Reproducible Analytical Pipelines (RAP) using R
- You should be familiar with R and the RStudio Integrated Development Environment.
- You should be familiar with git and Github.
- You should be familiar with writing functions in R.
At the end of my course, students will be able to identify suitable Reproducible Analytical Pipelines (RAP) opportunities in their organisation. From their chosen report they will derive the minimal tidy data set required to produce all the figures, tables and statistics therein. They will confidently use basic git functionality for version control, providing an audit trail of their progress. They will collaborate on Github using a standard workflow relying on pull requests for peer review; ensuring quality assurance throughout the project. They will build an R package, providing a single corpus to enshrine and encapsulate the business knowledge. The package will have all the hallmarks of reproducibility and quality assurance through the students’ prudent application of Open Source software development tools and principles including: functional programming, unit testing, continuous integration and dependency management. The outcome will be a software package that facilitates an improved production time of the statistical report while improving the quality of the statistics. This will free up the student's time to do more interesting things.
DISCLAIMER: The views and opinions expressed in this course are those of the author and do not reflect the official policy or position of GDS or the UK Government.
Who this course is for:
- Anyone who has produced the same report or publication more than once.
- Anyone who is frustrated and bored of manually processing data.
- Anyone keen to automate their workflow for the regular analysis of the same kind of data input.
- 02:53Introduction to Reproducible Analytical Pipelines (RAP)
- 06:59Why RAP? + ACTIVITY
- 6 questionsIdentifying RAP battles in your organisation
- 05:15RAP is Open Source + ACTIVITY
- 3 questionsOpen Source Trivia
- 01:00Evaluating RAP + ACTIVITY
- 4 questionsRAP business benefit
- 08:25Finding a RAP buddy + ACTIVITY
- 2 questionsFinding a buddy to collaborate with
A Data Scientist in the Public Sector working at the Government Digital Service (GDS). I am dedicated to building data science capability across Government and society; building capability is transformation.
Prior to life as a Civil Servant I completed a PhD in genetic engineering at Oxford and qualified as a teacher on the TeachFirst graduate scheme.
Disclaimer: my views and opinions are my own and do not reflect Government or GDS policy.