Statistical analysis

In this 3h. session we take an overview of the main ideas behind making your data analysis reproducible and transparent. For that, we use the R statistical platform, likely the most popular choice among academia and industry, as the main tool.

We begin with the main ideas behind a reproducible approach to data analysis, provide some background on the history of R and use a popular replication example to show how to read, manipulate, interact, analyze, visualize and write your data. In particular, we use the famous Reinhart and Rogoff replication exercise by Thomas Herndon et al.

Slides

Requirements

To follow on this session on your own machine, you will need both R and RStudio installed. For links to download and install them, head over to the Requirements page to see how to install them if you haven’t yet.

No previous background on R or any specific statistical background is needed for this session, although some basic understanding of statistics and data (at an undergraduate level) is required to comfortably follow along.

Outcomes

After completing the session, you should be able to:

References

R:

Reinhart & Rogoff: