Prerequisites and Computational Setup
Assumed Knowledge
We assume you know the following from StatProg1, Stat1 & Stat2:
- basics of programming in R (scripts, RStudio, installing packages etc.)
- literate programming with Quarto markdown documents (qmd)
- tidy data principles and basic data wrangling with dplyr and tidyr (pivoting, missing values, joins)
- data visualisation with ggplot2, accessibility best practices
- high level understanding of different data types and data sources (e.g. panel data, time-series, text)
- introductory statistical methods: descriptive statistics, linear regression, hypothesis testing
Useful sources for revising these topics:
- LMU OSC Introduction to version Control with git and GitHub within RStudio: https://lmu-osc.github.io/Introduction-RStudio-Git-GitHub
- Monash StartR modules: https://startr.numbat.space
- R4DS: https://r4ds.hadley.nz
- ModernDive textbook (v2): https://moderndive.com/v2/
- What the forgot to teach you about R: https://rstats.wtf/projects.html
Computational Requirements
In this course we will use:
- R
- RStudio
- Quarto
- Command Line Interface
git- GitHub
And numerous R packages including tidyverse, and palmerpenguins.
By the end of Week 1, please ensure you:
- Install R and Rstudio
- Can install packages in R:
install.packages("tidyverse")