Week 3
Data wrangling: Data import and manipulations
W03-00 Data Workflow + Tidy data + Wrangling
- Recommended. Practice:
- Primers: Programming basics
- Recommended. Read:
- Optional. Watch:
- Tidy data video + slides
- Grammar of data wrangling video + slides
- webinar: Data wrangling with R and RStudio
W03-01 Import data
- Recommended. Watch:
- Recommended. Read:
- Optional. Read:
- CHEATSHEET: Data import with the tidyverse
- tidyverse/readxl + tidyverse/readr + janitor
- Optional. Watch:
- Importing and recoding data
- Data classes video + slides
- Importing data video + slides
- webinar: Whatβs new with readxl?
W03-02 Exploring numerical data
- Recommended. Practice:
- Primers: Derive Information with dplyr
- Open Intro. Interactive. Visualizing numerical data (takes time to load)
- Open Intro. Interactive. Summarizing data (takes time to load)
- Primers: Data Visualization Basics
- Primers: Histograms
- Primers: Boxplots and Counts
- Primers: Scatterplots
- Optional. Read:
- Optional. Watch + Practice:
π₯ W03-03 Correlation
- Recommended. Practice:
- Primers: Exploratory data analysis
- Recommended. Read:
- Optional. Read:
π W03AE - application exercise:
Download project from π ae03-data-wrangling.zip
or the same on Ilias: π ae03-data-wrangling.zip
on Ilias.
π₯ SELF STUDY
Exploring categorical data
- Recommended. Practice:
- Open Intro. Interactive. Visualizing categorical data (takes time to load)
- Recommended. Watch + Practice:
- webinar: Tidyverse visualization manipulation basics
- Visualising categorical data video + slides
- Optional. Read:
π₯ SELF STUDY
Wrangle with dplyr
We use dplyr
every day in any exercise. Thus, there is no point to dedicate specific time to it. Use this list of materials to guide your learning process. Most of the core functions of dplyr
are covered in data other parts of the course.
Materials listed here may repeat the ones listed before.
Recommended. Read:
Recommended. Practice:
- Primers: Working with Tibbles
- Primers: Isolating Data with dplyr
- Primers: Derive Information with dplyr
- Primers: Filter observations
- Open Intro. Interactive. Summarizing data (takes time to load)
- Interactive tutorial based on R4DS Ch. 5.6 Summarizing data
π₯ SELF STUDY
Data visualization with ggplot2
There are many ways how data could be visualized in R. We do touch numerous visualization examples over the course. Therefore, dedicated mastering of the ggplot2
package is scheduled for self learning.
- Recommended. Read:
- Recommended. Practice:
- Primers: Data Visualization Basics
- Primers: Exploratory data analysis
- Primers: Bar Charts
- Primers: Histograms
- Primers: Boxplots and Counts
- Primers: Scatterplots
- Primers: Overplotting
- Primers: Customize plots
- Optional. Practice:
- Open Intro. Interactive. Visualizing categorical data (takes time to load)
- Open Intro. Interactive. Visualizing numerical data (takes time to load)
- Optional. Watch + Practice: