Week 3
Data wrangling: Data import and manipulations
W03-00 Data Workflow + Tidy data + Wrangling
In class
- SLIDES: π₯W03-00 Data Workflow + Tidy data + Wrangling
- Recommended. Practice:
Primers: Programming basics
- Recommended. Read:
- Optional. Watch:
Tidy data video + slides
Grammar of data wrangling video + slideswebinar: Data wrangling with R and RStudio
W03-01 Import data
In class
- Application Exercise: π
ae03-01-import-clean-sum-plot.Rmd
SELF STUDY
- SLIDES: π₯W03-02 Import data
- Recommended. Watch:
- Recommended. Read:
- Optional. Read:
- CHEATSHEET: Data import with the tidyverse
- tidyverse/readxl + tidyverse/readr + janitor
- Optional. Watch:
Importing and recoding data
Data classes video + slides
Importing data video + slideswebinar: Whatβs new with readxl?
W03-02 Exploring numerical data
SELF STUDY
- SLIDES: π₯W03-02 Exploring numerical data
- Application Exercise: π
AE03-02 Exploring data on plstic waste+ Solution to the ae03-02:ae03-02-plastic-waste-solutions.rmd
- Recommended. Practice:
Primers: Derive Information with dplyr
Open Intro. Interactive. Visualizing numerical data (takes time to load)
Open Intro. Interactive. Summarizing data (takes time to load)Primers: Data Visualization Basics
Primers: Histograms
Primers: Boxplots and Counts
Primers: Scatterplots
- Optional. Read:
- Optional. Watch + Practice:
π₯ W03-03 Correlation
In class
- SLIDES: π₯W03-04 correlation correlation
SELF STUDY
- Application Exercise: π
ae03-03-correlation.Rmd
- Recommended. Practice:
Primers: Exploratory data analysis
- Recommended. Read:
- Optional. Read:
π W03AE - application exercise:
Download project from π ae03-data-wrangling.zip or the same on Ilias: π ae03-data-wrangling.zip on Ilias.
In class
- Application Exercise: π
ae03-01-import-clean-sum-plot.Rmd
SELF STUDY
- Application Exercise: π
ae03-02-correlation.Rmd - Application Exercise: π
ae03-02-plastic-waste.rmd
π₯ SELF STUDY Exploring categorical data
- Recommended. Practice:
Open Intro. Interactive. Visualizing categorical data (takes time to load)
- Recommended. Watch + Practice:
webinar: Tidyverse visualization manipulation basics
Visualising categorical data video + slides
- Optional. Read:
π₯ SELF STUDY Wrangle with dplyr
We use dplyr every day in any exercise. Thus, there is no point to dedicate specific time to it. Use this list of materials to guide your learning process. Most of the core functions of dplyr are covered in data other parts of the course.
Materials listed here may repeat the ones listed before.
Recommended. Read:
Recommended. Practice:
Primers: Working with Tibbles
Primers: Isolating Data with dplyr
Primers: Derive Information with dplyr
Primers: Filter observations
Open Intro. Interactive. Summarizing data (takes time to load)- Interactive tutorial based on R4DS Ch. 5.6 Summarizing data
π₯ SELF STUDY Data visualization with ggplot2
There are many ways how data could be visualized in R. We do touch numerous visualization examples over the course. Therefore, dedicated mastering of the ggplot2 package is scheduled for self learning.
- Recommended. Read:
- Recommended. Practice:
Primers: Data Visualization Basics
Primers: Exploratory data analysis
Primers: Bar Charts
Primers: Histograms
Primers: Boxplots and Counts
Primers: Scatterplots
Primers: Overplotting
Primers: Customize plots
- Optional. Practice:
Open Intro. Interactive. Visualizing categorical data (takes time to load)
Open Intro. Interactive. Visualizing numerical data (takes time to load)
- Optional. Watch + Practice: