This website (harris-coding-lab.github.io) contains the content for the Harris School's Summer and Fall Coding Labs. The workshop aims to introduce R programming concepts with a focus on preparing and analyzing data, and conducting statistical simulations. We will cover how to read data into R, manipulate data with a suite of tools from the tidyverse
package dplyr
. We will also discuss some basic programming concepts including data types, operators, control flow with if statements and for loops, and how to write your own functions.
We ask that Coding Lab attendees have the latest stable versions of R and RStudio pre-installed on their local machine. If you need help getting set up, Harris IT is hosting office hours during on Sept 1 and 2 (you should have an email with the details.) and you can email the amazing staff of Harris IT at [email protected].
Accelerated coding lab meets twice in the quarter. You will either attend week 1 and week 3 or week 2 and week 4.
Class | Videos | Slides & Code | Problem sets | Additional Resources |
---|---|---|---|---|
Accelerated Class 1: Functions | video | slides slide code |
- basics review - acc. lab 1 - acc. lab 1 solved |
|
Accelerated Class 2: Iteration | video | slides slide code |
- basics review - acc lab 2 |
Recap slides for all of our meetings here.
Your final project is quite simple. You will pick a data set that speaks to you and try to uncover something interesting which you will visualize in a plot. You will also compute some summary statistics that you will show in a summary table. We'll provide feedback on your submission. Click on the link for details.
Links to materials for each week's workshop will be posted here as provided. For each class, watch the video. Then, go to lab and attempt the lab
The first three weeks of fall coding lab covers the same material (minus ggplot
).
Week 4 and 5 cover new material on for loops and functions.
Accelerated fall coding lab consists solely of new material on for loops and functions.
Post questions on Piazza
- tidyverse cheetsheets start with
dplyr
andggplot
- R for Data Science: free online book with clear explanations of many
tidyverse
functions, the book to read on data analysis with R - DataQuest.io: online modules about specific programming concepts, access provided by Harris. For students who would like additional guided practice we recommend:
- Vectors
- Data frames
- Control flow and if statements
- Functions
- writing custom functions
- working with functionals (includes discussion of map)
- Extensions
- random sampling with
sample()
- basic string manipulation
gather()
and correlations- "step 2" is all about
ggplot
and potentially useful.
- "step 2" is all about
- random sampling with
- They're always adding content, so we probably missed some good ones.
The instructors for this course are (see Canvas for emails):
- Ari Anisfeld
- Terence Chau
The teaching assistants for this course are:
- Michael Gorman
- Nguyen Luong
- Alysha Rashid
- Milan Rivas
- Paola Riveros
- Dominic Teo
- Ryan Webb
- Fanmei Xia