Jan 09, 2020, Location TBD
Haley Jeppson, hjeppson@iastate.edu.
Sam Tyner, sctyner@iastate.edu.
Heike Hofmann, hofmann@iastate.edu.
HThis workshop will to prepare you for dealing with messy data by walking you through real-life examples. We will work on improving your programming skills and help you move beyond using copy-and-paste. We will discuss how to write functions in order to reduce duplication in your code and automate common tasks and how to use iteration in order to further reduce duplication. You will leave with skills that will allow you to both tackle problems with more ease.
The course will be data centric, with lots of different data sets that illustrate examples of the different techniques used for different problems.
Date | Notes | Lecture and Resources |
---|---|---|
9:00 - 9:45 | Reading Files | Excel files vs text, data organization 1-Files.R 1-Files.Rmd midwest.csv midwest.xls |
9:45 - 12:15 | Summarizing with dplyr | pipe operator, dplyr verbs 2-dplyr.R 2-dplyr.Rmd baseball.csv |
12:15 - 1:15 | Lunch | |
1:15 - 2:45 | Tidy Data | gather, spread, separate 3-tidyr.R 3-tidyr.Rmd frenchfries.csv billboard.csv flights.csv |
2:45 - 4:00 | Joining Data | joining dataframes 4-joining.R 4-joining.Rmd boxoffice.csv baseball.csv |
After the end of the this one course, we expect you to be able to do the following:
You can bring your own laptop and work on it, if you like to. In order to keep up with everybody else, we're asking that you start installing the following pieces of software ahead of coming to the workshop:
Recommended Reading: