Programming with data

Jan 09, 2020, Location TBD

Haley Jeppson, hjeppson@iastate.edu.
Sam Tyner, sctyner@iastate.edu.
Heike Hofmann, hofmann@iastate.edu.

HThis workshop will to prepare you for dealing with messy data by walking you through real-life examples. We will work on improving your programming skills and help you move beyond using copy-and-paste. We will discuss how to write functions in order to reduce duplication in your code and automate common tasks and how to use iteration in order to further reduce duplication. You will leave with skills that will allow you to both tackle problems with more ease.
The course will be data centric, with lots of different data sets that illustrate examples of the different techniques used for different problems.

The course will be a mix of instruction and follow-up exercises. It will be held in a computer lab but you are encouraged to bring your own laptops, with software already loaded. A list of software will be available at this site several days prior to the workshop.

Lectures and timetable

Date Notes Lecture and Resources
9:00 - 9:45 Reading Files Excel files vs text, data organization
1-Files.R
1-Files.Rmd
midwest.csv
midwest.xls
9:45 - 12:15 Summarizing with dplyr pipe operator, dplyr verbs
2-dplyr.R
2-dplyr.Rmd
baseball.csv
12:15 - 1:15 Lunch
1:15 - 2:45 Tidy Data gather, spread, separate
3-tidyr.R
3-tidyr.Rmd
frenchfries.csv
billboard.csv
flights.csv
2:45 - 4:00 Joining Data joining dataframes
4-joining.R
4-joining.Rmd
boxoffice.csv
baseball.csv

Learning outcomes

After the end of the this one course, we expect you to be able to do the following:

Getting Ready on your own machine

You can bring your own laptop and work on it, if you like to. In order to keep up with everybody else, we're asking that you start installing the following pieces of software ahead of coming to the workshop:

Useful links

Recommended Reading: