2022-05-16
1:30 - 2:50 - Setup: Installation of R and RStudio
First Example
Some R basics
3:00 - 4:30
the tidy verse
examples with the tidyverse
a language and environment for statistical computing and graphics
20th annual poll on software for Analytics, Data Mining, Data Science, Machine Learning (kdnuggets) : “Which data mining/analytic tools you used in the past 12 months for a real project (not just evaluation)”
R has a share of 46.6% versus python’s 65.8%
On average, data scientists use 6.1 tools in their daily work.
We will touch on all of these aspects. Slides and material are available at http://heike.github.io/summerschool-2022
Make sure to have R version 4.x.x installed
Click the CRAN link on the left, and pick a download site (0-Cloud is a good choice)
Choose link based on your OS
On Windows, choose the “base” subdirectory to install R.
On OS X, choose the .pkg file to install R.
RStudio-2022 supports both R and python (Jupyter notebooks)
Browse to https://www.rstudio.com/
Mouse over Products and click RStudio
Choose RStudio Desktop
Click Download RStudio Desktop
Choose the installer appropriate for your platform
If R were an airplane, RStudio would be the airport, providing many, many supporting services that make it easier for you, the pilot, to take off and go to awesome places. Sure, you can fly an airplane without an airport, but having those runways and supporting infrastructure is a game-changer.
Start-here.Rproj
projects in RStudio allow easy start of an R session
keep track of all relevant files (R history, data, profile, …)
set the working directory to the current directory
keep track of settings for next time …
… double-click folder 01-Introducton-to-R
code
contains R scripts (files ending with extension .R)knitr
contains slides (files ending with extension .Rmd)R Markdown is an authoring format that enables easy creation of dynamic documents, presentations, and reports from R. It combines the core syntax of markdown (an easy-to-write plain text format) with embedded R code chunks that are run so their output can be included in the final document. R Markdown documents are fully reproducible (they can be automatically regenerated whenever underlying R code or data changes).
RStudio’s cheatsheet gives a nice, concise overview of its capabilities.
RStudio’s reference guide lists its options.