An introduction to using R for data science (zero prerequisites required!)

Details

R is a freely-available, cross-platform, open-source programming language and statistical software environment that is well-suited to doing data science but can have a high barrier to entry. This KnoxData presentation aims to provides an introduction to using for data science, with a focus on developing a foundation in capabilities (and confidence!) that can be applied in a variety of data science contexts. It is open to anyone with an interest in R or data science. Most of the time will be spent on developing the following skills: a) Getting started with RStudio, b) and visualizing (with the ggplot2 R package), processing (with dplyr), and modeling data and presenting results (using a regression model), There are zero prerequisites, though, to get the most from the workshop, please bring your own laptop computer with you with R and RStudio installed.

Installation instructions

If you have issues with any of the installations below (and don’t worry, they’re very small and won’t take up much space on your computer), please send a message to [masked] to try to get it resolved before the presentation. If we’re unsuccessful, we’ll try to address the issues together at the start of the session, and will also have an RStudio Cloud workspace available for anyone to use (requires an easy login process).

To download R:
* Visit this page to download R: https://cran.r-project.org/
* Find your operating system (Mac, Windows, or Linux)
* Download the 'latest release' on the page for your operating system and download and install the application

To download RStudio:
* Visit this page to download RStudio: https://www.rstudio.com/products/rstudio/download/
* Find your operating system (Mac, Windows, or Linux)
* Download the 'latest release' on the page for your operating system and download and install the application