Both R and Python have been growing in popularity as the go-to languages for data science in general.
In this session we will introduce Python and PySpark/SparkR by comparing it to R equivalents.
We will start with basic language elements and then cover some most widely used library packages. This will be a hand-on session, so the recommendation is to come with your laptops with R-Studio, Jupyter, Spark environments pre-installed ( see instructions/options below).
Vishwanath (Vish) Kamat – Data Science Architect, IBM Hybrid Cloud
Vish has been working with IBM for over 18 years in various capacity. Most recently, he has been working with IBM Analytics offerings focused on Watson Studio, Data Science Experience Local, IBM Cloud Private for Data products.
Dr Arvind Betrabet (PhD) - Data Scientist, IBM Cloud
Arvind, an Electrical Engineer, has been working in data analytics for over 15 years. Apart from working at IBM as a data scientist, Arvind has been teaching Python at Collin County College for past 3 years.
To install Jupyter notebook and R-Studio on you laptop: https://cse.buffalo.edu/~bina/cse487/spring2018/Lectures/JupyterHandoutJan31.pdf
Alternatively, you can sign up for a free environment in IBM cloud. The Watson Studio service offers R-Studio, Jupyter and Spark service all pre-installed and available at : https://www.ibm.com/cloud/watson-studio