Mar 2018 - Data analysis and ML with Spark & Docker in R (Jaehyeon (Bernie) Kim)


Details
We look forward to seeing you at our March meetup
Arrive from 5:45pm for a 6pm talk.
Data analysis and ML with Spark & Docker in R
Talk Outline:
In this talk, Jaehyeon (Bernie) will introduce data analysis and machine learning with Spark in a docker-based environment. Quickly introducing the development environment, it'll be explained how to manipulate data in comparison to the dplyr package as well as to apply user defined functions. Then an end-to-end example of executing a machine learning algorithm will be demonstrated. Finally, if time permits, further topics on application development with Spark will be discussed.
BIO:
Jaehyeon (Bernie) has a mixed background in quantitative analysis and programming. After studying economics and actuarial studies, he had a chance to learn database and application development at work in MS BI stack - C#, MS SQL Server, MS SharePoint etc. While he was working as an analyst developer, he taught himself R again as a programmer. Since then, he has been using R in large scale engineering/data analysis projects over his later positions. Currently he works for CoreLogic Australia and is developing a web service with Docker, Rserve on AWS.

Mar 2018 - Data analysis and ML with Spark & Docker in R (Jaehyeon (Bernie) Kim)