Skip to content

Mar 2018 - Data analysis and ML with Spark & Docker in R (Jaehyeon (Bernie) Kim)

Photo of Paul Dickins
Hosted By
Paul D. and 3 others
Mar 2018 - Data analysis and ML with Spark & Docker in R (Jaehyeon (Bernie) Kim)

Details

We look forward to seeing you at our March meetup

Arrive from 5:45pm for a 6pm talk.

Data analysis and ML with Spark & Docker in R

Talk Outline:
In this talk, Jaehyeon (Bernie) will introduce data analysis and machine learning with Spark in a docker-based environment. Quickly introducing the development environment, it'll be explained how to manipulate data in comparison to the dplyr package as well as to apply user defined functions. Then an end-to-end example of executing a machine learning algorithm will be demonstrated. Finally, if time permits, further topics on application development with Spark will be discussed.

BIO:
Jaehyeon (Bernie) has a mixed background in quantitative analysis and programming. After studying economics and actuarial studies, he had a chance to learn database and application development at work in MS BI stack - C#, MS SQL Server, MS SharePoint etc. While he was working as an analyst developer, he taught himself R again as a programmer. Since then, he has been using R in large scale engineering/data analysis projects over his later positions. Currently he works for CoreLogic Australia and is developing a web service with Docker, Rserve on AWS.

Photo of Sydney Users of  R Forum (SURF) group
Sydney Users of R Forum (SURF)
See more events
SMSA
280 Pitt St · Sydney