Big Data All the Things


Details
Should You Jump Into the Data Lake? : Doing Data Work with and without Hadoop
Whether and how to incorporate big data programs continues to be an important consideration for many companies. In this talk, we'll cover things to think about when considering a big data platform, including whether you have enough data and tradeoffs. We'll also review big data best practices to consider, including Hadoop native file formats and Hadoop/Spark programming language choice.
About the Speaker
Vicki Boykis works in all parts of the data stack, including predictive analytics, data engineering, and data visualization. Her most recent work has been with Spark. She has experience in healthcare, education, telecommunications, and finance. While not data science-ing, she enjoys writing and Nutella. Her tech blog is http://veekaybee.github.io
http://photos3.meetupstatic.com/photos/event/7/7/8/7/600_458370599.jpeg
Data Science Jawn: Getting started with Python Data Science Tools
Let's explore the Philadelphia data science community using Jupyter notebook and other Python data science tools! In this talk, I'll introduce you to several tools in the Python data science toolkit, all while exploring the rich and vibrant data science community we have right here in Philadelphia.
About the Speaker
Michael Becker is a Senior Data Scientist at Penn Medicine where he is building machine learning systems to improve patient outcomes by providing real-time predictive applications that empower clinicians to identify at risk individuals. In his spare time Michael organizes the DataPhilly Meetup group and despite being terrified of public speaking he presents regularly at community events and conferences. On the internet he can be found at http://beckerfuffle.com/
http://photos2.meetupstatic.com/photos/event/7/7/6/b/600_458370571.jpeg

Big Data All the Things