Piero Ferrante will provide a high level overview covering various aspects of time series analysis in Python. This talk will cover how to work with time series data in Pandas, various filtering and smoothing techniques, outlier/breakout/mean shift detection, interpolation for missing data, and a wide range of simple to sophisticated forecasting techniques.
Piero Ferrante is the Director of Data Science at C2FO and a member of the adjunct faculty at Rockhurst University and the University of Kansas. He also advises a local digital health startup, Play-It Health, on algorithms and data strategy. His areas of expertise include quantitative finance and machine learning.
Matt Habiger will show an example of building a model to find faces in pictures. He will focus on building a dataset, extracting features from images and applying the model. Time permitted, he will show how to take code that was built on a single machine and parallelize it using Spark.
Matt Habiger manages the Data Science and Business Intelligence teams at Pinsight Media. He thinks Spark is amazing and is very focused on building amazing data teams.