Building and Scaling Data Pipelines with Airbnb, Keen IO and more!


Details
The rise of software applications and their data outputs means a wealth of data to harvest and gather insights from, but it also means centralizing diverse data sources into a single data pipeline, which can be a pain.
Join us at Keen IO HQ on January 27th as we explore the challenges and best practices around building, scaling and maintaining these large sca
le data pipelines and analytics infrastructure. We'll explore technology options, scalable data models, architecture standards, best practices in data warehousing, and more.
SPEAKERS
Maxime Beauchemin, Data Engineer at Airbnb
https://media.licdn.com/mpr/mpr/shrinknp_400_400/p/2/005/007/3d0/20bdcbd.jpg
Maxime Beauchemin recently joined Airbnb as a data engineer developing tools to help streamline and automate data-engineering processes. He mastered his data-warehousing fundamentals at Ubisoft and was an early adopter of Hadoop/PIG while at Yahoo in 2007. More recently, at Facebook he developed “analytics as a service” frameworks around engagement and growth-metrics computation, anomaly detection, and cohort analysis. He’s a father of three, and in his free time, he’s a digital artist. You can read more about his projects on his blog, Digital Artifacts (http://mistercrunch.blogspot.com/). Maxime will be walking us through Airflow (https://github.com/airbnb/airflow), an open source system to programmatically author, schedule and monitor data pipelines.
Samantha Zeitlin, Data Scientist, Sighten.io
https://cdn.evbuc.com/eventlogos/151274748/zeitlinheadshot.jpeg
Sam is a recovering cancer research scientist, where she had to rely on other people to do help her reformat high-throughput imaging data before doing analysis. These days Sam works at a solar financing startup called Sighten (http://www.sighten.io/), and enjoys data pipelining with pandas, mostly because doing it well means getting to do better, faster, more meaningful data analysis. You can follow Sam on Twitter at @samanthazeitlin. (https://twitter.com/samanthazeitlin)
Bradford Stephens, Founder, 22Acacia
https://cdn.evbuc.com/eventlogos/151274748/bradford.jpg
Bradford is the Founder of 22Acacia (http://www.22acacia.com/), a company that bridges the gap between data infrastructure, data science, and revenue impact. He has three startup exits and one public M&A deal in his wheelhouse, and loves to make company-changing plans succeed with data architecture and data scientists. He formerly built scalable machine learning platforms at Projector and is also the co-founder of Cloth, the biggest mobile social network for fashion. He was the Founder and CEO of Drawn to Scale, a venture-backed startup. They built Spire, the first highly scalable SQL database on Hadoop, capable of storing billions of records.
Dan Kador, COO, Keen IO
https://cdn.evbuc.com/eventlogos/151274748/dan-1.jpg
Dan is a co-founder and CTO of Keen IO and is responsible for our technology stack, including pooling terabytes of data from thousands of Keen customers into one robust data pipeline. Prior to Keen, Dan spent 5 years building APIs for Salesforce. Dan will be walking us through Keen's data technology stack, and how we channel trillions of events daily into meaningful insights. You can follow Dan on twitter at @dkador. (http://twitter.com/dkador)
AGENDA
6:00pm - 6:30pm: Drinks, nibbles and hobnobbing
6:30pm - 7:30pm: Talks from our speakers!
7:30pm - 8:00pm: Panel Q&A
8:00pm - 8:30pm: More drinks, nibbling and hobnobbery
WHO SHOULD ATTEND?
If you're an engineer, data scientist, CTO, or just interested in learning more about how to leverage, scale and manage multiple sources of data into a single, scalable pipeline, join us for a evening of learning, networking and knowledge exchange!

Building and Scaling Data Pipelines with Airbnb, Keen IO and more!