• From Data to Insights with Google Cloud


    This time it's all about Google Cloud and their awesome data management products. Let's get together to learn best practices from Google experts. Agenda: 6:30 - 7:00pm: Doors open. Networking! 7:00 - 7:45pm: From Data to Insights with Google Cloud (Felipe Hoffa) 7:45 - 8:15pm: Panel on working with Data and Google Cloud: • Andreas Sekine, Software Engineer at Thumbtack • Felipe Hoffa, Developer Advocate at Google Cloud • Luis Bitencourt-Emilio, Principal Director of Engineering, Machine Learning at Reddit • Robin Li, Director of Data Engineering and Data Science at Tapjoy 8:15 - 9:00pm: Hang out and chat some more. Abstract: How awesome would it be if you could bring new and underutilized data sources alive? How hard is to scale up to ingest millions of messages per second? Can you give your teams a secure, productive collaborative data environment? Focus on what matters - your data and your insights. In this session, we are going to learn how Google Cloud can help you go from data to insights - while abstracting away infrastructure complexities. With Cloud Dataprep we are going to interactively explore, clean, and ingest existing datasets. Cloud Dataflow will allow us to merge both batch and streaming pipelines into one unified model, sunsetting the lambda architecture. Distributing hundreds of millions of messages per second? Easy with Cloud Pub/Sub. We'll discover how to analyze terabytes of data in seconds BigQuery. To finish we'll make all this data accessible - from data scientists to executives - with Datalab and Data Studio. Expect live demos - these tools are real, and you could start using them right now. About Felipe: In 2011 Felipe Hoffa moved from Chile to San Francisco to join Google as a Software Engineer. Since 2013 he's been a Developer Advocate on big data - to inspire developers around the world to leverage the Google Cloud Platform tools to analyze and understand their data in ways they could never before. You can find him in several YouTube videos, blog posts, and conferences around the world. Follow Felipe at @felipehoffa (https://twitter.com/felipehoffa). About Andreas: Andreas Sekine is a software engineer at Thumbtack working on the data platform and technical infrastructure. Prior to Thumbtack he was at Apple on the Customer Feedback team, wrangling user data and feedback to drive improvements of OS X and iOS. About Luis: Luis Bitencourt-Emilio is the Principal Director of Engineering for Machine Learning at Reddit. He leads this new team in building an industry-leading data and AI discipline, encompassing our search, relevance, data engineering and anti-evil efforts. Luis was previously VP of Engineering at Workpop, and also spent 10 years as Principal Engineering Manager at Microsoft shipping the v1s of Excel Online, Office for iPad and Office for iPhone. Originally from Brazil, Luis can now be found exploring San Francisco with his Siberian Husky, Vix. About Robin: Robin Li, Director of Data Engineering and Data Science at Tapjoy, leads the team to build Tapjoy's Data Platforms and Decision Engine for optimization & personalization. Prior to Tapjoy, he worked in multiple roles at Credit Suisse. Robin received Master degree in Computer Science from Imperial College London.

  • The Evolving Landscape of Data Engineering & How Systems Fail

    Welcome to our first edition of the Data Engineering Club, hosted by Twitter! In order to attend, you must RSVP @ https://twitterdataeng.splashthat.com/ Agenda: 6:30 - 7:00pm: Doors open. Networking, eat, drink, mingle! 7:00 - 7:30pm: The Evolving Landscape of Data Engineering (Andrei Savu) 7:30 - 8:15pm: How Systems Fail (Alexander Huras) 8:15 - 9:00pm: Hang out Details on Talks and Speakers: "The Evolving Landscape of Data Engineering" Abstract: Data Engineering is a relatively new, but fast evolving discipline that spans multiple environments and technologies, from traditional data centers to hyper-scale cloud providers, a discipline that combines closed-source, homegrown and open source software to create scalable data pipelines and power incredible new product features. In this presentation, we will go over the last 5-10 years of technology trends and advancements and bring all of that together in a story about modern day Data Engineering and the magic behind it. About Andrei Savu: Andrei is a software engineer in the MoPub team. He got the "data-bug" in college while developing an application to explore EXIF metadata from a very large collection of photos stored by Adobe. That led to multiple open source contributions (Apache Whirr, Zookeeper etc.) and a startup (Axemblr.com) that was acquired by Cloudera to bootstrap the Cloud Engineering team. As a Tech Lead at Cloudera, he had the opportunity to drive and witness first-hand many of the recent developments in the field of Data Engineering and Advanced Analytics and as of now he applies that knowledge to advance mobile monetization at Twitter part of MoPub. As a co-organizer of the Data Engineering Club, he is excited to learn about the latest challenges faced by Data Engineers in the Bay Area and beyond. "How Systems Fail" Abstract: With modern tools, building and maintaining data pipelines are as easy and simple as describing your system's process graph. There is no concept of "deployment", and "versioning" is just something you read about in old textbooks---as if things weren't always correctly built the first time. Somewhere, a phone rings. You wake up, it's 4:00 am and you're on call. Your phone has been ringing for the last 15 minutes and likely escalated the page to the secondary on-call: also you. Great. You wipe the sleep from your eyes, backflip out of bed, and quickly get into the VPN. It looks like a couple queue subscribers are lagging---you thought the team had fixed that problem hope that it isn't another case of data corruption... When we talk about cool data technology, we rarely mention what happens when things go wrong, when the abstractions start leaking, when product requirements change or appear out of nowhere. This will not be one of those talks. About Alexander Huras: Alex is a systems engineer on Twitter's Revenue-Platform team. He's the tech lead for the performance forecasting work group, and has designed and implemented multiple mission-critical "Big Data" systems. Before that, Alex worked on Traffic network optimization, brain simulations, and guitar soloing. Alex is a functional programming evangelist and is currently working on the next generation of Twitter's analytics products. Welcome to the club!