addressalign-toparrow-leftarrow-leftarrow-right-10x10arrow-rightbackbellblockcalendarcameraccwcheckchevron-downchevron-leftchevron-rightchevron-small-downchevron-small-leftchevron-small-rightchevron-small-upchevron-upcircle-with-checkcircle-with-crosscircle-with-pluscontroller-playcredit-cardcrossdots-three-verticaleditemptyheartexporteye-with-lineeyefacebookfolderfullheartglobe--smallglobegmailgooglegroupshelp-with-circleimageimagesinstagramFill 1languagelaunch-new-window--smalllight-bulblinklocation-pinlockm-swarmSearchmailmediummessagesminusmobilemoremuplabelShape 3 + Rectangle 1ShapeoutlookpersonJoin Group on CardStartprice-ribbonprintShapeShapeShapeShapeImported LayersImported LayersImported Layersshieldstartickettrashtriangle-downtriangle-uptwitteruserwarningyahooyoutube

Data By the Bay: 150 talks, 3 trainings

From: Alexy K.
Sent on: Thursday, March 24, 2016 8:27 AM

We're happy to report that two of our veteran speakers, Andy Petrella ( and Chris Richardson ( are giving full-day trainings on Agile Data Science Data Pipelines and Developing Microservices at Data By the Bay.  The conference itself is now at 150 talks, with the recent additions of Google BigQuery, Pandora and Quora Machine Learning, and more.  Here's the summary:

Data By the Bay ( 2016 is a seven-conference matrix running on May 16-20 in the Bay Area.  It is a by-data engineers, for-data engineers developer and data scientist meeting expanding on the highly successful Text By the Bay (  It fits 150 talks over 5 days, plus three innovative training sessions (micro services, data pipelines, NLP).

You can attend any combination of days.  Early Bird registration is in effect until April 1st.  Use the code SFSCALA10 to get 10% off the registration:

The days and topics for Data By the Bay 2016 include
— Data Pipelines:
— NLP:
— Data for Society: and
— AI, IoT, and streaming and sensor data:
— Life Sciences:
— User Experience and Data Visualization:

Data By the Bay is a new kind of data conference -- a Data Grid conference matrix. The verticals comprise application domains (text/NLP, society/democracy/law, life sciences, UX), and the horizontals consist of the platforms (like Spark), data pipelines (the SMACK stack), and algorithms (such as Deep Learning) underpinning them all.  More about the Data Grid:

The speakers include:

— Jay Kreps (CEO/Confluent)
— Ricardo Baeza-Yates (VP Research/Yahoo Labs)
— Lukas Biewald (CEO/Crowdflower)
— Anthony Goldbloom (CEO/Kaggle)
— Carlos Guestrin (CEO/Dato)
— Jeremy Howard (CEO/Enlitic)
— Grant Ingersoll (CTO/Lucidworks)
— Matt Mahan (CEO/Brigade)
— Monty Zweben (CEO/Splice Machine)
— 50+ other founders, CEOs and CTOs
— and 100+ more speakers

Companies presenting include Google, Facebook, Microsoft, IBM, Netflix, LinkedIn, Yahoo, Bloomberg, Samsung, Baidu, Salesforce, Workday,  Cloudera, Pivotal, Zillow, Databricks, Mesosphere, ClearStory, H2O, Pandora, Quora, Nitro, Lucidworks, Mattermark, Credit Karma, Alpine Labs, MetaMind, University of California-Berkeley, Stanford University, City of San Francisco, and many others.

We’re partnering with top Bay Area startups, academics, and portfolio companies of data-centric VCs such as DCVC, Battery, KPCB, Greylock, Shasta, and more.

The three all-day trainings are
Agile Data Science and End-to-end data pipelines with Spark Notebook
Developing Microservices
Introduction to Natural Language Processing

Come join us for the inaugural Data By the Bay week in May!



People in this
group are also in:

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy