Agile Data Science 2.0

Research Methods and Data Science Meetup
Research Methods and Data Science Meetup
Public group

IBM Glendale

400 N Brand Blvd 8th Floor · Glendale, CA

How to find us

please go to the 8th fl OR ask the security for assistance OR contact Alex at 626-552-2351

Location image of event venue


The demand is high for data science, and the current trend is favoring agile data science and prescriptive data science. So, let meet to learn and discuss about agile data science, together with the author of the book Agile Data Science 2.0.

This talk will be based on the book Agile Data Sciecne 2.0 published by O'Reilly 2017. Specifically, the book defines a methodology and a software stack with which to apply the agile data science methods. Here, the methodology seeks to deliver data products in short sprints by going meta and putting the focus on the applied research process itself. The stack is but an example of one meeting the requirements that it be utterly scalable and utterly efficient in use by application developers as well as data engineers. It includes everything needed to build a full-blown predictive system: Apache Spark, Apache Kafka, Apache Incubating Airflow, MongoDB, ElasticSearch, Apache Parquet, Python/Flask, JQuery.

This talk will cover the full lifecycle of large data application development and will show how to use lessons from agile software engineering to apply data science using this full-stack to build better analytics applications. The system starts with plumbing, moving on to data tables, charts and search, through interactive reports, and building towards predictions in both batch and realtime (and defining the role for both), the deployment of predictive systems and how to iteratively improve predictions that prove valuable by building an experimental setup.

Our speaker will be Mr. Russell Jorney, the book author. Mr. Russell Jurney is principal consultant at Data Syndrome, a product analytics consultancy dedicated to advancing the adoption of the development methodology - Agile Data Science, as outlined in the book Agile Data Science 2.0 (O'Reilly, 2017). He has worked as a data scientist building data products for over a decade, starting in interactive web visualization and then moving towards full-stack data products, machine learning and artificial intelligence at companies such as Ning, LinkedIn, Hortonworks and Relato.

As before, Starbucks coffee and cookies will be provided.

The meeting will start at 7pm sharp, with presentation scheduled for about one hour, and a one hour Q&As expected to follow, plus networking after Q&A as always.

Parking along the Brand Blvd costs about $1.50 per hour, and you may find free parking in nearby small streets.