Data Product Architectures - with Benjamin Bengfort

Hosted by Seattle Data Geeks

Public group

This is a past event

79 people went

Whitepages

1301 5th Avenue #1600 · Seattle, WA

How to find us

only the West doors of the Rainier Square building will be unlocked after 5 pm. Please make sure to enter the building from 4th St.

Location image of event venue

Details

NOTE: only the West doors of the Rainier Square building will be unlocked after 5 pm. Please make sure to enter the building from 4th St.

Benjamin Bengfort (https://www.linkedin.com/in/bbengfort) of District Data Labs (http://www.districtdatalabs.com/) is coming to town for Data Day Seattle (http://datadayseattle.com/). We asked him if, while he was in town, he would spend an evening with the community. He said yes. This is a great opportunity.

If your company would like to host this presentation, send a note to data at lynnbender dot com ([masked]).

Data Product Architectures

Data products derive their value from data and generate new data in return; as a result, machine learning techniques must be applied to their architecture and their development. Machine learning fits models to make predictions on unknown inputs and must be _generalizable_ and _adaptable_. As such, fitted models cannot exist in isolation; they must be operationalized and user facing so that applications can benefit from the new data, respond to it, and feed it back in to the data product. Data product architectures are therefore _life cycles_ and understanding the data product life cycle will enable architects to develop robust, failure free workflows and applications. In this talk we will discuss the data product life cycle, explore how to engage a model build, evaluation, and selection phase with an operation and interaction phase. Following the lambda architecture, we will investigate wrapping a central computational store for speed and querying, as well as incorporating a discussion of monitoring, management, and data exploration for hypothesis driven development. From web applications to big data appliances; this architecture serves as a blueprint for handling data services of all sizes!

Speaker Bio

Benjamin Bengfort (https://www.linkedin.com/in/bbengfort) is a Data Scientist who lives inside the beltway but ignores politics (the normal business of DC) favoring technology instead. He is currently working to finish his PhD at the University of Maryland where he studies machine learning and distributed computing. His focus is on highly consistent local distributed storage and visual diagnostics for data modeling. The lab next door does have robots and, much to his chagrin, they seem to constantly arm said robots with knives and tools; presumably to pursue culinary accolades. Having seen a robot attempt to slice a tomato, Benjamin prefers his own adventures in the kitchen where he specializes in fusion French and Guyanese cuisine as well as BBQ of all types. A professional programmer by trade, a Data Scientist by vocation, Benjamin's writing pursues a diverse range of subjects from Natural Language Processing, to Data Science with Python to analytics with Hadoop and Spark.

Agenda

6:30PM - Networking

7:00PM - Featured Talk

8:30PM - Adjourn to pub

If your company would like to host this presentation, send a note to data at lynnbender dot com ([masked]).