• Spark Structured Streaming : Introduction and Internals

    extraSlice - The Place for Tech

    Structured Streaming is a new stream processing engine built on Spark SQL, which enables developers to express queries using powerful high-level APIs including DataFrames, Dataset and SQL. In this meetup, we'll walk through the basics of Structured Streaming, its programming model and processing the data in Kafka with Structured Streaming. We'll use the techniques including Window operations, Watermarking and Triggers. The concepts will be illustrated using code examples. Presenter: Revin Chalil. Big Data Development Engineer. Expedia.

    8
  • Python for Data Science. Getting Started.

    extraSlice - The Place for Tech

    We will get started using Python for Data Science. We will explore 3 popular libraries used for data science : numpy, pandas and matplotlib. Participants who want to follow along the code examples are encouraged to have Python on their laptops. Instructions for getting started are provided here (https://github.com/abgoswam/ml-100-intro-to-dsml/tree/master/week0). This material is from Week 1 of the 6-week training program "Introduction to Data Science and Machine Learning (https://github.com/abgoswam/ml-100-intro-to-dsml)" offered by Z2 DataLabs (https://www.z2datalabs.com/data-science) Presenter: Abhishek Goswami. Software Engineer. Azure Machine Learning. Microsoft.

    4
  • Kafka : Introduction and Internals

    Location visible to members

    Apache Kafka (http://kafka.apache.org/) is publish-subscribe messaging rethought as a distributed, partitioned, replicated commit log service. In this meetup we will take a gentle introduction to Kafka, and also discuss some internals and usage patterns. Presenter: Abhishek Goswami. Software Engineer. Azure Machine Learning. Microsoft.

    4
  • Redis : Introduction and Internals

    Location visible to members

    Abstract : Redis has become one of the critical tools in a Data Engineers toolkit. In this meetup we will take a gentle introduction to Redis, and also discuss some internals and usage patterns. Presenter: Abhishek Goswami. Software Engineer. Azure Machine Learning. Microsoft.

    12
  • Data Science Fundamentals: An Introduction to Classifiers

    Location visible to members

    RSVP to this free workshop via Eventbrite HERE (https://www.eventbrite.com/e/data-science-fundamentals-an-introduction-to-classifiers-tickets-29212024953). An Introduction to Classifiers Classification is perhaps the most fundamental task in machine learning. Based on observations of a known training set, we infer which of several classes a new item belongs to. An example use of a classifier is inferring if a patient has an illness based on several early test results. We will use a publicly available data set and apply several of the most common classifiers. All code will use the standard python data science package scikit-learn. We will implement several classifiers and discuss how to compare their performance. Presenter: Brendan Farrell Brendan Farrell has a PhD in Applied Mathematics, taught at Caltech and has a dozen publications. He is the Founder and primary data scientist at HowLoud, Inc. He has implemented Machine Learning algorithms for commercial problems ranging from automated text analysis to traffic modeling to financial modeling. Prerequisites This is an introductory evening and will be accessible and informative for people starting to learn data science. Those with more experience will gain some new insights into the methods we discuss. Setup • Bring your laptop • Install python 2.7.x from https://www.python.org/downloads/ • Install scikit-learn using the instructions from http://scikit-learn.org/stable/install.html Schedule • 6:30 PM: Quick intro to Python and Scikit-learn • 7:00 PM: Classification Methods • 7:45 PM: Evaluating Performance and Comparing Methods • 8:15 PM:Question and Answer Period About Sponsor Z2 DataLabs is the Technology learning community in the Eastside and helps to skill up professionals in Data Science and Big Data Technologies by offering class-room training in Bellevue. To learn more about the full-fledged 48 Hours Data Science and Machine Learning training program, visit http://www.z2datalabs.com/data-science RSVP to this free workshop via Eventbrite HERE (https://www.eventbrite.com/e/data-science-fundamentals-an-introduction-to-classifiers-tickets-29212024953).

    2