Join part of the team at Overstock for lightning talks and discussions about how Overstock used Scala and other tools to build out their data flow and machine learning architecture. We'll have an intro to Overstock architecture and 3 sessions. These will be about 20 minute sessions with plenty of time for discussion and questions. 5:30 - Pizza Starting at 6:00 Intro (Chris Robison) Overview of Machine Learning architecture and data flow at Overstock. Getting Started with Scala, Spark, and Science (Victor Siu) Let’s take a deep but shallow dive into using Scala with Spark and how you can use them to build an analytics pipeline. First, I’ll go over some of the basic distributed datatypes and how you can utilize them to perform calculations and aggregations. Next, we’ll build a simple model and then chain some ETL to the model with a simple Pipeline. Moving to Streaming (Sean Seamonds) We'll cover how we turned a batch classification problem into a near real-time process using Structured Streaming. We'll talk high-level about Structured Streaming, then move onto a real business problem and how we migrated to a streaming architecture. Fighting Fraud with Data (Stephen Merrill) Large e-commerce companies like Overstock are targets for fraudulent activity. Detecting and preventing fraud is a combative affair as we run up against bad actors that systematically and intelligently attempt to beat the system. Our best path to mitigation is developing a machine learning system that uses a plethora of data points to uncover trends and stop fraudulent orders in near real time. I’ll discuss our development of this ML system as well as the low latency data flow that powers it.

    The Utah Scala Enthusiasts (USE) group is dedicated to providing a strong and vibrant community for software developers interested in the Scala language and ecosystem.

