Harnessing Data within Hadoop in the Connected World

Details
Join us for two great talks at the inaugural meetup at Streetside Brewery for food, drinks, and networking.
Agenda
• 6-630pm Food, drinks, networking
• 630-8pm Tech Talks
Talk #1 : Harnessing Data within Hadoop in the Connected World
Internet of Things (IoT) is starting to get more attention in the Big Data space and with an estimated 30 billion things, the attention is well deserved. The number of items that will be instrumented will continue to increase, but how do we keep up with all of that data? And better yet, how do we gain more value?
Hadoop has generally been known as a batch oriented framework, but as the ecosystem evolves it is able to handle a more diverse set of workloads. With modern components within Hadoop, businesses and users are able to gain more value with IoT data by combining disparate controller data as well as external data sources into a single system and still keep low latency requests.
At this meetup, attendees will learn more about the challenges of handling IoT data, the value of combining IoT data with various sources, and how to scale your solution to meet growing data speed and volume with Hadoop. This will include a demonstration of collecting, processing, and presenting IoT data from a collection of mocked things as well as combining that data with an external source to gain new insights.
Speaker #1:
Jason Hubbard is a Systems Engineer at Cloudera. He has five years experience designing and building solutions on Hadoop. In his spare time he enjoys annoying his family with his various maker and home automation projects.
Talk #2: Integrating Realtime Data Streams with Spark and Kafka for Video and Data-Stream Analysis
Join Talend as they bring you a deep discussion on the real-world data technologies underlying modern professional sports. Working from actual data collected in the English Premier Football league, and using tools that you can use yourself, we will walk through the process of building an analytics package using Spark and Kafka to collect real-time instrumentation data and produce meaningful results in minutes. The code, tools, and content for this session are readily available online and you can apply them to your own data science projects to accelerate your own real-time application. If you are using Kafka, Spark, or any real-time data technologies, or even if you are just trying to get a better understanding of them, this event is for you.
Speaker #2:
Norbert Krupa is a Solutions Engineer with Talend. He has over 10 years in the data space working in different industries and various roles; from analyst, to BI, database administration, consulting and architecting high volume, distributed systems.

Harnessing Data within Hadoop in the Connected World