Skip to content

Hadoop, the Data Lake, and a New World of Analytics

Photo of Kai Larsen
Hosted By
Kai L.
Hadoop, the Data Lake, and a New World of Analytics

Details

Join us for a night of truly big data as Hortonworks and Professor Han discuss cutting-edge developments including the Data Lake and Elastic OS. A short overview of the new Leeds M.S. in Analytics will be offered, and per tradition, free parking and Half Fast Subs to stave off the hunger after a productive day at work!

Presenters:
Jim Walker, Director of Product of Marketing, Hortonworks
Richard Han, Associate Professor, Computer Science, CU
Anne Sandoe and Gregg Macaluso (Directors , Graduate Programs)

Jim Walker: Hadoop, the Data Lake, and a New World of Analytics
Abstract: There has been an architectural shift in the data center… The Data Lake. Data architects have inserted Hadoop as a platform to present data as a service to applications and systems throughout the enterprise. More importantly, this new approach has become a central component of modern data analytics. Why?

Hadoop presents three key benefits that have not been possible until now.

  1. Cost effectively store everything so that you can run long term analysis over a more complete set of data
  2. Apply schema on analysis as opposed to schema on write so that you have the freedom to ask new questions
  3. Enable a data scientist to interact and iterate over data in a variety of ways from batch to interactive to search and streaming.

In this presentation, Jim Walker, Director of Product Marketing at Hortonworks will present the concepts of a Hadoop driven data lake and outline how they are affecting modern IT architecture and analytics teams.
Richard Han: Elasticizing the Linux Operating System for the Cloud

Abstract: Cloud systems today permit applications to scale to large numbers of servers, but this scaling often requires cumbersome manual scripting of virtual machines, the addition of distributed synchronization logic, and even code refactoring by developers. We propose to elastically and automatically scale applications, like large in-memory databases, web servers and data mining processes, by incorporating novel concepts that "stretch" memory address spaces and execution simultaneously over multiple cloud machines at the operating system level.

Anne Sandoe and Gregg Macaluso: Presenting the New Leeds M.S. in Analytics

Photo of CU Leeds Business Analytics group
CU Leeds Business Analytics
See more events
Koelbel building, Room S125 (downstairs from main entrance) · Boulder, CO