Skip to content

Building a system for machine and event-oriented data, and analytics with Rocana

Building a system for machine and event-oriented data, and analytics with Rocana

Details

Introduction

Eric Sammer is the co-founder and CTO of Rocana. He is deeply entrenched in the open source community with a passion for solving difficult scaling and processing problems. Prior to Rocana, Eric most recently served as an Engineering Manager at Cloudera, responsible for developer tools and partner integrations. Eric’s team worked with hundreds of partners to develop robust solutions and integrate them tightly with Cloudera’s Enterprise Data Hub. He was previously a Principal Solutions Architect, working with customers and strategic partners to support and integrate Hadoop clusters and related infrastructure. While working with some of Cloudera’s largest customers, Eric developed many of the best practices for developing large, distributed, data processing infrastructure.

http://photos1.meetupstatic.com/photos/event/d/2/e/7/600_440873991.jpeg

Eric is a committer on a number of open source projects including Apache Flume, MRUnit, and the Kite SDK. Prior to Cloudera, Eric served as a Senior Engineer and Architect at several large scale data driven organizations including Experian and Conductor. Eric is the author of Hadoop Operations published by O’Reilly Media. He speaks frequently on technology and techniques for large scale data processing, integration, and system management.

http://photos4.meetupstatic.com/photos/event/d/8/c/3/600_440635491.jpeg

Session information:

In this session, we’ll follow the flow of data through an end-to-end system built to handle tens of terabytes an hour of event-oriented data, providing real-time streaming, in-memory, SQL, and batch access to this data. We’ll go into detail on how open source systems such as Hadoop, Kafka, Solr, and Impala/Hive can be stitched together to form the base platform; describe how and where to perform data transformation and aggregation; provide a simple and pragmatic way of managing event metadata; and talk about how applications built on top of this platform get access to data and extend its functionality. Finally, a brief demo of Rocana Ops, an application for large scale data center operations, will be given, along with an explanation about how it uses the underlying platform.

Schedule

6:30-7:00PM Mixer
7:30 - 8:15PM Intro and Presentation
8:15- 8:30PM Q&A

Photo of Silicon Valley Data Engineering group
Silicon Valley Data Engineering
See more events
Nest GSV
425 Broadway Street · Redwood City, CA