Skip to content

Big Data Science Meetup Event

Photo of Shyam Sarkar
Hosted By
Shyam S. and 2 others
Big Data Science Meetup Event

Details

This event is sponsored by Huawei Technologies, Santa Clara, CA. Our new location is Huawei Technologies, Building A Cafeteria, Santa Clara, CA 95050.

5:30 P.M. - 6:00 P.M. Networking

6:00 P.M. - 7:30 P.M. Session 1

Title: The State of OpenStack Data Processing: Sahara

Speaker(s): Sergey Lukjanov, Principal Software Engineer, Sahara PTL, Mirantis

and Andrew Lazarev, Senior Engineer, Mirantis

Abstract:

The Sahara project (ex. Savanna), an official integrated OpenStack project in the Data Processing program, provides users the ability to provision and manage Hadoop clusters on OpenStack.

The focus of the project is on two primary use cases:
on-demand cluster provisioning and on-demand Hadoop task execution (Elastic Data Processing).

In this talk, we will provide an overview of project Sahara, its main goals and focus, and a tour of the most interesting Sahara features. We will discuss:

  • Cluster provisioning using different plugins - Vanilla (Apache Hadoop) plugin, Hortonworks Data Platform plugin, Cloudera Distribution including Hadoop
    plugin, Spark plugin;

  • Pluggable EDP that provides implementation for Oozie-based workloads execution for all Hadoop plugins and implementation for Spark;

  • Roadmap for the next OpenStack release.

We'll show the demo of main Sahara features including cluster provisioning and demonstration of running workloads using EDP.

After attending this session, you will have a good understanding for what is the Sahara now and where it’s going.

Speaker's bio:

Sergey Lukjanov is the Project Technical Lead of Sahara project and Principal Software Engineer in Mirantis. He has been involved in the project from the first days. One of his main responsibilities is architecture design and community-related work in Savanna. Also he is a top contributor and reviewer of Savanna and he oversees
all Launchpad and Gerrit activity. Sergey is experienced in Big Data projects and technologies (Hadoop, HDFS, Cassandra, Twitter Storm, etc.) and enterprise-grade solutions. He implemented HA for Twitter Storm and Sergey is contributing to different open source projects now including Twitter Storm and OpenStack. Also, he's currently the OpenStack Infrastructure core/root team
member.

Andrew Lazarev is a Senior Engineer in the Savanna group at Mirantis. Andrew joined Mirantis 9 years ago and has walked the whole road along with the company, from
low level networking for Cisco to Big Data for PayPal and Attensity and finally to open source OpenStack. Andrew started work on OpenStack in early 2013 and now actively contributes to the Savanna project.

7:30 P.M. - 8:00 P.M. Q/A

8:00 P.M. - 9:00 P.M. Networking

Coffee and Snacks will be available.

Photo of Big Data Science group
Big Data Science
See more events
Huawei Technologies, Building A Cafeteria
2330 Central Expressway · Santa Clara, CA