2 Talks: Hadoop and Spark - a perfect duo, Apache Toree...


Details
*Note, expedite check in at Galvanize; register here (https://www.eventbrite.com/e/hadoop-and-spark-a-perfect-duo-for-big-data-tickets-21794170952)
Agenda:
6 pm - 6:30 pm - Food and Networking
6:30 pm - 8:30pm - 2 Talks, Q&A
Talk 1: Hadoop and Spark - a perfect duo for Big Data
Hadoop is the original platform for Big Data. Hadoop is about 10 years old now. And we are seeing '2nd generation' technologies like Spark becoming popular.So is Spark replacing Hadoop? Is Hadoop at the end of its days?In this talk we will evaluate Hadoop and Spark with a neutral, practical perspective. We will high light their strengths and appropriate use cases.
Meet the Speaker:
Sujee Maniyam is a seasoned Big Data practitioner. He teaches and consults in Big Data technologies (Hadoop, Spark, NoSQL and Cloud). He is an open source contributor and author of 'Hadoop illuminated (http://hadoopilluminated.com/)' (an open-source book on Hadoop) and 'HBase Design Patterns (http://elephantscale.com/books/hbase-design-patterns/)'. Sujee is a frequent speaker at various conferences and meetups. He also advises and mentors various firms.
Sujee is a founder and principal at Elephant Scale (http://elephantscale.com/) that specializes in training and consulting around Big Data technologies.
Sujee's work can be found @ http://sujee.net (http://sujee.net/)
Talk 2: Apache Toree: How To Develop Spark/Scala Apps as Interactive Notebooks
Apache Toree provides the interactive notebook for Spark/Scala. Toree is a IPython/Jupyter kernel. It lets you mix Spark/Scala code with markdown, execute the notebook, and publish it on the web.
Asim will talk about how to install and get started with Apache Toree, how to use it to develop Spark applications interactively in notebooks, and how to publish your notebooks.
Prerequisites
Beginner. Familiarity with a programming language will be helpful.
Why should someone attend this talk?
After this talk you will be able to:
- Create Spark/Scala applications as Apache Toree notebooks.
- Publish your Toree notebooks on the web through GitHub.
- Use Apache Toree to develop Spark/Scala applications interactively.
Meet the Speaker:
Asim Jalis runs the Data Engineering program (http://www.galvanize.com/courses/data-engineering) at Galvanize. He has worked at Cloudera, Microsoft, and Salesforce. He has an MS in Computer Science from the University of Virginia, and an MA in Mathematics from the University of Wisconsin–Madison.

2 Talks: Hadoop and Spark - a perfect duo, Apache Toree...