align-toparrow-leftarrow-rightbackbellblockcalendarcamerachatcheckchevron-downchevron-leftchevron-rightchevron-small-downchevron-small-leftchevron-small-rightchevron-small-upchevron-upcircle-with-crosscrosseditfacebookglobegoogleimagesinstagramlocation-pinmagnifying-glassmailmoremuplabelShape 3 + Rectangle 1outlookpersonplusImported LayersImported LayersImported Layersshieldstartwitteryahoo

Large Scale ETL for Hadoop and Cloudera Search using Morphlines


6:00 - 7:00 Networking and Light Dinner (Thanks Google)

7:00 - 7:10 Announcements

7:10 - 8:45 Wolfgang Hoschek - Large Scale ETL for Hadoop and Cloudera Search using Morphlines

8:45 - 9:00 Q&A

Cloudera Morphlines is a new, embeddable, open source Java framework that reduces the time and skills necessary to integrate and build Hadoop applications that extract, transform, and load data into Apache Solr, Apache HBase, HDFS, enterprise data warehouses, analytic online dashboards, or other consumers. If you want to integrate, build, or facilitate streaming or batch transformation pipelines without programming and without MapReduce skills, and get the job done with a minimum amount of fuss and support costs, Morphlines is for you.

In this talk, you'll get an overview of Morphlines internals and explore sample use cases that can be widely applied.

Wolfgang Hoschek is a Software Engineer on the Platform team and the lead developer on Morphlines. He is a former CERN fellow and former Computer Scientist at Lawrence Berkeley Lab, and received his Ph.D from the Technical University of Vienna, Austria.

Join or login to comment.

Our Sponsors

  • Google

    Google provides the venue for most meetings.

  • New Relic

    New Relic, performance monitoring & analytics company, pays for pizza!

People in this
Meetup are also in:

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy