SHUG 10. Adding Search to the Hadoop Ecosystem

NOTE: The meeting will start at 18:00 as usually (in the past, by mistake 8 PM was choosen in the web-ui on this page).

Title: Finding a needle in a stack of needles - adding Search to the Hadoop Ecosystem

Speaker: Wolfgang Hoschek


Apache Hadoop is enabling organizations to collect larger, more varied data - but after it's collected how will it be found? Your users expect to be able to search for information using simple text based queries -- regardless of data location, size, and complexity.

How do they quickly find information that's just been created, or been stored for months or even years? Cloudera Search Engineer Wolfgang Hoschek will present their solution to this problem; what architecture is necessary to search HDFS and HBase? How was Apache Solr, Lucene, Flume, MapReduce, HBase and Morphlines integrated to allow for Near Real Time and Batch indexing of documents? What are the solved problems and what's still to come? Join us for an exciting discussion on this new technology.


Wolfgang Hoschek is a Software Engineer on the Platform and Cloudera Search team. He is a committer on the Apache Flume and Apache Lucene/Solr projects, a committer on the Kite project and the lead developer on Morphlines. He is a former CERN fellow and former Computer Scientist at Lawrence Berkeley Lab. He has 15+ years of experience in large-scale distributed systems, data intensive computing and real time analytics. He received his Ph.D from the Technical University of Vienna, Austria.


Additional information

RSVP to the meetup 

Please RSVP to this meetup, since we need to put everybody on a guest list for entering the Spotify office. The event will be held in the cafeteria of the Spotify office, so don’t go to the normal entrance but to the 11th floor.

Pizza and drinks

Thanks to Spotify, pizza and beverages will be available for the participants during the meetup. This is another reason to RSVP to this meetup, if you are willing to come - it will help us to estimate the number of pizzas and drinks based on declared attendance.

The entrance

The door will be open between 17:45 and 18:15* Because of fire regulations, we need to keep a list of everybody in the building, so please make sure that you get your name ticked off the list at the entrance or (in case of a +1), make sure that the person at the door puts your name on the list.

*Unfortunately we can not leave the door open all the time (the company security policy), nor have a person that will be constantly watching for guests coming late. If you need to come later, please let us know in the comments below, so that somebody will come to the door to open it a given time.

See you at soon!


Join or login to comment.

Our Sponsors

People in this
Meetup are also in:

Sometimes the best Meetup Group is the one you start

Get started Learn more

I'm surprised by the level of growth I've seen since becoming an organizer, it's given me more confidence in my abilities.

Katie, started NYC ICO

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy