I am excited to announce our next meetup, featuring Patrick Hunt from Cloudera. Patrick will talk about text search on Hadoop and about new search product released by Cloudera.
6pm - 6:30pm : enjoy food + drinks and network
6:30pm - 7:30pm : talk : text search on Hadoop by Patrick Hunt
7:30pm - 8pm : more food & networking
This event is generously hosted by Samsung! (Yummy food & drinks !!)
The event FREE to attend.
Talk Abstract: Finding a needle in a stack of needles - adding Search to the Hadoop Ecosystem
Apache Hadoop is enabling organizations to collect larger,
more varied data - but after it's collected how will it be found? Your
users expect to be able to search for information using simple text
based queries -- regardless of data location, size, and complexity.
How do they quickly find information that's just been created, or been
stored for months or even years? Cloudera Search team lead Patrick
Hunt will present their solution to this problem; what architecture is
necessary to search HDFS and HBase? How was Apache Solr, Lucene, Flume
and MapReduce integrated to allow for Near Real Time and Batch
indexing of documents? What are the solved problems and what's still
to come? Join us for an exciting discussion on this new technology.
About Speaker -- Patrick Hunt
Patrick Hunt is a major contributor and PMC member on the Apache
ZooKeeper project. He joined Cloudera more than three years ago and
currently leads the Cloudera Search team. Previous to Cloudera he
spent five years at Yahoo! where he specialized in large-scale
distributed systems architecture. During this time he also drove
development of the critically acclaimed Delicious firefox extension.
Prior to joining Yahoo, Patrick worked on enterprise software, founded
a startup, designed high performance network software, and dabbled a
bit with rocket science (NASA). He has a bachelors degree in Computer
Systems Engineering from Rensselaer Polytechnic Institute.
See you all there!