Skip to content

Big Data + Search

Photo of Arató Bence
Hosted By
Arató B.
Big Data + Search

Details

The topic for this meetup is Big Data + Search.

18:00 Welcome!

18:15 Short talks

Hadoop based ETL and Solr based semantic search behind Jobmonitor.hu
This talk will discuss how Hadoop and Solr is used to power the Hungarian job search site Jobmonitor.hu

Speaker: Károly Kása, Precognox Károly is the development manager at Precognox, the semantic search and text mining specialist company.

Search based user experience
Quick introduction to how National Instruments powers it’s web presence with Attivio (AIE) search engine based user experience. The talk will present how content is being pushed to the search index with a custom document enrichment process and what are the challenges.

Speakers: Barnabas Szasz, IT Manager for Search, CMS, eCRM at National Instruments and Tibor Borbely, Senior Programmer Analyst, Enterprise Search at National Instruments

Elastifying Workflow Dashboard
This talk discusses how we used Elasticsearch to build an unified dashboard for tracking and actioning work requests.

Speaker: Tamás Németh, senior Developer at Morgan Stanley

19:00-19:15 Break

19:15 Community announcements
Big Data related job offers, interesting upcoming meetups and conferences, and similar stuff.

19:20 Finding a needle in a stack of needles - adding Search to the Hadoop Ecosystem

Apache Hadoop is enabling organizations to collect larger, more varied data - but after it's collected how will it be found? Your users expect to be able to search for information using simple text based queries -- regardless of data location, size, and complexity.How do they quickly find information that's just been created, or been stored for months or even years? Cloudera Search Engineer Wolfgang Hoschek will present their solution to this problem; what architecture is necessary to search HDFS and HBase? How was Apache Solr, Lucene, Flume, MapReduce, HBase and Morphlines integrated to allow for Near Real Time and Batch indexing of documents? What are the solved problems and what's still to come? Join us for an exciting discussion on this new technology.

Speaker: Wolfgang Hoschek, Cloudera Wolfgang is a Software Engineer on the Platform and Cloudera Search team. He is a committer on the Apache Flume and Apache Lucene/Solr projects, a committer on the Kite project and the lead developer on Morphlines. He is a former CERN fellow and former Computer Scientist at Lawrence Berkeley Lab. He has 15+ years of experience in large-scale distributed systems, data intensive computing and real time analytics. He received his Ph.D from the Technical University of Vienna, Austria

20:00 Followup discussions

===========================================

Venue, drinks and snacks are provided by BalaBit IT Security (http://www.balabit.com/). This is an English speaking event.

Access map to the Balabit offices is here. (http://photos3.meetupstatic.com/photos/event/9/b/8/4/highres_332559812.jpeg)

Photo of Budapest Data & Analytics Meetup group
Budapest Data & Analytics Meetup
See more events
BalaBit
1117, Alíz utca 2 · Budapest