addressalign-toparrow-leftarrow-rightbackbellblockcalendarcameraccwcheckchevron-downchevron-leftchevron-rightchevron-small-downchevron-small-leftchevron-small-rightchevron-small-upchevron-upcircle-with-checkcircle-with-crosscircle-with-pluscrossdots-three-verticaleditemptyheartexporteye-with-lineeyefacebookfolderfullheartglobegmailgooglegroupshelp-with-circleimageimagesinstagramlinklocation-pinm-swarmSearchmailmessagesminusmoremuplabelShape 3 + Rectangle 1ShapeoutlookpersonJoin Group on CardStartprice-ribbonShapeShapeShapeShapeImported LayersImported LayersImported Layersshieldstartickettrashtriangle-downtriangle-uptwitteruserwarningyahoo

Big Data + Search

The topic for this meetup is Big Data + Search. 

18:00 Welcome!

18:15 Short talks

Hadoop based ETL and Solr based semantic search behind Jobmonitor.hu 
This talk will discuss how Hadoop and Solr is used to power the Hungarian job search site Jobmonitor.hu

Speaker: Károly Kása, Precognox Károly is the development manager at Precognox, the semantic search and text mining specialist company.


Search based user experience
 Quick introduction to how National Instruments powers it’s web presence with Attivio (AIE) search engine based user experience. The talk will present how content is being pushed to the search index with a custom document enrichment process and what are the challenges.

Speakers: Barnabas Szasz, IT Manager for Search, CMS, eCRM at National Instruments and Tibor Borbely, Senior Programmer Analyst, Enterprise Search at National Instruments 


Elastifying Workflow Dashboard  
This talk discusses how we used Elasticsearch to build an unified dashboard for tracking and actioning work requests.

Speaker: Tamás Németh, senior Developer at  Morgan Stanley

19:00-19:15 Break

19:15 Community announcements
Big Data related job offers,  interesting upcoming meetups and conferences, and similar stuff.

19:20 Finding a needle in a stack of needles - adding Search to the Hadoop Ecosystem

  Apache Hadoop is enabling organizations to collect larger, more varied data - but after it's collected how will it be found? Your users expect to be able to search for information using simple text based queries -- regardless of data location, size, and complexity.How do they quickly find information that's just been created, or been stored for months or even years? Cloudera Search Engineer Wolfgang Hoschek will present their solution to this problem; what architecture is necessary to search HDFS and HBase? How was Apache Solr, Lucene, Flume, MapReduce, HBase and Morphlines integrated to allow for Near Real Time and Batch indexing of documents? What are the solved problems and what's still to come? Join us for an exciting discussion on this new technology.

Speaker: Wolfgang Hoschek, Cloudera Wolfgang is a Software Engineer on the Platform and Cloudera Search team. He is a committer on the Apache Flume and Apache Lucene/Solr projects, a committer on the Kite project and the lead developer on Morphlines. He is a former CERN fellow and former Computer Scientist at Lawrence Berkeley Lab. He has 15+ years of experience in large-scale distributed systems, data intensive computing and real time analytics. He received his Ph.D from the Technical University of Vienna, Austria

20:00 Followup discussions

=========================================== 

Venue, drinks and snacks are provided by BalaBit IT Security. This is an English speaking event. 

Access map to the Balabit offices is here.



Join or login to comment.

  • Arató B.

    February 20, 2014

  • Arató B.

    Now the slided for all the talks are available on the Files area: http://www.meetup.com/Big-Data-Meetup-Budapest/files/

    1 · February 20, 2014

  • Arató B.

    Thanks for everyone for feedback, appreciated.

    February 20, 2014

  • Tamás K.

    As I mentioned on the last meetup, our team will be an exhibitor on next CEBIT, march 10-14. If you or anyone at your company would like to come, contact me via logdrill.com for free visitor tickets. We have A LOT of them.

    February 18, 2014

  • Bela E.

    Wolfgang's presentation was really informative, it saved the evening. :)

    February 13, 2014

  • Daniel D.

    Excellent host/organizer, okay presenters, good crowd.

    February 12, 2014

    • Daniel D.

      Whoops, I didn't realize this review would be public. I'm sorry, the presenters were great. The presentations were okay. It was my first meetup and it was great to see descriptions of actual projects. Very inspiring! But I wish there were more technical details. Thanks!

      February 13, 2014

  • Lorant D.

    There were interesting themes, I expected deeper presentations with more details, even fewer presenter

    February 13, 2014

  • Pajor G.

    I gave 4 stars, because I beleive everything could be better :)

    February 13, 2014

  • Arató B.

    The slides for Wolfang's talk are here:
    http://files.meetup.com/6895392/SearchOnHadoopBudapest.pdf

    February 13, 2014

  • Kovács B.

    I really enjoyed Cloudera presentation by Wolfgang, but the others were also valuable! Thanks for that!

    February 12, 2014

  • Arató B.

    Thanks for everyone for updating the RSVps. The waitlist is gone.

    February 12, 2014

  • Zoltán B.

    I'll fu at least... :)

    February 12, 2014

  • Ponori A.

    Many thanks for opening a spot for me!

    February 11, 2014

  • Laszlo T.

    I'm at the Startup Day on Wednesday which ends at about 6PM. Are you going to be tight on the agenda?

    February 10, 2014

    • Arató B.

      We try to be on time as much as possible, but still delays happen. As you can see in the schedule, we will be there at least until 8pm, so do come!

      February 11, 2014

  • Arató B.

    The schedule for the meetup is up!

    February 10, 2014

  • Arató B.

    This meetup will be hosted by Balabit, who will also provide snacks and drinks for us. The attendee limit has been raised to 140, so everyone on the waitlist is now in.

    1 · February 2, 2014

  • Arató B.

    The fourth speaker will be Tamás Németh from Morgan Stanley, who will talk about using Elasticsearch in a dashboard environment.

    February 2, 2014

  • Arató B.

    The third talk for the Meetup will be given by Barnabas Szasz and Tibor Borbely from National Instruments on search-based users experience.

    January 19, 2014

  • Arató B.

    Our second speaker will be Karoly Kasa from Precognox, who will present a case study about using Hadoop and Solr.

    2 · January 8, 2014

  • Arató B.

    I'm happy to announce our international guest speaker, Wolfgang Hoschek from Cloudera. Wolfgang is a software engineer on the Platform and Cloudera Search team. He is a committer on the Apache Flume and Apache Lucene/Solr projects, a committer on the Kite project and the lead developer on Morphlines. His talk will be titled: "Finding a needle in a stack of needles - adding Search to the Hadoop Ecosystem"

    5 · January 6, 2014

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy