• The Open reception for all prior to the meetup will held on April 14, 2015 from 5:30 pm – 6:15 pm at Panoramic Hall, Level 5 The Square
• Your meetup is at Studio 211 at Level 2 at The Square
• The Meetup Room doors will open at 6:30 pm (they’ve to prepare the room from prior engagement and setting)
• Maximum Capacity is 77 people.
We start with an open reception from 17:30 to 18:15 offered by Hortonworks.
We already have 5 top speakers - 3 from the US / Hortonworks and 2 from Belgium.
Ofer Mendelevitch (http://www.linkedin.com/in/ofermend) Hortonworks
Ofer Mendelevitch is Director of data sciences at Hortonworks, where he is responsible for professional services involving data science with Hadoop. Prior to joining Hortonworks, Ofer served as Entrepreneur in Residence at XSeed Capital where he developed an investment strategy around big data. Before XSeed, Ofer served as VP of Engineering at Nor1, and before that he was Director of engineering at Yahoo! where he led multiple engineering and data science teams responsible for R&D of large scale computational advertising projects including CTR prediction (with Hadoop), a new front-end ad-serving system and sales tools.
Title: Using Natural Language Processing on Non-Textual Data with MLLib
Abstract: Word2Vec (https://code.google.com/p/word2vec/) is an interesting unsupervised way to construct vector representations of words to act as features for downstream algorithms or as a basis for similarity searches. We look at using the Spark implementation of Word2Vec shipped in MLLib to help us organize and make sense of some non-textual data by treating discrete clinical events (I.e. Diagnoses, drugs prescribed, etc.) in a medical dataset as non-textual "words".
Title: From Notebook to Data Storytelling (and beyond...)
Abstract: The Data Lake allows WEB innovative solutions such as the Datalayer Notebook to perform data analysis and communicate results with users. Let's see how we can make better to go to a real “Storytelling” approach.