GOTO Night on NoSQL with Hadoop, Apache Drill and Physical design on Graph Databases
We would like to invite you to the interesting GOTO Night on behalf of NoSQL Search Roadshow Zurich 2013.
GOTO Night Events are free evening community events, driven by involvement from both speakers and participants. Come along and learn something new!
1. Denormalizing Data Sets in Hadoop while Building Lucene Indexes by Dragan Milosevic
The invention of Hadoop made possible to cost-efficiently collect and analyze petabytes of data in order to extract valuable information that critically supports daily business of more and more companies. Collected data that supports zanox business comes from its tracking systems that generate billions of events on a daily basis, search-engines that provide cost information and master data about millions of advertisers and publishers. The efficient querying of extracted information is afterwards achieved by denormalizing data sets and storing the results in Lucene indexes.
Dr. Dragan Milosevic is certified Solr/Lucene, Hadoop and HBase developer and currently works as senior architect at zanox in distributed computing team that uses various open-source projects to create world-class reporting framework. He is author of a book "Beyond Centralized Search Engines: An Agent-Based Filtering Framework" that describes the application of various machine-learning techniques for solving cooperation and coordination challenges in distributed systems.
2. Drill for interactive querying of heterogeneous, large-scale datasets by Michael Hausenblas
Apache Drill is a distributed system for the interactive analysis of large-scale datasets, inspired by Google's Dremel technology. It is designed to scale to thousands of servers and able to process Petabytes of data in seconds, enabling SQL-on-Hadoop and supporting a variety of data sources.
Michael works at MapR Technologies as Chief Data Engineer EMEA. His background is in large-scale data integration research and development, advocacy and standardisation. He has experience with NoSQL databases and the Hadoop ecosystem.
3. Physical design on Graph Databases - an example from InfiniteGraph presented by Timo Wagner
Database application design is often described as a combination of logical and physical design; logical design being the process of creating a logical schema that provides the persistence related functionality needed by the application, while physical design being the process of determining how that logical schema is to be represented in the database.
Typical physical design for applications built upon relational databases involve creating an internal schema (defining the tables), mapping the logical schema to the internal schema (classes to tables), and choosing various DB specific storage options for the internal schema components (tables and sometimes table fragments).
Timo Wagner currently works as Senior Technical Consultant EMEA for Objectivity, Inc. He has more than 10 years of experience in the SQL and NoSQL database field as a software architect and developer. He was part of the core development team that redesigned the Sones Graph/DB. With his experience in developing and managing large data sets he is a specialist in distributed graph databases, data quality and big data.
More info and online registration: http://nosqlroadshow.com/nosql-zurich-2013/upcomingevents/
(please to save a spot click RSVP and register also on NoSQL website!)