Apache Drill: Learn the basics with Neeraja Rentachintala (MapR)

This is a past event

242 people went

Location image of event venue

Details

SQL is one of the most widely used languages to access, analyze, and manipulate structured data. As Hadoop gains traction within enterprise data architectures across industries, the need for SQL for both structured and loosely-structured data on Hadoop is growing rapidly Apache Drill started off with the audacious goal of delivering consistent, millisecond ANSI SQL query capability across wide range of data formats. At a high level, this translates to two key requirements – Schema Flexibility and Performance.

Apache Drill provides the users the ability to interact with big data on Hadoop much faster and far more easily using the familiar SQL language. Users are no longer dependent on central IT teams and DBAs to produce schemas and then maintain them when the structure changes for a few records.Drill alleviates the pain associated with structuring unstructured data before one gains any insights by providing a simple mechanism to query any dataset on Hadoop - be it flat files, parquet or JSON files or tables within an HBase table. Learn the basics of Apache Drill, from installing the tool to running your first query within minutes. The demo will show users how to install and setup instantly and start getting value out of data on their computers. For Drill queries, the data can be in any format or could be from any data source - including JSON, CSV, HBase or even MongoDB data. The talk will also cover interesting use cases, update on 1.0 Drill release and code snippets to query different data formats and data sources in a self service fashion without going through the pain of creating centralized schemas or metadata stores.

Meet the speaker:

Neeraja Rentachintala is Director of Product Management for MapR, where she is responsible for the product strategy, roadmap and requirements of all MapR SQL initiatives. Prior to MapR, Neeraja held numerous product management an engineering roles at Informatica, Microsoft, Oracle and Expedia, and was most recently the principal product manager for Informatica Data Services/Data Virtualization. Neeraja received a BS in Electronics and Communications from the National Institute of Technology in India, and a certification in software product management from the University of Washington.

Agenda:

6.00 - 6.45pm: Registration, Networking
6:45 - 7.00pm: Introductions
7:00 - 7:45pm: Talk + Demo by Neeraja
7:45 - 8.00pm: Q&A session
8.00 - 8.30pm: Networking