Skip to content

Self-Service Data Exploration and Nested Data Analytics on Hadoop

L
Hosted By
Luca F.
Self-Service Data Exploration and Nested Data Analytics on Hadoop

Details

Dear all,

On April 7th, at 7pm we'll hear from John Benninghoff (https://www.linkedin.com/in/johnbenninghoff), Chief Data Engineer at MapR Technologies, about Apache Drill (http://drill.apache.org/).

Apache Drill is a new solution from The Apache Software Foundation that holds the promise to make data exploration on pretty much any data source as easy as writing SQL queries.

Here's the abstract of the proposed talk:

Self-Service Data Exploration and Nested Data Analytics on Hadoop

SQL is one of the most widely used languages to access, analyze, and manipulate structured data. As Hadoop gains traction within enterprise data architectures across industries, the need for SQL for both structured and loosely-structured data on Hadoop is growing rapidly Apache Drill started off with the audacious goal of delivering consistent, millisecond ANSI SQL query capability across wide range of data formats. At a high level, this translates to two key requirements – Schema Flexibility and Performance.
Apache Drill provides the users the ability to interact with big data on Hadoop much faster and far more easily using the familiar SQL language. Users are no longer dependent on central IT teams and DBAs to produce schemas and then maintain them when the structure changes for a few records. Drill alleviates the pain associated with structuring unstructured data before one gains any insights by providing a simple mechanism to query any dataset on Hadoop - be it flat files, parquet or JSON files or tables within an HBase table.
This session will give you an overview of several different use cases that enterprises are testing Drill for.

---

I think that a technology such as Apache Drill that allows to quickly capitalize on legacy datasources without the need of deploying custom ETL for each of them could be very beneficial for many small/medium size companies in the Santa Barbara area.

Please give widest distribution!

Pizza and soft drinks will be provided by MapR.

The talk will be in Adams Center 217.

Photo of Santa Barbara Data Science group
Santa Barbara Data Science
See more events
Westmont College
Montecito, CA · Montecito, CA