Intro on Drill: Self-Service Data Exploration & Nested Data Analytics on Hadoop


Details
Talk Description
SQL is one of the most widely used languages to access, analyze, and manipulate structured data. As Hadoop gains traction within enterprise data architectures across industries, the need for SQL for both structured and loosely-structured data on Hadoop is growing rapidly Apache Drill started off with the audacious goal of delivering consistent, millisecond ANSI SQL query capability across a wide range of data formats. At a high level, this translates to two key requirements – Schema Flexibility and Performance.
Apache Drill provides users with the ability to interact with big data on Hadoop much faster and far more easily using the familiar SQL language. Users are no longer dependent on central IT teams and DBAs to produce schemas and then maintain them when the structure changes for a few records. Drill alleviates the pain associated with structuring unstructured data before one gains any insights by providing a simple mechanism to query any dataset on Hadoop - be it flat files, parquet or JSON files, or tables within an HBase table.
This session will give you an overview of several different use cases that enterprises are testing Drill for.
Speaker Bio
William Peterson, MapR, Director of Product Marketing
William “Bill” Peterson is the Director of Product Marketing for MapR. Prior to MapR, William was the Director of Product and Solutions Marketing for CenturyLink Technology Solutions, where he was responsible for marketing, strategy and leadership for the company’s big data efforts. Prior to CenturyLink, Bill ran Product and Solutions Marketing for NetApp’s Big Analytics and Hadoop solutions. In addition to his marketing role at NetApp, Bill was the Marketing Co-Chair for the Analytics and Big Data committee, SNIA.
Prior to joining NetApp, Bill held leadership positions at IDC within the Software Consulting Group, and at Page One PR. Bill has also served as a research analyst at IDC and The Hurwitz Group, covering the operating environments, portals, content management and business intelligence markets. In addition, Bill was Director of Marketing for TurboLinux, where he led the S-1 team. Earlier in his career, he served as Vice President of Marketing for Venturcom, ran vertical solutions marketing for Computer Associates, and was an IT manager at Harvard University.

Intro on Drill: Self-Service Data Exploration & Nested Data Analytics on Hadoop