Skip to content

SF:Accessing External Hadoop Data Sources using Pivotal Xtension Framework (PXF)

Photo of Tamao Nakahara
Hosted By
Tamao N.
SF:Accessing External Hadoop Data Sources using Pivotal Xtension Framework (PXF)

Details

Accessing External Hadoop Data Sources using Pivotal Xtension Framework (PXF) with Sameer Tiwari

Schedule:
5:30-6:30 pm Pizza and networking
6:30-8:00pm Talk and Q&A
8:00-8:30 pm Wind down

Pivotal Xtension Framework (PXF) is an external table interface that gives SQL access on top of data stored within the Hadoop ecosystem. It enables loading and querying of data stored in HDFS, HBase and Hive. It supports a wide range of data formats such as Text, AVRO, Hive, Sequence, RCFile formats and HBase.

Example uses cases include using statistical and analytical functions from HAWQ (e.g. Madlib) on HBase or Hive data, Joining in-database dimensions with HBase facts, leveraging analytical capabilities on Hadoop data files of various kinds and fast ingest of data into HAWQ for in database processing and analytics.

PXF is in the process of being open-sourced.

About the speaker: Sameer Tiwari

Sameer has been building platform products for large deployments since the Application Server days at Sun Microsystems. He started working on big data prior to the invention of Hadoop, in the field of email archiving/search. Recently he was working on Ad-Serving and User Platform systems at Yahoo.

He is the Hadoop Architect at Pivotal, Inc. building the next generation systems for Big Data Analytics. - See more at: http://blog.gopivotal.com/author/sameertiwari#sthash.D6NH4fMa.dpuf

Photo of Data Engineers Guild group
Data Engineers Guild
See more events
Pivotal Labs
875 Howard Street 5th Floor · San Francisco, CA