SF:Accessing External Hadoop Data Sources using Pivotal Xtension Framework (PXF)


Details
Accessing External Hadoop Data Sources using Pivotal Xtension Framework (PXF) with Sameer Tiwari
Schedule:
5:30-6:30 pm Pizza and networking
6:30-8:00pm Talk and Q&A
8:00-8:30 pm Wind down
Pivotal Xtension Framework (PXF) is an external table interface that gives SQL access on top of data stored within the Hadoop ecosystem. It enables loading and querying of data stored in HDFS, HBase and Hive. It supports a wide range of data formats such as Text, AVRO, Hive, Sequence, RCFile formats and HBase.
Example uses cases include using statistical and analytical functions from HAWQ (e.g. Madlib) on HBase or Hive data, Joining in-database dimensions with HBase facts, leveraging analytical capabilities on Hadoop data files of various kinds and fast ingest of data into HAWQ for in database processing and analytics.
PXF is in the process of being open-sourced.
About the speaker: Sameer Tiwari
Sameer has been building platform products for large deployments since the Application Server days at Sun Microsystems. He started working on big data prior to the invention of Hadoop, in the field of email archiving/search. Recently he was working on Ad-Serving and User Platform systems at Yahoo.
He is the Hadoop Architect at Pivotal, Inc. building the next generation systems for Big Data Analytics. - See more at: http://blog.gopivotal.com/author/sameertiwari#sthash.D6NH4fMa.dpuf

SF:Accessing External Hadoop Data Sources using Pivotal Xtension Framework (PXF)