Emerging Big Data Technologies/Frameworks - Series 1

Details
We will discuss on new emerging big data technologies/frameworks:
Series 1 : (5th Sept 2015)
- Parquet: Efficient columnar storage (to Hadoop/Spark) 2. Apache Drill: Schema free SQL query engine for Hadoop, NoSQL
- Apache Flink : scalable batch and stream data processing
Agenda:
10.00 - 10.15am - Basic Introduction to Columnar Storage
10.15 - 10.45am - Parquet
10.45 - 11.15am - Dremel and Apache Drill (Demo)
11.15 - 11.30am - Break
11.30 - 12.00pm - Apache Flink
12.00 - 12.30pm - Flash Talks
Postponed to Series 2:
- Presto : Distributed SQL Query Engine for Big Data 5. Tachyon : A memory-centric distributed storage system 6. BlinkDB : Queries with Bounded Errors and Bounded Response Times on Very Large Data e.g. query on 10TB in 2sec using sampling (If time permits)
Please come with laptop (having unix/linux os), if you are planning to explore more. We would be trying out one or two frameworks.
All, As RSVP is closed, so please fill up this form, if you have not RSVPed earlier : http://goo.gl/forms/XXhj4MNyi6
Location Related queries:
Mayank : 9030710846
Rahul : 9908599937

BigData-Blockchain-AI-ML-XR-3Dprinting-Automation-Robotics
See more events
Thoughtworks
3rd Floor, Apurupa Silpi, Beside H.P. Petrol Bunk (KFC Building), Gachibowli, Hyderabad - 500032 · Hyderabad
Emerging Big Data Technologies/Frameworks - Series 1