Skip to content

Emerging Big Data Technologies/Framew­orks - Series 1

Emerging Big Data Technologies/Framew­orks - Series 1

Details

We will discuss on new emerging big data technologies/frameworks:

Series 1 : (5th Sept 2015)

  1. Parquet: Efficient columnar storage (to Hadoop/Spark) 2. Apache Drill: Schema free SQL query engine for Hadoop, NoSQL
  2. Apache Flink : scalable batch and stream data processing

Agenda:
10.00 - 10.15am - Basic Introduction to Columnar Storage
10.15 - 10.45am - Parquet
10.45 - 11.15am - Dremel and Apache Drill (Demo)
11.15 - 11.30am - Break
11.30 - 12.00pm - Apache Flink
12.00 - 12.30pm - Flash Talks

Postponed to Series 2:

  1. Presto : Distributed SQL Query Engine for Big Data 5. Tachyon : A memory-centric distributed storage system 6. BlinkDB : Queries with Bounded Errors and Bounded Response Times on Very Large Data e.g. query on 10TB in 2sec using sampling (If time permits)

Please come with laptop (having unix/linux os), if you are planning to explore more. We would be trying out one or two frameworks.

All, As RSVP is closed, so please fill up this form, if you have not RSVPed earlier : http://goo.gl/forms/XXhj4MNyi6 ­

Location Related queries:
Mayank : 9030710846
Rahul : 9908599937

Photo of BigData-Blockchain-AI-ML-XR-3Dprinting-Automation-Robotics group
BigData-Blockchain-AI-ML-XR-3Dprinting-Automation-Robotics
See more events
Thoughtworks
3rd Floor, Apurupa Silpi, Beside H.P. Petrol Bunk (KFC Building), Gachibowli, Hyderabad - 500032 · Hyderabad