What's New & Tuning Hive 0.11
- New SQL Features
- ORCFile Use
- Partitioning, Clustering, Bucketing
- Practical data workflow
We will take a break in June to enjoy the summer and Hadoop Summit 2013!
There will be minimal hands on for this one. I just need to hand over some Hive Tuning knowledge to the larger group. Most of what we cover will work with Hive 0.10 or Hive 0.11. All the new features, like the columnar format ORCFile, will be Hive 0.11 only.
Note that this will not cover tuning Cloudera's Impala, as the execution engine (and scheduling engine) is completely different from YARN or MapReduce. I'm happy to entertain a co presentation with anyone that has the knowledge and wants to cover the current tuning knobs in Impala however.