Transformatorweg 38, Amsterdam
The second meetup in the Future of Data series has speakers from Pivotal and Hortonworks, and will focus on real-time (and near-real-time) reporting tasks on the basis of the Hadoop stack.
We thank Dell for kindly hosting us in their great office in the Amsterdam Sloterdijk area.
17:00 - Arrive, drink, eat
17:45 - Presentations
1. Install and Admin of Apache HAWQ on Hortonworks with Apache Ambari
Learn how to install and manage Apache HAWQ on a Hadoop Cluster.
Speaker: Tony van Büüren van Heijst, Pivotal
If you bring your laptop and complete the installation exercise, you will leave with a functioning HAWQ instance in a Hortonworks Sandbox for you to play around with on your own time.
Apache HAWQ <- Note LInk, is an elastic, parallel processing query engine that operates on all your data directly within Hadoop. It provides the highest degree of ANSI-SQL completeness to execute sophisticated queries for advanced analytics and data science.
In this session we'll cover:
1. Installation of baseline Hortonworks Sandbox
2. Installation and configuration of Apache HAWQ using Ambari
3. Tour of administrative capabilities of HAWQ using Ambari
4. Running a smoke test by executing queries with HAWQ
5. Connecting Apache Zeppelin to your HAWQ instance
To participate in the hands-on portion, please bring a laptop with the following:
VirtualBox 4.2 or later, or VMWare 5.0 or later installed Pre-downloaded Sandbox VM with HAWQ
Please download the lab image VM and uncompress it ahead of time.
Download size: 6GBUncompressed size: 13GB
You will need total drive capacity of 20 GB to expand, and can then delete the download file to reclaim 6GB.
Please download the lab VM at: https://s3.amazonaws.com/dbbhdb/HDB-Sandbox-2.01-vmware-2016101704064516.zip
2. To be announced
19:30 - Drinks and Networking
20:30 - Everybody out