Rohit will talk about Docker and Spark. Here's his description: "I have created a single node Dockerized development environment showing a sample Spark pipeline using Python API for RDD for reading and writing to HDFS. It includes history server to check the lineage, parsing sysstat metrics using AWK and charting the output using R. Everything is automated using a single make command. Please let me know if there is an interest for me to present it. A mobile friendly presentation, complete with source code is available on Github at the location copied below.
We have room for a second speaker. I realize it is short notice, but if you have something you want to present, just let me know.
I will be there to help set up, but I have to leave before the meeting starts for a memorial service for a good friend of mine. I need someone to volunteer to pack up the projection equipment and hold on to it either until I can pick it up or until the next meeting.