Our own Alvin Henrick (https://www.linkedin.com/in/alvinhenrick) will lead a follow up presentation on Apache Spark. He will take us on the next step of our learning journey (from where Paul (https://www.linkedin.com/in/pmhargis) so graciously left us in the last meeting). From my own experiences of working with Alvin, he is a pragmatic programmer and I believe you will find his presentation full of practical examples ....
In this meeting, he will discuss how , what and why to use Apache Spark. Alvin will start from basics and cover some background on functional programming. He will show how RDD (Resilient Distributed Datasets) is the heart and core of Apache Spark. Then he will review job execution flow from Apache Spark perspective.
Alving discussion will have some awesome hands on exercises to perform transformations and actions on RDD:
1) Simple Map Example.
2) Word Count.
3) Stack overflow user popularity stats.
4) Twitter streaming example.
5) Naive Bayes Classifier (MLIB) to classify sentiments of tweets.
I bet you will not be disappointed. Alvin will change your perspective regarding the way you program, design, and code. Of course, he is expecting to learn from you.You can follow him on twitter , his handle is @alvinhenrick (https://twitter.com/alvinhenrick)
As an added bonus, Jim Bates (https://www.linkedin.com/in/jimsbates) from MapR (https://www.mapr.com) will provide a teaser into Apache Drill, which we will do a deep sink in our next meetup...
Thank you again to Improving Enterprise for providing the venue, food and drink.
So come and join us for food, drink, and a sparkful conversation.....
See you soon.