Meetup at Allstate - Hadoop + Spark


Details
3/25/15 - We have speakers!
Tanya Schlusser is going to give a hadoop intro and Paul Koester is going to talk about Spark.
Hadoop is ubiquitous for big data storage and processing across all industries and Spark is rapidly becoming the preferred way of interacting with an hadoop cluster. Knowledge about these tools is a huge boon to any data scientist.
Remember that I'll need your full name if you're coming to the talk, and you'll have to bring ID.
3/11/15: I'm in the process of getting space at Allstate's campus in Northbrook. Topic is not yet decided, but some options are (1) another talk on GBM (I'm leaning towards the more mathy stuff but it's up to you all), (2)a Spark/MLLib talk, (3) something else. Any preferences?
You'll need to be signed up to get in, I'll need your full name in advance and you'll need to bring a photo ID.
old message:
Any ideas for a next talk? I got one suggestion for monte carlo / markov chain methods and/or regular expressions. I'm not an expert on either of those, so someone else would have to present. Here are some additional ideas:
-
A deep dive into the algorithm behind decision/regression trees and gbm. This would be more on the math side and hopefully you'd come away with an idea of exactly what is going on under the hood. I could give this.
-
General intro to R, with an emphasis on data science. R is a great tool (despite my constant complaints) for data scientists and is very commonly used in the field. I could give this.
-
Intro to some sort of data visualization / interactivity. I'm currently learning Shiny (an R package), but this could also be about Tableau, ggplot (R package), whatever. I could do this if it's on Shiny (assuming I learn it in time), but it would have to be someone else for those other packages.
-
Intro to hadoop: general infrastructure, hdfs, map reduce, some newer tools, etc. I could give this.
-Other ideas?
Also, on the ones where I said "I could give this", that doesn't mean I have to give it, just that I'm capable. If someone else knows about that stuff and wants to give a talk on that topic, let me know.
Also, how do people feel about doing a meetup at Allstate in Northbrook? I'm in the process of figuring out what I need to do to host an event there.

Meetup at Allstate - Hadoop + Spark