Scala is a functional programming language on the JVM. Hadoop uses a functional programming model to represent large-scale distributed computation. Scala is thus a very natural match for Hadoop. Twitter uses Scalding for data analysis and machine learning, particularly in cases where we need more than sql-like queries on the logs, for instance fitting models and matrix processing. It scales beautifully from simple, grep-like jobs all the way up to jobs with hundreds of map-reduce pairs.
We are pleased to have Dr. Oscar Boykin and Dr. Argyris Zymnis from Twitter to give us some insight on Scalding DSL and provide some example jobs for common use cases. They will also be including a hands-on session where participants can try scalding on their local machine to run a realistic job.
MORE ABOUT THE PRESENTERS...
Presenters Dr. Oscar Boykin and Dr. Argyris Zymnis are the main scalding committers. Both are at Twitter and work on the Revenue Quality Engineering team. Both are often bearded but generally discourage use of the term "neckbeard".