Mar 2015 #2 - Hadoop with Python


Details
Join us Tuesday March 10th at the new iStrategyLabs for Hadoop with Python!
Donald Miner will do a quick introduction to Apache Hadoop, then discuss the different ways Python can be used to get the job done in Hadoop. This includes writing MapReduce jobs in Python in various different ways, interacting with HBase, writing custom behavior in Pig and Hive, interacting with the Hadoop Distributed File System, using Spark, and integration with other corners of the Hadoop ecosystem. The state of Python with Hadoop is far from stable, so we'll spend some honest time talking about the state of these open source projects and what's missing will also be discussed.
Donald Miner is an avid user of Apache Hadoop, a practitioner of data science, and a long-time Python user. He serves as Chief Technology Officer at ClearEdge IT Solutions, a company that provides Big Data professional services. He is author of the O’Reilly book MapReduce Design Patterns, which is based on his experiences as a MapReduce developer. Donald has architected and implemented a number of mission-critical and large-scale Hadoop systems within the U.S. Government and Fortune 500 companies. He received his PhD from the University of Maryland, Baltimore County in Computer Science, where he focused on Machine Learning and Multi-Agent Systems. He lives in Maryland with his wife and two young sons.

Mar 2015 #2 - Hadoop with Python