We're pleased to welcome Dr. Donald Miner, author of MapReduce Design Patterns, to the NYDS meetup.
While many people know Hadoop is an important part of the data scientist's toolkit, understanding when and how to use Hadoop for data science isn't necessarily immediately obvious. All too often, Hadoop and data science are mashed together without an appreciation for the work that goes into making a data scientist productive on Hadoop.
Donald's talk will cover why and how to use Hadoop for data science, including diving into the nuts and bolts of doing data science work with Hadoop.
Donald's bio: Before joining ClearEdge, Donald was a Solutions Architect at EMC Greenplum, where he advised and helped customers implement and use Greenplum's big data systems. Prior to working with Greenplum, Donald architected several large-scale and mission-critical Hadoop deployments with the U.S. Government as a contractor. He is also involved in teaching, having previously instructed industry classes on Hadoop and a variety of artificial intelligence courses at the University of Maryland, BC. Donald received his PhD from the University of Maryland, BC in Computer Science, where he focused on Machine Learning and Multi-Agent Systems in his dissertation.
Want to hear Donald's perspective on when to use Pig vs. MapReduce? Come to the NYC Pig User Group Meetup on Wed, 8/21: http://www.meetup.com/NYC-Pig-User-Group/events/132731672/