Topic: Introduction to Pig with Live Demonstration
Abstract: Pig is a language for expressing data analysis and infrastructure processes. Pig is translated into a series of MapReduce jobs that are run by the Hadoop cluster. Pig is extensible through user-defined functions that can be written in Java and other languages. Pig scripts provide a high level language to create the MapReduce jobs needed to process data in a Hadoop cluster.
This presentation will briefly introduce Pig, then the majority of the time will be used to demonstrate sample Pig jobs running on Hadoop.
Speaker Bio: Dan Marshall has over 30 years’ experience in the IT industry and has been in the trenches for several major shifts - Big Data being the latest. He currently works at Well Fargo in Chandler, AZ as a Big Data Engineer. Prior to that he worked at IBM for 17 years in various management and technical roles and had various IT roles before joining IBM. Dan has an M.S. in Computer Science from Rensselaer Polytechnic Institute and an MBA from Thunderbird School of Global Management.
Theatre Room - First Floor
The University of Advancing Technology
2625 W. BASELINE RD., Tempe, AZ