Skip to content

Big Data Topic: Applying Testing Techniques to Hadoop Development

Big Data Topic: Applying Testing Techniques to Hadoop Development

Details

FREE EVENT! Big Data Topic: Applying Testing Techniques to Hadoop Development

Find us on Twitter: @knowledgent or info about this event #BigDataPalooza

Speaker: Mark Johnson, Sr. Director, Hortonworks, President of New England Java User Group (largest Java user group in the US)

Title: Applying Testing Techniques to Hadoop Development

"For many Big Data applications means write some Map Reduce code here, some PIG scripts there and some PIG or HIVE user defined functions somewhere else, deploy the resulting code to the cluster and hope it all works - or at least hope that because the data is so big that no one will notice the errors. Any testing performed often would take hours running on large datasets in the cluster, making it hard to test early and test often. Fortunately, many of the testing techniques we have learned in more traditional data environments are also applicable in the world of "Big Data" and there even some new tools such as PigUnit available to efficiently validate our code prior to cluster deployment.

During this overview presentation we will explore automated testing techniques for different types of Hadoop development tools (MapReduce and PIG )”

*Admission is FREE to this event, Knowledgent will sponsor pizza, beer and (non alcoholic Beverages) for all attendees. Many thanks to the AlleyNYC for hosting our event.

Photo of BIG DATA PALOOZA group
BIG DATA PALOOZA
See more events
Alley NYC
500 7th Ave, 17th Floor · New York, NY