Free registration at
Topics: "Infrastructure Enablers for Effective Large-Scale Data Pipelines” by Jeromy Carriere, Tech Lead, Google
Processing large amounts of data in a corporate setting requires infrastructural enablers to build and support effective large-scale data pipelines. Specifically, Jeromy will be pointing out some of the practicalities we uncovered as the space grew: reliable large dataset movement, metadata (e.g. for dup/replication management, provenance), reliable workflow systems, abstractions for programming, infrastructure version management. Writing a MapReduce function to run over a petabyte of data is the "easy" part.
BYOB (Bring Your Own Business-Case)
This is a new feature for the TDWI Boston chapter meeting. Bring your own business case or problem and we’ll discuss the issue together. Take advantage of the assembled brain-power spanning the technical and business side as well as multiple verticals. There’s always something to be learned from the issues that others are facing. TDWI chapter officers will have a couple of problem sets ready to seed the discussion.