Mar 26, 2013 · 7:00 PM
Ideally I would like this to be a sort of panel where people share their large Data Integration and Change Data Capture problems. I would like to hear the larger or unique integration challenges regardless of whether or not you are using Hadoop.
1. Data Integration (ETL and otherwise) challenges
2. Change Data Capture (CDC) challenges
3. Hadoop, NoSQL, MPP Data Integration challenges
If you can, please submit 1-3 slides describing the issue (A diagram and some points) and I will put the whole deck together for all to consume. Alternatively you can whiteboard the concept but then no one can take it away after. The goal here is for people with relevant experience to come together and be aware of different approaches, how Hadoop may or may not fit in, and how other newer technologies are handling it.
I will cover some common Hadoop Integration scenarios that my colleagues and I have come across. I will also summarize LinkedIn's recently Open Sourced DataBus as an example of new technologies available (if anyone else has concrete experience with this or would like to summarize it then please feel free to take over that aspect of the discussion). http://engineering.linkedin.com/data-replication/open-sourcing-databus-linkedins-low-latency-change-data-capture-system
To that end, I would rather have more participants than observers if possible, but I suspect there will be many that would benefit from listening to the conversion. Please consider what you will bring to the discussion (questions or answers are welcome) before you RSVP.