Real-Time Ingestion & Event Processing with Apache NiFi


Details
UPDATE: New location. We are in the Havana Room at the 4240 Duncan Building. It's the same location as last month.
UPDATE: Apache NiFi committer Joey Frazee will be in town to present, and go over uses cases across multiple verticals and industries! We’ll have plenty of beer for everyone!
As one of the fastest growing Apache project, the 100% open-sourced Apache NiFi became a top-level Apache project within just a few months, and has gained tremendous attraction over the past year.
Apache NiFi is an integrated data logistics platform for automating the movement of data between disparate systems. It provides real-time control that makes it easy to manage the movement of data between any source and any destination. It is data source agnostic, supporting disparate and distributed sources of differing formats, schemas, protocols, speeds and sizes such as machines, geo location devices, click streams, files, social feeds, log files and videos and more. It is configurable plumbing for moving data around, similar to how Fedex, UPS or other courier delivery services move parcels around. And just like those services, Apache NiFi allows you to trace your data in real time, just like you could trace a delivery. Seamless integration with other top-level Apache projects, such as Ambari, Ranger & etc., provides the necessary enterprise capabilities: end-to-end security, Audit, Compliance, Operations, Compliance, and Governance.
Speaker Bio:
Derek Sun is a Solutions Engineer with Hortonworks local to St. Louis. Derek is a seasoned IT professional with more than 15 years’ experience building high performance applications.
Before joining Hortonworks, Derek was a Big Data Architect at MasterCard for two years. During that period, he led the onshore and offshore support teams, and implemented the World’s 1st PCI compliant Hadoop enterprise cluster with encryption enabled at in-motion and at-rest levels for all Hadoop services. Well exceeded the PCI DSS 3.0 requirements to meet the more stringent MasterCard internal top security standards. During this period, he was also responsible for the architecture design as well as implementation for several MasterCard core projects utilizing Big Data technologies.
Before MasterCard, Derek was a Lead Software Engineer at ThomsonReuters for 9 years. He worked on several high-performance products, including offloading 30 years of TS history data, 300+ billion rows, from Oracle 11g to HBase.
We'll have food/drink at 6:30, and begin the presentation at 7:00pm.
The speaker and refreshments is sponsored by Hortonworks. The meeting space is sponsored by Cloudera.

Real-Time Ingestion & Event Processing with Apache NiFi