Solving the "first mile" of the data pipeline problem can accelerate analytics

Future of Data: Chicago
Future of Data: Chicago
Public group

Online event

This event has passed


This meetup is for technical and leadership audiences who want to learn how modernizing a data pipeline can transform their business.

Enterprises today deploy many types of stream processing and analysis applications to satisfy their business needs. Security Information and Event Management (SIEM) is a specific kind of enterprise application maintained to assist cybersecurity teams looking through billions of events to detect and prevent fraud and cyber attacks. Behind all of these applications are data pipelines used to collect, curate, and distribute data to support analytics.

As enterprises transition to collecting data from edge devices to either their on-premises data centers or the cloud, they realize that the traditional approach to data pipelines will not sustain the new need for faster, real-time analytics. For example, potential fraud indicators can no longer wait to be detected months after the fact when prevention needs to be done immediately.

In this meetup, Gal Shpantzer, a vCISO and Data Pipeline Architect, and Laura Chu, Senior Product Marketing Manager for Cloudera DataFlow, will discuss how he employed Apache MiNiFi, NiFi, and Kafka as the basis for a new data pipeline that transformed a Fortune 500 company's analytics strategy.

At the meetup we will discuss:
- Initial data pipeline challenges at the Fortune 500 company
- Key components of the data pipeline solution: Apache MiNiFi, NiFi, and Kafka
- Business value added by using this solution for streaming data pipelines

About the Speaker
Gal Shpantzer is an independent security professional working in the information security field since 2000. Gal is a vCISO and trusted advisor to CISOs of large corporations, startups, and non-profits such as top research universities and think tanks. Since 2014, Gal has focused on threats to availability such as ransomware and destructive attacks. As part of his research into lateral movement, he began to focus on high-speed telemetry for security response to fast-moving threats, which brought him to streaming analytics and forward-compatible data pipelines.

Gal brought this approach to his client, a Fortune 100 company’s security team, from concept to full implementation, using open-source tools such as MiniFi, NiFi, Kafka and Spark, enabling a petabyte data lake serving on-prem and multi-cloud sources, and multiple analytical tools, from cloud-native storage to commercial SIEM. This project enabled CISO-level objectives such as significantly reducing MTTD/MTTR and enabling a scalable approach to onboarding new sources, as well as distributing data to new consumers in a cost-effective manner, reducing licensing costs.

Gal has been involved in multiple SANS Institute projects, including co-editing the popular Newsbites newsletter since 2001, as well as articles, papers and webinars on CAPTCHAs, cyberstalking, hardware roots of trust, and most recently on GDPR’s impact on security programs as they balance privacy issues with visibility.

More of a “thought bleeder” than a thought leader, Gal contributed to the energy sector’s global privacy standard (NIST 7628) and to the ESC2M2 maturity model for Department of Energy’s smart grid security assessment framework. He has presented on various topics at conferences such as RSA, BSides, Shmoocon, Blackhat CISO Summit and at invite-only blue team conferences.

About the Host
Laura Chu is a tech marketeer over the last 10 years positioning and selling event-driven analytics at SAP in industries for Utilities, Retail, Transportation, and Manufacturing. She has extensive experience positioning modern analytics and AI for IoT, Network Analytics, Infrastructure Analytics, Regulatory Compliance, and Cybersecurity solutions at Pentaho, Hitachi Vantara, and Pure Storage.

In order to do our part to help "flatten the curve" of the spread of COVID-19, this will be an online event that will be held in the Central Daylight Time Zone.