Toronto Apache Spark #12


Details
Delivery of modern data processing applications in Transitioning Enterprises: Navigating common road bumps!
Agenda:
6:30PM to 7:00PM - Opening and networking
7:00PM to 8:00PM - Presentation - EPAM
8:00PM to 8:30PM - Networking
Presentation abstract:
Building useful near-real time data collection solution in an enterprise-level company is often a challenge. Introducing a new tool-set is only part of the challenge; introducing best practice emerging engineering methodologies is also required.
This talk is about the delivery of effective solutions based on Spark and satellite technologies - when the ‘square peg’ of modern data application development does not fit into the ‘round hole’ of enterprise company standards.
It focuses on our recent experiences in designing and delivering solutions for our clients:
• Spark for data collection and processing
• How to use Jenkins for (not just) CI/CD
• Spark apps automated deployment with minimal ops infrastructure
• Sensible automated testing that users can understand
Speakers:
Yuriy Bodnar (https://ca.linkedin.com/in/ybodnar)is tech lead and architect at EPAM Systems Canada and specializes in data apps. Most of the time Yuriy works with teams building data processing oriented applications with Spark, Apache Hadoop and Apache Kafka.
Robert Wierdsma (https://www.linkedin.com/in/robwierdsma) is a senior solution architect at EPAM. He has been working with data for many years and is now designing and delivering solutions that include a big data component.
Level: Intermediate
Target Audience: Data Engineer, DevOps
Broadcast Link:
-
YouTube Live (https://www.youtube.com/watch?v=9M_npJzH7ik)
-
Google Hangouts (https://hangouts.google.com/call/um66i2nfqnaoxfkpdxnazpxiwie)
Sponsor:
https://a248.e.akamai.net/secure.meetupstatic.com/photos/event/b/2/e/0/600_453045792.jpeg

Toronto Apache Spark #12