Skip to content

Know Your Distributed Tools, Apache Tez and Spark

Photo of Demi Ben-Ari
Hosted By
Demi B. and shlomi h.
Know Your Distributed Tools, Apache Tez and Spark

Details

http://photos2.meetupstatic.com/photos/event/e/a/0/8/600_445799912.jpeg

18:00 - 18:30 - Mingling
18:30 - 19:15 - Monitoring Apache Spark Applications - Real World Examples - Tzach Zohar @ Kenshoo

Abstract:
Spark Monitoring - Know Your Cluster
Spark is quickly becoming the most popular framework in the MapReduce family. With better performance and much better APIs - it's easier than ever to perform the actual data wrangling; But as always - the challenges of operating, verifying and optimizing your application over time are much greater than the initial setup - and all the more so with distributes systems. In Kenshoo, we've used and developed some tools and techniques to monitor the state of our Spark application: health, correctness, performance, utilization, and business KPIs. We'll discuss some standard tools and less standard techniques to get the most information out of your Spark cluster. Bio: Tzach Zohar
Architect and developer with 10 years experience, specializing in building high-scale enterprise solutions, from whiteboard brainstorming to hands-on coding. Interested in optimizing development throughput, quality and efficiency by using the best tools and techniques. Mostly Java oriented, but fluent in python and Scala as well. Was part of the Kenshoo team from the very beginning, playing various roles (developer, dev team lead, chief architect and architect), nowadays focused on improving software craftsmanship and scale of our team and systems.

19:15 -20:00 - Data Plumbing with Apache Tez - Gal Vinograd - Big Data Kolboynic @ Crosswise

Abstract:

At Crosswise we try to find devices that belong to the same user, based only on the their behaviour and characteristics without actually knowing anything about the user ;).
We do this by running an embarrassingly complexed and large-scaled pipeline in which we constantly optimize. In this session, I'll share my experience with Apache Tez, pros and cons, and how it helped us to improve both our run times and resource utilization. After this session you'll have a better understanding of what kind of problems Tez solves, how it does it and where it should be applied.

Bio: Gal Vinograd
I’m a Programmer, a techie and everything in between :)
I'm a very hands-on kind of guy, and currently focused on solving scalability and big data challenges at Crosswise.

http://photos1.meetupstatic.com/photos/event/5/a/3/d/600_445823101.jpeg

Photo of Big Things group
Big Things
See more events
Kenshoo
HaBarzel 8 · Tel Aviv-Yafo