Skip to content

"High Performance BigData Techniques Should Be Easy"

Photo of shlomi hassan
Hosted By
shlomi h. and Demi B.
"High Performance BigData Techniques Should Be Easy"

Details

18:00 - 18:30 - Mingling
18:30 - 19:15 - A brave new object store world - deep dive into high performance analytics with the Stocator storage connector for Apache Spark) - Effi Ofer @ IBM Research
19:15 - 20:00 - Big Data should be simple - Dori Waldman @ InnerActive

Title:
A brave new object store world - deep dive into high performance analytics with the Stocator storage connector for Apache Spark

Abstract:
With data growing voluminously at high velocity there comes a need for inexpensive and secure way to store and access data. Object store technology, such as IBM Cloud Object Storage, enable big data analytics engine such as Apache Spark to cost effectively analyze and drive value from vast quantities of data. In this talk we will review the differences between file store and object stores and introduce Stocator, a new connector between Apache Spark and object stores which leverages object store semantics to achieve high performance while maintaining fault tolerant and speculative execution. We will deep dive into the low level details of how Apache Spark communicates with an object store, compare different connectors such as s3a, stocator, and hadoop swift, and present best practices for achieving high performance.

Bio:
Effi Ofer (https://www.linkedin.com/in/effi-ofer-91a261b0/)is a researcher at IBM Research at Haifa, currently researching data analytics and cloud object stores. Before joining research, Effi was a lead developer on IBM DB2 where he lead successful projects in the areas of transaction management, high availability, and concurrency.

Title :
"Big Data should be simple"

Abstract :
For those in ad tech, big data means facing some very complex (and exciting) challenges. With data collected at every touchpoint vendors are seeking to show the end-user the product of massive aggregations in the simplest of ways. Everything is driven by data, from audience targeting to insights on user behavior with the aim to drive better conversions and ROI. In an ever changing and evolving industry data is the competitive edge. In this session, we will touch the tip of the iceberg when it comes to the use of new generation data services and we will share insights from Inneractive’s experience in leveraging them to provide an answer the market requirements.

Bio:
Dori Waldman (https://www.linkedin.com/in/doriwaldman/) - Big Data Lead at Inneractive, a leading mobile advertising platform. For the past 2 years, Dori has been leading the charge in the effort to maximize the use of data in the company. Before joining Inneractive Dori worked at HP as a Mobile & Full-stack expert at HP and as a Software Engineer at RSA. Today, at Inneractive, he’s using technologies (Spark / Druid / Kafka / Cassandra ) to develop the next generation of big data solutions to support the company’s growing data based products.

Photo of Big Things group
Big Things
See more events
HaBarzel 8, Tel Aviv-Yafo · Tel Aviv-Yafo