Aug 7 Meetup: Accelerate Your Big Data Analytics with Open Source Software

This is a past event

25 people went

99 Almaden Blvd

99 Almaden Blvd · San Jose, CA

How to find us

Our building is the one in brown color, with Union Bank logo, next to Zoom’s building. Free parking on nearby streets after 6 PM.

Location image of event venue


Enjoy a summer evening with pizza, beer, and some insightful tech talks, hosted and sponsored by Kyligence.

6:30 – 7:00 PM: Socialize over food and beverages
7:00 – 8:30 PM: Tech Talks

* Talk One: Best Practices for Using Apache Spark and Alluxio for Blazingly Fast Analytics

Speaker: Bin Fan, founding engineer & VP of Open Source at Alluxio. Bin Fan is the founding member of Alluxio, Inc. and the PMC maintainer of the Alluxio open source project. Prior to Alluxio, he worked for Google to build the next-generation of storage infrastructure and won Google's Technical Infrastructure award. Bin received his Ph.D. in CS from CMU.
Abstract: Apache Spark and Alluxio are cousin open-source projects that originated from UC Berkeley’s AMPLab. Running Spark with Alluxio is a popular decision, particularly for cloud and hybrid cloud environments. In this session, Bin Fan will briefly introduce Apache Spark and Alluxio, share the top ten tips for performance-tuning for real-world workloads, and demo Alluxio with Spark.

* Talk Two: Real-time OLAP in Apache Kylin v3.0

Speaker: Shaofeng Shi, one of the early creators of Apache Kylin, Apache Kylin committer & PMC, and Kyligence Chief Architect.
Abstract: At the beginning, Apache Kylin focused on massive historical data using OLAP, with Apache Hive as the main data source. In v1.6, Kylin started to support near real-time streaming from Apache Kafka, which reduces the time to inspect from hours to minutes. Meanwhile, the demand for real-time analytics hasn’t stopped. Last year, eBay contributed their Real-time OLAP feature to Kylin, and it has been evaluated and improved by the community ever since. As planned, this feature will be released in Kylin v3.0. During this session, Shaofeng will introduce the architecture and technologies behind Kylin 3.0’s Real-time OLAP.

99 Almaden Blvd, Ste 150, Cultivate Room, San Jose, CA 95113

Free parking on nearby streets after 6 PM.