Skip to content

Spark as part of a Hybrid RDBMS Architecture-John Leach Cofounder Splice Machine

Photo of Subash DSouza
Hosted By
Subash D.
Spark as part of a Hybrid RDBMS Architecture-John Leach Cofounder Splice Machine

Details

This meetup is in collaboration with the LA HBase Users Group & LA Spark Users Group

In this talk, we will discuss how we use Spark as part of a hybrid RDBMS architecture that includes Hadoop and HBase. The optimizer evaluates each query and sends OLTP traffic (including CRUD queries) to HBase and OLAP traffic to Spark. We will focus on the challenges of handling the tradeoffs inherent in an integrated architecture that simultaneously handles real-time and batch traffic. Lessons learned include: - Embedding Spark into a RDBMS - Running Spark on Yarn and isolating OLTP traffic from OLAP traffic - Accelerating the generation of Spark RDDs from HBase - Customizing the Spark UI The lessons learned can also be applied to other hybrid systems, such as Lambda architectures.

Bio:-

John Leach is the CTO and Co-Founder of Splice Machine. With over 15 years of software experience under his belt, John’s expertise in analytics and BI drives his role as Chief Technology Officer. Prior to Splice Machine, John founded Incite Retail in June 2008 and led the company’s strategy and development efforts. At Incite Retail, he built custom Big Data systems (leveraging HBase and Hadoop) for Fortune 500 companies. Prior to Incite Retail, he ran the business intelligence practice at Blue Martini Software and built strategic partnerships with integration partners. John was a key subject matter expert for Blue Martini Software in many strategic implementations across the world. His focus at Blue Martini was helping clients incorporate decision support knowledge into their current business processes utilizing advanced algorithms and machine learning. John received dual bachelor’s degrees in biomedical and mechanical engineering from Washington University in Saint Louis. Leach is the organizer emeritus for the Saint Louis Hadoop Users Group and is active in the Washington University Elliot Society.

Parking:

The best place for attendees to park is at the Westfield Century City Mall. Please note that they no longer offer free parking for the first 3 hours. Parking rates are as follows:

•0-1 Hours: $1.00
•1-2 Hours: $2.00
•2-3 Hours: $3.00

Every 30 mins thereafter: $2.00
6+ Hours: $28.00

About the Venue:

Factual is a location platform that enables personalized and contextually relevant mobile experiences by enriching mobile location signals with definitive global data. Factual’s real-time data stack builds and maintains data on a global scale, with Factual's core Global Places data covering over 65 million local businesses and points of interest in 50 countries. Factual’s platform also informs location with contextual demographic and commercial data, and offers cleaning and mapping services for business listings and points of interest.

Photo of Data Con LA Users Group group
Data Con LA Users Group
See more events
Factual Inc
1999 Avenue of the Stars, 35th Floor · Los Angeles, CA