Past Meetup

Big Data Pipelining - Discover how to use Spark, Cassandra and Docker

This Meetup is past

154 people went


**** You can find Simon's material from the Meetup on the "Files" section of the group (under the "more" drop down menu. ****

As part of the Big Data Week ( in London, the organisers have very kindly offered to join forces with us for this Meetup which will take place at the Westminster Impact Hub ( close the successful week. They have also very kindly offered us 5 tickets for the One track, so if you are interested please email me and I will send you the code. (First come first served basis - [masked]).

We know is Friday, so the venue is perfect for you to go out to central London after the Meetup finishes and we have made it a bit shorter than usual so you can enjoy your Friday night! We promise it will be worth it with an amazing Demo by Simon Ambridge!


6:00pm - 6:30pm: Doors open (Pizza and Beer to be served)
6:30pm - 7:30pm: Simon Ambridge
7:30pm - 8pm: Q&A & Networking


Simon Ambridge


Building Big Data Applications that scale from small micro services to large scale mission critical applications requires a scalable data architecture designed to ensure the integrity of the data ingested. This requires architecting the application so that it is highly-available and resilient in order to guarantee that no data is lost and service quality is maintained.

Simon will provide an introduction to a working environment that attendees can build to familiarise themselves with Docker, Spark and Cassandra and Spark-Notebook.This is an abbreviated version of the same content that was delivered in a half-day workshop at Devoxx last week.

The focus of the talk is to demonstrate how to quickly build a working environment that uses a Genomics dataset published by Data Fellas. It will show how to ingest, analyze and save this data. Importantly it will show how to visualize the data analytics process using the Spark Notebook.


Simon Ambridge is a DataStax Solution Engineer based in the UK. Simon has 25 years of experience in designing, building, implementing and supporting complex data management solutions with traditional RDBMS technology. Simons’s day job is to enthuse about Apache Cassandra and DataStax explaining to customers that the traditional approaches to data management just don’t cut it anymore in the new always on, no single point of failure, high volume, high velocity, real time distributed data management world.

Twitter Handle: @stratman1958


About the Big Data Week London, the organisers want you to know:

Since it’s a community driven event, one of its main focuses will be to connect data passionate professionals with fellow technologists and industry influencers and offer them the opportunity to talk through ideas, explore new solutions and learn from each other.

You can find more details about the talks and workshop sessions here (

Remember, using the code Cassandra_20off will bring you an extra discount of 20% from the standard prices! Also, as we are closing the end of sales, we are running these days our last Flash Sale on regular priced tickets – Buy any kind of ticket, and get the second one free.