Getting Started with Spark & Cassandra And using pySpark with Google Colab

Questo è un evento passato

Hanno partecipato 64 persone

Immagine del luogo dell'evento


Another must-attend meetup is coming with two interesting talks, given respectively by Bruno Guedes (Datastax) and Mario Cartia (AgileLab).

Below the event program:

1- Getting Started with Spark & Cassandra

How do we analyze big amount of data, whether it be massive batching or as it’s ingested via streaming ?
Enter Apache Spark. Get ready to rock with the most powerful distributed database and analytics platform on the planet!
Massively scalable, always on, and very fast. Apache Cassandra is the database chosen by Apple, Netflix, and 30 of the Fortune 100 to power their critical infrastructure.
Challenging MapReduce head on, Apache Spark offers powerful constructs that make it possible to slice and dice your data, as well as transformations with functional programming
backgrounds such as map, filter, and reduce.

2- Getting started with pySpark in 5 minutes using Google Colab

Apache Spark is the Big Data opensource framework used by the world's leading companies for the implementation of advanced analytics. The talk will introduce the architecture, the modules and the main functionalities of the framework showing some practical examples in Python that do not require the installation of any software on your machine using the Colab tool made available by Google.

And in the end FREE Drinks for everyone offered by Agile Lab!!!

We hope you will join us!
Please, send your RSVP as soon as possible in order to let us organize the event in the best way possible.

See you all on the 19th