PyData Cyprus #10 — Introduction to (Py)Spark
Détails
For this months’ meetup, the topic will be Introduction to big-data using (Py)Spark. Apache Spark is a unified analytics engine for big data processing, with built-in modules for streaming, SQL, machine learning and graph processing.
This time we will have a lightning talk about uderline, by Argyris e. Argyrou - a STEM—run space in Limassol, an initiative committed to accelerating tech transformation in Cyprus.
After that, expect two 20-minute talks. The first one will an introduction to (Py)Spark by Argyris e. Argyrou, PhD candidate at CUT. The second one will be “Writing pyspark programs in Jupyter notebook” by Christos Christodoulou, Head of Data Insights at Motionlogic—Berlin, Germany.
Location:
Visit the link below and look for "1" (Andreas Themistokleous building).
Google map link
https://www.google.com/maps/d/u/0/viewer?mid=1s57_JLTiHNGg3jnKWwt-2upTRLU&ll=34.6748887910182%2C33.04482915928861&z=19
After you enter the building-Andreas Themistokleous, look for the "Lemesos" room. See ya on Thursday at 19:00.
#Talk 1
Title:
Introduction to big-data using (Py)Spark
Speaker:
Argyris e. Argyrou
Abstract:
Apache Spark is quickly gaining steam both in the headlines and real-world adoption, mainly because of its ability to process streaming data. With so much data being processed on a daily basis, it has become essential for us to be able to stream and analyze it in real time. I will do a introduction to (Py)spark for beginners.
Bio:
Entrepreneur, full stack developer, product manager, community leader, Ph.D. candidate in NLP, who loves to get hands-on with strategy, product, R&D, design, and coding.
Linkedin:
https://www.linkedin.com/in/argyrisargyrou/
#Talk 1
Title:
Writing pyspark programs in Jupyter notebook
Speaker:
Christos Christodoulou
Abstract:
Spark is a general purpose in-memory cluster computing framework suitable for large scale data processing. It evaluates in a lazy fashion, optimizes execution plans and smartly handles memory usage. In this presentation, I will show some examples of transformations and analysis of large-scale data sets written in jupyter notebook.
Bio:
I currently work with spatiotemporal data as the Head of Data Insights at Motionlogic, a daughter company of Deutsche Telekom, where we build algorithms that translate mobile signaling data to aggregated movement streams.
Before that I spent a lot of time in synchrotron facilities, investigating the complex interactions between molecules and two-dimensional materials to complete my Ph.D. in solid state physics.
I like being involved in communities that promote scientific communication and knowledge exchange.
Linkedin:
https://www.linkedin.com/in/chr7stos/

