MLMU #20 - Using Apache Spark


Details
(Meetup will be held in Czech)
Prezentace: Peter Zvirinský - Using Apache Spark
Abstrakt: Apache Spark is a fast and general engine for large-scale data processing, it is currently one of the most (hyped) active projects in the Hadoop ecosystem. Learn what Apache Spark is, how it works and how it differs from Hadoop MapReduce. This talk will cover the basics of Apache Spark and its various components like MLlib and SQL. We will also cover some of Spark’s latest features, which are supposed to make interactive data science easier. Live demo on an existing Hadoop Cluster will be included as well.
Jazyk prezentace: Slovenština
Místo: Paralelní Polis, Dělnická 43, Praha 7(Doporučujeme dopravu tramvají na zastávku Dělnická)
Program:
18:30 - (19:15) Prezentace
(19:15) - 20:00 Diskuze
20:00 - ... Networking v restauraci Kozlovna (cca 100m od Paralelní Polis)

MLMU #20 - Using Apache Spark