Skip to content

Apache Spark

Photo of David Villegas
Hosted By
David V.
Apache Spark

Details

Join us to learn about Apache Spark with guest speaker Nicolas A. Perez!

Apache Spark is a general purpose, high performance cluster computing framework, especially used in big data. Spark is the new Map Reduce, and it replaces all the headaches of creating long and complicated applications on Hadoop. Spark unifies, under a simple and concise API, different workflows so we can create very complicated pipelines in minutes.

In the new IoT world, the enterprise is aiming for streaming solutions that allow us to use SQL and machine learning algorithms (over hundreds of terabytes) at the same time we create and analyze complex social graphs. Spark lets us do all that without compromise performance, running up to 100x faster than classic solutions on Hadoop.

There are few precise points about using Apache Spark. One is easiness of use, next performance, because no other framework can stand equally to it, and finally, extensibility, where every component is extensible enough so we could adapt it to any workload.

Spark is just the future of high performance computing, and it something we want to get our hands on.

Our speaker is Nicolas A. Perez, he is a software engineer at IPC, working on their Big Data platform. Passionate about computing problems, compilers, and distributed systems. You can reach him here:

http://twitter.com/@anicolaspp

https://medium.com/@anicolaspp

Food and drinks will be provided.

Photo of Data Science Salon | South Florida group
Data Science Salon | South Florida
See more events
FIU Modesto a Maidique Campus
11200 Sw 8th St · Miami, FL