Skip to content

Taking Spark to the Clouds

Photo of Jim   Palmeri
Hosted By
Jim P.
Taking Spark to the Clouds

Details

Topic: Spark provides significant speed boosts over competing tools thanks to its memory-based architecture.According to stats on Apache.org, Spark can “run programs up to 100 times faster than Hadoop MapReduce in memory, or 10 times faster on disk.” Spark is typically deployed in a dedicated data center as a next step in an organizations big data deployment strategy to gain deeper and faster insights. However, as the advantages of big data in the cloud become more apparent and gain wider adoption, can organizations also reap the benefits of Spark as a service without sacrificing its primary benefit—speed? In other words, is Spark ready for the cloud? In this session, Ashish Dubey, Solutions Architect with Qubole, will explore the benefits of separating compute and storage. He’ll also explore use cases utilizing SparkSQL, SparkR, and other languages that can be used with Qubole’s Notebook UI.

BIO: Ashish Dubey is a Solutions Architect at Qubole with about 12 years of industry experience in various technology domains. Prior to Qubole Ashish has spent four years at Microsoft, contributed on Windows XP OS development. Later he worked for Teradata's consulting division and built several large scale BI/Big Data systems for some of the Fortune 500 clients in different industry verticals like finance, healthcare, retails and multimedia. For last 2.5 years Ashish has been helping Qubole customers, building large scale data solutions using technologies like Spark, Hadoop, Presto etc

Sponsors: Food and drink will be served

Amazon $50 Gift Card - You must be present to win.

http://bit.ly/1rUojBC

Sign up for a 15 day trial and bring your S3 data to the meeting and you will be shown how to get immediate value with simplified provisioning, management and scaling of your big data analytics workloads: http://bigdata.agitaretech.com/request-a-qubole-trial/

Qubole: QDS is a self-service platform for big data analytics that runs on the three major public clouds: Amazon AWS, Google Compute Engine and Microsoft Azure.

http://photos4.meetupstatic.com/photos/event/e/5/c/4/600_450118820.jpeg

Agitare Technologies, Inc. provides strategy consulting, big data and managed services for enterprises.

http://photos3.meetupstatic.com/photos/event/a/3/c/c/600_337961932.jpeg

Photo of CloudTalk group
CloudTalk
See more events