Past Meetup

Apache Cassandra, Apache Spark and Hadoop Meetup

This Meetup is past

58 people went

Location image of event venue

Details

Title: A leap forward for SQL on Hadoop

Abstract:

We will discuss the various SQL on Hadoop alternatives and explore why such
technologies have become an area of keen interest for many organizations.
We will discuss various alternatives such as Hive and Impala and their
pros/cons. We will share customer use cases for Warehouse Modernization
and the role of SQL for Hadoop in this space, as well as discuss
performance characteristics of SQL on Hadoop when running common query
workloads. We will provide an intro to Big SQL and demo. Big SQL delivers
some exciting capabilities including comprehensive SQL functionality that
leverages advanced SQL compiler/runtime all with low latency and high
performance.

Learn more about Big SQL here:
http://www.livestream.com/newchannel/popoutplayer?channel=ibmiod&clip=pla_e4b28ad8-f647-4a02-88ad-6679b78f7e8b&time=1481
. Check out the SQL on Hadoop white paper:
http://www-01.ibm.com/common/ssi/cgi-bin/ssialias?subtype=WH&infotype=SA&appname=SWGE_SW_SW_USEN&htmlfid=SWW14019USEN&attachment=SWW14019USEN.PDF

Speaker:

Claus Samuelsen, IBM

Claus Samuelsen has many years experience within data management, and has for more than 3 years had big data as his primary focus area.
Working with customers all over Europe devolping Hadoop based solutions in several industries from banking, insurance, healthcare to automotive and gaming.

--------------

Title: Apache Cassandra & Apache Spark for Time Series Data (45 minutes)

Presenter: Patrick McFadin

Bio: Patrick McFadin is regarded as one of the experts of Apache Cassandra and data modeling techniques. As the Chief Evangelist for Apache Cassandra and consultant for DataStax, he has helped build some of the largest deployments in the world. Previous to DataStax, he was Chief Architect at Hobsons, an education services company. There, he spoke often on Web Application design and performance.

Synopsis: Apache Cassandra has proven to be one of the best solutions for storing and retrieving time series data at high velocity and high volume. This talk will discuss how the storage model of Cassandra is ideal for time series use cases and go over examples of how to best build data models. We will also cover pairing Apache Spark with Apache Cassandra to create a real time data analytics platform. Attendees will leave this session knowing how to build their own real time data analytics platform, and will be shocked at how easy it is!