Skip to content

Continuous Data Management for Hadoop and Spark – On-Premise or in the Cloud

Photo of IBM Big Data
Hosted By
IBM Big D.
Continuous Data Management for Hadoop and Spark – On-Premise or in the Cloud

Details

Happy New Year!!!

We hope you had great holidays and wish you all the best in 2016! In case you have not done it yet, it is time to RSVP to our event coming up next week!

We are excited to announce that with the help of our generous sponsors, we will be holding a raffle for all attending group members at our event on January 13th. We will be giving away some awesome prizes.

First prize: 2 tickets for:

Washington Capitals at Chicago Blackhawks
United Center – Chicago
Sunday, February 28, 2016 @ 11:30AM
2 tickets

Second prize: a Chipolo https://chipolo.net/

---------------------------------------------------------------

Hi, everyone. Please join us and our awesome presenters on January 13th at 5:30 pm . Refreshments will be provided. See you there!

AGENDA

• Continuous Data Management for Hadoop and Spark – On-Premise or in the Cloud.

KEYNOTE SPEAKER:

http://photos4.meetupstatic.com/photos/event/c/e/b/600_444663307.jpeg

James Campigli, Chief Product Officer and Co-Founder Wandisco

Jim has over 25 years of software industry experience at both early-stage and public companies. In his current role he is responsible for overseeing WANdisco's product strategy. In his previous role as a founder and chief technology officer (CTO) of Librados, an application integration software provider, Jim was responsible for overall product strategy and product messaging. He was also a member of the management team that led the company’s acquisition by NetManage, Inc. Following its acquisition, Jim joined NetManage as CTO for the Librados products group.

Prior to Librados, he was the vice president of product management for Insevo, a middleware company specializing in enterprise application integration. Jim also held senior product management, product marketing and consulting management positions at BEA Systems and SAP AG.

• Big SQL - Making all of your big data (http://www.ibm.com/software/data/bigdata/) SQL accessible using an optimal execution strategy. Presenter: Benjamin Lumbert, Big Data Systems Engineer, IBM

Continuous Data Management for Hadoop and Spark – On-Premise or in the Cloud

Big Data makes it possible to inexpensively store and process petabytes of structured, unstructured and semi-structured data generated at incredible speeds. However, the ultimate benefits of big data are lost if fresh, fast-moving data is not analyzed as it happens. Fast data is about data in motion—immediate response and action.

The collection process for data in motion is essentially the same as data at rest, but the key difference is the analysis occurs in real time as data is generated and captured. However, this analysis has to include the historical context provided by data at rest in order to be meaningful. This requires an enterprise-ready architecture that efficiently handles both data at rest and data in motion with the following components:

  • An enterprise grade Big Data platform to support real-time analytics applications without downtime or data loss
  • A flexible and agile cloud environment for cost-effective burst-out processing
  • A data migration/replication engine that exceeds the most demanding application SLAs.

This meet up will provide an overview of the “best in class” architecture required to harness the benefits of Big Data with “Continuous Data Management for Hadoop and Spark”

BigSQL 4.1. Presenter: Benjamin Lumbert, Big Data Systems Engineer, IBM

Big SQL provides ANSI SQL access to data across any system from Hadoop, via JDBC or ODBC - seamlessly whether that data exists in Hadoop or a relational data base. This means that developers familiar with the SQL programming language can access data in Hadoop without having to learn new languages or skills. Big SQL sets a new bar: performance. Benchmark tests indicate that Big SQL executes queries 20 times faster, on average, over Apache Hive 12 with performance improvements ranging up to 70 times faster. It can query and combine data from many data sources, including (but not limited to) DB2 for Linux, Teradata, Oracle, UNIX and Windows database software, IBM PureData System for Analytics.

With Big SQL, all of your big data is SQL accessible. It presents a structured view of your existing data, using an optimal execution strategy, given your available resources.

Parking

  • On-site valet $8 per car
Photo of Big Data and AI Developers in Chicago group
Big Data and AI Developers in Chicago
See more events