April Big Data Meetup – Introduction To Pentaho

  • April 19, 2012 · 5:30 PM
  • Orbitz Worldwide Headquarters

Dave Reinke of OpenBI will be providing an introduction to Pentaho's new functionality targeted at supporting big data. Pentaho's software has been getting a lot of attention lately in the big data space, and OpenBI has extensive experience helping it's customers achieve success with Pentaho, so this is sure to be a valuable and enlightening presentation. More info is below. Looking forward to seeing you all there! 

An Introduction to Pentaho’s Big Data Technology

Recently, a series of product announcements has moved Pentaho to the center of the Big Datamarketplace. Over the past few months, Pentaho has:

  • Switched to the Apache license while open-sourcing a broad swath of commercial and Big Data functionality of its flagship ETL software, Pentaho Data Integration (PDI).
  • Developed strategic alliances with Big Data vendors such as MapR, Cloudera, DataStax, Actian,Greenplum/EMC, Vertica and Infobright
  • And has been Acknowledged as a Strong Performer in the Forrester Wave for Enterprise HadoopSolutions

Pentaho Big Data technology makes big data programming accessible to traditional BI developers,providing an integrated data architecture that spans a continuum from big data platform to datawarehouse to data mart -- while also enabling reporting and analysis against Hadoop, NoSQL and highperformance analytical databases.
This session serves to educate on the Pentaho Big Data functionality. We will demonstrate a commondata architecture pattern that:

  • Loads data into a big data platform: HDFS & NoSQL
  • Processes data within Hadoop via Pentaho MapReduce, Pig and Hive scripts
  • Extracts data from big data platforms to an RDBMS data mart, and
  • Creates a job that orchestrates the entire process

Dave's Bio:

Dave Reinke is a member of the Chicago Big Data and Hadoop user groups and co-founder of OpenBI, a business intelligence and big data analytics professional services company.     He has over 20 years of BI consulting experience across a variety of industries and domains.    Dave is Cloudera Hadoop certified and currently an active member of the Pentaho Big Data community.

Join or login to comment.

  • Jonathan Bordoli

    First meetup so learned loads. Alas had other engagement and I needed to head off straight after the presentation so no chance for networking this time

    April 20, 2012

  • A former member
    A former member

    Good intro to Pentaho, and a sample dataset that is easy to understand in terms of content and value of extracted information.

    April 20, 2012

  • Ameena Lalani

    Dave Reinke was a great presenter. Not only he demoed Pentaho but he aslo showed us the same process workings through other platforms such as Pig, Hbase, Mongodb etc. He packed a lot in one hour presentation and it was good.

    April 20, 2012

  • Jeff Sippel

    Great explanation on what Pentaho is bringing to the Hadoop party, and how you can use a tool that is also used for traditional data movement to help accelerate your Big Data adoption.

    April 19, 2012

  • David Douglas

    Still at work...will not be able to come sorry

    April 19, 2012

Our Sponsors

  • Orbitz Worldwide

    A leading global online travel company and technology innovator.

  • Cloudera

    The leader in Apache Hadoop-based software and services.

  • HortonWorks

    A leading provider of support and services for Apache Hadoop.

  • TechNexus

    Chicago’s first collaborative ecosystem for tech entrepreneurs.

  • Oracle

    Industry leading hardware and software solutions for data management.

  • Couchbase

    Open source NoSQL for mission-critical systems.

  • Terracotta

    In-memory data management for the enterprise.

People in this
Meetup are also in:

Share your interests and spark new friendships

Log in

Not registered with us yet?

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy