addressalign-toparrow-leftarrow-rightbackbellblockcalendarcameraccwcheckchevron-downchevron-leftchevron-rightchevron-small-downchevron-small-leftchevron-small-rightchevron-small-upchevron-upcircle-with-checkcircle-with-crosscircle-with-pluscontroller-playcrossdots-three-verticaleditemptyheartexporteye-with-lineeyefacebookfolderfullheartglobegmailgooglegroupshelp-with-circleimageimagesinstagramFill 1light-bulblinklocation-pinm-swarmSearchmailmessagesminusmoremuplabelShape 3 + Rectangle 1ShapeoutlookpersonJoin Group on CardStartprice-ribbonprintShapeShapeShapeShapeImported LayersImported LayersImported Layersshieldstartickettrashtriangle-downtriangle-uptwitteruserwarningyahoo

April Big Data Meetup – Introduction To Pentaho

  • Apr 19, 2012 · 5:30 PM
  • Orbitz Worldwide Headquarters

Dave Reinke of OpenBI will be providing an introduction to Pentaho's new functionality targeted at supporting big data. Pentaho's software has been getting a lot of attention lately in the big data space, and OpenBI has extensive experience helping it's customers achieve success with Pentaho, so this is sure to be a valuable and enlightening presentation. More info is below. Looking forward to seeing you all there! 

An Introduction to Pentaho’s Big Data Technology

Recently, a series of product announcements has moved Pentaho to the center of the Big Datamarketplace. Over the past few months, Pentaho has:

  • Switched to the Apache license while open-sourcing a broad swath of commercial and Big Data functionality of its flagship ETL software, Pentaho Data Integration (PDI).
  • Developed strategic alliances with Big Data vendors such as MapR, Cloudera, DataStax, Actian,Greenplum/EMC, Vertica and Infobright
  • And has been Acknowledged as a Strong Performer in the Forrester Wave for Enterprise HadoopSolutions

Pentaho Big Data technology makes big data programming accessible to traditional BI developers,providing an integrated data architecture that spans a continuum from big data platform to datawarehouse to data mart -- while also enabling reporting and analysis against Hadoop, NoSQL and highperformance analytical databases.
This session serves to educate on the Pentaho Big Data functionality. We will demonstrate a commondata architecture pattern that:

  • Loads data into a big data platform: HDFS & NoSQL
  • Processes data within Hadoop via Pentaho MapReduce, Pig and Hive scripts
  • Extracts data from big data platforms to an RDBMS data mart, and
  • Creates a job that orchestrates the entire process

Dave's Bio:

Dave Reinke is a member of the Chicago Big Data and Hadoop user groups and co-founder of OpenBI, a business intelligence and big data analytics professional services company.     He has over 20 years of BI consulting experience across a variety of industries and domains.    Dave is Cloudera Hadoop certified and currently an active member of the Pentaho Big Data community.

Join or login to comment.

  • Jonathan B.

    First meetup so learned loads. Alas had other engagement and I needed to head off straight after the presentation so no chance for networking this time

    April 20, 2012

  • A former member
    A former member

    Good intro to Pentaho, and a sample dataset that is easy to understand in terms of content and value of extracted information.

    April 20, 2012

  • Ameena L.

    Dave Reinke was a great presenter. Not only he demoed Pentaho but he aslo showed us the same process workings through other platforms such as Pig, Hbase, Mongodb etc. He packed a lot in one hour presentation and it was good.

    April 20, 2012

  • Jeff S.

    Great explanation on what Pentaho is bringing to the Hadoop party, and how you can use a tool that is also used for traditional data movement to help accelerate your Big Data adoption.

    April 19, 2012

Our Sponsors

  • Orbitz Worldwide

    A leading global online travel company and technology innovator.

  • Cloudera

    The leader in Apache Hadoop-based software and services.

  • HortonWorks

    A leading provider of support and services for Apache Hadoop.

  • TechNexus

    Chicago’s first collaborative ecosystem for tech entrepreneurs.

  • Oracle

    Industry leading hardware and software solutions for data management.

  • Couchbase

    Open source NoSQL for mission-critical systems.

  • Terracotta

    In-memory data management for the enterprise.

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy