addressalign-toparrow-leftarrow-rightbackbellblockcalendarcameraccwcheckchevron-downchevron-leftchevron-rightchevron-small-downchevron-small-leftchevron-small-rightchevron-small-upchevron-upcircle-with-checkcircle-with-crosscircle-with-pluscrossdots-three-verticaleditemptyheartexporteye-with-lineeyefacebookfolderfullheartglobegmailgooglegroupshelp-with-circleimageimagesinstagramFill 1linklocation-pinm-swarmSearchmailmessagesminusmoremuplabelShape 3 + Rectangle 1ShapeoutlookpersonJoin Group on CardStartprice-ribbonprintShapeShapeShapeShapeImported LayersImported LayersImported Layersshieldstartickettrashtriangle-downtriangle-uptwitteruserwarningyahoo

LineageDB Architecture for Big Data Analytics & Data Quality

University of Colorado Boulder - Wednesday December 3, 2014 @ 6:00pm MST

NOTE: For folks unable to attend in person register and we will email you a livestream link 2 hours prior to event.

Location: ATLAS -[masked]th St Bldg 223, Boulder, CO - Room 100 - Map:


6:00 - 6:20 Schmooze - Food shall be served in Lobby

6:20 - 6:30 Announcements

6:30 - 7:30 LineageDB Architecture for Big Data Analytics by Charles Clifford

7:30 - 8:30 Top 20 Data Quality Solutions for Data Science by Ken Farmer

8:30 - 9:30 Network at Old Chicago at 1102 Pearl St. (western end of Pearl Street pedestrian mall, directly facing Boulder Bookstore). Please support our sponsor, Old Chicago in Boulder, and make new friends. See:

LineageDB Architecture for Big Data Analytics - Abstract

The traditional approach to data analytic platforms are:

• tightly coupled to expensive relational data services;
• limited to star and snow-flake schema (notoriously difficult to maintain); and
• heavily dependent on brittle, expensive ETLs.

RDBMS can be scaled vertically (at a big price point), but eventually you run out of run-way because a b-tree does not scale linearly. The morphing of relational services into MPP appliances have resulted in platforms that are not flexible enough to support rapidly changing data analytic needs. These limitations in can be overcome by adopting the LineageDB architecture, a polyglot composed from loosely coupled, open-source:

• key-value storage service;
• index service;
• graph service;
• SQL service; and
• in-memory data service.

Charles Clifford - Bio

Charles Clifford has been designing and developing both transaction, as well as analytic, business solutions since the early 90s. He has delivered distributed solutions to a variety of industries, from tel-com, to capital markets, to health care, to software powerhouses. His current focus is on the design and delivery of DaaS solutions. 

Top 20 Data Quality Solutions for Data Science - Abstract

Data quality continues to be one of the chief challenges, costs and reasons for project failure in data science. Problems in this space limit accuracy, destroy credibility and can result in harmful solutions. And unlike challenges such as scalability and cost it has seen no major breakthrough improvements. This presentation will cover the types of problems, as well as their impacts, causes and various solutions.

Ken Farmer - Bio

Ken Farmer is the senior data architect/wrangler/librarian for ProtectWise where he is developing their analytical data solution. Previously, he has developed, maintained, managed and consulted on analytical data architectures for IBM, MapQuest, Verizon, and others.

Join or login to comment.

  • Michael W.

    Apologize for technical livestreaming problems - no video for this presentation. Should be fixed for next event on January 21, 2015.

    Ken Farmer will present on Top 20 Data Quality Solutions at CU Denver on January 21, 2015 - register @

    I will interview Charles Clifford on LineageDB Architecture for real-time Big Data Analytics for video/podcast and post in future.

    See Slide Decks:

    LineageDB Architecture for real-time Big Data Analytics - Slides -­

    Top 20 Data Quality Solutions for Data Science - Slides -­

    December 11, 2014

  • Nitin k.

    Michael, any chances of you putting up the recording...
    Thanks in advance.

    December 8, 2014

  • Nitin k.

    Do we have this video posted.... Being online there were problems with both audio and video and this is a very interesting topic, if someone can share the recording details.

    1 · December 8, 2014

  • Karl K.

    Re Claudia, I thought Charles gave a great intro to new technologies and approaches that are coming down the pike, and will assume a place in new analytic architectures. I'm still looking for things I don't know I don't know, but hoping Claudia and others pipe in to clarify the mix. BTW, you can give a great preso and still be wrong :-) I see it done all the time.

    December 4, 2014

  • Claudia I.

    Unfortunately I have to disagree with the previous comments on this presentation. In my opinion, the presentation: 1. confused the relational database model with relational database implementations. 2. contained misleading, inaccurate and confusing information in terms of the application of relational technology for analytical processing and data warehousing.

    3. showed a serious lack of understanding of data integration issues in enterprises today.

    December 4, 2014

  • Thom R.

    Geat presentation of thought provoking concepts

    December 4, 2014

  • Karl K.

    Excellent presentation!

    December 3, 2014

  • Mark C.

    One of the best talks I've ever heard at a Meetup tonight!!

    December 3, 2014

  • Michael W.

    Livestream Link for LineageDB Architecture for real-time Big Data Analytics and Data Quality Solutions

    Livestream Link - ­

    Show time: 6:30pm MST

    Slide Decks:

    Top 20 Data Quality Solutions for Data Science - Slides -

    LineageDB Architecture for real-time Big Data Analytics - Slides -

    Embed Code:

    <iframe width="480" height="302"
    scrolling="no" frameborder="0" style="border: 0px none transparent;">
    <br /><a href=""; style="font-size: 12px;
    line-height: 20px; font-weight: normal; text-align: left;"
    target="_blank">Broadcast live streaming video on Ustream</a>

    December 3, 2014

  • Cheryl G.

    do we have to specifically ask for the livestream??? if so, please include me...


    December 3, 2014

  • Wyatt

    Please send me the live link. Great event idea, wish I could be there in person. Thank you.

    December 3, 2014

  • Michael W.

    Free Data Science Books at Old Chicago

    We will be giving away free data science books at Old Chicago after the event at Atlas.

    Please support our sponsor Old Chicago by joining us. Make new friends and professional connections!

    Directions from Atlas building to Old Chicago:

    1. Exit the Euclid St. parking deck/underground garage by turning right onto Euclid St.

    2. Turn right to go north on Broadway.

    3. After driving through the middle of the Pearl St. pedestrian mall, turn left onto Spruce St.

    4. Halfway down the block, turn left into the Spruce St. parking garage ($).

    5. Old Chicago is at the western end of the Pearl St. pedestrian mall at the intersection of 11th & Pearl. It faces Boulder Bookstore, so as you emerge from the garage look for Boulder Bookstore and then it's
    right across the pedestrian mall from that.

    1 · December 3, 2014

  • A former member
    A former member

    Please send me the link

    December 3, 2014

  • Prateek

    Please send me link

    December 3, 2014

  • Brian D.

    Please send me a link as I would like to attend remotely, thx.

    December 2, 2014

  • Steven W.

    Attending remotely...

    November 29, 2014

  • Felicity R.

    Please email me a livestream link 2 hours prior to event

    November 11, 2014

Our Sponsors

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy