align-toparrow-leftarrow-rightbackbellblockcalendarcamerachatcheckchevron-downchevron-leftchevron-rightchevron-small-downchevron-small-leftchevron-small-rightchevron-small-upchevron-upcircle-with-crosscrosseditfacebookglobegoogleimagesinstagramlocation-pinmagnifying-glassmailmoremuplabelShape 3 + Rectangle 1outlookpersonplusImported LayersImported LayersImported Layersshieldstartwitteryahoo

Python for Data Logistics & Relational Technologies - Tues. June 25 @6pm MST

University of Colorado Denver - Tuesday June 25, 2013 @ 6:00pm MST

Large auditorium (170 person capacity) with 20' screen.

Location: CU Denver - North Classroom #1539 - 1200 Larimer Street 
Denver, CO[masked] - Map: http://bit.ly/Tyznzg

Agenda:

6:00 - 6:15 Schmooze - Old Chicago Pizza will be served.

6:15 - 7:30 Using Python for Data Logistics by Ken Farmer

7:30 - 8:30 With the Emergence of Big Data, Where do Relational Technologies Fit? by       Donna Burbank

8:30 - 9:30 Network at Old Chicago at 14th and Market.

See: http://www.oldchicago.com/denver-market-street

Using Python for Data Logistics - Abstract

Big Data and Data Science projects looking to reduce the risks, costs, and nightmares associated with managing dozens of data feeds have discovered the ETL (Extract, Transform, Load) product category. But there's no such thing as a silver bullet, and while there are practices and lessons to be learned from ETL, the tools are mostly the legacy of early 90s thinking in which data feeds were fewer, the alternative were COBOL or C, and writing code was deemed risky by DBAs and management. Ken will show how a high-level language like Python, when matched with certain practices and design patterns can offer a very successful alternative to these diagram-driven development tools. The discussion will focus on concepts, designs and patterns, and will include examples of successes and failures with a small amount of code.

Bio

Ken Farmer is a data architect at IBM where he has built and led their Security & Compliance Data Warehouses.  These projects used Python extensively for systems management, general data management, ETL, and analytics. He writes about data management at www.ken-far.com, and writes data analysis tools like DataGristle for fun on the side.

With the Emergence of Big Data, Where do Relational Technologies Fit? - Abstract

The recent focus on Big Data in the data management community brings with it a paradigm shift—from the more traditional top-down, “design then build” approach to data warehousing and business intelligence, to the more bottom up, “discover and analyze” approach to analytics on Big Data.  Where do relational data bases, data modeling, and data warehousing fit in this new world of Big Data?  Do they go away, or can they evolve to meet the emerging needs of this exciting new technology?  Join industry expert Donna Burbank as she discusses the issues and opportunities that exist for data management professionals in the Big Data environment.

Bio

Donna Burbank is a recognized industry expert and author, with more than 15 years of experience in data management, metadata management, and enterprise architecture. Donna currently is VP of product marketing for CA Technologies’ data modeling solutions.  Previous to this role, she has served in key brand strategy and product management roles at Computer Associates and Embarcadero Technologies and as a senior consultant for PLATINUM technology’s information management consulting division in both the U.S. and EMEA.  She has worked with dozens of Fortune 500 companies worldwide in the U.S., Europe, Asia, and Africa and speaks regularly at industry conferences.  She has recently co-authored two books: Data Modeling for the Business and Data Modeling Made Simple with CA ERwin Data Modeler r8.

Join or login to comment.

  • Michael M.

    Ken Farmer's slides for "Using Python for Data Logistics" are available at http://www.slideshare.net/kenfar719/python-for-datalogisticsv3

    June 26, 2013

  • Eric C.

    Great presentations!

    June 26, 2013

  • Michael M.

    Part 1: http://www.youtube.com/watch?v=JQcSTWY8FNE
    Part 2: http://www.youtube.com/watch?v=mzq-zLiOJWA

    Interruption was due to complaint on chat about no audio (but now realize it was limited to just that one person).

    Also, ustream.tv started failing the very last 10 minutes.

    June 26, 2013

  • Michael W.

    Thank you all for a great event. Slides and video will be posted asap.

    June 26, 2013

  • Mark C.

    Great experience meeting like-minded people who have a passion for big data

    June 26, 2013

  • Jamie O.

    Also did anyone see that Joyent launched a new Big Data platform today? Looks amazing.
    http://www.joyent.com/company/press/joyent-expands-cloud-offering-with-object-storage-and-data-services-platform

    June 25, 2013

  • Jamie O.

    Will the slides be made available? I wasn't able to see much of the text from the first presentation.

    June 25, 2013

  • Alex V.

    Formalizing "data science" -- earlier this month Rensselaer Polytechnic Institute, our oldest technological research university, announced a $100M university-wide initiative ("IDEA") that "brings together and fortifies the wealth of data science, high performance computing, predictive analytics, data visualization, and cognitive computing research at Rensselaer."

    See http://www.timesunion.com/local/article/100M-for-Big-Data-at-RPI-4599723.php and http://www.thegovlab.org/big-data-and-academia-the-launch-of-rennselaer-idea/

    June 25, 2013

  • Jamie O.

    Thanks for the live stream link. Running late at the office.

    June 25, 2013

  • Michael W.

    Livestream Link for Python for Data Logistics & Relational Technologies 6:15pm MST - http://ustre.am/VhHg


    Starts at 6:15pm MST / 5:15 PST / 8:15pm EST


    Embed the live video player anywhere using the code below


    <iframe src="http://www.ustream.tv/embed/13652726"; width="608" height="368" scrolling="no" frameborder="0" style="border: 0px none transparent;"></iframe><br /><a href="http://www.ustream.tv/everywhere"; style="padding: 2px 0px 4px; width: 400px; background: #ffffff; display: block; color: #000000; font-weight: normal; font-size: 10px; text-decoration: underline; text-align: center;" target="_blank">Live video for mobile from Ustream</a>

    June 25, 2013

  • Clayton Auzenne J.

    online

    June 25, 2013

  • Tom R.

    I'll have to stream this one, sorry I can't make it in person, it's a great and very salient topic.

    June 25, 2013

  • Christina

    I will not be able to be there in person. Please send me a link to watch via livestream video. Thanks!

    June 25, 2013

  • Chris O.

    Sounds really interesting, can't make it in person so thanks so much for the livestream option.

    June 25, 2013

  • Michael W.

    Livestream if unable to attend in person - register and we will email you a link to watch via livestream video 2 hours prior to start.

    1 · June 25, 2013

  • Carey G. B.

    I want to observe the event online.

    June 25, 2013

  • Randy T.

    Geospatial Database Consultant

    June 24, 2013

  • Masroor F.

    Looking forward to two of my favorite technologies working together

    June 19, 2013

  • A former member
    A former member

    Online.

    June 18, 2013

  • A former member
    A former member

    I'll be there online.

    June 18, 2013

  • Michael W.

    Some vendors (Teradata, Oracle) are pitching Hadoop as a data prep - ETL tool. At this time I do not think so. See this interesting Gartner piece: http://gtnr.it/ZTmlzZ

    1 · May 29, 2013

    • Tom R.

      What I meant by 'Hive is non-relational' is that Hive is just translating HQL into M/R. You can do joins, but not in the sense that you would using, say, Netezza. You're just translating joins into M/R, and lacking zone mapping or indexing Hive is pretty slow. I suppose I should have been more specific about what the salient features of MPP DBs are vs. Hive. And I didn't say Impala wasn't in use. We use it ourselves. I specifically said EMC's Hawq wasn't in use by any enterprise customers that I knew of, and it's the only fast query engine for Hadoop that I know of that utilizes a well developed query planner.

      May 31, 2013

    • Tom R.

      But I certainly wouldn't argue with you about expandability. If you see your data storage requirements growing rapidly in the near future and you don't need that data to be available for real time query (though as I pointed out, there are several Hadoopish solutions for getting near real time, including the Berkley stack which I know you're fond of), then Hadoop is a great solution.

      May 31, 2013

  • Nikhil V.

    Python and Hadoop. Sounds good to me!

    May 22, 2013

Our Sponsors

People in this
Meetup are also in:

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy