Climate Data Analysis Cyberinfrastructure & Data Scientists vs. Data Engineers

University of Colorado Boulder - Tuesday July 23, 2013 @ 6:00pm MST

For folks unable to attend in person register and we will email you a livestream link 2 hours prior to event.

Note: Dr. Arvind Sathi has a scheduling conflict and will present at a future event. Michael Walker will present on Data Scientists vs. Data Engineers.

Location: ATLAS -[masked]th St Bldg 223, Boulder, CO - Room 100
Map: http://goo.gl/maps/XTJ9v

Agenda:

6:00 - 6:15 Schmooze - Food will be served in Lobby.

6:15 - 7:30 Rethinking Cyberinfrastructure for Climate Data Analysis Workflows by Dr. Richard Loft

7:30 - 8:30 Data Scientists vs. Data Engineers by Michael Walker

8:30 - 9:30 Network at The Sink at[masked]th Street.

See: http://bit.ly/ND8Kp

Rethinking Cyberinfrastructure for Climate Data Analysis Workflows - Abstract

Advancements in the computational capability of massively parallel supercomputers have offered the Earth system science community an unprecedented opportunity to dramatically improve its understanding of the Earth system. This has spurred a focused effort, over many years, to improve Earth system model scalability and performance. However it has recently become painfully evident that the ancillary data analysis software and hardware systems have become the rate-limiting step in advancing scientific understanding. There are three reasons for this development: first, the rate of improvement in computing system has outpaced improvements in storage system performance; second, many workflows and tool remain serial, while applications have become increasingly parallelized; and third, many analysis tools and applications make inefficient use of the underlying hardware.

This talk will cover the history and current state of Earth system modeling and data analysis, show how capabilities of the NCAR Wyoming Supercomputing Center are advancing that state, and suggest how infrastructure and the analysis software can and must coevolve to address the massive amounts of data. The discussion will be framed through experiences at NCAR in pushing the boundaries of what is possible in data centric computing, and the trends influencing the next co-evolutionary steps.

Bio

Dr. Loft has been involved with massively parallel computing since joining Thinking Machine Corporation as an Application Engineer in 1989. Throughout his career he has contributed to the understanding and effective use of parallelism as applied to grand challenge simulations. His algorithmic innovations dramatically improved the scalability of the atmospheric component of the Community Earth System Model, and were recognized with an honorable mention prize in the IEEE/ACM Gordon Bell competition at Supercomputing 2001. Rich is currently the Director of Technology Development Division in the Computational and Information Systems Laboratory at NCAR. TDD is charged with improving application scalability and performance, exploring the use of new computer technologies, and developing software to serve and analyze large or complex datasets. He also serves as NCAR’s representative to the eXtreme Science and Engineering Discovery Environment (XSEDE) Service Provider Forum (SPF) and oversees NCAR’s participation in the XSEDE project. Dr. Loft also leads the Outreach Services Group for the CISL computing laboratory at NCAR. The education of future computational scientists is an area he is passionate about, which is why he founded the Summer Internships in Parallel Computational Science, or SIParCS program in 2007.

Data Scientists vs. Data Engineers - Abstract

Data science is a team sport. Organizations often make the mistake of mixing and confusing team roles on a data science project - resulting in over-allocation of responsibilities assigned to data scientists. For example, data scientists are often tasked with the role of data engineer leading to a misallocation of human capital. Here the data scientist wastes precious time and energy finding, organizing, cleaning, sorting and moving data. The solution is adding data engineers to the data science team.

Data scientists should be spending their time and brainpower on applying data science and analytic results to critical business issues - helping an organization turn data into information - information into knowledge and insights - and valuable, actionable insights into better decision making and game changing strategies.

Data engineers are the designers, builders and managers of the big data infrastructure. They develop the architecture that helps analyze and process data in the way the organization needs it. And they make sure those systems are performing smoothly.

This presentation will address key issues, including:

What is big data and how is it being used?
How can strategic plans for big data analytics be generated?
How does big data change analytics architecture?

Bio

Michael Walker is a managing partner at Rose Business Technologies, a professional technology services and systems integration firm. He leads the Data Science Professional Practice at Rose. Mr. Walker received his undergraduate degree from the University of Colorado and earned a doctorate from Syracuse University. He speaks and writes frequently about data science and is writing a book on Data Science Strategy for Business. Learn more about the Rose Data Science Professional Practice at http://bit.ly/10TgVHG. Follow Mike on Twitter @Ironwalker76.

Join or login to comment.

  • Douglas H.

    In a previous career I was an exploration geophysicist. So I have some knowledge of the mathematical techniques for solving partial differential equations. What I did not know, until last night, was the sheer amount of data produced by the weather models and the problems that it presents. Very interesting!

    July 24, 2013

  • Mark C.

    I really wanted to hear more about how NCAR was managing the flow of big data within their infrastructure. I hope that future talks like this can emphasize new/innovative ways to handle huge volumes of scientific data.

    July 24, 2013

  • Michael M.

    July 24, 2013

  • Atif Farid M.

    Excellent, however, none of my questions were answered.

    July 23, 2013

  • Michael W.

    Directions from ATLAS to The Sink (8 minute walk)

    Map: http://goo.gl/maps/Z9Mu3

    1.Head south on 18th St

    2.Turn right onto Euclid Ave

    3.Turn right onto Broadway

    4.Turn left onto College Ave

    5.Turn right onto 13th St

    The Sink will be on the left at[masked]th St, Boulder, CO 80302

    July 23, 2013

  • Michael W.

    Livestream Link: http://ustre.am/12gmh

    Start at 6:15pm MST / 8:15pm EST / 5:15pm PST


    embed code:


    <iframe src="http://www.ustream.tv/embed/15315877"; width="608" height="368" scrolling="no" frameborder="0" style="border: 0px none transparent;"></iframe><br /><a href="http://www.ustream.tv/everywhere"; style="padding: 2px 0px 4px; width: 400px; background: #ffffff; display: block; color: #000000; font-weight: normal; font-size: 10px; text-decoration: underline; text-align: center;" target="_blank">Live video from your iPhone using Ustream</a>

    July 23, 2013

  • Paul P.

    I have to attend remotely. Please send the link as noted.
    Thanks!!!
    paul

    July 23, 2013

  • Michael G.

    attending remotely

    July 23, 2013

  • Donna B.

    Via livestream
    (hoping to join in person, but not likely)

    July 23, 2013

  • Douglas H.

    By livestream

    July 22, 2013

  • Brian P.

    I cannot attend in person, and would appreciate a livestream link. great topic! <brian

    July 22, 2013

  • Michael M.

    An express RTD bus is available from Denver to the CU Boulder campus:

    Westbound: Depart Market & 16th on route BMX at 5:07pm, arrive at Broadway & Euclid (one stop before Boulder terminus) at 5:52pm.
    http://www3.rtd-denver.com/schedules/getSchedule.action?runboardId=133&routeId=B&routeType=12&direction=W-Bound&serviceType=3

    Eastbound: Depart Broadway & 16th on route BV at 8:38pm, arrive at Market & 16th 9:32pm.
    http://www3.rtd-denver.com/schedules/getSchedule.action?runboardId=133&routeId=B&routeType=12&direction=E-Bound&serviceType=3

    Eastbound after drinks at The Sink: Depart Broadway & 16th on route BV at 11:00pm, arrive at Market & 16th 11:59pm.

    July 21, 2013

  • Atif Farid M.

    I will be needing either Google Hangout or Skype details to join in as I am in Grand Forks.

    July 21, 2013

  • ken f.

    I'm very sorry to miss this one - have a commitment I can't get out of.

    July 17, 2013

  • Rocky M.

    Via livestream.

    July 17, 2013

  • Michael M.

    Directions from Denver:

    1. US-36 to Baseline exit (the exit after Foothills Parkway but before US-36 turns into 28th St.).

    2. At end of ramp, turn left to go west on Baseline

    3. Turn right to go north on Broadway

    4. Make second right onto 18th St. (comes after Regent but before Euclid)

    5. Curve around as you're forced onto Euclid St.

    6. Park in the large Visitor parking lot & underground garage on the right. ($4 to park)

    7. Continue walking in the same direction you were driving on 18th St. -- actually the portion of 18th St. that's closed to public traffic.

    8. The Atlas building is the third building on the left, with the Pekoe Sip House coffee shop on the ground floor.

    9. Auditorium #100 is immediately off the lobby of the Atlas building.

    July 12, 2013

  • Richard H.

    darn it!

    July 1, 2013

Our Sponsors

People in this
Meetup are also in:

Imagine having a community behind you

Get started Learn more
Henry

I decided to start Reno Motorcycle Riders Group because I wanted to be part of a group of people who enjoyed my passion... I was excited and nervous. Our group has grown by leaps and bounds. I never thought it would be this big.

Henry, started Reno Motorcycle Riders

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy