addressalign-toparrow-leftarrow-rightbackbellblockcalendarcameraccwcheckchevron-downchevron-leftchevron-rightchevron-small-downchevron-small-leftchevron-small-rightchevron-small-upchevron-upcircle-with-checkcircle-with-crosscircle-with-pluscrossdots-three-verticaleditemptyheartexporteye-with-lineeyefacebookfolderfullheartglobegmailgooglegroupshelp-with-circleimageimagesinstagramFill 1linklocation-pinm-swarmSearchmailmessagesminusmoremuplabelShape 3 + Rectangle 1ShapeoutlookpersonJoin Group on CardStartprice-ribbonShapeShapeShapeShapeImported LayersImported LayersImported Layersshieldstartickettrashtriangle-downtriangle-uptwitteruserwarningyahoo

Data Wrangling in SQL and Other Tools

In this sequel to John Tillinghast's March talk, Ryan Harvey will discuss good tools that exist for connecting to databases to help with repeatable and reversible data wrangling and analysis, including:

• The basics of using a relational database and querying it with SQL

• Database pros and cons, and database selection (spoiler: the examples use PostgreSQL)

• Doing analysis directly within the database using PostgreSQL's native functionality

• Connecting R to databases for more manageable data storage and easier exploratory analysis

• Doing hybrid analysis between R and the database using PostgreSQL

Ryan will discuss the importance of repeatable and reversible processes and scripting for enterprise settings and other production environments, why you should care about doing repeatable and reversible scripting, and how to do it more easily with good tools.

Our Teacher

Ryan Harvey is a local coder, datahead, project manager, wonk and dad. For work, he manages several government web apps at the Office of Management and Budget, in addition to doing data science, software architecture and application and database engineering for Kitchology, Inc. on the side. He's also a PhD candidate in the Applied Mathematics, Applied Statistics and Scientific Computing program at the University of Maryland's Norbert Wiener Center for Harmonic Analysis and Applications. Ryan lives in Lanham, MD, with his wife and two children and several pets.

Join or login to comment.

  • Ryan B. H.

    If anyone has photos of this one, I'd appreciate them. Please feel free to post them here on meetup.com.

    August 4, 2014

  • Ryan B. H.

    Thanks to all for the positive feedback. It was fun to put together, and I look forward to the next one. And with respect to my GitHub repo, since I talked about reproducibility, it's only appropriate that I walk the walk and share my code and data. :-) I hope the content of the talk and the shared materials help you to learn something new or understand something better than you did before.

    2 · June 5, 2014

  • Dorothy S.

    I am new to R and I was so glad to learn how to incorporate SQL in R. Thanks for sharing via GitHub, I really appreciate it. I look forward to more SQL talks!

    1 · June 5, 2014

  • Chad L.

    Ryan did a great job of presenting. It was nice to see how to access Postgres from R and Python. Having access to his code on Github is a great help as well.

    1 · June 5, 2014

  • A former member
    A former member

    Ryan did a fantastic job of explaining how to use SQL in pure SQL, R and Python. Much more than expected, and expertly delivered.

    3 · June 5, 2014

    • Ryan B. H.

      Thanks, Robert. I heard a few requests for more advanced SQL, so perhaps that's my next talk for DWDC?

      June 5, 2014

    • A former member
      A former member

      We'll line it up!

      1 · June 5, 2014

  • A former member
    A former member

    Thanks for the presentation. It is hard (so hard) to keep up with the various tools available and like many, I assume, I default to what I know. Given that, it is very helpful to have exposure to other options/tools/techniques. Ryan did a good job taking us through the basics. Thanks for posting everything to GitHub. Very kind of you.

    3 · June 5, 2014

  • Aaron

    Thanks for sharing the code!

    1 · June 4, 2014

  • Jing

    Thanks for sharing the code. The sql code tonight is very helpful for me, especially the part with R!

    1 · June 4, 2014

  • Shamiso M

    Great!

    1 · June 4, 2014

  • Ryan B. H.

    For those interested, everything I'm presenting tonight is available on GitHub: https://github.com/nihonjinrxs/dwdc-june2014

    4 · June 4, 2014

  • A former member
    A former member

    Just ordered pizza and drinks. See you all at Funger Hall tonight! Food arrives at 6:30pm.

    7 · June 4, 2014

  • Alain B.

    I have the privilege of working with Ryan. I am cutting another meeting short to make sure i will be there.

    June 3, 2014

    • Ryan B. H.

      Thanks, Alain. Looking forward to seeing you there!

      June 3, 2014

Our Sponsors

People in this
Meetup are also in:

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy