addressalign-toparrow-leftarrow-rightbackbellblockcalendarcameraccwchatcheckchevron-downchevron-leftchevron-rightchevron-small-downchevron-small-leftchevron-small-rightchevron-small-upchevron-upcircle-with-checkcircle-with-crosscircle-with-pluscrossdots-three-verticaleditemptyheartexporteye-with-lineeyefacebookfolderfullheartglobegmailgoogleimageimagesinstagramlinklocation-pinmagnifying-glassmailminusmoremuplabelShape 3 + Rectangle 1outlookpersonplusprice-ribbonImported LayersImported LayersImported Layersshieldstartickettrashtriangle-downtriangle-uptwitteruseryahoo

TomTom Traffic Data & Data Science at the Command Line

  • Apr 3, 2014 · 5:30 PM
  • This location is shown only to members

17:30-18:30 - Pizza & Drinks

18:30-19:30 - Data Science at the Command Line

Jeroen Janssens, Senior Data Scientist @ YPlan

The *nix command line, although invented decades ago, is an amazing environment for doing data science. By combining small, yet powerful, command-line tools we can really explore our data and quickly hack together prototypes. The recent addition of tools such as GNU Parallel, jq, and, Drake, further enables us to be more productive and more efficient data scientists. Installing these command-line tools and setting up an efficient environment is, unfortunately, not straightforward.

In the first part of this talk I will present a new open-source project called the Data Science Toolbox, which is a virtual environment that allows you to get started doing data science in minutes. It comes with commonly used software for data science and allows for easy installation of additional tools. Because the Data Science Toolbox runs on top of VirtualBox, it can be installed not only on Linux, but also on Mac OS X and Microsoft Windows.

Once you have a solid environment, it is worthwhile to further customize it to your own needs. In the second part of the talk I will explain how to (1) make your environment more efficient and (2) create reusable command-line tools from one-off commands or from existing code in, for example, Python and R.

By the end of this talk you will have a solid understanding of how to leverage the power of the command line for your next data science project.

Jeroen Janssens is a senior data scientist at YPlan, tonight's going out app, where he's responsible for making event recommendations more personal. Jeroen holds a Ph.D. in Machine Learning from Tilburg University. He is authoring a book called "Data Science at the Command Line", which will be published by O'Reilly in summer 2014. Jeroen enjoys biking the Brooklyn Bridge, building tools, and blogging at http://jeroenjanssens.com. He can be found on Twitter @jeroenhjanssens.

19:30-19:45 - Break

19:45-20:45 - TomTom’s use of Traffic Data

Ralf-Peter Schäfer, Fellow & VP Traffic and Travel Information Product Unit @ TomTom

Every day, TomTom's navigation devices generate about 10 billion data points about the speed and position of cars. TomTom uses this data to provide real-time traffic information, calculate accurate travel times, and much more. Mr. Schäfer will discuss how the data collection process works, how to manage such a big amount of data, and how TomTom creates value from it by sophisticated analysis. 

20:45-21:30 - Drinks

Join or login to comment.

  • Jeroen J.

    For those who'd like to learn more about Data Science at the Command Line: Next Wednesday I'll be doing a two-hour webcast on this topic. You can read more about this free event and sign up at http://www.oreilly.com/pub/e/3115

    August 12, 2014

  • Uri S.

    Great meetup, can’t wait to see the slides online!

    Koen suggested I use this as the proverbial “soap box” to look for a couple of data scientists to start up a Dutch chapter of DataKind. That’s an American organization which connects Data Scientists with NGOs and other “good causes” to solve data problems for the common good. They’re looking for people to start op chapters elsewhere (see http://www.datakind.org/howitworks/datachapters/ ) , and I think that a Dutch chapter would be a great idea. Let me know if you’re interested!

    1 · April 3, 2014

    • Gosia W.

      Hi Uri, we would be interested too. We are a company that analyzes data and builds tool to analyze data.

      April 9, 2014

    • Uri S.

      A short update: we've started a new meetup dedicated to connect Mission Driven Organisations and Data Scientists. First meetup in September! Please see http://www.meetup.com...­ for more info. And thanks to all the DataMission co-founders and to Koen!

      July 21, 2014

  • Elena L.

    First talk was not up to the expectations. I would prefer to see more complex examples. Second talk was really good!

    April 7, 2014

    • Jeroen J.

      I'm sorry to hear that via this way. I do hope that I have given everybody plenty of opportunity to ask questions (and for more complex examples) during and after the presentation. Feel free to contact me if you still have any questions.

      April 7, 2014

    • Elena L.

      If you would provide me some links for some complex examples, I would be really-really grateful! And thank you for the talk. It's just that I have expected something more from the examples side:)

      April 8, 2014

  • Marcel van der K.

    Big turn-out. The Data Science toolbox part was too much focussed on programming/data rookies and could have shown more about actual applications. The TomTom part was not high-paced, but it didn't inspire or trigger new ideas.

    April 5, 2014

  • Michael N.

    Very interesting talks with immediate application potential! Will certainly join in more often.

    April 4, 2014

  • Koen H.

    Jeroen's slides are now online: http://www.slideshare.net/datasciencenl/dsatcl

    1 · April 4, 2014

  • Koen H.

    I'll post the links to the slides here as soon as they become available.

    April 4, 2014

  • puja

    Very inspiring session! Where can we grab the presentations from?
    Cheers, Puja

    April 4, 2014

  • Coen J.

    1. Interesting talk for beginning Data Scientists. For myself, I would have preferred a demonstration of more elaborate functions of the data science toolkit.
    2. TomTom Traffic was a very nice presentation, both graphically and content-wise.

    1 · April 4, 2014

  • Koen H.

    Please upload photos if you have them.

    April 4, 2014

  • Bram B.

    Vriend van Jeroen

    April 3, 2014

  • Freddy H.

    Regretfully, I also have to cancel my planned visit to this evening.

    April 3, 2014

  • Frank I.

    Helaas een griepje opgelopen, ik kan er niet bij zijn. Veel plezier.

    April 2, 2014

  • Ward W.

    Looking forward to it!

    March 28, 2014

  • Kelly

    Very excited to join this meetup!
    Also, my company is looking for an Intern to help with Onsite Search, if anyone is interested. Pass Along!

    https://www.dropbox.com/s/940sb7vursolps2/Interns-Onsite%20Search%20EU.pdf

    March 10, 2014

  • Koen H.

    I changed the order of the speakers.

    March 10, 2014

  • Koen H.

    hehe

    March 7, 2014

  • Szabolcs B.

    Thanks Koen, I updated my priors accordingly. ;)

    March 7, 2014

  • Koen H.

    @Szabolcs I will look into it, but it might be hard because of the book that Jeroen is writing for O'Reilly. Speakers are also known to be more candid without a camera. I will officially announce the meetup in the next few days.

    March 7, 2014

  • Geoffrey S.

    Awesome topic again this time... looking forward to it

    March 7, 2014

  • Szabolcs B.

    Is there any chance of recording the event, and sharing it somehow?

    2 · March 7, 2014

Our Sponsors

  • Uber

    Venue/Food/Drinks

People in this
Meetup are also in:

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy