- Data Science NL Meetup meets Beer & Tech
• 5:30 PM: Walk-in with food and drinks • 6:30 PM: Introduction by Jeroen Janssens (Data Science Workshops) • 6:35 PM: Welcome by Ciaran Jetten (Heineken) • 6:45 PM: Talk 1 by Marjolein Peters (CBS) • 7:15 PM: Talk 2 by Elien van Riet (Reviewscan) • 7:45 PM: Book Raffle • 7:50 PM: Talk 3 by Django van Amstel (Heineken) • 8:20 PM: Community Announcements • 8:30 PM: Drinks • 9:30 PM: End The abstracts and speaker bio's are located at https://www.datascienceworkshops.com/blog/data-science-nl-meetup-meets-beer-and-tech/
- 12th Data Science NL Meetup
• 5:30 PM: Walk-in with food and drinks • 6:30 PM: Introduction by Jeroen Janssens • 6:35 PM: Welcome by Cognizant Digital Studio • 6:40 PM: Talk 1: "What is Real Programming?" by Felienne Hermans • 7:15 PM: Book Raffle 1 • 7:20 PM: Talk 2: "Thoughts on doing Agile Data Science" by Edwin Thoen • 7:55 PM: Book Raffle 2 • 8:00 PM: Talk 3: "Prescriptive Analytics: Methodologies and Challenges" by Sophia Katrenko • 8:35 PM: Community Announcements • 8:45 PM: Drinks • 9:30 PM: End • Talk 1: What is Real Programming? Is Excel a programming language? That question has kept my mind occupied since I wrote a PhD thesis on spreadsheets as code. And is CSS programming? My friend Lara Schenk asked this question on Twitter twice and got interesting answers… So, what is a programming language? What do people think, and what arguments do they use? After a lot of debate, I thought it was time for a scientific experiment and collected data from (to date) over 600 developers. In this talk I will talk about the results which I analyze with Python (which is of course the one programming language to rule them all) and discuss the implications of this study. • About Felienne Hermans Felienne is associate professor at the Leiden Institute of Advanced Computer Science at Leiden University, where she heads the PERL research group, focused on programming education. On Saturdays she teaches children programming in a local community centre. Felienne was also one of the founders of the Joy of Coding conference, with a similar goal, which she organized for 6 years. Since 2016, she has been a host at SE radio, one of the most popular software engineering podcasts on the web. When she is not coding, blogging or teaching, she is probably knitting, running or playing a (board)game. • Talk 2: Thoughts on doing Agile Data Science During the past decade, Agile has become the leading philosophy for many software engineering teams. Agile is a set of values and principles that promote quick delivery and short-cycled improvements to products. In this talk I explore what it means to apply the Agile philosophy to the development of data science products, such as machine learning models and the production of automated research reports. How can we, as data scientists, have continuous delivery, keep the business involved throughout the process, and integrate software development with the explorative nature of our work? • About Edwin Thoen With a degree in Applied Statistics he started to work on data science projects in business six years ago. First as a consultant, then at Rabobank and currently at Funda. Within the data science spectrum his main interests are interactive data analysis, solving machine learning problems with statistical models, and maintaining an effective workflow. Edwin is the author and maintainer of the padr R package and contributes to other open source projects on a regular basis. He lives in Oegstgeest with his wife and two boys, where he enjoys family life, playing tennis, and cooking. • Talk 3: Prescriptive Analytics: Methodologies and Challenges Prescriptive analytics has received substantial attention in the recent years and exemplifies a shift from descriptive and predictive analytics to actionable insights. In this talk, I will review existing solutions in prescriptive analytics and introduce Cognizant's Learning Evolutionary AI Framework that enables business decision making by discovering optimal strategies. • About Sophia Katrenko Sophia Katrenko is Senior Manager Data Science at Cognizant who has worked in advertising, research, finance, publishing, and continues exploring Data Science & Analytics in other sectors, preferably with a measurable impact on society. Prior to joining Cognizant, she had led content enrichment and commercial bibliometrics projects at Elsevier serving global markets. Sophia holds a PhD in Computer Science from the University of Amsterdam on the topic of information extraction.
- 11th Data Science NL Meetup
The previous meetup at Picnic was such a success that we might as well organise another one! On July 25, TomTom will be hosting the 11th Data Science NL Meetup. Join us for an evening of engaging talks, great people, delicious food, two exciting book raffles, and interesting announcements from the community. Many thanks to TomTom for sponsoring and hosting and to O'Reilly for sponsoring the book raffle. Cheers, Jeroen Programme • 5:30 PM: Walk-in with food and drinks • 6:30 PM: Introduction by Jeroen Janssens • 6:35 PM: Welcome by TomTom • 6:40 PM: Talk 1: "Ideas that Constrain Artificial Stupidity" by Vincent Warmerdam • 7:15 PM: Book Raffle part 1 • 7:20 PM: Talk 2: "Kids just want to have fun. Affect detection in an online learning platform" by Roger Smeets • 7:55 PM: Book Raffle part 2 • 8:00 PM: Talk 3: "Learning from messy sensor data" by Zeinab Bakhtiarinoodeh • 8:35 PM: Community Announcements • 8:45 PM: Drinks • 9:30 PM: End • Talk 1: Ideas that Constrain Artificial Stupidity: Freedom sounds like a good idea but there's a reason why you'd want a fence near a ravine and a speed limit on a car. One might even call it common sense. The problem is that machine learning models don't typically have that and the results can be disastrous. This talk is about celebrating constraints on models in order to improve their applications. • About Vincent (http://bit.ly/2XK7wWt): Vincent is an algorithm person who works for GoDataDriven, blogs at koaning.io and has been organising a bunch of conferences/meetups in the last few years. AskHimAnything[tm]. • Talk 2: Kids just want to have fun. Affect detection in an online learning platform: Squla is an online educational platform for children aged 4-12. Since Squla's mission is to promote fun learning, an important user engagement metric is enjoyment. Timely and automatic detection of (a lack of) enjoyment can be helpful in providing appropriate interventions to keep users engaged. Building models to facilitate the detection of affect (i.e. emotional states) in this context presents two challenges: First, an instrument has to be developed that can be used to identify instances of (lack of) enjoyment while children are playing. Second, this instrument has to be properly understood by children. To this end, we have developed and validated an emoticon-based self-report instrument to derive ground-truth labels of four emotions: Joy, frustration, confusion, and boredom. Training a number of different classifiers for automated affect detection yields promising results, in particular for detecting joy and frustration. • About Roger (http://bit.ly/2YY1luq): Roger Smeets (1981) is the head of data science at Squla. He obtained a PhD in Economics from the Radboud University Nijmegen in 2009. He spent the next eight years conducting econometric research in public policy (CPB Netherlands Bureau for Economic Policy Analysis) and academia (Rutgers University, USA). After returning to the Netherlands, he joined Squla in 2017. He currently lives in Amsterdam with his wife and three children. • Talk 3: Learning from messy sensor data: In this talk we will discuss how the information created by the multiple sensors, which are integrated in modern smartphones can assist human in a smart way in different domains. We will review Machine Learning techniques and methods to detect events from smartphone sensor data. • About Zeinab (http://bit.ly/32uHfdB): Zeinab Bakhtiarinoodeh is a senior data scientist at TomTom, with a strong background in mathematics and computer science. In 2017, she obtained her Ph.D. in Computer Science and Logic from LORIA, Université de Lorraine in France. At TomTom she leverages computer science, machine learning, and mathematical modelling to turn data into a story, and into new fascinating features for the users of TomTom products. She previously worked at EyeOn, where she improved their forecast of new products using machine learning techniques.
- 10th Data Science NL Meetup
We're thrilled to announce that, after a three-year hiatus, we're back with another event. On June 20, the 10th Data Science NL Meetup will take place at Picnic in Amsterdam. Many thanks to Picnic for sponsoring and hosting, to O'Reilly for sponsoring the book raffle, and to AIgents for co-organising. Below you'll find the details, the programme, the talk abstracts, and the speaker biographies. Cheers, Jeroen Visit https://aigents.co to discover more events for Data Scientists. Program • 5:30 PM: Walk-in with food and drinks • 6:30 PM: Introduction by Jeroen Janssens • 6:35 PM: Community Announcements • 6:40 PM: Talk 1: "Data Science in the Humanities" by Marieke van Erp • 7:15 PM: Book Raffle 1: "Data Science from Scratch" by Joel Grus • 7:20 PM: Talk 2: "Searching to be Entertained" by Daan Odijk • 7:55 PM: Book Raffle 2: "Python for Data Analysis" by Wes McKinney • 8:00 PM: Talk 3: "AI at Your Service: Supporting Customer Success With NLP Techniques" by Sharon Gieske • 8:30 PM: Drinks • 9:30 PM: End • Talk 1: Data Science in the Humanities: Humanities is a broad research field that covers (but is not limited to) history, literature studies, linguistics and ethnology. Digital methods are becoming more common in day-to-day humanities practice, but these methods don’t always translate seamlessly to the humanities domain. In this talk, I will present use cases from the Dutch Royal Academy’s Humanities Cluster that show the potential as well as frictions in the application of digital analysis methods in this domain. • About Marieke (https://www.linkedin.com/in/marieke-van-erp-b448701/): Marieke van Erp is a researcher and team leader of the Digital Humanities Lab at the Royal Netherlands Academy of Arts and Sciences Humanities Cluster in Amsterdam, the Netherlands. Her research is focused on applying natural language processing in semantic web applications with a particular interest in digital humanities. She previously worked on the European NewsReader project, which was aimed at building structured indexes of events from large volumes of financial news and the CLARIAH project, a large Dutch project to develop infrastructure for humanities research. • Talk 2: Searching to be Entertained: RTL, the largest commercial broadcaster in a declining Dutch TV market, is making a transition from a traditional TV company to a consumer-focused media company. Daan will share how we are using data science and AI to help our users find the right content for them, ranging from the 1M daily visitors on our news website to the over 2B yearly video plays we have, most of these on our rapidly growing video-on-demand platform Videoland. • About Daan (https://www.linkedin.com/in/dodijk/): Daan Odijk is the lead data scientist at RTL. In 2016, he obtained his PhD on search algorithms for news. Subsequently, he joined journalism start-up Blendle, leading the personalization team. At RTL, Daan leads data scientists and engineers, delivering data-powered products across RTL, including personalization for RTLNieuws and Videoland. • Talk 3: AI at Your Service: Supporting Customer Success With NLP Techniques Picnic is the world's fastest growing online supermarket with a focus on accessible and personal customer service. With growth at this scale, support agents must be able to handle an ever-increasing volume and diversity of customer feedback. In this talk, Sharon will share how Picnic uses AI to reduce the workload of support agents without compromising its customer experience. • About Sharon (https://www.linkedin.com/in/sharon-gieske-75772328/): Sharon Gieske is a data scientist at Picnic, with a strong background in Artificial Intelligence and Natural Language Processing. Her fascination lies with the diverse challenges that user-generated text brings. In her recent work, she leverages machine learning to support customer success in faster resolution of customer feedback.
- Data Science NL Meetup @ Uber Engineering
IMPORTANT: you need to RSVP or re-RSVP here (https://www.eventbrite.com/e/data-science-nl-meetup-at-uber-tickets-24509871686) on Eventbrite if you want to attend! We moved to a bigger location to accommodate the demand. 18:00 Food / Drinks 18:30 Talk I - Stephen t' Siobbel - PM 3D Maps and Highly Automated Driving at TomTom TomTom is working on the production of maps for highly automated driving. Building and maintaining these maps requires an enormous amount of data, software, and algorithms. This presentation will offer some insight into TomTom’s latest efforts in this area. Stephen joined the independent map provider Tele Atlas in 1996 as a researcher. In the early 2000’s he initiated the deployment of mobile mapping technology. Before and after the acquisition of Tele Atlas by TomTom he has worked as a product manager in the area of maps for advanced driver assistance systems and highly automated driving. 19:00 Talk II - Kevin Novak, Head of Data Science Platform at Uber Kevin currently leads Uber's data science efforts in machine learning and experimentation research, data modeling, and anomaly detection. Since joining Uber in July 2011, Kevin's contributed to Uber's engineering and scientific work as an engineer, scientist, and leader, including being responsible for the invention and implementation of Uber's first dynamic pricing algorithms, supervising development and theory behind the UberPOOL and uberEATS products, and building Uber's embedded data science organization from a team of 1 to a team of 50+. 19:30 Speaker Panel Q&A 20:00 Drinks 21:00 End
- Data Science NL Meetup @ Philips Research
18:00-19:00 - Pizza, drinks, and a data visualization demo. 19:00-20:00 - Data Science at Philips Research • Welcome and Introduction - Milan Petkovic, Department Head Data Science, Philips Research • People-centric Innovation for Smart Cities - Alexandre Sinitsyn , Senior Scientist, Philips Research • Visualizing for Big Data Healthcare and Lighting Solutions - Jeanne de Bont, Senior Design Consultant, Philips Design 20:00-20:15 - Break 20:15-21:00 - Lightning talks • Catching the unicorn: building data science teams - Duncan Ross, Director of data science, Teradata • Applying topological data analysis to healthcare & life sciences - Damir Herman, Senior data scientist, Ayasdi 21:00-21:30 - Drinks Please bring a valid ID.
- Philips Data Analytics Workshops
Philips Research would like to invite a number of people from the Data Science NL Meetup community to create new ideas for data-driven value propositions based on Philips-specific use cases. There will be 3 parallel workshop sessions: • Healthcare • Consumer Lifestyle(e.g. smart home) • Lighting(e.g. smart cities, connected lighting) RSVP does not guarantee attendance. You’ll be asked two questions when you RSVP. After the RSVP closes we'll make a selection of 10-20 members to attend the workshops. We'll let you know an exact address and other details once you're selected. Dinner is included for the workshop participants. After the dinner you can also attend the Data Science NL Meetup @ Philips (http://www.meetup.com/DataScienceNL/events/199545442/). It follows a more regular format and is created as a separate event on this website. Please bring a valid ID when you're selected to attend this event.
- TomTom Traffic Data & Data Science at the Command Line
17:30-18:30 - Pizza & Drinks 18:30-19:30 - Data Science at the Command Line Jeroen Janssens, Senior Data Scientist @ YPlan The *nix command line, although invented decades ago, is an amazing environment for doing data science. By combining small, yet powerful, command-line tools we can really explore our data and quickly hack together prototypes. The recent addition of tools such as GNU Parallel, jq, and, Drake, further enables us to be more productive and more efficient data scientists. Installing these command-line tools and setting up an efficient environment is, unfortunately, not straightforward. In the first part of this talk I will present a new open-source project called the Data Science Toolbox, which is a virtual environment that allows you to get started doing data science in minutes. It comes with commonly used software for data science and allows for easy installation of additional tools. Because the Data Science Toolbox runs on top of VirtualBox, it can be installed not only on Linux, but also on Mac OS X and Microsoft Windows. Once you have a solid environment, it is worthwhile to further customize it to your own needs. In the second part of the talk I will explain how to (1) make your environment more efficient and (2) create reusable command-line tools from one-off commands or from existing code in, for example, Python and R. By the end of this talk you will have a solid understanding of how to leverage the power of the command line for your next data science project. Jeroen Janssens is a senior data scientist at YPlan, tonight's going out app, where he's responsible for making event recommendations more personal. Jeroen holds a Ph.D. in Machine Learning from Tilburg University. He is authoring a book called "Data Science at the Command Line", which will be published by O'Reilly in summer 2014. Jeroen enjoys biking the Brooklyn Bridge, building tools, and blogging at http://jeroenjanssens.com (http://jeroenjanssens.com/). He can be found on Twitter @jeroenhjanssens. 19:30-19:45 - Break 19:45-20:45 - TomTom’s use of Traffic Data Ralf-Peter Schäfer, Fellow & VP Traffic and Travel Information Product Unit @ TomTom Every day, TomTom's navigation devices generate about 10 billion data points about the speed and position of cars. TomTom uses this data to provide real-time traffic information, calculate accurate travel times, and much more. Mr. Schäfer will discuss how the data collection process works, how to manage such a big amount of data, and how TomTom creates value from it by sophisticated analysis. 20:45-21:30 - Drinks
- Predictive models for bees, and R on the iPhone.
Attn: the location changed to a nearby, bigger venue (http://www.uu.nl/university/library/EN/contact/university_library/Pages/default2.aspx). The meetup will be in the Booth hall at the University Library Uithof (http://www.uu.nl/university/library/EN/contact/university_library/Pages/default2.aspx). The indoor parking garage (http://www.uu.nl/university/library/EN/contact/university_library/Parkeren/Pages/default.aspx) next door has free parking from 6pm on. 17:30-18:30 - Pizza & Drinks 18:30-19:15 - Prof. Dr. Richard Gill (http://www.math.leidenuniv.nl/~gill/) - Professor of Statistics at Leiden University R on the iPhone and tracing criminals using mobile phone data Prof. Dr. Richard Gill will first talk about experiences of using R on a smartphone. From there he will go on to something completely different: statistical analysis of mobile phone call data records in order to trace criminals through "co-location analysis". While discussing these topics, he will probably expose the audience to his opinions about the emergent field of Data Science. 19:15-19:30 - Break 19:30-20:15 - Prof. Dr. Koos Biesmeijer (https://science.naturalis.nl/en/people/scientists/koos-biesmeijer/) - Scientific Director at the Naturalis Biodiversity Center Biodiversity science in the digital age The archetypal image of a biodiversity researcher is the person chasing a butterfly with a net. Thanks to such activities we now have huge archives of biodiversity information. Naturalis, for example, houses 37 million specimens and the Global Biodiversity Information Facility (GBIF) database contains >500 million data records. Therefore, even though people still chase butterflies, data analysis is becoming more and more important. Particularly also in the molecular age with sequencers spitting out too much data. In his presentation, Prof. Dr. Koos Biesmeijer will show how Naturalis uses data and collections to answer important questions such as: are we still losing biodiversity? Do we have sufficient bees and other insects to pollinate our apples? Also, bringing the general public into science, in citizen science projects, is becoming important and Naturalis uses their collections to develop identification tools so that people can become involved. Recent developments in biodiversity science are fast, yet many more improvements can be made if there is more collaboration with other Data Science fields, e.g. borrowing algorithms from search, computer vision, etc. 20:15-21:00 - Drinks
- Data Science @ Shell
17:30 Arrival 17:30 - 18:45 Drinks/Sandwiches /Snacks 17:45 – 18:15 Group 1 (35 people) to visit the Virtual Reality Center at Shell 18:15 – 18:45 Group 2 (35 people) to visit the Virtual Reality Center at Shell. 18:45 Start of presentations Welcome and introduction to Shell by Johan Krebbers, VP Architecture/Group IT architect at Shell. Stijn Bierman form Shell Statistics & Chemometrics will present a number of examples of Data Science projects in the Oil & Gas industry, ranging from customer ordering patterns to process chemometrics, and inspection strategies for pipework integrity. Leo Berendschot, GM Shell Smart Platforms, will talk about estimating flow rates from producing wells using data-driven models in real-time. 20:30 - 21:00 Drinks The VR Center at Shell is the largest i-scope facility you can see in The Netherlands. Make sure you arrive early if you want to see it. Attendance requires you to RSVP for the event, answer the two questions that you'll see when you RSVP, and bring a valid ID. Without proper registration and a valid ID, you won't be able to attend this meetup.