#13 Introduction to PyData (English talk)

Toulouse Data Science
Toulouse Data Science
Public group

Location visible to members


As a Data Scientist you are often reading or writing code in R and/or Python for your day-to-day data analysis tasks. But why do Data Scientists love coding in Python?

Python has an extremely rich and healthy ecosystem of data science tools (known as PyData ecosystem). Python's popularity for data science is largely due to the strength of its core libraries, high productivity for prototyping and building small and reusable systems. Unfortunately, to outsiders this ecosystem can be a bit confusing for those new to Python, or even experienced programmers moving to Python for its excellent data analysis capabilities.

We often get confused by the PyData ecosystem and Peadar Coyle, Senior Data Scientist at Channel 4, will present a detailed look with examples of some of the cool tools out there.

He will touch on pure python, NumPy, Pandas, Blaze, xray, bcolz, Dask, and Spark, with a focus on the use-cases for each one. What do you do when your data doesn't fit in-memory, when do you need to use a functional programming approach - when do you need a compression? Where does Dask fit into all of this? When do you need Spark? Peadar gives us all the answers ;-)

Bio : Peadar Coyle (@Springcoil) is data scientist and author specialising in applying robust statistical or machine learning models to big/medium/small data to extract business value such as new revenue streams or business process optimisation. He joined the excellent Channel 4 team in early April, as a Senior Data Scientist working on recommendations and customer segmentation.
He has experience with Machine Learning, Bayesian Statistics, and other cool stuff. He's worked on analytics projects at Amazon.com and Vodafone, and at startups like Import.io and JobToday. His recent book “Interviews with Data Scientists” is available at https://leanpub.com/interviewswithdatascientists


- 18:30 - Welcome and TDS news

- 19:00 - Peadar Coyle Talk

- 20:15 - Strata Conf London pass draw

- 20:20 - Networking time, food and beverages for data geeks :-)

Many thanks to our sponsors:

A special thanks to Toulouse Business School (http://www.tbs-education.fr/en) for hosting our data community.

O'Reilly Strata support our community with conference discounts, books and goodies.

Strata + Hadoop World is the leading event on how big data and ubiquitous, real-time computing is shaping the course of business and society. It brings together the world’s best data scientists and business leaders to share hard-won knowledge and innovations in technology and strategy. Check out the impressive program and make plans to join Strata + Hadoop World in London 31 May-3 June 2016. Save 20% on most passes with discount code UGTDS20. http://www.oreilly.com/pub/cpc/5510


Les meetups peuvent être filmés et le public photographié au long de l'événement. En participant à ces rencontres vous autorisez la publication des photos sur notre site Toulouse Data Science Meetup. Cette autorisation n'inclut pas une utilisation publicitaire d'image.