39th meetup



NOTE: a valid photo ID is required



Note: Please use your full real names when signing up, otherwise you may be refused entry!

As always, there'll be free beer and pizza, generously provided by our host AHL.

We are still experimenting with issuing tickets via a lottery - if you want to be in with a chance of a place - sign up for the waitlist! The lottery will be run approx 1 week before the meetup, and we will re-run the lottery to fill any spaces that free up or use the waitlist towards the time of the event.


Amit Kumar on Data Pipelining with Luigi

This talk is an introduction to Luigi, which is a Python package that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization, handling failures, command line integration, and much more.

Dhruv Ghulati on Automated fact checking - technical approaches and challenges

This talk is about:

• History of fact checking

• The current state of the art

• Who is working on this?

• What is next in this field/developments?

• What tools/fields in NLP are similar and what the implications of automated fact-checking tech are?


Lightning talks:

Parul Sethi on visual analysis of topic models

I will talk about how to interactively explore Topic models and it's entities: documents, topics and words for aiding the downstream NLP applications.

Chenfu Wang on pyexcel

What is the cost to add the support of a new excel file format to your production code? And how quickly could it be achieved? Using pyexcel, it is effortless add an additional file format, i.e. ods, xlsx. To add a branch new file format of tomorrow, it is significantly less risky as no code changes are required inside your production code. All you need is either an existing pyexcel plugin or a new one. In this talk, I will go through the architecture of pyexcel and its capabilities. I will also provide a top tip for the existing pyexcel users in dealing with excel file of significant size.



Doors open at 6.30 (get there early as you have to sign-in via AHL's security), talks start at 7 pm, beers from 9 pm in the bar. We normally have > 200 folks in the room so there's plenty of people to discuss data science questions with!

Please unRSVP in good time if you realise you can't make it. We're limited by building security on the number of attendees, so please free up your place for your fellow community members!

Follow @pydatalondon (https://twitter.com/pydatalondon) for updates and early announcements. See you on the 7th!

Attendees (260)