PyData November Meetup
Details
Dear all,
the PyData November Meetup is coming up. The main focus this time will be on data processing pipelines and task orchestration. So, instead of "what can you do with data?", which might be enough for one-off loads, we tackle the question "how do you get the data?" Especially for data that keeps coming in reguarly – dirty, and constantly changing as data (imho) always is.
Our main talks are:
• Steffen Wenz: Helping travelers make better hotel choices - 500 million times a month
• Philipp Pahl and Anne Matthies : Task orchestration without a director - using Amazon Simple Workflow with boto3 for ETL pipelines at scale. (World premiere of an open source python3-boto3-AWS simple workflow package)
Also, we removed the attendee limit. This is an experiment. It's frustrating to plan for X people and then only X - Y show up, and we can only guess Y. TrustYou, our sponsor, tries to provide drinks and snacks for all attendees, but it's hard to plan if we don't know how many of the RSVPs really show up. So please, if you know that you can't make it, please change your RSVP!
So please, if you know that you can't make it, please change your RSVP!
By removing the limit we risk that you only get half a cola and have to sit on each others lap... No risk, no fun :-)
After the two main talks, we want to discuss concepts and approaches. If you think that you could add a lightning talk: please let us know via the meetup comments.
Sorry for not having a summary and bio ready yet for Philipp's and my talk. We will update it here, but currently, we focus on the first release of our open source code...
Update your RSVP! We are excited about this meetup and look forward to seeing you!
Liebe Grüße, Greetings,
Anne
---
The event is sponsored by TrustYou (http://www.trustyou.com).
http://photos3.meetupstatic.com/photos/sponsor/d/3/4/c/iab120x90_1674092.jpeg
---
Steffen Wenz – Helping travelers make better hotel choices - 500 million times a month
TrustYou analyzes online hotel reviews to create a summary for every hotel in the world. What do travelers think of the service? Is this hotel suitable for business travelers? TrustYou data is integrated on countless websites (Trivago, Wego, Kayak), helping travelers make better choices. Try it out yourself on http://www.trust-score.com/
TrustYou runs almost exclusively on Python. Every week, we find 3 million new hotel reviews on the web, process them, analyze the text using Natural Language Processing, and update our database of 600,000 hotels. In this talk, Steffen will give insights into how Python is used at TrustYou to collect, analyze and visualize these large amounts of data.
Steffen is CTO at TrustYou. He joined the company in 2008 as a student, and worked on scaling TrustYou's NLP solutions to 20 languages in his master thesis. Today, he works with a team of 20 data scientists and software engineers to make an impact in the online travel industry.
---
Location, St. Oberholz Zehdenicker:
http://photos3.meetupstatic.com/photos/event/3/7/1/a/600_441014106.jpeg
