Python Data Science June Meetup

This is a past event

41 people went


Looking forward to seeing you all at our next meetup!

-- Dataset of the Month --
TMDB Box Office Prediction
We're going to make you an offer you can't refuse: a Kaggle competition!

In a world... where movies made an estimated $41.7 billion in 2018, the film industry is more popular than ever. But what movies make the most money at the box office? How much does a director matter? Or the budget? For some movies, it's "You had me at 'Hello.'" For others, the trailer falls short of expectations and you think "What we have here is a failure to communicate."

In this competition, you're presented with metadata on over 7,000 past films from The Movie Database to try and predict their overall worldwide box office revenue. Data points provided include cast, crew, plot keywords, budget, posters, release dates, languages, production companies, and countries. You can collect other publicly available data to use in your model predictions, but in the spirit of this competition, use only data that would have been available before a movie's release.

Join in, "make our day", and then "you've got to ask yourself one question: 'Do I feel lucky?'"

--Data Presentations--

We will have two open slots for our users to present their analysis techniques for this dataset. The presentation format is very open-ended so you can focus on just EDA, machine learning, or data visualization. The only requirement is that you use Python (ideally 3.4+) in a notebook format to present!

4 pm- Welcome
4:15 pm- Dataset Presentation - Justin Richie
4:30 pm- OPEN SLOT
5:00 pm- OPEN SLOT
5:30 pm- Networking

If you want to present, please email me at [masked]

Look forward to seeing everyone!