World Cup Football Analytics + Sampling and Monte Carlo in R
Details
From open match data to the warehouse: World Cup Football Analytics
With Rabin Duran
Every few years attention swings to national teams, but the most granular analytics still start from structured match and event data. How
do you move from flat extracts to queryable tables analysts can use and
when you care about both domestic leagues and international windows, how do you keep the questions aligned with the data (lineups and events vs.
results and goals alone)?
This talk walks through a football analytics pipeline built on open
StatsBomb-style data reduced to Parquet, uploaded to Google Cloud
Storage, and loaded into BigQuery with the ingestion and cloud steps
shown in R (arrow or readr, googleCloudStorageR, bigrquery). On top of
those tables, we outline a club vs international comparison using
match-level metadata where the data actually supports per-player,
per-context contrasts.
A shorter second thread uses historical international results and goal
records from public CSVs for country and tournament-level patterns.
There we show a small dashboard tuned to that international dataset. We
finish with hands-on exploration: a Looker Studio view on BigQuery so
the audience can see how analysts browse the same tables the pipeline
publishes and we peek at a simple model or prediction sketch.
Sampling and Monte Carlo in R
With Conor Morrison
This presentation will briefly review some concepts in statistics, and show how mathematically tricky problems can be solved, approximately, by sampling random variables in R and computing functions of these samples.
**************************************************************
You are welcome to join us for some food and drinks at a nearby restaurant after we have finished.
**************************************************************
***
- If this is your first R-Ladies event, please take a moment to review our R-Ladies Global code of conduct: https://github.com/rladies/starter-kit/wiki/Code-of-Conduct
- Who is this allowed to attend the event?
Everyone is welcome to join us at our events and to be a part of our mission to empower gender minorities with the skills and knowledge to code in one of the top programming language in data science, which is R.
3. To fulfill this mission we do ask participants to respect a focus on the encouragement and support for women and non-binary people during our events.
See you there!
Jasmine, and Melissa




