Skip to content

Details

From open match data to the warehouse: World Cup Football Analytics
With Rabin Duran

Every few years attention swings to national teams, but the most granular analytics still start from structured match and event data. How
do you move from flat extracts to queryable tables analysts can use and
when you care about both domestic leagues and international windows, how do you keep the questions aligned with the data (lineups and events vs.
results and goals alone)?

This talk walks through a football analytics pipeline built on open
StatsBomb-style data reduced to Parquet, uploaded to Google Cloud
Storage, and loaded into BigQuery with the ingestion and cloud steps
shown in R (arrow or readr, googleCloudStorageR, bigrquery). On top of
those tables, we outline a club vs international comparison using
match-level metadata where the data actually supports per-player,
per-context contrasts.

A shorter second thread uses historical international results and goal
records from public CSVs for country and tournament-level patterns.
There we show a small dashboard tuned to that international dataset. We
finish with hands-on exploration: a Looker Studio view on BigQuery so
the audience can see how analysts browse the same tables the pipeline
publishes and we peek at a simple model or prediction sketch.

Sampling and Monte Carlo in R
With Conor Morrison

This presentation will briefly review some concepts in statistics, and show how mathematically tricky problems can be solved, approximately, by sampling random variables in R and computing functions of these samples.

**************************************************************
You are welcome to join us for some food and drinks at a nearby restaurant after we have finished.
**************************************************************

***

  1. If this is your first R-Ladies event, please take a moment to review our R-Ladies Global code of conduct: https://github.com/rladies/starter-kit/wiki/Code-of-Conduct
  2. Who is this allowed to attend the event?

Everyone is welcome to join us at our events and to be a part of our mission to empower gender minorities with the skills and knowledge to code in one of the top programming language in data science, which is R.
3. To fulfill this mission we do ask participants to respect a focus on the encouragement and support for women and non-binary people during our events.

See you there!
Jasmine, and Melissa

Related topics

Events in Vancouver, BC
Data Analytics
Data Mining
Data Visualization
Programming in R
R-Ladies

Sponsors

R Consortium

R Consortium

Provide support to the organizations developing, and using R software

Hothead Games

Hothead Games

Venue

RStudio

RStudio

Food

Microsoft

Microsoft

Venue

You may also like