[talk] Pydata Lausanne December 2023

![[talk] Pydata Lausanne December 2023](https://secure.meetupstatic.com/photos/event/1/4/7/1/highres_507065233.webp?w=750)
Details
Welcome to December sesssion of PyData Lausanne talk!
👇 👇👇
1st talk, by Oleg Nenashev
short bio: https://github.com/oleg-nenashev/oleg-nenashev/tree/main/speaking
API and Data Mocking for Python
Abstract:
When working in embedded and observability domains, I’ve used Python scripting to retrieve and pre-process data from external sources, and one of the issues I’ve seen is the difficulty to reliably test data pipelines against external services: API limits and pay-per-use costs, service outages, etc, etc. So, can we model (aka “mock”) the services to reliably test our data ingestion pipelines?. Sure we can!
In this talk I will show a few ways to build test services, databases and API providers with the help of Testcontainers and WireMock available on Python, thanks to container tech. Then, we will extend the approach by adding the generation of fake data with help of Faker libraries or Synthesized that can be used for both relational data and data sequences.
2nd talk, by Chris Hemmens
Model-X Knockoffs
Abstract:
Model-X Knockoffs is a technique proposed in 2018 by Emmanuel Candès, Professor of Mathematics and Statistics at Stanford, designed to improve feature selection in machine learning models by creating a set of statistical doppelgängers, adding them to the set of features, and then giving both sets to the model. If the model identifies a feature as important, but also its doppelgänger, it’s likely that the feature has been selected due to statistical noise rather than any predictive power.
--- --- --- --- --- --- --- --- --- ---
##
References
- WireMock and Python: https://wiremock.org/docs/solutions/python/
- Testcontainers for Python: https://testcontainers-python.readthedocs.io/
- Synthesized: https://www.synthesized.io/
- Testcontainers modules:
- WireMock: https://testcontainers.com/modules/wiremock/
- Synthesized: https://testcontainers.com/modules/synthesized/
- PostgreSQL: https://testcontainers.com/modules/postgresql/
--- --- --- --- --- --- --- --- --- ---
NOTES
- PLEAS RSVP: If your status changes, change your RSVP accordingly! This enables us to manage reservations.
- If you are on the waiting list: we will do our best to increase the number of seats, so stay tuned

[talk] Pydata Lausanne December 2023