Causal inference in machine learning with Columbia University

This is a past event

180 people went

Two Sigma

100 Avenue of the Americas · New York, NY

How to find us

Our entrance is on 6th avenue (not Watts street). The event will be on the 3rd floor

Location image of event venue

Details

Causal inference from observational data is a vital problem, but comes
with strong assumptions. Most methods assume that all confounders are
observed, variables that correlate to both the causal variables (e.g.,
the treatment) and the effect of those variables (e.g., the efficacy
of the treatment). However, many scientific studies involve multiple
causes, different variables whose effects are simultaneously of
interest. We propose the deconfounder, an algorithm that combines
unsupervised machine learning and predictive model checking to perform causal inference in multiple-cause settings. The deconfounder infers a latent variable as a substitute for unobserved confounders and then uses that substitute to perform causal inference. We develop theory
for when the deconfounder leads to unbiased causal estimates, and show
that it requires weaker assumptions than classical causal inference.
We analyze its performance in three types of studies: semi-simulated
data around smoking and lung cancer, semi-simulated data around
genomewide association studies, and a real dataset about actors and
movie revenue. The deconfounder provides a checkable approach to
estimating close-to-truth causal effects.

This is joint work with David Blei.

https://arxiv.org/abs/1805.06826

Event Schedule:

- Doors at 6:15 pm (there will be someone downstairs checking you in)
- Talk begins promptly at 7 pm with Q&A following
- Networking & Drinks!

Food & beverages will be available.

------- Sponsored by Comet.ml ---------
Comet.ml is doing for ML what Github did for software development. We allow data science teams to automagically track their datasets, code changes, experimentation history and production models creating efficiency, transparency, and reproducibility
---------------------------------------------------

About our speaker:

Yixin Wang is a PhD candidate in the Statistics Department of Columbia
University, advised by Professor David Blei. Her research interests
lie in Bayesian statistics, machine learning, and causal inference.
She obtained her BSc in Mathematics and Computer Science from the Hong Kong University of Science and Technology. Her research has received several awards, including the ASA Biometrics student paper award, the INFORMS data mining best paper award, and the ICSA conference young researcher award.