Causal Inference with Modern Machine Learning Methods

Course Description

Across disciplines, causal inference is a cornerstone of science, engineering, economics, and public policy. In medicine, we would like to understand how a patient might have responded if we offered a different treatment. In engineering system design and optimization, we would like to understand how the system would behave if we made different design choices. In public policy, we are constantly asking if different taxes, laws, regulations, or programs might improve (or hurt) society at large. Correctly answering such questions can help us make more informed and better decisions. Data are frequently collected and analyzed in the process of seeking statistically quantified answers to these questions. In contemporary applications, these data are decidedly large-scale, complex, and high-dimensional. These call for an urgent need in designing modern statistical/machine learning methods for causal inference.

Recently, an exciting set of tools at the intersection of causal inference and machine learning has emerged to tackle these types of questions in these settings. This course is a doctoral-level introduction to these tools. Our emphasis is primarily on studying the statistical/machine learning tools to formally analyze these types of methods, with the goal of empowering students to have an in-depth understanding of cutting-edge research in this area and contribute their own new (theoretically justified) methods to the field.

The first 2.5 weeks of the course will focus on basic concepts and methods in causal inference at the level of the Imbens and Rubin (2015) book. For the rest of the semester, we will move to recent developments for causal inference using modern machine learning methods and the content will primarily be based on recent research papers.

Learning Objectives

Upon successful completion of this course, students will be able to:

Explain the core concepts and challenges in causal inference under the potential outcomes framework;
Apply the most recent developments of modern statistical/machine learning methods to some core causal inference problems;
Demonstrate and improve the ability to develop and justify the statistical/machine learning methods with mathematical rigor when applying/adapting them to causal inference questions.

Causal Inference Overview

August 23, 2025

Understanding SUTVA and ignorability, fundamental assumptions in causal inference, what they mean, and what happens when they are violated.

causal-basics

Causal Assumption: SUTVA and Ignorability

August 24, 2025

Understanding SUTVA and ignorability, fundamental assumptions in causal inference, what they mean, and what happens when they are violated.

causal-basics

Causal Inference vs Causal Estimation

August 25, 2025

A deep dive into confidence intervals—what they are, how to interpret them, some caveats, and oft-encountered issues in online experimentation.

experimentation ab-testing metrics

Violation of SUTVA in A/B Testing: network interference

August 26, 2025

A summary of the Lyft Engineering blog post 'Interference Across a Network' detailing how naive A/B testing can bias effect estimates in ridesharing.

experimentation

Switchback Experiments

August 27, 2025

An overview of switchback (time-split) experiments: what they are, why they are used to solve network interference, and their trade-offs.

experimentation

Synthetic Control and Experimentation Culture

August 28, 2025

When standard experiments fail: utilizing Synthetic Controls, managing experimentation culture, and understanding various treatment effects.

causal-inference ab-testing experimentation synthetic-control

Case Study: Causal Effect of ETA Reduction

August 29, 2025

A practical case study on measuring the causal effect of reducing Estimated Time of Arrival (ETA) in a ridesharing marketplace.

causal-inference case-study marketplace experimentation

Case Study: Causal Effect of ETA Reduction

August 31, 2025

A practical case study on measuring the causal effect of reducing Estimated Time of Arrival (ETA) in a ridesharing marketplace.

causal-inference case-study marketplace experimentation

Paper review: Statistical Challenges in Online Controlled Experiments

August 31, 2025

A review of Larsen et al. (2024) on the statistical landscape and challenges of A/B testing in large-scale online environments.

a-b-testing online-experiments sutva sequential-testing

Deriving the IPW Estimator: From One RCT to Infinite RCTs under Unconfoundedness

September 1, 2025

An intuitive derivation of the Inverse Probability Weighting (IPW) estimator from a single RCT to multiple RCTs and observational data.

ate-estimation ipw

FWL Theorem

September 25, 2025

Frisch-Waugh-Lovell theorem

fwl-theorem

Augmented IPW and Double Robustness

October 15, 2025

AIPW estimator, double robustness, and cross-fitting

causal-inference aipw double-robustness

Lecture 16: Canonical Gradient and Efficient Influence Curve

February 20, 2026

Notes by Rachael Phillips for PB HLTH 290, Spring 2019

causal-inference asymptotic-efficiency efficiency-theory

Paper review: Improving randomized controlled trial analysis via data-adaptive borrowing

February 20, 2026

A deep dive into how machine learning and adaptive lasso can enhance RCTs by selectively borrowing information from external controls.

causal-inference medical-statistics adaptive-lasso machine-learning

Case Study: Diagnosing and Addressing a Metric Drop

March 3, 2026

An end-to-end framework for investigating MAU drops, targeting at-risk users, and making data-driven 'ship' decisions.

a-b-testing case-study product-analytics metrics

Potential Outcome Framework and Matching Estimators

March 3, 2026

An introduction to the Neyman-Rubin causal model and matching methods

causal-inference matching potential-outcomes

Quick and Dirty Sample Size Calculation

March 3, 2026

experimentation ab-testing metrics

A/B Testing Metrics

March 22, 2026

A comprehensive guide to selecting and evaluating metrics in A/B testing and online experimentation.

experimentation

Summary: Challenges in Experimentation (Lyft)

March 23, 2026

A summary of the Lyft Engineering blog post 'Challenges in Experimentation' by John Kirn.

causal-inference ab-testing experimentation

Paper review: Performance Guarantees for Individualized Treatment Rules

April 26, 2026

A review of Qian and Murphy (2011) on formulating individualized treatment rules via conditional outcome maximization with performance guarantees.

policy-learning paper-review

Policy Learning

April 26, 2026

Empirical Welfare Maximization, Policy Evaluation, and Regret Bounds

policy-learning causal-inference optimal-treatment

Course Description

Learning Objectives

Related Posts

Causal Inference Overview

Causal Assumption: SUTVA and Ignorability

Causal Inference vs Causal Estimation

Violation of SUTVA in A/B Testing: network interference

Switchback Experiments

Synthetic Control and Experimentation Culture

Case Study: Causal Effect of ETA Reduction

Case Study: Causal Effect of ETA Reduction

Paper review: Statistical Challenges in Online Controlled Experiments

Deriving the IPW Estimator: From One RCT to Infinite RCTs under Unconfoundedness

FWL Theorem

Augmented IPW and Double Robustness

Lecture 16: Canonical Gradient and Efficient Influence Curve

Paper review: Improving randomized controlled trial analysis via data-adaptive borrowing

Case Study: Diagnosing and Addressing a Metric Drop

Potential Outcome Framework and Matching Estimators

Quick and Dirty Sample Size Calculation

A/B Testing Metrics

Summary: Challenges in Experimentation (Lyft)

Paper review: Performance Guarantees for Individualized Treatment Rules

Policy Learning

Project Updates