Causal Assumption: SUTVA and Ignorability

Understanding SUTVA and ignorability, fundamental assumptions in causal inference, what they mean, and what happens when they are violated.

  1. Individual treatment effect is estimable if we have parallel universes: two outcomes observed for each unit. we just subtract.
  2. SUTVA + RCT → estimable ATE.
  3. SUTVA + ignorability + overlap + well specified linear model → estimable ATE.

Rubin Potential Outcomes Framework and individual treatment effect

SUTVA, RCT, ATE

Examples of SUTVA Violations for “One Version”

2. No Interference

Examples of “Interference” (SUTVA Violation)

What Happens When SUTVA is Violated?

When SUTVA is violated, our estimates of the Average Treatment Effect (ATE) become biased. Specifically, the standard difference-in-means estimator $\hat{\tau} = \bar{Y}{treated} - \bar{Y}{control}$ no longer purely measures the effect of the treatment itself. It now also includes the “spillover” effects from other units’ assignments.

In many marketplaces (like Uber or Lyft), this bias is typically negative. If we treat some users and they consume all the available drivers, the control group users will have worse outcomes (longer wait times) than they would have in a pure control world. This makes the treatment look better than it actually is.

How to Handle SUTVA Violations (Interference)

In modern tech, interference is the most common and difficult SUTVA violation. To deal with it, data scientists use alternative experimental designs:

  1. Cluster Randomization: Instead of randomizing at the user level, we randomize at a higher level where interference is less likely (e.g., randomizing by city or by geographic neighborhood).
  2. Switchback (Time-Series) Experiments: We randomize the entire system between treatment and control states over different time windows.
  3. Graph-based Randomization: If we know the social or network connections between units, we can use graph-partitioning algorithms to ensure that treated units are mostly surrounded by other treated units (and same for control).

ATE by RCT

Ignorability: When Does Linear Regression Yield Causal Inference?

RCT is the gold standard

Observational studies

Ignorability

To make causal claims from observational data, we need the Ignorability Assumption (also known as “Unconfoundedness” or “Selection on Observables”):

Overlap

Well specified linear model

If ignorability and overlap hold, we can estimate the causal effect by adjusting for $X$. A common way is using linear regression:

$$ Y_i = \alpha + \tau W_i + \beta X_i + \epsilon_i $$

In this model, $\tau$ can be interpreted as the causal effect if the model correctly captures the relationship between $X$ and $Y$.

References