Confounders and Unconfoundedness

Notations and General Definitions

Some notations for observed data:

Some notations for things that are never observed:

Things that we want: (the expectations below, if not specified, are all approximated by i.i.d)

Average treatment/causal effect: The last line follows because the second-to-last line is the “approximation” we made by taking the sample average. It’s estimating the real expectation. But in reality it can be written either way.

Things we have: Similarly, we have: But they are actually different: Intuitively speaking, they are different because conditioning on or the other way around is restricting to the population receives the treatment. But the population receiving the treatment are, possibly, more likely to have higher potentials. For example, people at higher risk for flu (outcome) are more likely to choose to get a flu shot (treatment, meant to reduce the risk for flu). Also, this is comparing two different populations of people, whereas the true ATE is on same population.

They are equal if and only if This is saying, the potential outcomes are independent of treatment received. More plainly, we shall assign, if possible, treatment randomly, or at least independent of potential outcomesThis does not mean the assignment probability shall be equal in each group, for e.g., 0.2 of assigning a treatment to one of the five groups. In fact, as long as these probabilities are not affected by the potentials, then our assumption holds, even we always have 0.9 probability of giving treatment to one group/invidual . For example, the Federal Government consider allocating subsidies to fix water pollution to some states. Let the state’s water quality be . Then the state shall give these money no matter the state’s current water quality (this will become remain if untreated) is good or bad, and possible water quality improvement.

You probably think that’s not possible, from the perspective of policy makers. What’s more probably is there are some regions s.t. in a region of states with poor water quality, maybe , the government gives them an (somewhat) equally high amount of subsidy. And for another region, say , the government give them an equally low amount of subsidy. This is saying That is, in each group, the treatment assignment is random. These groups may or maybe observed. More often, these groups are called covariate because it affects the assignment and treatment, or confounders.

The condition (2) is often called unconfoundedness, no unmeasured confounders, or ignorability condition.

Other Causal Effects

Causal effects other than average treatment effect are used:

Confounder

Confounders are defined as variables that affect treatment and at the same time directly affect the outcome. Below are examples not the confounder:

In turns let’s have a confounder example:

Note that a set of confounders should not include any descendants of treatment. That is, it shall not block the front door path. A set of variables X is sufficient to control for confounding if:

Also this set of variables is not necessarily unique. For example:

Figure 2. Exmaple of backdoor path
Confounders and Unconfoundedness - Ruizhen Mai