Instrumental Variables (IV) Estimator

deals with the problem of endogeneity/correlation between a regressor and another regressor or error (which breaks one of the Gauss-Markov/OLS Assumptions)
can be interpreted as a Two-Stage Least Squares (2SLS) Estimator
is a biased estimator
is a consistent estimator as long as the instrument variable is good

IV Estimator - Resources

IV Estimator - Other

IV Estimator - Intuition & Derivation

assume we have:

a univariate linear model:
- 𝑦_𝑖 = 𝜃₀ + 𝜃₁𝑥_𝑖 + 𝑒_𝑖
endogeneity/correlation between 𝑒_𝑖 and 𝑥_𝑖 exists:
- 𝐶𝑜𝑟(𝑒_𝑖,𝑥_𝑖) ≠ 0

the least squares estimate 𝜃₁ˆ of true population parameter 𝜃₁ is defined as:

𝜃₁ˆ= 𝛥𝑦/𝛥𝑥
𝜃₁ˆ= (𝛥𝑦_𝑥 + 𝛥𝑦_𝑒)/𝛥𝑥 # because of endogeneity
𝜃₁ˆ= (𝛥𝑦_𝑥/𝛥𝑥) + (𝛥𝑦_𝑒/𝛥𝑥)
𝜃₁ˆ= 𝜃₁ + (𝛥𝑦_𝑒/𝛥𝑥) # population parameter 𝜃₁= (𝛥𝑦_𝑥/𝛥𝑥) by definition

PROBLEM: therefore, the least squares estimate 𝜃₁ˆ is a BIASED estimate of the true population parameter 𝜃₁because of endogeneity

SOLUTION: introduce a third variable (instrumental variable) 𝑧_𝑖 such that:

𝐶𝑜𝑣(𝑧_𝑖,𝑥_𝑖) ≠ 0
𝐶𝑜𝑣(𝑧_𝑖,𝑒_𝑖) = 0

next we define:

𝐶𝑜𝑣(𝑧_𝑖,𝑦_𝑖) = 𝐶𝑜𝑣(𝑧_𝑖,𝜃₀ + 𝜃₁𝑥_𝑖 + 𝑒_𝑖)
𝐶𝑜𝑣(𝑧_𝑖,𝑦_𝑖) = 𝐶𝑜𝑣(𝑧_𝑖,𝜃₀) + 𝜃₁𝐶𝑜𝑣(𝑧_𝑖,𝑥_𝑖) + 𝐶𝑜𝑣(𝑧_𝑖,𝑒_𝑖) # by properties of covariance
𝐶𝑜𝑣(𝑧_𝑖,𝑦_𝑖) = 0 + 𝜃₁𝐶𝑜𝑣(𝑧_𝑖,𝑥_𝑖) + 𝐶𝑜𝑣(𝑧_𝑖,𝑒_𝑖) # covariance with constant equals 0
𝐶𝑜𝑣(𝑧_𝑖,𝑦_𝑖) = 𝜃₁𝐶𝑜𝑣(𝑧_𝑖,𝑥_𝑖) + 0 # by above statement 𝐶𝑜𝑣(𝑧_𝑖,𝑒_𝑖) = 0

therefore:

𝜃₁= 𝐶𝑜𝑣(𝑧_𝑖,𝑦_𝑖) / 𝐶𝑜𝑣(𝑧_𝑖,𝑥_𝑖)

therefore, the IV Estimate 𝜃₁ˆ of the true population parameter 𝜃₁ is defined as:

𝜃₁ˆ= 𝑆𝑎𝑚𝑝𝑙𝑒-𝐶𝑜𝑣(𝑧_𝑖,𝑦_𝑖) / 𝑆𝑎𝑚𝑝𝑙𝑒-𝐶𝑜𝑣(𝑧_𝑖,𝑥_𝑖)

Resource Videos

Click here to expand...

IV Estimator - Examples

Click here to expand...

IV Estimator - Bad/Weak/Good Instrument Variables

IV Type	Conditions of IV Type	IV Estimate is Unbiased	IV Estimate is Consistent
Good Instrument Variables	𝐶𝑜𝑣(𝑧_𝑖,𝑥_𝑖) ≠ 0 𝐶𝑜𝑣(𝑧_𝑖,𝑒_𝑖) = 0	✘	✔
Bad Instrument Variables	𝐶𝑜𝑣(𝑧_𝑖,𝑥_𝑖) ≠/= 0 𝐶𝑜𝑣(𝑧_𝑖,𝑒_𝑖) ≠ 0	✘	✘
Weak Instrument Variables	𝐶𝑜𝑣(𝑧_𝑖,𝑥_𝑖) ≈ 0 𝐶𝑜𝑣(𝑧_𝑖,𝑒_𝑖) = 0	✘	?

Resource Videos

Click here to expand...

IV Estimator - Biasness & Consistency

bias

an explanation on why the IV Estimate 𝜃₁ˆof population parameter 𝜃₁ is biased

assume we have:

a univariate linear model that accurately models the population distribution:
- 𝑦_𝑖 = 𝜃₀ + 𝜃₁𝑥_𝑖 + 𝑒_𝑖
endogeneity/correlation exists between 𝑒_𝑖 and 𝑥_𝑖:
- 𝐶𝑜𝑟(𝑒_𝑖,𝑥_𝑖) ≠ 0

then the IV Estimate 𝜃₁ˆof population parameter 𝜃₁ is biased, in other words:

𝐄[𝜃₁ˆ] ≠ 𝜃₁

from Instrumental Variable Estimate vs 2 Stage Least Squares Estimate we see that IV Estimate is similar to 2SLS Estimate. In 2SLS we have 2 stages of LS regression:

𝑥_𝑖= 𝛿₀ + 𝛿₁𝑧_𝑖 + 𝜀_𝑖
𝑦_𝑖 = 𝜃₀ + 𝜃₁𝑥_𝑖 + 𝑒_𝑖# original regression model

KNOWN {𝛿₀, 𝛿₁}

UNKNOWN {𝛿₀, 𝛿₁}

Assume we KNOW the values of {𝛿₀, 𝛿₁}. Thus:

𝑥_𝑖^{𝑡𝑟𝑢𝑒}= 𝛿₀ + 𝛿₁𝑧_𝑖

plug this into the original regression model:

𝑦_𝑖 = 𝜃₀ + 𝜃₁𝑥_𝑖^{𝑡𝑟𝑢𝑒} + 𝑒_𝑖

because the values of {𝛿₀, 𝛿₁} are KNOWN, 𝑥_𝑖^{𝑡𝑟𝑢𝑒}contains none of 𝜀_𝑖, thus 𝜀_𝑖 is completely uncorrelated with 𝑧_𝑖. Thus there is no correlation between 𝑒_𝑖 and 𝑥_𝑖^{𝑡𝑟𝑢𝑒}

In reality, the values of {𝛿₀, 𝛿₁} are UNKNOWN and are estimated with {𝛿₀ˆ, 𝛿₁ˆ}. Thus:

𝑥̂_𝑖= 𝛿₀ˆ + 𝛿₁ˆ𝑧_𝑖+ 𝜀_𝑖

plugging into the original regression model:

𝑦_𝑖 = 𝜃₀ + 𝜃₁𝑥̂_𝑖 + 𝑒_𝑖

because the values of {𝛿₀, 𝛿₁} are UNKNOWN, 𝑥̂_𝑖contains 𝜀_𝑖, because of sampling error {𝛿₀, 𝛿₁} ≠ {𝛿₀ˆ, 𝛿₁ˆ} and 𝑥̂_𝑖 ≠ 𝑥_𝑖^{𝑡𝑟𝑢𝑒}. Thus there is some correlation between 𝜀_𝑖 and 𝑧_𝑖. Thus there is some correlation between 𝑒_𝑖 and 𝑥_𝑖^{𝑡𝑟𝑢𝑒}

Resource Video

consitency

an explanation on why the IV Estimate 𝜃₁ˆof population parameter 𝜃₁ is consistent

assume we have:

a univariate linear model that accurately models the population distribution:
- 𝑦_𝑖 = 𝜃₀ + 𝜃₁𝑥_𝑖 + 𝑒_𝑖
endogeneity/correlation exists 𝑒_𝑖 and 𝑥_𝑖:
- 𝐶𝑜𝑟(𝑒_𝑖,𝑥_𝑖) ≠ 0

then the IV Estimate 𝜃₁ˆof population parameter 𝜃₁ is consistent, in other words:

𝑝𝑙𝑖𝑚_𝑛→∞𝜃₁ˆ = 𝜃₁

PROOF

first let's take the definition an IV Estimate:

𝜃₁ˆ= 𝑆𝑎𝑚𝑝𝑙𝑒-𝐶𝑜𝑣(𝑧_𝑖,𝑦_𝑖) / 𝑆𝑎𝑚𝑝𝑙𝑒-𝐶𝑜𝑣(𝑧_𝑖,𝑥_𝑖)

as 𝑛→∞ we have:

𝑝𝑙𝑖𝑚_𝑛→∞𝜃₁ˆ = 𝐶𝑜𝑣(𝑧_𝑖,𝑦_𝑖) / 𝐶𝑜𝑣(𝑧_𝑖,𝑥_𝑖)
𝑝𝑙𝑖𝑚_𝑛→∞𝜃₁ˆ = 𝐶𝑜𝑣(𝑧_𝑖,𝜃₀ + 𝜃₁𝑥_𝑖 + 𝑒_𝑖) / 𝐶𝑜𝑣(𝑧_𝑖,𝑥_𝑖) # because 𝑦_𝑖 = 𝜃₀ + 𝜃₁𝑥_𝑖 + 𝑒_𝑖
𝑝𝑙𝑖𝑚_𝑛→∞𝜃₁ˆ = [𝐶𝑜𝑣(𝑧_𝑖,𝜃₀) + 𝜃₁𝐶𝑜𝑣(𝑧_𝑖,𝑥_𝑖) + 𝐶𝑜𝑣(𝑧_𝑖,𝑒_𝑖)] / 𝐶𝑜𝑣(𝑧_𝑖,𝑥_𝑖) # by properties of covariance
𝑝𝑙𝑖𝑚_𝑛→∞𝜃₁ˆ = [0 + 𝜃₁𝐶𝑜𝑣(𝑧_𝑖,𝑥_𝑖) + 𝐶𝑜𝑣(𝑧_𝑖,𝑒_𝑖)] / 𝐶𝑜𝑣(𝑧_𝑖,𝑥_𝑖) # covariance with a constant is 0
𝑝𝑙𝑖𝑚_𝑛→∞𝜃₁ˆ = [0 + 𝜃₁𝐶𝑜𝑣(𝑧_𝑖,𝑥_𝑖) + 0] / 𝐶𝑜𝑣(𝑧_𝑖,𝑥_𝑖) # 𝐶𝑜𝑣(𝑧_𝑖,𝑒_𝑖) = 0 by condition of good instrumental variable
𝑝𝑙𝑖𝑚_𝑛→∞𝜃₁ˆ = 𝜃₁𝐶𝑜𝑣(𝑧_𝑖,𝑥_𝑖) / 𝐶𝑜𝑣(𝑧_𝑖,𝑥_𝑖)
𝑝𝑙𝑖𝑚_𝑛→∞𝜃₁ˆ = 𝜃₁

hence, proved

Resource Video