Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf ·...

88
Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben Zamar Department of Statistics UBC IV March 12, 2014 1 / 33

Transcript of Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf ·...

Page 1: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

Instrumental-Variable

Ruben ZamarDepartment of Statistics UBC

March 12, 2014

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 1 / 33

Page 2: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

THE LINEAR REGRESSION MODEL

Ordinary linear regression model

y = 1α+ Xβ+ ε

Design Matrix

Xc =

x11- x̄1 x12- x̄2 · · · x1p- x̄px21- x̄1 x22- x̄2 · · · x2p- x̄px31- x̄1 x32- x̄2 · · · x3p- x̄p...

......

xn1- x̄1 xn2- x̄2 · · · xnp- x̄p

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 2 / 33

Page 3: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

THE LINEAR REGRESSION MODEL

Ordinary linear regression model

y = 1α+ Xβ+ ε

Design Matrix

Xc =

x11- x̄1 x12- x̄2 · · · x1p- x̄px21- x̄1 x22- x̄2 · · · x2p- x̄px31- x̄1 x32- x̄2 · · · x3p- x̄p...

......

xn1- x̄1 xn2- x̄2 · · · xnp- x̄p

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 2 / 33

Page 4: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

THE LS ESTIMATOR

Ordinary Least Squares Estimate

β̂LS =(X ′cXc

)−1 X ′cy , α̂LS = y − x′ β̂LS

Σ̂xx =1nX ′cXc (sample covariance matrix)

Σ̂xy =1nX ′cy

β̂LS= Σ̂−1xx Σ̂xy

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 3 / 33

Page 5: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

THE LS ESTIMATOR

Ordinary Least Squares Estimate

β̂LS =(X ′cXc

)−1 X ′cy , α̂LS = y − x′ β̂LS

Σ̂xx =1nX ′cXc (sample covariance matrix)

Σ̂xy =1nX ′cy

β̂LS= Σ̂−1xx Σ̂xy

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 3 / 33

Page 6: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

THE LS ESTIMATOR

Ordinary Least Squares Estimate

β̂LS =(X ′cXc

)−1 X ′cy , α̂LS = y − x′ β̂LS

Σ̂xx =1nX ′cXc (sample covariance matrix)

Σ̂xy =1nX ′cy

β̂LS= Σ̂−1xx Σ̂xy

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 3 / 33

Page 7: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

KEY ASSUMPTIONS

All covariates are exogenous

X and ε are independent

Errors have zero mean

E (ε) = 0

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 4 / 33

Page 8: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

KEY ASSUMPTIONS

All covariates are exogenous

X and ε are independent

Errors have zero mean

E (ε) = 0

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 4 / 33

Page 9: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

LS IS UNBIASED

E{

β̂LS

}= E

{E{

β̂LS |X}}

= E{(X ′cXc

)−1 X ′c E {y|X}}

= E{(X ′cXc

)−1 X ′c E {1α+ Xβ+ ε|X}}

= E

= 0︷ ︸︸ ︷(

X ′cXc)−1 X ′c1α+

= I︷ ︸︸ ︷(X ′cXc

)−1 X ′cXβ+(X ′cXc

)−1 X ′c= 0︷ ︸︸ ︷E {ε}

= β

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 5 / 33

Page 10: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

LS IS UNBIASED

E{

β̂LS

}= E

{E{

β̂LS |X}}

= E{(X ′cXc

)−1 X ′c E {y|X}}

= E{(X ′cXc

)−1 X ′c E {1α+ Xβ+ ε|X}}

= E

= 0︷ ︸︸ ︷(

X ′cXc)−1 X ′c1α+

= I︷ ︸︸ ︷(X ′cXc

)−1 X ′cXβ+(X ′cXc

)−1 X ′c= 0︷ ︸︸ ︷E {ε}

= β

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 5 / 33

Page 11: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

LS IS UNBIASED

E{

β̂LS

}= E

{E{

β̂LS |X}}

= E{(X ′cXc

)−1 X ′c E {y|X}}

= E{(X ′cXc

)−1 X ′c E {1α+ Xβ+ ε|X}}

= E

= 0︷ ︸︸ ︷(

X ′cXc)−1 X ′c1α+

= I︷ ︸︸ ︷(X ′cXc

)−1 X ′cXβ+(X ′cXc

)−1 X ′c= 0︷ ︸︸ ︷E {ε}

= β

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 5 / 33

Page 12: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

LS IS UNBIASED

E{

β̂LS

}= E

{E{

β̂LS |X}}

= E{(X ′cXc

)−1 X ′c E {y|X}}

= E{(X ′cXc

)−1 X ′c E {1α+ Xβ+ ε|X}}

= E

= 0︷ ︸︸ ︷(

X ′cXc)−1 X ′c1α+

= I︷ ︸︸ ︷(X ′cXc

)−1 X ′cXβ+(X ′cXc

)−1 X ′c= 0︷ ︸︸ ︷E {ε}

= β

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 5 / 33

Page 13: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

LS IS UNBIASED

E{

β̂LS

}= E

{E{

β̂LS |X}}

= E{(X ′cXc

)−1 X ′c E {y|X}}

= E{(X ′cXc

)−1 X ′c E {1α+ Xβ+ ε|X}}

= E

= 0︷ ︸︸ ︷(

X ′cXc)−1 X ′c1α+

= I︷ ︸︸ ︷(X ′cXc

)−1 X ′cXβ+(X ′cXc

)−1 X ′c= 0︷ ︸︸ ︷E {ε}

= β

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 5 / 33

Page 14: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

LS IS CONSISTENT

β̂LS = Σ̂−1xx Σ̂xy −→ Σ−1xx Σxy , by LLN

Σ−1xx Σxy = Σ−1xx Cov (x,y)

= Σ−1xx Cov(x,α+ x′β+ ε

)

= Σ−1xx

Σxx β︷ ︸︸ ︷Cov

(x, x′β

)+ Σ−1xx

0︷ ︸︸ ︷Cov (x,ε)

= β

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 6 / 33

Page 15: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

LS IS CONSISTENT

β̂LS = Σ̂−1xx Σ̂xy −→ Σ−1xx Σxy , by LLN

Σ−1xx Σxy = Σ−1xx Cov (x,y)

= Σ−1xx Cov(x,α+ x′β+ ε

)

= Σ−1xx

Σxx β︷ ︸︸ ︷Cov

(x, x′β

)+ Σ−1xx

0︷ ︸︸ ︷Cov (x,ε)

= β

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 6 / 33

Page 16: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

LS IS CONSISTENT

β̂LS = Σ̂−1xx Σ̂xy −→ Σ−1xx Σxy , by LLN

Σ−1xx Σxy = Σ−1xx Cov (x,y)

= Σ−1xx Cov(x,α+ x′β+ ε

)

= Σ−1xx

Σxx β︷ ︸︸ ︷Cov

(x, x′β

)+ Σ−1xx

0︷ ︸︸ ︷Cov (x,ε)

= β

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 6 / 33

Page 17: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

LS IS CONSISTENT

β̂LS = Σ̂−1xx Σ̂xy −→ Σ−1xx Σxy , by LLN

Σ−1xx Σxy = Σ−1xx Cov (x,y)

= Σ−1xx Cov(x,α+ x′β+ ε

)

= Σ−1xx

Σxx β︷ ︸︸ ︷Cov

(x, x′β

)+ Σ−1xx

0︷ ︸︸ ︷Cov (x,ε)

= β

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 6 / 33

Page 18: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

LS IS CONSISTENT

β̂LS = Σ̂−1xx Σ̂xy −→ Σ−1xx Σxy , by LLN

Σ−1xx Σxy = Σ−1xx Cov (x,y)

= Σ−1xx Cov(x,α+ x′β+ ε

)

= Σ−1xx

Σxx β︷ ︸︸ ︷Cov

(x, x′β

)+ Σ−1xx

0︷ ︸︸ ︷Cov (x,ε)

= β

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 6 / 33

Page 19: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

ENDOGENEITY

Some covariates (called endogenous ) are correlated with the errorterm

This problem arises in several contexts:

Errors-in-variables: Some covariates are measured with errors

Omitted covariates: Some model covariates are correlated withunoserved predictors

Simultaneity: Some covariates simultaneously affect and areaffected by the response variable

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 7 / 33

Page 20: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

ENDOGENEITY

Some covariates (called endogenous ) are correlated with the errorterm

This problem arises in several contexts:

Errors-in-variables: Some covariates are measured with errors

Omitted covariates: Some model covariates are correlated withunoserved predictors

Simultaneity: Some covariates simultaneously affect and areaffected by the response variable

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 7 / 33

Page 21: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

ENDOGENEITY

Some covariates (called endogenous ) are correlated with the errorterm

This problem arises in several contexts:

Errors-in-variables: Some covariates are measured with errors

Omitted covariates: Some model covariates are correlated withunoserved predictors

Simultaneity: Some covariates simultaneously affect and areaffected by the response variable

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 7 / 33

Page 22: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

ENDOGENEITY

Some covariates (called endogenous ) are correlated with the errorterm

This problem arises in several contexts:

Errors-in-variables: Some covariates are measured with errors

Omitted covariates: Some model covariates are correlated withunoserved predictors

Simultaneity: Some covariates simultaneously affect and areaffected by the response variable

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 7 / 33

Page 23: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

ENDOGENEITY

Some covariates (called endogenous ) are correlated with the errorterm

This problem arises in several contexts:

Errors-in-variables: Some covariates are measured with errors

Omitted covariates: Some model covariates are correlated withunoserved predictors

Simultaneity: Some covariates simultaneously affect and areaffected by the response variable

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 7 / 33

Page 24: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

ENDOGENEITY

The (additive) asymptotic bias due to endogeneity is given by

AB = Σ−1xx Cov (x,ε)

=Cov (x , ε)Var (x)

(single linear regression)

= Corr (x , ε)sd (ε)sd (x)

(single linear regression)

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 8 / 33

Page 25: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

ENDOGENEITY

The (additive) asymptotic bias due to endogeneity is given by

AB = Σ−1xx Cov (x,ε)

=Cov (x , ε)Var (x)

(single linear regression)

= Corr (x , ε)sd (ε)sd (x)

(single linear regression)

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 8 / 33

Page 26: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

ENDOGENEITY

The (additive) asymptotic bias due to endogeneity is given by

AB = Σ−1xx Cov (x,ε)

=Cov (x , ε)Var (x)

(single linear regression)

= Corr (x , ε)sd (ε)sd (x)

(single linear regression)

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 8 / 33

Page 27: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

ERRORS IN VARIABLES (EIV)

A simple example:

yi = 1+ 2vi + εi , i = 1, 2, ..., n

xi = vi + ei (error-in-variable)

To fix ideas suppose that

vi , εi and ei are independent

vi ∼ N (0, 1) , εi , ei ∼ N (0, 0.5)

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 9 / 33

Page 28: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

ERRORS IN VARIABLES (EIV)

A simple example:

yi = 1+ 2vi + εi , i = 1, 2, ..., n

xi = vi + ei (error-in-variable)

To fix ideas suppose that

vi , εi and ei are independent

vi ∼ N (0, 1) , εi , ei ∼ N (0, 0.5)

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 9 / 33

Page 29: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

EIV (continued)

yi = 1+ 2vi + εi

= 1+ 2 (xi − ei ) + εi

= 1+ 2xi + (εi − 2ei )= 1+ 2xi + ∆i

∆i = εi − 2ei

cov (xi ,∆i ) = −2× 0.5 = −1 (endogeneity)

Var (xi ) = 1.5

AB =cov (xi ,∆i )Var (xi )

= − 11.5

= −0.667

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 10 / 33

Page 30: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

EIV (continued)

yi = 1+ 2vi + εi

= 1+ 2 (xi − ei ) + εi

= 1+ 2xi + (εi − 2ei )= 1+ 2xi + ∆i

∆i = εi − 2ei

cov (xi ,∆i ) = −2× 0.5 = −1 (endogeneity)

Var (xi ) = 1.5

AB =cov (xi ,∆i )Var (xi )

= − 11.5

= −0.667

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 10 / 33

Page 31: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

EIV (continued)

yi = 1+ 2vi + εi

= 1+ 2 (xi − ei ) + εi

= 1+ 2xi + (εi − 2ei )= 1+ 2xi + ∆i

∆i = εi − 2ei

cov (xi ,∆i ) = −2× 0.5 = −1 (endogeneity)

Var (xi ) = 1.5

AB =cov (xi ,∆i )Var (xi )

= − 11.5

= −0.667

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 10 / 33

Page 32: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

OMITTED VARIABLES

Suppose that the “true”model is

yi = α+ β1

included︷︸︸︷x1i + β2

omitted︷︸︸︷x2i +

independent errors︷︸︸︷εi

To fix ideas suppose that

Var(x1i ) = Var (x2i ) = 1, var (εi ) = 0.1

andCov (x1i , x2i ) = 0.4,

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 11 / 33

Page 33: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

OMITTED VARIABLES

Suppose that the “true”model is

yi = α+ β1

included︷︸︸︷x1i + β2

omitted︷︸︸︷x2i +

independent errors︷︸︸︷εi

To fix ideas suppose that

Var(x1i ) = Var (x2i ) = 1, var (εi ) = 0.1

andCov (x1i , x2i ) = 0.4,

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 11 / 33

Page 34: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

OMITTED VARIABLES (continued)

Suppose variable x2 is omitted and we fit the model

yi = α+ βx1i + ∆i

∆i = β2x2i + εi

cov (∆i , x1i ) = 0.4β2 (endogeneity)

AB =cov (x1i ,∆i )Var (x1i )

= 0.4β2

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 12 / 33

Page 35: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

OMITTED VARIABLES (continued)

Suppose variable x2 is omitted and we fit the model

yi = α+ βx1i + ∆i

∆i = β2x2i + εi

cov (∆i , x1i ) = 0.4β2 (endogeneity)

AB =cov (x1i ,∆i )Var (x1i )

= 0.4β2

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 12 / 33

Page 36: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

OMITTED VARIABLES (continued)

Suppose variable x2 is omitted and we fit the model

yi = α+ βx1i + ∆i

∆i = β2x2i + εi

cov (∆i , x1i ) = 0.4β2 (endogeneity)

AB =cov (x1i ,∆i )Var (x1i )

= 0.4β2

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 12 / 33

Page 37: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

SOME EXAMPLES OF ENDOGENEITY

Risk factors of coronary heart diseaseCarrol, Enciclopedy of Biostatistics, 2005: patients’s long-termblood pressure may be measured with error (errors-in-variables)

Effect of smoking on birth weightPermutt and Hebel, Biometrics, 1989: smoking by pregnantmothers may be correlated with unobserved socioeconomic factorswhich affect their infant birth weight (omitted covariates)

Atherosclerotic cardiovascular diseaseHolmes et al., PloS ONE, 2010: C-reactive protein (CPR) maycause atherogenesis or may be produced by inflammation withinatherosclerotic plaque. Thus CPR may be a cause and a result ofatherosclerosis (simultaneity)

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 13 / 33

Page 38: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

SOME EXAMPLES OF ENDOGENEITY

Risk factors of coronary heart diseaseCarrol, Enciclopedy of Biostatistics, 2005: patients’s long-termblood pressure may be measured with error (errors-in-variables)

Effect of smoking on birth weightPermutt and Hebel, Biometrics, 1989: smoking by pregnantmothers may be correlated with unobserved socioeconomic factorswhich affect their infant birth weight (omitted covariates)

Atherosclerotic cardiovascular diseaseHolmes et al., PloS ONE, 2010: C-reactive protein (CPR) maycause atherogenesis or may be produced by inflammation withinatherosclerotic plaque. Thus CPR may be a cause and a result ofatherosclerosis (simultaneity)

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 13 / 33

Page 39: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

SOME EXAMPLES OF ENDOGENEITY

Risk factors of coronary heart diseaseCarrol, Enciclopedy of Biostatistics, 2005: patients’s long-termblood pressure may be measured with error (errors-in-variables)

Effect of smoking on birth weightPermutt and Hebel, Biometrics, 1989: smoking by pregnantmothers may be correlated with unobserved socioeconomic factorswhich affect their infant birth weight (omitted covariates)

Atherosclerotic cardiovascular diseaseHolmes et al., PloS ONE, 2010: C-reactive protein (CPR) maycause atherogenesis or may be produced by inflammation withinatherosclerotic plaque. Thus CPR may be a cause and a result ofatherosclerosis (simultaneity)

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 13 / 33

Page 40: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

THE METHOD OF

INSTRUMENTAL VARIABLES

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 14 / 33

Page 41: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

OVERVIEW

First introduced in Appendix B of a book authored by Philip G.Wright, “TheTariff on Animal and Vegetable Oils”, published in1928.

Some people attribute the authorship of Appendix B to Philip’s eldestson, Sewal Wright

The method consists of using “instruments”(variables not includedin the model) to consistently estimate the regression coeffi cients ofendogenous covariatesVery useful to investigate possible causal relations in observationalstudies

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 15 / 33

Page 42: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

OVERVIEW

First introduced in Appendix B of a book authored by Philip G.Wright, “TheTariff on Animal and Vegetable Oils”, published in1928.

Some people attribute the authorship of Appendix B to Philip’s eldestson, Sewal Wright

The method consists of using “instruments”(variables not includedin the model) to consistently estimate the regression coeffi cients ofendogenous covariatesVery useful to investigate possible causal relations in observationalstudies

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 15 / 33

Page 43: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

OVERVIEW

First introduced in Appendix B of a book authored by Philip G.Wright, “TheTariff on Animal and Vegetable Oils”, published in1928.

Some people attribute the authorship of Appendix B to Philip’s eldestson, Sewal Wright

The method consists of using “instruments”(variables not includedin the model) to consistently estimate the regression coeffi cients ofendogenous covariates

Very useful to investigate possible causal relations in observationalstudies

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 15 / 33

Page 44: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

OVERVIEW

First introduced in Appendix B of a book authored by Philip G.Wright, “TheTariff on Animal and Vegetable Oils”, published in1928.

Some people attribute the authorship of Appendix B to Philip’s eldestson, Sewal Wright

The method consists of using “instruments”(variables not includedin the model) to consistently estimate the regression coeffi cients ofendogenous covariatesVery useful to investigate possible causal relations in observationalstudies

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 15 / 33

Page 45: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

INSTRUMENTAL VARIABLES

“Instruments” are variables

zi =

z1iz2i...zqi

i = 1, 2, ...n, q ≥ p

zi is correlated with the endogenous covariateszi is uncorrelated with the error termrank of Cov (zi ) = qrank of Cov (zi , xi ) = p

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 16 / 33

Page 46: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

INSTRUMENTAL VARIABLES

“Instruments” are variables

zi =

z1iz2i...zqi

i = 1, 2, ...n, q ≥ p

zi is correlated with the endogenous covariates

zi is uncorrelated with the error termrank of Cov (zi ) = qrank of Cov (zi , xi ) = p

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 16 / 33

Page 47: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

INSTRUMENTAL VARIABLES

“Instruments” are variables

zi =

z1iz2i...zqi

i = 1, 2, ...n, q ≥ p

zi is correlated with the endogenous covariateszi is uncorrelated with the error term

rank of Cov (zi ) = qrank of Cov (zi , xi ) = p

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 16 / 33

Page 48: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

INSTRUMENTAL VARIABLES

“Instruments” are variables

zi =

z1iz2i...zqi

i = 1, 2, ...n, q ≥ p

zi is correlated with the endogenous covariateszi is uncorrelated with the error termrank of Cov (zi ) = q

rank of Cov (zi , xi ) = p

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 16 / 33

Page 49: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

INSTRUMENTAL VARIABLES

“Instruments” are variables

zi =

z1iz2i...zqi

i = 1, 2, ...n, q ≥ p

zi is correlated with the endogenous covariateszi is uncorrelated with the error termrank of Cov (zi ) = qrank of Cov (zi , xi ) = p

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 16 / 33

Page 50: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

THE CENTERED INSTRUMENT MATRIX

Zc =

z11- z1 z12- z2 · · · z1p- z̄pz21- z̄1 z22- z̄2 · · · z2p- z̄pz31- z̄1 z32- z̄2 · · · z3p- z̄p...

......

zn1- z̄1 zn2- z̄2 · · · znp- z̄p

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 17 / 33

Page 51: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

THE METHOD

First, project the model covariates on the space generated by theinstruments

X̂c =

projection matrix︷ ︸︸ ︷[Zc(Z ′cZc

)−1 Z ′c] Xc = PzXc

X̂c represents the part of Xc uncorrelated with ε

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 18 / 33

Page 52: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

THE METHOD

First, project the model covariates on the space generated by theinstruments

X̂c =

projection matrix︷ ︸︸ ︷[Zc(Z ′cZc

)−1 Z ′c] Xc = PzXcX̂c represents the part of Xc uncorrelated with ε

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 18 / 33

Page 53: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

THE METHOD (CONTINUED)

Second, apply ordinary LS but using the design matrix X̂c instead ofXc :

β̂IV =(X̂ ′c X̂c

)−1X̂cy

=(X ′cPzXc

)−1 X ′cPzy=

(X ′cZc

(Z ′cZc

)−1 Z ′cXc)−1 X ′cZc (Z ′cZc )−1 Z ′cy=

(Σ̂xz Σ̂−1zz Σ̂zx

)−1Σ̂xz Σ̂−1zz Σ̂zy ⇐= KEY OBSERVATION

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 19 / 33

Page 54: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

THE METHOD (CONTINUED)

Second, apply ordinary LS but using the design matrix X̂c instead ofXc :

β̂IV =(X̂ ′c X̂c

)−1X̂cy

=(X ′cPzXc

)−1 X ′cPzy=

(X ′cZc

(Z ′cZc

)−1 Z ′cXc)−1 X ′cZc (Z ′cZc )−1 Z ′cy

=(

Σ̂xz Σ̂−1zz Σ̂zx)−1

Σ̂xz Σ̂−1zz Σ̂zy ⇐= KEY OBSERVATION

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 19 / 33

Page 55: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

THE METHOD (CONTINUED)

Second, apply ordinary LS but using the design matrix X̂c instead ofXc :

β̂IV =(X̂ ′c X̂c

)−1X̂cy

=(X ′cPzXc

)−1 X ′cPzy=

(X ′cZc

(Z ′cZc

)−1 Z ′cXc)−1 X ′cZc (Z ′cZc )−1 Z ′cy=

(Σ̂xz Σ̂−1zz Σ̂zx

)−1Σ̂xz Σ̂−1zz Σ̂zy

⇐= KEY OBSERVATION

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 19 / 33

Page 56: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

THE METHOD (CONTINUED)

Second, apply ordinary LS but using the design matrix X̂c instead ofXc :

β̂IV =(X̂ ′c X̂c

)−1X̂cy

=(X ′cPzXc

)−1 X ′cPzy=

(X ′cZc

(Z ′cZc

)−1 Z ′cXc)−1 X ′cZc (Z ′cZc )−1 Z ′cy=

(Σ̂xz Σ̂−1zz Σ̂zx

)−1Σ̂xz Σ̂−1zz Σ̂zy ⇐= KEY OBSERVATION

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 19 / 33

Page 57: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

STATISTICAL PROPERTIES

β̂IV is consistent and asymptotically normal under mild regularityassumptions

But data may contain outliers in the

response variable yi

covariates xi

instruments zi

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 20 / 33

Page 58: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

STATISTICAL PROPERTIES

β̂IV is consistent and asymptotically normal under mild regularityassumptions

But data may contain outliers in the

response variable yi

covariates xi

instruments zi

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 20 / 33

Page 59: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

STATISTICAL PROPERTIES

β̂IV is not robust (sensitive to outliers in the response, covariatesand/or instruments)

β̂IV has unbounded influence function

β̂IV has zero breakdown point

β̂IV fails robustness-tests in simulations

β̂IV has poor performance in some real data examples due tooutliers.

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 21 / 33

Page 60: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

STATISTICAL PROPERTIES

β̂IV is not robust (sensitive to outliers in the response, covariatesand/or instruments)

β̂IV has unbounded influence function

β̂IV has zero breakdown point

β̂IV fails robustness-tests in simulations

β̂IV has poor performance in some real data examples due tooutliers.

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 21 / 33

Page 61: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

STATISTICAL PROPERTIES

β̂IV is not robust (sensitive to outliers in the response, covariatesand/or instruments)

β̂IV has unbounded influence function

β̂IV has zero breakdown point

β̂IV fails robustness-tests in simulations

β̂IV has poor performance in some real data examples due tooutliers.

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 21 / 33

Page 62: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

STATISTICAL PROPERTIES

β̂IV is not robust (sensitive to outliers in the response, covariatesand/or instruments)

β̂IV has unbounded influence function

β̂IV has zero breakdown point

β̂IV fails robustness-tests in simulations

β̂IV has poor performance in some real data examples due tooutliers.

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 21 / 33

Page 63: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

STATISTICAL PROPERTIES

β̂IV is not robust (sensitive to outliers in the response, covariatesand/or instruments)

β̂IV has unbounded influence function

β̂IV has zero breakdown point

β̂IV fails robustness-tests in simulations

β̂IV has poor performance in some real data examples due tooutliers.

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 21 / 33

Page 64: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

APPLICATION OFINSTRUMENTAL VARIABLESIN MEDICAL RESEARCH

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 22 / 33

Page 65: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

OVERVIEWTHE FRAMINGHAM HEART STUDY

Prior work suggests that high blood pressure can increase the size ofthe left atrial dimension(Vaziri et al., 1995; Milan et al., 2009; Benjamin et al., 1995).

We use IV to measure the effect of long-term systolic blood pressure(SBP) on left atrial size (LAD).

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 23 / 33

Page 66: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

OVERVIEWTHE FRAMINGHAM HEART STUDY

Prior work suggests that high blood pressure can increase the size ofthe left atrial dimension(Vaziri et al., 1995; Milan et al., 2009; Benjamin et al., 1995).

We use IV to measure the effect of long-term systolic blood pressure(SBP) on left atrial size (LAD).

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 23 / 33

Page 67: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

DATA

Our sample includes 164 male subjects from the original cohort whounderwent their 16th biennial examination and satisfy the inclusioncriteria employed by Vaziri et al. (1995):

patient doesn’t have a history of heart disease,patient is not taking cardiovascular medicationpatient has complete clinical data

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 24 / 33

Page 68: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

STATISTICAL MODEL

Following Vaziri et al. (1995), we consider the following model:

Variable Description Variable StatusLAD Left atrial dimension ResponseSBP16 long-term systolic Main covariate

blood pressure Endogenous(Examination # 16) (measured with error)

BMI Body mass index Control covariateExogenous

AGE Subject’s age Control covariateExogenous

SBP15 long-term systolic Instrumentblood pressure(Examination # 15)

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 25 / 33

Page 69: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

RESULTS

SBP BMI AGECOEFF. 0.011 3.195 0.023P-VALUE 0.684 <0.001 0.685

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 26 / 33

Page 70: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

APPLICATION OFINSTRUMENTAL VARIABLESIN LABOR ECONOMICS

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 27 / 33

Page 71: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

APPLICATION OF RIV TO LABOR UNIONS DATA

Our dataset contains 181 industries in the United States for the year2003.

Unionization rates are obtained from the Union Membership andCoverage Database

The fraction of male workers comes from the Census PopulationSurvey

The remaining variables are constructed using data from Compustat

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 28 / 33

Page 72: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

APPLICATION OF RIV TO LABOR UNIONS DATA

Our dataset contains 181 industries in the United States for the year2003.

Unionization rates are obtained from the Union Membership andCoverage Database

The fraction of male workers comes from the Census PopulationSurvey

The remaining variables are constructed using data from Compustat

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 28 / 33

Page 73: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

APPLICATION OF RIV TO LABOR UNIONS DATA

Our dataset contains 181 industries in the United States for the year2003.

Unionization rates are obtained from the Union Membership andCoverage Database

The fraction of male workers comes from the Census PopulationSurvey

The remaining variables are constructed using data from Compustat

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 28 / 33

Page 74: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

APPLICATION OF RIV TO LABOR UNIONS DATA

Our dataset contains 181 industries in the United States for the year2003.

Unionization rates are obtained from the Union Membership andCoverage Database

The fraction of male workers comes from the Census PopulationSurvey

The remaining variables are constructed using data from Compustat

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 28 / 33

Page 75: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

APPLICATION OF RIV TO LABOR UNIONS DATA

Our dataset contains 181 industries in the United States for the year2003.

Unionization rates are obtained from the Union Membership andCoverage Database

The fraction of male workers comes from the Census PopulationSurvey

The remaining variables are constructed using data from Compustat

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 28 / 33

Page 76: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

EFFECT OF LABOR UNIONS ON FIRMS’PROFITS

Variable Name Variable Description Variable StatusROA Accounting profits Response variable

scaled by assets

UNION Fraction of workers Endogenous, main covariatethat are unionized

GROWTH Sales growth rate Exogenous, control covariate

CAPEX Capital expenditures Exogenous, control covariatescaled by assets

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 29 / 33

Page 77: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

EFFECT OF LABOR UNIONS ON FIRMS’PROFITS

Variable Name Variable Description Variable StatusSIZE Fixed assets (log scale) Exogenous, control covariate

LEV Total debt scaled Exogenous, control covariateby assets

KL Capital-labor ratio Exogenous, control covariate

MALE Fraction of male workers Instrument

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 30 / 33

Page 78: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

UNION

Why is UNION endogenous?

There are two possible reasons

1. UNION and ROA are likely to be simultaneously determined

Industry higher performance may stimulate unionization to get a betterprofits shareIt is conjectured that higher unionization cause industry performance todecline

2. There may be unobserved industry features that determine performanceand are correlated with unionization

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 31 / 33

Page 79: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

UNION

Why is UNION endogenous?

There are two possible reasons

1. UNION and ROA are likely to be simultaneously determined

Industry higher performance may stimulate unionization to get a betterprofits shareIt is conjectured that higher unionization cause industry performance todecline

2. There may be unobserved industry features that determine performanceand are correlated with unionization

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 31 / 33

Page 80: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

UNION

Why is UNION endogenous?

There are two possible reasons

1. UNION and ROA are likely to be simultaneously determined

Industry higher performance may stimulate unionization to get a betterprofits shareIt is conjectured that higher unionization cause industry performance todecline

2. There may be unobserved industry features that determine performanceand are correlated with unionization

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 31 / 33

Page 81: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

UNION

Why is UNION endogenous?

There are two possible reasons

1. UNION and ROA are likely to be simultaneously determined

Industry higher performance may stimulate unionization to get a betterprofits share

It is conjectured that higher unionization cause industry performance todecline

2. There may be unobserved industry features that determine performanceand are correlated with unionization

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 31 / 33

Page 82: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

UNION

Why is UNION endogenous?

There are two possible reasons

1. UNION and ROA are likely to be simultaneously determined

Industry higher performance may stimulate unionization to get a betterprofits shareIt is conjectured that higher unionization cause industry performance todecline

2. There may be unobserved industry features that determine performanceand are correlated with unionization

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 31 / 33

Page 83: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

UNION

Why is UNION endogenous?

There are two possible reasons

1. UNION and ROA are likely to be simultaneously determined

Industry higher performance may stimulate unionization to get a betterprofits shareIt is conjectured that higher unionization cause industry performance todecline

2. There may be unobserved industry features that determine performanceand are correlated with unionization

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 31 / 33

Page 84: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

MALE

Why is MALE a reasonable instrument for UNION?

males have more permanent attachment to the labor market and tointernal job ladders⇒ males derive higher wage/non-wage benefits from unionizing⇒ males are more likely to become unionized.

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 32 / 33

Page 85: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

MALE

Why is MALE a reasonable instrument for UNION?

males have more permanent attachment to the labor market and tointernal job ladders

⇒ males derive higher wage/non-wage benefits from unionizing⇒ males are more likely to become unionized.

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 32 / 33

Page 86: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

MALE

Why is MALE a reasonable instrument for UNION?

males have more permanent attachment to the labor market and tointernal job ladders⇒ males derive higher wage/non-wage benefits from unionizing

⇒ males are more likely to become unionized.

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 32 / 33

Page 87: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

MALE

Why is MALE a reasonable instrument for UNION?

males have more permanent attachment to the labor market and tointernal job ladders⇒ males derive higher wage/non-wage benefits from unionizing⇒ males are more likely to become unionized.

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 32 / 33

Page 88: Instrumental-Variable - University of British Columbiaruben/website/Santalo_School_2014_I.pdf · Instrumental-Variable Ruben Zamar Department of Statistics UBC March 12, 2014 Ruben

RESULTS

UNION GROWTH CAPEX SIZE LEV KL

coeff. -0.271 0.126 0.197 0.003 0.094 0.004p-value 0.128 0.013 0.000 0.312 0.005 0.313

Ruben Zamar Department of Statistics UBC () IV March 12, 2014 33 / 33