NABE EBA DECK 2

ikiki22i110i X...XXY ε+β++β+β+β=

εβ +=

=

XY

ni ,...,1

jji

i

X

Y β=∂∂

k22221

k11211

2

1

X...XX1

X...XX1

XY

Y

Y

=

=MMMM

62

Estimation of regression parameters:-- Least squares-- Maximum likelihood

1nxn

2

1

1x)1k(k

1

0

)1k(nxnk2n1n1nxn X...XX1Y

ε

εε

=ε

β

ββ

=β

+

+

MM

MMMM

Endogenous variables --Dependent variables, values of which are determined within the system

Exogenous variables --Variables determined outside the system butinfluence the system by affecting the values of the endogenous variables

63

of the endogenous variables

Structural parameters --Estimated using econometric techniques andrelevant data

Lagged endogenous variables

Lagged exogenous variables

Predetermined variables

(1) Degrees-of Freedom Problem

Available data imply do not include enough observations to allow adequate or reasonable estimates of the model

64

(2) Collinearity Problem

Tendency of the data to bunch or move together, e.g., in time-series data, the variables tend to exhibit the same trends, cyclical and secular, over time; interdependence among regressors in the model

(3) Serial Correlation Problem

Predominantly a problem with time-series data; underlying changes occur very slowly over time to the extent that the stochastic disturbance term represents conditions relevant to the model but not explicitly accounted for (such as omitted variables); serial correlation represents the

65

variables); serial correlation represents the dependence of stochastic disturbance terms in different periods.

(4) Structural Change Problem

Parameters may not be constant over time

(5) Errors-in-Measurement Problem

Data subject to various inaccuracies or biases; for example, GDP figures are revised periodically. Such changes necessitate data manipulations to make them comparable and

66

manipulations to make them comparable and

consistent over time.

• Structural specification versus forecasting

• Functional form

• Single-equation versus multi-equation specifications

• Time and effort to construct models

• Choice of econometric or time-series models

67

“It will be remembered that the 70 translators of the Septuagint were shut up in 70 separate rooms with the Hebrew text and brought out with them, when they emerged, 70 identical translations. Would the same miracle be vouchsafed if 70 multiple correlators were shut up with the same statistical material?”

-- John Maynard Keynes

� Model specification entails the expression of theor etical constructs in mathematical terms

� This phase of applied econometrics constitutes the model building activity

� In essence, model specification is the translation of theoretical constructs into mathematical/statistical forms.

68

constructs into mathematical/statistical forms.� Fundamental principles in model building

� The principle of parsimony (other things the same, simple models generally are preferable to complex models, especially in forecasting)

� The shrinkage principle (imposing restrictions eith er on estimated parameters or on forecasts often improves model performance)

� The KISS principle (Zellner, 2001), “Keep it Sophis tically Simple.”

� Often in applied econometrics, analysts have interest in estimating demand relationships, particularly for commodities

� Analysts may wish to estimate the demand for

69

� Analysts may wish to estimate the demand for cosmetic products, automobiles, various food products, or various beverages.

� The Demand Curve shows the theoretical relation between price and quantity demanded, holding all other factors constant.

� Axes: price is on y -axis, quantity on the x -axis

70

� Axes: price is on y -axis, quantity on the x -axis

� Example: Demand curve for Lipton tea,

Q=2500 – 500P

� Key question: How are these numbers obtained?

Average price per package

Demand Curve

Q = 2,500 – 500P

P = 5 – .002Q

5

4

3

71

2

1

500 1000 1500 2000 2500

packages of Lipton tea

� Movement along a given demand curve reflects a change in price and quantity of the commodity in question.

� Answer the following question: What happens to the quantity demanded if the price changes (exogenously) holding all other factors constant?

72

holding all other factors constant?

� Law of demand:

� The demand curve is negatively sloped.

� An increase (a decrease) in own price leads to a decrease (an increase) in quantity demanded.

� A very important empirical finding in economics.

� Prices of other products

� Income

� Advertising (positive information)

� Food Recalls or Food Scares (negative

73

� Food Recalls or Food Scares (negative information)

� Health and nutrition factors

� Lifestyle factors

� Tastes and preferences

1. Q = a-bP2. Q = a0 – a1P + a2I + a3A + a4PS

own-price effect income effect advertising price of substitute product

74

� The coefficients a 0, a1, a2, a3, and a 4 are labeled the demand parameters.

� We expect certain signs and magnitudes of the demand parameters according to economic theory.

� Different versions of the econometric model for applied analysis are possible.

own-price effect(-)

income effect(+)

advertising effect

(+)

price of substitute product(+)

� The starting point then rests on the development of a linear regression model.

� Philosophical differences exist among statisticians and applied econometric analysts.

� Attention centered on signs and magnitudes of the coefficients inherent in the model.

75

coefficients inherent in the model.

� Model specifications in applied econometrics are non-unique.

� Model specifications provide the blueprint for action.

� The effect of unionization on earnings hypothesis:� Unionization increases real wages

lnw = 0.28lnQ + 0.77Z + 0.84lnP + 0.228U- 0.16UZ(0.056) (0.28) (0.32) (0.094) (1.10)

-1.94UlnP + CONSTANT(0.99)

� R2= 0.891

76

� R2= 0.891� W = relative wages (average hourly compensation in

unionized industries relative to compensation in nonunionized industries)

� Q = relative value of output� Z = unemployment rate� P = measure of expected inflation� U = measure of the extent of union membership� UZ, UlnP = interaction variables

� Specification based on change rather than levels, e .g., estimation of a demand equation for new automobiles in the United States

85.0R

PS827.0)m/p(234.0S507.0I106.0115.0A

2)261.0()088.0()086.0()011.0(

=

−∆−∆−∆+=∆

77

� A = annual retail sales of new passenger automobile s� change in annual retail sales� change in real disposable income� change in the stock of passenger automobiles� change in average retail price deflated by the

average duration of automobile loans� PS= dummy variable to take account of years of sever e

production shortages

=∆A

=∆I

=∆S

=∆ )m/p(

Special type of algebraic model, namely stochastic (as opposed to deterministic)

� linear or nonlinear in parameters

� inherently linear versus nonlinear models

78

� inherently linear versus nonlinear models

� additive stochastic disturbance term that plays� the role of a chance medium

In general, each equation of an econometric model, other than definitions, equilibrium conditions, and identities, is assumed to contain an additive disturbance term.

An inherently linear model is a model which is linear in parameters.

βββ

ε

++=

=

kk

k

XXXgXXXgyF

XXXFY

),...,,(),...,,()(

),,...,,(

212221110

21

79

Polynomial model, exponential model, reciprocal model, semi-logarithmic model, interaction model

εσσσσ

εσ

εβ

σσσ

+++++=

=

+++

kk

k

kkk

XXXY

XXXYge

XXXg

k

ln...lnlnlnln

)exp(....,.

),...,,(...

22110

210

21

21

Show that by appropriate transformations of variables (e.g. logarithms, reciprocals) the following nonlinear models relating y to the explanatory variables x1 and x2 and the stochastic disturbance term u can be converted to models that are linear in parameters

u22

11 exAxY ββ=1.

(Constant elasticity)

80

122110

22110

)uxx(Y

)uxxexp(Y

−+β+β+β=

+β+β+β=

(Constant elasticity)

2.(exponential)

3.(reciprocal)

Stochastic, a random variable

Statistical distribution often normal

Captures:

81

Captures:

1. Omission of the influence of other variables

2. Measurement error

1. Model linear in parameters

2.

3.(a) Homoscedasticity(b) No serial or autocorrelation

0)(E =ε

IE T 2)( σεε =

82

(b) No serial or autocorrelation

4. X is a nx(k+1) matrix of fixed numbers

5. Rank of X = r(X) = k+1 < n

6.

[ ] [ ] 0EXXE TT =ε=ε⇒

),0(~ 2IN σε

1. Computational cost

2. Least squares

3. R2

4. Unbiasedness

5. Minimum variance

6. Minimum mean squared error

83

6. Minimum mean squared error

7. Asymptotic properties

8. Maximum likelihood

Choice of parameter estimates that maximize the probability of observing the data; requires assumptions of a statistical distribution

Often the normal distribution is the statistical distribution of choice

84

Residuals

Minimization of the sum of squared residuals

OLS estimator1T2OLS

T1T

)XX(s)ˆ(VAR

YX)XX(ˆ−

−

=β=β

Not sensitive to outliers

More generally these estimators are not sensitive to departures from assumptions

Minimize the sum of the absolute values of the

85

Minimize the sum of the absolute values of the residuals:

MAD (Mean Absolute Deviation)

LAR (Least Absolute Residuals)

LAE (Least Absolute Error)

1. Sum of squared residuals (SSE)

2. Regression sum of squares (SSR)

3. Corrected total sum of squares (SST)

( )2

2ˆ

∑

∑=

−=

i

ii

eSSE

YYSSE

( )2ˆ∑ −= YYSSR i

86

3. Corrected total sum of squares (SST)

4. SSE + SSR = SST

5. R2 = SSR/SST

6. F-statistics, t-statistics

7. Tests of hypotheses, construction of confidence intervals

( )2

∑ −= YYSST i

)1)(1/1(1

/1/

2

22

22

2

R

RR

RknnR

SSTSSESSTSSRR

≥

−−−−−=

−==

can be negative

If k>1, then

87

)/)1))((1/(()1/(

)/(

0...:

221,

210

kknRRknSSE

kSSRF

H

R

knk

k

−−−=−−

=

====

−−

βββ

can be negative

• N refers to the number of observations

• K refers to the number of parameters to be estimated, not counting the intercept

� is the “penalty factor”

MSEpT

T

T

e

pT

Ts

T

eMSE

T

tt

T

tt

−=

−=

=

∑

∑

=

=

1

2

2

1

2

pT

T

−

Useful way of discerning

alternative model specifications; for each specification,

calculate the associated model

selection statistics.

88

is the “penalty factor”

� Akaike Information Criterion (AIC)

� Schwarz or Bayesian Information Criterion(SIC) or (BIC)

� p is the number of parameters to be estimated� T is the number of observations

pT−

MSEeAIC T

p

=2

MSETBICSIC T

p

=)(

selection statistics. Then the smallest statistic yields the most appropriate

model specification.

Model 1 Model 2 Model 3

AIC 19.35 15.83 17.15

89

Which model to choose?

SIC 19.37 15.86 17.17

1. Ordinary regression coefficients

2. Beta or standardized regression coefficients

90

3. Elasticities

4. Partial correlation coefficients

kki*

2x

2i2*2

1x

1i1*1

Y

i

)XX(...

S

)XX(

S

)XX(

S

)YY(

−β++

−β+

−β=−

91

Interpretation: A beta coefficient of .25 means a 1 standard deviation change in the exogenous variable will lead to a .25 standard deviation chan ge in the dependent variable

Y

xjj

*j

xk

kki*k

S

Sˆˆ

S

)XX(...

β=β

−β++

Standardized regression coefficients

92

coefficientsInterpretation?

Elasticity (for a linear functional form):

� Interpretation: An elasticity of -.4 means a 1 percent change in the exogenous variable will lead to a -.4 percent change in the dependent variable.

Y

X

X

Y

Y

XE j

j

jjj

∂∂== β

93

� The elasticity corresponds to a measure of sensitivity of the percentage change in Y due to a one percent change in the explanatory variable Xj. This estimate of the elasticity of Y with respect to Xj is a function of , Xjt, and Yt. This estimate varies with the respective observations of Xjt, and Yt. Often, to come up with a single estimate of this elasticity, we employ the sample means.

jβ

VariableEstimated OLS

CoefficientMean Elasticity

PSHRIMP -0.03223 611.37 -2.8749

PFIN -0.02731 441.68 -1.7599

94

PSHELL1 0.01590 661.74 1.5351

ADSHRIMP -0.00101 46.33 -0.0068

ADFIN -0.01398 118.29 -0.2413

ADSHELL1 0.00848 11.49 0.0142

QSHRIMP ------ 6.85 ------

�Demand Estimation :�Dependent variable: Q, tile sales (in 1000 of cases)

�Right-hand side variables: tile price per case (p), income per capita I (in 1000 of $), and advertising expenditure A(in 1000 $) .

Q = ββββ0 + ββββ1P + ββββ2I + ββββ3A + uRegression:

95

� Interpretation :�β1 measures the effects of the tile price on the tile

consumption, holding all other factors fixed

�β2 represents the effects of income, holding all other factors fixed

�β3 represents the effects of advertising, holding all other factors fixed

(1) What is the impact of a price change on tile sales?

(2) What is the impact of a change in income on til e sales?

(3) What is the impact of a change in advertising

96

(3) What is the impact of a change in advertising expenditures on tile sales?

� Calculation of own-price elasticity?

� Calculation of income elasticity?

� Calculation of advertising elasticity?

� Suppose we wish to see whether or not the dependent variable and one independent variable are related after netting out the effect of any other independent variables in the model.

97

� Partial correlation coefficient between Yand Xj -- the effect of Xj on Y not accounted for by the other variables in the model.

Calculation:

(1) Eliminating the linear effect of the remaining X’s on Y

(2) Eliminate the linear effect of the remaining X’s on

(3) Let **

kk

1j1j1j1j22110j

kk

1j1j1j1j22110

XXX,YYY

Xˆ...

XˆXˆ...XˆXˆˆX

Xˆ...

XˆXˆ...XˆXˆˆY

−=−=

α++α+α++α+α+α=

α++α+α++α+α+α=

++−−

++−−

98

(3) LetCorr -partial correlation coefficient, -1,1

The partial correlation coefficient of .53 on varia ble implies that 28% (.53 2) of the variance in the dependent variable not accounted for by the other independent variables is accounted for by

jj*

j* XXX,YYY −=−=

)X,Y( *j

*

jX

jX

99

Circled entries are the squared partial correlation coefficients. The partial correlation coefficients are respectively:

PSHRIMP → -.7010 ADSHRIMP → -.0151

PFIN → -.2068 ADFIN → -.2932

PSHELL1 → .2272 ADSHELL1 → .0430

Variable Name

Estimated Coefficient

Partial Correlation

Standardized Coefficient

Elasticity at Means

PPAG -206.92 -0.796 -0.6017 -2.8388

PRAG 31.629 0.235 0.1309 0.4080

103

PRAG 31.629 0.235 0.1309 0.4080

TOTEXP 0.10728 0.737 0.6007 0.7090

CONSTANT 366.10

Goodness-of-fit Test (F-test)

Single Coefficients (t- or F-tests)

Linear Combinations of Coefficients (F-tests)

104

Nonlinear Combinations of Coefficients (χ2-tests)

Joint Tests of Several Regression Coefficients (F-tests)

Equality of Coefficients of Different Regressions (Chow Test) (F-test)

0 oneleast at : 0: 10 ≠= jHH ββ

ANOVA (Analysis of Variance) TableSource DF SS MS

(degrees of freedom) (sum of squares) (mean square)

105

1/

/1, −−

=−− knSSE

kSSRF knk

regressionresidualtotal 1

1/ˆ1

/ˆ 2

−

−−−−−

−

n

knSSEYXYYkn

kSSRYnYXk

TTT

TT

β

β

SSR → regression sum of squares

SSE → error sum of squares

N → # of observations

K → # of parameters to be estimated, not including the intercept

:

:

01

00

ββ

ββ

≠

=

H

H

j

j

106

A percent confidence interval for

)1( 100

)(ˆ

/)ˆ(

ˆ2/),1(

ˆ01

α

β

ββ

βα

β

−

±

−=

−−

−−

j

j

St

St

knj

jkn

jβ

� Estimation entails two classes, point and interval

� Point estimators yield a single numerical estimate

� But, the estimates we obtain usually do not have precisely the same value from sample to sample.

108

precisely the same value from sample to sample.

� Characterize estimates of structural parameters in terms of intervals, an explicit recognition that parameter estimates indeed are random variables.

� Construction of confidence intervals

� Another way to use classical statistical testing is to construct a confidence interval using the same critical value as was used for a two-sided test

� A (1 - αααα) % confidence interval is defined as

( ) percentile -1 theis where, ˆˆ

•± csec

αββ

109

� Depending on the sample size and the assumptions about the distribution of the error term, the critical value also can be constructed using the standard normal instead of the t distribution.

( )ondistributi ain

percentile 2

-1 theis where, ˆˆ

1−−

•±

kn

jj

t

csec ββ

111

� 95% confidence intervals for the OLS parameter esti mates

� t90, .025 = 1.987

� So, for PSHRIMP, parameter estimate = -.03223 and

standard error = .00346

� 95% CI = -.03223 ± .00346(1.987)

:

) :

01

000

kj

iii

kj

H

aH

γββ

γβγββ

≠+

==+ ∑(more generally,

112

)ˆ,ˆ(2)ˆ()ˆ(

)ˆ,ˆ(2)ˆ()ˆˆ(

)ˆˆ(

)ˆˆ(

201

kjkj

jijijiiia

kj

kjkn

COVVARVARS

COVaaVARaSS

t

kj

ii

ββββ

βββββ

γββ

ββ

β

++=

+=+

−+=

+

∑−− ∑ ∑ ∑

Reject H 0 : Test that the estimated coefficients of Pfin

and Pshell1 are jointly equal to zero.

Reject H 0 : Test that the estimated coefficients of Adfin and Adshell1 shell are jointly

equal to zero.

113

Reject H 0: Test that the estimated coefficients of

Pshrimp and Pshell1 are the same.

Reject H 0 : Test that the estimated coefficients of Pfin

and Pshell1 are the same.

Unrestricted Model (UR)

Restricted Model

ε+β++β+β+β= kk22110 X...XXY

0...:

...

10

22110

===

+++++=

+−

−−

H

XXXY

kqk

qkqk

ββεββββ

114

( )1/)1(

/

1/

/)(

2

22

1,

1,

−−−−=

−=−≥⇒

=−−

−=

≥

−−

−−

knR

qRRF

SSRSSRSSESSE

SSRSSR

SSTSST

knSSE

qSSESSEF

SSESSE

UR

RURknq

RURURR

RUR

RUR

UR

URRknq

URR

Tests involving the equality of coefficients of dif ferent regressions

Calculate

)1kN(df)N,...,1i(

X...XXY)1( ikiki22i110i

−−=ε+β++β+β+β=

)1kM(df)M,...,1J(

X...XXY)2( jkjkj22j110j

−−=

ε+σ++σ+σ+σ=

)22( −−++= kMNdfSSESSESSE

115

Calculate

Pooling Under

)22( 21 −−++= kMNdfSSESSESSEUR

kk2211000 ,...,,,:H σ=βσ=βσ=βσ=β

0H→

)22/(

1/)(

),...,11(

...

22,1

1121211101

−−++−=

+=+++++=

−−++ kMNSSE

kSSESSEF

SSE

MN

XXXY

UR

URRkMNk

R

kk εαααα

� Split the sample in two pieces at various break points.

� Reason for doing so due to the fact that the exact break point often is unknown.

� Basic test for structural change

� Make sure there exists a sufficient number of

117

� Make sure there exists a sufficient number of observations before and after a “potential” break point in order to conduct the Chow test.

� The break point specified on the Chow = (n) option is such that the first subsample consists of n – 1 observations and the second subsample starts from the n th observation and runs to the last observation of the entire sample.

� Sequential Chow Test:Split the sample in two pieces at every point

118

� Recursive Estimation:

Run a series of regressions by adding one observation per regression

N + M = 52K = 7

Sample 1 Sample 2

14 38 Reject H 0

119

K + 1

N + M – 2K – 2

14 38 Reject H 0

19 33 Reject H 0

24 28 Reject H 0

29 23 Reject H 0

N + K = 36K = 6

Sample 1 Sample 2

7 29Fail to

Dependent variable lnpcg

120

K + 1

N + M – 2K – 2

7 29Fail to Reject H 0

12 24 Reject H 0

21 15 Reject H 0

� RESET – Regression Specification Error Test:� Run additional regressions of the dependent variable on

the independent variables and on powers of included in the same regression

� The Reset Test is an F-test: Y

Adequacy of Model Specification

123

� The Reset Test is an F-test:� Test whether the coefficients on the variables are

(jointly) zero

Y

Y

Ramsey Reset Test indicates a problem

124

indicates a problem with model

specification

� Linear Functional Form

� Linear in Logs Functional Form

tktkt22t110t X...XXY ε+α++α+α+α=

Y

X iiαElasticity:

tktktltt vXbXbXbbY +++++= ln...lnlnln 2210

126

� Semi-Log Functional Form

� Semi-Log Functional Form

ibElasticity:

tktkt22t110t uXc...XcXccYln +++++=

ii XcElasticity:

tktkt22t110t wXlnd...XlndXlnddY +++++=Elasticity:

Y

di

� Box-Cox Transformation

� Linear Form

� Double Logarithmic Form (linear in logs functional

tkktk2t221t1100t )(X...)(X)(X)(y ε+λβ++λβ+λβ+β=λ

λ−λ /)1Z(Z transformed to

1... k210 =λ==λ=λ=λ

127

� Double Logarithmic Form (linear in logs functional form)

� Semi-Log Form

� Semi-Log Form

0... k210 =λ==λ=λ=λ

1...;0 k210 =λ==λ=λ=λ

0...;1 k210 =λ==λ=λ=λ

Linear Functional Form

128

Linear in Logs

Functional form

Double Logarithmic

129

Double Logarithmic

Functional Form

-0.3224 = own-price elasticity

0.1589 = cross-price elasticity

0.0651 = income elasticity

Semi-Logarithmic

Functional form

130

Semi-Logarithmic

Functional form

131

NABE EBA DECK 2

Documents

Transcript of NABE EBA DECK 2