Chapter 5 Multifactor Pricing Models 5.1Multifactor Pricing Models Motivation: Empirical evidences...

26
Chapter 5 Multifactor Pricing Models 5.1 Multifactor Pricing Models on: Empirical evidences show that the market β does no ly explain the cross section risk; More factors are ne the cross-sectional risk. -used factors: s of market portfolio; 149

Transcript of Chapter 5 Multifactor Pricing Models 5.1Multifactor Pricing Models Motivation: Empirical evidences...

Chapter 5

Multifactor Pricing Models

5.1 Multifactor Pricing Models

Motivation: Empirical evidences show that the market β does not

completely explain the cross section risk; More factors are needed to

explain the cross-sectional risk.

Commonly-used factors:

— proxies of market portfolio;149

150

— Firm characteristics: “size e ect”, “PE e ect”, “book-marketff ffratios”:

*** size e ect: di erence of returns on a portfolio with small capi-ff fftalization and high capitalization;

*** PE e ect: di erence of returns on a portfolio with small PE andff ffhigh PE;

— Macroeconomic and market variables: “maturity premium”

(yield spread between long and short rates), expected inflation,

unexpected inflation, industrial production growth, “default pre-

mium” (yield spread between high and low grade bonds).

Notation:

151

— R = (R1, · · · , RN )T is a vector of returns of N traded assets.

— f is the measurements (returns) related to K common factors.

The Models: The cross-sectional risk is modelled as

R = a + Bf + ε,

E(ε) = 0, Var(ε) = Σ.

For example, the return of the ith portfolio is

Rit = ai + bi1f1t + · · · + biKfKt + εit, t = 1, · · · , T.

Here, the factor sensitivity parameters (loadings) bij are an extension

of the market beta to the multi-factor model.

Statistical point of view: With more factors, the market portfo-

152

lio can be better approximated and pricing errors can be reduced.

Constraints: Financial economics theory puts theoretical constraints

on the intercepts ai. Here are some backgrounds.

Theoretical Background:

— derived by Ross (1976) using the Arbitrage Pricing Theory (APT). In the absence of arbitrage

in large economies, Ross (1976) showed that

µ = ER ≈ γ01 + BλK .

* γ0 is the model zero-beta parameter, equals to riskfree return if such an asset exists.

* λK is a vector of factor risk premia, e.g. λK = E(f − γ01) if tradable.

— derived by the Intertemporal CAPM (ICAPM) of Merton (1973), which shows

µ = γ01 + BλK .

— a generalization of CAPM.

153

Question:

— How to select factors?

— How to empirically test the above theory?

— How to use the multiple factor model to forecast the expected

return and volatility of a firm?

154

5.2 Estimation and Testing

Situations: We will mainly focus on the following 3 settings:

(a) Factors are portfolios of traded assets and γ0 is known.

(b) Factors are portfolios of traded assets and γ0 is unknown.

(c) Factors are not portfolios of traded assets (macroeconomic var).

Hypothesis: H0 : µ = γ01 + BλK, namely the intercepts

a = γ01 + B(Ef − γ01). (5.1)

MLR test: As in CAPM, the MLR test can be shown to have form

a

155

— d = # of restrictions imposed by the null hypothesis.

— the statistic has been scaled by (T − N/2 − K − 1) instead of T

in an hope to improve the finite sample performance;

— di erent situation gives di erent statistical models, which result inff ff

di erent d, Σ and Σff 0.

5.2.1 Portfolios as factors with a riskfree asset

Excess Returns:

Y t = Rt − rf,t1,

Xt = f t − rf,t1,

(Y t − Y¯ )(Xt − X)T (Xt − X)(Xt − X)T

a = Y¯ − BX,

156

where rf,t = risk-free rate at time t.

Models: Y t = a + BXt + εt.

The risk premium is λK = E(f − γ01). Hence, under the APT

and ICAPM, a = 0 by (5.1).

Hypothesis: H0 : a = 0.Note that

Cov(Y t, Xt) = BCov(Xt, Xt) = B = Cov(Y ⇒ t, Xt)var(Xt)−1.

and a = EY t − BEXt.

Substituting them by the empirical moments, we haveMLE:

B =T

t=1

¯T

t=1

¯ ¯−1

,

¯ εt = Y t − a − BXt, Σ = T −1T

t=1

εtεTt .

tXt ][

157

Remark: The coe cients affi i and (bi1, . . . bik) of the ith row of B

are just the regression coe cients obtained from the marginal models:ffiFit the multiple linear model for each given asset:

Yit = ai + bi1X1t + · · · + biKXKt + εit, t = 1, · · · , T.

This yields the residual vector εt = (ε1t, . . . , εN t) at each given time

period t, which captures the cross-section risk.

MLE under H0 : a = 0:

B0 = [T

t=1

Y TT

t=1

XtXTt ]−1,

Σ0 = T −1

T

t=1tεotεoT , εot = Y t − BXt.

¯ T −1[1 + X ΩK X]−1aT Σ−1a ∼0 FN,T −N −K .

158

Remark: The coe cients in Bffi 0 and residuals in εot can be obtained

via the marginal model: For each given i, fit without the intercept

term of the following model:

Yit = bi1X1t + · · · + bikXKt + εit.

a

Exact test: Let ΩK = Var(Xt) be the covariance structure of Kfactors and ΩK be its sample covariance. Then, the MLR test isequivalent to the Wald test

T1 =T − N − K

NH

This null distribution is exact rather than the approximate one if

ε N (∼ 0, Σ). This is an extension of the single-factor model.

159

Example 1. Fama and French (1993) consider the following factors

(K = 2, 3 and 5 factors):

1. di erence of returns between large and small capitalization;ff

2. di erence of returns between high and low book-to-market ratios;ff

3. CRSP value-weighted stock index;

4. a term structure factor (yield spread between long and short bonds);

5. a default risk (yield spread high and low grade bonds)

They use 25 stock portfolios and 7 bond portfolios, namely N = 32.

The stock portfolios are created using a two way sort based on the

market capitalization and book-to-equity ratio. The bond portfolios

160

include five US government and two corporate bound portfolios. The

period of study is 1963/07–1991/12, namely T = 354. The P-values

are summarized as follows based on the test statistic T1.

Table 5.1: Summary of testing results using the exact F-test

K 2 3 5

P-value 0.010 0.039 0.025

Fama and Fama find

— some improvement going from two factors to five factors;

— three factors are necessary when testing portfolio consisting only

of stocks;

— five factors when bond portfolio is included.

161

Remarks:∗

1. In the multi-factor model

Y t = a + BY Kt + εt,

when a linear combination of Y Kt forms the tangency portfolio, a = 0. Thus, multi-factor

model is an e ort to expand the model so that the tangency portfolio is better approximated.ff

2. When K is too large and T is too small, it is easy to overfit the model. Often, K ≈ 5.

3. For a subset of traded assets, one can also define the tangency portfolio within this subset. The

intercepts of regressing the returns of this subset of asset are 0. Thus, if the tested assets have

high weights in one of factors, its intercept will be zero (artifact of factor selection).

4. The Sharpe ratio increases as the subset of trade assets increase.

5.2.2 Portfolios as factors without a riskfree asset.

162

This is an extension of the Black version of CAPM.

Model: Rt = a + BRKt + ε.

— Rt is the return of N assets —RKt is the returns of K factors,

which are a traded portfolio.

MLE: The same as the last section, as the models are identical. The

marginal model technique continues to apply.

Null hypothesis: ERt − 1γ0 = B(ERKt − 1γ0).

Constrained Model: Under the null hypothesis,

Rt = 1γ0 + B(RKt − 1γ0) + εt

= (1 − B1)γ0 + BRKt + εt.

Constrained MLE: Given γ0, the situation is the same as 5.2.1

T Σ−1(1−B01) −1 (1−B01)T Σ−1(R−B0RK) ,

163

and B0 and Σ0 can be easily be obtained. Now, given the estimated

B0 and Σ0, the coe cient γffi 0 is determined by

γ0 = (1−B01) 0 0

¯ ¯

Algorithm: Starting from unconstrained model, iterate the above

equations once, and obtain a one-step estimate. This is as good as

the fully iterative procedure, since B and Σ are root-T consistent.

5.2.3 Macroeconomic variables as factors∗

Example: innovations in GNP, changes in bond yields, unanticipated inflation. These factors are

observable, but not traded. So the risk premia is unknown.

Model: Rt = a + Bf Kt + εt.

164

Since the risk premia λK are not the same as Ef K −γ01, they are treated as unknown parameters.

From APT and ICAMP, we have under the null hypothesis

µ = a + BEf K = γ01 + BλK .

Constrained Model: Letting γ1 = λK − Ef K , we have

a = γ01 + Bγ1.

Comparison: K extra parameters under H0. The same method as that in 4.2.2 can be used to

obtain the MLE under H0.

MLE under the full model: The same as before, since f Kt is observable

MLR: The maximum likelihood ratio continues to apply.

λk = X = T −1

165

5.3 Estimation of risk premia and expected returns

¯

Expected returns: µ = γ01 + BλK.

The risk premia for three situations can be estimated as follows.

The estimated covariance matrix is also attached.

Trade portfolios as factors with riskless rate rf :

T

Xt, Var(λK) = T −1ΩK.

λK = T −1

t=1

Trade portfolios as factors without rf :

T

t=1RKt − γ01, Var(λK) = T −1ΩK + Var(γ0)11T .

166

λK = T −1

Macroeconomic var. as factors∗: Since λK = Ef K + γ1,T

t=1f Kt + γ1.

5.4 Applications of multifactor models

Two important applications: Forecasting the return of a firm

and to estimate the covariance matrix of assets. For simplicity, we

consider the model in §5.2.1.

Forecast return: For any asset, its expected return is

r¯f + β1(¯rF,1 − r¯f ) + · · · + βK(¯rF,K − r¯f ),

where r¯f is the average riskfree rate, r¯F,i is the average return of the

167

factor, and βi is the associated beta with the i-th factor.

Covariance matrix: Covariance matrix is important for asset allo-

cation and portifolio and risk management. However, large covariance

matrices are hard to be estimated with good accuracy. From the mul-

tifactor model,

var(Y t) = var(Rt) = Bvar(Xt)BT + var(ε).

Idea: If we assume that the factors capture the cross-section risk, we

may assume that var(ε) is a diagnonal matrix. Note that var(Xt) is

a low-dimensional matrix (e.g. 3 × 3 if we use a three factor model).

Estimation. Based on the multiple regression, we can obtain an

estimate of B and residual variance and hence var(ε). The low-

168

dimensional matrix var(Xt) can be estimated by the sample covari-

ance matrix.

5.5 Selection of Factors

Two approaches:

— Economics Theory:

(a) Characteristics of Firms: “Size e ect”, “ PE”, “book-marketffratio”, “ PS”.

(b) macroeconomic and financial market variables: maturity pre-

mium, expected unexpected inflation, industrial production

growth, default premium.

= B B + Σ, B = ΩK B.

169

— Statistical Methods: Motivated by the APT, build factors from a

comprehensive set of asset returns (much larger than the test sets),

e.g. factor analysis, principal analysis

5.5.1 Factor analysis∗

Factor analysis: Assume that K factors account for all covariance of asset returns and hence Σ

is diagonal — strict factor structure, imposed also by Ross in his development of APT.

Model: Rt = a + Bf t + εt, Var(εt|f t) = Σ.

⇒ Var(Rt) = BT ΩK B + Σ,

∗T ∗

ΩK = Var(f t),1

∗ 2

Identifiability: B can be identified only up to a K × K nonsingular matrix. B ∗ is unique up to

an orthogonal transform.

MLE: If εt is normal, then B ∗ and Σ can be estimated from Rt, t = 1, . . . , T .

we have a = ERt and a = R. By generalized least-squares,

¯f t = (B Σ−1B)−1B Σ−1(Rt − R)

≡ W (Rt − R).

170

Estimation of factors: WOLG, Ef t = 0. From

Rt = a + Bf t + εt,

¯

T T

¯

Renormalization: (optional) Factor j is taken as

fjt =n

i=1

wjiRit/n

i=1

wji.

This di ers from the above f only by a linear scale. These factors can be used directly to fit aff

multi-factor model.

Interpretations: The weights wj is the same as solving

wj

s.t.wTj bk = 0, ∀ k = j, wTj bj = 1,

i.e. minimizing the residual variance, subject to orthogonality constraint.

171

Proof: Let ej be the K-dimensional unit vector with one at the jth position. Then, the side

condition can be written asT

B wj = ej.

By the Lagrange multiplier method, we minimize

T

Taking derivative and setting it to zero

Σwj − Bλ = 0 or wj = Σ−1Bλ.

Now, using the constraintT T

T

T

T T

The converse follow obvious from the above derivation:

TB wj = ej.

172

5.5.2

5.6

Principal component analysis∗

Empirical results

Lehmann and Modest (1988): (Statistical method)

— daily data to build factors using factor analysis

— weekly interval for testing

Connor and Korajczyk (1988):

— use principal component on the sample covariance of Rt to

obtain factors

— account for January e ects (intercept of January is 0) and non Jan.ff

173

Test intervals:

— CK works with monthly data

— CK and LM are based on tests from four five-year periods, aggre-

gated together.

Results:

CK and LM find little sensitivity to increasing the number of factors

beyond five.

174

Table 5.2: Summary of results for tests of exact factor pricing using zero-intercept F-test

Time period

64:01-83:12

63:01-82:12

63:01-82:12

63:01-82:12

Portfolio characteristic

market value of equity

market value of equity

dividend yield

own variance

Study

CK

CK

LM

LM

LM

LM

LM

LM

LM

LM

LM

LM

LM

LM

LM

LM

LM

LM

LM

LM

N

10

10

5

5

5

20

20

20

5

5

5

20

20

20

5

5

5

20

20

20

K

5

10

5

10

15

5

10

15

5

10

15

5

10

15

5

10

15

5

10

15

p-value

0.002

0.002

**

**

**

0.11

0.14

0.42

0.17

0.18

0.17

0.94

0.97

0.98

0.29

0.57

0.55

0.83

0.97

0.98

** Less than 0.001.