Pricing Kernels π t - Becker Friedman Institute · PDF file2 Pricing Kernels representation...

10
Pricing Kernels Pricing kernels or stochastic discount factors (SDFs) (see Stochastic Discount Factors) are used to rep- resent valuation operators in dynamic stochastic economies. A kernel is a commonly used mathe- matical term for representing an operator. The term stochastic discount factor (SDF) extends concepts from economics and finance to include adjustments for risk. As we will see, there is a close connection between the two terms. The terms pricing kernel and SDF are often used interchangeably. After deriving convenient representations for prices, we provide several examples of stochastic discount factors and discuss econometric methods for estimating and testing asset pricing models that restrict stochastic discount factors. Representing Prices We follow Ross [49] and Harrison and Kreps [34] by exploring the implications of no arbitrage in fric- tionless markets to obtain convenient representations of pricing operators. We build these operators as mappings that assign prices that trade in competi- tive markets to payoffs on portfolios of assets. The payoffs specify how much a numeraire good is pro- vided in alternative states of the world. To write these operators, we invoke the law of one price, which stip- ulates that any two assets with the same payoff must necessarily have the same price. This law is typi- cally implied by but is weaker than the principle of no arbitrage (see Fundamental Theorem of Asset Pricing). Formally, the principle of no arbitrage stip- ulates that nonnegative payoffs that are positive with positive (conditional) probability command a strictly positive price. To see why the principle implies the law of one price, suppose that there is such a non- negative portfolio payoff and call this portfolio a. If two portfolios, say b and c have the same payoff but the price of portfolio b is less than that of portfolio c, then an arbitrage opportunity exists. This can be seen by taking a long position on portfolio b, a short posi- tion on portfolio c, and using the proceeds to purchase portfolio a. This newly constructed portfolio has zero price but a positive payoff, violating the principle of no arbitrage. Given that we can assign prices to payoffs, let π t,t +1 be the date t valuation operator for payoffs (consumption claims) at date t + 1. This operator assigns prices or values to portfolio payoffs p t +1 in a space P t +1 suitably restricted. We extend Debreu’s [17] notion of a valuation functional to a valuation operator to allow for portfolio prices to depend on date t information. With this in mind, let F t be the sigma algebra used to depict the information available to all investors at date t . We use a representation of π t,t +1 of the following form: π t,t +1 (p t +1 ) = E (s t +1 p t +1 |F t ) (1) for some positive random variable s t +1 with proba- bility one and any admissible payoff p t +1 P t +1 on a portfolio of assets. Definition 1 The positive random variable s t +1 is the pricing kernel of the valuation operator π t ,t +1 , provided that representation (1) is valid. If a kernel representation exists, the valuation oper- ator π t,t +1 necessarily satisfies the principle of no arbitrage. How does one construct a representation of this type? There are alternative ways to achieve this. First, suppose that (i) P t +1 consists of all random variables that are F t +1 measurable and have finite conditional (on F t ) second moments, (ii) π t,t +1 is linear condi- tioned on F t , (iii) π t,t +1 also satisfies a conditional continuity restriction, and (iv) the principle of no arbitrage is satisfied. The existence of a kernel s t +1 follows from a conditional version of the Riesz rep- resentation theorem [30]. Second, suppose that (i) P t +1 consists of all bounded random variables that are F t +1 measurable, (ii) π t,t +1 is linear conditioned on F t , (iii) E(π t,t +1 ) induces an absolutely continuous measure on F t +1 , and (iv) the principle of no arbi- trage is satisfied. We can apply the Radon–Nikodym theorem to justify the existence of a kernel. In both these cases, security markets are complete in that pay- offs of any indicator function for events in F t +1 are included in the domain of the operator. a Kernels can be constructed for other mathematical restrictions on asset payoffs and prices as well. As featured by Harrison and Kreps [34] and Hansen and Richard [30], suppose that P t is not so richly specified. Under the (conditional) Hilbert space formulation and appropriate restrictions on P t , we may still apply a (conditional) version of the Riesz

Transcript of Pricing Kernels π t - Becker Friedman Institute · PDF file2 Pricing Kernels representation...

Page 1: Pricing Kernels π t - Becker Friedman Institute · PDF file2 Pricing Kernels representation ... justified by the law of iterated values when trading is ... mild modification of

Pricing Kernels

Pricing kernels or stochastic discount factors (SDFs)(see Stochastic Discount Factors) are used to rep-resent valuation operators in dynamic stochasticeconomies. A kernel is a commonly used mathe-matical term for representing an operator. The termstochastic discount factor (SDF) extends conceptsfrom economics and finance to include adjustmentsfor risk. As we will see, there is a close connectionbetween the two terms. The terms pricing kernel andSDF are often used interchangeably.

After deriving convenient representations forprices, we provide several examples of stochasticdiscount factors and discuss econometric methodsfor estimating and testing asset pricing models thatrestrict stochastic discount factors.

Representing Prices

We follow Ross [49] and Harrison and Kreps [34]by exploring the implications of no arbitrage in fric-tionless markets to obtain convenient representationsof pricing operators. We build these operators asmappings that assign prices that trade in competi-tive markets to payoffs on portfolios of assets. Thepayoffs specify how much a numeraire good is pro-vided in alternative states of the world. To write theseoperators, we invoke the law of one price, which stip-ulates that any two assets with the same payoff mustnecessarily have the same price. This law is typi-cally implied by but is weaker than the principle ofno arbitrage (see Fundamental Theorem of AssetPricing). Formally, the principle of no arbitrage stip-ulates that nonnegative payoffs that are positive withpositive (conditional) probability command a strictlypositive price. To see why the principle implies thelaw of one price, suppose that there is such a non-negative portfolio payoff and call this portfolio a. Iftwo portfolios, say b and c have the same payoff butthe price of portfolio b is less than that of portfolio c,then an arbitrage opportunity exists. This can be seenby taking a long position on portfolio b, a short posi-tion on portfolio c, and using the proceeds to purchaseportfolio a. This newly constructed portfolio has zeroprice but a positive payoff, violating the principle ofno arbitrage.

Given that we can assign prices to payoffs, letπt,t+1 be the date t valuation operator for payoffs(consumption claims) at date t + 1. This operatorassigns prices or values to portfolio payoffs pt+1 ina space Pt+1 suitably restricted. We extend Debreu’s[17] notion of a valuation functional to a valuationoperator to allow for portfolio prices to depend ondate t information. With this in mind, let Ft be thesigma algebra used to depict the information availableto all investors at date t . We use a representation ofπt,t+1 of the following form:

πt,t+1(pt+1) = E (st+1pt+1|Ft ) (1)

for some positive random variable st+1 with proba-bility one and any admissible payoff pt+1 ∈ Pt+1 ona portfolio of assets.

Definition 1 The positive random variable st+1 isthe pricing kernel of the valuation operator πt ,t+1,provided that representation (1) is valid.

If a kernel representation exists, the valuation oper-ator πt,t+1 necessarily satisfies the principle of noarbitrage.

How does one construct a representation of thistype? There are alternative ways to achieve this. First,suppose that (i) Pt+1 consists of all random variablesthat are Ft+1 measurable and have finite conditional(on Ft ) second moments, (ii) πt,t+1 is linear condi-tioned on Ft , (iii) πt,t+1 also satisfies a conditionalcontinuity restriction, and (iv) the principle of noarbitrage is satisfied. The existence of a kernel st+1

follows from a conditional version of the Riesz rep-resentation theorem [30]. Second, suppose that (i)Pt+1 consists of all bounded random variables that areFt+1 measurable, (ii) πt,t+1 is linear conditioned onFt , (iii) E(πt,t+1) induces an absolutely continuousmeasure on Ft+1, and (iv) the principle of no arbi-trage is satisfied. We can apply the Radon–Nikodymtheorem to justify the existence of a kernel. In boththese cases, security markets are complete in that pay-offs of any indicator function for events in Ft+1 areincluded in the domain of the operator.a Kernels canbe constructed for other mathematical restrictions onasset payoffs and prices as well.

As featured by Harrison and Kreps [34] andHansen and Richard [30], suppose that Pt is not sorichly specified. Under the (conditional) Hilbert spaceformulation and appropriate restrictions on Pt , wemay still apply a (conditional) version of the Riesz

Page 2: Pricing Kernels π t - Becker Friedman Institute · PDF file2 Pricing Kernels representation ... justified by the law of iterated values when trading is ... mild modification of

2 Pricing Kernels

representation theorem to represent

πt,t+1(pt+1) = E (qt+1pt+1|Ft ) (2)

for some qt+1 ∈ Pt+1 and all pt+1 ∈ Pt+1. Evenif the principle of no arbitrage is satisfied, thereis no guarantee that the resulting qt+1 is positivewith probability 1, however. Provided, however,that we can extend Pt+1 to a larger space thatincludes indicator functions for events in Ft+1 whilepreserving the principle of no arbitrage, when thereexists a pricing kernel st+1 for πt,t+1. However, thepricing kernel may not be unique.

Stochastic Discounting

The random variable st+1 is also called the one-periodstochastic discount factor. Its multiperiod counterpartis as follows.

Definition 2 The stochastic discount process {St+1 :t = 1, 2, . . .} is

St+1 =t+1∏j=1

sj (3)

where sj is the pricing kernel used to represent thevaluation operator between dates j − 1 and j .

Thus, the t + 1 period SDF compounds the cor-responding one-period SDFs. The compounding isjustified by the law of iterated values when trading isallowed at intermediate dates. As a consequence,

π0,t+1(pt+1) = E (St+1pt+1|F0) (4)

gives the date zero price of a security that pays pt+1

in the numeraire at date t + 1. An SDF discountsthe future in a manner that depends on futureoutcomes. This outcome dependence is included inorder to make adjustments for risk. However, suchadjustments are unnecessary for a discount bondbecause such a bond has a payoff that is equal toone independent of the state realized at date t + 1.The price of a date t + 1 discount bond is obtainedfrom formula (4) by letting pt+1 = 1 and hence isgiven by E

[St+1|F0

].

More generally, the date τ price of pt+1 is givenas

πτ,t+1(pt+1) = E

[(St+1

)pt+1|Fτ

](5)

for τ ≤ t . Thus, the ratio St+1

Sτis the pricing kernel for

the valuation operator πτ,t+1.

Risk-neutral Probabilities

Given a one-period pricing kernel, we build the so-called risk-neutral probability measure recursively asfollows. First, rewrite the one-period pricing operatoras

πt,t+1(pt+1) = E (st+1|Ft ) E (pt+1|Ft ) (6)

where

E (pt+1|Ft ) = E

([st+1

E (st+1|Ft )

]pt+1|Ft

)(7)

Thus, one-period pricing is conveniently summarizedby a one-period discount factor for a riskless payoff,E (st+1|Ft ), and a change of the conditional proba-bility measure implied by the conditional expectationoperator E (·|Ft ). Risk adjustments are now absorbedinto the change of probability measure.

Using this construction repeatedly, we decomposethe date t + 1 component of the SDF process:

St+1 = St+1Mt+1 (8)

where

St+1 =t+1∏

j=1

E(sj |Fj−1

) (9)

is the discount factor constructed from the sequenceof one-period riskless discount factors and

Mt+1 =t+1∏j=1

sj

E(sj |Fj−1

) = St+1

St+1(10)

is the positive martingale adapted to {Ft : t =0, 1, . . .} with expectation unity. For each date t , Mt

can be used to assign · probabilities to events inFt . Since E (Mt+1|Ft ) = Mt , the date t + 1 assign-ment of probabilities to events in Ft+1 is compatiblewith the date t assignment to events in Ft ⊂ Ft+1.This follows from the law of iterated expectations. Ineffect, the law of iterated expectations enforces thelaw of iterated values.

Applying factorization (8), we have an alternativeway to represent πτ,t+1:

πτ,t+1(pt+1) = E[St+1pt+1|Ft

](11)

Page 3: Pricing Kernels π t - Becker Friedman Institute · PDF file2 Pricing Kernels representation ... justified by the law of iterated values when trading is ... mild modification of

Pricing Kernels 3

Pricing is reduced to riskless discounting and adistorted (risk-neutral) conditional expectation. Theinsight provided in [34, 49] is that dynamic assetpricing in the absence of arbitrage is captured bythe existence of a positive (with probability one)martingale that can be used to represent prices inconjunction with a sequence of one-period risklessdiscount factors. Moreover, it justifies an arguablymild modification of the “efficient market hypothe-sis” that states that discounted prices should behaveas martingales with the appropriate cash-flow adjust-ment. While Rubinstein [50] and Lucas [42] hadclearly shown that efficiency should not preclude riskcompensation, the notion of equivalent martingalemeasures reconciles the points of view under greatergenerality. The martingale property and associated“risk-neutral pricing” are recovered for some distor-tion of the historical probability measure that encap-sulates risk compensation. This distortion preserves“equivalence” (the two probability measures agreeabout which events in Ft are assigned probabilityzero measure for each finite t) by ensuring the exis-tence of a strictly positive SDF.

The concept of equivalent martingale measure(for each t) has been tremendously influential inderivative asset pricing. The existence of such ameasure allows risk-neutral pricing of all contingentclaims that are attainable because their payoff canbe perfectly duplicated by self-financing strategies.Basic results from probability theory are directlyexploitable in characterizing asset pricing in anarbitrage-free environment. More details can be foundin Econometrics of Option Pricing.

Although we have developed this discussionfor a discrete-time environment, there are well-known continuous-time extensions. In the case ofcontinuous-time Brownian motion information struc-tures, the change of measure has a particularly simplestructure. In accordance to the Girsanov theorem,the Brownian motion under the original probabil-ity measure becomes a Brownian motion with driftunder the risk-neutral measure. These drifts absorbthe adjustments for exposure to Brownian motionrisk.

Investors’ Preferences

Economic models show how exposure to risk ispriced and what determines riskless rates of inter-est. As featured by Ross [49], the risk exposure that

is priced is the risk that cannot be diversified byaveraging through the construction of portfolios oftraded securities. Typical examples include macroe-conomic shocks. Empirical macroeconomics seeks toidentify macroeconomic shocks to quantify responsesof macroeconomic aggregates to those shocks. Assetpricing models assign prices to exposure of cashflows to the identified shocks. The risk prices areencoded in SDF processes and hence are implicit inthe risk-neutral probability measures used in financialengineering.

SDFs implied by specific economic models oftenreflect investor preferences. Subjective rates of dis-count and intertemporal elasticities appear in formu-las for risk-free interest rates, and investors’ aversionto risk appears in formulas for the prices assigned toalternative risk exposures. Sometimes the reflectionof investor preferences is direct and sometimes it isaltered by the presence of market frictions. In whatfollows, we illustrate briefly some of the SDFs thathave been derived in the literature.

Power Utility

When there are no market frictions and markets arecomplete, investors’ preferences can be subsumedinto a utility function of a representative agent. Inwhat follows, suppose that the representative investorhas discounted time-separable utility with a constantelasticity of substitution (CES) 1/ρ:

St+1

St

= exp(−δ)

(Ct+1

Ct

)−ρ

(12)

This is the SDF for the power utility specification ofthe model of Rubinstein [50] and Lucas [42].b

Recursive Utility

Following Epstein and Zin [18], Kreps and Porteus[41] and Weil [53], preferences are specified recur-sively using continuation values for hypothetical con-sumption processes. Using a double CES recursion,the resulting SDF is

St+1

St

= exp(−δ)

(Ct+1

Ct

)−ρ [Ut+1

Rt (Ut+1)

]ρ−γ

(13)

where γ > 0, ρ > 0, Ut is the continuation valueassociated with current and future consumption, and

Rt (Ut+1).= (

E[(Ut+1)

1−γ |Ft

])1/(1−γ )(14)

Page 4: Pricing Kernels π t - Becker Friedman Institute · PDF file2 Pricing Kernels representation ... justified by the law of iterated values when trading is ... mild modification of

4 Pricing Kernels

is the risk-adjusted continuation value. The parameterρ continues to govern the elasticity of substitution,whereas the parameter γ alters the risk preferences.See [25, 31, 52] for a discussion of the use ofcontinuation values in representing the SDF and see[11, 25] for discussions of empirical implications.When ρ = 1, the recursive utility model coincideswith a model in which investors have a concern aboutrobustness as in [6].

The recursive utility model is just one of avariety of ways of altering investors’ preferences.For instance, Constantinides [15] and Heaton [36]explore implications of introducing intertemporalcomplementarities in the form of “habit persistence”.

Consumption Externalities and Reference Utility

Abel [1], Campbell and Cochrane [12] and Menzlyet al. [45] develop asset pricing implications ofmodels in which there are consumption externalitiesin models with stochastic consumption growth. Theseexternalities can depend on a social stock of pastconsumptions. The implied one-period SDF in thesemodels has the following form:

St+1

St

= exp(−δ)

(Ct+1

Ct

)−ρφ(Ht)

φ(H0)(15)

The social stock of consumption is built as a pos-sibly nonlinear function of current and past socialconsumption or innovations to social consumption.The construction of this stock differs across the vari-ous specifications. The process {Ht : t = 0, 1, . . .} isthe ratio of the consumption to the social stock andφ is an appropriately specified function of this ratio.In a related approach, Garcia et al. [21] considerinvestors’ preferences in consumption as being eval-uated relative to a reference level that is determinedexternally. The SDF is

St+1

St

= exp(−δ)

(Ct+1

Ct

)−ρ (Ht

H0

(16)

where the process {Ht : t = 0, 1, . . .} is the ratio ofthe consumption to the reference level. In effect, thesocial externality in these models induces a pref-erence shock or wedge to the SDF specification,which explicitly depends on the equilibrium aggre-gate consumption process. Finally, Bakshi and Chen[8] develop implications for a model in which relativewealth enters the utility function of investors.

Incomplete Markets

Suppose that individuals face private shocks thatthey cannot insure against but that they can writecomplete contracts over aggregate shocks. Let Ft+1

denote the date t + 1 sigma algebra generated byaggregate shocks available up until date t + 1. Thisconditioning information set determines the type ofrisk exposure that can be traded as of date t + 1.Under this partial risk sharing and power utility, theSDF for pricing aggregate uncertainty is

St+1

St

= exp(−δ)

E

[(C

j

t+1)−ρ |Ft+1

]E

[(C

jt )−ρ |Ft

]

= exp(−δ)

(Ca

t+1

Cat

)−ρ

×E

[(C

j

t+1/Cat+1)

−ρ |Ft+1

]E

[(C

jt /Ca

t )−ρ |Ft

] (17)

In this example and those that follow, Cj

t+1 is con-sumption for individual j and Ca

t+1 is aggregateconsumption. Characterization (17) of the SDF dis-plays the pricing implications of limited risk sharingin security markets. It is satisfied, for instance, in themodel of Constantinides and Duffie [16].

Private Information

Suppose that individuals have private informationabout labor productivity that is conditionally inde-pendent, given aggregate information, leisure enterspreferences in a manner that is additively separableand consumption allocations are Pareto optimal giventhe private information. As shown by Kocherlakotaand Pistaferri [40], the SDF follows from the “inverseEuler equation” of [40, 47],

St+1

St

= exp(−δ)

E

[(C

jt )ρ |Ft

]E

[(C

j

t+1)ρ |Ft+1

]

= exp(−δ)

(Ca

t+1

Cat

)−ρ

× E

[(C

jt /Ca

t )ρ |Ft

]E

[(C

j

t+1/Cat+1)

ρ |Ft+1

] (18)

Page 5: Pricing Kernels π t - Becker Friedman Institute · PDF file2 Pricing Kernels representation ... justified by the law of iterated values when trading is ... mild modification of

Pricing Kernels 5

where Ft is generated by the public information. Asemphasized by Rogerson [47], this is a model witha form of “savings constraints”. While the stochasticdiscount factor for the incomplete information modelis expressed in terms of the (−ρ)th moments of thecross-sectional distributions of consumption in adja-cent time periods Kocherlakota and Pistaferri [40]show that in the private information model it is theρth moments of these same distributions.

Solvency Constraints

Luttmer [43], He and Modest [35], and Cochrane andHansen [14] study asset pricing implications in mod-els with limits imposed on the state contingent debtthat is allowed. Alvarez and Jermann [4] motivatesuch constraints by appealing to limited commit-ment as in [38, 39]. When investors default, theyare punished by excluding participation in asset mar-kets in the future. Chien and Lustig [13] explore theconsequences of alternative (out of equilibrium) pun-ishments. Following [43], the SDF in the presence ofsolvency constraints and power utility is

St+1

St

= exp(−δ)

(min

j

Cj

t+1

Cjt

)−ρ

(19)

and in particular,

St+1

St

≥ exp(−δ)

(Ca

t+1

Cat

)−ρ

(20)

Thus, the consumer with the smallest realized growthrate in consumption has a zero Lagrange multiplieron his or her solvency constraint, and hence theintertemporal marginal rate of substitution for thisperson is equal to the SDF. For the other consumers,the binding constraint prevents them shifting con-sumption from the future to the current period.

Long-term Risk

The SDF process assigns prices to risk exposures atalternative investment horizons. To study pricing overthese horizons, Alvarez and Jermann [5], Hansen andScheinkman [32], and Hansen [24] use a Markovstructure and apply factorizations of the followingform:

St+1 = exp(−ηt)Mt+1f (X0)

f (Xt+1)(21)

where {Mt : t = 0, 1, . . .} is a multiplicative martin-gale, η is a positive number, {Xt : t = 0, 1, . . .} is anunderlying Markov process, and f is a positive func-tion of the Markov state. The martingale is used as aconvenient change of probability, one that is distinctfrom the “risk-neutral” measure described previously.Using this change of measure, asset prices can bedepicted as

π0,t+1 (pt+1) = exp [−η(t + 1)]

× E

[pt+1

f (Xt+1)|X0

]f (X0) (22)

where · is the conditional expectation that is built withthe martingale {Mt : t ≥ 0} in (21). The additionaldiscounting is now constant and simple expectationscan now be computed by exploiting the Markov struc-ture. Hansen and Scheinkman [32] give necessary andsufficient conditions for the martingale in factoriza-tion (21) to imply a change of probability measurewith stable stochastic dynamics.c While Alvarez andJermann [5] use this factorization to investigate thelong-term links between the bond market and themacroeconomy, Hansen and Scheinkman [32] andHansen [24] extend this factorization to study thevaluation of cash flows that grow stochastically overtime. As argued by Hansen and Scheinkman [32]and Hansen [24], these more general factorizationsare valuable for the study of risk-return trade-offs forlong investment horizons.

As argued by Bansal and Lehmann [9], manyalterations to the power utility model in the sectionPower Utility can be represented as

S∗t+1

S∗t

=(

St+1

St

) [f (Xt+1)

f (Xt)

](23)

for a positive function f . Transient components inasset pricing models are included to produce short-term alterations in asset prices and are expressed asthe ratio of a function of the Markov state in adjacentdates. As shown by Bansal and Lehmann [9], thisrepresentation arises in models with habit persistence,or as shown in [24] the same is true for a limitingversion of the recursive utility model. Combiningequation (23) with equation (21) gives

S∗t+1 = exp(−η)Mt+1

f ∗(X0)

f ∗(Xt+1)(24)

where f ∗ = f /f.

Page 6: Pricing Kernels π t - Becker Friedman Institute · PDF file2 Pricing Kernels representation ... justified by the law of iterated values when trading is ... mild modification of

6 Pricing Kernels

Inferring Stochastic Discount Factorsfrom Data

Typically a finite number of asset payoffs are usedin econometric practice. In addition, the informationused in an econometric investigation may be lessthan that used by investors. With this in mind, letYt+1 denote an n-dimensional vector of asset payoffsobserved by the econometrician such that

E(|Yt+1|2|Gt

)< ∞ (25)

with E(Yt+1Yt+1

′|Gt

)nonsingular with probability 1

and Gt ⊂ Ft . Let Qt denote the corresponding pricevector that is measurable with respect to Gt , implyingthat

E (st+1Yt+1|Gt ) = Qt (26)

where st+1 = St+1/St . We may construct a counter-part to a (one-period) SDF by forming

p∗t+1 = Yt+1

′ [E (Yt+1Yt+1

′|Gt

)]−1Qt (27)

Note thatE

(p∗

t+1Yt+1

∣∣Gt ) = Qt (28)

suggesting that we could just replace st+1 in equa-tion (26) with p∗

t+1. We refer to p∗t+1 as a counterpart

to an SDF because we have not restricted p∗t+1 to be

positive.This construction is a special case of the represen-

tation of the prices implied by a conditional versionof the Riesz representation theorem [30]. Since p∗

t+1is not guaranteed to be positive, if we used it to assignprices to derivative claims (nonlinear functions ofYt+1), we might induce arbitrage opportunities. Nev-ertheless, provided that st+1 has a finite conditionalsecond moment,

E[(

st+1 − p∗t+1

)Yt+1|Gt

] = 0 (29)

This orthogonality indicates that p∗t+1 is the con-

ditional least-squares projection of st+1 onto Yt+1.Although a limited set of asset price data will notreveal st+1, the data can provide information aboutthe date t + 1 kernel for pricing over a unit timeinterval.

Suppose that Yt+1 contains a conditionally risklesspayoff. Then

E (st+1|Gt ) = E(p∗

t+1|Gt

)(30)

By a standard least-squares argument, the conditionalvolatility of st+1 must be at least as large as theconditional volatility of p∗

t+1. There are a variety ofother restrictions that can be derived. For instance,see [9, 28, 51].d

This construction has a direct extension to thecase in which a complete set of contracts can bewritten over the derivative claims. Let Ht+1 be theset of all payoffs that have finite second momentsconditioned on Gt and are of the form ht+1 = φ(Yt+1)

for some Borel measurable function φ. Then, we mayobtain a kernel representation for pricing claims withpayoffs in Ht+1 by applying the Riesz representationtheorem:

πt(ht+1) = E(h∗

t+1ht+1|Gt

)(31)

for some h∗t+1 in Ht+1. Then

E[(

st+1 − h∗t+1

)ht+1|Gt

) = 0 (32)

which shows that the conditional least-squares projec-tion of st+1 onto Ht+1 is h∗

t+1. In this case, h∗t+1 will

be positive. Thus, this richer collection of observedtradable assets implies a more refined characterizationof st+1 including the possibility that st+1 = h∗

t+1, pro-vided Ht+1 coincides with the full collection of one-period payoffs that could be traded by investors. Theestimation procedures of [3, 48] can be interpreted asestimating h∗

t+1 projected on the return information.

Linear Asset Pricing Models

The long history of linear beta pricing can be fruit-fully revisited within the SDF framework. Supposethat

Qt = E[(λt · zt+1 + αt ) Yt+1|Gt

](33)

for some vector λt and some scalar αt that are Gt

measurable. Then

E[Yt+1|Gt

] = α∗t Qt + λ∗

t · cov(Yt+1, zt+1|Gt ) (34)

where

α∗t = 1

αt + λt · E(zt+1|Gt )

λ∗t = − λtα

∗t (35)

Page 7: Pricing Kernels π t - Becker Friedman Institute · PDF file2 Pricing Kernels representation ... justified by the law of iterated values when trading is ... mild modification of

Pricing Kernels 7

which produces the familiar result that risk compen-sation expressed in terms of the conditional meandiscrepancy: E

[Yt+1|Gt

] − α∗t Qt depends on the

conditional covariances with the factors. When theentries of zt+1 are standardized to a have conditionalvariances equal to unity, the entries of λ∗

t become theconditional regression coefficients, the “betas” of theasset payoffs onto the alternative observable factors.When zt+1 is the scalar market return, this specifica-tion gives the familiar CAPM from empirical finance.When zt+1 is augmented to include the payoff on aportfolio designed to capture risk associated with size(market capitalization) and the payoff on a portfoliodesigned to capture risk associated with book to mar-ket equity, this linear specification gives a conditionalversion of the [19] three-factor model designed toexplain cross-sectional differences in expected returns(see Factor Models).

If the factors are among the asset payoffs and aconditional (on Gt ) riskless payoff is included in Yt+1,then

p∗t+1 = λt · zt+1 + αt (36)

and thus λt · zt+1 + αt is the conditional least-squaresregression of st+1 onto Yt+1.

For estimation and inference, consider the spe-cial case in which Gt is degenerate and the vec-tor Yt+1 consists of excess returns (Qt is a vectorof zeros). Suppose that the data generation processfor {(zt+1, Yt+1)} is independent and identically dis-tributed (i.i.d.) and multivariate normal. Then αt andλt are time invariant. The coefficient vectors can beefficiently estimated by maximum likelihood and thepricing restrictions tested by likelihood ratio statis-tics [22]. As MacKinlay and Richardson [44] pointout, it is important to relax the i.i.d. normal assump-tion in many applications. In contrast with the para-metric maximum likelihood approach, generalizedmethod of moments (GMM) provides an econometricframework that allows conditional heteroskedasticityand temporal dependence [23]. Moreover, the GMMapproach offers the important advantage to providea unified framework for the testing of conditionallinear beta pricing models. See [37] for an exten-sive discussion of the application of GMM to linearfactor models. We will have more to say about esti-mation when Gt is not degenerate in our subsequentdiscussion.

Suppose that the conditional linear pricing modelis misspecified. Hansen and Jagannathan [29] show

that choosing (λt , αt ) to minimize the maximumpricing error of payoffs with E(|pt+1|2|Gt ) = 1 isequivalent to solving the least-squares problem:e

minλt ,αt ,vt+1,E((vt+1)2|Gt )<∞

E[(λt · zt+1 + αt − vt+1)

2|Gt

]subject to Qt = E (vt+1Yt+1|Gt ) (37)

This latter problem finds a random variable vt+1

that is close to λt · zt+1 + αt , allowing for departuresfrom pricing formula (33) where vt+1 is required torepresent prices correctly. For a fixed (λt , αt ), the“concentrated” objective is

[E[λt · zt+1 + αt )Yt+1|Gt ] − Qt

]′

× [E

(Yt+1Yt+1

′|Gt

)]−1

× [E[λt · zt+1 + αt )Yt+1|Gt ] − Qt

](38)

which is a quadratic form of the vector of pric-ing error. The random vector (λt , αt ) is chosen tominimize this pricing error. In the case of a cor-rect specification, this minimization results in a zeroobjective; otherwise, it provides a measure of modelmisspecification.

Estimating Parametric Models

In examples of SDFs like those given in the sectionInvestors’ Preferences, there are typically unknownparameters to estimate. This parametric structureoften permits the identification of the SDF even withlimited data on security market payoffs and prices.To explore this approach, we introduce a parameterθ that governs say investors’ preferences. It is worthnoting that inference about θ is a semiparametricstatistical problem since we avoid the specificationof the law of motion of asset payoffs and pricesalong with the determinants of SDFs. In what follows,we sketch the main statistical issues in the followingsimplified context.

Suppose that st+1 = φt+1(θo) is a parameterizedmodel of an SDF that satisfies

E[φt+1(θo)Yt+1|Gt

] − Qt = 0 (39)

where φt+1 can depend on observed data but theparameter vector θo is unknown. This conditional

Page 8: Pricing Kernels π t - Becker Friedman Institute · PDF file2 Pricing Kernels representation ... justified by the law of iterated values when trading is ... mild modification of

8 Pricing Kernels

moment restriction implies a corresponding uncon-ditional moment condition:

E[φt+1(θo)Yt+1 − Qt

] = 0 (40)

As emphasized by Hansen and Richard [30], con-ditioning information can be brought in through theback door by scaling payoffs and their correspondingprices by random variables that are Gt measurable.Hansen and Singleton [33] describe how to use theunconditional moment condition to construct a GMMestimator of the parameter vector θo with propertiescharacterized by Hansen [23] (see also GeneralizedMethod of Moments (GMM)).As an alternative, the parameterized family of modelsmight be misspecified. Then the approach of [29]described previously could be used, whereby anestimator of θo is obtained by minimizing

[1

N

N∑t=1

φt+1(θ)Yt+1 − Qt

][1

N

N∑t=1

Yt+1Yt+1′]−1

×[

1

N

N∑t=1

φt+1(θ)Yt+1 − Qt

](41)

with respect to θ . This differs from a GMM formu-lation because the weighting matrix in the quadraticform does not depend on the SDF, but only on thesecond moment matrix of the payoff vector Yt+1. Thisapproach suffers from a loss of statistical efficiencyin estimation when the model is correctly specified,but it facilitates comparisons across models becausethe choice of an SDF does not alter how the overallmagnitude of the pricing errors is measured.As in [26, 29], the analysis can be modified toincorporate the restriction that pricing errors shouldalso be small for payoffs on derivative claims. Thiscan be formalized by computing the time seriesapproximation to the least-squares distance betweena candidate, but perhaps misspecified, SDF from thefamily of strictly positive random variables zt+1 thatsolve the pricing restriction:

E(zt+1Yt+1 − Qt) = 0 (42)

See [25] for more discussion of these alternativeapproaches for estimation.So far we have described econometric methods thatreduce conditional moment restrictions (33) and (40)

to their unconditional counterparts. From a statisti-cal perspective, it is more challenging to work withthe original conditional moment restriction. Alongthis vein, Ai and Chen [2] and Antoine et al. [7]develop nonparametric methods to estimate condi-tional moment restrictions used to represent pricingrestrictions. In particular, Antoine et al. [7] developa conditional counterpart to the continuously updatedGMM estimator introduced by Hansen et al. [27]in which the weighting matrix in GMM objectiveexplicitly depends on the unknown parameter to beestimated.f These same methods can be adapted toexplicitly take account of conditioning informationwhile allowing for misspecification as in [29].

Acknowledgment

Jarda Borovicka provided suggestions that were veryhelpful for this article.

End Notes

a.See [46] for a dynamic extension using this approach.b.See [10] for a continuous-time counterpart.c.Although Hansen and Scheinkman [32] use a continuous-time formulation, discrete-time counterparts to their analy-sis are straightforward to obtain.d.These authors do not explicitly use conditioning informa-tion. In contrast, Gallant et al. [20] estimate conditionalmoment restrictions using a flexible parameterization forthe dynamic evolution of the data.e.Hansen and Jagannathan [29] abstract from condition-ing information, but what follows is a straightforwardextension.f.The continuously updated estimator is also closely relatedto the “minimum entropy” and “empirical likelihood”approaches (see Entropy-based Estimation).

References

[1] Abel, A.B. (1990). Asset prices under habit formationand catching up with the Joneses, American EconomicReview 80(2), 38–42.

[2] Ai, C. & Chen, X. (2003). Efficient estimation ofmodels with conditional moment restrictions containingunknown functions, Econometrica 71(6), 1795–1843.

[3] Ait-Sahalia, Y. & Lo, A.W. (1998). Nonparametricestimation of state-price densities implicit in financialasset prices, Journal of Finance 53(2), 499–547.

[4] Alvarez, F. & Jermann, U.J. (2000). Efficiency, equilib-rium, and asset pricing with risk of default, Economet-rica 68(4), 775–798.

Page 9: Pricing Kernels π t - Becker Friedman Institute · PDF file2 Pricing Kernels representation ... justified by the law of iterated values when trading is ... mild modification of

Pricing Kernels 9

[5] Alvarez, F. & Jermann, U.J. (2005). Using asset prices tomeasure the persistence of the marginal utility of wealth,Econometrica 73(6), 1977–2016.

[6] Anderson, E.W., Hansen, L.P. & Sargent, T.J. (2003).A quartet of semigroups for model specification, robust-ness, prices of risk, and model detection, Journal of theEuropean Economic Association 1, 69–123.

[7] Antoine, B., Bonnal, H. & Renault, E. (2007). On theefficient use of the informational content of estimatingequations: implied probabilities and euclidean empiricallikelihood, Journal of Econometrics 138(2), 461–487.

[8] Bakshi, G.S. & Chen, Z. (1996). The spirit of capitalismand stock-market prices, American Economic Review86(1), 133–157.

[9] Bansal, R. & Lehmann, B.N. (1997). Growth-optimalportfolio restrictions on asset pricing models, Macroe-conomic Dynamics 1(2), 333–354.

[10] Breeden, D.T. (1979). An intertemporal asset pric-ing model with stochastic consumption and investmentopportunities, Journal of Financial Economics 7(3),265–296.

[11] Campbell, J.Y. (2003). Consumption-based asset pric-ing, in G.M. Constantinides, M. Harris & R.M. Stulz,eds, Handbook of the Economics of Finance, Elsevier,Chapter 13, Vol. 1, pp. 803–887.

[12] Campbell, J.Y. & Cochrane, J.H. (1999). By forceof habit, Journal of Political Economy 107(2),205–251.

[13] Chien, Y.L. & Lustig, H.N. (2008). The market price ofaggregate risk and the wealth distribution, forthcomingin the Review of Financial Studies.

[14] Cochrane, J.H. & Hansen, L.P. (1992). Asset pric-ing explorations for macroeconomics, NBER Macroeco-nomics Annual 7, 115–165.

[15] Constantinides, G.M. (1990). Habit formation: a resolu-tion of the equity premium puzzle, Journal of PoliticalEconomy 98(3), 519–543.

[16] Constantinides, G.M. & Duffie, D. (1996). Asset pric-ing with heterogeneous consumers, Journal of PoliticalEconomy 104(2), 219–240.

[17] Debreu, G. (1954). Valuation equilibrium and Paretooptimum, Proceedings of the National Academy of Sci-ences 40(7), 588–592.

[18] Epstein, L. & Zin, S. (1989). Substitution, risk aversionand the temporal behavior of consumption and assetreturns: A theoretical framework, Econometrica 57(4),937–969.

[19] Fama, E.F. & French, K.R. (1992). The cross-section ofexpected stock returns, Journal of Finance 47, 427–465.

[20] Gallant, A.R., Hansen, L.P. & Tauchen, G. (1990).Using conditional moments of asset payoffs to infer thevolatility of intertemporal marginal rates of substitution,Journal of Econometrics 45(1–2), 141–179.

[21] Garcia, R., Renault, E. & Semenov, A. (2006). Dis-entangling risk aversion and intertemporal substitutionthrough a reference utility level, Finance Research Let-ters 3, 181–193.

[22] Gibbons, M.R., Ross, S.A. & Shanken, J. (1989). Atest of the efficiency of a given portfolio, Econometrica57(5), 1121–1152.

[23] Hansen, L.P. (1982). Large sample properties of gen-eralized method of moments estimators, Econometrica50(4), 1029–1054.

[24] Hansen, L.P. (2008). Modeling the long run: valuationin dynamic stochastic economies, Fisher-Schultz Lectureat the 2006 Meetings of the Econometric Society, Vienna,Austria, 2006.

[25] Hansen, L.P., Heaton, J. Lee, J. & Roussanov, N. (2007).Handbook of Econometrics, Intertemporal substitutionand risk aversion, in J.J. Heckman & E.E. Leamer, eds,Handbook of Econometrics, Elsevier, Chapter 61, Vol. 6.

[26] Hansen, L.P., Heaton, J. & Luttmer, E.G.J. (1995).Econometric evaluation of asset pricing models, Reviewof Financial Studies 8(2), 237–274.

[27] Hansen, L.P., Heaton, J. & Yaron, A. (1996). Finite-sample properties of some alternative gmm estima-tors, Journal of Business and Economic Statistics 14(3),262–280.

[28] Hansen, L.P. & Jagannathan, R. (1991). Implications ofsecurity market data for models of dynamic economies,Journal of Political Economy 99(2), 225–262.

[29] Hansen, L.P. & Jagannathan, R. (1997). Assessingspecification errors in stochastic discount factor models,Journal of Finance 52(2), 557–590.

[30] Hansen, L.P. & Richard, S.F. (1987). The role of con-ditioning information in deducing testable restrictionsimplied by dynamic asset pricing models, Econometrica50(3), 587–614.

[31] Hansen, L.P., Sargent, T.J. & Tallarini, T.D. Jr. (1999).Robust permanent income and pricing, The Review ofEconomic Studies 66(4), 873–907.

[32] Hansen, L.P. & Scheinkman, J.A. (2009). Long-termrisk: an operator approach, Econometrica 77(1),177–234.

[33] Hansen, L.P. & Singleton, K.J. (1982). Generalizedinstrumental variables estimation of nonlinear rationalexpectations models, Econometrica 50(5), 1269–1286.

[34] Harrison, J.M. & Kreps, D.M. (1979). Martingales andarbitrage in multiperiod securities markets, Journal ofEconomic Theory 20, 381–408.

[35] He, H. & Modest, D.M. (1995). Market frictions andconsumption-based asset pricing, Journal of PoliticalEconomy 103(1), 94–117.

[36] Heaton, J. (1995). An empirical investigation of assetpricing with temporally dependent preference specifica-tions, Econometrica 63(3), 681–717.

[37] Jagannathan, R., Soulakis, G. & Wang, Z. (2009). Theanalysis of the cross section of security returns, inforthcoming in the Handbook of Financial Econometrics,Y. Ait-Sahalia & L.P. Hansen, Elsevier.

[38] Kehoe, T.J. & Levine, D.K. (1993). Debt-constrainedasset markets, Review of Economic Studies 60(4),865–888.

Page 10: Pricing Kernels π t - Becker Friedman Institute · PDF file2 Pricing Kernels representation ... justified by the law of iterated values when trading is ... mild modification of

10 Pricing Kernels

[39] Kocherlakota, N.R. (1996). Implications of efficientrisk sharing without commitment, Review of EconomicStudies 63(4), 595–609.

[40] Kocherlakota, N. & Pistaferri, L. (2009). Asset pricingimplications of Pareto optimality with private informa-tion, Journal of Political Economy 117(3), 555–590.

[41] Kreps, D.M. & Porteus, E.L. (1978). Temporal resolutionof uncertainty and dynamic choice, Econometrica 46(1),185–200.

[42] Lucas, R.E. (1978). Asset prices in an exchange econ-omy, Econometrica 46(6), 1429–1445.

[43] Luttmer, E.G.J. (1996). Asset pricing in economies withfrictions, Econometrica 64(6), 1439–1467.

[44] MacKinlay, A.C. & Richardson, M.P. (1991). Usinggeneralized method of moments to test mean-varianceefficiency, Journal of Finance 46, 511–527.

[45] Menzly, L., Santos, T. & Veronesi, P. (2004). Under-standing predictability, Journal of Political Economy112(1), 1–47.

[46] Rogers, L.C.G. (1998). The origins of risk-neutral pric-ing and the Black-Scholes formula, in Risk Managementand Analysis, C.O. Alexander, ed., Wiley, New York,Chapter 2, pp. 81–94.

[47] Rogerson, W.P. (1985). The first-order approachto principal-agent problems, Econometrica 53(6),1357–1367.

[48] Rosenberg, J.V. & Engle, R.F. (2002). Empirical pric-ing kernels, Journal of Financial Economics 64(3),341–372.

[49] Ross, S.A. (1976). The arbitrage theory of capital assetpricing, Journal of Economic Theory 13, 341–360.

[50] Rubinstein, M. (1976). The valuation of uncertainincome streams and the valuation of options, Bell Jour-nal 7, 407–425.

[51] Snow, K. (1991). Diagnosing asset pricing using thedistribution of asset returns, Journal of Finance 46(3),955–983.

[52] Tallarini, T.D. (2000). Risk-sensitive real businesscycles, Journal of Monetary Economics 45(3), 507–532.

[53] Weil, P. (1990). Nonexpected utility in macroeconomics,The Quarterly Journal of Economics 105(1), 29–42.

Related Articles

Arrow–Debreu Prices; Complete Markets; FixedMix Strategy; Fundamental Theorem of Asset Pri-cing; Stochastic Discount Factors.

LARS PETER HANSEN & ERIC RENAULT