Mean Reversion with a VarianceThreshold - Marco Cuturi · 2016-10-04 · Mean Reversion with a...

1
Mean Reversion with a VarianceThreshold Marco Cuturi 1 , Alexandre d’Aspremont 2 1 Kyoto University (Japan), 2 CNRS - Ecole Polytechnique (France) This Poster in 1 Minute Classic Cointegration Problem: Given a multivariate time series x t R n , find α such that y T x t is stationary. Our formulation: Find cointegrated relationship such that y T x t also has fast mean-reversion and sufficient variance. Motivation: Makes a lot of sense in financial applica- tions. We expect it can also be applied to other fields, such as anomaly detection. Approach: Formulate natural criteria that take into account both mean-reversion and variance. Optimization: These criteria are not convex. We approx- imate them (and solve them exactly in some cases) using semidefinite programming and the S -lemma. Experiments: We illustrate that, on stock volatility data, Mean-reversion statistical arbitrage opportunities. Sufficient variance lower transaction costs. 1. Mean-Reversion & Cointegration Loose Definition of Mean Reversion: Tendency of a stochastic process to revert (pull back) to its mean. Mean-reversion = Statistical Arbitrage Opportunity Which assets are mean-reverting? Stationary processes are mean-reverting, Arbitraging stationary assets is therefore desirable. In practice, very few assets are stationary. Those who are tend to revert to their means very slowly. However, combining assets can result in stationarity: "pair-trades" when n =2, "baskets" when n 3. Finding weights y such that y T x t is stationary = cointegration theory (econometrics, VAR modeling) 2. Stationarity is not enough Problem 1: slow mean-reversion is bad. Slow mean-reversion smaller expected arbitrage. Problem 2: small variance is bad. Large variance larger arbitrage expected per trade. Both problems lead to more leverage = higher risk. Both issues are not addressed by classic cointegration methods, which focus exclusively on stationarity. When n 1, when estimating y from finite samples, small variance can also mean overfitting. 3. Criteria •A k = E[x t x T t+k ],k 0 when finite. When x =(x 1 ,..., x T ) and each x t R n , A k def = 1 T - i - 1 T -k X t=1 ˜ x t ˜ x T t+k , ˜ x t def = x t - 1 T T X t=1 x t . To quantify mean-reversion, three different proxies: 1. Portmanteau (Ljung and Box, 1978) φ p (y ) def = por p (y T x t )= 1 p p X i=1 y T A i y y T A 0 y 2 , (norm of autocorrelogram) 2. Crossing Stats (Kedem and Yakowitz, 1994) Number of times y T x t crosses its mean is a decreasing function of y T A 1 y assuming y T A k y 0,k> 1. 3. Predictability (Box and Tiao, 1977) Suppose x t x t-1 + ε t , where ˆ x t-1 is a predictor of x t ; ε t i.i.d. Gaussian (0, Σ). n=1: E[x 2 t ]= Ex 2 t-1 ]+ E[ε 2 t ], thus 1= ˆ σ 2 σ 2 + Σ σ 2 , Box and Tiao measure the predictability of x t by the ratio λ def = ˆ σ 2 σ 2 . n>1: Consider the process (y T x t ) t with y R n . We can measure the predicability of y T x t as λ(y ) def = y T ˆ A 0 y y T A 0 y , where ˆ A 0 and A 0 are covariance matrices of x t and ˆ x t-1 . To Quantify variance var(y T x t )= y T A 0 y > ν. Mean Reversion with Variance Threshold minimize p i=1 ( y T A i y ) 2 subject to y T A 0 y ν ky k 2 =1, (P1) minimize y T A 1 y + μ p k=2 ( y T A k y ) 2 subject to y T A 0 y ν ky k 2 =1, (P2) minimize y T My subject to y T A 0 y ν ky k 2 =1, (P3) 4. SDP Relaxations SDP Formulation Writing Y = yy T , Brickman (1961): when n 3, (y T Ay, y T By ): y R n , ky k 2 =1 = {(Tr(AY ), Tr(BY )) : Y S n , Tr Y =1,Y 0} minimize p i=1 Tr(A i Y ) 2 subject to Tr(BY ) ν Tr(Y )=1,Y 0, (SDP1) minimize Tr(A 1 Y )+ μ p i=2 Tr(A i Y ) 2 subject to Tr(BY ) ν Tr(Y )=1,Y 0 (SDP2) minimize Tr(MY ) subject to Tr(BY ) ν Tr(Y )=1,Y 0, (SDP3) Exact solutions when p =1, approximation (randomiza- tion, leading eigenvector) when p> 1. 5. Experiments Data: implied volatility data for 217 stocks. Sample Trade Episode: using our approach and OLS Results: 20 time windows, results on most

Transcript of Mean Reversion with a VarianceThreshold - Marco Cuturi · 2016-10-04 · Mean Reversion with a...

Page 1: Mean Reversion with a VarianceThreshold - Marco Cuturi · 2016-10-04 · Mean Reversion with a VarianceThreshold Marco Cuturi1, Alexandre d’Aspremont2 1 Kyoto University (Japan),

Mean Reversion with a Variance ThresholdMarco Cuturi1, Alexandre d’Aspremont2

1 Kyoto University (Japan), 2 CNRS - Ecole Polytechnique (France)

This Poster in 1 MinuteClassic Cointegration Problem: Given a multivariatetime series xt ∈ Rn, find α such that yTxt is stationary.

Our formulation: Find cointegrated relationship suchthat yTxt also has fast mean-reversion and sufficientvariance.

Motivation: Makes a lot of sense in financial applica-tions. We expect it can also be applied to other fields,such as anomaly detection.

Approach: Formulate natural criteria that take intoaccount both mean-reversion and variance.

Optimization: These criteria are not convex. We approx-imate them (and solve them exactly in some cases) usingsemidefinite programming and the S-lemma.

Experiments: We illustrate that, on stock volatility data,Mean-reversion→ statistical arbitrage opportunities.Sufficient variance→ lower transaction costs.

1. Mean-Reversion & CointegrationLoose Definition of Mean Reversion: Tendency of astochastic process to revert (pull back) to its mean.

Mean-reversion = Statistical Arbitrage Opportunity

Which assets are mean-reverting?

• Stationary processes are mean-reverting,

• Arbitraging stationary assets is therefore desirable.

In practice, very few assets are stationary. Those who aretend to revert to their means very slowly.

However, combining assets can result in stationarity:"pair-trades" when n = 2, "baskets" when n ≥ 3.

Finding weights y such that yTxt is stationary=

cointegration theory (econometrics, VAR modeling)

2. Stationarity is not enough

Problem 1: slow mean-reversion is bad.

Slow mean-reversion→ smaller expected arbitrage.

Problem 2: small variance is bad.

Large variance→ larger arbitrage expected per trade.

Both problems lead to more leverage = higher risk.

Both issues are not addressed by classic cointegrationmethods, which focus exclusively on stationarity.

When n � 1, when estimating y from finite samples, small variancecan also mean overfitting.

3. Criteria• Ak = E[xtx

Tt+k], k ≥ 0 when finite.

• When x = (x1, . . . , xT ) and each xt ∈ Rn,

Akdef=

1

T − i− 1

T−k∑t=1

xtxTt+k, xt

def= xt −

1

T

T∑t=1

xt.

To quantify mean-reversion, three different proxies:

1. Portmanteau (Ljung and Box, 1978)

φp(y)def= porp(y

T xt) =1

p

p∑i=1

(yTAiyyTA0y

)2

,

(norm of autocorrelogram)

2. Crossing Stats (Kedem and Yakowitz, 1994)Number of times yTxt crosses its mean is a decreasingfunction of yTA1y assuming yTAky ≈ 0, k > 1.

3. Predictability (Box and Tiao, 1977) Suppose

xt = xt−1 + εt,

where xt−1 is a predictor of xt; εt i.i.d. Gaussian (0,Σ).

n=1: E[x2t ] = E[x2

t−1] + E[ε2t ], thus 1 = σ2

σ2 + Σσ2 ,

Box and Tiao measure the predictability of xt by the ratio

λdef=σ2

σ2.

n>1: Consider the process (yTxt)t with y ∈ Rn. We canmeasure the predicability of yTxt as

λ(y)def=yT A0y

yTA0y,

where A0 and A0 are covariance matrices of xt and xt−1.

To Quantify variance

var(yTxt) = yTA0y > ν.

Mean Reversion with Variance Threshold

minimize∑pi=1

(yTAiy

)2subject to yTA0y ≥ ν

‖y‖2 = 1,

(P1)

minimize yTA1y + µ∑pk=2

(yTAky

)2subject to yTA0y ≥ ν

‖y‖2 = 1,

(P2)

minimize yTMysubject to yTA0y ≥ ν

‖y‖2 = 1,(P3)

4. SDP RelaxationsSDP Formulation Writing Y = yyT ,

Brickman (1961): when n ≥ 3,{(yTAy, yTBy) : y ∈ Rn, ‖y‖2 = 1

}=

{(Tr(AY ),Tr(BY )) : Y ∈ Sn, TrY = 1, Y � 0}

minimize∑pi=1 Tr(AiY )2

subject to Tr(BY ) ≥ νTr(Y ) = 1, Y � 0,

(SDP1)

minimize Tr(A1Y ) + µ∑pi=2 Tr(AiY )2

subject to Tr(BY ) ≥ νTr(Y ) = 1, Y � 0

(SDP2)

minimize Tr(MY )subject to Tr(BY ) ≥ ν

Tr(Y ) = 1, Y � 0,(SDP3)

Exact solutions when p = 1, approximation (randomiza-tion, leading eigenvector) when p > 1.

5. ExperimentsData: implied volatility data for 217 stocks.Sample Trade Episode: using our approach and OLS

Results: 20 time windows, results on most

ReferencesReferencesBarvinok, A. (2002). A course in convexity. American Mathemat-

ical Society.

Box, G. and Tiao, G. (1977). A canonical analysis of multipletime series. Biometrika, 64(2):355–365.

Brickman, L. (1961). On the field of values of a matrix. Proceed-ings of the American Mathematical Society, pages 61–66.

Kedem, B. and Yakowitz, S. (1994). Time series analysis by higherorder crossings. IEEE press Piscataway, NJ.

Ljung, G. and Box, G. (1978). On a measure of lack of fit in timeseries models. Biometrika, 65(2):297–303.