Download - Mean Reversion with a VarianceThreshold - Marco Cuturi · 2016-10-04 · Mean Reversion with a VarianceThreshold Marco Cuturi1, Alexandre d’Aspremont2 1 Kyoto University (Japan),

Transcript
Page 1: Mean Reversion with a VarianceThreshold - Marco Cuturi · 2016-10-04 · Mean Reversion with a VarianceThreshold Marco Cuturi1, Alexandre d’Aspremont2 1 Kyoto University (Japan),

Mean Reversion with a Variance ThresholdMarco Cuturi1, Alexandre d’Aspremont2

1 Kyoto University (Japan), 2 CNRS - Ecole Polytechnique (France)

This Poster in 1 MinuteClassic Cointegration Problem: Given a multivariatetime series xt ∈ Rn, find α such that yTxt is stationary.

Our formulation: Find cointegrated relationship suchthat yTxt also has fast mean-reversion and sufficientvariance.

Motivation: Makes a lot of sense in financial applica-tions. We expect it can also be applied to other fields,such as anomaly detection.

Approach: Formulate natural criteria that take intoaccount both mean-reversion and variance.

Optimization: These criteria are not convex. We approx-imate them (and solve them exactly in some cases) usingsemidefinite programming and the S-lemma.

Experiments: We illustrate that, on stock volatility data,Mean-reversion→ statistical arbitrage opportunities.Sufficient variance→ lower transaction costs.

1. Mean-Reversion & CointegrationLoose Definition of Mean Reversion: Tendency of astochastic process to revert (pull back) to its mean.

Mean-reversion = Statistical Arbitrage Opportunity

Which assets are mean-reverting?

• Stationary processes are mean-reverting,

• Arbitraging stationary assets is therefore desirable.

In practice, very few assets are stationary. Those who aretend to revert to their means very slowly.

However, combining assets can result in stationarity:"pair-trades" when n = 2, "baskets" when n ≥ 3.

Finding weights y such that yTxt is stationary=

cointegration theory (econometrics, VAR modeling)

2. Stationarity is not enough

Problem 1: slow mean-reversion is bad.

Slow mean-reversion→ smaller expected arbitrage.

Problem 2: small variance is bad.

Large variance→ larger arbitrage expected per trade.

Both problems lead to more leverage = higher risk.

Both issues are not addressed by classic cointegrationmethods, which focus exclusively on stationarity.

When n � 1, when estimating y from finite samples, small variancecan also mean overfitting.

3. Criteria• Ak = E[xtx

Tt+k], k ≥ 0 when finite.

• When x = (x1, . . . , xT ) and each xt ∈ Rn,

Akdef=

1

T − i− 1

T−k∑t=1

xtxTt+k, xt

def= xt −

1

T

T∑t=1

xt.

To quantify mean-reversion, three different proxies:

1. Portmanteau (Ljung and Box, 1978)

φp(y)def= porp(y

T xt) =1

p

p∑i=1

(yTAiyyTA0y

)2

,

(norm of autocorrelogram)

2. Crossing Stats (Kedem and Yakowitz, 1994)Number of times yTxt crosses its mean is a decreasingfunction of yTA1y assuming yTAky ≈ 0, k > 1.

3. Predictability (Box and Tiao, 1977) Suppose

xt = xt−1 + εt,

where xt−1 is a predictor of xt; εt i.i.d. Gaussian (0,Σ).

n=1: E[x2t ] = E[x2

t−1] + E[ε2t ], thus 1 = σ2

σ2 + Σσ2 ,

Box and Tiao measure the predictability of xt by the ratio

λdef=σ2

σ2.

n>1: Consider the process (yTxt)t with y ∈ Rn. We canmeasure the predicability of yTxt as

λ(y)def=yT A0y

yTA0y,

where A0 and A0 are covariance matrices of xt and xt−1.

To Quantify variance

var(yTxt) = yTA0y > ν.

Mean Reversion with Variance Threshold

minimize∑pi=1

(yTAiy

)2subject to yTA0y ≥ ν

‖y‖2 = 1,

(P1)

minimize yTA1y + µ∑pk=2

(yTAky

)2subject to yTA0y ≥ ν

‖y‖2 = 1,

(P2)

minimize yTMysubject to yTA0y ≥ ν

‖y‖2 = 1,(P3)

4. SDP RelaxationsSDP Formulation Writing Y = yyT ,

Brickman (1961): when n ≥ 3,{(yTAy, yTBy) : y ∈ Rn, ‖y‖2 = 1

}=

{(Tr(AY ),Tr(BY )) : Y ∈ Sn, TrY = 1, Y � 0}

minimize∑pi=1 Tr(AiY )2

subject to Tr(BY ) ≥ νTr(Y ) = 1, Y � 0,

(SDP1)

minimize Tr(A1Y ) + µ∑pi=2 Tr(AiY )2

subject to Tr(BY ) ≥ νTr(Y ) = 1, Y � 0

(SDP2)

minimize Tr(MY )subject to Tr(BY ) ≥ ν

Tr(Y ) = 1, Y � 0,(SDP3)

Exact solutions when p = 1, approximation (randomiza-tion, leading eigenvector) when p > 1.

5. ExperimentsData: implied volatility data for 217 stocks.Sample Trade Episode: using our approach and OLS

Results: 20 time windows, results on most

ReferencesReferencesBarvinok, A. (2002). A course in convexity. American Mathemat-

ical Society.

Box, G. and Tiao, G. (1977). A canonical analysis of multipletime series. Biometrika, 64(2):355–365.

Brickman, L. (1961). On the field of values of a matrix. Proceed-ings of the American Mathematical Society, pages 61–66.

Kedem, B. and Yakowitz, S. (1994). Time series analysis by higherorder crossings. IEEE press Piscataway, NJ.

Ljung, G. and Box, G. (1978). On a measure of lack of fit in timeseries models. Biometrika, 65(2):297–303.