Limiting Spectral Distribution of Large Sample Covariance ... fileLimiting Spectral Distribution of...

Limiting Spectral Distribution of Large SampleCovariance Matrices

Marwa Banna

PhD student at Universite Paris-Est Marne-La-Vallee under the direction ofFlorence Merlevede and Emmanuel Rio

Young Women in ProbabilityBonn, Germany

26 May 2014

Introduction and Motivation

• Let X1, . . . ,Xn ∈ RN be i.i.d centered random vectors withcovariance matrix Σ = E(X1XT

1 ) = . . . = E(XnXTn ).

• The sample covariance matrix Bn is defined by

n∑k=1

XkXTk =

n , Xn = (X1 . . .Xn) ∈MN×n(R)

• E(Bn) = Σ

• For fixed N, the strong law of large numbers implies

limn→+∞

Bn = limn→+∞

n∑k=1

XkXTk = Σ a.s.

• What happens once N := Nn →∞ as n→∞ ?

1 ) = . . . = E(XnXTn ).

n∑k=1

XkXTk =

n , Xn = (X1 . . .Xn) ∈MN×n(R)

• E(Bn) = Σ

limn→+∞

Bn = limn→+∞

n∑k=1

XkXTk = Σ a.s.

1 ) = . . . = E(XnXTn ).

n∑k=1

XkXTk =

n , Xn = (X1 . . .Xn) ∈MN×n(R)

• E(Bn) = Σ

limn→+∞

Bn = limn→+∞

n∑k=1

XkXTk = Σ a.s.

1 ) = . . . = E(XnXTn ).

n∑k=1

XkXTk =

n , Xn = (X1 . . .Xn) ∈MN×n(R)

• E(Bn) = Σ

limn→+∞

Bn = limn→+∞

n∑k=1

XkXTk = Σ a.s.

Marcenko-Pastur Theorem

• Let (Xij)i ,j>1 be a family of i.i.d. random variables such thatE(X11) = 0 and Var(X11) = σ2.

• Let Bn = 1n

∑nk=1 XkX

Tk = 1

nXnXTn where

Xn = (X1 . . .Xn) =

X11 X12 . . . X1n...

......

XN1 XN2 . . . XNn

• The empirical spectral measure of Bn is defined by

µBn =1

N∑k=1

where λ1, . . . , λN are the eigenvalues of Bn.

• We suppose that cn := Nn −−−−→n→+∞

c ∈ (0,∞).

• Let Bn = 1n

∑nk=1 XkX

Tk = 1

nXnXTn where

Xn = (X1 . . .Xn) =

X11 X12 . . . X1n...

......

XN1 XN2 . . . XNn

µBn =1

N∑k=1

c ∈ (0,∞).

• Let Bn = 1n

∑nk=1 XkX

Tk = 1

nXnXTn where

Xn = (X1 . . .Xn) =

X11 X12 . . . X1n...

......

XN1 XN2 . . . XNn

µBn =1

N∑k=1

c ∈ (0,∞).

TheoremFor any continuous and bounded function f : R→ R,∫

f dµBn −−−−→n→+∞

∫f dµc a.s.

where µc is the Marcenko-Pastur law(1− 1

δ0 +1

2πcσ2x

√(b − x)(x − a)1[a,b](x)dx

with .+ := max(0, .), a = σ2(1−√

c)2 and b = σ2(1 +√

The Stieltjes Transform

The Stieltjes transform SG : C+ → C of a measure ν on R isdefined by

Sν(z) :=

x − zdν(x)

• |Sν(z)| 6 1/Im(z) and Im(Sν(z)) > 0

• The function Sν is analytic over C+ and characterizes ν

ν([a, b]) = limy↓0

aImSν(x + iy) dx

• For a sequence of measures (νn)n on R, we have(νn

L−−−→n→∞

ν)⇔(∀z ∈ C+, Sνn(z) −−−→

n→∞Sν(z)

Sν(z) :=

x − zdν(x)

• |Sν(z)| 6 1/Im(z) and Im(Sν(z)) > 0

ν([a, b]) = limy↓0

aImSν(x + iy) dx

L−−−→n→∞

ν)⇔(∀z ∈ C+, Sνn(z) −−−→

n→∞Sν(z)

Sν(z) :=

x − zdν(x)

• |Sν(z)| 6 1/Im(z) and Im(Sν(z)) > 0

ν([a, b]) = limy↓0

aImSν(x + iy) dx

L−−−→n→∞

ν)⇔(∀z ∈ C+, Sνn(z) −−−→

n→∞Sν(z)

Sν(z) :=

x − zdν(x)

• |Sν(z)| 6 1/Im(z) and Im(Sν(z)) > 0

ν([a, b]) = limy↓0

aImSν(x + iy) dx

L−−−→n→∞

ν)⇔(∀z ∈ C+, Sνn(z) −−−→

n→∞Sν(z)

Non linear Stationary Process• X = (Xk)k∈Z is said to be stationary if ∀k ∈ Z and ∀` ∈ N,

(Xk , . . . ,Xk+`)D∼ (X0, . . . ,X`)

• Let g : RZ → R be a measurable function such that for anyk ∈ Z,

Xk = g(ξk) with ξk = (. . . , εk−1, εk)

is well-defined, E(Xk) = 0 and ‖Xk‖2 <∞.• For i = 1, . . . , n, let (X i

k)k∈Z be n independent copies of(Xk)k∈Z

X 11 X 2

1 . . . X n1

Xn =...

......

X 1N X 2

N . . . X nN

X1 X2 Xn

• Bn = 1nXnXT

n = 1n

∑nk=1 XkX

Non linear Stationary Process

• Let (εk)k∈Z be a sequence of i.i.d. random variables

is well-defined, E(Xk) = 0 and ‖Xk‖2 <∞.

• For i = 1, . . . , n, let (X ik)k∈Z be n independent copies of

(Xk)k∈Z

X 11 X 2

1 . . . X n1

Xn =...

......

X 1N X 2

N . . . X nN

X1 X2 Xn

• Bn = 1nXnXT

n = 1n

∑nk=1 XkX

(Xk)k∈Z

X 11 X 2

1 . . . X n1

Xn =...

......

X 1N X 2

N . . . X nN

X1 X2 Xn

• Bn = 1nXnXT

n = 1n

∑nk=1 XkX

(Xk)k∈Z

X 11 X 2

1 . . . X n1

Xn =...

......

X 1N X 2

N . . . X nN

X1 X2 Xn

• Bn = 1nXnXT

n = 1n

∑nk=1 XkX

(Xk)k∈Z X 11 X 2

1 . . . X n1

Xn =...

......

X 1N X 2

N . . . X nN

X1 X2 Xn

• Bn = 1nXnXT

n = 1n

∑nk=1 XkX

(Xk)k∈Z X 11 X 2

1 . . . X n1

Xn =...

......

X 1N X 2

N . . . X nN

X1 X2 Xn

• Bn = 1nXnXT

n = 1n

∑nk=1 XkX

Theorem: Banna and Merlevede (2013)

Suppose that limn→∞ N/n = c ∈ (0,∞) and∑k≥0

‖P0(Xk)‖2 <∞ where P0(Xk) = E(Xk |ξ0)− E(Xk |ξ−1)

Then with probability one µBnconverges weakly to a non-randomprobability measure whose Stieltjes transform S = S(z) satisfies theequation

z = − 1

∫ 2π

S +(2πf (λ)

)−1 dλ ,

where S(z) := −(1− c)/z + cS(z) and f (·) is the spectral density of(Xk)k∈Z.

f (x) =1

Cov (X0,Xk)e ixk , x ∈ R

Theorem: Banna and Merlevede (2013)

Suppose that limn→∞ N/n = c ∈ (0,∞) and∑k≥0

‖P0(Xk)‖2 <∞ where P0(Xk) = E(Xk |ξ0)− E(Xk |ξ−1)

Then with probability one µBnconverges weakly to a non-randomprobability measure whose Stieltjes transform S = S(z) satisfies theequation

z = − 1

∫ 2π

S +(2πf (λ)

)−1 dλ ,

where S(z) := −(1− c)/z + cS(z) and f (·) is the spectral density of(Xk)k∈Z.

f (x) =1

Cov (X0,Xk)e ixk , x ∈ R

Applications

• Xk =∑

i≥0 aiεk−i where (εi )i∈Z is a sequence of i.i.d.centered random variables then

E(Xk |ξ0) =∑i≥k

aiεk−i and E(Xk |ξ−1) =∑

i≥k+1

aiεk−i

Thus P0(Xk) = akε0.

The condition∑

k≥0 ‖P0(Xk)‖2 <∞ issatisfied once ∑

|ak | <∞ and ‖ε0‖2 <∞.

• Functions of linear processes (ex: Riesz-Raikov sums)

• ARCH models

Applications

• Xk =∑

i≥k+1

aiεk−i

Thus P0(Xk) = akε0. The condition∑

|ak | <∞ and ‖ε0‖2 <∞.

• ARCH models

Applications

• Xk =∑

i≥k+1

aiεk−i

Thus P0(Xk) = akε0. The condition∑

|ak | <∞ and ‖ε0‖2 <∞.

• ARCH models

Strategy of ProofOur aim: limn→∞ SµBn (z) = S(z), ∀z ∈ C+.

1. SµBn (z)− E(SµBn (z)

)→ 0 a.s.

Concentration inequality

2. E(SµBn (z)

)−E(SµGn (z)

)→0 where Gn = 1

∑nk=1 ZkZ

Tk Z 1

1 Z 21 . . . Zn

Zn =...

......

Z 1N Z 2

N . . . ZnN

Z1 Z2 Zn

((Z ik)k∈Z)1≤i≤n are n independent copies of (Zk)k∈Z such

that for any i , j ∈ Z,

Cov(Zi ,Zj) = Cov(Xi ,Xj)

3. E(SµGn (z)

)− S(z)→ 0

)→ 0 a.s.

2. E(SµBn (z)

)−E(SµGn (z)

)→0 where Gn = 1

∑nk=1 ZkZ

Tk Z 1

1 Z 21 . . . Zn

Zn =...

......

Z 1N Z 2

N . . . ZnN

Z1 Z2 Zn

3. E(SµGn (z)

)− S(z)→ 0

)→ 0 a.s.

2. E(SµBn (z)

)−E(SµGn (z)

)→0 where Gn = 1

∑nk=1 ZkZ

Tk Z 1

1 Z 21 . . . Zn

Zn =...

......

Z 1N Z 2

N . . . ZnN

Z1 Z2 Zn

3. E(SµGn (z)

)− S(z)→ 0

)→ 0 a.s. Concentration inequality

2. E(SµBn (z)

)−E(SµGn (z)

)→0 where Gn = 1

∑nk=1 ZkZ

Tk Z 1

1 Z 21 . . . Zn

Zn =...

......

Z 1N Z 2

N . . . ZnN

Z1 Z2 Zn

3. E(SµGn (z)

)− S(z)→ 0

2. E(SµBn (z)

)−E(SµGn (z)

)→0 where Gn = 1

∑nk=1 ZkZ

Tk Z 1

1 Z 21 . . . Zn

Zn =...

......

Z 1N Z 2

N . . . ZnN

Z1 Z2 Zn

3. E(SµGn (z)

)− S(z)→ 0 Silverstein (1995)

2. E(SµBn (z)

)−E(SµGn (z)

)→0 where Gn = 1

∑nk=1 ZkZ

Tk Z 1

1 Z 21 . . . Zn

Zn =...

......

Z 1N Z 2

N . . . ZnN

Z1 Z2 Zn

Technic of blocks associated with the Lindeberg method

3. E(SµGn (z)

)− S(z)→ 0 Silverstein (1995)

Banna, M., Merlevede, F. (2013). Limiting spectral distributionof large sample covariance matrices associated with a class ofstationary processes. Journal of Theoretical Probability, 1-39.

Chatterjee, S. (2006). A generalization of the Lindebergprinciple. Ann. Probab. 34, 2061-2076.

Marcenko, V. and Pastur, L. (1967). Distribution ofeigenvalues for some sets of random matrices. Mat. Sb. 72,507-536.

10/ 11

Thank you for your attention!

11/ 11

Limiting Spectral Distribution of Large Sample Covariance ... fileLimiting Spectral Distribution of...

Documents

Transcript of Limiting Spectral Distribution of Large Sample Covariance ... fileLimiting Spectral Distribution of...

The Spectral Method

Learning Spectral Graph Segmentation

Spectral Analysis

Spectral theory of automorphic forms

Sparse Covariance Selection via Robust Maximum Likelihood ...

Improved Cheeger’s Inequality: Analysis of Spectral ...Improved Cheeger’s Inequality: Analysis of Spectral Partitioning Algorithms through Higher Order Spectral Gap Tsz Chiu Kwok

Beam Parameters, Spectral Brighness, Harmonics and Wiggler ...attwood/srms/2007/Lec11.pdfCh05_F30_32VG.ai K = 1 • Narrow spectral lines • High spectral brightness • Partial coherence

Continuous Probability Distributions Exponential, Erlang ...€¦ · •The Erlang distribution is a generalization of the exponential distribution. •The exponential distribution

Sparse Covariance Selection using Semideﬁnite Programmingaspremon/PDF/SIAM06am.pdfSparse Covariance Selection using Semideﬁnite Programming A. d’Aspremont ORFE, Princeton University

Inverse-covariance matrix of linear operators for quantum

Spectral Estimation 2

input u y F Δ H x - Portland State University · Actual (sampling) Linearized (EKF) UT sigma points true mean UT mean and covariance weighted sample mean mean UT covariance covariance

Moment Distribution

Lecture 32 Analysis of Covariance II - Purdue Universityghobbs/STAT_512/Lecture_Notes/ANO… · Lecture 32 Analysis of Covariance II STAT 512 Spring 2011 Background Reading KNNL:

Biostatistics - Normal Distribution

Lecture 32 Analysis of Covariance II - Purdue Universityghobbs/STAT_512/Lecture_Notes/... · Lecture 32 Analysis of Covariance II STAT 512 ... • Factor 2 is gender of the person

Metabolos: Composing through a Spectral Scope

Regularized M-estimators of the Covariance Matrix

VECTOR AUTOREGRESSIVE COVARIANCE MATRIX ESTIMATIONecon.lse.ac.uk/staff/wdenhaan/papers/asym35.pdf · VECTOR AUTOREGRESSIVE COVARIANCE MATRIX ESTIMATION ... (PSD) kernels. The VAR

Cannabinoid Distribution