Data-Driven Spectral Decomposition and Forecasting of Ergodic ...

31
Data-Driven Spectral Decomposition and Forecasting of Ergodic Dynamical Systems Dimitris Giannakis Courant Institute of Mathematical Sciences, NYU IPAM Workshop on Uncertainty Quantification for Multiscale Stochastic Systems and Applications January 21, 2016

Transcript of Data-Driven Spectral Decomposition and Forecasting of Ergodic ...

Page 1: Data-Driven Spectral Decomposition and Forecasting of Ergodic ...

Data-Driven Spectral Decomposition andForecasting of Ergodic Dynamical Systems

Dimitris GiannakisCourant Institute of Mathematical Sciences, NYU

IPAM Workshop on Uncertainty Quantification for MultiscaleStochastic Systems and Applications

January 21, 2016

Page 2: Data-Driven Spectral Decomposition and Forecasting of Ergodic ...

Setting & objectives

M

F

xi = F (ai )

x0 = F (a0)

F (M)

ai = Φtia0

a0

Ergodic dynamical system (M,M, Φt , µ) observed through avector-valued function F : M 7→ Rn

Given time-ordered observations x0, . . . , xN−1 with xi = F (ai ), we seekto perform

1 Dimension reduction with timescale separation and invariance underchanges of observation modality

2 Nonparametric forecasting of observables on M with deterministic orstatistical initial data

Page 3: Data-Driven Spectral Decomposition and Forecasting of Ergodic ...

Outline

1 Representation of Koopman operators in a data-driven orthonormal basis

2 Time-change techniques

3 Modes of organized tropical convection

AcknowledgmentsTyrus Berry, John Harlim, Joanna Slawinska, Jane Zhao

Page 4: Data-Driven Spectral Decomposition and Forecasting of Ergodic ...

State- and observable-centric viewpoints

• State space viewpoint

In data space, we observe the manifoldF (M) and the vector field

V |x =dx

dtwith x = F (Φta)

• Operator-theoretic viewpoint (Mezic et al. 2004, 2005, 2012, . . . )Associated with the dynamical system is a group of unitary operators Ut

on L2(M, µ) s.t.Ut f (a) = f (Φta)

The generator v of Ut gives the directional derivative of functionsalong the dynamical flow

vf (a) = limt→0

f (Φta)− f (a)

t, V = DF v

Page 5: Data-Driven Spectral Decomposition and Forecasting of Ergodic ...

Spectral characterization of ergodicity and mixing

A dynamical system (M,M, Φt , µ) is called

• Ergodic if all Φt-invariant sets have either zero or full measure

Spectral characterization: 0 is a simple eigenvalue of v correspondingto a constant eigenfunction

• Weak-mixing if for all A,B ∈M we have

limt→∞

1

t|µ(Φt(A) ∩ B)− µ(A)µ(B)| = 0

Spectral characterization: 0 is the only eigenvalue of v and thiseigenvalue is simple

Page 6: Data-Driven Spectral Decomposition and Forecasting of Ergodic ...

Systems with pure point spectra

L2(M, µ) has an orthonormal basis consisting of eigenfunctions of v

v(z) = λz , λ = iω, ω ∈ R, |z | = 1

The eigenvalues and eigenfunctions form a group

v(zz) = (λ+ λ)zz , v(z) = λz

• Such systems are metrically isomorphic to translations on compactAbelian groups equipped with the Haar measure

The canonical phase spaces for diffeomorphisms of smooth manifolds aretori; constructions on other manifolds are available but havediscontinuous eigenfunctions (Anosov & Katok 1970)

Page 7: Data-Driven Spectral Decomposition and Forecasting of Ergodic ...

Dimension reduction for systems with pure point spectra

The group of eigenvalues for M = Tm is generated by m rationallyindependent frequencies Ωi ∈ R with corresponding eigenfunctions ζi

ωk1···km =m∑j=1

kjΩj , zk1···km =m∏j=1

ζkjj , kj ∈ Z

Dimension reduction map. π : M 7→ Cm with

π(a) = (π1(a), . . . , πm(a)) = (ζ1(a), . . . ζm(a))

• The πi are independent of observation modality

• v is projectible under πi , and the system evolves as a simple harmonicoscillator in the image space

dζi (Φta)

dt= v(ζi )(Φta) = iΩiζi (Φta)

Page 8: Data-Driven Spectral Decomposition and Forecasting of Ergodic ...

Vector field decomposition

Define the vector fields vi : L2(M, µ) 7→ L2(M, µ) through their action onthe eigenfunctions:

vi (ζk11 · · · ζ

kii · · · ζ

kmm ) = ikiΩiζ

k11 · · · ζ

kii · · · ζ

kmm

The vi are linearly independent, nowhere vanishing, mutuallycommuting vector fields

v =m∑i=1

vi , [vi , vj ] = 0

• Due to their vanishing commutator, the vi can be thought of asdynamically independent components

• These vector fields can be realized in data space through thepushforward map DF : TM 7→ TRn

Vi = DF (vi ) = vi (F ) =∑k

Akv(zk), Ak = 〈zk ,F 〉

Page 9: Data-Driven Spectral Decomposition and Forecasting of Ergodic ...

Data-driven basis

• Start from a variable-bandwidth kernel (Berry & Harlim 2014),Kε : M ×M 7→ R+:

Kε(ai , aj) = exp

(− ‖xi − xj‖2

εσε(xi )−1/mσε(xj)−1/m

),

m = dimM, xi = F (ai ), σε(xi ) =1

N(πε)m/2

N−1∑j=0

e−‖xi−xj‖2/ε

• Apply the diffusion maps normalization (Coifman & Lafon 2006, Berry& Sauer 2015):

qε(ai ) =1

N

N−1∑j=0

Kε(ai , aj), K ′ε(ai , aj) =Kε(ai , aj)

qε(aj)

dε(ai ) =1

NK ′ε(ai , aj), pε(ai , aj) =

K ′ε(ai , aj)

dε(ai )

Page 10: Data-Driven Spectral Decomposition and Forecasting of Ergodic ...

Data-driven basis

• pε induces an averaging operator on L2(M, µ) for the sampling measure

µ = N−1∑N−1

i=0 δai :

Pεf (b) =

∫M

pε(b, a)f (a) d µ(a) =1

N

N−1∑j=0

pε(b, aj)f (aj)

• By ergodicity, as N →∞, Pεf (b) converges µ-a.s. to Pεf (b), where

Pεf (b) =

∫M

pε(y , x)f (x)dµ(x)

is an averaging operator on L2(M, µ).

Page 11: Data-Driven Spectral Decomposition and Forecasting of Ergodic ...

Data-driven basis

Uniformly on M (Coifman & Lafon 2006),

Pεf (a) = f (a) + ε∆f (a) + O(ε2),

where ∆ is the Laplace-Beltrami operator associated with theRiemannian metric h = σ2/dg

• Effect of variable-bandwidth kernel is a conformal transformation suchthat the Riemannian measure is equal to the invariant measure

• The eigenfunctions φ0, φ1, . . . of ∆ are orthogonal on L2(M, µ)

• The rescaled eigenfunctions ϕi = φi/η1/2i corresponding to eigenvalue

ηi are orthogonal on H1(M, h)∫M

gradh ϕi · gradh ϕj dµ = δij

• In practice, we approximate (ηi , φi , ϕi ) by solving for

Pεφi = (1− εηi )φi , ϕi = φi/η1/2i

Page 12: Data-Driven Spectral Decomposition and Forecasting of Ergodic ...

Eigenvalue problem for the Koopman generator

v(z) = λz

• In dimension m ≥ 2 the eigenvalues form a dense set on the imaginaryline

• We eliminate highly rough eigenfunctions by solving the eigenvalueproblem for Lε = v + ε∆.

Continuous problem. Find z ∈ H1(M, h) and λ ∈ C s.t.

〈ψ, v(z)〉+ ε〈gradh ψ, gradh z〉 = λ〈ψ, z〉, ∀ψ ∈ H1(M, h)

Discrete approximation. Set Hl = spanϕ0, . . . , ϕl−1 ⊆ L2(M, µ).Find z ∈ Hl and λ ∈ C s.t.

〈ψ, v(z)〉µ + 〈gradhψ, gradhz〉µ = λ〈ψ, z〉µ, ∀ψ ∈ Hl .

Page 13: Data-Driven Spectral Decomposition and Forecasting of Ergodic ...

Eigenvalue problem for the Koopman generator

〈ψ, v(z)〉µ + 〈gradhψ, gradhz〉µ = λ〈ψ, z〉µ,

z =l−1∑i=0

ci ϕi , ψ =l−1∑i=0

wi ϕi

• v is a finite-difference approximation of v , e.g.,

〈ψ, v(z)〉µ =l−1∑i,j=0

wicj

∫M

ϕi v(ϕj) d µ

=l−1∑i,j=0

wicj

[1

N

N−2∑k=1

ϕi (ak)ϕj(ak+1)− ϕj(ak−1)

2 δt

]

• By construction of the ϕi basis,

〈gradhψ, gradhz〉µ =l−1∑i,j=0

wicjδij

• Scheme remains well-conditioned at large spectral order l

Page 14: Data-Driven Spectral Decomposition and Forecasting of Ergodic ...

Variable-speed flow on T2

x1

−1.5

−1

−0.5

0

0.5

1

1.5

t

x3

0 5 10 15 20 25−0.5

0

0.5

v =2∑

µ=1

vµ∂

∂θµ

v1 = 1 + β cos θ1

v2 = ω(1− β sin θ2)

ω =√

30, β =√

1/2

Page 15: Data-Driven Spectral Decomposition and Forecasting of Ergodic ...

Results for variable-speed flow on T2

Page 16: Data-Driven Spectral Decomposition and Forecasting of Ergodic ...

Results for variable-speed flow on T2

Page 17: Data-Driven Spectral Decomposition and Forecasting of Ergodic ...

Forecasting densities and expectation values

The adjoint U∗t on L2(M, µ) governs the evolution of probability densitiesrelative to µ

ρ0 =∑k

ck(0)zk , ck(0) = 〈zk , ρ0〉, k = (k1, . . . , km)

ρt = U∗t ρ0 =∑k

ck(t)zk , ck(t) =∑k

e−iωk tck(0)

The time-dependent expectation value of an observable f is

Et f =∑k

fkc∗k (t), fk = 〈zk , f 〉

• By computing Et f and Et f2 the method keeps track of both the

forecast mean and the forecast uncertainty

• The forecast accuracy depends on the bandwidth of f in the zk basis

Page 18: Data-Driven Spectral Decomposition and Forecasting of Ergodic ...

Nonparametric forecasting of the variable-speed system on T2

!0

"1 / #0 0.5 1 1.5 2

"2/#

0

0.5

1

1.5

2

0

20

40

60

80

100

Initial distribution has circular Gaussiandensity relative to µ with mean (π, π) andvariance (30−1, 30−1)

Forecasts with the nonparametric model arein good agreement with ensemble forecastswith the perfect model for both the meanand uncertainty

mean

x 1

−1.5

−1

−0.5

0

0.5

1ensemblemodel

standard deviation

0

0.2

0.4

0.6

0.8

t

x 3

0 2 4 6 8 10 12−0.8

−0.6

−0.4

−0.2

0

0.2

0.4

0.6

t0 2 4 6 8 10 12

0

0.02

0.04

0.06

0.08

0.1

0.12

Page 19: Data-Driven Spectral Decomposition and Forecasting of Ergodic ...

Time change and mixing

Given a flow Φt with generator v and a positive function f , construct theflow Ψt with generator w = f −1v

Ψt has the same orbits as Φt , but evolves at different speed and hasinvariant measure ν with dν = f dµ

• For any ergodic Φt there exists a time change s.t. Ψt is mixing(Kochergin 1973)

0 10 20 30 40 50 60 70 80 90 1000

5

10

15

20

25

30

t

f−1

Example on T3 (Fayad 2002). Φt is anirrational flow, and

f (θ1, θ2, θ3) = 1

+ Re∞∑k=1

∑|l|≤k

e−k

k

(eikθ

1

+ eikθ2)e ilz

Page 20: Data-Driven Spectral Decomposition and Forecasting of Ergodic ...

Regularization by time change

Construct an orthonormal basis for L2(M, ν) with dν = ‖v‖ dµ using thekernel

Kε(ai , aj) = exp

(−‖xi − xj‖2‖V |xi‖1/m‖V |xj‖1/m

εσε(xi )−1/mσε(xj)−1/m

)

Solve the eigenvalue problem

w(z) = iωz , w =v

‖v‖, z ∈ L2(M, ν)

• In the reduced coordinates π(a) = z(a) ∈ C the system evolves as anoscillator with variable frequency ω‖v‖

Make the vector field decomposition

v =m∑i=1

vi , vi = ‖v‖wi , w =m∑i=1

wi , [wi ,wj ] = 0

• In general, the vi are non-commuting vector fields

Page 21: Data-Driven Spectral Decomposition and Forecasting of Ergodic ...

Flow on T2 with fixed points (Oxtoby 1953)x1

−1

−0.8

−0.6

−0.4

−0.2

0

0.2

0.40.6

0.8

1

t

x3

20 25 30 35 40 45

−1

−0.8

−0.6

−0.4

−0.2

0

0.2

0.4

0.6

0.8

1

The system

v =2∑

µ=1

vµ∂

∂θµ

v1 = α(1− cos(θ1 − θ2))

+ (1− α)(1− cosθ2)

v2 = α(1− cos(θ1 − θ2))

has a fixed point at θ1 = θ2 = 0, andpreserves the Haar measure

Page 22: Data-Driven Spectral Decomposition and Forecasting of Ergodic ...

Vector field decomposition for the fixed-point system

V

e1//

e 2//

0 0.5 1 1.5 20

0.5

1

1.5

2V1

e1//0 0.5 1 1.5 2

V2

e1//0 0.5 1 1.5 2

Page 23: Data-Driven Spectral Decomposition and Forecasting of Ergodic ...

Vector field decomposition for the fixed-point system

V

e1//

e 2//

0 0.5 1 1.5 20

0.5

1

1.5

2V1

e1//0 0.5 1 1.5 2

V2

e1//0 0.5 1 1.5 2

Page 24: Data-Driven Spectral Decomposition and Forecasting of Ergodic ...

Nonparametric forecasting!0

"1 / #0 0.5 1 1.5 2

"2/#

0

0.5

1

1.5

2

0

20

40

60

80

100

120

140

160

180

mean

x 1

−1

−0.5

0

0.5

1standard deviation

0

0.2

0.4

0.6

0.8

1ensemblemodel

t

x 3

0 2 4 6 8 10−1

−0.5

0

0.5

1

t0 2 4 6 8 10

0

0.2

0.4

0.6

0.8

1

Page 25: Data-Driven Spectral Decomposition and Forecasting of Ergodic ...

Analysis of organized tropical convection

• Brightness temperature, Tb, is the blackbody emission temperaturereceived from Earth by satellites

• Deep convective clouds become cold, and present as a negative Tb

anomaly against the Earth’s surface

• We analyze Tb data from the CLAUS archive averaged over the latitudes15S–15N

• Sampling is 8× daily at 0.5 resolution for the period 1983–2006

Page 26: Data-Driven Spectral Decomposition and Forecasting of Ergodic ...

Delay-coordinate embeddings

• The raw Tb(t) timeseries is highly non-Markovian

• To recover information lost in partial observations, we performdelay-coordinate mapping (Packard et al. 1980, Sauer et al. 1991)

x(t) = (Tb(t),Tb(t − δt), . . . ,Tb(t − (q − 1) δt))

• This procedure significantly improves the quality of the diffusioneigenfunction basis (G. & Majda 2012, Berry et al. 2013)

• We use q = 512, equivalent to 64 days (intraseasonal timescale)

Page 27: Data-Driven Spectral Decomposition and Forecasting of Ergodic ...

Koopman eigenfunctions

Multiple timescales are resolved including:

• (a) The annual cycle.

• (b, c) Intraseasonal oscillations.

• (d, e) Convectively coupled equatorial waves (CCEWs).

Page 28: Data-Driven Spectral Decomposition and Forecasting of Ergodic ...

Spatiotemporal reconstructions

• (b, f) Annual cycle.

• (c) Madden-Julian oscillation.

• (d,h) Westward-propagating CCEWs.

• (g) Boreal summer intraseasonal oscillation (Indian Monsoon).

Page 29: Data-Driven Spectral Decomposition and Forecasting of Ergodic ...

Effects of partial observations

• The computed Koopman eigenfunctions have amplitude modulationswhich are not consistent with the skew-symmetry of v

• Despite delay-coordinate mapping, it is unrealistic to expect that we haverecovered the full attractor of the climate system

• Instead of the full generator, it is more likely that we are approximatingan operator of the form

v = ΠvΠ,

where Π is a projector to the subspace of the full L2 space on theattractor spanned by the diffusion eigenfunctions obtained from Tb

• It is plausible that an effective description of v is through anonautonomous advection-diffusion process

• Consistent with stochastic oscillator models for the MJO (Chen et al.2014)

Page 30: Data-Driven Spectral Decomposition and Forecasting of Ergodic ...

Summary

• The spectral properties of Koopman operators have attractive propertiesfor dimension reduction and nonparametric forecasting of dynamicalsystems

• These operators can be approximated from time-ordered data with no apriori knowledge of the equations of motion using kernel methods

• In systems with pure point spectra, the eigenfunctions of the Koopmangroup lead to a decomposition of the dynamics into a collection ofindependent harmonic oscillators

• Time change extends the applicability of Koopman eigendecompositiontechniques to certain classes of mixing systems, which are nowdecomposed into coupled oscillators with time-dependent frequencies

Page 31: Data-Driven Spectral Decomposition and Forecasting of Ergodic ...

References

• Giannakis, D. (2015). Data-driven spectral decomposition andforecasting of ergodic dynamical systems. arXiv:1507.02338

• Berry, T., D. Giannakis, J. Harlim (2015). Nonparametric forecasting oflow-dimensional dynamical systems. Phys. Rev. E, 91, 032915

• Giannakis, D., J. Slawinska, Z. Zhao (2015). Spatiotemporal featureextraction with data-driven Koopman operators. J. Mach. Learn. Res.Proceedings, 44, 103–115