Download - 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo [email protected] October 27, 2015 ...

Transcript
Page 1: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

Frank-Wolfe Algorithm&

Alternating Direction Method of Multipliers

Ives [email protected]

October 27, 2015

Page 2: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

Where were we?

Page 3: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

Previous episode. . .Proximal-gradient methods

minimizex∈X

f(x) + ϕ(x)

I f : X → R convex with Lipschitz-continuous gradient

I ϕ : X → R ∪ {+∞} convex and simple (i.e., proximable)

proxαϕ : X → X (∀α > 0)

proxαϕ(x) := arg minx∈X

{αϕ(x) +

1

2‖x− x‖22

}

xk+1 := proxαkϕ

(xk − αk∇f(xk)

)

Page 4: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

Proximal-gradient methodsGood news

I αk ≡ α ∈ (0, 2/L)⇒ f(xk) +ϕ(xk)−min {f + ϕ} ≤ O(1/k)

I Acceleration gives O(1/k2)

I Generalize projected gradient methods, where

ϕ(x) = δC(x) :=

{0 if x ∈ C+∞ if x /∈ C

Page 5: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

Proximal-gradient methodsBad news

I Some sets C can be tough to project onto but you canminimize linear functions in them

Frank-Wolfe Algorithm/Conditional Gradient Method

I Dealing with ϕ(x) = φ(Ax) ain’t easy even when φ is simple

Alternating Direction Method of Multipliers (ADMM)

Page 6: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

Proximal-gradient methodsBad news

I Some sets C can be tough to project onto but you canminimize linear functions in them

Frank-Wolfe Algorithm/Conditional Gradient Method

I Dealing with ϕ(x) = φ(Ax) ain’t easy even when φ is simple

Alternating Direction Method of Multipliers (ADMM)

Page 7: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

Proximal-gradient methodsBad news

I Some sets C can be tough to project onto but you canminimize linear functions in them

Frank-Wolfe Algorithm/Conditional Gradient Method

I Dealing with ϕ(x) = φ(Ax) ain’t easy even when φ is simple

Alternating Direction Method of Multipliers (ADMM)

Page 8: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

Frank-Wolfe Algorithm

Page 9: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

A motivating problemMatrix completion

? ? 2 ?

1 ? ? 3

1 2 2 3

? 6 6 9

3 ? ? 9

1 ? 2 ?

? ? 6 9

Page 10: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

A motivating problemMatrix completion

0 0 2 0

1 0 0 3

1 2 2 3

0 6 6 9

3 0 0 9

1 0 2 0

0 0 6 9

Page 11: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

A motivating problemMatrix completion

rank

0 0 2 0

1 0 0 3

1 2 2 3

0 6 6 9

3 0 0 9

1 0 2 0

0 0 6 9

= 4

Page 12: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

A motivating problemMatrix completion

rank

1 2 2 3

1 2 2 3

1 2 2 3

3 6 6 9

3 6 6 9

1 2 2 3

3 6 6 9

= 1

Page 13: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

A motivating problemMatrix completion with nuclear-norm lasso

minimizeX∈Rn1×n2

1

2

m∑k=1

(Xik,jk − bk)2 subject to ‖σ(X)‖1 ≤ τ

Page 14: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

A motivating problemMatrix completion with nuclear-norm lasso

minimizeX∈Rn1×n2

1

2

m∑k=1

(Xik,jk − bk)2 subject to ‖σ(X)‖1 ≤ τ

Page 15: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

A motivating problemMatrix completion with nuclear-norm lasso

minimizeX∈Rn1×n2

1

2

m∑k=1

(Xik,jk − bk)2 subject to ‖X‖1 ≤ τ

I ‖X‖1 = ‖σ(X)‖1 =min{n1,n2}∑

i=1σi(X)

I Projection onto {X | ‖X‖1 ≤ τ} potentially requires full SVD

I Linear minimization requires only one SVD triplet!

Page 16: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

Model problem

minimizex∈C

f(x)

I f : Rn → R is convex and continuously differentiableI C ⊂ Rn is convex and compact (i.e., closed and bounded)

I we can minimize linear functions over C, i.e., ∀c ∈ Rn

find x ∈ arg minx∈C

〈c, x〉

Page 17: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

Frank-Wolfe algorithmFrank and Wolfe (1956)

x0 ∈ C

xk+1 ∈ arg minx∈C

{f(xk) +

⟨∇f(xk), x− xk

⟩}

xk+1 = (1− αk)xk+αkxk+1, αk :=2

k + 2

Approximation similar to projected gradient, but no quadratic term!

Page 18: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

Frank-Wolfe algorithmFrank and Wolfe (1956)

x0 ∈ C

xk+1 ∈ arg minx∈C

{f(xk) +

⟨∇f(xk), x− xk

⟩}

xk+1 = (1− αk)xk+αkxk+1, αk :=2

k + 2

Approximation similar to projected gradient, but no quadratic term!

Page 19: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

Frank-Wolfe algorithmVisualizing the iterates

Page 20: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

Frank-Wolfe algorithmVisualizing the iterates

Page 21: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

Frank-Wolfe algorithmVisualizing the iterates

Page 22: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

Frank-Wolfe algorithmVisualizing the iterates

Page 23: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

Frank-Wolfe algorithmCurvature constant

`f (y;x) := f(y)− f(x)− 〈∇f(x), y − x〉

Cf := maxx,x∈Cα∈[0,1]

y=(1−α)x+αx

2

α2`f (y;x)

Page 24: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

Frank-Wolfe algorithmCurvature constant (example)

f(x) =1

2‖x‖22

`f (y;x) =1

2‖y − x‖22

Cf = maxx,x∈C

‖x− x‖22 = (diam C)2

If ∇f is L-Lipschitz, then Cf 6 L(diam C)2

Page 25: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

Frank-Wolfe algorithmCurvature constant (example)

f(x) =1

2‖x‖22

`f (y;x) =1

2‖y − x‖22

Cf = maxx,x∈C

‖x− x‖22 = (diam C)2

If ∇f is L-Lipschitz, then Cf 6 L(diam C)2

Page 26: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

Frank-Wolfe algorithmApproximate subproblem minimizers

xk+1 ∈{x ∈ C

∣∣∣∣ `f (x;xk) ≤ minx∈C

`f (x;xk) +1

2δαkCf

}

Page 27: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

Frank-Wolfe algorithmExact line-search

αk ∈ arg minα∈[0,1]

f(

(1− α)xk + αxk+1)

Page 28: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

Frank-Wolfe algorithmFully-corrective reoptimization

xk+1 ∈ arg minx∈conv{x0,x1,...,xk+1}

f(x)

Page 29: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

Frank-Wolfe algorithmPrimal-convergence

Theorem (Jaggi, 2013)

f(xk)− infCf ≤

2Cfk + 2

(1 + δ)

Page 30: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

Frank-Wolfe algorithmLower bound on primal convergence

Theorem (Canon and Cullum, 1968)

There are instances with strongly convex objectives for which theoriginal FWA generates sequences with the following behavior:for all C, ε > 0 there are infinitely many k such that

f(xk)− infCf ≥ C

k1+ε

Page 31: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

Frank-Wolfe algorithmFaster variants

I Linear convergence can be obtained in certain cases if“away/drop steps” are used; see (GueLat and Marcotte, 1986)and (Lacoste-Julien and Jaggi, 2014)

I For smooth f and strongly convex C, a simple variant hascomplexity O(1/k2) (Garber and Hazan, 2015)

Page 32: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

Alternating DirectionMethod of Multipliers

Page 33: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

A motivating problemTotal-variation image denoising

Page 34: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

A motivating problemTotal-variation image denoising

Page 35: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

A motivating problemTotal-variation image denoising

Page 36: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

A motivating problemTotal-variation image denoising

Page 37: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

A motivating problemTotal-variation image denoising

minimizef

1

2

∫(f − fη)2 + λ

∫‖∇f‖2

Page 38: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

A motivating problemTotal-variation image denoising

minimizex

1

2‖x− xη‖22 + λ‖Dx‖1

Page 39: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

A motivating problemTotal-variation image denoising

minimizex,y

1

2‖x− xη‖22 + λ‖y‖1 subject to Dx− y = 0

Page 40: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

Equality-constrained optimizationFirst-order optimality conditions

minimizex

f(x) subject to h(x) = 0

Necessary for x to be a minimizer:

∇f(x) +∇h(x)z = 0

h(x) = 0

Page 41: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

Equality-constrained optimizationQuadratic penalization

xk+1 ∈ arg minx

{f(x) +

ρk2‖h(x)‖22

}

∇f(xk+1) +∇h(xk+1)[ρkh(xk+1)] = 0

I Need h(xk+1)→ 0 and ρk → +∞ for ρkh(xk+1)→ z 6= 0

Page 42: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

Equality-constrained optimizationLagrangian minimization

xk+1 ∈ arg minx

{f(x) +

⟨zk, h(x)

⟩}zk+1 = zk + αkh(xk+1)

∇f(xk+1) +∇h(xk+1)zk = 0

I (Super)gradient ascent on concave dual

I Stability issues when argmin has multiple points at solution

Page 43: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

Equality-constrained optimizationMethod of Multipliers/Augmented Lagrangian

I MM ≈ Lagrangian Minimization + Quadratic Penalization

xk+1 ∈ arg minx

{f(x) +

⟨zk, h(x)

⟩+ρk2‖h(x)‖22

}zk+1 = zk + ρkh(xk+1)

∇f(xk+1) +∇h(xk+1)zk+1 = 0

I Will work once ρk sufficiently large (no need for ρk → +∞)

I Computing xk+1 can be tough

Page 44: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

Model problem

minimizex∈X

f(x) + λφ(Ax)

I A : X → Y linear

I φ : Y → R ∪ {+∞} convex and proximable

I f : X → R such that one can solve:

minimizex∈X

f(x) +1

2‖b−Ax‖22

Page 45: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

Model problemMethod of Multipliers

minimizex,y

f(x) + λφ(y) subject to Ax− y = 0

(xk+1, yk+1) ∈ arg minx,y

{f(x) + λφ(y) +

ρk2

∥∥∥∥Ax− y +zk

ρk

∥∥∥∥22

}zk+1 = zk + ρk(Ax

k+1 − yk+1)

I Still tricky joint minimization over x and y

I Alternate!

Page 46: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

Model problemAlternating Direction Method of Multipliers

xk+1 ∈ arg minx

{f(x) +

ρk2

∥∥∥∥Ax− yk +zk

ρk

∥∥∥∥22

}

yk+1 = arg miny

{λφ(y) +

ρk2

∥∥∥∥Axk+1 − y +zk

ρk

∥∥∥∥22

}

= proxρ−1k λφ

[Axk+1 +

zk

ρk

]

zk+1 = zk + ρk(Axk+1 − yk+1)

Page 47: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

Model problemAlternating Direction Method of multipliers

I Simpler iterations when ρk ≡ ρ (defining zk := zk/ρ)

xk+1 ∈ arg minx

{f(x) +

ρ

2

∥∥∥Ax− yk + zk∥∥∥22

}

yk+1 = proxρ−1λφ

[Axk+1 + zk

]zk+1 = zk + (Axk+1 − yk+1)

Page 48: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

ADMMTotal-variation denoising

Page 49: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

ADMMTotal-variation denoising

Page 50: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

ADMMTotal-variation denoising

Page 51: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

ADMMConvergence

I Function values decrease as O(1/k) (He and Yuan, 2012)

I Linear convergence if f or φ is strongly convex and undercertain conditions on A (Deng and Yin, 2012)

Page 52: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

Other problems suitable for ADMMIn case I haven’t bored you out of your mind. . .

Page 53: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

ADMMWhat if f is only proximable?

minimizex

f(x) + λφ(Ax)

minimizex1,x2,y

f(x1) + λφ(y)

subject to

Ax2 − y = 0

x1 − x2 = 0

Page 54: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

ADMMWhat if f is only proximable?

minimizex1,x2,y

f(x1) + λφ(y)

subject to

Ax2 − y = 0

x1 − x2 = 0

xk+11 = proxρ−1f

[xk2 − zk2

]yk+1 = proxρ−1λφ

[Axk2 + zk1

]xk+12 = (I +A∗A)−1(xk+1

1 + zk2 +A∗(yk+1 − zk1 ))

zk+11 = zk1 + (Axk+1

2 − yk+1)

zk+12 = zk2 + (xk+1

1 − xk+12 )

Page 55: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

ADMMSum of proximable functions

minimizex

m∑i=1

fi(x)

xk+1i = proxρ−1fi [x

k − zki ]

xk+1=1

m

m∑i=1

(xk+1i + zki )

zk+1i = zki + (xk+1

i − xk+1)

Distributed consensus

Page 56: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

ADMMSum of proximable functions

minimizex,x1,...,xm

m∑i=1

fi(xi) subject to xi−x = 0, i = 1, . . . ,m

xk+1i = proxρ−1fi [x

k − zki ]

xk+1=1

m

m∑i=1

(xk+1i + zki )

zk+1i = zki + (xk+1

i − xk+1)

Distributed consensus

Page 57: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

ADMMSum of proximable functions

minimizex,x1,...,xm

m∑i=1

fi(xi) subject to xi−x = 0, i = 1, . . . ,m

xk+1i = proxρ−1fi [x

k − zki ]

xk+1=1

m

m∑i=1

(xk+1i + zki )

zk+1i = zki + (xk+1

i − xk+1)

Distributed consensus

Page 58: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

ADMMSum of proximable functions

minimizex,x1,...,xm

m∑i=1

fi(xi) subject to xi−x = 0, i = 1, . . . ,m

xk+1i = proxρ−1fi [x

k − zki ]

xk+1=1

m

m∑i=1

(xk+1i + zki )

zk+1i = zki + (xk+1

i − xk+1)

Distributed consensus

Page 59: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

ADMMRegularized sum of proximable functions

minimizex

m∑i=1

fi(x) + λϕ(x)

xk+1i = proxρ−1fi [x

k − zki ]

xk+1= prox(mρ)−1λϕ

[1

m

m∑i=1

(xk+1i + zki )

]zk+1i = zki + (xk+1

i − xk+1)

Distributed consensus

Page 60: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

ADMMRegularized sum of proximable functions

minimizex,x1,...,xm

m∑i=1

fi(xi) + λϕ(x) subject to xi − x = 0, ∀i

xk+1i = proxρ−1fi [x

k − zki ]

xk+1= prox(mρ)−1λϕ

[1

m

m∑i=1

(xk+1i + zki )

]zk+1i = zki + (xk+1

i − xk+1)

Distributed consensus

Page 61: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

ADMMRegularized sum of proximable functions

minimizex,x1,...,xm

m∑i=1

fi(xi) + λϕ(x) subject to xi − x = 0, ∀i

xk+1i = proxρ−1fi [x

k − zki ]

xk+1= prox(mρ)−1λϕ

[1

m

m∑i=1

(xk+1i + zki )

]zk+1i = zki + (xk+1

i − xk+1)

Distributed consensus

Page 62: 2pt Frank-Wolfe Algorithm & Alternating Direction Method ... · Frank-Wolfe Algorithm & Alternating Direction Method of Multipliers Ives Mac^edo ijamj@cs.ubc.ca October 27, 2015 ...

Frank-Wolfe Algorithm&

Alternating Direction Method of Multipliers

Ives [email protected]

October 27, 2015