Lecture10 outilier l0_svdd

11
Lecture 10: Robust outlier detection with L0-SVDD Stéphane Canu [email protected] Sao Paulo 2014 February 28, 2014

Transcript of Lecture10 outilier l0_svdd

Page 1: Lecture10 outilier l0_svdd

Lecture 10: Robust outlier detection with L0-SVDD

Stéphane [email protected]

Sao Paulo 2014

February 28, 2014

Page 2: Lecture10 outilier l0_svdd

Roadmap

1 Robust outlier detection with L0-SVDDL0 SVDD

4 iterations of Adaptive L0 SVDD

Page 3: Lecture10 outilier l0_svdd

Recall SVDD

minR,c,ξ

R + Cn∑

i=1

ξi

with ‖xi − c‖2 ≤ R + ξi , i = 1, . . . , nand ξi ≥ 0, i = 1, . . . , n

(1)

Stéphane Canu (INSA Rouen - LITIS) February 28, 2014 3 / 11

Page 4: Lecture10 outilier l0_svdd

SVDD + outlier

C =1/16 C =1/8 C =1/4 C = 1/2 (¾')

Figure: Example of SVDD solutions with different C values, m = 0 (red) andm = 5 (magenta). The circled data points represent support vectors for both m.

Stéphane Canu (INSA Rouen - LITIS) February 28, 2014 4 / 11

Page 5: Lecture10 outilier l0_svdd

The L0 norm

‖ξ‖0 ≤ t

min

c∈IRp ,R∈IR,ξ∈IRnR + C‖ξ‖0

with ‖xi − c‖2 ≤ R+ξiξi ≥ 0 i = 1, n

Stéphane Canu (INSA Rouen - LITIS) February 28, 2014 5 / 11

Page 6: Lecture10 outilier l0_svdd

L0 relaxations

p normexponenetialpiecewise linearlog

min

c∈IRp ,R∈IR,ξ∈IRnR + C

n∑i=1

log(γ + ξi )

with ‖xi − c‖2 ≤ R+ξiξi ≥ 0 i = 1, n .

Stéphane Canu (INSA Rouen - LITIS) February 28, 2014 6 / 11

Page 7: Lecture10 outilier l0_svdd

DC programing

log(γ + t) = f (t)− g(t) with f (t) = t and g(t) = t − log(γ + t),

both functions f and g being convex. The DC framework consists inminimizing iteratively (R plus a sum of) the following convex term:

f (ξ)− g ′(ξ)ξ = ξ −(

1− 1γ + ξold

)ξ =

ξ

γ + ξold ,

where ξoldi denotes the solution at the previous iteration.

Stéphane Canu (INSA Rouen - LITIS) February 28, 2014 7 / 11

Page 8: Lecture10 outilier l0_svdd

The DC idea applied to our L0 SVDD approximation consists in building asequence of solutions of the following adaptive SVDD:

minc∈IRp ,R∈IR,ξ∈IRn

R + Cn∑

i=1

wiξi

with ‖xi − c‖2 ≤ R+ξiξi ≥ 0 i = 1, n

with wi =1

γ + ξoldi.

Stéphane Canu (INSA Rouen - LITIS) February 28, 2014 8 / 11

Page 9: Lecture10 outilier l0_svdd

Stationary conditions of the KKT give: c =∑n

i=1 αixi and∑n

i=1 αi = 1where the αi are the Lagrange multipliers associated with the inequalityconstraints ‖xi − c‖2 ≤ R+ξi . The dual of this problem is{

minα∈IRn

α>XX>α− α>diag(XX>)

with∑n

i=1 αi = 1 0 ≤ αi ≤ Cwi i = 1, n(2)

Stéphane Canu (INSA Rouen - LITIS) February 28, 2014 9 / 11

Page 10: Lecture10 outilier l0_svdd

Algorithm 1 L0 SVDD for the linear kernel

Data: X , y , C , γResult: R , c , ξ , αwi = 1; i = 1, nwhile not converged do

(α, λ)← solve_QP(X ,C ,w) % solve problem (2)c ← X>αR ← λ+ c>cξi ← max(0, ‖xi − c‖2 − R) i = 1, nwi ← 1/(γ + ξi ) i = 1, n

end

Stéphane Canu (INSA Rouen - LITIS) February 28, 2014 10 / 11

Page 11: Lecture10 outilier l0_svdd

Bibliography

Stéphane Canu (INSA Rouen - LITIS) February 28, 2014 11 / 11