[IEEE 2011 11th International Conference on Intelligent Systems Design and Applications (ISDA) -...

7
3URMHFWHG 5RXJK )X]]\ F0HDQV &OXVWHULQJ &KDUX 3XUL 1DYHHQ .XPDU 'HSDUWPHQW RI &RPSXWHU 6FLHQFH 8QLYHUVLW\ RI 'HOKL 'HOKL ,QGLD FSXULFVGX#JPDLOFRP $EVWUDFW²7KH FRQYHQWLRQDO URXJK VHW EDVHG IHDWXUH VHOHFWLRQ WHFKQLTXHV ¿QG WKH UHOHYDQW IHDWXUHV IRU WKH HQWLUH GDWD VHW +RZHYHU GLIIHUHQW VHWV RI GLPHQVLRQV PD\ EH UHOHYDQW IRU GLIIHUHQW FOXVWHUV 7KLV SDSHU LQWURGXFHV D QRYHO 3URMHFWHG 5RXJK )X]]\ FPHDQV FOXVWHULQJ DOJRULWKP 35)&0 ZKLFK HPSOR\V URXJK VHWV WR PRGHO XQFHUWDLQW\ LQ GDWD DQG IX]]\ VHW WKHRU\ WR FRPSXWH WKH ZHLJKWV RI GLPHQVLRQV DSSOLFDEOH WR LQGLYLGXDO FOXVWHUV :H GLVFXVV WKH FRQYHUJHQFH RI WKH SURSRVHG DOJRULWKP DQG SUHVHQW WKH UHVXOWV RI DSSO\LQJ WKH SURSRVHG DSSURDFK WR VHYHUDO 8&, GDWD VHWV WR GHPRQVWUDWH WKDW LW VFRUHV RYHU LWV FRPSHWLWRUV LQ WHUPV RI VHYHUDO TXDOLW\ DQG YDOLGLW\ PHDVXUHV .H\ZRUGV&RQYHUJHQFH IX]]\ FOXVWHULQJ SURMHFWHG FOXVWHU LQJ URXJK VHWV YDOLGLW\ PHDVXUHV , , 1752'8&7,21 &OXVWHULQJ LV D WHFKQLTXH WKDW DLPV WR JURXS GDWD REMHFWV VXFK WKDW LQWUDFOXVWHU VLPLODULW\ LV KLJK ZKLOH LQWHUFOXVWHU VLPLODULW\ LV ORZ $ FOXVWHULQJ DOJRULWKP PDSV D VHW RI GDWD REMHFWV X = {x 1 ,x 2 , ..., x n } WR D VHW C = {1, 2, ..., k} RI k QXPEHUV ZKHUH HDFK HOHPHQW RI WKH VHW C LGHQWL¿HV D KRPRJHQHRXV JURXS RI REMHFWV 9DULRXV DSSOLFDWLRQ DUHDV RI FOXVWHULQJ LQFOXGH LPDJH SURFHVVLQJ SDWWHUQ UHFRJQLWLRQ DQDO\VLV RI PLFURDUUD\ GDWD LQ ELRLQIRUPDWLFV VSDWLDO DQDO\ VLV VRFLDO QHWZRUN DQDO\VLV LQWUXVLRQ GHWHFWLRQ LQ QHWZRUNV GRFXPHQW FODVVL¿FDWLRQ VSDP ¿OWHULQJ DQG PDUNHW UHVHDUFK >@ >@ >@ 9DULRXV FODVVLFDO FOXVWHULQJ DOJRULWKPV VXFK DV WKH NPHDQV DOJRULWKP DQG LWV YDULDQWV KLHUDUFKLFDO DJ JORPHUDWLYH FOXVWHULQJ DQG JUDSKWKHRUHWLF PHWKRGV KDYH EHHQ SURSRVHG LQ OLWHUDWXUH >@ >@ ,Q KLJK GLPHQVLRQDO GDWD VHWV FOXVWHUV DUH RIWHQ KLGGHQ LQ VSHFL¿F VXEVSDFHV RI WKH RULJLQDO IHDWXUH VSDFH LH RQO\ D VXEVHW RI GLPHQVLRQV LV UHOHYDQW IRU HDFK FOXVWHU >@ >@ $V WKH WUDGLWLRQDO FOXVWHULQJ DOJRULWKPV FRPSXWH WKH GLVWDQFH LQ IXOO GLPHQVLRQDO VSDFH >@ >@ WKH\ IDLO LQ LGHQWLI\LQJ KLGGHQ UHODWLRQVKLSV RI WKH XQGHUO\LQJ VWUXFWXUH IRU KLJK GLPHQVLRQDO GDWD GXH WR WKH UHDVRQ WKDW WKH QHDUHVW QHLJKERU RI D SDWWHUQ PD\ EH QHDUO\ DV GLVWDQW DV WKH IDUWKHVW QHLJKERU )RU H[DPSOH WH[W GDWDEDVHV KDYH ODUJH QXPEHU RI IHDWXUHV ZRUGV PRVW RI ZKLFK DUH QRW UHOHYDQW WR WKH FOXVWHULQJ WDVN 7KHVH LUUHOHYDQW IHDWXUHV PDVN WKH FRQWULEXWLRQ RI UHOHYDQW IHDWXUHV DQG KHQFH GHJUDGH WKH SHUIRUPDQFH RI FOXVWHULQJ DOJRULWKP >@ 7R FRSH ZLWK WKH SUREOHP RI KLJK GLPHQVLRQDO IHDWXUH VSDFHV IHDWXUH VHOHFWLRQ DQG IHDWXUH UHGXFWLRQ WHFKQLTXHV KDYH EHHQ SURSRVHG LQ WKH OLWHUDWXUH >@ 7KH\ FDQ EH XVHG DV D SUHSURFHVVLQJ VWHS IRU UHGXFLQJ GLPHQVLRQDOLW\ UHPRYLQJ LUUHOHYDQW GDWD LPSURYLQJ OHDUQLQJ DFFXUDF\ DQG HQKDQFLQJ RXWSXW FRPSUHKHQVLELOLW\ >@ )HDWXUH VHOHFWLRQ LV D SURFHVV RI SLFNLQJ D VXEVHW RI DWWULEXWHV IURP WKH VHW RI RULJLQDO DWWULEXWHV EDVHG RQ VRPH RSWLPDOLW\ FULWHULRQ )HDWXUH UHGXFWLRQ WUDQVIRUPV WKH IHDWXUH VSDFH WR D VPDOOHU GLPHQVLRQDO VSDFH E\ FRPELQLQJ WKH VHW RI IHDWXUHV DW KDQG $OWKRXJK IHDWXUH VHOHFWLRQ WHFKQLTXHV SURMHFW WKH ZKROH IHDWXUH VSDFH WR D ORZHU GLPHQVLRQDO VXEVSDFH VR WKDW FOXVWHU VWUXFWXUHV EHFRPH DSSDUHQW >@ >@ >@ >@ WKHVH WHFKQLTXHV DUH QRW HIIHFWLYH LQ ¿QGLQJ FOXVWHUV LQ YDU\LQJ VXEVSDFHV )HDWXUH UHGXFWLRQ WHFKQLTXHV VXFK DV SULQFLSDO FRPSRQHQW DQDO\VLV 3&$ VXIIHU IURP XVDELOLW\ SUREOHP DV LW EHFRPHV KDUG WR LQWHUSUHW WKH UHVXOWV LQWXLWLYHO\ >@ +HQFH WKHUH LV D QHHG IRU PRUH JHQHUDOL]HG WHFKQLTXHV WKDW FDQ EH XVHG WR REWDLQ PHDQLQJIXO FOXVWHUV LQ GLIIHUHQW VXEVSDFHV ,Q WKLV SDSHU ZH SURSRVH D QRYHO DGDSWDWLRQ RI URXJK IX]]\ FPHDQV DOJRULWKP IRU KLJK GLPHQVLRQDO GDWD E\ PRGLI\LQJ LWV REMHFWLYH IXQFWLRQ 7KH SURSRVHG DOJRULWKP DXWRPDWLFDOO\ GHWHFWV WKH UHOHYDQW FOXVWHU GLPHQVLRQV RI WKH KLJK GLPHQVLRQDO GDWD VHW 7KH DVVLJQPHQW RI ZHLJKWV WR DWWULEXWHV EHLQJ VSHFL¿F WR HDFK FOXVWHU DQ HI¿FLHQW SURMHFWHG FOXVWHULQJ VFKHPH LV JHQHUDWHG :H KDYH DOVR GLVFXVVHG WKH FRQYHUJHQFH RI WKH SURSRVHG DOJRULWKP 7KH UHPDLQGHU RI WKLV SDSHU LV RUJDQLVHG DV IROORZV LQ VHFWLRQ RQ UHODWHG ZRUN ZH GHVFULEH KRZ FODVVLFDO FOXVWHULQJ PHWKRGV KDYH EHHQ DGDSWHG WR VXLW WKH UHTXULPHQWV RI KLJK GLPHQVLRQDO GDWD LQ VHFWLRQ ZH H[WHQG WKH URXJK IX]]\ FPHDQV DOJR ULWKP IRU SURMHFWHG FOXVWHULQJ DQG GLVFXVV WKH FRQYHUJHQFH RI 3URMHFWHG 5RXJK )X]]\ FPHDQV 35)&0 DOJRULWKP LQ VHFWLRQ ZH SUHVHQW WKH UHVXOWV RI DSSO\LQJ 35)&0 DOJRULWKP RQ VHYHUDO 8&, GDWD VHWV DQG ¿QDOO\ VHFWLRQ FRQWDLQV FRQFOXVLRQV DQG VXJJHVWLRQV IRU IXWXUH ZRUN ,, 5(/$7(' :25. 5RXJK VHW WKHRU\ SURYLGHV D PHWKRGRORJ\ IRU DGGUHVVLQJ WKH SUREOHP RI UHOHYDQW IHDWXUH VHOHFWLRQ E\ VHOHFWLQJ D VHW RI LQIRUPDWLRQ ULFK IHDWXUHV LQ D GDWDVHW ZLWKRXW WUDQV IRUPLQJ WKH GDWD DW WKH VDPH WLPH DWWHPSWLQJ WR PLQLPL]H LQIRUPDWLRQ ORVV GXULQJ WKH VHOHFWLRQ SURFHVV >@ 7KXV LW UHWDLQV WKH VHPDQWLFV RI WKH RULJLQDO GDWD DQG UHTXLUHV QR 530 978-1-4577-1676-8/11/$26.00 c 2011 IEEE

Transcript of [IEEE 2011 11th International Conference on Intelligent Systems Design and Applications (ISDA) -...

Page 1: [IEEE 2011 11th International Conference on Intelligent Systems Design and Applications (ISDA) - Cordoba, Spain (2011.11.22-2011.11.24)] 2011 11th International Conference on Intelligent

X = {x1, x2, ..., xn} C = {1, 2, ..., k}k C

530978-1-4577-1676-8/11/$26.00 c©2011 IEEE

Page 2: [IEEE 2011 11th International Conference on Intelligent Systems Design and Applications (ISDA) - Cordoba, Spain (2011.11.22-2011.11.24)] 2011 11th International Conference on Intelligent

kk

2d

2011 11th International Conference on Intelligent Systems Design and Applications 531

Page 3: [IEEE 2011 11th International Conference on Intelligent Systems Design and Applications (ISDA) - Cordoba, Spain (2011.11.22-2011.11.24)] 2011 11th International Conference on Intelligent

X = {x1, x2, ..., xn}d k

X kZ = {z1, z2, ..., zk}

X k × nU = [μij ] μij

jth ith 1 ≤i ≤ k, 1 ≤ j ≤ n

k∑i=1

μij = 1, 1 ≤ j ≤ n

μij ∈ [0, 1], 1 ≤ j ≤ n, 1 ≤ i ≤ k

aVa a : U → Va a ∈ A

DS (U, A∪{d}) d /∈ AA

B ⊆ AB

INDIS(B)INDIS(B) = {(x1, x2) ∈ U2|a(x1) = a(x2) ∀a ∈ B}

x1 x2

B (x1, x2) ∈ INDIS(B)[x]B

X ⊆ U BB − lower BX = {x|[x]B ⊆ X}

B − upper BX = {x|[x]B ∩ X = φ}B − boundary region BX −BX

BUi BUi BUi − BUi

ith Ui

ith ωi

ith W = [ωir]k×d ωir

rth ith

d∑r=1

ωir = 1, 1 ≤ i ≤ k,

ωir ∈ [0, 1] , 1 ≤ i ≤ k, 1 ≤ r ≤ d

JPRFCM

k

JPRFCM

⎧⎪⎨⎪⎩

aA + bB BUi = φ ∧ BUi − BUi = φ

A BUi = φ ∧ BUi − BUi = φ

B otherwise.

A =∑

xj∈BUi

∑ki=1

∑dr=1 μα

ijωβird

2ijr

∑xj∈(BUi−BUi)

∑ki=1

∑dr=1 μα

ijωβird

2ijr

a b

d2ijr = (xjr − zir)

2

ith jth

rth α ∈ (1,∞) , β ∈ (1,∞)

μij ωir

μij ωir

μij = 1

/k∑

l=1

[∑dr=1(ωir)

βd2ijr∑d

r=1(ωlr)βd2ljr

]1/(α−1)

532 2011 11th International Conference on Intelligent Systems Design and Applications

Page 4: [IEEE 2011 11th International Conference on Intelligent Systems Design and Applications (ISDA) - Cordoba, Spain (2011.11.22-2011.11.24)] 2011 11th International Conference on Intelligent

ωir = 1

/d∑

l′=1

[ ∑nj=1(μij)

αd2ijr∑n

j=1(μij)αd2ijl′

]1/(β−1)

zir =⎧⎪⎪⎪⎪⎪⎪⎪⎪⎨⎪⎪⎪⎪⎪⎪⎪⎪⎩

∑xj∈(BUi−BUi)

μαijxjr∑

xj∈(BUi−BUi)μα

ij

BUi �= φ ∧ BUi − BUi = φ

a ×

∑xj∈BUi

μαijxjr∑

xj∈BUiμα

ij

+b ×

∑xj∈(BUi−BUi)

μαijxjr∑

xj∈(BUi−BUi)μα

ij

BUi �= φ ∧ BUi − BUi �= φ∑xj∈BUi

μαijxjr∑

xj∈BUiμα

ij

, otherwise.

a ≈ 1

a b o < a < b < 1 a + b = 1

zi, 1 ≤ i ≤ k, k

μij k nμij μij

μij μij ε xj ∈

BUi xj ∈ BUi xj

xj ∈ BUi

μij kωir k d

zi

ε10−3

ε

a = .85 b = .15

k

α βm

α βα β

2011 11th International Conference on Intelligent Systems Design and Applications 533

Page 5: [IEEE 2011 11th International Conference on Intelligent Systems Design and Applications (ISDA) - Cordoba, Spain (2011.11.22-2011.11.24)] 2011 11th International Conference on Intelligent

mm

α β

r =∑c

i=1 si/n si

i n

xj cl

l = arg max1≤i≤k μij

534 2011 11th International Conference on Intelligent Systems Design and Applications

Page 6: [IEEE 2011 11th International Conference on Intelligent Systems Design and Applications (ISDA) - Cordoba, Spain (2011.11.22-2011.11.24)] 2011 11th International Conference on Intelligent

PC = 1/n∑k

i=1

∑nj=1 μ2

ij

1/k

CE = −1/n∑k

i=1

∑nj=1 μij log(μij)

μij 1/k

2011 11th International Conference on Intelligent Systems Design and Applications 535

Page 7: [IEEE 2011 11th International Conference on Intelligent Systems Design and Applications (ISDA) - Cordoba, Spain (2011.11.22-2011.11.24)] 2011 11th International Conference on Intelligent

o o

u

u

o

536 2011 11th International Conference on Intelligent Systems Design and Applications