Mining Sequential Patterns and Tree Patterns to Detect Erroneous Sentences
-
Author
stelios-karabasakis -
Category
Documents
-
view
812 -
download
1
Embed Size (px)
description
Transcript of Mining Sequential Patterns and Tree Patterns to Detect Erroneous Sentences
- 1. Authored by Guihua Sun Gao Cong Xiaohua Liu Chin-Yew Lin Ming Zhou Microsoft Research Asia Mining Sequential Patterns and Tree Patterns to Detect Erroneous Sentences
2.
3.
- ( .. N-grams)
-
- N
4.
- ,
- .
LabeledTree Patterns LabeledSequential Patterns 5. Labeled Tree Patterns (LTP)
- Thispresentationbeboring.
Labeled Tree Pattern( tp ) : Tree pattern ( tp.t ) : Class Label ( tp.c ) (Ccorrect, Eerroneous)DT NN VB JJ . NP ADJP VP S DT NN VB JJ . NP ADJP VP S VB NP VP S E 6. Labeled Sequential Patterns (LSP)
- ;
-
- part-of-speech (POS) tagger
-
-
- function words ( , , , , ..)
-
-
-
- time words ( ..yesterday, during, afterward.)
-
- sequential patterns
-
- function word + POS tag,
Thispresentationissoboredom. This NN VBZ so NN < so, NN > E Labeled Sequential Pattern ( sp ) : Sequential pattern ( sp.s ) : Class Label ( sp.c ) NN so 7. patterns
- ( support):
- ( confidence):
- pattern
-
- confidence
-
- patterns confidence , support.
- support( minsup ) pattern .
8. Top-k Patterns
- :
-
- top-k LTP( TkLTP) top-k LSP (TkLSP) .
- (, ):
-
- pattern p sup(p) > minsup
-
- k patterns .
-
- patterns, confidence pattern .
-
- patterns 1, patterns 2 ,...
-
-
- pattern confidence, .
-
9.
-
- Nave Bayes ( NB )
-
- Support Vector Machines ( SVM )
- : Pattern Based Classification ( PBC )
-
- k
-
-
- 1:main classifier
-
-
-
- 2.. k: stand-by classifiers
-
-
- top-k patterns k ,
-
-
- , default.
-
10. : Top-3 Pattern Based Classifier Tree Patterns CL1 Sequential Patterns CL2 CL3 default main classifier stand-by classifiers 4 sp3 3 sp2 3 5 4 sp4 1 sp1 4 5 sp5 5 sp6 6 6 tp5 6 tp4 2 tp3 1 3 2 tp2 2 1 tp1 S3 S2 S1 4 sp3 3 sp2 3 5 4 sp4 1 sp1 4 5 sp5 5 sp6 6 6 tp5 6 tp4 2 tp3 1 3 2 tp2 2 1 tp1 S3 S2 S1 4 sp3 3 sp2 3 5 4 sp4 1 sp1 4 5 sp5 5 sp6 6 6 tp5 6 tp4 2 tp3 1 3 2 tp2 2 1 tp1 S3 S2 S1 4 sp3 3 sp2 3 5 4 sp4 1 sp1 4 5 sp5 5 sp6 6 6 tp5 6 tp4 2 tp3 1 3 2 tp2 2 1 tp1 S3 S2 S1 4 sp3 3 sp2 3 5 4 sp4 1 sp1 4 5 sp5 5 sp6 6 6 tp5 6 tp4 2 tp3 1 3 2 tp2 2 1 tp1 S3 S2 S1 4 sp3 3 sp2 3 5 4 sp4 1 sp1 4 5 sp5 5 sp6 6 6 tp5 6 tp4 2 tp3 1 3 2 tp2 2 1 tp1 S3 S2 S1 4 sp3 3 sp2 3 5 4 sp4 1 sp1 4 5 sp5 5 sp6 6 6 tp5 6 tp4 2 tp3 1 3 2 tp2 2 1 tp1 S3 S2 S1 11.
- PBC SVM , .
- PBC , SVN .
12.