ΔΙΠΛΩΜΑΤΙΚΗ ΕΡΓΑΣΙΑ ΜΕΤΑΠΤΥΧΙΑΚΟΥ « Επιςτημη και...
description
Transcript of ΔΙΠΛΩΜΑΤΙΚΗ ΕΡΓΑΣΙΑ ΜΕΤΑΠΤΥΧΙΑΚΟΥ « Επιςτημη και...
-
O MA A.M 478 I. XATZY / &
-
O MA* Spam
-
O MA* RIPPER SpamHaus http://www.spamhaus .org/, ORDB -http://www.ordb.org, mail-abuse -http://www.mailabuse.org Vipuls Razor (http://razor.sourceforge.net) spam BrightMail (http://www.brightmail.com) BrightMail , (Domain restriction)SpamAssassin (http://spamassassin.taint.org) , . , , , , on-line , mail-abuse.org ordb.org. Disposable E-mail Addresses DEA spam , Spamex (http://www.spamex.com), Emailias (http://www.emailias.com),SneakeMail (http://www.sneakemail.com)
-
O MA* HTML tagsX HTML tags / javascriptAccented -
-
O MA* , .
: spam ( - legitimate). : . : . emails . . . .
-
O MA*
-
. . LING-SPAM
. ENRON
. SPAMASSASIN
. spam : 1001 . spam .. easy_ham : 5051 legitimate . legitimate spam .. hard_ham : 500 legitimate . spam .. easy_ham_2 : 1400 legitimate . .. spam_2 : 1397 spam . . 9349, spam 35%.
O MA*
619446 . 158 .
. , . 1 , 200399
757
61.63%
13% Spam
2893 .
2412 .
481 spam
-
. - O M* . ( ).
. , >, Sender wrote, - - - - - original message - - - -, .
. , Subject, Sender, To, From, Cc, Importance, . H ED,-ING, -ION, -IONS, ( Porter)
s, a, of, the, an, and, or, while, at
O M
-
Ling-SpamMultiLayerPerceptron WEKA : MultilayerPerceptron -L 0.3 -M 0.2 -N 300 -V 0 -S 0 -E 20 -H aConfusion Matrixes
O MA*Enron-SpamSpam Assasin
Vector F: 750 Type: BooleanVector F: 750 Type: CountLegitimateSpamLegitimateSpam240392390221346876405
LegitimateType: BooleanType: CountRecall99,63%99,09%Precision99,46%96,92%Fallout2,70%15,80%Accuracy99,24%96,61%Error0,76%3,39%
SpamType: BooleanType: CountRecall97,30%84,20%Precision98,11%94,85%Fallout0,37%0,91%Accuracy99,24%96,61%Error0,76%3,39%
Average Precision98,79%95,88%Accuracy99,24%96,61%Error0,76%3,39%
-
- :
, . . . HTML . , spammers , DNS-based IP spammers, (spam-for-hire sites), (MTAs) (mail relays). . /
O MA*
-
O MA
**