Research on illegal E-mails recognition based on VSM and Statistical Decision Tree

This paper introduces an algorithm based on VSM algorithm and statistical decision tree (SDT) to recognize illegal e-mails. The vector space model is simple and easy to operate. At first, the vector space model (VSM ) can filter some specific words which are often used in illegal e-mails. Then, SDT...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Ke-Jian Wang, Xian-Zhong Han, Tao Guo
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:This paper introduces an algorithm based on VSM algorithm and statistical decision tree (SDT) to recognize illegal e-mails. The vector space model is simple and easy to operate. At first, the vector space model (VSM ) can filter some specific words which are often used in illegal e-mails. Then, SDT can judge illegal e-mails by Semanteme analyze. After the two steps, the illegal e-mails can also be easily identified and the recognition rate of illegal E-mails has been improved by basic experiments. Theoretical analysis and basic experiments shows that the illegal emails can be recognized effectively with VSM and SDT algorithm.
ISSN:2158-5695
DOI:10.1109/ICWAPR.2008.4635828