Latent semantic analysis and keyword extraction for phishing classification

Phishing email fraud has been considered as one of the main cyber-threats over the last years. Its development has been closely related to social engineering techniques, where different fraud strategies are used to deceit a naïve email user. In this work, a latent semantic analysis and text mining...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	L'Huillier, Gaston, Hevia, Alejandro, Weber, Richard, Rios, Sebastian
Format:	Tagungsbericht
Sprache:	eng
Schlagworte:	Algorithm design and analysis Data mining Feature extraction Latent Semantic Analysis Linear discriminant analysis Logistics Machine learning Machine learning algorithms Phishing detection Support vector machine classification Support vector machines Text mining
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Phishing email fraud has been considered as one of the main cyber-threats over the last years. Its development has been closely related to social engineering techniques, where different fraud strategies are used to deceit a naïve email user. In this work, a latent semantic analysis and text mining methodology is proposed for the characterisation of such strategies, and further classification using supervised learning algorithms. Results obtained showed that the feature set obtained in this work is competitive against previous phishing feature extraction methodologies, achieving promising results over different benchmark machine learning classification techniques.
DOI:	10.1109/ISI.2010.5484762