An Improved Feature Extraction Approach for Web Anomaly Detection Based on Semantic Structure

Anomaly-based Web application firewalls (WAFs) are vital for providing early reactions to novel Web attacks. In recent years, various machine learning, deep learning, and transfer learning-based anomaly detection approaches have been developed to protect against Web attacks. Most of them directly tr...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Security and communication networks 2021-02, Vol.2021, p.1-11
Hauptverfasser:	Cheng, Zishuai, Cui, Baojiang, Qi, Tao, Yang, Wenchuan, Fu, Junsong
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Anomalies Applications programs Behavior Datasets Feature extraction Firewalls Intrusion detection systems Knowledge Machine learning Methods Natural language processing Semantics Social research URLs
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Anomaly-based Web application firewalls (WAFs) are vital for providing early reactions to novel Web attacks. In recent years, various machine learning, deep learning, and transfer learning-based anomaly detection approaches have been developed to protect against Web attacks. Most of them directly treat the request URL as a general string that consists of letters and roughly use natural language processing (NLP) methods (i.e., Word2Vec and Doc2Vec) or domain knowledge to extract features. In this paper, we proposed an improved feature extraction approach which leveraged the advantage of the semantic structure of URLs. Semantic structure is an inherent interpretative property of the URL that identifies the function and vulnerability of each part in the URL. The evaluations on CSIC-2020 show that our feature extraction method has better performance than conventional feature extraction routine by more than average dramatic 5% improvement in accuracy, recall, and F1-score.
ISSN:	1939-0114 1939-0122
DOI:	10.1155/2021/6661124