A data reduction strategy and its application on scan and backscatter detection using rule-based classifiers

•A novel data reduction strategy is presented.•Multi measure strategy is used to reduce the number of features.•Training data is significantly reduced, without greatly affecting the IDS accuracy.•Boost up the detection process speed.•Does not require large computational resources to process a huge a...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Expert systems with applications 2018-04, Vol.95, p.272-279
Hauptverfasser: Herrera-Semenets, Vitali, Andrés Pérez-García, Osvaldo, Hernández-León, Raudel, van den Berg, Jan, Doerr, Christian
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:•A novel data reduction strategy is presented.•Multi measure strategy is used to reduce the number of features.•Training data is significantly reduced, without greatly affecting the IDS accuracy.•Boost up the detection process speed.•Does not require large computational resources to process a huge amount of data. In the last few years, the telecommunications scenario has experienced an increase in the volume of information generated, as well as in the execution of malicious activities. In order to complement Intrusion Detection Systems (IDSs), data mining techniques have begun to play a fundamental role in data analysis. On the other hand, the presence of useless information and the amount of data generated by telecommunication services (leading to a huge dimensional problem), can affect the performance of traditional IDSs. In this sense, a data preprocessing strategy is necessary to reduce data, but reducing data without affecting the accuracy of IDSs represents a challenge. In this paper, we propose a new data preprocessing strategy which reduces the number of features and instances in the training collection without greatly affecting the achieved accuracy of IDSs. Finally, our proposal is evaluated using four different rule-based classifiers, which are tested on real scan and backscatter data collected by a network telescope.
ISSN:0957-4174
1873-6793
DOI:10.1016/j.eswa.2017.11.041