An effective genetic algorithm-based feature selection method for intrusion detection systems

Availability of suitable and validated data is a key issue in multiple domains for implementing machine learning methods. Higher data dimensionality has adverse effects on the learning algorithm's performance. This work aims to design a method that preserves most of the unique information relat...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Computers & security 2021-11, Vol.110, p.102448, Article 102448
Hauptverfasser: Halim, Zahid, Yousaf, Muhammad Nadeem, Waqas, Muhammad, Sulaiman, Muhammad, Abbas, Ghulam, Hussain, Masroor, Ahmad, Iftekhar, Hanif, Muhammad
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page 102448
container_title Computers & security
container_volume 110
creator Halim, Zahid
Yousaf, Muhammad Nadeem
Waqas, Muhammad
Sulaiman, Muhammad
Abbas, Ghulam
Hussain, Masroor
Ahmad, Iftekhar
Hanif, Muhammad
description Availability of suitable and validated data is a key issue in multiple domains for implementing machine learning methods. Higher data dimensionality has adverse effects on the learning algorithm's performance. This work aims to design a method that preserves most of the unique information related to the data with minimum number of features. Addressing the feature selection problem in the domain of network security and intrusion detection, this work contributes an enhanced Genetic Algorithm (GA)-based feature selection method, named as GA-based Feature Selection (GbFS), to increase the classifiers’ accuracy. Securing a network from the cyber-attacks is a critical task and needs to be strengthened. Machine learning, due to its proven results, is widely used in developing firewalls and Intrusion Detection Systems (IDSs) to identify new kinds of attacks. Utilizing machine learning algorithms, IDSs are able to detect the intruder by analyzing the network traffic passing through it. This work presents parameter tuning for the GA-based feature selection along with a novel fitness function. The present work develops an enhanced GA-based feature selection method which is tested over three benchmark network traffic datasets, namely, CIRA-CIC-DOHBrw-2020, UNSW-NB15, and Bot-IoT. A comparison is also performed with the standard feature selection methods. Results show that the accuracies improve using GbFS by achieving a maximum accuracy of 99.80%.
doi_str_mv 10.1016/j.cose.2021.102448
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2582437984</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0167404821002728</els_id><sourcerecordid>2582437984</sourcerecordid><originalsourceid>FETCH-LOGICAL-c328t-d07906721347eb09fcbd0fc23f2e8d46f5fee339851d106eb56a6ea5a2dc132f3</originalsourceid><addsrcrecordid>eNp9kEtLxDAUhYMoOI7-AVcF1x3zaNMU3AyDLxhwo0sJbXIzkzJtxiQd8N-b0lm7unDuOffxIXRP8Ipgwh-7lXIBVhRTkgRaFOICLYioaM4pFpdokUxVXuBCXKObEDqMScWFWKDv9ZCBMaCiPUG2gwGiVVlz2Dlv477P2yaAzgw0cfSQBThMTjdkPcS9Sw3nMztEP4ZJ1BDP7fAbIvThFl2Z5hDg7lyX6Ovl-XPzlm8_Xt83622uGBUx17iqMa8oYUUFLa6NajU2ijJDQeiCm9IAMFaLkmiCObQlbzg0ZUO1IowatkQP89yjdz8jhCg7N_ohrZS0FLRgVS2K5KKzS3kXggcjj972jf-VBMsJo-zkhFFOGOWMMYWe5hCk-08WvAzKwqBAW5-eldrZ_-J_QIR9AQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2582437984</pqid></control><display><type>article</type><title>An effective genetic algorithm-based feature selection method for intrusion detection systems</title><source>Elsevier ScienceDirect Journals Complete</source><creator>Halim, Zahid ; Yousaf, Muhammad Nadeem ; Waqas, Muhammad ; Sulaiman, Muhammad ; Abbas, Ghulam ; Hussain, Masroor ; Ahmad, Iftekhar ; Hanif, Muhammad</creator><creatorcontrib>Halim, Zahid ; Yousaf, Muhammad Nadeem ; Waqas, Muhammad ; Sulaiman, Muhammad ; Abbas, Ghulam ; Hussain, Masroor ; Ahmad, Iftekhar ; Hanif, Muhammad</creatorcontrib><description>Availability of suitable and validated data is a key issue in multiple domains for implementing machine learning methods. Higher data dimensionality has adverse effects on the learning algorithm's performance. This work aims to design a method that preserves most of the unique information related to the data with minimum number of features. Addressing the feature selection problem in the domain of network security and intrusion detection, this work contributes an enhanced Genetic Algorithm (GA)-based feature selection method, named as GA-based Feature Selection (GbFS), to increase the classifiers’ accuracy. Securing a network from the cyber-attacks is a critical task and needs to be strengthened. Machine learning, due to its proven results, is widely used in developing firewalls and Intrusion Detection Systems (IDSs) to identify new kinds of attacks. Utilizing machine learning algorithms, IDSs are able to detect the intruder by analyzing the network traffic passing through it. This work presents parameter tuning for the GA-based feature selection along with a novel fitness function. The present work develops an enhanced GA-based feature selection method which is tested over three benchmark network traffic datasets, namely, CIRA-CIC-DOHBrw-2020, UNSW-NB15, and Bot-IoT. A comparison is also performed with the standard feature selection methods. Results show that the accuracies improve using GbFS by achieving a maximum accuracy of 99.80%.</description><identifier>ISSN: 0167-4048</identifier><identifier>EISSN: 1872-6208</identifier><identifier>DOI: 10.1016/j.cose.2021.102448</identifier><language>eng</language><publisher>Amsterdam: Elsevier Ltd</publisher><subject>Communications traffic ; Cybersecurity ; Data analysis ; Domains ; Feature selection ; Firewalls ; Genetic algorithm ; Genetic algorithms ; Intrusion detection ; Intrusion detection systems ; Machine learning ; System effectiveness</subject><ispartof>Computers &amp; security, 2021-11, Vol.110, p.102448, Article 102448</ispartof><rights>2021 Elsevier Ltd</rights><rights>Copyright Elsevier Sequoia S.A. Nov 2021</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c328t-d07906721347eb09fcbd0fc23f2e8d46f5fee339851d106eb56a6ea5a2dc132f3</citedby><cites>FETCH-LOGICAL-c328t-d07906721347eb09fcbd0fc23f2e8d46f5fee339851d106eb56a6ea5a2dc132f3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S0167404821002728$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,776,780,3537,27901,27902,65306</link.rule.ids></links><search><creatorcontrib>Halim, Zahid</creatorcontrib><creatorcontrib>Yousaf, Muhammad Nadeem</creatorcontrib><creatorcontrib>Waqas, Muhammad</creatorcontrib><creatorcontrib>Sulaiman, Muhammad</creatorcontrib><creatorcontrib>Abbas, Ghulam</creatorcontrib><creatorcontrib>Hussain, Masroor</creatorcontrib><creatorcontrib>Ahmad, Iftekhar</creatorcontrib><creatorcontrib>Hanif, Muhammad</creatorcontrib><title>An effective genetic algorithm-based feature selection method for intrusion detection systems</title><title>Computers &amp; security</title><description>Availability of suitable and validated data is a key issue in multiple domains for implementing machine learning methods. Higher data dimensionality has adverse effects on the learning algorithm's performance. This work aims to design a method that preserves most of the unique information related to the data with minimum number of features. Addressing the feature selection problem in the domain of network security and intrusion detection, this work contributes an enhanced Genetic Algorithm (GA)-based feature selection method, named as GA-based Feature Selection (GbFS), to increase the classifiers’ accuracy. Securing a network from the cyber-attacks is a critical task and needs to be strengthened. Machine learning, due to its proven results, is widely used in developing firewalls and Intrusion Detection Systems (IDSs) to identify new kinds of attacks. Utilizing machine learning algorithms, IDSs are able to detect the intruder by analyzing the network traffic passing through it. This work presents parameter tuning for the GA-based feature selection along with a novel fitness function. The present work develops an enhanced GA-based feature selection method which is tested over three benchmark network traffic datasets, namely, CIRA-CIC-DOHBrw-2020, UNSW-NB15, and Bot-IoT. A comparison is also performed with the standard feature selection methods. Results show that the accuracies improve using GbFS by achieving a maximum accuracy of 99.80%.</description><subject>Communications traffic</subject><subject>Cybersecurity</subject><subject>Data analysis</subject><subject>Domains</subject><subject>Feature selection</subject><subject>Firewalls</subject><subject>Genetic algorithm</subject><subject>Genetic algorithms</subject><subject>Intrusion detection</subject><subject>Intrusion detection systems</subject><subject>Machine learning</subject><subject>System effectiveness</subject><issn>0167-4048</issn><issn>1872-6208</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><recordid>eNp9kEtLxDAUhYMoOI7-AVcF1x3zaNMU3AyDLxhwo0sJbXIzkzJtxiQd8N-b0lm7unDuOffxIXRP8Ipgwh-7lXIBVhRTkgRaFOICLYioaM4pFpdokUxVXuBCXKObEDqMScWFWKDv9ZCBMaCiPUG2gwGiVVlz2Dlv477P2yaAzgw0cfSQBThMTjdkPcS9Sw3nMztEP4ZJ1BDP7fAbIvThFl2Z5hDg7lyX6Ovl-XPzlm8_Xt83622uGBUx17iqMa8oYUUFLa6NajU2ijJDQeiCm9IAMFaLkmiCObQlbzg0ZUO1IowatkQP89yjdz8jhCg7N_ohrZS0FLRgVS2K5KKzS3kXggcjj972jf-VBMsJo-zkhFFOGOWMMYWe5hCk-08WvAzKwqBAW5-eldrZ_-J_QIR9AQ</recordid><startdate>202111</startdate><enddate>202111</enddate><creator>Halim, Zahid</creator><creator>Yousaf, Muhammad Nadeem</creator><creator>Waqas, Muhammad</creator><creator>Sulaiman, Muhammad</creator><creator>Abbas, Ghulam</creator><creator>Hussain, Masroor</creator><creator>Ahmad, Iftekhar</creator><creator>Hanif, Muhammad</creator><general>Elsevier Ltd</general><general>Elsevier Sequoia S.A</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>K7.</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>202111</creationdate><title>An effective genetic algorithm-based feature selection method for intrusion detection systems</title><author>Halim, Zahid ; Yousaf, Muhammad Nadeem ; Waqas, Muhammad ; Sulaiman, Muhammad ; Abbas, Ghulam ; Hussain, Masroor ; Ahmad, Iftekhar ; Hanif, Muhammad</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c328t-d07906721347eb09fcbd0fc23f2e8d46f5fee339851d106eb56a6ea5a2dc132f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Communications traffic</topic><topic>Cybersecurity</topic><topic>Data analysis</topic><topic>Domains</topic><topic>Feature selection</topic><topic>Firewalls</topic><topic>Genetic algorithm</topic><topic>Genetic algorithms</topic><topic>Intrusion detection</topic><topic>Intrusion detection systems</topic><topic>Machine learning</topic><topic>System effectiveness</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Halim, Zahid</creatorcontrib><creatorcontrib>Yousaf, Muhammad Nadeem</creatorcontrib><creatorcontrib>Waqas, Muhammad</creatorcontrib><creatorcontrib>Sulaiman, Muhammad</creatorcontrib><creatorcontrib>Abbas, Ghulam</creatorcontrib><creatorcontrib>Hussain, Masroor</creatorcontrib><creatorcontrib>Ahmad, Iftekhar</creatorcontrib><creatorcontrib>Hanif, Muhammad</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Criminal Justice (Alumni)</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Computers &amp; security</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Halim, Zahid</au><au>Yousaf, Muhammad Nadeem</au><au>Waqas, Muhammad</au><au>Sulaiman, Muhammad</au><au>Abbas, Ghulam</au><au>Hussain, Masroor</au><au>Ahmad, Iftekhar</au><au>Hanif, Muhammad</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>An effective genetic algorithm-based feature selection method for intrusion detection systems</atitle><jtitle>Computers &amp; security</jtitle><date>2021-11</date><risdate>2021</risdate><volume>110</volume><spage>102448</spage><pages>102448-</pages><artnum>102448</artnum><issn>0167-4048</issn><eissn>1872-6208</eissn><abstract>Availability of suitable and validated data is a key issue in multiple domains for implementing machine learning methods. Higher data dimensionality has adverse effects on the learning algorithm's performance. This work aims to design a method that preserves most of the unique information related to the data with minimum number of features. Addressing the feature selection problem in the domain of network security and intrusion detection, this work contributes an enhanced Genetic Algorithm (GA)-based feature selection method, named as GA-based Feature Selection (GbFS), to increase the classifiers’ accuracy. Securing a network from the cyber-attacks is a critical task and needs to be strengthened. Machine learning, due to its proven results, is widely used in developing firewalls and Intrusion Detection Systems (IDSs) to identify new kinds of attacks. Utilizing machine learning algorithms, IDSs are able to detect the intruder by analyzing the network traffic passing through it. This work presents parameter tuning for the GA-based feature selection along with a novel fitness function. The present work develops an enhanced GA-based feature selection method which is tested over three benchmark network traffic datasets, namely, CIRA-CIC-DOHBrw-2020, UNSW-NB15, and Bot-IoT. A comparison is also performed with the standard feature selection methods. Results show that the accuracies improve using GbFS by achieving a maximum accuracy of 99.80%.</abstract><cop>Amsterdam</cop><pub>Elsevier Ltd</pub><doi>10.1016/j.cose.2021.102448</doi></addata></record>
fulltext fulltext
identifier ISSN: 0167-4048
ispartof Computers & security, 2021-11, Vol.110, p.102448, Article 102448
issn 0167-4048
1872-6208
language eng
recordid cdi_proquest_journals_2582437984
source Elsevier ScienceDirect Journals Complete
subjects Communications traffic
Cybersecurity
Data analysis
Domains
Feature selection
Firewalls
Genetic algorithm
Genetic algorithms
Intrusion detection
Intrusion detection systems
Machine learning
System effectiveness
title An effective genetic algorithm-based feature selection method for intrusion detection systems
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-07T15%3A34%3A22IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=An%20effective%20genetic%20algorithm-based%20feature%20selection%20method%20for%20intrusion%20detection%20systems&rft.jtitle=Computers%20&%20security&rft.au=Halim,%20Zahid&rft.date=2021-11&rft.volume=110&rft.spage=102448&rft.pages=102448-&rft.artnum=102448&rft.issn=0167-4048&rft.eissn=1872-6208&rft_id=info:doi/10.1016/j.cose.2021.102448&rft_dat=%3Cproquest_cross%3E2582437984%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2582437984&rft_id=info:pmid/&rft_els_id=S0167404821002728&rfr_iscdi=true