An effective genetic algorithm-based feature selection method for intrusion detection systems
Availability of suitable and validated data is a key issue in multiple domains for implementing machine learning methods. Higher data dimensionality has adverse effects on the learning algorithm's performance. This work aims to design a method that preserves most of the unique information relat...
Gespeichert in:
Veröffentlicht in: | Computers & security 2021-11, Vol.110, p.102448, Article 102448 |
---|---|
Hauptverfasser: | , , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | 102448 |
container_title | Computers & security |
container_volume | 110 |
creator | Halim, Zahid Yousaf, Muhammad Nadeem Waqas, Muhammad Sulaiman, Muhammad Abbas, Ghulam Hussain, Masroor Ahmad, Iftekhar Hanif, Muhammad |
description | Availability of suitable and validated data is a key issue in multiple domains for implementing machine learning methods. Higher data dimensionality has adverse effects on the learning algorithm's performance. This work aims to design a method that preserves most of the unique information related to the data with minimum number of features. Addressing the feature selection problem in the domain of network security and intrusion detection, this work contributes an enhanced Genetic Algorithm (GA)-based feature selection method, named as GA-based Feature Selection (GbFS), to increase the classifiers’ accuracy. Securing a network from the cyber-attacks is a critical task and needs to be strengthened. Machine learning, due to its proven results, is widely used in developing firewalls and Intrusion Detection Systems (IDSs) to identify new kinds of attacks. Utilizing machine learning algorithms, IDSs are able to detect the intruder by analyzing the network traffic passing through it. This work presents parameter tuning for the GA-based feature selection along with a novel fitness function. The present work develops an enhanced GA-based feature selection method which is tested over three benchmark network traffic datasets, namely, CIRA-CIC-DOHBrw-2020, UNSW-NB15, and Bot-IoT. A comparison is also performed with the standard feature selection methods. Results show that the accuracies improve using GbFS by achieving a maximum accuracy of 99.80%. |
doi_str_mv | 10.1016/j.cose.2021.102448 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2582437984</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0167404821002728</els_id><sourcerecordid>2582437984</sourcerecordid><originalsourceid>FETCH-LOGICAL-c328t-d07906721347eb09fcbd0fc23f2e8d46f5fee339851d106eb56a6ea5a2dc132f3</originalsourceid><addsrcrecordid>eNp9kEtLxDAUhYMoOI7-AVcF1x3zaNMU3AyDLxhwo0sJbXIzkzJtxiQd8N-b0lm7unDuOffxIXRP8Ipgwh-7lXIBVhRTkgRaFOICLYioaM4pFpdokUxVXuBCXKObEDqMScWFWKDv9ZCBMaCiPUG2gwGiVVlz2Dlv477P2yaAzgw0cfSQBThMTjdkPcS9Sw3nMztEP4ZJ1BDP7fAbIvThFl2Z5hDg7lyX6Ovl-XPzlm8_Xt83622uGBUx17iqMa8oYUUFLa6NajU2ijJDQeiCm9IAMFaLkmiCObQlbzg0ZUO1IowatkQP89yjdz8jhCg7N_ohrZS0FLRgVS2K5KKzS3kXggcjj972jf-VBMsJo-zkhFFOGOWMMYWe5hCk-08WvAzKwqBAW5-eldrZ_-J_QIR9AQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2582437984</pqid></control><display><type>article</type><title>An effective genetic algorithm-based feature selection method for intrusion detection systems</title><source>Elsevier ScienceDirect Journals Complete</source><creator>Halim, Zahid ; Yousaf, Muhammad Nadeem ; Waqas, Muhammad ; Sulaiman, Muhammad ; Abbas, Ghulam ; Hussain, Masroor ; Ahmad, Iftekhar ; Hanif, Muhammad</creator><creatorcontrib>Halim, Zahid ; Yousaf, Muhammad Nadeem ; Waqas, Muhammad ; Sulaiman, Muhammad ; Abbas, Ghulam ; Hussain, Masroor ; Ahmad, Iftekhar ; Hanif, Muhammad</creatorcontrib><description>Availability of suitable and validated data is a key issue in multiple domains for implementing machine learning methods. Higher data dimensionality has adverse effects on the learning algorithm's performance. This work aims to design a method that preserves most of the unique information related to the data with minimum number of features. Addressing the feature selection problem in the domain of network security and intrusion detection, this work contributes an enhanced Genetic Algorithm (GA)-based feature selection method, named as GA-based Feature Selection (GbFS), to increase the classifiers’ accuracy. Securing a network from the cyber-attacks is a critical task and needs to be strengthened. Machine learning, due to its proven results, is widely used in developing firewalls and Intrusion Detection Systems (IDSs) to identify new kinds of attacks. Utilizing machine learning algorithms, IDSs are able to detect the intruder by analyzing the network traffic passing through it. This work presents parameter tuning for the GA-based feature selection along with a novel fitness function. The present work develops an enhanced GA-based feature selection method which is tested over three benchmark network traffic datasets, namely, CIRA-CIC-DOHBrw-2020, UNSW-NB15, and Bot-IoT. A comparison is also performed with the standard feature selection methods. Results show that the accuracies improve using GbFS by achieving a maximum accuracy of 99.80%.</description><identifier>ISSN: 0167-4048</identifier><identifier>EISSN: 1872-6208</identifier><identifier>DOI: 10.1016/j.cose.2021.102448</identifier><language>eng</language><publisher>Amsterdam: Elsevier Ltd</publisher><subject>Communications traffic ; Cybersecurity ; Data analysis ; Domains ; Feature selection ; Firewalls ; Genetic algorithm ; Genetic algorithms ; Intrusion detection ; Intrusion detection systems ; Machine learning ; System effectiveness</subject><ispartof>Computers & security, 2021-11, Vol.110, p.102448, Article 102448</ispartof><rights>2021 Elsevier Ltd</rights><rights>Copyright Elsevier Sequoia S.A. Nov 2021</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c328t-d07906721347eb09fcbd0fc23f2e8d46f5fee339851d106eb56a6ea5a2dc132f3</citedby><cites>FETCH-LOGICAL-c328t-d07906721347eb09fcbd0fc23f2e8d46f5fee339851d106eb56a6ea5a2dc132f3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S0167404821002728$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,776,780,3537,27901,27902,65306</link.rule.ids></links><search><creatorcontrib>Halim, Zahid</creatorcontrib><creatorcontrib>Yousaf, Muhammad Nadeem</creatorcontrib><creatorcontrib>Waqas, Muhammad</creatorcontrib><creatorcontrib>Sulaiman, Muhammad</creatorcontrib><creatorcontrib>Abbas, Ghulam</creatorcontrib><creatorcontrib>Hussain, Masroor</creatorcontrib><creatorcontrib>Ahmad, Iftekhar</creatorcontrib><creatorcontrib>Hanif, Muhammad</creatorcontrib><title>An effective genetic algorithm-based feature selection method for intrusion detection systems</title><title>Computers & security</title><description>Availability of suitable and validated data is a key issue in multiple domains for implementing machine learning methods. Higher data dimensionality has adverse effects on the learning algorithm's performance. This work aims to design a method that preserves most of the unique information related to the data with minimum number of features. Addressing the feature selection problem in the domain of network security and intrusion detection, this work contributes an enhanced Genetic Algorithm (GA)-based feature selection method, named as GA-based Feature Selection (GbFS), to increase the classifiers’ accuracy. Securing a network from the cyber-attacks is a critical task and needs to be strengthened. Machine learning, due to its proven results, is widely used in developing firewalls and Intrusion Detection Systems (IDSs) to identify new kinds of attacks. Utilizing machine learning algorithms, IDSs are able to detect the intruder by analyzing the network traffic passing through it. This work presents parameter tuning for the GA-based feature selection along with a novel fitness function. The present work develops an enhanced GA-based feature selection method which is tested over three benchmark network traffic datasets, namely, CIRA-CIC-DOHBrw-2020, UNSW-NB15, and Bot-IoT. A comparison is also performed with the standard feature selection methods. Results show that the accuracies improve using GbFS by achieving a maximum accuracy of 99.80%.</description><subject>Communications traffic</subject><subject>Cybersecurity</subject><subject>Data analysis</subject><subject>Domains</subject><subject>Feature selection</subject><subject>Firewalls</subject><subject>Genetic algorithm</subject><subject>Genetic algorithms</subject><subject>Intrusion detection</subject><subject>Intrusion detection systems</subject><subject>Machine learning</subject><subject>System effectiveness</subject><issn>0167-4048</issn><issn>1872-6208</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><recordid>eNp9kEtLxDAUhYMoOI7-AVcF1x3zaNMU3AyDLxhwo0sJbXIzkzJtxiQd8N-b0lm7unDuOffxIXRP8Ipgwh-7lXIBVhRTkgRaFOICLYioaM4pFpdokUxVXuBCXKObEDqMScWFWKDv9ZCBMaCiPUG2gwGiVVlz2Dlv477P2yaAzgw0cfSQBThMTjdkPcS9Sw3nMztEP4ZJ1BDP7fAbIvThFl2Z5hDg7lyX6Ovl-XPzlm8_Xt83622uGBUx17iqMa8oYUUFLa6NajU2ijJDQeiCm9IAMFaLkmiCObQlbzg0ZUO1IowatkQP89yjdz8jhCg7N_ohrZS0FLRgVS2K5KKzS3kXggcjj972jf-VBMsJo-zkhFFOGOWMMYWe5hCk-08WvAzKwqBAW5-eldrZ_-J_QIR9AQ</recordid><startdate>202111</startdate><enddate>202111</enddate><creator>Halim, Zahid</creator><creator>Yousaf, Muhammad Nadeem</creator><creator>Waqas, Muhammad</creator><creator>Sulaiman, Muhammad</creator><creator>Abbas, Ghulam</creator><creator>Hussain, Masroor</creator><creator>Ahmad, Iftekhar</creator><creator>Hanif, Muhammad</creator><general>Elsevier Ltd</general><general>Elsevier Sequoia S.A</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>K7.</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>202111</creationdate><title>An effective genetic algorithm-based feature selection method for intrusion detection systems</title><author>Halim, Zahid ; Yousaf, Muhammad Nadeem ; Waqas, Muhammad ; Sulaiman, Muhammad ; Abbas, Ghulam ; Hussain, Masroor ; Ahmad, Iftekhar ; Hanif, Muhammad</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c328t-d07906721347eb09fcbd0fc23f2e8d46f5fee339851d106eb56a6ea5a2dc132f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Communications traffic</topic><topic>Cybersecurity</topic><topic>Data analysis</topic><topic>Domains</topic><topic>Feature selection</topic><topic>Firewalls</topic><topic>Genetic algorithm</topic><topic>Genetic algorithms</topic><topic>Intrusion detection</topic><topic>Intrusion detection systems</topic><topic>Machine learning</topic><topic>System effectiveness</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Halim, Zahid</creatorcontrib><creatorcontrib>Yousaf, Muhammad Nadeem</creatorcontrib><creatorcontrib>Waqas, Muhammad</creatorcontrib><creatorcontrib>Sulaiman, Muhammad</creatorcontrib><creatorcontrib>Abbas, Ghulam</creatorcontrib><creatorcontrib>Hussain, Masroor</creatorcontrib><creatorcontrib>Ahmad, Iftekhar</creatorcontrib><creatorcontrib>Hanif, Muhammad</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Criminal Justice (Alumni)</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Computers & security</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Halim, Zahid</au><au>Yousaf, Muhammad Nadeem</au><au>Waqas, Muhammad</au><au>Sulaiman, Muhammad</au><au>Abbas, Ghulam</au><au>Hussain, Masroor</au><au>Ahmad, Iftekhar</au><au>Hanif, Muhammad</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>An effective genetic algorithm-based feature selection method for intrusion detection systems</atitle><jtitle>Computers & security</jtitle><date>2021-11</date><risdate>2021</risdate><volume>110</volume><spage>102448</spage><pages>102448-</pages><artnum>102448</artnum><issn>0167-4048</issn><eissn>1872-6208</eissn><abstract>Availability of suitable and validated data is a key issue in multiple domains for implementing machine learning methods. Higher data dimensionality has adverse effects on the learning algorithm's performance. This work aims to design a method that preserves most of the unique information related to the data with minimum number of features. Addressing the feature selection problem in the domain of network security and intrusion detection, this work contributes an enhanced Genetic Algorithm (GA)-based feature selection method, named as GA-based Feature Selection (GbFS), to increase the classifiers’ accuracy. Securing a network from the cyber-attacks is a critical task and needs to be strengthened. Machine learning, due to its proven results, is widely used in developing firewalls and Intrusion Detection Systems (IDSs) to identify new kinds of attacks. Utilizing machine learning algorithms, IDSs are able to detect the intruder by analyzing the network traffic passing through it. This work presents parameter tuning for the GA-based feature selection along with a novel fitness function. The present work develops an enhanced GA-based feature selection method which is tested over three benchmark network traffic datasets, namely, CIRA-CIC-DOHBrw-2020, UNSW-NB15, and Bot-IoT. A comparison is also performed with the standard feature selection methods. Results show that the accuracies improve using GbFS by achieving a maximum accuracy of 99.80%.</abstract><cop>Amsterdam</cop><pub>Elsevier Ltd</pub><doi>10.1016/j.cose.2021.102448</doi></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0167-4048 |
ispartof | Computers & security, 2021-11, Vol.110, p.102448, Article 102448 |
issn | 0167-4048 1872-6208 |
language | eng |
recordid | cdi_proquest_journals_2582437984 |
source | Elsevier ScienceDirect Journals Complete |
subjects | Communications traffic Cybersecurity Data analysis Domains Feature selection Firewalls Genetic algorithm Genetic algorithms Intrusion detection Intrusion detection systems Machine learning System effectiveness |
title | An effective genetic algorithm-based feature selection method for intrusion detection systems |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-07T15%3A34%3A22IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=An%20effective%20genetic%20algorithm-based%20feature%20selection%20method%20for%20intrusion%20detection%20systems&rft.jtitle=Computers%20&%20security&rft.au=Halim,%20Zahid&rft.date=2021-11&rft.volume=110&rft.spage=102448&rft.pages=102448-&rft.artnum=102448&rft.issn=0167-4048&rft.eissn=1872-6208&rft_id=info:doi/10.1016/j.cose.2021.102448&rft_dat=%3Cproquest_cross%3E2582437984%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2582437984&rft_id=info:pmid/&rft_els_id=S0167404821002728&rfr_iscdi=true |