Android malware detection based on image-based features and machine learning techniques

In this paper, a malware classification model has been proposed for detecting malware samples in the Android environment. The proposed model is based on converting some files from the source of the Android applications into grayscale images. Some image-based local features and global features, inclu...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:SN applied sciences 2020-07, Vol.2 (7), p.1299, Article 1299
Hauptverfasser: Ünver, Halil Murat, Bakour, Khaled
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue 7
container_start_page 1299
container_title SN applied sciences
container_volume 2
creator Ünver, Halil Murat
Bakour, Khaled
description In this paper, a malware classification model has been proposed for detecting malware samples in the Android environment. The proposed model is based on converting some files from the source of the Android applications into grayscale images. Some image-based local features and global features, including four different types of local features and three different types of global features, have been extracted from the constructed grayscale image datasets and used for training the proposed model. To the best of our knowledge, this type of features is used for the first time in the Android malware detection domain. Moreover, the bag of visual words algorithm has been used to construct one feature vector from the descriptors of the local feature extracted from each image. The extracted local and global features have been used for training multiple machine learning classifiers including Random forest, k-nearest neighbors, Decision Tree, Bagging, AdaBoost and Gradient Boost. The proposed method obtained a very high classification accuracy reached 98.75% with a typical computational time does not exceed 0.018 s for each sample. The results of the proposed model outperformed the results of all compared state-of-art models in term of both classification accuracy and computational time.
doi_str_mv 10.1007/s42452-020-3132-2
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2788424586</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2788424586</sourcerecordid><originalsourceid>FETCH-LOGICAL-c359t-45ef0d2092d48f86e544320decf645ba863a94d702a1f6abe62eb31c5463b0a83</originalsourceid><addsrcrecordid>eNp1kE1LAzEQhoMoWGp_gLeA52gy-djdYyl-QcGL4jFkN7PtljZbky3ivzfLip48zQzM8w7zEHIt-K3gvLhLCpQGxoEzKSQwOCMz0CCZrApx_tsbeUkWKe0451BUUpVyRt6Xwce-8_Tg9p8uIvU4YDN0faC1S-hpbrqD2yCbxhbdcIqYqAsj02y7gHSPLoYubGhGt6H7OGG6Ihet2ydc_NQ5eXu4f109sfXL4_NquWaN1NXAlMaWe-AVeFW2pUGtlATusWmN0rUrjXSV8gUHJ1rjajSAtRSNVkbW3JVyTm6m3GPsx7uD3fWnGPJJC0VZjmJyxpyIaauJfUoRW3uM-av4ZQW3o0I7KbRZoR0VWsgMTEzKu2GD8S_5f-gbh6lzhA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2788424586</pqid></control><display><type>article</type><title>Android malware detection based on image-based features and machine learning techniques</title><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><creator>Ünver, Halil Murat ; Bakour, Khaled</creator><creatorcontrib>Ünver, Halil Murat ; Bakour, Khaled</creatorcontrib><description>In this paper, a malware classification model has been proposed for detecting malware samples in the Android environment. The proposed model is based on converting some files from the source of the Android applications into grayscale images. Some image-based local features and global features, including four different types of local features and three different types of global features, have been extracted from the constructed grayscale image datasets and used for training the proposed model. To the best of our knowledge, this type of features is used for the first time in the Android malware detection domain. Moreover, the bag of visual words algorithm has been used to construct one feature vector from the descriptors of the local feature extracted from each image. The extracted local and global features have been used for training multiple machine learning classifiers including Random forest, k-nearest neighbors, Decision Tree, Bagging, AdaBoost and Gradient Boost. The proposed method obtained a very high classification accuracy reached 98.75% with a typical computational time does not exceed 0.018 s for each sample. The results of the proposed model outperformed the results of all compared state-of-art models in term of both classification accuracy and computational time.</description><identifier>ISSN: 2523-3963</identifier><identifier>EISSN: 2523-3971</identifier><identifier>DOI: 10.1007/s42452-020-3132-2</identifier><language>eng</language><publisher>Cham: Springer International Publishing</publisher><subject>Accuracy ; Algorithms ; Applied and Technical Physics ; Archives &amp; records ; Behavior ; Cellular telephones ; Chemistry/Food Science ; Classification ; Computational efficiency ; Computer applications ; Computing time ; Datasets ; Decision trees ; Digitization ; Earth Sciences ; Engineering ; Engineering: Application of Machine Learning in Engineering ; Environment ; Feature extraction ; Gray scale ; Learning algorithms ; Machine learning ; Malware ; Materials Science ; Neural networks ; Operating systems ; Research Article ; Smartphones ; Training</subject><ispartof>SN applied sciences, 2020-07, Vol.2 (7), p.1299, Article 1299</ispartof><rights>Springer Nature Switzerland AG 2020</rights><rights>Springer Nature Switzerland AG 2020.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c359t-45ef0d2092d48f86e544320decf645ba863a94d702a1f6abe62eb31c5463b0a83</citedby><cites>FETCH-LOGICAL-c359t-45ef0d2092d48f86e544320decf645ba863a94d702a1f6abe62eb31c5463b0a83</cites><orcidid>0000-0003-3327-2822</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids></links><search><creatorcontrib>Ünver, Halil Murat</creatorcontrib><creatorcontrib>Bakour, Khaled</creatorcontrib><title>Android malware detection based on image-based features and machine learning techniques</title><title>SN applied sciences</title><addtitle>SN Appl. Sci</addtitle><description>In this paper, a malware classification model has been proposed for detecting malware samples in the Android environment. The proposed model is based on converting some files from the source of the Android applications into grayscale images. Some image-based local features and global features, including four different types of local features and three different types of global features, have been extracted from the constructed grayscale image datasets and used for training the proposed model. To the best of our knowledge, this type of features is used for the first time in the Android malware detection domain. Moreover, the bag of visual words algorithm has been used to construct one feature vector from the descriptors of the local feature extracted from each image. The extracted local and global features have been used for training multiple machine learning classifiers including Random forest, k-nearest neighbors, Decision Tree, Bagging, AdaBoost and Gradient Boost. The proposed method obtained a very high classification accuracy reached 98.75% with a typical computational time does not exceed 0.018 s for each sample. The results of the proposed model outperformed the results of all compared state-of-art models in term of both classification accuracy and computational time.</description><subject>Accuracy</subject><subject>Algorithms</subject><subject>Applied and Technical Physics</subject><subject>Archives &amp; records</subject><subject>Behavior</subject><subject>Cellular telephones</subject><subject>Chemistry/Food Science</subject><subject>Classification</subject><subject>Computational efficiency</subject><subject>Computer applications</subject><subject>Computing time</subject><subject>Datasets</subject><subject>Decision trees</subject><subject>Digitization</subject><subject>Earth Sciences</subject><subject>Engineering</subject><subject>Engineering: Application of Machine Learning in Engineering</subject><subject>Environment</subject><subject>Feature extraction</subject><subject>Gray scale</subject><subject>Learning algorithms</subject><subject>Machine learning</subject><subject>Malware</subject><subject>Materials Science</subject><subject>Neural networks</subject><subject>Operating systems</subject><subject>Research Article</subject><subject>Smartphones</subject><subject>Training</subject><issn>2523-3963</issn><issn>2523-3971</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><recordid>eNp1kE1LAzEQhoMoWGp_gLeA52gy-djdYyl-QcGL4jFkN7PtljZbky3ivzfLip48zQzM8w7zEHIt-K3gvLhLCpQGxoEzKSQwOCMz0CCZrApx_tsbeUkWKe0451BUUpVyRt6Xwce-8_Tg9p8uIvU4YDN0faC1S-hpbrqD2yCbxhbdcIqYqAsj02y7gHSPLoYubGhGt6H7OGG6Ihet2ydc_NQ5eXu4f109sfXL4_NquWaN1NXAlMaWe-AVeFW2pUGtlATusWmN0rUrjXSV8gUHJ1rjajSAtRSNVkbW3JVyTm6m3GPsx7uD3fWnGPJJC0VZjmJyxpyIaauJfUoRW3uM-av4ZQW3o0I7KbRZoR0VWsgMTEzKu2GD8S_5f-gbh6lzhA</recordid><startdate>20200701</startdate><enddate>20200701</enddate><creator>Ünver, Halil Murat</creator><creator>Bakour, Khaled</creator><general>Springer International Publishing</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0003-3327-2822</orcidid></search><sort><creationdate>20200701</creationdate><title>Android malware detection based on image-based features and machine learning techniques</title><author>Ünver, Halil Murat ; Bakour, Khaled</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c359t-45ef0d2092d48f86e544320decf645ba863a94d702a1f6abe62eb31c5463b0a83</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Accuracy</topic><topic>Algorithms</topic><topic>Applied and Technical Physics</topic><topic>Archives &amp; records</topic><topic>Behavior</topic><topic>Cellular telephones</topic><topic>Chemistry/Food Science</topic><topic>Classification</topic><topic>Computational efficiency</topic><topic>Computer applications</topic><topic>Computing time</topic><topic>Datasets</topic><topic>Decision trees</topic><topic>Digitization</topic><topic>Earth Sciences</topic><topic>Engineering</topic><topic>Engineering: Application of Machine Learning in Engineering</topic><topic>Environment</topic><topic>Feature extraction</topic><topic>Gray scale</topic><topic>Learning algorithms</topic><topic>Machine learning</topic><topic>Malware</topic><topic>Materials Science</topic><topic>Neural networks</topic><topic>Operating systems</topic><topic>Research Article</topic><topic>Smartphones</topic><topic>Training</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Ünver, Halil Murat</creatorcontrib><creatorcontrib>Bakour, Khaled</creatorcontrib><collection>CrossRef</collection><jtitle>SN applied sciences</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Ünver, Halil Murat</au><au>Bakour, Khaled</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Android malware detection based on image-based features and machine learning techniques</atitle><jtitle>SN applied sciences</jtitle><stitle>SN Appl. Sci</stitle><date>2020-07-01</date><risdate>2020</risdate><volume>2</volume><issue>7</issue><spage>1299</spage><pages>1299-</pages><artnum>1299</artnum><issn>2523-3963</issn><eissn>2523-3971</eissn><abstract>In this paper, a malware classification model has been proposed for detecting malware samples in the Android environment. The proposed model is based on converting some files from the source of the Android applications into grayscale images. Some image-based local features and global features, including four different types of local features and three different types of global features, have been extracted from the constructed grayscale image datasets and used for training the proposed model. To the best of our knowledge, this type of features is used for the first time in the Android malware detection domain. Moreover, the bag of visual words algorithm has been used to construct one feature vector from the descriptors of the local feature extracted from each image. The extracted local and global features have been used for training multiple machine learning classifiers including Random forest, k-nearest neighbors, Decision Tree, Bagging, AdaBoost and Gradient Boost. The proposed method obtained a very high classification accuracy reached 98.75% with a typical computational time does not exceed 0.018 s for each sample. The results of the proposed model outperformed the results of all compared state-of-art models in term of both classification accuracy and computational time.</abstract><cop>Cham</cop><pub>Springer International Publishing</pub><doi>10.1007/s42452-020-3132-2</doi><orcidid>https://orcid.org/0000-0003-3327-2822</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2523-3963
ispartof SN applied sciences, 2020-07, Vol.2 (7), p.1299, Article 1299
issn 2523-3963
2523-3971
language eng
recordid cdi_proquest_journals_2788424586
source Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals
subjects Accuracy
Algorithms
Applied and Technical Physics
Archives & records
Behavior
Cellular telephones
Chemistry/Food Science
Classification
Computational efficiency
Computer applications
Computing time
Datasets
Decision trees
Digitization
Earth Sciences
Engineering
Engineering: Application of Machine Learning in Engineering
Environment
Feature extraction
Gray scale
Learning algorithms
Machine learning
Malware
Materials Science
Neural networks
Operating systems
Research Article
Smartphones
Training
title Android malware detection based on image-based features and machine learning techniques
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-03T17%3A23%3A25IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Android%20malware%20detection%20based%20on%20image-based%20features%20and%20machine%20learning%20techniques&rft.jtitle=SN%20applied%20sciences&rft.au=%C3%9Cnver,%20Halil%20Murat&rft.date=2020-07-01&rft.volume=2&rft.issue=7&rft.spage=1299&rft.pages=1299-&rft.artnum=1299&rft.issn=2523-3963&rft.eissn=2523-3971&rft_id=info:doi/10.1007/s42452-020-3132-2&rft_dat=%3Cproquest_cross%3E2788424586%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2788424586&rft_id=info:pmid/&rfr_iscdi=true