Prediction of Number of Software Defects based on SMOTE

Prediction of software defects is an effective way to improve system quality, and it is a key factor affecting the efficiency of defect detection and repair in software components. The purpose of this study is to improve the effectiveness of component defect prediction in the following two ways: for...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:International journal of performability engineering 2021, Vol.17 (1), p.123
Hauptverfasser: Guoqiang, Xie, Shiyi, Xie, Xiaohong, Peng, Zhao, Li
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue 1
container_start_page 123
container_title International journal of performability engineering
container_volume 17
creator Guoqiang, Xie
Shiyi, Xie
Xiaohong, Peng
Zhao, Li
description Prediction of software defects is an effective way to improve system quality, and it is a key factor affecting the efficiency of defect detection and repair in software components. The purpose of this study is to improve the effectiveness of component defect prediction in the following two ways: for the imbalance of training data in defect prediction and the insufficient support of single regression in predicting the number of defects in components First, this study proposed to adopt SMOTE to construct a balanced sample dataset and oversample the defective components in the unbalanced sample dataset to take into account the proportion of different types of samples and improve the accuracy of prediction; second, this study proposes a method of multi-step prediction for the number of defects that supports regression after classification, and the method applies support vector machines to classify components and filter out non-defective components in the classification results, applies regression to establish a component defect number prediction model to effectively implement the multi-step prediction of component defect number, and further improves the accuracy of prediction. The evaluation of the experiment was completed on open-source datasets. The results show that the accuracy of multi-step prediction is better than the prediction by regression alone, and multi-step prediction has higher overall efficiency and applicability.
doi_str_mv 10.23940/ijpe.21.01.p12.123134
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2488256192</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2488256192</sourcerecordid><originalsourceid>FETCH-LOGICAL-c198t-491e40ae942d9357ca4eb2382344b71dd5afd656bc4e00803687b5f6e83aab43</originalsourceid><addsrcrecordid>eNotkFFLwzAUhfOg4Jz-BSn43Jp7k6bJo8zphOmE7T0k7Q10uLUmHeK_t7M-nfPwcQ58jN0BL1AYyR_afU8FQsGh6AELQAFCXrAZN5XIQYC-Ytcp7TlXWCHMWPURqWnroe2OWRey99PBUzy3bReGbxcpe6JA9ZAy7xI12Yht3za75Q27DO4z0e1_ztnueblbrPL15uV18bjOazB6yKUBktyRkdgYUVa1k-RRaBRS-gqapnShUaXytSTONRdKV74MirRwzksxZ_fTbB-7rxOlwe67UzyOjxal1lgqMDhSaqLq2KUUKdg-tgcXfyxw--fFnr1YBMvBjl7s5EX8AqT7V7Q</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2488256192</pqid></control><display><type>article</type><title>Prediction of Number of Software Defects based on SMOTE</title><source>Alma/SFX Local Collection</source><creator>Guoqiang, Xie ; Shiyi, Xie ; Xiaohong, Peng ; Zhao, Li</creator><creatorcontrib>Guoqiang, Xie ; Shiyi, Xie ; Xiaohong, Peng ; Zhao, Li</creatorcontrib><description>Prediction of software defects is an effective way to improve system quality, and it is a key factor affecting the efficiency of defect detection and repair in software components. The purpose of this study is to improve the effectiveness of component defect prediction in the following two ways: for the imbalance of training data in defect prediction and the insufficient support of single regression in predicting the number of defects in components First, this study proposed to adopt SMOTE to construct a balanced sample dataset and oversample the defective components in the unbalanced sample dataset to take into account the proportion of different types of samples and improve the accuracy of prediction; second, this study proposes a method of multi-step prediction for the number of defects that supports regression after classification, and the method applies support vector machines to classify components and filter out non-defective components in the classification results, applies regression to establish a component defect number prediction model to effectively implement the multi-step prediction of component defect number, and further improves the accuracy of prediction. The evaluation of the experiment was completed on open-source datasets. The results show that the accuracy of multi-step prediction is better than the prediction by regression alone, and multi-step prediction has higher overall efficiency and applicability.</description><identifier>ISSN: 0973-1318</identifier><identifier>DOI: 10.23940/ijpe.21.01.p12.123134</identifier><language>eng</language><publisher>Jaipur: RAMS Consultants</publisher><subject>Classification ; Datasets ; Defects ; Prediction models ; Predictions ; Regression ; Software ; Source code ; Support vector machines</subject><ispartof>International journal of performability engineering, 2021, Vol.17 (1), p.123</ispartof><rights>Copyright RAMS Consultants Jan 2021</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c198t-491e40ae942d9357ca4eb2382344b71dd5afd656bc4e00803687b5f6e83aab43</citedby><cites>FETCH-LOGICAL-c198t-491e40ae942d9357ca4eb2382344b71dd5afd656bc4e00803687b5f6e83aab43</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,4024,27923,27924,27925</link.rule.ids></links><search><creatorcontrib>Guoqiang, Xie</creatorcontrib><creatorcontrib>Shiyi, Xie</creatorcontrib><creatorcontrib>Xiaohong, Peng</creatorcontrib><creatorcontrib>Zhao, Li</creatorcontrib><title>Prediction of Number of Software Defects based on SMOTE</title><title>International journal of performability engineering</title><description>Prediction of software defects is an effective way to improve system quality, and it is a key factor affecting the efficiency of defect detection and repair in software components. The purpose of this study is to improve the effectiveness of component defect prediction in the following two ways: for the imbalance of training data in defect prediction and the insufficient support of single regression in predicting the number of defects in components First, this study proposed to adopt SMOTE to construct a balanced sample dataset and oversample the defective components in the unbalanced sample dataset to take into account the proportion of different types of samples and improve the accuracy of prediction; second, this study proposes a method of multi-step prediction for the number of defects that supports regression after classification, and the method applies support vector machines to classify components and filter out non-defective components in the classification results, applies regression to establish a component defect number prediction model to effectively implement the multi-step prediction of component defect number, and further improves the accuracy of prediction. The evaluation of the experiment was completed on open-source datasets. The results show that the accuracy of multi-step prediction is better than the prediction by regression alone, and multi-step prediction has higher overall efficiency and applicability.</description><subject>Classification</subject><subject>Datasets</subject><subject>Defects</subject><subject>Prediction models</subject><subject>Predictions</subject><subject>Regression</subject><subject>Software</subject><subject>Source code</subject><subject>Support vector machines</subject><issn>0973-1318</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><recordid>eNotkFFLwzAUhfOg4Jz-BSn43Jp7k6bJo8zphOmE7T0k7Q10uLUmHeK_t7M-nfPwcQ58jN0BL1AYyR_afU8FQsGh6AELQAFCXrAZN5XIQYC-Ytcp7TlXWCHMWPURqWnroe2OWRey99PBUzy3bReGbxcpe6JA9ZAy7xI12Yht3za75Q27DO4z0e1_ztnueblbrPL15uV18bjOazB6yKUBktyRkdgYUVa1k-RRaBRS-gqapnShUaXytSTONRdKV74MirRwzksxZ_fTbB-7rxOlwe67UzyOjxal1lgqMDhSaqLq2KUUKdg-tgcXfyxw--fFnr1YBMvBjl7s5EX8AqT7V7Q</recordid><startdate>2021</startdate><enddate>2021</enddate><creator>Guoqiang, Xie</creator><creator>Shiyi, Xie</creator><creator>Xiaohong, Peng</creator><creator>Zhao, Li</creator><general>RAMS Consultants</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7TA</scope><scope>7TB</scope><scope>8FD</scope><scope>FR3</scope><scope>JG9</scope></search><sort><creationdate>2021</creationdate><title>Prediction of Number of Software Defects based on SMOTE</title><author>Guoqiang, Xie ; Shiyi, Xie ; Xiaohong, Peng ; Zhao, Li</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c198t-491e40ae942d9357ca4eb2382344b71dd5afd656bc4e00803687b5f6e83aab43</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Classification</topic><topic>Datasets</topic><topic>Defects</topic><topic>Prediction models</topic><topic>Predictions</topic><topic>Regression</topic><topic>Software</topic><topic>Source code</topic><topic>Support vector machines</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Guoqiang, Xie</creatorcontrib><creatorcontrib>Shiyi, Xie</creatorcontrib><creatorcontrib>Xiaohong, Peng</creatorcontrib><creatorcontrib>Zhao, Li</creatorcontrib><collection>CrossRef</collection><collection>Materials Business File</collection><collection>Mechanical &amp; Transportation Engineering Abstracts</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>Materials Research Database</collection><jtitle>International journal of performability engineering</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Guoqiang, Xie</au><au>Shiyi, Xie</au><au>Xiaohong, Peng</au><au>Zhao, Li</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Prediction of Number of Software Defects based on SMOTE</atitle><jtitle>International journal of performability engineering</jtitle><date>2021</date><risdate>2021</risdate><volume>17</volume><issue>1</issue><spage>123</spage><pages>123-</pages><issn>0973-1318</issn><abstract>Prediction of software defects is an effective way to improve system quality, and it is a key factor affecting the efficiency of defect detection and repair in software components. The purpose of this study is to improve the effectiveness of component defect prediction in the following two ways: for the imbalance of training data in defect prediction and the insufficient support of single regression in predicting the number of defects in components First, this study proposed to adopt SMOTE to construct a balanced sample dataset and oversample the defective components in the unbalanced sample dataset to take into account the proportion of different types of samples and improve the accuracy of prediction; second, this study proposes a method of multi-step prediction for the number of defects that supports regression after classification, and the method applies support vector machines to classify components and filter out non-defective components in the classification results, applies regression to establish a component defect number prediction model to effectively implement the multi-step prediction of component defect number, and further improves the accuracy of prediction. The evaluation of the experiment was completed on open-source datasets. The results show that the accuracy of multi-step prediction is better than the prediction by regression alone, and multi-step prediction has higher overall efficiency and applicability.</abstract><cop>Jaipur</cop><pub>RAMS Consultants</pub><doi>10.23940/ijpe.21.01.p12.123134</doi></addata></record>
fulltext fulltext
identifier ISSN: 0973-1318
ispartof International journal of performability engineering, 2021, Vol.17 (1), p.123
issn 0973-1318
language eng
recordid cdi_proquest_journals_2488256192
source Alma/SFX Local Collection
subjects Classification
Datasets
Defects
Prediction models
Predictions
Regression
Software
Source code
Support vector machines
title Prediction of Number of Software Defects based on SMOTE
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-06T04%3A30%3A07IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Prediction%20of%20Number%20of%20Software%20Defects%20based%20on%20SMOTE&rft.jtitle=International%20journal%20of%20performability%20engineering&rft.au=Guoqiang,%20Xie&rft.date=2021&rft.volume=17&rft.issue=1&rft.spage=123&rft.pages=123-&rft.issn=0973-1318&rft_id=info:doi/10.23940/ijpe.21.01.p12.123134&rft_dat=%3Cproquest_cross%3E2488256192%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2488256192&rft_id=info:pmid/&rfr_iscdi=true