Mulr4FL: Effective Fault Localization of Evolution Software Based on Multivariate Logistic Regression Model
Fault localization is indeed tedious and costly work during software maintenance. Studies indicate that combining both structural features and behavior characteristics of programs can be beneficial for improving the efficiency of fault locating. In this paper, we proposed a framework, called Mulr4FL...
Gespeichert in:
Veröffentlicht in: | IEEE access 2020, Vol.8, p.207858-207870 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 207870 |
---|---|
container_issue | |
container_start_page | 207858 |
container_title | IEEE access |
container_volume | 8 |
creator | Ju, Xiaolin Qian, Jie Chen, Zhihua Zhao, Chunyu Qian, Junyan |
description | Fault localization is indeed tedious and costly work during software maintenance. Studies indicate that combining both structural features and behavior characteristics of programs can be beneficial for improving the efficiency of fault locating. In this paper, we proposed a framework, called Mulr4FL, for fault localization using a multivariate logistic regression model that combined both static and dynamic features collected from the program under debugging. Firstly, the hybrid metrics data set, with both program structural features and behavior characteristics combined, is constructed by static program analyzing and dynamically tracing that runs with a designed metrics set. Meanwhile, the fault information of the legacy program is also obtained from the bug tracking system. Secondly, Bivariate logistic analysis is performed to filter the metrics that are significantly related to faults, and then with the selected metrics and their measurements, a multivariate logistic regression model was constructed and trained. Finally, based on the trained logistic model, we conduct the multivariate logistic analysis on the features of the evolved software and predict the buggy class methods. An empirical study was conducted based on a set of benchmarks that are used widely in program debugging research. The results indicate that the Mulr4FL can significantly improve the effectiveness of locating faults in contrast to 5 baseline techniques. |
doi_str_mv | 10.1109/ACCESS.2020.3037235 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1109_ACCESS_2020_3037235</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9253643</ieee_id><doaj_id>oai_doaj_org_article_b93307bb844f4edda843847b824ecfba</doaj_id><sourcerecordid>2465442983</sourcerecordid><originalsourceid>FETCH-LOGICAL-c408t-b738957de2954eb2086e0c6a3a0e953121208f049b0d648fbafe9bf520e5c0293</originalsourceid><addsrcrecordid>eNpNkVFLwzAUhYsoOHS_YC8FnzvTJG0T3-bYdNAhOH0OSXszMusyk3aiv95sHcO8JDnc79wDJ4pGKRqnKeL3k-l0tlqNMcJoTBApMMkuogFOc56QjOSX_97X0dD7DQqHBSkrBtHHsmscnZcP8UxrqFqzh3guu6aNS1vJxvzK1thtbHU829umO35WVrff0kH8KD3UcVCCSSClM7KFAK6Nb00Vv8LagfcHZGlraG6jKy0bD8PTfRO9z2dv0-ekfHlaTCdlUlHE2kQVhIVsNWCeUVAYsRxQlUsiEfCMpDgNkkaUK1TnlGklNXClM4wgqxDm5CZa9L61lRuxc-ZTuh9hpRFHwbq1kC4EbEAoTggqlGKUagp1LRkljBaKYQpVcA5ed73XztmvDnwrNrZz2xBfYJpnlGLOSJgi_VTlrPcO9HlrisShJNGXJA4liVNJgRr1lAGAM8FxKIoS8geJz42J</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2465442983</pqid></control><display><type>article</type><title>Mulr4FL: Effective Fault Localization of Evolution Software Based on Multivariate Logistic Regression Model</title><source>DOAJ Directory of Open Access Journals</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>IEEE Xplore Open Access Journals</source><creator>Ju, Xiaolin ; Qian, Jie ; Chen, Zhihua ; Zhao, Chunyu ; Qian, Junyan</creator><creatorcontrib>Ju, Xiaolin ; Qian, Jie ; Chen, Zhihua ; Zhao, Chunyu ; Qian, Junyan</creatorcontrib><description>Fault localization is indeed tedious and costly work during software maintenance. Studies indicate that combining both structural features and behavior characteristics of programs can be beneficial for improving the efficiency of fault locating. In this paper, we proposed a framework, called Mulr4FL, for fault localization using a multivariate logistic regression model that combined both static and dynamic features collected from the program under debugging. Firstly, the hybrid metrics data set, with both program structural features and behavior characteristics combined, is constructed by static program analyzing and dynamically tracing that runs with a designed metrics set. Meanwhile, the fault information of the legacy program is also obtained from the bug tracking system. Secondly, Bivariate logistic analysis is performed to filter the metrics that are significantly related to faults, and then with the selected metrics and their measurements, a multivariate logistic regression model was constructed and trained. Finally, based on the trained logistic model, we conduct the multivariate logistic analysis on the features of the evolved software and predict the buggy class methods. An empirical study was conducted based on a set of benchmarks that are used widely in program debugging research. The results indicate that the Mulr4FL can significantly improve the effectiveness of locating faults in contrast to 5 baseline techniques.</description><identifier>ISSN: 2169-3536</identifier><identifier>EISSN: 2169-3536</identifier><identifier>DOI: 10.1109/ACCESS.2020.3037235</identifier><identifier>CODEN: IAECCG</identifier><language>eng</language><publisher>Piscataway: IEEE</publisher><subject>Analytical models ; Bivariate analysis ; Debugging ; Empirical analysis ; Evolution ; fault localization ; Fault location ; Localization ; logistic regression analysis ; Logistics ; Mathematical model ; Measurement ; Multivariate analysis ; Object oriented modeling ; Predictive models ; Regression analysis ; Regression models ; Software ; Software testing ; Tracking systems</subject><ispartof>IEEE access, 2020, Vol.8, p.207858-207870</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2020</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c408t-b738957de2954eb2086e0c6a3a0e953121208f049b0d648fbafe9bf520e5c0293</citedby><cites>FETCH-LOGICAL-c408t-b738957de2954eb2086e0c6a3a0e953121208f049b0d648fbafe9bf520e5c0293</cites><orcidid>0000-0003-2579-5359</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9253643$$EHTML$$P50$$Gieee$$Hfree_for_read</linktohtml><link.rule.ids>314,776,780,860,2096,4010,27610,27900,27901,27902,54908</link.rule.ids></links><search><creatorcontrib>Ju, Xiaolin</creatorcontrib><creatorcontrib>Qian, Jie</creatorcontrib><creatorcontrib>Chen, Zhihua</creatorcontrib><creatorcontrib>Zhao, Chunyu</creatorcontrib><creatorcontrib>Qian, Junyan</creatorcontrib><title>Mulr4FL: Effective Fault Localization of Evolution Software Based on Multivariate Logistic Regression Model</title><title>IEEE access</title><addtitle>Access</addtitle><description>Fault localization is indeed tedious and costly work during software maintenance. Studies indicate that combining both structural features and behavior characteristics of programs can be beneficial for improving the efficiency of fault locating. In this paper, we proposed a framework, called Mulr4FL, for fault localization using a multivariate logistic regression model that combined both static and dynamic features collected from the program under debugging. Firstly, the hybrid metrics data set, with both program structural features and behavior characteristics combined, is constructed by static program analyzing and dynamically tracing that runs with a designed metrics set. Meanwhile, the fault information of the legacy program is also obtained from the bug tracking system. Secondly, Bivariate logistic analysis is performed to filter the metrics that are significantly related to faults, and then with the selected metrics and their measurements, a multivariate logistic regression model was constructed and trained. Finally, based on the trained logistic model, we conduct the multivariate logistic analysis on the features of the evolved software and predict the buggy class methods. An empirical study was conducted based on a set of benchmarks that are used widely in program debugging research. The results indicate that the Mulr4FL can significantly improve the effectiveness of locating faults in contrast to 5 baseline techniques.</description><subject>Analytical models</subject><subject>Bivariate analysis</subject><subject>Debugging</subject><subject>Empirical analysis</subject><subject>Evolution</subject><subject>fault localization</subject><subject>Fault location</subject><subject>Localization</subject><subject>logistic regression analysis</subject><subject>Logistics</subject><subject>Mathematical model</subject><subject>Measurement</subject><subject>Multivariate analysis</subject><subject>Object oriented modeling</subject><subject>Predictive models</subject><subject>Regression analysis</subject><subject>Regression models</subject><subject>Software</subject><subject>Software testing</subject><subject>Tracking systems</subject><issn>2169-3536</issn><issn>2169-3536</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>ESBDL</sourceid><sourceid>RIE</sourceid><sourceid>DOA</sourceid><recordid>eNpNkVFLwzAUhYsoOHS_YC8FnzvTJG0T3-bYdNAhOH0OSXszMusyk3aiv95sHcO8JDnc79wDJ4pGKRqnKeL3k-l0tlqNMcJoTBApMMkuogFOc56QjOSX_97X0dD7DQqHBSkrBtHHsmscnZcP8UxrqFqzh3guu6aNS1vJxvzK1thtbHU829umO35WVrff0kH8KD3UcVCCSSClM7KFAK6Nb00Vv8LagfcHZGlraG6jKy0bD8PTfRO9z2dv0-ekfHlaTCdlUlHE2kQVhIVsNWCeUVAYsRxQlUsiEfCMpDgNkkaUK1TnlGklNXClM4wgqxDm5CZa9L61lRuxc-ZTuh9hpRFHwbq1kC4EbEAoTggqlGKUagp1LRkljBaKYQpVcA5ed73XztmvDnwrNrZz2xBfYJpnlGLOSJgi_VTlrPcO9HlrisShJNGXJA4liVNJgRr1lAGAM8FxKIoS8geJz42J</recordid><startdate>2020</startdate><enddate>2020</enddate><creator>Ju, Xiaolin</creator><creator>Qian, Jie</creator><creator>Chen, Zhihua</creator><creator>Zhao, Chunyu</creator><creator>Qian, Junyan</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>ESBDL</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>7SR</scope><scope>8BQ</scope><scope>8FD</scope><scope>JG9</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0003-2579-5359</orcidid></search><sort><creationdate>2020</creationdate><title>Mulr4FL: Effective Fault Localization of Evolution Software Based on Multivariate Logistic Regression Model</title><author>Ju, Xiaolin ; Qian, Jie ; Chen, Zhihua ; Zhao, Chunyu ; Qian, Junyan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c408t-b738957de2954eb2086e0c6a3a0e953121208f049b0d648fbafe9bf520e5c0293</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Analytical models</topic><topic>Bivariate analysis</topic><topic>Debugging</topic><topic>Empirical analysis</topic><topic>Evolution</topic><topic>fault localization</topic><topic>Fault location</topic><topic>Localization</topic><topic>logistic regression analysis</topic><topic>Logistics</topic><topic>Mathematical model</topic><topic>Measurement</topic><topic>Multivariate analysis</topic><topic>Object oriented modeling</topic><topic>Predictive models</topic><topic>Regression analysis</topic><topic>Regression models</topic><topic>Software</topic><topic>Software testing</topic><topic>Tracking systems</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Ju, Xiaolin</creatorcontrib><creatorcontrib>Qian, Jie</creatorcontrib><creatorcontrib>Chen, Zhihua</creatorcontrib><creatorcontrib>Zhao, Chunyu</creatorcontrib><creatorcontrib>Qian, Junyan</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE Xplore Open Access Journals</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>IEEE access</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Ju, Xiaolin</au><au>Qian, Jie</au><au>Chen, Zhihua</au><au>Zhao, Chunyu</au><au>Qian, Junyan</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Mulr4FL: Effective Fault Localization of Evolution Software Based on Multivariate Logistic Regression Model</atitle><jtitle>IEEE access</jtitle><stitle>Access</stitle><date>2020</date><risdate>2020</risdate><volume>8</volume><spage>207858</spage><epage>207870</epage><pages>207858-207870</pages><issn>2169-3536</issn><eissn>2169-3536</eissn><coden>IAECCG</coden><abstract>Fault localization is indeed tedious and costly work during software maintenance. Studies indicate that combining both structural features and behavior characteristics of programs can be beneficial for improving the efficiency of fault locating. In this paper, we proposed a framework, called Mulr4FL, for fault localization using a multivariate logistic regression model that combined both static and dynamic features collected from the program under debugging. Firstly, the hybrid metrics data set, with both program structural features and behavior characteristics combined, is constructed by static program analyzing and dynamically tracing that runs with a designed metrics set. Meanwhile, the fault information of the legacy program is also obtained from the bug tracking system. Secondly, Bivariate logistic analysis is performed to filter the metrics that are significantly related to faults, and then with the selected metrics and their measurements, a multivariate logistic regression model was constructed and trained. Finally, based on the trained logistic model, we conduct the multivariate logistic analysis on the features of the evolved software and predict the buggy class methods. An empirical study was conducted based on a set of benchmarks that are used widely in program debugging research. The results indicate that the Mulr4FL can significantly improve the effectiveness of locating faults in contrast to 5 baseline techniques.</abstract><cop>Piscataway</cop><pub>IEEE</pub><doi>10.1109/ACCESS.2020.3037235</doi><tpages>13</tpages><orcidid>https://orcid.org/0000-0003-2579-5359</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2169-3536 |
ispartof | IEEE access, 2020, Vol.8, p.207858-207870 |
issn | 2169-3536 2169-3536 |
language | eng |
recordid | cdi_crossref_primary_10_1109_ACCESS_2020_3037235 |
source | DOAJ Directory of Open Access Journals; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; IEEE Xplore Open Access Journals |
subjects | Analytical models Bivariate analysis Debugging Empirical analysis Evolution fault localization Fault location Localization logistic regression analysis Logistics Mathematical model Measurement Multivariate analysis Object oriented modeling Predictive models Regression analysis Regression models Software Software testing Tracking systems |
title | Mulr4FL: Effective Fault Localization of Evolution Software Based on Multivariate Logistic Regression Model |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-30T08%3A12%3A49IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Mulr4FL:%20Effective%20Fault%20Localization%20of%20Evolution%20Software%20Based%20on%20Multivariate%20Logistic%20Regression%20Model&rft.jtitle=IEEE%20access&rft.au=Ju,%20Xiaolin&rft.date=2020&rft.volume=8&rft.spage=207858&rft.epage=207870&rft.pages=207858-207870&rft.issn=2169-3536&rft.eissn=2169-3536&rft.coden=IAECCG&rft_id=info:doi/10.1109/ACCESS.2020.3037235&rft_dat=%3Cproquest_cross%3E2465442983%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2465442983&rft_id=info:pmid/&rft_ieee_id=9253643&rft_doaj_id=oai_doaj_org_article_b93307bb844f4edda843847b824ecfba&rfr_iscdi=true |