Mulr4FL: Effective Fault Localization of Evolution Software Based on Multivariate Logistic Regression Model

Fault localization is indeed tedious and costly work during software maintenance. Studies indicate that combining both structural features and behavior characteristics of programs can be beneficial for improving the efficiency of fault locating. In this paper, we proposed a framework, called Mulr4FL...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE access 2020, Vol.8, p.207858-207870
Hauptverfasser: Ju, Xiaolin, Qian, Jie, Chen, Zhihua, Zhao, Chunyu, Qian, Junyan
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 207870
container_issue
container_start_page 207858
container_title IEEE access
container_volume 8
creator Ju, Xiaolin
Qian, Jie
Chen, Zhihua
Zhao, Chunyu
Qian, Junyan
description Fault localization is indeed tedious and costly work during software maintenance. Studies indicate that combining both structural features and behavior characteristics of programs can be beneficial for improving the efficiency of fault locating. In this paper, we proposed a framework, called Mulr4FL, for fault localization using a multivariate logistic regression model that combined both static and dynamic features collected from the program under debugging. Firstly, the hybrid metrics data set, with both program structural features and behavior characteristics combined, is constructed by static program analyzing and dynamically tracing that runs with a designed metrics set. Meanwhile, the fault information of the legacy program is also obtained from the bug tracking system. Secondly, Bivariate logistic analysis is performed to filter the metrics that are significantly related to faults, and then with the selected metrics and their measurements, a multivariate logistic regression model was constructed and trained. Finally, based on the trained logistic model, we conduct the multivariate logistic analysis on the features of the evolved software and predict the buggy class methods. An empirical study was conducted based on a set of benchmarks that are used widely in program debugging research. The results indicate that the Mulr4FL can significantly improve the effectiveness of locating faults in contrast to 5 baseline techniques.
doi_str_mv 10.1109/ACCESS.2020.3037235
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1109_ACCESS_2020_3037235</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9253643</ieee_id><doaj_id>oai_doaj_org_article_b93307bb844f4edda843847b824ecfba</doaj_id><sourcerecordid>2465442983</sourcerecordid><originalsourceid>FETCH-LOGICAL-c408t-b738957de2954eb2086e0c6a3a0e953121208f049b0d648fbafe9bf520e5c0293</originalsourceid><addsrcrecordid>eNpNkVFLwzAUhYsoOHS_YC8FnzvTJG0T3-bYdNAhOH0OSXszMusyk3aiv95sHcO8JDnc79wDJ4pGKRqnKeL3k-l0tlqNMcJoTBApMMkuogFOc56QjOSX_97X0dD7DQqHBSkrBtHHsmscnZcP8UxrqFqzh3guu6aNS1vJxvzK1thtbHU829umO35WVrff0kH8KD3UcVCCSSClM7KFAK6Nb00Vv8LagfcHZGlraG6jKy0bD8PTfRO9z2dv0-ekfHlaTCdlUlHE2kQVhIVsNWCeUVAYsRxQlUsiEfCMpDgNkkaUK1TnlGklNXClM4wgqxDm5CZa9L61lRuxc-ZTuh9hpRFHwbq1kC4EbEAoTggqlGKUagp1LRkljBaKYQpVcA5ed73XztmvDnwrNrZz2xBfYJpnlGLOSJgi_VTlrPcO9HlrisShJNGXJA4liVNJgRr1lAGAM8FxKIoS8geJz42J</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2465442983</pqid></control><display><type>article</type><title>Mulr4FL: Effective Fault Localization of Evolution Software Based on Multivariate Logistic Regression Model</title><source>DOAJ Directory of Open Access Journals</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>IEEE Xplore Open Access Journals</source><creator>Ju, Xiaolin ; Qian, Jie ; Chen, Zhihua ; Zhao, Chunyu ; Qian, Junyan</creator><creatorcontrib>Ju, Xiaolin ; Qian, Jie ; Chen, Zhihua ; Zhao, Chunyu ; Qian, Junyan</creatorcontrib><description>Fault localization is indeed tedious and costly work during software maintenance. Studies indicate that combining both structural features and behavior characteristics of programs can be beneficial for improving the efficiency of fault locating. In this paper, we proposed a framework, called Mulr4FL, for fault localization using a multivariate logistic regression model that combined both static and dynamic features collected from the program under debugging. Firstly, the hybrid metrics data set, with both program structural features and behavior characteristics combined, is constructed by static program analyzing and dynamically tracing that runs with a designed metrics set. Meanwhile, the fault information of the legacy program is also obtained from the bug tracking system. Secondly, Bivariate logistic analysis is performed to filter the metrics that are significantly related to faults, and then with the selected metrics and their measurements, a multivariate logistic regression model was constructed and trained. Finally, based on the trained logistic model, we conduct the multivariate logistic analysis on the features of the evolved software and predict the buggy class methods. An empirical study was conducted based on a set of benchmarks that are used widely in program debugging research. The results indicate that the Mulr4FL can significantly improve the effectiveness of locating faults in contrast to 5 baseline techniques.</description><identifier>ISSN: 2169-3536</identifier><identifier>EISSN: 2169-3536</identifier><identifier>DOI: 10.1109/ACCESS.2020.3037235</identifier><identifier>CODEN: IAECCG</identifier><language>eng</language><publisher>Piscataway: IEEE</publisher><subject>Analytical models ; Bivariate analysis ; Debugging ; Empirical analysis ; Evolution ; fault localization ; Fault location ; Localization ; logistic regression analysis ; Logistics ; Mathematical model ; Measurement ; Multivariate analysis ; Object oriented modeling ; Predictive models ; Regression analysis ; Regression models ; Software ; Software testing ; Tracking systems</subject><ispartof>IEEE access, 2020, Vol.8, p.207858-207870</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2020</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c408t-b738957de2954eb2086e0c6a3a0e953121208f049b0d648fbafe9bf520e5c0293</citedby><cites>FETCH-LOGICAL-c408t-b738957de2954eb2086e0c6a3a0e953121208f049b0d648fbafe9bf520e5c0293</cites><orcidid>0000-0003-2579-5359</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9253643$$EHTML$$P50$$Gieee$$Hfree_for_read</linktohtml><link.rule.ids>314,776,780,860,2096,4010,27610,27900,27901,27902,54908</link.rule.ids></links><search><creatorcontrib>Ju, Xiaolin</creatorcontrib><creatorcontrib>Qian, Jie</creatorcontrib><creatorcontrib>Chen, Zhihua</creatorcontrib><creatorcontrib>Zhao, Chunyu</creatorcontrib><creatorcontrib>Qian, Junyan</creatorcontrib><title>Mulr4FL: Effective Fault Localization of Evolution Software Based on Multivariate Logistic Regression Model</title><title>IEEE access</title><addtitle>Access</addtitle><description>Fault localization is indeed tedious and costly work during software maintenance. Studies indicate that combining both structural features and behavior characteristics of programs can be beneficial for improving the efficiency of fault locating. In this paper, we proposed a framework, called Mulr4FL, for fault localization using a multivariate logistic regression model that combined both static and dynamic features collected from the program under debugging. Firstly, the hybrid metrics data set, with both program structural features and behavior characteristics combined, is constructed by static program analyzing and dynamically tracing that runs with a designed metrics set. Meanwhile, the fault information of the legacy program is also obtained from the bug tracking system. Secondly, Bivariate logistic analysis is performed to filter the metrics that are significantly related to faults, and then with the selected metrics and their measurements, a multivariate logistic regression model was constructed and trained. Finally, based on the trained logistic model, we conduct the multivariate logistic analysis on the features of the evolved software and predict the buggy class methods. An empirical study was conducted based on a set of benchmarks that are used widely in program debugging research. The results indicate that the Mulr4FL can significantly improve the effectiveness of locating faults in contrast to 5 baseline techniques.</description><subject>Analytical models</subject><subject>Bivariate analysis</subject><subject>Debugging</subject><subject>Empirical analysis</subject><subject>Evolution</subject><subject>fault localization</subject><subject>Fault location</subject><subject>Localization</subject><subject>logistic regression analysis</subject><subject>Logistics</subject><subject>Mathematical model</subject><subject>Measurement</subject><subject>Multivariate analysis</subject><subject>Object oriented modeling</subject><subject>Predictive models</subject><subject>Regression analysis</subject><subject>Regression models</subject><subject>Software</subject><subject>Software testing</subject><subject>Tracking systems</subject><issn>2169-3536</issn><issn>2169-3536</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>ESBDL</sourceid><sourceid>RIE</sourceid><sourceid>DOA</sourceid><recordid>eNpNkVFLwzAUhYsoOHS_YC8FnzvTJG0T3-bYdNAhOH0OSXszMusyk3aiv95sHcO8JDnc79wDJ4pGKRqnKeL3k-l0tlqNMcJoTBApMMkuogFOc56QjOSX_97X0dD7DQqHBSkrBtHHsmscnZcP8UxrqFqzh3guu6aNS1vJxvzK1thtbHU829umO35WVrff0kH8KD3UcVCCSSClM7KFAK6Nb00Vv8LagfcHZGlraG6jKy0bD8PTfRO9z2dv0-ekfHlaTCdlUlHE2kQVhIVsNWCeUVAYsRxQlUsiEfCMpDgNkkaUK1TnlGklNXClM4wgqxDm5CZa9L61lRuxc-ZTuh9hpRFHwbq1kC4EbEAoTggqlGKUagp1LRkljBaKYQpVcA5ed73XztmvDnwrNrZz2xBfYJpnlGLOSJgi_VTlrPcO9HlrisShJNGXJA4liVNJgRr1lAGAM8FxKIoS8geJz42J</recordid><startdate>2020</startdate><enddate>2020</enddate><creator>Ju, Xiaolin</creator><creator>Qian, Jie</creator><creator>Chen, Zhihua</creator><creator>Zhao, Chunyu</creator><creator>Qian, Junyan</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>ESBDL</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>7SR</scope><scope>8BQ</scope><scope>8FD</scope><scope>JG9</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0003-2579-5359</orcidid></search><sort><creationdate>2020</creationdate><title>Mulr4FL: Effective Fault Localization of Evolution Software Based on Multivariate Logistic Regression Model</title><author>Ju, Xiaolin ; Qian, Jie ; Chen, Zhihua ; Zhao, Chunyu ; Qian, Junyan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c408t-b738957de2954eb2086e0c6a3a0e953121208f049b0d648fbafe9bf520e5c0293</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Analytical models</topic><topic>Bivariate analysis</topic><topic>Debugging</topic><topic>Empirical analysis</topic><topic>Evolution</topic><topic>fault localization</topic><topic>Fault location</topic><topic>Localization</topic><topic>logistic regression analysis</topic><topic>Logistics</topic><topic>Mathematical model</topic><topic>Measurement</topic><topic>Multivariate analysis</topic><topic>Object oriented modeling</topic><topic>Predictive models</topic><topic>Regression analysis</topic><topic>Regression models</topic><topic>Software</topic><topic>Software testing</topic><topic>Tracking systems</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Ju, Xiaolin</creatorcontrib><creatorcontrib>Qian, Jie</creatorcontrib><creatorcontrib>Chen, Zhihua</creatorcontrib><creatorcontrib>Zhao, Chunyu</creatorcontrib><creatorcontrib>Qian, Junyan</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE Xplore Open Access Journals</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>IEEE access</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Ju, Xiaolin</au><au>Qian, Jie</au><au>Chen, Zhihua</au><au>Zhao, Chunyu</au><au>Qian, Junyan</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Mulr4FL: Effective Fault Localization of Evolution Software Based on Multivariate Logistic Regression Model</atitle><jtitle>IEEE access</jtitle><stitle>Access</stitle><date>2020</date><risdate>2020</risdate><volume>8</volume><spage>207858</spage><epage>207870</epage><pages>207858-207870</pages><issn>2169-3536</issn><eissn>2169-3536</eissn><coden>IAECCG</coden><abstract>Fault localization is indeed tedious and costly work during software maintenance. Studies indicate that combining both structural features and behavior characteristics of programs can be beneficial for improving the efficiency of fault locating. In this paper, we proposed a framework, called Mulr4FL, for fault localization using a multivariate logistic regression model that combined both static and dynamic features collected from the program under debugging. Firstly, the hybrid metrics data set, with both program structural features and behavior characteristics combined, is constructed by static program analyzing and dynamically tracing that runs with a designed metrics set. Meanwhile, the fault information of the legacy program is also obtained from the bug tracking system. Secondly, Bivariate logistic analysis is performed to filter the metrics that are significantly related to faults, and then with the selected metrics and their measurements, a multivariate logistic regression model was constructed and trained. Finally, based on the trained logistic model, we conduct the multivariate logistic analysis on the features of the evolved software and predict the buggy class methods. An empirical study was conducted based on a set of benchmarks that are used widely in program debugging research. The results indicate that the Mulr4FL can significantly improve the effectiveness of locating faults in contrast to 5 baseline techniques.</abstract><cop>Piscataway</cop><pub>IEEE</pub><doi>10.1109/ACCESS.2020.3037235</doi><tpages>13</tpages><orcidid>https://orcid.org/0000-0003-2579-5359</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2169-3536
ispartof IEEE access, 2020, Vol.8, p.207858-207870
issn 2169-3536
2169-3536
language eng
recordid cdi_crossref_primary_10_1109_ACCESS_2020_3037235
source DOAJ Directory of Open Access Journals; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; IEEE Xplore Open Access Journals
subjects Analytical models
Bivariate analysis
Debugging
Empirical analysis
Evolution
fault localization
Fault location
Localization
logistic regression analysis
Logistics
Mathematical model
Measurement
Multivariate analysis
Object oriented modeling
Predictive models
Regression analysis
Regression models
Software
Software testing
Tracking systems
title Mulr4FL: Effective Fault Localization of Evolution Software Based on Multivariate Logistic Regression Model
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-30T08%3A12%3A49IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Mulr4FL:%20Effective%20Fault%20Localization%20of%20Evolution%20Software%20Based%20on%20Multivariate%20Logistic%20Regression%20Model&rft.jtitle=IEEE%20access&rft.au=Ju,%20Xiaolin&rft.date=2020&rft.volume=8&rft.spage=207858&rft.epage=207870&rft.pages=207858-207870&rft.issn=2169-3536&rft.eissn=2169-3536&rft.coden=IAECCG&rft_id=info:doi/10.1109/ACCESS.2020.3037235&rft_dat=%3Cproquest_cross%3E2465442983%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2465442983&rft_id=info:pmid/&rft_ieee_id=9253643&rft_doaj_id=oai_doaj_org_article_b93307bb844f4edda843847b824ecfba&rfr_iscdi=true