Feedback-Driven Automated Whole Bug Report Reproduction for Android Apps

In software development, bug report reproduction is a challenging task. This paper introduces ReBL, a novel feedback-driven approach that leverages GPT-4, a large-scale language model (LLM), to automatically reproduce Android bug reports. Unlike traditional methods, ReBL bypasses the use of Step to...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:arXiv.org 2024-08
Hauptverfasser: Wang, Dingbang, Zhao, Yu, Feng, Sidong, Zhang, Zhaoxu, Halfond, William G J, Chen, Chunyang, Sun, Xiaoxia, Shi, Jiangfan, Yu, Tingting
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Wang, Dingbang
Zhao, Yu
Feng, Sidong
Zhang, Zhaoxu
Halfond, William G J
Chen, Chunyang
Sun, Xiaoxia
Shi, Jiangfan
Yu, Tingting
description In software development, bug report reproduction is a challenging task. This paper introduces ReBL, a novel feedback-driven approach that leverages GPT-4, a large-scale language model (LLM), to automatically reproduce Android bug reports. Unlike traditional methods, ReBL bypasses the use of Step to Reproduce (S2R) entities. Instead, it leverages the entire textual bug report and employs innovative prompts to enhance GPT's contextual reasoning. This approach is more flexible and context-aware than the traditional step-by-step entity matching approach, resulting in improved accuracy and effectiveness. In addition to handling crash reports, ReBL has the capability of handling non-crash functional bug reports. Our evaluation of 96 Android bug reports (73 crash and 23 non-crash) demonstrates that ReBL successfully reproduced 90.63% of these reports, averaging only 74.98 seconds per bug report. Additionally, ReBL outperformed three existing tools in both success rate and speed.
doi_str_mv 10.48550/arxiv.2407.05165
format Article
fullrecord <record><control><sourceid>proquest_arxiv</sourceid><recordid>TN_cdi_arxiv_primary_2407_05165</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3077527090</sourcerecordid><originalsourceid>FETCH-LOGICAL-a520-2dae26867bef2de5eb0088ff6b19d5e4525ea203e0e9f9e5628b39f758d5da33</originalsourceid><addsrcrecordid>eNotj81Kw0AURgdBsNQ-gCsHXKfe3MnNJMtYrRUKggouw6RzR1PbTJwkRd_e_rj6NoePc4S4imGaZERwa8JPvZtiAnoKFKd0JkaoVBxlCeKFmHTdGgAw1UikRmIxZ7aVWX1F96HecSOLofdb07OV759-w_Ju-JAv3PrQHyZ4O6z62jfS-SCLxgZfW1m0bXcpzp3ZdDz537F4nT-8zRbR8vnxaVYsI0MIEVrDmGaprtihZeIKIMucS6s4t8QJIbFBUAycu5wpxaxSudOUWbJGqbG4Pr0eK8s21FsTfstDbXms3RM3J2Lv-j1w15drP4Rmr1Qq0JpQQw7qD5hkV1s</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3077527090</pqid></control><display><type>article</type><title>Feedback-Driven Automated Whole Bug Report Reproduction for Android Apps</title><source>Freely Accessible Journals</source><source>arXiv.org</source><creator>Wang, Dingbang ; Zhao, Yu ; Feng, Sidong ; Zhang, Zhaoxu ; Halfond, William G J ; Chen, Chunyang ; Sun, Xiaoxia ; Shi, Jiangfan ; Yu, Tingting</creator><creatorcontrib>Wang, Dingbang ; Zhao, Yu ; Feng, Sidong ; Zhang, Zhaoxu ; Halfond, William G J ; Chen, Chunyang ; Sun, Xiaoxia ; Shi, Jiangfan ; Yu, Tingting</creatorcontrib><description>In software development, bug report reproduction is a challenging task. This paper introduces ReBL, a novel feedback-driven approach that leverages GPT-4, a large-scale language model (LLM), to automatically reproduce Android bug reports. Unlike traditional methods, ReBL bypasses the use of Step to Reproduce (S2R) entities. Instead, it leverages the entire textual bug report and employs innovative prompts to enhance GPT's contextual reasoning. This approach is more flexible and context-aware than the traditional step-by-step entity matching approach, resulting in improved accuracy and effectiveness. In addition to handling crash reports, ReBL has the capability of handling non-crash functional bug reports. Our evaluation of 96 Android bug reports (73 crash and 23 non-crash) demonstrates that ReBL successfully reproduced 90.63% of these reports, averaging only 74.98 seconds per bug report. Additionally, ReBL outperformed three existing tools in both success rate and speed.</description><identifier>EISSN: 2331-8422</identifier><identifier>DOI: 10.48550/arxiv.2407.05165</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Computer Science - Software Engineering ; Debugging ; Feedback ; Large language models ; Software development</subject><ispartof>arXiv.org, 2024-08</ispartof><rights>2024. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,781,785,886,27930</link.rule.ids><backlink>$$Uhttps://doi.org/10.1145/3650212.3680341$$DView published paper (Access to full text may be restricted)$$Hfree_for_read</backlink><backlink>$$Uhttps://doi.org/10.48550/arXiv.2407.05165$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Wang, Dingbang</creatorcontrib><creatorcontrib>Zhao, Yu</creatorcontrib><creatorcontrib>Feng, Sidong</creatorcontrib><creatorcontrib>Zhang, Zhaoxu</creatorcontrib><creatorcontrib>Halfond, William G J</creatorcontrib><creatorcontrib>Chen, Chunyang</creatorcontrib><creatorcontrib>Sun, Xiaoxia</creatorcontrib><creatorcontrib>Shi, Jiangfan</creatorcontrib><creatorcontrib>Yu, Tingting</creatorcontrib><title>Feedback-Driven Automated Whole Bug Report Reproduction for Android Apps</title><title>arXiv.org</title><description>In software development, bug report reproduction is a challenging task. This paper introduces ReBL, a novel feedback-driven approach that leverages GPT-4, a large-scale language model (LLM), to automatically reproduce Android bug reports. Unlike traditional methods, ReBL bypasses the use of Step to Reproduce (S2R) entities. Instead, it leverages the entire textual bug report and employs innovative prompts to enhance GPT's contextual reasoning. This approach is more flexible and context-aware than the traditional step-by-step entity matching approach, resulting in improved accuracy and effectiveness. In addition to handling crash reports, ReBL has the capability of handling non-crash functional bug reports. Our evaluation of 96 Android bug reports (73 crash and 23 non-crash) demonstrates that ReBL successfully reproduced 90.63% of these reports, averaging only 74.98 seconds per bug report. Additionally, ReBL outperformed three existing tools in both success rate and speed.</description><subject>Computer Science - Software Engineering</subject><subject>Debugging</subject><subject>Feedback</subject><subject>Large language models</subject><subject>Software development</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GOX</sourceid><recordid>eNotj81Kw0AURgdBsNQ-gCsHXKfe3MnNJMtYrRUKggouw6RzR1PbTJwkRd_e_rj6NoePc4S4imGaZERwa8JPvZtiAnoKFKd0JkaoVBxlCeKFmHTdGgAw1UikRmIxZ7aVWX1F96HecSOLofdb07OV759-w_Ju-JAv3PrQHyZ4O6z62jfS-SCLxgZfW1m0bXcpzp3ZdDz537F4nT-8zRbR8vnxaVYsI0MIEVrDmGaprtihZeIKIMucS6s4t8QJIbFBUAycu5wpxaxSudOUWbJGqbG4Pr0eK8s21FsTfstDbXms3RM3J2Lv-j1w15drP4Rmr1Qq0JpQQw7qD5hkV1s</recordid><startdate>20240828</startdate><enddate>20240828</enddate><creator>Wang, Dingbang</creator><creator>Zhao, Yu</creator><creator>Feng, Sidong</creator><creator>Zhang, Zhaoxu</creator><creator>Halfond, William G J</creator><creator>Chen, Chunyang</creator><creator>Sun, Xiaoxia</creator><creator>Shi, Jiangfan</creator><creator>Yu, Tingting</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20240828</creationdate><title>Feedback-Driven Automated Whole Bug Report Reproduction for Android Apps</title><author>Wang, Dingbang ; Zhao, Yu ; Feng, Sidong ; Zhang, Zhaoxu ; Halfond, William G J ; Chen, Chunyang ; Sun, Xiaoxia ; Shi, Jiangfan ; Yu, Tingting</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a520-2dae26867bef2de5eb0088ff6b19d5e4525ea203e0e9f9e5628b39f758d5da33</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer Science - Software Engineering</topic><topic>Debugging</topic><topic>Feedback</topic><topic>Large language models</topic><topic>Software development</topic><toplevel>online_resources</toplevel><creatorcontrib>Wang, Dingbang</creatorcontrib><creatorcontrib>Zhao, Yu</creatorcontrib><creatorcontrib>Feng, Sidong</creatorcontrib><creatorcontrib>Zhang, Zhaoxu</creatorcontrib><creatorcontrib>Halfond, William G J</creatorcontrib><creatorcontrib>Chen, Chunyang</creatorcontrib><creatorcontrib>Sun, Xiaoxia</creatorcontrib><creatorcontrib>Shi, Jiangfan</creatorcontrib><creatorcontrib>Yu, Tingting</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Access via ProQuest (Open Access)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection><collection>arXiv Computer Science</collection><collection>arXiv.org</collection><jtitle>arXiv.org</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Wang, Dingbang</au><au>Zhao, Yu</au><au>Feng, Sidong</au><au>Zhang, Zhaoxu</au><au>Halfond, William G J</au><au>Chen, Chunyang</au><au>Sun, Xiaoxia</au><au>Shi, Jiangfan</au><au>Yu, Tingting</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Feedback-Driven Automated Whole Bug Report Reproduction for Android Apps</atitle><jtitle>arXiv.org</jtitle><date>2024-08-28</date><risdate>2024</risdate><eissn>2331-8422</eissn><abstract>In software development, bug report reproduction is a challenging task. This paper introduces ReBL, a novel feedback-driven approach that leverages GPT-4, a large-scale language model (LLM), to automatically reproduce Android bug reports. Unlike traditional methods, ReBL bypasses the use of Step to Reproduce (S2R) entities. Instead, it leverages the entire textual bug report and employs innovative prompts to enhance GPT's contextual reasoning. This approach is more flexible and context-aware than the traditional step-by-step entity matching approach, resulting in improved accuracy and effectiveness. In addition to handling crash reports, ReBL has the capability of handling non-crash functional bug reports. Our evaluation of 96 Android bug reports (73 crash and 23 non-crash) demonstrates that ReBL successfully reproduced 90.63% of these reports, averaging only 74.98 seconds per bug report. Additionally, ReBL outperformed three existing tools in both success rate and speed.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><doi>10.48550/arxiv.2407.05165</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2024-08
issn 2331-8422
language eng
recordid cdi_arxiv_primary_2407_05165
source Freely Accessible Journals; arXiv.org
subjects Computer Science - Software Engineering
Debugging
Feedback
Large language models
Software development
title Feedback-Driven Automated Whole Bug Report Reproduction for Android Apps
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-13T13%3A08%3A54IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_arxiv&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Feedback-Driven%20Automated%20Whole%20Bug%20Report%20Reproduction%20for%20Android%20Apps&rft.jtitle=arXiv.org&rft.au=Wang,%20Dingbang&rft.date=2024-08-28&rft.eissn=2331-8422&rft_id=info:doi/10.48550/arxiv.2407.05165&rft_dat=%3Cproquest_arxiv%3E3077527090%3C/proquest_arxiv%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3077527090&rft_id=info:pmid/&rfr_iscdi=true