Feedback-Driven Automated Whole Bug Report Reproduction for Android Apps

In software development, bug report reproduction is a challenging task. This paper introduces ReBL, a novel feedback-driven approach that leverages GPT-4, a large-scale language model (LLM), to automatically reproduce Android bug reports. Unlike traditional methods, ReBL bypasses the use of Step to...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	arXiv.org 2024-08
Hauptverfasser:	Wang, Dingbang, Zhao, Yu, Feng, Sidong, Zhang, Zhaoxu, Halfond, William G J, Chen, Chunyang, Sun, Xiaoxia, Shi, Jiangfan, Yu, Tingting
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer Science - Software Engineering Debugging Feedback Large language models Software development
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title	arXiv.org
container_volume
creator	Wang, Dingbang Zhao, Yu Feng, Sidong Zhang, Zhaoxu Halfond, William G J Chen, Chunyang Sun, Xiaoxia Shi, Jiangfan Yu, Tingting
description	In software development, bug report reproduction is a challenging task. This paper introduces ReBL, a novel feedback-driven approach that leverages GPT-4, a large-scale language model (LLM), to automatically reproduce Android bug reports. Unlike traditional methods, ReBL bypasses the use of Step to Reproduce (S2R) entities. Instead, it leverages the entire textual bug report and employs innovative prompts to enhance GPT's contextual reasoning. This approach is more flexible and context-aware than the traditional step-by-step entity matching approach, resulting in improved accuracy and effectiveness. In addition to handling crash reports, ReBL has the capability of handling non-crash functional bug reports. Our evaluation of 96 Android bug reports (73 crash and 23 non-crash) demonstrates that ReBL successfully reproduced 90.63% of these reports, averaging only 74.98 seconds per bug report. Additionally, ReBL outperformed three existing tools in both success rate and speed.
doi_str_mv	10.48550/arxiv.2407.05165
format	Article
fullrecord	<record><control><sourceid>proquest_arxiv</sourceid><recordid>TN_cdi_arxiv_primary_2407_05165</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3077527090</sourcerecordid><originalsourceid>FETCH-LOGICAL-a520-2dae26867bef2de5eb0088ff6b19d5e4525ea203e0e9f9e5628b39f758d5da33</originalsourceid><addsrcrecordid>eNotj81Kw0AURgdBsNQ-gCsHXKfe3MnNJMtYrRUKggouw6RzR1PbTJwkRd_e_rj6NoePc4S4imGaZERwa8JPvZtiAnoKFKd0JkaoVBxlCeKFmHTdGgAw1UikRmIxZ7aVWX1F96HecSOLofdb07OV759-w_Ju-JAv3PrQHyZ4O6z62jfS-SCLxgZfW1m0bXcpzp3ZdDz537F4nT-8zRbR8vnxaVYsI0MIEVrDmGaprtihZeIKIMucS6s4t8QJIbFBUAycu5wpxaxSudOUWbJGqbG4Pr0eK8s21FsTfstDbXms3RM3J2Lv-j1w15drP4Rmr1Qq0JpQQw7qD5hkV1s</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3077527090</pqid></control><display><type>article</type><title>Feedback-Driven Automated Whole Bug Report Reproduction for Android Apps</title><source>Freely Accessible Journals</source><source>arXiv.org</source><creator>Wang, Dingbang ; Zhao, Yu ; Feng, Sidong ; Zhang, Zhaoxu ; Halfond, William G J ; Chen, Chunyang ; Sun, Xiaoxia ; Shi, Jiangfan ; Yu, Tingting</creator><creatorcontrib>Wang, Dingbang ; Zhao, Yu ; Feng, Sidong ; Zhang, Zhaoxu ; Halfond, William G J ; Chen, Chunyang ; Sun, Xiaoxia ; Shi, Jiangfan ; Yu, Tingting</creatorcontrib><description>In software development, bug report reproduction is a challenging task. This paper introduces ReBL, a novel feedback-driven approach that leverages GPT-4, a large-scale language model (LLM), to automatically reproduce Android bug reports. Unlike traditional methods, ReBL bypasses the use of Step to Reproduce (S2R) entities. Instead, it leverages the entire textual bug report and employs innovative prompts to enhance GPT's contextual reasoning. This approach is more flexible and context-aware than the traditional step-by-step entity matching approach, resulting in improved accuracy and effectiveness. In addition to handling crash reports, ReBL has the capability of handling non-crash functional bug reports. Our evaluation of 96 Android bug reports (73 crash and 23 non-crash) demonstrates that ReBL successfully reproduced 90.63% of these reports, averaging only 74.98 seconds per bug report. Additionally, ReBL outperformed three existing tools in both success rate and speed.</description><identifier>EISSN: 2331-8422</identifier><identifier>DOI: 10.48550/arxiv.2407.05165</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Computer Science - Software Engineering ; Debugging ; Feedback ; Large language models ; Software development</subject><ispartof>arXiv.org, 2024-08</ispartof><rights>2024. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,781,785,886,27930</link.rule.ids><backlink>$$Uhttps://doi.org/10.1145/3650212.3680341$$DView published paper (Access to full text may be restricted)$$Hfree_for_read</backlink><backlink>$$Uhttps://doi.org/10.48550/arXiv.2407.05165$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Wang, Dingbang</creatorcontrib><creatorcontrib>Zhao, Yu</creatorcontrib><creatorcontrib>Feng, Sidong</creatorcontrib><creatorcontrib>Zhang, Zhaoxu</creatorcontrib><creatorcontrib>Halfond, William G J</creatorcontrib><creatorcontrib>Chen, Chunyang</creatorcontrib><creatorcontrib>Sun, Xiaoxia</creatorcontrib><creatorcontrib>Shi, Jiangfan</creatorcontrib><creatorcontrib>Yu, Tingting</creatorcontrib><title>Feedback-Driven Automated Whole Bug Report Reproduction for Android Apps</title><title>arXiv.org</title><description>In software development, bug report reproduction is a challenging task. This paper introduces ReBL, a novel feedback-driven approach that leverages GPT-4, a large-scale language model (LLM), to automatically reproduce Android bug reports. Unlike traditional methods, ReBL bypasses the use of Step to Reproduce (S2R) entities. Instead, it leverages the entire textual bug report and employs innovative prompts to enhance GPT's contextual reasoning. This approach is more flexible and context-aware than the traditional step-by-step entity matching approach, resulting in improved accuracy and effectiveness. In addition to handling crash reports, ReBL has the capability of handling non-crash functional bug reports. Our evaluation of 96 Android bug reports (73 crash and 23 non-crash) demonstrates that ReBL successfully reproduced 90.63% of these reports, averaging only 74.98 seconds per bug report. Additionally, ReBL outperformed three existing tools in both success rate and speed.</description><subject>Computer Science - Software Engineering</subject><subject>Debugging</subject><subject>Feedback</subject><subject>Large language models</subject><subject>Software development</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GOX</sourceid><recordid>eNotj81Kw0AURgdBsNQ-gCsHXKfe3MnNJMtYrRUKggouw6RzR1PbTJwkRd_e_rj6NoePc4S4imGaZERwa8JPvZtiAnoKFKd0JkaoVBxlCeKFmHTdGgAw1UikRmIxZ7aVWX1F96HecSOLofdb07OV759-w_Ju-JAv3PrQHyZ4O6z62jfS-SCLxgZfW1m0bXcpzp3ZdDz537F4nT-8zRbR8vnxaVYsI0MIEVrDmGaprtihZeIKIMucS6s4t8QJIbFBUAycu5wpxaxSudOUWbJGqbG4Pr0eK8s21FsTfstDbXms3RM3J2Lv-j1w15drP4Rmr1Qq0JpQQw7qD5hkV1s</recordid><startdate>20240828</startdate><enddate>20240828</enddate><creator>Wang, Dingbang</creator><creator>Zhao, Yu</creator><creator>Feng, Sidong</creator><creator>Zhang, Zhaoxu</creator><creator>Halfond, William G J</creator><creator>Chen, Chunyang</creator><creator>Sun, Xiaoxia</creator><creator>Shi, Jiangfan</creator><creator>Yu, Tingting</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20240828</creationdate><title>Feedback-Driven Automated Whole Bug Report Reproduction for Android Apps</title><author>Wang, Dingbang ; Zhao, Yu ; Feng, Sidong ; Zhang, Zhaoxu ; Halfond, William G J ; Chen, Chunyang ; Sun, Xiaoxia ; Shi, Jiangfan ; Yu, Tingting</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a520-2dae26867bef2de5eb0088ff6b19d5e4525ea203e0e9f9e5628b39f758d5da33</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer Science - Software Engineering</topic><topic>Debugging</topic><topic>Feedback</topic><topic>Large language models</topic><topic>Software development</topic><toplevel>online_resources</toplevel><creatorcontrib>Wang, Dingbang</creatorcontrib><creatorcontrib>Zhao, Yu</creatorcontrib><creatorcontrib>Feng, Sidong</creatorcontrib><creatorcontrib>Zhang, Zhaoxu</creatorcontrib><creatorcontrib>Halfond, William G J</creatorcontrib><creatorcontrib>Chen, Chunyang</creatorcontrib><creatorcontrib>Sun, Xiaoxia</creatorcontrib><creatorcontrib>Shi, Jiangfan</creatorcontrib><creatorcontrib>Yu, Tingting</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Access via ProQuest (Open Access)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection><collection>arXiv Computer Science</collection><collection>arXiv.org</collection><jtitle>arXiv.org</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Wang, Dingbang</au><au>Zhao, Yu</au><au>Feng, Sidong</au><au>Zhang, Zhaoxu</au><au>Halfond, William G J</au><au>Chen, Chunyang</au><au>Sun, Xiaoxia</au><au>Shi, Jiangfan</au><au>Yu, Tingting</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Feedback-Driven Automated Whole Bug Report Reproduction for Android Apps</atitle><jtitle>arXiv.org</jtitle><date>2024-08-28</date><risdate>2024</risdate><eissn>2331-8422</eissn><abstract>In software development, bug report reproduction is a challenging task. This paper introduces ReBL, a novel feedback-driven approach that leverages GPT-4, a large-scale language model (LLM), to automatically reproduce Android bug reports. Unlike traditional methods, ReBL bypasses the use of Step to Reproduce (S2R) entities. Instead, it leverages the entire textual bug report and employs innovative prompts to enhance GPT's contextual reasoning. This approach is more flexible and context-aware than the traditional step-by-step entity matching approach, resulting in improved accuracy and effectiveness. In addition to handling crash reports, ReBL has the capability of handling non-crash functional bug reports. Our evaluation of 96 Android bug reports (73 crash and 23 non-crash) demonstrates that ReBL successfully reproduced 90.63% of these reports, averaging only 74.98 seconds per bug report. Additionally, ReBL outperformed three existing tools in both success rate and speed.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><doi>10.48550/arxiv.2407.05165</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	EISSN: 2331-8422
ispartof	arXiv.org, 2024-08
issn	2331-8422
language	eng
recordid	cdi_arxiv_primary_2407_05165
source	Freely Accessible Journals; arXiv.org
subjects	Computer Science - Software Engineering Debugging Feedback Large language models Software development
title	Feedback-Driven Automated Whole Bug Report Reproduction for Android Apps
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-13T13%3A08%3A54IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_arxiv&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Feedback-Driven%20Automated%20Whole%20Bug%20Report%20Reproduction%20for%20Android%20Apps&rft.jtitle=arXiv.org&rft.au=Wang,%20Dingbang&rft.date=2024-08-28&rft.eissn=2331-8422&rft_id=info:doi/10.48550/arxiv.2407.05165&rft_dat=%3Cproquest_arxiv%3E3077527090%3C/proquest_arxiv%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3077527090&rft_id=info:pmid/&rfr_iscdi=true