ReBucket: A method for clustering duplicate crash reports based on call stack similarity

Software often crashes. Once a crash happens, a crash report could be sent to software developers for investigation upon user permission. To facilitate efficient handling of crashes, crash reports received by Microsoft's Windows Error Reporting (WER) system are organized into a set of "buc...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Yingnong Dang, Rongxin Wu, Hongyu Zhang, Dongmei Zhang, Nobel, P.
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1093
container_issue
container_start_page 1084
container_title
container_volume
creator Yingnong Dang
Rongxin Wu
Hongyu Zhang
Dongmei Zhang
Nobel, P.
description Software often crashes. Once a crash happens, a crash report could be sent to software developers for investigation upon user permission. To facilitate efficient handling of crashes, crash reports received by Microsoft's Windows Error Reporting (WER) system are organized into a set of "buckets". Each bucket contains duplicate crash reports that are deemed as manifestations of the same bug. The bucket information is important for prioritizing efforts to resolve crashing bugs. To improve the accuracy of bucketing, we propose ReBucket, a method for clustering crash reports based on call stack matching. ReBucket measures the similarities of call stacks in crash reports and then assigns the reports to appropriate buckets based on the similarity values. We evaluate ReBucket using crash data collected from five widely-used Microsoft products. The results show that ReBucket achieves better overall performance than the existing methods. On average, the F-measure obtained by ReBucket is about 0.88.
doi_str_mv 10.1109/ICSE.2012.6227111
format Conference Proceeding
fullrecord <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_6227111</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>6227111</ieee_id><sourcerecordid>6227111</sourcerecordid><originalsourceid>FETCH-LOGICAL-i90t-8966645b772f633084da031300028653c6abcb671541810bef2d086a4e206a243</originalsourceid><addsrcrecordid>eNo1kMtOAjEUhustEZEHMG76AoPnnE4v4w6JFxISE2XhjnQ6HakMDGnLgreXRFz9iy_58uVn7A5hjAjVw2z6-TwmQBorIo2IZ-wGS6UFgtJwzgYopSmQSF6wUaXNP1Pqkg2ANBSSpL5mo5R-AABRlxWZAfv68E97t_b5kU_4xudV3_C2j9x1-5R9DNtv3ux3XXA2e-6iTSse_a6POfHaJt_wfsud7TqesnVrnsImdDaGfLhlV63tkh-ddsgWL8-L6Vsxf3-dTSfzIlSQC1MdC0tZa02tEgJM2VgQKI6JZJQUTtna1UqjLNEg1L6lBoyypSdQlkoxZPd_2uC9X-5i2Nh4WJ4-Er_yzFTq</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>ReBucket: A method for clustering duplicate crash reports based on call stack similarity</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Yingnong Dang ; Rongxin Wu ; Hongyu Zhang ; Dongmei Zhang ; Nobel, P.</creator><creatorcontrib>Yingnong Dang ; Rongxin Wu ; Hongyu Zhang ; Dongmei Zhang ; Nobel, P.</creatorcontrib><description>Software often crashes. Once a crash happens, a crash report could be sent to software developers for investigation upon user permission. To facilitate efficient handling of crashes, crash reports received by Microsoft's Windows Error Reporting (WER) system are organized into a set of "buckets". Each bucket contains duplicate crash reports that are deemed as manifestations of the same bug. The bucket information is important for prioritizing efforts to resolve crashing bugs. To improve the accuracy of bucketing, we propose ReBucket, a method for clustering crash reports based on call stack matching. ReBucket measures the similarities of call stacks in crash reports and then assigns the reports to appropriate buckets based on the similarity values. We evaluate ReBucket using crash data collected from five widely-used Microsoft products. The results show that ReBucket achieves better overall performance than the existing methods. On average, the F-measure obtained by ReBucket is about 0.88.</description><identifier>ISSN: 0270-5257</identifier><identifier>ISBN: 9781467310666</identifier><identifier>ISBN: 1467310662</identifier><identifier>EISSN: 1558-1225</identifier><identifier>EISBN: 1467310670</identifier><identifier>EISBN: 1467310654</identifier><identifier>EISBN: 9781467310673</identifier><identifier>EISBN: 9781467310659</identifier><identifier>DOI: 10.1109/ICSE.2012.6227111</identifier><language>eng</language><publisher>IEEE</publisher><subject>call stack trace ; clustering ; Computer bugs ; crash reports ; duplicate crash report detection ; Equations ; Mathematical model ; Measurement ; Software ; Training ; WER</subject><ispartof>2012 34th International Conference on Software Engineering (ICSE), 2012, p.1084-1093</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/6227111$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,776,780,785,786,2051,27904,54899</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/6227111$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Yingnong Dang</creatorcontrib><creatorcontrib>Rongxin Wu</creatorcontrib><creatorcontrib>Hongyu Zhang</creatorcontrib><creatorcontrib>Dongmei Zhang</creatorcontrib><creatorcontrib>Nobel, P.</creatorcontrib><title>ReBucket: A method for clustering duplicate crash reports based on call stack similarity</title><title>2012 34th International Conference on Software Engineering (ICSE)</title><addtitle>ICSE</addtitle><description>Software often crashes. Once a crash happens, a crash report could be sent to software developers for investigation upon user permission. To facilitate efficient handling of crashes, crash reports received by Microsoft's Windows Error Reporting (WER) system are organized into a set of "buckets". Each bucket contains duplicate crash reports that are deemed as manifestations of the same bug. The bucket information is important for prioritizing efforts to resolve crashing bugs. To improve the accuracy of bucketing, we propose ReBucket, a method for clustering crash reports based on call stack matching. ReBucket measures the similarities of call stacks in crash reports and then assigns the reports to appropriate buckets based on the similarity values. We evaluate ReBucket using crash data collected from five widely-used Microsoft products. The results show that ReBucket achieves better overall performance than the existing methods. On average, the F-measure obtained by ReBucket is about 0.88.</description><subject>call stack trace</subject><subject>clustering</subject><subject>Computer bugs</subject><subject>crash reports</subject><subject>duplicate crash report detection</subject><subject>Equations</subject><subject>Mathematical model</subject><subject>Measurement</subject><subject>Software</subject><subject>Training</subject><subject>WER</subject><issn>0270-5257</issn><issn>1558-1225</issn><isbn>9781467310666</isbn><isbn>1467310662</isbn><isbn>1467310670</isbn><isbn>1467310654</isbn><isbn>9781467310673</isbn><isbn>9781467310659</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2012</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNo1kMtOAjEUhustEZEHMG76AoPnnE4v4w6JFxISE2XhjnQ6HakMDGnLgreXRFz9iy_58uVn7A5hjAjVw2z6-TwmQBorIo2IZ-wGS6UFgtJwzgYopSmQSF6wUaXNP1Pqkg2ANBSSpL5mo5R-AABRlxWZAfv68E97t_b5kU_4xudV3_C2j9x1-5R9DNtv3ux3XXA2e-6iTSse_a6POfHaJt_wfsud7TqesnVrnsImdDaGfLhlV63tkh-ddsgWL8-L6Vsxf3-dTSfzIlSQC1MdC0tZa02tEgJM2VgQKI6JZJQUTtna1UqjLNEg1L6lBoyypSdQlkoxZPd_2uC9X-5i2Nh4WJ4-Er_yzFTq</recordid><startdate>201206</startdate><enddate>201206</enddate><creator>Yingnong Dang</creator><creator>Rongxin Wu</creator><creator>Hongyu Zhang</creator><creator>Dongmei Zhang</creator><creator>Nobel, P.</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope></search><sort><creationdate>201206</creationdate><title>ReBucket: A method for clustering duplicate crash reports based on call stack similarity</title><author>Yingnong Dang ; Rongxin Wu ; Hongyu Zhang ; Dongmei Zhang ; Nobel, P.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i90t-8966645b772f633084da031300028653c6abcb671541810bef2d086a4e206a243</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2012</creationdate><topic>call stack trace</topic><topic>clustering</topic><topic>Computer bugs</topic><topic>crash reports</topic><topic>duplicate crash report detection</topic><topic>Equations</topic><topic>Mathematical model</topic><topic>Measurement</topic><topic>Software</topic><topic>Training</topic><topic>WER</topic><toplevel>online_resources</toplevel><creatorcontrib>Yingnong Dang</creatorcontrib><creatorcontrib>Rongxin Wu</creatorcontrib><creatorcontrib>Hongyu Zhang</creatorcontrib><creatorcontrib>Dongmei Zhang</creatorcontrib><creatorcontrib>Nobel, P.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Yingnong Dang</au><au>Rongxin Wu</au><au>Hongyu Zhang</au><au>Dongmei Zhang</au><au>Nobel, P.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>ReBucket: A method for clustering duplicate crash reports based on call stack similarity</atitle><btitle>2012 34th International Conference on Software Engineering (ICSE)</btitle><stitle>ICSE</stitle><date>2012-06</date><risdate>2012</risdate><spage>1084</spage><epage>1093</epage><pages>1084-1093</pages><issn>0270-5257</issn><eissn>1558-1225</eissn><isbn>9781467310666</isbn><isbn>1467310662</isbn><eisbn>1467310670</eisbn><eisbn>1467310654</eisbn><eisbn>9781467310673</eisbn><eisbn>9781467310659</eisbn><abstract>Software often crashes. Once a crash happens, a crash report could be sent to software developers for investigation upon user permission. To facilitate efficient handling of crashes, crash reports received by Microsoft's Windows Error Reporting (WER) system are organized into a set of "buckets". Each bucket contains duplicate crash reports that are deemed as manifestations of the same bug. The bucket information is important for prioritizing efforts to resolve crashing bugs. To improve the accuracy of bucketing, we propose ReBucket, a method for clustering crash reports based on call stack matching. ReBucket measures the similarities of call stacks in crash reports and then assigns the reports to appropriate buckets based on the similarity values. We evaluate ReBucket using crash data collected from five widely-used Microsoft products. The results show that ReBucket achieves better overall performance than the existing methods. On average, the F-measure obtained by ReBucket is about 0.88.</abstract><pub>IEEE</pub><doi>10.1109/ICSE.2012.6227111</doi><tpages>10</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 0270-5257
ispartof 2012 34th International Conference on Software Engineering (ICSE), 2012, p.1084-1093
issn 0270-5257
1558-1225
language eng
recordid cdi_ieee_primary_6227111
source IEEE Electronic Library (IEL) Conference Proceedings
subjects call stack trace
clustering
Computer bugs
crash reports
duplicate crash report detection
Equations
Mathematical model
Measurement
Software
Training
WER
title ReBucket: A method for clustering duplicate crash reports based on call stack similarity
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-23T02%3A41%3A31IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=ReBucket:%20A%20method%20for%20clustering%20duplicate%20crash%20reports%20based%20on%20call%20stack%20similarity&rft.btitle=2012%2034th%20International%20Conference%20on%20Software%20Engineering%20(ICSE)&rft.au=Yingnong%20Dang&rft.date=2012-06&rft.spage=1084&rft.epage=1093&rft.pages=1084-1093&rft.issn=0270-5257&rft.eissn=1558-1225&rft.isbn=9781467310666&rft.isbn_list=1467310662&rft_id=info:doi/10.1109/ICSE.2012.6227111&rft_dat=%3Cieee_6IE%3E6227111%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&rft.eisbn=1467310670&rft.eisbn_list=1467310654&rft.eisbn_list=9781467310673&rft.eisbn_list=9781467310659&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=6227111&rfr_iscdi=true