ReBucket: A method for clustering duplicate crash reports based on call stack similarity
Software often crashes. Once a crash happens, a crash report could be sent to software developers for investigation upon user permission. To facilitate efficient handling of crashes, crash reports received by Microsoft's Windows Error Reporting (WER) system are organized into a set of "buc...
Gespeichert in:
Hauptverfasser: | , , , , |
---|---|
Format: | Tagungsbericht |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 1093 |
---|---|
container_issue | |
container_start_page | 1084 |
container_title | |
container_volume | |
creator | Yingnong Dang Rongxin Wu Hongyu Zhang Dongmei Zhang Nobel, P. |
description | Software often crashes. Once a crash happens, a crash report could be sent to software developers for investigation upon user permission. To facilitate efficient handling of crashes, crash reports received by Microsoft's Windows Error Reporting (WER) system are organized into a set of "buckets". Each bucket contains duplicate crash reports that are deemed as manifestations of the same bug. The bucket information is important for prioritizing efforts to resolve crashing bugs. To improve the accuracy of bucketing, we propose ReBucket, a method for clustering crash reports based on call stack matching. ReBucket measures the similarities of call stacks in crash reports and then assigns the reports to appropriate buckets based on the similarity values. We evaluate ReBucket using crash data collected from five widely-used Microsoft products. The results show that ReBucket achieves better overall performance than the existing methods. On average, the F-measure obtained by ReBucket is about 0.88. |
doi_str_mv | 10.1109/ICSE.2012.6227111 |
format | Conference Proceeding |
fullrecord | <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_6227111</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>6227111</ieee_id><sourcerecordid>6227111</sourcerecordid><originalsourceid>FETCH-LOGICAL-i90t-8966645b772f633084da031300028653c6abcb671541810bef2d086a4e206a243</originalsourceid><addsrcrecordid>eNo1kMtOAjEUhustEZEHMG76AoPnnE4v4w6JFxISE2XhjnQ6HakMDGnLgreXRFz9iy_58uVn7A5hjAjVw2z6-TwmQBorIo2IZ-wGS6UFgtJwzgYopSmQSF6wUaXNP1Pqkg2ANBSSpL5mo5R-AABRlxWZAfv68E97t_b5kU_4xudV3_C2j9x1-5R9DNtv3ux3XXA2e-6iTSse_a6POfHaJt_wfsud7TqesnVrnsImdDaGfLhlV63tkh-ddsgWL8-L6Vsxf3-dTSfzIlSQC1MdC0tZa02tEgJM2VgQKI6JZJQUTtna1UqjLNEg1L6lBoyypSdQlkoxZPd_2uC9X-5i2Nh4WJ4-Er_yzFTq</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>ReBucket: A method for clustering duplicate crash reports based on call stack similarity</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Yingnong Dang ; Rongxin Wu ; Hongyu Zhang ; Dongmei Zhang ; Nobel, P.</creator><creatorcontrib>Yingnong Dang ; Rongxin Wu ; Hongyu Zhang ; Dongmei Zhang ; Nobel, P.</creatorcontrib><description>Software often crashes. Once a crash happens, a crash report could be sent to software developers for investigation upon user permission. To facilitate efficient handling of crashes, crash reports received by Microsoft's Windows Error Reporting (WER) system are organized into a set of "buckets". Each bucket contains duplicate crash reports that are deemed as manifestations of the same bug. The bucket information is important for prioritizing efforts to resolve crashing bugs. To improve the accuracy of bucketing, we propose ReBucket, a method for clustering crash reports based on call stack matching. ReBucket measures the similarities of call stacks in crash reports and then assigns the reports to appropriate buckets based on the similarity values. We evaluate ReBucket using crash data collected from five widely-used Microsoft products. The results show that ReBucket achieves better overall performance than the existing methods. On average, the F-measure obtained by ReBucket is about 0.88.</description><identifier>ISSN: 0270-5257</identifier><identifier>ISBN: 9781467310666</identifier><identifier>ISBN: 1467310662</identifier><identifier>EISSN: 1558-1225</identifier><identifier>EISBN: 1467310670</identifier><identifier>EISBN: 1467310654</identifier><identifier>EISBN: 9781467310673</identifier><identifier>EISBN: 9781467310659</identifier><identifier>DOI: 10.1109/ICSE.2012.6227111</identifier><language>eng</language><publisher>IEEE</publisher><subject>call stack trace ; clustering ; Computer bugs ; crash reports ; duplicate crash report detection ; Equations ; Mathematical model ; Measurement ; Software ; Training ; WER</subject><ispartof>2012 34th International Conference on Software Engineering (ICSE), 2012, p.1084-1093</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/6227111$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,776,780,785,786,2051,27904,54899</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/6227111$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Yingnong Dang</creatorcontrib><creatorcontrib>Rongxin Wu</creatorcontrib><creatorcontrib>Hongyu Zhang</creatorcontrib><creatorcontrib>Dongmei Zhang</creatorcontrib><creatorcontrib>Nobel, P.</creatorcontrib><title>ReBucket: A method for clustering duplicate crash reports based on call stack similarity</title><title>2012 34th International Conference on Software Engineering (ICSE)</title><addtitle>ICSE</addtitle><description>Software often crashes. Once a crash happens, a crash report could be sent to software developers for investigation upon user permission. To facilitate efficient handling of crashes, crash reports received by Microsoft's Windows Error Reporting (WER) system are organized into a set of "buckets". Each bucket contains duplicate crash reports that are deemed as manifestations of the same bug. The bucket information is important for prioritizing efforts to resolve crashing bugs. To improve the accuracy of bucketing, we propose ReBucket, a method for clustering crash reports based on call stack matching. ReBucket measures the similarities of call stacks in crash reports and then assigns the reports to appropriate buckets based on the similarity values. We evaluate ReBucket using crash data collected from five widely-used Microsoft products. The results show that ReBucket achieves better overall performance than the existing methods. On average, the F-measure obtained by ReBucket is about 0.88.</description><subject>call stack trace</subject><subject>clustering</subject><subject>Computer bugs</subject><subject>crash reports</subject><subject>duplicate crash report detection</subject><subject>Equations</subject><subject>Mathematical model</subject><subject>Measurement</subject><subject>Software</subject><subject>Training</subject><subject>WER</subject><issn>0270-5257</issn><issn>1558-1225</issn><isbn>9781467310666</isbn><isbn>1467310662</isbn><isbn>1467310670</isbn><isbn>1467310654</isbn><isbn>9781467310673</isbn><isbn>9781467310659</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2012</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNo1kMtOAjEUhustEZEHMG76AoPnnE4v4w6JFxISE2XhjnQ6HakMDGnLgreXRFz9iy_58uVn7A5hjAjVw2z6-TwmQBorIo2IZ-wGS6UFgtJwzgYopSmQSF6wUaXNP1Pqkg2ANBSSpL5mo5R-AABRlxWZAfv68E97t_b5kU_4xudV3_C2j9x1-5R9DNtv3ux3XXA2e-6iTSse_a6POfHaJt_wfsud7TqesnVrnsImdDaGfLhlV63tkh-ddsgWL8-L6Vsxf3-dTSfzIlSQC1MdC0tZa02tEgJM2VgQKI6JZJQUTtna1UqjLNEg1L6lBoyypSdQlkoxZPd_2uC9X-5i2Nh4WJ4-Er_yzFTq</recordid><startdate>201206</startdate><enddate>201206</enddate><creator>Yingnong Dang</creator><creator>Rongxin Wu</creator><creator>Hongyu Zhang</creator><creator>Dongmei Zhang</creator><creator>Nobel, P.</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope></search><sort><creationdate>201206</creationdate><title>ReBucket: A method for clustering duplicate crash reports based on call stack similarity</title><author>Yingnong Dang ; Rongxin Wu ; Hongyu Zhang ; Dongmei Zhang ; Nobel, P.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i90t-8966645b772f633084da031300028653c6abcb671541810bef2d086a4e206a243</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2012</creationdate><topic>call stack trace</topic><topic>clustering</topic><topic>Computer bugs</topic><topic>crash reports</topic><topic>duplicate crash report detection</topic><topic>Equations</topic><topic>Mathematical model</topic><topic>Measurement</topic><topic>Software</topic><topic>Training</topic><topic>WER</topic><toplevel>online_resources</toplevel><creatorcontrib>Yingnong Dang</creatorcontrib><creatorcontrib>Rongxin Wu</creatorcontrib><creatorcontrib>Hongyu Zhang</creatorcontrib><creatorcontrib>Dongmei Zhang</creatorcontrib><creatorcontrib>Nobel, P.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Yingnong Dang</au><au>Rongxin Wu</au><au>Hongyu Zhang</au><au>Dongmei Zhang</au><au>Nobel, P.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>ReBucket: A method for clustering duplicate crash reports based on call stack similarity</atitle><btitle>2012 34th International Conference on Software Engineering (ICSE)</btitle><stitle>ICSE</stitle><date>2012-06</date><risdate>2012</risdate><spage>1084</spage><epage>1093</epage><pages>1084-1093</pages><issn>0270-5257</issn><eissn>1558-1225</eissn><isbn>9781467310666</isbn><isbn>1467310662</isbn><eisbn>1467310670</eisbn><eisbn>1467310654</eisbn><eisbn>9781467310673</eisbn><eisbn>9781467310659</eisbn><abstract>Software often crashes. Once a crash happens, a crash report could be sent to software developers for investigation upon user permission. To facilitate efficient handling of crashes, crash reports received by Microsoft's Windows Error Reporting (WER) system are organized into a set of "buckets". Each bucket contains duplicate crash reports that are deemed as manifestations of the same bug. The bucket information is important for prioritizing efforts to resolve crashing bugs. To improve the accuracy of bucketing, we propose ReBucket, a method for clustering crash reports based on call stack matching. ReBucket measures the similarities of call stacks in crash reports and then assigns the reports to appropriate buckets based on the similarity values. We evaluate ReBucket using crash data collected from five widely-used Microsoft products. The results show that ReBucket achieves better overall performance than the existing methods. On average, the F-measure obtained by ReBucket is about 0.88.</abstract><pub>IEEE</pub><doi>10.1109/ICSE.2012.6227111</doi><tpages>10</tpages></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISSN: 0270-5257 |
ispartof | 2012 34th International Conference on Software Engineering (ICSE), 2012, p.1084-1093 |
issn | 0270-5257 1558-1225 |
language | eng |
recordid | cdi_ieee_primary_6227111 |
source | IEEE Electronic Library (IEL) Conference Proceedings |
subjects | call stack trace clustering Computer bugs crash reports duplicate crash report detection Equations Mathematical model Measurement Software Training WER |
title | ReBucket: A method for clustering duplicate crash reports based on call stack similarity |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-23T02%3A41%3A31IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=ReBucket:%20A%20method%20for%20clustering%20duplicate%20crash%20reports%20based%20on%20call%20stack%20similarity&rft.btitle=2012%2034th%20International%20Conference%20on%20Software%20Engineering%20(ICSE)&rft.au=Yingnong%20Dang&rft.date=2012-06&rft.spage=1084&rft.epage=1093&rft.pages=1084-1093&rft.issn=0270-5257&rft.eissn=1558-1225&rft.isbn=9781467310666&rft.isbn_list=1467310662&rft_id=info:doi/10.1109/ICSE.2012.6227111&rft_dat=%3Cieee_6IE%3E6227111%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&rft.eisbn=1467310670&rft.eisbn_list=1467310654&rft.eisbn_list=9781467310673&rft.eisbn_list=9781467310659&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=6227111&rfr_iscdi=true |