Mining authorship characteristics in bug repositories
Bug reports are widely employed to facilitate software tasks in software maintenance. Since bug reports are contributed by people, the authorship characteristics of contributors may heavily impact the performance of resolving software tasks. Poorly written bug reports may delay developers when fixin...
Gespeichert in:
Veröffentlicht in: | Science China. Information sciences 2017, Vol.60 (1), p.96-111, Article 012107 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 111 |
---|---|
container_issue | 1 |
container_start_page | 96 |
container_title | Science China. Information sciences |
container_volume | 60 |
creator | Jiang, He Zhang, Jingxuan Ma, Hongjing Nazar, Najam Ren, Zhilei |
description | Bug reports are widely employed to facilitate software tasks in software maintenance. Since bug reports are contributed by people, the authorship characteristics of contributors may heavily impact the performance of resolving software tasks. Poorly written bug reports may delay developers when fixing bugs. However,no in-depth investigation has been conducted over the authorship characteristics. In this study, we first leverage byte-level N-grams to model the authorship characteristics and employ Normalized Simplified Profile Intersection(NSPI) to identify the similarity of the authorship characteristics. Then, we investigate a series of properties related to contributors’ authorship characteristics, including the evolvement over time and the variation among distinct products in open source projects. Moreover, we show how to leverage the authorship characteristics to facilitate a well-known task in software maintenance, namely Bug Report Summarization(BRS). Experiments on open source projects validate that incorporating the authorship characteristics can effectively improve a stateof-the-art method in BRS. Our findings suggest that contributors should retain stable authorship characteristics and the authorship characteristics can assist in resolving software tasks. |
doi_str_mv | 10.1007/s11432-014-0372-y |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2918636767</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><cqvip_id>671177090</cqvip_id><sourcerecordid>2918636767</sourcerecordid><originalsourceid>FETCH-LOGICAL-c391t-1395078da1c4e4b929d68625e4af99556151160068d08f3eb153dd9c5f6067f73</originalsourceid><addsrcrecordid>eNp9kD9PwzAQxS0EEhX0A7BFMBvu4tiOR1TxTypiAYnNSh0ncQVxaidDvz2uUsHGLXfD-72ne4RcIdwigLyLiAXLKWBBgcmc7k_IAkuhKCpUp-kWsqCSsc9zsoxxC2kYg1yWC8JfXe_6NqumsfMhdm7ITFeFyow2uDg6EzPXZ5upzYIdfHSjD87GS3LWVF_RLo_7gnw8Pryvnun67elldb-mhikcKTLFQZZ1haawxUblqhalyLktqkYpzgVyRAEgyhrKhtkNclbXyvBGgJCNZBfkZvYdgt9NNo5666fQp0idq_QhE1IcVDirTPAxBtvoIbjvKuw1gj4UpOeCdCpIHwrS-8TkMxOTtm9t-HP-D7o-BnW-b3eJ-00SElFKUMB-AI17csk</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2918636767</pqid></control><display><type>article</type><title>Mining authorship characteristics in bug repositories</title><source>ProQuest Central UK/Ireland</source><source>Alma/SFX Local Collection</source><source>SpringerLink Journals - AutoHoldings</source><source>ProQuest Central</source><creator>Jiang, He ; Zhang, Jingxuan ; Ma, Hongjing ; Nazar, Najam ; Ren, Zhilei</creator><creatorcontrib>Jiang, He ; Zhang, Jingxuan ; Ma, Hongjing ; Nazar, Najam ; Ren, Zhilei</creatorcontrib><description>Bug reports are widely employed to facilitate software tasks in software maintenance. Since bug reports are contributed by people, the authorship characteristics of contributors may heavily impact the performance of resolving software tasks. Poorly written bug reports may delay developers when fixing bugs. However,no in-depth investigation has been conducted over the authorship characteristics. In this study, we first leverage byte-level N-grams to model the authorship characteristics and employ Normalized Simplified Profile Intersection(NSPI) to identify the similarity of the authorship characteristics. Then, we investigate a series of properties related to contributors’ authorship characteristics, including the evolvement over time and the variation among distinct products in open source projects. Moreover, we show how to leverage the authorship characteristics to facilitate a well-known task in software maintenance, namely Bug Report Summarization(BRS). Experiments on open source projects validate that incorporating the authorship characteristics can effectively improve a stateof-the-art method in BRS. Our findings suggest that contributors should retain stable authorship characteristics and the authorship characteristics can assist in resolving software tasks.</description><identifier>ISSN: 1674-733X</identifier><identifier>EISSN: 1869-1919</identifier><identifier>DOI: 10.1007/s11432-014-0372-y</identifier><language>eng</language><publisher>Beijing: Science China Press</publisher><subject>Authorship ; Computer Science ; Debugging ; Information Systems and Communication Service ; Maintenance ; N-gram模型 ; Research Paper ; Software ; 作者 ; 开放源代码 ; 挖掘 ; 特征和 ; 维护软件 ; 缺陷 ; 错误报告</subject><ispartof>Science China. Information sciences, 2017, Vol.60 (1), p.96-111, Article 012107</ispartof><rights>Science China Press and Springer-Verlag Berlin Heidelberg 2016</rights><rights>Science China Press and Springer-Verlag Berlin Heidelberg 2016.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c391t-1395078da1c4e4b929d68625e4af99556151160068d08f3eb153dd9c5f6067f73</citedby><cites>FETCH-LOGICAL-c391t-1395078da1c4e4b929d68625e4af99556151160068d08f3eb153dd9c5f6067f73</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Uhttp://image.cqvip.com/vip1000/qk/84009A/84009A.jpg</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s11432-014-0372-y$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/2918636767?pq-origsite=primo$$EHTML$$P50$$Gproquest$$H</linktohtml><link.rule.ids>314,780,784,4024,21388,27923,27924,27925,33744,41488,42557,43805,51319,64385,64389,72469</link.rule.ids></links><search><creatorcontrib>Jiang, He</creatorcontrib><creatorcontrib>Zhang, Jingxuan</creatorcontrib><creatorcontrib>Ma, Hongjing</creatorcontrib><creatorcontrib>Nazar, Najam</creatorcontrib><creatorcontrib>Ren, Zhilei</creatorcontrib><title>Mining authorship characteristics in bug repositories</title><title>Science China. Information sciences</title><addtitle>Sci. China Inf. Sci</addtitle><addtitle>SCIENCE CHINA Information Sciences</addtitle><description>Bug reports are widely employed to facilitate software tasks in software maintenance. Since bug reports are contributed by people, the authorship characteristics of contributors may heavily impact the performance of resolving software tasks. Poorly written bug reports may delay developers when fixing bugs. However,no in-depth investigation has been conducted over the authorship characteristics. In this study, we first leverage byte-level N-grams to model the authorship characteristics and employ Normalized Simplified Profile Intersection(NSPI) to identify the similarity of the authorship characteristics. Then, we investigate a series of properties related to contributors’ authorship characteristics, including the evolvement over time and the variation among distinct products in open source projects. Moreover, we show how to leverage the authorship characteristics to facilitate a well-known task in software maintenance, namely Bug Report Summarization(BRS). Experiments on open source projects validate that incorporating the authorship characteristics can effectively improve a stateof-the-art method in BRS. Our findings suggest that contributors should retain stable authorship characteristics and the authorship characteristics can assist in resolving software tasks.</description><subject>Authorship</subject><subject>Computer Science</subject><subject>Debugging</subject><subject>Information Systems and Communication Service</subject><subject>Maintenance</subject><subject>N-gram模型</subject><subject>Research Paper</subject><subject>Software</subject><subject>作者</subject><subject>开放源代码</subject><subject>挖掘</subject><subject>特征和</subject><subject>维护软件</subject><subject>缺陷</subject><subject>错误报告</subject><issn>1674-733X</issn><issn>1869-1919</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2017</creationdate><recordtype>article</recordtype><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><recordid>eNp9kD9PwzAQxS0EEhX0A7BFMBvu4tiOR1TxTypiAYnNSh0ncQVxaidDvz2uUsHGLXfD-72ne4RcIdwigLyLiAXLKWBBgcmc7k_IAkuhKCpUp-kWsqCSsc9zsoxxC2kYg1yWC8JfXe_6NqumsfMhdm7ITFeFyow2uDg6EzPXZ5upzYIdfHSjD87GS3LWVF_RLo_7gnw8Pryvnun67elldb-mhikcKTLFQZZ1haawxUblqhalyLktqkYpzgVyRAEgyhrKhtkNclbXyvBGgJCNZBfkZvYdgt9NNo5666fQp0idq_QhE1IcVDirTPAxBtvoIbjvKuw1gj4UpOeCdCpIHwrS-8TkMxOTtm9t-HP-D7o-BnW-b3eJ-00SElFKUMB-AI17csk</recordid><startdate>2017</startdate><enddate>2017</enddate><creator>Jiang, He</creator><creator>Zhang, Jingxuan</creator><creator>Ma, Hongjing</creator><creator>Nazar, Najam</creator><creator>Ren, Zhilei</creator><general>Science China Press</general><general>Springer Nature B.V</general><scope>2RA</scope><scope>92L</scope><scope>CQIGP</scope><scope>W92</scope><scope>~WA</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>8FE</scope><scope>8FG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>P5Z</scope><scope>P62</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope></search><sort><creationdate>2017</creationdate><title>Mining authorship characteristics in bug repositories</title><author>Jiang, He ; Zhang, Jingxuan ; Ma, Hongjing ; Nazar, Najam ; Ren, Zhilei</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c391t-1395078da1c4e4b929d68625e4af99556151160068d08f3eb153dd9c5f6067f73</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2017</creationdate><topic>Authorship</topic><topic>Computer Science</topic><topic>Debugging</topic><topic>Information Systems and Communication Service</topic><topic>Maintenance</topic><topic>N-gram模型</topic><topic>Research Paper</topic><topic>Software</topic><topic>作者</topic><topic>开放源代码</topic><topic>挖掘</topic><topic>特征和</topic><topic>维护软件</topic><topic>缺陷</topic><topic>错误报告</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Jiang, He</creatorcontrib><creatorcontrib>Zhang, Jingxuan</creatorcontrib><creatorcontrib>Ma, Hongjing</creatorcontrib><creatorcontrib>Nazar, Najam</creatorcontrib><creatorcontrib>Ren, Zhilei</creatorcontrib><collection>中文科技期刊数据库</collection><collection>中文科技期刊数据库-CALIS站点</collection><collection>中文科技期刊数据库-7.0平台</collection><collection>中文科技期刊数据库-工程技术</collection><collection>中文科技期刊数据库- 镜像站点</collection><collection>CrossRef</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><jtitle>Science China. Information sciences</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Jiang, He</au><au>Zhang, Jingxuan</au><au>Ma, Hongjing</au><au>Nazar, Najam</au><au>Ren, Zhilei</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Mining authorship characteristics in bug repositories</atitle><jtitle>Science China. Information sciences</jtitle><stitle>Sci. China Inf. Sci</stitle><addtitle>SCIENCE CHINA Information Sciences</addtitle><date>2017</date><risdate>2017</risdate><volume>60</volume><issue>1</issue><spage>96</spage><epage>111</epage><pages>96-111</pages><artnum>012107</artnum><issn>1674-733X</issn><eissn>1869-1919</eissn><abstract>Bug reports are widely employed to facilitate software tasks in software maintenance. Since bug reports are contributed by people, the authorship characteristics of contributors may heavily impact the performance of resolving software tasks. Poorly written bug reports may delay developers when fixing bugs. However,no in-depth investigation has been conducted over the authorship characteristics. In this study, we first leverage byte-level N-grams to model the authorship characteristics and employ Normalized Simplified Profile Intersection(NSPI) to identify the similarity of the authorship characteristics. Then, we investigate a series of properties related to contributors’ authorship characteristics, including the evolvement over time and the variation among distinct products in open source projects. Moreover, we show how to leverage the authorship characteristics to facilitate a well-known task in software maintenance, namely Bug Report Summarization(BRS). Experiments on open source projects validate that incorporating the authorship characteristics can effectively improve a stateof-the-art method in BRS. Our findings suggest that contributors should retain stable authorship characteristics and the authorship characteristics can assist in resolving software tasks.</abstract><cop>Beijing</cop><pub>Science China Press</pub><doi>10.1007/s11432-014-0372-y</doi><tpages>16</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1674-733X |
ispartof | Science China. Information sciences, 2017, Vol.60 (1), p.96-111, Article 012107 |
issn | 1674-733X 1869-1919 |
language | eng |
recordid | cdi_proquest_journals_2918636767 |
source | ProQuest Central UK/Ireland; Alma/SFX Local Collection; SpringerLink Journals - AutoHoldings; ProQuest Central |
subjects | Authorship Computer Science Debugging Information Systems and Communication Service Maintenance N-gram模型 Research Paper Software 作者 开放源代码 挖掘 特征和 维护软件 缺陷 错误报告 |
title | Mining authorship characteristics in bug repositories |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-05T09%3A01%3A04IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Mining%20authorship%20characteristics%20in%20bug%20repositories&rft.jtitle=Science%20China.%20Information%20sciences&rft.au=Jiang,%20He&rft.date=2017&rft.volume=60&rft.issue=1&rft.spage=96&rft.epage=111&rft.pages=96-111&rft.artnum=012107&rft.issn=1674-733X&rft.eissn=1869-1919&rft_id=info:doi/10.1007/s11432-014-0372-y&rft_dat=%3Cproquest_cross%3E2918636767%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2918636767&rft_id=info:pmid/&rft_cqvip_id=671177090&rfr_iscdi=true |