Mining authorship characteristics in bug repositories

Bug reports are widely employed to facilitate software tasks in software maintenance. Since bug reports are contributed by people, the authorship characteristics of contributors may heavily impact the performance of resolving software tasks. Poorly written bug reports may delay developers when fixin...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Science China. Information sciences 2017, Vol.60 (1), p.96-111, Article 012107
Hauptverfasser: Jiang, He, Zhang, Jingxuan, Ma, Hongjing, Nazar, Najam, Ren, Zhilei
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 111
container_issue 1
container_start_page 96
container_title Science China. Information sciences
container_volume 60
creator Jiang, He
Zhang, Jingxuan
Ma, Hongjing
Nazar, Najam
Ren, Zhilei
description Bug reports are widely employed to facilitate software tasks in software maintenance. Since bug reports are contributed by people, the authorship characteristics of contributors may heavily impact the performance of resolving software tasks. Poorly written bug reports may delay developers when fixing bugs. However,no in-depth investigation has been conducted over the authorship characteristics. In this study, we first leverage byte-level N-grams to model the authorship characteristics and employ Normalized Simplified Profile Intersection(NSPI) to identify the similarity of the authorship characteristics. Then, we investigate a series of properties related to contributors’ authorship characteristics, including the evolvement over time and the variation among distinct products in open source projects. Moreover, we show how to leverage the authorship characteristics to facilitate a well-known task in software maintenance, namely Bug Report Summarization(BRS). Experiments on open source projects validate that incorporating the authorship characteristics can effectively improve a stateof-the-art method in BRS. Our findings suggest that contributors should retain stable authorship characteristics and the authorship characteristics can assist in resolving software tasks.
doi_str_mv 10.1007/s11432-014-0372-y
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2918636767</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><cqvip_id>671177090</cqvip_id><sourcerecordid>2918636767</sourcerecordid><originalsourceid>FETCH-LOGICAL-c391t-1395078da1c4e4b929d68625e4af99556151160068d08f3eb153dd9c5f6067f73</originalsourceid><addsrcrecordid>eNp9kD9PwzAQxS0EEhX0A7BFMBvu4tiOR1TxTypiAYnNSh0ncQVxaidDvz2uUsHGLXfD-72ne4RcIdwigLyLiAXLKWBBgcmc7k_IAkuhKCpUp-kWsqCSsc9zsoxxC2kYg1yWC8JfXe_6NqumsfMhdm7ITFeFyow2uDg6EzPXZ5upzYIdfHSjD87GS3LWVF_RLo_7gnw8Pryvnun67elldb-mhikcKTLFQZZ1haawxUblqhalyLktqkYpzgVyRAEgyhrKhtkNclbXyvBGgJCNZBfkZvYdgt9NNo5666fQp0idq_QhE1IcVDirTPAxBtvoIbjvKuw1gj4UpOeCdCpIHwrS-8TkMxOTtm9t-HP-D7o-BnW-b3eJ-00SElFKUMB-AI17csk</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2918636767</pqid></control><display><type>article</type><title>Mining authorship characteristics in bug repositories</title><source>ProQuest Central UK/Ireland</source><source>Alma/SFX Local Collection</source><source>SpringerLink Journals - AutoHoldings</source><source>ProQuest Central</source><creator>Jiang, He ; Zhang, Jingxuan ; Ma, Hongjing ; Nazar, Najam ; Ren, Zhilei</creator><creatorcontrib>Jiang, He ; Zhang, Jingxuan ; Ma, Hongjing ; Nazar, Najam ; Ren, Zhilei</creatorcontrib><description>Bug reports are widely employed to facilitate software tasks in software maintenance. Since bug reports are contributed by people, the authorship characteristics of contributors may heavily impact the performance of resolving software tasks. Poorly written bug reports may delay developers when fixing bugs. However,no in-depth investigation has been conducted over the authorship characteristics. In this study, we first leverage byte-level N-grams to model the authorship characteristics and employ Normalized Simplified Profile Intersection(NSPI) to identify the similarity of the authorship characteristics. Then, we investigate a series of properties related to contributors’ authorship characteristics, including the evolvement over time and the variation among distinct products in open source projects. Moreover, we show how to leverage the authorship characteristics to facilitate a well-known task in software maintenance, namely Bug Report Summarization(BRS). Experiments on open source projects validate that incorporating the authorship characteristics can effectively improve a stateof-the-art method in BRS. Our findings suggest that contributors should retain stable authorship characteristics and the authorship characteristics can assist in resolving software tasks.</description><identifier>ISSN: 1674-733X</identifier><identifier>EISSN: 1869-1919</identifier><identifier>DOI: 10.1007/s11432-014-0372-y</identifier><language>eng</language><publisher>Beijing: Science China Press</publisher><subject>Authorship ; Computer Science ; Debugging ; Information Systems and Communication Service ; Maintenance ; N-gram模型 ; Research Paper ; Software ; 作者 ; 开放源代码 ; 挖掘 ; 特征和 ; 维护软件 ; 缺陷 ; 错误报告</subject><ispartof>Science China. Information sciences, 2017, Vol.60 (1), p.96-111, Article 012107</ispartof><rights>Science China Press and Springer-Verlag Berlin Heidelberg 2016</rights><rights>Science China Press and Springer-Verlag Berlin Heidelberg 2016.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c391t-1395078da1c4e4b929d68625e4af99556151160068d08f3eb153dd9c5f6067f73</citedby><cites>FETCH-LOGICAL-c391t-1395078da1c4e4b929d68625e4af99556151160068d08f3eb153dd9c5f6067f73</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Uhttp://image.cqvip.com/vip1000/qk/84009A/84009A.jpg</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s11432-014-0372-y$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/2918636767?pq-origsite=primo$$EHTML$$P50$$Gproquest$$H</linktohtml><link.rule.ids>314,780,784,4024,21388,27923,27924,27925,33744,41488,42557,43805,51319,64385,64389,72469</link.rule.ids></links><search><creatorcontrib>Jiang, He</creatorcontrib><creatorcontrib>Zhang, Jingxuan</creatorcontrib><creatorcontrib>Ma, Hongjing</creatorcontrib><creatorcontrib>Nazar, Najam</creatorcontrib><creatorcontrib>Ren, Zhilei</creatorcontrib><title>Mining authorship characteristics in bug repositories</title><title>Science China. Information sciences</title><addtitle>Sci. China Inf. Sci</addtitle><addtitle>SCIENCE CHINA Information Sciences</addtitle><description>Bug reports are widely employed to facilitate software tasks in software maintenance. Since bug reports are contributed by people, the authorship characteristics of contributors may heavily impact the performance of resolving software tasks. Poorly written bug reports may delay developers when fixing bugs. However,no in-depth investigation has been conducted over the authorship characteristics. In this study, we first leverage byte-level N-grams to model the authorship characteristics and employ Normalized Simplified Profile Intersection(NSPI) to identify the similarity of the authorship characteristics. Then, we investigate a series of properties related to contributors’ authorship characteristics, including the evolvement over time and the variation among distinct products in open source projects. Moreover, we show how to leverage the authorship characteristics to facilitate a well-known task in software maintenance, namely Bug Report Summarization(BRS). Experiments on open source projects validate that incorporating the authorship characteristics can effectively improve a stateof-the-art method in BRS. Our findings suggest that contributors should retain stable authorship characteristics and the authorship characteristics can assist in resolving software tasks.</description><subject>Authorship</subject><subject>Computer Science</subject><subject>Debugging</subject><subject>Information Systems and Communication Service</subject><subject>Maintenance</subject><subject>N-gram模型</subject><subject>Research Paper</subject><subject>Software</subject><subject>作者</subject><subject>开放源代码</subject><subject>挖掘</subject><subject>特征和</subject><subject>维护软件</subject><subject>缺陷</subject><subject>错误报告</subject><issn>1674-733X</issn><issn>1869-1919</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2017</creationdate><recordtype>article</recordtype><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><recordid>eNp9kD9PwzAQxS0EEhX0A7BFMBvu4tiOR1TxTypiAYnNSh0ncQVxaidDvz2uUsHGLXfD-72ne4RcIdwigLyLiAXLKWBBgcmc7k_IAkuhKCpUp-kWsqCSsc9zsoxxC2kYg1yWC8JfXe_6NqumsfMhdm7ITFeFyow2uDg6EzPXZ5upzYIdfHSjD87GS3LWVF_RLo_7gnw8Pryvnun67elldb-mhikcKTLFQZZ1haawxUblqhalyLktqkYpzgVyRAEgyhrKhtkNclbXyvBGgJCNZBfkZvYdgt9NNo5666fQp0idq_QhE1IcVDirTPAxBtvoIbjvKuw1gj4UpOeCdCpIHwrS-8TkMxOTtm9t-HP-D7o-BnW-b3eJ-00SElFKUMB-AI17csk</recordid><startdate>2017</startdate><enddate>2017</enddate><creator>Jiang, He</creator><creator>Zhang, Jingxuan</creator><creator>Ma, Hongjing</creator><creator>Nazar, Najam</creator><creator>Ren, Zhilei</creator><general>Science China Press</general><general>Springer Nature B.V</general><scope>2RA</scope><scope>92L</scope><scope>CQIGP</scope><scope>W92</scope><scope>~WA</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>8FE</scope><scope>8FG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>P5Z</scope><scope>P62</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope></search><sort><creationdate>2017</creationdate><title>Mining authorship characteristics in bug repositories</title><author>Jiang, He ; Zhang, Jingxuan ; Ma, Hongjing ; Nazar, Najam ; Ren, Zhilei</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c391t-1395078da1c4e4b929d68625e4af99556151160068d08f3eb153dd9c5f6067f73</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2017</creationdate><topic>Authorship</topic><topic>Computer Science</topic><topic>Debugging</topic><topic>Information Systems and Communication Service</topic><topic>Maintenance</topic><topic>N-gram模型</topic><topic>Research Paper</topic><topic>Software</topic><topic>作者</topic><topic>开放源代码</topic><topic>挖掘</topic><topic>特征和</topic><topic>维护软件</topic><topic>缺陷</topic><topic>错误报告</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Jiang, He</creatorcontrib><creatorcontrib>Zhang, Jingxuan</creatorcontrib><creatorcontrib>Ma, Hongjing</creatorcontrib><creatorcontrib>Nazar, Najam</creatorcontrib><creatorcontrib>Ren, Zhilei</creatorcontrib><collection>中文科技期刊数据库</collection><collection>中文科技期刊数据库-CALIS站点</collection><collection>中文科技期刊数据库-7.0平台</collection><collection>中文科技期刊数据库-工程技术</collection><collection>中文科技期刊数据库- 镜像站点</collection><collection>CrossRef</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><jtitle>Science China. Information sciences</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Jiang, He</au><au>Zhang, Jingxuan</au><au>Ma, Hongjing</au><au>Nazar, Najam</au><au>Ren, Zhilei</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Mining authorship characteristics in bug repositories</atitle><jtitle>Science China. Information sciences</jtitle><stitle>Sci. China Inf. Sci</stitle><addtitle>SCIENCE CHINA Information Sciences</addtitle><date>2017</date><risdate>2017</risdate><volume>60</volume><issue>1</issue><spage>96</spage><epage>111</epage><pages>96-111</pages><artnum>012107</artnum><issn>1674-733X</issn><eissn>1869-1919</eissn><abstract>Bug reports are widely employed to facilitate software tasks in software maintenance. Since bug reports are contributed by people, the authorship characteristics of contributors may heavily impact the performance of resolving software tasks. Poorly written bug reports may delay developers when fixing bugs. However,no in-depth investigation has been conducted over the authorship characteristics. In this study, we first leverage byte-level N-grams to model the authorship characteristics and employ Normalized Simplified Profile Intersection(NSPI) to identify the similarity of the authorship characteristics. Then, we investigate a series of properties related to contributors’ authorship characteristics, including the evolvement over time and the variation among distinct products in open source projects. Moreover, we show how to leverage the authorship characteristics to facilitate a well-known task in software maintenance, namely Bug Report Summarization(BRS). Experiments on open source projects validate that incorporating the authorship characteristics can effectively improve a stateof-the-art method in BRS. Our findings suggest that contributors should retain stable authorship characteristics and the authorship characteristics can assist in resolving software tasks.</abstract><cop>Beijing</cop><pub>Science China Press</pub><doi>10.1007/s11432-014-0372-y</doi><tpages>16</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1674-733X
ispartof Science China. Information sciences, 2017, Vol.60 (1), p.96-111, Article 012107
issn 1674-733X
1869-1919
language eng
recordid cdi_proquest_journals_2918636767
source ProQuest Central UK/Ireland; Alma/SFX Local Collection; SpringerLink Journals - AutoHoldings; ProQuest Central
subjects Authorship
Computer Science
Debugging
Information Systems and Communication Service
Maintenance
N-gram模型
Research Paper
Software
作者
开放源代码
挖掘
特征和
维护软件
缺陷
错误报告
title Mining authorship characteristics in bug repositories
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-05T09%3A01%3A04IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Mining%20authorship%20characteristics%20in%20bug%20repositories&rft.jtitle=Science%20China.%20Information%20sciences&rft.au=Jiang,%20He&rft.date=2017&rft.volume=60&rft.issue=1&rft.spage=96&rft.epage=111&rft.pages=96-111&rft.artnum=012107&rft.issn=1674-733X&rft.eissn=1869-1919&rft_id=info:doi/10.1007/s11432-014-0372-y&rft_dat=%3Cproquest_cross%3E2918636767%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2918636767&rft_id=info:pmid/&rft_cqvip_id=671177090&rfr_iscdi=true