Potential and limitations of the ISBSG dataset in enhancing software engineering research: A mapping review

The International Software Benchmarking Standards Group (ISBSG) maintains a software development repository with over 6000 software projects. This dataset makes it possible to estimate a project’s size, effort, duration, and cost. The aim of this study was to determine how and to what extent, ISBSG...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Information and software technology 2014-06, Vol.56 (6), p.527-544
Hauptverfasser: Fernández-Diego, Marta, González-Ladrón-de-Guevara, Fernando
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 544
container_issue 6
container_start_page 527
container_title Information and software technology
container_volume 56
creator Fernández-Diego, Marta
González-Ladrón-de-Guevara, Fernando
description The International Software Benchmarking Standards Group (ISBSG) maintains a software development repository with over 6000 software projects. This dataset makes it possible to estimate a project’s size, effort, duration, and cost. The aim of this study was to determine how and to what extent, ISBSG has been used by researchers from 2000, when the first papers were published, until June of 2012. A systematic mapping review was used as the research method, which was applied to over 129 papers obtained after the filtering process. The papers were published in 19 journals and 40 conferences. Thirty-five percent of the papers published between years 2000 and 2011 have received at least one citation in journals and only five papers have received six or more citations. Effort variable is the focus of 70.5% of the papers, 22.5% center their research in a variable different from effort and 7% do not consider any target variable. Additionally, in as many as 70.5% of papers, effort estimation is the research topic, followed by dataset properties (36.4%). The more frequent methods are Regression (61.2%), Machine Learning (35.7%), and Estimation by Analogy (22.5%). ISBSG is used as the only support in 55% of the papers while the remaining papers use complementary datasets. The ISBSG release 10 is used most frequently with 32 references. Finally, some benefits and drawbacks of the usage of ISBSG have been highlighted. This work presents a snapshot of the existing usage of ISBSG in software development research. ISBSG offers a wealth of information regarding practices from a wide range of organizations, applications, and development types, which constitutes its main potential. However, a data preparation process is required before any analysis. Lastly, the potential of ISBSG to develop new research is also outlined.
doi_str_mv 10.1016/j.infsof.2014.01.003
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1531005632</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0950584914000147</els_id><sourcerecordid>3258888401</sourcerecordid><originalsourceid>FETCH-LOGICAL-c479t-874b427bde6ae1eb5c77bf9322f2d7d7bdb89b495eedc6174e56e0c5a4f02fc93</originalsourceid><addsrcrecordid>eNp9kU1v1DAQhi1EJZa2_6AHS1y4JIwdO044IJUKSqVKIBXOluOMu16y9mJ7W_Hv8SqcOHAa6Z1nPl9Crhi0DFj_btf64HJ0LQcmWmAtQPeCbNiguqYHLl-SDYwSGjmI8RV5nfMOgCnoYEN-fosFQ_FmoSbMdPF7X0zxMWQaHS1bpHcPHx9u6WyKyVioDxTD1gTrwyOtI8uzSVilRx8Q00lMmNEku31Pr-neHA6r9uTx-YKcObNkvPwbz8mPz5--33xp7r_e3t1c3zdWqLE0gxKT4GqasTfIcJJWqcmNHeeOz2quiWkYJzFKxNn2TAmUPYKVRjjgzo7dOXm79j2k-OuIuei9zxaXxQSMx6yZ7BiA7Dte0Tf_oLt4TKFuVykGauBC9JUSK2VTzDmh04fk9yb91gz0yQG906sD-uSABqarA7Xsw1qG9dj6gKSz9Rgszj6hLXqO_v8N_gA6npHz</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1510782446</pqid></control><display><type>article</type><title>Potential and limitations of the ISBSG dataset in enhancing software engineering research: A mapping review</title><source>Elsevier ScienceDirect Journals</source><creator>Fernández-Diego, Marta ; González-Ladrón-de-Guevara, Fernando</creator><creatorcontrib>Fernández-Diego, Marta ; González-Ladrón-de-Guevara, Fernando</creatorcontrib><description>The International Software Benchmarking Standards Group (ISBSG) maintains a software development repository with over 6000 software projects. This dataset makes it possible to estimate a project’s size, effort, duration, and cost. The aim of this study was to determine how and to what extent, ISBSG has been used by researchers from 2000, when the first papers were published, until June of 2012. A systematic mapping review was used as the research method, which was applied to over 129 papers obtained after the filtering process. The papers were published in 19 journals and 40 conferences. Thirty-five percent of the papers published between years 2000 and 2011 have received at least one citation in journals and only five papers have received six or more citations. Effort variable is the focus of 70.5% of the papers, 22.5% center their research in a variable different from effort and 7% do not consider any target variable. Additionally, in as many as 70.5% of papers, effort estimation is the research topic, followed by dataset properties (36.4%). The more frequent methods are Regression (61.2%), Machine Learning (35.7%), and Estimation by Analogy (22.5%). ISBSG is used as the only support in 55% of the papers while the remaining papers use complementary datasets. The ISBSG release 10 is used most frequently with 32 references. Finally, some benefits and drawbacks of the usage of ISBSG have been highlighted. This work presents a snapshot of the existing usage of ISBSG in software development research. ISBSG offers a wealth of information regarding practices from a wide range of organizations, applications, and development types, which constitutes its main potential. However, a data preparation process is required before any analysis. Lastly, the potential of ISBSG to develop new research is also outlined.</description><identifier>ISSN: 0950-5849</identifier><identifier>EISSN: 1873-6025</identifier><identifier>DOI: 10.1016/j.infsof.2014.01.003</identifier><language>eng</language><publisher>Amsterdam: Elsevier B.V</publisher><subject>Benchmarking ; Benchmarks ; Computer programs ; International standards ; ISBSG ; Mapping ; Research methodology ; Research methods ; Software ; Software cost prediction ; Software effort estimation ; Software engineering ; Studies ; Systematic mapping study ; Systems development</subject><ispartof>Information and software technology, 2014-06, Vol.56 (6), p.527-544</ispartof><rights>2014 Elsevier B.V.</rights><rights>Copyright Elsevier Science Ltd. Jun 2014</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c479t-874b427bde6ae1eb5c77bf9322f2d7d7bdb89b495eedc6174e56e0c5a4f02fc93</citedby><cites>FETCH-LOGICAL-c479t-874b427bde6ae1eb5c77bf9322f2d7d7bdb89b495eedc6174e56e0c5a4f02fc93</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S0950584914000147$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,776,780,3537,27901,27902,65306</link.rule.ids></links><search><creatorcontrib>Fernández-Diego, Marta</creatorcontrib><creatorcontrib>González-Ladrón-de-Guevara, Fernando</creatorcontrib><title>Potential and limitations of the ISBSG dataset in enhancing software engineering research: A mapping review</title><title>Information and software technology</title><description>The International Software Benchmarking Standards Group (ISBSG) maintains a software development repository with over 6000 software projects. This dataset makes it possible to estimate a project’s size, effort, duration, and cost. The aim of this study was to determine how and to what extent, ISBSG has been used by researchers from 2000, when the first papers were published, until June of 2012. A systematic mapping review was used as the research method, which was applied to over 129 papers obtained after the filtering process. The papers were published in 19 journals and 40 conferences. Thirty-five percent of the papers published between years 2000 and 2011 have received at least one citation in journals and only five papers have received six or more citations. Effort variable is the focus of 70.5% of the papers, 22.5% center their research in a variable different from effort and 7% do not consider any target variable. Additionally, in as many as 70.5% of papers, effort estimation is the research topic, followed by dataset properties (36.4%). The more frequent methods are Regression (61.2%), Machine Learning (35.7%), and Estimation by Analogy (22.5%). ISBSG is used as the only support in 55% of the papers while the remaining papers use complementary datasets. The ISBSG release 10 is used most frequently with 32 references. Finally, some benefits and drawbacks of the usage of ISBSG have been highlighted. This work presents a snapshot of the existing usage of ISBSG in software development research. ISBSG offers a wealth of information regarding practices from a wide range of organizations, applications, and development types, which constitutes its main potential. However, a data preparation process is required before any analysis. Lastly, the potential of ISBSG to develop new research is also outlined.</description><subject>Benchmarking</subject><subject>Benchmarks</subject><subject>Computer programs</subject><subject>International standards</subject><subject>ISBSG</subject><subject>Mapping</subject><subject>Research methodology</subject><subject>Research methods</subject><subject>Software</subject><subject>Software cost prediction</subject><subject>Software effort estimation</subject><subject>Software engineering</subject><subject>Studies</subject><subject>Systematic mapping study</subject><subject>Systems development</subject><issn>0950-5849</issn><issn>1873-6025</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2014</creationdate><recordtype>article</recordtype><recordid>eNp9kU1v1DAQhi1EJZa2_6AHS1y4JIwdO044IJUKSqVKIBXOluOMu16y9mJ7W_Hv8SqcOHAa6Z1nPl9Crhi0DFj_btf64HJ0LQcmWmAtQPeCbNiguqYHLl-SDYwSGjmI8RV5nfMOgCnoYEN-fosFQ_FmoSbMdPF7X0zxMWQaHS1bpHcPHx9u6WyKyVioDxTD1gTrwyOtI8uzSVilRx8Q00lMmNEku31Pr-neHA6r9uTx-YKcObNkvPwbz8mPz5--33xp7r_e3t1c3zdWqLE0gxKT4GqasTfIcJJWqcmNHeeOz2quiWkYJzFKxNn2TAmUPYKVRjjgzo7dOXm79j2k-OuIuei9zxaXxQSMx6yZ7BiA7Dte0Tf_oLt4TKFuVykGauBC9JUSK2VTzDmh04fk9yb91gz0yQG906sD-uSABqarA7Xsw1qG9dj6gKSz9Rgszj6hLXqO_v8N_gA6npHz</recordid><startdate>20140601</startdate><enddate>20140601</enddate><creator>Fernández-Diego, Marta</creator><creator>González-Ladrón-de-Guevara, Fernando</creator><general>Elsevier B.V</general><general>Elsevier Science Ltd</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20140601</creationdate><title>Potential and limitations of the ISBSG dataset in enhancing software engineering research: A mapping review</title><author>Fernández-Diego, Marta ; González-Ladrón-de-Guevara, Fernando</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c479t-874b427bde6ae1eb5c77bf9322f2d7d7bdb89b495eedc6174e56e0c5a4f02fc93</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2014</creationdate><topic>Benchmarking</topic><topic>Benchmarks</topic><topic>Computer programs</topic><topic>International standards</topic><topic>ISBSG</topic><topic>Mapping</topic><topic>Research methodology</topic><topic>Research methods</topic><topic>Software</topic><topic>Software cost prediction</topic><topic>Software effort estimation</topic><topic>Software engineering</topic><topic>Studies</topic><topic>Systematic mapping study</topic><topic>Systems development</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Fernández-Diego, Marta</creatorcontrib><creatorcontrib>González-Ladrón-de-Guevara, Fernando</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Information and software technology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Fernández-Diego, Marta</au><au>González-Ladrón-de-Guevara, Fernando</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Potential and limitations of the ISBSG dataset in enhancing software engineering research: A mapping review</atitle><jtitle>Information and software technology</jtitle><date>2014-06-01</date><risdate>2014</risdate><volume>56</volume><issue>6</issue><spage>527</spage><epage>544</epage><pages>527-544</pages><issn>0950-5849</issn><eissn>1873-6025</eissn><abstract>The International Software Benchmarking Standards Group (ISBSG) maintains a software development repository with over 6000 software projects. This dataset makes it possible to estimate a project’s size, effort, duration, and cost. The aim of this study was to determine how and to what extent, ISBSG has been used by researchers from 2000, when the first papers were published, until June of 2012. A systematic mapping review was used as the research method, which was applied to over 129 papers obtained after the filtering process. The papers were published in 19 journals and 40 conferences. Thirty-five percent of the papers published between years 2000 and 2011 have received at least one citation in journals and only five papers have received six or more citations. Effort variable is the focus of 70.5% of the papers, 22.5% center their research in a variable different from effort and 7% do not consider any target variable. Additionally, in as many as 70.5% of papers, effort estimation is the research topic, followed by dataset properties (36.4%). The more frequent methods are Regression (61.2%), Machine Learning (35.7%), and Estimation by Analogy (22.5%). ISBSG is used as the only support in 55% of the papers while the remaining papers use complementary datasets. The ISBSG release 10 is used most frequently with 32 references. Finally, some benefits and drawbacks of the usage of ISBSG have been highlighted. This work presents a snapshot of the existing usage of ISBSG in software development research. ISBSG offers a wealth of information regarding practices from a wide range of organizations, applications, and development types, which constitutes its main potential. However, a data preparation process is required before any analysis. Lastly, the potential of ISBSG to develop new research is also outlined.</abstract><cop>Amsterdam</cop><pub>Elsevier B.V</pub><doi>10.1016/j.infsof.2014.01.003</doi><tpages>18</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0950-5849
ispartof Information and software technology, 2014-06, Vol.56 (6), p.527-544
issn 0950-5849
1873-6025
language eng
recordid cdi_proquest_miscellaneous_1531005632
source Elsevier ScienceDirect Journals
subjects Benchmarking
Benchmarks
Computer programs
International standards
ISBSG
Mapping
Research methodology
Research methods
Software
Software cost prediction
Software effort estimation
Software engineering
Studies
Systematic mapping study
Systems development
title Potential and limitations of the ISBSG dataset in enhancing software engineering research: A mapping review
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-28T22%3A50%3A11IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Potential%20and%20limitations%20of%20the%20ISBSG%20dataset%20in%20enhancing%20software%20engineering%20research:%20A%20mapping%20review&rft.jtitle=Information%20and%20software%20technology&rft.au=Fern%C3%A1ndez-Diego,%20Marta&rft.date=2014-06-01&rft.volume=56&rft.issue=6&rft.spage=527&rft.epage=544&rft.pages=527-544&rft.issn=0950-5849&rft.eissn=1873-6025&rft_id=info:doi/10.1016/j.infsof.2014.01.003&rft_dat=%3Cproquest_cross%3E3258888401%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1510782446&rft_id=info:pmid/&rft_els_id=S0950584914000147&rfr_iscdi=true