Detecting Latent Topics and Trends in Software Engineering Research Since 1980 Using Probabilistic Topic Modeling

The landscape of software engineering research has changed significantly from one year to the next in line with industrial needs and trends. Therefore, today's research literature on software engineering has a rich and multidisciplinary content that includes a large number of studies; however,...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE access 2022, Vol.10, p.74638-74654
Hauptverfasser: Gurcan, Fatih, Dalveren, Gonca Gokce Menekse, Cagiltay, Nergiz Ercil, Soylu, Ahmet
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 74654
container_issue
container_start_page 74638
container_title IEEE access
container_volume 10
creator Gurcan, Fatih
Dalveren, Gonca Gokce Menekse
Cagiltay, Nergiz Ercil
Soylu, Ahmet
description The landscape of software engineering research has changed significantly from one year to the next in line with industrial needs and trends. Therefore, today's research literature on software engineering has a rich and multidisciplinary content that includes a large number of studies; however, not many of them demonstrate a holistic view of the field. From this perspective, this study aimed to reveal a holistic view that reflects topics, trends, and trajectories in software engineering research by analyzing the majority of domain-specific articles published over the last 40 years. This study first presents an objective and systematic method for corpus creation through major publication sources in the field. A corpus was then created using this method, which includes 44 domain-specific conferences and journals and 57,174 articles published between 1980 and 2019. Next, this corpus was analyzed using an automated text-mining methodology based on a probabilistic topic-modeling approach. As a result of this analysis, 24 main topics were found. In addition, topical trends in the field were revealed. Finally, three main developmental stages of the field were identified as: the programming age, the software development age, and the software optimization age.
doi_str_mv 10.1109/ACCESS.2022.3190632
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2691875765</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9828025</ieee_id><doaj_id>oai_doaj_org_article_5ccbbb9b0d0e423f881eb1141a5053fc</doaj_id><sourcerecordid>2691875765</sourcerecordid><originalsourceid>FETCH-LOGICAL-c408t-4f42072c26847cb10334d416e778d6b19d1dd3d7494cc8762931b1e47e901a3b3</originalsourceid><addsrcrecordid>eNpNUU1rGzEQXUoLDWl-QS6CnO1qJK0-jsF1m4BLS-2chT5mHRlXcqQNpf--62wIncsM7817M_C67hroEoCaz7er1Xq7XTLK2JKDoZKzd90FA2kWvOfy_X_zx-6qtQOdSk9Qry66py84YhhT3pONGzGPZFdOKTTiciS7ijk2kjLZlmH84yqSdd6njFjPgl_Y0NXwSLYpByRgNCUP7cz8rMU7n46pjSnMjuR7iXicyE_dh8EdG1699svu4et6t7pbbH58u1_dbhZBUD0uxCAYVSwwqYUKHijnIgqQqJSO0oOJECOPShgRglaSGQ4eUCg0FBz3_LK7n31jcQd7qum3q39tccm-AKXuravTe0e0fQjee-NppCgYH7QG9AACXE97PoTJ62b2OtXy9IxttIfyXPP0vmXSgFa9kv20xeetUEtrFYe3q0DtOSo7R2XPUdnXqCbV9axKiPimMJppynr-D97OjkQ</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2691875765</pqid></control><display><type>article</type><title>Detecting Latent Topics and Trends in Software Engineering Research Since 1980 Using Probabilistic Topic Modeling</title><source>IEEE Open Access Journals</source><source>DOAJ Directory of Open Access Journals</source><source>EZB-FREE-00999 freely available EZB journals</source><creator>Gurcan, Fatih ; Dalveren, Gonca Gokce Menekse ; Cagiltay, Nergiz Ercil ; Soylu, Ahmet</creator><creatorcontrib>Gurcan, Fatih ; Dalveren, Gonca Gokce Menekse ; Cagiltay, Nergiz Ercil ; Soylu, Ahmet</creatorcontrib><description>The landscape of software engineering research has changed significantly from one year to the next in line with industrial needs and trends. Therefore, today's research literature on software engineering has a rich and multidisciplinary content that includes a large number of studies; however, not many of them demonstrate a holistic view of the field. From this perspective, this study aimed to reveal a holistic view that reflects topics, trends, and trajectories in software engineering research by analyzing the majority of domain-specific articles published over the last 40 years. This study first presents an objective and systematic method for corpus creation through major publication sources in the field. A corpus was then created using this method, which includes 44 domain-specific conferences and journals and 57,174 articles published between 1980 and 2019. Next, this corpus was analyzed using an automated text-mining methodology based on a probabilistic topic-modeling approach. As a result of this analysis, 24 main topics were found. In addition, topical trends in the field were revealed. Finally, three main developmental stages of the field were identified as: the programming age, the software development age, and the software optimization age.</description><identifier>ISSN: 2169-3536</identifier><identifier>EISSN: 2169-3536</identifier><identifier>DOI: 10.1109/ACCESS.2022.3190632</identifier><identifier>CODEN: IAECCG</identifier><language>eng</language><publisher>Piscataway: IEEE</publisher><subject>Age ; Bibliometrics ; Corpus creation ; Engineering research ; Licenses ; Modelling ; Optimization ; research trends and topics ; Software ; Software development ; Software engineering ; Systematics ; Text mining ; topic model ; Trends</subject><ispartof>IEEE access, 2022, Vol.10, p.74638-74654</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2022</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c408t-4f42072c26847cb10334d416e778d6b19d1dd3d7494cc8762931b1e47e901a3b3</citedby><cites>FETCH-LOGICAL-c408t-4f42072c26847cb10334d416e778d6b19d1dd3d7494cc8762931b1e47e901a3b3</cites><orcidid>0000-0002-8649-1909 ; 0000-0001-6034-4137 ; 0000-0003-0875-9276 ; 0000-0001-9915-6686</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9828025$$EHTML$$P50$$Gieee$$Hfree_for_read</linktohtml><link.rule.ids>314,780,784,864,2102,4024,27633,27923,27924,27925,54933</link.rule.ids></links><search><creatorcontrib>Gurcan, Fatih</creatorcontrib><creatorcontrib>Dalveren, Gonca Gokce Menekse</creatorcontrib><creatorcontrib>Cagiltay, Nergiz Ercil</creatorcontrib><creatorcontrib>Soylu, Ahmet</creatorcontrib><title>Detecting Latent Topics and Trends in Software Engineering Research Since 1980 Using Probabilistic Topic Modeling</title><title>IEEE access</title><addtitle>Access</addtitle><description>The landscape of software engineering research has changed significantly from one year to the next in line with industrial needs and trends. Therefore, today's research literature on software engineering has a rich and multidisciplinary content that includes a large number of studies; however, not many of them demonstrate a holistic view of the field. From this perspective, this study aimed to reveal a holistic view that reflects topics, trends, and trajectories in software engineering research by analyzing the majority of domain-specific articles published over the last 40 years. This study first presents an objective and systematic method for corpus creation through major publication sources in the field. A corpus was then created using this method, which includes 44 domain-specific conferences and journals and 57,174 articles published between 1980 and 2019. Next, this corpus was analyzed using an automated text-mining methodology based on a probabilistic topic-modeling approach. As a result of this analysis, 24 main topics were found. In addition, topical trends in the field were revealed. Finally, three main developmental stages of the field were identified as: the programming age, the software development age, and the software optimization age.</description><subject>Age</subject><subject>Bibliometrics</subject><subject>Corpus creation</subject><subject>Engineering research</subject><subject>Licenses</subject><subject>Modelling</subject><subject>Optimization</subject><subject>research trends and topics</subject><subject>Software</subject><subject>Software development</subject><subject>Software engineering</subject><subject>Systematics</subject><subject>Text mining</subject><subject>topic model</subject><subject>Trends</subject><issn>2169-3536</issn><issn>2169-3536</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>ESBDL</sourceid><sourceid>RIE</sourceid><sourceid>DOA</sourceid><recordid>eNpNUU1rGzEQXUoLDWl-QS6CnO1qJK0-jsF1m4BLS-2chT5mHRlXcqQNpf--62wIncsM7817M_C67hroEoCaz7er1Xq7XTLK2JKDoZKzd90FA2kWvOfy_X_zx-6qtQOdSk9Qry66py84YhhT3pONGzGPZFdOKTTiciS7ijk2kjLZlmH84yqSdd6njFjPgl_Y0NXwSLYpByRgNCUP7cz8rMU7n46pjSnMjuR7iXicyE_dh8EdG1699svu4et6t7pbbH58u1_dbhZBUD0uxCAYVSwwqYUKHijnIgqQqJSO0oOJECOPShgRglaSGQ4eUCg0FBz3_LK7n31jcQd7qum3q39tccm-AKXuravTe0e0fQjee-NppCgYH7QG9AACXE97PoTJ62b2OtXy9IxttIfyXPP0vmXSgFa9kv20xeetUEtrFYe3q0DtOSo7R2XPUdnXqCbV9axKiPimMJppynr-D97OjkQ</recordid><startdate>2022</startdate><enddate>2022</enddate><creator>Gurcan, Fatih</creator><creator>Dalveren, Gonca Gokce Menekse</creator><creator>Cagiltay, Nergiz Ercil</creator><creator>Soylu, Ahmet</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>ESBDL</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>7SR</scope><scope>8BQ</scope><scope>8FD</scope><scope>JG9</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0002-8649-1909</orcidid><orcidid>https://orcid.org/0000-0001-6034-4137</orcidid><orcidid>https://orcid.org/0000-0003-0875-9276</orcidid><orcidid>https://orcid.org/0000-0001-9915-6686</orcidid></search><sort><creationdate>2022</creationdate><title>Detecting Latent Topics and Trends in Software Engineering Research Since 1980 Using Probabilistic Topic Modeling</title><author>Gurcan, Fatih ; Dalveren, Gonca Gokce Menekse ; Cagiltay, Nergiz Ercil ; Soylu, Ahmet</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c408t-4f42072c26847cb10334d416e778d6b19d1dd3d7494cc8762931b1e47e901a3b3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Age</topic><topic>Bibliometrics</topic><topic>Corpus creation</topic><topic>Engineering research</topic><topic>Licenses</topic><topic>Modelling</topic><topic>Optimization</topic><topic>research trends and topics</topic><topic>Software</topic><topic>Software development</topic><topic>Software engineering</topic><topic>Systematics</topic><topic>Text mining</topic><topic>topic model</topic><topic>Trends</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Gurcan, Fatih</creatorcontrib><creatorcontrib>Dalveren, Gonca Gokce Menekse</creatorcontrib><creatorcontrib>Cagiltay, Nergiz Ercil</creatorcontrib><creatorcontrib>Soylu, Ahmet</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE Open Access Journals</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>IEEE access</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Gurcan, Fatih</au><au>Dalveren, Gonca Gokce Menekse</au><au>Cagiltay, Nergiz Ercil</au><au>Soylu, Ahmet</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Detecting Latent Topics and Trends in Software Engineering Research Since 1980 Using Probabilistic Topic Modeling</atitle><jtitle>IEEE access</jtitle><stitle>Access</stitle><date>2022</date><risdate>2022</risdate><volume>10</volume><spage>74638</spage><epage>74654</epage><pages>74638-74654</pages><issn>2169-3536</issn><eissn>2169-3536</eissn><coden>IAECCG</coden><abstract>The landscape of software engineering research has changed significantly from one year to the next in line with industrial needs and trends. Therefore, today's research literature on software engineering has a rich and multidisciplinary content that includes a large number of studies; however, not many of them demonstrate a holistic view of the field. From this perspective, this study aimed to reveal a holistic view that reflects topics, trends, and trajectories in software engineering research by analyzing the majority of domain-specific articles published over the last 40 years. This study first presents an objective and systematic method for corpus creation through major publication sources in the field. A corpus was then created using this method, which includes 44 domain-specific conferences and journals and 57,174 articles published between 1980 and 2019. Next, this corpus was analyzed using an automated text-mining methodology based on a probabilistic topic-modeling approach. As a result of this analysis, 24 main topics were found. In addition, topical trends in the field were revealed. Finally, three main developmental stages of the field were identified as: the programming age, the software development age, and the software optimization age.</abstract><cop>Piscataway</cop><pub>IEEE</pub><doi>10.1109/ACCESS.2022.3190632</doi><tpages>17</tpages><orcidid>https://orcid.org/0000-0002-8649-1909</orcidid><orcidid>https://orcid.org/0000-0001-6034-4137</orcidid><orcidid>https://orcid.org/0000-0003-0875-9276</orcidid><orcidid>https://orcid.org/0000-0001-9915-6686</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2169-3536
ispartof IEEE access, 2022, Vol.10, p.74638-74654
issn 2169-3536
2169-3536
language eng
recordid cdi_proquest_journals_2691875765
source IEEE Open Access Journals; DOAJ Directory of Open Access Journals; EZB-FREE-00999 freely available EZB journals
subjects Age
Bibliometrics
Corpus creation
Engineering research
Licenses
Modelling
Optimization
research trends and topics
Software
Software development
Software engineering
Systematics
Text mining
topic model
Trends
title Detecting Latent Topics and Trends in Software Engineering Research Since 1980 Using Probabilistic Topic Modeling
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-05T12%3A40%3A02IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Detecting%20Latent%20Topics%20and%20Trends%20in%20Software%20Engineering%20Research%20Since%201980%20Using%20Probabilistic%20Topic%20Modeling&rft.jtitle=IEEE%20access&rft.au=Gurcan,%20Fatih&rft.date=2022&rft.volume=10&rft.spage=74638&rft.epage=74654&rft.pages=74638-74654&rft.issn=2169-3536&rft.eissn=2169-3536&rft.coden=IAECCG&rft_id=info:doi/10.1109/ACCESS.2022.3190632&rft_dat=%3Cproquest_cross%3E2691875765%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2691875765&rft_id=info:pmid/&rft_ieee_id=9828025&rft_doaj_id=oai_doaj_org_article_5ccbbb9b0d0e423f881eb1141a5053fc&rfr_iscdi=true