Detecting Latent Topics and Trends in Software Engineering Research Since 1980 Using Probabilistic Topic Modeling
The landscape of software engineering research has changed significantly from one year to the next in line with industrial needs and trends. Therefore, today's research literature on software engineering has a rich and multidisciplinary content that includes a large number of studies; however,...
Gespeichert in:
Veröffentlicht in: | IEEE access 2022, Vol.10, p.74638-74654 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 74654 |
---|---|
container_issue | |
container_start_page | 74638 |
container_title | IEEE access |
container_volume | 10 |
creator | Gurcan, Fatih Dalveren, Gonca Gokce Menekse Cagiltay, Nergiz Ercil Soylu, Ahmet |
description | The landscape of software engineering research has changed significantly from one year to the next in line with industrial needs and trends. Therefore, today's research literature on software engineering has a rich and multidisciplinary content that includes a large number of studies; however, not many of them demonstrate a holistic view of the field. From this perspective, this study aimed to reveal a holistic view that reflects topics, trends, and trajectories in software engineering research by analyzing the majority of domain-specific articles published over the last 40 years. This study first presents an objective and systematic method for corpus creation through major publication sources in the field. A corpus was then created using this method, which includes 44 domain-specific conferences and journals and 57,174 articles published between 1980 and 2019. Next, this corpus was analyzed using an automated text-mining methodology based on a probabilistic topic-modeling approach. As a result of this analysis, 24 main topics were found. In addition, topical trends in the field were revealed. Finally, three main developmental stages of the field were identified as: the programming age, the software development age, and the software optimization age. |
doi_str_mv | 10.1109/ACCESS.2022.3190632 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2691875765</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9828025</ieee_id><doaj_id>oai_doaj_org_article_5ccbbb9b0d0e423f881eb1141a5053fc</doaj_id><sourcerecordid>2691875765</sourcerecordid><originalsourceid>FETCH-LOGICAL-c408t-4f42072c26847cb10334d416e778d6b19d1dd3d7494cc8762931b1e47e901a3b3</originalsourceid><addsrcrecordid>eNpNUU1rGzEQXUoLDWl-QS6CnO1qJK0-jsF1m4BLS-2chT5mHRlXcqQNpf--62wIncsM7817M_C67hroEoCaz7er1Xq7XTLK2JKDoZKzd90FA2kWvOfy_X_zx-6qtQOdSk9Qry66py84YhhT3pONGzGPZFdOKTTiciS7ijk2kjLZlmH84yqSdd6njFjPgl_Y0NXwSLYpByRgNCUP7cz8rMU7n46pjSnMjuR7iXicyE_dh8EdG1699svu4et6t7pbbH58u1_dbhZBUD0uxCAYVSwwqYUKHijnIgqQqJSO0oOJECOPShgRglaSGQ4eUCg0FBz3_LK7n31jcQd7qum3q39tccm-AKXuravTe0e0fQjee-NppCgYH7QG9AACXE97PoTJ62b2OtXy9IxttIfyXPP0vmXSgFa9kv20xeetUEtrFYe3q0DtOSo7R2XPUdnXqCbV9axKiPimMJppynr-D97OjkQ</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2691875765</pqid></control><display><type>article</type><title>Detecting Latent Topics and Trends in Software Engineering Research Since 1980 Using Probabilistic Topic Modeling</title><source>IEEE Open Access Journals</source><source>DOAJ Directory of Open Access Journals</source><source>EZB-FREE-00999 freely available EZB journals</source><creator>Gurcan, Fatih ; Dalveren, Gonca Gokce Menekse ; Cagiltay, Nergiz Ercil ; Soylu, Ahmet</creator><creatorcontrib>Gurcan, Fatih ; Dalveren, Gonca Gokce Menekse ; Cagiltay, Nergiz Ercil ; Soylu, Ahmet</creatorcontrib><description>The landscape of software engineering research has changed significantly from one year to the next in line with industrial needs and trends. Therefore, today's research literature on software engineering has a rich and multidisciplinary content that includes a large number of studies; however, not many of them demonstrate a holistic view of the field. From this perspective, this study aimed to reveal a holistic view that reflects topics, trends, and trajectories in software engineering research by analyzing the majority of domain-specific articles published over the last 40 years. This study first presents an objective and systematic method for corpus creation through major publication sources in the field. A corpus was then created using this method, which includes 44 domain-specific conferences and journals and 57,174 articles published between 1980 and 2019. Next, this corpus was analyzed using an automated text-mining methodology based on a probabilistic topic-modeling approach. As a result of this analysis, 24 main topics were found. In addition, topical trends in the field were revealed. Finally, three main developmental stages of the field were identified as: the programming age, the software development age, and the software optimization age.</description><identifier>ISSN: 2169-3536</identifier><identifier>EISSN: 2169-3536</identifier><identifier>DOI: 10.1109/ACCESS.2022.3190632</identifier><identifier>CODEN: IAECCG</identifier><language>eng</language><publisher>Piscataway: IEEE</publisher><subject>Age ; Bibliometrics ; Corpus creation ; Engineering research ; Licenses ; Modelling ; Optimization ; research trends and topics ; Software ; Software development ; Software engineering ; Systematics ; Text mining ; topic model ; Trends</subject><ispartof>IEEE access, 2022, Vol.10, p.74638-74654</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2022</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c408t-4f42072c26847cb10334d416e778d6b19d1dd3d7494cc8762931b1e47e901a3b3</citedby><cites>FETCH-LOGICAL-c408t-4f42072c26847cb10334d416e778d6b19d1dd3d7494cc8762931b1e47e901a3b3</cites><orcidid>0000-0002-8649-1909 ; 0000-0001-6034-4137 ; 0000-0003-0875-9276 ; 0000-0001-9915-6686</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9828025$$EHTML$$P50$$Gieee$$Hfree_for_read</linktohtml><link.rule.ids>314,780,784,864,2102,4024,27633,27923,27924,27925,54933</link.rule.ids></links><search><creatorcontrib>Gurcan, Fatih</creatorcontrib><creatorcontrib>Dalveren, Gonca Gokce Menekse</creatorcontrib><creatorcontrib>Cagiltay, Nergiz Ercil</creatorcontrib><creatorcontrib>Soylu, Ahmet</creatorcontrib><title>Detecting Latent Topics and Trends in Software Engineering Research Since 1980 Using Probabilistic Topic Modeling</title><title>IEEE access</title><addtitle>Access</addtitle><description>The landscape of software engineering research has changed significantly from one year to the next in line with industrial needs and trends. Therefore, today's research literature on software engineering has a rich and multidisciplinary content that includes a large number of studies; however, not many of them demonstrate a holistic view of the field. From this perspective, this study aimed to reveal a holistic view that reflects topics, trends, and trajectories in software engineering research by analyzing the majority of domain-specific articles published over the last 40 years. This study first presents an objective and systematic method for corpus creation through major publication sources in the field. A corpus was then created using this method, which includes 44 domain-specific conferences and journals and 57,174 articles published between 1980 and 2019. Next, this corpus was analyzed using an automated text-mining methodology based on a probabilistic topic-modeling approach. As a result of this analysis, 24 main topics were found. In addition, topical trends in the field were revealed. Finally, three main developmental stages of the field were identified as: the programming age, the software development age, and the software optimization age.</description><subject>Age</subject><subject>Bibliometrics</subject><subject>Corpus creation</subject><subject>Engineering research</subject><subject>Licenses</subject><subject>Modelling</subject><subject>Optimization</subject><subject>research trends and topics</subject><subject>Software</subject><subject>Software development</subject><subject>Software engineering</subject><subject>Systematics</subject><subject>Text mining</subject><subject>topic model</subject><subject>Trends</subject><issn>2169-3536</issn><issn>2169-3536</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>ESBDL</sourceid><sourceid>RIE</sourceid><sourceid>DOA</sourceid><recordid>eNpNUU1rGzEQXUoLDWl-QS6CnO1qJK0-jsF1m4BLS-2chT5mHRlXcqQNpf--62wIncsM7817M_C67hroEoCaz7er1Xq7XTLK2JKDoZKzd90FA2kWvOfy_X_zx-6qtQOdSk9Qry66py84YhhT3pONGzGPZFdOKTTiciS7ijk2kjLZlmH84yqSdd6njFjPgl_Y0NXwSLYpByRgNCUP7cz8rMU7n46pjSnMjuR7iXicyE_dh8EdG1699svu4et6t7pbbH58u1_dbhZBUD0uxCAYVSwwqYUKHijnIgqQqJSO0oOJECOPShgRglaSGQ4eUCg0FBz3_LK7n31jcQd7qum3q39tccm-AKXuravTe0e0fQjee-NppCgYH7QG9AACXE97PoTJ62b2OtXy9IxttIfyXPP0vmXSgFa9kv20xeetUEtrFYe3q0DtOSo7R2XPUdnXqCbV9axKiPimMJppynr-D97OjkQ</recordid><startdate>2022</startdate><enddate>2022</enddate><creator>Gurcan, Fatih</creator><creator>Dalveren, Gonca Gokce Menekse</creator><creator>Cagiltay, Nergiz Ercil</creator><creator>Soylu, Ahmet</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>ESBDL</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>7SR</scope><scope>8BQ</scope><scope>8FD</scope><scope>JG9</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0002-8649-1909</orcidid><orcidid>https://orcid.org/0000-0001-6034-4137</orcidid><orcidid>https://orcid.org/0000-0003-0875-9276</orcidid><orcidid>https://orcid.org/0000-0001-9915-6686</orcidid></search><sort><creationdate>2022</creationdate><title>Detecting Latent Topics and Trends in Software Engineering Research Since 1980 Using Probabilistic Topic Modeling</title><author>Gurcan, Fatih ; Dalveren, Gonca Gokce Menekse ; Cagiltay, Nergiz Ercil ; Soylu, Ahmet</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c408t-4f42072c26847cb10334d416e778d6b19d1dd3d7494cc8762931b1e47e901a3b3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Age</topic><topic>Bibliometrics</topic><topic>Corpus creation</topic><topic>Engineering research</topic><topic>Licenses</topic><topic>Modelling</topic><topic>Optimization</topic><topic>research trends and topics</topic><topic>Software</topic><topic>Software development</topic><topic>Software engineering</topic><topic>Systematics</topic><topic>Text mining</topic><topic>topic model</topic><topic>Trends</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Gurcan, Fatih</creatorcontrib><creatorcontrib>Dalveren, Gonca Gokce Menekse</creatorcontrib><creatorcontrib>Cagiltay, Nergiz Ercil</creatorcontrib><creatorcontrib>Soylu, Ahmet</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE Open Access Journals</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>IEEE access</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Gurcan, Fatih</au><au>Dalveren, Gonca Gokce Menekse</au><au>Cagiltay, Nergiz Ercil</au><au>Soylu, Ahmet</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Detecting Latent Topics and Trends in Software Engineering Research Since 1980 Using Probabilistic Topic Modeling</atitle><jtitle>IEEE access</jtitle><stitle>Access</stitle><date>2022</date><risdate>2022</risdate><volume>10</volume><spage>74638</spage><epage>74654</epage><pages>74638-74654</pages><issn>2169-3536</issn><eissn>2169-3536</eissn><coden>IAECCG</coden><abstract>The landscape of software engineering research has changed significantly from one year to the next in line with industrial needs and trends. Therefore, today's research literature on software engineering has a rich and multidisciplinary content that includes a large number of studies; however, not many of them demonstrate a holistic view of the field. From this perspective, this study aimed to reveal a holistic view that reflects topics, trends, and trajectories in software engineering research by analyzing the majority of domain-specific articles published over the last 40 years. This study first presents an objective and systematic method for corpus creation through major publication sources in the field. A corpus was then created using this method, which includes 44 domain-specific conferences and journals and 57,174 articles published between 1980 and 2019. Next, this corpus was analyzed using an automated text-mining methodology based on a probabilistic topic-modeling approach. As a result of this analysis, 24 main topics were found. In addition, topical trends in the field were revealed. Finally, three main developmental stages of the field were identified as: the programming age, the software development age, and the software optimization age.</abstract><cop>Piscataway</cop><pub>IEEE</pub><doi>10.1109/ACCESS.2022.3190632</doi><tpages>17</tpages><orcidid>https://orcid.org/0000-0002-8649-1909</orcidid><orcidid>https://orcid.org/0000-0001-6034-4137</orcidid><orcidid>https://orcid.org/0000-0003-0875-9276</orcidid><orcidid>https://orcid.org/0000-0001-9915-6686</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2169-3536 |
ispartof | IEEE access, 2022, Vol.10, p.74638-74654 |
issn | 2169-3536 2169-3536 |
language | eng |
recordid | cdi_proquest_journals_2691875765 |
source | IEEE Open Access Journals; DOAJ Directory of Open Access Journals; EZB-FREE-00999 freely available EZB journals |
subjects | Age Bibliometrics Corpus creation Engineering research Licenses Modelling Optimization research trends and topics Software Software development Software engineering Systematics Text mining topic model Trends |
title | Detecting Latent Topics and Trends in Software Engineering Research Since 1980 Using Probabilistic Topic Modeling |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-05T12%3A40%3A02IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Detecting%20Latent%20Topics%20and%20Trends%20in%20Software%20Engineering%20Research%20Since%201980%20Using%20Probabilistic%20Topic%20Modeling&rft.jtitle=IEEE%20access&rft.au=Gurcan,%20Fatih&rft.date=2022&rft.volume=10&rft.spage=74638&rft.epage=74654&rft.pages=74638-74654&rft.issn=2169-3536&rft.eissn=2169-3536&rft.coden=IAECCG&rft_id=info:doi/10.1109/ACCESS.2022.3190632&rft_dat=%3Cproquest_cross%3E2691875765%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2691875765&rft_id=info:pmid/&rft_ieee_id=9828025&rft_doaj_id=oai_doaj_org_article_5ccbbb9b0d0e423f881eb1141a5053fc&rfr_iscdi=true |