Topic Modeling: Perspectives From a Literature Review
Topic modeling is a Natural Language Processing technique that has gained popularity over the last ten years, with applications in multiple fields of knowledge. However, there is insufficient empirical evidence to show how this field of study has developed over the years, as well as the main models...
Gespeichert in:
Veröffentlicht in: | IEEE access 2023, Vol.11, p.4066-4078 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 4078 |
---|---|
container_issue | |
container_start_page | 4066 |
container_title | IEEE access |
container_volume | 11 |
creator | A., Andres M. Grisales Robledo, Sebastian Zuluaga, Martha |
description | Topic modeling is a Natural Language Processing technique that has gained popularity over the last ten years, with applications in multiple fields of knowledge. However, there is insufficient empirical evidence to show how this field of study has developed over the years, as well as the main models that have been applied in different contexts. The objective of this paper is to analyze the evolution of the topic modeling technique, the main areas in which it has been applied, and the models that are recommended for specific types of data. The methodology applied is based on bibliometric analysis. First, we searched the Web of Science and the Scopus databases. We then used scientometric techniques and a Tree of Science methodology, which allowed us to analyze the search results from the perspectives of classics, structure, and trends. The results show that the USA and China are among the most productive countries in this field and the applications have been mainly in the identification of sub-topics in short texts, such as social networks and blogs. The main conclusion of this work is that topic modeling is a versatile technique that can complement systematic literature reviews and that has been well-received in different academic and research contexts. The results of this study will help researchers and academics to recognize the importance of these techniques for reviewing large volumes of unstructured information, such as research articles, and in general, for systematic literature reviews. |
doi_str_mv | 10.1109/ACCESS.2022.3232939 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1109_ACCESS_2022_3232939</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>10002352</ieee_id><doaj_id>oai_doaj_org_article_9afd3e2133234a41bd4c25bdce0ebaa6</doaj_id><sourcerecordid>2766634163</sourcerecordid><originalsourceid>FETCH-LOGICAL-c409t-fed2268c32281c9affcf952caae485aadb8ff9c32ae9522af402a650a36f66e03</originalsourceid><addsrcrecordid>eNpNkFFPwjAUhRejiQT5BfqwxGewvd3K5htZUEkwGsHn5q67JSVAZzcw_nuLI4a-tDk959ybL4puORtxzvKHSVFMF4sRMICRAAG5yC-iHnCZD0Uq5OXZ-zoaNM2ahZMFKR33onTpaqvjV1fRxu5Wj_E7-aYm3doDNfGTd9sY47ltyWO79xR_0MHS9010ZXDT0OB096PPp-myeBnO355nxWQ-1AnL26GhCkBmWgBkXOdojDZ5ChqRkixFrMrMmDx8IwUZ0CQMUKYMhTRSEhP9aNb1Vg7XqvZ2i_5HObTqT3B-pdC3Vm9IhfZKEHARECSY8LJKNKRlpYlRiShD133XVXv3taemVWu397uwvoKxlFIkXIrgEp1Le9c0nsz_VM7UEbfqcKsjbnXCHVJ3XcoS0VmCMRApiF_Ub3s9</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2766634163</pqid></control><display><type>article</type><title>Topic Modeling: Perspectives From a Literature Review</title><source>IEEE Open Access Journals</source><source>DOAJ Directory of Open Access Journals</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><creator>A., Andres M. Grisales ; Robledo, Sebastian ; Zuluaga, Martha</creator><creatorcontrib>A., Andres M. Grisales ; Robledo, Sebastian ; Zuluaga, Martha</creatorcontrib><description>Topic modeling is a Natural Language Processing technique that has gained popularity over the last ten years, with applications in multiple fields of knowledge. However, there is insufficient empirical evidence to show how this field of study has developed over the years, as well as the main models that have been applied in different contexts. The objective of this paper is to analyze the evolution of the topic modeling technique, the main areas in which it has been applied, and the models that are recommended for specific types of data. The methodology applied is based on bibliometric analysis. First, we searched the Web of Science and the Scopus databases. We then used scientometric techniques and a Tree of Science methodology, which allowed us to analyze the search results from the perspectives of classics, structure, and trends. The results show that the USA and China are among the most productive countries in this field and the applications have been mainly in the identification of sub-topics in short texts, such as social networks and blogs. The main conclusion of this work is that topic modeling is a versatile technique that can complement systematic literature reviews and that has been well-received in different academic and research contexts. The results of this study will help researchers and academics to recognize the importance of these techniques for reviewing large volumes of unstructured information, such as research articles, and in general, for systematic literature reviews.</description><identifier>ISSN: 2169-3536</identifier><identifier>EISSN: 2169-3536</identifier><identifier>DOI: 10.1109/ACCESS.2022.3232939</identifier><identifier>CODEN: IAECCG</identifier><language>eng</language><publisher>Piscataway: IEEE</publisher><subject>Bibliographies ; Bibliometrics ; Codes ; Data mining ; Data models ; Empirical analysis ; Literature review ; Literature reviews ; Machine learning ; Modelling ; Natural language processing ; Scientometrics ; Social networks ; Systematics ; topic modeling ; Unstructured data</subject><ispartof>IEEE access, 2023, Vol.11, p.4066-4078</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2023</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c409t-fed2268c32281c9affcf952caae485aadb8ff9c32ae9522af402a650a36f66e03</citedby><cites>FETCH-LOGICAL-c409t-fed2268c32281c9affcf952caae485aadb8ff9c32ae9522af402a650a36f66e03</cites><orcidid>0000-0002-4385-4474 ; 0000-0003-4357-4402 ; 0000-0003-1720-8476</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/10002352$$EHTML$$P50$$Gieee$$Hfree_for_read</linktohtml><link.rule.ids>314,776,780,860,2096,4010,27610,27900,27901,27902,54908</link.rule.ids></links><search><creatorcontrib>A., Andres M. Grisales</creatorcontrib><creatorcontrib>Robledo, Sebastian</creatorcontrib><creatorcontrib>Zuluaga, Martha</creatorcontrib><title>Topic Modeling: Perspectives From a Literature Review</title><title>IEEE access</title><addtitle>Access</addtitle><description>Topic modeling is a Natural Language Processing technique that has gained popularity over the last ten years, with applications in multiple fields of knowledge. However, there is insufficient empirical evidence to show how this field of study has developed over the years, as well as the main models that have been applied in different contexts. The objective of this paper is to analyze the evolution of the topic modeling technique, the main areas in which it has been applied, and the models that are recommended for specific types of data. The methodology applied is based on bibliometric analysis. First, we searched the Web of Science and the Scopus databases. We then used scientometric techniques and a Tree of Science methodology, which allowed us to analyze the search results from the perspectives of classics, structure, and trends. The results show that the USA and China are among the most productive countries in this field and the applications have been mainly in the identification of sub-topics in short texts, such as social networks and blogs. The main conclusion of this work is that topic modeling is a versatile technique that can complement systematic literature reviews and that has been well-received in different academic and research contexts. The results of this study will help researchers and academics to recognize the importance of these techniques for reviewing large volumes of unstructured information, such as research articles, and in general, for systematic literature reviews.</description><subject>Bibliographies</subject><subject>Bibliometrics</subject><subject>Codes</subject><subject>Data mining</subject><subject>Data models</subject><subject>Empirical analysis</subject><subject>Literature review</subject><subject>Literature reviews</subject><subject>Machine learning</subject><subject>Modelling</subject><subject>Natural language processing</subject><subject>Scientometrics</subject><subject>Social networks</subject><subject>Systematics</subject><subject>topic modeling</subject><subject>Unstructured data</subject><issn>2169-3536</issn><issn>2169-3536</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>ESBDL</sourceid><sourceid>RIE</sourceid><sourceid>DOA</sourceid><recordid>eNpNkFFPwjAUhRejiQT5BfqwxGewvd3K5htZUEkwGsHn5q67JSVAZzcw_nuLI4a-tDk959ybL4puORtxzvKHSVFMF4sRMICRAAG5yC-iHnCZD0Uq5OXZ-zoaNM2ahZMFKR33onTpaqvjV1fRxu5Wj_E7-aYm3doDNfGTd9sY47ltyWO79xR_0MHS9010ZXDT0OB096PPp-myeBnO355nxWQ-1AnL26GhCkBmWgBkXOdojDZ5ChqRkixFrMrMmDx8IwUZ0CQMUKYMhTRSEhP9aNb1Vg7XqvZ2i_5HObTqT3B-pdC3Vm9IhfZKEHARECSY8LJKNKRlpYlRiShD133XVXv3taemVWu397uwvoKxlFIkXIrgEp1Le9c0nsz_VM7UEbfqcKsjbnXCHVJ3XcoS0VmCMRApiF_Ub3s9</recordid><startdate>2023</startdate><enddate>2023</enddate><creator>A., Andres M. Grisales</creator><creator>Robledo, Sebastian</creator><creator>Zuluaga, Martha</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>ESBDL</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>7SR</scope><scope>8BQ</scope><scope>8FD</scope><scope>JG9</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0002-4385-4474</orcidid><orcidid>https://orcid.org/0000-0003-4357-4402</orcidid><orcidid>https://orcid.org/0000-0003-1720-8476</orcidid></search><sort><creationdate>2023</creationdate><title>Topic Modeling: Perspectives From a Literature Review</title><author>A., Andres M. Grisales ; Robledo, Sebastian ; Zuluaga, Martha</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c409t-fed2268c32281c9affcf952caae485aadb8ff9c32ae9522af402a650a36f66e03</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Bibliographies</topic><topic>Bibliometrics</topic><topic>Codes</topic><topic>Data mining</topic><topic>Data models</topic><topic>Empirical analysis</topic><topic>Literature review</topic><topic>Literature reviews</topic><topic>Machine learning</topic><topic>Modelling</topic><topic>Natural language processing</topic><topic>Scientometrics</topic><topic>Social networks</topic><topic>Systematics</topic><topic>topic modeling</topic><topic>Unstructured data</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>A., Andres M. Grisales</creatorcontrib><creatorcontrib>Robledo, Sebastian</creatorcontrib><creatorcontrib>Zuluaga, Martha</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE Open Access Journals</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>IEEE access</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>A., Andres M. Grisales</au><au>Robledo, Sebastian</au><au>Zuluaga, Martha</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Topic Modeling: Perspectives From a Literature Review</atitle><jtitle>IEEE access</jtitle><stitle>Access</stitle><date>2023</date><risdate>2023</risdate><volume>11</volume><spage>4066</spage><epage>4078</epage><pages>4066-4078</pages><issn>2169-3536</issn><eissn>2169-3536</eissn><coden>IAECCG</coden><abstract>Topic modeling is a Natural Language Processing technique that has gained popularity over the last ten years, with applications in multiple fields of knowledge. However, there is insufficient empirical evidence to show how this field of study has developed over the years, as well as the main models that have been applied in different contexts. The objective of this paper is to analyze the evolution of the topic modeling technique, the main areas in which it has been applied, and the models that are recommended for specific types of data. The methodology applied is based on bibliometric analysis. First, we searched the Web of Science and the Scopus databases. We then used scientometric techniques and a Tree of Science methodology, which allowed us to analyze the search results from the perspectives of classics, structure, and trends. The results show that the USA and China are among the most productive countries in this field and the applications have been mainly in the identification of sub-topics in short texts, such as social networks and blogs. The main conclusion of this work is that topic modeling is a versatile technique that can complement systematic literature reviews and that has been well-received in different academic and research contexts. The results of this study will help researchers and academics to recognize the importance of these techniques for reviewing large volumes of unstructured information, such as research articles, and in general, for systematic literature reviews.</abstract><cop>Piscataway</cop><pub>IEEE</pub><doi>10.1109/ACCESS.2022.3232939</doi><tpages>13</tpages><orcidid>https://orcid.org/0000-0002-4385-4474</orcidid><orcidid>https://orcid.org/0000-0003-4357-4402</orcidid><orcidid>https://orcid.org/0000-0003-1720-8476</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2169-3536 |
ispartof | IEEE access, 2023, Vol.11, p.4066-4078 |
issn | 2169-3536 2169-3536 |
language | eng |
recordid | cdi_crossref_primary_10_1109_ACCESS_2022_3232939 |
source | IEEE Open Access Journals; DOAJ Directory of Open Access Journals; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals |
subjects | Bibliographies Bibliometrics Codes Data mining Data models Empirical analysis Literature review Literature reviews Machine learning Modelling Natural language processing Scientometrics Social networks Systematics topic modeling Unstructured data |
title | Topic Modeling: Perspectives From a Literature Review |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-12T21%3A29%3A12IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Topic%20Modeling:%20Perspectives%20From%20a%20Literature%20Review&rft.jtitle=IEEE%20access&rft.au=A.,%20Andres%20M.%20Grisales&rft.date=2023&rft.volume=11&rft.spage=4066&rft.epage=4078&rft.pages=4066-4078&rft.issn=2169-3536&rft.eissn=2169-3536&rft.coden=IAECCG&rft_id=info:doi/10.1109/ACCESS.2022.3232939&rft_dat=%3Cproquest_cross%3E2766634163%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2766634163&rft_id=info:pmid/&rft_ieee_id=10002352&rft_doaj_id=oai_doaj_org_article_9afd3e2133234a41bd4c25bdce0ebaa6&rfr_iscdi=true |