Topic Modeling: Perspectives From a Literature Review

Topic modeling is a Natural Language Processing technique that has gained popularity over the last ten years, with applications in multiple fields of knowledge. However, there is insufficient empirical evidence to show how this field of study has developed over the years, as well as the main models...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE access 2023, Vol.11, p.4066-4078
Hauptverfasser: A., Andres M. Grisales, Robledo, Sebastian, Zuluaga, Martha
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 4078
container_issue
container_start_page 4066
container_title IEEE access
container_volume 11
creator A., Andres M. Grisales
Robledo, Sebastian
Zuluaga, Martha
description Topic modeling is a Natural Language Processing technique that has gained popularity over the last ten years, with applications in multiple fields of knowledge. However, there is insufficient empirical evidence to show how this field of study has developed over the years, as well as the main models that have been applied in different contexts. The objective of this paper is to analyze the evolution of the topic modeling technique, the main areas in which it has been applied, and the models that are recommended for specific types of data. The methodology applied is based on bibliometric analysis. First, we searched the Web of Science and the Scopus databases. We then used scientometric techniques and a Tree of Science methodology, which allowed us to analyze the search results from the perspectives of classics, structure, and trends. The results show that the USA and China are among the most productive countries in this field and the applications have been mainly in the identification of sub-topics in short texts, such as social networks and blogs. The main conclusion of this work is that topic modeling is a versatile technique that can complement systematic literature reviews and that has been well-received in different academic and research contexts. The results of this study will help researchers and academics to recognize the importance of these techniques for reviewing large volumes of unstructured information, such as research articles, and in general, for systematic literature reviews.
doi_str_mv 10.1109/ACCESS.2022.3232939
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1109_ACCESS_2022_3232939</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>10002352</ieee_id><doaj_id>oai_doaj_org_article_9afd3e2133234a41bd4c25bdce0ebaa6</doaj_id><sourcerecordid>2766634163</sourcerecordid><originalsourceid>FETCH-LOGICAL-c409t-fed2268c32281c9affcf952caae485aadb8ff9c32ae9522af402a650a36f66e03</originalsourceid><addsrcrecordid>eNpNkFFPwjAUhRejiQT5BfqwxGewvd3K5htZUEkwGsHn5q67JSVAZzcw_nuLI4a-tDk959ybL4puORtxzvKHSVFMF4sRMICRAAG5yC-iHnCZD0Uq5OXZ-zoaNM2ahZMFKR33onTpaqvjV1fRxu5Wj_E7-aYm3doDNfGTd9sY47ltyWO79xR_0MHS9010ZXDT0OB096PPp-myeBnO355nxWQ-1AnL26GhCkBmWgBkXOdojDZ5ChqRkixFrMrMmDx8IwUZ0CQMUKYMhTRSEhP9aNb1Vg7XqvZ2i_5HObTqT3B-pdC3Vm9IhfZKEHARECSY8LJKNKRlpYlRiShD133XVXv3taemVWu397uwvoKxlFIkXIrgEp1Le9c0nsz_VM7UEbfqcKsjbnXCHVJ3XcoS0VmCMRApiF_Ub3s9</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2766634163</pqid></control><display><type>article</type><title>Topic Modeling: Perspectives From a Literature Review</title><source>IEEE Open Access Journals</source><source>DOAJ Directory of Open Access Journals</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><creator>A., Andres M. Grisales ; Robledo, Sebastian ; Zuluaga, Martha</creator><creatorcontrib>A., Andres M. Grisales ; Robledo, Sebastian ; Zuluaga, Martha</creatorcontrib><description>Topic modeling is a Natural Language Processing technique that has gained popularity over the last ten years, with applications in multiple fields of knowledge. However, there is insufficient empirical evidence to show how this field of study has developed over the years, as well as the main models that have been applied in different contexts. The objective of this paper is to analyze the evolution of the topic modeling technique, the main areas in which it has been applied, and the models that are recommended for specific types of data. The methodology applied is based on bibliometric analysis. First, we searched the Web of Science and the Scopus databases. We then used scientometric techniques and a Tree of Science methodology, which allowed us to analyze the search results from the perspectives of classics, structure, and trends. The results show that the USA and China are among the most productive countries in this field and the applications have been mainly in the identification of sub-topics in short texts, such as social networks and blogs. The main conclusion of this work is that topic modeling is a versatile technique that can complement systematic literature reviews and that has been well-received in different academic and research contexts. The results of this study will help researchers and academics to recognize the importance of these techniques for reviewing large volumes of unstructured information, such as research articles, and in general, for systematic literature reviews.</description><identifier>ISSN: 2169-3536</identifier><identifier>EISSN: 2169-3536</identifier><identifier>DOI: 10.1109/ACCESS.2022.3232939</identifier><identifier>CODEN: IAECCG</identifier><language>eng</language><publisher>Piscataway: IEEE</publisher><subject>Bibliographies ; Bibliometrics ; Codes ; Data mining ; Data models ; Empirical analysis ; Literature review ; Literature reviews ; Machine learning ; Modelling ; Natural language processing ; Scientometrics ; Social networks ; Systematics ; topic modeling ; Unstructured data</subject><ispartof>IEEE access, 2023, Vol.11, p.4066-4078</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2023</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c409t-fed2268c32281c9affcf952caae485aadb8ff9c32ae9522af402a650a36f66e03</citedby><cites>FETCH-LOGICAL-c409t-fed2268c32281c9affcf952caae485aadb8ff9c32ae9522af402a650a36f66e03</cites><orcidid>0000-0002-4385-4474 ; 0000-0003-4357-4402 ; 0000-0003-1720-8476</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/10002352$$EHTML$$P50$$Gieee$$Hfree_for_read</linktohtml><link.rule.ids>314,776,780,860,2096,4010,27610,27900,27901,27902,54908</link.rule.ids></links><search><creatorcontrib>A., Andres M. Grisales</creatorcontrib><creatorcontrib>Robledo, Sebastian</creatorcontrib><creatorcontrib>Zuluaga, Martha</creatorcontrib><title>Topic Modeling: Perspectives From a Literature Review</title><title>IEEE access</title><addtitle>Access</addtitle><description>Topic modeling is a Natural Language Processing technique that has gained popularity over the last ten years, with applications in multiple fields of knowledge. However, there is insufficient empirical evidence to show how this field of study has developed over the years, as well as the main models that have been applied in different contexts. The objective of this paper is to analyze the evolution of the topic modeling technique, the main areas in which it has been applied, and the models that are recommended for specific types of data. The methodology applied is based on bibliometric analysis. First, we searched the Web of Science and the Scopus databases. We then used scientometric techniques and a Tree of Science methodology, which allowed us to analyze the search results from the perspectives of classics, structure, and trends. The results show that the USA and China are among the most productive countries in this field and the applications have been mainly in the identification of sub-topics in short texts, such as social networks and blogs. The main conclusion of this work is that topic modeling is a versatile technique that can complement systematic literature reviews and that has been well-received in different academic and research contexts. The results of this study will help researchers and academics to recognize the importance of these techniques for reviewing large volumes of unstructured information, such as research articles, and in general, for systematic literature reviews.</description><subject>Bibliographies</subject><subject>Bibliometrics</subject><subject>Codes</subject><subject>Data mining</subject><subject>Data models</subject><subject>Empirical analysis</subject><subject>Literature review</subject><subject>Literature reviews</subject><subject>Machine learning</subject><subject>Modelling</subject><subject>Natural language processing</subject><subject>Scientometrics</subject><subject>Social networks</subject><subject>Systematics</subject><subject>topic modeling</subject><subject>Unstructured data</subject><issn>2169-3536</issn><issn>2169-3536</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>ESBDL</sourceid><sourceid>RIE</sourceid><sourceid>DOA</sourceid><recordid>eNpNkFFPwjAUhRejiQT5BfqwxGewvd3K5htZUEkwGsHn5q67JSVAZzcw_nuLI4a-tDk959ybL4puORtxzvKHSVFMF4sRMICRAAG5yC-iHnCZD0Uq5OXZ-zoaNM2ahZMFKR33onTpaqvjV1fRxu5Wj_E7-aYm3doDNfGTd9sY47ltyWO79xR_0MHS9010ZXDT0OB096PPp-myeBnO355nxWQ-1AnL26GhCkBmWgBkXOdojDZ5ChqRkixFrMrMmDx8IwUZ0CQMUKYMhTRSEhP9aNb1Vg7XqvZ2i_5HObTqT3B-pdC3Vm9IhfZKEHARECSY8LJKNKRlpYlRiShD133XVXv3taemVWu397uwvoKxlFIkXIrgEp1Le9c0nsz_VM7UEbfqcKsjbnXCHVJ3XcoS0VmCMRApiF_Ub3s9</recordid><startdate>2023</startdate><enddate>2023</enddate><creator>A., Andres M. Grisales</creator><creator>Robledo, Sebastian</creator><creator>Zuluaga, Martha</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>ESBDL</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>7SR</scope><scope>8BQ</scope><scope>8FD</scope><scope>JG9</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0002-4385-4474</orcidid><orcidid>https://orcid.org/0000-0003-4357-4402</orcidid><orcidid>https://orcid.org/0000-0003-1720-8476</orcidid></search><sort><creationdate>2023</creationdate><title>Topic Modeling: Perspectives From a Literature Review</title><author>A., Andres M. Grisales ; Robledo, Sebastian ; Zuluaga, Martha</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c409t-fed2268c32281c9affcf952caae485aadb8ff9c32ae9522af402a650a36f66e03</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Bibliographies</topic><topic>Bibliometrics</topic><topic>Codes</topic><topic>Data mining</topic><topic>Data models</topic><topic>Empirical analysis</topic><topic>Literature review</topic><topic>Literature reviews</topic><topic>Machine learning</topic><topic>Modelling</topic><topic>Natural language processing</topic><topic>Scientometrics</topic><topic>Social networks</topic><topic>Systematics</topic><topic>topic modeling</topic><topic>Unstructured data</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>A., Andres M. Grisales</creatorcontrib><creatorcontrib>Robledo, Sebastian</creatorcontrib><creatorcontrib>Zuluaga, Martha</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE Open Access Journals</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>IEEE access</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>A., Andres M. Grisales</au><au>Robledo, Sebastian</au><au>Zuluaga, Martha</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Topic Modeling: Perspectives From a Literature Review</atitle><jtitle>IEEE access</jtitle><stitle>Access</stitle><date>2023</date><risdate>2023</risdate><volume>11</volume><spage>4066</spage><epage>4078</epage><pages>4066-4078</pages><issn>2169-3536</issn><eissn>2169-3536</eissn><coden>IAECCG</coden><abstract>Topic modeling is a Natural Language Processing technique that has gained popularity over the last ten years, with applications in multiple fields of knowledge. However, there is insufficient empirical evidence to show how this field of study has developed over the years, as well as the main models that have been applied in different contexts. The objective of this paper is to analyze the evolution of the topic modeling technique, the main areas in which it has been applied, and the models that are recommended for specific types of data. The methodology applied is based on bibliometric analysis. First, we searched the Web of Science and the Scopus databases. We then used scientometric techniques and a Tree of Science methodology, which allowed us to analyze the search results from the perspectives of classics, structure, and trends. The results show that the USA and China are among the most productive countries in this field and the applications have been mainly in the identification of sub-topics in short texts, such as social networks and blogs. The main conclusion of this work is that topic modeling is a versatile technique that can complement systematic literature reviews and that has been well-received in different academic and research contexts. The results of this study will help researchers and academics to recognize the importance of these techniques for reviewing large volumes of unstructured information, such as research articles, and in general, for systematic literature reviews.</abstract><cop>Piscataway</cop><pub>IEEE</pub><doi>10.1109/ACCESS.2022.3232939</doi><tpages>13</tpages><orcidid>https://orcid.org/0000-0002-4385-4474</orcidid><orcidid>https://orcid.org/0000-0003-4357-4402</orcidid><orcidid>https://orcid.org/0000-0003-1720-8476</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2169-3536
ispartof IEEE access, 2023, Vol.11, p.4066-4078
issn 2169-3536
2169-3536
language eng
recordid cdi_crossref_primary_10_1109_ACCESS_2022_3232939
source IEEE Open Access Journals; DOAJ Directory of Open Access Journals; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals
subjects Bibliographies
Bibliometrics
Codes
Data mining
Data models
Empirical analysis
Literature review
Literature reviews
Machine learning
Modelling
Natural language processing
Scientometrics
Social networks
Systematics
topic modeling
Unstructured data
title Topic Modeling: Perspectives From a Literature Review
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-12T21%3A29%3A12IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Topic%20Modeling:%20Perspectives%20From%20a%20Literature%20Review&rft.jtitle=IEEE%20access&rft.au=A.,%20Andres%20M.%20Grisales&rft.date=2023&rft.volume=11&rft.spage=4066&rft.epage=4078&rft.pages=4066-4078&rft.issn=2169-3536&rft.eissn=2169-3536&rft.coden=IAECCG&rft_id=info:doi/10.1109/ACCESS.2022.3232939&rft_dat=%3Cproquest_cross%3E2766634163%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2766634163&rft_id=info:pmid/&rft_ieee_id=10002352&rft_doaj_id=oai_doaj_org_article_9afd3e2133234a41bd4c25bdce0ebaa6&rfr_iscdi=true