An overview of topic modeling and its current applications in bioinformatics

Background With the rapid accumulation of biological datasets, machine learning methods designed to automate data analysis are urgently needed. In recent years, so-called topic models that originated from the field of natural language processing have been receiving much attention in bioinformatics b...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:SpringerPlus 2016-09, Vol.5 (1), p.1608-1608, Article 1608
Hauptverfasser: Liu, Lin, Tang, Lin, Dong, Wen, Yao, Shaowen, Zhou, Wei
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1608
container_issue 1
container_start_page 1608
container_title SpringerPlus
container_volume 5
creator Liu, Lin
Tang, Lin
Dong, Wen
Yao, Shaowen
Zhou, Wei
description Background With the rapid accumulation of biological datasets, machine learning methods designed to automate data analysis are urgently needed. In recent years, so-called topic models that originated from the field of natural language processing have been receiving much attention in bioinformatics because of their interpretability. Our aim was to review the application and development of topic models for bioinformatics. Description This paper starts with the description of a topic model, with a focus on the understanding of topic modeling. A general outline is provided on how to build an application in a topic model and how to develop a topic model. Meanwhile, the literature on application of topic models to biological data was searched and analyzed in depth. According to the types of models and the analogy between the concept of document-topic-word and a biological object (as well as the tasks of a topic model), we categorized the related studies and provided an outlook on the use of topic models for the development of bioinformatics applications. Conclusion Topic modeling is a useful method (in contrast to the traditional means of data reduction in bioinformatics) and enhances researchers’ ability to interpret biological information. Nevertheless, due to the lack of topic models optimized for specific biological data, the studies on topic modeling in biological data still have a long and challenging road ahead. We believe that topic models are a promising method for various applications in bioinformatics research.
doi_str_mv 10.1186/s40064-016-3252-8
format Article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_5028368</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1822468423</sourcerecordid><originalsourceid>FETCH-LOGICAL-c536t-80e8447ce2d7086d1d59befa4f07e95420ca196ad5e8fad49c03a508e18184173</originalsourceid><addsrcrecordid>eNp1kUFr3DAQhUVJaEKSH5BLEfSSi9uRLMnyJRBC2gQWemnOQiuPtwq25Ej2lv77aNkkbAvVRWLm05s3PEIuGXxhTKuvWQAoUQFTVc0lr_QHcspZW1dMAzs6eJ-Qi5yfoBzVMNHAR3LCGyU50-yUrG4CjVtMW4-_aezpHCfv6Bg7HHzYUBs66udM3ZIShpnaaRq8s7OPIVMf6NpHH_qYxlJy-Zwc93bIePF6n5HHb3c_b--r1Y_vD7c3q8rJWs2VBtRCNA5514BWHetku8beih4abKXg4Cxrle0k6t52onVQWwkai2MtWFOfkeu97rSsR-xccZbsYKbkR5v-mGi9-bsT_C-ziVsjgeta6SJw9SqQ4vOCeTajzw6HwQaMSzZMcy6UFrwu6Od_0Ke4pFDWK5TQLYDkO0dsT7kUc07Yv5thYHZxmX1cpsRldnGZnYlPh1u8_3gLpwB8D-TSChtMB6P_q_oCXMOgQA</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1848900527</pqid></control><display><type>article</type><title>An overview of topic modeling and its current applications in bioinformatics</title><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>PubMed Central Open Access</source><source>Springer Nature OA Free Journals</source><source>PubMed Central</source><source>Free Full-Text Journals in Chemistry</source><creator>Liu, Lin ; Tang, Lin ; Dong, Wen ; Yao, Shaowen ; Zhou, Wei</creator><creatorcontrib>Liu, Lin ; Tang, Lin ; Dong, Wen ; Yao, Shaowen ; Zhou, Wei</creatorcontrib><description>Background With the rapid accumulation of biological datasets, machine learning methods designed to automate data analysis are urgently needed. In recent years, so-called topic models that originated from the field of natural language processing have been receiving much attention in bioinformatics because of their interpretability. Our aim was to review the application and development of topic models for bioinformatics. Description This paper starts with the description of a topic model, with a focus on the understanding of topic modeling. A general outline is provided on how to build an application in a topic model and how to develop a topic model. Meanwhile, the literature on application of topic models to biological data was searched and analyzed in depth. According to the types of models and the analogy between the concept of document-topic-word and a biological object (as well as the tasks of a topic model), we categorized the related studies and provided an outlook on the use of topic models for the development of bioinformatics applications. Conclusion Topic modeling is a useful method (in contrast to the traditional means of data reduction in bioinformatics) and enhances researchers’ ability to interpret biological information. Nevertheless, due to the lack of topic models optimized for specific biological data, the studies on topic modeling in biological data still have a long and challenging road ahead. We believe that topic models are a promising method for various applications in bioinformatics research.</description><identifier>ISSN: 2193-1801</identifier><identifier>EISSN: 2193-1801</identifier><identifier>DOI: 10.1186/s40064-016-3252-8</identifier><identifier>PMID: 27652181</identifier><language>eng</language><publisher>Cham: Springer International Publishing</publisher><subject>Bioinformatics ; Biomedical and Life Sciences ; Humanities and Social Sciences ; multidisciplinary ; Review ; Science ; Science (multidisciplinary)</subject><ispartof>SpringerPlus, 2016-09, Vol.5 (1), p.1608-1608, Article 1608</ispartof><rights>The Author(s) 2016</rights><rights>SpringerPlus is a copyright of Springer, 2016.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c536t-80e8447ce2d7086d1d59befa4f07e95420ca196ad5e8fad49c03a508e18184173</citedby><cites>FETCH-LOGICAL-c536t-80e8447ce2d7086d1d59befa4f07e95420ca196ad5e8fad49c03a508e18184173</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC5028368/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC5028368/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,313,314,727,780,784,792,885,27913,27915,27916,41111,42180,51567,53782,53784</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/27652181$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Liu, Lin</creatorcontrib><creatorcontrib>Tang, Lin</creatorcontrib><creatorcontrib>Dong, Wen</creatorcontrib><creatorcontrib>Yao, Shaowen</creatorcontrib><creatorcontrib>Zhou, Wei</creatorcontrib><title>An overview of topic modeling and its current applications in bioinformatics</title><title>SpringerPlus</title><addtitle>SpringerPlus</addtitle><addtitle>Springerplus</addtitle><description>Background With the rapid accumulation of biological datasets, machine learning methods designed to automate data analysis are urgently needed. In recent years, so-called topic models that originated from the field of natural language processing have been receiving much attention in bioinformatics because of their interpretability. Our aim was to review the application and development of topic models for bioinformatics. Description This paper starts with the description of a topic model, with a focus on the understanding of topic modeling. A general outline is provided on how to build an application in a topic model and how to develop a topic model. Meanwhile, the literature on application of topic models to biological data was searched and analyzed in depth. According to the types of models and the analogy between the concept of document-topic-word and a biological object (as well as the tasks of a topic model), we categorized the related studies and provided an outlook on the use of topic models for the development of bioinformatics applications. Conclusion Topic modeling is a useful method (in contrast to the traditional means of data reduction in bioinformatics) and enhances researchers’ ability to interpret biological information. Nevertheless, due to the lack of topic models optimized for specific biological data, the studies on topic modeling in biological data still have a long and challenging road ahead. We believe that topic models are a promising method for various applications in bioinformatics research.</description><subject>Bioinformatics</subject><subject>Biomedical and Life Sciences</subject><subject>Humanities and Social Sciences</subject><subject>multidisciplinary</subject><subject>Review</subject><subject>Science</subject><subject>Science (multidisciplinary)</subject><issn>2193-1801</issn><issn>2193-1801</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2016</creationdate><recordtype>article</recordtype><sourceid>C6C</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><recordid>eNp1kUFr3DAQhUVJaEKSH5BLEfSSi9uRLMnyJRBC2gQWemnOQiuPtwq25Ej2lv77aNkkbAvVRWLm05s3PEIuGXxhTKuvWQAoUQFTVc0lr_QHcspZW1dMAzs6eJ-Qi5yfoBzVMNHAR3LCGyU50-yUrG4CjVtMW4-_aezpHCfv6Bg7HHzYUBs66udM3ZIShpnaaRq8s7OPIVMf6NpHH_qYxlJy-Zwc93bIePF6n5HHb3c_b--r1Y_vD7c3q8rJWs2VBtRCNA5514BWHetku8beih4abKXg4Cxrle0k6t52onVQWwkai2MtWFOfkeu97rSsR-xccZbsYKbkR5v-mGi9-bsT_C-ziVsjgeta6SJw9SqQ4vOCeTajzw6HwQaMSzZMcy6UFrwu6Od_0Ke4pFDWK5TQLYDkO0dsT7kUc07Yv5thYHZxmX1cpsRldnGZnYlPh1u8_3gLpwB8D-TSChtMB6P_q_oCXMOgQA</recordid><startdate>20160920</startdate><enddate>20160920</enddate><creator>Liu, Lin</creator><creator>Tang, Lin</creator><creator>Dong, Wen</creator><creator>Yao, Shaowen</creator><creator>Zhou, Wei</creator><general>Springer International Publishing</general><general>Springer Nature B.V</general><scope>C6C</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7X2</scope><scope>8FE</scope><scope>8FG</scope><scope>8FH</scope><scope>8FK</scope><scope>ABJCF</scope><scope>AEUYN</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>ATCPS</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>BKSAR</scope><scope>CCPQU</scope><scope>D1I</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>KB.</scope><scope>L6V</scope><scope>LK8</scope><scope>M0K</scope><scope>M7P</scope><scope>M7S</scope><scope>P5Z</scope><scope>P62</scope><scope>PATMY</scope><scope>PCBAR</scope><scope>PDBOC</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope><scope>PYCSY</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>20160920</creationdate><title>An overview of topic modeling and its current applications in bioinformatics</title><author>Liu, Lin ; Tang, Lin ; Dong, Wen ; Yao, Shaowen ; Zhou, Wei</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c536t-80e8447ce2d7086d1d59befa4f07e95420ca196ad5e8fad49c03a508e18184173</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2016</creationdate><topic>Bioinformatics</topic><topic>Biomedical and Life Sciences</topic><topic>Humanities and Social Sciences</topic><topic>multidisciplinary</topic><topic>Review</topic><topic>Science</topic><topic>Science (multidisciplinary)</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Liu, Lin</creatorcontrib><creatorcontrib>Tang, Lin</creatorcontrib><creatorcontrib>Dong, Wen</creatorcontrib><creatorcontrib>Yao, Shaowen</creatorcontrib><creatorcontrib>Zhou, Wei</creatorcontrib><collection>Springer Nature OA Free Journals</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Agricultural Science Collection</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest One Sustainability</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>Agricultural &amp; Environmental Science Collection</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>Natural Science Collection</collection><collection>Earth, Atmospheric &amp; Aquatic Science Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Materials Science Collection</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>Materials Science Database</collection><collection>ProQuest Engineering Collection</collection><collection>ProQuest Biological Science Collection</collection><collection>Agricultural Science Database</collection><collection>Biological Science Database</collection><collection>Engineering Database</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>Environmental Science Database</collection><collection>Earth, Atmospheric &amp; Aquatic Science Database</collection><collection>Materials Science Collection</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection><collection>Environmental Science Collection</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>SpringerPlus</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Liu, Lin</au><au>Tang, Lin</au><au>Dong, Wen</au><au>Yao, Shaowen</au><au>Zhou, Wei</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>An overview of topic modeling and its current applications in bioinformatics</atitle><jtitle>SpringerPlus</jtitle><stitle>SpringerPlus</stitle><addtitle>Springerplus</addtitle><date>2016-09-20</date><risdate>2016</risdate><volume>5</volume><issue>1</issue><spage>1608</spage><epage>1608</epage><pages>1608-1608</pages><artnum>1608</artnum><issn>2193-1801</issn><eissn>2193-1801</eissn><abstract>Background With the rapid accumulation of biological datasets, machine learning methods designed to automate data analysis are urgently needed. In recent years, so-called topic models that originated from the field of natural language processing have been receiving much attention in bioinformatics because of their interpretability. Our aim was to review the application and development of topic models for bioinformatics. Description This paper starts with the description of a topic model, with a focus on the understanding of topic modeling. A general outline is provided on how to build an application in a topic model and how to develop a topic model. Meanwhile, the literature on application of topic models to biological data was searched and analyzed in depth. According to the types of models and the analogy between the concept of document-topic-word and a biological object (as well as the tasks of a topic model), we categorized the related studies and provided an outlook on the use of topic models for the development of bioinformatics applications. Conclusion Topic modeling is a useful method (in contrast to the traditional means of data reduction in bioinformatics) and enhances researchers’ ability to interpret biological information. Nevertheless, due to the lack of topic models optimized for specific biological data, the studies on topic modeling in biological data still have a long and challenging road ahead. We believe that topic models are a promising method for various applications in bioinformatics research.</abstract><cop>Cham</cop><pub>Springer International Publishing</pub><pmid>27652181</pmid><doi>10.1186/s40064-016-3252-8</doi><tpages>1</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2193-1801
ispartof SpringerPlus, 2016-09, Vol.5 (1), p.1608-1608, Article 1608
issn 2193-1801
2193-1801
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_5028368
source Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; PubMed Central Open Access; Springer Nature OA Free Journals; PubMed Central; Free Full-Text Journals in Chemistry
subjects Bioinformatics
Biomedical and Life Sciences
Humanities and Social Sciences
multidisciplinary
Review
Science
Science (multidisciplinary)
title An overview of topic modeling and its current applications in bioinformatics
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-15T04%3A42%3A05IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=An%20overview%20of%20topic%20modeling%20and%20its%20current%20applications%20in%20bioinformatics&rft.jtitle=SpringerPlus&rft.au=Liu,%20Lin&rft.date=2016-09-20&rft.volume=5&rft.issue=1&rft.spage=1608&rft.epage=1608&rft.pages=1608-1608&rft.artnum=1608&rft.issn=2193-1801&rft.eissn=2193-1801&rft_id=info:doi/10.1186/s40064-016-3252-8&rft_dat=%3Cproquest_pubme%3E1822468423%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1848900527&rft_id=info:pmid/27652181&rfr_iscdi=true