Evaluating disease similarity based on gene network reconstruction and representation

Abstract Motivation Quantifying the associations between diseases is of great significance in increasing our understanding of disease biology, improving disease diagnosis, re-positioning and developing drugs. Therefore, in recent years, the research of disease similarity has received a lot of attent...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Bioinformatics 2021-10, Vol.37 (20), p.3579-3587
Hauptverfasser: Li, Yang, Keqi, Wang, Wang, Guohua
Format: Artikel
Sprache:eng
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 3587
container_issue 20
container_start_page 3579
container_title Bioinformatics
container_volume 37
creator Li, Yang
Keqi, Wang
Wang, Guohua
description Abstract Motivation Quantifying the associations between diseases is of great significance in increasing our understanding of disease biology, improving disease diagnosis, re-positioning and developing drugs. Therefore, in recent years, the research of disease similarity has received a lot of attention in the field of bioinformatics. Previous work has shown that the combination of the ontology (such as disease ontology and gene ontology) and disease–gene interactions are worthy to be regarded to elucidate diseases and disease associations. However, most of them are either based on the overlap between disease-related gene sets or distance within the ontology’s hierarchy. The diseases in these methods are represented by discrete or sparse feature vectors, which cannot grasp the deep semantic information of diseases. Recently, deep representation learning has been widely studied and gradually applied to various fields of bioinformatics. Based on the hypothesis that disease representation depends on its related gene representations, we propose a disease representation model using two most representative gene resources HumanNet and Gene Ontology to construct a new gene network and learn gene (disease) representations. The similarity between two diseases is computed by the cosine similarity of their corresponding representations. Results We propose a novel approach to compute disease similarity, which integrates two important factors disease-related genes and gene ontology hierarchy to learn disease representation based on deep representation learning. Under the same experimental settings, the AUC value of our method is 0.8074, which improves the most competitive baseline method by 10.1%. The quantitative and qualitative experimental results show that our model can learn effective disease representations and improve the accuracy of disease similarity computation significantly. Availability and implementation The research shows that this method has certain applicability in the prediction of gene-related diseases, the migration of disease treatment methods, drug development and so on. Supplementary information Supplementary data are available at Bioinformatics online.
doi_str_mv 10.1093/bioinformatics/btab252
format Article
fullrecord <record><control><sourceid>oup_TOX</sourceid><recordid>TN_cdi_crossref_primary_10_1093_bioinformatics_btab252</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><oup_id>10.1093/bioinformatics/btab252</oup_id><sourcerecordid>10.1093/bioinformatics/btab252</sourcerecordid><originalsourceid>FETCH-LOGICAL-c353t-b4adaa26928b01c730ce7a581d62e4028faad9489822b9f3759b47f3e93635073</originalsourceid><addsrcrecordid>eNqNkMtOwzAQRS0EoqXwC1V-INSvxPESVeUhVWJD19H4kcrQOJHtgPr3pEpBYsdqZu7cexcHoSXB9wRLtlKuc77pQgvJ6bhSCRQt6AWaE17inOJCXo47K0XOK8xm6CbGd4wLwjm_RjPGpKgEpnO023zCYRhL_D4zLlqINouudQcILh0zNd4m63y2t95m3qavLnxkwerOxxQGndz4A29GqQ82Wp_gJN2iqwYO0d6d5wLtHjdv6-d8-_r0sn7Y5poVLOWKgwGgpaSVwkQLhrUVUFTElNRyTKsGwEheyYpSJRsmCqm4aJiVrGQFFmyByqlXhy7GYJu6D66FcKwJrk-c6r-c6jOnMbicgv2gWmt-Yz9gRgOZDN3Q_7f0GwgEfWo</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Evaluating disease similarity based on gene network reconstruction and representation</title><source>Oxford Journals Open Access Collection</source><creator>Li, Yang ; Keqi, Wang ; Wang, Guohua</creator><creatorcontrib>Li, Yang ; Keqi, Wang ; Wang, Guohua</creatorcontrib><description>Abstract Motivation Quantifying the associations between diseases is of great significance in increasing our understanding of disease biology, improving disease diagnosis, re-positioning and developing drugs. Therefore, in recent years, the research of disease similarity has received a lot of attention in the field of bioinformatics. Previous work has shown that the combination of the ontology (such as disease ontology and gene ontology) and disease–gene interactions are worthy to be regarded to elucidate diseases and disease associations. However, most of them are either based on the overlap between disease-related gene sets or distance within the ontology’s hierarchy. The diseases in these methods are represented by discrete or sparse feature vectors, which cannot grasp the deep semantic information of diseases. Recently, deep representation learning has been widely studied and gradually applied to various fields of bioinformatics. Based on the hypothesis that disease representation depends on its related gene representations, we propose a disease representation model using two most representative gene resources HumanNet and Gene Ontology to construct a new gene network and learn gene (disease) representations. The similarity between two diseases is computed by the cosine similarity of their corresponding representations. Results We propose a novel approach to compute disease similarity, which integrates two important factors disease-related genes and gene ontology hierarchy to learn disease representation based on deep representation learning. Under the same experimental settings, the AUC value of our method is 0.8074, which improves the most competitive baseline method by 10.1%. The quantitative and qualitative experimental results show that our model can learn effective disease representations and improve the accuracy of disease similarity computation significantly. Availability and implementation The research shows that this method has certain applicability in the prediction of gene-related diseases, the migration of disease treatment methods, drug development and so on. Supplementary information Supplementary data are available at Bioinformatics online.</description><identifier>ISSN: 1367-4803</identifier><identifier>EISSN: 1460-2059</identifier><identifier>EISSN: 1367-4811</identifier><identifier>DOI: 10.1093/bioinformatics/btab252</identifier><identifier>PMID: 33978702</identifier><language>eng</language><publisher>England: Oxford University Press</publisher><ispartof>Bioinformatics, 2021-10, Vol.37 (20), p.3579-3587</ispartof><rights>The Author(s) 2021. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com 2021</rights><rights>The Author(s) 2021. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c353t-b4adaa26928b01c730ce7a581d62e4028faad9489822b9f3759b47f3e93635073</citedby><cites>FETCH-LOGICAL-c353t-b4adaa26928b01c730ce7a581d62e4028faad9489822b9f3759b47f3e93635073</cites><orcidid>0000-0002-0403-7287</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,1598,27901,27902</link.rule.ids><linktorsrc>$$Uhttps://dx.doi.org/10.1093/bioinformatics/btab252$$EView_record_in_Oxford_University_Press$$FView_record_in_$$GOxford_University_Press</linktorsrc><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/33978702$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Li, Yang</creatorcontrib><creatorcontrib>Keqi, Wang</creatorcontrib><creatorcontrib>Wang, Guohua</creatorcontrib><title>Evaluating disease similarity based on gene network reconstruction and representation</title><title>Bioinformatics</title><addtitle>Bioinformatics</addtitle><description>Abstract Motivation Quantifying the associations between diseases is of great significance in increasing our understanding of disease biology, improving disease diagnosis, re-positioning and developing drugs. Therefore, in recent years, the research of disease similarity has received a lot of attention in the field of bioinformatics. Previous work has shown that the combination of the ontology (such as disease ontology and gene ontology) and disease–gene interactions are worthy to be regarded to elucidate diseases and disease associations. However, most of them are either based on the overlap between disease-related gene sets or distance within the ontology’s hierarchy. The diseases in these methods are represented by discrete or sparse feature vectors, which cannot grasp the deep semantic information of diseases. Recently, deep representation learning has been widely studied and gradually applied to various fields of bioinformatics. Based on the hypothesis that disease representation depends on its related gene representations, we propose a disease representation model using two most representative gene resources HumanNet and Gene Ontology to construct a new gene network and learn gene (disease) representations. The similarity between two diseases is computed by the cosine similarity of their corresponding representations. Results We propose a novel approach to compute disease similarity, which integrates two important factors disease-related genes and gene ontology hierarchy to learn disease representation based on deep representation learning. Under the same experimental settings, the AUC value of our method is 0.8074, which improves the most competitive baseline method by 10.1%. The quantitative and qualitative experimental results show that our model can learn effective disease representations and improve the accuracy of disease similarity computation significantly. Availability and implementation The research shows that this method has certain applicability in the prediction of gene-related diseases, the migration of disease treatment methods, drug development and so on. Supplementary information Supplementary data are available at Bioinformatics online.</description><issn>1367-4803</issn><issn>1460-2059</issn><issn>1367-4811</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><recordid>eNqNkMtOwzAQRS0EoqXwC1V-INSvxPESVeUhVWJD19H4kcrQOJHtgPr3pEpBYsdqZu7cexcHoSXB9wRLtlKuc77pQgvJ6bhSCRQt6AWaE17inOJCXo47K0XOK8xm6CbGd4wLwjm_RjPGpKgEpnO023zCYRhL_D4zLlqINouudQcILh0zNd4m63y2t95m3qavLnxkwerOxxQGndz4A29GqQ82Wp_gJN2iqwYO0d6d5wLtHjdv6-d8-_r0sn7Y5poVLOWKgwGgpaSVwkQLhrUVUFTElNRyTKsGwEheyYpSJRsmCqm4aJiVrGQFFmyByqlXhy7GYJu6D66FcKwJrk-c6r-c6jOnMbicgv2gWmt-Yz9gRgOZDN3Q_7f0GwgEfWo</recordid><startdate>20211025</startdate><enddate>20211025</enddate><creator>Li, Yang</creator><creator>Keqi, Wang</creator><creator>Wang, Guohua</creator><general>Oxford University Press</general><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0002-0403-7287</orcidid></search><sort><creationdate>20211025</creationdate><title>Evaluating disease similarity based on gene network reconstruction and representation</title><author>Li, Yang ; Keqi, Wang ; Wang, Guohua</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c353t-b4adaa26928b01c730ce7a581d62e4028faad9489822b9f3759b47f3e93635073</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Li, Yang</creatorcontrib><creatorcontrib>Keqi, Wang</creatorcontrib><creatorcontrib>Wang, Guohua</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><jtitle>Bioinformatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Li, Yang</au><au>Keqi, Wang</au><au>Wang, Guohua</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Evaluating disease similarity based on gene network reconstruction and representation</atitle><jtitle>Bioinformatics</jtitle><addtitle>Bioinformatics</addtitle><date>2021-10-25</date><risdate>2021</risdate><volume>37</volume><issue>20</issue><spage>3579</spage><epage>3587</epage><pages>3579-3587</pages><issn>1367-4803</issn><eissn>1460-2059</eissn><eissn>1367-4811</eissn><abstract>Abstract Motivation Quantifying the associations between diseases is of great significance in increasing our understanding of disease biology, improving disease diagnosis, re-positioning and developing drugs. Therefore, in recent years, the research of disease similarity has received a lot of attention in the field of bioinformatics. Previous work has shown that the combination of the ontology (such as disease ontology and gene ontology) and disease–gene interactions are worthy to be regarded to elucidate diseases and disease associations. However, most of them are either based on the overlap between disease-related gene sets or distance within the ontology’s hierarchy. The diseases in these methods are represented by discrete or sparse feature vectors, which cannot grasp the deep semantic information of diseases. Recently, deep representation learning has been widely studied and gradually applied to various fields of bioinformatics. Based on the hypothesis that disease representation depends on its related gene representations, we propose a disease representation model using two most representative gene resources HumanNet and Gene Ontology to construct a new gene network and learn gene (disease) representations. The similarity between two diseases is computed by the cosine similarity of their corresponding representations. Results We propose a novel approach to compute disease similarity, which integrates two important factors disease-related genes and gene ontology hierarchy to learn disease representation based on deep representation learning. Under the same experimental settings, the AUC value of our method is 0.8074, which improves the most competitive baseline method by 10.1%. The quantitative and qualitative experimental results show that our model can learn effective disease representations and improve the accuracy of disease similarity computation significantly. Availability and implementation The research shows that this method has certain applicability in the prediction of gene-related diseases, the migration of disease treatment methods, drug development and so on. Supplementary information Supplementary data are available at Bioinformatics online.</abstract><cop>England</cop><pub>Oxford University Press</pub><pmid>33978702</pmid><doi>10.1093/bioinformatics/btab252</doi><tpages>9</tpages><orcidid>https://orcid.org/0000-0002-0403-7287</orcidid></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1367-4803
ispartof Bioinformatics, 2021-10, Vol.37 (20), p.3579-3587
issn 1367-4803
1460-2059
1367-4811
language eng
recordid cdi_crossref_primary_10_1093_bioinformatics_btab252
source Oxford Journals Open Access Collection
title Evaluating disease similarity based on gene network reconstruction and representation
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-02T04%3A17%3A05IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-oup_TOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Evaluating%20disease%20similarity%20based%20on%20gene%20network%20reconstruction%20and%20representation&rft.jtitle=Bioinformatics&rft.au=Li,%20Yang&rft.date=2021-10-25&rft.volume=37&rft.issue=20&rft.spage=3579&rft.epage=3587&rft.pages=3579-3587&rft.issn=1367-4803&rft.eissn=1460-2059&rft_id=info:doi/10.1093/bioinformatics/btab252&rft_dat=%3Coup_TOX%3E10.1093/bioinformatics/btab252%3C/oup_TOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/33978702&rft_oup_id=10.1093/bioinformatics/btab252&rfr_iscdi=true