Drug Repositioning by Integrating Known Disease-Gene and Drug-Target Associations in a Semi-supervised Learning Model

Computational drug repositioning has been proven as a promising and efficient strategy for discovering new uses from existing drugs. To achieve this goal, a number of computational methods have been proposed, which are based on different data sources of drugs and diseases. These methods approach the...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Acta biotheoretica 2018-12, Vol.66 (4), p.315-331
Hauptverfasser: Le, Duc-Hau, Nguyen-Ngoc, Doanh
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 331
container_issue 4
container_start_page 315
container_title Acta biotheoretica
container_volume 66
creator Le, Duc-Hau
Nguyen-Ngoc, Doanh
description Computational drug repositioning has been proven as a promising and efficient strategy for discovering new uses from existing drugs. To achieve this goal, a number of computational methods have been proposed, which are based on different data sources of drugs and diseases. These methods approach the problem using either machine learning- or network-based models with an assumption that similar drugs can be used for similar diseases to identify new indications of drugs. Therefore, similarities between drugs and between diseases are usually used as inputs. In addition, known drug-disease associations are also needed for the methods as prior information. It should be noted that those associations are still not well established due to the fact that many of marketed drugs have been withdrawn and this could affect the outcome of the methods. In this study, we propose a novel method named RLSDR (Regularized Least Square for Drug Repositioning) to find new uses of drugs. More specifically, it relies on a semi-supervised learning model, Regularized Least Square, thus it does not require definition of non-drug-disease associations as previously proposed machine learning-based methods. In addition, the similarity between drugs measured by chemical structures of drug compounds and the similarity between diseases which share phenotypes can be represented in a form of either similarity network or similarity matrix as inputs of the method. Moreover, instead of using a gold-standard set of known drug-disease associations, we construct an artificial set of the associations based on known disease-gene and drug-target associations. Experiment results demonstrate that RLSDR achieves better prediction performance on the artificial set of drug-disease associations than that on the gold-standard ones in terms of area under the Receiver Operating Characteristic (ROC) curve (AUC). In addition, it outperforms two representative network-based methods irrespective of the prior information of drug-disease associations. Novel indications for a number of drugs are also identified and validated by evidences from a different data resource.
doi_str_mv 10.1007/s10441-018-9325-z
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2032435168</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2032435168</sourcerecordid><originalsourceid>FETCH-LOGICAL-c372t-66f0cd93051d17d5eb0b72f1f5285a17bdb4fa12cb1b6b4d76ccc0104a76b23a3</originalsourceid><addsrcrecordid>eNp1kU1rFTEUhoMo9lr9AW4k4MZNNB-TZGZZWq3FK4LWdcjHmSHl3uSazCjtrzfjrQqCq3DI8z455EXoOaOvGaX6TWW06xihrCeD4JLcPUAbJjUnvZD9Q7ShlDIiRcdP0JNab9o4KE0foxM-aEqVohu0XJRlwp_hkGucY04xTdjd4qs0w1TsvI4fUv6R8EWsYCuQS0iAbQp4DZJrWyaY8Vmt2Ue7CiqOCVv8BfaR1OUA5XsLBrwFW37JP-YAu6fo0Wh3FZ7dn6fo67u31-fvyfbT5dX52ZZ4oflMlBqpD4OgkgWmgwRHneYjGyXvpWXaBdeNlnHvmFOuC1p572n7FKuV48KKU_Tq6D2U_G2BOpt9rB52O5sgL9VwKngnJFN9Q1_-g97kpaS23UoxplQ3iEaxI-VLrrXAaA4l7m25NYyatRNz7MS0TszaiblrmRf35sXtIfxJ_C6hAfwI1HaVJih_n_6_9SdqO5fk</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2031166493</pqid></control><display><type>article</type><title>Drug Repositioning by Integrating Known Disease-Gene and Drug-Target Associations in a Semi-supervised Learning Model</title><source>MEDLINE</source><source>SpringerLink Journals - AutoHoldings</source><creator>Le, Duc-Hau ; Nguyen-Ngoc, Doanh</creator><creatorcontrib>Le, Duc-Hau ; Nguyen-Ngoc, Doanh</creatorcontrib><description>Computational drug repositioning has been proven as a promising and efficient strategy for discovering new uses from existing drugs. To achieve this goal, a number of computational methods have been proposed, which are based on different data sources of drugs and diseases. These methods approach the problem using either machine learning- or network-based models with an assumption that similar drugs can be used for similar diseases to identify new indications of drugs. Therefore, similarities between drugs and between diseases are usually used as inputs. In addition, known drug-disease associations are also needed for the methods as prior information. It should be noted that those associations are still not well established due to the fact that many of marketed drugs have been withdrawn and this could affect the outcome of the methods. In this study, we propose a novel method named RLSDR (Regularized Least Square for Drug Repositioning) to find new uses of drugs. More specifically, it relies on a semi-supervised learning model, Regularized Least Square, thus it does not require definition of non-drug-disease associations as previously proposed machine learning-based methods. In addition, the similarity between drugs measured by chemical structures of drug compounds and the similarity between diseases which share phenotypes can be represented in a form of either similarity network or similarity matrix as inputs of the method. Moreover, instead of using a gold-standard set of known drug-disease associations, we construct an artificial set of the associations based on known disease-gene and drug-target associations. Experiment results demonstrate that RLSDR achieves better prediction performance on the artificial set of drug-disease associations than that on the gold-standard ones in terms of area under the Receiver Operating Characteristic (ROC) curve (AUC). In addition, it outperforms two representative network-based methods irrespective of the prior information of drug-disease associations. Novel indications for a number of drugs are also identified and validated by evidences from a different data resource.</description><identifier>ISSN: 0001-5342</identifier><identifier>EISSN: 1572-8358</identifier><identifier>DOI: 10.1007/s10441-018-9325-z</identifier><identifier>PMID: 29700660</identifier><language>eng</language><publisher>Dordrecht: Springer Netherlands</publisher><subject>Area Under Curve ; Artificial intelligence ; Chemical compounds ; Computational Biology - methods ; Computer applications ; Construction standards ; Disease ; Diseases ; Drug Repositioning ; Drug Therapy - methods ; Drugs ; Education ; Evolutionary Biology ; Gold ; Humans ; Learning algorithms ; Models, Statistical ; Pharmaceutical Preparations - chemistry ; Pharmacology - methods ; Pharmacy - methods ; Philosophy ; Philosophy of Biology ; Regular Article ; Reproducibility of Results ; Software ; Supervised Machine Learning</subject><ispartof>Acta biotheoretica, 2018-12, Vol.66 (4), p.315-331</ispartof><rights>Springer Science+Business Media B.V., part of Springer Nature 2018</rights><rights>Acta Biotheoretica is a copyright of Springer, (2018). All Rights Reserved.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c372t-66f0cd93051d17d5eb0b72f1f5285a17bdb4fa12cb1b6b4d76ccc0104a76b23a3</citedby><cites>FETCH-LOGICAL-c372t-66f0cd93051d17d5eb0b72f1f5285a17bdb4fa12cb1b6b4d76ccc0104a76b23a3</cites><orcidid>0000-0002-4951-5916</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s10441-018-9325-z$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s10441-018-9325-z$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,780,784,27923,27924,41487,42556,51318</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/29700660$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Le, Duc-Hau</creatorcontrib><creatorcontrib>Nguyen-Ngoc, Doanh</creatorcontrib><title>Drug Repositioning by Integrating Known Disease-Gene and Drug-Target Associations in a Semi-supervised Learning Model</title><title>Acta biotheoretica</title><addtitle>Acta Biotheor</addtitle><addtitle>Acta Biotheor</addtitle><description>Computational drug repositioning has been proven as a promising and efficient strategy for discovering new uses from existing drugs. To achieve this goal, a number of computational methods have been proposed, which are based on different data sources of drugs and diseases. These methods approach the problem using either machine learning- or network-based models with an assumption that similar drugs can be used for similar diseases to identify new indications of drugs. Therefore, similarities between drugs and between diseases are usually used as inputs. In addition, known drug-disease associations are also needed for the methods as prior information. It should be noted that those associations are still not well established due to the fact that many of marketed drugs have been withdrawn and this could affect the outcome of the methods. In this study, we propose a novel method named RLSDR (Regularized Least Square for Drug Repositioning) to find new uses of drugs. More specifically, it relies on a semi-supervised learning model, Regularized Least Square, thus it does not require definition of non-drug-disease associations as previously proposed machine learning-based methods. In addition, the similarity between drugs measured by chemical structures of drug compounds and the similarity between diseases which share phenotypes can be represented in a form of either similarity network or similarity matrix as inputs of the method. Moreover, instead of using a gold-standard set of known drug-disease associations, we construct an artificial set of the associations based on known disease-gene and drug-target associations. Experiment results demonstrate that RLSDR achieves better prediction performance on the artificial set of drug-disease associations than that on the gold-standard ones in terms of area under the Receiver Operating Characteristic (ROC) curve (AUC). In addition, it outperforms two representative network-based methods irrespective of the prior information of drug-disease associations. Novel indications for a number of drugs are also identified and validated by evidences from a different data resource.</description><subject>Area Under Curve</subject><subject>Artificial intelligence</subject><subject>Chemical compounds</subject><subject>Computational Biology - methods</subject><subject>Computer applications</subject><subject>Construction standards</subject><subject>Disease</subject><subject>Diseases</subject><subject>Drug Repositioning</subject><subject>Drug Therapy - methods</subject><subject>Drugs</subject><subject>Education</subject><subject>Evolutionary Biology</subject><subject>Gold</subject><subject>Humans</subject><subject>Learning algorithms</subject><subject>Models, Statistical</subject><subject>Pharmaceutical Preparations - chemistry</subject><subject>Pharmacology - methods</subject><subject>Pharmacy - methods</subject><subject>Philosophy</subject><subject>Philosophy of Biology</subject><subject>Regular Article</subject><subject>Reproducibility of Results</subject><subject>Software</subject><subject>Supervised Machine Learning</subject><issn>0001-5342</issn><issn>1572-8358</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><recordid>eNp1kU1rFTEUhoMo9lr9AW4k4MZNNB-TZGZZWq3FK4LWdcjHmSHl3uSazCjtrzfjrQqCq3DI8z455EXoOaOvGaX6TWW06xihrCeD4JLcPUAbJjUnvZD9Q7ShlDIiRcdP0JNab9o4KE0foxM-aEqVohu0XJRlwp_hkGucY04xTdjd4qs0w1TsvI4fUv6R8EWsYCuQS0iAbQp4DZJrWyaY8Vmt2Ue7CiqOCVv8BfaR1OUA5XsLBrwFW37JP-YAu6fo0Wh3FZ7dn6fo67u31-fvyfbT5dX52ZZ4oflMlBqpD4OgkgWmgwRHneYjGyXvpWXaBdeNlnHvmFOuC1p572n7FKuV48KKU_Tq6D2U_G2BOpt9rB52O5sgL9VwKngnJFN9Q1_-g97kpaS23UoxplQ3iEaxI-VLrrXAaA4l7m25NYyatRNz7MS0TszaiblrmRf35sXtIfxJ_C6hAfwI1HaVJih_n_6_9SdqO5fk</recordid><startdate>20181201</startdate><enddate>20181201</enddate><creator>Le, Duc-Hau</creator><creator>Nguyen-Ngoc, Doanh</creator><general>Springer Netherlands</general><general>Springer Nature B.V</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7QP</scope><scope>7QR</scope><scope>7T5</scope><scope>7TK</scope><scope>7TM</scope><scope>7U9</scope><scope>7X7</scope><scope>7XB</scope><scope>88A</scope><scope>88E</scope><scope>8AO</scope><scope>8FD</scope><scope>8FE</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BHPHI</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>H94</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>LK8</scope><scope>M0S</scope><scope>M1P</scope><scope>M7P</scope><scope>P64</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0002-4951-5916</orcidid></search><sort><creationdate>20181201</creationdate><title>Drug Repositioning by Integrating Known Disease-Gene and Drug-Target Associations in a Semi-supervised Learning Model</title><author>Le, Duc-Hau ; Nguyen-Ngoc, Doanh</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c372t-66f0cd93051d17d5eb0b72f1f5285a17bdb4fa12cb1b6b4d76ccc0104a76b23a3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Area Under Curve</topic><topic>Artificial intelligence</topic><topic>Chemical compounds</topic><topic>Computational Biology - methods</topic><topic>Computer applications</topic><topic>Construction standards</topic><topic>Disease</topic><topic>Diseases</topic><topic>Drug Repositioning</topic><topic>Drug Therapy - methods</topic><topic>Drugs</topic><topic>Education</topic><topic>Evolutionary Biology</topic><topic>Gold</topic><topic>Humans</topic><topic>Learning algorithms</topic><topic>Models, Statistical</topic><topic>Pharmaceutical Preparations - chemistry</topic><topic>Pharmacology - methods</topic><topic>Pharmacy - methods</topic><topic>Philosophy</topic><topic>Philosophy of Biology</topic><topic>Regular Article</topic><topic>Reproducibility of Results</topic><topic>Software</topic><topic>Supervised Machine Learning</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Le, Duc-Hau</creatorcontrib><creatorcontrib>Nguyen-Ngoc, Doanh</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Calcium &amp; Calcified Tissue Abstracts</collection><collection>Chemoreception Abstracts</collection><collection>Immunology Abstracts</collection><collection>Neurosciences Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Virology and AIDS Abstracts</collection><collection>Health &amp; Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Biology Database (Alumni Edition)</collection><collection>Medical Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Natural Science Collection (ProQuest)</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>ProQuest Biological Science Collection</collection><collection>Health &amp; Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Biological Science Database</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>MEDLINE - Academic</collection><jtitle>Acta biotheoretica</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Le, Duc-Hau</au><au>Nguyen-Ngoc, Doanh</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Drug Repositioning by Integrating Known Disease-Gene and Drug-Target Associations in a Semi-supervised Learning Model</atitle><jtitle>Acta biotheoretica</jtitle><stitle>Acta Biotheor</stitle><addtitle>Acta Biotheor</addtitle><date>2018-12-01</date><risdate>2018</risdate><volume>66</volume><issue>4</issue><spage>315</spage><epage>331</epage><pages>315-331</pages><issn>0001-5342</issn><eissn>1572-8358</eissn><abstract>Computational drug repositioning has been proven as a promising and efficient strategy for discovering new uses from existing drugs. To achieve this goal, a number of computational methods have been proposed, which are based on different data sources of drugs and diseases. These methods approach the problem using either machine learning- or network-based models with an assumption that similar drugs can be used for similar diseases to identify new indications of drugs. Therefore, similarities between drugs and between diseases are usually used as inputs. In addition, known drug-disease associations are also needed for the methods as prior information. It should be noted that those associations are still not well established due to the fact that many of marketed drugs have been withdrawn and this could affect the outcome of the methods. In this study, we propose a novel method named RLSDR (Regularized Least Square for Drug Repositioning) to find new uses of drugs. More specifically, it relies on a semi-supervised learning model, Regularized Least Square, thus it does not require definition of non-drug-disease associations as previously proposed machine learning-based methods. In addition, the similarity between drugs measured by chemical structures of drug compounds and the similarity between diseases which share phenotypes can be represented in a form of either similarity network or similarity matrix as inputs of the method. Moreover, instead of using a gold-standard set of known drug-disease associations, we construct an artificial set of the associations based on known disease-gene and drug-target associations. Experiment results demonstrate that RLSDR achieves better prediction performance on the artificial set of drug-disease associations than that on the gold-standard ones in terms of area under the Receiver Operating Characteristic (ROC) curve (AUC). In addition, it outperforms two representative network-based methods irrespective of the prior information of drug-disease associations. Novel indications for a number of drugs are also identified and validated by evidences from a different data resource.</abstract><cop>Dordrecht</cop><pub>Springer Netherlands</pub><pmid>29700660</pmid><doi>10.1007/s10441-018-9325-z</doi><tpages>17</tpages><orcidid>https://orcid.org/0000-0002-4951-5916</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0001-5342
ispartof Acta biotheoretica, 2018-12, Vol.66 (4), p.315-331
issn 0001-5342
1572-8358
language eng
recordid cdi_proquest_miscellaneous_2032435168
source MEDLINE; SpringerLink Journals - AutoHoldings
subjects Area Under Curve
Artificial intelligence
Chemical compounds
Computational Biology - methods
Computer applications
Construction standards
Disease
Diseases
Drug Repositioning
Drug Therapy - methods
Drugs
Education
Evolutionary Biology
Gold
Humans
Learning algorithms
Models, Statistical
Pharmaceutical Preparations - chemistry
Pharmacology - methods
Pharmacy - methods
Philosophy
Philosophy of Biology
Regular Article
Reproducibility of Results
Software
Supervised Machine Learning
title Drug Repositioning by Integrating Known Disease-Gene and Drug-Target Associations in a Semi-supervised Learning Model
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-09T03%3A34%3A39IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Drug%20Repositioning%20by%20Integrating%20Known%20Disease-Gene%20and%20Drug-Target%20Associations%20in%20a%20Semi-supervised%20Learning%20Model&rft.jtitle=Acta%20biotheoretica&rft.au=Le,%20Duc-Hau&rft.date=2018-12-01&rft.volume=66&rft.issue=4&rft.spage=315&rft.epage=331&rft.pages=315-331&rft.issn=0001-5342&rft.eissn=1572-8358&rft_id=info:doi/10.1007/s10441-018-9325-z&rft_dat=%3Cproquest_cross%3E2032435168%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2031166493&rft_id=info:pmid/29700660&rfr_iscdi=true