Drug Repositioning by Integrating Known Disease-Gene and Drug-Target Associations in a Semi-supervised Learning Model
Computational drug repositioning has been proven as a promising and efficient strategy for discovering new uses from existing drugs. To achieve this goal, a number of computational methods have been proposed, which are based on different data sources of drugs and diseases. These methods approach the...
Gespeichert in:
Veröffentlicht in: | Acta biotheoretica 2018-12, Vol.66 (4), p.315-331 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 331 |
---|---|
container_issue | 4 |
container_start_page | 315 |
container_title | Acta biotheoretica |
container_volume | 66 |
creator | Le, Duc-Hau Nguyen-Ngoc, Doanh |
description | Computational drug repositioning has been proven as a promising and efficient strategy for discovering new uses from existing drugs. To achieve this goal, a number of computational methods have been proposed, which are based on different data sources of drugs and diseases. These methods approach the problem using either machine learning- or network-based models with an assumption that similar drugs can be used for similar diseases to identify new indications of drugs. Therefore, similarities between drugs and between diseases are usually used as inputs. In addition, known drug-disease associations are also needed for the methods as prior information. It should be noted that those associations are still not well established due to the fact that many of marketed drugs have been withdrawn and this could affect the outcome of the methods. In this study, we propose a novel method named RLSDR (Regularized Least Square for Drug Repositioning) to find new uses of drugs. More specifically, it relies on a semi-supervised learning model, Regularized Least Square, thus it does not require definition of non-drug-disease associations as previously proposed machine learning-based methods. In addition, the similarity between drugs measured by chemical structures of drug compounds and the similarity between diseases which share phenotypes can be represented in a form of either similarity network or similarity matrix as inputs of the method. Moreover, instead of using a gold-standard set of known drug-disease associations, we construct an artificial set of the associations based on known disease-gene and drug-target associations. Experiment results demonstrate that RLSDR achieves better prediction performance on the artificial set of drug-disease associations than that on the gold-standard ones in terms of area under the Receiver Operating Characteristic (ROC) curve (AUC). In addition, it outperforms two representative network-based methods irrespective of the prior information of drug-disease associations. Novel indications for a number of drugs are also identified and validated by evidences from a different data resource. |
doi_str_mv | 10.1007/s10441-018-9325-z |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2032435168</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2032435168</sourcerecordid><originalsourceid>FETCH-LOGICAL-c372t-66f0cd93051d17d5eb0b72f1f5285a17bdb4fa12cb1b6b4d76ccc0104a76b23a3</originalsourceid><addsrcrecordid>eNp1kU1rFTEUhoMo9lr9AW4k4MZNNB-TZGZZWq3FK4LWdcjHmSHl3uSazCjtrzfjrQqCq3DI8z455EXoOaOvGaX6TWW06xihrCeD4JLcPUAbJjUnvZD9Q7ShlDIiRcdP0JNab9o4KE0foxM-aEqVohu0XJRlwp_hkGucY04xTdjd4qs0w1TsvI4fUv6R8EWsYCuQS0iAbQp4DZJrWyaY8Vmt2Ue7CiqOCVv8BfaR1OUA5XsLBrwFW37JP-YAu6fo0Wh3FZ7dn6fo67u31-fvyfbT5dX52ZZ4oflMlBqpD4OgkgWmgwRHneYjGyXvpWXaBdeNlnHvmFOuC1p572n7FKuV48KKU_Tq6D2U_G2BOpt9rB52O5sgL9VwKngnJFN9Q1_-g97kpaS23UoxplQ3iEaxI-VLrrXAaA4l7m25NYyatRNz7MS0TszaiblrmRf35sXtIfxJ_C6hAfwI1HaVJih_n_6_9SdqO5fk</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2031166493</pqid></control><display><type>article</type><title>Drug Repositioning by Integrating Known Disease-Gene and Drug-Target Associations in a Semi-supervised Learning Model</title><source>MEDLINE</source><source>SpringerLink Journals - AutoHoldings</source><creator>Le, Duc-Hau ; Nguyen-Ngoc, Doanh</creator><creatorcontrib>Le, Duc-Hau ; Nguyen-Ngoc, Doanh</creatorcontrib><description>Computational drug repositioning has been proven as a promising and efficient strategy for discovering new uses from existing drugs. To achieve this goal, a number of computational methods have been proposed, which are based on different data sources of drugs and diseases. These methods approach the problem using either machine learning- or network-based models with an assumption that similar drugs can be used for similar diseases to identify new indications of drugs. Therefore, similarities between drugs and between diseases are usually used as inputs. In addition, known drug-disease associations are also needed for the methods as prior information. It should be noted that those associations are still not well established due to the fact that many of marketed drugs have been withdrawn and this could affect the outcome of the methods. In this study, we propose a novel method named RLSDR (Regularized Least Square for Drug Repositioning) to find new uses of drugs. More specifically, it relies on a semi-supervised learning model, Regularized Least Square, thus it does not require definition of non-drug-disease associations as previously proposed machine learning-based methods. In addition, the similarity between drugs measured by chemical structures of drug compounds and the similarity between diseases which share phenotypes can be represented in a form of either similarity network or similarity matrix as inputs of the method. Moreover, instead of using a gold-standard set of known drug-disease associations, we construct an artificial set of the associations based on known disease-gene and drug-target associations. Experiment results demonstrate that RLSDR achieves better prediction performance on the artificial set of drug-disease associations than that on the gold-standard ones in terms of area under the Receiver Operating Characteristic (ROC) curve (AUC). In addition, it outperforms two representative network-based methods irrespective of the prior information of drug-disease associations. Novel indications for a number of drugs are also identified and validated by evidences from a different data resource.</description><identifier>ISSN: 0001-5342</identifier><identifier>EISSN: 1572-8358</identifier><identifier>DOI: 10.1007/s10441-018-9325-z</identifier><identifier>PMID: 29700660</identifier><language>eng</language><publisher>Dordrecht: Springer Netherlands</publisher><subject>Area Under Curve ; Artificial intelligence ; Chemical compounds ; Computational Biology - methods ; Computer applications ; Construction standards ; Disease ; Diseases ; Drug Repositioning ; Drug Therapy - methods ; Drugs ; Education ; Evolutionary Biology ; Gold ; Humans ; Learning algorithms ; Models, Statistical ; Pharmaceutical Preparations - chemistry ; Pharmacology - methods ; Pharmacy - methods ; Philosophy ; Philosophy of Biology ; Regular Article ; Reproducibility of Results ; Software ; Supervised Machine Learning</subject><ispartof>Acta biotheoretica, 2018-12, Vol.66 (4), p.315-331</ispartof><rights>Springer Science+Business Media B.V., part of Springer Nature 2018</rights><rights>Acta Biotheoretica is a copyright of Springer, (2018). All Rights Reserved.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c372t-66f0cd93051d17d5eb0b72f1f5285a17bdb4fa12cb1b6b4d76ccc0104a76b23a3</citedby><cites>FETCH-LOGICAL-c372t-66f0cd93051d17d5eb0b72f1f5285a17bdb4fa12cb1b6b4d76ccc0104a76b23a3</cites><orcidid>0000-0002-4951-5916</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s10441-018-9325-z$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s10441-018-9325-z$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,780,784,27923,27924,41487,42556,51318</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/29700660$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Le, Duc-Hau</creatorcontrib><creatorcontrib>Nguyen-Ngoc, Doanh</creatorcontrib><title>Drug Repositioning by Integrating Known Disease-Gene and Drug-Target Associations in a Semi-supervised Learning Model</title><title>Acta biotheoretica</title><addtitle>Acta Biotheor</addtitle><addtitle>Acta Biotheor</addtitle><description>Computational drug repositioning has been proven as a promising and efficient strategy for discovering new uses from existing drugs. To achieve this goal, a number of computational methods have been proposed, which are based on different data sources of drugs and diseases. These methods approach the problem using either machine learning- or network-based models with an assumption that similar drugs can be used for similar diseases to identify new indications of drugs. Therefore, similarities between drugs and between diseases are usually used as inputs. In addition, known drug-disease associations are also needed for the methods as prior information. It should be noted that those associations are still not well established due to the fact that many of marketed drugs have been withdrawn and this could affect the outcome of the methods. In this study, we propose a novel method named RLSDR (Regularized Least Square for Drug Repositioning) to find new uses of drugs. More specifically, it relies on a semi-supervised learning model, Regularized Least Square, thus it does not require definition of non-drug-disease associations as previously proposed machine learning-based methods. In addition, the similarity between drugs measured by chemical structures of drug compounds and the similarity between diseases which share phenotypes can be represented in a form of either similarity network or similarity matrix as inputs of the method. Moreover, instead of using a gold-standard set of known drug-disease associations, we construct an artificial set of the associations based on known disease-gene and drug-target associations. Experiment results demonstrate that RLSDR achieves better prediction performance on the artificial set of drug-disease associations than that on the gold-standard ones in terms of area under the Receiver Operating Characteristic (ROC) curve (AUC). In addition, it outperforms two representative network-based methods irrespective of the prior information of drug-disease associations. Novel indications for a number of drugs are also identified and validated by evidences from a different data resource.</description><subject>Area Under Curve</subject><subject>Artificial intelligence</subject><subject>Chemical compounds</subject><subject>Computational Biology - methods</subject><subject>Computer applications</subject><subject>Construction standards</subject><subject>Disease</subject><subject>Diseases</subject><subject>Drug Repositioning</subject><subject>Drug Therapy - methods</subject><subject>Drugs</subject><subject>Education</subject><subject>Evolutionary Biology</subject><subject>Gold</subject><subject>Humans</subject><subject>Learning algorithms</subject><subject>Models, Statistical</subject><subject>Pharmaceutical Preparations - chemistry</subject><subject>Pharmacology - methods</subject><subject>Pharmacy - methods</subject><subject>Philosophy</subject><subject>Philosophy of Biology</subject><subject>Regular Article</subject><subject>Reproducibility of Results</subject><subject>Software</subject><subject>Supervised Machine Learning</subject><issn>0001-5342</issn><issn>1572-8358</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><recordid>eNp1kU1rFTEUhoMo9lr9AW4k4MZNNB-TZGZZWq3FK4LWdcjHmSHl3uSazCjtrzfjrQqCq3DI8z455EXoOaOvGaX6TWW06xihrCeD4JLcPUAbJjUnvZD9Q7ShlDIiRcdP0JNab9o4KE0foxM-aEqVohu0XJRlwp_hkGucY04xTdjd4qs0w1TsvI4fUv6R8EWsYCuQS0iAbQp4DZJrWyaY8Vmt2Ue7CiqOCVv8BfaR1OUA5XsLBrwFW37JP-YAu6fo0Wh3FZ7dn6fo67u31-fvyfbT5dX52ZZ4oflMlBqpD4OgkgWmgwRHneYjGyXvpWXaBdeNlnHvmFOuC1p572n7FKuV48KKU_Tq6D2U_G2BOpt9rB52O5sgL9VwKngnJFN9Q1_-g97kpaS23UoxplQ3iEaxI-VLrrXAaA4l7m25NYyatRNz7MS0TszaiblrmRf35sXtIfxJ_C6hAfwI1HaVJih_n_6_9SdqO5fk</recordid><startdate>20181201</startdate><enddate>20181201</enddate><creator>Le, Duc-Hau</creator><creator>Nguyen-Ngoc, Doanh</creator><general>Springer Netherlands</general><general>Springer Nature B.V</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7QP</scope><scope>7QR</scope><scope>7T5</scope><scope>7TK</scope><scope>7TM</scope><scope>7U9</scope><scope>7X7</scope><scope>7XB</scope><scope>88A</scope><scope>88E</scope><scope>8AO</scope><scope>8FD</scope><scope>8FE</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BHPHI</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>H94</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>LK8</scope><scope>M0S</scope><scope>M1P</scope><scope>M7P</scope><scope>P64</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0002-4951-5916</orcidid></search><sort><creationdate>20181201</creationdate><title>Drug Repositioning by Integrating Known Disease-Gene and Drug-Target Associations in a Semi-supervised Learning Model</title><author>Le, Duc-Hau ; Nguyen-Ngoc, Doanh</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c372t-66f0cd93051d17d5eb0b72f1f5285a17bdb4fa12cb1b6b4d76ccc0104a76b23a3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Area Under Curve</topic><topic>Artificial intelligence</topic><topic>Chemical compounds</topic><topic>Computational Biology - methods</topic><topic>Computer applications</topic><topic>Construction standards</topic><topic>Disease</topic><topic>Diseases</topic><topic>Drug Repositioning</topic><topic>Drug Therapy - methods</topic><topic>Drugs</topic><topic>Education</topic><topic>Evolutionary Biology</topic><topic>Gold</topic><topic>Humans</topic><topic>Learning algorithms</topic><topic>Models, Statistical</topic><topic>Pharmaceutical Preparations - chemistry</topic><topic>Pharmacology - methods</topic><topic>Pharmacy - methods</topic><topic>Philosophy</topic><topic>Philosophy of Biology</topic><topic>Regular Article</topic><topic>Reproducibility of Results</topic><topic>Software</topic><topic>Supervised Machine Learning</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Le, Duc-Hau</creatorcontrib><creatorcontrib>Nguyen-Ngoc, Doanh</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Calcium & Calcified Tissue Abstracts</collection><collection>Chemoreception Abstracts</collection><collection>Immunology Abstracts</collection><collection>Neurosciences Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Virology and AIDS Abstracts</collection><collection>Health & Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Biology Database (Alumni Edition)</collection><collection>Medical Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Natural Science Collection (ProQuest)</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>ProQuest Biological Science Collection</collection><collection>Health & Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Biological Science Database</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>MEDLINE - Academic</collection><jtitle>Acta biotheoretica</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Le, Duc-Hau</au><au>Nguyen-Ngoc, Doanh</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Drug Repositioning by Integrating Known Disease-Gene and Drug-Target Associations in a Semi-supervised Learning Model</atitle><jtitle>Acta biotheoretica</jtitle><stitle>Acta Biotheor</stitle><addtitle>Acta Biotheor</addtitle><date>2018-12-01</date><risdate>2018</risdate><volume>66</volume><issue>4</issue><spage>315</spage><epage>331</epage><pages>315-331</pages><issn>0001-5342</issn><eissn>1572-8358</eissn><abstract>Computational drug repositioning has been proven as a promising and efficient strategy for discovering new uses from existing drugs. To achieve this goal, a number of computational methods have been proposed, which are based on different data sources of drugs and diseases. These methods approach the problem using either machine learning- or network-based models with an assumption that similar drugs can be used for similar diseases to identify new indications of drugs. Therefore, similarities between drugs and between diseases are usually used as inputs. In addition, known drug-disease associations are also needed for the methods as prior information. It should be noted that those associations are still not well established due to the fact that many of marketed drugs have been withdrawn and this could affect the outcome of the methods. In this study, we propose a novel method named RLSDR (Regularized Least Square for Drug Repositioning) to find new uses of drugs. More specifically, it relies on a semi-supervised learning model, Regularized Least Square, thus it does not require definition of non-drug-disease associations as previously proposed machine learning-based methods. In addition, the similarity between drugs measured by chemical structures of drug compounds and the similarity between diseases which share phenotypes can be represented in a form of either similarity network or similarity matrix as inputs of the method. Moreover, instead of using a gold-standard set of known drug-disease associations, we construct an artificial set of the associations based on known disease-gene and drug-target associations. Experiment results demonstrate that RLSDR achieves better prediction performance on the artificial set of drug-disease associations than that on the gold-standard ones in terms of area under the Receiver Operating Characteristic (ROC) curve (AUC). In addition, it outperforms two representative network-based methods irrespective of the prior information of drug-disease associations. Novel indications for a number of drugs are also identified and validated by evidences from a different data resource.</abstract><cop>Dordrecht</cop><pub>Springer Netherlands</pub><pmid>29700660</pmid><doi>10.1007/s10441-018-9325-z</doi><tpages>17</tpages><orcidid>https://orcid.org/0000-0002-4951-5916</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0001-5342 |
ispartof | Acta biotheoretica, 2018-12, Vol.66 (4), p.315-331 |
issn | 0001-5342 1572-8358 |
language | eng |
recordid | cdi_proquest_miscellaneous_2032435168 |
source | MEDLINE; SpringerLink Journals - AutoHoldings |
subjects | Area Under Curve Artificial intelligence Chemical compounds Computational Biology - methods Computer applications Construction standards Disease Diseases Drug Repositioning Drug Therapy - methods Drugs Education Evolutionary Biology Gold Humans Learning algorithms Models, Statistical Pharmaceutical Preparations - chemistry Pharmacology - methods Pharmacy - methods Philosophy Philosophy of Biology Regular Article Reproducibility of Results Software Supervised Machine Learning |
title | Drug Repositioning by Integrating Known Disease-Gene and Drug-Target Associations in a Semi-supervised Learning Model |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-09T03%3A34%3A39IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Drug%20Repositioning%20by%20Integrating%20Known%20Disease-Gene%20and%20Drug-Target%20Associations%20in%20a%20Semi-supervised%20Learning%20Model&rft.jtitle=Acta%20biotheoretica&rft.au=Le,%20Duc-Hau&rft.date=2018-12-01&rft.volume=66&rft.issue=4&rft.spage=315&rft.epage=331&rft.pages=315-331&rft.issn=0001-5342&rft.eissn=1572-8358&rft_id=info:doi/10.1007/s10441-018-9325-z&rft_dat=%3Cproquest_cross%3E2032435168%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2031166493&rft_id=info:pmid/29700660&rfr_iscdi=true |