Drug Repositioning by Integrating Known Disease-Gene and Drug-Target Associations in a Semi-supervised Learning Model

Computational drug repositioning has been proven as a promising and efficient strategy for discovering new uses from existing drugs. To achieve this goal, a number of computational methods have been proposed, which are based on different data sources of drugs and diseases. These methods approach the...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Acta biotheoretica 2018-12, Vol.66 (4), p.315-331
Hauptverfasser:	Le, Duc-Hau, Nguyen-Ngoc, Doanh
Format:	Artikel
Sprache:	eng
Schlagworte:	Area Under Curve Artificial intelligence Chemical compounds Computational Biology - methods Computer applications Construction standards Disease Diseases Drug Repositioning Drug Therapy - methods Drugs Education Evolutionary Biology Gold Humans Learning algorithms Models, Statistical Pharmaceutical Preparations - chemistry Pharmacology - methods Pharmacy - methods Philosophy Philosophy of Biology Regular Article Reproducibility of Results Software Supervised Machine Learning
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	331
container_issue	4
container_start_page	315
container_title	Acta biotheoretica
container_volume	66
creator	Le, Duc-Hau Nguyen-Ngoc, Doanh
description	Computational drug repositioning has been proven as a promising and efficient strategy for discovering new uses from existing drugs. To achieve this goal, a number of computational methods have been proposed, which are based on different data sources of drugs and diseases. These methods approach the problem using either machine learning- or network-based models with an assumption that similar drugs can be used for similar diseases to identify new indications of drugs. Therefore, similarities between drugs and between diseases are usually used as inputs. In addition, known drug-disease associations are also needed for the methods as prior information. It should be noted that those associations are still not well established due to the fact that many of marketed drugs have been withdrawn and this could affect the outcome of the methods. In this study, we propose a novel method named RLSDR (Regularized Least Square for Drug Repositioning) to find new uses of drugs. More specifically, it relies on a semi-supervised learning model, Regularized Least Square, thus it does not require definition of non-drug-disease associations as previously proposed machine learning-based methods. In addition, the similarity between drugs measured by chemical structures of drug compounds and the similarity between diseases which share phenotypes can be represented in a form of either similarity network or similarity matrix as inputs of the method. Moreover, instead of using a gold-standard set of known drug-disease associations, we construct an artificial set of the associations based on known disease-gene and drug-target associations. Experiment results demonstrate that RLSDR achieves better prediction performance on the artificial set of drug-disease associations than that on the gold-standard ones in terms of area under the Receiver Operating Characteristic (ROC) curve (AUC). In addition, it outperforms two representative network-based methods irrespective of the prior information of drug-disease associations. Novel indications for a number of drugs are also identified and validated by evidences from a different data resource.
doi_str_mv	10.1007/s10441-018-9325-z
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2032435168</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2032435168</sourcerecordid><originalsourceid>FETCH-LOGICAL-c372t-66f0cd93051d17d5eb0b72f1f5285a17bdb4fa12cb1b6b4d76ccc0104a76b23a3</originalsourceid><addsrcrecordid>eNp1kU1rFTEUhoMo9lr9AW4k4MZNNB-TZGZZWq3FK4LWdcjHmSHl3uSazCjtrzfjrQqCq3DI8z455EXoOaOvGaX6TWW06xihrCeD4JLcPUAbJjUnvZD9Q7ShlDIiRcdP0JNab9o4KE0foxM-aEqVohu0XJRlwp_hkGucY04xTdjd4qs0w1TsvI4fUv6R8EWsYCuQS0iAbQp4DZJrWyaY8Vmt2Ue7CiqOCVv8BfaR1OUA5XsLBrwFW37JP-YAu6fo0Wh3FZ7dn6fo67u31-fvyfbT5dX52ZZ4oflMlBqpD4OgkgWmgwRHneYjGyXvpWXaBdeNlnHvmFOuC1p572n7FKuV48KKU_Tq6D2U_G2BOpt9rB52O5sgL9VwKngnJFN9Q1_-g97kpaS23UoxplQ3iEaxI-VLrrXAaA4l7m25NYyatRNz7MS0TszaiblrmRf35sXtIfxJ_C6hAfwI1HaVJih_n_6_9SdqO5fk</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2031166493</pqid></control><display><type>article</type><title>Drug Repositioning by Integrating Known Disease-Gene and Drug-Target Associations in a Semi-supervised Learning Model</title><source>MEDLINE</source><source>SpringerLink Journals - AutoHoldings</source><creator>Le, Duc-Hau ; Nguyen-Ngoc, Doanh</creator><creatorcontrib>Le, Duc-Hau ; Nguyen-Ngoc, Doanh</creatorcontrib><description>Computational drug repositioning has been proven as a promising and efficient strategy for discovering new uses from existing drugs. To achieve this goal, a number of computational methods have been proposed, which are based on different data sources of drugs and diseases. These methods approach the problem using either machine learning- or network-based models with an assumption that similar drugs can be used for similar diseases to identify new indications of drugs. Therefore, similarities between drugs and between diseases are usually used as inputs. In addition, known drug-disease associations are also needed for the methods as prior information. It should be noted that those associations are still not well established due to the fact that many of marketed drugs have been withdrawn and this could affect the outcome of the methods. In this study, we propose a novel method named RLSDR (Regularized Least Square for Drug Repositioning) to find new uses of drugs. More specifically, it relies on a semi-supervised learning model, Regularized Least Square, thus it does not require definition of non-drug-disease associations as previously proposed machine learning-based methods. In addition, the similarity between drugs measured by chemical structures of drug compounds and the similarity between diseases which share phenotypes can be represented in a form of either similarity network or similarity matrix as inputs of the method. Moreover, instead of using a gold-standard set of known drug-disease associations, we construct an artificial set of the associations based on known disease-gene and drug-target associations. Experiment results demonstrate that RLSDR achieves better prediction performance on the artificial set of drug-disease associations than that on the gold-standard ones in terms of area under the Receiver Operating Characteristic (ROC) curve (AUC). In addition, it outperforms two representative network-based methods irrespective of the prior information of drug-disease associations. Novel indications for a number of drugs are also identified and validated by evidences from a different data resource.</description><identifier>ISSN: 0001-5342</identifier><identifier>EISSN: 1572-8358</identifier><identifier>DOI: 10.1007/s10441-018-9325-z</identifier><identifier>PMID: 29700660</identifier><language>eng</language><publisher>Dordrecht: Springer Netherlands</publisher><subject>Area Under Curve ; Artificial intelligence ; Chemical compounds ; Computational Biology - methods ; Computer applications ; Construction standards ; Disease ; Diseases ; Drug Repositioning ; Drug Therapy - methods ; Drugs ; Education ; Evolutionary Biology ; Gold ; Humans ; Learning algorithms ; Models, Statistical ; Pharmaceutical Preparations - chemistry ; Pharmacology - methods ; Pharmacy - methods ; Philosophy ; Philosophy of Biology ; Regular Article ; Reproducibility of Results ; Software ; Supervised Machine Learning</subject><ispartof>Acta biotheoretica, 2018-12, Vol.66 (4), p.315-331</ispartof><rights>Springer Science+Business Media B.V., part of Springer Nature 2018</rights><rights>Acta Biotheoretica is a copyright of Springer, (2018). All Rights Reserved.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c372t-66f0cd93051d17d5eb0b72f1f5285a17bdb4fa12cb1b6b4d76ccc0104a76b23a3</citedby><cites>FETCH-LOGICAL-c372t-66f0cd93051d17d5eb0b72f1f5285a17bdb4fa12cb1b6b4d76ccc0104a76b23a3</cites><orcidid>0000-0002-4951-5916</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s10441-018-9325-z$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s10441-018-9325-z$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,780,784,27923,27924,41487,42556,51318</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/29700660$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Le, Duc-Hau</creatorcontrib><creatorcontrib>Nguyen-Ngoc, Doanh</creatorcontrib><title>Drug Repositioning by Integrating Known Disease-Gene and Drug-Target Associations in a Semi-supervised Learning Model</title><title>Acta biotheoretica</title><addtitle>Acta Biotheor</addtitle><addtitle>Acta Biotheor</addtitle><description>Computational drug repositioning has been proven as a promising and efficient strategy for discovering new uses from existing drugs. To achieve this goal, a number of computational methods have been proposed, which are based on different data sources of drugs and diseases. These methods approach the problem using either machine learning- or network-based models with an assumption that similar drugs can be used for similar diseases to identify new indications of drugs. Therefore, similarities between drugs and between diseases are usually used as inputs. In addition, known drug-disease associations are also needed for the methods as prior information. It should be noted that those associations are still not well established due to the fact that many of marketed drugs have been withdrawn and this could affect the outcome of the methods. In this study, we propose a novel method named RLSDR (Regularized Least Square for Drug Repositioning) to find new uses of drugs. More specifically, it relies on a semi-supervised learning model, Regularized Least Square, thus it does not require definition of non-drug-disease associations as previously proposed machine learning-based methods. In addition, the similarity between drugs measured by chemical structures of drug compounds and the similarity between diseases which share phenotypes can be represented in a form of either similarity network or similarity matrix as inputs of the method. Moreover, instead of using a gold-standard set of known drug-disease associations, we construct an artificial set of the associations based on known disease-gene and drug-target associations. Experiment results demonstrate that RLSDR achieves better prediction performance on the artificial set of drug-disease associations than that on the gold-standard ones in terms of area under the Receiver Operating Characteristic (ROC) curve (AUC). In addition, it outperforms two representative network-based methods irrespective of the prior information of drug-disease associations. Novel indications for a number of drugs are also identified and validated by evidences from a different data resource.</description><subject>Area Under Curve</subject><subject>Artificial intelligence</subject><subject>Chemical compounds</subject><subject>Computational Biology - methods</subject><subject>Computer applications</subject><subject>Construction standards</subject><subject>Disease</subject><subject>Diseases</subject><subject>Drug Repositioning</subject><subject>Drug Therapy - methods</subject><subject>Drugs</subject><subject>Education</subject><subject>Evolutionary Biology</subject><subject>Gold</subject><subject>Humans</subject><subject>Learning algorithms</subject><subject>Models, Statistical</subject><subject>Pharmaceutical Preparations - chemistry</subject><subject>Pharmacology - methods</subject><subject>Pharmacy - methods</subject><subject>Philosophy</subject><subject>Philosophy of Biology</subject><subject>Regular Article</subject><subject>Reproducibility of Results</subject><subject>Software</subject><subject>Supervised Machine Learning</subject><issn>0001-5342</issn><issn>1572-8358</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><recordid>eNp1kU1rFTEUhoMo9lr9AW4k4MZNNB-TZGZZWq3FK4LWdcjHmSHl3uSazCjtrzfjrQqCq3DI8z455EXoOaOvGaX6TWW06xihrCeD4JLcPUAbJjUnvZD9Q7ShlDIiRcdP0JNab9o4KE0foxM-aEqVohu0XJRlwp_hkGucY04xTdjd4qs0w1TsvI4fUv6R8EWsYCuQS0iAbQp4DZJrWyaY8Vmt2Ue7CiqOCVv8BfaR1OUA5XsLBrwFW37JP-YAu6fo0Wh3FZ7dn6fo67u31-fvyfbT5dX52ZZ4oflMlBqpD4OgkgWmgwRHneYjGyXvpWXaBdeNlnHvmFOuC1p572n7FKuV48KKU_Tq6D2U_G2BOpt9rB52O5sgL9VwKngnJFN9Q1_-g97kpaS23UoxplQ3iEaxI-VLrrXAaA4l7m25NYyatRNz7MS0TszaiblrmRf35sXtIfxJ_C6hAfwI1HaVJih_n_6_9SdqO5fk</recordid><startdate>20181201</startdate><enddate>20181201</enddate><creator>Le, Duc-Hau</creator><creator>Nguyen-Ngoc, Doanh</creator><general>Springer Netherlands</general><general>Springer Nature B.V</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7QP</scope><scope>7QR</scope><scope>7T5</scope><scope>7TK</scope><scope>7TM</scope><scope>7U9</scope><scope>7X7</scope><scope>7XB</scope><scope>88A</scope><scope>88E</scope><scope>8AO</scope><scope>8FD</scope><scope>8FE</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BHPHI</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>H94</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>LK8</scope><scope>M0S</scope><scope>M1P</scope><scope>M7P</scope><scope>P64</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0002-4951-5916</orcidid></search><sort><creationdate>20181201</creationdate><title>Drug Repositioning by Integrating Known Disease-Gene and Drug-Target Associations in a Semi-supervised Learning Model</title><author>Le, Duc-Hau ; Nguyen-Ngoc, Doanh</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c372t-66f0cd93051d17d5eb0b72f1f5285a17bdb4fa12cb1b6b4d76ccc0104a76b23a3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Area Under Curve</topic><topic>Artificial intelligence</topic><topic>Chemical compounds</topic><topic>Computational Biology - methods</topic><topic>Computer applications</topic><topic>Construction standards</topic><topic>Disease</topic><topic>Diseases</topic><topic>Drug Repositioning</topic><topic>Drug Therapy - methods</topic><topic>Drugs</topic><topic>Education</topic><topic>Evolutionary Biology</topic><topic>Gold</topic><topic>Humans</topic><topic>Learning algorithms</topic><topic>Models, Statistical</topic><topic>Pharmaceutical Preparations - chemistry</topic><topic>Pharmacology - methods</topic><topic>Pharmacy - methods</topic><topic>Philosophy</topic><topic>Philosophy of Biology</topic><topic>Regular Article</topic><topic>Reproducibility of Results</topic><topic>Software</topic><topic>Supervised Machine Learning</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Le, Duc-Hau</creatorcontrib><creatorcontrib>Nguyen-Ngoc, Doanh</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Calcium & Calcified Tissue Abstracts</collection><collection>Chemoreception Abstracts</collection><collection>Immunology Abstracts</collection><collection>Neurosciences Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Virology and AIDS Abstracts</collection><collection>Health & Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Biology Database (Alumni Edition)</collection><collection>Medical Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Natural Science Collection (ProQuest)</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>ProQuest Biological Science Collection</collection><collection>Health & Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Biological Science Database</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>MEDLINE - Academic</collection><jtitle>Acta biotheoretica</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Le, Duc-Hau</au><au>Nguyen-Ngoc, Doanh</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Drug Repositioning by Integrating Known Disease-Gene and Drug-Target Associations in a Semi-supervised Learning Model</atitle><jtitle>Acta biotheoretica</jtitle><stitle>Acta Biotheor</stitle><addtitle>Acta Biotheor</addtitle><date>2018-12-01</date><risdate>2018</risdate><volume>66</volume><issue>4</issue><spage>315</spage><epage>331</epage><pages>315-331</pages><issn>0001-5342</issn><eissn>1572-8358</eissn><abstract>Computational drug repositioning has been proven as a promising and efficient strategy for discovering new uses from existing drugs. To achieve this goal, a number of computational methods have been proposed, which are based on different data sources of drugs and diseases. These methods approach the problem using either machine learning- or network-based models with an assumption that similar drugs can be used for similar diseases to identify new indications of drugs. Therefore, similarities between drugs and between diseases are usually used as inputs. In addition, known drug-disease associations are also needed for the methods as prior information. It should be noted that those associations are still not well established due to the fact that many of marketed drugs have been withdrawn and this could affect the outcome of the methods. In this study, we propose a novel method named RLSDR (Regularized Least Square for Drug Repositioning) to find new uses of drugs. More specifically, it relies on a semi-supervised learning model, Regularized Least Square, thus it does not require definition of non-drug-disease associations as previously proposed machine learning-based methods. In addition, the similarity between drugs measured by chemical structures of drug compounds and the similarity between diseases which share phenotypes can be represented in a form of either similarity network or similarity matrix as inputs of the method. Moreover, instead of using a gold-standard set of known drug-disease associations, we construct an artificial set of the associations based on known disease-gene and drug-target associations. Experiment results demonstrate that RLSDR achieves better prediction performance on the artificial set of drug-disease associations than that on the gold-standard ones in terms of area under the Receiver Operating Characteristic (ROC) curve (AUC). In addition, it outperforms two representative network-based methods irrespective of the prior information of drug-disease associations. Novel indications for a number of drugs are also identified and validated by evidences from a different data resource.</abstract><cop>Dordrecht</cop><pub>Springer Netherlands</pub><pmid>29700660</pmid><doi>10.1007/s10441-018-9325-z</doi><tpages>17</tpages><orcidid>https://orcid.org/0000-0002-4951-5916</orcidid></addata></record>
fulltext	fulltext
identifier	ISSN: 0001-5342
ispartof	Acta biotheoretica, 2018-12, Vol.66 (4), p.315-331
issn	0001-5342 1572-8358
language	eng
recordid	cdi_proquest_miscellaneous_2032435168
source	MEDLINE; SpringerLink Journals - AutoHoldings
subjects	Area Under Curve Artificial intelligence Chemical compounds Computational Biology - methods Computer applications Construction standards Disease Diseases Drug Repositioning Drug Therapy - methods Drugs Education Evolutionary Biology Gold Humans Learning algorithms Models, Statistical Pharmaceutical Preparations - chemistry Pharmacology - methods Pharmacy - methods Philosophy Philosophy of Biology Regular Article Reproducibility of Results Software Supervised Machine Learning
title	Drug Repositioning by Integrating Known Disease-Gene and Drug-Target Associations in a Semi-supervised Learning Model
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-09T03%3A34%3A39IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Drug%20Repositioning%20by%20Integrating%20Known%20Disease-Gene%20and%20Drug-Target%20Associations%20in%20a%20Semi-supervised%20Learning%20Model&rft.jtitle=Acta%20biotheoretica&rft.au=Le,%20Duc-Hau&rft.date=2018-12-01&rft.volume=66&rft.issue=4&rft.spage=315&rft.epage=331&rft.pages=315-331&rft.issn=0001-5342&rft.eissn=1572-8358&rft_id=info:doi/10.1007/s10441-018-9325-z&rft_dat=%3Cproquest_cross%3E2032435168%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2031166493&rft_id=info:pmid/29700660&rfr_iscdi=true