Using human in vitro transcriptome analysis to build trustworthy machine learning models for prediction of animal drug toxicity

During the development of new drugs or compounds there is a requirement for preclinical trials, commonly involving animal tests, to ascertain the safety of the compound prior to human trials. Machine learning techniques could provide an in-silico alternative to animal models for assessing drug toxic...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Scientific reports 2020-06, Vol.10 (1), p.9522, Article 9522
Hauptverfasser:	Gardiner, Laura-Jayne, Carrieri, Anna Paola, Wilshaw, Jenny, Checkley, Stephen, Pyzer-Knapp, Edward O., Krishna, Ritesh
Format:	Artikel
Sprache:	eng
Schlagworte:	631/114/1305 631/154 Animal models Animal research Animals Bayesian analysis Cell lines Clinical trials Drug development Drug-Related Side Effects and Adverse Reactions - genetics Gene expression Gene Expression Profiling Humanities and Social Sciences Humans Invasive animals Kidney diseases Learning algorithms Machine Learning Mathematical models Models, Theoretical multidisciplinary Quality Control Science Science (multidisciplinary) Toxicity Uncertainty
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue	1
container_start_page	9522
container_title	Scientific reports
container_volume	10
creator	Gardiner, Laura-Jayne Carrieri, Anna Paola Wilshaw, Jenny Checkley, Stephen Pyzer-Knapp, Edward O. Krishna, Ritesh
description	During the development of new drugs or compounds there is a requirement for preclinical trials, commonly involving animal tests, to ascertain the safety of the compound prior to human trials. Machine learning techniques could provide an in-silico alternative to animal models for assessing drug toxicity, thus reducing expensive and invasive animal testing during clinical trials, for drugs that are most likely to fail safety tests. Here we present a machine learning model to predict kidney dysfunction, as a proxy for drug induced renal toxicity, in rats. To achieve this, we use inexpensive transcriptomic profiles derived from human cell lines after chemical compound treatment to train our models combined with compound chemical structure information. Genomics data due to its sparse, high-dimensional and noisy nature presents significant challenges in building trustworthy and transparent machine learning models. Here we address these issues by judiciously building feature sets from heterogenous sources and coupling them with measures of model uncertainty achieved through Gaussian Process based Bayesian models. We combine the use of insight into the feature-wise contributions to our predictions with the use of predictive uncertainties recovered from the Gaussian Process to improve the transparency and trustworthiness of the model.
doi_str_mv	10.1038/s41598-020-66481-0
format	Article
fullrecord	<record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_7293302</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2412413714</sourcerecordid><originalsourceid>FETCH-LOGICAL-c474t-b4921a5271f0304d268cb66a86282828c2cd683c130f471611a4a03e2f1f95d23</originalsourceid><addsrcrecordid>eNp9kU1PAyEQhonRWFP9Ax4MiedVGNivi4kxfiUmXvRMKMu2NLtQga325F-XWj_qRSCBZN55ZpgXoWNKzihh1XngNK-rjADJioJXNCM76AAIzzNgALtb7xE6CmFO0sqh5rTeRyMGOWOE8AP0_hyMneLZ0EuLjcVLE73D0UsblDeL6HqNpZXdKpiAo8OTwXRNig8hvjofZyvcSzUzVuNOS2_XrN41ugu4dR4vvG6MisZZ7NrEMb3scOOHaUK9GWXi6hDttbIL-ujrHqPnm-unq7vs4fH2_uryIVO85DGb8BqozKGkLWGEN1BUalIUsiqgWm8FqikqpigjLS9pQankkjANLW3rvAE2Rhcb7mKY9LpR2qY_dmLhU0t-JZw04m_EmpmYuqUooU6jWgNOvwDevQw6RDF3g0-TCQI4TYeVlCcVbFTKuxC8bn8qUCLWvomNbyL5Jj59EyQlnWz39pPy7VISsI0gpJCdav9b-x_sBxxUpfI</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2412413714</pqid></control><display><type>article</type><title>Using human in vitro transcriptome analysis to build trustworthy machine learning models for prediction of animal drug toxicity</title><source>MEDLINE</source><source>Nature Free</source><source>DOAJ Directory of Open Access Journals</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>PubMed Central</source><source>Alma/SFX Local Collection</source><source>Free Full-Text Journals in Chemistry</source><source>Springer Nature OA Free Journals</source><creator>Gardiner, Laura-Jayne ; Carrieri, Anna Paola ; Wilshaw, Jenny ; Checkley, Stephen ; Pyzer-Knapp, Edward O. ; Krishna, Ritesh</creator><creatorcontrib>Gardiner, Laura-Jayne ; Carrieri, Anna Paola ; Wilshaw, Jenny ; Checkley, Stephen ; Pyzer-Knapp, Edward O. ; Krishna, Ritesh</creatorcontrib><description>During the development of new drugs or compounds there is a requirement for preclinical trials, commonly involving animal tests, to ascertain the safety of the compound prior to human trials. Machine learning techniques could provide an in-silico alternative to animal models for assessing drug toxicity, thus reducing expensive and invasive animal testing during clinical trials, for drugs that are most likely to fail safety tests. Here we present a machine learning model to predict kidney dysfunction, as a proxy for drug induced renal toxicity, in rats. To achieve this, we use inexpensive transcriptomic profiles derived from human cell lines after chemical compound treatment to train our models combined with compound chemical structure information. Genomics data due to its sparse, high-dimensional and noisy nature presents significant challenges in building trustworthy and transparent machine learning models. Here we address these issues by judiciously building feature sets from heterogenous sources and coupling them with measures of model uncertainty achieved through Gaussian Process based Bayesian models. We combine the use of insight into the feature-wise contributions to our predictions with the use of predictive uncertainties recovered from the Gaussian Process to improve the transparency and trustworthiness of the model.</description><identifier>ISSN: 2045-2322</identifier><identifier>EISSN: 2045-2322</identifier><identifier>DOI: 10.1038/s41598-020-66481-0</identifier><identifier>PMID: 32533004</identifier><language>eng</language><publisher>London: Nature Publishing Group UK</publisher><subject>631/114/1305 ; 631/154 ; Animal models ; Animal research ; Animals ; Bayesian analysis ; Cell lines ; Clinical trials ; Drug development ; Drug-Related Side Effects and Adverse Reactions - genetics ; Gene expression ; Gene Expression Profiling ; Humanities and Social Sciences ; Humans ; Invasive animals ; Kidney diseases ; Learning algorithms ; Machine Learning ; Mathematical models ; Models, Theoretical ; multidisciplinary ; Quality Control ; Science ; Science (multidisciplinary) ; Toxicity ; Uncertainty</subject><ispartof>Scientific reports, 2020-06, Vol.10 (1), p.9522, Article 9522</ispartof><rights>The Author(s) 2020</rights><rights>The Author(s) 2020. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c474t-b4921a5271f0304d268cb66a86282828c2cd683c130f471611a4a03e2f1f95d23</citedby><cites>FETCH-LOGICAL-c474t-b4921a5271f0304d268cb66a86282828c2cd683c130f471611a4a03e2f1f95d23</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC7293302/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC7293302/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,723,776,780,860,881,27901,27902,41096,42165,51551,53766,53768</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/32533004$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Gardiner, Laura-Jayne</creatorcontrib><creatorcontrib>Carrieri, Anna Paola</creatorcontrib><creatorcontrib>Wilshaw, Jenny</creatorcontrib><creatorcontrib>Checkley, Stephen</creatorcontrib><creatorcontrib>Pyzer-Knapp, Edward O.</creatorcontrib><creatorcontrib>Krishna, Ritesh</creatorcontrib><title>Using human in vitro transcriptome analysis to build trustworthy machine learning models for prediction of animal drug toxicity</title><title>Scientific reports</title><addtitle>Sci Rep</addtitle><addtitle>Sci Rep</addtitle><description>During the development of new drugs or compounds there is a requirement for preclinical trials, commonly involving animal tests, to ascertain the safety of the compound prior to human trials. Machine learning techniques could provide an in-silico alternative to animal models for assessing drug toxicity, thus reducing expensive and invasive animal testing during clinical trials, for drugs that are most likely to fail safety tests. Here we present a machine learning model to predict kidney dysfunction, as a proxy for drug induced renal toxicity, in rats. To achieve this, we use inexpensive transcriptomic profiles derived from human cell lines after chemical compound treatment to train our models combined with compound chemical structure information. Genomics data due to its sparse, high-dimensional and noisy nature presents significant challenges in building trustworthy and transparent machine learning models. Here we address these issues by judiciously building feature sets from heterogenous sources and coupling them with measures of model uncertainty achieved through Gaussian Process based Bayesian models. We combine the use of insight into the feature-wise contributions to our predictions with the use of predictive uncertainties recovered from the Gaussian Process to improve the transparency and trustworthiness of the model.</description><subject>631/114/1305</subject><subject>631/154</subject><subject>Animal models</subject><subject>Animal research</subject><subject>Animals</subject><subject>Bayesian analysis</subject><subject>Cell lines</subject><subject>Clinical trials</subject><subject>Drug development</subject><subject>Drug-Related Side Effects and Adverse Reactions - genetics</subject><subject>Gene expression</subject><subject>Gene Expression Profiling</subject><subject>Humanities and Social Sciences</subject><subject>Humans</subject><subject>Invasive animals</subject><subject>Kidney diseases</subject><subject>Learning algorithms</subject><subject>Machine Learning</subject><subject>Mathematical models</subject><subject>Models, Theoretical</subject><subject>multidisciplinary</subject><subject>Quality Control</subject><subject>Science</subject><subject>Science (multidisciplinary)</subject><subject>Toxicity</subject><subject>Uncertainty</subject><issn>2045-2322</issn><issn>2045-2322</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>C6C</sourceid><sourceid>EIF</sourceid><sourceid>BENPR</sourceid><recordid>eNp9kU1PAyEQhonRWFP9Ax4MiedVGNivi4kxfiUmXvRMKMu2NLtQga325F-XWj_qRSCBZN55ZpgXoWNKzihh1XngNK-rjADJioJXNCM76AAIzzNgALtb7xE6CmFO0sqh5rTeRyMGOWOE8AP0_hyMneLZ0EuLjcVLE73D0UsblDeL6HqNpZXdKpiAo8OTwXRNig8hvjofZyvcSzUzVuNOS2_XrN41ugu4dR4vvG6MisZZ7NrEMb3scOOHaUK9GWXi6hDttbIL-ujrHqPnm-unq7vs4fH2_uryIVO85DGb8BqozKGkLWGEN1BUalIUsiqgWm8FqikqpigjLS9pQankkjANLW3rvAE2Rhcb7mKY9LpR2qY_dmLhU0t-JZw04m_EmpmYuqUooU6jWgNOvwDevQw6RDF3g0-TCQI4TYeVlCcVbFTKuxC8bn8qUCLWvomNbyL5Jj59EyQlnWz39pPy7VISsI0gpJCdav9b-x_sBxxUpfI</recordid><startdate>20200612</startdate><enddate>20200612</enddate><creator>Gardiner, Laura-Jayne</creator><creator>Carrieri, Anna Paola</creator><creator>Wilshaw, Jenny</creator><creator>Checkley, Stephen</creator><creator>Pyzer-Knapp, Edward O.</creator><creator>Krishna, Ritesh</creator><general>Nature Publishing Group UK</general><general>Nature Publishing Group</general><scope>C6C</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7X7</scope><scope>7XB</scope><scope>88A</scope><scope>88E</scope><scope>88I</scope><scope>8FE</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AEUYN</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BHPHI</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>LK8</scope><scope>M0S</scope><scope>M1P</scope><scope>M2P</scope><scope>M7P</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope><scope>5PM</scope></search><sort><creationdate>20200612</creationdate><title>Using human in vitro transcriptome analysis to build trustworthy machine learning models for prediction of animal drug toxicity</title><author>Gardiner, Laura-Jayne ; Carrieri, Anna Paola ; Wilshaw, Jenny ; Checkley, Stephen ; Pyzer-Knapp, Edward O. ; Krishna, Ritesh</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c474t-b4921a5271f0304d268cb66a86282828c2cd683c130f471611a4a03e2f1f95d23</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>631/114/1305</topic><topic>631/154</topic><topic>Animal models</topic><topic>Animal research</topic><topic>Animals</topic><topic>Bayesian analysis</topic><topic>Cell lines</topic><topic>Clinical trials</topic><topic>Drug development</topic><topic>Drug-Related Side Effects and Adverse Reactions - genetics</topic><topic>Gene expression</topic><topic>Gene Expression Profiling</topic><topic>Humanities and Social Sciences</topic><topic>Humans</topic><topic>Invasive animals</topic><topic>Kidney diseases</topic><topic>Learning algorithms</topic><topic>Machine Learning</topic><topic>Mathematical models</topic><topic>Models, Theoretical</topic><topic>multidisciplinary</topic><topic>Quality Control</topic><topic>Science</topic><topic>Science (multidisciplinary)</topic><topic>Toxicity</topic><topic>Uncertainty</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Gardiner, Laura-Jayne</creatorcontrib><creatorcontrib>Carrieri, Anna Paola</creatorcontrib><creatorcontrib>Wilshaw, Jenny</creatorcontrib><creatorcontrib>Checkley, Stephen</creatorcontrib><creatorcontrib>Pyzer-Knapp, Edward O.</creatorcontrib><creatorcontrib>Krishna, Ritesh</creatorcontrib><collection>Springer Nature OA Free Journals</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Health & Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Biology Database (Alumni Edition)</collection><collection>Medical Database (Alumni Edition)</collection><collection>Science Database (Alumni Edition)</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest One Sustainability</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Natural Science Collection (ProQuest)</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>ProQuest Biological Science Collection</collection><collection>Health & Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Science Database</collection><collection>Biological Science Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Scientific reports</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Gardiner, Laura-Jayne</au><au>Carrieri, Anna Paola</au><au>Wilshaw, Jenny</au><au>Checkley, Stephen</au><au>Pyzer-Knapp, Edward O.</au><au>Krishna, Ritesh</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Using human in vitro transcriptome analysis to build trustworthy machine learning models for prediction of animal drug toxicity</atitle><jtitle>Scientific reports</jtitle><stitle>Sci Rep</stitle><addtitle>Sci Rep</addtitle><date>2020-06-12</date><risdate>2020</risdate><volume>10</volume><issue>1</issue><spage>9522</spage><pages>9522-</pages><artnum>9522</artnum><issn>2045-2322</issn><eissn>2045-2322</eissn><abstract>During the development of new drugs or compounds there is a requirement for preclinical trials, commonly involving animal tests, to ascertain the safety of the compound prior to human trials. Machine learning techniques could provide an in-silico alternative to animal models for assessing drug toxicity, thus reducing expensive and invasive animal testing during clinical trials, for drugs that are most likely to fail safety tests. Here we present a machine learning model to predict kidney dysfunction, as a proxy for drug induced renal toxicity, in rats. To achieve this, we use inexpensive transcriptomic profiles derived from human cell lines after chemical compound treatment to train our models combined with compound chemical structure information. Genomics data due to its sparse, high-dimensional and noisy nature presents significant challenges in building trustworthy and transparent machine learning models. Here we address these issues by judiciously building feature sets from heterogenous sources and coupling them with measures of model uncertainty achieved through Gaussian Process based Bayesian models. We combine the use of insight into the feature-wise contributions to our predictions with the use of predictive uncertainties recovered from the Gaussian Process to improve the transparency and trustworthiness of the model.</abstract><cop>London</cop><pub>Nature Publishing Group UK</pub><pmid>32533004</pmid><doi>10.1038/s41598-020-66481-0</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 2045-2322
ispartof	Scientific reports, 2020-06, Vol.10 (1), p.9522, Article 9522
issn	2045-2322 2045-2322
language	eng
recordid	cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_7293302
source	MEDLINE; Nature Free; DOAJ Directory of Open Access Journals; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; PubMed Central; Alma/SFX Local Collection; Free Full-Text Journals in Chemistry; Springer Nature OA Free Journals
subjects	631/114/1305 631/154 Animal models Animal research Animals Bayesian analysis Cell lines Clinical trials Drug development Drug-Related Side Effects and Adverse Reactions - genetics Gene expression Gene Expression Profiling Humanities and Social Sciences Humans Invasive animals Kidney diseases Learning algorithms Machine Learning Mathematical models Models, Theoretical multidisciplinary Quality Control Science Science (multidisciplinary) Toxicity Uncertainty
title	Using human in vitro transcriptome analysis to build trustworthy machine learning models for prediction of animal drug toxicity
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-13T10%3A29%3A36IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Using%20human%20in%20vitro%20transcriptome%20analysis%20to%20build%20trustworthy%20machine%20learning%20models%20for%20prediction%20of%20animal%20drug%20toxicity&rft.jtitle=Scientific%20reports&rft.au=Gardiner,%20Laura-Jayne&rft.date=2020-06-12&rft.volume=10&rft.issue=1&rft.spage=9522&rft.pages=9522-&rft.artnum=9522&rft.issn=2045-2322&rft.eissn=2045-2322&rft_id=info:doi/10.1038/s41598-020-66481-0&rft_dat=%3Cproquest_pubme%3E2412413714%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2412413714&rft_id=info:pmid/32533004&rfr_iscdi=true