Using human in vitro transcriptome analysis to build trustworthy machine learning models for prediction of animal drug toxicity

During the development of new drugs or compounds there is a requirement for preclinical trials, commonly involving animal tests, to ascertain the safety of the compound prior to human trials. Machine learning techniques could provide an in-silico alternative to animal models for assessing drug toxic...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Scientific reports 2020-06, Vol.10 (1), p.9522, Article 9522
Hauptverfasser: Gardiner, Laura-Jayne, Carrieri, Anna Paola, Wilshaw, Jenny, Checkley, Stephen, Pyzer-Knapp, Edward O., Krishna, Ritesh
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue 1
container_start_page 9522
container_title Scientific reports
container_volume 10
creator Gardiner, Laura-Jayne
Carrieri, Anna Paola
Wilshaw, Jenny
Checkley, Stephen
Pyzer-Knapp, Edward O.
Krishna, Ritesh
description During the development of new drugs or compounds there is a requirement for preclinical trials, commonly involving animal tests, to ascertain the safety of the compound prior to human trials. Machine learning techniques could provide an in-silico alternative to animal models for assessing drug toxicity, thus reducing expensive and invasive animal testing during clinical trials, for drugs that are most likely to fail safety tests. Here we present a machine learning model to predict kidney dysfunction, as a proxy for drug induced renal toxicity, in rats. To achieve this, we use inexpensive transcriptomic profiles derived from human cell lines after chemical compound treatment to train our models combined with compound chemical structure information. Genomics data due to its sparse, high-dimensional and noisy nature presents significant challenges in building trustworthy and transparent machine learning models. Here we address these issues by judiciously building feature sets from heterogenous sources and coupling them with measures of model uncertainty achieved through Gaussian Process based Bayesian models. We combine the use of insight into the feature-wise contributions to our predictions with the use of predictive uncertainties recovered from the Gaussian Process to improve the transparency and trustworthiness of the model.
doi_str_mv 10.1038/s41598-020-66481-0
format Article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_7293302</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2412413714</sourcerecordid><originalsourceid>FETCH-LOGICAL-c474t-b4921a5271f0304d268cb66a86282828c2cd683c130f471611a4a03e2f1f95d23</originalsourceid><addsrcrecordid>eNp9kU1PAyEQhonRWFP9Ax4MiedVGNivi4kxfiUmXvRMKMu2NLtQga325F-XWj_qRSCBZN55ZpgXoWNKzihh1XngNK-rjADJioJXNCM76AAIzzNgALtb7xE6CmFO0sqh5rTeRyMGOWOE8AP0_hyMneLZ0EuLjcVLE73D0UsblDeL6HqNpZXdKpiAo8OTwXRNig8hvjofZyvcSzUzVuNOS2_XrN41ugu4dR4vvG6MisZZ7NrEMb3scOOHaUK9GWXi6hDttbIL-ujrHqPnm-unq7vs4fH2_uryIVO85DGb8BqozKGkLWGEN1BUalIUsiqgWm8FqikqpigjLS9pQankkjANLW3rvAE2Rhcb7mKY9LpR2qY_dmLhU0t-JZw04m_EmpmYuqUooU6jWgNOvwDevQw6RDF3g0-TCQI4TYeVlCcVbFTKuxC8bn8qUCLWvomNbyL5Jj59EyQlnWz39pPy7VISsI0gpJCdav9b-x_sBxxUpfI</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2412413714</pqid></control><display><type>article</type><title>Using human in vitro transcriptome analysis to build trustworthy machine learning models for prediction of animal drug toxicity</title><source>MEDLINE</source><source>Nature Free</source><source>DOAJ Directory of Open Access Journals</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>PubMed Central</source><source>Alma/SFX Local Collection</source><source>Free Full-Text Journals in Chemistry</source><source>Springer Nature OA Free Journals</source><creator>Gardiner, Laura-Jayne ; Carrieri, Anna Paola ; Wilshaw, Jenny ; Checkley, Stephen ; Pyzer-Knapp, Edward O. ; Krishna, Ritesh</creator><creatorcontrib>Gardiner, Laura-Jayne ; Carrieri, Anna Paola ; Wilshaw, Jenny ; Checkley, Stephen ; Pyzer-Knapp, Edward O. ; Krishna, Ritesh</creatorcontrib><description>During the development of new drugs or compounds there is a requirement for preclinical trials, commonly involving animal tests, to ascertain the safety of the compound prior to human trials. Machine learning techniques could provide an in-silico alternative to animal models for assessing drug toxicity, thus reducing expensive and invasive animal testing during clinical trials, for drugs that are most likely to fail safety tests. Here we present a machine learning model to predict kidney dysfunction, as a proxy for drug induced renal toxicity, in rats. To achieve this, we use inexpensive transcriptomic profiles derived from human cell lines after chemical compound treatment to train our models combined with compound chemical structure information. Genomics data due to its sparse, high-dimensional and noisy nature presents significant challenges in building trustworthy and transparent machine learning models. Here we address these issues by judiciously building feature sets from heterogenous sources and coupling them with measures of model uncertainty achieved through Gaussian Process based Bayesian models. We combine the use of insight into the feature-wise contributions to our predictions with the use of predictive uncertainties recovered from the Gaussian Process to improve the transparency and trustworthiness of the model.</description><identifier>ISSN: 2045-2322</identifier><identifier>EISSN: 2045-2322</identifier><identifier>DOI: 10.1038/s41598-020-66481-0</identifier><identifier>PMID: 32533004</identifier><language>eng</language><publisher>London: Nature Publishing Group UK</publisher><subject>631/114/1305 ; 631/154 ; Animal models ; Animal research ; Animals ; Bayesian analysis ; Cell lines ; Clinical trials ; Drug development ; Drug-Related Side Effects and Adverse Reactions - genetics ; Gene expression ; Gene Expression Profiling ; Humanities and Social Sciences ; Humans ; Invasive animals ; Kidney diseases ; Learning algorithms ; Machine Learning ; Mathematical models ; Models, Theoretical ; multidisciplinary ; Quality Control ; Science ; Science (multidisciplinary) ; Toxicity ; Uncertainty</subject><ispartof>Scientific reports, 2020-06, Vol.10 (1), p.9522, Article 9522</ispartof><rights>The Author(s) 2020</rights><rights>The Author(s) 2020. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c474t-b4921a5271f0304d268cb66a86282828c2cd683c130f471611a4a03e2f1f95d23</citedby><cites>FETCH-LOGICAL-c474t-b4921a5271f0304d268cb66a86282828c2cd683c130f471611a4a03e2f1f95d23</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC7293302/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC7293302/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,723,776,780,860,881,27901,27902,41096,42165,51551,53766,53768</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/32533004$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Gardiner, Laura-Jayne</creatorcontrib><creatorcontrib>Carrieri, Anna Paola</creatorcontrib><creatorcontrib>Wilshaw, Jenny</creatorcontrib><creatorcontrib>Checkley, Stephen</creatorcontrib><creatorcontrib>Pyzer-Knapp, Edward O.</creatorcontrib><creatorcontrib>Krishna, Ritesh</creatorcontrib><title>Using human in vitro transcriptome analysis to build trustworthy machine learning models for prediction of animal drug toxicity</title><title>Scientific reports</title><addtitle>Sci Rep</addtitle><addtitle>Sci Rep</addtitle><description>During the development of new drugs or compounds there is a requirement for preclinical trials, commonly involving animal tests, to ascertain the safety of the compound prior to human trials. Machine learning techniques could provide an in-silico alternative to animal models for assessing drug toxicity, thus reducing expensive and invasive animal testing during clinical trials, for drugs that are most likely to fail safety tests. Here we present a machine learning model to predict kidney dysfunction, as a proxy for drug induced renal toxicity, in rats. To achieve this, we use inexpensive transcriptomic profiles derived from human cell lines after chemical compound treatment to train our models combined with compound chemical structure information. Genomics data due to its sparse, high-dimensional and noisy nature presents significant challenges in building trustworthy and transparent machine learning models. Here we address these issues by judiciously building feature sets from heterogenous sources and coupling them with measures of model uncertainty achieved through Gaussian Process based Bayesian models. We combine the use of insight into the feature-wise contributions to our predictions with the use of predictive uncertainties recovered from the Gaussian Process to improve the transparency and trustworthiness of the model.</description><subject>631/114/1305</subject><subject>631/154</subject><subject>Animal models</subject><subject>Animal research</subject><subject>Animals</subject><subject>Bayesian analysis</subject><subject>Cell lines</subject><subject>Clinical trials</subject><subject>Drug development</subject><subject>Drug-Related Side Effects and Adverse Reactions - genetics</subject><subject>Gene expression</subject><subject>Gene Expression Profiling</subject><subject>Humanities and Social Sciences</subject><subject>Humans</subject><subject>Invasive animals</subject><subject>Kidney diseases</subject><subject>Learning algorithms</subject><subject>Machine Learning</subject><subject>Mathematical models</subject><subject>Models, Theoretical</subject><subject>multidisciplinary</subject><subject>Quality Control</subject><subject>Science</subject><subject>Science (multidisciplinary)</subject><subject>Toxicity</subject><subject>Uncertainty</subject><issn>2045-2322</issn><issn>2045-2322</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>C6C</sourceid><sourceid>EIF</sourceid><sourceid>BENPR</sourceid><recordid>eNp9kU1PAyEQhonRWFP9Ax4MiedVGNivi4kxfiUmXvRMKMu2NLtQga325F-XWj_qRSCBZN55ZpgXoWNKzihh1XngNK-rjADJioJXNCM76AAIzzNgALtb7xE6CmFO0sqh5rTeRyMGOWOE8AP0_hyMneLZ0EuLjcVLE73D0UsblDeL6HqNpZXdKpiAo8OTwXRNig8hvjofZyvcSzUzVuNOS2_XrN41ugu4dR4vvG6MisZZ7NrEMb3scOOHaUK9GWXi6hDttbIL-ujrHqPnm-unq7vs4fH2_uryIVO85DGb8BqozKGkLWGEN1BUalIUsiqgWm8FqikqpigjLS9pQankkjANLW3rvAE2Rhcb7mKY9LpR2qY_dmLhU0t-JZw04m_EmpmYuqUooU6jWgNOvwDevQw6RDF3g0-TCQI4TYeVlCcVbFTKuxC8bn8qUCLWvomNbyL5Jj59EyQlnWz39pPy7VISsI0gpJCdav9b-x_sBxxUpfI</recordid><startdate>20200612</startdate><enddate>20200612</enddate><creator>Gardiner, Laura-Jayne</creator><creator>Carrieri, Anna Paola</creator><creator>Wilshaw, Jenny</creator><creator>Checkley, Stephen</creator><creator>Pyzer-Knapp, Edward O.</creator><creator>Krishna, Ritesh</creator><general>Nature Publishing Group UK</general><general>Nature Publishing Group</general><scope>C6C</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7X7</scope><scope>7XB</scope><scope>88A</scope><scope>88E</scope><scope>88I</scope><scope>8FE</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AEUYN</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BHPHI</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>LK8</scope><scope>M0S</scope><scope>M1P</scope><scope>M2P</scope><scope>M7P</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope><scope>5PM</scope></search><sort><creationdate>20200612</creationdate><title>Using human in vitro transcriptome analysis to build trustworthy machine learning models for prediction of animal drug toxicity</title><author>Gardiner, Laura-Jayne ; Carrieri, Anna Paola ; Wilshaw, Jenny ; Checkley, Stephen ; Pyzer-Knapp, Edward O. ; Krishna, Ritesh</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c474t-b4921a5271f0304d268cb66a86282828c2cd683c130f471611a4a03e2f1f95d23</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>631/114/1305</topic><topic>631/154</topic><topic>Animal models</topic><topic>Animal research</topic><topic>Animals</topic><topic>Bayesian analysis</topic><topic>Cell lines</topic><topic>Clinical trials</topic><topic>Drug development</topic><topic>Drug-Related Side Effects and Adverse Reactions - genetics</topic><topic>Gene expression</topic><topic>Gene Expression Profiling</topic><topic>Humanities and Social Sciences</topic><topic>Humans</topic><topic>Invasive animals</topic><topic>Kidney diseases</topic><topic>Learning algorithms</topic><topic>Machine Learning</topic><topic>Mathematical models</topic><topic>Models, Theoretical</topic><topic>multidisciplinary</topic><topic>Quality Control</topic><topic>Science</topic><topic>Science (multidisciplinary)</topic><topic>Toxicity</topic><topic>Uncertainty</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Gardiner, Laura-Jayne</creatorcontrib><creatorcontrib>Carrieri, Anna Paola</creatorcontrib><creatorcontrib>Wilshaw, Jenny</creatorcontrib><creatorcontrib>Checkley, Stephen</creatorcontrib><creatorcontrib>Pyzer-Knapp, Edward O.</creatorcontrib><creatorcontrib>Krishna, Ritesh</creatorcontrib><collection>Springer Nature OA Free Journals</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Health &amp; Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Biology Database (Alumni Edition)</collection><collection>Medical Database (Alumni Edition)</collection><collection>Science Database (Alumni Edition)</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest One Sustainability</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Natural Science Collection (ProQuest)</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>ProQuest Biological Science Collection</collection><collection>Health &amp; Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Science Database</collection><collection>Biological Science Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Scientific reports</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Gardiner, Laura-Jayne</au><au>Carrieri, Anna Paola</au><au>Wilshaw, Jenny</au><au>Checkley, Stephen</au><au>Pyzer-Knapp, Edward O.</au><au>Krishna, Ritesh</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Using human in vitro transcriptome analysis to build trustworthy machine learning models for prediction of animal drug toxicity</atitle><jtitle>Scientific reports</jtitle><stitle>Sci Rep</stitle><addtitle>Sci Rep</addtitle><date>2020-06-12</date><risdate>2020</risdate><volume>10</volume><issue>1</issue><spage>9522</spage><pages>9522-</pages><artnum>9522</artnum><issn>2045-2322</issn><eissn>2045-2322</eissn><abstract>During the development of new drugs or compounds there is a requirement for preclinical trials, commonly involving animal tests, to ascertain the safety of the compound prior to human trials. Machine learning techniques could provide an in-silico alternative to animal models for assessing drug toxicity, thus reducing expensive and invasive animal testing during clinical trials, for drugs that are most likely to fail safety tests. Here we present a machine learning model to predict kidney dysfunction, as a proxy for drug induced renal toxicity, in rats. To achieve this, we use inexpensive transcriptomic profiles derived from human cell lines after chemical compound treatment to train our models combined with compound chemical structure information. Genomics data due to its sparse, high-dimensional and noisy nature presents significant challenges in building trustworthy and transparent machine learning models. Here we address these issues by judiciously building feature sets from heterogenous sources and coupling them with measures of model uncertainty achieved through Gaussian Process based Bayesian models. We combine the use of insight into the feature-wise contributions to our predictions with the use of predictive uncertainties recovered from the Gaussian Process to improve the transparency and trustworthiness of the model.</abstract><cop>London</cop><pub>Nature Publishing Group UK</pub><pmid>32533004</pmid><doi>10.1038/s41598-020-66481-0</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2045-2322
ispartof Scientific reports, 2020-06, Vol.10 (1), p.9522, Article 9522
issn 2045-2322
2045-2322
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_7293302
source MEDLINE; Nature Free; DOAJ Directory of Open Access Journals; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; PubMed Central; Alma/SFX Local Collection; Free Full-Text Journals in Chemistry; Springer Nature OA Free Journals
subjects 631/114/1305
631/154
Animal models
Animal research
Animals
Bayesian analysis
Cell lines
Clinical trials
Drug development
Drug-Related Side Effects and Adverse Reactions - genetics
Gene expression
Gene Expression Profiling
Humanities and Social Sciences
Humans
Invasive animals
Kidney diseases
Learning algorithms
Machine Learning
Mathematical models
Models, Theoretical
multidisciplinary
Quality Control
Science
Science (multidisciplinary)
Toxicity
Uncertainty
title Using human in vitro transcriptome analysis to build trustworthy machine learning models for prediction of animal drug toxicity
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-13T10%3A29%3A36IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Using%20human%20in%20vitro%20transcriptome%20analysis%20to%20build%20trustworthy%20machine%20learning%20models%20for%20prediction%20of%20animal%20drug%20toxicity&rft.jtitle=Scientific%20reports&rft.au=Gardiner,%20Laura-Jayne&rft.date=2020-06-12&rft.volume=10&rft.issue=1&rft.spage=9522&rft.pages=9522-&rft.artnum=9522&rft.issn=2045-2322&rft.eissn=2045-2322&rft_id=info:doi/10.1038/s41598-020-66481-0&rft_dat=%3Cproquest_pubme%3E2412413714%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2412413714&rft_id=info:pmid/32533004&rfr_iscdi=true