Leveraging high-throughput screening data, deep neural networks, and conditional generative adversarial networks to advance predictive toxicology

There are currently 85,000 chemicals registered with the Environmental Protection Agency (EPA) under the Toxic Substances Control Act, but only a small fraction have measured toxicological data. To address this gap, high-throughput screening (HTS) and computational methods are vital. As part of one...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:PLoS computational biology 2021-07, Vol.17 (7), p.e1009135
Hauptverfasser: Green, Adrian J, Mohlenkamp, Martin J, Das, Jhuma, Chaudhari, Meenal, Truong, Lisa, Tanguay, Robyn L, Reif, David M
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue 7
container_start_page e1009135
container_title PLoS computational biology
container_volume 17
creator Green, Adrian J
Mohlenkamp, Martin J
Das, Jhuma
Chaudhari, Meenal
Truong, Lisa
Tanguay, Robyn L
Reif, David M
description There are currently 85,000 chemicals registered with the Environmental Protection Agency (EPA) under the Toxic Substances Control Act, but only a small fraction have measured toxicological data. To address this gap, high-throughput screening (HTS) and computational methods are vital. As part of one such HTS effort, embryonic zebrafish were used to examine a suite of morphological and mortality endpoints at six concentrations from over 1,000 unique chemicals found in the ToxCast library (phase 1 and 2). We hypothesized that by using a conditional generative adversarial network (cGAN) or deep neural networks (DNN), and leveraging this large set of toxicity data we could efficiently predict toxic outcomes of untested chemicals. Utilizing a novel method in this space, we converted the 3D structural information into a weighted set of points while retaining all information about the structure. In vivo toxicity and chemical data were used to train two neural network generators. The first was a DNN (Go-ZT) while the second utilized cGAN architecture (GAN-ZT) to train generators to produce toxicity data. Our results showed that Go-ZT significantly outperformed the cGAN, support vector machine, random forest and multilayer perceptron models in cross-validation, and when tested against an external test dataset. By combining both Go-ZT and GAN-ZT, our consensus model improved the SE, SP, PPV, and Kappa, to 71.4%, 95.9%, 71.4% and 0.673, respectively, resulting in an area under the receiver operating characteristic (AUROC) of 0.837. Considering their potential use as prescreening tools, these models could provide in vivo toxicity predictions and insight into the hundreds of thousands of untested chemicals to prioritize compounds for HT testing.
doi_str_mv 10.1371/journal.pcbi.1009135
format Article
fullrecord <record><control><sourceid>gale_plos_</sourceid><recordid>TN_cdi_plos_journals_2561943696</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A670983119</galeid><doaj_id>oai_doaj_org_article_5275e5c7bd414dfaa9a5ebbc012a0e38</doaj_id><sourcerecordid>A670983119</sourcerecordid><originalsourceid>FETCH-LOGICAL-c661t-a6a207ce9479cfd9a8e57fc8d623a21ea0ce152fff27e951e633575c6c6630da3</originalsourceid><addsrcrecordid>eNqVkttu1DAQhiMEoqXwBggicYXUXew4jpMbpKrisNIKJA7X1qw9ybpk7WA7S_sYvDHeblrtSnCBfGFr5vt_j8eTZc8pmVMm6JsrN3oL_XxQKzOnhDSU8QfZKeWczQTj9cOD80n2JIQrQtKxqR5nJ6wsaElEfZr9XuIWPXTGdvnadOtZXHs3duthjHlQHtHuMhoinOcaccgtjh76tMVfzv8I5zlYnStntYnGpXryDm0yjGaLOejkHcCbA0Ee3S4OVmE-eNRG3aLRXRvletfdPM0etdAHfDbtZ9n39---XX6cLT9_WFxeLGeqqmicQQUFEQqbUjSq1Q3UyEWral0VDAqKQBRSXrRtWwhsOMWKMS64qpKcEQ3sLHu59x16F-TUzSALXtGmZFVTJWKxJ7SDKzl4swF_Ix0YeRtwvpPgo1E9Sl4IjlyJlS5pqVuABjiuVorQAgiyOnm9nW4bVxvUCm1MbTwyPc5Ys5ad28qaEVoRkQxeTQbe_RwxxH-UPFEdpKqMbV0yUxsTlLyoBGlqRmmTqPlfqLQ0btIvWGxNih8JXh8JEhPxOnYwhiAXX7_8B_vpmC33rPIuBI_tfUMokbspv3uk3E25nKY8yV4cNvNedDfW7A_Lrv0S</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2561943696</pqid></control><display><type>article</type><title>Leveraging high-throughput screening data, deep neural networks, and conditional generative adversarial networks to advance predictive toxicology</title><source>MEDLINE</source><source>DOAJ Directory of Open Access Journals</source><source>Public Library of Science (PLoS) Journals Open Access</source><source>EZB-FREE-00999 freely available EZB journals</source><source>PubMed Central</source><creator>Green, Adrian J ; Mohlenkamp, Martin J ; Das, Jhuma ; Chaudhari, Meenal ; Truong, Lisa ; Tanguay, Robyn L ; Reif, David M</creator><contributor>Hatzimanikatis, Vassily</contributor><creatorcontrib>Green, Adrian J ; Mohlenkamp, Martin J ; Das, Jhuma ; Chaudhari, Meenal ; Truong, Lisa ; Tanguay, Robyn L ; Reif, David M ; Hatzimanikatis, Vassily</creatorcontrib><description>There are currently 85,000 chemicals registered with the Environmental Protection Agency (EPA) under the Toxic Substances Control Act, but only a small fraction have measured toxicological data. To address this gap, high-throughput screening (HTS) and computational methods are vital. As part of one such HTS effort, embryonic zebrafish were used to examine a suite of morphological and mortality endpoints at six concentrations from over 1,000 unique chemicals found in the ToxCast library (phase 1 and 2). We hypothesized that by using a conditional generative adversarial network (cGAN) or deep neural networks (DNN), and leveraging this large set of toxicity data we could efficiently predict toxic outcomes of untested chemicals. Utilizing a novel method in this space, we converted the 3D structural information into a weighted set of points while retaining all information about the structure. In vivo toxicity and chemical data were used to train two neural network generators. The first was a DNN (Go-ZT) while the second utilized cGAN architecture (GAN-ZT) to train generators to produce toxicity data. Our results showed that Go-ZT significantly outperformed the cGAN, support vector machine, random forest and multilayer perceptron models in cross-validation, and when tested against an external test dataset. By combining both Go-ZT and GAN-ZT, our consensus model improved the SE, SP, PPV, and Kappa, to 71.4%, 95.9%, 71.4% and 0.673, respectively, resulting in an area under the receiver operating characteristic (AUROC) of 0.837. Considering their potential use as prescreening tools, these models could provide in vivo toxicity predictions and insight into the hundreds of thousands of untested chemicals to prioritize compounds for HT testing.</description><identifier>ISSN: 1553-7358</identifier><identifier>ISSN: 1553-734X</identifier><identifier>EISSN: 1553-7358</identifier><identifier>DOI: 10.1371/journal.pcbi.1009135</identifier><identifier>PMID: 34214078</identifier><language>eng</language><publisher>United States: Public Library of Science</publisher><subject>Algorithms ; Analysis ; Animals ; Artificial intelligence ; Artificial neural networks ; Biocompatibility ; Biology and Life Sciences ; Chemicals ; Computational Biology ; Computer and Information Sciences ; Computer applications ; Embryo, Nonmammalian - drug effects ; Embryos ; Environmental law ; Environmental protection ; Generative adversarial networks ; Generators ; High-throughput screening ; High-throughput screening (Biochemical assaying) ; High-Throughput Screening Assays ; In vivo methods and tests ; Machine learning ; Medicine and Health Sciences ; Methods ; Models, Chemical ; Mortality ; Multilayer perceptrons ; Neural networks ; Neural Networks, Computer ; Physical Sciences ; Principal components analysis ; Research and Analysis Methods ; Screening ; Support vector machines ; Toxic substances ; Toxicity ; Toxicity Tests ; Toxicology ; United States ; Zebrafish</subject><ispartof>PLoS computational biology, 2021-07, Vol.17 (7), p.e1009135</ispartof><rights>COPYRIGHT 2021 Public Library of Science</rights><rights>2021 Green et al. This is an open access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>2021 Green et al 2021 Green et al</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c661t-a6a207ce9479cfd9a8e57fc8d623a21ea0ce152fff27e951e633575c6c6630da3</citedby><cites>FETCH-LOGICAL-c661t-a6a207ce9479cfd9a8e57fc8d623a21ea0ce152fff27e951e633575c6c6630da3</cites><orcidid>0000-0003-3548-3105 ; 0000-0001-7815-6767 ; 0000-0003-1268-7754 ; 0000-0001-6190-3682 ; 0000-0001-9016-7549 ; 0000-0003-1751-4617 ; 0000-0001-9429-1838</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC8301607/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC8301607/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,727,780,784,864,885,2102,2928,23866,27924,27925,53791,53793</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/34214078$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><contributor>Hatzimanikatis, Vassily</contributor><creatorcontrib>Green, Adrian J</creatorcontrib><creatorcontrib>Mohlenkamp, Martin J</creatorcontrib><creatorcontrib>Das, Jhuma</creatorcontrib><creatorcontrib>Chaudhari, Meenal</creatorcontrib><creatorcontrib>Truong, Lisa</creatorcontrib><creatorcontrib>Tanguay, Robyn L</creatorcontrib><creatorcontrib>Reif, David M</creatorcontrib><title>Leveraging high-throughput screening data, deep neural networks, and conditional generative adversarial networks to advance predictive toxicology</title><title>PLoS computational biology</title><addtitle>PLoS Comput Biol</addtitle><description>There are currently 85,000 chemicals registered with the Environmental Protection Agency (EPA) under the Toxic Substances Control Act, but only a small fraction have measured toxicological data. To address this gap, high-throughput screening (HTS) and computational methods are vital. As part of one such HTS effort, embryonic zebrafish were used to examine a suite of morphological and mortality endpoints at six concentrations from over 1,000 unique chemicals found in the ToxCast library (phase 1 and 2). We hypothesized that by using a conditional generative adversarial network (cGAN) or deep neural networks (DNN), and leveraging this large set of toxicity data we could efficiently predict toxic outcomes of untested chemicals. Utilizing a novel method in this space, we converted the 3D structural information into a weighted set of points while retaining all information about the structure. In vivo toxicity and chemical data were used to train two neural network generators. The first was a DNN (Go-ZT) while the second utilized cGAN architecture (GAN-ZT) to train generators to produce toxicity data. Our results showed that Go-ZT significantly outperformed the cGAN, support vector machine, random forest and multilayer perceptron models in cross-validation, and when tested against an external test dataset. By combining both Go-ZT and GAN-ZT, our consensus model improved the SE, SP, PPV, and Kappa, to 71.4%, 95.9%, 71.4% and 0.673, respectively, resulting in an area under the receiver operating characteristic (AUROC) of 0.837. Considering their potential use as prescreening tools, these models could provide in vivo toxicity predictions and insight into the hundreds of thousands of untested chemicals to prioritize compounds for HT testing.</description><subject>Algorithms</subject><subject>Analysis</subject><subject>Animals</subject><subject>Artificial intelligence</subject><subject>Artificial neural networks</subject><subject>Biocompatibility</subject><subject>Biology and Life Sciences</subject><subject>Chemicals</subject><subject>Computational Biology</subject><subject>Computer and Information Sciences</subject><subject>Computer applications</subject><subject>Embryo, Nonmammalian - drug effects</subject><subject>Embryos</subject><subject>Environmental law</subject><subject>Environmental protection</subject><subject>Generative adversarial networks</subject><subject>Generators</subject><subject>High-throughput screening</subject><subject>High-throughput screening (Biochemical assaying)</subject><subject>High-Throughput Screening Assays</subject><subject>In vivo methods and tests</subject><subject>Machine learning</subject><subject>Medicine and Health Sciences</subject><subject>Methods</subject><subject>Models, Chemical</subject><subject>Mortality</subject><subject>Multilayer perceptrons</subject><subject>Neural networks</subject><subject>Neural Networks, Computer</subject><subject>Physical Sciences</subject><subject>Principal components analysis</subject><subject>Research and Analysis Methods</subject><subject>Screening</subject><subject>Support vector machines</subject><subject>Toxic substances</subject><subject>Toxicity</subject><subject>Toxicity Tests</subject><subject>Toxicology</subject><subject>United States</subject><subject>Zebrafish</subject><issn>1553-7358</issn><issn>1553-734X</issn><issn>1553-7358</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><sourceid>DOA</sourceid><recordid>eNqVkttu1DAQhiMEoqXwBggicYXUXew4jpMbpKrisNIKJA7X1qw9ybpk7WA7S_sYvDHeblrtSnCBfGFr5vt_j8eTZc8pmVMm6JsrN3oL_XxQKzOnhDSU8QfZKeWczQTj9cOD80n2JIQrQtKxqR5nJ6wsaElEfZr9XuIWPXTGdvnadOtZXHs3duthjHlQHtHuMhoinOcaccgtjh76tMVfzv8I5zlYnStntYnGpXryDm0yjGaLOejkHcCbA0Ee3S4OVmE-eNRG3aLRXRvletfdPM0etdAHfDbtZ9n39---XX6cLT9_WFxeLGeqqmicQQUFEQqbUjSq1Q3UyEWral0VDAqKQBRSXrRtWwhsOMWKMS64qpKcEQ3sLHu59x16F-TUzSALXtGmZFVTJWKxJ7SDKzl4swF_Ix0YeRtwvpPgo1E9Sl4IjlyJlS5pqVuABjiuVorQAgiyOnm9nW4bVxvUCm1MbTwyPc5Ys5ad28qaEVoRkQxeTQbe_RwxxH-UPFEdpKqMbV0yUxsTlLyoBGlqRmmTqPlfqLQ0btIvWGxNih8JXh8JEhPxOnYwhiAXX7_8B_vpmC33rPIuBI_tfUMokbspv3uk3E25nKY8yV4cNvNedDfW7A_Lrv0S</recordid><startdate>20210701</startdate><enddate>20210701</enddate><creator>Green, Adrian J</creator><creator>Mohlenkamp, Martin J</creator><creator>Das, Jhuma</creator><creator>Chaudhari, Meenal</creator><creator>Truong, Lisa</creator><creator>Tanguay, Robyn L</creator><creator>Reif, David M</creator><general>Public Library of Science</general><general>Public Library of Science (PLoS)</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>ISN</scope><scope>ISR</scope><scope>3V.</scope><scope>7QO</scope><scope>7QP</scope><scope>7TK</scope><scope>7TM</scope><scope>7X7</scope><scope>7XB</scope><scope>88E</scope><scope>8AL</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>K9.</scope><scope>LK8</scope><scope>M0N</scope><scope>M0S</scope><scope>M1P</scope><scope>M7P</scope><scope>P5Z</scope><scope>P62</scope><scope>P64</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope><scope>RC3</scope><scope>5PM</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0003-3548-3105</orcidid><orcidid>https://orcid.org/0000-0001-7815-6767</orcidid><orcidid>https://orcid.org/0000-0003-1268-7754</orcidid><orcidid>https://orcid.org/0000-0001-6190-3682</orcidid><orcidid>https://orcid.org/0000-0001-9016-7549</orcidid><orcidid>https://orcid.org/0000-0003-1751-4617</orcidid><orcidid>https://orcid.org/0000-0001-9429-1838</orcidid></search><sort><creationdate>20210701</creationdate><title>Leveraging high-throughput screening data, deep neural networks, and conditional generative adversarial networks to advance predictive toxicology</title><author>Green, Adrian J ; Mohlenkamp, Martin J ; Das, Jhuma ; Chaudhari, Meenal ; Truong, Lisa ; Tanguay, Robyn L ; Reif, David M</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c661t-a6a207ce9479cfd9a8e57fc8d623a21ea0ce152fff27e951e633575c6c6630da3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Algorithms</topic><topic>Analysis</topic><topic>Animals</topic><topic>Artificial intelligence</topic><topic>Artificial neural networks</topic><topic>Biocompatibility</topic><topic>Biology and Life Sciences</topic><topic>Chemicals</topic><topic>Computational Biology</topic><topic>Computer and Information Sciences</topic><topic>Computer applications</topic><topic>Embryo, Nonmammalian - drug effects</topic><topic>Embryos</topic><topic>Environmental law</topic><topic>Environmental protection</topic><topic>Generative adversarial networks</topic><topic>Generators</topic><topic>High-throughput screening</topic><topic>High-throughput screening (Biochemical assaying)</topic><topic>High-Throughput Screening Assays</topic><topic>In vivo methods and tests</topic><topic>Machine learning</topic><topic>Medicine and Health Sciences</topic><topic>Methods</topic><topic>Models, Chemical</topic><topic>Mortality</topic><topic>Multilayer perceptrons</topic><topic>Neural networks</topic><topic>Neural Networks, Computer</topic><topic>Physical Sciences</topic><topic>Principal components analysis</topic><topic>Research and Analysis Methods</topic><topic>Screening</topic><topic>Support vector machines</topic><topic>Toxic substances</topic><topic>Toxicity</topic><topic>Toxicity Tests</topic><topic>Toxicology</topic><topic>United States</topic><topic>Zebrafish</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Green, Adrian J</creatorcontrib><creatorcontrib>Mohlenkamp, Martin J</creatorcontrib><creatorcontrib>Das, Jhuma</creatorcontrib><creatorcontrib>Chaudhari, Meenal</creatorcontrib><creatorcontrib>Truong, Lisa</creatorcontrib><creatorcontrib>Tanguay, Robyn L</creatorcontrib><creatorcontrib>Reif, David M</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Gale In Context: Canada</collection><collection>Gale In Context: Science</collection><collection>ProQuest Central (Corporate)</collection><collection>Biotechnology Research Abstracts</collection><collection>Calcium &amp; Calcified Tissue Abstracts</collection><collection>Neurosciences Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Health &amp; Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Medical Database (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>Natural Science Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>ProQuest Biological Science Collection</collection><collection>Computing Database</collection><collection>Health &amp; Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Biological Science Database</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><collection>Genetics Abstracts</collection><collection>PubMed Central (Full Participant titles)</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>PLoS computational biology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Green, Adrian J</au><au>Mohlenkamp, Martin J</au><au>Das, Jhuma</au><au>Chaudhari, Meenal</au><au>Truong, Lisa</au><au>Tanguay, Robyn L</au><au>Reif, David M</au><au>Hatzimanikatis, Vassily</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Leveraging high-throughput screening data, deep neural networks, and conditional generative adversarial networks to advance predictive toxicology</atitle><jtitle>PLoS computational biology</jtitle><addtitle>PLoS Comput Biol</addtitle><date>2021-07-01</date><risdate>2021</risdate><volume>17</volume><issue>7</issue><spage>e1009135</spage><pages>e1009135-</pages><issn>1553-7358</issn><issn>1553-734X</issn><eissn>1553-7358</eissn><abstract>There are currently 85,000 chemicals registered with the Environmental Protection Agency (EPA) under the Toxic Substances Control Act, but only a small fraction have measured toxicological data. To address this gap, high-throughput screening (HTS) and computational methods are vital. As part of one such HTS effort, embryonic zebrafish were used to examine a suite of morphological and mortality endpoints at six concentrations from over 1,000 unique chemicals found in the ToxCast library (phase 1 and 2). We hypothesized that by using a conditional generative adversarial network (cGAN) or deep neural networks (DNN), and leveraging this large set of toxicity data we could efficiently predict toxic outcomes of untested chemicals. Utilizing a novel method in this space, we converted the 3D structural information into a weighted set of points while retaining all information about the structure. In vivo toxicity and chemical data were used to train two neural network generators. The first was a DNN (Go-ZT) while the second utilized cGAN architecture (GAN-ZT) to train generators to produce toxicity data. Our results showed that Go-ZT significantly outperformed the cGAN, support vector machine, random forest and multilayer perceptron models in cross-validation, and when tested against an external test dataset. By combining both Go-ZT and GAN-ZT, our consensus model improved the SE, SP, PPV, and Kappa, to 71.4%, 95.9%, 71.4% and 0.673, respectively, resulting in an area under the receiver operating characteristic (AUROC) of 0.837. Considering their potential use as prescreening tools, these models could provide in vivo toxicity predictions and insight into the hundreds of thousands of untested chemicals to prioritize compounds for HT testing.</abstract><cop>United States</cop><pub>Public Library of Science</pub><pmid>34214078</pmid><doi>10.1371/journal.pcbi.1009135</doi><orcidid>https://orcid.org/0000-0003-3548-3105</orcidid><orcidid>https://orcid.org/0000-0001-7815-6767</orcidid><orcidid>https://orcid.org/0000-0003-1268-7754</orcidid><orcidid>https://orcid.org/0000-0001-6190-3682</orcidid><orcidid>https://orcid.org/0000-0001-9016-7549</orcidid><orcidid>https://orcid.org/0000-0003-1751-4617</orcidid><orcidid>https://orcid.org/0000-0001-9429-1838</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1553-7358
ispartof PLoS computational biology, 2021-07, Vol.17 (7), p.e1009135
issn 1553-7358
1553-734X
1553-7358
language eng
recordid cdi_plos_journals_2561943696
source MEDLINE; DOAJ Directory of Open Access Journals; Public Library of Science (PLoS) Journals Open Access; EZB-FREE-00999 freely available EZB journals; PubMed Central
subjects Algorithms
Analysis
Animals
Artificial intelligence
Artificial neural networks
Biocompatibility
Biology and Life Sciences
Chemicals
Computational Biology
Computer and Information Sciences
Computer applications
Embryo, Nonmammalian - drug effects
Embryos
Environmental law
Environmental protection
Generative adversarial networks
Generators
High-throughput screening
High-throughput screening (Biochemical assaying)
High-Throughput Screening Assays
In vivo methods and tests
Machine learning
Medicine and Health Sciences
Methods
Models, Chemical
Mortality
Multilayer perceptrons
Neural networks
Neural Networks, Computer
Physical Sciences
Principal components analysis
Research and Analysis Methods
Screening
Support vector machines
Toxic substances
Toxicity
Toxicity Tests
Toxicology
United States
Zebrafish
title Leveraging high-throughput screening data, deep neural networks, and conditional generative adversarial networks to advance predictive toxicology
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-22T13%3A22%3A09IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_plos_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Leveraging%20high-throughput%20screening%20data,%20deep%20neural%20networks,%20and%20conditional%20generative%20adversarial%20networks%20to%20advance%20predictive%20toxicology&rft.jtitle=PLoS%20computational%20biology&rft.au=Green,%20Adrian%20J&rft.date=2021-07-01&rft.volume=17&rft.issue=7&rft.spage=e1009135&rft.pages=e1009135-&rft.issn=1553-7358&rft.eissn=1553-7358&rft_id=info:doi/10.1371/journal.pcbi.1009135&rft_dat=%3Cgale_plos_%3EA670983119%3C/gale_plos_%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2561943696&rft_id=info:pmid/34214078&rft_galeid=A670983119&rft_doaj_id=oai_doaj_org_article_5275e5c7bd414dfaa9a5ebbc012a0e38&rfr_iscdi=true