Leveraging high-throughput screening data, deep neural networks, and conditional generative adversarial networks to advance predictive toxicology
There are currently 85,000 chemicals registered with the Environmental Protection Agency (EPA) under the Toxic Substances Control Act, but only a small fraction have measured toxicological data. To address this gap, high-throughput screening (HTS) and computational methods are vital. As part of one...
Gespeichert in:
Veröffentlicht in: | PLoS computational biology 2021-07, Vol.17 (7), p.e1009135 |
---|---|
Hauptverfasser: | , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | 7 |
container_start_page | e1009135 |
container_title | PLoS computational biology |
container_volume | 17 |
creator | Green, Adrian J Mohlenkamp, Martin J Das, Jhuma Chaudhari, Meenal Truong, Lisa Tanguay, Robyn L Reif, David M |
description | There are currently 85,000 chemicals registered with the Environmental Protection Agency (EPA) under the Toxic Substances Control Act, but only a small fraction have measured toxicological data. To address this gap, high-throughput screening (HTS) and computational methods are vital. As part of one such HTS effort, embryonic zebrafish were used to examine a suite of morphological and mortality endpoints at six concentrations from over 1,000 unique chemicals found in the ToxCast library (phase 1 and 2). We hypothesized that by using a conditional generative adversarial network (cGAN) or deep neural networks (DNN), and leveraging this large set of toxicity data we could efficiently predict toxic outcomes of untested chemicals. Utilizing a novel method in this space, we converted the 3D structural information into a weighted set of points while retaining all information about the structure. In vivo toxicity and chemical data were used to train two neural network generators. The first was a DNN (Go-ZT) while the second utilized cGAN architecture (GAN-ZT) to train generators to produce toxicity data. Our results showed that Go-ZT significantly outperformed the cGAN, support vector machine, random forest and multilayer perceptron models in cross-validation, and when tested against an external test dataset. By combining both Go-ZT and GAN-ZT, our consensus model improved the SE, SP, PPV, and Kappa, to 71.4%, 95.9%, 71.4% and 0.673, respectively, resulting in an area under the receiver operating characteristic (AUROC) of 0.837. Considering their potential use as prescreening tools, these models could provide in vivo toxicity predictions and insight into the hundreds of thousands of untested chemicals to prioritize compounds for HT testing. |
doi_str_mv | 10.1371/journal.pcbi.1009135 |
format | Article |
fullrecord | <record><control><sourceid>gale_plos_</sourceid><recordid>TN_cdi_plos_journals_2561943696</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A670983119</galeid><doaj_id>oai_doaj_org_article_5275e5c7bd414dfaa9a5ebbc012a0e38</doaj_id><sourcerecordid>A670983119</sourcerecordid><originalsourceid>FETCH-LOGICAL-c661t-a6a207ce9479cfd9a8e57fc8d623a21ea0ce152fff27e951e633575c6c6630da3</originalsourceid><addsrcrecordid>eNqVkttu1DAQhiMEoqXwBggicYXUXew4jpMbpKrisNIKJA7X1qw9ybpk7WA7S_sYvDHeblrtSnCBfGFr5vt_j8eTZc8pmVMm6JsrN3oL_XxQKzOnhDSU8QfZKeWczQTj9cOD80n2JIQrQtKxqR5nJ6wsaElEfZr9XuIWPXTGdvnadOtZXHs3duthjHlQHtHuMhoinOcaccgtjh76tMVfzv8I5zlYnStntYnGpXryDm0yjGaLOejkHcCbA0Ee3S4OVmE-eNRG3aLRXRvletfdPM0etdAHfDbtZ9n39---XX6cLT9_WFxeLGeqqmicQQUFEQqbUjSq1Q3UyEWral0VDAqKQBRSXrRtWwhsOMWKMS64qpKcEQ3sLHu59x16F-TUzSALXtGmZFVTJWKxJ7SDKzl4swF_Ix0YeRtwvpPgo1E9Sl4IjlyJlS5pqVuABjiuVorQAgiyOnm9nW4bVxvUCm1MbTwyPc5Ys5ad28qaEVoRkQxeTQbe_RwxxH-UPFEdpKqMbV0yUxsTlLyoBGlqRmmTqPlfqLQ0btIvWGxNih8JXh8JEhPxOnYwhiAXX7_8B_vpmC33rPIuBI_tfUMokbspv3uk3E25nKY8yV4cNvNedDfW7A_Lrv0S</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2561943696</pqid></control><display><type>article</type><title>Leveraging high-throughput screening data, deep neural networks, and conditional generative adversarial networks to advance predictive toxicology</title><source>MEDLINE</source><source>DOAJ Directory of Open Access Journals</source><source>Public Library of Science (PLoS) Journals Open Access</source><source>EZB-FREE-00999 freely available EZB journals</source><source>PubMed Central</source><creator>Green, Adrian J ; Mohlenkamp, Martin J ; Das, Jhuma ; Chaudhari, Meenal ; Truong, Lisa ; Tanguay, Robyn L ; Reif, David M</creator><contributor>Hatzimanikatis, Vassily</contributor><creatorcontrib>Green, Adrian J ; Mohlenkamp, Martin J ; Das, Jhuma ; Chaudhari, Meenal ; Truong, Lisa ; Tanguay, Robyn L ; Reif, David M ; Hatzimanikatis, Vassily</creatorcontrib><description>There are currently 85,000 chemicals registered with the Environmental Protection Agency (EPA) under the Toxic Substances Control Act, but only a small fraction have measured toxicological data. To address this gap, high-throughput screening (HTS) and computational methods are vital. As part of one such HTS effort, embryonic zebrafish were used to examine a suite of morphological and mortality endpoints at six concentrations from over 1,000 unique chemicals found in the ToxCast library (phase 1 and 2). We hypothesized that by using a conditional generative adversarial network (cGAN) or deep neural networks (DNN), and leveraging this large set of toxicity data we could efficiently predict toxic outcomes of untested chemicals. Utilizing a novel method in this space, we converted the 3D structural information into a weighted set of points while retaining all information about the structure. In vivo toxicity and chemical data were used to train two neural network generators. The first was a DNN (Go-ZT) while the second utilized cGAN architecture (GAN-ZT) to train generators to produce toxicity data. Our results showed that Go-ZT significantly outperformed the cGAN, support vector machine, random forest and multilayer perceptron models in cross-validation, and when tested against an external test dataset. By combining both Go-ZT and GAN-ZT, our consensus model improved the SE, SP, PPV, and Kappa, to 71.4%, 95.9%, 71.4% and 0.673, respectively, resulting in an area under the receiver operating characteristic (AUROC) of 0.837. Considering their potential use as prescreening tools, these models could provide in vivo toxicity predictions and insight into the hundreds of thousands of untested chemicals to prioritize compounds for HT testing.</description><identifier>ISSN: 1553-7358</identifier><identifier>ISSN: 1553-734X</identifier><identifier>EISSN: 1553-7358</identifier><identifier>DOI: 10.1371/journal.pcbi.1009135</identifier><identifier>PMID: 34214078</identifier><language>eng</language><publisher>United States: Public Library of Science</publisher><subject>Algorithms ; Analysis ; Animals ; Artificial intelligence ; Artificial neural networks ; Biocompatibility ; Biology and Life Sciences ; Chemicals ; Computational Biology ; Computer and Information Sciences ; Computer applications ; Embryo, Nonmammalian - drug effects ; Embryos ; Environmental law ; Environmental protection ; Generative adversarial networks ; Generators ; High-throughput screening ; High-throughput screening (Biochemical assaying) ; High-Throughput Screening Assays ; In vivo methods and tests ; Machine learning ; Medicine and Health Sciences ; Methods ; Models, Chemical ; Mortality ; Multilayer perceptrons ; Neural networks ; Neural Networks, Computer ; Physical Sciences ; Principal components analysis ; Research and Analysis Methods ; Screening ; Support vector machines ; Toxic substances ; Toxicity ; Toxicity Tests ; Toxicology ; United States ; Zebrafish</subject><ispartof>PLoS computational biology, 2021-07, Vol.17 (7), p.e1009135</ispartof><rights>COPYRIGHT 2021 Public Library of Science</rights><rights>2021 Green et al. This is an open access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>2021 Green et al 2021 Green et al</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c661t-a6a207ce9479cfd9a8e57fc8d623a21ea0ce152fff27e951e633575c6c6630da3</citedby><cites>FETCH-LOGICAL-c661t-a6a207ce9479cfd9a8e57fc8d623a21ea0ce152fff27e951e633575c6c6630da3</cites><orcidid>0000-0003-3548-3105 ; 0000-0001-7815-6767 ; 0000-0003-1268-7754 ; 0000-0001-6190-3682 ; 0000-0001-9016-7549 ; 0000-0003-1751-4617 ; 0000-0001-9429-1838</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC8301607/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC8301607/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,727,780,784,864,885,2102,2928,23866,27924,27925,53791,53793</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/34214078$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><contributor>Hatzimanikatis, Vassily</contributor><creatorcontrib>Green, Adrian J</creatorcontrib><creatorcontrib>Mohlenkamp, Martin J</creatorcontrib><creatorcontrib>Das, Jhuma</creatorcontrib><creatorcontrib>Chaudhari, Meenal</creatorcontrib><creatorcontrib>Truong, Lisa</creatorcontrib><creatorcontrib>Tanguay, Robyn L</creatorcontrib><creatorcontrib>Reif, David M</creatorcontrib><title>Leveraging high-throughput screening data, deep neural networks, and conditional generative adversarial networks to advance predictive toxicology</title><title>PLoS computational biology</title><addtitle>PLoS Comput Biol</addtitle><description>There are currently 85,000 chemicals registered with the Environmental Protection Agency (EPA) under the Toxic Substances Control Act, but only a small fraction have measured toxicological data. To address this gap, high-throughput screening (HTS) and computational methods are vital. As part of one such HTS effort, embryonic zebrafish were used to examine a suite of morphological and mortality endpoints at six concentrations from over 1,000 unique chemicals found in the ToxCast library (phase 1 and 2). We hypothesized that by using a conditional generative adversarial network (cGAN) or deep neural networks (DNN), and leveraging this large set of toxicity data we could efficiently predict toxic outcomes of untested chemicals. Utilizing a novel method in this space, we converted the 3D structural information into a weighted set of points while retaining all information about the structure. In vivo toxicity and chemical data were used to train two neural network generators. The first was a DNN (Go-ZT) while the second utilized cGAN architecture (GAN-ZT) to train generators to produce toxicity data. Our results showed that Go-ZT significantly outperformed the cGAN, support vector machine, random forest and multilayer perceptron models in cross-validation, and when tested against an external test dataset. By combining both Go-ZT and GAN-ZT, our consensus model improved the SE, SP, PPV, and Kappa, to 71.4%, 95.9%, 71.4% and 0.673, respectively, resulting in an area under the receiver operating characteristic (AUROC) of 0.837. Considering their potential use as prescreening tools, these models could provide in vivo toxicity predictions and insight into the hundreds of thousands of untested chemicals to prioritize compounds for HT testing.</description><subject>Algorithms</subject><subject>Analysis</subject><subject>Animals</subject><subject>Artificial intelligence</subject><subject>Artificial neural networks</subject><subject>Biocompatibility</subject><subject>Biology and Life Sciences</subject><subject>Chemicals</subject><subject>Computational Biology</subject><subject>Computer and Information Sciences</subject><subject>Computer applications</subject><subject>Embryo, Nonmammalian - drug effects</subject><subject>Embryos</subject><subject>Environmental law</subject><subject>Environmental protection</subject><subject>Generative adversarial networks</subject><subject>Generators</subject><subject>High-throughput screening</subject><subject>High-throughput screening (Biochemical assaying)</subject><subject>High-Throughput Screening Assays</subject><subject>In vivo methods and tests</subject><subject>Machine learning</subject><subject>Medicine and Health Sciences</subject><subject>Methods</subject><subject>Models, Chemical</subject><subject>Mortality</subject><subject>Multilayer perceptrons</subject><subject>Neural networks</subject><subject>Neural Networks, Computer</subject><subject>Physical Sciences</subject><subject>Principal components analysis</subject><subject>Research and Analysis Methods</subject><subject>Screening</subject><subject>Support vector machines</subject><subject>Toxic substances</subject><subject>Toxicity</subject><subject>Toxicity Tests</subject><subject>Toxicology</subject><subject>United States</subject><subject>Zebrafish</subject><issn>1553-7358</issn><issn>1553-734X</issn><issn>1553-7358</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><sourceid>DOA</sourceid><recordid>eNqVkttu1DAQhiMEoqXwBggicYXUXew4jpMbpKrisNIKJA7X1qw9ybpk7WA7S_sYvDHeblrtSnCBfGFr5vt_j8eTZc8pmVMm6JsrN3oL_XxQKzOnhDSU8QfZKeWczQTj9cOD80n2JIQrQtKxqR5nJ6wsaElEfZr9XuIWPXTGdvnadOtZXHs3duthjHlQHtHuMhoinOcaccgtjh76tMVfzv8I5zlYnStntYnGpXryDm0yjGaLOejkHcCbA0Ee3S4OVmE-eNRG3aLRXRvletfdPM0etdAHfDbtZ9n39---XX6cLT9_WFxeLGeqqmicQQUFEQqbUjSq1Q3UyEWral0VDAqKQBRSXrRtWwhsOMWKMS64qpKcEQ3sLHu59x16F-TUzSALXtGmZFVTJWKxJ7SDKzl4swF_Ix0YeRtwvpPgo1E9Sl4IjlyJlS5pqVuABjiuVorQAgiyOnm9nW4bVxvUCm1MbTwyPc5Ys5ad28qaEVoRkQxeTQbe_RwxxH-UPFEdpKqMbV0yUxsTlLyoBGlqRmmTqPlfqLQ0btIvWGxNih8JXh8JEhPxOnYwhiAXX7_8B_vpmC33rPIuBI_tfUMokbspv3uk3E25nKY8yV4cNvNedDfW7A_Lrv0S</recordid><startdate>20210701</startdate><enddate>20210701</enddate><creator>Green, Adrian J</creator><creator>Mohlenkamp, Martin J</creator><creator>Das, Jhuma</creator><creator>Chaudhari, Meenal</creator><creator>Truong, Lisa</creator><creator>Tanguay, Robyn L</creator><creator>Reif, David M</creator><general>Public Library of Science</general><general>Public Library of Science (PLoS)</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>ISN</scope><scope>ISR</scope><scope>3V.</scope><scope>7QO</scope><scope>7QP</scope><scope>7TK</scope><scope>7TM</scope><scope>7X7</scope><scope>7XB</scope><scope>88E</scope><scope>8AL</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>K9.</scope><scope>LK8</scope><scope>M0N</scope><scope>M0S</scope><scope>M1P</scope><scope>M7P</scope><scope>P5Z</scope><scope>P62</scope><scope>P64</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope><scope>RC3</scope><scope>5PM</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0003-3548-3105</orcidid><orcidid>https://orcid.org/0000-0001-7815-6767</orcidid><orcidid>https://orcid.org/0000-0003-1268-7754</orcidid><orcidid>https://orcid.org/0000-0001-6190-3682</orcidid><orcidid>https://orcid.org/0000-0001-9016-7549</orcidid><orcidid>https://orcid.org/0000-0003-1751-4617</orcidid><orcidid>https://orcid.org/0000-0001-9429-1838</orcidid></search><sort><creationdate>20210701</creationdate><title>Leveraging high-throughput screening data, deep neural networks, and conditional generative adversarial networks to advance predictive toxicology</title><author>Green, Adrian J ; Mohlenkamp, Martin J ; Das, Jhuma ; Chaudhari, Meenal ; Truong, Lisa ; Tanguay, Robyn L ; Reif, David M</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c661t-a6a207ce9479cfd9a8e57fc8d623a21ea0ce152fff27e951e633575c6c6630da3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Algorithms</topic><topic>Analysis</topic><topic>Animals</topic><topic>Artificial intelligence</topic><topic>Artificial neural networks</topic><topic>Biocompatibility</topic><topic>Biology and Life Sciences</topic><topic>Chemicals</topic><topic>Computational Biology</topic><topic>Computer and Information Sciences</topic><topic>Computer applications</topic><topic>Embryo, Nonmammalian - drug effects</topic><topic>Embryos</topic><topic>Environmental law</topic><topic>Environmental protection</topic><topic>Generative adversarial networks</topic><topic>Generators</topic><topic>High-throughput screening</topic><topic>High-throughput screening (Biochemical assaying)</topic><topic>High-Throughput Screening Assays</topic><topic>In vivo methods and tests</topic><topic>Machine learning</topic><topic>Medicine and Health Sciences</topic><topic>Methods</topic><topic>Models, Chemical</topic><topic>Mortality</topic><topic>Multilayer perceptrons</topic><topic>Neural networks</topic><topic>Neural Networks, Computer</topic><topic>Physical Sciences</topic><topic>Principal components analysis</topic><topic>Research and Analysis Methods</topic><topic>Screening</topic><topic>Support vector machines</topic><topic>Toxic substances</topic><topic>Toxicity</topic><topic>Toxicity Tests</topic><topic>Toxicology</topic><topic>United States</topic><topic>Zebrafish</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Green, Adrian J</creatorcontrib><creatorcontrib>Mohlenkamp, Martin J</creatorcontrib><creatorcontrib>Das, Jhuma</creatorcontrib><creatorcontrib>Chaudhari, Meenal</creatorcontrib><creatorcontrib>Truong, Lisa</creatorcontrib><creatorcontrib>Tanguay, Robyn L</creatorcontrib><creatorcontrib>Reif, David M</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Gale In Context: Canada</collection><collection>Gale In Context: Science</collection><collection>ProQuest Central (Corporate)</collection><collection>Biotechnology Research Abstracts</collection><collection>Calcium & Calcified Tissue Abstracts</collection><collection>Neurosciences Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Health & Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Medical Database (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>Natural Science Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>ProQuest Biological Science Collection</collection><collection>Computing Database</collection><collection>Health & Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Biological Science Database</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><collection>Genetics Abstracts</collection><collection>PubMed Central (Full Participant titles)</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>PLoS computational biology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Green, Adrian J</au><au>Mohlenkamp, Martin J</au><au>Das, Jhuma</au><au>Chaudhari, Meenal</au><au>Truong, Lisa</au><au>Tanguay, Robyn L</au><au>Reif, David M</au><au>Hatzimanikatis, Vassily</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Leveraging high-throughput screening data, deep neural networks, and conditional generative adversarial networks to advance predictive toxicology</atitle><jtitle>PLoS computational biology</jtitle><addtitle>PLoS Comput Biol</addtitle><date>2021-07-01</date><risdate>2021</risdate><volume>17</volume><issue>7</issue><spage>e1009135</spage><pages>e1009135-</pages><issn>1553-7358</issn><issn>1553-734X</issn><eissn>1553-7358</eissn><abstract>There are currently 85,000 chemicals registered with the Environmental Protection Agency (EPA) under the Toxic Substances Control Act, but only a small fraction have measured toxicological data. To address this gap, high-throughput screening (HTS) and computational methods are vital. As part of one such HTS effort, embryonic zebrafish were used to examine a suite of morphological and mortality endpoints at six concentrations from over 1,000 unique chemicals found in the ToxCast library (phase 1 and 2). We hypothesized that by using a conditional generative adversarial network (cGAN) or deep neural networks (DNN), and leveraging this large set of toxicity data we could efficiently predict toxic outcomes of untested chemicals. Utilizing a novel method in this space, we converted the 3D structural information into a weighted set of points while retaining all information about the structure. In vivo toxicity and chemical data were used to train two neural network generators. The first was a DNN (Go-ZT) while the second utilized cGAN architecture (GAN-ZT) to train generators to produce toxicity data. Our results showed that Go-ZT significantly outperformed the cGAN, support vector machine, random forest and multilayer perceptron models in cross-validation, and when tested against an external test dataset. By combining both Go-ZT and GAN-ZT, our consensus model improved the SE, SP, PPV, and Kappa, to 71.4%, 95.9%, 71.4% and 0.673, respectively, resulting in an area under the receiver operating characteristic (AUROC) of 0.837. Considering their potential use as prescreening tools, these models could provide in vivo toxicity predictions and insight into the hundreds of thousands of untested chemicals to prioritize compounds for HT testing.</abstract><cop>United States</cop><pub>Public Library of Science</pub><pmid>34214078</pmid><doi>10.1371/journal.pcbi.1009135</doi><orcidid>https://orcid.org/0000-0003-3548-3105</orcidid><orcidid>https://orcid.org/0000-0001-7815-6767</orcidid><orcidid>https://orcid.org/0000-0003-1268-7754</orcidid><orcidid>https://orcid.org/0000-0001-6190-3682</orcidid><orcidid>https://orcid.org/0000-0001-9016-7549</orcidid><orcidid>https://orcid.org/0000-0003-1751-4617</orcidid><orcidid>https://orcid.org/0000-0001-9429-1838</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1553-7358 |
ispartof | PLoS computational biology, 2021-07, Vol.17 (7), p.e1009135 |
issn | 1553-7358 1553-734X 1553-7358 |
language | eng |
recordid | cdi_plos_journals_2561943696 |
source | MEDLINE; DOAJ Directory of Open Access Journals; Public Library of Science (PLoS) Journals Open Access; EZB-FREE-00999 freely available EZB journals; PubMed Central |
subjects | Algorithms Analysis Animals Artificial intelligence Artificial neural networks Biocompatibility Biology and Life Sciences Chemicals Computational Biology Computer and Information Sciences Computer applications Embryo, Nonmammalian - drug effects Embryos Environmental law Environmental protection Generative adversarial networks Generators High-throughput screening High-throughput screening (Biochemical assaying) High-Throughput Screening Assays In vivo methods and tests Machine learning Medicine and Health Sciences Methods Models, Chemical Mortality Multilayer perceptrons Neural networks Neural Networks, Computer Physical Sciences Principal components analysis Research and Analysis Methods Screening Support vector machines Toxic substances Toxicity Toxicity Tests Toxicology United States Zebrafish |
title | Leveraging high-throughput screening data, deep neural networks, and conditional generative adversarial networks to advance predictive toxicology |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-22T13%3A22%3A09IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_plos_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Leveraging%20high-throughput%20screening%20data,%20deep%20neural%20networks,%20and%20conditional%20generative%20adversarial%20networks%20to%20advance%20predictive%20toxicology&rft.jtitle=PLoS%20computational%20biology&rft.au=Green,%20Adrian%20J&rft.date=2021-07-01&rft.volume=17&rft.issue=7&rft.spage=e1009135&rft.pages=e1009135-&rft.issn=1553-7358&rft.eissn=1553-7358&rft_id=info:doi/10.1371/journal.pcbi.1009135&rft_dat=%3Cgale_plos_%3EA670983119%3C/gale_plos_%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2561943696&rft_id=info:pmid/34214078&rft_galeid=A670983119&rft_doaj_id=oai_doaj_org_article_5275e5c7bd414dfaa9a5ebbc012a0e38&rfr_iscdi=true |