In silico Genotoxicity Prediction by Similarity Search and Machine Learning Algorithm: Optimization and Validation of the Method for High Energetic Materials

The European regulation REACh (Registration, Evaluation, Authorization, and restriction of Chemicals) has placed responsibility on the industry to manage the risk from chemicals since 2006. In order to ensure a high level of protection of human health and environment, toxicity prediction methods are...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Propellants, explosives, pyrotechnics explosives, pyrotechnics, 2023-03, Vol.48 (4), p.n/a
Hauptverfasser: Fournier, Mailys, Vroland, Christophe, Megy, Simon, Aguero, Stéphanie, Chemelle, Julie‐Anne, Defoort, Brigitte, Jacob, Guy, Terreux, Raphaël
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page n/a
container_issue 4
container_start_page
container_title Propellants, explosives, pyrotechnics
container_volume 48
creator Fournier, Mailys
Vroland, Christophe
Megy, Simon
Aguero, Stéphanie
Chemelle, Julie‐Anne
Defoort, Brigitte
Jacob, Guy
Terreux, Raphaël
description The European regulation REACh (Registration, Evaluation, Authorization, and restriction of Chemicals) has placed responsibility on the industry to manage the risk from chemicals since 2006. In order to ensure a high level of protection of human health and environment, toxicity prediction methods are now a widely used tool for regulatory decision making and selection of leads in new substances design. These in silico methods are an alternative to traditional in vitro and in vivo testing methods, which are laborious, time‐consuming, highly expensive, and even involve animal welfare issues. Many computational methods have been employed to predict the toxicity profile of substances, but they are mostly adapted to pharmaceutical molecules and not to High Energetic Materials (HEMs). In line with these restrictions, ArianeGroup set up a collaborative project with the French CNRS to develop optimized tools for the prediction of HEM properties, such as genotoxicity. Several in silico methods can be used to predict the properties of molecules, such as QSAR, Local QSAR or Machine Learning. We already demonstrated that using Local QSAR allows for better predictions with a good reliability [1]. We therefore developed a genotoxicity prediction tool based on the structural similarity search coupled with a supervised machine learning algorithm. This tool is composed of 3 predictive models: the Ames test, the Chromosomal Aberration test and the Mouse Lymphoma Assay. The aim of this paper is to evaluate the performance of these models to predict the genotoxicity of HEMs. We also present the methodology we applied to build these models and to optimize their performances. The dimensional reduction of the training set and the hyperparameters tuning of the different algorithms showed a performance acceleration and a significant reduction of the overfitting, which caused a decline in the generalization capacity of the predictive models. The performance of the predictive models was evaluated on a test set of HEMs and compared to the results of other prediction softwares.
doi_str_mv 10.1002/prep.202200259
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2795908448</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2795908448</sourcerecordid><originalsourceid>FETCH-LOGICAL-c3179-84e1f09785301c5989e5f116166df3abf8dc4f8a7a325d3da04528b62f12ad2f3</originalsourceid><addsrcrecordid>eNqFkU9PwjAYhxujiYhePTfxDLbdyjpvxCCQQCCiXpfSP6xktLMrUfwuflc7MXr09PZtn-fXww-Aa4z6GCFyW3tV9wkiJC40PwEdTAnupYhlp6CDsnhOMKbn4KJptghFBeEO-Jxa2JjKCAfHyrrg3o0w4QCXXkkjgnEWrg9wZXam4r59WCnuRQm5lXDORWmsgrN4ZY3dwGG1cREqd3dwUYfofPDvhBZ-4ZWRx9VpGEoF5yqUTkLtPJyYTQlHVvmNCkbE4KC84VVzCc50HOrqZ3bB88Po6X7Smy3G0_vhrCcSnOU9liqsUZ4xmiAsaM5yRTXGAzwYSJ3wtWZSpJrxjCeEykRylFLC1gOiMeGS6KQLbo65tXeve9WEYuv23sYvC5LlNEcsTVmk-kdKeNc0Xumi9mbH_aHAqGgrKNoKit8KopAfhTdTqcM_dLF8HC3_3C_nlo0h</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2795908448</pqid></control><display><type>article</type><title>In silico Genotoxicity Prediction by Similarity Search and Machine Learning Algorithm: Optimization and Validation of the Method for High Energetic Materials</title><source>Wiley Online Library Journals Frontfile Complete</source><creator>Fournier, Mailys ; Vroland, Christophe ; Megy, Simon ; Aguero, Stéphanie ; Chemelle, Julie‐Anne ; Defoort, Brigitte ; Jacob, Guy ; Terreux, Raphaël</creator><creatorcontrib>Fournier, Mailys ; Vroland, Christophe ; Megy, Simon ; Aguero, Stéphanie ; Chemelle, Julie‐Anne ; Defoort, Brigitte ; Jacob, Guy ; Terreux, Raphaël</creatorcontrib><description>The European regulation REACh (Registration, Evaluation, Authorization, and restriction of Chemicals) has placed responsibility on the industry to manage the risk from chemicals since 2006. In order to ensure a high level of protection of human health and environment, toxicity prediction methods are now a widely used tool for regulatory decision making and selection of leads in new substances design. These in silico methods are an alternative to traditional in vitro and in vivo testing methods, which are laborious, time‐consuming, highly expensive, and even involve animal welfare issues. Many computational methods have been employed to predict the toxicity profile of substances, but they are mostly adapted to pharmaceutical molecules and not to High Energetic Materials (HEMs). In line with these restrictions, ArianeGroup set up a collaborative project with the French CNRS to develop optimized tools for the prediction of HEM properties, such as genotoxicity. Several in silico methods can be used to predict the properties of molecules, such as QSAR, Local QSAR or Machine Learning. We already demonstrated that using Local QSAR allows for better predictions with a good reliability [1]. We therefore developed a genotoxicity prediction tool based on the structural similarity search coupled with a supervised machine learning algorithm. This tool is composed of 3 predictive models: the Ames test, the Chromosomal Aberration test and the Mouse Lymphoma Assay. The aim of this paper is to evaluate the performance of these models to predict the genotoxicity of HEMs. We also present the methodology we applied to build these models and to optimize their performances. The dimensional reduction of the training set and the hyperparameters tuning of the different algorithms showed a performance acceleration and a significant reduction of the overfitting, which caused a decline in the generalization capacity of the predictive models. The performance of the predictive models was evaluated on a test set of HEMs and compared to the results of other prediction softwares.</description><identifier>ISSN: 0721-3115</identifier><identifier>EISSN: 1521-4087</identifier><identifier>DOI: 10.1002/prep.202200259</identifier><language>eng</language><publisher>Weinheim: Wiley Subscription Services, Inc</publisher><subject>Algorithms ; Ames test ; Animal welfare ; Biocompatibility ; Biomedical materials ; Decision making ; Energetic materials ; Genotoxicity ; High Energetic Materials ; In vivo methods and tests ; Machine Learning ; Optimization ; Performance evaluation ; Performance prediction ; prediction ; Prediction models ; Reduction ; Risk management ; Similarity ; Supervised learning ; Toxicity</subject><ispartof>Propellants, explosives, pyrotechnics, 2023-03, Vol.48 (4), p.n/a</ispartof><rights>2023 Wiley‐VCH GmbH</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c3179-84e1f09785301c5989e5f116166df3abf8dc4f8a7a325d3da04528b62f12ad2f3</citedby><cites>FETCH-LOGICAL-c3179-84e1f09785301c5989e5f116166df3abf8dc4f8a7a325d3da04528b62f12ad2f3</cites><orcidid>0000-0001-7003-8571</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://onlinelibrary.wiley.com/doi/pdf/10.1002%2Fprep.202200259$$EPDF$$P50$$Gwiley$$H</linktopdf><linktohtml>$$Uhttps://onlinelibrary.wiley.com/doi/full/10.1002%2Fprep.202200259$$EHTML$$P50$$Gwiley$$H</linktohtml><link.rule.ids>314,776,780,1411,27901,27902,45550,45551</link.rule.ids></links><search><creatorcontrib>Fournier, Mailys</creatorcontrib><creatorcontrib>Vroland, Christophe</creatorcontrib><creatorcontrib>Megy, Simon</creatorcontrib><creatorcontrib>Aguero, Stéphanie</creatorcontrib><creatorcontrib>Chemelle, Julie‐Anne</creatorcontrib><creatorcontrib>Defoort, Brigitte</creatorcontrib><creatorcontrib>Jacob, Guy</creatorcontrib><creatorcontrib>Terreux, Raphaël</creatorcontrib><title>In silico Genotoxicity Prediction by Similarity Search and Machine Learning Algorithm: Optimization and Validation of the Method for High Energetic Materials</title><title>Propellants, explosives, pyrotechnics</title><description>The European regulation REACh (Registration, Evaluation, Authorization, and restriction of Chemicals) has placed responsibility on the industry to manage the risk from chemicals since 2006. In order to ensure a high level of protection of human health and environment, toxicity prediction methods are now a widely used tool for regulatory decision making and selection of leads in new substances design. These in silico methods are an alternative to traditional in vitro and in vivo testing methods, which are laborious, time‐consuming, highly expensive, and even involve animal welfare issues. Many computational methods have been employed to predict the toxicity profile of substances, but they are mostly adapted to pharmaceutical molecules and not to High Energetic Materials (HEMs). In line with these restrictions, ArianeGroup set up a collaborative project with the French CNRS to develop optimized tools for the prediction of HEM properties, such as genotoxicity. Several in silico methods can be used to predict the properties of molecules, such as QSAR, Local QSAR or Machine Learning. We already demonstrated that using Local QSAR allows for better predictions with a good reliability [1]. We therefore developed a genotoxicity prediction tool based on the structural similarity search coupled with a supervised machine learning algorithm. This tool is composed of 3 predictive models: the Ames test, the Chromosomal Aberration test and the Mouse Lymphoma Assay. The aim of this paper is to evaluate the performance of these models to predict the genotoxicity of HEMs. We also present the methodology we applied to build these models and to optimize their performances. The dimensional reduction of the training set and the hyperparameters tuning of the different algorithms showed a performance acceleration and a significant reduction of the overfitting, which caused a decline in the generalization capacity of the predictive models. The performance of the predictive models was evaluated on a test set of HEMs and compared to the results of other prediction softwares.</description><subject>Algorithms</subject><subject>Ames test</subject><subject>Animal welfare</subject><subject>Biocompatibility</subject><subject>Biomedical materials</subject><subject>Decision making</subject><subject>Energetic materials</subject><subject>Genotoxicity</subject><subject>High Energetic Materials</subject><subject>In vivo methods and tests</subject><subject>Machine Learning</subject><subject>Optimization</subject><subject>Performance evaluation</subject><subject>Performance prediction</subject><subject>prediction</subject><subject>Prediction models</subject><subject>Reduction</subject><subject>Risk management</subject><subject>Similarity</subject><subject>Supervised learning</subject><subject>Toxicity</subject><issn>0721-3115</issn><issn>1521-4087</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><recordid>eNqFkU9PwjAYhxujiYhePTfxDLbdyjpvxCCQQCCiXpfSP6xktLMrUfwuflc7MXr09PZtn-fXww-Aa4z6GCFyW3tV9wkiJC40PwEdTAnupYhlp6CDsnhOMKbn4KJptghFBeEO-Jxa2JjKCAfHyrrg3o0w4QCXXkkjgnEWrg9wZXam4r59WCnuRQm5lXDORWmsgrN4ZY3dwGG1cREqd3dwUYfofPDvhBZ-4ZWRx9VpGEoF5yqUTkLtPJyYTQlHVvmNCkbE4KC84VVzCc50HOrqZ3bB88Po6X7Smy3G0_vhrCcSnOU9liqsUZ4xmiAsaM5yRTXGAzwYSJ3wtWZSpJrxjCeEykRylFLC1gOiMeGS6KQLbo65tXeve9WEYuv23sYvC5LlNEcsTVmk-kdKeNc0Xumi9mbH_aHAqGgrKNoKit8KopAfhTdTqcM_dLF8HC3_3C_nlo0h</recordid><startdate>202303</startdate><enddate>202303</enddate><creator>Fournier, Mailys</creator><creator>Vroland, Christophe</creator><creator>Megy, Simon</creator><creator>Aguero, Stéphanie</creator><creator>Chemelle, Julie‐Anne</creator><creator>Defoort, Brigitte</creator><creator>Jacob, Guy</creator><creator>Terreux, Raphaël</creator><general>Wiley Subscription Services, Inc</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7TB</scope><scope>8FD</scope><scope>FR3</scope><scope>H8D</scope><scope>L7M</scope><orcidid>https://orcid.org/0000-0001-7003-8571</orcidid></search><sort><creationdate>202303</creationdate><title>In silico Genotoxicity Prediction by Similarity Search and Machine Learning Algorithm: Optimization and Validation of the Method for High Energetic Materials</title><author>Fournier, Mailys ; Vroland, Christophe ; Megy, Simon ; Aguero, Stéphanie ; Chemelle, Julie‐Anne ; Defoort, Brigitte ; Jacob, Guy ; Terreux, Raphaël</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c3179-84e1f09785301c5989e5f116166df3abf8dc4f8a7a325d3da04528b62f12ad2f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Algorithms</topic><topic>Ames test</topic><topic>Animal welfare</topic><topic>Biocompatibility</topic><topic>Biomedical materials</topic><topic>Decision making</topic><topic>Energetic materials</topic><topic>Genotoxicity</topic><topic>High Energetic Materials</topic><topic>In vivo methods and tests</topic><topic>Machine Learning</topic><topic>Optimization</topic><topic>Performance evaluation</topic><topic>Performance prediction</topic><topic>prediction</topic><topic>Prediction models</topic><topic>Reduction</topic><topic>Risk management</topic><topic>Similarity</topic><topic>Supervised learning</topic><topic>Toxicity</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Fournier, Mailys</creatorcontrib><creatorcontrib>Vroland, Christophe</creatorcontrib><creatorcontrib>Megy, Simon</creatorcontrib><creatorcontrib>Aguero, Stéphanie</creatorcontrib><creatorcontrib>Chemelle, Julie‐Anne</creatorcontrib><creatorcontrib>Defoort, Brigitte</creatorcontrib><creatorcontrib>Jacob, Guy</creatorcontrib><creatorcontrib>Terreux, Raphaël</creatorcontrib><collection>CrossRef</collection><collection>Mechanical &amp; Transportation Engineering Abstracts</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>Aerospace Database</collection><collection>Advanced Technologies Database with Aerospace</collection><jtitle>Propellants, explosives, pyrotechnics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Fournier, Mailys</au><au>Vroland, Christophe</au><au>Megy, Simon</au><au>Aguero, Stéphanie</au><au>Chemelle, Julie‐Anne</au><au>Defoort, Brigitte</au><au>Jacob, Guy</au><au>Terreux, Raphaël</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>In silico Genotoxicity Prediction by Similarity Search and Machine Learning Algorithm: Optimization and Validation of the Method for High Energetic Materials</atitle><jtitle>Propellants, explosives, pyrotechnics</jtitle><date>2023-03</date><risdate>2023</risdate><volume>48</volume><issue>4</issue><epage>n/a</epage><issn>0721-3115</issn><eissn>1521-4087</eissn><abstract>The European regulation REACh (Registration, Evaluation, Authorization, and restriction of Chemicals) has placed responsibility on the industry to manage the risk from chemicals since 2006. In order to ensure a high level of protection of human health and environment, toxicity prediction methods are now a widely used tool for regulatory decision making and selection of leads in new substances design. These in silico methods are an alternative to traditional in vitro and in vivo testing methods, which are laborious, time‐consuming, highly expensive, and even involve animal welfare issues. Many computational methods have been employed to predict the toxicity profile of substances, but they are mostly adapted to pharmaceutical molecules and not to High Energetic Materials (HEMs). In line with these restrictions, ArianeGroup set up a collaborative project with the French CNRS to develop optimized tools for the prediction of HEM properties, such as genotoxicity. Several in silico methods can be used to predict the properties of molecules, such as QSAR, Local QSAR or Machine Learning. We already demonstrated that using Local QSAR allows for better predictions with a good reliability [1]. We therefore developed a genotoxicity prediction tool based on the structural similarity search coupled with a supervised machine learning algorithm. This tool is composed of 3 predictive models: the Ames test, the Chromosomal Aberration test and the Mouse Lymphoma Assay. The aim of this paper is to evaluate the performance of these models to predict the genotoxicity of HEMs. We also present the methodology we applied to build these models and to optimize their performances. The dimensional reduction of the training set and the hyperparameters tuning of the different algorithms showed a performance acceleration and a significant reduction of the overfitting, which caused a decline in the generalization capacity of the predictive models. The performance of the predictive models was evaluated on a test set of HEMs and compared to the results of other prediction softwares.</abstract><cop>Weinheim</cop><pub>Wiley Subscription Services, Inc</pub><doi>10.1002/prep.202200259</doi><tpages>18</tpages><orcidid>https://orcid.org/0000-0001-7003-8571</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0721-3115
ispartof Propellants, explosives, pyrotechnics, 2023-03, Vol.48 (4), p.n/a
issn 0721-3115
1521-4087
language eng
recordid cdi_proquest_journals_2795908448
source Wiley Online Library Journals Frontfile Complete
subjects Algorithms
Ames test
Animal welfare
Biocompatibility
Biomedical materials
Decision making
Energetic materials
Genotoxicity
High Energetic Materials
In vivo methods and tests
Machine Learning
Optimization
Performance evaluation
Performance prediction
prediction
Prediction models
Reduction
Risk management
Similarity
Supervised learning
Toxicity
title In silico Genotoxicity Prediction by Similarity Search and Machine Learning Algorithm: Optimization and Validation of the Method for High Energetic Materials
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-05T02%3A37%3A15IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=In%20silico%20Genotoxicity%20Prediction%20by%20Similarity%20Search%20and%20Machine%20Learning%20Algorithm:%20Optimization%20and%20Validation%20of%20the%20Method%20for%20High%20Energetic%20Materials&rft.jtitle=Propellants,%20explosives,%20pyrotechnics&rft.au=Fournier,%20Mailys&rft.date=2023-03&rft.volume=48&rft.issue=4&rft.epage=n/a&rft.issn=0721-3115&rft.eissn=1521-4087&rft_id=info:doi/10.1002/prep.202200259&rft_dat=%3Cproquest_cross%3E2795908448%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2795908448&rft_id=info:pmid/&rfr_iscdi=true