Molecular Screening and Toxicity Estimation of 260,000 Perfluoroalkyl and Polyfluoroalkyl Substances (PFASs) through Machine Learning

Perfluoroalkyl and polyfluoroalkyl substances (PFASs) are a class of chemicals widely used in industrial applications due to their exceptional properties and stability. However, they do not readily degrade in the environment and are linked to contamination and adverse health effects in humans and wi...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of chemical information and modeling 2022-10, Vol.62 (19), p.4569-4578
Hauptverfasser: Lai, Thanh T., Kuntz, David, Wilson, Angela K.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 4578
container_issue 19
container_start_page 4569
container_title Journal of chemical information and modeling
container_volume 62
creator Lai, Thanh T.
Kuntz, David
Wilson, Angela K.
description Perfluoroalkyl and polyfluoroalkyl substances (PFASs) are a class of chemicals widely used in industrial applications due to their exceptional properties and stability. However, they do not readily degrade in the environment and are linked to contamination and adverse health effects in humans and wildlife. To find alternatives for the most commonly used PFAS molecules that maintain their desirable chemical properties but are not adverse to biological lifeforms, a novel approach based upon machine learning is utilized. The machine learning model is trained on an existing set of PFAS molecules to generate over 260,000 novel PFAS molecules, which we dub PFAS-AI-Gen. Using molecular descriptors with known relationships to toxicity and industrial suitability followed by molecular docking and molecular dynamics simulations, this set of molecules is screened. In this manner, increasingly complex calculations are performed only for candidate molecules that are most likely to yield the desired properties of low binding affinity toward two selected protein receptors, the human pregnane x receptor (hPXR) and peroxisome proliferator-activated receptor γ (PPAR-γ), and high industrial suitability, defined by critical micelle concentration (CMC). The selection criteria of low binding affinity and high industrial suitability are relative to the popular PFAS alternative GenX. hPXR and PPAR-γ are selected as they are PFAS targets and facilitate a variety of functions, such as drug metabolism and glucose regulation, respectively. Through this approach, 22 promising new PFAS substitutes that may warrant experimental investigation are identified. This integrated approach of molecular screening and toxicity estimation may be applicable to other chemical classes.
doi_str_mv 10.1021/acs.jcim.2c00374
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2718632056</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2718632056</sourcerecordid><originalsourceid>FETCH-LOGICAL-a271t-4a298ee209642105f449a7df5a0eddc0edf74cda9bbca04d7ab72647bf2d2c043</originalsourceid><addsrcrecordid>eNp1kV1LwzAUhoMoOKf3Xga8mbDONE0_cjnGpsKGg03wrpym6daZNTNpwf4A_7fZhyCCN0k4ed73HM6L0K1PBj6h_gMIO9iIcjuggpAgZmeo44eMezwib-c_75BHl-jK2o1DAh7RDvqaaSVFo8DghTBSVmW1wlDleKk_S1HWLR7butxCXeoK6wLTiPQJIXguTaEabTSo91YdFHOt2t-1RZPZGiohLe7NJ8OFvcf12uhmtcYzEOuykngqwew7XqOLApSVN6e7i14n4-XoyZu-PD6PhlMPaOzXHgPKEykp4RGjPgkLxjjEeRECkXku3FHETOTAs0wAYXkMWUwjFmcFzd1aWNBFvaPvzuiPRto63ZZWSKWgkrqxqeuSRAElYeTQuz_oRjemctM5igZJ6CecOoocKWG0tUYW6c64bZk29Um6zyV1uaT7XNJTLk7SP0oOPz-e_-LfWmiSfQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2723851892</pqid></control><display><type>article</type><title>Molecular Screening and Toxicity Estimation of 260,000 Perfluoroalkyl and Polyfluoroalkyl Substances (PFASs) through Machine Learning</title><source>ACS Publications</source><creator>Lai, Thanh T. ; Kuntz, David ; Wilson, Angela K.</creator><creatorcontrib>Lai, Thanh T. ; Kuntz, David ; Wilson, Angela K.</creatorcontrib><description>Perfluoroalkyl and polyfluoroalkyl substances (PFASs) are a class of chemicals widely used in industrial applications due to their exceptional properties and stability. However, they do not readily degrade in the environment and are linked to contamination and adverse health effects in humans and wildlife. To find alternatives for the most commonly used PFAS molecules that maintain their desirable chemical properties but are not adverse to biological lifeforms, a novel approach based upon machine learning is utilized. The machine learning model is trained on an existing set of PFAS molecules to generate over 260,000 novel PFAS molecules, which we dub PFAS-AI-Gen. Using molecular descriptors with known relationships to toxicity and industrial suitability followed by molecular docking and molecular dynamics simulations, this set of molecules is screened. In this manner, increasingly complex calculations are performed only for candidate molecules that are most likely to yield the desired properties of low binding affinity toward two selected protein receptors, the human pregnane x receptor (hPXR) and peroxisome proliferator-activated receptor γ (PPAR-γ), and high industrial suitability, defined by critical micelle concentration (CMC). The selection criteria of low binding affinity and high industrial suitability are relative to the popular PFAS alternative GenX. hPXR and PPAR-γ are selected as they are PFAS targets and facilitate a variety of functions, such as drug metabolism and glucose regulation, respectively. Through this approach, 22 promising new PFAS substitutes that may warrant experimental investigation are identified. This integrated approach of molecular screening and toxicity estimation may be applicable to other chemical classes.</description><identifier>ISSN: 1549-9596</identifier><identifier>EISSN: 1549-960X</identifier><identifier>DOI: 10.1021/acs.jcim.2c00374</identifier><language>eng</language><publisher>Washington: American Chemical Society</publisher><subject>Affinity ; Binding ; Chemical properties ; Industrial applications ; Machine learning ; Machine Learning and Deep Learning ; Micelles ; Molecular docking ; Molecular dynamics ; Perfluoroalkyl &amp; polyfluoroalkyl substances ; Receptors ; Screening ; Toxicity</subject><ispartof>Journal of chemical information and modeling, 2022-10, Vol.62 (19), p.4569-4578</ispartof><rights>2022 American Chemical Society</rights><rights>Copyright American Chemical Society Oct 10, 2022</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-a271t-4a298ee209642105f449a7df5a0eddc0edf74cda9bbca04d7ab72647bf2d2c043</citedby><cites>FETCH-LOGICAL-a271t-4a298ee209642105f449a7df5a0eddc0edf74cda9bbca04d7ab72647bf2d2c043</cites><orcidid>0000-0001-9500-1628</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://pubs.acs.org/doi/pdf/10.1021/acs.jcim.2c00374$$EPDF$$P50$$Gacs$$H</linktopdf><linktohtml>$$Uhttps://pubs.acs.org/doi/10.1021/acs.jcim.2c00374$$EHTML$$P50$$Gacs$$H</linktohtml><link.rule.ids>314,777,781,2752,27057,27905,27906,56719,56769</link.rule.ids></links><search><creatorcontrib>Lai, Thanh T.</creatorcontrib><creatorcontrib>Kuntz, David</creatorcontrib><creatorcontrib>Wilson, Angela K.</creatorcontrib><title>Molecular Screening and Toxicity Estimation of 260,000 Perfluoroalkyl and Polyfluoroalkyl Substances (PFASs) through Machine Learning</title><title>Journal of chemical information and modeling</title><addtitle>J. Chem. Inf. Model</addtitle><description>Perfluoroalkyl and polyfluoroalkyl substances (PFASs) are a class of chemicals widely used in industrial applications due to their exceptional properties and stability. However, they do not readily degrade in the environment and are linked to contamination and adverse health effects in humans and wildlife. To find alternatives for the most commonly used PFAS molecules that maintain their desirable chemical properties but are not adverse to biological lifeforms, a novel approach based upon machine learning is utilized. The machine learning model is trained on an existing set of PFAS molecules to generate over 260,000 novel PFAS molecules, which we dub PFAS-AI-Gen. Using molecular descriptors with known relationships to toxicity and industrial suitability followed by molecular docking and molecular dynamics simulations, this set of molecules is screened. In this manner, increasingly complex calculations are performed only for candidate molecules that are most likely to yield the desired properties of low binding affinity toward two selected protein receptors, the human pregnane x receptor (hPXR) and peroxisome proliferator-activated receptor γ (PPAR-γ), and high industrial suitability, defined by critical micelle concentration (CMC). The selection criteria of low binding affinity and high industrial suitability are relative to the popular PFAS alternative GenX. hPXR and PPAR-γ are selected as they are PFAS targets and facilitate a variety of functions, such as drug metabolism and glucose regulation, respectively. Through this approach, 22 promising new PFAS substitutes that may warrant experimental investigation are identified. This integrated approach of molecular screening and toxicity estimation may be applicable to other chemical classes.</description><subject>Affinity</subject><subject>Binding</subject><subject>Chemical properties</subject><subject>Industrial applications</subject><subject>Machine learning</subject><subject>Machine Learning and Deep Learning</subject><subject>Micelles</subject><subject>Molecular docking</subject><subject>Molecular dynamics</subject><subject>Perfluoroalkyl &amp; polyfluoroalkyl substances</subject><subject>Receptors</subject><subject>Screening</subject><subject>Toxicity</subject><issn>1549-9596</issn><issn>1549-960X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><recordid>eNp1kV1LwzAUhoMoOKf3Xga8mbDONE0_cjnGpsKGg03wrpym6daZNTNpwf4A_7fZhyCCN0k4ed73HM6L0K1PBj6h_gMIO9iIcjuggpAgZmeo44eMezwib-c_75BHl-jK2o1DAh7RDvqaaSVFo8DghTBSVmW1wlDleKk_S1HWLR7butxCXeoK6wLTiPQJIXguTaEabTSo91YdFHOt2t-1RZPZGiohLe7NJ8OFvcf12uhmtcYzEOuykngqwew7XqOLApSVN6e7i14n4-XoyZu-PD6PhlMPaOzXHgPKEykp4RGjPgkLxjjEeRECkXku3FHETOTAs0wAYXkMWUwjFmcFzd1aWNBFvaPvzuiPRto63ZZWSKWgkrqxqeuSRAElYeTQuz_oRjemctM5igZJ6CecOoocKWG0tUYW6c64bZk29Um6zyV1uaT7XNJTLk7SP0oOPz-e_-LfWmiSfQ</recordid><startdate>20221010</startdate><enddate>20221010</enddate><creator>Lai, Thanh T.</creator><creator>Kuntz, David</creator><creator>Wilson, Angela K.</creator><general>American Chemical Society</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SR</scope><scope>7U5</scope><scope>8BQ</scope><scope>8FD</scope><scope>JG9</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0001-9500-1628</orcidid></search><sort><creationdate>20221010</creationdate><title>Molecular Screening and Toxicity Estimation of 260,000 Perfluoroalkyl and Polyfluoroalkyl Substances (PFASs) through Machine Learning</title><author>Lai, Thanh T. ; Kuntz, David ; Wilson, Angela K.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a271t-4a298ee209642105f449a7df5a0eddc0edf74cda9bbca04d7ab72647bf2d2c043</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Affinity</topic><topic>Binding</topic><topic>Chemical properties</topic><topic>Industrial applications</topic><topic>Machine learning</topic><topic>Machine Learning and Deep Learning</topic><topic>Micelles</topic><topic>Molecular docking</topic><topic>Molecular dynamics</topic><topic>Perfluoroalkyl &amp; polyfluoroalkyl substances</topic><topic>Receptors</topic><topic>Screening</topic><topic>Toxicity</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Lai, Thanh T.</creatorcontrib><creatorcontrib>Kuntz, David</creatorcontrib><creatorcontrib>Wilson, Angela K.</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>Solid State and Superconductivity Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>MEDLINE - Academic</collection><jtitle>Journal of chemical information and modeling</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Lai, Thanh T.</au><au>Kuntz, David</au><au>Wilson, Angela K.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Molecular Screening and Toxicity Estimation of 260,000 Perfluoroalkyl and Polyfluoroalkyl Substances (PFASs) through Machine Learning</atitle><jtitle>Journal of chemical information and modeling</jtitle><addtitle>J. Chem. Inf. Model</addtitle><date>2022-10-10</date><risdate>2022</risdate><volume>62</volume><issue>19</issue><spage>4569</spage><epage>4578</epage><pages>4569-4578</pages><issn>1549-9596</issn><eissn>1549-960X</eissn><abstract>Perfluoroalkyl and polyfluoroalkyl substances (PFASs) are a class of chemicals widely used in industrial applications due to their exceptional properties and stability. However, they do not readily degrade in the environment and are linked to contamination and adverse health effects in humans and wildlife. To find alternatives for the most commonly used PFAS molecules that maintain their desirable chemical properties but are not adverse to biological lifeforms, a novel approach based upon machine learning is utilized. The machine learning model is trained on an existing set of PFAS molecules to generate over 260,000 novel PFAS molecules, which we dub PFAS-AI-Gen. Using molecular descriptors with known relationships to toxicity and industrial suitability followed by molecular docking and molecular dynamics simulations, this set of molecules is screened. In this manner, increasingly complex calculations are performed only for candidate molecules that are most likely to yield the desired properties of low binding affinity toward two selected protein receptors, the human pregnane x receptor (hPXR) and peroxisome proliferator-activated receptor γ (PPAR-γ), and high industrial suitability, defined by critical micelle concentration (CMC). The selection criteria of low binding affinity and high industrial suitability are relative to the popular PFAS alternative GenX. hPXR and PPAR-γ are selected as they are PFAS targets and facilitate a variety of functions, such as drug metabolism and glucose regulation, respectively. Through this approach, 22 promising new PFAS substitutes that may warrant experimental investigation are identified. This integrated approach of molecular screening and toxicity estimation may be applicable to other chemical classes.</abstract><cop>Washington</cop><pub>American Chemical Society</pub><doi>10.1021/acs.jcim.2c00374</doi><tpages>10</tpages><orcidid>https://orcid.org/0000-0001-9500-1628</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 1549-9596
ispartof Journal of chemical information and modeling, 2022-10, Vol.62 (19), p.4569-4578
issn 1549-9596
1549-960X
language eng
recordid cdi_proquest_miscellaneous_2718632056
source ACS Publications
subjects Affinity
Binding
Chemical properties
Industrial applications
Machine learning
Machine Learning and Deep Learning
Micelles
Molecular docking
Molecular dynamics
Perfluoroalkyl & polyfluoroalkyl substances
Receptors
Screening
Toxicity
title Molecular Screening and Toxicity Estimation of 260,000 Perfluoroalkyl and Polyfluoroalkyl Substances (PFASs) through Machine Learning
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-18T19%3A57%3A26IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Molecular%20Screening%20and%20Toxicity%20Estimation%20of%20260,000%20Perfluoroalkyl%20and%20Polyfluoroalkyl%20Substances%20(PFASs)%20through%20Machine%20Learning&rft.jtitle=Journal%20of%20chemical%20information%20and%20modeling&rft.au=Lai,%20Thanh%20T.&rft.date=2022-10-10&rft.volume=62&rft.issue=19&rft.spage=4569&rft.epage=4578&rft.pages=4569-4578&rft.issn=1549-9596&rft.eissn=1549-960X&rft_id=info:doi/10.1021/acs.jcim.2c00374&rft_dat=%3Cproquest_cross%3E2718632056%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2723851892&rft_id=info:pmid/&rfr_iscdi=true