Computational Predictions of Nonclinical Pharmacokinetics at the Drug Design Stage

Although computational predictions of pharmacokinetics (PK) are desirable at the drug design stage, existing approaches are often limited by prediction accuracy and human interpretability. Using a discovery data set of mouse and rat PK studies at Roche (9,685 unique compounds), we performed a proof-...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of chemical information and modeling 2023-01, Vol.63 (2), p.442-458
Hauptverfasser: Stoyanova, Raya, Katzberger, Paul Maximilian, Komissarov, Leonid, Khadhraoui, Aous, Sach-Peltason, Lisa, Groebke Zbinden, Katrin, Schindler, Torsten, Manevski, Nenad
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 458
container_issue 2
container_start_page 442
container_title Journal of chemical information and modeling
container_volume 63
creator Stoyanova, Raya
Katzberger, Paul Maximilian
Komissarov, Leonid
Khadhraoui, Aous
Sach-Peltason, Lisa
Groebke Zbinden, Katrin
Schindler, Torsten
Manevski, Nenad
description Although computational predictions of pharmacokinetics (PK) are desirable at the drug design stage, existing approaches are often limited by prediction accuracy and human interpretability. Using a discovery data set of mouse and rat PK studies at Roche (9,685 unique compounds), we performed a proof-of-concept study to predict key PK properties from chemical structure alone, including plasma clearance (CLp), volume of distribution at steady-state (Vss), and oral bioavailability (F). Ten machine learning (ML) models were evaluated, including Single-Task, Multitask, and transfer learning approaches (i.e., pretraining with in vitro data). In addition to prediction accuracy, we emphasized human interpretability of outcomes, especially the quantification of uncertainty, applicability domains, and explanations of predictions in terms of molecular features. Results show that intravenous (IV) PK properties (CLp and Vss) can be predicted with good precision (average absolute fold error, AAFE of 1.96–2.84 depending on data split) and low bias (average fold error, AFE of 0.98–1.36), with AutoGluon, Gaussian Process Regressor (GP), and ChemProp displaying the best performance. Driven by higher complexity of oral PK studies, predictions of F were more challenging, with the best AAFE values of 2.35–2.60 and higher overprediction bias (AFE of 1.45–1.62). Multi-Task approaches and pretraining of ChemProp neural networks with in vitro data showed similar precision to Single-Task models but helped reduce the bias and increase correlations between observations and predictions. A combination of GP-computed prediction variance, molecular clustering, and dimensionality-reduction provided valuable quantitative insights into prediction uncertainty and applicability domains. SHAPley Additive exPlanations (SHAPs) highlighted molecular features contributing to prediction outcomes of Vss, providing explanations that could aid drug design. Combined results show that computational predictions of PK are feasible at the drug design stage, with several ML technologies converging to successfully leverage historical PK data sets. Further studies are needed to unlock the full potential of this approach, especially with respect to data set sizes and quality, transfer learning between in vitro and in vivo data sets, model-independent quantification of uncertainty, and explainability of predictions.
doi_str_mv 10.1021/acs.jcim.2c01134
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2769263768</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2769263768</sourcerecordid><originalsourceid>FETCH-LOGICAL-a364t-b5724a22afc84edd0241416cf961caf2bd1450dbc8fef84b95c20675fb51d6a83</originalsourceid><addsrcrecordid>eNp1kM1PAyEQxYnRWK3ePRkSr24FFtjlaFq_kkaNH4k3wrLQUrtLBfbgf2_Xtt48zUzmvTeZHwBnGI0wIvhK6ThaaNeMiEYY53QPHGFGRSY4-tjf9UzwATiOcYFQngtODsEg50ywApVH4GXsm1WXVHK-VUv4HEztdD9E6C189K1eutbpfjVXoVHaf7rWJKcjVAmmuYGT0M3gxEQ3a-FrUjNzAg6sWkZzuq1D8H578za-z6ZPdw_j62mmck5TVrGCUEWIsrqkpq4RoZhirq3gWCtLqhpThupKl9bYklaCaYJ4wWzFcM1VmQ_BxSZ3FfxXZ2KSC9-F9RdRkoILwvOC9yq0UengYwzGylVwjQrfEiPZQ5RriLKHKLcQ15bzbXBXNab-M-yorQWXG8GvdXf037wfByB-Wg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2769263768</pqid></control><display><type>article</type><title>Computational Predictions of Nonclinical Pharmacokinetics at the Drug Design Stage</title><source>MEDLINE</source><source>American Chemical Society Journals</source><creator>Stoyanova, Raya ; Katzberger, Paul Maximilian ; Komissarov, Leonid ; Khadhraoui, Aous ; Sach-Peltason, Lisa ; Groebke Zbinden, Katrin ; Schindler, Torsten ; Manevski, Nenad</creator><creatorcontrib>Stoyanova, Raya ; Katzberger, Paul Maximilian ; Komissarov, Leonid ; Khadhraoui, Aous ; Sach-Peltason, Lisa ; Groebke Zbinden, Katrin ; Schindler, Torsten ; Manevski, Nenad</creatorcontrib><description>Although computational predictions of pharmacokinetics (PK) are desirable at the drug design stage, existing approaches are often limited by prediction accuracy and human interpretability. Using a discovery data set of mouse and rat PK studies at Roche (9,685 unique compounds), we performed a proof-of-concept study to predict key PK properties from chemical structure alone, including plasma clearance (CLp), volume of distribution at steady-state (Vss), and oral bioavailability (F). Ten machine learning (ML) models were evaluated, including Single-Task, Multitask, and transfer learning approaches (i.e., pretraining with in vitro data). In addition to prediction accuracy, we emphasized human interpretability of outcomes, especially the quantification of uncertainty, applicability domains, and explanations of predictions in terms of molecular features. Results show that intravenous (IV) PK properties (CLp and Vss) can be predicted with good precision (average absolute fold error, AAFE of 1.96–2.84 depending on data split) and low bias (average fold error, AFE of 0.98–1.36), with AutoGluon, Gaussian Process Regressor (GP), and ChemProp displaying the best performance. Driven by higher complexity of oral PK studies, predictions of F were more challenging, with the best AAFE values of 2.35–2.60 and higher overprediction bias (AFE of 1.45–1.62). Multi-Task approaches and pretraining of ChemProp neural networks with in vitro data showed similar precision to Single-Task models but helped reduce the bias and increase correlations between observations and predictions. A combination of GP-computed prediction variance, molecular clustering, and dimensionality-reduction provided valuable quantitative insights into prediction uncertainty and applicability domains. SHAPley Additive exPlanations (SHAPs) highlighted molecular features contributing to prediction outcomes of Vss, providing explanations that could aid drug design. Combined results show that computational predictions of PK are feasible at the drug design stage, with several ML technologies converging to successfully leverage historical PK data sets. Further studies are needed to unlock the full potential of this approach, especially with respect to data set sizes and quality, transfer learning between in vitro and in vivo data sets, model-independent quantification of uncertainty, and explainability of predictions.</description><identifier>ISSN: 1549-9596</identifier><identifier>EISSN: 1549-960X</identifier><identifier>DOI: 10.1021/acs.jcim.2c01134</identifier><identifier>PMID: 36595708</identifier><language>eng</language><publisher>United States: American Chemical Society</publisher><subject>Accuracy ; Administration, Intravenous ; Animals ; Bias ; Bioavailability ; Biological Availability ; Clustering ; Datasets ; Drug Design ; Gaussian process ; Humans ; Machine learning ; Machine Learning and Deep Learning ; Models, Biological ; Neural networks ; Neural Networks, Computer ; Pharmaceutical Preparations ; Pharmacokinetics ; Pharmacology ; Rats ; Uncertainty</subject><ispartof>Journal of chemical information and modeling, 2023-01, Vol.63 (2), p.442-458</ispartof><rights>2023 American Chemical Society</rights><rights>Copyright American Chemical Society Jan 23, 2023</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-a364t-b5724a22afc84edd0241416cf961caf2bd1450dbc8fef84b95c20675fb51d6a83</citedby><cites>FETCH-LOGICAL-a364t-b5724a22afc84edd0241416cf961caf2bd1450dbc8fef84b95c20675fb51d6a83</cites><orcidid>0000-0001-8387-3023</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://pubs.acs.org/doi/pdf/10.1021/acs.jcim.2c01134$$EPDF$$P50$$Gacs$$H</linktopdf><linktohtml>$$Uhttps://pubs.acs.org/doi/10.1021/acs.jcim.2c01134$$EHTML$$P50$$Gacs$$H</linktohtml><link.rule.ids>314,776,780,2752,27053,27901,27902,56713,56763</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/36595708$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Stoyanova, Raya</creatorcontrib><creatorcontrib>Katzberger, Paul Maximilian</creatorcontrib><creatorcontrib>Komissarov, Leonid</creatorcontrib><creatorcontrib>Khadhraoui, Aous</creatorcontrib><creatorcontrib>Sach-Peltason, Lisa</creatorcontrib><creatorcontrib>Groebke Zbinden, Katrin</creatorcontrib><creatorcontrib>Schindler, Torsten</creatorcontrib><creatorcontrib>Manevski, Nenad</creatorcontrib><title>Computational Predictions of Nonclinical Pharmacokinetics at the Drug Design Stage</title><title>Journal of chemical information and modeling</title><addtitle>J. Chem. Inf. Model</addtitle><description>Although computational predictions of pharmacokinetics (PK) are desirable at the drug design stage, existing approaches are often limited by prediction accuracy and human interpretability. Using a discovery data set of mouse and rat PK studies at Roche (9,685 unique compounds), we performed a proof-of-concept study to predict key PK properties from chemical structure alone, including plasma clearance (CLp), volume of distribution at steady-state (Vss), and oral bioavailability (F). Ten machine learning (ML) models were evaluated, including Single-Task, Multitask, and transfer learning approaches (i.e., pretraining with in vitro data). In addition to prediction accuracy, we emphasized human interpretability of outcomes, especially the quantification of uncertainty, applicability domains, and explanations of predictions in terms of molecular features. Results show that intravenous (IV) PK properties (CLp and Vss) can be predicted with good precision (average absolute fold error, AAFE of 1.96–2.84 depending on data split) and low bias (average fold error, AFE of 0.98–1.36), with AutoGluon, Gaussian Process Regressor (GP), and ChemProp displaying the best performance. Driven by higher complexity of oral PK studies, predictions of F were more challenging, with the best AAFE values of 2.35–2.60 and higher overprediction bias (AFE of 1.45–1.62). Multi-Task approaches and pretraining of ChemProp neural networks with in vitro data showed similar precision to Single-Task models but helped reduce the bias and increase correlations between observations and predictions. A combination of GP-computed prediction variance, molecular clustering, and dimensionality-reduction provided valuable quantitative insights into prediction uncertainty and applicability domains. SHAPley Additive exPlanations (SHAPs) highlighted molecular features contributing to prediction outcomes of Vss, providing explanations that could aid drug design. Combined results show that computational predictions of PK are feasible at the drug design stage, with several ML technologies converging to successfully leverage historical PK data sets. Further studies are needed to unlock the full potential of this approach, especially with respect to data set sizes and quality, transfer learning between in vitro and in vivo data sets, model-independent quantification of uncertainty, and explainability of predictions.</description><subject>Accuracy</subject><subject>Administration, Intravenous</subject><subject>Animals</subject><subject>Bias</subject><subject>Bioavailability</subject><subject>Biological Availability</subject><subject>Clustering</subject><subject>Datasets</subject><subject>Drug Design</subject><subject>Gaussian process</subject><subject>Humans</subject><subject>Machine learning</subject><subject>Machine Learning and Deep Learning</subject><subject>Models, Biological</subject><subject>Neural networks</subject><subject>Neural Networks, Computer</subject><subject>Pharmaceutical Preparations</subject><subject>Pharmacokinetics</subject><subject>Pharmacology</subject><subject>Rats</subject><subject>Uncertainty</subject><issn>1549-9596</issn><issn>1549-960X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNp1kM1PAyEQxYnRWK3ePRkSr24FFtjlaFq_kkaNH4k3wrLQUrtLBfbgf2_Xtt48zUzmvTeZHwBnGI0wIvhK6ThaaNeMiEYY53QPHGFGRSY4-tjf9UzwATiOcYFQngtODsEg50ywApVH4GXsm1WXVHK-VUv4HEztdD9E6C189K1eutbpfjVXoVHaf7rWJKcjVAmmuYGT0M3gxEQ3a-FrUjNzAg6sWkZzuq1D8H578za-z6ZPdw_j62mmck5TVrGCUEWIsrqkpq4RoZhirq3gWCtLqhpThupKl9bYklaCaYJ4wWzFcM1VmQ_BxSZ3FfxXZ2KSC9-F9RdRkoILwvOC9yq0UengYwzGylVwjQrfEiPZQ5RriLKHKLcQ15bzbXBXNab-M-yorQWXG8GvdXf037wfByB-Wg</recordid><startdate>20230123</startdate><enddate>20230123</enddate><creator>Stoyanova, Raya</creator><creator>Katzberger, Paul Maximilian</creator><creator>Komissarov, Leonid</creator><creator>Khadhraoui, Aous</creator><creator>Sach-Peltason, Lisa</creator><creator>Groebke Zbinden, Katrin</creator><creator>Schindler, Torsten</creator><creator>Manevski, Nenad</creator><general>American Chemical Society</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SR</scope><scope>7U5</scope><scope>8BQ</scope><scope>8FD</scope><scope>JG9</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0001-8387-3023</orcidid></search><sort><creationdate>20230123</creationdate><title>Computational Predictions of Nonclinical Pharmacokinetics at the Drug Design Stage</title><author>Stoyanova, Raya ; Katzberger, Paul Maximilian ; Komissarov, Leonid ; Khadhraoui, Aous ; Sach-Peltason, Lisa ; Groebke Zbinden, Katrin ; Schindler, Torsten ; Manevski, Nenad</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a364t-b5724a22afc84edd0241416cf961caf2bd1450dbc8fef84b95c20675fb51d6a83</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Accuracy</topic><topic>Administration, Intravenous</topic><topic>Animals</topic><topic>Bias</topic><topic>Bioavailability</topic><topic>Biological Availability</topic><topic>Clustering</topic><topic>Datasets</topic><topic>Drug Design</topic><topic>Gaussian process</topic><topic>Humans</topic><topic>Machine learning</topic><topic>Machine Learning and Deep Learning</topic><topic>Models, Biological</topic><topic>Neural networks</topic><topic>Neural Networks, Computer</topic><topic>Pharmaceutical Preparations</topic><topic>Pharmacokinetics</topic><topic>Pharmacology</topic><topic>Rats</topic><topic>Uncertainty</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Stoyanova, Raya</creatorcontrib><creatorcontrib>Katzberger, Paul Maximilian</creatorcontrib><creatorcontrib>Komissarov, Leonid</creatorcontrib><creatorcontrib>Khadhraoui, Aous</creatorcontrib><creatorcontrib>Sach-Peltason, Lisa</creatorcontrib><creatorcontrib>Groebke Zbinden, Katrin</creatorcontrib><creatorcontrib>Schindler, Torsten</creatorcontrib><creatorcontrib>Manevski, Nenad</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>Solid State and Superconductivity Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Journal of chemical information and modeling</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Stoyanova, Raya</au><au>Katzberger, Paul Maximilian</au><au>Komissarov, Leonid</au><au>Khadhraoui, Aous</au><au>Sach-Peltason, Lisa</au><au>Groebke Zbinden, Katrin</au><au>Schindler, Torsten</au><au>Manevski, Nenad</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Computational Predictions of Nonclinical Pharmacokinetics at the Drug Design Stage</atitle><jtitle>Journal of chemical information and modeling</jtitle><addtitle>J. Chem. Inf. Model</addtitle><date>2023-01-23</date><risdate>2023</risdate><volume>63</volume><issue>2</issue><spage>442</spage><epage>458</epage><pages>442-458</pages><issn>1549-9596</issn><eissn>1549-960X</eissn><abstract>Although computational predictions of pharmacokinetics (PK) are desirable at the drug design stage, existing approaches are often limited by prediction accuracy and human interpretability. Using a discovery data set of mouse and rat PK studies at Roche (9,685 unique compounds), we performed a proof-of-concept study to predict key PK properties from chemical structure alone, including plasma clearance (CLp), volume of distribution at steady-state (Vss), and oral bioavailability (F). Ten machine learning (ML) models were evaluated, including Single-Task, Multitask, and transfer learning approaches (i.e., pretraining with in vitro data). In addition to prediction accuracy, we emphasized human interpretability of outcomes, especially the quantification of uncertainty, applicability domains, and explanations of predictions in terms of molecular features. Results show that intravenous (IV) PK properties (CLp and Vss) can be predicted with good precision (average absolute fold error, AAFE of 1.96–2.84 depending on data split) and low bias (average fold error, AFE of 0.98–1.36), with AutoGluon, Gaussian Process Regressor (GP), and ChemProp displaying the best performance. Driven by higher complexity of oral PK studies, predictions of F were more challenging, with the best AAFE values of 2.35–2.60 and higher overprediction bias (AFE of 1.45–1.62). Multi-Task approaches and pretraining of ChemProp neural networks with in vitro data showed similar precision to Single-Task models but helped reduce the bias and increase correlations between observations and predictions. A combination of GP-computed prediction variance, molecular clustering, and dimensionality-reduction provided valuable quantitative insights into prediction uncertainty and applicability domains. SHAPley Additive exPlanations (SHAPs) highlighted molecular features contributing to prediction outcomes of Vss, providing explanations that could aid drug design. Combined results show that computational predictions of PK are feasible at the drug design stage, with several ML technologies converging to successfully leverage historical PK data sets. Further studies are needed to unlock the full potential of this approach, especially with respect to data set sizes and quality, transfer learning between in vitro and in vivo data sets, model-independent quantification of uncertainty, and explainability of predictions.</abstract><cop>United States</cop><pub>American Chemical Society</pub><pmid>36595708</pmid><doi>10.1021/acs.jcim.2c01134</doi><tpages>17</tpages><orcidid>https://orcid.org/0000-0001-8387-3023</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 1549-9596
ispartof Journal of chemical information and modeling, 2023-01, Vol.63 (2), p.442-458
issn 1549-9596
1549-960X
language eng
recordid cdi_proquest_journals_2769263768
source MEDLINE; American Chemical Society Journals
subjects Accuracy
Administration, Intravenous
Animals
Bias
Bioavailability
Biological Availability
Clustering
Datasets
Drug Design
Gaussian process
Humans
Machine learning
Machine Learning and Deep Learning
Models, Biological
Neural networks
Neural Networks, Computer
Pharmaceutical Preparations
Pharmacokinetics
Pharmacology
Rats
Uncertainty
title Computational Predictions of Nonclinical Pharmacokinetics at the Drug Design Stage
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-03T11%3A23%3A38IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Computational%20Predictions%20of%20Nonclinical%20Pharmacokinetics%20at%20the%20Drug%20Design%20Stage&rft.jtitle=Journal%20of%20chemical%20information%20and%20modeling&rft.au=Stoyanova,%20Raya&rft.date=2023-01-23&rft.volume=63&rft.issue=2&rft.spage=442&rft.epage=458&rft.pages=442-458&rft.issn=1549-9596&rft.eissn=1549-960X&rft_id=info:doi/10.1021/acs.jcim.2c01134&rft_dat=%3Cproquest_cross%3E2769263768%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2769263768&rft_id=info:pmid/36595708&rfr_iscdi=true