Quantum–mechanical property prediction of solvated drug molecules: what have we learned from a decade of SAMPL blind prediction challenges?

Joint academic–industrial projects supporting drug discovery are frequently pursued to deploy and benchmark cutting-edge methodical developments from academia in a real-world industrial environment at different scales. The dimensionality of tasks ranges from small molecule physicochemical property a...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of computer-aided molecular design 2021-04, Vol.35 (4), p.453-472
Hauptverfasser: Tielker, Nicolas, Eberlein, Lukas, Hessler, Gerhard, Schmidt, K. Friedemann, Güssregen, Stefan, Kast, Stefan M.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 472
container_issue 4
container_start_page 453
container_title Journal of computer-aided molecular design
container_volume 35
creator Tielker, Nicolas
Eberlein, Lukas
Hessler, Gerhard
Schmidt, K. Friedemann
Güssregen, Stefan
Kast, Stefan M.
description Joint academic–industrial projects supporting drug discovery are frequently pursued to deploy and benchmark cutting-edge methodical developments from academia in a real-world industrial environment at different scales. The dimensionality of tasks ranges from small molecule physicochemical property assessment over protein–ligand interaction up to statistical analyses of biological data. This way, method development and usability both benefit from insights gained at both ends, when predictiveness and readiness of novel approaches are confirmed, but the pharmaceutical drug makers get early access to novel tools for the quality of drug products and benefit of patients. Quantum–mechanical and simulation methods particularly fall into this group of methods, as they require skills and expense in their development but also significant resources in their application, thus are comparatively slowly dripping into the realm of industrial use. Nevertheless, these physics-based methods are becoming more and more useful. Starting with a general overview of these and in particular quantum–mechanical methods for drug discovery we review a decade-long and ongoing collaboration between Sanofi and the Kast group focused on the application of the embedded cluster reference interaction site model (EC-RISM), a solvation model for quantum chemistry, to study small molecule chemistry in the context of joint participation in several SAMPL (Statistical Assessment of Modeling of Proteins and Ligands) blind prediction challenges. Starting with early application to tautomer equilibria in water (SAMPL2) the methodology was further developed to allow for challenge contributions related to predictions of distribution coefficients (SAMPL5) and acidity constants (SAMPL6) over the years. Particular emphasis is put on a frequently overlooked aspect of measuring the quality of models, namely the retrospective analysis of earlier datasets and predictions in light of more recent and advanced developments. We therefore demonstrate the performance of the current methodical state of the art as developed and optimized for the SAMPL6 p K a and octanol–water log P challenges when re-applied to the earlier SAMPL5 cyclohexane-water log D and SAMPL2 tautomer equilibria datasets. Systematic improvement is not consistently found throughout despite the similarity of the problem class, i.e. protonation reactions and phase distribution. Hence, it is possible to learn about hidden bias in model assessment, as r
doi_str_mv 10.1007/s10822-020-00347-5
format Article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_8018924</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2452498717</sourcerecordid><originalsourceid>FETCH-LOGICAL-c474t-3f02edd8cc409ed4febfbce311a51908b39b72ce398f94585cbff3ef654da8ce3</originalsourceid><addsrcrecordid>eNp9Ub1uFDEQthCIHIEXoECWaGgW_Ju1KUBRRADpECBAorO89vhuI-_6Yu9elC4vQMUb8iQ4XAiBgsqj-X7GMx9CDyl5SglpnxVKFGMNYaQhhIu2kbfQgsqWN0JLehstiK7QgRRf99C9Uk5IFekDchftcV4rLtUCffs423Gahx8X3wdwazv2zka8yWkDeTqvBfjeTX0acQq4pLi1E3js87zCQ4rg5gjlOT5b2wmv7RbwGeAINo-VFHIasMUenPVwKf90-O7DEnexH_1N4zo1RhhXUF7eR3eCjQUeXL376Mvxq89Hb5rl-9dvjw6XjROtmBoeCAPvlXOCaPAiQBc6B5xSK6kmquO6a1ltaBW0kEq6LgQOoZ7CW1X7--jFznczdwN4B-OUbTSb3A82n5tke_M3MvZrs0pbowhVmolq8OTKIKfTGcpkhr44iNGOkOZimJBMaNXStlIf_0M9SXMe63qGSaIIY7pllcV2LJdTKRnC9WcoMZdpm13apqZtfqVtZBU9urnGteR3vJXAd4RSoXrh_Gf2f2x_Au1tulY</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2508022972</pqid></control><display><type>article</type><title>Quantum–mechanical property prediction of solvated drug molecules: what have we learned from a decade of SAMPL blind prediction challenges?</title><source>MEDLINE</source><source>SpringerLink Journals - AutoHoldings</source><creator>Tielker, Nicolas ; Eberlein, Lukas ; Hessler, Gerhard ; Schmidt, K. Friedemann ; Güssregen, Stefan ; Kast, Stefan M.</creator><creatorcontrib>Tielker, Nicolas ; Eberlein, Lukas ; Hessler, Gerhard ; Schmidt, K. Friedemann ; Güssregen, Stefan ; Kast, Stefan M.</creatorcontrib><description>Joint academic–industrial projects supporting drug discovery are frequently pursued to deploy and benchmark cutting-edge methodical developments from academia in a real-world industrial environment at different scales. The dimensionality of tasks ranges from small molecule physicochemical property assessment over protein–ligand interaction up to statistical analyses of biological data. This way, method development and usability both benefit from insights gained at both ends, when predictiveness and readiness of novel approaches are confirmed, but the pharmaceutical drug makers get early access to novel tools for the quality of drug products and benefit of patients. Quantum–mechanical and simulation methods particularly fall into this group of methods, as they require skills and expense in their development but also significant resources in their application, thus are comparatively slowly dripping into the realm of industrial use. Nevertheless, these physics-based methods are becoming more and more useful. Starting with a general overview of these and in particular quantum–mechanical methods for drug discovery we review a decade-long and ongoing collaboration between Sanofi and the Kast group focused on the application of the embedded cluster reference interaction site model (EC-RISM), a solvation model for quantum chemistry, to study small molecule chemistry in the context of joint participation in several SAMPL (Statistical Assessment of Modeling of Proteins and Ligands) blind prediction challenges. Starting with early application to tautomer equilibria in water (SAMPL2) the methodology was further developed to allow for challenge contributions related to predictions of distribution coefficients (SAMPL5) and acidity constants (SAMPL6) over the years. Particular emphasis is put on a frequently overlooked aspect of measuring the quality of models, namely the retrospective analysis of earlier datasets and predictions in light of more recent and advanced developments. We therefore demonstrate the performance of the current methodical state of the art as developed and optimized for the SAMPL6 p K a and octanol–water log P challenges when re-applied to the earlier SAMPL5 cyclohexane-water log D and SAMPL2 tautomer equilibria datasets. Systematic improvement is not consistently found throughout despite the similarity of the problem class, i.e. protonation reactions and phase distribution. Hence, it is possible to learn about hidden bias in model assessment, as results derived from more elaborate methods do not necessarily improve quantitative agreement. This indicates the role of chance or coincidence for model development on the one hand which allows for the identification of systematic error and opportunities toward improvement and reveals possible sources of experimental uncertainty on the other. These insights are particularly useful for further academia–industry collaborations, as both partners are then enabled to optimize both the computational and experimental settings for data generation.</description><identifier>ISSN: 0920-654X</identifier><identifier>EISSN: 1573-4951</identifier><identifier>DOI: 10.1007/s10822-020-00347-5</identifier><identifier>PMID: 33079358</identifier><language>eng</language><publisher>Cham: Springer International Publishing</publisher><subject>Acidity ; Animal Anatomy ; Chemistry ; Chemistry and Materials Science ; Computer Applications in Chemistry ; Computer Simulation ; Cyclohexane ; Cyclohexanes - chemistry ; Datasets ; Drug Discovery ; Histology ; Industrial applications ; Ligands ; Models, Chemical ; Morphology ; Octanol ; Pharmaceutical Preparations - chemistry ; Phase distribution ; Physical Chemistry ; Physicochemical properties ; Proteins ; Protonation ; Quantum chemistry ; Quantum Theory ; Solubility ; Solvation ; Solvents - chemistry ; Statistical analysis ; Systematic errors ; Tautomers ; Thermodynamics ; Water - chemistry</subject><ispartof>Journal of computer-aided molecular design, 2021-04, Vol.35 (4), p.453-472</ispartof><rights>The Author(s) 2020</rights><rights>The Author(s) 2020. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c474t-3f02edd8cc409ed4febfbce311a51908b39b72ce398f94585cbff3ef654da8ce3</citedby><cites>FETCH-LOGICAL-c474t-3f02edd8cc409ed4febfbce311a51908b39b72ce398f94585cbff3ef654da8ce3</cites><orcidid>0000-0001-7346-7064 ; 0000-0002-1868-9614</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s10822-020-00347-5$$EPDF$$P50$$Gspringer$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s10822-020-00347-5$$EHTML$$P50$$Gspringer$$Hfree_for_read</linktohtml><link.rule.ids>230,314,780,784,885,27923,27924,41487,42556,51318</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/33079358$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Tielker, Nicolas</creatorcontrib><creatorcontrib>Eberlein, Lukas</creatorcontrib><creatorcontrib>Hessler, Gerhard</creatorcontrib><creatorcontrib>Schmidt, K. Friedemann</creatorcontrib><creatorcontrib>Güssregen, Stefan</creatorcontrib><creatorcontrib>Kast, Stefan M.</creatorcontrib><title>Quantum–mechanical property prediction of solvated drug molecules: what have we learned from a decade of SAMPL blind prediction challenges?</title><title>Journal of computer-aided molecular design</title><addtitle>J Comput Aided Mol Des</addtitle><addtitle>J Comput Aided Mol Des</addtitle><description>Joint academic–industrial projects supporting drug discovery are frequently pursued to deploy and benchmark cutting-edge methodical developments from academia in a real-world industrial environment at different scales. The dimensionality of tasks ranges from small molecule physicochemical property assessment over protein–ligand interaction up to statistical analyses of biological data. This way, method development and usability both benefit from insights gained at both ends, when predictiveness and readiness of novel approaches are confirmed, but the pharmaceutical drug makers get early access to novel tools for the quality of drug products and benefit of patients. Quantum–mechanical and simulation methods particularly fall into this group of methods, as they require skills and expense in their development but also significant resources in their application, thus are comparatively slowly dripping into the realm of industrial use. Nevertheless, these physics-based methods are becoming more and more useful. Starting with a general overview of these and in particular quantum–mechanical methods for drug discovery we review a decade-long and ongoing collaboration between Sanofi and the Kast group focused on the application of the embedded cluster reference interaction site model (EC-RISM), a solvation model for quantum chemistry, to study small molecule chemistry in the context of joint participation in several SAMPL (Statistical Assessment of Modeling of Proteins and Ligands) blind prediction challenges. Starting with early application to tautomer equilibria in water (SAMPL2) the methodology was further developed to allow for challenge contributions related to predictions of distribution coefficients (SAMPL5) and acidity constants (SAMPL6) over the years. Particular emphasis is put on a frequently overlooked aspect of measuring the quality of models, namely the retrospective analysis of earlier datasets and predictions in light of more recent and advanced developments. We therefore demonstrate the performance of the current methodical state of the art as developed and optimized for the SAMPL6 p K a and octanol–water log P challenges when re-applied to the earlier SAMPL5 cyclohexane-water log D and SAMPL2 tautomer equilibria datasets. Systematic improvement is not consistently found throughout despite the similarity of the problem class, i.e. protonation reactions and phase distribution. Hence, it is possible to learn about hidden bias in model assessment, as results derived from more elaborate methods do not necessarily improve quantitative agreement. This indicates the role of chance or coincidence for model development on the one hand which allows for the identification of systematic error and opportunities toward improvement and reveals possible sources of experimental uncertainty on the other. These insights are particularly useful for further academia–industry collaborations, as both partners are then enabled to optimize both the computational and experimental settings for data generation.</description><subject>Acidity</subject><subject>Animal Anatomy</subject><subject>Chemistry</subject><subject>Chemistry and Materials Science</subject><subject>Computer Applications in Chemistry</subject><subject>Computer Simulation</subject><subject>Cyclohexane</subject><subject>Cyclohexanes - chemistry</subject><subject>Datasets</subject><subject>Drug Discovery</subject><subject>Histology</subject><subject>Industrial applications</subject><subject>Ligands</subject><subject>Models, Chemical</subject><subject>Morphology</subject><subject>Octanol</subject><subject>Pharmaceutical Preparations - chemistry</subject><subject>Phase distribution</subject><subject>Physical Chemistry</subject><subject>Physicochemical properties</subject><subject>Proteins</subject><subject>Protonation</subject><subject>Quantum chemistry</subject><subject>Quantum Theory</subject><subject>Solubility</subject><subject>Solvation</subject><subject>Solvents - chemistry</subject><subject>Statistical analysis</subject><subject>Systematic errors</subject><subject>Tautomers</subject><subject>Thermodynamics</subject><subject>Water - chemistry</subject><issn>0920-654X</issn><issn>1573-4951</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>C6C</sourceid><sourceid>EIF</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><recordid>eNp9Ub1uFDEQthCIHIEXoECWaGgW_Ju1KUBRRADpECBAorO89vhuI-_6Yu9elC4vQMUb8iQ4XAiBgsqj-X7GMx9CDyl5SglpnxVKFGMNYaQhhIu2kbfQgsqWN0JLehstiK7QgRRf99C9Uk5IFekDchftcV4rLtUCffs423Gahx8X3wdwazv2zka8yWkDeTqvBfjeTX0acQq4pLi1E3js87zCQ4rg5gjlOT5b2wmv7RbwGeAINo-VFHIasMUenPVwKf90-O7DEnexH_1N4zo1RhhXUF7eR3eCjQUeXL376Mvxq89Hb5rl-9dvjw6XjROtmBoeCAPvlXOCaPAiQBc6B5xSK6kmquO6a1ltaBW0kEq6LgQOoZ7CW1X7--jFznczdwN4B-OUbTSb3A82n5tke_M3MvZrs0pbowhVmolq8OTKIKfTGcpkhr44iNGOkOZimJBMaNXStlIf_0M9SXMe63qGSaIIY7pllcV2LJdTKRnC9WcoMZdpm13apqZtfqVtZBU9urnGteR3vJXAd4RSoXrh_Gf2f2x_Au1tulY</recordid><startdate>20210401</startdate><enddate>20210401</enddate><creator>Tielker, Nicolas</creator><creator>Eberlein, Lukas</creator><creator>Hessler, Gerhard</creator><creator>Schmidt, K. Friedemann</creator><creator>Güssregen, Stefan</creator><creator>Kast, Stefan M.</creator><general>Springer International Publishing</general><general>Springer Nature B.V</general><scope>C6C</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7SC</scope><scope>7X7</scope><scope>7XB</scope><scope>88E</scope><scope>88I</scope><scope>8AL</scope><scope>8AO</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AEUYN</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>BKSAR</scope><scope>CCPQU</scope><scope>D1I</scope><scope>DWQXO</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>K9.</scope><scope>KB.</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M0N</scope><scope>M0S</scope><scope>M1P</scope><scope>M2P</scope><scope>P5Z</scope><scope>P62</scope><scope>PCBAR</scope><scope>PDBOC</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>Q9U</scope><scope>7X8</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0001-7346-7064</orcidid><orcidid>https://orcid.org/0000-0002-1868-9614</orcidid></search><sort><creationdate>20210401</creationdate><title>Quantum–mechanical property prediction of solvated drug molecules: what have we learned from a decade of SAMPL blind prediction challenges?</title><author>Tielker, Nicolas ; Eberlein, Lukas ; Hessler, Gerhard ; Schmidt, K. Friedemann ; Güssregen, Stefan ; Kast, Stefan M.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c474t-3f02edd8cc409ed4febfbce311a51908b39b72ce398f94585cbff3ef654da8ce3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Acidity</topic><topic>Animal Anatomy</topic><topic>Chemistry</topic><topic>Chemistry and Materials Science</topic><topic>Computer Applications in Chemistry</topic><topic>Computer Simulation</topic><topic>Cyclohexane</topic><topic>Cyclohexanes - chemistry</topic><topic>Datasets</topic><topic>Drug Discovery</topic><topic>Histology</topic><topic>Industrial applications</topic><topic>Ligands</topic><topic>Models, Chemical</topic><topic>Morphology</topic><topic>Octanol</topic><topic>Pharmaceutical Preparations - chemistry</topic><topic>Phase distribution</topic><topic>Physical Chemistry</topic><topic>Physicochemical properties</topic><topic>Proteins</topic><topic>Protonation</topic><topic>Quantum chemistry</topic><topic>Quantum Theory</topic><topic>Solubility</topic><topic>Solvation</topic><topic>Solvents - chemistry</topic><topic>Statistical analysis</topic><topic>Systematic errors</topic><topic>Tautomers</topic><topic>Thermodynamics</topic><topic>Water - chemistry</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Tielker, Nicolas</creatorcontrib><creatorcontrib>Eberlein, Lukas</creatorcontrib><creatorcontrib>Hessler, Gerhard</creatorcontrib><creatorcontrib>Schmidt, K. Friedemann</creatorcontrib><creatorcontrib>Güssregen, Stefan</creatorcontrib><creatorcontrib>Kast, Stefan M.</creatorcontrib><collection>Springer Nature OA Free Journals</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Computer and Information Systems Abstracts</collection><collection>Health &amp; Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Medical Database (Alumni Edition)</collection><collection>Science Database (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest One Sustainability</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>Natural Science Collection</collection><collection>Earth, Atmospheric &amp; Aquatic Science Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Materials Science Collection</collection><collection>ProQuest Central Korea</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>Materials Science Database</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Computing Database</collection><collection>Health &amp; Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Science Database</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>Earth, Atmospheric &amp; Aquatic Science Database</collection><collection>Materials Science Collection</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central Basic</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Journal of computer-aided molecular design</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Tielker, Nicolas</au><au>Eberlein, Lukas</au><au>Hessler, Gerhard</au><au>Schmidt, K. Friedemann</au><au>Güssregen, Stefan</au><au>Kast, Stefan M.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Quantum–mechanical property prediction of solvated drug molecules: what have we learned from a decade of SAMPL blind prediction challenges?</atitle><jtitle>Journal of computer-aided molecular design</jtitle><stitle>J Comput Aided Mol Des</stitle><addtitle>J Comput Aided Mol Des</addtitle><date>2021-04-01</date><risdate>2021</risdate><volume>35</volume><issue>4</issue><spage>453</spage><epage>472</epage><pages>453-472</pages><issn>0920-654X</issn><eissn>1573-4951</eissn><abstract>Joint academic–industrial projects supporting drug discovery are frequently pursued to deploy and benchmark cutting-edge methodical developments from academia in a real-world industrial environment at different scales. The dimensionality of tasks ranges from small molecule physicochemical property assessment over protein–ligand interaction up to statistical analyses of biological data. This way, method development and usability both benefit from insights gained at both ends, when predictiveness and readiness of novel approaches are confirmed, but the pharmaceutical drug makers get early access to novel tools for the quality of drug products and benefit of patients. Quantum–mechanical and simulation methods particularly fall into this group of methods, as they require skills and expense in their development but also significant resources in their application, thus are comparatively slowly dripping into the realm of industrial use. Nevertheless, these physics-based methods are becoming more and more useful. Starting with a general overview of these and in particular quantum–mechanical methods for drug discovery we review a decade-long and ongoing collaboration between Sanofi and the Kast group focused on the application of the embedded cluster reference interaction site model (EC-RISM), a solvation model for quantum chemistry, to study small molecule chemistry in the context of joint participation in several SAMPL (Statistical Assessment of Modeling of Proteins and Ligands) blind prediction challenges. Starting with early application to tautomer equilibria in water (SAMPL2) the methodology was further developed to allow for challenge contributions related to predictions of distribution coefficients (SAMPL5) and acidity constants (SAMPL6) over the years. Particular emphasis is put on a frequently overlooked aspect of measuring the quality of models, namely the retrospective analysis of earlier datasets and predictions in light of more recent and advanced developments. We therefore demonstrate the performance of the current methodical state of the art as developed and optimized for the SAMPL6 p K a and octanol–water log P challenges when re-applied to the earlier SAMPL5 cyclohexane-water log D and SAMPL2 tautomer equilibria datasets. Systematic improvement is not consistently found throughout despite the similarity of the problem class, i.e. protonation reactions and phase distribution. Hence, it is possible to learn about hidden bias in model assessment, as results derived from more elaborate methods do not necessarily improve quantitative agreement. This indicates the role of chance or coincidence for model development on the one hand which allows for the identification of systematic error and opportunities toward improvement and reveals possible sources of experimental uncertainty on the other. These insights are particularly useful for further academia–industry collaborations, as both partners are then enabled to optimize both the computational and experimental settings for data generation.</abstract><cop>Cham</cop><pub>Springer International Publishing</pub><pmid>33079358</pmid><doi>10.1007/s10822-020-00347-5</doi><tpages>20</tpages><orcidid>https://orcid.org/0000-0001-7346-7064</orcidid><orcidid>https://orcid.org/0000-0002-1868-9614</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0920-654X
ispartof Journal of computer-aided molecular design, 2021-04, Vol.35 (4), p.453-472
issn 0920-654X
1573-4951
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_8018924
source MEDLINE; SpringerLink Journals - AutoHoldings
subjects Acidity
Animal Anatomy
Chemistry
Chemistry and Materials Science
Computer Applications in Chemistry
Computer Simulation
Cyclohexane
Cyclohexanes - chemistry
Datasets
Drug Discovery
Histology
Industrial applications
Ligands
Models, Chemical
Morphology
Octanol
Pharmaceutical Preparations - chemistry
Phase distribution
Physical Chemistry
Physicochemical properties
Proteins
Protonation
Quantum chemistry
Quantum Theory
Solubility
Solvation
Solvents - chemistry
Statistical analysis
Systematic errors
Tautomers
Thermodynamics
Water - chemistry
title Quantum–mechanical property prediction of solvated drug molecules: what have we learned from a decade of SAMPL blind prediction challenges?
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-08T22%3A03%3A48IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Quantum%E2%80%93mechanical%20property%20prediction%20of%20solvated%20drug%20molecules:%20what%20have%20we%20learned%20from%20a%20decade%20of%20SAMPL%20blind%20prediction%20challenges?&rft.jtitle=Journal%20of%20computer-aided%20molecular%20design&rft.au=Tielker,%20Nicolas&rft.date=2021-04-01&rft.volume=35&rft.issue=4&rft.spage=453&rft.epage=472&rft.pages=453-472&rft.issn=0920-654X&rft.eissn=1573-4951&rft_id=info:doi/10.1007/s10822-020-00347-5&rft_dat=%3Cproquest_pubme%3E2452498717%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2508022972&rft_id=info:pmid/33079358&rfr_iscdi=true