Target-Specific Native/Decoy Pose Classifier Improves the Accuracy of Ligand Ranking in the CSAR 2013 Benchmark

As part of the CSAR 2013 benchmark exercise, we have implemented a hybrid docking and scoring workflow to rank 10 steroid ligands of an engineered digoxigenin-binding protein. Schrödinger’s Glide docking software was used to generate poses for each steroid ligand and rank them according to both sta...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of chemical information and modeling 2015-01, Vol.55 (1), p.63-71
Hauptverfasser: Fourches, Denis, Politi, Regina, Tropsha, Alexander
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 71
container_issue 1
container_start_page 63
container_title Journal of chemical information and modeling
container_volume 55
creator Fourches, Denis
Politi, Regina
Tropsha, Alexander
description As part of the CSAR 2013 benchmark exercise, we have implemented a hybrid docking and scoring workflow to rank 10 steroid ligands of an engineered digoxigenin-binding protein. Schrödinger’s Glide docking software was used to generate poses for each steroid ligand and rank them according to both standard docking precision (SP) and extra docking precision (XP) scoring functions. The unique component of our approach was the use of a target-specific pose classifier trained to discriminate nativelike from decoy poses. To build the classifier, a single cognate ligand with a known native pose (PDB code 4J8T) was docked multiple times into its target protein, and the generated poses were divided into two classes (nativelike and decoy) using a root-mean-square deviation threshold of 2 Å. All of the poses were characterized by the MCT-Tess descriptors of the protein–ligand interface, and random forest (RF) models were trained to discriminate the two classes of poses on the basis of their descriptors. The consensus pose classifier was then applied to the Glide-generated poses of each CSAR ligand in order to filter out those poses predicted as decoys and rerank the remaining ones using both XP and SP scoring functions. The best-scoring pose for each ligand following this filtering step was used for final ligand ranking. Overall, the ranking accuracy for the 10 ligands evaluated by the Spearman correlation coefficient was 0.64 for SP and 0.52 for XP but reached 0.75 for SP/RF consensus scoring (ranked third in the CSAR 2013 benchmark exercise). This study reconfirms that target-specific pose scoring models are capable of enhancing the reliability of structure-based molecular docking by discarding decoy poses.
doi_str_mv 10.1021/ci500519w
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1652431539</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3574261961</sourcerecordid><originalsourceid>FETCH-LOGICAL-a409t-61c2fb957c3986f95e5a4578e74988144436a4247a2607cf79fa1eab9f492cf73</originalsourceid><addsrcrecordid>eNpl0ctKAzEUBuAgiveFLyABEXQxNveZLGu9QlHxAu6GNJ7UaDtTk5lK395oq4iukpCPP-fkILRDyREljHasl4RIqt-X0DqVQmdakcfl773Uag1txPhCCOdasVW0xqRkNKd8HdX3Jgyhye4mYL3zFl-Zxk-hcwK2nuGbOgLujUyM6Q4CvhxPQj2FiJtnwF1r22DsDNcO9_3QVE_41lSvvhpiX32J3l33FjNCOT6Gyj6PTXjdQivOjCJsL9ZN9HB2et-7yPrX55e9bj8zgugmU9QyN9Ayt1wXymkJ0giZF5ALXRRUCMGVEUzkhimSW5drZyiYgXZCs3Tkm-hgnpsKfmshNuXYRwujkamgbmNJlWSCU8l1ont_6EvdhipVl1SKY6rgKqnDubKhjjGAKyfBp45mJSXl5xTKnykku7tIbAdjePqR39-ewP4cGBt_vfYv6AOPFYtU</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1649226836</pqid></control><display><type>article</type><title>Target-Specific Native/Decoy Pose Classifier Improves the Accuracy of Ligand Ranking in the CSAR 2013 Benchmark</title><source>MEDLINE</source><source>ACS Publications</source><creator>Fourches, Denis ; Politi, Regina ; Tropsha, Alexander</creator><creatorcontrib>Fourches, Denis ; Politi, Regina ; Tropsha, Alexander</creatorcontrib><description>As part of the CSAR 2013 benchmark exercise, we have implemented a hybrid docking and scoring workflow to rank 10 steroid ligands of an engineered digoxigenin-binding protein. Schrödinger’s Glide docking software was used to generate poses for each steroid ligand and rank them according to both standard docking precision (SP) and extra docking precision (XP) scoring functions. The unique component of our approach was the use of a target-specific pose classifier trained to discriminate nativelike from decoy poses. To build the classifier, a single cognate ligand with a known native pose (PDB code 4J8T) was docked multiple times into its target protein, and the generated poses were divided into two classes (nativelike and decoy) using a root-mean-square deviation threshold of 2 Å. All of the poses were characterized by the MCT-Tess descriptors of the protein–ligand interface, and random forest (RF) models were trained to discriminate the two classes of poses on the basis of their descriptors. The consensus pose classifier was then applied to the Glide-generated poses of each CSAR ligand in order to filter out those poses predicted as decoys and rerank the remaining ones using both XP and SP scoring functions. The best-scoring pose for each ligand following this filtering step was used for final ligand ranking. Overall, the ranking accuracy for the 10 ligands evaluated by the Spearman correlation coefficient was 0.64 for SP and 0.52 for XP but reached 0.75 for SP/RF consensus scoring (ranked third in the CSAR 2013 benchmark exercise). This study reconfirms that target-specific pose scoring models are capable of enhancing the reliability of structure-based molecular docking by discarding decoy poses.</description><identifier>ISSN: 1549-9596</identifier><identifier>EISSN: 1549-960X</identifier><identifier>DOI: 10.1021/ci500519w</identifier><identifier>PMID: 25521713</identifier><language>eng</language><publisher>United States: American Chemical Society</publisher><subject>Benchmarking ; Binding sites ; Computational Biology - methods ; Correlation analysis ; Databases, Chemical ; Ligands ; Models, Chemical ; Models, Theoretical ; Molecular Docking Simulation - methods ; Molecules ; Proteins ; Proteins - chemistry ; Proteins - metabolism ; Reproducibility of Results ; User-Computer Interface ; Workflow</subject><ispartof>Journal of chemical information and modeling, 2015-01, Vol.55 (1), p.63-71</ispartof><rights>Copyright © 2014 American Chemical Society</rights><rights>Copyright American Chemical Society Jan 26, 2015</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-a409t-61c2fb957c3986f95e5a4578e74988144436a4247a2607cf79fa1eab9f492cf73</citedby><cites>FETCH-LOGICAL-a409t-61c2fb957c3986f95e5a4578e74988144436a4247a2607cf79fa1eab9f492cf73</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://pubs.acs.org/doi/pdf/10.1021/ci500519w$$EPDF$$P50$$Gacs$$H</linktopdf><linktohtml>$$Uhttps://pubs.acs.org/doi/10.1021/ci500519w$$EHTML$$P50$$Gacs$$H</linktohtml><link.rule.ids>314,778,782,2754,27063,27911,27912,56725,56775</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/25521713$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Fourches, Denis</creatorcontrib><creatorcontrib>Politi, Regina</creatorcontrib><creatorcontrib>Tropsha, Alexander</creatorcontrib><title>Target-Specific Native/Decoy Pose Classifier Improves the Accuracy of Ligand Ranking in the CSAR 2013 Benchmark</title><title>Journal of chemical information and modeling</title><addtitle>J. Chem. Inf. Model</addtitle><description>As part of the CSAR 2013 benchmark exercise, we have implemented a hybrid docking and scoring workflow to rank 10 steroid ligands of an engineered digoxigenin-binding protein. Schrödinger’s Glide docking software was used to generate poses for each steroid ligand and rank them according to both standard docking precision (SP) and extra docking precision (XP) scoring functions. The unique component of our approach was the use of a target-specific pose classifier trained to discriminate nativelike from decoy poses. To build the classifier, a single cognate ligand with a known native pose (PDB code 4J8T) was docked multiple times into its target protein, and the generated poses were divided into two classes (nativelike and decoy) using a root-mean-square deviation threshold of 2 Å. All of the poses were characterized by the MCT-Tess descriptors of the protein–ligand interface, and random forest (RF) models were trained to discriminate the two classes of poses on the basis of their descriptors. The consensus pose classifier was then applied to the Glide-generated poses of each CSAR ligand in order to filter out those poses predicted as decoys and rerank the remaining ones using both XP and SP scoring functions. The best-scoring pose for each ligand following this filtering step was used for final ligand ranking. Overall, the ranking accuracy for the 10 ligands evaluated by the Spearman correlation coefficient was 0.64 for SP and 0.52 for XP but reached 0.75 for SP/RF consensus scoring (ranked third in the CSAR 2013 benchmark exercise). This study reconfirms that target-specific pose scoring models are capable of enhancing the reliability of structure-based molecular docking by discarding decoy poses.</description><subject>Benchmarking</subject><subject>Binding sites</subject><subject>Computational Biology - methods</subject><subject>Correlation analysis</subject><subject>Databases, Chemical</subject><subject>Ligands</subject><subject>Models, Chemical</subject><subject>Models, Theoretical</subject><subject>Molecular Docking Simulation - methods</subject><subject>Molecules</subject><subject>Proteins</subject><subject>Proteins - chemistry</subject><subject>Proteins - metabolism</subject><subject>Reproducibility of Results</subject><subject>User-Computer Interface</subject><subject>Workflow</subject><issn>1549-9596</issn><issn>1549-960X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2015</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNpl0ctKAzEUBuAgiveFLyABEXQxNveZLGu9QlHxAu6GNJ7UaDtTk5lK395oq4iukpCPP-fkILRDyREljHasl4RIqt-X0DqVQmdakcfl773Uag1txPhCCOdasVW0xqRkNKd8HdX3Jgyhye4mYL3zFl-Zxk-hcwK2nuGbOgLujUyM6Q4CvhxPQj2FiJtnwF1r22DsDNcO9_3QVE_41lSvvhpiX32J3l33FjNCOT6Gyj6PTXjdQivOjCJsL9ZN9HB2et-7yPrX55e9bj8zgugmU9QyN9Ayt1wXymkJ0giZF5ALXRRUCMGVEUzkhimSW5drZyiYgXZCs3Tkm-hgnpsKfmshNuXYRwujkamgbmNJlWSCU8l1ont_6EvdhipVl1SKY6rgKqnDubKhjjGAKyfBp45mJSXl5xTKnykku7tIbAdjePqR39-ewP4cGBt_vfYv6AOPFYtU</recordid><startdate>20150126</startdate><enddate>20150126</enddate><creator>Fourches, Denis</creator><creator>Politi, Regina</creator><creator>Tropsha, Alexander</creator><general>American Chemical Society</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SR</scope><scope>7U5</scope><scope>8BQ</scope><scope>8FD</scope><scope>JG9</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>7X8</scope></search><sort><creationdate>20150126</creationdate><title>Target-Specific Native/Decoy Pose Classifier Improves the Accuracy of Ligand Ranking in the CSAR 2013 Benchmark</title><author>Fourches, Denis ; Politi, Regina ; Tropsha, Alexander</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a409t-61c2fb957c3986f95e5a4578e74988144436a4247a2607cf79fa1eab9f492cf73</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2015</creationdate><topic>Benchmarking</topic><topic>Binding sites</topic><topic>Computational Biology - methods</topic><topic>Correlation analysis</topic><topic>Databases, Chemical</topic><topic>Ligands</topic><topic>Models, Chemical</topic><topic>Models, Theoretical</topic><topic>Molecular Docking Simulation - methods</topic><topic>Molecules</topic><topic>Proteins</topic><topic>Proteins - chemistry</topic><topic>Proteins - metabolism</topic><topic>Reproducibility of Results</topic><topic>User-Computer Interface</topic><topic>Workflow</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Fourches, Denis</creatorcontrib><creatorcontrib>Politi, Regina</creatorcontrib><creatorcontrib>Tropsha, Alexander</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>Solid State and Superconductivity Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>MEDLINE - Academic</collection><jtitle>Journal of chemical information and modeling</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Fourches, Denis</au><au>Politi, Regina</au><au>Tropsha, Alexander</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Target-Specific Native/Decoy Pose Classifier Improves the Accuracy of Ligand Ranking in the CSAR 2013 Benchmark</atitle><jtitle>Journal of chemical information and modeling</jtitle><addtitle>J. Chem. Inf. Model</addtitle><date>2015-01-26</date><risdate>2015</risdate><volume>55</volume><issue>1</issue><spage>63</spage><epage>71</epage><pages>63-71</pages><issn>1549-9596</issn><eissn>1549-960X</eissn><abstract>As part of the CSAR 2013 benchmark exercise, we have implemented a hybrid docking and scoring workflow to rank 10 steroid ligands of an engineered digoxigenin-binding protein. Schrödinger’s Glide docking software was used to generate poses for each steroid ligand and rank them according to both standard docking precision (SP) and extra docking precision (XP) scoring functions. The unique component of our approach was the use of a target-specific pose classifier trained to discriminate nativelike from decoy poses. To build the classifier, a single cognate ligand with a known native pose (PDB code 4J8T) was docked multiple times into its target protein, and the generated poses were divided into two classes (nativelike and decoy) using a root-mean-square deviation threshold of 2 Å. All of the poses were characterized by the MCT-Tess descriptors of the protein–ligand interface, and random forest (RF) models were trained to discriminate the two classes of poses on the basis of their descriptors. The consensus pose classifier was then applied to the Glide-generated poses of each CSAR ligand in order to filter out those poses predicted as decoys and rerank the remaining ones using both XP and SP scoring functions. The best-scoring pose for each ligand following this filtering step was used for final ligand ranking. Overall, the ranking accuracy for the 10 ligands evaluated by the Spearman correlation coefficient was 0.64 for SP and 0.52 for XP but reached 0.75 for SP/RF consensus scoring (ranked third in the CSAR 2013 benchmark exercise). This study reconfirms that target-specific pose scoring models are capable of enhancing the reliability of structure-based molecular docking by discarding decoy poses.</abstract><cop>United States</cop><pub>American Chemical Society</pub><pmid>25521713</pmid><doi>10.1021/ci500519w</doi><tpages>9</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1549-9596
ispartof Journal of chemical information and modeling, 2015-01, Vol.55 (1), p.63-71
issn 1549-9596
1549-960X
language eng
recordid cdi_proquest_miscellaneous_1652431539
source MEDLINE; ACS Publications
subjects Benchmarking
Binding sites
Computational Biology - methods
Correlation analysis
Databases, Chemical
Ligands
Models, Chemical
Models, Theoretical
Molecular Docking Simulation - methods
Molecules
Proteins
Proteins - chemistry
Proteins - metabolism
Reproducibility of Results
User-Computer Interface
Workflow
title Target-Specific Native/Decoy Pose Classifier Improves the Accuracy of Ligand Ranking in the CSAR 2013 Benchmark
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-15T19%3A29%3A37IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Target-Specific%20Native/Decoy%20Pose%20Classifier%20Improves%20the%20Accuracy%20of%20Ligand%20Ranking%20in%20the%20CSAR%202013%20Benchmark&rft.jtitle=Journal%20of%20chemical%20information%20and%20modeling&rft.au=Fourches,%20Denis&rft.date=2015-01-26&rft.volume=55&rft.issue=1&rft.spage=63&rft.epage=71&rft.pages=63-71&rft.issn=1549-9596&rft.eissn=1549-960X&rft_id=info:doi/10.1021/ci500519w&rft_dat=%3Cproquest_cross%3E3574261961%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1649226836&rft_id=info:pmid/25521713&rfr_iscdi=true