Target-Specific Native/Decoy Pose Classifier Improves the Accuracy of Ligand Ranking in the CSAR 2013 Benchmark
As part of the CSAR 2013 benchmark exercise, we have implemented a hybrid docking and scoring workflow to rank 10 steroid ligands of an engineered digoxigenin-binding protein. Schrödinger’s Glide docking software was used to generate poses for each steroid ligand and rank them according to both sta...
Gespeichert in:
Veröffentlicht in: | Journal of chemical information and modeling 2015-01, Vol.55 (1), p.63-71 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 71 |
---|---|
container_issue | 1 |
container_start_page | 63 |
container_title | Journal of chemical information and modeling |
container_volume | 55 |
creator | Fourches, Denis Politi, Regina Tropsha, Alexander |
description | As part of the CSAR 2013 benchmark exercise, we have implemented a hybrid docking and scoring workflow to rank 10 steroid ligands of an engineered digoxigenin-binding protein. Schrödinger’s Glide docking software was used to generate poses for each steroid ligand and rank them according to both standard docking precision (SP) and extra docking precision (XP) scoring functions. The unique component of our approach was the use of a target-specific pose classifier trained to discriminate nativelike from decoy poses. To build the classifier, a single cognate ligand with a known native pose (PDB code 4J8T) was docked multiple times into its target protein, and the generated poses were divided into two classes (nativelike and decoy) using a root-mean-square deviation threshold of 2 Å. All of the poses were characterized by the MCT-Tess descriptors of the protein–ligand interface, and random forest (RF) models were trained to discriminate the two classes of poses on the basis of their descriptors. The consensus pose classifier was then applied to the Glide-generated poses of each CSAR ligand in order to filter out those poses predicted as decoys and rerank the remaining ones using both XP and SP scoring functions. The best-scoring pose for each ligand following this filtering step was used for final ligand ranking. Overall, the ranking accuracy for the 10 ligands evaluated by the Spearman correlation coefficient was 0.64 for SP and 0.52 for XP but reached 0.75 for SP/RF consensus scoring (ranked third in the CSAR 2013 benchmark exercise). This study reconfirms that target-specific pose scoring models are capable of enhancing the reliability of structure-based molecular docking by discarding decoy poses. |
doi_str_mv | 10.1021/ci500519w |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1652431539</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3574261961</sourcerecordid><originalsourceid>FETCH-LOGICAL-a409t-61c2fb957c3986f95e5a4578e74988144436a4247a2607cf79fa1eab9f492cf73</originalsourceid><addsrcrecordid>eNpl0ctKAzEUBuAgiveFLyABEXQxNveZLGu9QlHxAu6GNJ7UaDtTk5lK395oq4iukpCPP-fkILRDyREljHasl4RIqt-X0DqVQmdakcfl773Uag1txPhCCOdasVW0xqRkNKd8HdX3Jgyhye4mYL3zFl-Zxk-hcwK2nuGbOgLujUyM6Q4CvhxPQj2FiJtnwF1r22DsDNcO9_3QVE_41lSvvhpiX32J3l33FjNCOT6Gyj6PTXjdQivOjCJsL9ZN9HB2et-7yPrX55e9bj8zgugmU9QyN9Ayt1wXymkJ0giZF5ALXRRUCMGVEUzkhimSW5drZyiYgXZCs3Tkm-hgnpsKfmshNuXYRwujkamgbmNJlWSCU8l1ont_6EvdhipVl1SKY6rgKqnDubKhjjGAKyfBp45mJSXl5xTKnykku7tIbAdjePqR39-ewP4cGBt_vfYv6AOPFYtU</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1649226836</pqid></control><display><type>article</type><title>Target-Specific Native/Decoy Pose Classifier Improves the Accuracy of Ligand Ranking in the CSAR 2013 Benchmark</title><source>MEDLINE</source><source>ACS Publications</source><creator>Fourches, Denis ; Politi, Regina ; Tropsha, Alexander</creator><creatorcontrib>Fourches, Denis ; Politi, Regina ; Tropsha, Alexander</creatorcontrib><description>As part of the CSAR 2013 benchmark exercise, we have implemented a hybrid docking and scoring workflow to rank 10 steroid ligands of an engineered digoxigenin-binding protein. Schrödinger’s Glide docking software was used to generate poses for each steroid ligand and rank them according to both standard docking precision (SP) and extra docking precision (XP) scoring functions. The unique component of our approach was the use of a target-specific pose classifier trained to discriminate nativelike from decoy poses. To build the classifier, a single cognate ligand with a known native pose (PDB code 4J8T) was docked multiple times into its target protein, and the generated poses were divided into two classes (nativelike and decoy) using a root-mean-square deviation threshold of 2 Å. All of the poses were characterized by the MCT-Tess descriptors of the protein–ligand interface, and random forest (RF) models were trained to discriminate the two classes of poses on the basis of their descriptors. The consensus pose classifier was then applied to the Glide-generated poses of each CSAR ligand in order to filter out those poses predicted as decoys and rerank the remaining ones using both XP and SP scoring functions. The best-scoring pose for each ligand following this filtering step was used for final ligand ranking. Overall, the ranking accuracy for the 10 ligands evaluated by the Spearman correlation coefficient was 0.64 for SP and 0.52 for XP but reached 0.75 for SP/RF consensus scoring (ranked third in the CSAR 2013 benchmark exercise). This study reconfirms that target-specific pose scoring models are capable of enhancing the reliability of structure-based molecular docking by discarding decoy poses.</description><identifier>ISSN: 1549-9596</identifier><identifier>EISSN: 1549-960X</identifier><identifier>DOI: 10.1021/ci500519w</identifier><identifier>PMID: 25521713</identifier><language>eng</language><publisher>United States: American Chemical Society</publisher><subject>Benchmarking ; Binding sites ; Computational Biology - methods ; Correlation analysis ; Databases, Chemical ; Ligands ; Models, Chemical ; Models, Theoretical ; Molecular Docking Simulation - methods ; Molecules ; Proteins ; Proteins - chemistry ; Proteins - metabolism ; Reproducibility of Results ; User-Computer Interface ; Workflow</subject><ispartof>Journal of chemical information and modeling, 2015-01, Vol.55 (1), p.63-71</ispartof><rights>Copyright © 2014 American Chemical Society</rights><rights>Copyright American Chemical Society Jan 26, 2015</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-a409t-61c2fb957c3986f95e5a4578e74988144436a4247a2607cf79fa1eab9f492cf73</citedby><cites>FETCH-LOGICAL-a409t-61c2fb957c3986f95e5a4578e74988144436a4247a2607cf79fa1eab9f492cf73</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://pubs.acs.org/doi/pdf/10.1021/ci500519w$$EPDF$$P50$$Gacs$$H</linktopdf><linktohtml>$$Uhttps://pubs.acs.org/doi/10.1021/ci500519w$$EHTML$$P50$$Gacs$$H</linktohtml><link.rule.ids>314,778,782,2754,27063,27911,27912,56725,56775</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/25521713$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Fourches, Denis</creatorcontrib><creatorcontrib>Politi, Regina</creatorcontrib><creatorcontrib>Tropsha, Alexander</creatorcontrib><title>Target-Specific Native/Decoy Pose Classifier Improves the Accuracy of Ligand Ranking in the CSAR 2013 Benchmark</title><title>Journal of chemical information and modeling</title><addtitle>J. Chem. Inf. Model</addtitle><description>As part of the CSAR 2013 benchmark exercise, we have implemented a hybrid docking and scoring workflow to rank 10 steroid ligands of an engineered digoxigenin-binding protein. Schrödinger’s Glide docking software was used to generate poses for each steroid ligand and rank them according to both standard docking precision (SP) and extra docking precision (XP) scoring functions. The unique component of our approach was the use of a target-specific pose classifier trained to discriminate nativelike from decoy poses. To build the classifier, a single cognate ligand with a known native pose (PDB code 4J8T) was docked multiple times into its target protein, and the generated poses were divided into two classes (nativelike and decoy) using a root-mean-square deviation threshold of 2 Å. All of the poses were characterized by the MCT-Tess descriptors of the protein–ligand interface, and random forest (RF) models were trained to discriminate the two classes of poses on the basis of their descriptors. The consensus pose classifier was then applied to the Glide-generated poses of each CSAR ligand in order to filter out those poses predicted as decoys and rerank the remaining ones using both XP and SP scoring functions. The best-scoring pose for each ligand following this filtering step was used for final ligand ranking. Overall, the ranking accuracy for the 10 ligands evaluated by the Spearman correlation coefficient was 0.64 for SP and 0.52 for XP but reached 0.75 for SP/RF consensus scoring (ranked third in the CSAR 2013 benchmark exercise). This study reconfirms that target-specific pose scoring models are capable of enhancing the reliability of structure-based molecular docking by discarding decoy poses.</description><subject>Benchmarking</subject><subject>Binding sites</subject><subject>Computational Biology - methods</subject><subject>Correlation analysis</subject><subject>Databases, Chemical</subject><subject>Ligands</subject><subject>Models, Chemical</subject><subject>Models, Theoretical</subject><subject>Molecular Docking Simulation - methods</subject><subject>Molecules</subject><subject>Proteins</subject><subject>Proteins - chemistry</subject><subject>Proteins - metabolism</subject><subject>Reproducibility of Results</subject><subject>User-Computer Interface</subject><subject>Workflow</subject><issn>1549-9596</issn><issn>1549-960X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2015</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNpl0ctKAzEUBuAgiveFLyABEXQxNveZLGu9QlHxAu6GNJ7UaDtTk5lK395oq4iukpCPP-fkILRDyREljHasl4RIqt-X0DqVQmdakcfl773Uag1txPhCCOdasVW0xqRkNKd8HdX3Jgyhye4mYL3zFl-Zxk-hcwK2nuGbOgLujUyM6Q4CvhxPQj2FiJtnwF1r22DsDNcO9_3QVE_41lSvvhpiX32J3l33FjNCOT6Gyj6PTXjdQivOjCJsL9ZN9HB2et-7yPrX55e9bj8zgugmU9QyN9Ayt1wXymkJ0giZF5ALXRRUCMGVEUzkhimSW5drZyiYgXZCs3Tkm-hgnpsKfmshNuXYRwujkamgbmNJlWSCU8l1ont_6EvdhipVl1SKY6rgKqnDubKhjjGAKyfBp45mJSXl5xTKnykku7tIbAdjePqR39-ewP4cGBt_vfYv6AOPFYtU</recordid><startdate>20150126</startdate><enddate>20150126</enddate><creator>Fourches, Denis</creator><creator>Politi, Regina</creator><creator>Tropsha, Alexander</creator><general>American Chemical Society</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SR</scope><scope>7U5</scope><scope>8BQ</scope><scope>8FD</scope><scope>JG9</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>7X8</scope></search><sort><creationdate>20150126</creationdate><title>Target-Specific Native/Decoy Pose Classifier Improves the Accuracy of Ligand Ranking in the CSAR 2013 Benchmark</title><author>Fourches, Denis ; Politi, Regina ; Tropsha, Alexander</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a409t-61c2fb957c3986f95e5a4578e74988144436a4247a2607cf79fa1eab9f492cf73</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2015</creationdate><topic>Benchmarking</topic><topic>Binding sites</topic><topic>Computational Biology - methods</topic><topic>Correlation analysis</topic><topic>Databases, Chemical</topic><topic>Ligands</topic><topic>Models, Chemical</topic><topic>Models, Theoretical</topic><topic>Molecular Docking Simulation - methods</topic><topic>Molecules</topic><topic>Proteins</topic><topic>Proteins - chemistry</topic><topic>Proteins - metabolism</topic><topic>Reproducibility of Results</topic><topic>User-Computer Interface</topic><topic>Workflow</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Fourches, Denis</creatorcontrib><creatorcontrib>Politi, Regina</creatorcontrib><creatorcontrib>Tropsha, Alexander</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>Solid State and Superconductivity Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>MEDLINE - Academic</collection><jtitle>Journal of chemical information and modeling</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Fourches, Denis</au><au>Politi, Regina</au><au>Tropsha, Alexander</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Target-Specific Native/Decoy Pose Classifier Improves the Accuracy of Ligand Ranking in the CSAR 2013 Benchmark</atitle><jtitle>Journal of chemical information and modeling</jtitle><addtitle>J. Chem. Inf. Model</addtitle><date>2015-01-26</date><risdate>2015</risdate><volume>55</volume><issue>1</issue><spage>63</spage><epage>71</epage><pages>63-71</pages><issn>1549-9596</issn><eissn>1549-960X</eissn><abstract>As part of the CSAR 2013 benchmark exercise, we have implemented a hybrid docking and scoring workflow to rank 10 steroid ligands of an engineered digoxigenin-binding protein. Schrödinger’s Glide docking software was used to generate poses for each steroid ligand and rank them according to both standard docking precision (SP) and extra docking precision (XP) scoring functions. The unique component of our approach was the use of a target-specific pose classifier trained to discriminate nativelike from decoy poses. To build the classifier, a single cognate ligand with a known native pose (PDB code 4J8T) was docked multiple times into its target protein, and the generated poses were divided into two classes (nativelike and decoy) using a root-mean-square deviation threshold of 2 Å. All of the poses were characterized by the MCT-Tess descriptors of the protein–ligand interface, and random forest (RF) models were trained to discriminate the two classes of poses on the basis of their descriptors. The consensus pose classifier was then applied to the Glide-generated poses of each CSAR ligand in order to filter out those poses predicted as decoys and rerank the remaining ones using both XP and SP scoring functions. The best-scoring pose for each ligand following this filtering step was used for final ligand ranking. Overall, the ranking accuracy for the 10 ligands evaluated by the Spearman correlation coefficient was 0.64 for SP and 0.52 for XP but reached 0.75 for SP/RF consensus scoring (ranked third in the CSAR 2013 benchmark exercise). This study reconfirms that target-specific pose scoring models are capable of enhancing the reliability of structure-based molecular docking by discarding decoy poses.</abstract><cop>United States</cop><pub>American Chemical Society</pub><pmid>25521713</pmid><doi>10.1021/ci500519w</doi><tpages>9</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1549-9596 |
ispartof | Journal of chemical information and modeling, 2015-01, Vol.55 (1), p.63-71 |
issn | 1549-9596 1549-960X |
language | eng |
recordid | cdi_proquest_miscellaneous_1652431539 |
source | MEDLINE; ACS Publications |
subjects | Benchmarking Binding sites Computational Biology - methods Correlation analysis Databases, Chemical Ligands Models, Chemical Models, Theoretical Molecular Docking Simulation - methods Molecules Proteins Proteins - chemistry Proteins - metabolism Reproducibility of Results User-Computer Interface Workflow |
title | Target-Specific Native/Decoy Pose Classifier Improves the Accuracy of Ligand Ranking in the CSAR 2013 Benchmark |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-15T19%3A29%3A37IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Target-Specific%20Native/Decoy%20Pose%20Classifier%20Improves%20the%20Accuracy%20of%20Ligand%20Ranking%20in%20the%20CSAR%202013%20Benchmark&rft.jtitle=Journal%20of%20chemical%20information%20and%20modeling&rft.au=Fourches,%20Denis&rft.date=2015-01-26&rft.volume=55&rft.issue=1&rft.spage=63&rft.epage=71&rft.pages=63-71&rft.issn=1549-9596&rft.eissn=1549-960X&rft_id=info:doi/10.1021/ci500519w&rft_dat=%3Cproquest_cross%3E3574261961%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1649226836&rft_id=info:pmid/25521713&rfr_iscdi=true |