SyGMa: Combining Expert Knowledge and Empirical Scoring in the Prediction of Metabolites
Predictions of potential metabolites based on chemical structure are becoming increasingly important in drug discovery to guide medicinal chemistry efforts that address metabolic issues and to support experimental metabolite screening and identification. Herein we present a novel rule‐based method,...
Gespeichert in:
Veröffentlicht in: | ChemMedChem 2008-05, Vol.3 (5), p.821-832 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 832 |
---|---|
container_issue | 5 |
container_start_page | 821 |
container_title | ChemMedChem |
container_volume | 3 |
creator | Ridder, Lars Wagener, Markus |
description | Predictions of potential metabolites based on chemical structure are becoming increasingly important in drug discovery to guide medicinal chemistry efforts that address metabolic issues and to support experimental metabolite screening and identification. Herein we present a novel rule‐based method, SyGMa (Systematic Generation of potential Metabolites), to predict the potential metabolites of a given parent structure. A set of reaction rules covering a broad range of phase 1 and phase 2 metabolism has been derived from metabolic reactions reported in the Metabolite Database to occur in humans. An empirical probability score is assigned to each rule representing the fraction of correctly predicted metabolites in the training database. This score is used to refine the rules and to rank predicted metabolites. The current rule set of SyGMa covers approximately 70 % of biotransformation reactions observed in humans. Evaluation of the rule‐based predictions demonstrated a significant enrichment of true metabolites in the top of the ranking list: while in total, 68 % of all observed metabolites in an independent test set were reproduced by SyGMa, a large part, 30 % of the observed metabolites, were identified among the top three predictions. From a subset of cytochrome P450 specific metabolites, 84 % were reproduced overall, with 66 % in the top three predicted phase 1 metabolites. A similarity analysis of the reactions present in the database was performed to obtain an overview of the metabolic reactions predicted by SyGMa and to support ongoing efforts to extend the rules. Specific examples demonstrate the use of SyGMa in experimental metabolite identification and the application of SyGMa to suggest chemical modifications that improve the metabolic stability of compounds.
A dataset of 6187 metabolic reactions reported to occur in man has been used to develop a rule‐based method that systematically predicts and ranks potential metabolites of a given parent compound. The graphic shows a projection of the training set with various types of correctly predicted metabolic reactions represented by different colors. |
doi_str_mv | 10.1002/cmdc.200700312 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_70739188</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>70739188</sourcerecordid><originalsourceid>FETCH-LOGICAL-c3812-1a1e682b40353f1f0a263ad9bc89751b3694ed63d498d638631d5e0739ff78463</originalsourceid><addsrcrecordid>eNqFkLFz0zAUh3UcPVoCa0dOE5tTPcu2ZDbOSUNpAxwB2k0nS89FrW2lknNN_nuSSy6wMb03fL9v-Ag5BzYGxtIL01kzThkTjHFIX5AzkAVLBEjx8viL8pS8jvGBsSyTIF-RU5AcQGT5GblbbGZz_YFWvqtd7_p7Ol0vMQz0uvfPLdp7pLq3dNotXXBGt3RhfNhhrqfDb6TfAlpnBud76hs6x0HXvnUDxjfkpNFtxLeHOyI_L6c_qk_JzdfZVfXxJjFcQpqABixkWmeM57yBhum04NqWtZGlyKHmRZmhLbjNSrk9suBgc2SCl00jZFbwEXm_9y6Df1phHFTnosG21T36VVRix4KUW3C8B03wMQZs1DK4ToeNAqZ2LdWupTq23A7eHcyrukP7Fz_E2wLlHnh2LW7-o1PVfFL9K0_2WxcHXB-3OjyqQnCRq9svM3UHOfv-efFLTfgfzaGONA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>70739188</pqid></control><display><type>article</type><title>SyGMa: Combining Expert Knowledge and Empirical Scoring in the Prediction of Metabolites</title><source>MEDLINE</source><source>Wiley Online Library All Journals</source><creator>Ridder, Lars ; Wagener, Markus</creator><creatorcontrib>Ridder, Lars ; Wagener, Markus</creatorcontrib><description>Predictions of potential metabolites based on chemical structure are becoming increasingly important in drug discovery to guide medicinal chemistry efforts that address metabolic issues and to support experimental metabolite screening and identification. Herein we present a novel rule‐based method, SyGMa (Systematic Generation of potential Metabolites), to predict the potential metabolites of a given parent structure. A set of reaction rules covering a broad range of phase 1 and phase 2 metabolism has been derived from metabolic reactions reported in the Metabolite Database to occur in humans. An empirical probability score is assigned to each rule representing the fraction of correctly predicted metabolites in the training database. This score is used to refine the rules and to rank predicted metabolites. The current rule set of SyGMa covers approximately 70 % of biotransformation reactions observed in humans. Evaluation of the rule‐based predictions demonstrated a significant enrichment of true metabolites in the top of the ranking list: while in total, 68 % of all observed metabolites in an independent test set were reproduced by SyGMa, a large part, 30 % of the observed metabolites, were identified among the top three predictions. From a subset of cytochrome P450 specific metabolites, 84 % were reproduced overall, with 66 % in the top three predicted phase 1 metabolites. A similarity analysis of the reactions present in the database was performed to obtain an overview of the metabolic reactions predicted by SyGMa and to support ongoing efforts to extend the rules. Specific examples demonstrate the use of SyGMa in experimental metabolite identification and the application of SyGMa to suggest chemical modifications that improve the metabolic stability of compounds.
A dataset of 6187 metabolic reactions reported to occur in man has been used to develop a rule‐based method that systematically predicts and ranks potential metabolites of a given parent compound. The graphic shows a projection of the training set with various types of correctly predicted metabolic reactions represented by different colors.</description><identifier>ISSN: 1860-7179</identifier><identifier>EISSN: 1860-7187</identifier><identifier>DOI: 10.1002/cmdc.200700312</identifier><identifier>PMID: 18311745</identifier><language>eng</language><publisher>Weinheim: WILEY-VCH Verlag</publisher><subject>Animals ; Chemistry, Pharmaceutical ; computer chemistry ; Cytochrome P-450 Enzyme System - physiology ; Databases as Topic ; Drug Design ; Humans ; Metabolism ; metabolite prediction ; Probability ; Rats ; reaction fingerprints ; Software ; Species Specificity ; Structure-Activity Relationship</subject><ispartof>ChemMedChem, 2008-05, Vol.3 (5), p.821-832</ispartof><rights>Copyright © 2008 WILEY‐VCH Verlag GmbH & Co. KGaA, Weinheim</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c3812-1a1e682b40353f1f0a263ad9bc89751b3694ed63d498d638631d5e0739ff78463</citedby><cites>FETCH-LOGICAL-c3812-1a1e682b40353f1f0a263ad9bc89751b3694ed63d498d638631d5e0739ff78463</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://onlinelibrary.wiley.com/doi/pdf/10.1002%2Fcmdc.200700312$$EPDF$$P50$$Gwiley$$H</linktopdf><linktohtml>$$Uhttps://onlinelibrary.wiley.com/doi/full/10.1002%2Fcmdc.200700312$$EHTML$$P50$$Gwiley$$H</linktohtml><link.rule.ids>314,780,784,1417,27923,27924,45573,45574</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/18311745$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Ridder, Lars</creatorcontrib><creatorcontrib>Wagener, Markus</creatorcontrib><title>SyGMa: Combining Expert Knowledge and Empirical Scoring in the Prediction of Metabolites</title><title>ChemMedChem</title><addtitle>ChemMedChem</addtitle><description>Predictions of potential metabolites based on chemical structure are becoming increasingly important in drug discovery to guide medicinal chemistry efforts that address metabolic issues and to support experimental metabolite screening and identification. Herein we present a novel rule‐based method, SyGMa (Systematic Generation of potential Metabolites), to predict the potential metabolites of a given parent structure. A set of reaction rules covering a broad range of phase 1 and phase 2 metabolism has been derived from metabolic reactions reported in the Metabolite Database to occur in humans. An empirical probability score is assigned to each rule representing the fraction of correctly predicted metabolites in the training database. This score is used to refine the rules and to rank predicted metabolites. The current rule set of SyGMa covers approximately 70 % of biotransformation reactions observed in humans. Evaluation of the rule‐based predictions demonstrated a significant enrichment of true metabolites in the top of the ranking list: while in total, 68 % of all observed metabolites in an independent test set were reproduced by SyGMa, a large part, 30 % of the observed metabolites, were identified among the top three predictions. From a subset of cytochrome P450 specific metabolites, 84 % were reproduced overall, with 66 % in the top three predicted phase 1 metabolites. A similarity analysis of the reactions present in the database was performed to obtain an overview of the metabolic reactions predicted by SyGMa and to support ongoing efforts to extend the rules. Specific examples demonstrate the use of SyGMa in experimental metabolite identification and the application of SyGMa to suggest chemical modifications that improve the metabolic stability of compounds.
A dataset of 6187 metabolic reactions reported to occur in man has been used to develop a rule‐based method that systematically predicts and ranks potential metabolites of a given parent compound. The graphic shows a projection of the training set with various types of correctly predicted metabolic reactions represented by different colors.</description><subject>Animals</subject><subject>Chemistry, Pharmaceutical</subject><subject>computer chemistry</subject><subject>Cytochrome P-450 Enzyme System - physiology</subject><subject>Databases as Topic</subject><subject>Drug Design</subject><subject>Humans</subject><subject>Metabolism</subject><subject>metabolite prediction</subject><subject>Probability</subject><subject>Rats</subject><subject>reaction fingerprints</subject><subject>Software</subject><subject>Species Specificity</subject><subject>Structure-Activity Relationship</subject><issn>1860-7179</issn><issn>1860-7187</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2008</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqFkLFz0zAUh3UcPVoCa0dOE5tTPcu2ZDbOSUNpAxwB2k0nS89FrW2lknNN_nuSSy6wMb03fL9v-Ag5BzYGxtIL01kzThkTjHFIX5AzkAVLBEjx8viL8pS8jvGBsSyTIF-RU5AcQGT5GblbbGZz_YFWvqtd7_p7Ol0vMQz0uvfPLdp7pLq3dNotXXBGt3RhfNhhrqfDb6TfAlpnBud76hs6x0HXvnUDxjfkpNFtxLeHOyI_L6c_qk_JzdfZVfXxJjFcQpqABixkWmeM57yBhum04NqWtZGlyKHmRZmhLbjNSrk9suBgc2SCl00jZFbwEXm_9y6Df1phHFTnosG21T36VVRix4KUW3C8B03wMQZs1DK4ToeNAqZ2LdWupTq23A7eHcyrukP7Fz_E2wLlHnh2LW7-o1PVfFL9K0_2WxcHXB-3OjyqQnCRq9svM3UHOfv-efFLTfgfzaGONA</recordid><startdate>20080519</startdate><enddate>20080519</enddate><creator>Ridder, Lars</creator><creator>Wagener, Markus</creator><general>WILEY-VCH Verlag</general><general>WILEY‐VCH Verlag</general><scope>BSCLL</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope></search><sort><creationdate>20080519</creationdate><title>SyGMa: Combining Expert Knowledge and Empirical Scoring in the Prediction of Metabolites</title><author>Ridder, Lars ; Wagener, Markus</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c3812-1a1e682b40353f1f0a263ad9bc89751b3694ed63d498d638631d5e0739ff78463</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2008</creationdate><topic>Animals</topic><topic>Chemistry, Pharmaceutical</topic><topic>computer chemistry</topic><topic>Cytochrome P-450 Enzyme System - physiology</topic><topic>Databases as Topic</topic><topic>Drug Design</topic><topic>Humans</topic><topic>Metabolism</topic><topic>metabolite prediction</topic><topic>Probability</topic><topic>Rats</topic><topic>reaction fingerprints</topic><topic>Software</topic><topic>Species Specificity</topic><topic>Structure-Activity Relationship</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Ridder, Lars</creatorcontrib><creatorcontrib>Wagener, Markus</creatorcontrib><collection>Istex</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>ChemMedChem</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Ridder, Lars</au><au>Wagener, Markus</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>SyGMa: Combining Expert Knowledge and Empirical Scoring in the Prediction of Metabolites</atitle><jtitle>ChemMedChem</jtitle><addtitle>ChemMedChem</addtitle><date>2008-05-19</date><risdate>2008</risdate><volume>3</volume><issue>5</issue><spage>821</spage><epage>832</epage><pages>821-832</pages><issn>1860-7179</issn><eissn>1860-7187</eissn><abstract>Predictions of potential metabolites based on chemical structure are becoming increasingly important in drug discovery to guide medicinal chemistry efforts that address metabolic issues and to support experimental metabolite screening and identification. Herein we present a novel rule‐based method, SyGMa (Systematic Generation of potential Metabolites), to predict the potential metabolites of a given parent structure. A set of reaction rules covering a broad range of phase 1 and phase 2 metabolism has been derived from metabolic reactions reported in the Metabolite Database to occur in humans. An empirical probability score is assigned to each rule representing the fraction of correctly predicted metabolites in the training database. This score is used to refine the rules and to rank predicted metabolites. The current rule set of SyGMa covers approximately 70 % of biotransformation reactions observed in humans. Evaluation of the rule‐based predictions demonstrated a significant enrichment of true metabolites in the top of the ranking list: while in total, 68 % of all observed metabolites in an independent test set were reproduced by SyGMa, a large part, 30 % of the observed metabolites, were identified among the top three predictions. From a subset of cytochrome P450 specific metabolites, 84 % were reproduced overall, with 66 % in the top three predicted phase 1 metabolites. A similarity analysis of the reactions present in the database was performed to obtain an overview of the metabolic reactions predicted by SyGMa and to support ongoing efforts to extend the rules. Specific examples demonstrate the use of SyGMa in experimental metabolite identification and the application of SyGMa to suggest chemical modifications that improve the metabolic stability of compounds.
A dataset of 6187 metabolic reactions reported to occur in man has been used to develop a rule‐based method that systematically predicts and ranks potential metabolites of a given parent compound. The graphic shows a projection of the training set with various types of correctly predicted metabolic reactions represented by different colors.</abstract><cop>Weinheim</cop><pub>WILEY-VCH Verlag</pub><pmid>18311745</pmid><doi>10.1002/cmdc.200700312</doi><tpages>12</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1860-7179 |
ispartof | ChemMedChem, 2008-05, Vol.3 (5), p.821-832 |
issn | 1860-7179 1860-7187 |
language | eng |
recordid | cdi_proquest_miscellaneous_70739188 |
source | MEDLINE; Wiley Online Library All Journals |
subjects | Animals Chemistry, Pharmaceutical computer chemistry Cytochrome P-450 Enzyme System - physiology Databases as Topic Drug Design Humans Metabolism metabolite prediction Probability Rats reaction fingerprints Software Species Specificity Structure-Activity Relationship |
title | SyGMa: Combining Expert Knowledge and Empirical Scoring in the Prediction of Metabolites |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-08T22%3A51%3A07IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=SyGMa:%20Combining%20Expert%20Knowledge%20and%20Empirical%20Scoring%20in%20the%20Prediction%20of%20Metabolites&rft.jtitle=ChemMedChem&rft.au=Ridder,%20Lars&rft.date=2008-05-19&rft.volume=3&rft.issue=5&rft.spage=821&rft.epage=832&rft.pages=821-832&rft.issn=1860-7179&rft.eissn=1860-7187&rft_id=info:doi/10.1002/cmdc.200700312&rft_dat=%3Cproquest_cross%3E70739188%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=70739188&rft_id=info:pmid/18311745&rfr_iscdi=true |