GARLig: A Fully Automated Tool for Subset Selection of Large Fragment Spaces via a Self-Adaptive Genetic Algorithm
In combinatorial chemistry, molecules are assembled according to combinatorial principles by linking suitable reagents or decorating a given scaffold with appropriate substituents from a large chemical space of starting materials. Often the number of possible combinations greatly exceeds the number...
Gespeichert in:
Veröffentlicht in: | Journal of chemical information and modeling 2010-09, Vol.50 (9), p.1644-1659 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 1659 |
---|---|
container_issue | 9 |
container_start_page | 1644 |
container_title | Journal of chemical information and modeling |
container_volume | 50 |
creator | Pfeffer, Patrick Fober, Thomas Hüllermeier, Eyke Klebe, Gerhard |
description | In combinatorial chemistry, molecules are assembled according to combinatorial principles by linking suitable reagents or decorating a given scaffold with appropriate substituents from a large chemical space of starting materials. Often the number of possible combinations greatly exceeds the number feasible to handle by an in-depth in silico approach or even more if it should be experimentally synthesized. Therefore, powerful tools to efficiently enumerate large chemical spaces are required. They can be provided by genetic algorithms, which mimic Darwinian evolution. GARLig (genetic algorithm using reagents to compose ligands) has been developed to perform subset selection in large chemical compound spaces subject to target-specific 3D-scoring criteria. GARLig uses different scoring schemes, such as AutoDock4 Score, GOLDScore, and DrugScoreCSD, as fitness functions. Its genetic parameters have been optimized to characterize combinatorial libraries with respect to the binding to various targets of pharmaceutical interest. A large tripeptide library of 203 members has been used to profile amino acid frequencies in putative substrates for trypsin, thrombin, factor Xa, and plasmin. A peptidomimetic scaffold assembled from a selection of a 253 building block was used to test the performance of the evolutionary algorithm in suggesting potent inhibitors of the enzyme cathepsin D. In a final case study, our program was used to characterize and rank a combinatorial drug-like library comprising 33 750 potential thrombin inhibitors. These case studies demonstrate that GARLig finds experimentally confirmed potent leads by processing a significantly smaller subset of the fully enumerated combinatorial library. Furthermore, the profiles of amino acids computed by the genetic algorithm match the observed amino acid frequencies found by screening peptide libraries in substrate cleavage assays. |
doi_str_mv | 10.1021/ci9003305 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_755972416</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2150745531</sourcerecordid><originalsourceid>FETCH-LOGICAL-a341t-3ff861c0a01ceb7c0a7d177d4ebc8c2abab0f340fabd0c242aa27ae029adbcfe3</originalsourceid><addsrcrecordid>eNpl0UtLxDAQB_Agiu-DX0CCIOKhmqSPbL2VxV2FBcFV8Fam6WSNtM2apMJ-e7usD9BTBvLjP8MMISecXXEm-LUyOWNxzNItss_TJI_yjL1sf9dpnu2RA-_f1ibPxC7ZE0zmaSblPnHT4nFmFje0oJO-aVa06INtIWBNn6xtqLaOzvvKY6BzbFAFYztqNZ2BWyCdOFi02A1_S1Do6YcBCmuoo6KGZTAfSKfYYTCKFs3COhNe2yOyo6HxePz1HpLnye3T-C6aPUzvx8UsgjjhIYq1HmVcMWBcYSWHQtZcyjrBSo2UgAoqpuOEaahqpkQiAIQEZCKHulIa40NyscldOvveow9la7zCpoEObe9Lmaa5FAnPBnn2R77Z3nXDcGuUJqNM8AFdbpBy1nuHulw604JblZyV6zOUP2cY7OlXYF-1WP_I770P4HwDQPnfZv-DPgFR6I4o</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>755548621</pqid></control><display><type>article</type><title>GARLig: A Fully Automated Tool for Subset Selection of Large Fragment Spaces via a Self-Adaptive Genetic Algorithm</title><source>MEDLINE</source><source>ACS Journals</source><creator>Pfeffer, Patrick ; Fober, Thomas ; Hüllermeier, Eyke ; Klebe, Gerhard</creator><creatorcontrib>Pfeffer, Patrick ; Fober, Thomas ; Hüllermeier, Eyke ; Klebe, Gerhard</creatorcontrib><description>In combinatorial chemistry, molecules are assembled according to combinatorial principles by linking suitable reagents or decorating a given scaffold with appropriate substituents from a large chemical space of starting materials. Often the number of possible combinations greatly exceeds the number feasible to handle by an in-depth in silico approach or even more if it should be experimentally synthesized. Therefore, powerful tools to efficiently enumerate large chemical spaces are required. They can be provided by genetic algorithms, which mimic Darwinian evolution. GARLig (genetic algorithm using reagents to compose ligands) has been developed to perform subset selection in large chemical compound spaces subject to target-specific 3D-scoring criteria. GARLig uses different scoring schemes, such as AutoDock4 Score, GOLDScore, and DrugScoreCSD, as fitness functions. Its genetic parameters have been optimized to characterize combinatorial libraries with respect to the binding to various targets of pharmaceutical interest. A large tripeptide library of 203 members has been used to profile amino acid frequencies in putative substrates for trypsin, thrombin, factor Xa, and plasmin. A peptidomimetic scaffold assembled from a selection of a 253 building block was used to test the performance of the evolutionary algorithm in suggesting potent inhibitors of the enzyme cathepsin D. In a final case study, our program was used to characterize and rank a combinatorial drug-like library comprising 33 750 potential thrombin inhibitors. These case studies demonstrate that GARLig finds experimentally confirmed potent leads by processing a significantly smaller subset of the fully enumerated combinatorial library. Furthermore, the profiles of amino acids computed by the genetic algorithm match the observed amino acid frequencies found by screening peptide libraries in substrate cleavage assays.</description><identifier>ISSN: 1549-9596</identifier><identifier>EISSN: 1549-960X</identifier><identifier>DOI: 10.1021/ci9003305</identifier><identifier>PMID: 20795677</identifier><language>eng</language><publisher>United States: American Chemical Society</publisher><subject>Algorithms ; Amino acids ; Automation ; Chemical compounds ; Enzymes ; Genetic algorithms ; Ligands ; Molecules ; Pharmaceutical Modeling ; Proteins - chemistry</subject><ispartof>Journal of chemical information and modeling, 2010-09, Vol.50 (9), p.1644-1659</ispartof><rights>Copyright © 2010 American Chemical Society</rights><rights>Copyright American Chemical Society Sep 27, 2010</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-a341t-3ff861c0a01ceb7c0a7d177d4ebc8c2abab0f340fabd0c242aa27ae029adbcfe3</citedby><cites>FETCH-LOGICAL-a341t-3ff861c0a01ceb7c0a7d177d4ebc8c2abab0f340fabd0c242aa27ae029adbcfe3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://pubs.acs.org/doi/pdf/10.1021/ci9003305$$EPDF$$P50$$Gacs$$H</linktopdf><linktohtml>$$Uhttps://pubs.acs.org/doi/10.1021/ci9003305$$EHTML$$P50$$Gacs$$H</linktohtml><link.rule.ids>314,780,784,2763,27075,27923,27924,56737,56787</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/20795677$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Pfeffer, Patrick</creatorcontrib><creatorcontrib>Fober, Thomas</creatorcontrib><creatorcontrib>Hüllermeier, Eyke</creatorcontrib><creatorcontrib>Klebe, Gerhard</creatorcontrib><title>GARLig: A Fully Automated Tool for Subset Selection of Large Fragment Spaces via a Self-Adaptive Genetic Algorithm</title><title>Journal of chemical information and modeling</title><addtitle>J. Chem. Inf. Model</addtitle><description>In combinatorial chemistry, molecules are assembled according to combinatorial principles by linking suitable reagents or decorating a given scaffold with appropriate substituents from a large chemical space of starting materials. Often the number of possible combinations greatly exceeds the number feasible to handle by an in-depth in silico approach or even more if it should be experimentally synthesized. Therefore, powerful tools to efficiently enumerate large chemical spaces are required. They can be provided by genetic algorithms, which mimic Darwinian evolution. GARLig (genetic algorithm using reagents to compose ligands) has been developed to perform subset selection in large chemical compound spaces subject to target-specific 3D-scoring criteria. GARLig uses different scoring schemes, such as AutoDock4 Score, GOLDScore, and DrugScoreCSD, as fitness functions. Its genetic parameters have been optimized to characterize combinatorial libraries with respect to the binding to various targets of pharmaceutical interest. A large tripeptide library of 203 members has been used to profile amino acid frequencies in putative substrates for trypsin, thrombin, factor Xa, and plasmin. A peptidomimetic scaffold assembled from a selection of a 253 building block was used to test the performance of the evolutionary algorithm in suggesting potent inhibitors of the enzyme cathepsin D. In a final case study, our program was used to characterize and rank a combinatorial drug-like library comprising 33 750 potential thrombin inhibitors. These case studies demonstrate that GARLig finds experimentally confirmed potent leads by processing a significantly smaller subset of the fully enumerated combinatorial library. Furthermore, the profiles of amino acids computed by the genetic algorithm match the observed amino acid frequencies found by screening peptide libraries in substrate cleavage assays.</description><subject>Algorithms</subject><subject>Amino acids</subject><subject>Automation</subject><subject>Chemical compounds</subject><subject>Enzymes</subject><subject>Genetic algorithms</subject><subject>Ligands</subject><subject>Molecules</subject><subject>Pharmaceutical Modeling</subject><subject>Proteins - chemistry</subject><issn>1549-9596</issn><issn>1549-960X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2010</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNpl0UtLxDAQB_Agiu-DX0CCIOKhmqSPbL2VxV2FBcFV8Fam6WSNtM2apMJ-e7usD9BTBvLjP8MMISecXXEm-LUyOWNxzNItss_TJI_yjL1sf9dpnu2RA-_f1ibPxC7ZE0zmaSblPnHT4nFmFje0oJO-aVa06INtIWBNn6xtqLaOzvvKY6BzbFAFYztqNZ2BWyCdOFi02A1_S1Do6YcBCmuoo6KGZTAfSKfYYTCKFs3COhNe2yOyo6HxePz1HpLnye3T-C6aPUzvx8UsgjjhIYq1HmVcMWBcYSWHQtZcyjrBSo2UgAoqpuOEaahqpkQiAIQEZCKHulIa40NyscldOvveow9la7zCpoEObe9Lmaa5FAnPBnn2R77Z3nXDcGuUJqNM8AFdbpBy1nuHulw604JblZyV6zOUP2cY7OlXYF-1WP_I770P4HwDQPnfZv-DPgFR6I4o</recordid><startdate>20100927</startdate><enddate>20100927</enddate><creator>Pfeffer, Patrick</creator><creator>Fober, Thomas</creator><creator>Hüllermeier, Eyke</creator><creator>Klebe, Gerhard</creator><general>American Chemical Society</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SR</scope><scope>7U5</scope><scope>8BQ</scope><scope>8FD</scope><scope>JG9</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>7X8</scope></search><sort><creationdate>20100927</creationdate><title>GARLig: A Fully Automated Tool for Subset Selection of Large Fragment Spaces via a Self-Adaptive Genetic Algorithm</title><author>Pfeffer, Patrick ; Fober, Thomas ; Hüllermeier, Eyke ; Klebe, Gerhard</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a341t-3ff861c0a01ceb7c0a7d177d4ebc8c2abab0f340fabd0c242aa27ae029adbcfe3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2010</creationdate><topic>Algorithms</topic><topic>Amino acids</topic><topic>Automation</topic><topic>Chemical compounds</topic><topic>Enzymes</topic><topic>Genetic algorithms</topic><topic>Ligands</topic><topic>Molecules</topic><topic>Pharmaceutical Modeling</topic><topic>Proteins - chemistry</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Pfeffer, Patrick</creatorcontrib><creatorcontrib>Fober, Thomas</creatorcontrib><creatorcontrib>Hüllermeier, Eyke</creatorcontrib><creatorcontrib>Klebe, Gerhard</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>Solid State and Superconductivity Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>MEDLINE - Academic</collection><jtitle>Journal of chemical information and modeling</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Pfeffer, Patrick</au><au>Fober, Thomas</au><au>Hüllermeier, Eyke</au><au>Klebe, Gerhard</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>GARLig: A Fully Automated Tool for Subset Selection of Large Fragment Spaces via a Self-Adaptive Genetic Algorithm</atitle><jtitle>Journal of chemical information and modeling</jtitle><addtitle>J. Chem. Inf. Model</addtitle><date>2010-09-27</date><risdate>2010</risdate><volume>50</volume><issue>9</issue><spage>1644</spage><epage>1659</epage><pages>1644-1659</pages><issn>1549-9596</issn><eissn>1549-960X</eissn><abstract>In combinatorial chemistry, molecules are assembled according to combinatorial principles by linking suitable reagents or decorating a given scaffold with appropriate substituents from a large chemical space of starting materials. Often the number of possible combinations greatly exceeds the number feasible to handle by an in-depth in silico approach or even more if it should be experimentally synthesized. Therefore, powerful tools to efficiently enumerate large chemical spaces are required. They can be provided by genetic algorithms, which mimic Darwinian evolution. GARLig (genetic algorithm using reagents to compose ligands) has been developed to perform subset selection in large chemical compound spaces subject to target-specific 3D-scoring criteria. GARLig uses different scoring schemes, such as AutoDock4 Score, GOLDScore, and DrugScoreCSD, as fitness functions. Its genetic parameters have been optimized to characterize combinatorial libraries with respect to the binding to various targets of pharmaceutical interest. A large tripeptide library of 203 members has been used to profile amino acid frequencies in putative substrates for trypsin, thrombin, factor Xa, and plasmin. A peptidomimetic scaffold assembled from a selection of a 253 building block was used to test the performance of the evolutionary algorithm in suggesting potent inhibitors of the enzyme cathepsin D. In a final case study, our program was used to characterize and rank a combinatorial drug-like library comprising 33 750 potential thrombin inhibitors. These case studies demonstrate that GARLig finds experimentally confirmed potent leads by processing a significantly smaller subset of the fully enumerated combinatorial library. Furthermore, the profiles of amino acids computed by the genetic algorithm match the observed amino acid frequencies found by screening peptide libraries in substrate cleavage assays.</abstract><cop>United States</cop><pub>American Chemical Society</pub><pmid>20795677</pmid><doi>10.1021/ci9003305</doi><tpages>16</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1549-9596 |
ispartof | Journal of chemical information and modeling, 2010-09, Vol.50 (9), p.1644-1659 |
issn | 1549-9596 1549-960X |
language | eng |
recordid | cdi_proquest_miscellaneous_755972416 |
source | MEDLINE; ACS Journals |
subjects | Algorithms Amino acids Automation Chemical compounds Enzymes Genetic algorithms Ligands Molecules Pharmaceutical Modeling Proteins - chemistry |
title | GARLig: A Fully Automated Tool for Subset Selection of Large Fragment Spaces via a Self-Adaptive Genetic Algorithm |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-10T15%3A15%3A01IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=GARLig:%20A%20Fully%20Automated%20Tool%20for%20Subset%20Selection%20of%20Large%20Fragment%20Spaces%20via%20a%20Self-Adaptive%20Genetic%20Algorithm&rft.jtitle=Journal%20of%20chemical%20information%20and%20modeling&rft.au=Pfeffer,%20Patrick&rft.date=2010-09-27&rft.volume=50&rft.issue=9&rft.spage=1644&rft.epage=1659&rft.pages=1644-1659&rft.issn=1549-9596&rft.eissn=1549-960X&rft_id=info:doi/10.1021/ci9003305&rft_dat=%3Cproquest_cross%3E2150745531%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=755548621&rft_id=info:pmid/20795677&rfr_iscdi=true |