GARLig: A Fully Automated Tool for Subset Selection of Large Fragment Spaces via a Self-Adaptive Genetic Algorithm

In combinatorial chemistry, molecules are assembled according to combinatorial principles by linking suitable reagents or decorating a given scaffold with appropriate substituents from a large chemical space of starting materials. Often the number of possible combinations greatly exceeds the number...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of chemical information and modeling 2010-09, Vol.50 (9), p.1644-1659
Hauptverfasser: Pfeffer, Patrick, Fober, Thomas, Hüllermeier, Eyke, Klebe, Gerhard
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1659
container_issue 9
container_start_page 1644
container_title Journal of chemical information and modeling
container_volume 50
creator Pfeffer, Patrick
Fober, Thomas
Hüllermeier, Eyke
Klebe, Gerhard
description In combinatorial chemistry, molecules are assembled according to combinatorial principles by linking suitable reagents or decorating a given scaffold with appropriate substituents from a large chemical space of starting materials. Often the number of possible combinations greatly exceeds the number feasible to handle by an in-depth in silico approach or even more if it should be experimentally synthesized. Therefore, powerful tools to efficiently enumerate large chemical spaces are required. They can be provided by genetic algorithms, which mimic Darwinian evolution. GARLig (genetic algorithm using reagents to compose ligands) has been developed to perform subset selection in large chemical compound spaces subject to target-specific 3D-scoring criteria. GARLig uses different scoring schemes, such as AutoDock4 Score, GOLDScore, and DrugScoreCSD, as fitness functions. Its genetic parameters have been optimized to characterize combinatorial libraries with respect to the binding to various targets of pharmaceutical interest. A large tripeptide library of 203 members has been used to profile amino acid frequencies in putative substrates for trypsin, thrombin, factor Xa, and plasmin. A peptidomimetic scaffold assembled from a selection of a 253 building block was used to test the performance of the evolutionary algorithm in suggesting potent inhibitors of the enzyme cathepsin D. In a final case study, our program was used to characterize and rank a combinatorial drug-like library comprising 33 750 potential thrombin inhibitors. These case studies demonstrate that GARLig finds experimentally confirmed potent leads by processing a significantly smaller subset of the fully enumerated combinatorial library. Furthermore, the profiles of amino acids computed by the genetic algorithm match the observed amino acid frequencies found by screening peptide libraries in substrate cleavage assays.
doi_str_mv 10.1021/ci9003305
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_755972416</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2150745531</sourcerecordid><originalsourceid>FETCH-LOGICAL-a341t-3ff861c0a01ceb7c0a7d177d4ebc8c2abab0f340fabd0c242aa27ae029adbcfe3</originalsourceid><addsrcrecordid>eNpl0UtLxDAQB_Agiu-DX0CCIOKhmqSPbL2VxV2FBcFV8Fam6WSNtM2apMJ-e7usD9BTBvLjP8MMISecXXEm-LUyOWNxzNItss_TJI_yjL1sf9dpnu2RA-_f1ibPxC7ZE0zmaSblPnHT4nFmFje0oJO-aVa06INtIWBNn6xtqLaOzvvKY6BzbFAFYztqNZ2BWyCdOFi02A1_S1Do6YcBCmuoo6KGZTAfSKfYYTCKFs3COhNe2yOyo6HxePz1HpLnye3T-C6aPUzvx8UsgjjhIYq1HmVcMWBcYSWHQtZcyjrBSo2UgAoqpuOEaahqpkQiAIQEZCKHulIa40NyscldOvveow9la7zCpoEObe9Lmaa5FAnPBnn2R77Z3nXDcGuUJqNM8AFdbpBy1nuHulw604JblZyV6zOUP2cY7OlXYF-1WP_I770P4HwDQPnfZv-DPgFR6I4o</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>755548621</pqid></control><display><type>article</type><title>GARLig: A Fully Automated Tool for Subset Selection of Large Fragment Spaces via a Self-Adaptive Genetic Algorithm</title><source>MEDLINE</source><source>ACS Journals</source><creator>Pfeffer, Patrick ; Fober, Thomas ; Hüllermeier, Eyke ; Klebe, Gerhard</creator><creatorcontrib>Pfeffer, Patrick ; Fober, Thomas ; Hüllermeier, Eyke ; Klebe, Gerhard</creatorcontrib><description>In combinatorial chemistry, molecules are assembled according to combinatorial principles by linking suitable reagents or decorating a given scaffold with appropriate substituents from a large chemical space of starting materials. Often the number of possible combinations greatly exceeds the number feasible to handle by an in-depth in silico approach or even more if it should be experimentally synthesized. Therefore, powerful tools to efficiently enumerate large chemical spaces are required. They can be provided by genetic algorithms, which mimic Darwinian evolution. GARLig (genetic algorithm using reagents to compose ligands) has been developed to perform subset selection in large chemical compound spaces subject to target-specific 3D-scoring criteria. GARLig uses different scoring schemes, such as AutoDock4 Score, GOLDScore, and DrugScoreCSD, as fitness functions. Its genetic parameters have been optimized to characterize combinatorial libraries with respect to the binding to various targets of pharmaceutical interest. A large tripeptide library of 203 members has been used to profile amino acid frequencies in putative substrates for trypsin, thrombin, factor Xa, and plasmin. A peptidomimetic scaffold assembled from a selection of a 253 building block was used to test the performance of the evolutionary algorithm in suggesting potent inhibitors of the enzyme cathepsin D. In a final case study, our program was used to characterize and rank a combinatorial drug-like library comprising 33 750 potential thrombin inhibitors. These case studies demonstrate that GARLig finds experimentally confirmed potent leads by processing a significantly smaller subset of the fully enumerated combinatorial library. Furthermore, the profiles of amino acids computed by the genetic algorithm match the observed amino acid frequencies found by screening peptide libraries in substrate cleavage assays.</description><identifier>ISSN: 1549-9596</identifier><identifier>EISSN: 1549-960X</identifier><identifier>DOI: 10.1021/ci9003305</identifier><identifier>PMID: 20795677</identifier><language>eng</language><publisher>United States: American Chemical Society</publisher><subject>Algorithms ; Amino acids ; Automation ; Chemical compounds ; Enzymes ; Genetic algorithms ; Ligands ; Molecules ; Pharmaceutical Modeling ; Proteins - chemistry</subject><ispartof>Journal of chemical information and modeling, 2010-09, Vol.50 (9), p.1644-1659</ispartof><rights>Copyright © 2010 American Chemical Society</rights><rights>Copyright American Chemical Society Sep 27, 2010</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-a341t-3ff861c0a01ceb7c0a7d177d4ebc8c2abab0f340fabd0c242aa27ae029adbcfe3</citedby><cites>FETCH-LOGICAL-a341t-3ff861c0a01ceb7c0a7d177d4ebc8c2abab0f340fabd0c242aa27ae029adbcfe3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://pubs.acs.org/doi/pdf/10.1021/ci9003305$$EPDF$$P50$$Gacs$$H</linktopdf><linktohtml>$$Uhttps://pubs.acs.org/doi/10.1021/ci9003305$$EHTML$$P50$$Gacs$$H</linktohtml><link.rule.ids>314,780,784,2763,27075,27923,27924,56737,56787</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/20795677$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Pfeffer, Patrick</creatorcontrib><creatorcontrib>Fober, Thomas</creatorcontrib><creatorcontrib>Hüllermeier, Eyke</creatorcontrib><creatorcontrib>Klebe, Gerhard</creatorcontrib><title>GARLig: A Fully Automated Tool for Subset Selection of Large Fragment Spaces via a Self-Adaptive Genetic Algorithm</title><title>Journal of chemical information and modeling</title><addtitle>J. Chem. Inf. Model</addtitle><description>In combinatorial chemistry, molecules are assembled according to combinatorial principles by linking suitable reagents or decorating a given scaffold with appropriate substituents from a large chemical space of starting materials. Often the number of possible combinations greatly exceeds the number feasible to handle by an in-depth in silico approach or even more if it should be experimentally synthesized. Therefore, powerful tools to efficiently enumerate large chemical spaces are required. They can be provided by genetic algorithms, which mimic Darwinian evolution. GARLig (genetic algorithm using reagents to compose ligands) has been developed to perform subset selection in large chemical compound spaces subject to target-specific 3D-scoring criteria. GARLig uses different scoring schemes, such as AutoDock4 Score, GOLDScore, and DrugScoreCSD, as fitness functions. Its genetic parameters have been optimized to characterize combinatorial libraries with respect to the binding to various targets of pharmaceutical interest. A large tripeptide library of 203 members has been used to profile amino acid frequencies in putative substrates for trypsin, thrombin, factor Xa, and plasmin. A peptidomimetic scaffold assembled from a selection of a 253 building block was used to test the performance of the evolutionary algorithm in suggesting potent inhibitors of the enzyme cathepsin D. In a final case study, our program was used to characterize and rank a combinatorial drug-like library comprising 33 750 potential thrombin inhibitors. These case studies demonstrate that GARLig finds experimentally confirmed potent leads by processing a significantly smaller subset of the fully enumerated combinatorial library. Furthermore, the profiles of amino acids computed by the genetic algorithm match the observed amino acid frequencies found by screening peptide libraries in substrate cleavage assays.</description><subject>Algorithms</subject><subject>Amino acids</subject><subject>Automation</subject><subject>Chemical compounds</subject><subject>Enzymes</subject><subject>Genetic algorithms</subject><subject>Ligands</subject><subject>Molecules</subject><subject>Pharmaceutical Modeling</subject><subject>Proteins - chemistry</subject><issn>1549-9596</issn><issn>1549-960X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2010</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNpl0UtLxDAQB_Agiu-DX0CCIOKhmqSPbL2VxV2FBcFV8Fam6WSNtM2apMJ-e7usD9BTBvLjP8MMISecXXEm-LUyOWNxzNItss_TJI_yjL1sf9dpnu2RA-_f1ibPxC7ZE0zmaSblPnHT4nFmFje0oJO-aVa06INtIWBNn6xtqLaOzvvKY6BzbFAFYztqNZ2BWyCdOFi02A1_S1Do6YcBCmuoo6KGZTAfSKfYYTCKFs3COhNe2yOyo6HxePz1HpLnye3T-C6aPUzvx8UsgjjhIYq1HmVcMWBcYSWHQtZcyjrBSo2UgAoqpuOEaahqpkQiAIQEZCKHulIa40NyscldOvveow9la7zCpoEObe9Lmaa5FAnPBnn2R77Z3nXDcGuUJqNM8AFdbpBy1nuHulw604JblZyV6zOUP2cY7OlXYF-1WP_I770P4HwDQPnfZv-DPgFR6I4o</recordid><startdate>20100927</startdate><enddate>20100927</enddate><creator>Pfeffer, Patrick</creator><creator>Fober, Thomas</creator><creator>Hüllermeier, Eyke</creator><creator>Klebe, Gerhard</creator><general>American Chemical Society</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SR</scope><scope>7U5</scope><scope>8BQ</scope><scope>8FD</scope><scope>JG9</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>7X8</scope></search><sort><creationdate>20100927</creationdate><title>GARLig: A Fully Automated Tool for Subset Selection of Large Fragment Spaces via a Self-Adaptive Genetic Algorithm</title><author>Pfeffer, Patrick ; Fober, Thomas ; Hüllermeier, Eyke ; Klebe, Gerhard</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a341t-3ff861c0a01ceb7c0a7d177d4ebc8c2abab0f340fabd0c242aa27ae029adbcfe3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2010</creationdate><topic>Algorithms</topic><topic>Amino acids</topic><topic>Automation</topic><topic>Chemical compounds</topic><topic>Enzymes</topic><topic>Genetic algorithms</topic><topic>Ligands</topic><topic>Molecules</topic><topic>Pharmaceutical Modeling</topic><topic>Proteins - chemistry</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Pfeffer, Patrick</creatorcontrib><creatorcontrib>Fober, Thomas</creatorcontrib><creatorcontrib>Hüllermeier, Eyke</creatorcontrib><creatorcontrib>Klebe, Gerhard</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>Solid State and Superconductivity Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>MEDLINE - Academic</collection><jtitle>Journal of chemical information and modeling</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Pfeffer, Patrick</au><au>Fober, Thomas</au><au>Hüllermeier, Eyke</au><au>Klebe, Gerhard</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>GARLig: A Fully Automated Tool for Subset Selection of Large Fragment Spaces via a Self-Adaptive Genetic Algorithm</atitle><jtitle>Journal of chemical information and modeling</jtitle><addtitle>J. Chem. Inf. Model</addtitle><date>2010-09-27</date><risdate>2010</risdate><volume>50</volume><issue>9</issue><spage>1644</spage><epage>1659</epage><pages>1644-1659</pages><issn>1549-9596</issn><eissn>1549-960X</eissn><abstract>In combinatorial chemistry, molecules are assembled according to combinatorial principles by linking suitable reagents or decorating a given scaffold with appropriate substituents from a large chemical space of starting materials. Often the number of possible combinations greatly exceeds the number feasible to handle by an in-depth in silico approach or even more if it should be experimentally synthesized. Therefore, powerful tools to efficiently enumerate large chemical spaces are required. They can be provided by genetic algorithms, which mimic Darwinian evolution. GARLig (genetic algorithm using reagents to compose ligands) has been developed to perform subset selection in large chemical compound spaces subject to target-specific 3D-scoring criteria. GARLig uses different scoring schemes, such as AutoDock4 Score, GOLDScore, and DrugScoreCSD, as fitness functions. Its genetic parameters have been optimized to characterize combinatorial libraries with respect to the binding to various targets of pharmaceutical interest. A large tripeptide library of 203 members has been used to profile amino acid frequencies in putative substrates for trypsin, thrombin, factor Xa, and plasmin. A peptidomimetic scaffold assembled from a selection of a 253 building block was used to test the performance of the evolutionary algorithm in suggesting potent inhibitors of the enzyme cathepsin D. In a final case study, our program was used to characterize and rank a combinatorial drug-like library comprising 33 750 potential thrombin inhibitors. These case studies demonstrate that GARLig finds experimentally confirmed potent leads by processing a significantly smaller subset of the fully enumerated combinatorial library. Furthermore, the profiles of amino acids computed by the genetic algorithm match the observed amino acid frequencies found by screening peptide libraries in substrate cleavage assays.</abstract><cop>United States</cop><pub>American Chemical Society</pub><pmid>20795677</pmid><doi>10.1021/ci9003305</doi><tpages>16</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1549-9596
ispartof Journal of chemical information and modeling, 2010-09, Vol.50 (9), p.1644-1659
issn 1549-9596
1549-960X
language eng
recordid cdi_proquest_miscellaneous_755972416
source MEDLINE; ACS Journals
subjects Algorithms
Amino acids
Automation
Chemical compounds
Enzymes
Genetic algorithms
Ligands
Molecules
Pharmaceutical Modeling
Proteins - chemistry
title GARLig: A Fully Automated Tool for Subset Selection of Large Fragment Spaces via a Self-Adaptive Genetic Algorithm
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-10T15%3A15%3A01IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=GARLig:%20A%20Fully%20Automated%20Tool%20for%20Subset%20Selection%20of%20Large%20Fragment%20Spaces%20via%20a%20Self-Adaptive%20Genetic%20Algorithm&rft.jtitle=Journal%20of%20chemical%20information%20and%20modeling&rft.au=Pfeffer,%20Patrick&rft.date=2010-09-27&rft.volume=50&rft.issue=9&rft.spage=1644&rft.epage=1659&rft.pages=1644-1659&rft.issn=1549-9596&rft.eissn=1549-960X&rft_id=info:doi/10.1021/ci9003305&rft_dat=%3Cproquest_cross%3E2150745531%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=755548621&rft_id=info:pmid/20795677&rfr_iscdi=true