GARLig: A Fully Automated Tool for Subset Selection of Large Fragment Spaces via a Self-Adaptive Genetic Algorithm

In combinatorial chemistry, molecules are assembled according to combinatorial principles by linking suitable reagents or decorating a given scaffold with appropriate substituents from a large chemical space of starting materials. Often the number of possible combinations greatly exceeds the number...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Journal of chemical information and modeling 2010-09, Vol.50 (9), p.1644-1659
Hauptverfasser:	Pfeffer, Patrick, Fober, Thomas, Hüllermeier, Eyke, Klebe, Gerhard
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Amino acids Automation Chemical compounds Enzymes Genetic algorithms Ligands Molecules Pharmaceutical Modeling Proteins - chemistry
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	1659
container_issue	9
container_start_page	1644
container_title	Journal of chemical information and modeling
container_volume	50
creator	Pfeffer, Patrick Fober, Thomas Hüllermeier, Eyke Klebe, Gerhard
description	In combinatorial chemistry, molecules are assembled according to combinatorial principles by linking suitable reagents or decorating a given scaffold with appropriate substituents from a large chemical space of starting materials. Often the number of possible combinations greatly exceeds the number feasible to handle by an in-depth in silico approach or even more if it should be experimentally synthesized. Therefore, powerful tools to efficiently enumerate large chemical spaces are required. They can be provided by genetic algorithms, which mimic Darwinian evolution. GARLig (genetic algorithm using reagents to compose ligands) has been developed to perform subset selection in large chemical compound spaces subject to target-specific 3D-scoring criteria. GARLig uses different scoring schemes, such as AutoDock4 Score, GOLDScore, and DrugScoreCSD, as fitness functions. Its genetic parameters have been optimized to characterize combinatorial libraries with respect to the binding to various targets of pharmaceutical interest. A large tripeptide library of 203 members has been used to profile amino acid frequencies in putative substrates for trypsin, thrombin, factor Xa, and plasmin. A peptidomimetic scaffold assembled from a selection of a 253 building block was used to test the performance of the evolutionary algorithm in suggesting potent inhibitors of the enzyme cathepsin D. In a final case study, our program was used to characterize and rank a combinatorial drug-like library comprising 33 750 potential thrombin inhibitors. These case studies demonstrate that GARLig finds experimentally confirmed potent leads by processing a significantly smaller subset of the fully enumerated combinatorial library. Furthermore, the profiles of amino acids computed by the genetic algorithm match the observed amino acid frequencies found by screening peptide libraries in substrate cleavage assays.
doi_str_mv	10.1021/ci9003305
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_755972416</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2150745531</sourcerecordid><originalsourceid>FETCH-LOGICAL-a341t-3ff861c0a01ceb7c0a7d177d4ebc8c2abab0f340fabd0c242aa27ae029adbcfe3</originalsourceid><addsrcrecordid>eNpl0UtLxDAQB_Agiu-DX0CCIOKhmqSPbL2VxV2FBcFV8Fam6WSNtM2apMJ-e7usD9BTBvLjP8MMISecXXEm-LUyOWNxzNItss_TJI_yjL1sf9dpnu2RA-_f1ibPxC7ZE0zmaSblPnHT4nFmFje0oJO-aVa06INtIWBNn6xtqLaOzvvKY6BzbFAFYztqNZ2BWyCdOFi02A1_S1Do6YcBCmuoo6KGZTAfSKfYYTCKFs3COhNe2yOyo6HxePz1HpLnye3T-C6aPUzvx8UsgjjhIYq1HmVcMWBcYSWHQtZcyjrBSo2UgAoqpuOEaahqpkQiAIQEZCKHulIa40NyscldOvveow9la7zCpoEObe9Lmaa5FAnPBnn2R77Z3nXDcGuUJqNM8AFdbpBy1nuHulw604JblZyV6zOUP2cY7OlXYF-1WP_I770P4HwDQPnfZv-DPgFR6I4o</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>755548621</pqid></control><display><type>article</type><title>GARLig: A Fully Automated Tool for Subset Selection of Large Fragment Spaces via a Self-Adaptive Genetic Algorithm</title><source>MEDLINE</source><source>ACS Journals</source><creator>Pfeffer, Patrick ; Fober, Thomas ; Hüllermeier, Eyke ; Klebe, Gerhard</creator><creatorcontrib>Pfeffer, Patrick ; Fober, Thomas ; Hüllermeier, Eyke ; Klebe, Gerhard</creatorcontrib><description>In combinatorial chemistry, molecules are assembled according to combinatorial principles by linking suitable reagents or decorating a given scaffold with appropriate substituents from a large chemical space of starting materials. Often the number of possible combinations greatly exceeds the number feasible to handle by an in-depth in silico approach or even more if it should be experimentally synthesized. Therefore, powerful tools to efficiently enumerate large chemical spaces are required. They can be provided by genetic algorithms, which mimic Darwinian evolution. GARLig (genetic algorithm using reagents to compose ligands) has been developed to perform subset selection in large chemical compound spaces subject to target-specific 3D-scoring criteria. GARLig uses different scoring schemes, such as AutoDock4 Score, GOLDScore, and DrugScoreCSD, as fitness functions. Its genetic parameters have been optimized to characterize combinatorial libraries with respect to the binding to various targets of pharmaceutical interest. A large tripeptide library of 203 members has been used to profile amino acid frequencies in putative substrates for trypsin, thrombin, factor Xa, and plasmin. A peptidomimetic scaffold assembled from a selection of a 253 building block was used to test the performance of the evolutionary algorithm in suggesting potent inhibitors of the enzyme cathepsin D. In a final case study, our program was used to characterize and rank a combinatorial drug-like library comprising 33 750 potential thrombin inhibitors. These case studies demonstrate that GARLig finds experimentally confirmed potent leads by processing a significantly smaller subset of the fully enumerated combinatorial library. Furthermore, the profiles of amino acids computed by the genetic algorithm match the observed amino acid frequencies found by screening peptide libraries in substrate cleavage assays.</description><identifier>ISSN: 1549-9596</identifier><identifier>EISSN: 1549-960X</identifier><identifier>DOI: 10.1021/ci9003305</identifier><identifier>PMID: 20795677</identifier><language>eng</language><publisher>United States: American Chemical Society</publisher><subject>Algorithms ; Amino acids ; Automation ; Chemical compounds ; Enzymes ; Genetic algorithms ; Ligands ; Molecules ; Pharmaceutical Modeling ; Proteins - chemistry</subject><ispartof>Journal of chemical information and modeling, 2010-09, Vol.50 (9), p.1644-1659</ispartof><rights>Copyright © 2010 American Chemical Society</rights><rights>Copyright American Chemical Society Sep 27, 2010</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-a341t-3ff861c0a01ceb7c0a7d177d4ebc8c2abab0f340fabd0c242aa27ae029adbcfe3</citedby><cites>FETCH-LOGICAL-a341t-3ff861c0a01ceb7c0a7d177d4ebc8c2abab0f340fabd0c242aa27ae029adbcfe3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://pubs.acs.org/doi/pdf/10.1021/ci9003305$$EPDF$$P50$$Gacs$$H</linktopdf><linktohtml>$$Uhttps://pubs.acs.org/doi/10.1021/ci9003305$$EHTML$$P50$$Gacs$$H</linktohtml><link.rule.ids>314,780,784,2763,27075,27923,27924,56737,56787</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/20795677$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Pfeffer, Patrick</creatorcontrib><creatorcontrib>Fober, Thomas</creatorcontrib><creatorcontrib>Hüllermeier, Eyke</creatorcontrib><creatorcontrib>Klebe, Gerhard</creatorcontrib><title>GARLig: A Fully Automated Tool for Subset Selection of Large Fragment Spaces via a Self-Adaptive Genetic Algorithm</title><title>Journal of chemical information and modeling</title><addtitle>J. Chem. Inf. Model</addtitle><description>In combinatorial chemistry, molecules are assembled according to combinatorial principles by linking suitable reagents or decorating a given scaffold with appropriate substituents from a large chemical space of starting materials. Often the number of possible combinations greatly exceeds the number feasible to handle by an in-depth in silico approach or even more if it should be experimentally synthesized. Therefore, powerful tools to efficiently enumerate large chemical spaces are required. They can be provided by genetic algorithms, which mimic Darwinian evolution. GARLig (genetic algorithm using reagents to compose ligands) has been developed to perform subset selection in large chemical compound spaces subject to target-specific 3D-scoring criteria. GARLig uses different scoring schemes, such as AutoDock4 Score, GOLDScore, and DrugScoreCSD, as fitness functions. Its genetic parameters have been optimized to characterize combinatorial libraries with respect to the binding to various targets of pharmaceutical interest. A large tripeptide library of 203 members has been used to profile amino acid frequencies in putative substrates for trypsin, thrombin, factor Xa, and plasmin. A peptidomimetic scaffold assembled from a selection of a 253 building block was used to test the performance of the evolutionary algorithm in suggesting potent inhibitors of the enzyme cathepsin D. In a final case study, our program was used to characterize and rank a combinatorial drug-like library comprising 33 750 potential thrombin inhibitors. These case studies demonstrate that GARLig finds experimentally confirmed potent leads by processing a significantly smaller subset of the fully enumerated combinatorial library. Furthermore, the profiles of amino acids computed by the genetic algorithm match the observed amino acid frequencies found by screening peptide libraries in substrate cleavage assays.</description><subject>Algorithms</subject><subject>Amino acids</subject><subject>Automation</subject><subject>Chemical compounds</subject><subject>Enzymes</subject><subject>Genetic algorithms</subject><subject>Ligands</subject><subject>Molecules</subject><subject>Pharmaceutical Modeling</subject><subject>Proteins - chemistry</subject><issn>1549-9596</issn><issn>1549-960X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2010</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNpl0UtLxDAQB_Agiu-DX0CCIOKhmqSPbL2VxV2FBcFV8Fam6WSNtM2apMJ-e7usD9BTBvLjP8MMISecXXEm-LUyOWNxzNItss_TJI_yjL1sf9dpnu2RA-_f1ibPxC7ZE0zmaSblPnHT4nFmFje0oJO-aVa06INtIWBNn6xtqLaOzvvKY6BzbFAFYztqNZ2BWyCdOFi02A1_S1Do6YcBCmuoo6KGZTAfSKfYYTCKFs3COhNe2yOyo6HxePz1HpLnye3T-C6aPUzvx8UsgjjhIYq1HmVcMWBcYSWHQtZcyjrBSo2UgAoqpuOEaahqpkQiAIQEZCKHulIa40NyscldOvveow9la7zCpoEObe9Lmaa5FAnPBnn2R77Z3nXDcGuUJqNM8AFdbpBy1nuHulw604JblZyV6zOUP2cY7OlXYF-1WP_I770P4HwDQPnfZv-DPgFR6I4o</recordid><startdate>20100927</startdate><enddate>20100927</enddate><creator>Pfeffer, Patrick</creator><creator>Fober, Thomas</creator><creator>Hüllermeier, Eyke</creator><creator>Klebe, Gerhard</creator><general>American Chemical Society</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SR</scope><scope>7U5</scope><scope>8BQ</scope><scope>8FD</scope><scope>JG9</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>7X8</scope></search><sort><creationdate>20100927</creationdate><title>GARLig: A Fully Automated Tool for Subset Selection of Large Fragment Spaces via a Self-Adaptive Genetic Algorithm</title><author>Pfeffer, Patrick ; Fober, Thomas ; Hüllermeier, Eyke ; Klebe, Gerhard</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a341t-3ff861c0a01ceb7c0a7d177d4ebc8c2abab0f340fabd0c242aa27ae029adbcfe3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2010</creationdate><topic>Algorithms</topic><topic>Amino acids</topic><topic>Automation</topic><topic>Chemical compounds</topic><topic>Enzymes</topic><topic>Genetic algorithms</topic><topic>Ligands</topic><topic>Molecules</topic><topic>Pharmaceutical Modeling</topic><topic>Proteins - chemistry</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Pfeffer, Patrick</creatorcontrib><creatorcontrib>Fober, Thomas</creatorcontrib><creatorcontrib>Hüllermeier, Eyke</creatorcontrib><creatorcontrib>Klebe, Gerhard</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>Solid State and Superconductivity Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>MEDLINE - Academic</collection><jtitle>Journal of chemical information and modeling</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Pfeffer, Patrick</au><au>Fober, Thomas</au><au>Hüllermeier, Eyke</au><au>Klebe, Gerhard</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>GARLig: A Fully Automated Tool for Subset Selection of Large Fragment Spaces via a Self-Adaptive Genetic Algorithm</atitle><jtitle>Journal of chemical information and modeling</jtitle><addtitle>J. Chem. Inf. Model</addtitle><date>2010-09-27</date><risdate>2010</risdate><volume>50</volume><issue>9</issue><spage>1644</spage><epage>1659</epage><pages>1644-1659</pages><issn>1549-9596</issn><eissn>1549-960X</eissn><abstract>In combinatorial chemistry, molecules are assembled according to combinatorial principles by linking suitable reagents or decorating a given scaffold with appropriate substituents from a large chemical space of starting materials. Often the number of possible combinations greatly exceeds the number feasible to handle by an in-depth in silico approach or even more if it should be experimentally synthesized. Therefore, powerful tools to efficiently enumerate large chemical spaces are required. They can be provided by genetic algorithms, which mimic Darwinian evolution. GARLig (genetic algorithm using reagents to compose ligands) has been developed to perform subset selection in large chemical compound spaces subject to target-specific 3D-scoring criteria. GARLig uses different scoring schemes, such as AutoDock4 Score, GOLDScore, and DrugScoreCSD, as fitness functions. Its genetic parameters have been optimized to characterize combinatorial libraries with respect to the binding to various targets of pharmaceutical interest. A large tripeptide library of 203 members has been used to profile amino acid frequencies in putative substrates for trypsin, thrombin, factor Xa, and plasmin. A peptidomimetic scaffold assembled from a selection of a 253 building block was used to test the performance of the evolutionary algorithm in suggesting potent inhibitors of the enzyme cathepsin D. In a final case study, our program was used to characterize and rank a combinatorial drug-like library comprising 33 750 potential thrombin inhibitors. These case studies demonstrate that GARLig finds experimentally confirmed potent leads by processing a significantly smaller subset of the fully enumerated combinatorial library. Furthermore, the profiles of amino acids computed by the genetic algorithm match the observed amino acid frequencies found by screening peptide libraries in substrate cleavage assays.</abstract><cop>United States</cop><pub>American Chemical Society</pub><pmid>20795677</pmid><doi>10.1021/ci9003305</doi><tpages>16</tpages></addata></record>
fulltext	fulltext
identifier	ISSN: 1549-9596
ispartof	Journal of chemical information and modeling, 2010-09, Vol.50 (9), p.1644-1659
issn	1549-9596 1549-960X
language	eng
recordid	cdi_proquest_miscellaneous_755972416
source	MEDLINE; ACS Journals
subjects	Algorithms Amino acids Automation Chemical compounds Enzymes Genetic algorithms Ligands Molecules Pharmaceutical Modeling Proteins - chemistry
title	GARLig: A Fully Automated Tool for Subset Selection of Large Fragment Spaces via a Self-Adaptive Genetic Algorithm
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-10T15%3A15%3A01IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=GARLig:%20A%20Fully%20Automated%20Tool%20for%20Subset%20Selection%20of%20Large%20Fragment%20Spaces%20via%20a%20Self-Adaptive%20Genetic%20Algorithm&rft.jtitle=Journal%20of%20chemical%20information%20and%20modeling&rft.au=Pfeffer,%20Patrick&rft.date=2010-09-27&rft.volume=50&rft.issue=9&rft.spage=1644&rft.epage=1659&rft.pages=1644-1659&rft.issn=1549-9596&rft.eissn=1549-960X&rft_id=info:doi/10.1021/ci9003305&rft_dat=%3Cproquest_cross%3E2150745531%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=755548621&rft_id=info:pmid/20795677&rfr_iscdi=true