Similarity searching in large combinatorial chemistry spaces

We present a novel algorithm, called Ftrees-FS, for similarity searching in large chemistry spaces based on dynamic programming. Given a query compound, the algorithm generates sets of compounds from a given chemistry space that are similar to the query. The similarity search is based on the feature...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of computer-aided molecular design 2001-06, Vol.15 (6), p.497-520
Hauptverfasser: Rarey, M, Stahl, M
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 520
container_issue 6
container_start_page 497
container_title Journal of computer-aided molecular design
container_volume 15
creator Rarey, M
Stahl, M
description We present a novel algorithm, called Ftrees-FS, for similarity searching in large chemistry spaces based on dynamic programming. Given a query compound, the algorithm generates sets of compounds from a given chemistry space that are similar to the query. The similarity search is based on the feature tree similarity measure representing molecules by tree structures. This descriptor allows handling combinatorial chemistry spaces as a whole instead of looking at subsets of enumerated compounds. Within few minutes of computing time, the algorithm is able to find the most similar compound in very large spaces as well as sets of compounds at an arbitrary similarity level. In addition, the diversity among the generated compounds can be controlled. A set of 17,000 fragments of known drugs, generated by the RECAP procedure from the World Drug Index, was used as the search chemistry space. These fragments can be combined to more than 10(18) compounds of reasonable size. For validation, known antagonists/inhibitors of several targets including dopamine D4, histamine H1, and COX2 are used as queries. Comparison of the compounds created by Ftrees-FS to other known actives demonstrates the ability of the method to jump between structurally unrelated molecule classes.
doi_str_mv 10.1023/A:1011144622059
format Article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_proquest_miscellaneous_71067979</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>71067979</sourcerecordid><originalsourceid>FETCH-LOGICAL-c305t-658dd84d458bfca50fd3671f6c124a6ec73b33a590b86c081a38c86a295934733</originalsourceid><addsrcrecordid>eNpdkEtLAzEUhYMotlbX7mRw4W705p2Im1J8QcGFCu6GTCbTpszLZGbRf2_AunF14fCdw8dF6BLDLQZC75b3GDDGjAlCgOsjNMdc0pxpjo_RHDSBXHD2NUNnMe4AQGoBp2iWGpoTQufo4d23vjHBj_ssOhPs1nebzHdZyjYus31b-s6MffCmyezWtT6OIaGDsS6eo5PaNNFdHO4CfT49fqxe8vXb8-tquc4tBT4mA1VVilWMq7K2hkNdUSFxLSwmzAhnJS0pNVxDqYQFhQ1VVglDNNeUSUoX6OZ3dwj99-TiWCQN65rGdK6fYiExCKmlTuD1P3DXT6FLboWkimkMFCfo6gBNZeuqYgi-NWFf_D2F_gCKLGI_</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>738491031</pqid></control><display><type>article</type><title>Similarity searching in large combinatorial chemistry spaces</title><source>MEDLINE</source><source>SpringerNature Journals</source><creator>Rarey, M ; Stahl, M</creator><creatorcontrib>Rarey, M ; Stahl, M</creatorcontrib><description>We present a novel algorithm, called Ftrees-FS, for similarity searching in large chemistry spaces based on dynamic programming. Given a query compound, the algorithm generates sets of compounds from a given chemistry space that are similar to the query. The similarity search is based on the feature tree similarity measure representing molecules by tree structures. This descriptor allows handling combinatorial chemistry spaces as a whole instead of looking at subsets of enumerated compounds. Within few minutes of computing time, the algorithm is able to find the most similar compound in very large spaces as well as sets of compounds at an arbitrary similarity level. In addition, the diversity among the generated compounds can be controlled. A set of 17,000 fragments of known drugs, generated by the RECAP procedure from the World Drug Index, was used as the search chemistry space. These fragments can be combined to more than 10(18) compounds of reasonable size. For validation, known antagonists/inhibitors of several targets including dopamine D4, histamine H1, and COX2 are used as queries. Comparison of the compounds created by Ftrees-FS to other known actives demonstrates the ability of the method to jump between structurally unrelated molecule classes.</description><identifier>ISSN: 0920-654X</identifier><identifier>EISSN: 1573-4951</identifier><identifier>DOI: 10.1023/A:1011144622059</identifier><identifier>PMID: 11495223</identifier><language>eng</language><publisher>Netherlands: Springer Nature B.V</publisher><subject>Algorithms ; Angiotensin II - antagonists &amp; inhibitors ; Chemistry ; Combinatorial Chemistry Techniques ; Cyclooxygenase 2 ; Cyclooxygenase 2 Inhibitors ; Cyclooxygenase Inhibitors - chemistry ; Cyclooxygenase Inhibitors - pharmacology ; Dopamine Agonists - chemistry ; Dopamine Agonists - pharmacology ; Dynamic programming ; Histamine H1 Antagonists - chemistry ; Isoenzymes - drug effects ; Models, Molecular ; Prostaglandin-Endoperoxide Synthases - drug effects ; Receptors, Dopamine D2 - drug effects ; Receptors, Dopamine D4 ; Serine Proteinase Inhibitors - chemistry ; Studies</subject><ispartof>Journal of computer-aided molecular design, 2001-06, Vol.15 (6), p.497-520</ispartof><rights>Kluwer Academic Publishers 2001</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c305t-658dd84d458bfca50fd3671f6c124a6ec73b33a590b86c081a38c86a295934733</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>315,782,786,27931,27932</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/11495223$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Rarey, M</creatorcontrib><creatorcontrib>Stahl, M</creatorcontrib><title>Similarity searching in large combinatorial chemistry spaces</title><title>Journal of computer-aided molecular design</title><addtitle>J Comput Aided Mol Des</addtitle><description>We present a novel algorithm, called Ftrees-FS, for similarity searching in large chemistry spaces based on dynamic programming. Given a query compound, the algorithm generates sets of compounds from a given chemistry space that are similar to the query. The similarity search is based on the feature tree similarity measure representing molecules by tree structures. This descriptor allows handling combinatorial chemistry spaces as a whole instead of looking at subsets of enumerated compounds. Within few minutes of computing time, the algorithm is able to find the most similar compound in very large spaces as well as sets of compounds at an arbitrary similarity level. In addition, the diversity among the generated compounds can be controlled. A set of 17,000 fragments of known drugs, generated by the RECAP procedure from the World Drug Index, was used as the search chemistry space. These fragments can be combined to more than 10(18) compounds of reasonable size. For validation, known antagonists/inhibitors of several targets including dopamine D4, histamine H1, and COX2 are used as queries. Comparison of the compounds created by Ftrees-FS to other known actives demonstrates the ability of the method to jump between structurally unrelated molecule classes.</description><subject>Algorithms</subject><subject>Angiotensin II - antagonists &amp; inhibitors</subject><subject>Chemistry</subject><subject>Combinatorial Chemistry Techniques</subject><subject>Cyclooxygenase 2</subject><subject>Cyclooxygenase 2 Inhibitors</subject><subject>Cyclooxygenase Inhibitors - chemistry</subject><subject>Cyclooxygenase Inhibitors - pharmacology</subject><subject>Dopamine Agonists - chemistry</subject><subject>Dopamine Agonists - pharmacology</subject><subject>Dynamic programming</subject><subject>Histamine H1 Antagonists - chemistry</subject><subject>Isoenzymes - drug effects</subject><subject>Models, Molecular</subject><subject>Prostaglandin-Endoperoxide Synthases - drug effects</subject><subject>Receptors, Dopamine D2 - drug effects</subject><subject>Receptors, Dopamine D4</subject><subject>Serine Proteinase Inhibitors - chemistry</subject><subject>Studies</subject><issn>0920-654X</issn><issn>1573-4951</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2001</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><recordid>eNpdkEtLAzEUhYMotlbX7mRw4W705p2Im1J8QcGFCu6GTCbTpszLZGbRf2_AunF14fCdw8dF6BLDLQZC75b3GDDGjAlCgOsjNMdc0pxpjo_RHDSBXHD2NUNnMe4AQGoBp2iWGpoTQufo4d23vjHBj_ssOhPs1nebzHdZyjYus31b-s6MffCmyezWtT6OIaGDsS6eo5PaNNFdHO4CfT49fqxe8vXb8-tquc4tBT4mA1VVilWMq7K2hkNdUSFxLSwmzAhnJS0pNVxDqYQFhQ1VVglDNNeUSUoX6OZ3dwj99-TiWCQN65rGdK6fYiExCKmlTuD1P3DXT6FLboWkimkMFCfo6gBNZeuqYgi-NWFf_D2F_gCKLGI_</recordid><startdate>200106</startdate><enddate>200106</enddate><creator>Rarey, M</creator><creator>Stahl, M</creator><general>Springer Nature B.V</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>3V.</scope><scope>7SC</scope><scope>7X7</scope><scope>7XB</scope><scope>88E</scope><scope>88I</scope><scope>8AL</scope><scope>8AO</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>BKSAR</scope><scope>CCPQU</scope><scope>D1I</scope><scope>DWQXO</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>K9.</scope><scope>KB.</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M0N</scope><scope>M0S</scope><scope>M1P</scope><scope>M2P</scope><scope>P5Z</scope><scope>P62</scope><scope>PCBAR</scope><scope>PDBOC</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>Q9U</scope><scope>7X8</scope></search><sort><creationdate>200106</creationdate><title>Similarity searching in large combinatorial chemistry spaces</title><author>Rarey, M ; Stahl, M</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c305t-658dd84d458bfca50fd3671f6c124a6ec73b33a590b86c081a38c86a295934733</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2001</creationdate><topic>Algorithms</topic><topic>Angiotensin II - antagonists &amp; inhibitors</topic><topic>Chemistry</topic><topic>Combinatorial Chemistry Techniques</topic><topic>Cyclooxygenase 2</topic><topic>Cyclooxygenase 2 Inhibitors</topic><topic>Cyclooxygenase Inhibitors - chemistry</topic><topic>Cyclooxygenase Inhibitors - pharmacology</topic><topic>Dopamine Agonists - chemistry</topic><topic>Dopamine Agonists - pharmacology</topic><topic>Dynamic programming</topic><topic>Histamine H1 Antagonists - chemistry</topic><topic>Isoenzymes - drug effects</topic><topic>Models, Molecular</topic><topic>Prostaglandin-Endoperoxide Synthases - drug effects</topic><topic>Receptors, Dopamine D2 - drug effects</topic><topic>Receptors, Dopamine D4</topic><topic>Serine Proteinase Inhibitors - chemistry</topic><topic>Studies</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Rarey, M</creatorcontrib><creatorcontrib>Stahl, M</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>ProQuest Central (Corporate)</collection><collection>Computer and Information Systems Abstracts</collection><collection>Health &amp; Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Medical Database (Alumni Edition)</collection><collection>Science Database (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>Natural Science Collection</collection><collection>Earth, Atmospheric &amp; Aquatic Science Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Materials Science Collection</collection><collection>ProQuest Central Korea</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>Materials Science Database</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Computing Database</collection><collection>Health &amp; Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Science Database</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>Earth, Atmospheric &amp; Aquatic Science Database</collection><collection>Materials Science Collection</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central Basic</collection><collection>MEDLINE - Academic</collection><jtitle>Journal of computer-aided molecular design</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Rarey, M</au><au>Stahl, M</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Similarity searching in large combinatorial chemistry spaces</atitle><jtitle>Journal of computer-aided molecular design</jtitle><addtitle>J Comput Aided Mol Des</addtitle><date>2001-06</date><risdate>2001</risdate><volume>15</volume><issue>6</issue><spage>497</spage><epage>520</epage><pages>497-520</pages><issn>0920-654X</issn><eissn>1573-4951</eissn><abstract>We present a novel algorithm, called Ftrees-FS, for similarity searching in large chemistry spaces based on dynamic programming. Given a query compound, the algorithm generates sets of compounds from a given chemistry space that are similar to the query. The similarity search is based on the feature tree similarity measure representing molecules by tree structures. This descriptor allows handling combinatorial chemistry spaces as a whole instead of looking at subsets of enumerated compounds. Within few minutes of computing time, the algorithm is able to find the most similar compound in very large spaces as well as sets of compounds at an arbitrary similarity level. In addition, the diversity among the generated compounds can be controlled. A set of 17,000 fragments of known drugs, generated by the RECAP procedure from the World Drug Index, was used as the search chemistry space. These fragments can be combined to more than 10(18) compounds of reasonable size. For validation, known antagonists/inhibitors of several targets including dopamine D4, histamine H1, and COX2 are used as queries. Comparison of the compounds created by Ftrees-FS to other known actives demonstrates the ability of the method to jump between structurally unrelated molecule classes.</abstract><cop>Netherlands</cop><pub>Springer Nature B.V</pub><pmid>11495223</pmid><doi>10.1023/A:1011144622059</doi><tpages>24</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0920-654X
ispartof Journal of computer-aided molecular design, 2001-06, Vol.15 (6), p.497-520
issn 0920-654X
1573-4951
language eng
recordid cdi_proquest_miscellaneous_71067979
source MEDLINE; SpringerNature Journals
subjects Algorithms
Angiotensin II - antagonists & inhibitors
Chemistry
Combinatorial Chemistry Techniques
Cyclooxygenase 2
Cyclooxygenase 2 Inhibitors
Cyclooxygenase Inhibitors - chemistry
Cyclooxygenase Inhibitors - pharmacology
Dopamine Agonists - chemistry
Dopamine Agonists - pharmacology
Dynamic programming
Histamine H1 Antagonists - chemistry
Isoenzymes - drug effects
Models, Molecular
Prostaglandin-Endoperoxide Synthases - drug effects
Receptors, Dopamine D2 - drug effects
Receptors, Dopamine D4
Serine Proteinase Inhibitors - chemistry
Studies
title Similarity searching in large combinatorial chemistry spaces
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-04T02%3A03%3A00IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Similarity%20searching%20in%20large%20combinatorial%20chemistry%20spaces&rft.jtitle=Journal%20of%20computer-aided%20molecular%20design&rft.au=Rarey,%20M&rft.date=2001-06&rft.volume=15&rft.issue=6&rft.spage=497&rft.epage=520&rft.pages=497-520&rft.issn=0920-654X&rft.eissn=1573-4951&rft_id=info:doi/10.1023/A:1011144622059&rft_dat=%3Cproquest_pubme%3E71067979%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=738491031&rft_id=info:pmid/11495223&rfr_iscdi=true