Similarity searching in large combinatorial chemistry spaces
We present a novel algorithm, called Ftrees-FS, for similarity searching in large chemistry spaces based on dynamic programming. Given a query compound, the algorithm generates sets of compounds from a given chemistry space that are similar to the query. The similarity search is based on the feature...
Gespeichert in:
Veröffentlicht in: | Journal of computer-aided molecular design 2001-06, Vol.15 (6), p.497-520 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 520 |
---|---|
container_issue | 6 |
container_start_page | 497 |
container_title | Journal of computer-aided molecular design |
container_volume | 15 |
creator | Rarey, M Stahl, M |
description | We present a novel algorithm, called Ftrees-FS, for similarity searching in large chemistry spaces based on dynamic programming. Given a query compound, the algorithm generates sets of compounds from a given chemistry space that are similar to the query. The similarity search is based on the feature tree similarity measure representing molecules by tree structures. This descriptor allows handling combinatorial chemistry spaces as a whole instead of looking at subsets of enumerated compounds. Within few minutes of computing time, the algorithm is able to find the most similar compound in very large spaces as well as sets of compounds at an arbitrary similarity level. In addition, the diversity among the generated compounds can be controlled. A set of 17,000 fragments of known drugs, generated by the RECAP procedure from the World Drug Index, was used as the search chemistry space. These fragments can be combined to more than 10(18) compounds of reasonable size. For validation, known antagonists/inhibitors of several targets including dopamine D4, histamine H1, and COX2 are used as queries. Comparison of the compounds created by Ftrees-FS to other known actives demonstrates the ability of the method to jump between structurally unrelated molecule classes. |
doi_str_mv | 10.1023/A:1011144622059 |
format | Article |
fullrecord | <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_proquest_miscellaneous_71067979</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>71067979</sourcerecordid><originalsourceid>FETCH-LOGICAL-c305t-658dd84d458bfca50fd3671f6c124a6ec73b33a590b86c081a38c86a295934733</originalsourceid><addsrcrecordid>eNpdkEtLAzEUhYMotlbX7mRw4W705p2Im1J8QcGFCu6GTCbTpszLZGbRf2_AunF14fCdw8dF6BLDLQZC75b3GDDGjAlCgOsjNMdc0pxpjo_RHDSBXHD2NUNnMe4AQGoBp2iWGpoTQufo4d23vjHBj_ssOhPs1nebzHdZyjYus31b-s6MffCmyezWtT6OIaGDsS6eo5PaNNFdHO4CfT49fqxe8vXb8-tquc4tBT4mA1VVilWMq7K2hkNdUSFxLSwmzAhnJS0pNVxDqYQFhQ1VVglDNNeUSUoX6OZ3dwj99-TiWCQN65rGdK6fYiExCKmlTuD1P3DXT6FLboWkimkMFCfo6gBNZeuqYgi-NWFf_D2F_gCKLGI_</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>738491031</pqid></control><display><type>article</type><title>Similarity searching in large combinatorial chemistry spaces</title><source>MEDLINE</source><source>SpringerNature Journals</source><creator>Rarey, M ; Stahl, M</creator><creatorcontrib>Rarey, M ; Stahl, M</creatorcontrib><description>We present a novel algorithm, called Ftrees-FS, for similarity searching in large chemistry spaces based on dynamic programming. Given a query compound, the algorithm generates sets of compounds from a given chemistry space that are similar to the query. The similarity search is based on the feature tree similarity measure representing molecules by tree structures. This descriptor allows handling combinatorial chemistry spaces as a whole instead of looking at subsets of enumerated compounds. Within few minutes of computing time, the algorithm is able to find the most similar compound in very large spaces as well as sets of compounds at an arbitrary similarity level. In addition, the diversity among the generated compounds can be controlled. A set of 17,000 fragments of known drugs, generated by the RECAP procedure from the World Drug Index, was used as the search chemistry space. These fragments can be combined to more than 10(18) compounds of reasonable size. For validation, known antagonists/inhibitors of several targets including dopamine D4, histamine H1, and COX2 are used as queries. Comparison of the compounds created by Ftrees-FS to other known actives demonstrates the ability of the method to jump between structurally unrelated molecule classes.</description><identifier>ISSN: 0920-654X</identifier><identifier>EISSN: 1573-4951</identifier><identifier>DOI: 10.1023/A:1011144622059</identifier><identifier>PMID: 11495223</identifier><language>eng</language><publisher>Netherlands: Springer Nature B.V</publisher><subject>Algorithms ; Angiotensin II - antagonists & inhibitors ; Chemistry ; Combinatorial Chemistry Techniques ; Cyclooxygenase 2 ; Cyclooxygenase 2 Inhibitors ; Cyclooxygenase Inhibitors - chemistry ; Cyclooxygenase Inhibitors - pharmacology ; Dopamine Agonists - chemistry ; Dopamine Agonists - pharmacology ; Dynamic programming ; Histamine H1 Antagonists - chemistry ; Isoenzymes - drug effects ; Models, Molecular ; Prostaglandin-Endoperoxide Synthases - drug effects ; Receptors, Dopamine D2 - drug effects ; Receptors, Dopamine D4 ; Serine Proteinase Inhibitors - chemistry ; Studies</subject><ispartof>Journal of computer-aided molecular design, 2001-06, Vol.15 (6), p.497-520</ispartof><rights>Kluwer Academic Publishers 2001</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c305t-658dd84d458bfca50fd3671f6c124a6ec73b33a590b86c081a38c86a295934733</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>315,782,786,27931,27932</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/11495223$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Rarey, M</creatorcontrib><creatorcontrib>Stahl, M</creatorcontrib><title>Similarity searching in large combinatorial chemistry spaces</title><title>Journal of computer-aided molecular design</title><addtitle>J Comput Aided Mol Des</addtitle><description>We present a novel algorithm, called Ftrees-FS, for similarity searching in large chemistry spaces based on dynamic programming. Given a query compound, the algorithm generates sets of compounds from a given chemistry space that are similar to the query. The similarity search is based on the feature tree similarity measure representing molecules by tree structures. This descriptor allows handling combinatorial chemistry spaces as a whole instead of looking at subsets of enumerated compounds. Within few minutes of computing time, the algorithm is able to find the most similar compound in very large spaces as well as sets of compounds at an arbitrary similarity level. In addition, the diversity among the generated compounds can be controlled. A set of 17,000 fragments of known drugs, generated by the RECAP procedure from the World Drug Index, was used as the search chemistry space. These fragments can be combined to more than 10(18) compounds of reasonable size. For validation, known antagonists/inhibitors of several targets including dopamine D4, histamine H1, and COX2 are used as queries. Comparison of the compounds created by Ftrees-FS to other known actives demonstrates the ability of the method to jump between structurally unrelated molecule classes.</description><subject>Algorithms</subject><subject>Angiotensin II - antagonists & inhibitors</subject><subject>Chemistry</subject><subject>Combinatorial Chemistry Techniques</subject><subject>Cyclooxygenase 2</subject><subject>Cyclooxygenase 2 Inhibitors</subject><subject>Cyclooxygenase Inhibitors - chemistry</subject><subject>Cyclooxygenase Inhibitors - pharmacology</subject><subject>Dopamine Agonists - chemistry</subject><subject>Dopamine Agonists - pharmacology</subject><subject>Dynamic programming</subject><subject>Histamine H1 Antagonists - chemistry</subject><subject>Isoenzymes - drug effects</subject><subject>Models, Molecular</subject><subject>Prostaglandin-Endoperoxide Synthases - drug effects</subject><subject>Receptors, Dopamine D2 - drug effects</subject><subject>Receptors, Dopamine D4</subject><subject>Serine Proteinase Inhibitors - chemistry</subject><subject>Studies</subject><issn>0920-654X</issn><issn>1573-4951</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2001</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><recordid>eNpdkEtLAzEUhYMotlbX7mRw4W705p2Im1J8QcGFCu6GTCbTpszLZGbRf2_AunF14fCdw8dF6BLDLQZC75b3GDDGjAlCgOsjNMdc0pxpjo_RHDSBXHD2NUNnMe4AQGoBp2iWGpoTQufo4d23vjHBj_ssOhPs1nebzHdZyjYus31b-s6MffCmyezWtT6OIaGDsS6eo5PaNNFdHO4CfT49fqxe8vXb8-tquc4tBT4mA1VVilWMq7K2hkNdUSFxLSwmzAhnJS0pNVxDqYQFhQ1VVglDNNeUSUoX6OZ3dwj99-TiWCQN65rGdK6fYiExCKmlTuD1P3DXT6FLboWkimkMFCfo6gBNZeuqYgi-NWFf_D2F_gCKLGI_</recordid><startdate>200106</startdate><enddate>200106</enddate><creator>Rarey, M</creator><creator>Stahl, M</creator><general>Springer Nature B.V</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>3V.</scope><scope>7SC</scope><scope>7X7</scope><scope>7XB</scope><scope>88E</scope><scope>88I</scope><scope>8AL</scope><scope>8AO</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>BKSAR</scope><scope>CCPQU</scope><scope>D1I</scope><scope>DWQXO</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>K9.</scope><scope>KB.</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M0N</scope><scope>M0S</scope><scope>M1P</scope><scope>M2P</scope><scope>P5Z</scope><scope>P62</scope><scope>PCBAR</scope><scope>PDBOC</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>Q9U</scope><scope>7X8</scope></search><sort><creationdate>200106</creationdate><title>Similarity searching in large combinatorial chemistry spaces</title><author>Rarey, M ; Stahl, M</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c305t-658dd84d458bfca50fd3671f6c124a6ec73b33a590b86c081a38c86a295934733</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2001</creationdate><topic>Algorithms</topic><topic>Angiotensin II - antagonists & inhibitors</topic><topic>Chemistry</topic><topic>Combinatorial Chemistry Techniques</topic><topic>Cyclooxygenase 2</topic><topic>Cyclooxygenase 2 Inhibitors</topic><topic>Cyclooxygenase Inhibitors - chemistry</topic><topic>Cyclooxygenase Inhibitors - pharmacology</topic><topic>Dopamine Agonists - chemistry</topic><topic>Dopamine Agonists - pharmacology</topic><topic>Dynamic programming</topic><topic>Histamine H1 Antagonists - chemistry</topic><topic>Isoenzymes - drug effects</topic><topic>Models, Molecular</topic><topic>Prostaglandin-Endoperoxide Synthases - drug effects</topic><topic>Receptors, Dopamine D2 - drug effects</topic><topic>Receptors, Dopamine D4</topic><topic>Serine Proteinase Inhibitors - chemistry</topic><topic>Studies</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Rarey, M</creatorcontrib><creatorcontrib>Stahl, M</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>ProQuest Central (Corporate)</collection><collection>Computer and Information Systems Abstracts</collection><collection>Health & Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Medical Database (Alumni Edition)</collection><collection>Science Database (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>Natural Science Collection</collection><collection>Earth, Atmospheric & Aquatic Science Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Materials Science Collection</collection><collection>ProQuest Central Korea</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>Materials Science Database</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Computing Database</collection><collection>Health & Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Science Database</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>Earth, Atmospheric & Aquatic Science Database</collection><collection>Materials Science Collection</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central Basic</collection><collection>MEDLINE - Academic</collection><jtitle>Journal of computer-aided molecular design</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Rarey, M</au><au>Stahl, M</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Similarity searching in large combinatorial chemistry spaces</atitle><jtitle>Journal of computer-aided molecular design</jtitle><addtitle>J Comput Aided Mol Des</addtitle><date>2001-06</date><risdate>2001</risdate><volume>15</volume><issue>6</issue><spage>497</spage><epage>520</epage><pages>497-520</pages><issn>0920-654X</issn><eissn>1573-4951</eissn><abstract>We present a novel algorithm, called Ftrees-FS, for similarity searching in large chemistry spaces based on dynamic programming. Given a query compound, the algorithm generates sets of compounds from a given chemistry space that are similar to the query. The similarity search is based on the feature tree similarity measure representing molecules by tree structures. This descriptor allows handling combinatorial chemistry spaces as a whole instead of looking at subsets of enumerated compounds. Within few minutes of computing time, the algorithm is able to find the most similar compound in very large spaces as well as sets of compounds at an arbitrary similarity level. In addition, the diversity among the generated compounds can be controlled. A set of 17,000 fragments of known drugs, generated by the RECAP procedure from the World Drug Index, was used as the search chemistry space. These fragments can be combined to more than 10(18) compounds of reasonable size. For validation, known antagonists/inhibitors of several targets including dopamine D4, histamine H1, and COX2 are used as queries. Comparison of the compounds created by Ftrees-FS to other known actives demonstrates the ability of the method to jump between structurally unrelated molecule classes.</abstract><cop>Netherlands</cop><pub>Springer Nature B.V</pub><pmid>11495223</pmid><doi>10.1023/A:1011144622059</doi><tpages>24</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0920-654X |
ispartof | Journal of computer-aided molecular design, 2001-06, Vol.15 (6), p.497-520 |
issn | 0920-654X 1573-4951 |
language | eng |
recordid | cdi_proquest_miscellaneous_71067979 |
source | MEDLINE; SpringerNature Journals |
subjects | Algorithms Angiotensin II - antagonists & inhibitors Chemistry Combinatorial Chemistry Techniques Cyclooxygenase 2 Cyclooxygenase 2 Inhibitors Cyclooxygenase Inhibitors - chemistry Cyclooxygenase Inhibitors - pharmacology Dopamine Agonists - chemistry Dopamine Agonists - pharmacology Dynamic programming Histamine H1 Antagonists - chemistry Isoenzymes - drug effects Models, Molecular Prostaglandin-Endoperoxide Synthases - drug effects Receptors, Dopamine D2 - drug effects Receptors, Dopamine D4 Serine Proteinase Inhibitors - chemistry Studies |
title | Similarity searching in large combinatorial chemistry spaces |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-04T02%3A03%3A00IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Similarity%20searching%20in%20large%20combinatorial%20chemistry%20spaces&rft.jtitle=Journal%20of%20computer-aided%20molecular%20design&rft.au=Rarey,%20M&rft.date=2001-06&rft.volume=15&rft.issue=6&rft.spage=497&rft.epage=520&rft.pages=497-520&rft.issn=0920-654X&rft.eissn=1573-4951&rft_id=info:doi/10.1023/A:1011144622059&rft_dat=%3Cproquest_pubme%3E71067979%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=738491031&rft_id=info:pmid/11495223&rfr_iscdi=true |