CAALIGN: a program for pairwise and multiple protein-structure alignment
Coordinate superposition of proteins provides a structural basis to protein similarity and therefore complements the technique of sequence alignment. Methods that carry out structure alignment are faced with the problem of the large number of trials necessary to determine the optimal alignment solut...
Gespeichert in:
Veröffentlicht in: | Acta crystallographica. Section D, Biological crystallography. Biological crystallography., 2007-04, Vol.63 (4), p.514-525 |
---|---|
1. Verfasser: | |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 525 |
---|---|
container_issue | 4 |
container_start_page | 514 |
container_title | Acta crystallographica. Section D, Biological crystallography. |
container_volume | 63 |
creator | Oldfield, T. J. |
description | Coordinate superposition of proteins provides a structural basis to protein similarity and therefore complements the technique of sequence alignment. Methods that carry out structure alignment are faced with the problem of the large number of trials necessary to determine the optimal alignment solution. This article presents a method of carrying out rapid (subsecond) protein‐structure alignment between pairs of proteins based on a maximal Cα‐atom superposition. The algorithm can return alignments of 12 or more residues in length as multiple non‐overlapping solutions of alignment between a pair of proteins which are independent of the fold connectivity and secondary‐structure content. The algorithm is equally effective for all protein fold types and can align proteins containing no secondary‐structure elements such as is the case when searching for common turn structures in proteins. It has high sensitivity and returns the set of true positive results before any false positives as judged by SCOP classification. It can find alignments between topologically different folds and returns information about sequence alignment based on structure alignment. Additionally, this algorithm has been extended to carry out multiple structure alignment to determine common structures within groups of proteins, including the nondegenerate set of proteins in the PDB. The algorithm has been implemented within the program CAALIGN and this article presents results from pairwise structure alignment, multiple structure alignment and the generation of common structure fragments found within the PDB using multiple structure alignment. |
doi_str_mv | 10.1107/S0907444907000844 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_70286212</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>70286212</sourcerecordid><originalsourceid>FETCH-LOGICAL-c3868-1878ad3a9d42eb200801fe06ec060a3f8dc1c48e909b8c557dacfe68e844c203</originalsourceid><addsrcrecordid>eNqFkF1LwzAUhoMozq8f4I30yrvqSdI2qXdj6iaMKiiKVyFLT0e07WbSovv3RjZU8MKbJJDneTnnJeSYwhmlIM7vIQeRJEk4AUAmyRbZozzPY4BEbP96D8i-9y-BYYyLXTKgggvGU7FHJqPhcHozLi4iHS3dYu50E1ULFy21de_WY6TbMmr6urPLGr-IDm0b-871putd-K7tvG2w7Q7JTqVrj0eb-4A8XF89jCbx9HZ8MxpOY8NlJmMqhdQl13mZMJyxMDXQCiFDAxloXsnSUJNIzCGfSZOmotSmwkxi2M4w4AfkdB0bRnnr0Xeqsd5gXesWF71XApjMGGUBpGvQuIX3Diu1dLbRbqUoqK_21J_2gnOyCe9nDZY_xqauAMg18G5rXP2fqIbPl1dFSqUMarxWre_w41vV7lVlIT5VT8VYFY9F-phN79Q9_wTOZ4jk</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>70286212</pqid></control><display><type>article</type><title>CAALIGN: a program for pairwise and multiple protein-structure alignment</title><source>MEDLINE</source><source>Wiley Journals</source><source>Alma/SFX Local Collection</source><creator>Oldfield, T. J.</creator><creatorcontrib>Oldfield, T. J.</creatorcontrib><description>Coordinate superposition of proteins provides a structural basis to protein similarity and therefore complements the technique of sequence alignment. Methods that carry out structure alignment are faced with the problem of the large number of trials necessary to determine the optimal alignment solution. This article presents a method of carrying out rapid (subsecond) protein‐structure alignment between pairs of proteins based on a maximal Cα‐atom superposition. The algorithm can return alignments of 12 or more residues in length as multiple non‐overlapping solutions of alignment between a pair of proteins which are independent of the fold connectivity and secondary‐structure content. The algorithm is equally effective for all protein fold types and can align proteins containing no secondary‐structure elements such as is the case when searching for common turn structures in proteins. It has high sensitivity and returns the set of true positive results before any false positives as judged by SCOP classification. It can find alignments between topologically different folds and returns information about sequence alignment based on structure alignment. Additionally, this algorithm has been extended to carry out multiple structure alignment to determine common structures within groups of proteins, including the nondegenerate set of proteins in the PDB. The algorithm has been implemented within the program CAALIGN and this article presents results from pairwise structure alignment, multiple structure alignment and the generation of common structure fragments found within the PDB using multiple structure alignment.</description><identifier>ISSN: 1399-0047</identifier><identifier>ISSN: 0907-4449</identifier><identifier>EISSN: 1399-0047</identifier><identifier>DOI: 10.1107/S0907444907000844</identifier><identifier>PMID: 17372357</identifier><language>eng</language><publisher>5 Abbey Square, Chester, Cheshire CH1 2HU, England: Blackwell Publishing Ltd</publisher><subject>Algorithms ; Amino Acid Sequence ; CAALIGN ; Computational Biology ; data mining ; Databases, Protein ; Models, Molecular ; Molecular Sequence Data ; Proteins - chemistry ; Sequence Alignment ; similarity ; Software ; Structural Homology, Protein ; structure alignment ; superposition</subject><ispartof>Acta crystallographica. Section D, Biological crystallography., 2007-04, Vol.63 (4), p.514-525</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c3868-1878ad3a9d42eb200801fe06ec060a3f8dc1c48e909b8c557dacfe68e844c203</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://onlinelibrary.wiley.com/doi/pdf/10.1107%2FS0907444907000844$$EPDF$$P50$$Gwiley$$H</linktopdf><linktohtml>$$Uhttps://onlinelibrary.wiley.com/doi/full/10.1107%2FS0907444907000844$$EHTML$$P50$$Gwiley$$H</linktohtml><link.rule.ids>314,780,784,1417,27924,27925,45574,45575</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/17372357$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Oldfield, T. J.</creatorcontrib><title>CAALIGN: a program for pairwise and multiple protein-structure alignment</title><title>Acta crystallographica. Section D, Biological crystallography.</title><addtitle>Acta Cryst. D</addtitle><description>Coordinate superposition of proteins provides a structural basis to protein similarity and therefore complements the technique of sequence alignment. Methods that carry out structure alignment are faced with the problem of the large number of trials necessary to determine the optimal alignment solution. This article presents a method of carrying out rapid (subsecond) protein‐structure alignment between pairs of proteins based on a maximal Cα‐atom superposition. The algorithm can return alignments of 12 or more residues in length as multiple non‐overlapping solutions of alignment between a pair of proteins which are independent of the fold connectivity and secondary‐structure content. The algorithm is equally effective for all protein fold types and can align proteins containing no secondary‐structure elements such as is the case when searching for common turn structures in proteins. It has high sensitivity and returns the set of true positive results before any false positives as judged by SCOP classification. It can find alignments between topologically different folds and returns information about sequence alignment based on structure alignment. Additionally, this algorithm has been extended to carry out multiple structure alignment to determine common structures within groups of proteins, including the nondegenerate set of proteins in the PDB. The algorithm has been implemented within the program CAALIGN and this article presents results from pairwise structure alignment, multiple structure alignment and the generation of common structure fragments found within the PDB using multiple structure alignment.</description><subject>Algorithms</subject><subject>Amino Acid Sequence</subject><subject>CAALIGN</subject><subject>Computational Biology</subject><subject>data mining</subject><subject>Databases, Protein</subject><subject>Models, Molecular</subject><subject>Molecular Sequence Data</subject><subject>Proteins - chemistry</subject><subject>Sequence Alignment</subject><subject>similarity</subject><subject>Software</subject><subject>Structural Homology, Protein</subject><subject>structure alignment</subject><subject>superposition</subject><issn>1399-0047</issn><issn>0907-4449</issn><issn>1399-0047</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2007</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqFkF1LwzAUhoMozq8f4I30yrvqSdI2qXdj6iaMKiiKVyFLT0e07WbSovv3RjZU8MKbJJDneTnnJeSYwhmlIM7vIQeRJEk4AUAmyRbZozzPY4BEbP96D8i-9y-BYYyLXTKgggvGU7FHJqPhcHozLi4iHS3dYu50E1ULFy21de_WY6TbMmr6urPLGr-IDm0b-871putd-K7tvG2w7Q7JTqVrj0eb-4A8XF89jCbx9HZ8MxpOY8NlJmMqhdQl13mZMJyxMDXQCiFDAxloXsnSUJNIzCGfSZOmotSmwkxi2M4w4AfkdB0bRnnr0Xeqsd5gXesWF71XApjMGGUBpGvQuIX3Diu1dLbRbqUoqK_21J_2gnOyCe9nDZY_xqauAMg18G5rXP2fqIbPl1dFSqUMarxWre_w41vV7lVlIT5VT8VYFY9F-phN79Q9_wTOZ4jk</recordid><startdate>200704</startdate><enddate>200704</enddate><creator>Oldfield, T. J.</creator><general>Blackwell Publishing Ltd</general><scope>BSCLL</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope></search><sort><creationdate>200704</creationdate><title>CAALIGN: a program for pairwise and multiple protein-structure alignment</title><author>Oldfield, T. J.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c3868-1878ad3a9d42eb200801fe06ec060a3f8dc1c48e909b8c557dacfe68e844c203</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2007</creationdate><topic>Algorithms</topic><topic>Amino Acid Sequence</topic><topic>CAALIGN</topic><topic>Computational Biology</topic><topic>data mining</topic><topic>Databases, Protein</topic><topic>Models, Molecular</topic><topic>Molecular Sequence Data</topic><topic>Proteins - chemistry</topic><topic>Sequence Alignment</topic><topic>similarity</topic><topic>Software</topic><topic>Structural Homology, Protein</topic><topic>structure alignment</topic><topic>superposition</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Oldfield, T. J.</creatorcontrib><collection>Istex</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Acta crystallographica. Section D, Biological crystallography.</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Oldfield, T. J.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>CAALIGN: a program for pairwise and multiple protein-structure alignment</atitle><jtitle>Acta crystallographica. Section D, Biological crystallography.</jtitle><addtitle>Acta Cryst. D</addtitle><date>2007-04</date><risdate>2007</risdate><volume>63</volume><issue>4</issue><spage>514</spage><epage>525</epage><pages>514-525</pages><issn>1399-0047</issn><issn>0907-4449</issn><eissn>1399-0047</eissn><abstract>Coordinate superposition of proteins provides a structural basis to protein similarity and therefore complements the technique of sequence alignment. Methods that carry out structure alignment are faced with the problem of the large number of trials necessary to determine the optimal alignment solution. This article presents a method of carrying out rapid (subsecond) protein‐structure alignment between pairs of proteins based on a maximal Cα‐atom superposition. The algorithm can return alignments of 12 or more residues in length as multiple non‐overlapping solutions of alignment between a pair of proteins which are independent of the fold connectivity and secondary‐structure content. The algorithm is equally effective for all protein fold types and can align proteins containing no secondary‐structure elements such as is the case when searching for common turn structures in proteins. It has high sensitivity and returns the set of true positive results before any false positives as judged by SCOP classification. It can find alignments between topologically different folds and returns information about sequence alignment based on structure alignment. Additionally, this algorithm has been extended to carry out multiple structure alignment to determine common structures within groups of proteins, including the nondegenerate set of proteins in the PDB. The algorithm has been implemented within the program CAALIGN and this article presents results from pairwise structure alignment, multiple structure alignment and the generation of common structure fragments found within the PDB using multiple structure alignment.</abstract><cop>5 Abbey Square, Chester, Cheshire CH1 2HU, England</cop><pub>Blackwell Publishing Ltd</pub><pmid>17372357</pmid><doi>10.1107/S0907444907000844</doi><tpages>12</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1399-0047 |
ispartof | Acta crystallographica. Section D, Biological crystallography., 2007-04, Vol.63 (4), p.514-525 |
issn | 1399-0047 0907-4449 1399-0047 |
language | eng |
recordid | cdi_proquest_miscellaneous_70286212 |
source | MEDLINE; Wiley Journals; Alma/SFX Local Collection |
subjects | Algorithms Amino Acid Sequence CAALIGN Computational Biology data mining Databases, Protein Models, Molecular Molecular Sequence Data Proteins - chemistry Sequence Alignment similarity Software Structural Homology, Protein structure alignment superposition |
title | CAALIGN: a program for pairwise and multiple protein-structure alignment |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-05T07%3A40%3A46IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=CAALIGN:%20a%20program%20for%20pairwise%20and%20multiple%20protein-structure%20alignment&rft.jtitle=Acta%20crystallographica.%20Section%20D,%20Biological%20crystallography.&rft.au=Oldfield,%20T.%20J.&rft.date=2007-04&rft.volume=63&rft.issue=4&rft.spage=514&rft.epage=525&rft.pages=514-525&rft.issn=1399-0047&rft.eissn=1399-0047&rft_id=info:doi/10.1107/S0907444907000844&rft_dat=%3Cproquest_cross%3E70286212%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=70286212&rft_id=info:pmid/17372357&rfr_iscdi=true |