CAALIGN: a program for pairwise and multiple protein-structure alignment

Coordinate superposition of proteins provides a structural basis to protein similarity and therefore complements the technique of sequence alignment. Methods that carry out structure alignment are faced with the problem of the large number of trials necessary to determine the optimal alignment solut...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Acta crystallographica. Section D, Biological crystallography. Biological crystallography., 2007-04, Vol.63 (4), p.514-525
1. Verfasser:	Oldfield, T. J.
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Amino Acid Sequence CAALIGN Computational Biology data mining Databases, Protein Models, Molecular Molecular Sequence Data Proteins - chemistry Sequence Alignment similarity Software Structural Homology, Protein structure alignment superposition
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	525
container_issue	4
container_start_page	514
container_title	Acta crystallographica. Section D, Biological crystallography.
container_volume	63
creator	Oldfield, T. J.
description	Coordinate superposition of proteins provides a structural basis to protein similarity and therefore complements the technique of sequence alignment. Methods that carry out structure alignment are faced with the problem of the large number of trials necessary to determine the optimal alignment solution. This article presents a method of carrying out rapid (subsecond) protein‐structure alignment between pairs of proteins based on a maximal Cα‐atom superposition. The algorithm can return alignments of 12 or more residues in length as multiple non‐overlapping solutions of alignment between a pair of proteins which are independent of the fold connectivity and secondary‐structure content. The algorithm is equally effective for all protein fold types and can align proteins containing no secondary‐structure elements such as is the case when searching for common turn structures in proteins. It has high sensitivity and returns the set of true positive results before any false positives as judged by SCOP classification. It can find alignments between topologically different folds and returns information about sequence alignment based on structure alignment. Additionally, this algorithm has been extended to carry out multiple structure alignment to determine common structures within groups of proteins, including the nondegenerate set of proteins in the PDB. The algorithm has been implemented within the program CAALIGN and this article presents results from pairwise structure alignment, multiple structure alignment and the generation of common structure fragments found within the PDB using multiple structure alignment.
doi_str_mv	10.1107/S0907444907000844
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_70286212</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>70286212</sourcerecordid><originalsourceid>FETCH-LOGICAL-c3868-1878ad3a9d42eb200801fe06ec060a3f8dc1c48e909b8c557dacfe68e844c203</originalsourceid><addsrcrecordid>eNqFkF1LwzAUhoMozq8f4I30yrvqSdI2qXdj6iaMKiiKVyFLT0e07WbSovv3RjZU8MKbJJDneTnnJeSYwhmlIM7vIQeRJEk4AUAmyRbZozzPY4BEbP96D8i-9y-BYYyLXTKgggvGU7FHJqPhcHozLi4iHS3dYu50E1ULFy21de_WY6TbMmr6urPLGr-IDm0b-871putd-K7tvG2w7Q7JTqVrj0eb-4A8XF89jCbx9HZ8MxpOY8NlJmMqhdQl13mZMJyxMDXQCiFDAxloXsnSUJNIzCGfSZOmotSmwkxi2M4w4AfkdB0bRnnr0Xeqsd5gXesWF71XApjMGGUBpGvQuIX3Diu1dLbRbqUoqK_21J_2gnOyCe9nDZY_xqauAMg18G5rXP2fqIbPl1dFSqUMarxWre_w41vV7lVlIT5VT8VYFY9F-phN79Q9_wTOZ4jk</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>70286212</pqid></control><display><type>article</type><title>CAALIGN: a program for pairwise and multiple protein-structure alignment</title><source>MEDLINE</source><source>Wiley Journals</source><source>Alma/SFX Local Collection</source><creator>Oldfield, T. J.</creator><creatorcontrib>Oldfield, T. J.</creatorcontrib><description>Coordinate superposition of proteins provides a structural basis to protein similarity and therefore complements the technique of sequence alignment. Methods that carry out structure alignment are faced with the problem of the large number of trials necessary to determine the optimal alignment solution. This article presents a method of carrying out rapid (subsecond) protein‐structure alignment between pairs of proteins based on a maximal Cα‐atom superposition. The algorithm can return alignments of 12 or more residues in length as multiple non‐overlapping solutions of alignment between a pair of proteins which are independent of the fold connectivity and secondary‐structure content. The algorithm is equally effective for all protein fold types and can align proteins containing no secondary‐structure elements such as is the case when searching for common turn structures in proteins. It has high sensitivity and returns the set of true positive results before any false positives as judged by SCOP classification. It can find alignments between topologically different folds and returns information about sequence alignment based on structure alignment. Additionally, this algorithm has been extended to carry out multiple structure alignment to determine common structures within groups of proteins, including the nondegenerate set of proteins in the PDB. The algorithm has been implemented within the program CAALIGN and this article presents results from pairwise structure alignment, multiple structure alignment and the generation of common structure fragments found within the PDB using multiple structure alignment.</description><identifier>ISSN: 1399-0047</identifier><identifier>ISSN: 0907-4449</identifier><identifier>EISSN: 1399-0047</identifier><identifier>DOI: 10.1107/S0907444907000844</identifier><identifier>PMID: 17372357</identifier><language>eng</language><publisher>5 Abbey Square, Chester, Cheshire CH1 2HU, England: Blackwell Publishing Ltd</publisher><subject>Algorithms ; Amino Acid Sequence ; CAALIGN ; Computational Biology ; data mining ; Databases, Protein ; Models, Molecular ; Molecular Sequence Data ; Proteins - chemistry ; Sequence Alignment ; similarity ; Software ; Structural Homology, Protein ; structure alignment ; superposition</subject><ispartof>Acta crystallographica. Section D, Biological crystallography., 2007-04, Vol.63 (4), p.514-525</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c3868-1878ad3a9d42eb200801fe06ec060a3f8dc1c48e909b8c557dacfe68e844c203</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://onlinelibrary.wiley.com/doi/pdf/10.1107%2FS0907444907000844$$EPDF$$P50$$Gwiley$$H</linktopdf><linktohtml>$$Uhttps://onlinelibrary.wiley.com/doi/full/10.1107%2FS0907444907000844$$EHTML$$P50$$Gwiley$$H</linktohtml><link.rule.ids>314,780,784,1417,27924,27925,45574,45575</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/17372357$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Oldfield, T. J.</creatorcontrib><title>CAALIGN: a program for pairwise and multiple protein-structure alignment</title><title>Acta crystallographica. Section D, Biological crystallography.</title><addtitle>Acta Cryst. D</addtitle><description>Coordinate superposition of proteins provides a structural basis to protein similarity and therefore complements the technique of sequence alignment. Methods that carry out structure alignment are faced with the problem of the large number of trials necessary to determine the optimal alignment solution. This article presents a method of carrying out rapid (subsecond) protein‐structure alignment between pairs of proteins based on a maximal Cα‐atom superposition. The algorithm can return alignments of 12 or more residues in length as multiple non‐overlapping solutions of alignment between a pair of proteins which are independent of the fold connectivity and secondary‐structure content. The algorithm is equally effective for all protein fold types and can align proteins containing no secondary‐structure elements such as is the case when searching for common turn structures in proteins. It has high sensitivity and returns the set of true positive results before any false positives as judged by SCOP classification. It can find alignments between topologically different folds and returns information about sequence alignment based on structure alignment. Additionally, this algorithm has been extended to carry out multiple structure alignment to determine common structures within groups of proteins, including the nondegenerate set of proteins in the PDB. The algorithm has been implemented within the program CAALIGN and this article presents results from pairwise structure alignment, multiple structure alignment and the generation of common structure fragments found within the PDB using multiple structure alignment.</description><subject>Algorithms</subject><subject>Amino Acid Sequence</subject><subject>CAALIGN</subject><subject>Computational Biology</subject><subject>data mining</subject><subject>Databases, Protein</subject><subject>Models, Molecular</subject><subject>Molecular Sequence Data</subject><subject>Proteins - chemistry</subject><subject>Sequence Alignment</subject><subject>similarity</subject><subject>Software</subject><subject>Structural Homology, Protein</subject><subject>structure alignment</subject><subject>superposition</subject><issn>1399-0047</issn><issn>0907-4449</issn><issn>1399-0047</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2007</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqFkF1LwzAUhoMozq8f4I30yrvqSdI2qXdj6iaMKiiKVyFLT0e07WbSovv3RjZU8MKbJJDneTnnJeSYwhmlIM7vIQeRJEk4AUAmyRbZozzPY4BEbP96D8i-9y-BYYyLXTKgggvGU7FHJqPhcHozLi4iHS3dYu50E1ULFy21de_WY6TbMmr6urPLGr-IDm0b-871putd-K7tvG2w7Q7JTqVrj0eb-4A8XF89jCbx9HZ8MxpOY8NlJmMqhdQl13mZMJyxMDXQCiFDAxloXsnSUJNIzCGfSZOmotSmwkxi2M4w4AfkdB0bRnnr0Xeqsd5gXesWF71XApjMGGUBpGvQuIX3Diu1dLbRbqUoqK_21J_2gnOyCe9nDZY_xqauAMg18G5rXP2fqIbPl1dFSqUMarxWre_w41vV7lVlIT5VT8VYFY9F-phN79Q9_wTOZ4jk</recordid><startdate>200704</startdate><enddate>200704</enddate><creator>Oldfield, T. J.</creator><general>Blackwell Publishing Ltd</general><scope>BSCLL</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope></search><sort><creationdate>200704</creationdate><title>CAALIGN: a program for pairwise and multiple protein-structure alignment</title><author>Oldfield, T. J.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c3868-1878ad3a9d42eb200801fe06ec060a3f8dc1c48e909b8c557dacfe68e844c203</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2007</creationdate><topic>Algorithms</topic><topic>Amino Acid Sequence</topic><topic>CAALIGN</topic><topic>Computational Biology</topic><topic>data mining</topic><topic>Databases, Protein</topic><topic>Models, Molecular</topic><topic>Molecular Sequence Data</topic><topic>Proteins - chemistry</topic><topic>Sequence Alignment</topic><topic>similarity</topic><topic>Software</topic><topic>Structural Homology, Protein</topic><topic>structure alignment</topic><topic>superposition</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Oldfield, T. J.</creatorcontrib><collection>Istex</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Acta crystallographica. Section D, Biological crystallography.</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Oldfield, T. J.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>CAALIGN: a program for pairwise and multiple protein-structure alignment</atitle><jtitle>Acta crystallographica. Section D, Biological crystallography.</jtitle><addtitle>Acta Cryst. D</addtitle><date>2007-04</date><risdate>2007</risdate><volume>63</volume><issue>4</issue><spage>514</spage><epage>525</epage><pages>514-525</pages><issn>1399-0047</issn><issn>0907-4449</issn><eissn>1399-0047</eissn><abstract>Coordinate superposition of proteins provides a structural basis to protein similarity and therefore complements the technique of sequence alignment. Methods that carry out structure alignment are faced with the problem of the large number of trials necessary to determine the optimal alignment solution. This article presents a method of carrying out rapid (subsecond) protein‐structure alignment between pairs of proteins based on a maximal Cα‐atom superposition. The algorithm can return alignments of 12 or more residues in length as multiple non‐overlapping solutions of alignment between a pair of proteins which are independent of the fold connectivity and secondary‐structure content. The algorithm is equally effective for all protein fold types and can align proteins containing no secondary‐structure elements such as is the case when searching for common turn structures in proteins. It has high sensitivity and returns the set of true positive results before any false positives as judged by SCOP classification. It can find alignments between topologically different folds and returns information about sequence alignment based on structure alignment. Additionally, this algorithm has been extended to carry out multiple structure alignment to determine common structures within groups of proteins, including the nondegenerate set of proteins in the PDB. The algorithm has been implemented within the program CAALIGN and this article presents results from pairwise structure alignment, multiple structure alignment and the generation of common structure fragments found within the PDB using multiple structure alignment.</abstract><cop>5 Abbey Square, Chester, Cheshire CH1 2HU, England</cop><pub>Blackwell Publishing Ltd</pub><pmid>17372357</pmid><doi>10.1107/S0907444907000844</doi><tpages>12</tpages></addata></record>
fulltext	fulltext
identifier	ISSN: 1399-0047
ispartof	Acta crystallographica. Section D, Biological crystallography., 2007-04, Vol.63 (4), p.514-525
issn	1399-0047 0907-4449 1399-0047
language	eng
recordid	cdi_proquest_miscellaneous_70286212
source	MEDLINE; Wiley Journals; Alma/SFX Local Collection
subjects	Algorithms Amino Acid Sequence CAALIGN Computational Biology data mining Databases, Protein Models, Molecular Molecular Sequence Data Proteins - chemistry Sequence Alignment similarity Software Structural Homology, Protein structure alignment superposition
title	CAALIGN: a program for pairwise and multiple protein-structure alignment
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-05T07%3A40%3A46IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=CAALIGN:%20a%20program%20for%20pairwise%20and%20multiple%20protein-structure%20alignment&rft.jtitle=Acta%20crystallographica.%20Section%20D,%20Biological%20crystallography.&rft.au=Oldfield,%20T.%20J.&rft.date=2007-04&rft.volume=63&rft.issue=4&rft.spage=514&rft.epage=525&rft.pages=514-525&rft.issn=1399-0047&rft.eissn=1399-0047&rft_id=info:doi/10.1107/S0907444907000844&rft_dat=%3Cproquest_cross%3E70286212%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=70286212&rft_id=info:pmid/17372357&rfr_iscdi=true