MATCH-BOX: a fundamentally new algorithm for the simultaneous alignment of several protein sequences

Original algorithms for simultaneous alignment of protein sequences are presented, including sequence clustering and within-or between-groups multiple alignment. The way of matching similar regions is fundamentally new. Complete matches are formed by segments more similar than expected by random, ac...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Bioinformatics 1992-10, Vol.8 (5), p.501-509
Hauptverfasser:	Depiereux, Eric, Feytmans, Ernest
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Amino Acid Sequence Analytical, structural and metabolic biochemistry Biological and medical sciences Cluster Analysis Fundamental and applied biological sciences. Psychology General aspects, investigation methods Proteins Sequence Alignment Software Design
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Original algorithms for simultaneous alignment of protein sequences are presented, including sequence clustering and within-or between-groups multiple alignment. The way of matching similar regions is fundamentally new. Complete matches are formed by segments more similar than expected by random, according to a given probability limit. Any classic or user-defined score matrix can be used to express the similarity between the residues. The algorithm seeks for complete matches common to all the sequences without performing pairwise alignment and regardless of gap weighting. An automatic screening delineates all the similar regions (boxes) that may be defined for a given maximal shift between the sequences. The shift can be large enough to allow the matching of any region of a sequence with any region of another one. It can also be short and used to refine the alignment around anchor points. The algorithm provides the most likely optimal alignment and a comprehensive list of the alignment dilemma. Duality between automatism and interactivity is provided. Depending on the problem complexity, a final alignment is obtained fully automatically or requires some interactive handling to discriminate alternative pathways.
ISSN:	1367-4803 0266-7061 1460-2059
DOI:	10.1093/bioinformatics/8.5.501