Clustering results on the 507 dataset with our implementation of GeneRAGE (Top Left), hierarchical clustering (Top Right), TribeMCL (Bottom Left) and our Spectral Clustering algorithm (Bottom Right)

Copyright information:Taken from "Spectral clustering of protein sequences"Nucleic Acids Research 2006;34(5):1571-1580.Published online 17 Mar 2006PMCID:PMC1409676.© The Author 2006. Published by Oxford University Press. All rights reserved The figures show only the top 30 most populated c...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Paccanaro, Alberto, Casbon, James A., Saqi, Mansoor A. S.
Format: Bild
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Copyright information:Taken from "Spectral clustering of protein sequences"Nucleic Acids Research 2006;34(5):1571-1580.Published online 17 Mar 2006PMCID:PMC1409676.© The Author 2006. Published by Oxford University Press. All rights reserved The figures show only the top 30 most populated clusters returned by each algorithm and 8 for the spectral clustering, since it returned only 8 clusters. Each row in the diagrams corresponds to a different cluster. Short (green) bars represent the assignment of each protein sequence to a cluster. Each protein has one of these bars in only one of the rows (clusters); the presence of the bar means that the protein is assigned to that cluster. Boundaries between super-families are shown by vertical thick (red) lines; boundaries between families within each super-family are shown by dotted (blue) lines. The dataset has 6 super-families, orderly from left to right: Globin-like (88), EF-hand (83), Cupredoxins (78), ()glycosidases (83), Thioredoxin-like (81), Membrane all-alpha (94).
DOI:10.6084/m9.figshare.53697