Transcription factor binding site identification using the self-organizing map

Motivation: The automatic identification of over-represented motifs present in a collection of sequences continues to be a challenging problem in computational biology. In this paper, we propose a self-organizing map of position weight matrices as an alternative method for motif discovery. The advan...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Bioinformatics 2005-05, Vol.21 (9), p.1807-1814
Hauptverfasser: Mahony, Shaun, Hendrix, David, Golden, Aaron, Smith, Terry J., Rokhsar, Daniel S.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Motivation: The automatic identification of over-represented motifs present in a collection of sequences continues to be a challenging problem in computational biology. In this paper, we propose a self-organizing map of position weight matrices as an alternative method for motif discovery. The advantage of this approach is that it can be used to simultaneously characterize every feature present in the dataset, thus lessening the chance that weaker signals will be missed. Features identified are ranked in terms of over-representation relative to a background model. Results: We present an implementation of this approach, named SOMBRERO (self-organizing map for biological regulatory element recognition and ordering), which is capable of discovering multiple distinct motifs present in a single dataset. Demonstrated here are the advantages of our approach on various datasets and SOMBRERO's improved performance over two popular motif-finding programs, MEME and AlignACE. Availability: SOMBRERO is available free of charge from http://bioinf.nuigalway.ie/sombrero Contact: shaun.mahony@nuigalway.ie Supplementary information: http://bioinf.nuigalway.ie/sombrero/additional
ISSN:1367-4803
1460-2059
1367-4811
DOI:10.1093/bioinformatics/bti256