A non‐parametric model for transcription factor binding sites

We introduce a non‐parametric representation of transcription factor binding sites which can model arbitrary dependencies between positions. As two parameters are varied, this representation smoothly interpolates between the empirical distribution of binding sites and the standard position‐specific...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Nucleic acids research 2003-10, Vol.31 (19), p.e116-e116
Hauptverfasser: King, Oliver D., Roth, Frederick P.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:We introduce a non‐parametric representation of transcription factor binding sites which can model arbitrary dependencies between positions. As two parameters are varied, this representation smoothly interpolates between the empirical distribution of binding sites and the standard position‐specific scoring matrix (PSSM). In a test of generalization to unseen binding sites using 10‐fold cross‐validation on known binding sites for 95 TRANSFAC transcription factors, this representation outperforms PSSMs on between 65 and 89 of the 95 transcription factors, depending on the choice of the two adjustable parameters. We also discuss how the non‐ parametric representation may be incorporated into frameworks for finding binding sites given only a collection of unaligned promoter regions.
ISSN:0305-1048
1362-4962
1362-4962
DOI:10.1093/nar/gng117