Improving distance measures between genomic tracks with mutual proximity

An increasing number of genomic tracks such as DNA methylation, histone modifications or transcriptomes are being produced to annotate genomes with functional states. The comparison of such high dimensional vectors obtained under various experimental conditions requires the use of a distance or diss...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Briefings in bioinformatics 2021-11, Vol.22 (6)
Hauptverfasser: Haschka, Thomas, Morlot, Jean Baptiste, Carron, Leopold, Mozziconacci, Julien
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:An increasing number of genomic tracks such as DNA methylation, histone modifications or transcriptomes are being produced to annotate genomes with functional states. The comparison of such high dimensional vectors obtained under various experimental conditions requires the use of a distance or dissimilarity measure. Pearson, Cosine and $L_{p}$-norm distances are commonly used for both count and binary vectors. In this article, we highlight how enhancement methods such as the contrast increasing mutual proximity' (MP) or local scaling' improve common distance measures. We present a systematic approach to evaluate the performance of such enhanced distance measures in terms of separability of groups of experimental replicates to outline their effect. We show that the MP' applied on the various distance measures drastically increases performance. Depending on the type of epigenetic experiment, MP' coupled together with Pearson, Cosine, $L_1$, Yule or Jaccard distances proves to be highly efficient in discriminating epigenomic profiles.
ISSN:1467-5463
1477-4054
DOI:10.1093/bib/bbab266