Single cell RNA-seq data clustering using TF-IDF based methods
Single cell transcriptomics is critical for understanding cellular heterogeneity and identification of novel cell types. Leveraging the recent advances in single cell RNA sequencing (scRNA-Seq) technology requires novel unsupervised clustering algorithms that are robust to high levels of technical a...
Gespeichert in:
Veröffentlicht in: | BMC genomics 2018-08, Vol.19 (Suppl 6), p.569-569, Article 569 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Single cell transcriptomics is critical for understanding cellular heterogeneity and identification of novel cell types. Leveraging the recent advances in single cell RNA sequencing (scRNA-Seq) technology requires novel unsupervised clustering algorithms that are robust to high levels of technical and biological noise and scale to datasets of millions of cells.
We present novel computational approaches for clustering scRNA-seq data based on the Term Frequency - Inverse Document Frequency (TF-IDF) transformation that has been successfully used in the field of text analysis.
Empirical experimental results show that TF-IDF methods consistently outperform commonly used scRNA-Seq clustering approaches. |
---|---|
ISSN: | 1471-2164 1471-2164 |
DOI: | 10.1186/s12864-018-4922-4 |