A novel significance score for gene selection and ranking

When identifying differentially expressed (DE) genes from high-throughput gene expression measurements, we would like to take both statistical significance (such as P-value) and biological relevance (such as fold change) into consideration. In gene set enrichment analysis (GSEA), a score that can co...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Bioinformatics 2014-03, Vol.30 (6), p.801-807
Hauptverfasser:	Xiao, Yufei, Hsiao, Tzu-Hung, Suresh, Uthra, Chen, Hung-I Harry, Wu, Xiaowu, Wolf, Steven E, Chen, Yidong
Format:	Artikel
Sprache:	eng
Schlagworte:	Bioinformatics Biological Breast Neoplasms - genetics Breast Neoplasms - metabolism Cancer Databases, Genetic Gene Expression Gene Expression Profiling - methods Genes Humans Oligonucleotide Array Sequence Analysis - methods On-line systems Online Original Papers Receivers Receptors, Estrogen - metabolism ROC Curve
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	When identifying differentially expressed (DE) genes from high-throughput gene expression measurements, we would like to take both statistical significance (such as P-value) and biological relevance (such as fold change) into consideration. In gene set enrichment analysis (GSEA), a score that can combine fold change and P-value together is needed for better gene ranking. We defined a gene significance score π-value by combining expression fold change and statistical significance (P-value), and explored its statistical properties. When compared to various existing methods, π-value based approach is more robust in selecting DE genes, with the largest area under curve in its receiver operating characteristic curve. We applied π-value to GSEA and found it comparable to P-value and t-statistic based methods, with added protection against false discovery in certain situations. Finally, in a gene functional study of breast cancer profiles, we showed that using π-value helps elucidating otherwise overlooked important biological functions. http://gccri.uthscsa.edu/Pi_Value_Supplementary.asp xy@ieee.org, cheny8@uthscsa.edu Supplementary data are available at Bioinformatics online.
ISSN:	1367-4803 1367-4811 1460-2059
DOI:	10.1093/bioinformatics/btr671