Galgo: a bi-objective evolutionary meta-heuristic identifies robust transcriptomic classifiers associated with patient outcome across multiple cancer types
Abstract Motivation Statistical and machine-learning analyses of tumor transcriptomic profiles offer a powerful resource to gain deeper understanding of tumor subtypes and disease prognosis. Currently, prognostic gene-expression signatures do not exist for all cancer types, and most developed to dat...
Gespeichert in:
Veröffentlicht in: | Bioinformatics 2020-12, Vol.36 (20), p.5037-5044 |
---|---|
Hauptverfasser: | , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Abstract
Motivation
Statistical and machine-learning analyses of tumor transcriptomic profiles offer a powerful resource to gain deeper understanding of tumor subtypes and disease prognosis. Currently, prognostic gene-expression signatures do not exist for all cancer types, and most developed to date have been optimized for individual tumor types. In Galgo, we implement a bi-objective optimization approach that prioritizes gene signature cohesiveness and patient survival in parallel, which provides greater power to identify tumor transcriptomic phenotypes strongly associated with patient survival.
Results
To compare the predictive power of the signatures obtained by Galgo with previously studied subtyping methods, we used a meta-analytic approach testing a total of 35 large population-based transcriptomic biobanks of four different cancer types. Galgo-generated colorectal and lung adenocarcinoma signatures were stronger predictors of patient survival compared to published molecular classification schemes. One Galgo-generated breast cancer signature outperformed PAM50, AIMS, SCMGENE and IntClust subtyping predictors. In high-grade serous ovarian cancer, Galgo signatures obtained similar predictive power to a consensus classification method. In all cases, Galgo subtypes reflected enrichment of gene sets related to the hallmarks of the disease, which highlights the biological relevance of the partitions found.
Availability and implementation
The open-source R package is available on www.github.com/harpomaxx/galgo.
Supplementary information
Supplementary data are available at Bioinformatics online. |
---|---|
ISSN: | 1367-4803 1460-2059 1367-4811 |
DOI: | 10.1093/bioinformatics/btaa619 |