Statistical analysis of multiple regions-of-interest in multiplexed spatial proteomics data

Abstract Multiplexed spatial proteomics reveals the spatial organization of cells in tumors, which is associated with important clinical outcomes such as survival and treatment response. This spatial organization is often summarized using spatial summary statistics, including Ripley’s K and Besag’s...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Briefings in bioinformatics 2024-09, Vol.25 (6)
Hauptverfasser:	Samorodnitsky, Sarah, Wu, Michael C
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Cancer Carcinoma, Non-Small-Cell Lung - genetics Carcinoma, Non-Small-Cell Lung - metabolism Carcinoma, Non-Small-Cell Lung - pathology Cell size Cell survival Colorectal carcinoma Colorectal Neoplasms - genetics Colorectal Neoplasms - metabolism Colorectal Neoplasms - pathology Data Interpretation, Statistical Humans Lung cancer Lung Neoplasms - genetics Lung Neoplasms - metabolism Lung Neoplasms - pathology Multiplexing Neoplasms - genetics Neoplasms - metabolism Non-small cell lung carcinoma Performance evaluation Problem Solving Protocol Proteomics Proteomics - methods Resampling Small cell lung carcinoma Spatial data Statistical analysis Summaries Triple Negative Breast Neoplasms - genetics Triple Negative Breast Neoplasms - metabolism Triple Negative Breast Neoplasms - pathology Tumors
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Abstract Multiplexed spatial proteomics reveals the spatial organization of cells in tumors, which is associated with important clinical outcomes such as survival and treatment response. This spatial organization is often summarized using spatial summary statistics, including Ripley’s K and Besag’s L. However, if multiple regions of the same tumor are imaged, it is unclear how to synthesize the relationship with a single patient-level endpoint. We evaluate extant approaches for accommodating multiple images within the context of associating summary statistics with outcomes. First, we consider averaging-based approaches wherein multiple summaries for a single sample are combined in a weighted mean. We then propose a novel class of ensemble testing approaches in which we simulate random weights used to aggregate summaries, test for an association with outcomes, and combine the $P$-values. We systematically evaluate the performance of these approaches via simulation and application to data from non-small cell lung cancer, colorectal cancer, and triple negative breast cancer. We find that the optimal strategy varies, but a simple weighted average of the summary statistics based on the number of cells in each image often offers the highest power and controls type I error effectively. When the size of the imaged regions varies, incorporating this variation into the weighted aggregation may yield additional power in cases where the varying size is informative. Ensemble testing (but not resampling) offered high power and type I error control across conditions in our simulated data sets.
ISSN:	1467-5463 1477-4054 1477-4054
DOI:	10.1093/bib/bbae522