Contribution of low-level image statistics to EEG decoding of semantic content in multivariate and univariate models with feature optimization

•Optimized univariate models outperform multivariate models in EEG visual decoding.•Model performance is confounded by low-level image features.•Model optimization can increase the sensitivity of model towards this confound.•Low-level image feature confounds also appear when decoding of concept cate...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	NeuroImage (Orlando, Fla.) Fla.), 2024-06, Vol.293, p.120626-120626, Article 120626
Hauptverfasser:	Lützow Holm, Eric, Fernández Slezak, Diego, Tagliazucchi, Enzo
Format:	Artikel
Sprache:	eng
Schlagworte:	Accuracy Datasets Decoding EEG Electroencephalography Feature selection Frequency dependence Hypotheses Image statistics Medical imaging Neuroimaging Object recognition Optimization Paradigms Psychophysics Semantics Statistical analysis Statistics Visual stimuli
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	•Optimized univariate models outperform multivariate models in EEG visual decoding.•Model performance is confounded by low-level image features.•Model optimization can increase the sensitivity of model towards this confound.•Low-level image feature confounds also appear when decoding of concept categories. Spatio-temporal patterns of evoked brain activity contain information that can be used to decode and categorize the semantic content of visual stimuli. However, this procedure can be biased by low-level image features independently of the semantic content present in the stimuli, prompting the need to understand the robustness of different models regarding these confounding factors. In this study, we trained machine learning models to distinguish between concepts included in the publicly available THINGS-EEG dataset using electroencephalography (EEG) data acquired during a rapid serial visual presentation paradigm. We investigated the contribution of low-level image features to decoding accuracy in a multivariate model, utilizing broadband data from all EEG channels. Additionally, we explored a univariate model obtained through data-driven feature selection applied to the spatial and frequency domains. While the univariate models exhibited better decoding accuracy, their predictions were less robust to the confounding effect of low-level image statistics. Notably, some of the models maintained their accuracy even after random replacement of the training dataset with semantically unrelated samples that presented similar low-level content. In conclusion, our findings suggest that model optimization impacts sensitivity to confounding factors, regardless of the resulting classification performance. Therefore, the choice of EEG features for semantic decoding should ideally be informed by criteria beyond classifier performance, such as the neurobiological mechanisms under study.
ISSN:	1053-8119 1095-9572
DOI:	10.1016/j.neuroimage.2024.120626