Robust preprocessing and model selection for spectral data

To calibrate spectral data, one typically starts with preprocessing the spectra and then applies a multivariate calibration method such as principal component regression or partial least squares regression. In the model selection step, the optimal number of latent variables is determined in order to...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of Chemometrics 2012-06, Vol.26 (6), p.282-289
Hauptverfasser: Verboven, Sabine, Hubert, Mia, Goos, Peter
Format: Artikel
Sprache:eng
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:To calibrate spectral data, one typically starts with preprocessing the spectra and then applies a multivariate calibration method such as principal component regression or partial least squares regression. In the model selection step, the optimal number of latent variables is determined in order to minimize the prediction error. To protect the analysis against the harmful influence of possible outliers in the data, robust calibration methods have been developed. In this paper, we focus on the preprocessing and the model selection step. We propose several robust preprocessing methods as well as robust measures of the root mean squared error of prediction (RMSEP). To select the optimal preprocessing method, we summarize the results for the different RMSEP values by means of a desirability index, which is a concept from industrial quality control. These robust RMSEP values are also used to select the optimal number of latent variables. We illustrate our newly developed techniques through the analysis of a real data set containing near-infrared measurements of samples of animal feed. © 2012 John Wiley & Sons, Ltd.
ISSN:0886-9383