A regularised logistic regression model with structured features for classification of geographical origin in olive oils

Geographical origin of extra virgin olive oil is a factor that consumers may take into account when making purchasing decisions. Oils that are labelled to be from regions famous for olive cultivation may be assumed to be of higher quality. However, difficulties in the authentication of the geographi...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Chemometrics and intelligent laboratory systems 2023-06, Vol.237, p.104819, Article 104819
Hauptverfasser: Soh, Chin Gi, Zhu, Ying, Toh, Tin Lam
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Geographical origin of extra virgin olive oil is a factor that consumers may take into account when making purchasing decisions. Oils that are labelled to be from regions famous for olive cultivation may be assumed to be of higher quality. However, difficulties in the authentication of the geographical origin of olive oils arise due to the similarity in chemical compositions of the oils involved. Fourier-transform infrared (FTIR) spectroscopy has been found to be a viable technology for the classification of oil samples by geographical origin. However, classical methods involving dimension reduction before model fitting usually yield models that are more challenging to interpret. Sparse fused group lasso logistic regression (SFGL-LR) is used with FTIR spectroscopic data to discriminate between Greek and non-Greek organic extra-virgin olive oils. The prediction performance is also compared with that obtained by partial least squares linear discriminant analysis (PLS-LDA). While both methods give comparable good prediction performance, with more than 90% accuracy in classification, the SFGL-LR model demonstrates improvements in the interpretability of the model coefficients. •A regularised logistic regression model for classification using spectroscopic data•ADMM algorithm coupled with BFGS algorithm for solving the optimization problem•Accurately distinguish between Greek and non-Greek organic extra-virgin olive oils•Selected spectral features are interpretable regarding chemical functional groups
ISSN:0169-7439
1873-3239
DOI:10.1016/j.chemolab.2023.104819