Determination of the geographical origin of hazelnuts (Corylus avellana L.) by Near-Infrared spectroscopy (NIR) and a Low-Level Fusion with nuclear magnetic resonance (NMR)

[Display omitted] •Near-infrared spectroscopy has high potential in the determination of hazelnut origin.•Feature selection shows high relevance of proteins in classification.•Low-level data fusion of NIR and NMR outperforms the individual techniques. Fourier-transform near-infrared (FT-NIR) spectro...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Microchemical journal 2022-03, Vol.174, p.107066, Article 107066
Hauptverfasser: Shakiba, Navid, Gerdes, Annika, Holz, Nathalie, Wenck, Soeren, Bachmann, René, Schneider, Tobias, Seifert, Stephan, Fischer, Markus, Hackl, Thomas
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:[Display omitted] •Near-infrared spectroscopy has high potential in the determination of hazelnut origin.•Feature selection shows high relevance of proteins in classification.•Low-level data fusion of NIR and NMR outperforms the individual techniques. Fourier-transform near-infrared (FT-NIR) spectroscopy was used to determine the geographical origin of 233 hazelnut samples of various varieties from five different countries (Germany, France, Georgia, Italy, Turkey). The experimental determination of the geographical origin of hazelnuts is important, because there are usually large price differences between the producer countries and thus a risk of food fraud that should not be underestimated. The present work is a feasibility study using a low-cost method, as high-field NMR and UPLC-QTOF-MS have already been used for this question. Sample sets were split with repeated nested cross validation and an ensemble of discriminant classifiers with random subspaces was used to build the classification models. By using a preprocessing strategy consisting of multiplicative scatter correction, bucketing and the mean averaging of five measured spectra per sample, a test accuracy of 90.6 ± 3.9% was achieved, which rivals results obtained with much more expensive infrastructure. The application of the feature selection approach surrogate minimal depth showed that the successful classification is mainly caused by protein signals. In addition, a low-level data fusion of the NIR and NMR data was performed to assess how well the two methods complement each other. The data fusion was compared to a complementary approach, where the classification results based on the individual NIR and NMR models were jointly examined. The data fusion performed better than the individual methods with a test accuracy of 96.6 ± 2.8%. A comparison of the outliers in all classification models shows conspicuities in always the same samples, indicating that robust classification models are obtained.
ISSN:0026-265X
1095-9149
DOI:10.1016/j.microc.2021.107066