Coffee authentication via targeted metabolomics and machine learning: Unveiling origins and their discriminating biochemicals

Coffee is an export commodity that is prone to fraudulent practices. Therefore, this study presents a novel approach to authenticate coffee origins using targeted metabolomics with gas chromatography-tandem mass spectrometry (GC-MS/MS) and machine learning models. A total of 200 coffee samples from...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Food bioscience 2023-12, Vol.56, p.103122, Article 103122
Hauptverfasser: Aurum, Fawzan Sigma, Zaman, Muhammad Zukhrufuz, Purwanto, Edi, Praseptiangga, Danar, Nakano, Kohei
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Coffee is an export commodity that is prone to fraudulent practices. Therefore, this study presents a novel approach to authenticate coffee origins using targeted metabolomics with gas chromatography-tandem mass spectrometry (GC-MS/MS) and machine learning models. A total of 200 coffee samples from different harvest years and areas from Indonesia were extracted using the derivatisation method and then analysed for their metabolite profiles. Several supervised machine-learning models were tested to classify coffee origins and discover their potential markers. The study found various metabolite features spanning diverse chemical classes, encompassing sugar alcohols, carbohydrates, amino acids, organic acids, fatty acids, and phenols. Random forest (RF) and partial least squares discriminant analysis (PLS-DA) were among the most accurate models in predicting the origin of coffee from several classes in the validation dataset. The accuracy of both models is in the range of 91%–100%. Furthermore, this study proposes a new strategy for determining “intersection features” as the set of features that are important and common to both RF and PLS-DA models, thereby providing a robust selection of coffee origin markers. Overall, the approach and findings of this study have far-reaching implications for coffee authentication. •Metabolomics coupled with machine learning approaches can predict the origin of coffee.•Random forest and PLS-DA show high accuracy in classifying the origin of coffee.•Important metabolite features were assigned to each coffee origin.
ISSN:2212-4292
2212-4306
DOI:10.1016/j.fbio.2023.103122