Data-driven pipeline modeling for predicting unknown protein adulteration in dairy products

To preemptively predict unknown protein adulterants in food and curb the incidence of food fraud at its origin, data-driven models were developed using three machine learning (ML) algorithms. Among these, the random forest (RF)-based model achieved optimal performance, achieving accuracies of 96.2 %...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Food chemistry 2024-12, Vol.471, p.142736, Article 142736
Hauptverfasser: Yang, Huihui, Wang, Yutang, Zhao, Jinyong, Li, Ping, Li, Zhixiang, Li, Long, Fan, Bei, Wang, Fengzhong
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:To preemptively predict unknown protein adulterants in food and curb the incidence of food fraud at its origin, data-driven models were developed using three machine learning (ML) algorithms. Among these, the random forest (RF)-based model achieved optimal performance, achieving accuracies of 96.2 %, 95.1 %, and 88.0 % in identifying odorless, tasteless, and colorless adulterants, respectively. These optimal models are then applied to implement external prediction, ultimately predicting 51 potential adulterants. From these, two cost-effective candidates were selected for adulteration tests. While there was no significant sensory difference between adulterated and unadulterated milk powder, the protein content in the adulterated milk powder increased. This study offers a proactive strategy to combat food fraud effectively. •Develop a novel workflow for predicting potential protein adulterants.•Collection of four distinct types of datasets for food adulterant screening.•A total of 51 adulterants were predicted, and two substances were verified.
ISSN:0308-8146
1873-7072
1873-7072
DOI:10.1016/j.foodchem.2024.142736