Predicting the microalgae lipid profile obtained by supercritical fluid extraction using a machine learning model

In this study a Machine Learning model was employed to predict the lipid profile from supercritical fluid extraction (SFE) of microalgae sp. USBA-GBX-832 under different temperature (40, 50, 60°C), pressure (150, 250 bar), and ethanol flow (0.6, 0.9 mL min ) conditions. Six machine learning regressi...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Frontiers in chemistry 2024-10, Vol.12, p.1480887
Hauptverfasser: Rangel Pinto, Juan David, Guerrero, Jose L, Rivera, Lorena, Parada-Pinilla, María Paula, Cala, Mónica P, López, Gina, González Barrios, Andrés Fernando
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:In this study a Machine Learning model was employed to predict the lipid profile from supercritical fluid extraction (SFE) of microalgae sp. USBA-GBX-832 under different temperature (40, 50, 60°C), pressure (150, 250 bar), and ethanol flow (0.6, 0.9 mL min ) conditions. Six machine learning regression models were trained using 33 independent variables: 29 from RD-Kit molecular descriptors, three from the extraction conditions, and the infinite dilution activity coefficient (IDAC). The lipidomic characterization analysis identified 139 features, annotating 89 lipids used as the entries of the model, primarily glycerophospholipids and glycerolipids. It was proposed a methodology for selecting the representative lipids from the lipidomic analysis using an unsupervised learning method, these results were compared with Tanimoto scores and IDAC calculations using COSMO-SAC-HB2 model. The models based on decision trees, particularly XGBoost, outperformed others (RMSE: 0.035, 0.095, 0.065 and coefficient of determination (R ): 0.971, 0.933, 0.946 for train, test and experimental validation, respectively), accurately predicting lipid profiles for unseen conditions. Machine Learning methods provide a cost-effective way to optimize SFE conditions and are applicable to other biological samples.
ISSN:2296-2646
2296-2646
DOI:10.3389/fchem.2024.1480887