Predictive capability of rough set machine learning in tetracycline adsorption using biochar

Machine learning algorithms investigate relationships in data to deliver useful outputs. However, past models required complete datasets as a prerequisite. In this study, rough set-based machine learning was applied using real-world incomplete datasets to generate a prediction model of biochar’s ads...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Carbon Research 2024-05, Vol.3 (1), Article 48
Hauptverfasser: Balasubramanian, Paramasivan, Prabhakar, Muhil Raj, Liu, Chong, Zhang, Pengyan, Li, Fayong
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Machine learning algorithms investigate relationships in data to deliver useful outputs. However, past models required complete datasets as a prerequisite. In this study, rough set-based machine learning was applied using real-world incomplete datasets to generate a prediction model of biochar’s adsorption capacity based on key attributes. The predictive model consists of if–then rules classifying properties by fulfilling certain conditions. The rules generated from both complete and incomplete datasets exhibit high certainty and coverage, along with scientific coherence. Based on the complete dataset model, optimal pyrolysis conditions, biomass characteristics and adsorption conditions were identified to maximize tetracycline adsorption capacity (> 200 mg/g) by biochar. This study demonstrates the capabilities of rough set-based machine learning using incomplete practical real-world data without compromising key features. The approach can generate valid predictive models even with missing values in datasets. Overall, the preliminary results show promise for applying rough set machine learning to real-world, incomplete data for generating biomass and biochar predictive models. However, further refinement and testing are warranted before practical implementation. Highlights • It is the first explainable AI-based rough set model to study the tetracycline adsorption capacity of biochar. • Usage of an incomplete Practical dataset through RSML evaded the biasness due to imputations. • Higher accuracy and precision of incomplete Practical datasets revealed the uniqueness of the model. Graphical Abstract
ISSN:2731-6696
2731-6696
DOI:10.1007/s44246-024-00129-w