TPE-xgboost for explainable predictions of concrete compressive strength considering compositions, and mechanical and microstructure properties of testing samples

The use of machine learning (ML) for predicting concrete compressive strength (CCS) has shown promising and accurate results, making it a valuable tool in the field. However, efficient prediction requires not only a robust ML approach but also a comprehensive and well-curated dataset. This study add...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Construction & building materials 2024-12, Vol.457, p.139398, Article 139398
Hauptverfasser: Akber, Muhammad Zeshan, Anwar, Ghazanfar Ali, Chan, Wai-Kit, Lee, Hiu-Hung
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The use of machine learning (ML) for predicting concrete compressive strength (CCS) has shown promising and accurate results, making it a valuable tool in the field. However, efficient prediction requires not only a robust ML approach but also a comprehensive and well-curated dataset. This study addresses this challenge by investigating an extensive and integrated dataset of (1) concrete compositions (mixture proportions) and (2) testing conditions (mechanical and microstructure properties of concrete testing samples), containing 1525 observations relating to 39 parameters. On algorithms side, this research proposes novel tree-structured parzen estimator based extreme gradient boosting (TPE-xgboost) for accurate and confident CCS predictions. Moreover, SHapley Additive exPlanations (SHAP) analysis was performed to provide a comprehensive understanding of feature importance, dependencies, and interactions. Compared to prior research, this study demonstrates significant improvements in CCS prediction accuracy for separate investigations on concrete compositions (3.77 %) and mechanical and microstructure properties (28.57 %), while maintaining reasonable accuracy for the integrated dataset despite its complexity and sparseness. SHAP analysis reveals key influential factors, including age and water-to-binder ratio in concrete composition, and peak load and height of testing samples, as well as microstructural characteristics such as mean values of global autocorrelation length and its integral range. This research provides valuable insights into model optimization, predictive performance, and feature dependencies and interactions, contributing to the development of more accurate and reliable CCS prediction models. •Predicted compressive strength on enhanced dataset with 39 features.•Proposed xgboost for accurate and confident predictions on sparse dataset.•Used Tree-structured parzen estimator (TPE) approach for optimization of xgboost.•SHAP analysis reveals key influential features and interactions.
ISSN:0950-0618
DOI:10.1016/j.conbuildmat.2024.139398