Deriving equation from data via knowledge discovery and machine learning: A study of Young’s modulus of Ti-Nb alloys

[Display omitted] •A property-targeted machine learning regression model can discover knowledge inside materials datasets and facilitate the machine learning process.•A framework of unearthing inner relationships is summarized, where domain knowledge runs through and machine learning provides insigh...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Computational materials science 2023-09, Vol.228, p.112349, Article 112349
Hauptverfasser: Zhang, Huiran, Liu, Xi, Zhang, Guangjie, Zhu, Yuquan, Li, Shengzhou, Qian, Quan, Dai, Dongbo, Che, Renchao, Xu, Tao
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:[Display omitted] •A property-targeted machine learning regression model can discover knowledge inside materials datasets and facilitate the machine learning process.•A framework of unearthing inner relationships is summarized, where domain knowledge runs through and machine learning provides insights.•An accurate, interpretable, and rational equation for Young’s modulus of Ti-Nb alloys is found, analyzed, and validated based on a customized workflow (come from the proposed framework). Acquiring knowledge and assisting materials design from computing and experimental data is very interesting and important at the intersection of materials and data science. In this work, we construct a whole framework to find an easy-to-interpret equation by taking a study of Young’s modulus of Ti-Nb alloys as an example. Here, we used Young’s modulus-targeted regression model (decision tree) to find a potential rule for classifying the dataset. The explicit equation was constructed through machine learning (ML) and validated based on physical laws. The transferability of the equation stood out after comparing it with five ML models including support vector machine (SVR), linear regression (LR), k-nearest neighbor (KNN), decision tree (DT), and random forest (RF). The results prove that our method indeed helps us to discover the contained knowledge hidden in the data and to uncover the relationship between features and property.
ISSN:0927-0256
1879-0801
DOI:10.1016/j.commatsci.2023.112349