Efficient structure-informed featurization and property prediction of ordered, dilute, and random atomic structures

Structure-informed materials informatics is a rapidly evolving discipline of materials science relying on the featurization of atomic structures or configurations to construct vector, voxel, graph, graphlet, and other representations useful for machine learning prediction of properties, fingerprinti...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Computational materials science 2025-01, Vol.247 (C), p.113495, Article 113495
Hauptverfasser: Krajewski, Adam M., Siegel, Jonathan W., Liu, Zi-Kui
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Structure-informed materials informatics is a rapidly evolving discipline of materials science relying on the featurization of atomic structures or configurations to construct vector, voxel, graph, graphlet, and other representations useful for machine learning prediction of properties, fingerprinting, and generative design. This work discusses how current featurizers typically perform redundant calculations and how their efficiency could be improved by considering (1) fundamentals of crystallographic (orbits) equivalency to optimize ordered structures and (2) representation-dependent equivalency to optimize dilute, doped, and defect structures with broken symmetry. It also discusses and contrasts ways of (3) approximating random solid solutions occupying arbitrary lattices under such representations. Efficiency improvements discussed in this work were implemented within ▪ or python toolset for Structure-Informed Property and Feature Engineering with Neural Networks developed by authors since 2019 and shown to increase performance from 2 to 10 times for typical inputs. Throughout this work, the authors explicitly discuss how these advances can be applied to different kinds of similar tools in the community. [Display omitted] •▪ is an open toolset for structure-informed property and feature engineering.•Modular build enables both easy extensions and integration into external libraries.•▪ featurizer multiplies machine learning throughput using symmetry-equivalency.•▪ enables further optimizations by considering representation-equivalency.•▪ _ ▪ considers chemistry and geometry to featurize solid solutions.
ISSN:0927-0256
DOI:10.1016/j.commatsci.2024.113495