Accurate band gap prediction based on an interpretable Δ-machine learning

Most materials science datasets are not so large that the accuracy of machine learning (ML) models is relatively limited if only simple features are used. Here, we constructed an interpretable ∆-machine learning (∆-ML) model to connect the hybrid functional HSE bandgap (EgHSE) with the PBE functiona...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Materials today communications 2022-12, Vol.33, p.104630, Article 104630
Hauptverfasser: Zhang, Lingyao, Su, Tianhao, Li, Musen, Jia, Fanhao, Hu, Shuobo, Zhang, Peihong, Ren, Wei
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Most materials science datasets are not so large that the accuracy of machine learning (ML) models is relatively limited if only simple features are used. Here, we constructed an interpretable ∆-machine learning (∆-ML) model to connect the hybrid functional HSE bandgap (EgHSE) with the PBE functional bandgap (EgPBE). The former can reproduce the band gap comparable with experiments, but the computational cost is much more challenging. The training is based on our high-throughput calculations on a set of two-dimensional semiconductors. Four complex descriptors, all based on the EgPBE are constructed using the sure independence screening and sparsifying operator (SISSO) algorithm. Using these descriptors, the ∆-ML can accurately predict the EgHSE of test set with a determination coefficient (R2) of 0.96. The error satisfies a normal distribution with a mean of zero. We provide a direct functional relationship between input descriptors and target properties. We find that EgHSE and the 5/6th power of EgPBE show a significant linear correlation, which may guide rapid prediction of EgHSE from EgPBE for materials with a EgHSE greater than 0.22 eV. We also discussed the correlation between the atomic radius and the EgHSE. Our work will provide an effective and interpretable model to construct the optimal physical descriptors for ML prediction on bandgaps in screening massive new 2D materials research. [Display omitted] •Constructing an interpretable ∆-machine learning (∆-ML) model to connect the hybrid functional EgHSE with the EgPBE.•SISSO descriptor D3=EgPBE5/6 can predict the EgHSE of 2D-semiconductors using equation EgHSE = D3 × 1.55 + 0.22.•SISSO descriptor D1shows the atomic volume negatively correlated to EgHSE.
ISSN:2352-4928
2352-4928
DOI:10.1016/j.mtcomm.2022.104630