Can machine learning find extraordinary materials?
[Display omitted] •Machine learning can extrapolate to extraordinary materials within AFLOW dataset.•Extrapolation as a classification task can often improve results.•Results suggest machine learning may be a useful screening tool. One of the most common criticisms of machine learning is an assumed...
Gespeichert in:
Veröffentlicht in: | Computational materials science 2020-03, Vol.174, p.109498, Article 109498 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | [Display omitted]
•Machine learning can extrapolate to extraordinary materials within AFLOW dataset.•Extrapolation as a classification task can often improve results.•Results suggest machine learning may be a useful screening tool.
One of the most common criticisms of machine learning is an assumed inability for models to extrapolate, i.e. to identify extraordinary materials with properties beyond those present in the training data set. To investigate whether this is indeed the case, this work takes advantage of density functional theory calculated properties (bulk modulus, shear modulus, thermal conductivity, thermal expansion, band gap, and Debye temperature) to investigate whether machine learning is truly capable of predicting materials with properties that extend beyond previously seen values. We refer to these materials as extraordinary, meaning they represent the top 1% of values in the available data set. Interestingly, we show that even when machine learning is trained on a fraction of the bottom 99% we can consistently identify 34 of the highest performing compositions for all considered properties with a precision that is typically above 0.5. We explore model performance as the extrapolation distance is increased in various ways including, introduction of a gap, removal of certain elements, and removal of certain structure types. Moreover, we investigate a few different modeling choices and demonstrate how a classification approach can identify an equivalent amount of extraordinary compounds but with significantly fewer false positives than a regression approach. Finally, we discuss cautions and potential limitations in implementing such an approach to discover new record-breaking materials. |
---|---|
ISSN: | 0927-0256 1879-0801 |
DOI: | 10.1016/j.commatsci.2019.109498 |