Deep-Learning-Based End-to-End Predictions of CO 2 Capture in Metal-Organic Frameworks

Metal-organic frameworks (MOFs) have become an active topic because of their excellent carbon capture and storage (CCS) properties. However, it is quite challenging to identify MOFs with superior performance within a massive combinatorial search space. To this end, we propose a deep-learning-based e...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of chemical information and modeling 2022-07, Vol.62 (14), p.3281-3290
Hauptverfasser: Lu, Cunxing, Wan, Xili, Ma, Xuhao, Guan, Xinjie, Zhu, Aichun
Format: Artikel
Sprache:eng
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Metal-organic frameworks (MOFs) have become an active topic because of their excellent carbon capture and storage (CCS) properties. However, it is quite challenging to identify MOFs with superior performance within a massive combinatorial search space. To this end, we propose a deep-learning-based end-to-end prediction model to rapidly and accurately predict the CO working capacity and CO /N selectivity of a given MOF under low-pressure conditions. Different from previous methods, our prediction model relies only on the data from the Crystallographic Information File (CIF) rather than handcrafted geometric descriptors and chemical descriptors. The model was developed, trained, and tested on a dataset of 342489 topologically diverse MOFs. Experimental results on the dataset show that the proposed model achieves high prediction performance, i.e., = 0.916 for predicting the CO working capacity and = 0.911 for predicting the CO /N selectivity. With regard to the identification of potential high-performing MOFs, 1020 of 1027 (top 3%) high-performance MOFs were recovered while screening only 12% of the entire dataset using our provided pretrained model, reducing the computation time by nearly an order of magnitude when the model was used to prescreen material prior to computationally intensive grand canonical Monte Carlo (GCMC) simulations while still capturing 99% of the high-performance MOFs. In the ab initio training task, the method can achieve = 0.85 with only 20% of the labeled data used for training and recover 995 of 1027 (top 3%) high-performance MOFs with only 12% of the entire dataset screened.
ISSN:1549-9596
1549-960X
DOI:10.1021/acs.jcim.2c00092