Analysis of Updated Literature Data up to 2019 on the Oxidative Coupling of Methane Using an Extrapolative Machine‐Learning Method to Identify Novel Catalysts

We have constructed and analyzed an updated dataset consisting of 4759 experimental datapoints for the oxidative coupling of methane (OCM) reaction based on literature data reported before 2020 (∼2019) using machine learning (ML) methods. Several ML methods, including random forest regression (RFR),...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:ChemCatChem 2021-08, Vol.13 (16), p.3636-3655
Hauptverfasser: Mine, Shinya, Takao, Motoshi, Yamaguchi, Taichi, Toyao, Takashi, Maeno, Zen, Hakim Siddiki, S. M. A., Takakusagi, Satoru, Shimizu, Ken‐ichi, Takigawa, Ichigaku
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:We have constructed and analyzed an updated dataset consisting of 4759 experimental datapoints for the oxidative coupling of methane (OCM) reaction based on literature data reported before 2020 (∼2019) using machine learning (ML) methods. Several ML methods, including random forest regression (RFR), extra trees regression (ETR), and gradient boosting regression with XGBoost (XGB), were used in conjunction with our proposed approach, in which elemental features are used as input representations rather than inputting the catalyst compositions directly. A recent research trend, namely, the extensive exploration of Mn/Na2WO4/SiO2 catalyst systems in recent years due to their high activity and durability, was clearly reflected in the dataset analysis. An ML model for the prediction of the reaction outcome (C2 yield) was successfully developed, and feature importance scores and SHapley Additive exPlanations (SHAP) values were calculated based on ETR and XGB, respectively, to identify the input variables with the greatest influence on the catalyst performance and observe how these important variables affect the C2 yield in the OCM. The discovery and optimization of catalytic processes using ML as a “surrogate” model were explored, and promising catalytic system candidates for the OCM reaction were identified. Notably, the developed ML model predicted catalysts containing elements that do not appear in the OCM dataset. This clearly demonstrates desirably high potential of our ML model to enable extrapolative predictions for ML‐aided future catalysis research. Machine learning: An updated dataset of the catalytic oxidative coupling of methane (OCM) consisting of nearly 5000 experimental datapoints has been compiled and analyzed using statistical methods based on machine learning (ML). A new explorative ML approach that considers elemental features as input representations (named as Sorted Weighted Elemental Descriptor (SWED) representation) rather than inputting the catalyst compositions directly was developed. This ML method allows predicting novel promising catalyst candidates that include elements unseen in the original dataset.
ISSN:1867-3880
1867-3899
DOI:10.1002/cctc.202100495