Predicting the Yield of Pd‐Catalyzed Buchwald–Hartwig Amination Using Machine Learning with Extended Molecular Fingerprints and Selected Physical Parameters
Machine learning has gained attention due to its ongoing advancements and diverse applications. Within the field of homogeneous catalysis, a prominent area of research in machine learning revolves around predicting reaction yield in Pd‐catalyzed Buchwald–Hartwig amination reactions. This study sough...
Gespeichert in:
Veröffentlicht in: | ChemistrySelect (Weinheim) 2024-09, Vol.9 (33), p.n/a |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Machine learning has gained attention due to its ongoing advancements and diverse applications. Within the field of homogeneous catalysis, a prominent area of research in machine learning revolves around predicting reaction yield in Pd‐catalyzed Buchwald–Hartwig amination reactions. This study sought to determine the optimal descriptors for representing the both structural and physical information associated with the reaction, particularly focusing on product details. To achieve this, we assessed the utilization of product extended molecular fingerprints (PEMF) and selected physical parameters (SPP). The utilization of a random forest model incorporating these descriptors yielded promising results in the prediction of reaction yields in Pd‐catalyzed Buchwald–Hartwig amination reactions. The model achieved an impressive R2 value of 0.943. Data preprocessing on PEMF and sorting preprocessing on physical parameters resulted in a significant reduction in data size to 259 bits PEMF+2 SPPs per prediction, much less than the two previous random forest models which utilized 480 physical parameters and 21,073 bits molecular fingerprints. Although establishing definitive correlations between SPPs and reaction yield presented challenges, our findings indicate that the presence of heavier atoms in the aryl halides may have a beneficial impact within the examined Pd‐catalyzed Buchwald–Hartwig amination reactions, as compared to their analogues.
This work focuses on balancing computation cost and model accuracy through Machine Learning for forecasting the reaction yield in Pd‐catalysed Buchwald–Hartwig amination reactions. The study evaluated the effectiveness of product extended molecular fingerprints (PEMF) and selected physical parameters (SPP) as critical features. The model achieved an R2 value of 0.943 using a smaller dataset. Further exploration of the reaction mechanisms reveals that the presence of heavier atoms in the aryl halides may have a beneficial impact within the examined Pd‐catalysed Buchwald–Hartwig amination reactions. |
---|---|
ISSN: | 2365-6549 2365-6549 |
DOI: | 10.1002/slct.202402529 |