Making machine learning a useful tool in the accelerated discovery of transition metal complexes

As machine learning (ML) has matured, it has opened a new frontier in theoretical and computational chemistry by offering the promise of simultaneous paradigm shifts in accuracy and efficiency. Nowhere is this advance more needed, but also more challenging to achieve, than in the discovery of open‐s...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Wiley interdisciplinary reviews. Computational molecular science 2020-01, Vol.10 (1), p.e1439-n/a
1. Verfasser: Kulik, Heather J.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page n/a
container_issue 1
container_start_page e1439
container_title Wiley interdisciplinary reviews. Computational molecular science
container_volume 10
creator Kulik, Heather J.
description As machine learning (ML) has matured, it has opened a new frontier in theoretical and computational chemistry by offering the promise of simultaneous paradigm shifts in accuracy and efficiency. Nowhere is this advance more needed, but also more challenging to achieve, than in the discovery of open‐shell transition metal complexes. Here, localized d or f electrons exhibit variable bonding that is challenging to capture even with the most computationally demanding methods. Thus, despite great promise, clear obstacles remain in constructing ML models that can supplement or even replace explicit electronic structure calculations. In this article, I outline the recent advances in building ML models in transition metal chemistry, including the ability to approach sub‐kcal/mol accuracy on a range of properties with tailored representations, to discover and enumerate complexes in large chemical spaces, and to reveal opportunities for design through analysis of feature importance. I discuss unique considerations that have been essential to enabling ML in open‐shell transition metal chemistry, including (a) the relationship of data set size/diversity, model complexity, and representation choice, (b) the importance of quantitative assessments of both theory and model domain of applicability, and (c) the need to enable autonomous generation of reliable, large data sets both for ML model training and in active learning or discovery contexts. Finally, I summarize the next steps toward making ML a mainstream tool in the accelerated discovery of transition metal complexes. This article is categorized under: Electronic Structure Theory > Density Functional Theory Software > Molecular Modeling Computer and Information Science > Chemoinformatics An outlook on the challenges and opportunities for making machine learning models a mainstream tool in computational chemistry to accelerate discovery in open‐shell transition metal chemistry.
doi_str_mv 10.1002/wcms.1439
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2322033046</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2322033046</sourcerecordid><originalsourceid>FETCH-LOGICAL-c3329-8b60284e31b88781af5de9f6117cabd8dc213dd70582fb47d005d2aad6772c4d3</originalsourceid><addsrcrecordid>eNp1kD1PwzAQhi0EElXpwD-wxMSQ1h9J7Iyo4ktqxQCI0Tj2hbokcbFTSv89CUVs3HKnV8_dSQ9C55RMKSFstjNNnNKUF0doREVWJETK9PhvFvkpmsS4Jn2lBWWcjtDrUr-79g032qxcC7gGHdoh0HgbodrWuPO-xq7F3QqwNgZqCLoDi62Lxn9C2GNf4S7oNrrO-RY30OkaG99saviCeIZOKl1HmPz2MXq-uX6a3yWLh9v7-dUiMZyzIpFlTphMgdNSSiGprjILRZVTKowurbSGUW6tIJlkVZkKS0hmmdY2F4KZ1PIxujjc3QT_sYXYqbXfhrZ_qRhnjHBO0rynLg-UCT7GAJXaBNfosFeUqMGhGhyqwWHPzg7sztWw_x9UL_Pl48_GN0zmdFs</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2322033046</pqid></control><display><type>article</type><title>Making machine learning a useful tool in the accelerated discovery of transition metal complexes</title><source>Wiley Online Library Journals Frontfile Complete</source><creator>Kulik, Heather J.</creator><creatorcontrib>Kulik, Heather J.</creatorcontrib><description>As machine learning (ML) has matured, it has opened a new frontier in theoretical and computational chemistry by offering the promise of simultaneous paradigm shifts in accuracy and efficiency. Nowhere is this advance more needed, but also more challenging to achieve, than in the discovery of open‐shell transition metal complexes. Here, localized d or f electrons exhibit variable bonding that is challenging to capture even with the most computationally demanding methods. Thus, despite great promise, clear obstacles remain in constructing ML models that can supplement or even replace explicit electronic structure calculations. In this article, I outline the recent advances in building ML models in transition metal chemistry, including the ability to approach sub‐kcal/mol accuracy on a range of properties with tailored representations, to discover and enumerate complexes in large chemical spaces, and to reveal opportunities for design through analysis of feature importance. I discuss unique considerations that have been essential to enabling ML in open‐shell transition metal chemistry, including (a) the relationship of data set size/diversity, model complexity, and representation choice, (b) the importance of quantitative assessments of both theory and model domain of applicability, and (c) the need to enable autonomous generation of reliable, large data sets both for ML model training and in active learning or discovery contexts. Finally, I summarize the next steps toward making ML a mainstream tool in the accelerated discovery of transition metal complexes. This article is categorized under: Electronic Structure Theory &gt; Density Functional Theory Software &gt; Molecular Modeling Computer and Information Science &gt; Chemoinformatics An outlook on the challenges and opportunities for making machine learning models a mainstream tool in computational chemistry to accelerate discovery in open‐shell transition metal chemistry.</description><identifier>ISSN: 1759-0876</identifier><identifier>EISSN: 1759-0884</identifier><identifier>DOI: 10.1002/wcms.1439</identifier><language>eng</language><publisher>Hoboken, USA: Wiley Periodicals, Inc</publisher><subject>Accuracy ; Artificial intelligence ; Chemistry ; Computational chemistry ; Computer applications ; Coordination compounds ; Datasets ; Density functional theory ; Electronic structure ; high‐throughput screening ; inorganic chemistry ; Learning algorithms ; Machine learning ; Metal complexes ; Metals ; Molecular modelling ; Molecular structure ; Organic chemistry ; Representations ; Theories ; Theory ; Training ; Transition metal compounds</subject><ispartof>Wiley interdisciplinary reviews. Computational molecular science, 2020-01, Vol.10 (1), p.e1439-n/a</ispartof><rights>2019 Wiley Periodicals, Inc.</rights><rights>2020 Wiley Periodicals, Inc.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c3329-8b60284e31b88781af5de9f6117cabd8dc213dd70582fb47d005d2aad6772c4d3</citedby><cites>FETCH-LOGICAL-c3329-8b60284e31b88781af5de9f6117cabd8dc213dd70582fb47d005d2aad6772c4d3</cites><orcidid>0000-0001-9342-0191</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://onlinelibrary.wiley.com/doi/pdf/10.1002%2Fwcms.1439$$EPDF$$P50$$Gwiley$$H</linktopdf><linktohtml>$$Uhttps://onlinelibrary.wiley.com/doi/full/10.1002%2Fwcms.1439$$EHTML$$P50$$Gwiley$$H</linktohtml><link.rule.ids>314,776,780,1411,27903,27904,45553,45554</link.rule.ids></links><search><creatorcontrib>Kulik, Heather J.</creatorcontrib><title>Making machine learning a useful tool in the accelerated discovery of transition metal complexes</title><title>Wiley interdisciplinary reviews. Computational molecular science</title><description>As machine learning (ML) has matured, it has opened a new frontier in theoretical and computational chemistry by offering the promise of simultaneous paradigm shifts in accuracy and efficiency. Nowhere is this advance more needed, but also more challenging to achieve, than in the discovery of open‐shell transition metal complexes. Here, localized d or f electrons exhibit variable bonding that is challenging to capture even with the most computationally demanding methods. Thus, despite great promise, clear obstacles remain in constructing ML models that can supplement or even replace explicit electronic structure calculations. In this article, I outline the recent advances in building ML models in transition metal chemistry, including the ability to approach sub‐kcal/mol accuracy on a range of properties with tailored representations, to discover and enumerate complexes in large chemical spaces, and to reveal opportunities for design through analysis of feature importance. I discuss unique considerations that have been essential to enabling ML in open‐shell transition metal chemistry, including (a) the relationship of data set size/diversity, model complexity, and representation choice, (b) the importance of quantitative assessments of both theory and model domain of applicability, and (c) the need to enable autonomous generation of reliable, large data sets both for ML model training and in active learning or discovery contexts. Finally, I summarize the next steps toward making ML a mainstream tool in the accelerated discovery of transition metal complexes. This article is categorized under: Electronic Structure Theory &gt; Density Functional Theory Software &gt; Molecular Modeling Computer and Information Science &gt; Chemoinformatics An outlook on the challenges and opportunities for making machine learning models a mainstream tool in computational chemistry to accelerate discovery in open‐shell transition metal chemistry.</description><subject>Accuracy</subject><subject>Artificial intelligence</subject><subject>Chemistry</subject><subject>Computational chemistry</subject><subject>Computer applications</subject><subject>Coordination compounds</subject><subject>Datasets</subject><subject>Density functional theory</subject><subject>Electronic structure</subject><subject>high‐throughput screening</subject><subject>inorganic chemistry</subject><subject>Learning algorithms</subject><subject>Machine learning</subject><subject>Metal complexes</subject><subject>Metals</subject><subject>Molecular modelling</subject><subject>Molecular structure</subject><subject>Organic chemistry</subject><subject>Representations</subject><subject>Theories</subject><subject>Theory</subject><subject>Training</subject><subject>Transition metal compounds</subject><issn>1759-0876</issn><issn>1759-0884</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><recordid>eNp1kD1PwzAQhi0EElXpwD-wxMSQ1h9J7Iyo4ktqxQCI0Tj2hbokcbFTSv89CUVs3HKnV8_dSQ9C55RMKSFstjNNnNKUF0doREVWJETK9PhvFvkpmsS4Jn2lBWWcjtDrUr-79g032qxcC7gGHdoh0HgbodrWuPO-xq7F3QqwNgZqCLoDi62Lxn9C2GNf4S7oNrrO-RY30OkaG99saviCeIZOKl1HmPz2MXq-uX6a3yWLh9v7-dUiMZyzIpFlTphMgdNSSiGprjILRZVTKowurbSGUW6tIJlkVZkKS0hmmdY2F4KZ1PIxujjc3QT_sYXYqbXfhrZ_qRhnjHBO0rynLg-UCT7GAJXaBNfosFeUqMGhGhyqwWHPzg7sztWw_x9UL_Pl48_GN0zmdFs</recordid><startdate>202001</startdate><enddate>202001</enddate><creator>Kulik, Heather J.</creator><general>Wiley Periodicals, Inc</general><general>Wiley Subscription Services, Inc</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7QH</scope><scope>7TN</scope><scope>7UA</scope><scope>C1K</scope><scope>F1W</scope><scope>H96</scope><scope>JQ2</scope><scope>L.G</scope><orcidid>https://orcid.org/0000-0001-9342-0191</orcidid></search><sort><creationdate>202001</creationdate><title>Making machine learning a useful tool in the accelerated discovery of transition metal complexes</title><author>Kulik, Heather J.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c3329-8b60284e31b88781af5de9f6117cabd8dc213dd70582fb47d005d2aad6772c4d3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Accuracy</topic><topic>Artificial intelligence</topic><topic>Chemistry</topic><topic>Computational chemistry</topic><topic>Computer applications</topic><topic>Coordination compounds</topic><topic>Datasets</topic><topic>Density functional theory</topic><topic>Electronic structure</topic><topic>high‐throughput screening</topic><topic>inorganic chemistry</topic><topic>Learning algorithms</topic><topic>Machine learning</topic><topic>Metal complexes</topic><topic>Metals</topic><topic>Molecular modelling</topic><topic>Molecular structure</topic><topic>Organic chemistry</topic><topic>Representations</topic><topic>Theories</topic><topic>Theory</topic><topic>Training</topic><topic>Transition metal compounds</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Kulik, Heather J.</creatorcontrib><collection>CrossRef</collection><collection>Aqualine</collection><collection>Oceanic Abstracts</collection><collection>Water Resources Abstracts</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ASFA: Aquatic Sciences and Fisheries Abstracts</collection><collection>Aquatic Science &amp; Fisheries Abstracts (ASFA) 2: Ocean Technology, Policy &amp; Non-Living Resources</collection><collection>ProQuest Computer Science Collection</collection><collection>Aquatic Science &amp; Fisheries Abstracts (ASFA) Professional</collection><jtitle>Wiley interdisciplinary reviews. Computational molecular science</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Kulik, Heather J.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Making machine learning a useful tool in the accelerated discovery of transition metal complexes</atitle><jtitle>Wiley interdisciplinary reviews. Computational molecular science</jtitle><date>2020-01</date><risdate>2020</risdate><volume>10</volume><issue>1</issue><spage>e1439</spage><epage>n/a</epage><pages>e1439-n/a</pages><issn>1759-0876</issn><eissn>1759-0884</eissn><abstract>As machine learning (ML) has matured, it has opened a new frontier in theoretical and computational chemistry by offering the promise of simultaneous paradigm shifts in accuracy and efficiency. Nowhere is this advance more needed, but also more challenging to achieve, than in the discovery of open‐shell transition metal complexes. Here, localized d or f electrons exhibit variable bonding that is challenging to capture even with the most computationally demanding methods. Thus, despite great promise, clear obstacles remain in constructing ML models that can supplement or even replace explicit electronic structure calculations. In this article, I outline the recent advances in building ML models in transition metal chemistry, including the ability to approach sub‐kcal/mol accuracy on a range of properties with tailored representations, to discover and enumerate complexes in large chemical spaces, and to reveal opportunities for design through analysis of feature importance. I discuss unique considerations that have been essential to enabling ML in open‐shell transition metal chemistry, including (a) the relationship of data set size/diversity, model complexity, and representation choice, (b) the importance of quantitative assessments of both theory and model domain of applicability, and (c) the need to enable autonomous generation of reliable, large data sets both for ML model training and in active learning or discovery contexts. Finally, I summarize the next steps toward making ML a mainstream tool in the accelerated discovery of transition metal complexes. This article is categorized under: Electronic Structure Theory &gt; Density Functional Theory Software &gt; Molecular Modeling Computer and Information Science &gt; Chemoinformatics An outlook on the challenges and opportunities for making machine learning models a mainstream tool in computational chemistry to accelerate discovery in open‐shell transition metal chemistry.</abstract><cop>Hoboken, USA</cop><pub>Wiley Periodicals, Inc</pub><doi>10.1002/wcms.1439</doi><tpages>14</tpages><orcidid>https://orcid.org/0000-0001-9342-0191</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1759-0876
ispartof Wiley interdisciplinary reviews. Computational molecular science, 2020-01, Vol.10 (1), p.e1439-n/a
issn 1759-0876
1759-0884
language eng
recordid cdi_proquest_journals_2322033046
source Wiley Online Library Journals Frontfile Complete
subjects Accuracy
Artificial intelligence
Chemistry
Computational chemistry
Computer applications
Coordination compounds
Datasets
Density functional theory
Electronic structure
high‐throughput screening
inorganic chemistry
Learning algorithms
Machine learning
Metal complexes
Metals
Molecular modelling
Molecular structure
Organic chemistry
Representations
Theories
Theory
Training
Transition metal compounds
title Making machine learning a useful tool in the accelerated discovery of transition metal complexes
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-22T01%3A41%3A27IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Making%20machine%20learning%20a%20useful%20tool%20in%20the%20accelerated%20discovery%20of%20transition%20metal%20complexes&rft.jtitle=Wiley%20interdisciplinary%20reviews.%20Computational%20molecular%20science&rft.au=Kulik,%20Heather%20J.&rft.date=2020-01&rft.volume=10&rft.issue=1&rft.spage=e1439&rft.epage=n/a&rft.pages=e1439-n/a&rft.issn=1759-0876&rft.eissn=1759-0884&rft_id=info:doi/10.1002/wcms.1439&rft_dat=%3Cproquest_cross%3E2322033046%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2322033046&rft_id=info:pmid/&rfr_iscdi=true