Explainable Machine Learning for Hydrogen Diffusion in Metals and Random Binary Alloys

Hydrogen diffusion in metals and alloys plays an important role in the discovery of new materials for fuel cell and energy storage technology. While analytic models use hand-selected features that have clear physical ties to hydrogen diffusion, they often lack accuracy when making quantitative predi...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:arXiv.org 2023-10
Hauptverfasser: Lu, Grace M, Witman, Matthew, Agarwal, Sapan, Stavila, Vitalie, Trinkle, Dallas R
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Lu, Grace M
Witman, Matthew
Agarwal, Sapan
Stavila, Vitalie
Trinkle, Dallas R
description Hydrogen diffusion in metals and alloys plays an important role in the discovery of new materials for fuel cell and energy storage technology. While analytic models use hand-selected features that have clear physical ties to hydrogen diffusion, they often lack accuracy when making quantitative predictions. Machine learning models are capable of making accurate predictions, but their inner workings are obscured, rendering it unclear which physical features are truly important. To develop interpretable machine learning models to predict the activation energies of hydrogen diffusion in metals and random binary alloys, we create a database for physical and chemical properties of the species and use it to fit six machine learning models. Our models achieve root-mean-squared-errors between 98-119 meV on the testing data and accurately predict that elemental Ru has a large activation energy, while elemental Cr and Fe have small activation energies. By analyzing the feature importances of these fitted models, we identify relevant physical properties for predicting hydrogen diffusivity. While metrics for measuring the individual feature importances for machine learning models exist, correlations between the features lead to disagreement between models and limit the conclusions that can be drawn. Instead grouped feature importances, formed by combining the features via their correlations, agree across the six models and reveal that the two groups containing the packing factor and electronic specific heat are particularly significant for predicting hydrogen diffusion in metals and random binary alloys. This framework allows us to interpret machine learning models and enables rapid screening of new materials with the desired rates of hydrogen diffusion.
doi_str_mv 10.48550/arxiv.2308.07823
format Article
fullrecord <record><control><sourceid>proquest_arxiv</sourceid><recordid>TN_cdi_arxiv_primary_2308_07823</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2851479825</sourcerecordid><originalsourceid>FETCH-LOGICAL-a525-6f34c03842172a3b9282f3e69366d4f8502b0dbfe74be73e439d6ffc15f524273</originalsourceid><addsrcrecordid>eNotj8FOAjEURRsTEwnyAa5s4nqw81477SwRUUwgJoa4nXSYFkuGFjuMgb-3gpt3F-_m5hxC7nI25koI9qjj0f2MAZkaM6kAr8gAEPNMcYAbMuq6LWMMCglC4IB8zo77Vjuv69bQpV5_OW_owujond9QGyKdn5oYNsbTZ2dt37ngqfN0aQ667aj2Df1IJ-zoUxqJJzpp23Dqbsm1TX8z-s8hWb3MVtN5tnh_fZtOFpkWILLCIl8zTGS5BI11CQosmqLEomi4VYJBzZraGslrI9FwLJvC2nUurAAOEofk_jJ7lq720e0SQ_UnX53lU-Ph0tjH8N2b7lBtQx99YqpAiZzLUoHAXzgtW8o</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2851479825</pqid></control><display><type>article</type><title>Explainable Machine Learning for Hydrogen Diffusion in Metals and Random Binary Alloys</title><source>arXiv.org</source><source>Free E- Journals</source><creator>Lu, Grace M ; Witman, Matthew ; Agarwal, Sapan ; Stavila, Vitalie ; Trinkle, Dallas R</creator><creatorcontrib>Lu, Grace M ; Witman, Matthew ; Agarwal, Sapan ; Stavila, Vitalie ; Trinkle, Dallas R</creatorcontrib><description>Hydrogen diffusion in metals and alloys plays an important role in the discovery of new materials for fuel cell and energy storage technology. While analytic models use hand-selected features that have clear physical ties to hydrogen diffusion, they often lack accuracy when making quantitative predictions. Machine learning models are capable of making accurate predictions, but their inner workings are obscured, rendering it unclear which physical features are truly important. To develop interpretable machine learning models to predict the activation energies of hydrogen diffusion in metals and random binary alloys, we create a database for physical and chemical properties of the species and use it to fit six machine learning models. Our models achieve root-mean-squared-errors between 98-119 meV on the testing data and accurately predict that elemental Ru has a large activation energy, while elemental Cr and Fe have small activation energies. By analyzing the feature importances of these fitted models, we identify relevant physical properties for predicting hydrogen diffusivity. While metrics for measuring the individual feature importances for machine learning models exist, correlations between the features lead to disagreement between models and limit the conclusions that can be drawn. Instead grouped feature importances, formed by combining the features via their correlations, agree across the six models and reveal that the two groups containing the packing factor and electronic specific heat are particularly significant for predicting hydrogen diffusion in metals and random binary alloys. This framework allows us to interpret machine learning models and enables rapid screening of new materials with the desired rates of hydrogen diffusion.</description><identifier>EISSN: 2331-8422</identifier><identifier>DOI: 10.48550/arxiv.2308.07823</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Activation energy ; Binary alloys ; Chemical properties ; Diffusion rate ; Energy storage ; Fuel cells ; Hydrogen ; Machine learning ; Mathematical models ; Physical properties ; Physics - Materials Science ; Technology assessment</subject><ispartof>arXiv.org, 2023-10</ispartof><rights>2023. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,784,885,27923</link.rule.ids><backlink>$$Uhttps://doi.org/10.48550/arXiv.2308.07823$$DView paper in arXiv$$Hfree_for_read</backlink><backlink>$$Uhttps://doi.org/10.1103/PhysRevMaterials.7.105402$$DView published paper (Access to full text may be restricted)$$Hfree_for_read</backlink></links><search><creatorcontrib>Lu, Grace M</creatorcontrib><creatorcontrib>Witman, Matthew</creatorcontrib><creatorcontrib>Agarwal, Sapan</creatorcontrib><creatorcontrib>Stavila, Vitalie</creatorcontrib><creatorcontrib>Trinkle, Dallas R</creatorcontrib><title>Explainable Machine Learning for Hydrogen Diffusion in Metals and Random Binary Alloys</title><title>arXiv.org</title><description>Hydrogen diffusion in metals and alloys plays an important role in the discovery of new materials for fuel cell and energy storage technology. While analytic models use hand-selected features that have clear physical ties to hydrogen diffusion, they often lack accuracy when making quantitative predictions. Machine learning models are capable of making accurate predictions, but their inner workings are obscured, rendering it unclear which physical features are truly important. To develop interpretable machine learning models to predict the activation energies of hydrogen diffusion in metals and random binary alloys, we create a database for physical and chemical properties of the species and use it to fit six machine learning models. Our models achieve root-mean-squared-errors between 98-119 meV on the testing data and accurately predict that elemental Ru has a large activation energy, while elemental Cr and Fe have small activation energies. By analyzing the feature importances of these fitted models, we identify relevant physical properties for predicting hydrogen diffusivity. While metrics for measuring the individual feature importances for machine learning models exist, correlations between the features lead to disagreement between models and limit the conclusions that can be drawn. Instead grouped feature importances, formed by combining the features via their correlations, agree across the six models and reveal that the two groups containing the packing factor and electronic specific heat are particularly significant for predicting hydrogen diffusion in metals and random binary alloys. This framework allows us to interpret machine learning models and enables rapid screening of new materials with the desired rates of hydrogen diffusion.</description><subject>Activation energy</subject><subject>Binary alloys</subject><subject>Chemical properties</subject><subject>Diffusion rate</subject><subject>Energy storage</subject><subject>Fuel cells</subject><subject>Hydrogen</subject><subject>Machine learning</subject><subject>Mathematical models</subject><subject>Physical properties</subject><subject>Physics - Materials Science</subject><subject>Technology assessment</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GOX</sourceid><recordid>eNotj8FOAjEURRsTEwnyAa5s4nqw81477SwRUUwgJoa4nXSYFkuGFjuMgb-3gpt3F-_m5hxC7nI25koI9qjj0f2MAZkaM6kAr8gAEPNMcYAbMuq6LWMMCglC4IB8zo77Vjuv69bQpV5_OW_owujond9QGyKdn5oYNsbTZ2dt37ngqfN0aQ667aj2Df1IJ-zoUxqJJzpp23Dqbsm1TX8z-s8hWb3MVtN5tnh_fZtOFpkWILLCIl8zTGS5BI11CQosmqLEomi4VYJBzZraGslrI9FwLJvC2nUurAAOEofk_jJ7lq720e0SQ_UnX53lU-Ph0tjH8N2b7lBtQx99YqpAiZzLUoHAXzgtW8o</recordid><startdate>20231026</startdate><enddate>20231026</enddate><creator>Lu, Grace M</creator><creator>Witman, Matthew</creator><creator>Agarwal, Sapan</creator><creator>Stavila, Vitalie</creator><creator>Trinkle, Dallas R</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope><scope>GOX</scope></search><sort><creationdate>20231026</creationdate><title>Explainable Machine Learning for Hydrogen Diffusion in Metals and Random Binary Alloys</title><author>Lu, Grace M ; Witman, Matthew ; Agarwal, Sapan ; Stavila, Vitalie ; Trinkle, Dallas R</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a525-6f34c03842172a3b9282f3e69366d4f8502b0dbfe74be73e439d6ffc15f524273</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Activation energy</topic><topic>Binary alloys</topic><topic>Chemical properties</topic><topic>Diffusion rate</topic><topic>Energy storage</topic><topic>Fuel cells</topic><topic>Hydrogen</topic><topic>Machine learning</topic><topic>Mathematical models</topic><topic>Physical properties</topic><topic>Physics - Materials Science</topic><topic>Technology assessment</topic><toplevel>online_resources</toplevel><creatorcontrib>Lu, Grace M</creatorcontrib><creatorcontrib>Witman, Matthew</creatorcontrib><creatorcontrib>Agarwal, Sapan</creatorcontrib><creatorcontrib>Stavila, Vitalie</creatorcontrib><creatorcontrib>Trinkle, Dallas R</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection (ProQuest)</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection><collection>arXiv.org</collection><jtitle>arXiv.org</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Lu, Grace M</au><au>Witman, Matthew</au><au>Agarwal, Sapan</au><au>Stavila, Vitalie</au><au>Trinkle, Dallas R</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Explainable Machine Learning for Hydrogen Diffusion in Metals and Random Binary Alloys</atitle><jtitle>arXiv.org</jtitle><date>2023-10-26</date><risdate>2023</risdate><eissn>2331-8422</eissn><abstract>Hydrogen diffusion in metals and alloys plays an important role in the discovery of new materials for fuel cell and energy storage technology. While analytic models use hand-selected features that have clear physical ties to hydrogen diffusion, they often lack accuracy when making quantitative predictions. Machine learning models are capable of making accurate predictions, but their inner workings are obscured, rendering it unclear which physical features are truly important. To develop interpretable machine learning models to predict the activation energies of hydrogen diffusion in metals and random binary alloys, we create a database for physical and chemical properties of the species and use it to fit six machine learning models. Our models achieve root-mean-squared-errors between 98-119 meV on the testing data and accurately predict that elemental Ru has a large activation energy, while elemental Cr and Fe have small activation energies. By analyzing the feature importances of these fitted models, we identify relevant physical properties for predicting hydrogen diffusivity. While metrics for measuring the individual feature importances for machine learning models exist, correlations between the features lead to disagreement between models and limit the conclusions that can be drawn. Instead grouped feature importances, formed by combining the features via their correlations, agree across the six models and reveal that the two groups containing the packing factor and electronic specific heat are particularly significant for predicting hydrogen diffusion in metals and random binary alloys. This framework allows us to interpret machine learning models and enables rapid screening of new materials with the desired rates of hydrogen diffusion.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><doi>10.48550/arxiv.2308.07823</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2023-10
issn 2331-8422
language eng
recordid cdi_arxiv_primary_2308_07823
source arXiv.org; Free E- Journals
subjects Activation energy
Binary alloys
Chemical properties
Diffusion rate
Energy storage
Fuel cells
Hydrogen
Machine learning
Mathematical models
Physical properties
Physics - Materials Science
Technology assessment
title Explainable Machine Learning for Hydrogen Diffusion in Metals and Random Binary Alloys
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-13T12%3A42%3A06IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_arxiv&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Explainable%20Machine%20Learning%20for%20Hydrogen%20Diffusion%20in%20Metals%20and%20Random%20Binary%20Alloys&rft.jtitle=arXiv.org&rft.au=Lu,%20Grace%20M&rft.date=2023-10-26&rft.eissn=2331-8422&rft_id=info:doi/10.48550/arxiv.2308.07823&rft_dat=%3Cproquest_arxiv%3E2851479825%3C/proquest_arxiv%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2851479825&rft_id=info:pmid/&rfr_iscdi=true