Assessing Performance and Limitations of a Device‐Level Machine Learning Approach for Perovskite Solar Cells with an Application to Hole Transport Materials

Machine learning models have become widespread in materials science research. An open‐access and community‐driven database containing over 40 000 perovskite photovoltaic devices has been recently published. This resource enables the application of predictive data‐driven models to correlate device st...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Solar RRL 2023-10, Vol.7 (20)
Hauptverfasser: Sala, Simone, Quadrivi, Eleonora, Biagini, Paolo, Po’, Riccardo
Format: Artikel
Sprache:eng
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue 20
container_start_page
container_title Solar RRL
container_volume 7
creator Sala, Simone
Quadrivi, Eleonora
Biagini, Paolo
Po’, Riccardo
description Machine learning models have become widespread in materials science research. An open‐access and community‐driven database containing over 40 000 perovskite photovoltaic devices has been recently published. This resource enables the application of predictive data‐driven models to correlate device structure with photovoltaic performance, whereas the literature usually focuses on specific device layers. Herein, the concept of device‐level performance prediction is explored using gradient‐boosted regression trees as the core algorithm and Shapley values analysis to interpret and rationalize the results. The main pitfalls and conceptual limitations of the approach are discussed and correlated with the database structure and dimension, by comparing the performance of different choices of descriptors and dataset size. Evidence suggests that the additional features introduced herein, in particular chemical descriptors of perovskite additives, can boost regression performance at a device level. A specific model is finally trained to predict the performance of unseen devices and tested on experimental data from the literature. This task is found to be particularly challenging, as the ability of the model to generalize to a new chemical space is limited by several factors, including the amount and the quality of available data.
doi_str_mv 10.1002/solr.202300490
format Article
fullrecord <record><control><sourceid>crossref</sourceid><recordid>TN_cdi_crossref_primary_10_1002_solr_202300490</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>10_1002_solr_202300490</sourcerecordid><originalsourceid>FETCH-LOGICAL-c194t-43bcbf9a93aed92bbe4e0499265a5d72dbd9f6d4bb360fa49253b270998c811b3</originalsourceid><addsrcrecordid>eNpN0EtOwzAQBmALgUQF3bL2BVL8ahIvq_IoUhBIFIldZDsTanDjyLaK2HEETsDhOAkJIMTqH41mvsWP0AklM0oIO43ehRkjjBMiJNlDE8bzIqOyfNj_Nx-iaYxPZHgQoihzOkEfixghRts94lsIrQ9b1RnAqmtwZbc2qWR9F7FvscJnsLMGPt_eK9iBw9fKbGwHuAIVuhFY9H3wwxIPzKj5XXy2CfCddyrgJTgX8YtNm0Efb5013zpOHq-8A7wOqou9D2mgEwSrXDxGB-0QMP3NI3R_cb5errLq5vJquagyQ6VImeDa6FYqyRU0kmkNAoYeJMvnat4UrNGNbPNGaM1z0ioh2ZxrVhApS1NSqvkRmv24JvgYA7R1H-xWhdeaknosuB4Lrv8K5l9pFHOQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Assessing Performance and Limitations of a Device‐Level Machine Learning Approach for Perovskite Solar Cells with an Application to Hole Transport Materials</title><source>Wiley Online Library Journals Frontfile Complete</source><creator>Sala, Simone ; Quadrivi, Eleonora ; Biagini, Paolo ; Po’, Riccardo</creator><creatorcontrib>Sala, Simone ; Quadrivi, Eleonora ; Biagini, Paolo ; Po’, Riccardo</creatorcontrib><description>Machine learning models have become widespread in materials science research. An open‐access and community‐driven database containing over 40 000 perovskite photovoltaic devices has been recently published. This resource enables the application of predictive data‐driven models to correlate device structure with photovoltaic performance, whereas the literature usually focuses on specific device layers. Herein, the concept of device‐level performance prediction is explored using gradient‐boosted regression trees as the core algorithm and Shapley values analysis to interpret and rationalize the results. The main pitfalls and conceptual limitations of the approach are discussed and correlated with the database structure and dimension, by comparing the performance of different choices of descriptors and dataset size. Evidence suggests that the additional features introduced herein, in particular chemical descriptors of perovskite additives, can boost regression performance at a device level. A specific model is finally trained to predict the performance of unseen devices and tested on experimental data from the literature. This task is found to be particularly challenging, as the ability of the model to generalize to a new chemical space is limited by several factors, including the amount and the quality of available data.</description><identifier>ISSN: 2367-198X</identifier><identifier>EISSN: 2367-198X</identifier><identifier>DOI: 10.1002/solr.202300490</identifier><language>eng</language><ispartof>Solar RRL, 2023-10, Vol.7 (20)</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c194t-43bcbf9a93aed92bbe4e0499265a5d72dbd9f6d4bb360fa49253b270998c811b3</cites><orcidid>0000-0003-1059-161X</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids></links><search><creatorcontrib>Sala, Simone</creatorcontrib><creatorcontrib>Quadrivi, Eleonora</creatorcontrib><creatorcontrib>Biagini, Paolo</creatorcontrib><creatorcontrib>Po’, Riccardo</creatorcontrib><title>Assessing Performance and Limitations of a Device‐Level Machine Learning Approach for Perovskite Solar Cells with an Application to Hole Transport Materials</title><title>Solar RRL</title><description>Machine learning models have become widespread in materials science research. An open‐access and community‐driven database containing over 40 000 perovskite photovoltaic devices has been recently published. This resource enables the application of predictive data‐driven models to correlate device structure with photovoltaic performance, whereas the literature usually focuses on specific device layers. Herein, the concept of device‐level performance prediction is explored using gradient‐boosted regression trees as the core algorithm and Shapley values analysis to interpret and rationalize the results. The main pitfalls and conceptual limitations of the approach are discussed and correlated with the database structure and dimension, by comparing the performance of different choices of descriptors and dataset size. Evidence suggests that the additional features introduced herein, in particular chemical descriptors of perovskite additives, can boost regression performance at a device level. A specific model is finally trained to predict the performance of unseen devices and tested on experimental data from the literature. This task is found to be particularly challenging, as the ability of the model to generalize to a new chemical space is limited by several factors, including the amount and the quality of available data.</description><issn>2367-198X</issn><issn>2367-198X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><recordid>eNpN0EtOwzAQBmALgUQF3bL2BVL8ahIvq_IoUhBIFIldZDsTanDjyLaK2HEETsDhOAkJIMTqH41mvsWP0AklM0oIO43ehRkjjBMiJNlDE8bzIqOyfNj_Nx-iaYxPZHgQoihzOkEfixghRts94lsIrQ9b1RnAqmtwZbc2qWR9F7FvscJnsLMGPt_eK9iBw9fKbGwHuAIVuhFY9H3wwxIPzKj5XXy2CfCddyrgJTgX8YtNm0Efb5013zpOHq-8A7wOqou9D2mgEwSrXDxGB-0QMP3NI3R_cb5errLq5vJquagyQ6VImeDa6FYqyRU0kmkNAoYeJMvnat4UrNGNbPNGaM1z0ioh2ZxrVhApS1NSqvkRmv24JvgYA7R1H-xWhdeaknosuB4Lrv8K5l9pFHOQ</recordid><startdate>202310</startdate><enddate>202310</enddate><creator>Sala, Simone</creator><creator>Quadrivi, Eleonora</creator><creator>Biagini, Paolo</creator><creator>Po’, Riccardo</creator><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0003-1059-161X</orcidid></search><sort><creationdate>202310</creationdate><title>Assessing Performance and Limitations of a Device‐Level Machine Learning Approach for Perovskite Solar Cells with an Application to Hole Transport Materials</title><author>Sala, Simone ; Quadrivi, Eleonora ; Biagini, Paolo ; Po’, Riccardo</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c194t-43bcbf9a93aed92bbe4e0499265a5d72dbd9f6d4bb360fa49253b270998c811b3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Sala, Simone</creatorcontrib><creatorcontrib>Quadrivi, Eleonora</creatorcontrib><creatorcontrib>Biagini, Paolo</creatorcontrib><creatorcontrib>Po’, Riccardo</creatorcontrib><collection>CrossRef</collection><jtitle>Solar RRL</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Sala, Simone</au><au>Quadrivi, Eleonora</au><au>Biagini, Paolo</au><au>Po’, Riccardo</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Assessing Performance and Limitations of a Device‐Level Machine Learning Approach for Perovskite Solar Cells with an Application to Hole Transport Materials</atitle><jtitle>Solar RRL</jtitle><date>2023-10</date><risdate>2023</risdate><volume>7</volume><issue>20</issue><issn>2367-198X</issn><eissn>2367-198X</eissn><abstract>Machine learning models have become widespread in materials science research. An open‐access and community‐driven database containing over 40 000 perovskite photovoltaic devices has been recently published. This resource enables the application of predictive data‐driven models to correlate device structure with photovoltaic performance, whereas the literature usually focuses on specific device layers. Herein, the concept of device‐level performance prediction is explored using gradient‐boosted regression trees as the core algorithm and Shapley values analysis to interpret and rationalize the results. The main pitfalls and conceptual limitations of the approach are discussed and correlated with the database structure and dimension, by comparing the performance of different choices of descriptors and dataset size. Evidence suggests that the additional features introduced herein, in particular chemical descriptors of perovskite additives, can boost regression performance at a device level. A specific model is finally trained to predict the performance of unseen devices and tested on experimental data from the literature. This task is found to be particularly challenging, as the ability of the model to generalize to a new chemical space is limited by several factors, including the amount and the quality of available data.</abstract><doi>10.1002/solr.202300490</doi><orcidid>https://orcid.org/0000-0003-1059-161X</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 2367-198X
ispartof Solar RRL, 2023-10, Vol.7 (20)
issn 2367-198X
2367-198X
language eng
recordid cdi_crossref_primary_10_1002_solr_202300490
source Wiley Online Library Journals Frontfile Complete
title Assessing Performance and Limitations of a Device‐Level Machine Learning Approach for Perovskite Solar Cells with an Application to Hole Transport Materials
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-29T01%3A40%3A42IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-crossref&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Assessing%20Performance%20and%20Limitations%20of%20a%20Device%E2%80%90Level%20Machine%20Learning%20Approach%20for%20Perovskite%20Solar%20Cells%20with%20an%20Application%20to%20Hole%20Transport%20Materials&rft.jtitle=Solar%20RRL&rft.au=Sala,%20Simone&rft.date=2023-10&rft.volume=7&rft.issue=20&rft.issn=2367-198X&rft.eissn=2367-198X&rft_id=info:doi/10.1002/solr.202300490&rft_dat=%3Ccrossref%3E10_1002_solr_202300490%3C/crossref%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true