Towards more sustainable and trustworthy reporting in machine learning
With machine learning (ML) becoming a popular tool across all domains, practitioners are in dire need of comprehensive reporting on the state-of-the-art. Benchmarks and open databases provide helpful insights for many tasks, however suffer from several phenomena: Firstly, they overly focus on predic...
Gespeichert in:
Veröffentlicht in: | Data mining and knowledge discovery 2024-07, Vol.38 (4), p.1909-1928 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 1928 |
---|---|
container_issue | 4 |
container_start_page | 1909 |
container_title | Data mining and knowledge discovery |
container_volume | 38 |
creator | Fischer, Raphael Liebig, Thomas Morik, Katharina |
description | With machine learning (ML) becoming a popular tool across all domains, practitioners are in dire need of comprehensive reporting on the state-of-the-art. Benchmarks and open databases provide helpful insights for many tasks, however suffer from several phenomena: Firstly, they overly focus on prediction quality, which is problematic considering the demand for more sustainability in ML. Depending on the use case at hand, interested users might also face tight resource constraints and thus should be allowed to interact with reporting frameworks, in order to prioritize certain reported characteristics. Furthermore, as some practitioners might not yet be well-skilled in ML, it is important to convey information on a more abstract, comprehensible level. Usability and extendability are key for moving with the state-of-the-art and in order to be trustworthy, frameworks should explicitly address reproducibility. In this work, we analyze established reporting systems under consideration of the aforementioned issues. Afterwards, we propose STREP, our novel framework that aims at overcoming these shortcomings and paves the way towards more sustainable and trustworthy reporting. We use STREP’s (publicly available) implementation to investigate various existing report databases. Our experimental results unveil the need for making reporting more resource-aware and demonstrate our framework’s capabilities of overcoming current reporting limitations. With our work, we want to initiate a paradigm shift in reporting and help with making ML advances more considerate of sustainability and trustworthiness. |
doi_str_mv | 10.1007/s10618-024-01020-3 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3086149301</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3086149301</sourcerecordid><originalsourceid>FETCH-LOGICAL-c314t-d0a3c2b55988cc0ecc21b4ccf91b17570867752d8a8deea714e1462fb78f74753</originalsourceid><addsrcrecordid>eNp9kE9LAzEUxIMoWKtfwFPAc_S9_NmkRylWhYKXCt5CNpttt7TZmmwp_fZGK3jz9IZhfvNgCLlFuEcA_ZARKjQMuGSAwIGJMzJCpQXTqvo4L1oYyZRBuCRXOa8BQHEBIzJb9AeXmky3fQo07_PguujqTaAuNnRIxTj0aVgdaQq7Irq4pF2kW-dXXQx0E1yKxbsmF63b5HDze8fkffa0mL6w-dvz6_RxzrxAObAGnPC8VmpijPcQvOdYS-_bCdaolQZTaa14Y5xpQnAaZUBZ8bbWptVSKzEmd6feXeo_9yEPdt3vUywvrSgwyokALCl-SvnU55xCa3ep27p0tAj2ey972suWvezPXlYUSJygXMJxGdJf9T_UF_Aybfw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3086149301</pqid></control><display><type>article</type><title>Towards more sustainable and trustworthy reporting in machine learning</title><source>SpringerLink Journals</source><creator>Fischer, Raphael ; Liebig, Thomas ; Morik, Katharina</creator><creatorcontrib>Fischer, Raphael ; Liebig, Thomas ; Morik, Katharina</creatorcontrib><description>With machine learning (ML) becoming a popular tool across all domains, practitioners are in dire need of comprehensive reporting on the state-of-the-art. Benchmarks and open databases provide helpful insights for many tasks, however suffer from several phenomena: Firstly, they overly focus on prediction quality, which is problematic considering the demand for more sustainability in ML. Depending on the use case at hand, interested users might also face tight resource constraints and thus should be allowed to interact with reporting frameworks, in order to prioritize certain reported characteristics. Furthermore, as some practitioners might not yet be well-skilled in ML, it is important to convey information on a more abstract, comprehensible level. Usability and extendability are key for moving with the state-of-the-art and in order to be trustworthy, frameworks should explicitly address reproducibility. In this work, we analyze established reporting systems under consideration of the aforementioned issues. Afterwards, we propose STREP, our novel framework that aims at overcoming these shortcomings and paves the way towards more sustainable and trustworthy reporting. We use STREP’s (publicly available) implementation to investigate various existing report databases. Our experimental results unveil the need for making reporting more resource-aware and demonstrate our framework’s capabilities of overcoming current reporting limitations. With our work, we want to initiate a paradigm shift in reporting and help with making ML advances more considerate of sustainability and trustworthiness.</description><identifier>ISSN: 1384-5810</identifier><identifier>EISSN: 1573-756X</identifier><identifier>DOI: 10.1007/s10618-024-01020-3</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Artificial Intelligence ; Chemistry and Earth Sciences ; Computer Science ; Data Mining and Knowledge Discovery ; Information Storage and Retrieval ; Machine learning ; Physics ; Statistics for Engineering ; Sustainability ; Trustworthiness</subject><ispartof>Data mining and knowledge discovery, 2024-07, Vol.38 (4), p.1909-1928</ispartof><rights>The Author(s) 2024</rights><rights>The Author(s) 2024. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c314t-d0a3c2b55988cc0ecc21b4ccf91b17570867752d8a8deea714e1462fb78f74753</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s10618-024-01020-3$$EPDF$$P50$$Gspringer$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s10618-024-01020-3$$EHTML$$P50$$Gspringer$$Hfree_for_read</linktohtml><link.rule.ids>314,780,784,27924,27925,41488,42557,51319</link.rule.ids></links><search><creatorcontrib>Fischer, Raphael</creatorcontrib><creatorcontrib>Liebig, Thomas</creatorcontrib><creatorcontrib>Morik, Katharina</creatorcontrib><title>Towards more sustainable and trustworthy reporting in machine learning</title><title>Data mining and knowledge discovery</title><addtitle>Data Min Knowl Disc</addtitle><description>With machine learning (ML) becoming a popular tool across all domains, practitioners are in dire need of comprehensive reporting on the state-of-the-art. Benchmarks and open databases provide helpful insights for many tasks, however suffer from several phenomena: Firstly, they overly focus on prediction quality, which is problematic considering the demand for more sustainability in ML. Depending on the use case at hand, interested users might also face tight resource constraints and thus should be allowed to interact with reporting frameworks, in order to prioritize certain reported characteristics. Furthermore, as some practitioners might not yet be well-skilled in ML, it is important to convey information on a more abstract, comprehensible level. Usability and extendability are key for moving with the state-of-the-art and in order to be trustworthy, frameworks should explicitly address reproducibility. In this work, we analyze established reporting systems under consideration of the aforementioned issues. Afterwards, we propose STREP, our novel framework that aims at overcoming these shortcomings and paves the way towards more sustainable and trustworthy reporting. We use STREP’s (publicly available) implementation to investigate various existing report databases. Our experimental results unveil the need for making reporting more resource-aware and demonstrate our framework’s capabilities of overcoming current reporting limitations. With our work, we want to initiate a paradigm shift in reporting and help with making ML advances more considerate of sustainability and trustworthiness.</description><subject>Artificial Intelligence</subject><subject>Chemistry and Earth Sciences</subject><subject>Computer Science</subject><subject>Data Mining and Knowledge Discovery</subject><subject>Information Storage and Retrieval</subject><subject>Machine learning</subject><subject>Physics</subject><subject>Statistics for Engineering</subject><subject>Sustainability</subject><subject>Trustworthiness</subject><issn>1384-5810</issn><issn>1573-756X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>C6C</sourceid><recordid>eNp9kE9LAzEUxIMoWKtfwFPAc_S9_NmkRylWhYKXCt5CNpttt7TZmmwp_fZGK3jz9IZhfvNgCLlFuEcA_ZARKjQMuGSAwIGJMzJCpQXTqvo4L1oYyZRBuCRXOa8BQHEBIzJb9AeXmky3fQo07_PguujqTaAuNnRIxTj0aVgdaQq7Irq4pF2kW-dXXQx0E1yKxbsmF63b5HDze8fkffa0mL6w-dvz6_RxzrxAObAGnPC8VmpijPcQvOdYS-_bCdaolQZTaa14Y5xpQnAaZUBZ8bbWptVSKzEmd6feXeo_9yEPdt3vUywvrSgwyokALCl-SvnU55xCa3ep27p0tAj2ey972suWvezPXlYUSJygXMJxGdJf9T_UF_Aybfw</recordid><startdate>20240701</startdate><enddate>20240701</enddate><creator>Fischer, Raphael</creator><creator>Liebig, Thomas</creator><creator>Morik, Katharina</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>C6C</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20240701</creationdate><title>Towards more sustainable and trustworthy reporting in machine learning</title><author>Fischer, Raphael ; Liebig, Thomas ; Morik, Katharina</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c314t-d0a3c2b55988cc0ecc21b4ccf91b17570867752d8a8deea714e1462fb78f74753</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Artificial Intelligence</topic><topic>Chemistry and Earth Sciences</topic><topic>Computer Science</topic><topic>Data Mining and Knowledge Discovery</topic><topic>Information Storage and Retrieval</topic><topic>Machine learning</topic><topic>Physics</topic><topic>Statistics for Engineering</topic><topic>Sustainability</topic><topic>Trustworthiness</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Fischer, Raphael</creatorcontrib><creatorcontrib>Liebig, Thomas</creatorcontrib><creatorcontrib>Morik, Katharina</creatorcontrib><collection>Springer Nature OA Free Journals</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Data mining and knowledge discovery</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Fischer, Raphael</au><au>Liebig, Thomas</au><au>Morik, Katharina</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Towards more sustainable and trustworthy reporting in machine learning</atitle><jtitle>Data mining and knowledge discovery</jtitle><stitle>Data Min Knowl Disc</stitle><date>2024-07-01</date><risdate>2024</risdate><volume>38</volume><issue>4</issue><spage>1909</spage><epage>1928</epage><pages>1909-1928</pages><issn>1384-5810</issn><eissn>1573-756X</eissn><abstract>With machine learning (ML) becoming a popular tool across all domains, practitioners are in dire need of comprehensive reporting on the state-of-the-art. Benchmarks and open databases provide helpful insights for many tasks, however suffer from several phenomena: Firstly, they overly focus on prediction quality, which is problematic considering the demand for more sustainability in ML. Depending on the use case at hand, interested users might also face tight resource constraints and thus should be allowed to interact with reporting frameworks, in order to prioritize certain reported characteristics. Furthermore, as some practitioners might not yet be well-skilled in ML, it is important to convey information on a more abstract, comprehensible level. Usability and extendability are key for moving with the state-of-the-art and in order to be trustworthy, frameworks should explicitly address reproducibility. In this work, we analyze established reporting systems under consideration of the aforementioned issues. Afterwards, we propose STREP, our novel framework that aims at overcoming these shortcomings and paves the way towards more sustainable and trustworthy reporting. We use STREP’s (publicly available) implementation to investigate various existing report databases. Our experimental results unveil the need for making reporting more resource-aware and demonstrate our framework’s capabilities of overcoming current reporting limitations. With our work, we want to initiate a paradigm shift in reporting and help with making ML advances more considerate of sustainability and trustworthiness.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s10618-024-01020-3</doi><tpages>20</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1384-5810 |
ispartof | Data mining and knowledge discovery, 2024-07, Vol.38 (4), p.1909-1928 |
issn | 1384-5810 1573-756X |
language | eng |
recordid | cdi_proquest_journals_3086149301 |
source | SpringerLink Journals |
subjects | Artificial Intelligence Chemistry and Earth Sciences Computer Science Data Mining and Knowledge Discovery Information Storage and Retrieval Machine learning Physics Statistics for Engineering Sustainability Trustworthiness |
title | Towards more sustainable and trustworthy reporting in machine learning |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T07%3A03%3A11IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Towards%20more%20sustainable%20and%20trustworthy%20reporting%20in%20machine%20learning&rft.jtitle=Data%20mining%20and%20knowledge%20discovery&rft.au=Fischer,%20Raphael&rft.date=2024-07-01&rft.volume=38&rft.issue=4&rft.spage=1909&rft.epage=1928&rft.pages=1909-1928&rft.issn=1384-5810&rft.eissn=1573-756X&rft_id=info:doi/10.1007/s10618-024-01020-3&rft_dat=%3Cproquest_cross%3E3086149301%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3086149301&rft_id=info:pmid/&rfr_iscdi=true |