Towards more sustainable and trustworthy reporting in machine learning

With machine learning (ML) becoming a popular tool across all domains, practitioners are in dire need of comprehensive reporting on the state-of-the-art. Benchmarks and open databases provide helpful insights for many tasks, however suffer from several phenomena: Firstly, they overly focus on predic...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Data mining and knowledge discovery 2024-07, Vol.38 (4), p.1909-1928
Hauptverfasser: Fischer, Raphael, Liebig, Thomas, Morik, Katharina
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1928
container_issue 4
container_start_page 1909
container_title Data mining and knowledge discovery
container_volume 38
creator Fischer, Raphael
Liebig, Thomas
Morik, Katharina
description With machine learning (ML) becoming a popular tool across all domains, practitioners are in dire need of comprehensive reporting on the state-of-the-art. Benchmarks and open databases provide helpful insights for many tasks, however suffer from several phenomena: Firstly, they overly focus on prediction quality, which is problematic considering the demand for more sustainability in ML. Depending on the use case at hand, interested users might also face tight resource constraints and thus should be allowed to interact with reporting frameworks, in order to prioritize certain reported characteristics. Furthermore, as some practitioners might not yet be well-skilled in ML, it is important to convey information on a more abstract, comprehensible level. Usability and extendability are key for moving with the state-of-the-art and in order to be trustworthy, frameworks should explicitly address reproducibility. In this work, we analyze established reporting systems under consideration of the aforementioned issues. Afterwards, we propose STREP, our novel framework that aims at overcoming these shortcomings and paves the way towards more sustainable and trustworthy reporting. We use STREP’s (publicly available) implementation to investigate various existing report databases. Our experimental results unveil the need for making reporting more resource-aware and demonstrate our framework’s capabilities of overcoming current reporting limitations. With our work, we want to initiate a paradigm shift in reporting and help with making ML advances more considerate of sustainability and trustworthiness.
doi_str_mv 10.1007/s10618-024-01020-3
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3086149301</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3086149301</sourcerecordid><originalsourceid>FETCH-LOGICAL-c314t-d0a3c2b55988cc0ecc21b4ccf91b17570867752d8a8deea714e1462fb78f74753</originalsourceid><addsrcrecordid>eNp9kE9LAzEUxIMoWKtfwFPAc_S9_NmkRylWhYKXCt5CNpttt7TZmmwp_fZGK3jz9IZhfvNgCLlFuEcA_ZARKjQMuGSAwIGJMzJCpQXTqvo4L1oYyZRBuCRXOa8BQHEBIzJb9AeXmky3fQo07_PguujqTaAuNnRIxTj0aVgdaQq7Irq4pF2kW-dXXQx0E1yKxbsmF63b5HDze8fkffa0mL6w-dvz6_RxzrxAObAGnPC8VmpijPcQvOdYS-_bCdaolQZTaa14Y5xpQnAaZUBZ8bbWptVSKzEmd6feXeo_9yEPdt3vUywvrSgwyokALCl-SvnU55xCa3ep27p0tAj2ey972suWvezPXlYUSJygXMJxGdJf9T_UF_Aybfw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3086149301</pqid></control><display><type>article</type><title>Towards more sustainable and trustworthy reporting in machine learning</title><source>SpringerLink Journals</source><creator>Fischer, Raphael ; Liebig, Thomas ; Morik, Katharina</creator><creatorcontrib>Fischer, Raphael ; Liebig, Thomas ; Morik, Katharina</creatorcontrib><description>With machine learning (ML) becoming a popular tool across all domains, practitioners are in dire need of comprehensive reporting on the state-of-the-art. Benchmarks and open databases provide helpful insights for many tasks, however suffer from several phenomena: Firstly, they overly focus on prediction quality, which is problematic considering the demand for more sustainability in ML. Depending on the use case at hand, interested users might also face tight resource constraints and thus should be allowed to interact with reporting frameworks, in order to prioritize certain reported characteristics. Furthermore, as some practitioners might not yet be well-skilled in ML, it is important to convey information on a more abstract, comprehensible level. Usability and extendability are key for moving with the state-of-the-art and in order to be trustworthy, frameworks should explicitly address reproducibility. In this work, we analyze established reporting systems under consideration of the aforementioned issues. Afterwards, we propose STREP, our novel framework that aims at overcoming these shortcomings and paves the way towards more sustainable and trustworthy reporting. We use STREP’s (publicly available) implementation to investigate various existing report databases. Our experimental results unveil the need for making reporting more resource-aware and demonstrate our framework’s capabilities of overcoming current reporting limitations. With our work, we want to initiate a paradigm shift in reporting and help with making ML advances more considerate of sustainability and trustworthiness.</description><identifier>ISSN: 1384-5810</identifier><identifier>EISSN: 1573-756X</identifier><identifier>DOI: 10.1007/s10618-024-01020-3</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Artificial Intelligence ; Chemistry and Earth Sciences ; Computer Science ; Data Mining and Knowledge Discovery ; Information Storage and Retrieval ; Machine learning ; Physics ; Statistics for Engineering ; Sustainability ; Trustworthiness</subject><ispartof>Data mining and knowledge discovery, 2024-07, Vol.38 (4), p.1909-1928</ispartof><rights>The Author(s) 2024</rights><rights>The Author(s) 2024. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c314t-d0a3c2b55988cc0ecc21b4ccf91b17570867752d8a8deea714e1462fb78f74753</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s10618-024-01020-3$$EPDF$$P50$$Gspringer$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s10618-024-01020-3$$EHTML$$P50$$Gspringer$$Hfree_for_read</linktohtml><link.rule.ids>314,780,784,27924,27925,41488,42557,51319</link.rule.ids></links><search><creatorcontrib>Fischer, Raphael</creatorcontrib><creatorcontrib>Liebig, Thomas</creatorcontrib><creatorcontrib>Morik, Katharina</creatorcontrib><title>Towards more sustainable and trustworthy reporting in machine learning</title><title>Data mining and knowledge discovery</title><addtitle>Data Min Knowl Disc</addtitle><description>With machine learning (ML) becoming a popular tool across all domains, practitioners are in dire need of comprehensive reporting on the state-of-the-art. Benchmarks and open databases provide helpful insights for many tasks, however suffer from several phenomena: Firstly, they overly focus on prediction quality, which is problematic considering the demand for more sustainability in ML. Depending on the use case at hand, interested users might also face tight resource constraints and thus should be allowed to interact with reporting frameworks, in order to prioritize certain reported characteristics. Furthermore, as some practitioners might not yet be well-skilled in ML, it is important to convey information on a more abstract, comprehensible level. Usability and extendability are key for moving with the state-of-the-art and in order to be trustworthy, frameworks should explicitly address reproducibility. In this work, we analyze established reporting systems under consideration of the aforementioned issues. Afterwards, we propose STREP, our novel framework that aims at overcoming these shortcomings and paves the way towards more sustainable and trustworthy reporting. We use STREP’s (publicly available) implementation to investigate various existing report databases. Our experimental results unveil the need for making reporting more resource-aware and demonstrate our framework’s capabilities of overcoming current reporting limitations. With our work, we want to initiate a paradigm shift in reporting and help with making ML advances more considerate of sustainability and trustworthiness.</description><subject>Artificial Intelligence</subject><subject>Chemistry and Earth Sciences</subject><subject>Computer Science</subject><subject>Data Mining and Knowledge Discovery</subject><subject>Information Storage and Retrieval</subject><subject>Machine learning</subject><subject>Physics</subject><subject>Statistics for Engineering</subject><subject>Sustainability</subject><subject>Trustworthiness</subject><issn>1384-5810</issn><issn>1573-756X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>C6C</sourceid><recordid>eNp9kE9LAzEUxIMoWKtfwFPAc_S9_NmkRylWhYKXCt5CNpttt7TZmmwp_fZGK3jz9IZhfvNgCLlFuEcA_ZARKjQMuGSAwIGJMzJCpQXTqvo4L1oYyZRBuCRXOa8BQHEBIzJb9AeXmky3fQo07_PguujqTaAuNnRIxTj0aVgdaQq7Irq4pF2kW-dXXQx0E1yKxbsmF63b5HDze8fkffa0mL6w-dvz6_RxzrxAObAGnPC8VmpijPcQvOdYS-_bCdaolQZTaa14Y5xpQnAaZUBZ8bbWptVSKzEmd6feXeo_9yEPdt3vUywvrSgwyokALCl-SvnU55xCa3ep27p0tAj2ey972suWvezPXlYUSJygXMJxGdJf9T_UF_Aybfw</recordid><startdate>20240701</startdate><enddate>20240701</enddate><creator>Fischer, Raphael</creator><creator>Liebig, Thomas</creator><creator>Morik, Katharina</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>C6C</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20240701</creationdate><title>Towards more sustainable and trustworthy reporting in machine learning</title><author>Fischer, Raphael ; Liebig, Thomas ; Morik, Katharina</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c314t-d0a3c2b55988cc0ecc21b4ccf91b17570867752d8a8deea714e1462fb78f74753</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Artificial Intelligence</topic><topic>Chemistry and Earth Sciences</topic><topic>Computer Science</topic><topic>Data Mining and Knowledge Discovery</topic><topic>Information Storage and Retrieval</topic><topic>Machine learning</topic><topic>Physics</topic><topic>Statistics for Engineering</topic><topic>Sustainability</topic><topic>Trustworthiness</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Fischer, Raphael</creatorcontrib><creatorcontrib>Liebig, Thomas</creatorcontrib><creatorcontrib>Morik, Katharina</creatorcontrib><collection>Springer Nature OA Free Journals</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Data mining and knowledge discovery</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Fischer, Raphael</au><au>Liebig, Thomas</au><au>Morik, Katharina</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Towards more sustainable and trustworthy reporting in machine learning</atitle><jtitle>Data mining and knowledge discovery</jtitle><stitle>Data Min Knowl Disc</stitle><date>2024-07-01</date><risdate>2024</risdate><volume>38</volume><issue>4</issue><spage>1909</spage><epage>1928</epage><pages>1909-1928</pages><issn>1384-5810</issn><eissn>1573-756X</eissn><abstract>With machine learning (ML) becoming a popular tool across all domains, practitioners are in dire need of comprehensive reporting on the state-of-the-art. Benchmarks and open databases provide helpful insights for many tasks, however suffer from several phenomena: Firstly, they overly focus on prediction quality, which is problematic considering the demand for more sustainability in ML. Depending on the use case at hand, interested users might also face tight resource constraints and thus should be allowed to interact with reporting frameworks, in order to prioritize certain reported characteristics. Furthermore, as some practitioners might not yet be well-skilled in ML, it is important to convey information on a more abstract, comprehensible level. Usability and extendability are key for moving with the state-of-the-art and in order to be trustworthy, frameworks should explicitly address reproducibility. In this work, we analyze established reporting systems under consideration of the aforementioned issues. Afterwards, we propose STREP, our novel framework that aims at overcoming these shortcomings and paves the way towards more sustainable and trustworthy reporting. We use STREP’s (publicly available) implementation to investigate various existing report databases. Our experimental results unveil the need for making reporting more resource-aware and demonstrate our framework’s capabilities of overcoming current reporting limitations. With our work, we want to initiate a paradigm shift in reporting and help with making ML advances more considerate of sustainability and trustworthiness.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s10618-024-01020-3</doi><tpages>20</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1384-5810
ispartof Data mining and knowledge discovery, 2024-07, Vol.38 (4), p.1909-1928
issn 1384-5810
1573-756X
language eng
recordid cdi_proquest_journals_3086149301
source SpringerLink Journals
subjects Artificial Intelligence
Chemistry and Earth Sciences
Computer Science
Data Mining and Knowledge Discovery
Information Storage and Retrieval
Machine learning
Physics
Statistics for Engineering
Sustainability
Trustworthiness
title Towards more sustainable and trustworthy reporting in machine learning
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T07%3A03%3A11IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Towards%20more%20sustainable%20and%20trustworthy%20reporting%20in%20machine%20learning&rft.jtitle=Data%20mining%20and%20knowledge%20discovery&rft.au=Fischer,%20Raphael&rft.date=2024-07-01&rft.volume=38&rft.issue=4&rft.spage=1909&rft.epage=1928&rft.pages=1909-1928&rft.issn=1384-5810&rft.eissn=1573-756X&rft_id=info:doi/10.1007/s10618-024-01020-3&rft_dat=%3Cproquest_cross%3E3086149301%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3086149301&rft_id=info:pmid/&rfr_iscdi=true