Resampling strategies for imbalanced regression: a survey and empirical analysis

Imbalanced problems can arise in different real-world situations, and to address this, certain strategies in the form of resampling or balancing algorithms are proposed. This issue has largely been studied in the context of classification, and yet, the same problem features in regression tasks, wher...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	The Artificial intelligence review 2024-03, Vol.57 (4), p.82, Article 82
Hauptverfasser:	Avelino, Juscimara G., Cavalcanti, George D. C., Cruz, Rafael M. O.
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Artificial Intelligence Balancing Computer Science Context Empirical analysis Learning Prediction models Regression models Resampling Taxonomy
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue	4
container_start_page	82
container_title	The Artificial intelligence review
container_volume	57
creator	Avelino, Juscimara G. Cavalcanti, George D. C. Cruz, Rafael M. O.
description	Imbalanced problems can arise in different real-world situations, and to address this, certain strategies in the form of resampling or balancing algorithms are proposed. This issue has largely been studied in the context of classification, and yet, the same problem features in regression tasks, where target values are continuous. This work presents an extensive experimental study comprising various balancing and predictive models, and wich uses metrics to capture important elements for the user and to evaluate the predictive model in an imbalanced regression data context. It also proposes a taxonomy for imbalanced regression approaches based on three crucial criteria: regression model, learning process, and evaluation metrics. The study offers new insights into the use of such strategies, highlighting the advantages they bring to each model’s learning process, and indicating directions for further studies. The code, data and further information related to the experiments performed herein can be found on GitHub: https://github.com/JusciAvelino/imbalancedRegression .
doi_str_mv	10.1007/s10462-024-10724-3
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2934007442</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2934007442</sourcerecordid><originalsourceid>FETCH-LOGICAL-c363t-b7d18e321073f3e42a52a992269edff4ab5cdd201fed42b0e269248e7ea8d1723</originalsourceid><addsrcrecordid>eNp9kMtKAzEUhoMoWKsv4CrgejS3aTrupGgVCoroOmQmJ0PK3MyZCn17U0fQlZtz_f_D4SPkkrNrzpi-Qc7UQmRMqIwznaI8IjOea5npND_-U5-SM8QtYywXSs7IyyugbYcmdDXFMdoR6gBIfR9paEvb2K4CRyPUERBD391SS3EXP2FPbecotEOIobJN6myzx4Dn5MTbBuHiJ8_J-8P92-ox2zyvn1Z3m6ySCzlmpXZ8CVKkb6WXoITNhS0KIRYFOO-VLfPKOcG4B6dEySAthFqCBrt0XAs5J1fT3SH2HzvA0Wz7XUxPoBGFVAmKUgeVmFRV7BEjeDPE0Nq4N5yZAzkzkTOJnPkmZ2QyycmESdzVEH9P_-P6Ar2tcYM</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2934007442</pqid></control><display><type>article</type><title>Resampling strategies for imbalanced regression: a survey and empirical analysis</title><source>Springer Nature OA Free Journals</source><source>Springer Nature - Complete Springer Journals</source><creator>Avelino, Juscimara G. ; Cavalcanti, George D. C. ; Cruz, Rafael M. O.</creator><creatorcontrib>Avelino, Juscimara G. ; Cavalcanti, George D. C. ; Cruz, Rafael M. O.</creatorcontrib><description>Imbalanced problems can arise in different real-world situations, and to address this, certain strategies in the form of resampling or balancing algorithms are proposed. This issue has largely been studied in the context of classification, and yet, the same problem features in regression tasks, where target values are continuous. This work presents an extensive experimental study comprising various balancing and predictive models, and wich uses metrics to capture important elements for the user and to evaluate the predictive model in an imbalanced regression data context. It also proposes a taxonomy for imbalanced regression approaches based on three crucial criteria: regression model, learning process, and evaluation metrics. The study offers new insights into the use of such strategies, highlighting the advantages they bring to each model’s learning process, and indicating directions for further studies. The code, data and further information related to the experiments performed herein can be found on GitHub: https://github.com/JusciAvelino/imbalancedRegression .</description><identifier>ISSN: 1573-7462</identifier><identifier>ISSN: 0269-2821</identifier><identifier>EISSN: 1573-7462</identifier><identifier>DOI: 10.1007/s10462-024-10724-3</identifier><language>eng</language><publisher>Dordrecht: Springer Netherlands</publisher><subject>Algorithms ; Artificial Intelligence ; Balancing ; Computer Science ; Context ; Empirical analysis ; Learning ; Prediction models ; Regression models ; Resampling ; Taxonomy</subject><ispartof>The Artificial intelligence review, 2024-03, Vol.57 (4), p.82, Article 82</ispartof><rights>The Author(s) 2024</rights><rights>The Author(s) 2024. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c363t-b7d18e321073f3e42a52a992269edff4ab5cdd201fed42b0e269248e7ea8d1723</citedby><cites>FETCH-LOGICAL-c363t-b7d18e321073f3e42a52a992269edff4ab5cdd201fed42b0e269248e7ea8d1723</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s10462-024-10724-3$$EPDF$$P50$$Gspringer$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://doi.org/10.1007/s10462-024-10724-3$$EHTML$$P50$$Gspringer$$Hfree_for_read</linktohtml><link.rule.ids>314,776,780,27903,27904,41099,41467,42168,42536,51297,51554</link.rule.ids></links><search><creatorcontrib>Avelino, Juscimara G.</creatorcontrib><creatorcontrib>Cavalcanti, George D. C.</creatorcontrib><creatorcontrib>Cruz, Rafael M. O.</creatorcontrib><title>Resampling strategies for imbalanced regression: a survey and empirical analysis</title><title>The Artificial intelligence review</title><addtitle>Artif Intell Rev</addtitle><description>Imbalanced problems can arise in different real-world situations, and to address this, certain strategies in the form of resampling or balancing algorithms are proposed. This issue has largely been studied in the context of classification, and yet, the same problem features in regression tasks, where target values are continuous. This work presents an extensive experimental study comprising various balancing and predictive models, and wich uses metrics to capture important elements for the user and to evaluate the predictive model in an imbalanced regression data context. It also proposes a taxonomy for imbalanced regression approaches based on three crucial criteria: regression model, learning process, and evaluation metrics. The study offers new insights into the use of such strategies, highlighting the advantages they bring to each model’s learning process, and indicating directions for further studies. The code, data and further information related to the experiments performed herein can be found on GitHub: https://github.com/JusciAvelino/imbalancedRegression .</description><subject>Algorithms</subject><subject>Artificial Intelligence</subject><subject>Balancing</subject><subject>Computer Science</subject><subject>Context</subject><subject>Empirical analysis</subject><subject>Learning</subject><subject>Prediction models</subject><subject>Regression models</subject><subject>Resampling</subject><subject>Taxonomy</subject><issn>1573-7462</issn><issn>0269-2821</issn><issn>1573-7462</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>C6C</sourceid><recordid>eNp9kMtKAzEUhoMoWKsv4CrgejS3aTrupGgVCoroOmQmJ0PK3MyZCn17U0fQlZtz_f_D4SPkkrNrzpi-Qc7UQmRMqIwznaI8IjOea5npND_-U5-SM8QtYywXSs7IyyugbYcmdDXFMdoR6gBIfR9paEvb2K4CRyPUERBD391SS3EXP2FPbecotEOIobJN6myzx4Dn5MTbBuHiJ8_J-8P92-ox2zyvn1Z3m6ySCzlmpXZ8CVKkb6WXoITNhS0KIRYFOO-VLfPKOcG4B6dEySAthFqCBrt0XAs5J1fT3SH2HzvA0Wz7XUxPoBGFVAmKUgeVmFRV7BEjeDPE0Nq4N5yZAzkzkTOJnPkmZ2QyycmESdzVEH9P_-P6Ar2tcYM</recordid><startdate>20240304</startdate><enddate>20240304</enddate><creator>Avelino, Juscimara G.</creator><creator>Cavalcanti, George D. C.</creator><creator>Cruz, Rafael M. O.</creator><general>Springer Netherlands</general><general>Springer Nature B.V</general><scope>C6C</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>E3H</scope><scope>F2A</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20240304</creationdate><title>Resampling strategies for imbalanced regression: a survey and empirical analysis</title><author>Avelino, Juscimara G. ; Cavalcanti, George D. C. ; Cruz, Rafael M. O.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c363t-b7d18e321073f3e42a52a992269edff4ab5cdd201fed42b0e269248e7ea8d1723</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Algorithms</topic><topic>Artificial Intelligence</topic><topic>Balancing</topic><topic>Computer Science</topic><topic>Context</topic><topic>Empirical analysis</topic><topic>Learning</topic><topic>Prediction models</topic><topic>Regression models</topic><topic>Resampling</topic><topic>Taxonomy</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Avelino, Juscimara G.</creatorcontrib><creatorcontrib>Cavalcanti, George D. C.</creatorcontrib><creatorcontrib>Cruz, Rafael M. O.</creatorcontrib><collection>Springer Nature OA Free Journals</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>Library & Information Sciences Abstracts (LISA)</collection><collection>Library & Information Science Abstracts (LISA)</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>The Artificial intelligence review</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Avelino, Juscimara G.</au><au>Cavalcanti, George D. C.</au><au>Cruz, Rafael M. O.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Resampling strategies for imbalanced regression: a survey and empirical analysis</atitle><jtitle>The Artificial intelligence review</jtitle><stitle>Artif Intell Rev</stitle><date>2024-03-04</date><risdate>2024</risdate><volume>57</volume><issue>4</issue><spage>82</spage><pages>82-</pages><artnum>82</artnum><issn>1573-7462</issn><issn>0269-2821</issn><eissn>1573-7462</eissn><abstract>Imbalanced problems can arise in different real-world situations, and to address this, certain strategies in the form of resampling or balancing algorithms are proposed. This issue has largely been studied in the context of classification, and yet, the same problem features in regression tasks, where target values are continuous. This work presents an extensive experimental study comprising various balancing and predictive models, and wich uses metrics to capture important elements for the user and to evaluate the predictive model in an imbalanced regression data context. It also proposes a taxonomy for imbalanced regression approaches based on three crucial criteria: regression model, learning process, and evaluation metrics. The study offers new insights into the use of such strategies, highlighting the advantages they bring to each model’s learning process, and indicating directions for further studies. The code, data and further information related to the experiments performed herein can be found on GitHub: https://github.com/JusciAvelino/imbalancedRegression .</abstract><cop>Dordrecht</cop><pub>Springer Netherlands</pub><doi>10.1007/s10462-024-10724-3</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 1573-7462
ispartof	The Artificial intelligence review, 2024-03, Vol.57 (4), p.82, Article 82
issn	1573-7462 0269-2821 1573-7462
language	eng
recordid	cdi_proquest_journals_2934007442
source	Springer Nature OA Free Journals; Springer Nature - Complete Springer Journals
subjects	Algorithms Artificial Intelligence Balancing Computer Science Context Empirical analysis Learning Prediction models Regression models Resampling Taxonomy
title	Resampling strategies for imbalanced regression: a survey and empirical analysis
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-23T18%3A39%3A05IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Resampling%20strategies%20for%20imbalanced%20regression:%20a%20survey%20and%20empirical%20analysis&rft.jtitle=The%20Artificial%20intelligence%20review&rft.au=Avelino,%20Juscimara%20G.&rft.date=2024-03-04&rft.volume=57&rft.issue=4&rft.spage=82&rft.pages=82-&rft.artnum=82&rft.issn=1573-7462&rft.eissn=1573-7462&rft_id=info:doi/10.1007/s10462-024-10724-3&rft_dat=%3Cproquest_cross%3E2934007442%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2934007442&rft_id=info:pmid/&rfr_iscdi=true