Solid Waste Analysis Using Open-Access Socio-Economic Data

Nowadays, problems related with solid waste management become a challenge for most countries due to the rising generation of waste, related environmental issues, and associated costs of produced wastes. Effective waste management systems at different geographic levels require accurate forecasting of...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Sustainability 2022-02, Vol.14 (3), p.1233
Hauptverfasser: Dunkel, Jürgen, Dominguez, David, Borzdynski, Óscar G., Sánchez, Ángel
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue 3
container_start_page 1233
container_title Sustainability
container_volume 14
creator Dunkel, Jürgen
Dominguez, David
Borzdynski, Óscar G.
Sánchez, Ángel
description Nowadays, problems related with solid waste management become a challenge for most countries due to the rising generation of waste, related environmental issues, and associated costs of produced wastes. Effective waste management systems at different geographic levels require accurate forecasting of future waste generation. In this work, we investigate how open-access data, such as provided from the Organisation for Economic Co-operation and Development (OECD), can be used for the analysis of waste data. The main idea of this study is finding the links between socio-economic and demographic variables that determine the amounts of types of solid wastes produced by countries. This would make it possible to accurately predict at the country level the waste production and determine the requirements for the development of effective waste management strategies. In particular, we use several machine learning data regression (Support Vector, Gradient Boosting, and Random Forest) and clustering models (k-means) to respectively predict waste production for OECD countries along years and also to perform clustering among these countries according to similar characteristics. The main contributions of our work are: (1) waste analysis at the OECD country-level to compare and cluster countries according to similar waste features predicted; (2) the detection of most relevant features for prediction models; and (3) the comparison between several regression models with respect to accuracy in predictions. Coefficient of determination (R2), Mean Absolute Error (MAE), Root Mean Square Error (RMSE), and Mean Absolute Percentage Error (MAPE), respectively, are used as indices of the efficiency of the developed models. Our experiments have shown that some data pre-processings on the OECD data are an essential stage required in the analysis; that Random Forest Regressor (RFR) produced the best prediction results over the dataset; and that these results are highly influenced by the quality of available socio-economic data. In particular, the RFR model exhibited the highest accuracy in predictions for most waste types. For example, for “municipal” waste, it produced, respectively, R2 = 1 and MAPE=4.31 global error values for the test set; and for “household” waste, it, respectively, produced R2 = 1 and MAPE=3.03. Our results indicate that the considered models (and specially RFR) all are effective in predicting the amount of produced wastes derived from input data for the considered
doi_str_mv 10.3390/su14031233
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2627838199</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2627838199</sourcerecordid><originalsourceid>FETCH-LOGICAL-c295t-9a68a7b7dec4ee68ce55eda18a7e53e030a1f85146f908b46e1569183e9562e23</originalsourceid><addsrcrecordid>eNpNUMFKw0AUXETBUnvxCwLehOi-fdnNrrdSWxUKPdTiMWw3L5KSZmteeujfG6mgc5lhGIZhhLgF-YDo5CMfIZMICvFCjJTMIQWp5eU_fS0mzDs5ABEcmJF4WsemLpMPzz0l09Y3J6452XDdfiarA7XpNARiTtYx1DGdh9jGfR2SZ9_7G3FV-YZp8stjsVnM32ev6XL18jabLtOgnO5T5431-TYvKWRExgbSmkoPg0kaSaL0UFkNmamctNvMEGjjwCI5bRQpHIu7c--hi19H4r7YxWM3TOVCGZVbtODckLo_p0IXmTuqikNX7313KkAWP_cUf_fgN8aeVR8</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2627838199</pqid></control><display><type>article</type><title>Solid Waste Analysis Using Open-Access Socio-Economic Data</title><source>MDPI - Multidisciplinary Digital Publishing Institute</source><source>EZB-FREE-00999 freely available EZB journals</source><creator>Dunkel, Jürgen ; Dominguez, David ; Borzdynski, Óscar G. ; Sánchez, Ángel</creator><creatorcontrib>Dunkel, Jürgen ; Dominguez, David ; Borzdynski, Óscar G. ; Sánchez, Ángel</creatorcontrib><description>Nowadays, problems related with solid waste management become a challenge for most countries due to the rising generation of waste, related environmental issues, and associated costs of produced wastes. Effective waste management systems at different geographic levels require accurate forecasting of future waste generation. In this work, we investigate how open-access data, such as provided from the Organisation for Economic Co-operation and Development (OECD), can be used for the analysis of waste data. The main idea of this study is finding the links between socio-economic and demographic variables that determine the amounts of types of solid wastes produced by countries. This would make it possible to accurately predict at the country level the waste production and determine the requirements for the development of effective waste management strategies. In particular, we use several machine learning data regression (Support Vector, Gradient Boosting, and Random Forest) and clustering models (k-means) to respectively predict waste production for OECD countries along years and also to perform clustering among these countries according to similar characteristics. The main contributions of our work are: (1) waste analysis at the OECD country-level to compare and cluster countries according to similar waste features predicted; (2) the detection of most relevant features for prediction models; and (3) the comparison between several regression models with respect to accuracy in predictions. Coefficient of determination (R2), Mean Absolute Error (MAE), Root Mean Square Error (RMSE), and Mean Absolute Percentage Error (MAPE), respectively, are used as indices of the efficiency of the developed models. Our experiments have shown that some data pre-processings on the OECD data are an essential stage required in the analysis; that Random Forest Regressor (RFR) produced the best prediction results over the dataset; and that these results are highly influenced by the quality of available socio-economic data. In particular, the RFR model exhibited the highest accuracy in predictions for most waste types. For example, for “municipal” waste, it produced, respectively, R2 = 1 and MAPE=4.31 global error values for the test set; and for “household” waste, it, respectively, produced R2 = 1 and MAPE=3.03. Our results indicate that the considered models (and specially RFR) all are effective in predicting the amount of produced wastes derived from input data for the considered countries.</description><identifier>ISSN: 2071-1050</identifier><identifier>EISSN: 2071-1050</identifier><identifier>DOI: 10.3390/su14031233</identifier><language>eng</language><publisher>Basel: MDPI AG</publisher><subject>Artificial intelligence ; Clustering ; Data analysis ; Datasets ; Demographic variables ; Design of experiments ; Genetic algorithms ; Learning algorithms ; Machine learning ; Open data ; Prediction models ; Regression analysis ; Root-mean-square errors ; Socioeconomic factors ; Socioeconomics ; Solid waste management ; Solid wastes ; Support vector machines ; Sustainability ; System effectiveness ; Waste analysis ; Waste management ; Wastes</subject><ispartof>Sustainability, 2022-02, Vol.14 (3), p.1233</ispartof><rights>2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c295t-9a68a7b7dec4ee68ce55eda18a7e53e030a1f85146f908b46e1569183e9562e23</citedby><cites>FETCH-LOGICAL-c295t-9a68a7b7dec4ee68ce55eda18a7e53e030a1f85146f908b46e1569183e9562e23</cites><orcidid>0000-0001-6598-3448 ; 0000-0001-9069-6985 ; 0000-0003-3567-1173 ; 0000-0003-0911-1834</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27923,27924</link.rule.ids></links><search><creatorcontrib>Dunkel, Jürgen</creatorcontrib><creatorcontrib>Dominguez, David</creatorcontrib><creatorcontrib>Borzdynski, Óscar G.</creatorcontrib><creatorcontrib>Sánchez, Ángel</creatorcontrib><title>Solid Waste Analysis Using Open-Access Socio-Economic Data</title><title>Sustainability</title><description>Nowadays, problems related with solid waste management become a challenge for most countries due to the rising generation of waste, related environmental issues, and associated costs of produced wastes. Effective waste management systems at different geographic levels require accurate forecasting of future waste generation. In this work, we investigate how open-access data, such as provided from the Organisation for Economic Co-operation and Development (OECD), can be used for the analysis of waste data. The main idea of this study is finding the links between socio-economic and demographic variables that determine the amounts of types of solid wastes produced by countries. This would make it possible to accurately predict at the country level the waste production and determine the requirements for the development of effective waste management strategies. In particular, we use several machine learning data regression (Support Vector, Gradient Boosting, and Random Forest) and clustering models (k-means) to respectively predict waste production for OECD countries along years and also to perform clustering among these countries according to similar characteristics. The main contributions of our work are: (1) waste analysis at the OECD country-level to compare and cluster countries according to similar waste features predicted; (2) the detection of most relevant features for prediction models; and (3) the comparison between several regression models with respect to accuracy in predictions. Coefficient of determination (R2), Mean Absolute Error (MAE), Root Mean Square Error (RMSE), and Mean Absolute Percentage Error (MAPE), respectively, are used as indices of the efficiency of the developed models. Our experiments have shown that some data pre-processings on the OECD data are an essential stage required in the analysis; that Random Forest Regressor (RFR) produced the best prediction results over the dataset; and that these results are highly influenced by the quality of available socio-economic data. In particular, the RFR model exhibited the highest accuracy in predictions for most waste types. For example, for “municipal” waste, it produced, respectively, R2 = 1 and MAPE=4.31 global error values for the test set; and for “household” waste, it, respectively, produced R2 = 1 and MAPE=3.03. Our results indicate that the considered models (and specially RFR) all are effective in predicting the amount of produced wastes derived from input data for the considered countries.</description><subject>Artificial intelligence</subject><subject>Clustering</subject><subject>Data analysis</subject><subject>Datasets</subject><subject>Demographic variables</subject><subject>Design of experiments</subject><subject>Genetic algorithms</subject><subject>Learning algorithms</subject><subject>Machine learning</subject><subject>Open data</subject><subject>Prediction models</subject><subject>Regression analysis</subject><subject>Root-mean-square errors</subject><subject>Socioeconomic factors</subject><subject>Socioeconomics</subject><subject>Solid waste management</subject><subject>Solid wastes</subject><subject>Support vector machines</subject><subject>Sustainability</subject><subject>System effectiveness</subject><subject>Waste analysis</subject><subject>Waste management</subject><subject>Wastes</subject><issn>2071-1050</issn><issn>2071-1050</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><recordid>eNpNUMFKw0AUXETBUnvxCwLehOi-fdnNrrdSWxUKPdTiMWw3L5KSZmteeujfG6mgc5lhGIZhhLgF-YDo5CMfIZMICvFCjJTMIQWp5eU_fS0mzDs5ABEcmJF4WsemLpMPzz0l09Y3J6452XDdfiarA7XpNARiTtYx1DGdh9jGfR2SZ9_7G3FV-YZp8stjsVnM32ev6XL18jabLtOgnO5T5431-TYvKWRExgbSmkoPg0kaSaL0UFkNmamctNvMEGjjwCI5bRQpHIu7c--hi19H4r7YxWM3TOVCGZVbtODckLo_p0IXmTuqikNX7313KkAWP_cUf_fgN8aeVR8</recordid><startdate>20220201</startdate><enddate>20220201</enddate><creator>Dunkel, Jürgen</creator><creator>Dominguez, David</creator><creator>Borzdynski, Óscar G.</creator><creator>Sánchez, Ángel</creator><general>MDPI AG</general><scope>AAYXX</scope><scope>CITATION</scope><scope>4U-</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><orcidid>https://orcid.org/0000-0001-6598-3448</orcidid><orcidid>https://orcid.org/0000-0001-9069-6985</orcidid><orcidid>https://orcid.org/0000-0003-3567-1173</orcidid><orcidid>https://orcid.org/0000-0003-0911-1834</orcidid></search><sort><creationdate>20220201</creationdate><title>Solid Waste Analysis Using Open-Access Socio-Economic Data</title><author>Dunkel, Jürgen ; Dominguez, David ; Borzdynski, Óscar G. ; Sánchez, Ángel</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c295t-9a68a7b7dec4ee68ce55eda18a7e53e030a1f85146f908b46e1569183e9562e23</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Artificial intelligence</topic><topic>Clustering</topic><topic>Data analysis</topic><topic>Datasets</topic><topic>Demographic variables</topic><topic>Design of experiments</topic><topic>Genetic algorithms</topic><topic>Learning algorithms</topic><topic>Machine learning</topic><topic>Open data</topic><topic>Prediction models</topic><topic>Regression analysis</topic><topic>Root-mean-square errors</topic><topic>Socioeconomic factors</topic><topic>Socioeconomics</topic><topic>Solid waste management</topic><topic>Solid wastes</topic><topic>Support vector machines</topic><topic>Sustainability</topic><topic>System effectiveness</topic><topic>Waste analysis</topic><topic>Waste management</topic><topic>Wastes</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Dunkel, Jürgen</creatorcontrib><creatorcontrib>Dominguez, David</creatorcontrib><creatorcontrib>Borzdynski, Óscar G.</creatorcontrib><creatorcontrib>Sánchez, Ángel</creatorcontrib><collection>CrossRef</collection><collection>University Readers</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><jtitle>Sustainability</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Dunkel, Jürgen</au><au>Dominguez, David</au><au>Borzdynski, Óscar G.</au><au>Sánchez, Ángel</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Solid Waste Analysis Using Open-Access Socio-Economic Data</atitle><jtitle>Sustainability</jtitle><date>2022-02-01</date><risdate>2022</risdate><volume>14</volume><issue>3</issue><spage>1233</spage><pages>1233-</pages><issn>2071-1050</issn><eissn>2071-1050</eissn><abstract>Nowadays, problems related with solid waste management become a challenge for most countries due to the rising generation of waste, related environmental issues, and associated costs of produced wastes. Effective waste management systems at different geographic levels require accurate forecasting of future waste generation. In this work, we investigate how open-access data, such as provided from the Organisation for Economic Co-operation and Development (OECD), can be used for the analysis of waste data. The main idea of this study is finding the links between socio-economic and demographic variables that determine the amounts of types of solid wastes produced by countries. This would make it possible to accurately predict at the country level the waste production and determine the requirements for the development of effective waste management strategies. In particular, we use several machine learning data regression (Support Vector, Gradient Boosting, and Random Forest) and clustering models (k-means) to respectively predict waste production for OECD countries along years and also to perform clustering among these countries according to similar characteristics. The main contributions of our work are: (1) waste analysis at the OECD country-level to compare and cluster countries according to similar waste features predicted; (2) the detection of most relevant features for prediction models; and (3) the comparison between several regression models with respect to accuracy in predictions. Coefficient of determination (R2), Mean Absolute Error (MAE), Root Mean Square Error (RMSE), and Mean Absolute Percentage Error (MAPE), respectively, are used as indices of the efficiency of the developed models. Our experiments have shown that some data pre-processings on the OECD data are an essential stage required in the analysis; that Random Forest Regressor (RFR) produced the best prediction results over the dataset; and that these results are highly influenced by the quality of available socio-economic data. In particular, the RFR model exhibited the highest accuracy in predictions for most waste types. For example, for “municipal” waste, it produced, respectively, R2 = 1 and MAPE=4.31 global error values for the test set; and for “household” waste, it, respectively, produced R2 = 1 and MAPE=3.03. Our results indicate that the considered models (and specially RFR) all are effective in predicting the amount of produced wastes derived from input data for the considered countries.</abstract><cop>Basel</cop><pub>MDPI AG</pub><doi>10.3390/su14031233</doi><orcidid>https://orcid.org/0000-0001-6598-3448</orcidid><orcidid>https://orcid.org/0000-0001-9069-6985</orcidid><orcidid>https://orcid.org/0000-0003-3567-1173</orcidid><orcidid>https://orcid.org/0000-0003-0911-1834</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2071-1050
ispartof Sustainability, 2022-02, Vol.14 (3), p.1233
issn 2071-1050
2071-1050
language eng
recordid cdi_proquest_journals_2627838199
source MDPI - Multidisciplinary Digital Publishing Institute; EZB-FREE-00999 freely available EZB journals
subjects Artificial intelligence
Clustering
Data analysis
Datasets
Demographic variables
Design of experiments
Genetic algorithms
Learning algorithms
Machine learning
Open data
Prediction models
Regression analysis
Root-mean-square errors
Socioeconomic factors
Socioeconomics
Solid waste management
Solid wastes
Support vector machines
Sustainability
System effectiveness
Waste analysis
Waste management
Wastes
title Solid Waste Analysis Using Open-Access Socio-Economic Data
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-08T22%3A45%3A50IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Solid%20Waste%20Analysis%20Using%20Open-Access%20Socio-Economic%20Data&rft.jtitle=Sustainability&rft.au=Dunkel,%20J%C3%BCrgen&rft.date=2022-02-01&rft.volume=14&rft.issue=3&rft.spage=1233&rft.pages=1233-&rft.issn=2071-1050&rft.eissn=2071-1050&rft_id=info:doi/10.3390/su14031233&rft_dat=%3Cproquest_cross%3E2627838199%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2627838199&rft_id=info:pmid/&rfr_iscdi=true