Prediction of the Daily Mean PM10 Concentrations Using Linear Models

The performance of five linear models to predict the daily mean PM10 concentrations was compared. The linear models proposed were: i) multiple linear regression; ii) principal component regression; iii) independent component regression; iv) quantile regression; and v) partial least squares regressio...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:American journal of environmental sciences 2008-05, Vol.4 (5), p.445-453
Hauptverfasser: Pires, J. C. M., Martins, F. G., Sousa, S. I. V., Alvim-Ferr, M. C. M., Pereira, M. C.
Format: Artikel
Sprache:eng
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 453
container_issue 5
container_start_page 445
container_title American journal of environmental sciences
container_volume 4
creator Pires, J. C. M.
Martins, F. G.
Sousa, S. I. V.
Alvim-Ferr, M. C. M.
Pereira, M. C.
description The performance of five linear models to predict the daily mean PM10 concentrations was compared. The linear models proposed were: i) multiple linear regression; ii) principal component regression; iii) independent component regression; iv) quantile regression; and v) partial least squares regression. The study was based on data from an urban site in Oporto Metropolitan Area and the analysed period was from January 2003 to December 2005. The linear models were evaluated with two datasets of different sizes belonging to the analysed period. Environmental data (SO2, CO, NO, NO2 and PM10 concentrations) and meteorological data (temperature, relative humidity and wind speed) were used as PM10 predictors. During the training step, quantile regression presented the lowest residual errors for the two datasets. Independent component regression was the worst model using the larger dataset. Multiple linear regression, principal component regression and partial least squares regression presented similar results for both datasets. During the test step, independent component regression and quantile regression showed bad performance, while multiple linear regression, principal component regression and partial least squares regression presented similar results using the larger dataset. For the smaller dataset, the models that remove the correlation of the variables (principal component regression, independent component regression and partial least squares regression) presented better results than multiple linear regression and quantile regression. Independent component regression was the linear model with the lowest value of residual error. Concluding, the dataset size is also an important parameter for the evaluation of the models concerning the prediction of variables. The prediction of the daily mean PM10 concentrations was more efficient when using independent component regression for the smaller dataset and partial least squares regression for the larger datasets.
doi_str_mv 10.3844/ajessp.2008.445.453
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_20371723</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>20371723</sourcerecordid><originalsourceid>FETCH-LOGICAL-c261t-56dae433d11e18f0e769745755978723635048426964e7e30ce4680519d51c853</originalsourceid><addsrcrecordid>eNot0DtPwzAYhWEPIFEuv4DFE1uCnc-3jKjlJrWiA5XYLMv5Aq5Su9jp0H9PqjKd5dUZHkLuOavBCPHotljKvm4YM7UQshYSLsiMSwkVCPl1Ra5L2TIGqtF6RhbrjF3wY0iRpp6OP0gXLgxHukIX6XrFGZ2n6DGO2Z2iQjclxG-6DBFdpqvU4VBuyWXvhoJ3_3tDNi_Pn_O3avnx-j5_Wla-UXyspOocCoCOc-SmZ6hVq4XUUrba6AYUSCaMaFSrBGoE5lEowyRvO8m9kXBDHs6_-5x-D1hGuwvF4zC4iOlQbMNA8-loCuEc-pxKydjbfQ47l4-WM3tSsmcle1Kyk5KdlOAPgF5b7w</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>20371723</pqid></control><display><type>article</type><title>Prediction of the Daily Mean PM10 Concentrations Using Linear Models</title><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>Free Full-Text Journals in Chemistry</source><creator>Pires, J. C. M. ; Martins, F. G. ; Sousa, S. I. V. ; Alvim-Ferr, M. C. M. ; Pereira, M. C.</creator><creatorcontrib>Pires, J. C. M. ; Martins, F. G. ; Sousa, S. I. V. ; Alvim-Ferr, M. C. M. ; Pereira, M. C.</creatorcontrib><description>The performance of five linear models to predict the daily mean PM10 concentrations was compared. The linear models proposed were: i) multiple linear regression; ii) principal component regression; iii) independent component regression; iv) quantile regression; and v) partial least squares regression. The study was based on data from an urban site in Oporto Metropolitan Area and the analysed period was from January 2003 to December 2005. The linear models were evaluated with two datasets of different sizes belonging to the analysed period. Environmental data (SO2, CO, NO, NO2 and PM10 concentrations) and meteorological data (temperature, relative humidity and wind speed) were used as PM10 predictors. During the training step, quantile regression presented the lowest residual errors for the two datasets. Independent component regression was the worst model using the larger dataset. Multiple linear regression, principal component regression and partial least squares regression presented similar results for both datasets. During the test step, independent component regression and quantile regression showed bad performance, while multiple linear regression, principal component regression and partial least squares regression presented similar results using the larger dataset. For the smaller dataset, the models that remove the correlation of the variables (principal component regression, independent component regression and partial least squares regression) presented better results than multiple linear regression and quantile regression. Independent component regression was the linear model with the lowest value of residual error. Concluding, the dataset size is also an important parameter for the evaluation of the models concerning the prediction of variables. The prediction of the daily mean PM10 concentrations was more efficient when using independent component regression for the smaller dataset and partial least squares regression for the larger datasets.</description><identifier>ISSN: 1553-345X</identifier><identifier>DOI: 10.3844/ajessp.2008.445.453</identifier><language>eng</language><ispartof>American journal of environmental sciences, 2008-05, Vol.4 (5), p.445-453</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c261t-56dae433d11e18f0e769745755978723635048426964e7e30ce4680519d51c853</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids></links><search><creatorcontrib>Pires, J. C. M.</creatorcontrib><creatorcontrib>Martins, F. G.</creatorcontrib><creatorcontrib>Sousa, S. I. V.</creatorcontrib><creatorcontrib>Alvim-Ferr, M. C. M.</creatorcontrib><creatorcontrib>Pereira, M. C.</creatorcontrib><title>Prediction of the Daily Mean PM10 Concentrations Using Linear Models</title><title>American journal of environmental sciences</title><description>The performance of five linear models to predict the daily mean PM10 concentrations was compared. The linear models proposed were: i) multiple linear regression; ii) principal component regression; iii) independent component regression; iv) quantile regression; and v) partial least squares regression. The study was based on data from an urban site in Oporto Metropolitan Area and the analysed period was from January 2003 to December 2005. The linear models were evaluated with two datasets of different sizes belonging to the analysed period. Environmental data (SO2, CO, NO, NO2 and PM10 concentrations) and meteorological data (temperature, relative humidity and wind speed) were used as PM10 predictors. During the training step, quantile regression presented the lowest residual errors for the two datasets. Independent component regression was the worst model using the larger dataset. Multiple linear regression, principal component regression and partial least squares regression presented similar results for both datasets. During the test step, independent component regression and quantile regression showed bad performance, while multiple linear regression, principal component regression and partial least squares regression presented similar results using the larger dataset. For the smaller dataset, the models that remove the correlation of the variables (principal component regression, independent component regression and partial least squares regression) presented better results than multiple linear regression and quantile regression. Independent component regression was the linear model with the lowest value of residual error. Concluding, the dataset size is also an important parameter for the evaluation of the models concerning the prediction of variables. The prediction of the daily mean PM10 concentrations was more efficient when using independent component regression for the smaller dataset and partial least squares regression for the larger datasets.</description><issn>1553-345X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2008</creationdate><recordtype>article</recordtype><recordid>eNot0DtPwzAYhWEPIFEuv4DFE1uCnc-3jKjlJrWiA5XYLMv5Aq5Su9jp0H9PqjKd5dUZHkLuOavBCPHotljKvm4YM7UQshYSLsiMSwkVCPl1Ra5L2TIGqtF6RhbrjF3wY0iRpp6OP0gXLgxHukIX6XrFGZ2n6DGO2Z2iQjclxG-6DBFdpqvU4VBuyWXvhoJ3_3tDNi_Pn_O3avnx-j5_Wla-UXyspOocCoCOc-SmZ6hVq4XUUrba6AYUSCaMaFSrBGoE5lEowyRvO8m9kXBDHs6_-5x-D1hGuwvF4zC4iOlQbMNA8-loCuEc-pxKydjbfQ47l4-WM3tSsmcle1Kyk5KdlOAPgF5b7w</recordid><startdate>20080501</startdate><enddate>20080501</enddate><creator>Pires, J. C. M.</creator><creator>Martins, F. G.</creator><creator>Sousa, S. I. V.</creator><creator>Alvim-Ferr, M. C. M.</creator><creator>Pereira, M. C.</creator><scope>AAYXX</scope><scope>CITATION</scope><scope>7TV</scope><scope>C1K</scope></search><sort><creationdate>20080501</creationdate><title>Prediction of the Daily Mean PM10 Concentrations Using Linear Models</title><author>Pires, J. C. M. ; Martins, F. G. ; Sousa, S. I. V. ; Alvim-Ferr, M. C. M. ; Pereira, M. C.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c261t-56dae433d11e18f0e769745755978723635048426964e7e30ce4680519d51c853</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2008</creationdate><toplevel>online_resources</toplevel><creatorcontrib>Pires, J. C. M.</creatorcontrib><creatorcontrib>Martins, F. G.</creatorcontrib><creatorcontrib>Sousa, S. I. V.</creatorcontrib><creatorcontrib>Alvim-Ferr, M. C. M.</creatorcontrib><creatorcontrib>Pereira, M. C.</creatorcontrib><collection>CrossRef</collection><collection>Pollution Abstracts</collection><collection>Environmental Sciences and Pollution Management</collection><jtitle>American journal of environmental sciences</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Pires, J. C. M.</au><au>Martins, F. G.</au><au>Sousa, S. I. V.</au><au>Alvim-Ferr, M. C. M.</au><au>Pereira, M. C.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Prediction of the Daily Mean PM10 Concentrations Using Linear Models</atitle><jtitle>American journal of environmental sciences</jtitle><date>2008-05-01</date><risdate>2008</risdate><volume>4</volume><issue>5</issue><spage>445</spage><epage>453</epage><pages>445-453</pages><issn>1553-345X</issn><abstract>The performance of five linear models to predict the daily mean PM10 concentrations was compared. The linear models proposed were: i) multiple linear regression; ii) principal component regression; iii) independent component regression; iv) quantile regression; and v) partial least squares regression. The study was based on data from an urban site in Oporto Metropolitan Area and the analysed period was from January 2003 to December 2005. The linear models were evaluated with two datasets of different sizes belonging to the analysed period. Environmental data (SO2, CO, NO, NO2 and PM10 concentrations) and meteorological data (temperature, relative humidity and wind speed) were used as PM10 predictors. During the training step, quantile regression presented the lowest residual errors for the two datasets. Independent component regression was the worst model using the larger dataset. Multiple linear regression, principal component regression and partial least squares regression presented similar results for both datasets. During the test step, independent component regression and quantile regression showed bad performance, while multiple linear regression, principal component regression and partial least squares regression presented similar results using the larger dataset. For the smaller dataset, the models that remove the correlation of the variables (principal component regression, independent component regression and partial least squares regression) presented better results than multiple linear regression and quantile regression. Independent component regression was the linear model with the lowest value of residual error. Concluding, the dataset size is also an important parameter for the evaluation of the models concerning the prediction of variables. The prediction of the daily mean PM10 concentrations was more efficient when using independent component regression for the smaller dataset and partial least squares regression for the larger datasets.</abstract><doi>10.3844/ajessp.2008.445.453</doi><tpages>9</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1553-345X
ispartof American journal of environmental sciences, 2008-05, Vol.4 (5), p.445-453
issn 1553-345X
language eng
recordid cdi_proquest_miscellaneous_20371723
source Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; Free Full-Text Journals in Chemistry
title Prediction of the Daily Mean PM10 Concentrations Using Linear Models
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-30T00%3A16%3A23IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Prediction%20of%20the%20Daily%20Mean%20PM10%20Concentrations%20Using%20Linear%20Models&rft.jtitle=American%20journal%20of%20environmental%20sciences&rft.au=Pires,%20J.%20C.%20M.&rft.date=2008-05-01&rft.volume=4&rft.issue=5&rft.spage=445&rft.epage=453&rft.pages=445-453&rft.issn=1553-345X&rft_id=info:doi/10.3844/ajessp.2008.445.453&rft_dat=%3Cproquest_cross%3E20371723%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=20371723&rft_id=info:pmid/&rfr_iscdi=true