The sensitivity of satellite-based PM 2.5 estimates to its inputs: Implications to model development in data-poor regions

Exposure to fine particulate matter (PM ) has been associated with a wide range of negative health outcomes. The overwhelming majority of the epidemiological studies that helped establish such associations was conducted in regions with sufficient ground observations and other supporting data, i.e.,...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Environment international 2018-12, Vol.121 (Pt 1), p.550
Hauptverfasser: Geng, Guannan, Murray, Nancy L, Chang, Howard H, Liu, Yang
Format: Artikel
Sprache:eng
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue Pt 1
container_start_page 550
container_title Environment international
container_volume 121
creator Geng, Guannan
Murray, Nancy L
Chang, Howard H
Liu, Yang
description Exposure to fine particulate matter (PM ) has been associated with a wide range of negative health outcomes. The overwhelming majority of the epidemiological studies that helped establish such associations was conducted in regions with sufficient ground observations and other supporting data, i.e., the data-rich regions. However, air pollution health effects research in the data-poor regions, where pollution levels are often the highest, is still very limited due to the lack of high-quality exposure estimates. To improve our understanding of the desired input datasets for the application of satellite-based PM exposure models in data-poor areas, we applied a Bayesian ensemble model in the southeast U.S. that was selected as a representative data-rich region. We designed four groups of sensitivity tests to simulate various data-poor scenarios. The factors considered that would influence the model performance included the temporal sampling frequency of the monitors, the number of ground monitors, the accuracy of the chemical transport model simulation of PM concentrations, and different combinations of the additional predictors. While our full model achieved a 10-fold cross-validated (CV) R of 0.82, we found that when reducing the sampling frequency from the current 1-in-3 day to 1-in-9 day, the CV R decreased to 0.58, and the predictions could not capture the daily variations of PM . Half of the current stations (i.e., 30 monitors) could still support a robust model with a CV R of 0.79. With 20 monitors, the CV R decreased from 0.71 to 0.55 when 100% additional random errors were added to the original CMAQ simulations. However, with a sufficient number of ground monitors (e.g., 30 monitors), our Bayesian ensemble model had the ability to tolerate CMAQ errors with only a slight decrease in CV R (from 0.79 to 0.75). With fewer than 15 monitors, our full model collapsed and failed to fit any covariates, while the models with only time-varying variables could still converge even with only five monitors left. A model without the land use parameters lacked fine spatial details in the prediction maps, but could still capture the daily variability of PM (CV R  ≥ 0.67) and might support a study of the acute health effects of PM exposure.
doi_str_mv 10.1016/j.envint.2018.09.051
format Article
fullrecord <record><control><sourceid>pubmed</sourceid><recordid>TN_cdi_pubmed_primary_30300813</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>30300813</sourcerecordid><originalsourceid>FETCH-pubmed_primary_303008133</originalsourceid><addsrcrecordid>eNqFjsFKAzEURYMgtlr_QOT9wMSXidOObkWxC8FF9yVtXjUlk4R5rwPz946ia1d3cQ6cq9SNQW3QLO-OmtIQkugaTavxQWNjztTctCtbLVcNztQl8xER6_u2uVAzixaxNXauxs0nAVPiIGEIMkI-ADuhGINQtXNMHt7foNYNEEvoJsQgGYIwhFROwo-w7koMeychpx_WZU8RPA0Uc-koyWSCd-KqknMPPX18mwt1fnCR6fp3r9Tty_Pm6bUqp11Hflv6qdaP27-v9l_hC58oUkY</addsrcrecordid><sourcetype>Index Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>The sensitivity of satellite-based PM 2.5 estimates to its inputs: Implications to model development in data-poor regions</title><source>Elsevier ScienceDirect Journals Complete</source><creator>Geng, Guannan ; Murray, Nancy L ; Chang, Howard H ; Liu, Yang</creator><creatorcontrib>Geng, Guannan ; Murray, Nancy L ; Chang, Howard H ; Liu, Yang</creatorcontrib><description>Exposure to fine particulate matter (PM ) has been associated with a wide range of negative health outcomes. The overwhelming majority of the epidemiological studies that helped establish such associations was conducted in regions with sufficient ground observations and other supporting data, i.e., the data-rich regions. However, air pollution health effects research in the data-poor regions, where pollution levels are often the highest, is still very limited due to the lack of high-quality exposure estimates. To improve our understanding of the desired input datasets for the application of satellite-based PM exposure models in data-poor areas, we applied a Bayesian ensemble model in the southeast U.S. that was selected as a representative data-rich region. We designed four groups of sensitivity tests to simulate various data-poor scenarios. The factors considered that would influence the model performance included the temporal sampling frequency of the monitors, the number of ground monitors, the accuracy of the chemical transport model simulation of PM concentrations, and different combinations of the additional predictors. While our full model achieved a 10-fold cross-validated (CV) R of 0.82, we found that when reducing the sampling frequency from the current 1-in-3 day to 1-in-9 day, the CV R decreased to 0.58, and the predictions could not capture the daily variations of PM . Half of the current stations (i.e., 30 monitors) could still support a robust model with a CV R of 0.79. With 20 monitors, the CV R decreased from 0.71 to 0.55 when 100% additional random errors were added to the original CMAQ simulations. However, with a sufficient number of ground monitors (e.g., 30 monitors), our Bayesian ensemble model had the ability to tolerate CMAQ errors with only a slight decrease in CV R (from 0.79 to 0.75). With fewer than 15 monitors, our full model collapsed and failed to fit any covariates, while the models with only time-varying variables could still converge even with only five monitors left. A model without the land use parameters lacked fine spatial details in the prediction maps, but could still capture the daily variability of PM (CV R  ≥ 0.67) and might support a study of the acute health effects of PM exposure.</description><identifier>EISSN: 1873-6750</identifier><identifier>DOI: 10.1016/j.envint.2018.09.051</identifier><identifier>PMID: 30300813</identifier><language>eng</language><publisher>Netherlands</publisher><ispartof>Environment international, 2018-12, Vol.121 (Pt 1), p.550</ispartof><rights>Copyright © 2018 Elsevier Ltd. All rights reserved.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/30300813$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Geng, Guannan</creatorcontrib><creatorcontrib>Murray, Nancy L</creatorcontrib><creatorcontrib>Chang, Howard H</creatorcontrib><creatorcontrib>Liu, Yang</creatorcontrib><title>The sensitivity of satellite-based PM 2.5 estimates to its inputs: Implications to model development in data-poor regions</title><title>Environment international</title><addtitle>Environ Int</addtitle><description>Exposure to fine particulate matter (PM ) has been associated with a wide range of negative health outcomes. The overwhelming majority of the epidemiological studies that helped establish such associations was conducted in regions with sufficient ground observations and other supporting data, i.e., the data-rich regions. However, air pollution health effects research in the data-poor regions, where pollution levels are often the highest, is still very limited due to the lack of high-quality exposure estimates. To improve our understanding of the desired input datasets for the application of satellite-based PM exposure models in data-poor areas, we applied a Bayesian ensemble model in the southeast U.S. that was selected as a representative data-rich region. We designed four groups of sensitivity tests to simulate various data-poor scenarios. The factors considered that would influence the model performance included the temporal sampling frequency of the monitors, the number of ground monitors, the accuracy of the chemical transport model simulation of PM concentrations, and different combinations of the additional predictors. While our full model achieved a 10-fold cross-validated (CV) R of 0.82, we found that when reducing the sampling frequency from the current 1-in-3 day to 1-in-9 day, the CV R decreased to 0.58, and the predictions could not capture the daily variations of PM . Half of the current stations (i.e., 30 monitors) could still support a robust model with a CV R of 0.79. With 20 monitors, the CV R decreased from 0.71 to 0.55 when 100% additional random errors were added to the original CMAQ simulations. However, with a sufficient number of ground monitors (e.g., 30 monitors), our Bayesian ensemble model had the ability to tolerate CMAQ errors with only a slight decrease in CV R (from 0.79 to 0.75). With fewer than 15 monitors, our full model collapsed and failed to fit any covariates, while the models with only time-varying variables could still converge even with only five monitors left. A model without the land use parameters lacked fine spatial details in the prediction maps, but could still capture the daily variability of PM (CV R  ≥ 0.67) and might support a study of the acute health effects of PM exposure.</description><issn>1873-6750</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><recordid>eNqFjsFKAzEURYMgtlr_QOT9wMSXidOObkWxC8FF9yVtXjUlk4R5rwPz946ia1d3cQ6cq9SNQW3QLO-OmtIQkugaTavxQWNjztTctCtbLVcNztQl8xER6_u2uVAzixaxNXauxs0nAVPiIGEIMkI-ADuhGINQtXNMHt7foNYNEEvoJsQgGYIwhFROwo-w7koMeychpx_WZU8RPA0Uc-koyWSCd-KqknMPPX18mwt1fnCR6fp3r9Tty_Pm6bUqp11Hflv6qdaP27-v9l_hC58oUkY</recordid><startdate>201812</startdate><enddate>201812</enddate><creator>Geng, Guannan</creator><creator>Murray, Nancy L</creator><creator>Chang, Howard H</creator><creator>Liu, Yang</creator><scope>NPM</scope></search><sort><creationdate>201812</creationdate><title>The sensitivity of satellite-based PM 2.5 estimates to its inputs: Implications to model development in data-poor regions</title><author>Geng, Guannan ; Murray, Nancy L ; Chang, Howard H ; Liu, Yang</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-pubmed_primary_303008133</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Geng, Guannan</creatorcontrib><creatorcontrib>Murray, Nancy L</creatorcontrib><creatorcontrib>Chang, Howard H</creatorcontrib><creatorcontrib>Liu, Yang</creatorcontrib><collection>PubMed</collection><jtitle>Environment international</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Geng, Guannan</au><au>Murray, Nancy L</au><au>Chang, Howard H</au><au>Liu, Yang</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>The sensitivity of satellite-based PM 2.5 estimates to its inputs: Implications to model development in data-poor regions</atitle><jtitle>Environment international</jtitle><addtitle>Environ Int</addtitle><date>2018-12</date><risdate>2018</risdate><volume>121</volume><issue>Pt 1</issue><spage>550</spage><pages>550-</pages><eissn>1873-6750</eissn><abstract>Exposure to fine particulate matter (PM ) has been associated with a wide range of negative health outcomes. The overwhelming majority of the epidemiological studies that helped establish such associations was conducted in regions with sufficient ground observations and other supporting data, i.e., the data-rich regions. However, air pollution health effects research in the data-poor regions, where pollution levels are often the highest, is still very limited due to the lack of high-quality exposure estimates. To improve our understanding of the desired input datasets for the application of satellite-based PM exposure models in data-poor areas, we applied a Bayesian ensemble model in the southeast U.S. that was selected as a representative data-rich region. We designed four groups of sensitivity tests to simulate various data-poor scenarios. The factors considered that would influence the model performance included the temporal sampling frequency of the monitors, the number of ground monitors, the accuracy of the chemical transport model simulation of PM concentrations, and different combinations of the additional predictors. While our full model achieved a 10-fold cross-validated (CV) R of 0.82, we found that when reducing the sampling frequency from the current 1-in-3 day to 1-in-9 day, the CV R decreased to 0.58, and the predictions could not capture the daily variations of PM . Half of the current stations (i.e., 30 monitors) could still support a robust model with a CV R of 0.79. With 20 monitors, the CV R decreased from 0.71 to 0.55 when 100% additional random errors were added to the original CMAQ simulations. However, with a sufficient number of ground monitors (e.g., 30 monitors), our Bayesian ensemble model had the ability to tolerate CMAQ errors with only a slight decrease in CV R (from 0.79 to 0.75). With fewer than 15 monitors, our full model collapsed and failed to fit any covariates, while the models with only time-varying variables could still converge even with only five monitors left. A model without the land use parameters lacked fine spatial details in the prediction maps, but could still capture the daily variability of PM (CV R  ≥ 0.67) and might support a study of the acute health effects of PM exposure.</abstract><cop>Netherlands</cop><pmid>30300813</pmid><doi>10.1016/j.envint.2018.09.051</doi></addata></record>
fulltext fulltext
identifier EISSN: 1873-6750
ispartof Environment international, 2018-12, Vol.121 (Pt 1), p.550
issn 1873-6750
language eng
recordid cdi_pubmed_primary_30300813
source Elsevier ScienceDirect Journals Complete
title The sensitivity of satellite-based PM 2.5 estimates to its inputs: Implications to model development in data-poor regions
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-07T18%3A35%3A10IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-pubmed&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=The%20sensitivity%20of%20satellite-based%20PM%202.5%20estimates%20to%20its%20inputs:%20Implications%20to%20model%20development%20in%20data-poor%20regions&rft.jtitle=Environment%20international&rft.au=Geng,%20Guannan&rft.date=2018-12&rft.volume=121&rft.issue=Pt%201&rft.spage=550&rft.pages=550-&rft.eissn=1873-6750&rft_id=info:doi/10.1016/j.envint.2018.09.051&rft_dat=%3Cpubmed%3E30300813%3C/pubmed%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/30300813&rfr_iscdi=true