Descriptive inference using large, unrepresentative nonprobability samples: An introduction for ecologists

Biodiversity monitoring usually involves drawing inferences about some variable of interest across a defined landscape from observations made at a sample of locations within that landscape. If the variable of interest differs between sampled and nonsampled locations, and no mitigating action is take...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Ecology (Durham) 2024-02, Vol.105 (2), p.e4214-n/a
Hauptverfasser: Boyd, Robin J., Stewart, Gavin B., Pescott, Oliver L.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page n/a
container_issue 2
container_start_page e4214
container_title Ecology (Durham)
container_volume 105
creator Boyd, Robin J.
Stewart, Gavin B.
Pescott, Oliver L.
description Biodiversity monitoring usually involves drawing inferences about some variable of interest across a defined landscape from observations made at a sample of locations within that landscape. If the variable of interest differs between sampled and nonsampled locations, and no mitigating action is taken, then the sample is unrepresentative and inferences drawn from it will be biased. It is possible to adjust unrepresentative samples so that they more closely resemble the wider landscape in terms of “auxiliary variables.” A good auxiliary variable is a common cause of sample inclusion and the variable of interest, and if it explains an appreciable portion of the variance in both, then inferences drawn from the adjusted sample will be closer to the truth. We applied six types of survey sample adjustment—subsampling, quasirandomization, poststratification, superpopulation modeling, a “doubly robust” procedure, and multilevel regression and poststratification—to a simple two‐part biodiversity monitoring problem. The first part was to estimate the mean occupancy of the plant Calluna vulgaris in Great Britain in two time periods (1987–1999 and 2010–2019); the second was to estimate the difference between the two (i.e., the trend). We estimated the means and trend using large, but (originally) unrepresentative, samples from a citizen science dataset. Compared with the unadjusted estimates, the means and trends estimated using most adjustment methods were more accurate, although standard uncertainty intervals generally did not cover the true values. Completely unbiased inference is not possible from an unrepresentative sample without knowing and having data on all relevant auxiliary variables. Adjustments can reduce the bias if auxiliary variables are available and selected carefully, but the potential for residual bias should be acknowledged and reported.
doi_str_mv 10.1002/ecy.4214
format Article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_10929663</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2920453931</sourcerecordid><originalsourceid>FETCH-LOGICAL-c4394-d4b153a9234b6fd7a6ef4c295ceb17d868f274ff3f9c5a0daa8cbe3bb2d2a5603</originalsourceid><addsrcrecordid>eNp1kctqHDEQRYVJsMeOwV8QGrLJwu3o1T1SNsFMnAcYskkWXglJXZpo0Ehtqdth_j7yM3EgtalFHQ63uAidEHxGMKbvwO7OOCV8Dy2IZLKVZIlfoAXGhLay78QBOixlg-sQLvbRARNYCNyTBdp8hGKzHyd_A42PDjJEC81cfFw3Qec1nDZzzDBmKBAnfcfFFMecjDY--GnXFL0dA5T3zXmsiimnYbaTT7FxKTdgU0hrX6byCr10OhQ4fthH6Meni--rL-3lt89fV-eXreVM8nbghnRMS8q46d2w1D04bqnsLBiyHEQvHF1y55iTttN40FpYA8wYOlDd9ZgdoQ_33nE2WxhsjZ11UGP2W513Kmmvnl-i_6nW6UYRLKnse1YNbx8MOV3PUCa19cVCCDpCmouiElPJqWBdRd_8g27SnGP9r1IU845JRv4IbU6lZHBPaQhWtw2q2qC6bbCir_9O_wQ-VlaB9h745QPs_itSF6urO-FvSh6ohg</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2920453931</pqid></control><display><type>article</type><title>Descriptive inference using large, unrepresentative nonprobability samples: An introduction for ecologists</title><source>MEDLINE</source><source>Wiley Online Library Journals Frontfile Complete</source><creator>Boyd, Robin J. ; Stewart, Gavin B. ; Pescott, Oliver L.</creator><creatorcontrib>Boyd, Robin J. ; Stewart, Gavin B. ; Pescott, Oliver L.</creatorcontrib><description>Biodiversity monitoring usually involves drawing inferences about some variable of interest across a defined landscape from observations made at a sample of locations within that landscape. If the variable of interest differs between sampled and nonsampled locations, and no mitigating action is taken, then the sample is unrepresentative and inferences drawn from it will be biased. It is possible to adjust unrepresentative samples so that they more closely resemble the wider landscape in terms of “auxiliary variables.” A good auxiliary variable is a common cause of sample inclusion and the variable of interest, and if it explains an appreciable portion of the variance in both, then inferences drawn from the adjusted sample will be closer to the truth. We applied six types of survey sample adjustment—subsampling, quasirandomization, poststratification, superpopulation modeling, a “doubly robust” procedure, and multilevel regression and poststratification—to a simple two‐part biodiversity monitoring problem. The first part was to estimate the mean occupancy of the plant Calluna vulgaris in Great Britain in two time periods (1987–1999 and 2010–2019); the second was to estimate the difference between the two (i.e., the trend). We estimated the means and trend using large, but (originally) unrepresentative, samples from a citizen science dataset. Compared with the unadjusted estimates, the means and trends estimated using most adjustment methods were more accurate, although standard uncertainty intervals generally did not cover the true values. Completely unbiased inference is not possible from an unrepresentative sample without knowing and having data on all relevant auxiliary variables. Adjustments can reduce the bias if auxiliary variables are available and selected carefully, but the potential for residual bias should be acknowledged and reported.</description><identifier>ISSN: 0012-9658</identifier><identifier>EISSN: 1939-9170</identifier><identifier>DOI: 10.1002/ecy.4214</identifier><identifier>PMID: 38088061</identifier><language>eng</language><publisher>Hoboken, USA: John Wiley &amp; Sons, Inc</publisher><subject>Bias ; Biodiversity ; biodiversity monitoring ; Ecological monitoring ; Ecology ; Inference ; nonprobability samples ; Robustness (mathematics) ; Surveys and Questionnaires ; Trends ; Variables ; weighting</subject><ispartof>Ecology (Durham), 2024-02, Vol.105 (2), p.e4214-n/a</ispartof><rights>2023 The Authors. published by Wiley Periodicals LLC on behalf of The Ecological Society of America.</rights><rights>2023 The Authors. Ecology published by Wiley Periodicals LLC on behalf of The Ecological Society of America.</rights><rights>2023. This article is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c4394-d4b153a9234b6fd7a6ef4c295ceb17d868f274ff3f9c5a0daa8cbe3bb2d2a5603</citedby><cites>FETCH-LOGICAL-c4394-d4b153a9234b6fd7a6ef4c295ceb17d868f274ff3f9c5a0daa8cbe3bb2d2a5603</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://onlinelibrary.wiley.com/doi/pdf/10.1002%2Fecy.4214$$EPDF$$P50$$Gwiley$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://onlinelibrary.wiley.com/doi/full/10.1002%2Fecy.4214$$EHTML$$P50$$Gwiley$$Hfree_for_read</linktohtml><link.rule.ids>230,314,780,784,885,1416,27922,27923,45572,45573</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/38088061$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Boyd, Robin J.</creatorcontrib><creatorcontrib>Stewart, Gavin B.</creatorcontrib><creatorcontrib>Pescott, Oliver L.</creatorcontrib><title>Descriptive inference using large, unrepresentative nonprobability samples: An introduction for ecologists</title><title>Ecology (Durham)</title><addtitle>Ecology</addtitle><description>Biodiversity monitoring usually involves drawing inferences about some variable of interest across a defined landscape from observations made at a sample of locations within that landscape. If the variable of interest differs between sampled and nonsampled locations, and no mitigating action is taken, then the sample is unrepresentative and inferences drawn from it will be biased. It is possible to adjust unrepresentative samples so that they more closely resemble the wider landscape in terms of “auxiliary variables.” A good auxiliary variable is a common cause of sample inclusion and the variable of interest, and if it explains an appreciable portion of the variance in both, then inferences drawn from the adjusted sample will be closer to the truth. We applied six types of survey sample adjustment—subsampling, quasirandomization, poststratification, superpopulation modeling, a “doubly robust” procedure, and multilevel regression and poststratification—to a simple two‐part biodiversity monitoring problem. The first part was to estimate the mean occupancy of the plant Calluna vulgaris in Great Britain in two time periods (1987–1999 and 2010–2019); the second was to estimate the difference between the two (i.e., the trend). We estimated the means and trend using large, but (originally) unrepresentative, samples from a citizen science dataset. Compared with the unadjusted estimates, the means and trends estimated using most adjustment methods were more accurate, although standard uncertainty intervals generally did not cover the true values. Completely unbiased inference is not possible from an unrepresentative sample without knowing and having data on all relevant auxiliary variables. Adjustments can reduce the bias if auxiliary variables are available and selected carefully, but the potential for residual bias should be acknowledged and reported.</description><subject>Bias</subject><subject>Biodiversity</subject><subject>biodiversity monitoring</subject><subject>Ecological monitoring</subject><subject>Ecology</subject><subject>Inference</subject><subject>nonprobability samples</subject><subject>Robustness (mathematics)</subject><subject>Surveys and Questionnaires</subject><subject>Trends</subject><subject>Variables</subject><subject>weighting</subject><issn>0012-9658</issn><issn>1939-9170</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>24P</sourceid><sourceid>WIN</sourceid><sourceid>EIF</sourceid><recordid>eNp1kctqHDEQRYVJsMeOwV8QGrLJwu3o1T1SNsFMnAcYskkWXglJXZpo0Ehtqdth_j7yM3EgtalFHQ63uAidEHxGMKbvwO7OOCV8Dy2IZLKVZIlfoAXGhLay78QBOixlg-sQLvbRARNYCNyTBdp8hGKzHyd_A42PDjJEC81cfFw3Qec1nDZzzDBmKBAnfcfFFMecjDY--GnXFL0dA5T3zXmsiimnYbaTT7FxKTdgU0hrX6byCr10OhQ4fthH6Meni--rL-3lt89fV-eXreVM8nbghnRMS8q46d2w1D04bqnsLBiyHEQvHF1y55iTttN40FpYA8wYOlDd9ZgdoQ_33nE2WxhsjZ11UGP2W513Kmmvnl-i_6nW6UYRLKnse1YNbx8MOV3PUCa19cVCCDpCmouiElPJqWBdRd_8g27SnGP9r1IU845JRv4IbU6lZHBPaQhWtw2q2qC6bbCir_9O_wQ-VlaB9h745QPs_itSF6urO-FvSh6ohg</recordid><startdate>202402</startdate><enddate>202402</enddate><creator>Boyd, Robin J.</creator><creator>Stewart, Gavin B.</creator><creator>Pescott, Oliver L.</creator><general>John Wiley &amp; Sons, Inc</general><general>Ecological Society of America</general><scope>24P</scope><scope>WIN</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7QG</scope><scope>7SN</scope><scope>7SS</scope><scope>7ST</scope><scope>7T7</scope><scope>8FD</scope><scope>C1K</scope><scope>FR3</scope><scope>K9.</scope><scope>P64</scope><scope>RC3</scope><scope>SOI</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>202402</creationdate><title>Descriptive inference using large, unrepresentative nonprobability samples: An introduction for ecologists</title><author>Boyd, Robin J. ; Stewart, Gavin B. ; Pescott, Oliver L.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c4394-d4b153a9234b6fd7a6ef4c295ceb17d868f274ff3f9c5a0daa8cbe3bb2d2a5603</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Bias</topic><topic>Biodiversity</topic><topic>biodiversity monitoring</topic><topic>Ecological monitoring</topic><topic>Ecology</topic><topic>Inference</topic><topic>nonprobability samples</topic><topic>Robustness (mathematics)</topic><topic>Surveys and Questionnaires</topic><topic>Trends</topic><topic>Variables</topic><topic>weighting</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Boyd, Robin J.</creatorcontrib><creatorcontrib>Stewart, Gavin B.</creatorcontrib><creatorcontrib>Pescott, Oliver L.</creatorcontrib><collection>Wiley-Blackwell Open Access Titles</collection><collection>Wiley Free Content</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Animal Behavior Abstracts</collection><collection>Ecology Abstracts</collection><collection>Entomology Abstracts (Full archive)</collection><collection>Environment Abstracts</collection><collection>Industrial and Applied Microbiology Abstracts (Microbiology A)</collection><collection>Technology Research Database</collection><collection>Environmental Sciences and Pollution Management</collection><collection>Engineering Research Database</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Genetics Abstracts</collection><collection>Environment Abstracts</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Ecology (Durham)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Boyd, Robin J.</au><au>Stewart, Gavin B.</au><au>Pescott, Oliver L.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Descriptive inference using large, unrepresentative nonprobability samples: An introduction for ecologists</atitle><jtitle>Ecology (Durham)</jtitle><addtitle>Ecology</addtitle><date>2024-02</date><risdate>2024</risdate><volume>105</volume><issue>2</issue><spage>e4214</spage><epage>n/a</epage><pages>e4214-n/a</pages><issn>0012-9658</issn><eissn>1939-9170</eissn><abstract>Biodiversity monitoring usually involves drawing inferences about some variable of interest across a defined landscape from observations made at a sample of locations within that landscape. If the variable of interest differs between sampled and nonsampled locations, and no mitigating action is taken, then the sample is unrepresentative and inferences drawn from it will be biased. It is possible to adjust unrepresentative samples so that they more closely resemble the wider landscape in terms of “auxiliary variables.” A good auxiliary variable is a common cause of sample inclusion and the variable of interest, and if it explains an appreciable portion of the variance in both, then inferences drawn from the adjusted sample will be closer to the truth. We applied six types of survey sample adjustment—subsampling, quasirandomization, poststratification, superpopulation modeling, a “doubly robust” procedure, and multilevel regression and poststratification—to a simple two‐part biodiversity monitoring problem. The first part was to estimate the mean occupancy of the plant Calluna vulgaris in Great Britain in two time periods (1987–1999 and 2010–2019); the second was to estimate the difference between the two (i.e., the trend). We estimated the means and trend using large, but (originally) unrepresentative, samples from a citizen science dataset. Compared with the unadjusted estimates, the means and trends estimated using most adjustment methods were more accurate, although standard uncertainty intervals generally did not cover the true values. Completely unbiased inference is not possible from an unrepresentative sample without knowing and having data on all relevant auxiliary variables. Adjustments can reduce the bias if auxiliary variables are available and selected carefully, but the potential for residual bias should be acknowledged and reported.</abstract><cop>Hoboken, USA</cop><pub>John Wiley &amp; Sons, Inc</pub><pmid>38088061</pmid><doi>10.1002/ecy.4214</doi><tpages>14</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0012-9658
ispartof Ecology (Durham), 2024-02, Vol.105 (2), p.e4214-n/a
issn 0012-9658
1939-9170
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_10929663
source MEDLINE; Wiley Online Library Journals Frontfile Complete
subjects Bias
Biodiversity
biodiversity monitoring
Ecological monitoring
Ecology
Inference
nonprobability samples
Robustness (mathematics)
Surveys and Questionnaires
Trends
Variables
weighting
title Descriptive inference using large, unrepresentative nonprobability samples: An introduction for ecologists
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-10T00%3A51%3A55IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Descriptive%20inference%20using%20large,%20unrepresentative%20nonprobability%20samples:%20An%20introduction%20for%20ecologists&rft.jtitle=Ecology%20(Durham)&rft.au=Boyd,%20Robin%20J.&rft.date=2024-02&rft.volume=105&rft.issue=2&rft.spage=e4214&rft.epage=n/a&rft.pages=e4214-n/a&rft.issn=0012-9658&rft.eissn=1939-9170&rft_id=info:doi/10.1002/ecy.4214&rft_dat=%3Cproquest_pubme%3E2920453931%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2920453931&rft_id=info:pmid/38088061&rfr_iscdi=true