Bayesian and Likelihood Inference for 2 × 2 Ecological Tables: An Incomplete-Data Approach

Ecological inference is a statistical problem where aggregate-level data are used to make inferences about individual-level behavior. In this article, we conduct a theoretical and empirical study of Bayesian and likelihood inference for 2 × 2 ecological tables by applying the general statistical fra...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Political analysis 2008, Vol.16 (1), p.41-69
Hauptverfasser: Imai, Kosuke, Lu, Ying, Strauss, Aaron
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 69
container_issue 1
container_start_page 41
container_title Political analysis
container_volume 16
creator Imai, Kosuke
Lu, Ying
Strauss, Aaron
description Ecological inference is a statistical problem where aggregate-level data are used to make inferences about individual-level behavior. In this article, we conduct a theoretical and empirical study of Bayesian and likelihood inference for 2 × 2 ecological tables by applying the general statistical framework of incomplete data. We first show that the ecological inference problem can be decomposed into three factors: distributional effects, which address the possible misspecification of parametric modeling assumptions about the unknown distribution of missing data; contextual effects, which represent the possible correlation between missing data and observed variables; and aggregation effects, which are directly related to the loss of information caused by data aggregation. We then examine how these three factors affect inference and offer new statistical methods to address each of them. To deal with distributional effects, we propose a nonparametric Bayesian model based on a Dirichlet process prior, which relaxes common parametric assumptions. We also identify the statistical adjustments necessary to account for contextual effects. Finally, although little can be done to cope with aggregation effects, we offer a method to quantify the magnitude of such effects in order to formally assess its severity. We use simulated and real data sets to empirically investigate the consequences of these three factors and to evaluate the performance of our proposed methods. C code, along with an easy-to-use R interface, is publicly available for implementing our proposed methods (Imai, Lu, and Strauss, forthcoming).
doi_str_mv 10.1093/pan/mpm017
format Article
fullrecord <record><control><sourceid>jstor_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1093_pan_mpm017</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><cupid>10_1093_pan_mpm017</cupid><jstor_id>25791916</jstor_id><oup_id>10.1093/pan/mpm017</oup_id><sourcerecordid>25791916</sourcerecordid><originalsourceid>FETCH-LOGICAL-c273t-86cd45e5f4f9f42013811a42293430e5d7b46f54d510793822f93557b1e9922e3</originalsourceid><addsrcrecordid>eNp9kE1Lw0AQhoMo-HnxLuzFixi7n9mst1qrLRQUrCJ6WDbJbk1NsmE3gv0l_iD_mCspPYmXmYH3YWZ4ougYwQsEBRm0qhnUbQ0R34r2EOVJTEUqtsMMKY-RSPlutO_9EgaCC7EXvV6plfalaoBqCjAr33VVvllbgGljtNNNroGxDmDw_RXKOLeVXZS5qsBcZZX2l2DYBDS3dVvpTsfXqlNg2LbOqvztMNoxqvL6aN0Poseb8Xw0iWd3t9PRcBbnmJMuTpO8oEwzQ40wFENEUoQUxVgQSqBmBc9oYhgtGIJckBRjIwhjPENaCIw1OYjO-r25s947bWTrylq5lURQ_mqRQYvstQT4tIftR_s_d9JzS99ZtyEx4wIJlIQ87vPSd_pzkyv3LhNOOJOT5xd5P0le7m-eHiQO_Pn6SVVnriwWWi7th2uCmL_O_wBnaone</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Bayesian and Likelihood Inference for 2 × 2 Ecological Tables: An Incomplete-Data Approach</title><source>Cambridge Journals</source><source>JSTOR Archive Collection A-Z Listing</source><source>Political Science Complete</source><creator>Imai, Kosuke ; Lu, Ying ; Strauss, Aaron</creator><creatorcontrib>Imai, Kosuke ; Lu, Ying ; Strauss, Aaron</creatorcontrib><description>Ecological inference is a statistical problem where aggregate-level data are used to make inferences about individual-level behavior. In this article, we conduct a theoretical and empirical study of Bayesian and likelihood inference for 2 × 2 ecological tables by applying the general statistical framework of incomplete data. We first show that the ecological inference problem can be decomposed into three factors: distributional effects, which address the possible misspecification of parametric modeling assumptions about the unknown distribution of missing data; contextual effects, which represent the possible correlation between missing data and observed variables; and aggregation effects, which are directly related to the loss of information caused by data aggregation. We then examine how these three factors affect inference and offer new statistical methods to address each of them. To deal with distributional effects, we propose a nonparametric Bayesian model based on a Dirichlet process prior, which relaxes common parametric assumptions. We also identify the statistical adjustments necessary to account for contextual effects. Finally, although little can be done to cope with aggregation effects, we offer a method to quantify the magnitude of such effects in order to formally assess its severity. We use simulated and real data sets to empirically investigate the consequences of these three factors and to evaluate the performance of our proposed methods. C code, along with an easy-to-use R interface, is publicly available for implementing our proposed methods (Imai, Lu, and Strauss, forthcoming).</description><identifier>ISSN: 1047-1987</identifier><identifier>EISSN: 1476-4989</identifier><identifier>DOI: 10.1093/pan/mpm017</identifier><language>eng</language><publisher>New York, US: Cambridge University Press</publisher><subject>Aggregation ; Datasets ; Ecological modeling ; Ecology ; Inference ; Missing data ; Modeling ; Nonparametric models ; Parametric models ; Population ecology</subject><ispartof>Political analysis, 2008, Vol.16 (1), p.41-69</ispartof><rights>Copyright © The Author 2007. Published by Oxford University Press on behalf of the Society for Political Methodology</rights><rights>Copyright © 2008 Society for Political Methodology</rights><rights>The Author 2007. Published by Oxford University Press on behalf of the Society for Political Methodology. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org 2008</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c273t-86cd45e5f4f9f42013811a42293430e5d7b46f54d510793822f93557b1e9922e3</citedby><cites>FETCH-LOGICAL-c273t-86cd45e5f4f9f42013811a42293430e5d7b46f54d510793822f93557b1e9922e3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.jstor.org/stable/pdf/25791916$$EPDF$$P50$$Gjstor$$H</linktopdf><linktohtml>$$Uhttps://www.cambridge.org/core/product/identifier/S1047198700006689/type/journal_article$$EHTML$$P50$$Gcambridge$$H</linktohtml><link.rule.ids>164,314,780,784,803,4024,27923,27924,27925,55628,58017,58250</link.rule.ids></links><search><creatorcontrib>Imai, Kosuke</creatorcontrib><creatorcontrib>Lu, Ying</creatorcontrib><creatorcontrib>Strauss, Aaron</creatorcontrib><title>Bayesian and Likelihood Inference for 2 × 2 Ecological Tables: An Incomplete-Data Approach</title><title>Political analysis</title><addtitle>Polit. anal</addtitle><description>Ecological inference is a statistical problem where aggregate-level data are used to make inferences about individual-level behavior. In this article, we conduct a theoretical and empirical study of Bayesian and likelihood inference for 2 × 2 ecological tables by applying the general statistical framework of incomplete data. We first show that the ecological inference problem can be decomposed into three factors: distributional effects, which address the possible misspecification of parametric modeling assumptions about the unknown distribution of missing data; contextual effects, which represent the possible correlation between missing data and observed variables; and aggregation effects, which are directly related to the loss of information caused by data aggregation. We then examine how these three factors affect inference and offer new statistical methods to address each of them. To deal with distributional effects, we propose a nonparametric Bayesian model based on a Dirichlet process prior, which relaxes common parametric assumptions. We also identify the statistical adjustments necessary to account for contextual effects. Finally, although little can be done to cope with aggregation effects, we offer a method to quantify the magnitude of such effects in order to formally assess its severity. We use simulated and real data sets to empirically investigate the consequences of these three factors and to evaluate the performance of our proposed methods. C code, along with an easy-to-use R interface, is publicly available for implementing our proposed methods (Imai, Lu, and Strauss, forthcoming).</description><subject>Aggregation</subject><subject>Datasets</subject><subject>Ecological modeling</subject><subject>Ecology</subject><subject>Inference</subject><subject>Missing data</subject><subject>Modeling</subject><subject>Nonparametric models</subject><subject>Parametric models</subject><subject>Population ecology</subject><issn>1047-1987</issn><issn>1476-4989</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2008</creationdate><recordtype>article</recordtype><recordid>eNp9kE1Lw0AQhoMo-HnxLuzFixi7n9mst1qrLRQUrCJ6WDbJbk1NsmE3gv0l_iD_mCspPYmXmYH3YWZ4ougYwQsEBRm0qhnUbQ0R34r2EOVJTEUqtsMMKY-RSPlutO_9EgaCC7EXvV6plfalaoBqCjAr33VVvllbgGljtNNNroGxDmDw_RXKOLeVXZS5qsBcZZX2l2DYBDS3dVvpTsfXqlNg2LbOqvztMNoxqvL6aN0Poseb8Xw0iWd3t9PRcBbnmJMuTpO8oEwzQ40wFENEUoQUxVgQSqBmBc9oYhgtGIJckBRjIwhjPENaCIw1OYjO-r25s947bWTrylq5lURQ_mqRQYvstQT4tIftR_s_d9JzS99ZtyEx4wIJlIQ87vPSd_pzkyv3LhNOOJOT5xd5P0le7m-eHiQO_Pn6SVVnriwWWi7th2uCmL_O_wBnaone</recordid><startdate>2008</startdate><enddate>2008</enddate><creator>Imai, Kosuke</creator><creator>Lu, Ying</creator><creator>Strauss, Aaron</creator><general>Cambridge University Press</general><general>Oxford University Press</general><scope>BSCLL</scope><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>2008</creationdate><title>Bayesian and Likelihood Inference for 2 × 2 Ecological Tables: An Incomplete-Data Approach</title><author>Imai, Kosuke ; Lu, Ying ; Strauss, Aaron</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c273t-86cd45e5f4f9f42013811a42293430e5d7b46f54d510793822f93557b1e9922e3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2008</creationdate><topic>Aggregation</topic><topic>Datasets</topic><topic>Ecological modeling</topic><topic>Ecology</topic><topic>Inference</topic><topic>Missing data</topic><topic>Modeling</topic><topic>Nonparametric models</topic><topic>Parametric models</topic><topic>Population ecology</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Imai, Kosuke</creatorcontrib><creatorcontrib>Lu, Ying</creatorcontrib><creatorcontrib>Strauss, Aaron</creatorcontrib><collection>Istex</collection><collection>CrossRef</collection><jtitle>Political analysis</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Imai, Kosuke</au><au>Lu, Ying</au><au>Strauss, Aaron</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Bayesian and Likelihood Inference for 2 × 2 Ecological Tables: An Incomplete-Data Approach</atitle><jtitle>Political analysis</jtitle><addtitle>Polit. anal</addtitle><date>2008</date><risdate>2008</risdate><volume>16</volume><issue>1</issue><spage>41</spage><epage>69</epage><pages>41-69</pages><issn>1047-1987</issn><eissn>1476-4989</eissn><abstract>Ecological inference is a statistical problem where aggregate-level data are used to make inferences about individual-level behavior. In this article, we conduct a theoretical and empirical study of Bayesian and likelihood inference for 2 × 2 ecological tables by applying the general statistical framework of incomplete data. We first show that the ecological inference problem can be decomposed into three factors: distributional effects, which address the possible misspecification of parametric modeling assumptions about the unknown distribution of missing data; contextual effects, which represent the possible correlation between missing data and observed variables; and aggregation effects, which are directly related to the loss of information caused by data aggregation. We then examine how these three factors affect inference and offer new statistical methods to address each of them. To deal with distributional effects, we propose a nonparametric Bayesian model based on a Dirichlet process prior, which relaxes common parametric assumptions. We also identify the statistical adjustments necessary to account for contextual effects. Finally, although little can be done to cope with aggregation effects, we offer a method to quantify the magnitude of such effects in order to formally assess its severity. We use simulated and real data sets to empirically investigate the consequences of these three factors and to evaluate the performance of our proposed methods. C code, along with an easy-to-use R interface, is publicly available for implementing our proposed methods (Imai, Lu, and Strauss, forthcoming).</abstract><cop>New York, US</cop><pub>Cambridge University Press</pub><doi>10.1093/pan/mpm017</doi><tpages>29</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1047-1987
ispartof Political analysis, 2008, Vol.16 (1), p.41-69
issn 1047-1987
1476-4989
language eng
recordid cdi_crossref_primary_10_1093_pan_mpm017
source Cambridge Journals; JSTOR Archive Collection A-Z Listing; Political Science Complete
subjects Aggregation
Datasets
Ecological modeling
Ecology
Inference
Missing data
Modeling
Nonparametric models
Parametric models
Population ecology
title Bayesian and Likelihood Inference for 2 × 2 Ecological Tables: An Incomplete-Data Approach
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-08T06%3A21%3A39IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-jstor_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Bayesian%20and%20Likelihood%20Inference%20for%202%20%C3%97%202%20Ecological%20Tables:%20An%20Incomplete-Data%20Approach&rft.jtitle=Political%20analysis&rft.au=Imai,%20Kosuke&rft.date=2008&rft.volume=16&rft.issue=1&rft.spage=41&rft.epage=69&rft.pages=41-69&rft.issn=1047-1987&rft.eissn=1476-4989&rft_id=info:doi/10.1093/pan/mpm017&rft_dat=%3Cjstor_cross%3E25791916%3C/jstor_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_cupid=10_1093_pan_mpm017&rft_jstor_id=25791916&rft_oup_id=10.1093/pan/mpm017&rfr_iscdi=true