Simpson's Paradox in COVID-19 Case Fatality Rates: A Mediation Analysis of Age-Related Causal Effects

We point out an instantiation of Simpson's paradox in COVID-19 case fatality rates ( cfr s): comparing a large-scale study from China (February 17) with early reports from Italy (March 9), we find that cfr s are lower in Italy for every age group, but higher overall. This phenomenon is explaine...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on artificial intelligence 2021-02, Vol.2 (1), p.18-27
Hauptverfasser: von Kugelgen, Julius, Gresele, Luigi, Scholkopf, Bernhard
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 27
container_issue 1
container_start_page 18
container_title IEEE transactions on artificial intelligence
container_volume 2
creator von Kugelgen, Julius
Gresele, Luigi
Scholkopf, Bernhard
description We point out an instantiation of Simpson's paradox in COVID-19 case fatality rates ( cfr s): comparing a large-scale study from China (February 17) with early reports from Italy (March 9), we find that cfr s are lower in Italy for every age group, but higher overall. This phenomenon is explained by a stark difference in case demographic between the two countries. Using this as a motivating example, we introduce basic concepts from mediation analysis and show how these can be used to quantify different direct and indirect effects when assuming a coarse-grained causal graph involving country, age, and case fatality. We curate an age-stratified cfr dataset with >750 k cases and conduct a case study, investigating total, direct, and indirect (age-mediated) causal effects between different countries and at different points in time. This allows us to separate age-related effects from others unrelated to age and facilitates a more transparent comparison of cfr s across countries at different stages of the COVID-19 pandemic. Using longitudinal data from Italy, we discover a sign reversal of the direct causal effect in mid-March, which temporally aligns with the reported collapse of the healthcare system in parts of the country. Moreover, we find that direct and indirect effects across 132 pairs of countries are only weakly correlated, suggesting that a country's policy and case demographic may be largely unrelated. We point out limitations and extensions for future work, and finally, discuss the role of causal reasoning in the broader context of using AI to combat the COVID-19 pandemic. Impact Statement -During a global pandemic, understanding the causal effects of risk factors such as age on COVID-19 fatality is an important scientific question. Since randomised controlled trials are typically infeasible or unethical in this context, causal investigations based on observational data-such as the one carried out in this article-will, therefore, be crucial in guiding our understanding of the available data. Causal inference, in particular mediation analysis, can be used to resolve apparent statistical paradoxes; help educate the public and decision-makers alike; avoid unsound comparisons; and answer a range of causal questions pertaining to the pandemic, subject to transparently stated assumptions. Our exposition helps clarify how mediation analysis can be used to investigate direct and indirect
doi_str_mv 10.1109/TAI.2021.3073088
format Article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_proquest_miscellaneous_2635248596</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9404149</ieee_id><sourcerecordid>2635248596</sourcerecordid><originalsourceid>FETCH-LOGICAL-c3318-4310539ab08c148e43c6865c1c38f7b9bcaaae7309ba4101f14c576759e403bc3</originalsourceid><addsrcrecordid>eNpVkV1v0zAUhi0EYtPYPRIS8h3cpPj4I7G5QIrKNioNDY3BrXXingyjNO7iFNF_v1Qt1biypfO8rz8exl6DmAEI9-GuXsykkDBTolLC2mfsVJYOCm0sPH-yP2HnOf8WQkgDUsrqJTtRRiplTHnK6HtcrXPq32X-DQdcpr889nx-83PxuQDH55iJX-KIXRy3_BZHyh95zb_SMuIYU8_rHrttjpmnltf3VNxSN0HLKbjJ2PGLtqUw5lfsRYtdpvPDesZ-XF7czb8U1zdXi3l9XQSlwBZagTDKYSNsAG1Jq1Da0gQIyrZV45qAiDQ91jWoQUALOpiqrIwjLVQT1Bn7tO9db5oVLQP144CdXw9xhcPWJ4z-_0kff_n79MfbyoFW5VTw_lAwpIcN5dGvYg7UddhT2mQvy-nrtDVuh4o9GoaU80Dt8RgQfifIT4L8TpA_CJoib59e7xj4p2MC3uyBSETHsdNCg3bqETypkpo</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2635248596</pqid></control><display><type>article</type><title>Simpson's Paradox in COVID-19 Case Fatality Rates: A Mediation Analysis of Age-Related Causal Effects</title><source>IEEE Electronic Library (IEL)</source><creator>von Kugelgen, Julius ; Gresele, Luigi ; Scholkopf, Bernhard</creator><creatorcontrib>von Kugelgen, Julius ; Gresele, Luigi ; Scholkopf, Bernhard</creatorcontrib><description>We point out an instantiation of Simpson's paradox in COVID-19 case fatality rates ( cfr s): comparing a large-scale study from China (February 17) with early reports from Italy (March 9), we find that cfr s are lower in Italy for every age group, but higher overall. This phenomenon is explained by a stark difference in case demographic between the two countries. Using this as a motivating example, we introduce basic concepts from mediation analysis and show how these can be used to quantify different direct and indirect effects when assuming a coarse-grained causal graph involving country, age, and case fatality. We curate an age-stratified cfr dataset with &lt;inline-formula&gt;&lt;tex-math notation="LaTeX"&gt;&gt;&lt;/tex-math&gt;&lt;/inline-formula&gt;750 k cases and conduct a case study, investigating total, direct, and indirect (age-mediated) causal effects between different countries and at different points in time. This allows us to separate age-related effects from others unrelated to age and facilitates a more transparent comparison of cfr s across countries at different stages of the COVID-19 pandemic. Using longitudinal data from Italy, we discover a sign reversal of the direct causal effect in mid-March, which temporally aligns with the reported collapse of the healthcare system in parts of the country. Moreover, we find that direct and indirect effects across 132 pairs of countries are only weakly correlated, suggesting that a country's policy and case demographic may be largely unrelated. We point out limitations and extensions for future work, and finally, discuss the role of causal reasoning in the broader context of using AI to combat the COVID-19 pandemic. Impact Statement -During a global pandemic, understanding the causal effects of risk factors such as age on COVID-19 fatality is an important scientific question. Since randomised controlled trials are typically infeasible or unethical in this context, causal investigations based on observational data-such as the one carried out in this article-will, therefore, be crucial in guiding our understanding of the available data. Causal inference, in particular mediation analysis, can be used to resolve apparent statistical paradoxes; help educate the public and decision-makers alike; avoid unsound comparisons; and answer a range of causal questions pertaining to the pandemic, subject to transparently stated assumptions. Our exposition helps clarify how mediation analysis can be used to investigate direct and indirect effects along different causal paths and thus serves as a stepping stone for future studies of other important risk factors for COVID-19 besides age.</description><identifier>ISSN: 2691-4581</identifier><identifier>EISSN: 2691-4581</identifier><identifier>DOI: 10.1109/TAI.2021.3073088</identifier><identifier>PMID: 35233556</identifier><identifier>CODEN: ITAICB</identifier><language>eng</language><publisher>United States: IEEE</publisher><subject>Causal inference ; Cognition ; COVID-19 ; Data models ; Intelligent systems ; Mediation ; mediation analysis ; Pandemics ; Senior citizens ; Simpson's paradox</subject><ispartof>IEEE transactions on artificial intelligence, 2021-02, Vol.2 (1), p.18-27</ispartof><rights>This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/.</rights><rights>2021 Author</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c3318-4310539ab08c148e43c6865c1c38f7b9bcaaae7309ba4101f14c576759e403bc3</citedby><cites>FETCH-LOGICAL-c3318-4310539ab08c148e43c6865c1c38f7b9bcaaae7309ba4101f14c576759e403bc3</cites><orcidid>0000-0002-8177-0925 ; 0000-0001-6469-4118 ; 0000-0001-8837-6720</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9404149$$EHTML$$P50$$Gieee$$Hfree_for_read</linktohtml><link.rule.ids>230,314,780,784,796,885,27924,27925,54758</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/35233556$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>von Kugelgen, Julius</creatorcontrib><creatorcontrib>Gresele, Luigi</creatorcontrib><creatorcontrib>Scholkopf, Bernhard</creatorcontrib><title>Simpson's Paradox in COVID-19 Case Fatality Rates: A Mediation Analysis of Age-Related Causal Effects</title><title>IEEE transactions on artificial intelligence</title><addtitle>TAI</addtitle><addtitle>IEEE Trans Artif Intell</addtitle><description>We point out an instantiation of Simpson's paradox in COVID-19 case fatality rates ( cfr s): comparing a large-scale study from China (February 17) with early reports from Italy (March 9), we find that cfr s are lower in Italy for every age group, but higher overall. This phenomenon is explained by a stark difference in case demographic between the two countries. Using this as a motivating example, we introduce basic concepts from mediation analysis and show how these can be used to quantify different direct and indirect effects when assuming a coarse-grained causal graph involving country, age, and case fatality. We curate an age-stratified cfr dataset with &lt;inline-formula&gt;&lt;tex-math notation="LaTeX"&gt;&gt;&lt;/tex-math&gt;&lt;/inline-formula&gt;750 k cases and conduct a case study, investigating total, direct, and indirect (age-mediated) causal effects between different countries and at different points in time. This allows us to separate age-related effects from others unrelated to age and facilitates a more transparent comparison of cfr s across countries at different stages of the COVID-19 pandemic. Using longitudinal data from Italy, we discover a sign reversal of the direct causal effect in mid-March, which temporally aligns with the reported collapse of the healthcare system in parts of the country. Moreover, we find that direct and indirect effects across 132 pairs of countries are only weakly correlated, suggesting that a country's policy and case demographic may be largely unrelated. We point out limitations and extensions for future work, and finally, discuss the role of causal reasoning in the broader context of using AI to combat the COVID-19 pandemic. Impact Statement -During a global pandemic, understanding the causal effects of risk factors such as age on COVID-19 fatality is an important scientific question. Since randomised controlled trials are typically infeasible or unethical in this context, causal investigations based on observational data-such as the one carried out in this article-will, therefore, be crucial in guiding our understanding of the available data. Causal inference, in particular mediation analysis, can be used to resolve apparent statistical paradoxes; help educate the public and decision-makers alike; avoid unsound comparisons; and answer a range of causal questions pertaining to the pandemic, subject to transparently stated assumptions. Our exposition helps clarify how mediation analysis can be used to investigate direct and indirect effects along different causal paths and thus serves as a stepping stone for future studies of other important risk factors for COVID-19 besides age.</description><subject>Causal inference</subject><subject>Cognition</subject><subject>COVID-19</subject><subject>Data models</subject><subject>Intelligent systems</subject><subject>Mediation</subject><subject>mediation analysis</subject><subject>Pandemics</subject><subject>Senior citizens</subject><subject>Simpson's paradox</subject><issn>2691-4581</issn><issn>2691-4581</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>ESBDL</sourceid><sourceid>RIE</sourceid><recordid>eNpVkV1v0zAUhi0EYtPYPRIS8h3cpPj4I7G5QIrKNioNDY3BrXXingyjNO7iFNF_v1Qt1biypfO8rz8exl6DmAEI9-GuXsykkDBTolLC2mfsVJYOCm0sPH-yP2HnOf8WQkgDUsrqJTtRRiplTHnK6HtcrXPq32X-DQdcpr889nx-83PxuQDH55iJX-KIXRy3_BZHyh95zb_SMuIYU8_rHrttjpmnltf3VNxSN0HLKbjJ2PGLtqUw5lfsRYtdpvPDesZ-XF7czb8U1zdXi3l9XQSlwBZagTDKYSNsAG1Jq1Da0gQIyrZV45qAiDQ91jWoQUALOpiqrIwjLVQT1Bn7tO9db5oVLQP144CdXw9xhcPWJ4z-_0kff_n79MfbyoFW5VTw_lAwpIcN5dGvYg7UddhT2mQvy-nrtDVuh4o9GoaU80Dt8RgQfifIT4L8TpA_CJoib59e7xj4p2MC3uyBSETHsdNCg3bqETypkpo</recordid><startdate>20210201</startdate><enddate>20210201</enddate><creator>von Kugelgen, Julius</creator><creator>Gresele, Luigi</creator><creator>Scholkopf, Bernhard</creator><general>IEEE</general><scope>97E</scope><scope>ESBDL</scope><scope>RIA</scope><scope>RIE</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0002-8177-0925</orcidid><orcidid>https://orcid.org/0000-0001-6469-4118</orcidid><orcidid>https://orcid.org/0000-0001-8837-6720</orcidid></search><sort><creationdate>20210201</creationdate><title>Simpson's Paradox in COVID-19 Case Fatality Rates: A Mediation Analysis of Age-Related Causal Effects</title><author>von Kugelgen, Julius ; Gresele, Luigi ; Scholkopf, Bernhard</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c3318-4310539ab08c148e43c6865c1c38f7b9bcaaae7309ba4101f14c576759e403bc3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Causal inference</topic><topic>Cognition</topic><topic>COVID-19</topic><topic>Data models</topic><topic>Intelligent systems</topic><topic>Mediation</topic><topic>mediation analysis</topic><topic>Pandemics</topic><topic>Senior citizens</topic><topic>Simpson's paradox</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>von Kugelgen, Julius</creatorcontrib><creatorcontrib>Gresele, Luigi</creatorcontrib><creatorcontrib>Scholkopf, Bernhard</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE Open Access Journals</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>IEEE transactions on artificial intelligence</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>von Kugelgen, Julius</au><au>Gresele, Luigi</au><au>Scholkopf, Bernhard</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Simpson's Paradox in COVID-19 Case Fatality Rates: A Mediation Analysis of Age-Related Causal Effects</atitle><jtitle>IEEE transactions on artificial intelligence</jtitle><stitle>TAI</stitle><addtitle>IEEE Trans Artif Intell</addtitle><date>2021-02-01</date><risdate>2021</risdate><volume>2</volume><issue>1</issue><spage>18</spage><epage>27</epage><pages>18-27</pages><issn>2691-4581</issn><eissn>2691-4581</eissn><coden>ITAICB</coden><abstract>We point out an instantiation of Simpson's paradox in COVID-19 case fatality rates ( cfr s): comparing a large-scale study from China (February 17) with early reports from Italy (March 9), we find that cfr s are lower in Italy for every age group, but higher overall. This phenomenon is explained by a stark difference in case demographic between the two countries. Using this as a motivating example, we introduce basic concepts from mediation analysis and show how these can be used to quantify different direct and indirect effects when assuming a coarse-grained causal graph involving country, age, and case fatality. We curate an age-stratified cfr dataset with &lt;inline-formula&gt;&lt;tex-math notation="LaTeX"&gt;&gt;&lt;/tex-math&gt;&lt;/inline-formula&gt;750 k cases and conduct a case study, investigating total, direct, and indirect (age-mediated) causal effects between different countries and at different points in time. This allows us to separate age-related effects from others unrelated to age and facilitates a more transparent comparison of cfr s across countries at different stages of the COVID-19 pandemic. Using longitudinal data from Italy, we discover a sign reversal of the direct causal effect in mid-March, which temporally aligns with the reported collapse of the healthcare system in parts of the country. Moreover, we find that direct and indirect effects across 132 pairs of countries are only weakly correlated, suggesting that a country's policy and case demographic may be largely unrelated. We point out limitations and extensions for future work, and finally, discuss the role of causal reasoning in the broader context of using AI to combat the COVID-19 pandemic. Impact Statement -During a global pandemic, understanding the causal effects of risk factors such as age on COVID-19 fatality is an important scientific question. Since randomised controlled trials are typically infeasible or unethical in this context, causal investigations based on observational data-such as the one carried out in this article-will, therefore, be crucial in guiding our understanding of the available data. Causal inference, in particular mediation analysis, can be used to resolve apparent statistical paradoxes; help educate the public and decision-makers alike; avoid unsound comparisons; and answer a range of causal questions pertaining to the pandemic, subject to transparently stated assumptions. Our exposition helps clarify how mediation analysis can be used to investigate direct and indirect effects along different causal paths and thus serves as a stepping stone for future studies of other important risk factors for COVID-19 besides age.</abstract><cop>United States</cop><pub>IEEE</pub><pmid>35233556</pmid><doi>10.1109/TAI.2021.3073088</doi><tpages>10</tpages><orcidid>https://orcid.org/0000-0002-8177-0925</orcidid><orcidid>https://orcid.org/0000-0001-6469-4118</orcidid><orcidid>https://orcid.org/0000-0001-8837-6720</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2691-4581
ispartof IEEE transactions on artificial intelligence, 2021-02, Vol.2 (1), p.18-27
issn 2691-4581
2691-4581
language eng
recordid cdi_proquest_miscellaneous_2635248596
source IEEE Electronic Library (IEL)
subjects Causal inference
Cognition
COVID-19
Data models
Intelligent systems
Mediation
mediation analysis
Pandemics
Senior citizens
Simpson's paradox
title Simpson's Paradox in COVID-19 Case Fatality Rates: A Mediation Analysis of Age-Related Causal Effects
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-05T03%3A48%3A39IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Simpson's%20Paradox%20in%20COVID-19%20Case%20Fatality%20Rates:%20A%20Mediation%20Analysis%20of%20Age-Related%20Causal%20Effects&rft.jtitle=IEEE%20transactions%20on%20artificial%20intelligence&rft.au=von%20Kugelgen,%20Julius&rft.date=2021-02-01&rft.volume=2&rft.issue=1&rft.spage=18&rft.epage=27&rft.pages=18-27&rft.issn=2691-4581&rft.eissn=2691-4581&rft.coden=ITAICB&rft_id=info:doi/10.1109/TAI.2021.3073088&rft_dat=%3Cproquest_pubme%3E2635248596%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2635248596&rft_id=info:pmid/35233556&rft_ieee_id=9404149&rfr_iscdi=true