A Multi-resolution Theory for Approximating Infinite-p-Zero-n: Transitional Inference, Individualized Predictions, and a World Without Bias-Variance Tradeoff

Transitional inference is an empiricism concept, rooted and practiced in clinical medicine since ancient Greece. Knowledge and experiences gained from treating one entity (e.g., a disease or a group of patients) are applied to treat a related but distinctively different one (e.g., a similar disease...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of the American Statistical Association 2021-03, Vol.116 (533), p.353-367
Hauptverfasser: Li, Xinran, Meng, Xiao-Li
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 367
container_issue 533
container_start_page 353
container_title Journal of the American Statistical Association
container_volume 116
creator Li, Xinran
Meng, Xiao-Li
description Transitional inference is an empiricism concept, rooted and practiced in clinical medicine since ancient Greece. Knowledge and experiences gained from treating one entity (e.g., a disease or a group of patients) are applied to treat a related but distinctively different one (e.g., a similar disease or a new patient). This notion of "transition to the similar" renders individualized treatments an operational meaning, yet its theoretical foundation defies the familiar inductive inference framework. The uniqueness of entities is the result of potentially an infinite number of attributes (hence ), which entails zero direct training sample size (i.e., n = 0) because genuine guinea pigs do not exist. However, the literature on wavelets and on sieve methods for nonparametric estimation suggests a principled approximation theory for transitional inference via a multi-resolution (MR) perspective, where we use the resolution level to index the degree of approximation to ultimate individuality. MR inference seeks a primary resolution indexing an indirect training sample, which provides enough matched attributes to increase the relevance of the results to the target individuals and yet still accumulate sufficient indirect sample sizes for robust estimation. Theoretically, MR inference relies on an infinite-term ANOVA-type decomposition, providing an alternative way to model sparsity via the decay rate of the resolution bias as a function of the primary resolution level. Unexpectedly, this decomposition reveals a world without variance when the outcome is a deterministic function of potentially infinitely many predictors. In this deterministic world, the optimal resolution prefers over-fitting in the traditional sense when the resolution bias decays sufficiently rapidly. Furthermore, there can be many "descents" in the prediction error curve, when the contributions of predictors are inhomogeneous and the ordering of their importance does not align with the order of their inclusion in prediction. These findings may hint at a deterministic approximation theory for understanding the apparently over-fitting resistant phenomenon of some over-saturated models in machine learning.
doi_str_mv 10.1080/01621459.2020.1844210
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2499236113</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2499236113</sourcerecordid><originalsourceid>FETCH-LOGICAL-c446t-27fade615a831ec8650dfe20f20eca7319da69ae3a2651182071aeead268b09e3</originalsourceid><addsrcrecordid>eNp9kc1u1DAUhSNEJYaWR0CyxLZu_RMnDiuGCkqlonYxUMTGusTX1FVqD3YCDO_Cu2Jryrbe2Lr-zpHuOU3zkrMTzjQ7ZbwTvFXDiWCijHTbCs6eNCuuZE9F33552qwqQyv0rHme8x0rp9d61fxdk4_LNHuaMMdpmX0MZHOLMe2Ii4mst9sUf_t7mH34Ti6C88HPSLf0K6ZIw2uySRCyrzKY6j8mDCMel6f1P71dYPJ_0JLrhNaPFcvHBIIlQG5imiy58fNtXGby1kOmnyF5KPLqajE6d9QcOJgyvni4D5tP799tzj7Qy6vzi7P1JR3btpvLjq7wHVegJcdRd4pZh4I5wXCEXvLBQjcAShCd4lwL1nNABCs6_Y0NKA-bV3vfsu2PBfNs7uKSykrZCCX1oDhT7FGqHQYhO85lodSeGlPMOaEz21QCTDvDmal9mf99mdqXeeir6N7sdT6U5O_hV83HzLCbYnIl5dFnIx-3-Ae6lZ3z</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2499236113</pqid></control><display><type>article</type><title>A Multi-resolution Theory for Approximating Infinite-p-Zero-n: Transitional Inference, Individualized Predictions, and a World Without Bias-Variance Tradeoff</title><source>Taylor &amp; Francis</source><creator>Li, Xinran ; Meng, Xiao-Li</creator><creatorcontrib>Li, Xinran ; Meng, Xiao-Li</creatorcontrib><description>Transitional inference is an empiricism concept, rooted and practiced in clinical medicine since ancient Greece. Knowledge and experiences gained from treating one entity (e.g., a disease or a group of patients) are applied to treat a related but distinctively different one (e.g., a similar disease or a new patient). This notion of "transition to the similar" renders individualized treatments an operational meaning, yet its theoretical foundation defies the familiar inductive inference framework. The uniqueness of entities is the result of potentially an infinite number of attributes (hence ), which entails zero direct training sample size (i.e., n = 0) because genuine guinea pigs do not exist. However, the literature on wavelets and on sieve methods for nonparametric estimation suggests a principled approximation theory for transitional inference via a multi-resolution (MR) perspective, where we use the resolution level to index the degree of approximation to ultimate individuality. MR inference seeks a primary resolution indexing an indirect training sample, which provides enough matched attributes to increase the relevance of the results to the target individuals and yet still accumulate sufficient indirect sample sizes for robust estimation. Theoretically, MR inference relies on an infinite-term ANOVA-type decomposition, providing an alternative way to model sparsity via the decay rate of the resolution bias as a function of the primary resolution level. Unexpectedly, this decomposition reveals a world without variance when the outcome is a deterministic function of potentially infinitely many predictors. In this deterministic world, the optimal resolution prefers over-fitting in the traditional sense when the resolution bias decays sufficiently rapidly. Furthermore, there can be many "descents" in the prediction error curve, when the contributions of predictors are inhomogeneous and the ordering of their importance does not align with the order of their inclusion in prediction. These findings may hint at a deterministic approximation theory for understanding the apparently over-fitting resistant phenomenon of some over-saturated models in machine learning.</description><identifier>ISSN: 0162-1459</identifier><identifier>EISSN: 1537-274X</identifier><identifier>DOI: 10.1080/01621459.2020.1844210</identifier><language>eng</language><publisher>Alexandria: Taylor &amp; Francis</publisher><subject>Approximation ; Attributes ; Bias ; Clinical medicine ; Decay rate ; Decomposition ; Double descent ; Empiricism ; Greek civilization ; Guinea pigs ; Indexing ; Inference ; Machine learning ; Mathematical analysis ; Multiple descents ; Personalized medicine ; Regression analysis ; Sieve methods ; Sparsity ; Statistical methods ; Statistics ; Training ; Transition to the similar ; Uniqueness ; Wavelets</subject><ispartof>Journal of the American Statistical Association, 2021-03, Vol.116 (533), p.353-367</ispartof><rights>2020 American Statistical Association 2020</rights><rights>2020 American Statistical Association</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c446t-27fade615a831ec8650dfe20f20eca7319da69ae3a2651182071aeead268b09e3</citedby><cites>FETCH-LOGICAL-c446t-27fade615a831ec8650dfe20f20eca7319da69ae3a2651182071aeead268b09e3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.tandfonline.com/doi/pdf/10.1080/01621459.2020.1844210$$EPDF$$P50$$Ginformaworld$$H</linktopdf><linktohtml>$$Uhttps://www.tandfonline.com/doi/full/10.1080/01621459.2020.1844210$$EHTML$$P50$$Ginformaworld$$H</linktohtml><link.rule.ids>314,776,780,27901,27902,59620,60409</link.rule.ids></links><search><creatorcontrib>Li, Xinran</creatorcontrib><creatorcontrib>Meng, Xiao-Li</creatorcontrib><title>A Multi-resolution Theory for Approximating Infinite-p-Zero-n: Transitional Inference, Individualized Predictions, and a World Without Bias-Variance Tradeoff</title><title>Journal of the American Statistical Association</title><description>Transitional inference is an empiricism concept, rooted and practiced in clinical medicine since ancient Greece. Knowledge and experiences gained from treating one entity (e.g., a disease or a group of patients) are applied to treat a related but distinctively different one (e.g., a similar disease or a new patient). This notion of "transition to the similar" renders individualized treatments an operational meaning, yet its theoretical foundation defies the familiar inductive inference framework. The uniqueness of entities is the result of potentially an infinite number of attributes (hence ), which entails zero direct training sample size (i.e., n = 0) because genuine guinea pigs do not exist. However, the literature on wavelets and on sieve methods for nonparametric estimation suggests a principled approximation theory for transitional inference via a multi-resolution (MR) perspective, where we use the resolution level to index the degree of approximation to ultimate individuality. MR inference seeks a primary resolution indexing an indirect training sample, which provides enough matched attributes to increase the relevance of the results to the target individuals and yet still accumulate sufficient indirect sample sizes for robust estimation. Theoretically, MR inference relies on an infinite-term ANOVA-type decomposition, providing an alternative way to model sparsity via the decay rate of the resolution bias as a function of the primary resolution level. Unexpectedly, this decomposition reveals a world without variance when the outcome is a deterministic function of potentially infinitely many predictors. In this deterministic world, the optimal resolution prefers over-fitting in the traditional sense when the resolution bias decays sufficiently rapidly. Furthermore, there can be many "descents" in the prediction error curve, when the contributions of predictors are inhomogeneous and the ordering of their importance does not align with the order of their inclusion in prediction. These findings may hint at a deterministic approximation theory for understanding the apparently over-fitting resistant phenomenon of some over-saturated models in machine learning.</description><subject>Approximation</subject><subject>Attributes</subject><subject>Bias</subject><subject>Clinical medicine</subject><subject>Decay rate</subject><subject>Decomposition</subject><subject>Double descent</subject><subject>Empiricism</subject><subject>Greek civilization</subject><subject>Guinea pigs</subject><subject>Indexing</subject><subject>Inference</subject><subject>Machine learning</subject><subject>Mathematical analysis</subject><subject>Multiple descents</subject><subject>Personalized medicine</subject><subject>Regression analysis</subject><subject>Sieve methods</subject><subject>Sparsity</subject><subject>Statistical methods</subject><subject>Statistics</subject><subject>Training</subject><subject>Transition to the similar</subject><subject>Uniqueness</subject><subject>Wavelets</subject><issn>0162-1459</issn><issn>1537-274X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><recordid>eNp9kc1u1DAUhSNEJYaWR0CyxLZu_RMnDiuGCkqlonYxUMTGusTX1FVqD3YCDO_Cu2Jryrbe2Lr-zpHuOU3zkrMTzjQ7ZbwTvFXDiWCijHTbCs6eNCuuZE9F33552qwqQyv0rHme8x0rp9d61fxdk4_LNHuaMMdpmX0MZHOLMe2Ii4mst9sUf_t7mH34Ti6C88HPSLf0K6ZIw2uySRCyrzKY6j8mDCMel6f1P71dYPJ_0JLrhNaPFcvHBIIlQG5imiy58fNtXGby1kOmnyF5KPLqajE6d9QcOJgyvni4D5tP799tzj7Qy6vzi7P1JR3btpvLjq7wHVegJcdRd4pZh4I5wXCEXvLBQjcAShCd4lwL1nNABCs6_Y0NKA-bV3vfsu2PBfNs7uKSykrZCCX1oDhT7FGqHQYhO85lodSeGlPMOaEz21QCTDvDmal9mf99mdqXeeir6N7sdT6U5O_hV83HzLCbYnIl5dFnIx-3-Ae6lZ3z</recordid><startdate>20210301</startdate><enddate>20210301</enddate><creator>Li, Xinran</creator><creator>Meng, Xiao-Li</creator><general>Taylor &amp; Francis</general><general>Taylor &amp; Francis Ltd</general><scope>AAYXX</scope><scope>CITATION</scope><scope>8BJ</scope><scope>FQK</scope><scope>JBE</scope><scope>K9.</scope></search><sort><creationdate>20210301</creationdate><title>A Multi-resolution Theory for Approximating Infinite-p-Zero-n: Transitional Inference, Individualized Predictions, and a World Without Bias-Variance Tradeoff</title><author>Li, Xinran ; Meng, Xiao-Li</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c446t-27fade615a831ec8650dfe20f20eca7319da69ae3a2651182071aeead268b09e3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Approximation</topic><topic>Attributes</topic><topic>Bias</topic><topic>Clinical medicine</topic><topic>Decay rate</topic><topic>Decomposition</topic><topic>Double descent</topic><topic>Empiricism</topic><topic>Greek civilization</topic><topic>Guinea pigs</topic><topic>Indexing</topic><topic>Inference</topic><topic>Machine learning</topic><topic>Mathematical analysis</topic><topic>Multiple descents</topic><topic>Personalized medicine</topic><topic>Regression analysis</topic><topic>Sieve methods</topic><topic>Sparsity</topic><topic>Statistical methods</topic><topic>Statistics</topic><topic>Training</topic><topic>Transition to the similar</topic><topic>Uniqueness</topic><topic>Wavelets</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Li, Xinran</creatorcontrib><creatorcontrib>Meng, Xiao-Li</creatorcontrib><collection>CrossRef</collection><collection>International Bibliography of the Social Sciences (IBSS)</collection><collection>International Bibliography of the Social Sciences</collection><collection>International Bibliography of the Social Sciences</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><jtitle>Journal of the American Statistical Association</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Li, Xinran</au><au>Meng, Xiao-Li</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A Multi-resolution Theory for Approximating Infinite-p-Zero-n: Transitional Inference, Individualized Predictions, and a World Without Bias-Variance Tradeoff</atitle><jtitle>Journal of the American Statistical Association</jtitle><date>2021-03-01</date><risdate>2021</risdate><volume>116</volume><issue>533</issue><spage>353</spage><epage>367</epage><pages>353-367</pages><issn>0162-1459</issn><eissn>1537-274X</eissn><abstract>Transitional inference is an empiricism concept, rooted and practiced in clinical medicine since ancient Greece. Knowledge and experiences gained from treating one entity (e.g., a disease or a group of patients) are applied to treat a related but distinctively different one (e.g., a similar disease or a new patient). This notion of "transition to the similar" renders individualized treatments an operational meaning, yet its theoretical foundation defies the familiar inductive inference framework. The uniqueness of entities is the result of potentially an infinite number of attributes (hence ), which entails zero direct training sample size (i.e., n = 0) because genuine guinea pigs do not exist. However, the literature on wavelets and on sieve methods for nonparametric estimation suggests a principled approximation theory for transitional inference via a multi-resolution (MR) perspective, where we use the resolution level to index the degree of approximation to ultimate individuality. MR inference seeks a primary resolution indexing an indirect training sample, which provides enough matched attributes to increase the relevance of the results to the target individuals and yet still accumulate sufficient indirect sample sizes for robust estimation. Theoretically, MR inference relies on an infinite-term ANOVA-type decomposition, providing an alternative way to model sparsity via the decay rate of the resolution bias as a function of the primary resolution level. Unexpectedly, this decomposition reveals a world without variance when the outcome is a deterministic function of potentially infinitely many predictors. In this deterministic world, the optimal resolution prefers over-fitting in the traditional sense when the resolution bias decays sufficiently rapidly. Furthermore, there can be many "descents" in the prediction error curve, when the contributions of predictors are inhomogeneous and the ordering of their importance does not align with the order of their inclusion in prediction. These findings may hint at a deterministic approximation theory for understanding the apparently over-fitting resistant phenomenon of some over-saturated models in machine learning.</abstract><cop>Alexandria</cop><pub>Taylor &amp; Francis</pub><doi>10.1080/01621459.2020.1844210</doi><tpages>15</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0162-1459
ispartof Journal of the American Statistical Association, 2021-03, Vol.116 (533), p.353-367
issn 0162-1459
1537-274X
language eng
recordid cdi_proquest_journals_2499236113
source Taylor & Francis
subjects Approximation
Attributes
Bias
Clinical medicine
Decay rate
Decomposition
Double descent
Empiricism
Greek civilization
Guinea pigs
Indexing
Inference
Machine learning
Mathematical analysis
Multiple descents
Personalized medicine
Regression analysis
Sieve methods
Sparsity
Statistical methods
Statistics
Training
Transition to the similar
Uniqueness
Wavelets
title A Multi-resolution Theory for Approximating Infinite-p-Zero-n: Transitional Inference, Individualized Predictions, and a World Without Bias-Variance Tradeoff
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-04T22%3A16%3A46IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20Multi-resolution%20Theory%20for%20Approximating%20Infinite-p-Zero-n:%20Transitional%20Inference,%20Individualized%20Predictions,%20and%20a%20World%20Without%20Bias-Variance%20Tradeoff&rft.jtitle=Journal%20of%20the%20American%20Statistical%20Association&rft.au=Li,%20Xinran&rft.date=2021-03-01&rft.volume=116&rft.issue=533&rft.spage=353&rft.epage=367&rft.pages=353-367&rft.issn=0162-1459&rft.eissn=1537-274X&rft_id=info:doi/10.1080/01621459.2020.1844210&rft_dat=%3Cproquest_cross%3E2499236113%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2499236113&rft_id=info:pmid/&rfr_iscdi=true