Generalized Inflated Discrete Models: A Strategy to Work with Multimodal Discrete Distributions
Analysts of discrete data often face the challenge of managing the tendency of inflation on certain values. When treated improperly, such phenomenon may lead to biased estimates and incorrect inferences. This study extends the existing literature on single-value inflated models and develops a genera...
Gespeichert in:
Veröffentlicht in: | Sociological methods & research 2021-02, Vol.50 (1), p.365-400 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 400 |
---|---|
container_issue | 1 |
container_start_page | 365 |
container_title | Sociological methods & research |
container_volume | 50 |
creator | Cai, Tianji Xia, Yiwei Zhou, Yisu |
description | Analysts of discrete data often face the challenge of managing the tendency of inflation on certain values. When treated improperly, such phenomenon may lead to biased estimates and incorrect inferences. This study extends the existing literature on single-value inflated models and develops a general framework to handle variables with more than one inflated value. To assess the performance of the proposed maximum likelihood estimator, we conducted Monte Carlo experiments under several scenarios for different levels of inflated probabilities under multinomial, ordinal, Poisson, and zero-truncated Poisson outcomes with covariates. We found that ignoring the inflations leads to substantial bias and poor inference of the inflations—not only for the intercept(s) of the inflated categories but other coefficients as well. Specifically, higher values of inflated probabilities are associated with larger biases. By contrast, the generalized inflated discrete models (GIDMs) perform well with unbiased estimates and satisfactory coverages even when the number of parameters that need to be estimated is quite large. We showed that model fit criteria, such as Akaike information criterion, could be used in selecting the appropriate specifications of inflated models. Lastly, the GIDM was implemented using large-scale health survey data as a comparison to conventional modeling approaches such as various Poisson and Ordered Logit models. We showed that the GIDM fits the data better in general. The current work provides a practical approach to analyze multimodal data that exists in many fields, such as heaping in self-reported behavioral outcomes, inflated categories of indifference and neutral in attitude surveys, large amounts of zero, and low occurrences of delinquent behaviors. |
doi_str_mv | 10.1177/0049124118782535 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2476813281</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ericid>EJ1281926</ericid><sage_id>10.1177_0049124118782535</sage_id><sourcerecordid>2476813281</sourcerecordid><originalsourceid>FETCH-LOGICAL-c373t-5186da90bdf91114828ab35f8bfc968bf3b7e2c223f4b6ab9bd9ec1de922feba3</originalsourceid><addsrcrecordid>eNp1kM1LAzEQxYMoWKt3L0LA82o-9iPxVmqtlRYPKh6XZDepqdtNTbJI_etNWbEgeJiZB795b2AAOMfoCuOiuEYo5ZikGLOCkYxmB2CAs4wkjPD0EAx2ONnxY3Di_QohTApEB6CcqlY50ZgvVcNZqxsRorg1vnIqKLiwtWr8DRzBp-AiWm5hsPDVunf4acIbXHRNMGtbi2bviSI4I7tgbOtPwZEWjVdnP3MIXu4mz-P7ZP44nY1H86SiBQ1JhlleC45krTnGOGWECUkzzaSueB47lYUiFSFUpzIXksuaqwrXihOilRR0CC773I2zH53yoVzZzrXxZEnSImeYklhDgPqtylnvndLlxpm1cNsSo3L3xvLvG6PlorcoZ6rf9ckDjnmc5JEnPfdiqfZH_837Bs5RfDc</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2476813281</pqid></control><display><type>article</type><title>Generalized Inflated Discrete Models: A Strategy to Work with Multimodal Discrete Distributions</title><source>SAGE Complete A-Z List</source><source>Sociological Abstracts</source><creator>Cai, Tianji ; Xia, Yiwei ; Zhou, Yisu</creator><creatorcontrib>Cai, Tianji ; Xia, Yiwei ; Zhou, Yisu</creatorcontrib><description>Analysts of discrete data often face the challenge of managing the tendency of inflation on certain values. When treated improperly, such phenomenon may lead to biased estimates and incorrect inferences. This study extends the existing literature on single-value inflated models and develops a general framework to handle variables with more than one inflated value. To assess the performance of the proposed maximum likelihood estimator, we conducted Monte Carlo experiments under several scenarios for different levels of inflated probabilities under multinomial, ordinal, Poisson, and zero-truncated Poisson outcomes with covariates. We found that ignoring the inflations leads to substantial bias and poor inference of the inflations—not only for the intercept(s) of the inflated categories but other coefficients as well. Specifically, higher values of inflated probabilities are associated with larger biases. By contrast, the generalized inflated discrete models (GIDMs) perform well with unbiased estimates and satisfactory coverages even when the number of parameters that need to be estimated is quite large. We showed that model fit criteria, such as Akaike information criterion, could be used in selecting the appropriate specifications of inflated models. Lastly, the GIDM was implemented using large-scale health survey data as a comparison to conventional modeling approaches such as various Poisson and Ordered Logit models. We showed that the GIDM fits the data better in general. The current work provides a practical approach to analyze multimodal data that exists in many fields, such as heaping in self-reported behavioral outcomes, inflated categories of indifference and neutral in attitude surveys, large amounts of zero, and low occurrences of delinquent behaviors.</description><identifier>ISSN: 0049-1241</identifier><identifier>EISSN: 1552-8294</identifier><identifier>DOI: 10.1177/0049124118782535</identifier><language>eng</language><publisher>Los Angeles, CA: SAGE Publications</publisher><subject>Adolescents ; Attitude surveys ; Behavior problems ; Bias ; Classification ; Computation ; Goodness of Fit ; Health surveys ; Inflation ; Longitudinal Studies ; Maximum Likelihood Statistics ; Monte Carlo Methods ; Multimodality ; Probability ; Statistical Analysis ; Statistical Bias ; Statistical Distributions ; Statistical Inference</subject><ispartof>Sociological methods & research, 2021-02, Vol.50 (1), p.365-400</ispartof><rights>The Author(s) 2018</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c373t-5186da90bdf91114828ab35f8bfc968bf3b7e2c223f4b6ab9bd9ec1de922feba3</citedby><cites>FETCH-LOGICAL-c373t-5186da90bdf91114828ab35f8bfc968bf3b7e2c223f4b6ab9bd9ec1de922feba3</cites><orcidid>0000-0001-7360-732X ; 0000-0002-8962-2660</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://journals.sagepub.com/doi/pdf/10.1177/0049124118782535$$EPDF$$P50$$Gsage$$H</linktopdf><linktohtml>$$Uhttps://journals.sagepub.com/doi/10.1177/0049124118782535$$EHTML$$P50$$Gsage$$H</linktohtml><link.rule.ids>314,776,780,21798,27901,27902,33751,43597,43598</link.rule.ids><backlink>$$Uhttp://eric.ed.gov/ERICWebPortal/detail?accno=EJ1281926$$DView record in ERIC$$Hfree_for_read</backlink></links><search><creatorcontrib>Cai, Tianji</creatorcontrib><creatorcontrib>Xia, Yiwei</creatorcontrib><creatorcontrib>Zhou, Yisu</creatorcontrib><title>Generalized Inflated Discrete Models: A Strategy to Work with Multimodal Discrete Distributions</title><title>Sociological methods & research</title><description>Analysts of discrete data often face the challenge of managing the tendency of inflation on certain values. When treated improperly, such phenomenon may lead to biased estimates and incorrect inferences. This study extends the existing literature on single-value inflated models and develops a general framework to handle variables with more than one inflated value. To assess the performance of the proposed maximum likelihood estimator, we conducted Monte Carlo experiments under several scenarios for different levels of inflated probabilities under multinomial, ordinal, Poisson, and zero-truncated Poisson outcomes with covariates. We found that ignoring the inflations leads to substantial bias and poor inference of the inflations—not only for the intercept(s) of the inflated categories but other coefficients as well. Specifically, higher values of inflated probabilities are associated with larger biases. By contrast, the generalized inflated discrete models (GIDMs) perform well with unbiased estimates and satisfactory coverages even when the number of parameters that need to be estimated is quite large. We showed that model fit criteria, such as Akaike information criterion, could be used in selecting the appropriate specifications of inflated models. Lastly, the GIDM was implemented using large-scale health survey data as a comparison to conventional modeling approaches such as various Poisson and Ordered Logit models. We showed that the GIDM fits the data better in general. The current work provides a practical approach to analyze multimodal data that exists in many fields, such as heaping in self-reported behavioral outcomes, inflated categories of indifference and neutral in attitude surveys, large amounts of zero, and low occurrences of delinquent behaviors.</description><subject>Adolescents</subject><subject>Attitude surveys</subject><subject>Behavior problems</subject><subject>Bias</subject><subject>Classification</subject><subject>Computation</subject><subject>Goodness of Fit</subject><subject>Health surveys</subject><subject>Inflation</subject><subject>Longitudinal Studies</subject><subject>Maximum Likelihood Statistics</subject><subject>Monte Carlo Methods</subject><subject>Multimodality</subject><subject>Probability</subject><subject>Statistical Analysis</subject><subject>Statistical Bias</subject><subject>Statistical Distributions</subject><subject>Statistical Inference</subject><issn>0049-1241</issn><issn>1552-8294</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>BHHNA</sourceid><recordid>eNp1kM1LAzEQxYMoWKt3L0LA82o-9iPxVmqtlRYPKh6XZDepqdtNTbJI_etNWbEgeJiZB795b2AAOMfoCuOiuEYo5ZikGLOCkYxmB2CAs4wkjPD0EAx2ONnxY3Di_QohTApEB6CcqlY50ZgvVcNZqxsRorg1vnIqKLiwtWr8DRzBp-AiWm5hsPDVunf4acIbXHRNMGtbi2bviSI4I7tgbOtPwZEWjVdnP3MIXu4mz-P7ZP44nY1H86SiBQ1JhlleC45krTnGOGWECUkzzaSueB47lYUiFSFUpzIXksuaqwrXihOilRR0CC773I2zH53yoVzZzrXxZEnSImeYklhDgPqtylnvndLlxpm1cNsSo3L3xvLvG6PlorcoZ6rf9ckDjnmc5JEnPfdiqfZH_837Bs5RfDc</recordid><startdate>202102</startdate><enddate>202102</enddate><creator>Cai, Tianji</creator><creator>Xia, Yiwei</creator><creator>Zhou, Yisu</creator><general>SAGE Publications</general><general>SAGE PUBLICATIONS, INC</general><scope>7SW</scope><scope>BJH</scope><scope>BNH</scope><scope>BNI</scope><scope>BNJ</scope><scope>BNO</scope><scope>ERI</scope><scope>PET</scope><scope>REK</scope><scope>WWN</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7U4</scope><scope>8BJ</scope><scope>BHHNA</scope><scope>DWI</scope><scope>FQK</scope><scope>JBE</scope><scope>WZK</scope><orcidid>https://orcid.org/0000-0001-7360-732X</orcidid><orcidid>https://orcid.org/0000-0002-8962-2660</orcidid></search><sort><creationdate>202102</creationdate><title>Generalized Inflated Discrete Models: A Strategy to Work with Multimodal Discrete Distributions</title><author>Cai, Tianji ; Xia, Yiwei ; Zhou, Yisu</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c373t-5186da90bdf91114828ab35f8bfc968bf3b7e2c223f4b6ab9bd9ec1de922feba3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Adolescents</topic><topic>Attitude surveys</topic><topic>Behavior problems</topic><topic>Bias</topic><topic>Classification</topic><topic>Computation</topic><topic>Goodness of Fit</topic><topic>Health surveys</topic><topic>Inflation</topic><topic>Longitudinal Studies</topic><topic>Maximum Likelihood Statistics</topic><topic>Monte Carlo Methods</topic><topic>Multimodality</topic><topic>Probability</topic><topic>Statistical Analysis</topic><topic>Statistical Bias</topic><topic>Statistical Distributions</topic><topic>Statistical Inference</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Cai, Tianji</creatorcontrib><creatorcontrib>Xia, Yiwei</creatorcontrib><creatorcontrib>Zhou, Yisu</creatorcontrib><collection>ERIC</collection><collection>ERIC (Ovid)</collection><collection>ERIC</collection><collection>ERIC</collection><collection>ERIC (Legacy Platform)</collection><collection>ERIC( SilverPlatter )</collection><collection>ERIC</collection><collection>ERIC PlusText (Legacy Platform)</collection><collection>Education Resources Information Center (ERIC)</collection><collection>ERIC</collection><collection>CrossRef</collection><collection>Sociological Abstracts (pre-2017)</collection><collection>International Bibliography of the Social Sciences (IBSS)</collection><collection>Sociological Abstracts</collection><collection>Sociological Abstracts</collection><collection>International Bibliography of the Social Sciences</collection><collection>International Bibliography of the Social Sciences</collection><collection>Sociological Abstracts (Ovid)</collection><jtitle>Sociological methods & research</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Cai, Tianji</au><au>Xia, Yiwei</au><au>Zhou, Yisu</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><ericid>EJ1281926</ericid><atitle>Generalized Inflated Discrete Models: A Strategy to Work with Multimodal Discrete Distributions</atitle><jtitle>Sociological methods & research</jtitle><date>2021-02</date><risdate>2021</risdate><volume>50</volume><issue>1</issue><spage>365</spage><epage>400</epage><pages>365-400</pages><issn>0049-1241</issn><eissn>1552-8294</eissn><abstract>Analysts of discrete data often face the challenge of managing the tendency of inflation on certain values. When treated improperly, such phenomenon may lead to biased estimates and incorrect inferences. This study extends the existing literature on single-value inflated models and develops a general framework to handle variables with more than one inflated value. To assess the performance of the proposed maximum likelihood estimator, we conducted Monte Carlo experiments under several scenarios for different levels of inflated probabilities under multinomial, ordinal, Poisson, and zero-truncated Poisson outcomes with covariates. We found that ignoring the inflations leads to substantial bias and poor inference of the inflations—not only for the intercept(s) of the inflated categories but other coefficients as well. Specifically, higher values of inflated probabilities are associated with larger biases. By contrast, the generalized inflated discrete models (GIDMs) perform well with unbiased estimates and satisfactory coverages even when the number of parameters that need to be estimated is quite large. We showed that model fit criteria, such as Akaike information criterion, could be used in selecting the appropriate specifications of inflated models. Lastly, the GIDM was implemented using large-scale health survey data as a comparison to conventional modeling approaches such as various Poisson and Ordered Logit models. We showed that the GIDM fits the data better in general. The current work provides a practical approach to analyze multimodal data that exists in many fields, such as heaping in self-reported behavioral outcomes, inflated categories of indifference and neutral in attitude surveys, large amounts of zero, and low occurrences of delinquent behaviors.</abstract><cop>Los Angeles, CA</cop><pub>SAGE Publications</pub><doi>10.1177/0049124118782535</doi><tpages>36</tpages><orcidid>https://orcid.org/0000-0001-7360-732X</orcidid><orcidid>https://orcid.org/0000-0002-8962-2660</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0049-1241 |
ispartof | Sociological methods & research, 2021-02, Vol.50 (1), p.365-400 |
issn | 0049-1241 1552-8294 |
language | eng |
recordid | cdi_proquest_journals_2476813281 |
source | SAGE Complete A-Z List; Sociological Abstracts |
subjects | Adolescents Attitude surveys Behavior problems Bias Classification Computation Goodness of Fit Health surveys Inflation Longitudinal Studies Maximum Likelihood Statistics Monte Carlo Methods Multimodality Probability Statistical Analysis Statistical Bias Statistical Distributions Statistical Inference |
title | Generalized Inflated Discrete Models: A Strategy to Work with Multimodal Discrete Distributions |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-28T16%3A50%3A40IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Generalized%20Inflated%20Discrete%20Models:%20A%20Strategy%20to%20Work%20with%20Multimodal%20Discrete%20Distributions&rft.jtitle=Sociological%20methods%20&%20research&rft.au=Cai,%20Tianji&rft.date=2021-02&rft.volume=50&rft.issue=1&rft.spage=365&rft.epage=400&rft.pages=365-400&rft.issn=0049-1241&rft.eissn=1552-8294&rft_id=info:doi/10.1177/0049124118782535&rft_dat=%3Cproquest_cross%3E2476813281%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2476813281&rft_id=info:pmid/&rft_ericid=EJ1281926&rft_sage_id=10.1177_0049124118782535&rfr_iscdi=true |