Generalized Inflated Discrete Models: A Strategy to Work with Multimodal Discrete Distributions

Analysts of discrete data often face the challenge of managing the tendency of inflation on certain values. When treated improperly, such phenomenon may lead to biased estimates and incorrect inferences. This study extends the existing literature on single-value inflated models and develops a genera...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Sociological methods & research 2021-02, Vol.50 (1), p.365-400
Hauptverfasser: Cai, Tianji, Xia, Yiwei, Zhou, Yisu
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 400
container_issue 1
container_start_page 365
container_title Sociological methods & research
container_volume 50
creator Cai, Tianji
Xia, Yiwei
Zhou, Yisu
description Analysts of discrete data often face the challenge of managing the tendency of inflation on certain values. When treated improperly, such phenomenon may lead to biased estimates and incorrect inferences. This study extends the existing literature on single-value inflated models and develops a general framework to handle variables with more than one inflated value. To assess the performance of the proposed maximum likelihood estimator, we conducted Monte Carlo experiments under several scenarios for different levels of inflated probabilities under multinomial, ordinal, Poisson, and zero-truncated Poisson outcomes with covariates. We found that ignoring the inflations leads to substantial bias and poor inference of the inflations—not only for the intercept(s) of the inflated categories but other coefficients as well. Specifically, higher values of inflated probabilities are associated with larger biases. By contrast, the generalized inflated discrete models (GIDMs) perform well with unbiased estimates and satisfactory coverages even when the number of parameters that need to be estimated is quite large. We showed that model fit criteria, such as Akaike information criterion, could be used in selecting the appropriate specifications of inflated models. Lastly, the GIDM was implemented using large-scale health survey data as a comparison to conventional modeling approaches such as various Poisson and Ordered Logit models. We showed that the GIDM fits the data better in general. The current work provides a practical approach to analyze multimodal data that exists in many fields, such as heaping in self-reported behavioral outcomes, inflated categories of indifference and neutral in attitude surveys, large amounts of zero, and low occurrences of delinquent behaviors.
doi_str_mv 10.1177/0049124118782535
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2476813281</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ericid>EJ1281926</ericid><sage_id>10.1177_0049124118782535</sage_id><sourcerecordid>2476813281</sourcerecordid><originalsourceid>FETCH-LOGICAL-c373t-5186da90bdf91114828ab35f8bfc968bf3b7e2c223f4b6ab9bd9ec1de922feba3</originalsourceid><addsrcrecordid>eNp1kM1LAzEQxYMoWKt3L0LA82o-9iPxVmqtlRYPKh6XZDepqdtNTbJI_etNWbEgeJiZB795b2AAOMfoCuOiuEYo5ZikGLOCkYxmB2CAs4wkjPD0EAx2ONnxY3Di_QohTApEB6CcqlY50ZgvVcNZqxsRorg1vnIqKLiwtWr8DRzBp-AiWm5hsPDVunf4acIbXHRNMGtbi2bviSI4I7tgbOtPwZEWjVdnP3MIXu4mz-P7ZP44nY1H86SiBQ1JhlleC45krTnGOGWECUkzzaSueB47lYUiFSFUpzIXksuaqwrXihOilRR0CC773I2zH53yoVzZzrXxZEnSImeYklhDgPqtylnvndLlxpm1cNsSo3L3xvLvG6PlorcoZ6rf9ckDjnmc5JEnPfdiqfZH_837Bs5RfDc</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2476813281</pqid></control><display><type>article</type><title>Generalized Inflated Discrete Models: A Strategy to Work with Multimodal Discrete Distributions</title><source>SAGE Complete A-Z List</source><source>Sociological Abstracts</source><creator>Cai, Tianji ; Xia, Yiwei ; Zhou, Yisu</creator><creatorcontrib>Cai, Tianji ; Xia, Yiwei ; Zhou, Yisu</creatorcontrib><description>Analysts of discrete data often face the challenge of managing the tendency of inflation on certain values. When treated improperly, such phenomenon may lead to biased estimates and incorrect inferences. This study extends the existing literature on single-value inflated models and develops a general framework to handle variables with more than one inflated value. To assess the performance of the proposed maximum likelihood estimator, we conducted Monte Carlo experiments under several scenarios for different levels of inflated probabilities under multinomial, ordinal, Poisson, and zero-truncated Poisson outcomes with covariates. We found that ignoring the inflations leads to substantial bias and poor inference of the inflations—not only for the intercept(s) of the inflated categories but other coefficients as well. Specifically, higher values of inflated probabilities are associated with larger biases. By contrast, the generalized inflated discrete models (GIDMs) perform well with unbiased estimates and satisfactory coverages even when the number of parameters that need to be estimated is quite large. We showed that model fit criteria, such as Akaike information criterion, could be used in selecting the appropriate specifications of inflated models. Lastly, the GIDM was implemented using large-scale health survey data as a comparison to conventional modeling approaches such as various Poisson and Ordered Logit models. We showed that the GIDM fits the data better in general. The current work provides a practical approach to analyze multimodal data that exists in many fields, such as heaping in self-reported behavioral outcomes, inflated categories of indifference and neutral in attitude surveys, large amounts of zero, and low occurrences of delinquent behaviors.</description><identifier>ISSN: 0049-1241</identifier><identifier>EISSN: 1552-8294</identifier><identifier>DOI: 10.1177/0049124118782535</identifier><language>eng</language><publisher>Los Angeles, CA: SAGE Publications</publisher><subject>Adolescents ; Attitude surveys ; Behavior problems ; Bias ; Classification ; Computation ; Goodness of Fit ; Health surveys ; Inflation ; Longitudinal Studies ; Maximum Likelihood Statistics ; Monte Carlo Methods ; Multimodality ; Probability ; Statistical Analysis ; Statistical Bias ; Statistical Distributions ; Statistical Inference</subject><ispartof>Sociological methods &amp; research, 2021-02, Vol.50 (1), p.365-400</ispartof><rights>The Author(s) 2018</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c373t-5186da90bdf91114828ab35f8bfc968bf3b7e2c223f4b6ab9bd9ec1de922feba3</citedby><cites>FETCH-LOGICAL-c373t-5186da90bdf91114828ab35f8bfc968bf3b7e2c223f4b6ab9bd9ec1de922feba3</cites><orcidid>0000-0001-7360-732X ; 0000-0002-8962-2660</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://journals.sagepub.com/doi/pdf/10.1177/0049124118782535$$EPDF$$P50$$Gsage$$H</linktopdf><linktohtml>$$Uhttps://journals.sagepub.com/doi/10.1177/0049124118782535$$EHTML$$P50$$Gsage$$H</linktohtml><link.rule.ids>314,776,780,21798,27901,27902,33751,43597,43598</link.rule.ids><backlink>$$Uhttp://eric.ed.gov/ERICWebPortal/detail?accno=EJ1281926$$DView record in ERIC$$Hfree_for_read</backlink></links><search><creatorcontrib>Cai, Tianji</creatorcontrib><creatorcontrib>Xia, Yiwei</creatorcontrib><creatorcontrib>Zhou, Yisu</creatorcontrib><title>Generalized Inflated Discrete Models: A Strategy to Work with Multimodal Discrete Distributions</title><title>Sociological methods &amp; research</title><description>Analysts of discrete data often face the challenge of managing the tendency of inflation on certain values. When treated improperly, such phenomenon may lead to biased estimates and incorrect inferences. This study extends the existing literature on single-value inflated models and develops a general framework to handle variables with more than one inflated value. To assess the performance of the proposed maximum likelihood estimator, we conducted Monte Carlo experiments under several scenarios for different levels of inflated probabilities under multinomial, ordinal, Poisson, and zero-truncated Poisson outcomes with covariates. We found that ignoring the inflations leads to substantial bias and poor inference of the inflations—not only for the intercept(s) of the inflated categories but other coefficients as well. Specifically, higher values of inflated probabilities are associated with larger biases. By contrast, the generalized inflated discrete models (GIDMs) perform well with unbiased estimates and satisfactory coverages even when the number of parameters that need to be estimated is quite large. We showed that model fit criteria, such as Akaike information criterion, could be used in selecting the appropriate specifications of inflated models. Lastly, the GIDM was implemented using large-scale health survey data as a comparison to conventional modeling approaches such as various Poisson and Ordered Logit models. We showed that the GIDM fits the data better in general. The current work provides a practical approach to analyze multimodal data that exists in many fields, such as heaping in self-reported behavioral outcomes, inflated categories of indifference and neutral in attitude surveys, large amounts of zero, and low occurrences of delinquent behaviors.</description><subject>Adolescents</subject><subject>Attitude surveys</subject><subject>Behavior problems</subject><subject>Bias</subject><subject>Classification</subject><subject>Computation</subject><subject>Goodness of Fit</subject><subject>Health surveys</subject><subject>Inflation</subject><subject>Longitudinal Studies</subject><subject>Maximum Likelihood Statistics</subject><subject>Monte Carlo Methods</subject><subject>Multimodality</subject><subject>Probability</subject><subject>Statistical Analysis</subject><subject>Statistical Bias</subject><subject>Statistical Distributions</subject><subject>Statistical Inference</subject><issn>0049-1241</issn><issn>1552-8294</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>BHHNA</sourceid><recordid>eNp1kM1LAzEQxYMoWKt3L0LA82o-9iPxVmqtlRYPKh6XZDepqdtNTbJI_etNWbEgeJiZB795b2AAOMfoCuOiuEYo5ZikGLOCkYxmB2CAs4wkjPD0EAx2ONnxY3Di_QohTApEB6CcqlY50ZgvVcNZqxsRorg1vnIqKLiwtWr8DRzBp-AiWm5hsPDVunf4acIbXHRNMGtbi2bviSI4I7tgbOtPwZEWjVdnP3MIXu4mz-P7ZP44nY1H86SiBQ1JhlleC45krTnGOGWECUkzzaSueB47lYUiFSFUpzIXksuaqwrXihOilRR0CC773I2zH53yoVzZzrXxZEnSImeYklhDgPqtylnvndLlxpm1cNsSo3L3xvLvG6PlorcoZ6rf9ckDjnmc5JEnPfdiqfZH_837Bs5RfDc</recordid><startdate>202102</startdate><enddate>202102</enddate><creator>Cai, Tianji</creator><creator>Xia, Yiwei</creator><creator>Zhou, Yisu</creator><general>SAGE Publications</general><general>SAGE PUBLICATIONS, INC</general><scope>7SW</scope><scope>BJH</scope><scope>BNH</scope><scope>BNI</scope><scope>BNJ</scope><scope>BNO</scope><scope>ERI</scope><scope>PET</scope><scope>REK</scope><scope>WWN</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7U4</scope><scope>8BJ</scope><scope>BHHNA</scope><scope>DWI</scope><scope>FQK</scope><scope>JBE</scope><scope>WZK</scope><orcidid>https://orcid.org/0000-0001-7360-732X</orcidid><orcidid>https://orcid.org/0000-0002-8962-2660</orcidid></search><sort><creationdate>202102</creationdate><title>Generalized Inflated Discrete Models: A Strategy to Work with Multimodal Discrete Distributions</title><author>Cai, Tianji ; Xia, Yiwei ; Zhou, Yisu</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c373t-5186da90bdf91114828ab35f8bfc968bf3b7e2c223f4b6ab9bd9ec1de922feba3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Adolescents</topic><topic>Attitude surveys</topic><topic>Behavior problems</topic><topic>Bias</topic><topic>Classification</topic><topic>Computation</topic><topic>Goodness of Fit</topic><topic>Health surveys</topic><topic>Inflation</topic><topic>Longitudinal Studies</topic><topic>Maximum Likelihood Statistics</topic><topic>Monte Carlo Methods</topic><topic>Multimodality</topic><topic>Probability</topic><topic>Statistical Analysis</topic><topic>Statistical Bias</topic><topic>Statistical Distributions</topic><topic>Statistical Inference</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Cai, Tianji</creatorcontrib><creatorcontrib>Xia, Yiwei</creatorcontrib><creatorcontrib>Zhou, Yisu</creatorcontrib><collection>ERIC</collection><collection>ERIC (Ovid)</collection><collection>ERIC</collection><collection>ERIC</collection><collection>ERIC (Legacy Platform)</collection><collection>ERIC( SilverPlatter )</collection><collection>ERIC</collection><collection>ERIC PlusText (Legacy Platform)</collection><collection>Education Resources Information Center (ERIC)</collection><collection>ERIC</collection><collection>CrossRef</collection><collection>Sociological Abstracts (pre-2017)</collection><collection>International Bibliography of the Social Sciences (IBSS)</collection><collection>Sociological Abstracts</collection><collection>Sociological Abstracts</collection><collection>International Bibliography of the Social Sciences</collection><collection>International Bibliography of the Social Sciences</collection><collection>Sociological Abstracts (Ovid)</collection><jtitle>Sociological methods &amp; research</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Cai, Tianji</au><au>Xia, Yiwei</au><au>Zhou, Yisu</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><ericid>EJ1281926</ericid><atitle>Generalized Inflated Discrete Models: A Strategy to Work with Multimodal Discrete Distributions</atitle><jtitle>Sociological methods &amp; research</jtitle><date>2021-02</date><risdate>2021</risdate><volume>50</volume><issue>1</issue><spage>365</spage><epage>400</epage><pages>365-400</pages><issn>0049-1241</issn><eissn>1552-8294</eissn><abstract>Analysts of discrete data often face the challenge of managing the tendency of inflation on certain values. When treated improperly, such phenomenon may lead to biased estimates and incorrect inferences. This study extends the existing literature on single-value inflated models and develops a general framework to handle variables with more than one inflated value. To assess the performance of the proposed maximum likelihood estimator, we conducted Monte Carlo experiments under several scenarios for different levels of inflated probabilities under multinomial, ordinal, Poisson, and zero-truncated Poisson outcomes with covariates. We found that ignoring the inflations leads to substantial bias and poor inference of the inflations—not only for the intercept(s) of the inflated categories but other coefficients as well. Specifically, higher values of inflated probabilities are associated with larger biases. By contrast, the generalized inflated discrete models (GIDMs) perform well with unbiased estimates and satisfactory coverages even when the number of parameters that need to be estimated is quite large. We showed that model fit criteria, such as Akaike information criterion, could be used in selecting the appropriate specifications of inflated models. Lastly, the GIDM was implemented using large-scale health survey data as a comparison to conventional modeling approaches such as various Poisson and Ordered Logit models. We showed that the GIDM fits the data better in general. The current work provides a practical approach to analyze multimodal data that exists in many fields, such as heaping in self-reported behavioral outcomes, inflated categories of indifference and neutral in attitude surveys, large amounts of zero, and low occurrences of delinquent behaviors.</abstract><cop>Los Angeles, CA</cop><pub>SAGE Publications</pub><doi>10.1177/0049124118782535</doi><tpages>36</tpages><orcidid>https://orcid.org/0000-0001-7360-732X</orcidid><orcidid>https://orcid.org/0000-0002-8962-2660</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0049-1241
ispartof Sociological methods & research, 2021-02, Vol.50 (1), p.365-400
issn 0049-1241
1552-8294
language eng
recordid cdi_proquest_journals_2476813281
source SAGE Complete A-Z List; Sociological Abstracts
subjects Adolescents
Attitude surveys
Behavior problems
Bias
Classification
Computation
Goodness of Fit
Health surveys
Inflation
Longitudinal Studies
Maximum Likelihood Statistics
Monte Carlo Methods
Multimodality
Probability
Statistical Analysis
Statistical Bias
Statistical Distributions
Statistical Inference
title Generalized Inflated Discrete Models: A Strategy to Work with Multimodal Discrete Distributions
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-28T16%3A50%3A40IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Generalized%20Inflated%20Discrete%20Models:%20A%20Strategy%20to%20Work%20with%20Multimodal%20Discrete%20Distributions&rft.jtitle=Sociological%20methods%20&%20research&rft.au=Cai,%20Tianji&rft.date=2021-02&rft.volume=50&rft.issue=1&rft.spage=365&rft.epage=400&rft.pages=365-400&rft.issn=0049-1241&rft.eissn=1552-8294&rft_id=info:doi/10.1177/0049124118782535&rft_dat=%3Cproquest_cross%3E2476813281%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2476813281&rft_id=info:pmid/&rft_ericid=EJ1281926&rft_sage_id=10.1177_0049124118782535&rfr_iscdi=true