A Comparison of Several Goodness-of-Fit Statistics

A study was conducted to evaluate four goodness- of-fit procedures using data simulation techniques. The procedures were evaluated using data generated ac cording to three different item response theory models and a factor analytic model. Three different distribu tions of ability were used, as were...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Applied psychological measurement 1985-03, Vol.9 (1), p.49-57
Hauptverfasser: McKinley, Robert L., Mills, Craig N.
Format: Artikel
Sprache:eng
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 57
container_issue 1
container_start_page 49
container_title Applied psychological measurement
container_volume 9
creator McKinley, Robert L.
Mills, Craig N.
description A study was conducted to evaluate four goodness- of-fit procedures using data simulation techniques. The procedures were evaluated using data generated ac cording to three different item response theory models and a factor analytic model. Three different distribu tions of ability were used, as were three different sam ple sizes. It was concluded that the likelihood ratio chi-square procedure yielded the fewest erroneous re jections of the hypothesis of fit, whereas Bock's chi- square procedure yielded the fewest erroneous accep tances of fit. It was found that sample sizes some where between 500 and 1,000 were best. Shifts in the mean of the ability distribution were found to cause minor fluctuations, but they did not appear to be a major issue.
doi_str_mv 10.1177/014662168500900105
format Article
fullrecord <record><control><sourceid>sage_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1177_014662168500900105</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sage_id>10.1177_014662168500900105</sage_id><sourcerecordid>10.1177_014662168500900105</sourcerecordid><originalsourceid>FETCH-LOGICAL-c397t-7a3409689958d09e20e4f0e42844316d7f268f5742e876fd6058e2024cecb1333</originalsourceid><addsrcrecordid>eNp9j81KAzEUhYMoOFZfwNW8QOzNf7Isg61CwUV1PcSZRKa0k5IbBd_eGepOcHG4m--7nEPIPYMHxoxZApNac6atAnAADNQFqZhSnArpzCWpZoDOxDW5QdwDgNBOVYSv6iYdTz4PmMY6xXoXvkL2h3qTUj8GRJoiXQ-l3hVfBixDh7fkKvoDhrvfuyBv68fX5oluXzbPzWpLO-FMocYLCU5b55TtwQUOQcYp3EopmO5N5NpGZSQP1ujYa1B2grjsQvfOhBALws9_u5wQc4jtKQ9Hn79bBu28uv27epKWZwn9R2j36TOPU8f_jB9EklTf</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>A Comparison of Several Goodness-of-Fit Statistics</title><source>SAGE Complete A-Z List</source><source>EZB-FREE-00999 freely available EZB journals</source><creator>McKinley, Robert L. ; Mills, Craig N.</creator><creatorcontrib>McKinley, Robert L. ; Mills, Craig N.</creatorcontrib><description>A study was conducted to evaluate four goodness- of-fit procedures using data simulation techniques. The procedures were evaluated using data generated ac cording to three different item response theory models and a factor analytic model. Three different distribu tions of ability were used, as were three different sam ple sizes. It was concluded that the likelihood ratio chi-square procedure yielded the fewest erroneous re jections of the hypothesis of fit, whereas Bock's chi- square procedure yielded the fewest erroneous accep tances of fit. It was found that sample sizes some where between 500 and 1,000 were best. Shifts in the mean of the ability distribution were found to cause minor fluctuations, but they did not appear to be a major issue.</description><identifier>ISSN: 0146-6216</identifier><identifier>EISSN: 1552-3497</identifier><identifier>DOI: 10.1177/014662168500900105</identifier><language>eng</language><publisher>Thousand Oaks, CA: Sage Publications</publisher><ispartof>Applied psychological measurement, 1985-03, Vol.9 (1), p.49-57</ispartof><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c397t-7a3409689958d09e20e4f0e42844316d7f268f5742e876fd6058e2024cecb1333</citedby><cites>FETCH-LOGICAL-c397t-7a3409689958d09e20e4f0e42844316d7f268f5742e876fd6058e2024cecb1333</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://journals.sagepub.com/doi/pdf/10.1177/014662168500900105$$EPDF$$P50$$Gsage$$H</linktopdf><linktohtml>$$Uhttps://journals.sagepub.com/doi/10.1177/014662168500900105$$EHTML$$P50$$Gsage$$H</linktohtml><link.rule.ids>314,776,780,21798,27901,27902,43597,43598</link.rule.ids></links><search><creatorcontrib>McKinley, Robert L.</creatorcontrib><creatorcontrib>Mills, Craig N.</creatorcontrib><title>A Comparison of Several Goodness-of-Fit Statistics</title><title>Applied psychological measurement</title><description>A study was conducted to evaluate four goodness- of-fit procedures using data simulation techniques. The procedures were evaluated using data generated ac cording to three different item response theory models and a factor analytic model. Three different distribu tions of ability were used, as were three different sam ple sizes. It was concluded that the likelihood ratio chi-square procedure yielded the fewest erroneous re jections of the hypothesis of fit, whereas Bock's chi- square procedure yielded the fewest erroneous accep tances of fit. It was found that sample sizes some where between 500 and 1,000 were best. Shifts in the mean of the ability distribution were found to cause minor fluctuations, but they did not appear to be a major issue.</description><issn>0146-6216</issn><issn>1552-3497</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>1985</creationdate><recordtype>article</recordtype><recordid>eNp9j81KAzEUhYMoOFZfwNW8QOzNf7Isg61CwUV1PcSZRKa0k5IbBd_eGepOcHG4m--7nEPIPYMHxoxZApNac6atAnAADNQFqZhSnArpzCWpZoDOxDW5QdwDgNBOVYSv6iYdTz4PmMY6xXoXvkL2h3qTUj8GRJoiXQ-l3hVfBixDh7fkKvoDhrvfuyBv68fX5oluXzbPzWpLO-FMocYLCU5b55TtwQUOQcYp3EopmO5N5NpGZSQP1ujYa1B2grjsQvfOhBALws9_u5wQc4jtKQ9Hn79bBu28uv27epKWZwn9R2j36TOPU8f_jB9EklTf</recordid><startdate>198503</startdate><enddate>198503</enddate><creator>McKinley, Robert L.</creator><creator>Mills, Craig N.</creator><general>Sage Publications</general><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>198503</creationdate><title>A Comparison of Several Goodness-of-Fit Statistics</title><author>McKinley, Robert L. ; Mills, Craig N.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c397t-7a3409689958d09e20e4f0e42844316d7f268f5742e876fd6058e2024cecb1333</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>1985</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>McKinley, Robert L.</creatorcontrib><creatorcontrib>Mills, Craig N.</creatorcontrib><collection>CrossRef</collection><jtitle>Applied psychological measurement</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>McKinley, Robert L.</au><au>Mills, Craig N.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A Comparison of Several Goodness-of-Fit Statistics</atitle><jtitle>Applied psychological measurement</jtitle><date>1985-03</date><risdate>1985</risdate><volume>9</volume><issue>1</issue><spage>49</spage><epage>57</epage><pages>49-57</pages><issn>0146-6216</issn><eissn>1552-3497</eissn><abstract>A study was conducted to evaluate four goodness- of-fit procedures using data simulation techniques. The procedures were evaluated using data generated ac cording to three different item response theory models and a factor analytic model. Three different distribu tions of ability were used, as were three different sam ple sizes. It was concluded that the likelihood ratio chi-square procedure yielded the fewest erroneous re jections of the hypothesis of fit, whereas Bock's chi- square procedure yielded the fewest erroneous accep tances of fit. It was found that sample sizes some where between 500 and 1,000 were best. Shifts in the mean of the ability distribution were found to cause minor fluctuations, but they did not appear to be a major issue.</abstract><cop>Thousand Oaks, CA</cop><pub>Sage Publications</pub><doi>10.1177/014662168500900105</doi><tpages>9</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0146-6216
ispartof Applied psychological measurement, 1985-03, Vol.9 (1), p.49-57
issn 0146-6216
1552-3497
language eng
recordid cdi_crossref_primary_10_1177_014662168500900105
source SAGE Complete A-Z List; EZB-FREE-00999 freely available EZB journals
title A Comparison of Several Goodness-of-Fit Statistics
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-01T06%3A26%3A43IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-sage_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20Comparison%20of%20Several%20Goodness-of-Fit%20Statistics&rft.jtitle=Applied%20psychological%20measurement&rft.au=McKinley,%20Robert%20L.&rft.date=1985-03&rft.volume=9&rft.issue=1&rft.spage=49&rft.epage=57&rft.pages=49-57&rft.issn=0146-6216&rft.eissn=1552-3497&rft_id=info:doi/10.1177/014662168500900105&rft_dat=%3Csage_cross%3E10.1177_014662168500900105%3C/sage_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_sage_id=10.1177_014662168500900105&rfr_iscdi=true