A Graphical Method for Displaying the Model Fit of Item Response Theory Trace Lines

Item response theory (IRT) is a statistical paradigm for developing educational tests and assessing students. IRT, however, currently lacks an established graphical method for examining model fit for the three-parameter logistic model, the most flexible and popular IRT model in educational testing....

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Educational and psychological measurement 2019-12, Vol.79 (6), p.1064-1074
1. Verfasser: Kalinowski, Steven T.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1074
container_issue 6
container_start_page 1064
container_title Educational and psychological measurement
container_volume 79
creator Kalinowski, Steven T.
description Item response theory (IRT) is a statistical paradigm for developing educational tests and assessing students. IRT, however, currently lacks an established graphical method for examining model fit for the three-parameter logistic model, the most flexible and popular IRT model in educational testing. A method is presented here to do this. The graph, which is referred to herein as a “bin plot,” is the IRT equivalent of a scatterplot for linear regression. Bin plots display a conventional IRT trace line (with ability on the horizontal axis and probability correct on the vertical axis). Students are binned according to how well they performed on the entire test, and the proportion of students in each bin who answered the focal question correctly is displayed on the graph as points above or below the trace line. With this arrangement, the difference between each point and the trace line is the residual for the bin. Confidence intervals can be added to the observed proportions in order to display uncertainty. Computer simulations were used to test four alternative ways for binning students. These simulations showed that binning students according to number of questions they answered correctly on the entire test works best. Simulations also showed confidence intervals for bin plots had coverage probabilities close to nominal values for common testing scenarios, but that there are scenarios in which confidence intervals had inflated error rates.
doi_str_mv 10.1177/0013164419846234
format Article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_6777066</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ericid>EJ1227406</ericid><sage_id>10.1177_0013164419846234</sage_id><sourcerecordid>2461147203</sourcerecordid><originalsourceid>FETCH-LOGICAL-c494t-148ba6f38bf673e38d061884b859d7a9b23d44548b7eea94cfc4b64a7cea3d043</originalsourceid><addsrcrecordid>eNqFkc2L1DAYh4Mo7uzq3YsQ8OKlmjRvk_QiLOt-ySyCjueQpm-nWTpNTTrC_PemzLLigphLDr_n9-TjJeQNZx84V-ojY1xwCcBrDbIU8IyseFWVhdBaPyerJS6W_IScpnTP8gLOX5KTXFoqbEW-n9PraKfeOzvQO5z70NIuRPrZp2mwBz9u6dwjvQstDvTKzzR09HbGHf2GaQpjQrrpMcQD3UTrkK79iOkVedHZIeHrh_2M_Li63FzcFOuv17cX5-vCQQ1zwUE3VnZCN51UAoVumeRaQ6OrulW2bkrRAlSZUoi2Btc5aCRY5dCKloE4I5-O3mnf7LB1OM7RDmaKfmfjwQTrzd_J6HuzDb-MVEoxKbPg_YMghp97TLPZ-eRwGOyIYZ9MCZJzUCUT_0cFk1BLCTyj756g92Efx_wTpix1xQCyMlPsSLkYUorYPd6bM7MM1zwdbq68PVYweveIX37hZamALc8pjnmyW_xz6D99vwFafKni</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2285044472</pqid></control><display><type>article</type><title>A Graphical Method for Displaying the Model Fit of Item Response Theory Trace Lines</title><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>SAGE Complete A-Z List</source><source>PubMed Central</source><creator>Kalinowski, Steven T.</creator><creatorcontrib>Kalinowski, Steven T.</creatorcontrib><description>Item response theory (IRT) is a statistical paradigm for developing educational tests and assessing students. IRT, however, currently lacks an established graphical method for examining model fit for the three-parameter logistic model, the most flexible and popular IRT model in educational testing. A method is presented here to do this. The graph, which is referred to herein as a “bin plot,” is the IRT equivalent of a scatterplot for linear regression. Bin plots display a conventional IRT trace line (with ability on the horizontal axis and probability correct on the vertical axis). Students are binned according to how well they performed on the entire test, and the proportion of students in each bin who answered the focal question correctly is displayed on the graph as points above or below the trace line. With this arrangement, the difference between each point and the trace line is the residual for the bin. Confidence intervals can be added to the observed proportions in order to display uncertainty. Computer simulations were used to test four alternative ways for binning students. These simulations showed that binning students according to number of questions they answered correctly on the entire test works best. Simulations also showed confidence intervals for bin plots had coverage probabilities close to nominal values for common testing scenarios, but that there are scenarios in which confidence intervals had inflated error rates.</description><identifier>ISSN: 0013-1644</identifier><identifier>EISSN: 1552-3888</identifier><identifier>DOI: 10.1177/0013164419846234</identifier><identifier>PMID: 31619840</identifier><language>eng</language><publisher>Los Angeles, CA: SAGE Publications</publisher><subject>Computer Simulation ; Confidence intervals ; Educational Assessment ; Educational Development ; Educational evaluation ; Educational Testing ; Error Patterns ; Goodness of Fit ; Graphs ; Intervals ; Item Response Theory ; Probability ; Simulation ; Student Evaluation ; Students ; Test Items ; Tests ; Vignettes</subject><ispartof>Educational and psychological measurement, 2019-12, Vol.79 (6), p.1064-1074</ispartof><rights>The Author(s) 2019</rights><rights>The Author(s) 2019 2019 SAGE Publications</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c494t-148ba6f38bf673e38d061884b859d7a9b23d44548b7eea94cfc4b64a7cea3d043</citedby><cites>FETCH-LOGICAL-c494t-148ba6f38bf673e38d061884b859d7a9b23d44548b7eea94cfc4b64a7cea3d043</cites><orcidid>0000-0001-8504-4923</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC6777066/pdf/$$EPDF$$P50$$Gpubmedcentral$$H</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC6777066/$$EHTML$$P50$$Gpubmedcentral$$H</linktohtml><link.rule.ids>230,314,727,780,784,885,21819,27924,27925,43621,43622,53791,53793</link.rule.ids><backlink>$$Uhttp://eric.ed.gov/ERICWebPortal/detail?accno=EJ1227406$$DView record in ERIC$$Hfree_for_read</backlink></links><search><creatorcontrib>Kalinowski, Steven T.</creatorcontrib><title>A Graphical Method for Displaying the Model Fit of Item Response Theory Trace Lines</title><title>Educational and psychological measurement</title><description>Item response theory (IRT) is a statistical paradigm for developing educational tests and assessing students. IRT, however, currently lacks an established graphical method for examining model fit for the three-parameter logistic model, the most flexible and popular IRT model in educational testing. A method is presented here to do this. The graph, which is referred to herein as a “bin plot,” is the IRT equivalent of a scatterplot for linear regression. Bin plots display a conventional IRT trace line (with ability on the horizontal axis and probability correct on the vertical axis). Students are binned according to how well they performed on the entire test, and the proportion of students in each bin who answered the focal question correctly is displayed on the graph as points above or below the trace line. With this arrangement, the difference between each point and the trace line is the residual for the bin. Confidence intervals can be added to the observed proportions in order to display uncertainty. Computer simulations were used to test four alternative ways for binning students. These simulations showed that binning students according to number of questions they answered correctly on the entire test works best. Simulations also showed confidence intervals for bin plots had coverage probabilities close to nominal values for common testing scenarios, but that there are scenarios in which confidence intervals had inflated error rates.</description><subject>Computer Simulation</subject><subject>Confidence intervals</subject><subject>Educational Assessment</subject><subject>Educational Development</subject><subject>Educational evaluation</subject><subject>Educational Testing</subject><subject>Error Patterns</subject><subject>Goodness of Fit</subject><subject>Graphs</subject><subject>Intervals</subject><subject>Item Response Theory</subject><subject>Probability</subject><subject>Simulation</subject><subject>Student Evaluation</subject><subject>Students</subject><subject>Test Items</subject><subject>Tests</subject><subject>Vignettes</subject><issn>0013-1644</issn><issn>1552-3888</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><recordid>eNqFkc2L1DAYh4Mo7uzq3YsQ8OKlmjRvk_QiLOt-ySyCjueQpm-nWTpNTTrC_PemzLLigphLDr_n9-TjJeQNZx84V-ojY1xwCcBrDbIU8IyseFWVhdBaPyerJS6W_IScpnTP8gLOX5KTXFoqbEW-n9PraKfeOzvQO5z70NIuRPrZp2mwBz9u6dwjvQstDvTKzzR09HbGHf2GaQpjQrrpMcQD3UTrkK79iOkVedHZIeHrh_2M_Li63FzcFOuv17cX5-vCQQ1zwUE3VnZCN51UAoVumeRaQ6OrulW2bkrRAlSZUoi2Btc5aCRY5dCKloE4I5-O3mnf7LB1OM7RDmaKfmfjwQTrzd_J6HuzDb-MVEoxKbPg_YMghp97TLPZ-eRwGOyIYZ9MCZJzUCUT_0cFk1BLCTyj756g92Efx_wTpix1xQCyMlPsSLkYUorYPd6bM7MM1zwdbq68PVYweveIX37hZamALc8pjnmyW_xz6D99vwFafKni</recordid><startdate>20191201</startdate><enddate>20191201</enddate><creator>Kalinowski, Steven T.</creator><general>SAGE Publications</general><general>SAGE PUBLICATIONS, INC</general><scope>7SW</scope><scope>BJH</scope><scope>BNH</scope><scope>BNI</scope><scope>BNJ</scope><scope>BNO</scope><scope>ERI</scope><scope>PET</scope><scope>REK</scope><scope>WWN</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0001-8504-4923</orcidid></search><sort><creationdate>20191201</creationdate><title>A Graphical Method for Displaying the Model Fit of Item Response Theory Trace Lines</title><author>Kalinowski, Steven T.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c494t-148ba6f38bf673e38d061884b859d7a9b23d44548b7eea94cfc4b64a7cea3d043</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Computer Simulation</topic><topic>Confidence intervals</topic><topic>Educational Assessment</topic><topic>Educational Development</topic><topic>Educational evaluation</topic><topic>Educational Testing</topic><topic>Error Patterns</topic><topic>Goodness of Fit</topic><topic>Graphs</topic><topic>Intervals</topic><topic>Item Response Theory</topic><topic>Probability</topic><topic>Simulation</topic><topic>Student Evaluation</topic><topic>Students</topic><topic>Test Items</topic><topic>Tests</topic><topic>Vignettes</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Kalinowski, Steven T.</creatorcontrib><collection>ERIC</collection><collection>ERIC (Ovid)</collection><collection>ERIC</collection><collection>ERIC</collection><collection>ERIC (Legacy Platform)</collection><collection>ERIC( SilverPlatter )</collection><collection>ERIC</collection><collection>ERIC PlusText (Legacy Platform)</collection><collection>Education Resources Information Center (ERIC)</collection><collection>ERIC</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Educational and psychological measurement</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Kalinowski, Steven T.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><ericid>EJ1227406</ericid><atitle>A Graphical Method for Displaying the Model Fit of Item Response Theory Trace Lines</atitle><jtitle>Educational and psychological measurement</jtitle><date>2019-12-01</date><risdate>2019</risdate><volume>79</volume><issue>6</issue><spage>1064</spage><epage>1074</epage><pages>1064-1074</pages><issn>0013-1644</issn><eissn>1552-3888</eissn><abstract>Item response theory (IRT) is a statistical paradigm for developing educational tests and assessing students. IRT, however, currently lacks an established graphical method for examining model fit for the three-parameter logistic model, the most flexible and popular IRT model in educational testing. A method is presented here to do this. The graph, which is referred to herein as a “bin plot,” is the IRT equivalent of a scatterplot for linear regression. Bin plots display a conventional IRT trace line (with ability on the horizontal axis and probability correct on the vertical axis). Students are binned according to how well they performed on the entire test, and the proportion of students in each bin who answered the focal question correctly is displayed on the graph as points above or below the trace line. With this arrangement, the difference between each point and the trace line is the residual for the bin. Confidence intervals can be added to the observed proportions in order to display uncertainty. Computer simulations were used to test four alternative ways for binning students. These simulations showed that binning students according to number of questions they answered correctly on the entire test works best. Simulations also showed confidence intervals for bin plots had coverage probabilities close to nominal values for common testing scenarios, but that there are scenarios in which confidence intervals had inflated error rates.</abstract><cop>Los Angeles, CA</cop><pub>SAGE Publications</pub><pmid>31619840</pmid><doi>10.1177/0013164419846234</doi><tpages>11</tpages><orcidid>https://orcid.org/0000-0001-8504-4923</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0013-1644
ispartof Educational and psychological measurement, 2019-12, Vol.79 (6), p.1064-1074
issn 0013-1644
1552-3888
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_6777066
source Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; SAGE Complete A-Z List; PubMed Central
subjects Computer Simulation
Confidence intervals
Educational Assessment
Educational Development
Educational evaluation
Educational Testing
Error Patterns
Goodness of Fit
Graphs
Intervals
Item Response Theory
Probability
Simulation
Student Evaluation
Students
Test Items
Tests
Vignettes
title A Graphical Method for Displaying the Model Fit of Item Response Theory Trace Lines
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-05T18%3A07%3A54IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20Graphical%20Method%20for%20Displaying%20the%20Model%20Fit%20of%20Item%20Response%20Theory%20Trace%20Lines&rft.jtitle=Educational%20and%20psychological%20measurement&rft.au=Kalinowski,%20Steven%20T.&rft.date=2019-12-01&rft.volume=79&rft.issue=6&rft.spage=1064&rft.epage=1074&rft.pages=1064-1074&rft.issn=0013-1644&rft.eissn=1552-3888&rft_id=info:doi/10.1177/0013164419846234&rft_dat=%3Cproquest_pubme%3E2461147203%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2285044472&rft_id=info:pmid/31619840&rft_ericid=EJ1227406&rft_sage_id=10.1177_0013164419846234&rfr_iscdi=true