A Graphical Method for Displaying the Model Fit of Item Response Theory Trace Lines
Item response theory (IRT) is a statistical paradigm for developing educational tests and assessing students. IRT, however, currently lacks an established graphical method for examining model fit for the three-parameter logistic model, the most flexible and popular IRT model in educational testing....
Gespeichert in:
Veröffentlicht in: | Educational and psychological measurement 2019-12, Vol.79 (6), p.1064-1074 |
---|---|
1. Verfasser: | |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 1074 |
---|---|
container_issue | 6 |
container_start_page | 1064 |
container_title | Educational and psychological measurement |
container_volume | 79 |
creator | Kalinowski, Steven T. |
description | Item response theory (IRT) is a statistical paradigm for developing educational tests and assessing students. IRT, however, currently lacks an established graphical method for examining model fit for the three-parameter logistic model, the most flexible and popular IRT model in educational testing. A method is presented here to do this. The graph, which is referred to herein as a “bin plot,” is the IRT equivalent of a scatterplot for linear regression. Bin plots display a conventional IRT trace line (with ability on the horizontal axis and probability correct on the vertical axis). Students are binned according to how well they performed on the entire test, and the proportion of students in each bin who answered the focal question correctly is displayed on the graph as points above or below the trace line. With this arrangement, the difference between each point and the trace line is the residual for the bin. Confidence intervals can be added to the observed proportions in order to display uncertainty. Computer simulations were used to test four alternative ways for binning students. These simulations showed that binning students according to number of questions they answered correctly on the entire test works best. Simulations also showed confidence intervals for bin plots had coverage probabilities close to nominal values for common testing scenarios, but that there are scenarios in which confidence intervals had inflated error rates. |
doi_str_mv | 10.1177/0013164419846234 |
format | Article |
fullrecord | <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_6777066</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ericid>EJ1227406</ericid><sage_id>10.1177_0013164419846234</sage_id><sourcerecordid>2461147203</sourcerecordid><originalsourceid>FETCH-LOGICAL-c494t-148ba6f38bf673e38d061884b859d7a9b23d44548b7eea94cfc4b64a7cea3d043</originalsourceid><addsrcrecordid>eNqFkc2L1DAYh4Mo7uzq3YsQ8OKlmjRvk_QiLOt-ySyCjueQpm-nWTpNTTrC_PemzLLigphLDr_n9-TjJeQNZx84V-ojY1xwCcBrDbIU8IyseFWVhdBaPyerJS6W_IScpnTP8gLOX5KTXFoqbEW-n9PraKfeOzvQO5z70NIuRPrZp2mwBz9u6dwjvQstDvTKzzR09HbGHf2GaQpjQrrpMcQD3UTrkK79iOkVedHZIeHrh_2M_Li63FzcFOuv17cX5-vCQQ1zwUE3VnZCN51UAoVumeRaQ6OrulW2bkrRAlSZUoi2Btc5aCRY5dCKloE4I5-O3mnf7LB1OM7RDmaKfmfjwQTrzd_J6HuzDb-MVEoxKbPg_YMghp97TLPZ-eRwGOyIYZ9MCZJzUCUT_0cFk1BLCTyj756g92Efx_wTpix1xQCyMlPsSLkYUorYPd6bM7MM1zwdbq68PVYweveIX37hZamALc8pjnmyW_xz6D99vwFafKni</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2285044472</pqid></control><display><type>article</type><title>A Graphical Method for Displaying the Model Fit of Item Response Theory Trace Lines</title><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>SAGE Complete A-Z List</source><source>PubMed Central</source><creator>Kalinowski, Steven T.</creator><creatorcontrib>Kalinowski, Steven T.</creatorcontrib><description>Item response theory (IRT) is a statistical paradigm for developing educational tests and assessing students. IRT, however, currently lacks an established graphical method for examining model fit for the three-parameter logistic model, the most flexible and popular IRT model in educational testing. A method is presented here to do this. The graph, which is referred to herein as a “bin plot,” is the IRT equivalent of a scatterplot for linear regression. Bin plots display a conventional IRT trace line (with ability on the horizontal axis and probability correct on the vertical axis). Students are binned according to how well they performed on the entire test, and the proportion of students in each bin who answered the focal question correctly is displayed on the graph as points above or below the trace line. With this arrangement, the difference between each point and the trace line is the residual for the bin. Confidence intervals can be added to the observed proportions in order to display uncertainty. Computer simulations were used to test four alternative ways for binning students. These simulations showed that binning students according to number of questions they answered correctly on the entire test works best. Simulations also showed confidence intervals for bin plots had coverage probabilities close to nominal values for common testing scenarios, but that there are scenarios in which confidence intervals had inflated error rates.</description><identifier>ISSN: 0013-1644</identifier><identifier>EISSN: 1552-3888</identifier><identifier>DOI: 10.1177/0013164419846234</identifier><identifier>PMID: 31619840</identifier><language>eng</language><publisher>Los Angeles, CA: SAGE Publications</publisher><subject>Computer Simulation ; Confidence intervals ; Educational Assessment ; Educational Development ; Educational evaluation ; Educational Testing ; Error Patterns ; Goodness of Fit ; Graphs ; Intervals ; Item Response Theory ; Probability ; Simulation ; Student Evaluation ; Students ; Test Items ; Tests ; Vignettes</subject><ispartof>Educational and psychological measurement, 2019-12, Vol.79 (6), p.1064-1074</ispartof><rights>The Author(s) 2019</rights><rights>The Author(s) 2019 2019 SAGE Publications</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c494t-148ba6f38bf673e38d061884b859d7a9b23d44548b7eea94cfc4b64a7cea3d043</citedby><cites>FETCH-LOGICAL-c494t-148ba6f38bf673e38d061884b859d7a9b23d44548b7eea94cfc4b64a7cea3d043</cites><orcidid>0000-0001-8504-4923</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC6777066/pdf/$$EPDF$$P50$$Gpubmedcentral$$H</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC6777066/$$EHTML$$P50$$Gpubmedcentral$$H</linktohtml><link.rule.ids>230,314,727,780,784,885,21819,27924,27925,43621,43622,53791,53793</link.rule.ids><backlink>$$Uhttp://eric.ed.gov/ERICWebPortal/detail?accno=EJ1227406$$DView record in ERIC$$Hfree_for_read</backlink></links><search><creatorcontrib>Kalinowski, Steven T.</creatorcontrib><title>A Graphical Method for Displaying the Model Fit of Item Response Theory Trace Lines</title><title>Educational and psychological measurement</title><description>Item response theory (IRT) is a statistical paradigm for developing educational tests and assessing students. IRT, however, currently lacks an established graphical method for examining model fit for the three-parameter logistic model, the most flexible and popular IRT model in educational testing. A method is presented here to do this. The graph, which is referred to herein as a “bin plot,” is the IRT equivalent of a scatterplot for linear regression. Bin plots display a conventional IRT trace line (with ability on the horizontal axis and probability correct on the vertical axis). Students are binned according to how well they performed on the entire test, and the proportion of students in each bin who answered the focal question correctly is displayed on the graph as points above or below the trace line. With this arrangement, the difference between each point and the trace line is the residual for the bin. Confidence intervals can be added to the observed proportions in order to display uncertainty. Computer simulations were used to test four alternative ways for binning students. These simulations showed that binning students according to number of questions they answered correctly on the entire test works best. Simulations also showed confidence intervals for bin plots had coverage probabilities close to nominal values for common testing scenarios, but that there are scenarios in which confidence intervals had inflated error rates.</description><subject>Computer Simulation</subject><subject>Confidence intervals</subject><subject>Educational Assessment</subject><subject>Educational Development</subject><subject>Educational evaluation</subject><subject>Educational Testing</subject><subject>Error Patterns</subject><subject>Goodness of Fit</subject><subject>Graphs</subject><subject>Intervals</subject><subject>Item Response Theory</subject><subject>Probability</subject><subject>Simulation</subject><subject>Student Evaluation</subject><subject>Students</subject><subject>Test Items</subject><subject>Tests</subject><subject>Vignettes</subject><issn>0013-1644</issn><issn>1552-3888</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><recordid>eNqFkc2L1DAYh4Mo7uzq3YsQ8OKlmjRvk_QiLOt-ySyCjueQpm-nWTpNTTrC_PemzLLigphLDr_n9-TjJeQNZx84V-ojY1xwCcBrDbIU8IyseFWVhdBaPyerJS6W_IScpnTP8gLOX5KTXFoqbEW-n9PraKfeOzvQO5z70NIuRPrZp2mwBz9u6dwjvQstDvTKzzR09HbGHf2GaQpjQrrpMcQD3UTrkK79iOkVedHZIeHrh_2M_Li63FzcFOuv17cX5-vCQQ1zwUE3VnZCN51UAoVumeRaQ6OrulW2bkrRAlSZUoi2Btc5aCRY5dCKloE4I5-O3mnf7LB1OM7RDmaKfmfjwQTrzd_J6HuzDb-MVEoxKbPg_YMghp97TLPZ-eRwGOyIYZ9MCZJzUCUT_0cFk1BLCTyj756g92Efx_wTpix1xQCyMlPsSLkYUorYPd6bM7MM1zwdbq68PVYweveIX37hZamALc8pjnmyW_xz6D99vwFafKni</recordid><startdate>20191201</startdate><enddate>20191201</enddate><creator>Kalinowski, Steven T.</creator><general>SAGE Publications</general><general>SAGE PUBLICATIONS, INC</general><scope>7SW</scope><scope>BJH</scope><scope>BNH</scope><scope>BNI</scope><scope>BNJ</scope><scope>BNO</scope><scope>ERI</scope><scope>PET</scope><scope>REK</scope><scope>WWN</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0001-8504-4923</orcidid></search><sort><creationdate>20191201</creationdate><title>A Graphical Method for Displaying the Model Fit of Item Response Theory Trace Lines</title><author>Kalinowski, Steven T.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c494t-148ba6f38bf673e38d061884b859d7a9b23d44548b7eea94cfc4b64a7cea3d043</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Computer Simulation</topic><topic>Confidence intervals</topic><topic>Educational Assessment</topic><topic>Educational Development</topic><topic>Educational evaluation</topic><topic>Educational Testing</topic><topic>Error Patterns</topic><topic>Goodness of Fit</topic><topic>Graphs</topic><topic>Intervals</topic><topic>Item Response Theory</topic><topic>Probability</topic><topic>Simulation</topic><topic>Student Evaluation</topic><topic>Students</topic><topic>Test Items</topic><topic>Tests</topic><topic>Vignettes</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Kalinowski, Steven T.</creatorcontrib><collection>ERIC</collection><collection>ERIC (Ovid)</collection><collection>ERIC</collection><collection>ERIC</collection><collection>ERIC (Legacy Platform)</collection><collection>ERIC( SilverPlatter )</collection><collection>ERIC</collection><collection>ERIC PlusText (Legacy Platform)</collection><collection>Education Resources Information Center (ERIC)</collection><collection>ERIC</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Educational and psychological measurement</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Kalinowski, Steven T.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><ericid>EJ1227406</ericid><atitle>A Graphical Method for Displaying the Model Fit of Item Response Theory Trace Lines</atitle><jtitle>Educational and psychological measurement</jtitle><date>2019-12-01</date><risdate>2019</risdate><volume>79</volume><issue>6</issue><spage>1064</spage><epage>1074</epage><pages>1064-1074</pages><issn>0013-1644</issn><eissn>1552-3888</eissn><abstract>Item response theory (IRT) is a statistical paradigm for developing educational tests and assessing students. IRT, however, currently lacks an established graphical method for examining model fit for the three-parameter logistic model, the most flexible and popular IRT model in educational testing. A method is presented here to do this. The graph, which is referred to herein as a “bin plot,” is the IRT equivalent of a scatterplot for linear regression. Bin plots display a conventional IRT trace line (with ability on the horizontal axis and probability correct on the vertical axis). Students are binned according to how well they performed on the entire test, and the proportion of students in each bin who answered the focal question correctly is displayed on the graph as points above or below the trace line. With this arrangement, the difference between each point and the trace line is the residual for the bin. Confidence intervals can be added to the observed proportions in order to display uncertainty. Computer simulations were used to test four alternative ways for binning students. These simulations showed that binning students according to number of questions they answered correctly on the entire test works best. Simulations also showed confidence intervals for bin plots had coverage probabilities close to nominal values for common testing scenarios, but that there are scenarios in which confidence intervals had inflated error rates.</abstract><cop>Los Angeles, CA</cop><pub>SAGE Publications</pub><pmid>31619840</pmid><doi>10.1177/0013164419846234</doi><tpages>11</tpages><orcidid>https://orcid.org/0000-0001-8504-4923</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0013-1644 |
ispartof | Educational and psychological measurement, 2019-12, Vol.79 (6), p.1064-1074 |
issn | 0013-1644 1552-3888 |
language | eng |
recordid | cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_6777066 |
source | Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; SAGE Complete A-Z List; PubMed Central |
subjects | Computer Simulation Confidence intervals Educational Assessment Educational Development Educational evaluation Educational Testing Error Patterns Goodness of Fit Graphs Intervals Item Response Theory Probability Simulation Student Evaluation Students Test Items Tests Vignettes |
title | A Graphical Method for Displaying the Model Fit of Item Response Theory Trace Lines |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-05T18%3A07%3A54IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20Graphical%20Method%20for%20Displaying%20the%20Model%20Fit%20of%20Item%20Response%20Theory%20Trace%20Lines&rft.jtitle=Educational%20and%20psychological%20measurement&rft.au=Kalinowski,%20Steven%20T.&rft.date=2019-12-01&rft.volume=79&rft.issue=6&rft.spage=1064&rft.epage=1074&rft.pages=1064-1074&rft.issn=0013-1644&rft.eissn=1552-3888&rft_id=info:doi/10.1177/0013164419846234&rft_dat=%3Cproquest_pubme%3E2461147203%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2285044472&rft_id=info:pmid/31619840&rft_ericid=EJ1227406&rft_sage_id=10.1177_0013164419846234&rfr_iscdi=true |