Validating Tests of Turkish L2 Receptive Skills: An Argument-Based Validation Study through Standard Setting
This paper reports on a validation study based on an assessment-use argument for four level tests developed under a larger project for adult learners of Turkish as a second language (L2). We treat the test scores as data on which we build the validity claim that the tests accurately classify learner...
Gespeichert in:
Veröffentlicht in: | Egitim ve Bilim 2025-01, Vol.49 (220), p.239-257 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 257 |
---|---|
container_issue | 220 |
container_start_page | 239 |
container_title | Egitim ve Bilim |
container_volume | 49 |
creator | Savuran, Yigit Cubukcu, Zuhal |
description | This paper reports on a validation study based on an assessment-use argument for four level tests developed under a larger project for adult learners of Turkish as a second language (L2). We treat the test scores as data on which we build the validity claim that the tests accurately classify learners into the intended Common European Framework of Reference for Languages (CEFR) levels. Four level tests (A1, A2, B1, and B2), each comprising listening and reading tasks, were administered to mixed groups of students, including those at Pre-A1 and C1 levels. Cut scores for each of the four tests were determined through the Angoff method and were considered as backing for the validity claim. To provide warrants, we assumed a 50% probability of a test-taker being at a CEFR level for Chi-square goodness-of-fit tests, which were conducted to assess the statistical significance between the expected and observed numbers of students under and above the cut score for each level. The distribution of student scores--with acceptable item difficulty and discrimination indices--cut scores placed in the intervals between adjacent levels, and chi-square analyses of all four tests enabled us to conclude that the tests have the potential to validly demonstrate the intended learner performance. With its innovative design and techniques in data collection and analysis, this paper offers theoretical, methodological and practical insights for practitioners in Turkish L2, based on solid empirical evidence. Keywords Turkish L2 Test validation Assessment-use argument Standard-setting |
doi_str_mv | 10.15390/EB.2024.12959 |
format | Article |
fullrecord | <record><control><sourceid>gale_proqu</sourceid><recordid>TN_cdi_proquest_journals_3141583330</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A817540701</galeid><sourcerecordid>A817540701</sourcerecordid><originalsourceid>FETCH-LOGICAL-g670-95b6ca9f3f3e2b684894bbbc3033b8294b2c98f334d6206702fd3b86ba40e55c3</originalsourceid><addsrcrecordid>eNo1j81rwzAMxX3YYKXrdWfDzslkK5-7paX7gMJgDbsWJ7ZTt6nTxc5g__08uqGDeE_S7yFC7hjELMUSHtbLmANPYsbLtLwiM4YAEUPMb8jCuQMAMMg5y7MZ6T9Eb6Twxna0Vs47OmhaT-PRuD3dcPquWnX25kvR7dH0vXuklaXV2E0nZX20FE5J-o8YLN36SX5Tvx-HqdsHJawUo6Rb5X8Tbsm1Fr1Ti78-J_XTul69RJu359dVtYm6LIeoTJusFaVGjYo3WZEUZdI0TYuA2BQ8CN6WhUZMZMYhXHAtwyBrRAIqTVuck_sL9jwOn1N4ancYptGGxB2yhKUFYmDNSXzZ6kSvdsbqwY-iDSXVybSDVdoEvypYniaQA8Mf1YxoJw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3141583330</pqid></control><display><type>article</type><title>Validating Tests of Turkish L2 Receptive Skills: An Argument-Based Validation Study through Standard Setting</title><source>Education Source</source><creator>Savuran, Yigit ; Cubukcu, Zuhal</creator><creatorcontrib>Savuran, Yigit ; Cubukcu, Zuhal</creatorcontrib><description>This paper reports on a validation study based on an assessment-use argument for four level tests developed under a larger project for adult learners of Turkish as a second language (L2). We treat the test scores as data on which we build the validity claim that the tests accurately classify learners into the intended Common European Framework of Reference for Languages (CEFR) levels. Four level tests (A1, A2, B1, and B2), each comprising listening and reading tasks, were administered to mixed groups of students, including those at Pre-A1 and C1 levels. Cut scores for each of the four tests were determined through the Angoff method and were considered as backing for the validity claim. To provide warrants, we assumed a 50% probability of a test-taker being at a CEFR level for Chi-square goodness-of-fit tests, which were conducted to assess the statistical significance between the expected and observed numbers of students under and above the cut score for each level. The distribution of student scores--with acceptable item difficulty and discrimination indices--cut scores placed in the intervals between adjacent levels, and chi-square analyses of all four tests enabled us to conclude that the tests have the potential to validly demonstrate the intended learner performance. With its innovative design and techniques in data collection and analysis, this paper offers theoretical, methodological and practical insights for practitioners in Turkish L2, based on solid empirical evidence. Keywords Turkish L2 Test validation Assessment-use argument Standard-setting</description><identifier>ISSN: 1300-1337</identifier><identifier>DOI: 10.15390/EB.2024.12959</identifier><language>eng</language><publisher>Ankara: Turkish Education Association</publisher><subject>Adult Basic Education ; Adult education ; Adult Learning ; Adult Students ; Analysis ; Cutting Scores ; Data collection ; Educational standards ; Language tests ; Receptive language ; Second language learning ; Standard Setting ; Standard Setting (Scoring) ; Test validity and reliability ; Turkish language ; Validation studies ; Validity</subject><ispartof>Egitim ve Bilim, 2025-01, Vol.49 (220), p.239-257</ispartof><rights>COPYRIGHT 2025 Turkish Education Association</rights><rights>Copyright Turk Egitim Dernegi 2024</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,777,781,27905,27906</link.rule.ids></links><search><creatorcontrib>Savuran, Yigit</creatorcontrib><creatorcontrib>Cubukcu, Zuhal</creatorcontrib><title>Validating Tests of Turkish L2 Receptive Skills: An Argument-Based Validation Study through Standard Setting</title><title>Egitim ve Bilim</title><description>This paper reports on a validation study based on an assessment-use argument for four level tests developed under a larger project for adult learners of Turkish as a second language (L2). We treat the test scores as data on which we build the validity claim that the tests accurately classify learners into the intended Common European Framework of Reference for Languages (CEFR) levels. Four level tests (A1, A2, B1, and B2), each comprising listening and reading tasks, were administered to mixed groups of students, including those at Pre-A1 and C1 levels. Cut scores for each of the four tests were determined through the Angoff method and were considered as backing for the validity claim. To provide warrants, we assumed a 50% probability of a test-taker being at a CEFR level for Chi-square goodness-of-fit tests, which were conducted to assess the statistical significance between the expected and observed numbers of students under and above the cut score for each level. The distribution of student scores--with acceptable item difficulty and discrimination indices--cut scores placed in the intervals between adjacent levels, and chi-square analyses of all four tests enabled us to conclude that the tests have the potential to validly demonstrate the intended learner performance. With its innovative design and techniques in data collection and analysis, this paper offers theoretical, methodological and practical insights for practitioners in Turkish L2, based on solid empirical evidence. Keywords Turkish L2 Test validation Assessment-use argument Standard-setting</description><subject>Adult Basic Education</subject><subject>Adult education</subject><subject>Adult Learning</subject><subject>Adult Students</subject><subject>Analysis</subject><subject>Cutting Scores</subject><subject>Data collection</subject><subject>Educational standards</subject><subject>Language tests</subject><subject>Receptive language</subject><subject>Second language learning</subject><subject>Standard Setting</subject><subject>Standard Setting (Scoring)</subject><subject>Test validity and reliability</subject><subject>Turkish language</subject><subject>Validation studies</subject><subject>Validity</subject><issn>1300-1337</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2025</creationdate><recordtype>article</recordtype><sourceid>8G5</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><sourceid>GUQSH</sourceid><sourceid>M2O</sourceid><recordid>eNo1j81rwzAMxX3YYKXrdWfDzslkK5-7paX7gMJgDbsWJ7ZTt6nTxc5g__08uqGDeE_S7yFC7hjELMUSHtbLmANPYsbLtLwiM4YAEUPMb8jCuQMAMMg5y7MZ6T9Eb6Twxna0Vs47OmhaT-PRuD3dcPquWnX25kvR7dH0vXuklaXV2E0nZX20FE5J-o8YLN36SX5Tvx-HqdsHJawUo6Rb5X8Tbsm1Fr1Ti78-J_XTul69RJu359dVtYm6LIeoTJusFaVGjYo3WZEUZdI0TYuA2BQ8CN6WhUZMZMYhXHAtwyBrRAIqTVuck_sL9jwOn1N4ancYptGGxB2yhKUFYmDNSXzZ6kSvdsbqwY-iDSXVybSDVdoEvypYniaQA8Mf1YxoJw</recordid><startdate>20250101</startdate><enddate>20250101</enddate><creator>Savuran, Yigit</creator><creator>Cubukcu, Zuhal</creator><general>Turkish Education Association</general><general>Turk Egitim Dernegi</general><scope>0-V</scope><scope>3V.</scope><scope>7T9</scope><scope>7XB</scope><scope>8FK</scope><scope>8G5</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ALSLI</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>CCPQU</scope><scope>CJNVE</scope><scope>DWQXO</scope><scope>EDSIH</scope><scope>GNUQQ</scope><scope>GUQSH</scope><scope>M0P</scope><scope>M2O</scope><scope>MBDVC</scope><scope>PQEDU</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope></search><sort><creationdate>20250101</creationdate><title>Validating Tests of Turkish L2 Receptive Skills: An Argument-Based Validation Study through Standard Setting</title><author>Savuran, Yigit ; Cubukcu, Zuhal</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-g670-95b6ca9f3f3e2b684894bbbc3033b8294b2c98f334d6206702fd3b86ba40e55c3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2025</creationdate><topic>Adult Basic Education</topic><topic>Adult education</topic><topic>Adult Learning</topic><topic>Adult Students</topic><topic>Analysis</topic><topic>Cutting Scores</topic><topic>Data collection</topic><topic>Educational standards</topic><topic>Language tests</topic><topic>Receptive language</topic><topic>Second language learning</topic><topic>Standard Setting</topic><topic>Standard Setting (Scoring)</topic><topic>Test validity and reliability</topic><topic>Turkish language</topic><topic>Validation studies</topic><topic>Validity</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Savuran, Yigit</creatorcontrib><creatorcontrib>Cubukcu, Zuhal</creatorcontrib><collection>ProQuest Social Sciences Premium Collection</collection><collection>ProQuest Central (Corporate)</collection><collection>Linguistics and Language Behavior Abstracts (LLBA)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>Research Library (Alumni Edition)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Social Science Premium Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>ProQuest One Community College</collection><collection>Education Collection</collection><collection>ProQuest Central Korea</collection><collection>Turkey Database</collection><collection>ProQuest Central Student</collection><collection>Research Library Prep</collection><collection>Education Database</collection><collection>Research Library</collection><collection>Research Library (Corporate)</collection><collection>ProQuest One Education</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><jtitle>Egitim ve Bilim</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Savuran, Yigit</au><au>Cubukcu, Zuhal</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Validating Tests of Turkish L2 Receptive Skills: An Argument-Based Validation Study through Standard Setting</atitle><jtitle>Egitim ve Bilim</jtitle><date>2025-01-01</date><risdate>2025</risdate><volume>49</volume><issue>220</issue><spage>239</spage><epage>257</epage><pages>239-257</pages><issn>1300-1337</issn><abstract>This paper reports on a validation study based on an assessment-use argument for four level tests developed under a larger project for adult learners of Turkish as a second language (L2). We treat the test scores as data on which we build the validity claim that the tests accurately classify learners into the intended Common European Framework of Reference for Languages (CEFR) levels. Four level tests (A1, A2, B1, and B2), each comprising listening and reading tasks, were administered to mixed groups of students, including those at Pre-A1 and C1 levels. Cut scores for each of the four tests were determined through the Angoff method and were considered as backing for the validity claim. To provide warrants, we assumed a 50% probability of a test-taker being at a CEFR level for Chi-square goodness-of-fit tests, which were conducted to assess the statistical significance between the expected and observed numbers of students under and above the cut score for each level. The distribution of student scores--with acceptable item difficulty and discrimination indices--cut scores placed in the intervals between adjacent levels, and chi-square analyses of all four tests enabled us to conclude that the tests have the potential to validly demonstrate the intended learner performance. With its innovative design and techniques in data collection and analysis, this paper offers theoretical, methodological and practical insights for practitioners in Turkish L2, based on solid empirical evidence. Keywords Turkish L2 Test validation Assessment-use argument Standard-setting</abstract><cop>Ankara</cop><pub>Turkish Education Association</pub><doi>10.15390/EB.2024.12959</doi><tpages>19</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1300-1337 |
ispartof | Egitim ve Bilim, 2025-01, Vol.49 (220), p.239-257 |
issn | 1300-1337 |
language | eng |
recordid | cdi_proquest_journals_3141583330 |
source | Education Source |
subjects | Adult Basic Education Adult education Adult Learning Adult Students Analysis Cutting Scores Data collection Educational standards Language tests Receptive language Second language learning Standard Setting Standard Setting (Scoring) Test validity and reliability Turkish language Validation studies Validity |
title | Validating Tests of Turkish L2 Receptive Skills: An Argument-Based Validation Study through Standard Setting |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-19T19%3A17%3A04IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_proqu&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Validating%20Tests%20of%20Turkish%20L2%20Receptive%20Skills:%20An%20Argument-Based%20Validation%20Study%20through%20Standard%20Setting&rft.jtitle=Egitim%20ve%20Bilim&rft.au=Savuran,%20Yigit&rft.date=2025-01-01&rft.volume=49&rft.issue=220&rft.spage=239&rft.epage=257&rft.pages=239-257&rft.issn=1300-1337&rft_id=info:doi/10.15390/EB.2024.12959&rft_dat=%3Cgale_proqu%3EA817540701%3C/gale_proqu%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3141583330&rft_id=info:pmid/&rft_galeid=A817540701&rfr_iscdi=true |