Validating Tests of Turkish L2 Receptive Skills: An Argument-Based Validation Study through Standard Setting

This paper reports on a validation study based on an assessment-use argument for four level tests developed under a larger project for adult learners of Turkish as a second language (L2). We treat the test scores as data on which we build the validity claim that the tests accurately classify learner...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Egitim ve Bilim 2025-01, Vol.49 (220), p.239-257
Hauptverfasser:	Savuran, Yigit, Cubukcu, Zuhal
Format:	Artikel
Sprache:	eng
Schlagworte:	Adult Basic Education Adult education Adult Learning Adult Students Analysis Cutting Scores Data collection Educational standards Language tests Receptive language Second language learning Standard Setting Standard Setting (Scoring) Test validity and reliability Turkish language Validation studies Validity
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	257
container_issue	220
container_start_page	239
container_title	Egitim ve Bilim
container_volume	49
creator	Savuran, Yigit Cubukcu, Zuhal
description	This paper reports on a validation study based on an assessment-use argument for four level tests developed under a larger project for adult learners of Turkish as a second language (L2). We treat the test scores as data on which we build the validity claim that the tests accurately classify learners into the intended Common European Framework of Reference for Languages (CEFR) levels. Four level tests (A1, A2, B1, and B2), each comprising listening and reading tasks, were administered to mixed groups of students, including those at Pre-A1 and C1 levels. Cut scores for each of the four tests were determined through the Angoff method and were considered as backing for the validity claim. To provide warrants, we assumed a 50% probability of a test-taker being at a CEFR level for Chi-square goodness-of-fit tests, which were conducted to assess the statistical significance between the expected and observed numbers of students under and above the cut score for each level. The distribution of student scores--with acceptable item difficulty and discrimination indices--cut scores placed in the intervals between adjacent levels, and chi-square analyses of all four tests enabled us to conclude that the tests have the potential to validly demonstrate the intended learner performance. With its innovative design and techniques in data collection and analysis, this paper offers theoretical, methodological and practical insights for practitioners in Turkish L2, based on solid empirical evidence. Keywords Turkish L2 Test validation Assessment-use argument Standard-setting
doi_str_mv	10.15390/EB.2024.12959
format	Article
fullrecord	<record><control><sourceid>gale_proqu</sourceid><recordid>TN_cdi_proquest_journals_3141583330</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A817540701</galeid><sourcerecordid>A817540701</sourcerecordid><originalsourceid>FETCH-LOGICAL-g670-95b6ca9f3f3e2b684894bbbc3033b8294b2c98f334d6206702fd3b86ba40e55c3</originalsourceid><addsrcrecordid>eNo1j81rwzAMxX3YYKXrdWfDzslkK5-7paX7gMJgDbsWJ7ZTt6nTxc5g__08uqGDeE_S7yFC7hjELMUSHtbLmANPYsbLtLwiM4YAEUPMb8jCuQMAMMg5y7MZ6T9Eb6Twxna0Vs47OmhaT-PRuD3dcPquWnX25kvR7dH0vXuklaXV2E0nZX20FE5J-o8YLN36SX5Tvx-HqdsHJawUo6Rb5X8Tbsm1Fr1Ti78-J_XTul69RJu359dVtYm6LIeoTJusFaVGjYo3WZEUZdI0TYuA2BQ8CN6WhUZMZMYhXHAtwyBrRAIqTVuck_sL9jwOn1N4ancYptGGxB2yhKUFYmDNSXzZ6kSvdsbqwY-iDSXVybSDVdoEvypYniaQA8Mf1YxoJw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3141583330</pqid></control><display><type>article</type><title>Validating Tests of Turkish L2 Receptive Skills: An Argument-Based Validation Study through Standard Setting</title><source>Education Source</source><creator>Savuran, Yigit ; Cubukcu, Zuhal</creator><creatorcontrib>Savuran, Yigit ; Cubukcu, Zuhal</creatorcontrib><description>This paper reports on a validation study based on an assessment-use argument for four level tests developed under a larger project for adult learners of Turkish as a second language (L2). We treat the test scores as data on which we build the validity claim that the tests accurately classify learners into the intended Common European Framework of Reference for Languages (CEFR) levels. Four level tests (A1, A2, B1, and B2), each comprising listening and reading tasks, were administered to mixed groups of students, including those at Pre-A1 and C1 levels. Cut scores for each of the four tests were determined through the Angoff method and were considered as backing for the validity claim. To provide warrants, we assumed a 50% probability of a test-taker being at a CEFR level for Chi-square goodness-of-fit tests, which were conducted to assess the statistical significance between the expected and observed numbers of students under and above the cut score for each level. The distribution of student scores--with acceptable item difficulty and discrimination indices--cut scores placed in the intervals between adjacent levels, and chi-square analyses of all four tests enabled us to conclude that the tests have the potential to validly demonstrate the intended learner performance. With its innovative design and techniques in data collection and analysis, this paper offers theoretical, methodological and practical insights for practitioners in Turkish L2, based on solid empirical evidence. Keywords Turkish L2 Test validation Assessment-use argument Standard-setting</description><identifier>ISSN: 1300-1337</identifier><identifier>DOI: 10.15390/EB.2024.12959</identifier><language>eng</language><publisher>Ankara: Turkish Education Association</publisher><subject>Adult Basic Education ; Adult education ; Adult Learning ; Adult Students ; Analysis ; Cutting Scores ; Data collection ; Educational standards ; Language tests ; Receptive language ; Second language learning ; Standard Setting ; Standard Setting (Scoring) ; Test validity and reliability ; Turkish language ; Validation studies ; Validity</subject><ispartof>Egitim ve Bilim, 2025-01, Vol.49 (220), p.239-257</ispartof><rights>COPYRIGHT 2025 Turkish Education Association</rights><rights>Copyright Turk Egitim Dernegi 2024</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,777,781,27905,27906</link.rule.ids></links><search><creatorcontrib>Savuran, Yigit</creatorcontrib><creatorcontrib>Cubukcu, Zuhal</creatorcontrib><title>Validating Tests of Turkish L2 Receptive Skills: An Argument-Based Validation Study through Standard Setting</title><title>Egitim ve Bilim</title><description>This paper reports on a validation study based on an assessment-use argument for four level tests developed under a larger project for adult learners of Turkish as a second language (L2). We treat the test scores as data on which we build the validity claim that the tests accurately classify learners into the intended Common European Framework of Reference for Languages (CEFR) levels. Four level tests (A1, A2, B1, and B2), each comprising listening and reading tasks, were administered to mixed groups of students, including those at Pre-A1 and C1 levels. Cut scores for each of the four tests were determined through the Angoff method and were considered as backing for the validity claim. To provide warrants, we assumed a 50% probability of a test-taker being at a CEFR level for Chi-square goodness-of-fit tests, which were conducted to assess the statistical significance between the expected and observed numbers of students under and above the cut score for each level. The distribution of student scores--with acceptable item difficulty and discrimination indices--cut scores placed in the intervals between adjacent levels, and chi-square analyses of all four tests enabled us to conclude that the tests have the potential to validly demonstrate the intended learner performance. With its innovative design and techniques in data collection and analysis, this paper offers theoretical, methodological and practical insights for practitioners in Turkish L2, based on solid empirical evidence. Keywords Turkish L2 Test validation Assessment-use argument Standard-setting</description><subject>Adult Basic Education</subject><subject>Adult education</subject><subject>Adult Learning</subject><subject>Adult Students</subject><subject>Analysis</subject><subject>Cutting Scores</subject><subject>Data collection</subject><subject>Educational standards</subject><subject>Language tests</subject><subject>Receptive language</subject><subject>Second language learning</subject><subject>Standard Setting</subject><subject>Standard Setting (Scoring)</subject><subject>Test validity and reliability</subject><subject>Turkish language</subject><subject>Validation studies</subject><subject>Validity</subject><issn>1300-1337</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2025</creationdate><recordtype>article</recordtype><sourceid>8G5</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><sourceid>GUQSH</sourceid><sourceid>M2O</sourceid><recordid>eNo1j81rwzAMxX3YYKXrdWfDzslkK5-7paX7gMJgDbsWJ7ZTt6nTxc5g__08uqGDeE_S7yFC7hjELMUSHtbLmANPYsbLtLwiM4YAEUPMb8jCuQMAMMg5y7MZ6T9Eb6Twxna0Vs47OmhaT-PRuD3dcPquWnX25kvR7dH0vXuklaXV2E0nZX20FE5J-o8YLN36SX5Tvx-HqdsHJawUo6Rb5X8Tbsm1Fr1Ti78-J_XTul69RJu359dVtYm6LIeoTJusFaVGjYo3WZEUZdI0TYuA2BQ8CN6WhUZMZMYhXHAtwyBrRAIqTVuck_sL9jwOn1N4ancYptGGxB2yhKUFYmDNSXzZ6kSvdsbqwY-iDSXVybSDVdoEvypYniaQA8Mf1YxoJw</recordid><startdate>20250101</startdate><enddate>20250101</enddate><creator>Savuran, Yigit</creator><creator>Cubukcu, Zuhal</creator><general>Turkish Education Association</general><general>Turk Egitim Dernegi</general><scope>0-V</scope><scope>3V.</scope><scope>7T9</scope><scope>7XB</scope><scope>8FK</scope><scope>8G5</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ALSLI</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>CCPQU</scope><scope>CJNVE</scope><scope>DWQXO</scope><scope>EDSIH</scope><scope>GNUQQ</scope><scope>GUQSH</scope><scope>M0P</scope><scope>M2O</scope><scope>MBDVC</scope><scope>PQEDU</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope></search><sort><creationdate>20250101</creationdate><title>Validating Tests of Turkish L2 Receptive Skills: An Argument-Based Validation Study through Standard Setting</title><author>Savuran, Yigit ; Cubukcu, Zuhal</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-g670-95b6ca9f3f3e2b684894bbbc3033b8294b2c98f334d6206702fd3b86ba40e55c3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2025</creationdate><topic>Adult Basic Education</topic><topic>Adult education</topic><topic>Adult Learning</topic><topic>Adult Students</topic><topic>Analysis</topic><topic>Cutting Scores</topic><topic>Data collection</topic><topic>Educational standards</topic><topic>Language tests</topic><topic>Receptive language</topic><topic>Second language learning</topic><topic>Standard Setting</topic><topic>Standard Setting (Scoring)</topic><topic>Test validity and reliability</topic><topic>Turkish language</topic><topic>Validation studies</topic><topic>Validity</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Savuran, Yigit</creatorcontrib><creatorcontrib>Cubukcu, Zuhal</creatorcontrib><collection>ProQuest Social Sciences Premium Collection</collection><collection>ProQuest Central (Corporate)</collection><collection>Linguistics and Language Behavior Abstracts (LLBA)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>Research Library (Alumni Edition)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Social Science Premium Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>ProQuest One Community College</collection><collection>Education Collection</collection><collection>ProQuest Central Korea</collection><collection>Turkey Database</collection><collection>ProQuest Central Student</collection><collection>Research Library Prep</collection><collection>Education Database</collection><collection>Research Library</collection><collection>Research Library (Corporate)</collection><collection>ProQuest One Education</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><jtitle>Egitim ve Bilim</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Savuran, Yigit</au><au>Cubukcu, Zuhal</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Validating Tests of Turkish L2 Receptive Skills: An Argument-Based Validation Study through Standard Setting</atitle><jtitle>Egitim ve Bilim</jtitle><date>2025-01-01</date><risdate>2025</risdate><volume>49</volume><issue>220</issue><spage>239</spage><epage>257</epage><pages>239-257</pages><issn>1300-1337</issn><abstract>This paper reports on a validation study based on an assessment-use argument for four level tests developed under a larger project for adult learners of Turkish as a second language (L2). We treat the test scores as data on which we build the validity claim that the tests accurately classify learners into the intended Common European Framework of Reference for Languages (CEFR) levels. Four level tests (A1, A2, B1, and B2), each comprising listening and reading tasks, were administered to mixed groups of students, including those at Pre-A1 and C1 levels. Cut scores for each of the four tests were determined through the Angoff method and were considered as backing for the validity claim. To provide warrants, we assumed a 50% probability of a test-taker being at a CEFR level for Chi-square goodness-of-fit tests, which were conducted to assess the statistical significance between the expected and observed numbers of students under and above the cut score for each level. The distribution of student scores--with acceptable item difficulty and discrimination indices--cut scores placed in the intervals between adjacent levels, and chi-square analyses of all four tests enabled us to conclude that the tests have the potential to validly demonstrate the intended learner performance. With its innovative design and techniques in data collection and analysis, this paper offers theoretical, methodological and practical insights for practitioners in Turkish L2, based on solid empirical evidence. Keywords Turkish L2 Test validation Assessment-use argument Standard-setting</abstract><cop>Ankara</cop><pub>Turkish Education Association</pub><doi>10.15390/EB.2024.12959</doi><tpages>19</tpages></addata></record>
fulltext	fulltext
identifier	ISSN: 1300-1337
ispartof	Egitim ve Bilim, 2025-01, Vol.49 (220), p.239-257
issn	1300-1337
language	eng
recordid	cdi_proquest_journals_3141583330
source	Education Source
subjects	Adult Basic Education Adult education Adult Learning Adult Students Analysis Cutting Scores Data collection Educational standards Language tests Receptive language Second language learning Standard Setting Standard Setting (Scoring) Test validity and reliability Turkish language Validation studies Validity
title	Validating Tests of Turkish L2 Receptive Skills: An Argument-Based Validation Study through Standard Setting
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-19T19%3A17%3A04IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_proqu&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Validating%20Tests%20of%20Turkish%20L2%20Receptive%20Skills:%20An%20Argument-Based%20Validation%20Study%20through%20Standard%20Setting&rft.jtitle=Egitim%20ve%20Bilim&rft.au=Savuran,%20Yigit&rft.date=2025-01-01&rft.volume=49&rft.issue=220&rft.spage=239&rft.epage=257&rft.pages=239-257&rft.issn=1300-1337&rft_id=info:doi/10.15390/EB.2024.12959&rft_dat=%3Cgale_proqu%3EA817540701%3C/gale_proqu%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3141583330&rft_id=info:pmid/&rft_galeid=A817540701&rfr_iscdi=true