Coding energy knowledge in constructed responses with explainable NLP models

Background : Formative assessments are needed to enable monitoring how student knowledge develops throughout a unit. Constructed response items which require learners to formulate their own free-text responses are well suited for testing their active knowledge. However, assessing such constructed re...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Journal of computer assisted learning 2022, Vol.39 (3), p.767-786
Hauptverfasser:	Gombert, Sebastian, Di Mitri, Daniele, Karademir, Onur, Kubsch, Marcus, Kolbe, Hannah, Tautz, Simon, Grimm, Adrian, Bohm, Isabell, Neumann, Knut, Drachsler, Hendrik
Format:	Artikel
Sprache:	eng
Schlagworte:	Antwort Aufgabe Automatisierung Bewertung Codierung Computerunterstützter Unterricht Deutschland Empirische Untersuchung Energie Leistungsbeurteilung Modell Physikunterricht Schüler Sekundarstufe I Text Wissen
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	786
container_issue	3
container_start_page	767
container_title	Journal of computer assisted learning
container_volume	39
creator	Gombert, Sebastian Di Mitri, Daniele Karademir, Onur Kubsch, Marcus Kolbe, Hannah Tautz, Simon Grimm, Adrian Bohm, Isabell Neumann, Knut Drachsler, Hendrik
description	Background : Formative assessments are needed to enable monitoring how student knowledge develops throughout a unit. Constructed response items which require learners to formulate their own free-text responses are well suited for testing their active knowledge. However, assessing such constructed responses in an automated fashion is a complex task and requires the application of natural language processing methodology. In this article, we implement and evaluate multiple machine learning models for coding energy knowledge in free-text responses of German K-12 students to items in formative science assessments which were conducted during synchronous online learning sessions. Dataset : The dataset we collected for this purpose consists of German constructed responses from 38 different items dealing with aspects of energy such as manifestation and transformation. The units and items were implemented with the help of project-based pedagogy and evidence-centered design, and the responses were coded for seven core ideas concerning the manifestation and transformation of energy. The data was collected from students in seventh, eighth and ninth grade. Methodology : We train various transformer- and feature-based models and compare their ability to recognize the respective ideas in students' writing. Moreover, as domain knowledge and its development can be formally modeled through knowledge networks, we evaluate how well the detection of the ideas within responses translated into accurate co-occurrence-based knowledge networks. Finally, in terms of the descriptive accuracy of our models, we inspect what features played a role for which prediction outcome and if the models pick up on undesired shortcuts. In addition to this, we analyze how much the models match human coders in what evidence within responses they consider important for their coding decisions. Results : A model based on a modified GBERT-large can achieve the overall most promising results, although descriptive accuracy varies much more than predictive accuracy for the different ideas assessed. For reasons of comparability, we also evaluate the same machine learning architecture using the SciEntsBank 3-Way benchmark with an English RoBERTa-large model, where it achieves state-of-the-art results in two out of three evaluation categories. (DIPF/Orig.)
doi_str_mv	10.1111/jcal.12767 10.25656/01:28441
format	Article
fullrecord	<record><control><sourceid>dipf</sourceid><recordid>TN_cdi_dipf_primary_A49854</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>A49854</sourcerecordid><originalsourceid>FETCH-dipf_primary_A498543</originalsourceid><addsrcrecordid>eNp9ibsOgjAUQDtoIj4WR6f7A2B5w2iIxsEYB3dS6QWLpZAWg_y9DM6e5eTkELJ1qeNO7OuCScf14iieEcv1o9D2Yi9dkKUxNaU0TqPEIpes5UJVgAp1NcJLtYNEXiEIBUWrTK_fRY8cNJpuSjQwiP4J-OkkE4o9JML1coOm5SjNmsxLJg1ufl6R3el4z842F12Zd1o0TI_5IUiTMPD_zi8eTzu5</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Coding energy knowledge in constructed responses with explainable NLP models</title><source>Wiley Online Library Journals Frontfile Complete</source><creator>Gombert, Sebastian ; Di Mitri, Daniele ; Karademir, Onur ; Kubsch, Marcus ; Kolbe, Hannah ; Tautz, Simon ; Grimm, Adrian ; Bohm, Isabell ; Neumann, Knut ; Drachsler, Hendrik</creator><creatorcontrib>Gombert, Sebastian ; Di Mitri, Daniele ; Karademir, Onur ; Kubsch, Marcus ; Kolbe, Hannah ; Tautz, Simon ; Grimm, Adrian ; Bohm, Isabell ; Neumann, Knut ; Drachsler, Hendrik</creatorcontrib><description>Background : Formative assessments are needed to enable monitoring how student knowledge develops throughout a unit. Constructed response items which require learners to formulate their own free-text responses are well suited for testing their active knowledge. However, assessing such constructed responses in an automated fashion is a complex task and requires the application of natural language processing methodology. In this article, we implement and evaluate multiple machine learning models for coding energy knowledge in free-text responses of German K-12 students to items in formative science assessments which were conducted during synchronous online learning sessions. Dataset : The dataset we collected for this purpose consists of German constructed responses from 38 different items dealing with aspects of energy such as manifestation and transformation. The units and items were implemented with the help of project-based pedagogy and evidence-centered design, and the responses were coded for seven core ideas concerning the manifestation and transformation of energy. The data was collected from students in seventh, eighth and ninth grade. Methodology : We train various transformer- and feature-based models and compare their ability to recognize the respective ideas in students' writing. Moreover, as domain knowledge and its development can be formally modeled through knowledge networks, we evaluate how well the detection of the ideas within responses translated into accurate co-occurrence-based knowledge networks. Finally, in terms of the descriptive accuracy of our models, we inspect what features played a role for which prediction outcome and if the models pick up on undesired shortcuts. In addition to this, we analyze how much the models match human coders in what evidence within responses they consider important for their coding decisions. Results : A model based on a modified GBERT-large can achieve the overall most promising results, although descriptive accuracy varies much more than predictive accuracy for the different ideas assessed. For reasons of comparability, we also evaluate the same machine learning architecture using the SciEntsBank 3-Way benchmark with an English RoBERTa-large model, where it achieves state-of-the-art results in two out of three evaluation categories. (DIPF/Orig.)</description><identifier>ISSN: 1365-2729</identifier><identifier>DOI: 10.1111/jcal.12767</identifier><identifier>DOI: 10.25656/01:28441</identifier><language>eng</language><subject>Antwort ; Aufgabe ; Automatisierung ; Bewertung ; Codierung ; Computerunterstützter Unterricht ; Deutschland ; Empirische Untersuchung ; Energie ; Leistungsbeurteilung ; Modell ; Physikunterricht ; Schüler ; Sekundarstufe I ; Text ; Wissen</subject><ispartof>Journal of computer assisted learning, 2022, Vol.39 (3), p.767-786</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,4010,27900,27901,27902</link.rule.ids><backlink>$$Uhttp://www.fachportal-paedagogik.de/fis_bildung/suche/fis_set.html?FId=A49854$$DAccess content in the German Education Portal$$Hfree_for_read</backlink></links><search><creatorcontrib>Gombert, Sebastian</creatorcontrib><creatorcontrib>Di Mitri, Daniele</creatorcontrib><creatorcontrib>Karademir, Onur</creatorcontrib><creatorcontrib>Kubsch, Marcus</creatorcontrib><creatorcontrib>Kolbe, Hannah</creatorcontrib><creatorcontrib>Tautz, Simon</creatorcontrib><creatorcontrib>Grimm, Adrian</creatorcontrib><creatorcontrib>Bohm, Isabell</creatorcontrib><creatorcontrib>Neumann, Knut</creatorcontrib><creatorcontrib>Drachsler, Hendrik</creatorcontrib><title>Coding energy knowledge in constructed responses with explainable NLP models</title><title>Journal of computer assisted learning</title><description>Background : Formative assessments are needed to enable monitoring how student knowledge develops throughout a unit. Constructed response items which require learners to formulate their own free-text responses are well suited for testing their active knowledge. However, assessing such constructed responses in an automated fashion is a complex task and requires the application of natural language processing methodology. In this article, we implement and evaluate multiple machine learning models for coding energy knowledge in free-text responses of German K-12 students to items in formative science assessments which were conducted during synchronous online learning sessions. Dataset : The dataset we collected for this purpose consists of German constructed responses from 38 different items dealing with aspects of energy such as manifestation and transformation. The units and items were implemented with the help of project-based pedagogy and evidence-centered design, and the responses were coded for seven core ideas concerning the manifestation and transformation of energy. The data was collected from students in seventh, eighth and ninth grade. Methodology : We train various transformer- and feature-based models and compare their ability to recognize the respective ideas in students' writing. Moreover, as domain knowledge and its development can be formally modeled through knowledge networks, we evaluate how well the detection of the ideas within responses translated into accurate co-occurrence-based knowledge networks. Finally, in terms of the descriptive accuracy of our models, we inspect what features played a role for which prediction outcome and if the models pick up on undesired shortcuts. In addition to this, we analyze how much the models match human coders in what evidence within responses they consider important for their coding decisions. Results : A model based on a modified GBERT-large can achieve the overall most promising results, although descriptive accuracy varies much more than predictive accuracy for the different ideas assessed. For reasons of comparability, we also evaluate the same machine learning architecture using the SciEntsBank 3-Way benchmark with an English RoBERTa-large model, where it achieves state-of-the-art results in two out of three evaluation categories. (DIPF/Orig.)</description><subject>Antwort</subject><subject>Aufgabe</subject><subject>Automatisierung</subject><subject>Bewertung</subject><subject>Codierung</subject><subject>Computerunterstützter Unterricht</subject><subject>Deutschland</subject><subject>Empirische Untersuchung</subject><subject>Energie</subject><subject>Leistungsbeurteilung</subject><subject>Modell</subject><subject>Physikunterricht</subject><subject>Schüler</subject><subject>Sekundarstufe I</subject><subject>Text</subject><subject>Wissen</subject><issn>1365-2729</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><recordid>eNp9ibsOgjAUQDtoIj4WR6f7A2B5w2iIxsEYB3dS6QWLpZAWg_y9DM6e5eTkELJ1qeNO7OuCScf14iieEcv1o9D2Yi9dkKUxNaU0TqPEIpes5UJVgAp1NcJLtYNEXiEIBUWrTK_fRY8cNJpuSjQwiP4J-OkkE4o9JML1coOm5SjNmsxLJg1ufl6R3el4z842F12Zd1o0TI_5IUiTMPD_zi8eTzu5</recordid><startdate>2022</startdate><enddate>2022</enddate><creator>Gombert, Sebastian</creator><creator>Di Mitri, Daniele</creator><creator>Karademir, Onur</creator><creator>Kubsch, Marcus</creator><creator>Kolbe, Hannah</creator><creator>Tautz, Simon</creator><creator>Grimm, Adrian</creator><creator>Bohm, Isabell</creator><creator>Neumann, Knut</creator><creator>Drachsler, Hendrik</creator><scope>9S6</scope></search><sort><creationdate>2022</creationdate><title>Coding energy knowledge in constructed responses with explainable NLP models</title><author>Gombert, Sebastian ; Di Mitri, Daniele ; Karademir, Onur ; Kubsch, Marcus ; Kolbe, Hannah ; Tautz, Simon ; Grimm, Adrian ; Bohm, Isabell ; Neumann, Knut ; Drachsler, Hendrik</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-dipf_primary_A498543</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Antwort</topic><topic>Aufgabe</topic><topic>Automatisierung</topic><topic>Bewertung</topic><topic>Codierung</topic><topic>Computerunterstützter Unterricht</topic><topic>Deutschland</topic><topic>Empirische Untersuchung</topic><topic>Energie</topic><topic>Leistungsbeurteilung</topic><topic>Modell</topic><topic>Physikunterricht</topic><topic>Schüler</topic><topic>Sekundarstufe I</topic><topic>Text</topic><topic>Wissen</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Gombert, Sebastian</creatorcontrib><creatorcontrib>Di Mitri, Daniele</creatorcontrib><creatorcontrib>Karademir, Onur</creatorcontrib><creatorcontrib>Kubsch, Marcus</creatorcontrib><creatorcontrib>Kolbe, Hannah</creatorcontrib><creatorcontrib>Tautz, Simon</creatorcontrib><creatorcontrib>Grimm, Adrian</creatorcontrib><creatorcontrib>Bohm, Isabell</creatorcontrib><creatorcontrib>Neumann, Knut</creatorcontrib><creatorcontrib>Drachsler, Hendrik</creatorcontrib><collection>FIS Bildung Literaturdatenbank</collection><jtitle>Journal of computer assisted learning</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Gombert, Sebastian</au><au>Di Mitri, Daniele</au><au>Karademir, Onur</au><au>Kubsch, Marcus</au><au>Kolbe, Hannah</au><au>Tautz, Simon</au><au>Grimm, Adrian</au><au>Bohm, Isabell</au><au>Neumann, Knut</au><au>Drachsler, Hendrik</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Coding energy knowledge in constructed responses with explainable NLP models</atitle><jtitle>Journal of computer assisted learning</jtitle><date>2022</date><risdate>2022</risdate><volume>39</volume><issue>3</issue><spage>767</spage><epage>786</epage><pages>767-786</pages><issn>1365-2729</issn><abstract>Background : Formative assessments are needed to enable monitoring how student knowledge develops throughout a unit. Constructed response items which require learners to formulate their own free-text responses are well suited for testing their active knowledge. However, assessing such constructed responses in an automated fashion is a complex task and requires the application of natural language processing methodology. In this article, we implement and evaluate multiple machine learning models for coding energy knowledge in free-text responses of German K-12 students to items in formative science assessments which were conducted during synchronous online learning sessions. Dataset : The dataset we collected for this purpose consists of German constructed responses from 38 different items dealing with aspects of energy such as manifestation and transformation. The units and items were implemented with the help of project-based pedagogy and evidence-centered design, and the responses were coded for seven core ideas concerning the manifestation and transformation of energy. The data was collected from students in seventh, eighth and ninth grade. Methodology : We train various transformer- and feature-based models and compare their ability to recognize the respective ideas in students' writing. Moreover, as domain knowledge and its development can be formally modeled through knowledge networks, we evaluate how well the detection of the ideas within responses translated into accurate co-occurrence-based knowledge networks. Finally, in terms of the descriptive accuracy of our models, we inspect what features played a role for which prediction outcome and if the models pick up on undesired shortcuts. In addition to this, we analyze how much the models match human coders in what evidence within responses they consider important for their coding decisions. Results : A model based on a modified GBERT-large can achieve the overall most promising results, although descriptive accuracy varies much more than predictive accuracy for the different ideas assessed. For reasons of comparability, we also evaluate the same machine learning architecture using the SciEntsBank 3-Way benchmark with an English RoBERTa-large model, where it achieves state-of-the-art results in two out of three evaluation categories. (DIPF/Orig.)</abstract><doi>10.1111/jcal.12767</doi><doi>10.25656/01:28441</doi></addata></record>
fulltext	fulltext
identifier	ISSN: 1365-2729
ispartof	Journal of computer assisted learning, 2022, Vol.39 (3), p.767-786
issn	1365-2729
language	eng
recordid	cdi_dipf_primary_A49854
source	Wiley Online Library Journals Frontfile Complete
subjects	Antwort Aufgabe Automatisierung Bewertung Codierung Computerunterstützter Unterricht Deutschland Empirische Untersuchung Energie Leistungsbeurteilung Modell Physikunterricht Schüler Sekundarstufe I Text Wissen
title	Coding energy knowledge in constructed responses with explainable NLP models
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-04T20%3A12%3A33IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-dipf&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Coding%20energy%20knowledge%20in%20constructed%20responses%20with%20explainable%20NLP%20models&rft.jtitle=Journal%20of%20computer%20assisted%20learning&rft.au=Gombert,%20Sebastian&rft.date=2022&rft.volume=39&rft.issue=3&rft.spage=767&rft.epage=786&rft.pages=767-786&rft.issn=1365-2729&rft_id=info:doi/10.1111/jcal.12767&rft_dat=%3Cdipf%3EA49854%3C/dipf%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true