Practical and ethical challenges of large language models in education: A systematic scoping review

Educational technology innovations leveraging large language models (LLMs) have shown the potential to automate the laborious process of generating and analysing textual content. While various innovations have been developed to automate a range of educational tasks (eg, question generation, feedback...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	British journal of educational technology 2024-01, Vol.55 (1), p.90-112
Hauptverfasser:	Yan, Lixiang, Sha, Lele, Zhao, Linxuan, Li, Yuheng, Martinez‐Maldonado, Roberto, Chen, Guanliang, Li, Xinyu, Jin, Yueqiao, Gašević, Dragan
Format:	Artikel
Sprache:	eng
Schlagworte:	artificial intelligence Automation BERT ChatGPT Education Educational technology Empirical analysis Ethics Feedback Generative artificial intelligence GPT‐3 Grading Innovation Innovations Knowledge representation Language Language Processing Large language models Natural language processing pre‐trained language models Questions Speech recognition systematic scoping review Task complexity Teaching Methods Technological change
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	112
container_issue	1
container_start_page	90
container_title	British journal of educational technology
container_volume	55
creator	Yan, Lixiang Sha, Lele Zhao, Linxuan Li, Yuheng Martinez‐Maldonado, Roberto Chen, Guanliang Li, Xinyu Jin, Yueqiao Gašević, Dragan
description	Educational technology innovations leveraging large language models (LLMs) have shown the potential to automate the laborious process of generating and analysing textual content. While various innovations have been developed to automate a range of educational tasks (eg, question generation, feedback provision, and essay grading), there are concerns regarding the practicality and ethicality of these innovations. Such concerns may hinder future research and the adoption of LLMs‐based innovations in authentic educational contexts. To address this, we conducted a systematic scoping review of 118 peer‐reviewed papers published since 2017 to pinpoint the current state of research on using LLMs to automate and support educational tasks. The findings revealed 53 use cases for LLMs in automating education tasks, categorised into nine main categories: profiling/labelling, detection, grading, teaching support, prediction, knowledge representation, feedback, content generation, and recommendation. Additionally, we also identified several practical and ethical challenges, including low technological readiness, lack of replicability and transparency and insufficient privacy and beneficence considerations. The findings were summarised into three recommendations for future studies, including updating existing innovations with state‐of‐the‐art models (eg, GPT‐3/4), embracing the initiative of open‐sourcing models/systems, and adopting a human‐centred approach throughout the developmental process. As the intersection of AI and education is continuously evolving, the findings of this study can serve as an essential reference point for researchers, allowing them to leverage the strengths, learn from the limitations, and uncover potential research opportunities enabled by ChatGPT and other generative AI models. Practitioner notes What is currently known about this topic Generating and analysing text‐based content are time‐consuming and laborious tasks. Large language models are capable of efficiently analysing an unprecedented amount of textual content and completing complex natural language processing and generation tasks. Large language models have been increasingly used to develop educational technologies that aim to automate the generation and analysis of textual content, such as automated question generation and essay scoring. What this paper adds A comprehensive list of different educational tasks that could potentially benefit from LLMs‐based innovations through automat
doi_str_mv	10.1111/bjet.13370
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2917702377</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2917702377</sourcerecordid><originalsourceid>FETCH-LOGICAL-c3370-23792a50be9616884e136e30a84fb618b2c62e3546400fbf06e6180d6e8592573</originalsourceid><addsrcrecordid>eNp9kMlOwzAQhi0EEqVw4QkscUNKGceJk3ArVdlUCQ7lbDnOJE2VxsVOqPr2OA1n5jCbvln0E3LLYMa8PeRb7GaM8wTOyIRFIgnSmMfnZAIAScCA8Uty5dzWl8DjaEL0p1W6q7VqqGoLit3mlOuNahpsK3TUlLRRtkLv26pXPtmZAhtH65Zi0WvV1aZ9pHPqjq7DnS81ddrs67aiFn9qPFyTi1I1Dm_-4pR8PS_Xi9dg9fHytpivAj08HIQ8yUIVQ46ZYCJNI2RcIAeVRmUuWJqHWoTovxYRQJmXINB3oRCYxlkYJ3xK7sa9e2u-e3Sd3Jretv6kDDOWJOAvDNT9SGlrnLNYyr2td8oeJQM5iCgHEeVJRA-zET7UDR7_IeXT-3I9zvwC6G1zTA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2917702377</pqid></control><display><type>article</type><title>Practical and ethical challenges of large language models in education: A systematic scoping review</title><source>Access via Wiley Online Library</source><creator>Yan, Lixiang ; Sha, Lele ; Zhao, Linxuan ; Li, Yuheng ; Martinez‐Maldonado, Roberto ; Chen, Guanliang ; Li, Xinyu ; Jin, Yueqiao ; Gašević, Dragan</creator><creatorcontrib>Yan, Lixiang ; Sha, Lele ; Zhao, Linxuan ; Li, Yuheng ; Martinez‐Maldonado, Roberto ; Chen, Guanliang ; Li, Xinyu ; Jin, Yueqiao ; Gašević, Dragan</creatorcontrib><description>Educational technology innovations leveraging large language models (LLMs) have shown the potential to automate the laborious process of generating and analysing textual content. While various innovations have been developed to automate a range of educational tasks (eg, question generation, feedback provision, and essay grading), there are concerns regarding the practicality and ethicality of these innovations. Such concerns may hinder future research and the adoption of LLMs‐based innovations in authentic educational contexts. To address this, we conducted a systematic scoping review of 118 peer‐reviewed papers published since 2017 to pinpoint the current state of research on using LLMs to automate and support educational tasks. The findings revealed 53 use cases for LLMs in automating education tasks, categorised into nine main categories: profiling/labelling, detection, grading, teaching support, prediction, knowledge representation, feedback, content generation, and recommendation. Additionally, we also identified several practical and ethical challenges, including low technological readiness, lack of replicability and transparency and insufficient privacy and beneficence considerations. The findings were summarised into three recommendations for future studies, including updating existing innovations with state‐of‐the‐art models (eg, GPT‐3/4), embracing the initiative of open‐sourcing models/systems, and adopting a human‐centred approach throughout the developmental process. As the intersection of AI and education is continuously evolving, the findings of this study can serve as an essential reference point for researchers, allowing them to leverage the strengths, learn from the limitations, and uncover potential research opportunities enabled by ChatGPT and other generative AI models. Practitioner notes What is currently known about this topic Generating and analysing text‐based content are time‐consuming and laborious tasks. Large language models are capable of efficiently analysing an unprecedented amount of textual content and completing complex natural language processing and generation tasks. Large language models have been increasingly used to develop educational technologies that aim to automate the generation and analysis of textual content, such as automated question generation and essay scoring. What this paper adds A comprehensive list of different educational tasks that could potentially benefit from LLMs‐based innovations through automation. A structured assessment of the practicality and ethicality of existing LLMs‐based innovations from seven important aspects using established frameworks. Three recommendations that could potentially support future studies to develop LLMs‐based innovations that are practical and ethical to implement in authentic educational contexts. Implications for practice and/or policy Updating existing innovations with state‐of‐the‐art models may further reduce the amount of manual effort required for adapting existing models to different educational tasks. The reporting standards of empirical research that aims to develop educational technologies using large language models need to be improved. Adopting a human‐centred approach throughout the developmental process could contribute to resolving the practical and ethical challenges of large language models in education.</description><identifier>ISSN: 0007-1013</identifier><identifier>EISSN: 1467-8535</identifier><identifier>DOI: 10.1111/bjet.13370</identifier><language>eng</language><publisher>Coventry: Blackwell Publishing Ltd</publisher><subject>artificial intelligence ; Automation ; BERT ; ChatGPT ; Education ; Educational technology ; Empirical analysis ; Ethics ; Feedback ; Generative artificial intelligence ; GPT‐3 ; Grading ; Innovation ; Innovations ; Knowledge representation ; Language ; Language Processing ; Large language models ; Natural language processing ; pre‐trained language models ; Questions ; Speech recognition ; systematic scoping review ; Task complexity ; Teaching Methods ; Technological change</subject><ispartof>British journal of educational technology, 2024-01, Vol.55 (1), p.90-112</ispartof><rights>2023 The Authors. published by John Wiley & Sons Ltd on behalf of British Educational Research Association.</rights><rights>2023. This article is published under http://creativecommons.org/licenses/by-nc/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c3370-23792a50be9616884e136e30a84fb618b2c62e3546400fbf06e6180d6e8592573</citedby><cites>FETCH-LOGICAL-c3370-23792a50be9616884e136e30a84fb618b2c62e3546400fbf06e6180d6e8592573</cites><orcidid>0000-0002-8236-3133 ; 0000-0003-3818-045X ; 0000-0002-5971-8469 ; 0000-0001-5564-0185</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://onlinelibrary.wiley.com/doi/pdf/10.1111%2Fbjet.13370$$EPDF$$P50$$Gwiley$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://onlinelibrary.wiley.com/doi/full/10.1111%2Fbjet.13370$$EHTML$$P50$$Gwiley$$Hfree_for_read</linktohtml><link.rule.ids>315,781,785,1418,27929,27930,45579,45580</link.rule.ids></links><search><creatorcontrib>Yan, Lixiang</creatorcontrib><creatorcontrib>Sha, Lele</creatorcontrib><creatorcontrib>Zhao, Linxuan</creatorcontrib><creatorcontrib>Li, Yuheng</creatorcontrib><creatorcontrib>Martinez‐Maldonado, Roberto</creatorcontrib><creatorcontrib>Chen, Guanliang</creatorcontrib><creatorcontrib>Li, Xinyu</creatorcontrib><creatorcontrib>Jin, Yueqiao</creatorcontrib><creatorcontrib>Gašević, Dragan</creatorcontrib><title>Practical and ethical challenges of large language models in education: A systematic scoping review</title><title>British journal of educational technology</title><description>Educational technology innovations leveraging large language models (LLMs) have shown the potential to automate the laborious process of generating and analysing textual content. While various innovations have been developed to automate a range of educational tasks (eg, question generation, feedback provision, and essay grading), there are concerns regarding the practicality and ethicality of these innovations. Such concerns may hinder future research and the adoption of LLMs‐based innovations in authentic educational contexts. To address this, we conducted a systematic scoping review of 118 peer‐reviewed papers published since 2017 to pinpoint the current state of research on using LLMs to automate and support educational tasks. The findings revealed 53 use cases for LLMs in automating education tasks, categorised into nine main categories: profiling/labelling, detection, grading, teaching support, prediction, knowledge representation, feedback, content generation, and recommendation. Additionally, we also identified several practical and ethical challenges, including low technological readiness, lack of replicability and transparency and insufficient privacy and beneficence considerations. The findings were summarised into three recommendations for future studies, including updating existing innovations with state‐of‐the‐art models (eg, GPT‐3/4), embracing the initiative of open‐sourcing models/systems, and adopting a human‐centred approach throughout the developmental process. As the intersection of AI and education is continuously evolving, the findings of this study can serve as an essential reference point for researchers, allowing them to leverage the strengths, learn from the limitations, and uncover potential research opportunities enabled by ChatGPT and other generative AI models. Practitioner notes What is currently known about this topic Generating and analysing text‐based content are time‐consuming and laborious tasks. Large language models are capable of efficiently analysing an unprecedented amount of textual content and completing complex natural language processing and generation tasks. Large language models have been increasingly used to develop educational technologies that aim to automate the generation and analysis of textual content, such as automated question generation and essay scoring. What this paper adds A comprehensive list of different educational tasks that could potentially benefit from LLMs‐based innovations through automation. A structured assessment of the practicality and ethicality of existing LLMs‐based innovations from seven important aspects using established frameworks. Three recommendations that could potentially support future studies to develop LLMs‐based innovations that are practical and ethical to implement in authentic educational contexts. Implications for practice and/or policy Updating existing innovations with state‐of‐the‐art models may further reduce the amount of manual effort required for adapting existing models to different educational tasks. The reporting standards of empirical research that aims to develop educational technologies using large language models need to be improved. Adopting a human‐centred approach throughout the developmental process could contribute to resolving the practical and ethical challenges of large language models in education.</description><subject>artificial intelligence</subject><subject>Automation</subject><subject>BERT</subject><subject>ChatGPT</subject><subject>Education</subject><subject>Educational technology</subject><subject>Empirical analysis</subject><subject>Ethics</subject><subject>Feedback</subject><subject>Generative artificial intelligence</subject><subject>GPT‐3</subject><subject>Grading</subject><subject>Innovation</subject><subject>Innovations</subject><subject>Knowledge representation</subject><subject>Language</subject><subject>Language Processing</subject><subject>Large language models</subject><subject>Natural language processing</subject><subject>pre‐trained language models</subject><subject>Questions</subject><subject>Speech recognition</subject><subject>systematic scoping review</subject><subject>Task complexity</subject><subject>Teaching Methods</subject><subject>Technological change</subject><issn>0007-1013</issn><issn>1467-8535</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>24P</sourceid><sourceid>WIN</sourceid><recordid>eNp9kMlOwzAQhi0EEqVw4QkscUNKGceJk3ArVdlUCQ7lbDnOJE2VxsVOqPr2OA1n5jCbvln0E3LLYMa8PeRb7GaM8wTOyIRFIgnSmMfnZAIAScCA8Uty5dzWl8DjaEL0p1W6q7VqqGoLit3mlOuNahpsK3TUlLRRtkLv26pXPtmZAhtH65Zi0WvV1aZ9pHPqjq7DnS81ddrs67aiFn9qPFyTi1I1Dm_-4pR8PS_Xi9dg9fHytpivAj08HIQ8yUIVQ46ZYCJNI2RcIAeVRmUuWJqHWoTovxYRQJmXINB3oRCYxlkYJ3xK7sa9e2u-e3Sd3Jretv6kDDOWJOAvDNT9SGlrnLNYyr2td8oeJQM5iCgHEeVJRA-zET7UDR7_IeXT-3I9zvwC6G1zTA</recordid><startdate>202401</startdate><enddate>202401</enddate><creator>Yan, Lixiang</creator><creator>Sha, Lele</creator><creator>Zhao, Linxuan</creator><creator>Li, Yuheng</creator><creator>Martinez‐Maldonado, Roberto</creator><creator>Chen, Guanliang</creator><creator>Li, Xinyu</creator><creator>Jin, Yueqiao</creator><creator>Gašević, Dragan</creator><general>Blackwell Publishing Ltd</general><scope>24P</scope><scope>WIN</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0002-8236-3133</orcidid><orcidid>https://orcid.org/0000-0003-3818-045X</orcidid><orcidid>https://orcid.org/0000-0002-5971-8469</orcidid><orcidid>https://orcid.org/0000-0001-5564-0185</orcidid></search><sort><creationdate>202401</creationdate><title>Practical and ethical challenges of large language models in education: A systematic scoping review</title><author>Yan, Lixiang ; Sha, Lele ; Zhao, Linxuan ; Li, Yuheng ; Martinez‐Maldonado, Roberto ; Chen, Guanliang ; Li, Xinyu ; Jin, Yueqiao ; Gašević, Dragan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c3370-23792a50be9616884e136e30a84fb618b2c62e3546400fbf06e6180d6e8592573</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>artificial intelligence</topic><topic>Automation</topic><topic>BERT</topic><topic>ChatGPT</topic><topic>Education</topic><topic>Educational technology</topic><topic>Empirical analysis</topic><topic>Ethics</topic><topic>Feedback</topic><topic>Generative artificial intelligence</topic><topic>GPT‐3</topic><topic>Grading</topic><topic>Innovation</topic><topic>Innovations</topic><topic>Knowledge representation</topic><topic>Language</topic><topic>Language Processing</topic><topic>Large language models</topic><topic>Natural language processing</topic><topic>pre‐trained language models</topic><topic>Questions</topic><topic>Speech recognition</topic><topic>systematic scoping review</topic><topic>Task complexity</topic><topic>Teaching Methods</topic><topic>Technological change</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Yan, Lixiang</creatorcontrib><creatorcontrib>Sha, Lele</creatorcontrib><creatorcontrib>Zhao, Linxuan</creatorcontrib><creatorcontrib>Li, Yuheng</creatorcontrib><creatorcontrib>Martinez‐Maldonado, Roberto</creatorcontrib><creatorcontrib>Chen, Guanliang</creatorcontrib><creatorcontrib>Li, Xinyu</creatorcontrib><creatorcontrib>Jin, Yueqiao</creatorcontrib><creatorcontrib>Gašević, Dragan</creatorcontrib><collection>Wiley-Blackwell Open Access Titles</collection><collection>Wiley Free Content</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>British journal of educational technology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Yan, Lixiang</au><au>Sha, Lele</au><au>Zhao, Linxuan</au><au>Li, Yuheng</au><au>Martinez‐Maldonado, Roberto</au><au>Chen, Guanliang</au><au>Li, Xinyu</au><au>Jin, Yueqiao</au><au>Gašević, Dragan</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Practical and ethical challenges of large language models in education: A systematic scoping review</atitle><jtitle>British journal of educational technology</jtitle><date>2024-01</date><risdate>2024</risdate><volume>55</volume><issue>1</issue><spage>90</spage><epage>112</epage><pages>90-112</pages><issn>0007-1013</issn><eissn>1467-8535</eissn><abstract>Educational technology innovations leveraging large language models (LLMs) have shown the potential to automate the laborious process of generating and analysing textual content. While various innovations have been developed to automate a range of educational tasks (eg, question generation, feedback provision, and essay grading), there are concerns regarding the practicality and ethicality of these innovations. Such concerns may hinder future research and the adoption of LLMs‐based innovations in authentic educational contexts. To address this, we conducted a systematic scoping review of 118 peer‐reviewed papers published since 2017 to pinpoint the current state of research on using LLMs to automate and support educational tasks. The findings revealed 53 use cases for LLMs in automating education tasks, categorised into nine main categories: profiling/labelling, detection, grading, teaching support, prediction, knowledge representation, feedback, content generation, and recommendation. Additionally, we also identified several practical and ethical challenges, including low technological readiness, lack of replicability and transparency and insufficient privacy and beneficence considerations. The findings were summarised into three recommendations for future studies, including updating existing innovations with state‐of‐the‐art models (eg, GPT‐3/4), embracing the initiative of open‐sourcing models/systems, and adopting a human‐centred approach throughout the developmental process. As the intersection of AI and education is continuously evolving, the findings of this study can serve as an essential reference point for researchers, allowing them to leverage the strengths, learn from the limitations, and uncover potential research opportunities enabled by ChatGPT and other generative AI models. Practitioner notes What is currently known about this topic Generating and analysing text‐based content are time‐consuming and laborious tasks. Large language models are capable of efficiently analysing an unprecedented amount of textual content and completing complex natural language processing and generation tasks. Large language models have been increasingly used to develop educational technologies that aim to automate the generation and analysis of textual content, such as automated question generation and essay scoring. What this paper adds A comprehensive list of different educational tasks that could potentially benefit from LLMs‐based innovations through automation. A structured assessment of the practicality and ethicality of existing LLMs‐based innovations from seven important aspects using established frameworks. Three recommendations that could potentially support future studies to develop LLMs‐based innovations that are practical and ethical to implement in authentic educational contexts. Implications for practice and/or policy Updating existing innovations with state‐of‐the‐art models may further reduce the amount of manual effort required for adapting existing models to different educational tasks. The reporting standards of empirical research that aims to develop educational technologies using large language models need to be improved. Adopting a human‐centred approach throughout the developmental process could contribute to resolving the practical and ethical challenges of large language models in education.</abstract><cop>Coventry</cop><pub>Blackwell Publishing Ltd</pub><doi>10.1111/bjet.13370</doi><tpages>23</tpages><orcidid>https://orcid.org/0000-0002-8236-3133</orcidid><orcidid>https://orcid.org/0000-0003-3818-045X</orcidid><orcidid>https://orcid.org/0000-0002-5971-8469</orcidid><orcidid>https://orcid.org/0000-0001-5564-0185</orcidid><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 0007-1013
ispartof	British journal of educational technology, 2024-01, Vol.55 (1), p.90-112
issn	0007-1013 1467-8535
language	eng
recordid	cdi_proquest_journals_2917702377
source	Access via Wiley Online Library
subjects	artificial intelligence Automation BERT ChatGPT Education Educational technology Empirical analysis Ethics Feedback Generative artificial intelligence GPT‐3 Grading Innovation Innovations Knowledge representation Language Language Processing Large language models Natural language processing pre‐trained language models Questions Speech recognition systematic scoping review Task complexity Teaching Methods Technological change
title	Practical and ethical challenges of large language models in education: A systematic scoping review
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-14T08%3A07%3A22IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Practical%20and%20ethical%20challenges%20of%20large%20language%20models%20in%20education:%20A%20systematic%20scoping%20review&rft.jtitle=British%20journal%20of%20educational%20technology&rft.au=Yan,%20Lixiang&rft.date=2024-01&rft.volume=55&rft.issue=1&rft.spage=90&rft.epage=112&rft.pages=90-112&rft.issn=0007-1013&rft.eissn=1467-8535&rft_id=info:doi/10.1111/bjet.13370&rft_dat=%3Cproquest_cross%3E2917702377%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2917702377&rft_id=info:pmid/&rfr_iscdi=true