Correct Like Humans: Progressive Learning Framework for Chinese Text Error Correction

Chinese Text Error Correction (CTEC) aims to detect and correct errors in the input text, which benefits human daily life and various downstream tasks. Recent approaches mainly employ Pre-trained Language Models (PLMs) to resolve CTEC. Although PLMs have achieved remarkable success in CTEC, we argue...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Li, Yinghui, Ma, Shirong, Chen, Shaoshen, Huang, Haojing, Huang, Shulin, Li, Yangning, Zheng, Hai-Tao, Shen, Ying
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Li, Yinghui
Ma, Shirong
Chen, Shaoshen
Huang, Haojing
Huang, Shulin
Li, Yangning
Zheng, Hai-Tao
Shen, Ying
description Chinese Text Error Correction (CTEC) aims to detect and correct errors in the input text, which benefits human daily life and various downstream tasks. Recent approaches mainly employ Pre-trained Language Models (PLMs) to resolve CTEC. Although PLMs have achieved remarkable success in CTEC, we argue that previous studies still overlook the importance of human thinking patterns. To enhance the development of PLMs for CTEC, inspired by humans' daily error-correcting behavior, we propose a novel model-agnostic progressive learning framework, named ProTEC, which guides PLMs-based CTEC models to learn to correct like humans. During the training process, ProTEC guides the model to learn text error correction by incorporating these sub-tasks into a progressive paradigm. During the inference process, the model completes these sub-tasks in turn to generate the correction results. Extensive experiments and detailed analyses demonstrate the effectiveness and efficiency of our proposed model-agnostic ProTEC framework.
doi_str_mv 10.48550/arxiv.2306.17447
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2306_17447</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2306_17447</sourcerecordid><originalsourceid>FETCH-arxiv_primary_2306_174473</originalsourceid><addsrcrecordid>eNqFzrEOgkAQBNBrLIz6AVbuD4gghxhbAqGgsMD6cjELbpA7s4eIf29Ae6tJJpPJE2Id-J48RpG_0zxQ7-1D_-AFsZTxXFwSy4zXDgpqEPJnq407wZltzegc9QgFajZkashYt_iy3EBlGZIbGXQIJQ4dpMxj9b0ia5ZiVum7w9UvF2KTpWWSbyeAejC1mt9qhKgJEv5ffABFlz5w</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Correct Like Humans: Progressive Learning Framework for Chinese Text Error Correction</title><source>arXiv.org</source><creator>Li, Yinghui ; Ma, Shirong ; Chen, Shaoshen ; Huang, Haojing ; Huang, Shulin ; Li, Yangning ; Zheng, Hai-Tao ; Shen, Ying</creator><creatorcontrib>Li, Yinghui ; Ma, Shirong ; Chen, Shaoshen ; Huang, Haojing ; Huang, Shulin ; Li, Yangning ; Zheng, Hai-Tao ; Shen, Ying</creatorcontrib><description>Chinese Text Error Correction (CTEC) aims to detect and correct errors in the input text, which benefits human daily life and various downstream tasks. Recent approaches mainly employ Pre-trained Language Models (PLMs) to resolve CTEC. Although PLMs have achieved remarkable success in CTEC, we argue that previous studies still overlook the importance of human thinking patterns. To enhance the development of PLMs for CTEC, inspired by humans' daily error-correcting behavior, we propose a novel model-agnostic progressive learning framework, named ProTEC, which guides PLMs-based CTEC models to learn to correct like humans. During the training process, ProTEC guides the model to learn text error correction by incorporating these sub-tasks into a progressive paradigm. During the inference process, the model completes these sub-tasks in turn to generate the correction results. Extensive experiments and detailed analyses demonstrate the effectiveness and efficiency of our proposed model-agnostic ProTEC framework.</description><identifier>DOI: 10.48550/arxiv.2306.17447</identifier><language>eng</language><subject>Computer Science - Computation and Language</subject><creationdate>2023-06</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,881</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2306.17447$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2306.17447$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Li, Yinghui</creatorcontrib><creatorcontrib>Ma, Shirong</creatorcontrib><creatorcontrib>Chen, Shaoshen</creatorcontrib><creatorcontrib>Huang, Haojing</creatorcontrib><creatorcontrib>Huang, Shulin</creatorcontrib><creatorcontrib>Li, Yangning</creatorcontrib><creatorcontrib>Zheng, Hai-Tao</creatorcontrib><creatorcontrib>Shen, Ying</creatorcontrib><title>Correct Like Humans: Progressive Learning Framework for Chinese Text Error Correction</title><description>Chinese Text Error Correction (CTEC) aims to detect and correct errors in the input text, which benefits human daily life and various downstream tasks. Recent approaches mainly employ Pre-trained Language Models (PLMs) to resolve CTEC. Although PLMs have achieved remarkable success in CTEC, we argue that previous studies still overlook the importance of human thinking patterns. To enhance the development of PLMs for CTEC, inspired by humans' daily error-correcting behavior, we propose a novel model-agnostic progressive learning framework, named ProTEC, which guides PLMs-based CTEC models to learn to correct like humans. During the training process, ProTEC guides the model to learn text error correction by incorporating these sub-tasks into a progressive paradigm. During the inference process, the model completes these sub-tasks in turn to generate the correction results. Extensive experiments and detailed analyses demonstrate the effectiveness and efficiency of our proposed model-agnostic ProTEC framework.</description><subject>Computer Science - Computation and Language</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNqFzrEOgkAQBNBrLIz6AVbuD4gghxhbAqGgsMD6cjELbpA7s4eIf29Ae6tJJpPJE2Id-J48RpG_0zxQ7-1D_-AFsZTxXFwSy4zXDgpqEPJnq407wZltzegc9QgFajZkashYt_iy3EBlGZIbGXQIJQ4dpMxj9b0ia5ZiVum7w9UvF2KTpWWSbyeAejC1mt9qhKgJEv5ffABFlz5w</recordid><startdate>20230630</startdate><enddate>20230630</enddate><creator>Li, Yinghui</creator><creator>Ma, Shirong</creator><creator>Chen, Shaoshen</creator><creator>Huang, Haojing</creator><creator>Huang, Shulin</creator><creator>Li, Yangning</creator><creator>Zheng, Hai-Tao</creator><creator>Shen, Ying</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20230630</creationdate><title>Correct Like Humans: Progressive Learning Framework for Chinese Text Error Correction</title><author>Li, Yinghui ; Ma, Shirong ; Chen, Shaoshen ; Huang, Haojing ; Huang, Shulin ; Li, Yangning ; Zheng, Hai-Tao ; Shen, Ying</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-arxiv_primary_2306_174473</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Computer Science - Computation and Language</topic><toplevel>online_resources</toplevel><creatorcontrib>Li, Yinghui</creatorcontrib><creatorcontrib>Ma, Shirong</creatorcontrib><creatorcontrib>Chen, Shaoshen</creatorcontrib><creatorcontrib>Huang, Haojing</creatorcontrib><creatorcontrib>Huang, Shulin</creatorcontrib><creatorcontrib>Li, Yangning</creatorcontrib><creatorcontrib>Zheng, Hai-Tao</creatorcontrib><creatorcontrib>Shen, Ying</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Li, Yinghui</au><au>Ma, Shirong</au><au>Chen, Shaoshen</au><au>Huang, Haojing</au><au>Huang, Shulin</au><au>Li, Yangning</au><au>Zheng, Hai-Tao</au><au>Shen, Ying</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Correct Like Humans: Progressive Learning Framework for Chinese Text Error Correction</atitle><date>2023-06-30</date><risdate>2023</risdate><abstract>Chinese Text Error Correction (CTEC) aims to detect and correct errors in the input text, which benefits human daily life and various downstream tasks. Recent approaches mainly employ Pre-trained Language Models (PLMs) to resolve CTEC. Although PLMs have achieved remarkable success in CTEC, we argue that previous studies still overlook the importance of human thinking patterns. To enhance the development of PLMs for CTEC, inspired by humans' daily error-correcting behavior, we propose a novel model-agnostic progressive learning framework, named ProTEC, which guides PLMs-based CTEC models to learn to correct like humans. During the training process, ProTEC guides the model to learn text error correction by incorporating these sub-tasks into a progressive paradigm. During the inference process, the model completes these sub-tasks in turn to generate the correction results. Extensive experiments and detailed analyses demonstrate the effectiveness and efficiency of our proposed model-agnostic ProTEC framework.</abstract><doi>10.48550/arxiv.2306.17447</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.2306.17447
ispartof
issn
language eng
recordid cdi_arxiv_primary_2306_17447
source arXiv.org
subjects Computer Science - Computation and Language
title Correct Like Humans: Progressive Learning Framework for Chinese Text Error Correction
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-22T12%3A31%3A59IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Correct%20Like%20Humans:%20Progressive%20Learning%20Framework%20for%20Chinese%20Text%20Error%20Correction&rft.au=Li,%20Yinghui&rft.date=2023-06-30&rft_id=info:doi/10.48550/arxiv.2306.17447&rft_dat=%3Carxiv_GOX%3E2306_17447%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true