Method and device for training text auditing model

The invention provides a method and device for training a text auditing model, and relates to the field of artificial intelligence, in particular to the field of natural language processing. According to the specific implementation scheme, the method comprises the steps of obtaining a pre-training l...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: CHEN YONGFENG, WANG ZANBO, HUANG SHUO, CAO YUHUI
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator CHEN YONGFENG
WANG ZANBO
HUANG SHUO
CAO YUHUI
description The invention provides a method and device for training a text auditing model, and relates to the field of artificial intelligence, in particular to the field of natural language processing. According to the specific implementation scheme, the method comprises the steps of obtaining a pre-training language model, a pre-training language micro model, annotation data and non-annotation data; inputting the annotation data into a pre-training language model for supervised training to obtain a teacher model; inputting the annotation data into a pre-training language micro model for supervised training to obtain a student model; and respectively inputting the unlabeled data into the teacher model and the student model, and distilling the student model by using the teacher model to obtain a text auditing model. According to the embodiment, training can be carried out on small-scale manual annotation data and large-scale non-annotation data, and the text auditing model with a good effect and a high speed is obtained.
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN114969332A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN114969332A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN114969332A3</originalsourceid><addsrcrecordid>eNrjZDDyTS3JyE9RSMxLUUhJLctMTlVIyy9SKClKzMzLzEtXKEmtKFFILE3JLAHxcvNTUnN4GFjTEnOKU3mhNDeDoptriLOHbmpBfnxqcUFicmpeakm8s5-hoYmlmaWxsZGjMTFqAKgyK9c</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Method and device for training text auditing model</title><source>esp@cenet</source><creator>CHEN YONGFENG ; WANG ZANBO ; HUANG SHUO ; CAO YUHUI</creator><creatorcontrib>CHEN YONGFENG ; WANG ZANBO ; HUANG SHUO ; CAO YUHUI</creatorcontrib><description>The invention provides a method and device for training a text auditing model, and relates to the field of artificial intelligence, in particular to the field of natural language processing. According to the specific implementation scheme, the method comprises the steps of obtaining a pre-training language model, a pre-training language micro model, annotation data and non-annotation data; inputting the annotation data into a pre-training language model for supervised training to obtain a teacher model; inputting the annotation data into a pre-training language micro model for supervised training to obtain a student model; and respectively inputting the unlabeled data into the teacher model and the student model, and distilling the student model by using the teacher model to obtain a text auditing model. According to the embodiment, training can be carried out on small-scale manual annotation data and large-scale non-annotation data, and the text auditing model with a good effect and a high speed is obtained.</description><language>chi ; eng</language><subject>CALCULATING ; COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; PHYSICS</subject><creationdate>2022</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20220830&amp;DB=EPODOC&amp;CC=CN&amp;NR=114969332A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25564,76547</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20220830&amp;DB=EPODOC&amp;CC=CN&amp;NR=114969332A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>CHEN YONGFENG</creatorcontrib><creatorcontrib>WANG ZANBO</creatorcontrib><creatorcontrib>HUANG SHUO</creatorcontrib><creatorcontrib>CAO YUHUI</creatorcontrib><title>Method and device for training text auditing model</title><description>The invention provides a method and device for training a text auditing model, and relates to the field of artificial intelligence, in particular to the field of natural language processing. According to the specific implementation scheme, the method comprises the steps of obtaining a pre-training language model, a pre-training language micro model, annotation data and non-annotation data; inputting the annotation data into a pre-training language model for supervised training to obtain a teacher model; inputting the annotation data into a pre-training language micro model for supervised training to obtain a student model; and respectively inputting the unlabeled data into the teacher model and the student model, and distilling the student model by using the teacher model to obtain a text auditing model. According to the embodiment, training can be carried out on small-scale manual annotation data and large-scale non-annotation data, and the text auditing model with a good effect and a high speed is obtained.</description><subject>CALCULATING</subject><subject>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>PHYSICS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2022</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZDDyTS3JyE9RSMxLUUhJLctMTlVIyy9SKClKzMzLzEtXKEmtKFFILE3JLAHxcvNTUnN4GFjTEnOKU3mhNDeDoptriLOHbmpBfnxqcUFicmpeakm8s5-hoYmlmaWxsZGjMTFqAKgyK9c</recordid><startdate>20220830</startdate><enddate>20220830</enddate><creator>CHEN YONGFENG</creator><creator>WANG ZANBO</creator><creator>HUANG SHUO</creator><creator>CAO YUHUI</creator><scope>EVB</scope></search><sort><creationdate>20220830</creationdate><title>Method and device for training text auditing model</title><author>CHEN YONGFENG ; WANG ZANBO ; HUANG SHUO ; CAO YUHUI</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN114969332A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2022</creationdate><topic>CALCULATING</topic><topic>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>PHYSICS</topic><toplevel>online_resources</toplevel><creatorcontrib>CHEN YONGFENG</creatorcontrib><creatorcontrib>WANG ZANBO</creatorcontrib><creatorcontrib>HUANG SHUO</creatorcontrib><creatorcontrib>CAO YUHUI</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>CHEN YONGFENG</au><au>WANG ZANBO</au><au>HUANG SHUO</au><au>CAO YUHUI</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Method and device for training text auditing model</title><date>2022-08-30</date><risdate>2022</risdate><abstract>The invention provides a method and device for training a text auditing model, and relates to the field of artificial intelligence, in particular to the field of natural language processing. According to the specific implementation scheme, the method comprises the steps of obtaining a pre-training language model, a pre-training language micro model, annotation data and non-annotation data; inputting the annotation data into a pre-training language model for supervised training to obtain a teacher model; inputting the annotation data into a pre-training language micro model for supervised training to obtain a student model; and respectively inputting the unlabeled data into the teacher model and the student model, and distilling the student model by using the teacher model to obtain a text auditing model. According to the embodiment, training can be carried out on small-scale manual annotation data and large-scale non-annotation data, and the text auditing model with a good effect and a high speed is obtained.</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language chi ; eng
recordid cdi_epo_espacenet_CN114969332A
source esp@cenet
subjects CALCULATING
COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
COMPUTING
COUNTING
ELECTRIC DIGITAL DATA PROCESSING
PHYSICS
title Method and device for training text auditing model
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-28T06%3A56%3A00IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=CHEN%20YONGFENG&rft.date=2022-08-30&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN114969332A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true