Method and device for training text auditing model

The invention provides a method and device for training a text auditing model, and relates to the field of artificial intelligence, in particular to the field of natural language processing. According to the specific implementation scheme, a domain language model, a student model, unlabeled data and...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: CHEN YONGFENG, WANG ZANBO, HUANG SHUO, CAO YUHUI
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator CHEN YONGFENG
WANG ZANBO
HUANG SHUO
CAO YUHUI
description The invention provides a method and device for training a text auditing model, and relates to the field of artificial intelligence, in particular to the field of natural language processing. According to the specific implementation scheme, a domain language model, a student model, unlabeled data and labeled data are obtained, and the labeled data comprise text information and auditing labels; taking the text information and the auditing label in the annotation data as input and expected output respectively, and performing fine tuning training on the domain language model to obtain a teacher model; inputting the unlabeled data into the teacher model, and outputting a pseudo-audit label to obtain pseudo-labeled data; and training the student model through the pseudo-annotation data to obtain a text auditing model. According to the embodiment, training can be carried out on small-scale manual annotation data and large-scale non-annotation data, and the text auditing model with a good effect and a high speed is o
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN114970540A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN114970540A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN114970540A3</originalsourceid><addsrcrecordid>eNrjZDDyTS3JyE9RSMxLUUhJLctMTlVIyy9SKClKzMzLzEtXKEmtKFFILE3JLAHxcvNTUnN4GFjTEnOKU3mhNDeDoptriLOHbmpBfnxqcUFicmpeakm8s5-hoYmluYGpiYGjMTFqAKboK8k</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Method and device for training text auditing model</title><source>esp@cenet</source><creator>CHEN YONGFENG ; WANG ZANBO ; HUANG SHUO ; CAO YUHUI</creator><creatorcontrib>CHEN YONGFENG ; WANG ZANBO ; HUANG SHUO ; CAO YUHUI</creatorcontrib><description>The invention provides a method and device for training a text auditing model, and relates to the field of artificial intelligence, in particular to the field of natural language processing. According to the specific implementation scheme, a domain language model, a student model, unlabeled data and labeled data are obtained, and the labeled data comprise text information and auditing labels; taking the text information and the auditing label in the annotation data as input and expected output respectively, and performing fine tuning training on the domain language model to obtain a teacher model; inputting the unlabeled data into the teacher model, and outputting a pseudo-audit label to obtain pseudo-labeled data; and training the student model through the pseudo-annotation data to obtain a text auditing model. According to the embodiment, training can be carried out on small-scale manual annotation data and large-scale non-annotation data, and the text auditing model with a good effect and a high speed is o</description><language>chi ; eng</language><subject>CALCULATING ; COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; HANDLING RECORD CARRIERS ; PHYSICS ; PRESENTATION OF DATA ; RECOGNITION OF DATA ; RECORD CARRIERS</subject><creationdate>2022</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20220830&amp;DB=EPODOC&amp;CC=CN&amp;NR=114970540A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25555,76308</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20220830&amp;DB=EPODOC&amp;CC=CN&amp;NR=114970540A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>CHEN YONGFENG</creatorcontrib><creatorcontrib>WANG ZANBO</creatorcontrib><creatorcontrib>HUANG SHUO</creatorcontrib><creatorcontrib>CAO YUHUI</creatorcontrib><title>Method and device for training text auditing model</title><description>The invention provides a method and device for training a text auditing model, and relates to the field of artificial intelligence, in particular to the field of natural language processing. According to the specific implementation scheme, a domain language model, a student model, unlabeled data and labeled data are obtained, and the labeled data comprise text information and auditing labels; taking the text information and the auditing label in the annotation data as input and expected output respectively, and performing fine tuning training on the domain language model to obtain a teacher model; inputting the unlabeled data into the teacher model, and outputting a pseudo-audit label to obtain pseudo-labeled data; and training the student model through the pseudo-annotation data to obtain a text auditing model. According to the embodiment, training can be carried out on small-scale manual annotation data and large-scale non-annotation data, and the text auditing model with a good effect and a high speed is o</description><subject>CALCULATING</subject><subject>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>HANDLING RECORD CARRIERS</subject><subject>PHYSICS</subject><subject>PRESENTATION OF DATA</subject><subject>RECOGNITION OF DATA</subject><subject>RECORD CARRIERS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2022</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZDDyTS3JyE9RSMxLUUhJLctMTlVIyy9SKClKzMzLzEtXKEmtKFFILE3JLAHxcvNTUnN4GFjTEnOKU3mhNDeDoptriLOHbmpBfnxqcUFicmpeakm8s5-hoYmluYGpiYGjMTFqAKboK8k</recordid><startdate>20220830</startdate><enddate>20220830</enddate><creator>CHEN YONGFENG</creator><creator>WANG ZANBO</creator><creator>HUANG SHUO</creator><creator>CAO YUHUI</creator><scope>EVB</scope></search><sort><creationdate>20220830</creationdate><title>Method and device for training text auditing model</title><author>CHEN YONGFENG ; WANG ZANBO ; HUANG SHUO ; CAO YUHUI</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN114970540A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2022</creationdate><topic>CALCULATING</topic><topic>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>HANDLING RECORD CARRIERS</topic><topic>PHYSICS</topic><topic>PRESENTATION OF DATA</topic><topic>RECOGNITION OF DATA</topic><topic>RECORD CARRIERS</topic><toplevel>online_resources</toplevel><creatorcontrib>CHEN YONGFENG</creatorcontrib><creatorcontrib>WANG ZANBO</creatorcontrib><creatorcontrib>HUANG SHUO</creatorcontrib><creatorcontrib>CAO YUHUI</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>CHEN YONGFENG</au><au>WANG ZANBO</au><au>HUANG SHUO</au><au>CAO YUHUI</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Method and device for training text auditing model</title><date>2022-08-30</date><risdate>2022</risdate><abstract>The invention provides a method and device for training a text auditing model, and relates to the field of artificial intelligence, in particular to the field of natural language processing. According to the specific implementation scheme, a domain language model, a student model, unlabeled data and labeled data are obtained, and the labeled data comprise text information and auditing labels; taking the text information and the auditing label in the annotation data as input and expected output respectively, and performing fine tuning training on the domain language model to obtain a teacher model; inputting the unlabeled data into the teacher model, and outputting a pseudo-audit label to obtain pseudo-labeled data; and training the student model through the pseudo-annotation data to obtain a text auditing model. According to the embodiment, training can be carried out on small-scale manual annotation data and large-scale non-annotation data, and the text auditing model with a good effect and a high speed is o</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language chi ; eng
recordid cdi_epo_espacenet_CN114970540A
source esp@cenet
subjects CALCULATING
COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
COMPUTING
COUNTING
ELECTRIC DIGITAL DATA PROCESSING
HANDLING RECORD CARRIERS
PHYSICS
PRESENTATION OF DATA
RECOGNITION OF DATA
RECORD CARRIERS
title Method and device for training text auditing model
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-14T18%3A54%3A48IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=CHEN%20YONGFENG&rft.date=2022-08-30&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN114970540A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true