Method and device for training text auditing model
The invention provides a method and device for training a text auditing model, and relates to the field of artificial intelligence, in particular to the field of natural language processing. According to the specific implementation scheme, the method comprises the steps of obtaining a pre-training l...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Patent |
Sprache: | chi ; eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | CHEN YONGFENG WANG ZANBO HUANG SHUO CAO YUHUI |
description | The invention provides a method and device for training a text auditing model, and relates to the field of artificial intelligence, in particular to the field of natural language processing. According to the specific implementation scheme, the method comprises the steps of obtaining a pre-training language model, a pre-training language micro model, annotation data and non-annotation data; inputting the annotation data into a pre-training language model for supervised training to obtain a teacher model; inputting the annotation data into a pre-training language micro model for supervised training to obtain a student model; and respectively inputting the unlabeled data into the teacher model and the student model, and distilling the student model by using the teacher model to obtain a text auditing model. According to the embodiment, training can be carried out on small-scale manual annotation data and large-scale non-annotation data, and the text auditing model with a good effect and a high speed is obtained. |
format | Patent |
fullrecord | <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN114969332A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN114969332A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN114969332A3</originalsourceid><addsrcrecordid>eNrjZDDyTS3JyE9RSMxLUUhJLctMTlVIyy9SKClKzMzLzEtXKEmtKFFILE3JLAHxcvNTUnN4GFjTEnOKU3mhNDeDoptriLOHbmpBfnxqcUFicmpeakm8s5-hoYmlmaWxsZGjMTFqAKgyK9c</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Method and device for training text auditing model</title><source>esp@cenet</source><creator>CHEN YONGFENG ; WANG ZANBO ; HUANG SHUO ; CAO YUHUI</creator><creatorcontrib>CHEN YONGFENG ; WANG ZANBO ; HUANG SHUO ; CAO YUHUI</creatorcontrib><description>The invention provides a method and device for training a text auditing model, and relates to the field of artificial intelligence, in particular to the field of natural language processing. According to the specific implementation scheme, the method comprises the steps of obtaining a pre-training language model, a pre-training language micro model, annotation data and non-annotation data; inputting the annotation data into a pre-training language model for supervised training to obtain a teacher model; inputting the annotation data into a pre-training language micro model for supervised training to obtain a student model; and respectively inputting the unlabeled data into the teacher model and the student model, and distilling the student model by using the teacher model to obtain a text auditing model. According to the embodiment, training can be carried out on small-scale manual annotation data and large-scale non-annotation data, and the text auditing model with a good effect and a high speed is obtained.</description><language>chi ; eng</language><subject>CALCULATING ; COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; PHYSICS</subject><creationdate>2022</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20220830&DB=EPODOC&CC=CN&NR=114969332A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25564,76547</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20220830&DB=EPODOC&CC=CN&NR=114969332A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>CHEN YONGFENG</creatorcontrib><creatorcontrib>WANG ZANBO</creatorcontrib><creatorcontrib>HUANG SHUO</creatorcontrib><creatorcontrib>CAO YUHUI</creatorcontrib><title>Method and device for training text auditing model</title><description>The invention provides a method and device for training a text auditing model, and relates to the field of artificial intelligence, in particular to the field of natural language processing. According to the specific implementation scheme, the method comprises the steps of obtaining a pre-training language model, a pre-training language micro model, annotation data and non-annotation data; inputting the annotation data into a pre-training language model for supervised training to obtain a teacher model; inputting the annotation data into a pre-training language micro model for supervised training to obtain a student model; and respectively inputting the unlabeled data into the teacher model and the student model, and distilling the student model by using the teacher model to obtain a text auditing model. According to the embodiment, training can be carried out on small-scale manual annotation data and large-scale non-annotation data, and the text auditing model with a good effect and a high speed is obtained.</description><subject>CALCULATING</subject><subject>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>PHYSICS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2022</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZDDyTS3JyE9RSMxLUUhJLctMTlVIyy9SKClKzMzLzEtXKEmtKFFILE3JLAHxcvNTUnN4GFjTEnOKU3mhNDeDoptriLOHbmpBfnxqcUFicmpeakm8s5-hoYmlmaWxsZGjMTFqAKgyK9c</recordid><startdate>20220830</startdate><enddate>20220830</enddate><creator>CHEN YONGFENG</creator><creator>WANG ZANBO</creator><creator>HUANG SHUO</creator><creator>CAO YUHUI</creator><scope>EVB</scope></search><sort><creationdate>20220830</creationdate><title>Method and device for training text auditing model</title><author>CHEN YONGFENG ; WANG ZANBO ; HUANG SHUO ; CAO YUHUI</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN114969332A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2022</creationdate><topic>CALCULATING</topic><topic>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>PHYSICS</topic><toplevel>online_resources</toplevel><creatorcontrib>CHEN YONGFENG</creatorcontrib><creatorcontrib>WANG ZANBO</creatorcontrib><creatorcontrib>HUANG SHUO</creatorcontrib><creatorcontrib>CAO YUHUI</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>CHEN YONGFENG</au><au>WANG ZANBO</au><au>HUANG SHUO</au><au>CAO YUHUI</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Method and device for training text auditing model</title><date>2022-08-30</date><risdate>2022</risdate><abstract>The invention provides a method and device for training a text auditing model, and relates to the field of artificial intelligence, in particular to the field of natural language processing. According to the specific implementation scheme, the method comprises the steps of obtaining a pre-training language model, a pre-training language micro model, annotation data and non-annotation data; inputting the annotation data into a pre-training language model for supervised training to obtain a teacher model; inputting the annotation data into a pre-training language micro model for supervised training to obtain a student model; and respectively inputting the unlabeled data into the teacher model and the student model, and distilling the student model by using the teacher model to obtain a text auditing model. According to the embodiment, training can be carried out on small-scale manual annotation data and large-scale non-annotation data, and the text auditing model with a good effect and a high speed is obtained.</abstract><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | |
ispartof | |
issn | |
language | chi ; eng |
recordid | cdi_epo_espacenet_CN114969332A |
source | esp@cenet |
subjects | CALCULATING COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING PHYSICS |
title | Method and device for training text auditing model |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-28T06%3A56%3A00IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=CHEN%20YONGFENG&rft.date=2022-08-30&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN114969332A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |