Data screening method and device, storage medium and electronic equipment

The invention relates to a data screening method and device, a storage medium and electronic equipment, and the method comprises the steps: carrying out the recognition of to-be-recognized data through a recognition model, generating a first initial label, adding the to-be-recognized data and the co...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: SUN XIUSONG, HE YI, MA ZEJUN
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator SUN XIUSONG
HE YI
MA ZEJUN
description The invention relates to a data screening method and device, a storage medium and electronic equipment, and the method comprises the steps: carrying out the recognition of to-be-recognized data through a recognition model, generating a first initial label, adding the to-be-recognized data and the corresponding first initial label into a data training set corresponding to the recognition model, and training an identification model based on the data training set, generating the confusion degree of the to-be-screened data through the trained identification model, and determining the to-be-screened data as target data when the confusion degree is greater than a preset confusion degree threshold value. Therefore, it is guaranteed that the repetition rate of the data screened out through iteration each time and an existing data training set is low, the data training set is distributed more evenly, and the recognition model obtained through final training has better generalization performance. 本公开涉及一种数据筛选方法、装置、存储介质及
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN114970880A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN114970880A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN114970880A3</originalsourceid><addsrcrecordid>eNrjZPB0SSxJVChOLkpNzcvMS1fITS3JyE9RSMxLUUhJLctMTtVRKC7JL0pMTwVKpWSW5oKlUnNSk0uK8vMykxVSC0szC3JT80p4GFjTEnOKU3mhNDeDoptriLOHbmpBfnxqcUFicmpeakm8s5-hoYmluYGFhYGjMTFqAL5_NI0</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Data screening method and device, storage medium and electronic equipment</title><source>esp@cenet</source><creator>SUN XIUSONG ; HE YI ; MA ZEJUN</creator><creatorcontrib>SUN XIUSONG ; HE YI ; MA ZEJUN</creatorcontrib><description>The invention relates to a data screening method and device, a storage medium and electronic equipment, and the method comprises the steps: carrying out the recognition of to-be-recognized data through a recognition model, generating a first initial label, adding the to-be-recognized data and the corresponding first initial label into a data training set corresponding to the recognition model, and training an identification model based on the data training set, generating the confusion degree of the to-be-screened data through the trained identification model, and determining the to-be-screened data as target data when the confusion degree is greater than a preset confusion degree threshold value. Therefore, it is guaranteed that the repetition rate of the data screened out through iteration each time and an existing data training set is low, the data training set is distributed more evenly, and the recognition model obtained through final training has better generalization performance. 本公开涉及一种数据筛选方法、装置、存储介质及</description><language>chi ; eng</language><subject>CALCULATING ; COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; HANDLING RECORD CARRIERS ; PHYSICS ; PRESENTATION OF DATA ; RECOGNITION OF DATA ; RECORD CARRIERS</subject><creationdate>2022</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20220830&amp;DB=EPODOC&amp;CC=CN&amp;NR=114970880A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25564,76547</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20220830&amp;DB=EPODOC&amp;CC=CN&amp;NR=114970880A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>SUN XIUSONG</creatorcontrib><creatorcontrib>HE YI</creatorcontrib><creatorcontrib>MA ZEJUN</creatorcontrib><title>Data screening method and device, storage medium and electronic equipment</title><description>The invention relates to a data screening method and device, a storage medium and electronic equipment, and the method comprises the steps: carrying out the recognition of to-be-recognized data through a recognition model, generating a first initial label, adding the to-be-recognized data and the corresponding first initial label into a data training set corresponding to the recognition model, and training an identification model based on the data training set, generating the confusion degree of the to-be-screened data through the trained identification model, and determining the to-be-screened data as target data when the confusion degree is greater than a preset confusion degree threshold value. Therefore, it is guaranteed that the repetition rate of the data screened out through iteration each time and an existing data training set is low, the data training set is distributed more evenly, and the recognition model obtained through final training has better generalization performance. 本公开涉及一种数据筛选方法、装置、存储介质及</description><subject>CALCULATING</subject><subject>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>HANDLING RECORD CARRIERS</subject><subject>PHYSICS</subject><subject>PRESENTATION OF DATA</subject><subject>RECOGNITION OF DATA</subject><subject>RECORD CARRIERS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2022</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZPB0SSxJVChOLkpNzcvMS1fITS3JyE9RSMxLUUhJLctMTtVRKC7JL0pMTwVKpWSW5oKlUnNSk0uK8vMykxVSC0szC3JT80p4GFjTEnOKU3mhNDeDoptriLOHbmpBfnxqcUFicmpeakm8s5-hoYmluYGFhYGjMTFqAL5_NI0</recordid><startdate>20220830</startdate><enddate>20220830</enddate><creator>SUN XIUSONG</creator><creator>HE YI</creator><creator>MA ZEJUN</creator><scope>EVB</scope></search><sort><creationdate>20220830</creationdate><title>Data screening method and device, storage medium and electronic equipment</title><author>SUN XIUSONG ; HE YI ; MA ZEJUN</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN114970880A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2022</creationdate><topic>CALCULATING</topic><topic>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>HANDLING RECORD CARRIERS</topic><topic>PHYSICS</topic><topic>PRESENTATION OF DATA</topic><topic>RECOGNITION OF DATA</topic><topic>RECORD CARRIERS</topic><toplevel>online_resources</toplevel><creatorcontrib>SUN XIUSONG</creatorcontrib><creatorcontrib>HE YI</creatorcontrib><creatorcontrib>MA ZEJUN</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>SUN XIUSONG</au><au>HE YI</au><au>MA ZEJUN</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Data screening method and device, storage medium and electronic equipment</title><date>2022-08-30</date><risdate>2022</risdate><abstract>The invention relates to a data screening method and device, a storage medium and electronic equipment, and the method comprises the steps: carrying out the recognition of to-be-recognized data through a recognition model, generating a first initial label, adding the to-be-recognized data and the corresponding first initial label into a data training set corresponding to the recognition model, and training an identification model based on the data training set, generating the confusion degree of the to-be-screened data through the trained identification model, and determining the to-be-screened data as target data when the confusion degree is greater than a preset confusion degree threshold value. Therefore, it is guaranteed that the repetition rate of the data screened out through iteration each time and an existing data training set is low, the data training set is distributed more evenly, and the recognition model obtained through final training has better generalization performance. 本公开涉及一种数据筛选方法、装置、存储介质及</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language chi ; eng
recordid cdi_epo_espacenet_CN114970880A
source esp@cenet
subjects CALCULATING
COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
COMPUTING
COUNTING
ELECTRIC DIGITAL DATA PROCESSING
HANDLING RECORD CARRIERS
PHYSICS
PRESENTATION OF DATA
RECOGNITION OF DATA
RECORD CARRIERS
title Data screening method and device, storage medium and electronic equipment
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-04T05%3A38%3A52IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=SUN%20XIUSONG&rft.date=2022-08-30&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN114970880A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true