Data screening method and device, storage medium and electronic equipment
The invention relates to a data screening method and device, a storage medium and electronic equipment, and the method comprises the steps: carrying out the recognition of to-be-recognized data through a recognition model, generating a first initial label, adding the to-be-recognized data and the co...
Gespeichert in:
Hauptverfasser: | , , |
---|---|
Format: | Patent |
Sprache: | chi ; eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | SUN XIUSONG HE YI MA ZEJUN |
description | The invention relates to a data screening method and device, a storage medium and electronic equipment, and the method comprises the steps: carrying out the recognition of to-be-recognized data through a recognition model, generating a first initial label, adding the to-be-recognized data and the corresponding first initial label into a data training set corresponding to the recognition model, and training an identification model based on the data training set, generating the confusion degree of the to-be-screened data through the trained identification model, and determining the to-be-screened data as target data when the confusion degree is greater than a preset confusion degree threshold value. Therefore, it is guaranteed that the repetition rate of the data screened out through iteration each time and an existing data training set is low, the data training set is distributed more evenly, and the recognition model obtained through final training has better generalization performance.
本公开涉及一种数据筛选方法、装置、存储介质及 |
format | Patent |
fullrecord | <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN114970880A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN114970880A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN114970880A3</originalsourceid><addsrcrecordid>eNrjZPB0SSxJVChOLkpNzcvMS1fITS3JyE9RSMxLUUhJLctMTtVRKC7JL0pMTwVKpWSW5oKlUnNSk0uK8vMykxVSC0szC3JT80p4GFjTEnOKU3mhNDeDoptriLOHbmpBfnxqcUFicmpeakm8s5-hoYmluYGFhYGjMTFqAL5_NI0</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Data screening method and device, storage medium and electronic equipment</title><source>esp@cenet</source><creator>SUN XIUSONG ; HE YI ; MA ZEJUN</creator><creatorcontrib>SUN XIUSONG ; HE YI ; MA ZEJUN</creatorcontrib><description>The invention relates to a data screening method and device, a storage medium and electronic equipment, and the method comprises the steps: carrying out the recognition of to-be-recognized data through a recognition model, generating a first initial label, adding the to-be-recognized data and the corresponding first initial label into a data training set corresponding to the recognition model, and training an identification model based on the data training set, generating the confusion degree of the to-be-screened data through the trained identification model, and determining the to-be-screened data as target data when the confusion degree is greater than a preset confusion degree threshold value. Therefore, it is guaranteed that the repetition rate of the data screened out through iteration each time and an existing data training set is low, the data training set is distributed more evenly, and the recognition model obtained through final training has better generalization performance.
本公开涉及一种数据筛选方法、装置、存储介质及</description><language>chi ; eng</language><subject>CALCULATING ; COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; HANDLING RECORD CARRIERS ; PHYSICS ; PRESENTATION OF DATA ; RECOGNITION OF DATA ; RECORD CARRIERS</subject><creationdate>2022</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20220830&DB=EPODOC&CC=CN&NR=114970880A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25564,76547</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20220830&DB=EPODOC&CC=CN&NR=114970880A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>SUN XIUSONG</creatorcontrib><creatorcontrib>HE YI</creatorcontrib><creatorcontrib>MA ZEJUN</creatorcontrib><title>Data screening method and device, storage medium and electronic equipment</title><description>The invention relates to a data screening method and device, a storage medium and electronic equipment, and the method comprises the steps: carrying out the recognition of to-be-recognized data through a recognition model, generating a first initial label, adding the to-be-recognized data and the corresponding first initial label into a data training set corresponding to the recognition model, and training an identification model based on the data training set, generating the confusion degree of the to-be-screened data through the trained identification model, and determining the to-be-screened data as target data when the confusion degree is greater than a preset confusion degree threshold value. Therefore, it is guaranteed that the repetition rate of the data screened out through iteration each time and an existing data training set is low, the data training set is distributed more evenly, and the recognition model obtained through final training has better generalization performance.
本公开涉及一种数据筛选方法、装置、存储介质及</description><subject>CALCULATING</subject><subject>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>HANDLING RECORD CARRIERS</subject><subject>PHYSICS</subject><subject>PRESENTATION OF DATA</subject><subject>RECOGNITION OF DATA</subject><subject>RECORD CARRIERS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2022</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZPB0SSxJVChOLkpNzcvMS1fITS3JyE9RSMxLUUhJLctMTtVRKC7JL0pMTwVKpWSW5oKlUnNSk0uK8vMykxVSC0szC3JT80p4GFjTEnOKU3mhNDeDoptriLOHbmpBfnxqcUFicmpeakm8s5-hoYmluYGFhYGjMTFqAL5_NI0</recordid><startdate>20220830</startdate><enddate>20220830</enddate><creator>SUN XIUSONG</creator><creator>HE YI</creator><creator>MA ZEJUN</creator><scope>EVB</scope></search><sort><creationdate>20220830</creationdate><title>Data screening method and device, storage medium and electronic equipment</title><author>SUN XIUSONG ; HE YI ; MA ZEJUN</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN114970880A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2022</creationdate><topic>CALCULATING</topic><topic>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>HANDLING RECORD CARRIERS</topic><topic>PHYSICS</topic><topic>PRESENTATION OF DATA</topic><topic>RECOGNITION OF DATA</topic><topic>RECORD CARRIERS</topic><toplevel>online_resources</toplevel><creatorcontrib>SUN XIUSONG</creatorcontrib><creatorcontrib>HE YI</creatorcontrib><creatorcontrib>MA ZEJUN</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>SUN XIUSONG</au><au>HE YI</au><au>MA ZEJUN</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Data screening method and device, storage medium and electronic equipment</title><date>2022-08-30</date><risdate>2022</risdate><abstract>The invention relates to a data screening method and device, a storage medium and electronic equipment, and the method comprises the steps: carrying out the recognition of to-be-recognized data through a recognition model, generating a first initial label, adding the to-be-recognized data and the corresponding first initial label into a data training set corresponding to the recognition model, and training an identification model based on the data training set, generating the confusion degree of the to-be-screened data through the trained identification model, and determining the to-be-screened data as target data when the confusion degree is greater than a preset confusion degree threshold value. Therefore, it is guaranteed that the repetition rate of the data screened out through iteration each time and an existing data training set is low, the data training set is distributed more evenly, and the recognition model obtained through final training has better generalization performance.
本公开涉及一种数据筛选方法、装置、存储介质及</abstract><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | |
ispartof | |
issn | |
language | chi ; eng |
recordid | cdi_epo_espacenet_CN114970880A |
source | esp@cenet |
subjects | CALCULATING COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING HANDLING RECORD CARRIERS PHYSICS PRESENTATION OF DATA RECOGNITION OF DATA RECORD CARRIERS |
title | Data screening method and device, storage medium and electronic equipment |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-04T05%3A38%3A52IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=SUN%20XIUSONG&rft.date=2022-08-30&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN114970880A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |