OCR data generation method and device, computer equipment and storage medium

The invention relates to an OCR data generation method and device, computer equipment and a storage medium. The method comprises the steps that manufacturing an original data material; carrying out analysis and batch modification on the original data material to obtain a marked image; and identifyin...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: ZHOU XIANDE, LI AILIN, ZHANG HUAN
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator ZHOU XIANDE
LI AILIN
ZHANG HUAN
description The invention relates to an OCR data generation method and device, computer equipment and a storage medium. The method comprises the steps that manufacturing an original data material; carrying out analysis and batch modification on the original data material to obtain a marked image; and identifying the mark image to extract the position information of the text area, and forming OCR data. According to the invention, the JavaScript interface is provided by means of the Photoshop; the original data material is analyzed and modified in batches; according to the OCR data generation method and device, the annotated image with the text area annotation is formed, text area recognition is conducted on the annotated image to obtain the position information of the text area, OCR data used for training OCR are formed accordingly, generation of the OCR data is automatically completed, the data generation speed is high, data adjustment details are simple, and practicability is high. 本发明涉及OCR数据生成方法、装置、计算机设备及存储介质,该方法包括制作原始
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN109948549A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN109948549A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN109948549A3</originalsourceid><addsrcrecordid>eNrjZPDxdw5SSEksSVRIT81LLUosyczPU8hNLcnIT1FIzEtRSEkty0xO1VFIzs8tKC1JLVJILSzNLMhNzSsBSxeX5BclpqcCdaRklubyMLCmJeYUp_JCaW4GRTfXEGcP3dSC_PjU4oLEZKAdJfHOfoYGlpYmFqYmlo7GxKgBAFI1NX8</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>OCR data generation method and device, computer equipment and storage medium</title><source>esp@cenet</source><creator>ZHOU XIANDE ; LI AILIN ; ZHANG HUAN</creator><creatorcontrib>ZHOU XIANDE ; LI AILIN ; ZHANG HUAN</creatorcontrib><description>The invention relates to an OCR data generation method and device, computer equipment and a storage medium. The method comprises the steps that manufacturing an original data material; carrying out analysis and batch modification on the original data material to obtain a marked image; and identifying the mark image to extract the position information of the text area, and forming OCR data. According to the invention, the JavaScript interface is provided by means of the Photoshop; the original data material is analyzed and modified in batches; according to the OCR data generation method and device, the annotated image with the text area annotation is formed, text area recognition is conducted on the annotated image to obtain the position information of the text area, OCR data used for training OCR are formed accordingly, generation of the OCR data is automatically completed, the data generation speed is high, data adjustment details are simple, and practicability is high. 本发明涉及OCR数据生成方法、装置、计算机设备及存储介质,该方法包括制作原始</description><language>chi ; eng</language><subject>CALCULATING ; COMPUTING ; COUNTING ; HANDLING RECORD CARRIERS ; PHYSICS ; PRESENTATION OF DATA ; RECOGNITION OF DATA ; RECORD CARRIERS</subject><creationdate>2019</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20190628&amp;DB=EPODOC&amp;CC=CN&amp;NR=109948549A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25564,76547</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20190628&amp;DB=EPODOC&amp;CC=CN&amp;NR=109948549A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>ZHOU XIANDE</creatorcontrib><creatorcontrib>LI AILIN</creatorcontrib><creatorcontrib>ZHANG HUAN</creatorcontrib><title>OCR data generation method and device, computer equipment and storage medium</title><description>The invention relates to an OCR data generation method and device, computer equipment and a storage medium. The method comprises the steps that manufacturing an original data material; carrying out analysis and batch modification on the original data material to obtain a marked image; and identifying the mark image to extract the position information of the text area, and forming OCR data. According to the invention, the JavaScript interface is provided by means of the Photoshop; the original data material is analyzed and modified in batches; according to the OCR data generation method and device, the annotated image with the text area annotation is formed, text area recognition is conducted on the annotated image to obtain the position information of the text area, OCR data used for training OCR are formed accordingly, generation of the OCR data is automatically completed, the data generation speed is high, data adjustment details are simple, and practicability is high. 本发明涉及OCR数据生成方法、装置、计算机设备及存储介质,该方法包括制作原始</description><subject>CALCULATING</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>HANDLING RECORD CARRIERS</subject><subject>PHYSICS</subject><subject>PRESENTATION OF DATA</subject><subject>RECOGNITION OF DATA</subject><subject>RECORD CARRIERS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2019</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZPDxdw5SSEksSVRIT81LLUosyczPU8hNLcnIT1FIzEtRSEkty0xO1VFIzs8tKC1JLVJILSzNLMhNzSsBSxeX5BclpqcCdaRklubyMLCmJeYUp_JCaW4GRTfXEGcP3dSC_PjU4oLEZKAdJfHOfoYGlpYmFqYmlo7GxKgBAFI1NX8</recordid><startdate>20190628</startdate><enddate>20190628</enddate><creator>ZHOU XIANDE</creator><creator>LI AILIN</creator><creator>ZHANG HUAN</creator><scope>EVB</scope></search><sort><creationdate>20190628</creationdate><title>OCR data generation method and device, computer equipment and storage medium</title><author>ZHOU XIANDE ; LI AILIN ; ZHANG HUAN</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN109948549A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2019</creationdate><topic>CALCULATING</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>HANDLING RECORD CARRIERS</topic><topic>PHYSICS</topic><topic>PRESENTATION OF DATA</topic><topic>RECOGNITION OF DATA</topic><topic>RECORD CARRIERS</topic><toplevel>online_resources</toplevel><creatorcontrib>ZHOU XIANDE</creatorcontrib><creatorcontrib>LI AILIN</creatorcontrib><creatorcontrib>ZHANG HUAN</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>ZHOU XIANDE</au><au>LI AILIN</au><au>ZHANG HUAN</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>OCR data generation method and device, computer equipment and storage medium</title><date>2019-06-28</date><risdate>2019</risdate><abstract>The invention relates to an OCR data generation method and device, computer equipment and a storage medium. The method comprises the steps that manufacturing an original data material; carrying out analysis and batch modification on the original data material to obtain a marked image; and identifying the mark image to extract the position information of the text area, and forming OCR data. According to the invention, the JavaScript interface is provided by means of the Photoshop; the original data material is analyzed and modified in batches; according to the OCR data generation method and device, the annotated image with the text area annotation is formed, text area recognition is conducted on the annotated image to obtain the position information of the text area, OCR data used for training OCR are formed accordingly, generation of the OCR data is automatically completed, the data generation speed is high, data adjustment details are simple, and practicability is high. 本发明涉及OCR数据生成方法、装置、计算机设备及存储介质,该方法包括制作原始</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language chi ; eng
recordid cdi_epo_espacenet_CN109948549A
source esp@cenet
subjects CALCULATING
COMPUTING
COUNTING
HANDLING RECORD CARRIERS
PHYSICS
PRESENTATION OF DATA
RECOGNITION OF DATA
RECORD CARRIERS
title OCR data generation method and device, computer equipment and storage medium
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-20T08%3A54%3A10IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=ZHOU%20XIANDE&rft.date=2019-06-28&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN109948549A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true