Method and system for labeling text segment

A method and system for labeling text segment. The method includes the following steps. First, a document to be recognized is provided, and the document to be recognized includes multiple text images. Then, at least one text segment is recognized and the text image in the text segment is converted i...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: CHAO, SHIH-LUNG, LIU, YIN-LI, LIN, TZUUAN, LIN, YIN, SHEN, SHENG-SYUN, HUANG, SHIHNG
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator CHAO, SHIH-LUNG
LIU, YIN-LI
LIN, TZUUAN
LIN, YIN
SHEN, SHENG-SYUN
HUANG, SHIHNG
description A method and system for labeling text segment. The method includes the following steps. First, a document to be recognized is provided, and the document to be recognized includes multiple text images. Then, at least one text segment is recognized and the text image in the text segment is converted into editable text. Thereafter, at least one first correlation information between the text segment and the document to be recognized is evaluated, and the editable text and the first correlation information are converted into a first feature matrix. Furthermore, a plurality of second correlation information of each text segment and other text segments is evaluated, and the first feature matrix is converted into a second feature matrix by the second correlation information. Then, the second feature matrix is converted into a third feature matrix which represents the confidence level. The third feature matrix is converted into a one-dimensional matrix, and each element of the one-dimensional matrix represents a label
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_TWI787651BB</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>TWI787651BB</sourcerecordid><originalsourceid>FETCH-epo_espacenet_TWI787651BB3</originalsourceid><addsrcrecordid>eNrjZND2TS3JyE9RSMxLUSiuLC5JzVVIyy9SyElMSs3JzEtXKEmtKFEoTk3PTc0r4WFgTUvMKU7lhdLcDApuriHOHrqpBfnxqcUFicmpeakl8SHhnuYW5mamhk5OxkQoAQAQoCl2</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Method and system for labeling text segment</title><source>esp@cenet</source><creator>CHAO, SHIH-LUNG ; LIU, YIN-LI ; LIN, TZUUAN ; LIN, YIN ; SHEN, SHENG-SYUN ; HUANG, SHIHNG</creator><creatorcontrib>CHAO, SHIH-LUNG ; LIU, YIN-LI ; LIN, TZUUAN ; LIN, YIN ; SHEN, SHENG-SYUN ; HUANG, SHIHNG</creatorcontrib><description>A method and system for labeling text segment. The method includes the following steps. First, a document to be recognized is provided, and the document to be recognized includes multiple text images. Then, at least one text segment is recognized and the text image in the text segment is converted into editable text. Thereafter, at least one first correlation information between the text segment and the document to be recognized is evaluated, and the editable text and the first correlation information are converted into a first feature matrix. Furthermore, a plurality of second correlation information of each text segment and other text segments is evaluated, and the first feature matrix is converted into a second feature matrix by the second correlation information. Then, the second feature matrix is converted into a third feature matrix which represents the confidence level. The third feature matrix is converted into a one-dimensional matrix, and each element of the one-dimensional matrix represents a label</description><language>chi ; eng</language><subject>CALCULATING ; COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; HANDLING RECORD CARRIERS ; PHYSICS ; PRESENTATION OF DATA ; RECOGNITION OF DATA ; RECORD CARRIERS</subject><creationdate>2022</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20221221&amp;DB=EPODOC&amp;CC=TW&amp;NR=I787651B$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25564,76547</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20221221&amp;DB=EPODOC&amp;CC=TW&amp;NR=I787651B$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>CHAO, SHIH-LUNG</creatorcontrib><creatorcontrib>LIU, YIN-LI</creatorcontrib><creatorcontrib>LIN, TZUUAN</creatorcontrib><creatorcontrib>LIN, YIN</creatorcontrib><creatorcontrib>SHEN, SHENG-SYUN</creatorcontrib><creatorcontrib>HUANG, SHIHNG</creatorcontrib><title>Method and system for labeling text segment</title><description>A method and system for labeling text segment. The method includes the following steps. First, a document to be recognized is provided, and the document to be recognized includes multiple text images. Then, at least one text segment is recognized and the text image in the text segment is converted into editable text. Thereafter, at least one first correlation information between the text segment and the document to be recognized is evaluated, and the editable text and the first correlation information are converted into a first feature matrix. Furthermore, a plurality of second correlation information of each text segment and other text segments is evaluated, and the first feature matrix is converted into a second feature matrix by the second correlation information. Then, the second feature matrix is converted into a third feature matrix which represents the confidence level. The third feature matrix is converted into a one-dimensional matrix, and each element of the one-dimensional matrix represents a label</description><subject>CALCULATING</subject><subject>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>HANDLING RECORD CARRIERS</subject><subject>PHYSICS</subject><subject>PRESENTATION OF DATA</subject><subject>RECOGNITION OF DATA</subject><subject>RECORD CARRIERS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2022</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZND2TS3JyE9RSMxLUSiuLC5JzVVIyy9SyElMSs3JzEtXKEmtKFEoTk3PTc0r4WFgTUvMKU7lhdLcDApuriHOHrqpBfnxqcUFicmpeakl8SHhnuYW5mamhk5OxkQoAQAQoCl2</recordid><startdate>20221221</startdate><enddate>20221221</enddate><creator>CHAO, SHIH-LUNG</creator><creator>LIU, YIN-LI</creator><creator>LIN, TZUUAN</creator><creator>LIN, YIN</creator><creator>SHEN, SHENG-SYUN</creator><creator>HUANG, SHIHNG</creator><scope>EVB</scope></search><sort><creationdate>20221221</creationdate><title>Method and system for labeling text segment</title><author>CHAO, SHIH-LUNG ; LIU, YIN-LI ; LIN, TZUUAN ; LIN, YIN ; SHEN, SHENG-SYUN ; HUANG, SHIHNG</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_TWI787651BB3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2022</creationdate><topic>CALCULATING</topic><topic>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>HANDLING RECORD CARRIERS</topic><topic>PHYSICS</topic><topic>PRESENTATION OF DATA</topic><topic>RECOGNITION OF DATA</topic><topic>RECORD CARRIERS</topic><toplevel>online_resources</toplevel><creatorcontrib>CHAO, SHIH-LUNG</creatorcontrib><creatorcontrib>LIU, YIN-LI</creatorcontrib><creatorcontrib>LIN, TZUUAN</creatorcontrib><creatorcontrib>LIN, YIN</creatorcontrib><creatorcontrib>SHEN, SHENG-SYUN</creatorcontrib><creatorcontrib>HUANG, SHIHNG</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>CHAO, SHIH-LUNG</au><au>LIU, YIN-LI</au><au>LIN, TZUUAN</au><au>LIN, YIN</au><au>SHEN, SHENG-SYUN</au><au>HUANG, SHIHNG</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Method and system for labeling text segment</title><date>2022-12-21</date><risdate>2022</risdate><abstract>A method and system for labeling text segment. The method includes the following steps. First, a document to be recognized is provided, and the document to be recognized includes multiple text images. Then, at least one text segment is recognized and the text image in the text segment is converted into editable text. Thereafter, at least one first correlation information between the text segment and the document to be recognized is evaluated, and the editable text and the first correlation information are converted into a first feature matrix. Furthermore, a plurality of second correlation information of each text segment and other text segments is evaluated, and the first feature matrix is converted into a second feature matrix by the second correlation information. Then, the second feature matrix is converted into a third feature matrix which represents the confidence level. The third feature matrix is converted into a one-dimensional matrix, and each element of the one-dimensional matrix represents a label</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language chi ; eng
recordid cdi_epo_espacenet_TWI787651BB
source esp@cenet
subjects CALCULATING
COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
COMPUTING
COUNTING
ELECTRIC DIGITAL DATA PROCESSING
HANDLING RECORD CARRIERS
PHYSICS
PRESENTATION OF DATA
RECOGNITION OF DATA
RECORD CARRIERS
title Method and system for labeling text segment
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-02T17%3A59%3A22IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=CHAO,%20SHIH-LUNG&rft.date=2022-12-21&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ETWI787651BB%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true