APPARATUS AND METHOD FOR DOCUMENT RECOGNITION

Disclosed are a method and device for type analysis and key-value extraction of a document by using a feature vector of a document image generated by color space conversion. According to an embodiment of the present invention, the device for document recognition includes: a document type analyzer wh...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: CHO SOO AH, YOON JEONG, LEE JOON SEOK, SONG HYO SEOB
Format: Patent
Sprache:eng ; kor
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator CHO SOO AH
YOON JEONG
LEE JOON SEOK
SONG HYO SEOB
description Disclosed are a method and device for type analysis and key-value extraction of a document by using a feature vector of a document image generated by color space conversion. According to an embodiment of the present invention, the device for document recognition includes: a document type analyzer which analyzes a type of recognition target document based on document feature vector extracted from one or more partial images obtained by color space conversion of one or more partial regions of the recognition target document; and an information extractor which extracts value information from one or more information search images organized in a grid form based on a position of key information of the recognition target document. Accordingly, it is possible to reduce the influence of color and contamination of an input document. 색공간 변환으로 생성된 문서 이미지의 특징 벡터를 이용하여 문서의 유형 분석 및 키-값을 추출하기 위한 방법 및 장치가 개시된다. 일 실시예에 따른 문서 인식 장치는 인식 대상 문서에 대한 하나 이상의 부분 영역을 색공간 변환시킨 하나 이상의 부분 이미지로부터 추출한 문서 특징 벡터를 기초로 인식 대상 문서의 유형을 분석하는 문서 유형 분석부; 및 인식 대상 문서의 키(key) 정보의 위치를 기초로 격자 형태로 구성된 하나 이상의 정보 검색 이미지에서 값(value) 정보를 추출하는 정보 추출부를 포함할 수 있다.
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_KR20220050356A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>KR20220050356A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_KR20220050356A3</originalsourceid><addsrcrecordid>eNrjZNB1DAhwDHIMCQ1WcPRzUfB1DfHwd1Fw8w9ScPF3DvV19QtRCHJ19nf38wzx9PfjYWBNS8wpTuWF0twMym6uIc4euqkF-fGpxQWJyal5qSXx3kFGBkZGBgamBsamZo7GxKkCAFKjJk4</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>APPARATUS AND METHOD FOR DOCUMENT RECOGNITION</title><source>esp@cenet</source><creator>CHO SOO AH ; YOON JEONG ; LEE JOON SEOK ; SONG HYO SEOB</creator><creatorcontrib>CHO SOO AH ; YOON JEONG ; LEE JOON SEOK ; SONG HYO SEOB</creatorcontrib><description>Disclosed are a method and device for type analysis and key-value extraction of a document by using a feature vector of a document image generated by color space conversion. According to an embodiment of the present invention, the device for document recognition includes: a document type analyzer which analyzes a type of recognition target document based on document feature vector extracted from one or more partial images obtained by color space conversion of one or more partial regions of the recognition target document; and an information extractor which extracts value information from one or more information search images organized in a grid form based on a position of key information of the recognition target document. Accordingly, it is possible to reduce the influence of color and contamination of an input document. 색공간 변환으로 생성된 문서 이미지의 특징 벡터를 이용하여 문서의 유형 분석 및 키-값을 추출하기 위한 방법 및 장치가 개시된다. 일 실시예에 따른 문서 인식 장치는 인식 대상 문서에 대한 하나 이상의 부분 영역을 색공간 변환시킨 하나 이상의 부분 이미지로부터 추출한 문서 특징 벡터를 기초로 인식 대상 문서의 유형을 분석하는 문서 유형 분석부; 및 인식 대상 문서의 키(key) 정보의 위치를 기초로 격자 형태로 구성된 하나 이상의 정보 검색 이미지에서 값(value) 정보를 추출하는 정보 추출부를 포함할 수 있다.</description><language>eng ; kor</language><subject>CALCULATING ; COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS ; COMPUTING ; COUNTING ; HANDLING RECORD CARRIERS ; PHYSICS ; PRESENTATION OF DATA ; RECOGNITION OF DATA ; RECORD CARRIERS</subject><creationdate>2022</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20220425&amp;DB=EPODOC&amp;CC=KR&amp;NR=20220050356A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,777,882,25545,76296</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20220425&amp;DB=EPODOC&amp;CC=KR&amp;NR=20220050356A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>CHO SOO AH</creatorcontrib><creatorcontrib>YOON JEONG</creatorcontrib><creatorcontrib>LEE JOON SEOK</creatorcontrib><creatorcontrib>SONG HYO SEOB</creatorcontrib><title>APPARATUS AND METHOD FOR DOCUMENT RECOGNITION</title><description>Disclosed are a method and device for type analysis and key-value extraction of a document by using a feature vector of a document image generated by color space conversion. According to an embodiment of the present invention, the device for document recognition includes: a document type analyzer which analyzes a type of recognition target document based on document feature vector extracted from one or more partial images obtained by color space conversion of one or more partial regions of the recognition target document; and an information extractor which extracts value information from one or more information search images organized in a grid form based on a position of key information of the recognition target document. Accordingly, it is possible to reduce the influence of color and contamination of an input document. 색공간 변환으로 생성된 문서 이미지의 특징 벡터를 이용하여 문서의 유형 분석 및 키-값을 추출하기 위한 방법 및 장치가 개시된다. 일 실시예에 따른 문서 인식 장치는 인식 대상 문서에 대한 하나 이상의 부분 영역을 색공간 변환시킨 하나 이상의 부분 이미지로부터 추출한 문서 특징 벡터를 기초로 인식 대상 문서의 유형을 분석하는 문서 유형 분석부; 및 인식 대상 문서의 키(key) 정보의 위치를 기초로 격자 형태로 구성된 하나 이상의 정보 검색 이미지에서 값(value) 정보를 추출하는 정보 추출부를 포함할 수 있다.</description><subject>CALCULATING</subject><subject>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>HANDLING RECORD CARRIERS</subject><subject>PHYSICS</subject><subject>PRESENTATION OF DATA</subject><subject>RECOGNITION OF DATA</subject><subject>RECORD CARRIERS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2022</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZNB1DAhwDHIMCQ1WcPRzUfB1DfHwd1Fw8w9ScPF3DvV19QtRCHJ19nf38wzx9PfjYWBNS8wpTuWF0twMym6uIc4euqkF-fGpxQWJyal5qSXx3kFGBkZGBgamBsamZo7GxKkCAFKjJk4</recordid><startdate>20220425</startdate><enddate>20220425</enddate><creator>CHO SOO AH</creator><creator>YOON JEONG</creator><creator>LEE JOON SEOK</creator><creator>SONG HYO SEOB</creator><scope>EVB</scope></search><sort><creationdate>20220425</creationdate><title>APPARATUS AND METHOD FOR DOCUMENT RECOGNITION</title><author>CHO SOO AH ; YOON JEONG ; LEE JOON SEOK ; SONG HYO SEOB</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_KR20220050356A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng ; kor</language><creationdate>2022</creationdate><topic>CALCULATING</topic><topic>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>HANDLING RECORD CARRIERS</topic><topic>PHYSICS</topic><topic>PRESENTATION OF DATA</topic><topic>RECOGNITION OF DATA</topic><topic>RECORD CARRIERS</topic><toplevel>online_resources</toplevel><creatorcontrib>CHO SOO AH</creatorcontrib><creatorcontrib>YOON JEONG</creatorcontrib><creatorcontrib>LEE JOON SEOK</creatorcontrib><creatorcontrib>SONG HYO SEOB</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>CHO SOO AH</au><au>YOON JEONG</au><au>LEE JOON SEOK</au><au>SONG HYO SEOB</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>APPARATUS AND METHOD FOR DOCUMENT RECOGNITION</title><date>2022-04-25</date><risdate>2022</risdate><abstract>Disclosed are a method and device for type analysis and key-value extraction of a document by using a feature vector of a document image generated by color space conversion. According to an embodiment of the present invention, the device for document recognition includes: a document type analyzer which analyzes a type of recognition target document based on document feature vector extracted from one or more partial images obtained by color space conversion of one or more partial regions of the recognition target document; and an information extractor which extracts value information from one or more information search images organized in a grid form based on a position of key information of the recognition target document. Accordingly, it is possible to reduce the influence of color and contamination of an input document. 색공간 변환으로 생성된 문서 이미지의 특징 벡터를 이용하여 문서의 유형 분석 및 키-값을 추출하기 위한 방법 및 장치가 개시된다. 일 실시예에 따른 문서 인식 장치는 인식 대상 문서에 대한 하나 이상의 부분 영역을 색공간 변환시킨 하나 이상의 부분 이미지로부터 추출한 문서 특징 벡터를 기초로 인식 대상 문서의 유형을 분석하는 문서 유형 분석부; 및 인식 대상 문서의 키(key) 정보의 위치를 기초로 격자 형태로 구성된 하나 이상의 정보 검색 이미지에서 값(value) 정보를 추출하는 정보 추출부를 포함할 수 있다.</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language eng ; kor
recordid cdi_epo_espacenet_KR20220050356A
source esp@cenet
subjects CALCULATING
COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
COMPUTING
COUNTING
HANDLING RECORD CARRIERS
PHYSICS
PRESENTATION OF DATA
RECOGNITION OF DATA
RECORD CARRIERS
title APPARATUS AND METHOD FOR DOCUMENT RECOGNITION
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-20T00%3A57%3A18IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=CHO%20SOO%20AH&rft.date=2022-04-25&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EKR20220050356A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true