APPARATUS AND METHOD FOR DOCUMENT RECOGNITION
Disclosed are a method and device for type analysis and key-value extraction of a document by using a feature vector of a document image generated by color space conversion. According to an embodiment of the present invention, the device for document recognition includes: a document type analyzer wh...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Patent |
Sprache: | eng ; kor |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | CHO SOO AH YOON JEONG LEE JOON SEOK SONG HYO SEOB |
description | Disclosed are a method and device for type analysis and key-value extraction of a document by using a feature vector of a document image generated by color space conversion. According to an embodiment of the present invention, the device for document recognition includes: a document type analyzer which analyzes a type of recognition target document based on document feature vector extracted from one or more partial images obtained by color space conversion of one or more partial regions of the recognition target document; and an information extractor which extracts value information from one or more information search images organized in a grid form based on a position of key information of the recognition target document. Accordingly, it is possible to reduce the influence of color and contamination of an input document.
색공간 변환으로 생성된 문서 이미지의 특징 벡터를 이용하여 문서의 유형 분석 및 키-값을 추출하기 위한 방법 및 장치가 개시된다. 일 실시예에 따른 문서 인식 장치는 인식 대상 문서에 대한 하나 이상의 부분 영역을 색공간 변환시킨 하나 이상의 부분 이미지로부터 추출한 문서 특징 벡터를 기초로 인식 대상 문서의 유형을 분석하는 문서 유형 분석부; 및 인식 대상 문서의 키(key) 정보의 위치를 기초로 격자 형태로 구성된 하나 이상의 정보 검색 이미지에서 값(value) 정보를 추출하는 정보 추출부를 포함할 수 있다. |
format | Patent |
fullrecord | <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_KR20220050356A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>KR20220050356A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_KR20220050356A3</originalsourceid><addsrcrecordid>eNrjZNB1DAhwDHIMCQ1WcPRzUfB1DfHwd1Fw8w9ScPF3DvV19QtRCHJ19nf38wzx9PfjYWBNS8wpTuWF0twMym6uIc4euqkF-fGpxQWJyal5qSXx3kFGBkZGBgamBsamZo7GxKkCAFKjJk4</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>APPARATUS AND METHOD FOR DOCUMENT RECOGNITION</title><source>esp@cenet</source><creator>CHO SOO AH ; YOON JEONG ; LEE JOON SEOK ; SONG HYO SEOB</creator><creatorcontrib>CHO SOO AH ; YOON JEONG ; LEE JOON SEOK ; SONG HYO SEOB</creatorcontrib><description>Disclosed are a method and device for type analysis and key-value extraction of a document by using a feature vector of a document image generated by color space conversion. According to an embodiment of the present invention, the device for document recognition includes: a document type analyzer which analyzes a type of recognition target document based on document feature vector extracted from one or more partial images obtained by color space conversion of one or more partial regions of the recognition target document; and an information extractor which extracts value information from one or more information search images organized in a grid form based on a position of key information of the recognition target document. Accordingly, it is possible to reduce the influence of color and contamination of an input document.
색공간 변환으로 생성된 문서 이미지의 특징 벡터를 이용하여 문서의 유형 분석 및 키-값을 추출하기 위한 방법 및 장치가 개시된다. 일 실시예에 따른 문서 인식 장치는 인식 대상 문서에 대한 하나 이상의 부분 영역을 색공간 변환시킨 하나 이상의 부분 이미지로부터 추출한 문서 특징 벡터를 기초로 인식 대상 문서의 유형을 분석하는 문서 유형 분석부; 및 인식 대상 문서의 키(key) 정보의 위치를 기초로 격자 형태로 구성된 하나 이상의 정보 검색 이미지에서 값(value) 정보를 추출하는 정보 추출부를 포함할 수 있다.</description><language>eng ; kor</language><subject>CALCULATING ; COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS ; COMPUTING ; COUNTING ; HANDLING RECORD CARRIERS ; PHYSICS ; PRESENTATION OF DATA ; RECOGNITION OF DATA ; RECORD CARRIERS</subject><creationdate>2022</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20220425&DB=EPODOC&CC=KR&NR=20220050356A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,777,882,25545,76296</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20220425&DB=EPODOC&CC=KR&NR=20220050356A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>CHO SOO AH</creatorcontrib><creatorcontrib>YOON JEONG</creatorcontrib><creatorcontrib>LEE JOON SEOK</creatorcontrib><creatorcontrib>SONG HYO SEOB</creatorcontrib><title>APPARATUS AND METHOD FOR DOCUMENT RECOGNITION</title><description>Disclosed are a method and device for type analysis and key-value extraction of a document by using a feature vector of a document image generated by color space conversion. According to an embodiment of the present invention, the device for document recognition includes: a document type analyzer which analyzes a type of recognition target document based on document feature vector extracted from one or more partial images obtained by color space conversion of one or more partial regions of the recognition target document; and an information extractor which extracts value information from one or more information search images organized in a grid form based on a position of key information of the recognition target document. Accordingly, it is possible to reduce the influence of color and contamination of an input document.
색공간 변환으로 생성된 문서 이미지의 특징 벡터를 이용하여 문서의 유형 분석 및 키-값을 추출하기 위한 방법 및 장치가 개시된다. 일 실시예에 따른 문서 인식 장치는 인식 대상 문서에 대한 하나 이상의 부분 영역을 색공간 변환시킨 하나 이상의 부분 이미지로부터 추출한 문서 특징 벡터를 기초로 인식 대상 문서의 유형을 분석하는 문서 유형 분석부; 및 인식 대상 문서의 키(key) 정보의 위치를 기초로 격자 형태로 구성된 하나 이상의 정보 검색 이미지에서 값(value) 정보를 추출하는 정보 추출부를 포함할 수 있다.</description><subject>CALCULATING</subject><subject>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>HANDLING RECORD CARRIERS</subject><subject>PHYSICS</subject><subject>PRESENTATION OF DATA</subject><subject>RECOGNITION OF DATA</subject><subject>RECORD CARRIERS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2022</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZNB1DAhwDHIMCQ1WcPRzUfB1DfHwd1Fw8w9ScPF3DvV19QtRCHJ19nf38wzx9PfjYWBNS8wpTuWF0twMym6uIc4euqkF-fGpxQWJyal5qSXx3kFGBkZGBgamBsamZo7GxKkCAFKjJk4</recordid><startdate>20220425</startdate><enddate>20220425</enddate><creator>CHO SOO AH</creator><creator>YOON JEONG</creator><creator>LEE JOON SEOK</creator><creator>SONG HYO SEOB</creator><scope>EVB</scope></search><sort><creationdate>20220425</creationdate><title>APPARATUS AND METHOD FOR DOCUMENT RECOGNITION</title><author>CHO SOO AH ; YOON JEONG ; LEE JOON SEOK ; SONG HYO SEOB</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_KR20220050356A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng ; kor</language><creationdate>2022</creationdate><topic>CALCULATING</topic><topic>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>HANDLING RECORD CARRIERS</topic><topic>PHYSICS</topic><topic>PRESENTATION OF DATA</topic><topic>RECOGNITION OF DATA</topic><topic>RECORD CARRIERS</topic><toplevel>online_resources</toplevel><creatorcontrib>CHO SOO AH</creatorcontrib><creatorcontrib>YOON JEONG</creatorcontrib><creatorcontrib>LEE JOON SEOK</creatorcontrib><creatorcontrib>SONG HYO SEOB</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>CHO SOO AH</au><au>YOON JEONG</au><au>LEE JOON SEOK</au><au>SONG HYO SEOB</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>APPARATUS AND METHOD FOR DOCUMENT RECOGNITION</title><date>2022-04-25</date><risdate>2022</risdate><abstract>Disclosed are a method and device for type analysis and key-value extraction of a document by using a feature vector of a document image generated by color space conversion. According to an embodiment of the present invention, the device for document recognition includes: a document type analyzer which analyzes a type of recognition target document based on document feature vector extracted from one or more partial images obtained by color space conversion of one or more partial regions of the recognition target document; and an information extractor which extracts value information from one or more information search images organized in a grid form based on a position of key information of the recognition target document. Accordingly, it is possible to reduce the influence of color and contamination of an input document.
색공간 변환으로 생성된 문서 이미지의 특징 벡터를 이용하여 문서의 유형 분석 및 키-값을 추출하기 위한 방법 및 장치가 개시된다. 일 실시예에 따른 문서 인식 장치는 인식 대상 문서에 대한 하나 이상의 부분 영역을 색공간 변환시킨 하나 이상의 부분 이미지로부터 추출한 문서 특징 벡터를 기초로 인식 대상 문서의 유형을 분석하는 문서 유형 분석부; 및 인식 대상 문서의 키(key) 정보의 위치를 기초로 격자 형태로 구성된 하나 이상의 정보 검색 이미지에서 값(value) 정보를 추출하는 정보 추출부를 포함할 수 있다.</abstract><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | |
ispartof | |
issn | |
language | eng ; kor |
recordid | cdi_epo_espacenet_KR20220050356A |
source | esp@cenet |
subjects | CALCULATING COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS COMPUTING COUNTING HANDLING RECORD CARRIERS PHYSICS PRESENTATION OF DATA RECOGNITION OF DATA RECORD CARRIERS |
title | APPARATUS AND METHOD FOR DOCUMENT RECOGNITION |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-20T00%3A57%3A18IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=CHO%20SOO%20AH&rft.date=2022-04-25&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EKR20220050356A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |