Form analysis and understanding based on knowledge

Forms are different from common documents and more difficult to process. It is not very well done if processed by traditional document analyzing technology. The purpose of this thesis is to research on the key technologies of form analysis and understanding aiming at forms which are multi-type, larg...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Xiuling He, Yang Yang, Zengzhao Chen, Ying Yu, Cailin Dong
Format: Tagungsbericht
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 9291
container_issue
container_start_page 9286
container_title
container_volume
creator Xiuling He
Yang Yang
Zengzhao Chen
Ying Yu
Cailin Dong
description Forms are different from common documents and more difficult to process. It is not very well done if processed by traditional document analyzing technology. The purpose of this thesis is to research on the key technologies of form analysis and understanding aiming at forms which are multi-type, large quantity, noisy, and variety of paper quality used by bank and revenue in our country. Based on analyzing knowledge of forms, a model of form based on object-orient sorting tree knowledge base and triple node representation is put forward. As for form feature extraction, hierarchical regulated hit or miss transform (HRHMT) is proposed. Feature extraction algorithm is also given. The algorithm is proved to be feasibility theoretically. Analysis on theoretical complexity and experiments shows their efficiency and superiority.
doi_str_mv 10.1109/WCICA.2008.4594401
format Conference Proceeding
fullrecord <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_4594401</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>4594401</ieee_id><sourcerecordid>4594401</sourcerecordid><originalsourceid>FETCH-LOGICAL-i175t-9a7be1be53283b78f832e79ee1c05c2a4fbe5a07c0d8c5dac9da6ca7725d99a03</originalsourceid><addsrcrecordid>eNpFT9tKw0AUXJGC9vID-pIfSDx7y-4-SrAqFHxR-lhOdk_KappItiL9e1csOC8zwwyHM4zdcKg4B3e3bZ6b-0oA2EpppxTwCzbnSiglOFfq8t_Iesbmv0UHUBu4YquU3iFDaVm7-pqJ9TgdChywP6WYsgjF1xBoSscs47AvWkwUinEoPobxu6ewpyWbddgnWp15wd7WD6_NU7l5ecx_bcrIjT6WDk1LvCUthZWtsZ2Vgowj4h60F6i6nCEYD8F6HdC7gLVHY4QOziHIBbv9uxuJaPc5xQNOp915sPwBDBRHtw</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Form analysis and understanding based on knowledge</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Xiuling He ; Yang Yang ; Zengzhao Chen ; Ying Yu ; Cailin Dong</creator><creatorcontrib>Xiuling He ; Yang Yang ; Zengzhao Chen ; Ying Yu ; Cailin Dong</creatorcontrib><description>Forms are different from common documents and more difficult to process. It is not very well done if processed by traditional document analyzing technology. The purpose of this thesis is to research on the key technologies of form analysis and understanding aiming at forms which are multi-type, large quantity, noisy, and variety of paper quality used by bank and revenue in our country. Based on analyzing knowledge of forms, a model of form based on object-orient sorting tree knowledge base and triple node representation is put forward. As for form feature extraction, hierarchical regulated hit or miss transform (HRHMT) is proposed. Feature extraction algorithm is also given. The algorithm is proved to be feasibility theoretically. Analysis on theoretical complexity and experiments shows their efficiency and superiority.</description><identifier>ISBN: 1424421136</identifier><identifier>ISBN: 9781424421138</identifier><identifier>EISBN: 1424421144</identifier><identifier>EISBN: 9781424421145</identifier><identifier>DOI: 10.1109/WCICA.2008.4594401</identifier><identifier>LCCN: 2008900670</identifier><language>chi ; eng</language><publisher>IEEE</publisher><subject>Algorithm design and analysis ; Analytical models ; Complexity theory ; Feature extraction ; form ; Hierarchical Regulated Hit or Miss Transform ; Mathematics ; Optical character recognition software ; presentation model ; Transforms</subject><ispartof>2008 7th World Congress on Intelligent Control and Automation, 2008, p.9286-9291</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/4594401$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,776,780,785,786,2052,27902,54895</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/4594401$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Xiuling He</creatorcontrib><creatorcontrib>Yang Yang</creatorcontrib><creatorcontrib>Zengzhao Chen</creatorcontrib><creatorcontrib>Ying Yu</creatorcontrib><creatorcontrib>Cailin Dong</creatorcontrib><title>Form analysis and understanding based on knowledge</title><title>2008 7th World Congress on Intelligent Control and Automation</title><addtitle>WCICA</addtitle><description>Forms are different from common documents and more difficult to process. It is not very well done if processed by traditional document analyzing technology. The purpose of this thesis is to research on the key technologies of form analysis and understanding aiming at forms which are multi-type, large quantity, noisy, and variety of paper quality used by bank and revenue in our country. Based on analyzing knowledge of forms, a model of form based on object-orient sorting tree knowledge base and triple node representation is put forward. As for form feature extraction, hierarchical regulated hit or miss transform (HRHMT) is proposed. Feature extraction algorithm is also given. The algorithm is proved to be feasibility theoretically. Analysis on theoretical complexity and experiments shows their efficiency and superiority.</description><subject>Algorithm design and analysis</subject><subject>Analytical models</subject><subject>Complexity theory</subject><subject>Feature extraction</subject><subject>form</subject><subject>Hierarchical Regulated Hit or Miss Transform</subject><subject>Mathematics</subject><subject>Optical character recognition software</subject><subject>presentation model</subject><subject>Transforms</subject><isbn>1424421136</isbn><isbn>9781424421138</isbn><isbn>1424421144</isbn><isbn>9781424421145</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2008</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNpFT9tKw0AUXJGC9vID-pIfSDx7y-4-SrAqFHxR-lhOdk_KappItiL9e1csOC8zwwyHM4zdcKg4B3e3bZ6b-0oA2EpppxTwCzbnSiglOFfq8t_Iesbmv0UHUBu4YquU3iFDaVm7-pqJ9TgdChywP6WYsgjF1xBoSscs47AvWkwUinEoPobxu6ewpyWbddgnWp15wd7WD6_NU7l5ecx_bcrIjT6WDk1LvCUthZWtsZ2Vgowj4h60F6i6nCEYD8F6HdC7gLVHY4QOziHIBbv9uxuJaPc5xQNOp915sPwBDBRHtw</recordid><startdate>200806</startdate><enddate>200806</enddate><creator>Xiuling He</creator><creator>Yang Yang</creator><creator>Zengzhao Chen</creator><creator>Ying Yu</creator><creator>Cailin Dong</creator><general>IEEE</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>200806</creationdate><title>Form analysis and understanding based on knowledge</title><author>Xiuling He ; Yang Yang ; Zengzhao Chen ; Ying Yu ; Cailin Dong</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i175t-9a7be1be53283b78f832e79ee1c05c2a4fbe5a07c0d8c5dac9da6ca7725d99a03</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>chi ; eng</language><creationdate>2008</creationdate><topic>Algorithm design and analysis</topic><topic>Analytical models</topic><topic>Complexity theory</topic><topic>Feature extraction</topic><topic>form</topic><topic>Hierarchical Regulated Hit or Miss Transform</topic><topic>Mathematics</topic><topic>Optical character recognition software</topic><topic>presentation model</topic><topic>Transforms</topic><toplevel>online_resources</toplevel><creatorcontrib>Xiuling He</creatorcontrib><creatorcontrib>Yang Yang</creatorcontrib><creatorcontrib>Zengzhao Chen</creatorcontrib><creatorcontrib>Ying Yu</creatorcontrib><creatorcontrib>Cailin Dong</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Xiuling He</au><au>Yang Yang</au><au>Zengzhao Chen</au><au>Ying Yu</au><au>Cailin Dong</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Form analysis and understanding based on knowledge</atitle><btitle>2008 7th World Congress on Intelligent Control and Automation</btitle><stitle>WCICA</stitle><date>2008-06</date><risdate>2008</risdate><spage>9286</spage><epage>9291</epage><pages>9286-9291</pages><isbn>1424421136</isbn><isbn>9781424421138</isbn><eisbn>1424421144</eisbn><eisbn>9781424421145</eisbn><abstract>Forms are different from common documents and more difficult to process. It is not very well done if processed by traditional document analyzing technology. The purpose of this thesis is to research on the key technologies of form analysis and understanding aiming at forms which are multi-type, large quantity, noisy, and variety of paper quality used by bank and revenue in our country. Based on analyzing knowledge of forms, a model of form based on object-orient sorting tree knowledge base and triple node representation is put forward. As for form feature extraction, hierarchical regulated hit or miss transform (HRHMT) is proposed. Feature extraction algorithm is also given. The algorithm is proved to be feasibility theoretically. Analysis on theoretical complexity and experiments shows their efficiency and superiority.</abstract><pub>IEEE</pub><doi>10.1109/WCICA.2008.4594401</doi><tpages>6</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier ISBN: 1424421136
ispartof 2008 7th World Congress on Intelligent Control and Automation, 2008, p.9286-9291
issn
language chi ; eng
recordid cdi_ieee_primary_4594401
source IEEE Electronic Library (IEL) Conference Proceedings
subjects Algorithm design and analysis
Analytical models
Complexity theory
Feature extraction
form
Hierarchical Regulated Hit or Miss Transform
Mathematics
Optical character recognition software
presentation model
Transforms
title Form analysis and understanding based on knowledge
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-11T08%3A33%3A36IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Form%20analysis%20and%20understanding%20based%20on%20knowledge&rft.btitle=2008%207th%20World%20Congress%20on%20Intelligent%20Control%20and%20Automation&rft.au=Xiuling%20He&rft.date=2008-06&rft.spage=9286&rft.epage=9291&rft.pages=9286-9291&rft.isbn=1424421136&rft.isbn_list=9781424421138&rft_id=info:doi/10.1109/WCICA.2008.4594401&rft_dat=%3Cieee_6IE%3E4594401%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&rft.eisbn=1424421144&rft.eisbn_list=9781424421145&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=4594401&rfr_iscdi=true