Form analysis and understanding based on knowledge
Forms are different from common documents and more difficult to process. It is not very well done if processed by traditional document analyzing technology. The purpose of this thesis is to research on the key technologies of form analysis and understanding aiming at forms which are multi-type, larg...
Gespeichert in:
Hauptverfasser: | , , , , |
---|---|
Format: | Tagungsbericht |
Sprache: | chi ; eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 9291 |
---|---|
container_issue | |
container_start_page | 9286 |
container_title | |
container_volume | |
creator | Xiuling He Yang Yang Zengzhao Chen Ying Yu Cailin Dong |
description | Forms are different from common documents and more difficult to process. It is not very well done if processed by traditional document analyzing technology. The purpose of this thesis is to research on the key technologies of form analysis and understanding aiming at forms which are multi-type, large quantity, noisy, and variety of paper quality used by bank and revenue in our country. Based on analyzing knowledge of forms, a model of form based on object-orient sorting tree knowledge base and triple node representation is put forward. As for form feature extraction, hierarchical regulated hit or miss transform (HRHMT) is proposed. Feature extraction algorithm is also given. The algorithm is proved to be feasibility theoretically. Analysis on theoretical complexity and experiments shows their efficiency and superiority. |
doi_str_mv | 10.1109/WCICA.2008.4594401 |
format | Conference Proceeding |
fullrecord | <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_4594401</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>4594401</ieee_id><sourcerecordid>4594401</sourcerecordid><originalsourceid>FETCH-LOGICAL-i175t-9a7be1be53283b78f832e79ee1c05c2a4fbe5a07c0d8c5dac9da6ca7725d99a03</originalsourceid><addsrcrecordid>eNpFT9tKw0AUXJGC9vID-pIfSDx7y-4-SrAqFHxR-lhOdk_KappItiL9e1csOC8zwwyHM4zdcKg4B3e3bZ6b-0oA2EpppxTwCzbnSiglOFfq8t_Iesbmv0UHUBu4YquU3iFDaVm7-pqJ9TgdChywP6WYsgjF1xBoSscs47AvWkwUinEoPobxu6ewpyWbddgnWp15wd7WD6_NU7l5ecx_bcrIjT6WDk1LvCUthZWtsZ2Vgowj4h60F6i6nCEYD8F6HdC7gLVHY4QOziHIBbv9uxuJaPc5xQNOp915sPwBDBRHtw</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Form analysis and understanding based on knowledge</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Xiuling He ; Yang Yang ; Zengzhao Chen ; Ying Yu ; Cailin Dong</creator><creatorcontrib>Xiuling He ; Yang Yang ; Zengzhao Chen ; Ying Yu ; Cailin Dong</creatorcontrib><description>Forms are different from common documents and more difficult to process. It is not very well done if processed by traditional document analyzing technology. The purpose of this thesis is to research on the key technologies of form analysis and understanding aiming at forms which are multi-type, large quantity, noisy, and variety of paper quality used by bank and revenue in our country. Based on analyzing knowledge of forms, a model of form based on object-orient sorting tree knowledge base and triple node representation is put forward. As for form feature extraction, hierarchical regulated hit or miss transform (HRHMT) is proposed. Feature extraction algorithm is also given. The algorithm is proved to be feasibility theoretically. Analysis on theoretical complexity and experiments shows their efficiency and superiority.</description><identifier>ISBN: 1424421136</identifier><identifier>ISBN: 9781424421138</identifier><identifier>EISBN: 1424421144</identifier><identifier>EISBN: 9781424421145</identifier><identifier>DOI: 10.1109/WCICA.2008.4594401</identifier><identifier>LCCN: 2008900670</identifier><language>chi ; eng</language><publisher>IEEE</publisher><subject>Algorithm design and analysis ; Analytical models ; Complexity theory ; Feature extraction ; form ; Hierarchical Regulated Hit or Miss Transform ; Mathematics ; Optical character recognition software ; presentation model ; Transforms</subject><ispartof>2008 7th World Congress on Intelligent Control and Automation, 2008, p.9286-9291</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/4594401$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,776,780,785,786,2052,27902,54895</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/4594401$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Xiuling He</creatorcontrib><creatorcontrib>Yang Yang</creatorcontrib><creatorcontrib>Zengzhao Chen</creatorcontrib><creatorcontrib>Ying Yu</creatorcontrib><creatorcontrib>Cailin Dong</creatorcontrib><title>Form analysis and understanding based on knowledge</title><title>2008 7th World Congress on Intelligent Control and Automation</title><addtitle>WCICA</addtitle><description>Forms are different from common documents and more difficult to process. It is not very well done if processed by traditional document analyzing technology. The purpose of this thesis is to research on the key technologies of form analysis and understanding aiming at forms which are multi-type, large quantity, noisy, and variety of paper quality used by bank and revenue in our country. Based on analyzing knowledge of forms, a model of form based on object-orient sorting tree knowledge base and triple node representation is put forward. As for form feature extraction, hierarchical regulated hit or miss transform (HRHMT) is proposed. Feature extraction algorithm is also given. The algorithm is proved to be feasibility theoretically. Analysis on theoretical complexity and experiments shows their efficiency and superiority.</description><subject>Algorithm design and analysis</subject><subject>Analytical models</subject><subject>Complexity theory</subject><subject>Feature extraction</subject><subject>form</subject><subject>Hierarchical Regulated Hit or Miss Transform</subject><subject>Mathematics</subject><subject>Optical character recognition software</subject><subject>presentation model</subject><subject>Transforms</subject><isbn>1424421136</isbn><isbn>9781424421138</isbn><isbn>1424421144</isbn><isbn>9781424421145</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2008</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNpFT9tKw0AUXJGC9vID-pIfSDx7y-4-SrAqFHxR-lhOdk_KappItiL9e1csOC8zwwyHM4zdcKg4B3e3bZ6b-0oA2EpppxTwCzbnSiglOFfq8t_Iesbmv0UHUBu4YquU3iFDaVm7-pqJ9TgdChywP6WYsgjF1xBoSscs47AvWkwUinEoPobxu6ewpyWbddgnWp15wd7WD6_NU7l5ecx_bcrIjT6WDk1LvCUthZWtsZ2Vgowj4h60F6i6nCEYD8F6HdC7gLVHY4QOziHIBbv9uxuJaPc5xQNOp915sPwBDBRHtw</recordid><startdate>200806</startdate><enddate>200806</enddate><creator>Xiuling He</creator><creator>Yang Yang</creator><creator>Zengzhao Chen</creator><creator>Ying Yu</creator><creator>Cailin Dong</creator><general>IEEE</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>200806</creationdate><title>Form analysis and understanding based on knowledge</title><author>Xiuling He ; Yang Yang ; Zengzhao Chen ; Ying Yu ; Cailin Dong</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i175t-9a7be1be53283b78f832e79ee1c05c2a4fbe5a07c0d8c5dac9da6ca7725d99a03</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>chi ; eng</language><creationdate>2008</creationdate><topic>Algorithm design and analysis</topic><topic>Analytical models</topic><topic>Complexity theory</topic><topic>Feature extraction</topic><topic>form</topic><topic>Hierarchical Regulated Hit or Miss Transform</topic><topic>Mathematics</topic><topic>Optical character recognition software</topic><topic>presentation model</topic><topic>Transforms</topic><toplevel>online_resources</toplevel><creatorcontrib>Xiuling He</creatorcontrib><creatorcontrib>Yang Yang</creatorcontrib><creatorcontrib>Zengzhao Chen</creatorcontrib><creatorcontrib>Ying Yu</creatorcontrib><creatorcontrib>Cailin Dong</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Xiuling He</au><au>Yang Yang</au><au>Zengzhao Chen</au><au>Ying Yu</au><au>Cailin Dong</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Form analysis and understanding based on knowledge</atitle><btitle>2008 7th World Congress on Intelligent Control and Automation</btitle><stitle>WCICA</stitle><date>2008-06</date><risdate>2008</risdate><spage>9286</spage><epage>9291</epage><pages>9286-9291</pages><isbn>1424421136</isbn><isbn>9781424421138</isbn><eisbn>1424421144</eisbn><eisbn>9781424421145</eisbn><abstract>Forms are different from common documents and more difficult to process. It is not very well done if processed by traditional document analyzing technology. The purpose of this thesis is to research on the key technologies of form analysis and understanding aiming at forms which are multi-type, large quantity, noisy, and variety of paper quality used by bank and revenue in our country. Based on analyzing knowledge of forms, a model of form based on object-orient sorting tree knowledge base and triple node representation is put forward. As for form feature extraction, hierarchical regulated hit or miss transform (HRHMT) is proposed. Feature extraction algorithm is also given. The algorithm is proved to be feasibility theoretically. Analysis on theoretical complexity and experiments shows their efficiency and superiority.</abstract><pub>IEEE</pub><doi>10.1109/WCICA.2008.4594401</doi><tpages>6</tpages></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISBN: 1424421136 |
ispartof | 2008 7th World Congress on Intelligent Control and Automation, 2008, p.9286-9291 |
issn | |
language | chi ; eng |
recordid | cdi_ieee_primary_4594401 |
source | IEEE Electronic Library (IEL) Conference Proceedings |
subjects | Algorithm design and analysis Analytical models Complexity theory Feature extraction form Hierarchical Regulated Hit or Miss Transform Mathematics Optical character recognition software presentation model Transforms |
title | Form analysis and understanding based on knowledge |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-11T08%3A33%3A36IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Form%20analysis%20and%20understanding%20based%20on%20knowledge&rft.btitle=2008%207th%20World%20Congress%20on%20Intelligent%20Control%20and%20Automation&rft.au=Xiuling%20He&rft.date=2008-06&rft.spage=9286&rft.epage=9291&rft.pages=9286-9291&rft.isbn=1424421136&rft.isbn_list=9781424421138&rft_id=info:doi/10.1109/WCICA.2008.4594401&rft_dat=%3Cieee_6IE%3E4594401%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&rft.eisbn=1424421144&rft.eisbn_list=9781424421145&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=4594401&rfr_iscdi=true |