INFORMATION EXTRACTION METHOD AND APPARATUS FOR TEXT WITH LAYOUT

The present application relates to the technical field of information and the technical field of artificial intelligence, and provides an information extraction method and apparatus for text with layout. It is beneficial to improving the accuracy of information extraction of text with layout. The me...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: JIANG, Xuan, HAO, Licui, CHEN, Minqin, WU, Peng, YUE, Rongzhong
Format: Patent
Sprache:chi ; eng ; fre
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator JIANG, Xuan
HAO, Licui
CHEN, Minqin
WU, Peng
YUE, Rongzhong
description The present application relates to the technical field of information and the technical field of artificial intelligence, and provides an information extraction method and apparatus for text with layout. It is beneficial to improving the accuracy of information extraction of text with layout. The method comprises: first, determining that a text block, which belongs to a target category, in text with layout needs to be extracted (S101); then, identifying a text block, which belongs to the target category, in the text with layout on the basis of feature information of text block granularity (S103); and next, outputting the identifier of the text block, which belongs to the target category, in the text with layout (S104). La présente invention se rapporte au domaine technique de l'information et au domaine technique de l'intelligence artificielle, et concerne un procédé et un appareil d'extraction d'informations pour un texte ayant une mise en page. Elle est bénéfique pour améliorer la précision d'extraction d'i
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_WO2022105237A1</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>WO2022105237A1</sourcerecordid><originalsourceid>FETCH-epo_espacenet_WO2022105237A13</originalsourceid><addsrcrecordid>eNrjZHDw9HPzD_J1DPH091NwjQgJcnQGM31dQzz8XRQc_YA4IMAxyDEkNFgBqFIhBKhIIdwzxEPBxzHSPzSEh4E1LTGnOJUXSnMzKLu5hjh76KYW5MenFhckJqfmpZbEh_sbGRgZGRqYGhmbOxoaE6cKAIAeK9k</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>INFORMATION EXTRACTION METHOD AND APPARATUS FOR TEXT WITH LAYOUT</title><source>esp@cenet</source><creator>JIANG, Xuan ; HAO, Licui ; CHEN, Minqin ; WU, Peng ; YUE, Rongzhong</creator><creatorcontrib>JIANG, Xuan ; HAO, Licui ; CHEN, Minqin ; WU, Peng ; YUE, Rongzhong</creatorcontrib><description>The present application relates to the technical field of information and the technical field of artificial intelligence, and provides an information extraction method and apparatus for text with layout. It is beneficial to improving the accuracy of information extraction of text with layout. The method comprises: first, determining that a text block, which belongs to a target category, in text with layout needs to be extracted (S101); then, identifying a text block, which belongs to the target category, in the text with layout on the basis of feature information of text block granularity (S103); and next, outputting the identifier of the text block, which belongs to the target category, in the text with layout (S104). La présente invention se rapporte au domaine technique de l'information et au domaine technique de l'intelligence artificielle, et concerne un procédé et un appareil d'extraction d'informations pour un texte ayant une mise en page. Elle est bénéfique pour améliorer la précision d'extraction d'i</description><language>chi ; eng ; fre</language><subject>CALCULATING ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; PHYSICS</subject><creationdate>2022</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20220527&amp;DB=EPODOC&amp;CC=WO&amp;NR=2022105237A1$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25563,76318</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20220527&amp;DB=EPODOC&amp;CC=WO&amp;NR=2022105237A1$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>JIANG, Xuan</creatorcontrib><creatorcontrib>HAO, Licui</creatorcontrib><creatorcontrib>CHEN, Minqin</creatorcontrib><creatorcontrib>WU, Peng</creatorcontrib><creatorcontrib>YUE, Rongzhong</creatorcontrib><title>INFORMATION EXTRACTION METHOD AND APPARATUS FOR TEXT WITH LAYOUT</title><description>The present application relates to the technical field of information and the technical field of artificial intelligence, and provides an information extraction method and apparatus for text with layout. It is beneficial to improving the accuracy of information extraction of text with layout. The method comprises: first, determining that a text block, which belongs to a target category, in text with layout needs to be extracted (S101); then, identifying a text block, which belongs to the target category, in the text with layout on the basis of feature information of text block granularity (S103); and next, outputting the identifier of the text block, which belongs to the target category, in the text with layout (S104). La présente invention se rapporte au domaine technique de l'information et au domaine technique de l'intelligence artificielle, et concerne un procédé et un appareil d'extraction d'informations pour un texte ayant une mise en page. Elle est bénéfique pour améliorer la précision d'extraction d'i</description><subject>CALCULATING</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>PHYSICS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2022</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZHDw9HPzD_J1DPH091NwjQgJcnQGM31dQzz8XRQc_YA4IMAxyDEkNFgBqFIhBKhIIdwzxEPBxzHSPzSEh4E1LTGnOJUXSnMzKLu5hjh76KYW5MenFhckJqfmpZbEh_sbGRgZGRqYGhmbOxoaE6cKAIAeK9k</recordid><startdate>20220527</startdate><enddate>20220527</enddate><creator>JIANG, Xuan</creator><creator>HAO, Licui</creator><creator>CHEN, Minqin</creator><creator>WU, Peng</creator><creator>YUE, Rongzhong</creator><scope>EVB</scope></search><sort><creationdate>20220527</creationdate><title>INFORMATION EXTRACTION METHOD AND APPARATUS FOR TEXT WITH LAYOUT</title><author>JIANG, Xuan ; HAO, Licui ; CHEN, Minqin ; WU, Peng ; YUE, Rongzhong</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_WO2022105237A13</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng ; fre</language><creationdate>2022</creationdate><topic>CALCULATING</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>PHYSICS</topic><toplevel>online_resources</toplevel><creatorcontrib>JIANG, Xuan</creatorcontrib><creatorcontrib>HAO, Licui</creatorcontrib><creatorcontrib>CHEN, Minqin</creatorcontrib><creatorcontrib>WU, Peng</creatorcontrib><creatorcontrib>YUE, Rongzhong</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>JIANG, Xuan</au><au>HAO, Licui</au><au>CHEN, Minqin</au><au>WU, Peng</au><au>YUE, Rongzhong</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>INFORMATION EXTRACTION METHOD AND APPARATUS FOR TEXT WITH LAYOUT</title><date>2022-05-27</date><risdate>2022</risdate><abstract>The present application relates to the technical field of information and the technical field of artificial intelligence, and provides an information extraction method and apparatus for text with layout. It is beneficial to improving the accuracy of information extraction of text with layout. The method comprises: first, determining that a text block, which belongs to a target category, in text with layout needs to be extracted (S101); then, identifying a text block, which belongs to the target category, in the text with layout on the basis of feature information of text block granularity (S103); and next, outputting the identifier of the text block, which belongs to the target category, in the text with layout (S104). La présente invention se rapporte au domaine technique de l'information et au domaine technique de l'intelligence artificielle, et concerne un procédé et un appareil d'extraction d'informations pour un texte ayant une mise en page. Elle est bénéfique pour améliorer la précision d'extraction d'i</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language chi ; eng ; fre
recordid cdi_epo_espacenet_WO2022105237A1
source esp@cenet
subjects CALCULATING
COMPUTING
COUNTING
ELECTRIC DIGITAL DATA PROCESSING
PHYSICS
title INFORMATION EXTRACTION METHOD AND APPARATUS FOR TEXT WITH LAYOUT
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-13T02%3A40%3A04IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=JIANG,%20Xuan&rft.date=2022-05-27&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EWO2022105237A1%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true