Document key information extraction method and system based on keyword splitting technology

The invention provides a document key information extraction method and system based on a keyword splitting technology, and relates to the field of document key information extraction. The method comprises the steps: converting an obtained target document into an XML format document; carrying out ke...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: ZHAO ZENGTAO, SHE JUN, LUO YONG, YU SHAOFENG, LIAO CHONGYANG
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator ZHAO ZENGTAO
SHE JUN
LUO YONG
YU SHAOFENG
LIAO CHONGYANG
description The invention provides a document key information extraction method and system based on a keyword splitting technology, and relates to the field of document key information extraction. The method comprises the steps: converting an obtained target document into an XML format document; carrying out key information extraction on the XML format document based on a keyword splitting detection technology; and obtaining a target document and converting the target document into an XML format document. XML is an extensible markup language, and is a markup language used for marking an electronic file to enable the electronic file to have a structural property. Therefore, the target document is converted into the XML format document, and subsequent information extraction is facilitated. Key information extraction is carried out on the XML format document based on a keyword splitting detection technology. In the step, structured key field information can be extracted from a continuous natural language text. The problem t
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN113850056A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN113850056A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN113850056A3</originalsourceid><addsrcrecordid>eNqNi0EKwjAQRbtxIeodxgMILaXiVqriypU7FyWm0zaYzIRkRHN7g3gAV__z_vvz4nZg_XRIAg9MYGjg4JQYJsC3BKW_1aFM3IOiHmKKgg7uKmIPecqvF4fMvTUihkYQ1BOx5TEti9mgbMTVLxfF-nS8tucNeu4weqWRULr2UlX1rinLZruv_3E-5UU8Xw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Document key information extraction method and system based on keyword splitting technology</title><source>esp@cenet</source><creator>ZHAO ZENGTAO ; SHE JUN ; LUO YONG ; YU SHAOFENG ; LIAO CHONGYANG</creator><creatorcontrib>ZHAO ZENGTAO ; SHE JUN ; LUO YONG ; YU SHAOFENG ; LIAO CHONGYANG</creatorcontrib><description>The invention provides a document key information extraction method and system based on a keyword splitting technology, and relates to the field of document key information extraction. The method comprises the steps: converting an obtained target document into an XML format document; carrying out key information extraction on the XML format document based on a keyword splitting detection technology; and obtaining a target document and converting the target document into an XML format document. XML is an extensible markup language, and is a markup language used for marking an electronic file to enable the electronic file to have a structural property. Therefore, the target document is converted into the XML format document, and subsequent information extraction is facilitated. Key information extraction is carried out on the XML format document based on a keyword splitting detection technology. In the step, structured key field information can be extracted from a continuous natural language text. The problem t</description><language>chi ; eng</language><subject>CALCULATING ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; PHYSICS</subject><creationdate>2021</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20211228&amp;DB=EPODOC&amp;CC=CN&amp;NR=113850056A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,776,881,25542,76289</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20211228&amp;DB=EPODOC&amp;CC=CN&amp;NR=113850056A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>ZHAO ZENGTAO</creatorcontrib><creatorcontrib>SHE JUN</creatorcontrib><creatorcontrib>LUO YONG</creatorcontrib><creatorcontrib>YU SHAOFENG</creatorcontrib><creatorcontrib>LIAO CHONGYANG</creatorcontrib><title>Document key information extraction method and system based on keyword splitting technology</title><description>The invention provides a document key information extraction method and system based on a keyword splitting technology, and relates to the field of document key information extraction. The method comprises the steps: converting an obtained target document into an XML format document; carrying out key information extraction on the XML format document based on a keyword splitting detection technology; and obtaining a target document and converting the target document into an XML format document. XML is an extensible markup language, and is a markup language used for marking an electronic file to enable the electronic file to have a structural property. Therefore, the target document is converted into the XML format document, and subsequent information extraction is facilitated. Key information extraction is carried out on the XML format document based on a keyword splitting detection technology. In the step, structured key field information can be extracted from a continuous natural language text. The problem t</description><subject>CALCULATING</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>PHYSICS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2021</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNqNi0EKwjAQRbtxIeodxgMILaXiVqriypU7FyWm0zaYzIRkRHN7g3gAV__z_vvz4nZg_XRIAg9MYGjg4JQYJsC3BKW_1aFM3IOiHmKKgg7uKmIPecqvF4fMvTUihkYQ1BOx5TEti9mgbMTVLxfF-nS8tucNeu4weqWRULr2UlX1rinLZruv_3E-5UU8Xw</recordid><startdate>20211228</startdate><enddate>20211228</enddate><creator>ZHAO ZENGTAO</creator><creator>SHE JUN</creator><creator>LUO YONG</creator><creator>YU SHAOFENG</creator><creator>LIAO CHONGYANG</creator><scope>EVB</scope></search><sort><creationdate>20211228</creationdate><title>Document key information extraction method and system based on keyword splitting technology</title><author>ZHAO ZENGTAO ; SHE JUN ; LUO YONG ; YU SHAOFENG ; LIAO CHONGYANG</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN113850056A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2021</creationdate><topic>CALCULATING</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>PHYSICS</topic><toplevel>online_resources</toplevel><creatorcontrib>ZHAO ZENGTAO</creatorcontrib><creatorcontrib>SHE JUN</creatorcontrib><creatorcontrib>LUO YONG</creatorcontrib><creatorcontrib>YU SHAOFENG</creatorcontrib><creatorcontrib>LIAO CHONGYANG</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>ZHAO ZENGTAO</au><au>SHE JUN</au><au>LUO YONG</au><au>YU SHAOFENG</au><au>LIAO CHONGYANG</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Document key information extraction method and system based on keyword splitting technology</title><date>2021-12-28</date><risdate>2021</risdate><abstract>The invention provides a document key information extraction method and system based on a keyword splitting technology, and relates to the field of document key information extraction. The method comprises the steps: converting an obtained target document into an XML format document; carrying out key information extraction on the XML format document based on a keyword splitting detection technology; and obtaining a target document and converting the target document into an XML format document. XML is an extensible markup language, and is a markup language used for marking an electronic file to enable the electronic file to have a structural property. Therefore, the target document is converted into the XML format document, and subsequent information extraction is facilitated. Key information extraction is carried out on the XML format document based on a keyword splitting detection technology. In the step, structured key field information can be extracted from a continuous natural language text. The problem t</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language chi ; eng
recordid cdi_epo_espacenet_CN113850056A
source esp@cenet
subjects CALCULATING
COMPUTING
COUNTING
ELECTRIC DIGITAL DATA PROCESSING
PHYSICS
title Document key information extraction method and system based on keyword splitting technology
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-13T01%3A11%3A44IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=ZHAO%20ZENGTAO&rft.date=2021-12-28&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN113850056A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true