Document key information extraction method and system based on keyword splitting technology

The invention provides a document key information extraction method and system based on a keyword splitting technology, and relates to the field of document key information extraction. The method comprises the steps: converting an obtained target document into an XML format document; carrying out ke...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	ZHAO ZENGTAO, SHE JUN, LUO YONG, YU SHAOFENG, LIAO CHONGYANG
Format:	Patent
Sprache:	chi ; eng
Schlagworte:	CALCULATING COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING PHYSICS
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	ZHAO ZENGTAO SHE JUN LUO YONG YU SHAOFENG LIAO CHONGYANG
description	The invention provides a document key information extraction method and system based on a keyword splitting technology, and relates to the field of document key information extraction. The method comprises the steps: converting an obtained target document into an XML format document; carrying out key information extraction on the XML format document based on a keyword splitting detection technology; and obtaining a target document and converting the target document into an XML format document. XML is an extensible markup language, and is a markup language used for marking an electronic file to enable the electronic file to have a structural property. Therefore, the target document is converted into the XML format document, and subsequent information extraction is facilitated. Key information extraction is carried out on the XML format document based on a keyword splitting detection technology. In the step, structured key field information can be extracted from a continuous natural language text. The problem t
format	Patent
fullrecord	<record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN113850056A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN113850056A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN113850056A3</originalsourceid><addsrcrecordid>eNqNi0EKwjAQRbtxIeodxgMILaXiVqriypU7FyWm0zaYzIRkRHN7g3gAV__z_vvz4nZg_XRIAg9MYGjg4JQYJsC3BKW_1aFM3IOiHmKKgg7uKmIPecqvF4fMvTUihkYQ1BOx5TEti9mgbMTVLxfF-nS8tucNeu4weqWRULr2UlX1rinLZruv_3E-5UU8Xw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Document key information extraction method and system based on keyword splitting technology</title><source>esp@cenet</source><creator>ZHAO ZENGTAO ; SHE JUN ; LUO YONG ; YU SHAOFENG ; LIAO CHONGYANG</creator><creatorcontrib>ZHAO ZENGTAO ; SHE JUN ; LUO YONG ; YU SHAOFENG ; LIAO CHONGYANG</creatorcontrib><description>The invention provides a document key information extraction method and system based on a keyword splitting technology, and relates to the field of document key information extraction. The method comprises the steps: converting an obtained target document into an XML format document; carrying out key information extraction on the XML format document based on a keyword splitting detection technology; and obtaining a target document and converting the target document into an XML format document. XML is an extensible markup language, and is a markup language used for marking an electronic file to enable the electronic file to have a structural property. Therefore, the target document is converted into the XML format document, and subsequent information extraction is facilitated. Key information extraction is carried out on the XML format document based on a keyword splitting detection technology. In the step, structured key field information can be extracted from a continuous natural language text. The problem t</description><language>chi ; eng</language><subject>CALCULATING ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; PHYSICS</subject><creationdate>2021</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20211228&DB=EPODOC&CC=CN&NR=113850056A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,776,881,25542,76289</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20211228&DB=EPODOC&CC=CN&NR=113850056A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>ZHAO ZENGTAO</creatorcontrib><creatorcontrib>SHE JUN</creatorcontrib><creatorcontrib>LUO YONG</creatorcontrib><creatorcontrib>YU SHAOFENG</creatorcontrib><creatorcontrib>LIAO CHONGYANG</creatorcontrib><title>Document key information extraction method and system based on keyword splitting technology</title><description>The invention provides a document key information extraction method and system based on a keyword splitting technology, and relates to the field of document key information extraction. The method comprises the steps: converting an obtained target document into an XML format document; carrying out key information extraction on the XML format document based on a keyword splitting detection technology; and obtaining a target document and converting the target document into an XML format document. XML is an extensible markup language, and is a markup language used for marking an electronic file to enable the electronic file to have a structural property. Therefore, the target document is converted into the XML format document, and subsequent information extraction is facilitated. Key information extraction is carried out on the XML format document based on a keyword splitting detection technology. In the step, structured key field information can be extracted from a continuous natural language text. The problem t</description><subject>CALCULATING</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>PHYSICS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2021</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNqNi0EKwjAQRbtxIeodxgMILaXiVqriypU7FyWm0zaYzIRkRHN7g3gAV__z_vvz4nZg_XRIAg9MYGjg4JQYJsC3BKW_1aFM3IOiHmKKgg7uKmIPecqvF4fMvTUihkYQ1BOx5TEti9mgbMTVLxfF-nS8tucNeu4weqWRULr2UlX1rinLZruv_3E-5UU8Xw</recordid><startdate>20211228</startdate><enddate>20211228</enddate><creator>ZHAO ZENGTAO</creator><creator>SHE JUN</creator><creator>LUO YONG</creator><creator>YU SHAOFENG</creator><creator>LIAO CHONGYANG</creator><scope>EVB</scope></search><sort><creationdate>20211228</creationdate><title>Document key information extraction method and system based on keyword splitting technology</title><author>ZHAO ZENGTAO ; SHE JUN ; LUO YONG ; YU SHAOFENG ; LIAO CHONGYANG</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN113850056A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2021</creationdate><topic>CALCULATING</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>PHYSICS</topic><toplevel>online_resources</toplevel><creatorcontrib>ZHAO ZENGTAO</creatorcontrib><creatorcontrib>SHE JUN</creatorcontrib><creatorcontrib>LUO YONG</creatorcontrib><creatorcontrib>YU SHAOFENG</creatorcontrib><creatorcontrib>LIAO CHONGYANG</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>ZHAO ZENGTAO</au><au>SHE JUN</au><au>LUO YONG</au><au>YU SHAOFENG</au><au>LIAO CHONGYANG</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Document key information extraction method and system based on keyword splitting technology</title><date>2021-12-28</date><risdate>2021</risdate><abstract>The invention provides a document key information extraction method and system based on a keyword splitting technology, and relates to the field of document key information extraction. The method comprises the steps: converting an obtained target document into an XML format document; carrying out key information extraction on the XML format document based on a keyword splitting detection technology; and obtaining a target document and converting the target document into an XML format document. XML is an extensible markup language, and is a markup language used for marking an electronic file to enable the electronic file to have a structural property. Therefore, the target document is converted into the XML format document, and subsequent information extraction is facilitated. Key information extraction is carried out on the XML format document based on a keyword splitting detection technology. In the step, structured key field information can be extracted from a continuous natural language text. The problem t</abstract><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier
ispartof
issn
language	chi ; eng
recordid	cdi_epo_espacenet_CN113850056A
source	esp@cenet
subjects	CALCULATING COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING PHYSICS
title	Document key information extraction method and system based on keyword splitting technology
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-13T01%3A11%3A44IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=ZHAO%20ZENGTAO&rft.date=2021-12-28&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN113850056A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true