Document key information extraction method and system based on keyword splitting technology
The invention provides a document key information extraction method and system based on a keyword splitting technology, and relates to the field of document key information extraction. The method comprises the steps: converting an obtained target document into an XML format document; carrying out ke...
Gespeichert in:
Hauptverfasser: | , , , , |
---|---|
Format: | Patent |
Sprache: | chi ; eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | ZHAO ZENGTAO SHE JUN LUO YONG YU SHAOFENG LIAO CHONGYANG |
description | The invention provides a document key information extraction method and system based on a keyword splitting technology, and relates to the field of document key information extraction. The method comprises the steps: converting an obtained target document into an XML format document; carrying out key information extraction on the XML format document based on a keyword splitting detection technology; and obtaining a target document and converting the target document into an XML format document. XML is an extensible markup language, and is a markup language used for marking an electronic file to enable the electronic file to have a structural property. Therefore, the target document is converted into the XML format document, and subsequent information extraction is facilitated. Key information extraction is carried out on the XML format document based on a keyword splitting detection technology. In the step, structured key field information can be extracted from a continuous natural language text. The problem t |
format | Patent |
fullrecord | <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN113850056A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN113850056A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN113850056A3</originalsourceid><addsrcrecordid>eNqNi0EKwjAQRbtxIeodxgMILaXiVqriypU7FyWm0zaYzIRkRHN7g3gAV__z_vvz4nZg_XRIAg9MYGjg4JQYJsC3BKW_1aFM3IOiHmKKgg7uKmIPecqvF4fMvTUihkYQ1BOx5TEti9mgbMTVLxfF-nS8tucNeu4weqWRULr2UlX1rinLZruv_3E-5UU8Xw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Document key information extraction method and system based on keyword splitting technology</title><source>esp@cenet</source><creator>ZHAO ZENGTAO ; SHE JUN ; LUO YONG ; YU SHAOFENG ; LIAO CHONGYANG</creator><creatorcontrib>ZHAO ZENGTAO ; SHE JUN ; LUO YONG ; YU SHAOFENG ; LIAO CHONGYANG</creatorcontrib><description>The invention provides a document key information extraction method and system based on a keyword splitting technology, and relates to the field of document key information extraction. The method comprises the steps: converting an obtained target document into an XML format document; carrying out key information extraction on the XML format document based on a keyword splitting detection technology; and obtaining a target document and converting the target document into an XML format document. XML is an extensible markup language, and is a markup language used for marking an electronic file to enable the electronic file to have a structural property. Therefore, the target document is converted into the XML format document, and subsequent information extraction is facilitated. Key information extraction is carried out on the XML format document based on a keyword splitting detection technology. In the step, structured key field information can be extracted from a continuous natural language text. The problem t</description><language>chi ; eng</language><subject>CALCULATING ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; PHYSICS</subject><creationdate>2021</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20211228&DB=EPODOC&CC=CN&NR=113850056A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,776,881,25542,76289</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20211228&DB=EPODOC&CC=CN&NR=113850056A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>ZHAO ZENGTAO</creatorcontrib><creatorcontrib>SHE JUN</creatorcontrib><creatorcontrib>LUO YONG</creatorcontrib><creatorcontrib>YU SHAOFENG</creatorcontrib><creatorcontrib>LIAO CHONGYANG</creatorcontrib><title>Document key information extraction method and system based on keyword splitting technology</title><description>The invention provides a document key information extraction method and system based on a keyword splitting technology, and relates to the field of document key information extraction. The method comprises the steps: converting an obtained target document into an XML format document; carrying out key information extraction on the XML format document based on a keyword splitting detection technology; and obtaining a target document and converting the target document into an XML format document. XML is an extensible markup language, and is a markup language used for marking an electronic file to enable the electronic file to have a structural property. Therefore, the target document is converted into the XML format document, and subsequent information extraction is facilitated. Key information extraction is carried out on the XML format document based on a keyword splitting detection technology. In the step, structured key field information can be extracted from a continuous natural language text. The problem t</description><subject>CALCULATING</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>PHYSICS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2021</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNqNi0EKwjAQRbtxIeodxgMILaXiVqriypU7FyWm0zaYzIRkRHN7g3gAV__z_vvz4nZg_XRIAg9MYGjg4JQYJsC3BKW_1aFM3IOiHmKKgg7uKmIPecqvF4fMvTUihkYQ1BOx5TEti9mgbMTVLxfF-nS8tucNeu4weqWRULr2UlX1rinLZruv_3E-5UU8Xw</recordid><startdate>20211228</startdate><enddate>20211228</enddate><creator>ZHAO ZENGTAO</creator><creator>SHE JUN</creator><creator>LUO YONG</creator><creator>YU SHAOFENG</creator><creator>LIAO CHONGYANG</creator><scope>EVB</scope></search><sort><creationdate>20211228</creationdate><title>Document key information extraction method and system based on keyword splitting technology</title><author>ZHAO ZENGTAO ; SHE JUN ; LUO YONG ; YU SHAOFENG ; LIAO CHONGYANG</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN113850056A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2021</creationdate><topic>CALCULATING</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>PHYSICS</topic><toplevel>online_resources</toplevel><creatorcontrib>ZHAO ZENGTAO</creatorcontrib><creatorcontrib>SHE JUN</creatorcontrib><creatorcontrib>LUO YONG</creatorcontrib><creatorcontrib>YU SHAOFENG</creatorcontrib><creatorcontrib>LIAO CHONGYANG</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>ZHAO ZENGTAO</au><au>SHE JUN</au><au>LUO YONG</au><au>YU SHAOFENG</au><au>LIAO CHONGYANG</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Document key information extraction method and system based on keyword splitting technology</title><date>2021-12-28</date><risdate>2021</risdate><abstract>The invention provides a document key information extraction method and system based on a keyword splitting technology, and relates to the field of document key information extraction. The method comprises the steps: converting an obtained target document into an XML format document; carrying out key information extraction on the XML format document based on a keyword splitting detection technology; and obtaining a target document and converting the target document into an XML format document. XML is an extensible markup language, and is a markup language used for marking an electronic file to enable the electronic file to have a structural property. Therefore, the target document is converted into the XML format document, and subsequent information extraction is facilitated. Key information extraction is carried out on the XML format document based on a keyword splitting detection technology. In the step, structured key field information can be extracted from a continuous natural language text. The problem t</abstract><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | |
ispartof | |
issn | |
language | chi ; eng |
recordid | cdi_epo_espacenet_CN113850056A |
source | esp@cenet |
subjects | CALCULATING COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING PHYSICS |
title | Document key information extraction method and system based on keyword splitting technology |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-13T01%3A11%3A44IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=ZHAO%20ZENGTAO&rft.date=2021-12-28&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN113850056A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |