Power grid domain phrase identification and classification method and system based on Baidu encyclopedia
The invention discloses a power grid domain phrase identification and classification method and system based on Baidu encyclopedia. The method comprises the steps of extracting phrases of which the occurrence frequency is greater than or equal to a threshold t from a given corpus C as high-frequency...
Gespeichert in:
Hauptverfasser: | , , , , , , , , , , , , , |
---|---|
Format: | Patent |
Sprache: | chi ; eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | CHEN YING LIU XINGWEI PI JUNBO XU ZHENGQI WU KUN FAN SHIXIONG LIAO ZHIFANG LI BIN LI ZEKE LIN JINGHUAI FAN HAIWEI WANG JING HAN YE FENG CHANGYOU |
description | The invention discloses a power grid domain phrase identification and classification method and system based on Baidu encyclopedia. The method comprises the steps of extracting phrases of which the occurrence frequency is greater than or equal to a threshold t from a given corpus C as high-frequency candidate phrases; carrying out redundant phrase filtering on the extracted high-frequency candidate phrases; crawling entry explanations corresponding to the remaining high-frequency candidate phrases after phrase filtering from Baidu encyclopedia on the Internet; regarding the high-frequency candidate phrases which cannot be crawled to the entry explanation as illegal phrases to be removed, and regarding the high-frequency candidate phrases which can be crawled to the entry explanation as legal phrases to be reserved; and recognizing and classifying the high-frequency candidate phrases which are regarded as legal phrases through a pre-trained power grid domain phrase recognition and classification model, and out |
format | Patent |
fullrecord | <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN111552809A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN111552809A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN111552809A3</originalsourceid><addsrcrecordid>eNqNizEKwkAQRbexEPUO4wEEowS01KBYiYV9GHcm2YFkd9lZkdzeIIKt1YP__psadwsvTtAmIaDQo3iILqEyCLHP0ojFLMEDegLboepv6jm7QB-jg2bu4TGGBKM6otAT2NvBdiEyCc7NpMFOefHlzCzPp3t1WXEMNWtEy55zXV2LoijLzW69P2z_-bwBFbhAcA</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Power grid domain phrase identification and classification method and system based on Baidu encyclopedia</title><source>esp@cenet</source><creator>CHEN YING ; LIU XINGWEI ; PI JUNBO ; XU ZHENGQI ; WU KUN ; FAN SHIXIONG ; LIAO ZHIFANG ; LI BIN ; LI ZEKE ; LIN JINGHUAI ; FAN HAIWEI ; WANG JING ; HAN YE ; FENG CHANGYOU</creator><creatorcontrib>CHEN YING ; LIU XINGWEI ; PI JUNBO ; XU ZHENGQI ; WU KUN ; FAN SHIXIONG ; LIAO ZHIFANG ; LI BIN ; LI ZEKE ; LIN JINGHUAI ; FAN HAIWEI ; WANG JING ; HAN YE ; FENG CHANGYOU</creatorcontrib><description>The invention discloses a power grid domain phrase identification and classification method and system based on Baidu encyclopedia. The method comprises the steps of extracting phrases of which the occurrence frequency is greater than or equal to a threshold t from a given corpus C as high-frequency candidate phrases; carrying out redundant phrase filtering on the extracted high-frequency candidate phrases; crawling entry explanations corresponding to the remaining high-frequency candidate phrases after phrase filtering from Baidu encyclopedia on the Internet; regarding the high-frequency candidate phrases which cannot be crawled to the entry explanation as illegal phrases to be removed, and regarding the high-frequency candidate phrases which can be crawled to the entry explanation as legal phrases to be reserved; and recognizing and classifying the high-frequency candidate phrases which are regarded as legal phrases through a pre-trained power grid domain phrase recognition and classification model, and out</description><language>chi ; eng</language><subject>CALCULATING ; COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; PHYSICS</subject><creationdate>2020</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20200818&DB=EPODOC&CC=CN&NR=111552809A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,776,881,25542,76289</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20200818&DB=EPODOC&CC=CN&NR=111552809A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>CHEN YING</creatorcontrib><creatorcontrib>LIU XINGWEI</creatorcontrib><creatorcontrib>PI JUNBO</creatorcontrib><creatorcontrib>XU ZHENGQI</creatorcontrib><creatorcontrib>WU KUN</creatorcontrib><creatorcontrib>FAN SHIXIONG</creatorcontrib><creatorcontrib>LIAO ZHIFANG</creatorcontrib><creatorcontrib>LI BIN</creatorcontrib><creatorcontrib>LI ZEKE</creatorcontrib><creatorcontrib>LIN JINGHUAI</creatorcontrib><creatorcontrib>FAN HAIWEI</creatorcontrib><creatorcontrib>WANG JING</creatorcontrib><creatorcontrib>HAN YE</creatorcontrib><creatorcontrib>FENG CHANGYOU</creatorcontrib><title>Power grid domain phrase identification and classification method and system based on Baidu encyclopedia</title><description>The invention discloses a power grid domain phrase identification and classification method and system based on Baidu encyclopedia. The method comprises the steps of extracting phrases of which the occurrence frequency is greater than or equal to a threshold t from a given corpus C as high-frequency candidate phrases; carrying out redundant phrase filtering on the extracted high-frequency candidate phrases; crawling entry explanations corresponding to the remaining high-frequency candidate phrases after phrase filtering from Baidu encyclopedia on the Internet; regarding the high-frequency candidate phrases which cannot be crawled to the entry explanation as illegal phrases to be removed, and regarding the high-frequency candidate phrases which can be crawled to the entry explanation as legal phrases to be reserved; and recognizing and classifying the high-frequency candidate phrases which are regarded as legal phrases through a pre-trained power grid domain phrase recognition and classification model, and out</description><subject>CALCULATING</subject><subject>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>PHYSICS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2020</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNqNizEKwkAQRbexEPUO4wEEowS01KBYiYV9GHcm2YFkd9lZkdzeIIKt1YP__psadwsvTtAmIaDQo3iILqEyCLHP0ojFLMEDegLboepv6jm7QB-jg2bu4TGGBKM6otAT2NvBdiEyCc7NpMFOefHlzCzPp3t1WXEMNWtEy55zXV2LoijLzW69P2z_-bwBFbhAcA</recordid><startdate>20200818</startdate><enddate>20200818</enddate><creator>CHEN YING</creator><creator>LIU XINGWEI</creator><creator>PI JUNBO</creator><creator>XU ZHENGQI</creator><creator>WU KUN</creator><creator>FAN SHIXIONG</creator><creator>LIAO ZHIFANG</creator><creator>LI BIN</creator><creator>LI ZEKE</creator><creator>LIN JINGHUAI</creator><creator>FAN HAIWEI</creator><creator>WANG JING</creator><creator>HAN YE</creator><creator>FENG CHANGYOU</creator><scope>EVB</scope></search><sort><creationdate>20200818</creationdate><title>Power grid domain phrase identification and classification method and system based on Baidu encyclopedia</title><author>CHEN YING ; LIU XINGWEI ; PI JUNBO ; XU ZHENGQI ; WU KUN ; FAN SHIXIONG ; LIAO ZHIFANG ; LI BIN ; LI ZEKE ; LIN JINGHUAI ; FAN HAIWEI ; WANG JING ; HAN YE ; FENG CHANGYOU</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN111552809A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2020</creationdate><topic>CALCULATING</topic><topic>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>PHYSICS</topic><toplevel>online_resources</toplevel><creatorcontrib>CHEN YING</creatorcontrib><creatorcontrib>LIU XINGWEI</creatorcontrib><creatorcontrib>PI JUNBO</creatorcontrib><creatorcontrib>XU ZHENGQI</creatorcontrib><creatorcontrib>WU KUN</creatorcontrib><creatorcontrib>FAN SHIXIONG</creatorcontrib><creatorcontrib>LIAO ZHIFANG</creatorcontrib><creatorcontrib>LI BIN</creatorcontrib><creatorcontrib>LI ZEKE</creatorcontrib><creatorcontrib>LIN JINGHUAI</creatorcontrib><creatorcontrib>FAN HAIWEI</creatorcontrib><creatorcontrib>WANG JING</creatorcontrib><creatorcontrib>HAN YE</creatorcontrib><creatorcontrib>FENG CHANGYOU</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>CHEN YING</au><au>LIU XINGWEI</au><au>PI JUNBO</au><au>XU ZHENGQI</au><au>WU KUN</au><au>FAN SHIXIONG</au><au>LIAO ZHIFANG</au><au>LI BIN</au><au>LI ZEKE</au><au>LIN JINGHUAI</au><au>FAN HAIWEI</au><au>WANG JING</au><au>HAN YE</au><au>FENG CHANGYOU</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Power grid domain phrase identification and classification method and system based on Baidu encyclopedia</title><date>2020-08-18</date><risdate>2020</risdate><abstract>The invention discloses a power grid domain phrase identification and classification method and system based on Baidu encyclopedia. The method comprises the steps of extracting phrases of which the occurrence frequency is greater than or equal to a threshold t from a given corpus C as high-frequency candidate phrases; carrying out redundant phrase filtering on the extracted high-frequency candidate phrases; crawling entry explanations corresponding to the remaining high-frequency candidate phrases after phrase filtering from Baidu encyclopedia on the Internet; regarding the high-frequency candidate phrases which cannot be crawled to the entry explanation as illegal phrases to be removed, and regarding the high-frequency candidate phrases which can be crawled to the entry explanation as legal phrases to be reserved; and recognizing and classifying the high-frequency candidate phrases which are regarded as legal phrases through a pre-trained power grid domain phrase recognition and classification model, and out</abstract><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | |
ispartof | |
issn | |
language | chi ; eng |
recordid | cdi_epo_espacenet_CN111552809A |
source | esp@cenet |
subjects | CALCULATING COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING PHYSICS |
title | Power grid domain phrase identification and classification method and system based on Baidu encyclopedia |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-07T17%3A57%3A23IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=CHEN%20YING&rft.date=2020-08-18&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN111552809A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |