Power grid domain phrase identification and classification method and system based on Baidu encyclopedia

The invention discloses a power grid domain phrase identification and classification method and system based on Baidu encyclopedia. The method comprises the steps of extracting phrases of which the occurrence frequency is greater than or equal to a threshold t from a given corpus C as high-frequency...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: CHEN YING, LIU XINGWEI, PI JUNBO, XU ZHENGQI, WU KUN, FAN SHIXIONG, LIAO ZHIFANG, LI BIN, LI ZEKE, LIN JINGHUAI, FAN HAIWEI, WANG JING, HAN YE, FENG CHANGYOU
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator CHEN YING
LIU XINGWEI
PI JUNBO
XU ZHENGQI
WU KUN
FAN SHIXIONG
LIAO ZHIFANG
LI BIN
LI ZEKE
LIN JINGHUAI
FAN HAIWEI
WANG JING
HAN YE
FENG CHANGYOU
description The invention discloses a power grid domain phrase identification and classification method and system based on Baidu encyclopedia. The method comprises the steps of extracting phrases of which the occurrence frequency is greater than or equal to a threshold t from a given corpus C as high-frequency candidate phrases; carrying out redundant phrase filtering on the extracted high-frequency candidate phrases; crawling entry explanations corresponding to the remaining high-frequency candidate phrases after phrase filtering from Baidu encyclopedia on the Internet; regarding the high-frequency candidate phrases which cannot be crawled to the entry explanation as illegal phrases to be removed, and regarding the high-frequency candidate phrases which can be crawled to the entry explanation as legal phrases to be reserved; and recognizing and classifying the high-frequency candidate phrases which are regarded as legal phrases through a pre-trained power grid domain phrase recognition and classification model, and out
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN111552809A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN111552809A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN111552809A3</originalsourceid><addsrcrecordid>eNqNizEKwkAQRbexEPUO4wEEowS01KBYiYV9GHcm2YFkd9lZkdzeIIKt1YP__psadwsvTtAmIaDQo3iILqEyCLHP0ojFLMEDegLboepv6jm7QB-jg2bu4TGGBKM6otAT2NvBdiEyCc7NpMFOefHlzCzPp3t1WXEMNWtEy55zXV2LoijLzW69P2z_-bwBFbhAcA</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Power grid domain phrase identification and classification method and system based on Baidu encyclopedia</title><source>esp@cenet</source><creator>CHEN YING ; LIU XINGWEI ; PI JUNBO ; XU ZHENGQI ; WU KUN ; FAN SHIXIONG ; LIAO ZHIFANG ; LI BIN ; LI ZEKE ; LIN JINGHUAI ; FAN HAIWEI ; WANG JING ; HAN YE ; FENG CHANGYOU</creator><creatorcontrib>CHEN YING ; LIU XINGWEI ; PI JUNBO ; XU ZHENGQI ; WU KUN ; FAN SHIXIONG ; LIAO ZHIFANG ; LI BIN ; LI ZEKE ; LIN JINGHUAI ; FAN HAIWEI ; WANG JING ; HAN YE ; FENG CHANGYOU</creatorcontrib><description>The invention discloses a power grid domain phrase identification and classification method and system based on Baidu encyclopedia. The method comprises the steps of extracting phrases of which the occurrence frequency is greater than or equal to a threshold t from a given corpus C as high-frequency candidate phrases; carrying out redundant phrase filtering on the extracted high-frequency candidate phrases; crawling entry explanations corresponding to the remaining high-frequency candidate phrases after phrase filtering from Baidu encyclopedia on the Internet; regarding the high-frequency candidate phrases which cannot be crawled to the entry explanation as illegal phrases to be removed, and regarding the high-frequency candidate phrases which can be crawled to the entry explanation as legal phrases to be reserved; and recognizing and classifying the high-frequency candidate phrases which are regarded as legal phrases through a pre-trained power grid domain phrase recognition and classification model, and out</description><language>chi ; eng</language><subject>CALCULATING ; COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; PHYSICS</subject><creationdate>2020</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20200818&amp;DB=EPODOC&amp;CC=CN&amp;NR=111552809A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,776,881,25542,76289</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20200818&amp;DB=EPODOC&amp;CC=CN&amp;NR=111552809A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>CHEN YING</creatorcontrib><creatorcontrib>LIU XINGWEI</creatorcontrib><creatorcontrib>PI JUNBO</creatorcontrib><creatorcontrib>XU ZHENGQI</creatorcontrib><creatorcontrib>WU KUN</creatorcontrib><creatorcontrib>FAN SHIXIONG</creatorcontrib><creatorcontrib>LIAO ZHIFANG</creatorcontrib><creatorcontrib>LI BIN</creatorcontrib><creatorcontrib>LI ZEKE</creatorcontrib><creatorcontrib>LIN JINGHUAI</creatorcontrib><creatorcontrib>FAN HAIWEI</creatorcontrib><creatorcontrib>WANG JING</creatorcontrib><creatorcontrib>HAN YE</creatorcontrib><creatorcontrib>FENG CHANGYOU</creatorcontrib><title>Power grid domain phrase identification and classification method and system based on Baidu encyclopedia</title><description>The invention discloses a power grid domain phrase identification and classification method and system based on Baidu encyclopedia. The method comprises the steps of extracting phrases of which the occurrence frequency is greater than or equal to a threshold t from a given corpus C as high-frequency candidate phrases; carrying out redundant phrase filtering on the extracted high-frequency candidate phrases; crawling entry explanations corresponding to the remaining high-frequency candidate phrases after phrase filtering from Baidu encyclopedia on the Internet; regarding the high-frequency candidate phrases which cannot be crawled to the entry explanation as illegal phrases to be removed, and regarding the high-frequency candidate phrases which can be crawled to the entry explanation as legal phrases to be reserved; and recognizing and classifying the high-frequency candidate phrases which are regarded as legal phrases through a pre-trained power grid domain phrase recognition and classification model, and out</description><subject>CALCULATING</subject><subject>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>PHYSICS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2020</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNqNizEKwkAQRbexEPUO4wEEowS01KBYiYV9GHcm2YFkd9lZkdzeIIKt1YP__psadwsvTtAmIaDQo3iILqEyCLHP0ojFLMEDegLboepv6jm7QB-jg2bu4TGGBKM6otAT2NvBdiEyCc7NpMFOefHlzCzPp3t1WXEMNWtEy55zXV2LoijLzW69P2z_-bwBFbhAcA</recordid><startdate>20200818</startdate><enddate>20200818</enddate><creator>CHEN YING</creator><creator>LIU XINGWEI</creator><creator>PI JUNBO</creator><creator>XU ZHENGQI</creator><creator>WU KUN</creator><creator>FAN SHIXIONG</creator><creator>LIAO ZHIFANG</creator><creator>LI BIN</creator><creator>LI ZEKE</creator><creator>LIN JINGHUAI</creator><creator>FAN HAIWEI</creator><creator>WANG JING</creator><creator>HAN YE</creator><creator>FENG CHANGYOU</creator><scope>EVB</scope></search><sort><creationdate>20200818</creationdate><title>Power grid domain phrase identification and classification method and system based on Baidu encyclopedia</title><author>CHEN YING ; LIU XINGWEI ; PI JUNBO ; XU ZHENGQI ; WU KUN ; FAN SHIXIONG ; LIAO ZHIFANG ; LI BIN ; LI ZEKE ; LIN JINGHUAI ; FAN HAIWEI ; WANG JING ; HAN YE ; FENG CHANGYOU</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN111552809A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2020</creationdate><topic>CALCULATING</topic><topic>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>PHYSICS</topic><toplevel>online_resources</toplevel><creatorcontrib>CHEN YING</creatorcontrib><creatorcontrib>LIU XINGWEI</creatorcontrib><creatorcontrib>PI JUNBO</creatorcontrib><creatorcontrib>XU ZHENGQI</creatorcontrib><creatorcontrib>WU KUN</creatorcontrib><creatorcontrib>FAN SHIXIONG</creatorcontrib><creatorcontrib>LIAO ZHIFANG</creatorcontrib><creatorcontrib>LI BIN</creatorcontrib><creatorcontrib>LI ZEKE</creatorcontrib><creatorcontrib>LIN JINGHUAI</creatorcontrib><creatorcontrib>FAN HAIWEI</creatorcontrib><creatorcontrib>WANG JING</creatorcontrib><creatorcontrib>HAN YE</creatorcontrib><creatorcontrib>FENG CHANGYOU</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>CHEN YING</au><au>LIU XINGWEI</au><au>PI JUNBO</au><au>XU ZHENGQI</au><au>WU KUN</au><au>FAN SHIXIONG</au><au>LIAO ZHIFANG</au><au>LI BIN</au><au>LI ZEKE</au><au>LIN JINGHUAI</au><au>FAN HAIWEI</au><au>WANG JING</au><au>HAN YE</au><au>FENG CHANGYOU</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Power grid domain phrase identification and classification method and system based on Baidu encyclopedia</title><date>2020-08-18</date><risdate>2020</risdate><abstract>The invention discloses a power grid domain phrase identification and classification method and system based on Baidu encyclopedia. The method comprises the steps of extracting phrases of which the occurrence frequency is greater than or equal to a threshold t from a given corpus C as high-frequency candidate phrases; carrying out redundant phrase filtering on the extracted high-frequency candidate phrases; crawling entry explanations corresponding to the remaining high-frequency candidate phrases after phrase filtering from Baidu encyclopedia on the Internet; regarding the high-frequency candidate phrases which cannot be crawled to the entry explanation as illegal phrases to be removed, and regarding the high-frequency candidate phrases which can be crawled to the entry explanation as legal phrases to be reserved; and recognizing and classifying the high-frequency candidate phrases which are regarded as legal phrases through a pre-trained power grid domain phrase recognition and classification model, and out</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language chi ; eng
recordid cdi_epo_espacenet_CN111552809A
source esp@cenet
subjects CALCULATING
COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
COMPUTING
COUNTING
ELECTRIC DIGITAL DATA PROCESSING
PHYSICS
title Power grid domain phrase identification and classification method and system based on Baidu encyclopedia
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-07T17%3A57%3A23IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=CHEN%20YING&rft.date=2020-08-18&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN111552809A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true