Text classification method for open network questions in specific field

The invention belongs to the technical field of text classification processing, and particularly relates to a text classification method for open network questions in a specific field. According to the method, the problems of lack of enough available corpus sets with category marks, low network text...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: YU RICHANG, YANG HUI, LI RONGSHENG, LIU WANGYANG, ZHANG BAIJIA, HUANG SHAOBIN, SHEN LINSHAN, LI YI
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator YU RICHANG
YANG HUI
LI RONGSHENG
LIU WANGYANG
ZHANG BAIJIA
HUANG SHAOBIN
SHEN LINSHAN
LI YI
description The invention belongs to the technical field of text classification processing, and particularly relates to a text classification method for open network questions in a specific field. According to the method, the problems of lack of enough available corpus sets with category marks, low network text information amount and high noise under the condition of executing network open text classificationtasks in certain specific fields are solved, and a new method is provided for hierarchical classification of open network questions in the fields. According to the method, open network questions andwritten texts in a specific domain are utilized to enable word embedding representation in the domain to better conform to domain knowledge features, and meanwhile, a semi-supervised method is used for accelerating classification model training and reducing required marked samples; and in addition, category classification at a multi-granularity level is realized in combination with conditional probability. The method can a
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN111046179A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN111046179A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN111046179A3</originalsourceid><addsrcrecordid>eNrjZHAPSa0oUUjOSSwuzkzLTE4syczPU8hNLcnIT1FIyy9SyC9IzVPISy0pzy_KVigsTS0GKShWyMxTKC5ITQZpUUjLTM1J4WFgTUvMKU7lhdLcDIpuriHOHrqpBfnxqcUFicmpQFPinf0MDQ0NTMwMzS0djYlRAwCd5zRL</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Text classification method for open network questions in specific field</title><source>esp@cenet</source><creator>YU RICHANG ; YANG HUI ; LI RONGSHENG ; LIU WANGYANG ; ZHANG BAIJIA ; HUANG SHAOBIN ; SHEN LINSHAN ; LI YI</creator><creatorcontrib>YU RICHANG ; YANG HUI ; LI RONGSHENG ; LIU WANGYANG ; ZHANG BAIJIA ; HUANG SHAOBIN ; SHEN LINSHAN ; LI YI</creatorcontrib><description>The invention belongs to the technical field of text classification processing, and particularly relates to a text classification method for open network questions in a specific field. According to the method, the problems of lack of enough available corpus sets with category marks, low network text information amount and high noise under the condition of executing network open text classificationtasks in certain specific fields are solved, and a new method is provided for hierarchical classification of open network questions in the fields. According to the method, open network questions andwritten texts in a specific domain are utilized to enable word embedding representation in the domain to better conform to domain knowledge features, and meanwhile, a semi-supervised method is used for accelerating classification model training and reducing required marked samples; and in addition, category classification at a multi-granularity level is realized in combination with conditional probability. The method can a</description><language>chi ; eng</language><subject>CALCULATING ; COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; HANDLING RECORD CARRIERS ; PHYSICS ; PRESENTATION OF DATA ; RECOGNITION OF DATA ; RECORD CARRIERS</subject><creationdate>2020</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20200421&amp;DB=EPODOC&amp;CC=CN&amp;NR=111046179A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,309,781,886,25569,76552</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20200421&amp;DB=EPODOC&amp;CC=CN&amp;NR=111046179A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>YU RICHANG</creatorcontrib><creatorcontrib>YANG HUI</creatorcontrib><creatorcontrib>LI RONGSHENG</creatorcontrib><creatorcontrib>LIU WANGYANG</creatorcontrib><creatorcontrib>ZHANG BAIJIA</creatorcontrib><creatorcontrib>HUANG SHAOBIN</creatorcontrib><creatorcontrib>SHEN LINSHAN</creatorcontrib><creatorcontrib>LI YI</creatorcontrib><title>Text classification method for open network questions in specific field</title><description>The invention belongs to the technical field of text classification processing, and particularly relates to a text classification method for open network questions in a specific field. According to the method, the problems of lack of enough available corpus sets with category marks, low network text information amount and high noise under the condition of executing network open text classificationtasks in certain specific fields are solved, and a new method is provided for hierarchical classification of open network questions in the fields. According to the method, open network questions andwritten texts in a specific domain are utilized to enable word embedding representation in the domain to better conform to domain knowledge features, and meanwhile, a semi-supervised method is used for accelerating classification model training and reducing required marked samples; and in addition, category classification at a multi-granularity level is realized in combination with conditional probability. The method can a</description><subject>CALCULATING</subject><subject>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>HANDLING RECORD CARRIERS</subject><subject>PHYSICS</subject><subject>PRESENTATION OF DATA</subject><subject>RECOGNITION OF DATA</subject><subject>RECORD CARRIERS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2020</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZHAPSa0oUUjOSSwuzkzLTE4syczPU8hNLcnIT1FIyy9SyC9IzVPISy0pzy_KVigsTS0GKShWyMxTKC5ITQZpUUjLTM1J4WFgTUvMKU7lhdLcDIpuriHOHrqpBfnxqcUFicmpQFPinf0MDQ0NTMwMzS0djYlRAwCd5zRL</recordid><startdate>20200421</startdate><enddate>20200421</enddate><creator>YU RICHANG</creator><creator>YANG HUI</creator><creator>LI RONGSHENG</creator><creator>LIU WANGYANG</creator><creator>ZHANG BAIJIA</creator><creator>HUANG SHAOBIN</creator><creator>SHEN LINSHAN</creator><creator>LI YI</creator><scope>EVB</scope></search><sort><creationdate>20200421</creationdate><title>Text classification method for open network questions in specific field</title><author>YU RICHANG ; YANG HUI ; LI RONGSHENG ; LIU WANGYANG ; ZHANG BAIJIA ; HUANG SHAOBIN ; SHEN LINSHAN ; LI YI</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN111046179A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2020</creationdate><topic>CALCULATING</topic><topic>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>HANDLING RECORD CARRIERS</topic><topic>PHYSICS</topic><topic>PRESENTATION OF DATA</topic><topic>RECOGNITION OF DATA</topic><topic>RECORD CARRIERS</topic><toplevel>online_resources</toplevel><creatorcontrib>YU RICHANG</creatorcontrib><creatorcontrib>YANG HUI</creatorcontrib><creatorcontrib>LI RONGSHENG</creatorcontrib><creatorcontrib>LIU WANGYANG</creatorcontrib><creatorcontrib>ZHANG BAIJIA</creatorcontrib><creatorcontrib>HUANG SHAOBIN</creatorcontrib><creatorcontrib>SHEN LINSHAN</creatorcontrib><creatorcontrib>LI YI</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>YU RICHANG</au><au>YANG HUI</au><au>LI RONGSHENG</au><au>LIU WANGYANG</au><au>ZHANG BAIJIA</au><au>HUANG SHAOBIN</au><au>SHEN LINSHAN</au><au>LI YI</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Text classification method for open network questions in specific field</title><date>2020-04-21</date><risdate>2020</risdate><abstract>The invention belongs to the technical field of text classification processing, and particularly relates to a text classification method for open network questions in a specific field. According to the method, the problems of lack of enough available corpus sets with category marks, low network text information amount and high noise under the condition of executing network open text classificationtasks in certain specific fields are solved, and a new method is provided for hierarchical classification of open network questions in the fields. According to the method, open network questions andwritten texts in a specific domain are utilized to enable word embedding representation in the domain to better conform to domain knowledge features, and meanwhile, a semi-supervised method is used for accelerating classification model training and reducing required marked samples; and in addition, category classification at a multi-granularity level is realized in combination with conditional probability. The method can a</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language chi ; eng
recordid cdi_epo_espacenet_CN111046179A
source esp@cenet
subjects CALCULATING
COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
COMPUTING
COUNTING
ELECTRIC DIGITAL DATA PROCESSING
HANDLING RECORD CARRIERS
PHYSICS
PRESENTATION OF DATA
RECOGNITION OF DATA
RECORD CARRIERS
title Text classification method for open network questions in specific field
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-15T07%3A12%3A57IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=YU%20RICHANG&rft.date=2020-04-21&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN111046179A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true