Literature new word discovery method and system based on mask language model

The invention discloses a literature new word discovery method and system based on a mask language model, and belongs to the technical field of artificial intelligence natural language processing, the method employs a mask language training component, a model dependency relationship operation compon...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: YANG XI, GU GANG, ZHU JIABING, YIN JINGGANG
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator YANG XI
GU GANG
ZHU JIABING
YIN JINGGANG
description The invention discloses a literature new word discovery method and system based on a mask language model, and belongs to the technical field of artificial intelligence natural language processing, the method employs a mask language training component, a model dependency relationship operation component and a maximum probability operation component, the mask language training component carries out data cleaning and sentence segment segmentation on literature data; an Attention mechanism and a feedforward neural network are constructed through a training composition vector identifier Word Embedding of the training set, combining the Attention mechanism and the feedforward neural network into a group of Encoder, and constructing an Encoder training model; the coded training set is subjected to random shielding, part of the input token is used as training set input, the shielded token is used as output, the mode is used as a data generator, and the deep bidirectional representation network is trained. According t
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN113901811A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN113901811A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN113901811A3</originalsourceid><addsrcrecordid>eNqNyjEOgkAQBVAaC6PeYTyAiRsaLQ3RWBArezIyXyTu7pCdRcLtbTyA1WvesqjrPiNxHhMoYqJJk5D01uoHaaaA_FIhjkI2W0agBxuENFJge5Pn2I3cgYIK_LpYPNkbNj9XxfZyvlfXHQZtYAO3iMhNdXOuPO7dwblT-c_5AqRBNbo</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Literature new word discovery method and system based on mask language model</title><source>esp@cenet</source><creator>YANG XI ; GU GANG ; ZHU JIABING ; YIN JINGGANG</creator><creatorcontrib>YANG XI ; GU GANG ; ZHU JIABING ; YIN JINGGANG</creatorcontrib><description>The invention discloses a literature new word discovery method and system based on a mask language model, and belongs to the technical field of artificial intelligence natural language processing, the method employs a mask language training component, a model dependency relationship operation component and a maximum probability operation component, the mask language training component carries out data cleaning and sentence segment segmentation on literature data; an Attention mechanism and a feedforward neural network are constructed through a training composition vector identifier Word Embedding of the training set, combining the Attention mechanism and the feedforward neural network into a group of Encoder, and constructing an Encoder training model; the coded training set is subjected to random shielding, part of the input token is used as training set input, the shielded token is used as output, the mode is used as a data generator, and the deep bidirectional representation network is trained. According t</description><language>chi ; eng</language><subject>CALCULATING ; COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; PHYSICS</subject><creationdate>2022</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20220107&amp;DB=EPODOC&amp;CC=CN&amp;NR=113901811A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25564,76547</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20220107&amp;DB=EPODOC&amp;CC=CN&amp;NR=113901811A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>YANG XI</creatorcontrib><creatorcontrib>GU GANG</creatorcontrib><creatorcontrib>ZHU JIABING</creatorcontrib><creatorcontrib>YIN JINGGANG</creatorcontrib><title>Literature new word discovery method and system based on mask language model</title><description>The invention discloses a literature new word discovery method and system based on a mask language model, and belongs to the technical field of artificial intelligence natural language processing, the method employs a mask language training component, a model dependency relationship operation component and a maximum probability operation component, the mask language training component carries out data cleaning and sentence segment segmentation on literature data; an Attention mechanism and a feedforward neural network are constructed through a training composition vector identifier Word Embedding of the training set, combining the Attention mechanism and the feedforward neural network into a group of Encoder, and constructing an Encoder training model; the coded training set is subjected to random shielding, part of the input token is used as training set input, the shielded token is used as output, the mode is used as a data generator, and the deep bidirectional representation network is trained. According t</description><subject>CALCULATING</subject><subject>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>PHYSICS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2022</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNqNyjEOgkAQBVAaC6PeYTyAiRsaLQ3RWBArezIyXyTu7pCdRcLtbTyA1WvesqjrPiNxHhMoYqJJk5D01uoHaaaA_FIhjkI2W0agBxuENFJge5Pn2I3cgYIK_LpYPNkbNj9XxfZyvlfXHQZtYAO3iMhNdXOuPO7dwblT-c_5AqRBNbo</recordid><startdate>20220107</startdate><enddate>20220107</enddate><creator>YANG XI</creator><creator>GU GANG</creator><creator>ZHU JIABING</creator><creator>YIN JINGGANG</creator><scope>EVB</scope></search><sort><creationdate>20220107</creationdate><title>Literature new word discovery method and system based on mask language model</title><author>YANG XI ; GU GANG ; ZHU JIABING ; YIN JINGGANG</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN113901811A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2022</creationdate><topic>CALCULATING</topic><topic>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>PHYSICS</topic><toplevel>online_resources</toplevel><creatorcontrib>YANG XI</creatorcontrib><creatorcontrib>GU GANG</creatorcontrib><creatorcontrib>ZHU JIABING</creatorcontrib><creatorcontrib>YIN JINGGANG</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>YANG XI</au><au>GU GANG</au><au>ZHU JIABING</au><au>YIN JINGGANG</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Literature new word discovery method and system based on mask language model</title><date>2022-01-07</date><risdate>2022</risdate><abstract>The invention discloses a literature new word discovery method and system based on a mask language model, and belongs to the technical field of artificial intelligence natural language processing, the method employs a mask language training component, a model dependency relationship operation component and a maximum probability operation component, the mask language training component carries out data cleaning and sentence segment segmentation on literature data; an Attention mechanism and a feedforward neural network are constructed through a training composition vector identifier Word Embedding of the training set, combining the Attention mechanism and the feedforward neural network into a group of Encoder, and constructing an Encoder training model; the coded training set is subjected to random shielding, part of the input token is used as training set input, the shielded token is used as output, the mode is used as a data generator, and the deep bidirectional representation network is trained. According t</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language chi ; eng
recordid cdi_epo_espacenet_CN113901811A
source esp@cenet
subjects CALCULATING
COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
COMPUTING
COUNTING
ELECTRIC DIGITAL DATA PROCESSING
PHYSICS
title Literature new word discovery method and system based on mask language model
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-27T11%3A09%3A24IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=YANG%20XI&rft.date=2022-01-07&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN113901811A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true