Literature new word discovery method and system based on mask language model
The invention discloses a literature new word discovery method and system based on a mask language model, and belongs to the technical field of artificial intelligence natural language processing, the method employs a mask language training component, a model dependency relationship operation compon...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Patent |
Sprache: | chi ; eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | YANG XI GU GANG ZHU JIABING YIN JINGGANG |
description | The invention discloses a literature new word discovery method and system based on a mask language model, and belongs to the technical field of artificial intelligence natural language processing, the method employs a mask language training component, a model dependency relationship operation component and a maximum probability operation component, the mask language training component carries out data cleaning and sentence segment segmentation on literature data; an Attention mechanism and a feedforward neural network are constructed through a training composition vector identifier Word Embedding of the training set, combining the Attention mechanism and the feedforward neural network into a group of Encoder, and constructing an Encoder training model; the coded training set is subjected to random shielding, part of the input token is used as training set input, the shielded token is used as output, the mode is used as a data generator, and the deep bidirectional representation network is trained. According t |
format | Patent |
fullrecord | <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN113901811A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN113901811A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN113901811A3</originalsourceid><addsrcrecordid>eNqNyjEOgkAQBVAaC6PeYTyAiRsaLQ3RWBArezIyXyTu7pCdRcLtbTyA1WvesqjrPiNxHhMoYqJJk5D01uoHaaaA_FIhjkI2W0agBxuENFJge5Pn2I3cgYIK_LpYPNkbNj9XxfZyvlfXHQZtYAO3iMhNdXOuPO7dwblT-c_5AqRBNbo</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Literature new word discovery method and system based on mask language model</title><source>esp@cenet</source><creator>YANG XI ; GU GANG ; ZHU JIABING ; YIN JINGGANG</creator><creatorcontrib>YANG XI ; GU GANG ; ZHU JIABING ; YIN JINGGANG</creatorcontrib><description>The invention discloses a literature new word discovery method and system based on a mask language model, and belongs to the technical field of artificial intelligence natural language processing, the method employs a mask language training component, a model dependency relationship operation component and a maximum probability operation component, the mask language training component carries out data cleaning and sentence segment segmentation on literature data; an Attention mechanism and a feedforward neural network are constructed through a training composition vector identifier Word Embedding of the training set, combining the Attention mechanism and the feedforward neural network into a group of Encoder, and constructing an Encoder training model; the coded training set is subjected to random shielding, part of the input token is used as training set input, the shielded token is used as output, the mode is used as a data generator, and the deep bidirectional representation network is trained. According t</description><language>chi ; eng</language><subject>CALCULATING ; COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; PHYSICS</subject><creationdate>2022</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20220107&DB=EPODOC&CC=CN&NR=113901811A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25564,76547</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20220107&DB=EPODOC&CC=CN&NR=113901811A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>YANG XI</creatorcontrib><creatorcontrib>GU GANG</creatorcontrib><creatorcontrib>ZHU JIABING</creatorcontrib><creatorcontrib>YIN JINGGANG</creatorcontrib><title>Literature new word discovery method and system based on mask language model</title><description>The invention discloses a literature new word discovery method and system based on a mask language model, and belongs to the technical field of artificial intelligence natural language processing, the method employs a mask language training component, a model dependency relationship operation component and a maximum probability operation component, the mask language training component carries out data cleaning and sentence segment segmentation on literature data; an Attention mechanism and a feedforward neural network are constructed through a training composition vector identifier Word Embedding of the training set, combining the Attention mechanism and the feedforward neural network into a group of Encoder, and constructing an Encoder training model; the coded training set is subjected to random shielding, part of the input token is used as training set input, the shielded token is used as output, the mode is used as a data generator, and the deep bidirectional representation network is trained. According t</description><subject>CALCULATING</subject><subject>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>PHYSICS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2022</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNqNyjEOgkAQBVAaC6PeYTyAiRsaLQ3RWBArezIyXyTu7pCdRcLtbTyA1WvesqjrPiNxHhMoYqJJk5D01uoHaaaA_FIhjkI2W0agBxuENFJge5Pn2I3cgYIK_LpYPNkbNj9XxfZyvlfXHQZtYAO3iMhNdXOuPO7dwblT-c_5AqRBNbo</recordid><startdate>20220107</startdate><enddate>20220107</enddate><creator>YANG XI</creator><creator>GU GANG</creator><creator>ZHU JIABING</creator><creator>YIN JINGGANG</creator><scope>EVB</scope></search><sort><creationdate>20220107</creationdate><title>Literature new word discovery method and system based on mask language model</title><author>YANG XI ; GU GANG ; ZHU JIABING ; YIN JINGGANG</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN113901811A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2022</creationdate><topic>CALCULATING</topic><topic>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>PHYSICS</topic><toplevel>online_resources</toplevel><creatorcontrib>YANG XI</creatorcontrib><creatorcontrib>GU GANG</creatorcontrib><creatorcontrib>ZHU JIABING</creatorcontrib><creatorcontrib>YIN JINGGANG</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>YANG XI</au><au>GU GANG</au><au>ZHU JIABING</au><au>YIN JINGGANG</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Literature new word discovery method and system based on mask language model</title><date>2022-01-07</date><risdate>2022</risdate><abstract>The invention discloses a literature new word discovery method and system based on a mask language model, and belongs to the technical field of artificial intelligence natural language processing, the method employs a mask language training component, a model dependency relationship operation component and a maximum probability operation component, the mask language training component carries out data cleaning and sentence segment segmentation on literature data; an Attention mechanism and a feedforward neural network are constructed through a training composition vector identifier Word Embedding of the training set, combining the Attention mechanism and the feedforward neural network into a group of Encoder, and constructing an Encoder training model; the coded training set is subjected to random shielding, part of the input token is used as training set input, the shielded token is used as output, the mode is used as a data generator, and the deep bidirectional representation network is trained. According t</abstract><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | |
ispartof | |
issn | |
language | chi ; eng |
recordid | cdi_epo_espacenet_CN113901811A |
source | esp@cenet |
subjects | CALCULATING COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING PHYSICS |
title | Literature new word discovery method and system based on mask language model |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-27T11%3A09%3A24IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=YANG%20XI&rft.date=2022-01-07&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN113901811A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |