Text abstract automatic generation method based on XLNet

The invention discloses a text abstract automatic generation method based on XLNet, and mainly solves the problems of low sentence fluency and accuracy in the text abstract automatic generation process. The method comprises the following steps: obtaining paired text and abstract data, and constructi...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: ZHANG HUAIYU, LIU HONGYING, SHANG FANHUA, SHEN XIONGJIE, WANG ZHONGSHU, CHEN SUNHU
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The invention discloses a text abstract automatic generation method based on XLNet, and mainly solves the problems of low sentence fluency and accuracy in the text abstract automatic generation process. The method comprises the following steps: obtaining paired text and abstract data, and constructing a training set; constructing a dictionary containing all common vocabularies and characters; building a backbone network by taking the pre-trained XLNet as an encoder and the Transformer-XL as a decoder; performing word segmentation on the text data in the training set and encoding the text datainto vectors to obtain network input, and finely tuning the network; and performing word segmentation and coding on the test text, and sending the test text to the trained network N to obtain an abstract result. The abstract generated by the method has good accuracy and language fluency, and has a certain practical value. 本发明公开了一种基于XLNet的文本摘要自动生成方法,主要解决文本摘要自动生成过程中,句子流畅性和准确性不高的问题。其实现过程是:获取成对的文本、摘要数据,构建训练集;构建一个包含所有常见词汇和字符的词