Cellular user App use data synthesis method based on large language model

The invention belongs to the technical field of data synthesis, and particularly relates to a cellular user App use data synthesis method based on a large language model, which comprises the following steps: (1) a text coding method: carrying out text coding on real table data, and converting App ta...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: CHEN WENXIONG, LIU SONGLIN, LYU FENG, DUAN SIJING
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The invention belongs to the technical field of data synthesis, and particularly relates to a cellular user App use data synthesis method based on a large language model, which comprises the following steps: (1) a text coding method: carrying out text coding on real table data, and converting App table data into a text sequence for representation; (2) pre-training fine tuning: performing fine tuning on the pre-training generation type large language model by using a text data set; (3) sampling and synthesizing App data, sampling and generating text sequence data by using the fine-tuned pre-training generative large language model, and converting the text sequence data into table data to obtain a synthesized table data set; compared with the prior art, the method has the following beneficial effects that the user is allowed to perform probability control on the data generation process through various input prompt settings, the user can flexibly define conditions according to own requirements to realize control