Text image confrontation generation system, method and equipment based on multi-modal information prompt and medium

The invention provides a text image confrontation generation method based on multi-modal information prompt, and aims to synthesize text images with different character types and interference attributes under a specified style background. Rich interference attributes and character type information a...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: QIN XUNHUI, SHEN FAHAI, LIU KE, SHI FANG
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The invention provides a text image confrontation generation method based on multi-modal information prompt, and aims to synthesize text images with different character types and interference attributes under a specified style background. Rich interference attributes and character type information are introduced and serve as attribute prompts for text generation, a random character image is combined to be input into an attribute encoder and a character encoder for feature extraction, meanwhile, background style feature extraction is conducted on a background image based on a style encoder, and background style features are converted into background style features in a decoder part. Performing multi-level deep feature fusion on the attributes and the character features to generate a text image which corresponds to a background and has specified attributes and corresponding character contents and styles; multi-task adversarial training is carried out on the generated image, the authenticity of the text image ge