An Impartial Transformer for Story Visualization
Story Visualization is an advanced task of computed vision that targets sequential image synthesis, where the generated samples need to be realistic, faithful to their conditioning and sequentially consistent. Our work proposes a novel architectural and training approach: the Impartial Transformer a...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Story Visualization is an advanced task of computed vision that targets
sequential image synthesis, where the generated samples need to be realistic,
faithful to their conditioning and sequentially consistent. Our work proposes a
novel architectural and training approach: the Impartial Transformer achieves
both text-relevant plausible scenes and sequential consistency utilizing as few
trainable parameters as possible. This enhancement is even able to handle
synthesis of 'hard' samples with occluded objects, achieving improved
evaluation metrics comparing to past approaches. |
---|---|
DOI: | 10.48550/arxiv.2301.03563 |