METHOD OF VIDEO CODING BY MULTI-MODAL PROCESSING

Methods and apparatuses are described for encoding and decoding of image data, which includes processing two or more representations of the image data by a multi-scale reconstruction network. Feature extraction networks generate latent representations for each of the representations. The latent repr...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: SHUTKIN, Andrey Sergeevich, PLETNEV, Alexander Andreevich, PARKHOMENKO, Denis Vladimirovich, MA, Xiang, ILYIN, Ivan Iurevich, KIRILLOV, Ivan Vladimirovich, LETUNOVSKIY, Alexey Aleksandrovich
Format: Patent
Sprache:eng ; fre ; ger
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Methods and apparatuses are described for encoding and decoding of image data, which includes processing two or more representations of the image data by a multi-scale reconstruction network. Feature extraction networks generate latent representations for each of the representations. The latent representations are processed by a generation neural network to obtain reconstructed image data. The representations are encoded into a bitstream according to a set of split parameters indication target bitrates.