METHOD OF VIDEO CODING BY MULTI-MODAL PROCESSING

Methods and apparatuses are described for encoding and decoding of image data, which includes processing two or more representations of the image data by a multi-scale reconstruction network. Feature extraction networks generate latent representations for each of the representations. The latent repr...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	SHUTKIN, Andrey Sergeevich, PLETNEV, Alexander Andreevich, PARKHOMENKO, Denis Vladimirovich, MA, Xiang, ILYIN, Ivan Iurevich, KIRILLOV, Ivan Vladimirovich, LETUNOVSKIY, Alexey Aleksandrovich
Format:	Patent
Sprache:	eng ; fre ; ger
Schlagworte:	CALCULATING COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS COMPUTING COUNTING ELECTRIC COMMUNICATION TECHNIQUE ELECTRICITY PHYSICS PICTORIAL COMMUNICATION, e.g. TELEVISION
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Methods and apparatuses are described for encoding and decoding of image data, which includes processing two or more representations of the image data by a multi-scale reconstruction network. Feature extraction networks generate latent representations for each of the representations. The latent representations are processed by a generation neural network to obtain reconstructed image data. The representations are encoded into a bitstream according to a set of split parameters indication target bitrates.