METHOD OF VIDEO CODING BY MULTI-MODAL PROCESSING

Methods and apparatuses are described for encoding and decoding of image data, which includes processing two or more representations of the image data by a multi-scale reconstruction network. Feature extraction networks generate latent representations for each of the representations. The latent repr...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	SHUTKIN, Andrey Sergeevich, PLETNEV, Alexander Andreevich, PARKHOMENKO, Denis Vladimirovich, MA, Xiang, ILYIN, Ivan Iurevich, KIRILLOV, Ivan Vladimirovich, LETUNOVSKIY, Alexey Aleksandrovich
Format:	Patent
Sprache:	eng ; fre ; ger
Schlagworte:	CALCULATING COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS COMPUTING COUNTING ELECTRIC COMMUNICATION TECHNIQUE ELECTRICITY PHYSICS PICTORIAL COMMUNICATION, e.g. TELEVISION
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	SHUTKIN, Andrey Sergeevich PLETNEV, Alexander Andreevich PARKHOMENKO, Denis Vladimirovich MA, Xiang ILYIN, Ivan Iurevich KIRILLOV, Ivan Vladimirovich LETUNOVSKIY, Alexey Aleksandrovich
description	Methods and apparatuses are described for encoding and decoding of image data, which includes processing two or more representations of the image data by a multi-scale reconstruction network. Feature extraction networks generate latent representations for each of the representations. The latent representations are processed by a generation neural network to obtain reconstructed image data. The representations are encoded into a bitstream according to a set of split parameters indication target bitrates.
format	Patent
fullrecord	<record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_EP4445609A1</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>EP4445609A1</sourcerecordid><originalsourceid>FETCH-epo_espacenet_EP4445609A13</originalsourceid><addsrcrecordid>eNrjZDDwdQ3x8HdR8HdTCPN0cfVXcPZ38fRzV3CKVPAN9Qnx1PX1d3H0UQgI8nd2DQ4GyvAwsKYl5hSn8kJpbgYFN9cQZw_d1IL8-NTigsTk1LzUknjXABMTE1MzA0tHQ2MilAAAzCAlvQ</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>METHOD OF VIDEO CODING BY MULTI-MODAL PROCESSING</title><source>esp@cenet</source><creator>SHUTKIN, Andrey Sergeevich ; PLETNEV, Alexander Andreevich ; PARKHOMENKO, Denis Vladimirovich ; MA, Xiang ; ILYIN, Ivan Iurevich ; KIRILLOV, Ivan Vladimirovich ; LETUNOVSKIY, Alexey Aleksandrovich</creator><creatorcontrib>SHUTKIN, Andrey Sergeevich ; PLETNEV, Alexander Andreevich ; PARKHOMENKO, Denis Vladimirovich ; MA, Xiang ; ILYIN, Ivan Iurevich ; KIRILLOV, Ivan Vladimirovich ; LETUNOVSKIY, Alexey Aleksandrovich</creatorcontrib><description>Methods and apparatuses are described for encoding and decoding of image data, which includes processing two or more representations of the image data by a multi-scale reconstruction network. Feature extraction networks generate latent representations for each of the representations. The latent representations are processed by a generation neural network to obtain reconstructed image data. The representations are encoded into a bitstream according to a set of split parameters indication target bitrates.</description><language>eng ; fre ; ger</language><subject>CALCULATING ; COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS ; COMPUTING ; COUNTING ; ELECTRIC COMMUNICATION TECHNIQUE ; ELECTRICITY ; PHYSICS ; PICTORIAL COMMUNICATION, e.g. TELEVISION</subject><creationdate>2024</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20241016&DB=EPODOC&CC=EP&NR=4445609A1$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,776,881,25542,76289</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20241016&DB=EPODOC&CC=EP&NR=4445609A1$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>SHUTKIN, Andrey Sergeevich</creatorcontrib><creatorcontrib>PLETNEV, Alexander Andreevich</creatorcontrib><creatorcontrib>PARKHOMENKO, Denis Vladimirovich</creatorcontrib><creatorcontrib>MA, Xiang</creatorcontrib><creatorcontrib>ILYIN, Ivan Iurevich</creatorcontrib><creatorcontrib>KIRILLOV, Ivan Vladimirovich</creatorcontrib><creatorcontrib>LETUNOVSKIY, Alexey Aleksandrovich</creatorcontrib><title>METHOD OF VIDEO CODING BY MULTI-MODAL PROCESSING</title><description>Methods and apparatuses are described for encoding and decoding of image data, which includes processing two or more representations of the image data by a multi-scale reconstruction network. Feature extraction networks generate latent representations for each of the representations. The latent representations are processed by a generation neural network to obtain reconstructed image data. The representations are encoded into a bitstream according to a set of split parameters indication target bitrates.</description><subject>CALCULATING</subject><subject>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC COMMUNICATION TECHNIQUE</subject><subject>ELECTRICITY</subject><subject>PHYSICS</subject><subject>PICTORIAL COMMUNICATION, e.g. TELEVISION</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2024</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZDDwdQ3x8HdR8HdTCPN0cfVXcPZ38fRzV3CKVPAN9Qnx1PX1d3H0UQgI8nd2DQ4GyvAwsKYl5hSn8kJpbgYFN9cQZw_d1IL8-NTigsTk1LzUknjXABMTE1MzA0tHQ2MilAAAzCAlvQ</recordid><startdate>20241016</startdate><enddate>20241016</enddate><creator>SHUTKIN, Andrey Sergeevich</creator><creator>PLETNEV, Alexander Andreevich</creator><creator>PARKHOMENKO, Denis Vladimirovich</creator><creator>MA, Xiang</creator><creator>ILYIN, Ivan Iurevich</creator><creator>KIRILLOV, Ivan Vladimirovich</creator><creator>LETUNOVSKIY, Alexey Aleksandrovich</creator><scope>EVB</scope></search><sort><creationdate>20241016</creationdate><title>METHOD OF VIDEO CODING BY MULTI-MODAL PROCESSING</title><author>SHUTKIN, Andrey Sergeevich ; PLETNEV, Alexander Andreevich ; PARKHOMENKO, Denis Vladimirovich ; MA, Xiang ; ILYIN, Ivan Iurevich ; KIRILLOV, Ivan Vladimirovich ; LETUNOVSKIY, Alexey Aleksandrovich</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_EP4445609A13</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng ; fre ; ger</language><creationdate>2024</creationdate><topic>CALCULATING</topic><topic>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC COMMUNICATION TECHNIQUE</topic><topic>ELECTRICITY</topic><topic>PHYSICS</topic><topic>PICTORIAL COMMUNICATION, e.g. TELEVISION</topic><toplevel>online_resources</toplevel><creatorcontrib>SHUTKIN, Andrey Sergeevich</creatorcontrib><creatorcontrib>PLETNEV, Alexander Andreevich</creatorcontrib><creatorcontrib>PARKHOMENKO, Denis Vladimirovich</creatorcontrib><creatorcontrib>MA, Xiang</creatorcontrib><creatorcontrib>ILYIN, Ivan Iurevich</creatorcontrib><creatorcontrib>KIRILLOV, Ivan Vladimirovich</creatorcontrib><creatorcontrib>LETUNOVSKIY, Alexey Aleksandrovich</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>SHUTKIN, Andrey Sergeevich</au><au>PLETNEV, Alexander Andreevich</au><au>PARKHOMENKO, Denis Vladimirovich</au><au>MA, Xiang</au><au>ILYIN, Ivan Iurevich</au><au>KIRILLOV, Ivan Vladimirovich</au><au>LETUNOVSKIY, Alexey Aleksandrovich</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>METHOD OF VIDEO CODING BY MULTI-MODAL PROCESSING</title><date>2024-10-16</date><risdate>2024</risdate><abstract>Methods and apparatuses are described for encoding and decoding of image data, which includes processing two or more representations of the image data by a multi-scale reconstruction network. Feature extraction networks generate latent representations for each of the representations. The latent representations are processed by a generation neural network to obtain reconstructed image data. The representations are encoded into a bitstream according to a set of split parameters indication target bitrates.</abstract><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier
ispartof
issn
language	eng ; fre ; ger
recordid	cdi_epo_espacenet_EP4445609A1
source	esp@cenet
subjects	CALCULATING COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS COMPUTING COUNTING ELECTRIC COMMUNICATION TECHNIQUE ELECTRICITY PHYSICS PICTORIAL COMMUNICATION, e.g. TELEVISION
title	METHOD OF VIDEO CODING BY MULTI-MODAL PROCESSING
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-08T14%3A47%3A23IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=SHUTKIN,%20Andrey%20Sergeevich&rft.date=2024-10-16&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EEP4445609A1%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true