A Novel Video Coding Framework Using a Self-Adaptive Dictionary

In this paper, we propose to use a self-adaptive redundant dictionary, consisting of all possible inter and intra prediction candidates, to directly represent the frame blocks in a video sequence. The self-adaptive dictionary generalizes the conventional predictive coding approach by allowing adapti...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on circuits and systems for video technology 2018-12, Vol.28 (12), p.3478-3491
Hauptverfasser:	Xue, Yuanyi, Wang, Yao
Format:	Artikel
Sprache:	eng
Schlagworte:	Adaptive algorithms Coding Coding standards Counting Dictionaries Discrete cosine transform Distortion Encoding L0-norm minimization Matching pursuit algorithms Mathematical analysis Partitioning algorithms Partitions Predictions Predictive coding Rate-distortion rate-distortion optimization sparse coding Switching theory Video coding Video compression
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	3491
container_issue	12
container_start_page	3478
container_title	IEEE transactions on circuits and systems for video technology
container_volume	28
creator	Xue, Yuanyi Wang, Yao
description	In this paper, we propose to use a self-adaptive redundant dictionary, consisting of all possible inter and intra prediction candidates, to directly represent the frame blocks in a video sequence. The self-adaptive dictionary generalizes the conventional predictive coding approach by allowing adaptive linear combinations of prediction candidates, which is solved by an rate-distortion aware L0-norm minimization problem using orthogonal least squares (OLS). To overcome the inefficiency in quantizing and coding coefficients corresponding to correlated chosen atoms, we orthonormalize the chosen atoms recursively as part of OLS process. We further propose a two-stage video coding framework, in which a second stage codes the residual from the chosen atoms using a modified discrete cosine transform (DCT) dictionary that is adaptively orthonormalized with respect to the subspace spanned by the first stage atoms. To determine the transition from the first stage to the second stage, we propose a rate-distortion (RD) aware adaptive switching algorithm. The proposed framework is further extended to accommodate variable block sizes ( 16\times 16 , 8\times 8 , and 4\times 4 ), and the partition mode is derived by a fast partition mode decision algorithm. A context-adaptive binary arithmetic entropy coder is designed to code the symbols of the proposed coding framework. The proposed coder shows competitive, and in some cases better RD performance, compared with the HEVC video coding standard for P-frames.
doi_str_mv	10.1109/TCSVT.2017.2760143
format	Article
fullrecord	<record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_proquest_journals_2151460805</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>8058496</ieee_id><sourcerecordid>2151460805</sourcerecordid><originalsourceid>FETCH-LOGICAL-c246t-91969e15fc7238445aab90c0fe76dc07de8fa11a33fffe81922887e2348ab30b3</originalsourceid><addsrcrecordid>eNo9kMtOwzAQRS0EEqXwA7CJxDrF40fsrFAVKCBVsOhja7nJGLm0dXHSIv4elyJWMyPde2fmEHINdABAy7tpNZlPB4yCGjBVUBD8hPRASp0zRuVp6qmEXDOQ5-SibZc0SbRQPXI_zF7DHlfZ3DcYsio0fvOejaJd41eIH9msPcw2m-DK5cPGbju_x-zB150PGxu_L8mZs6sWr_5qn8xGj9PqOR-_Pb1Uw3FeM1F0eQllUSJIVyvGtRDS2kVJa-pQFU1NVYPaWQDLuXMONZSMaa2QcaHtgtMF75PbY-42hs8dtp1Zhl3cpJUmPQWioJrKpGJHVR1D20Z0Zhv9Op1pgJoDKPMLyhxAmT9QyXRzNHlE_DekOC3Kgv8Az2pi3Q</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2151460805</pqid></control><display><type>article</type><title>A Novel Video Coding Framework Using a Self-Adaptive Dictionary</title><source>IEEE Electronic Library (IEL)</source><creator>Xue, Yuanyi ; Wang, Yao</creator><creatorcontrib>Xue, Yuanyi ; Wang, Yao</creatorcontrib><description><![CDATA[In this paper, we propose to use a self-adaptive redundant dictionary, consisting of all possible inter and intra prediction candidates, to directly represent the frame blocks in a video sequence. The self-adaptive dictionary generalizes the conventional predictive coding approach by allowing adaptive linear combinations of prediction candidates, which is solved by an rate-distortion aware L0-norm minimization problem using orthogonal least squares (OLS). To overcome the inefficiency in quantizing and coding coefficients corresponding to correlated chosen atoms, we orthonormalize the chosen atoms recursively as part of OLS process. We further propose a two-stage video coding framework, in which a second stage codes the residual from the chosen atoms using a modified discrete cosine transform (DCT) dictionary that is adaptively orthonormalized with respect to the subspace spanned by the first stage atoms. To determine the transition from the first stage to the second stage, we propose a rate-distortion (RD) aware adaptive switching algorithm. The proposed framework is further extended to accommodate variable block sizes (<inline-formula> <tex-math notation="LaTeX">16\times 16 </tex-math></inline-formula>, <inline-formula> <tex-math notation="LaTeX">8\times 8 </tex-math></inline-formula>, and <inline-formula> <tex-math notation="LaTeX">4\times 4 </tex-math></inline-formula>), and the partition mode is derived by a fast partition mode decision algorithm. A context-adaptive binary arithmetic entropy coder is designed to code the symbols of the proposed coding framework. The proposed coder shows competitive, and in some cases better RD performance, compared with the HEVC video coding standard for P-frames.]]></description><identifier>ISSN: 1051-8215</identifier><identifier>EISSN: 1558-2205</identifier><identifier>DOI: 10.1109/TCSVT.2017.2760143</identifier><identifier>CODEN: ITCTEM</identifier><language>eng</language><publisher>New York: IEEE</publisher><subject>Adaptive algorithms ; Coding ; Coding standards ; Counting ; Dictionaries ; Discrete cosine transform ; Distortion ; Encoding ; L0-norm minimization ; Matching pursuit algorithms ; Mathematical analysis ; Partitioning algorithms ; Partitions ; Predictions ; Predictive coding ; Rate-distortion ; rate-distortion optimization ; sparse coding ; Switching theory ; Video coding ; Video compression</subject><ispartof>IEEE transactions on circuits and systems for video technology, 2018-12, Vol.28 (12), p.3478-3491</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2018</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c246t-91969e15fc7238445aab90c0fe76dc07de8fa11a33fffe81922887e2348ab30b3</cites><orcidid>0000-0003-3199-3802 ; 0000-0001-5449-0103</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/8058496$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,776,780,792,27901,27902,54733</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/8058496$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Xue, Yuanyi</creatorcontrib><creatorcontrib>Wang, Yao</creatorcontrib><title>A Novel Video Coding Framework Using a Self-Adaptive Dictionary</title><title>IEEE transactions on circuits and systems for video technology</title><addtitle>TCSVT</addtitle><description><![CDATA[In this paper, we propose to use a self-adaptive redundant dictionary, consisting of all possible inter and intra prediction candidates, to directly represent the frame blocks in a video sequence. The self-adaptive dictionary generalizes the conventional predictive coding approach by allowing adaptive linear combinations of prediction candidates, which is solved by an rate-distortion aware L0-norm minimization problem using orthogonal least squares (OLS). To overcome the inefficiency in quantizing and coding coefficients corresponding to correlated chosen atoms, we orthonormalize the chosen atoms recursively as part of OLS process. We further propose a two-stage video coding framework, in which a second stage codes the residual from the chosen atoms using a modified discrete cosine transform (DCT) dictionary that is adaptively orthonormalized with respect to the subspace spanned by the first stage atoms. To determine the transition from the first stage to the second stage, we propose a rate-distortion (RD) aware adaptive switching algorithm. The proposed framework is further extended to accommodate variable block sizes (<inline-formula> <tex-math notation="LaTeX">16\times 16 </tex-math></inline-formula>, <inline-formula> <tex-math notation="LaTeX">8\times 8 </tex-math></inline-formula>, and <inline-formula> <tex-math notation="LaTeX">4\times 4 </tex-math></inline-formula>), and the partition mode is derived by a fast partition mode decision algorithm. A context-adaptive binary arithmetic entropy coder is designed to code the symbols of the proposed coding framework. The proposed coder shows competitive, and in some cases better RD performance, compared with the HEVC video coding standard for P-frames.]]></description><subject>Adaptive algorithms</subject><subject>Coding</subject><subject>Coding standards</subject><subject>Counting</subject><subject>Dictionaries</subject><subject>Discrete cosine transform</subject><subject>Distortion</subject><subject>Encoding</subject><subject>L0-norm minimization</subject><subject>Matching pursuit algorithms</subject><subject>Mathematical analysis</subject><subject>Partitioning algorithms</subject><subject>Partitions</subject><subject>Predictions</subject><subject>Predictive coding</subject><subject>Rate-distortion</subject><subject>rate-distortion optimization</subject><subject>sparse coding</subject><subject>Switching theory</subject><subject>Video coding</subject><subject>Video compression</subject><issn>1051-8215</issn><issn>1558-2205</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNo9kMtOwzAQRS0EEqXwA7CJxDrF40fsrFAVKCBVsOhja7nJGLm0dXHSIv4elyJWMyPde2fmEHINdABAy7tpNZlPB4yCGjBVUBD8hPRASp0zRuVp6qmEXDOQ5-SibZc0SbRQPXI_zF7DHlfZ3DcYsio0fvOejaJd41eIH9msPcw2m-DK5cPGbju_x-zB150PGxu_L8mZs6sWr_5qn8xGj9PqOR-_Pb1Uw3FeM1F0eQllUSJIVyvGtRDS2kVJa-pQFU1NVYPaWQDLuXMONZSMaa2QcaHtgtMF75PbY-42hs8dtp1Zhl3cpJUmPQWioJrKpGJHVR1D20Z0Zhv9Op1pgJoDKPMLyhxAmT9QyXRzNHlE_DekOC3Kgv8Az2pi3Q</recordid><startdate>20181201</startdate><enddate>20181201</enddate><creator>Xue, Yuanyi</creator><creator>Wang, Yao</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0003-3199-3802</orcidid><orcidid>https://orcid.org/0000-0001-5449-0103</orcidid></search><sort><creationdate>20181201</creationdate><title>A Novel Video Coding Framework Using a Self-Adaptive Dictionary</title><author>Xue, Yuanyi ; Wang, Yao</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c246t-91969e15fc7238445aab90c0fe76dc07de8fa11a33fffe81922887e2348ab30b3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Adaptive algorithms</topic><topic>Coding</topic><topic>Coding standards</topic><topic>Counting</topic><topic>Dictionaries</topic><topic>Discrete cosine transform</topic><topic>Distortion</topic><topic>Encoding</topic><topic>L0-norm minimization</topic><topic>Matching pursuit algorithms</topic><topic>Mathematical analysis</topic><topic>Partitioning algorithms</topic><topic>Partitions</topic><topic>Predictions</topic><topic>Predictive coding</topic><topic>Rate-distortion</topic><topic>rate-distortion optimization</topic><topic>sparse coding</topic><topic>Switching theory</topic><topic>Video coding</topic><topic>Video compression</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Xue, Yuanyi</creatorcontrib><creatorcontrib>Wang, Yao</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>IEEE transactions on circuits and systems for video technology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Xue, Yuanyi</au><au>Wang, Yao</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A Novel Video Coding Framework Using a Self-Adaptive Dictionary</atitle><jtitle>IEEE transactions on circuits and systems for video technology</jtitle><stitle>TCSVT</stitle><date>2018-12-01</date><risdate>2018</risdate><volume>28</volume><issue>12</issue><spage>3478</spage><epage>3491</epage><pages>3478-3491</pages><issn>1051-8215</issn><eissn>1558-2205</eissn><coden>ITCTEM</coden><abstract><![CDATA[In this paper, we propose to use a self-adaptive redundant dictionary, consisting of all possible inter and intra prediction candidates, to directly represent the frame blocks in a video sequence. The self-adaptive dictionary generalizes the conventional predictive coding approach by allowing adaptive linear combinations of prediction candidates, which is solved by an rate-distortion aware L0-norm minimization problem using orthogonal least squares (OLS). To overcome the inefficiency in quantizing and coding coefficients corresponding to correlated chosen atoms, we orthonormalize the chosen atoms recursively as part of OLS process. We further propose a two-stage video coding framework, in which a second stage codes the residual from the chosen atoms using a modified discrete cosine transform (DCT) dictionary that is adaptively orthonormalized with respect to the subspace spanned by the first stage atoms. To determine the transition from the first stage to the second stage, we propose a rate-distortion (RD) aware adaptive switching algorithm. The proposed framework is further extended to accommodate variable block sizes (<inline-formula> <tex-math notation="LaTeX">16\times 16 </tex-math></inline-formula>, <inline-formula> <tex-math notation="LaTeX">8\times 8 </tex-math></inline-formula>, and <inline-formula> <tex-math notation="LaTeX">4\times 4 </tex-math></inline-formula>), and the partition mode is derived by a fast partition mode decision algorithm. A context-adaptive binary arithmetic entropy coder is designed to code the symbols of the proposed coding framework. The proposed coder shows competitive, and in some cases better RD performance, compared with the HEVC video coding standard for P-frames.]]></abstract><cop>New York</cop><pub>IEEE</pub><doi>10.1109/TCSVT.2017.2760143</doi><tpages>14</tpages><orcidid>https://orcid.org/0000-0003-3199-3802</orcidid><orcidid>https://orcid.org/0000-0001-5449-0103</orcidid></addata></record>
fulltext	fulltext_linktorsrc
identifier	ISSN: 1051-8215
ispartof	IEEE transactions on circuits and systems for video technology, 2018-12, Vol.28 (12), p.3478-3491
issn	1051-8215 1558-2205
language	eng
recordid	cdi_proquest_journals_2151460805
source	IEEE Electronic Library (IEL)
subjects	Adaptive algorithms Coding Coding standards Counting Dictionaries Discrete cosine transform Distortion Encoding L0-norm minimization Matching pursuit algorithms Mathematical analysis Partitioning algorithms Partitions Predictions Predictive coding Rate-distortion rate-distortion optimization sparse coding Switching theory Video coding Video compression
title	A Novel Video Coding Framework Using a Self-Adaptive Dictionary
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-02T15%3A05%3A12IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20Novel%20Video%20Coding%20Framework%20Using%20a%20Self-Adaptive%20Dictionary&rft.jtitle=IEEE%20transactions%20on%20circuits%20and%20systems%20for%20video%20technology&rft.au=Xue,%20Yuanyi&rft.date=2018-12-01&rft.volume=28&rft.issue=12&rft.spage=3478&rft.epage=3491&rft.pages=3478-3491&rft.issn=1051-8215&rft.eissn=1558-2205&rft.coden=ITCTEM&rft_id=info:doi/10.1109/TCSVT.2017.2760143&rft_dat=%3Cproquest_RIE%3E2151460805%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2151460805&rft_id=info:pmid/&rft_ieee_id=8058496&rfr_iscdi=true