On Transcoding a B-Frame to a P-Frame in the Compressed Domain

Only a limited number of methods have been proposed to realize heterogeneous transcoding, for example from MPEG-2 to H.263, or from H.264 to H.263. The major difficulties of transcoding a B-picture to a P-picture are that the incoming discrete cosine transform (DCT) coefficients of the B-frame are p...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on multimedia 2007-10, Vol.9 (6), p.1093-1102
Hauptverfasser: SIU, Wan-Chi, CHAN, Yui-Lam, FUNG, Kai-Tat
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1102
container_issue 6
container_start_page 1093
container_title IEEE transactions on multimedia
container_volume 9
creator SIU, Wan-Chi
CHAN, Yui-Lam
FUNG, Kai-Tat
description Only a limited number of methods have been proposed to realize heterogeneous transcoding, for example from MPEG-2 to H.263, or from H.264 to H.263. The major difficulties of transcoding a B-picture to a P-picture are that the incoming discrete cosine transform (DCT) coefficients of the B-frame are prediction errors arising from both forward and backward predictions, whilst the prediction errors in the DCT domain arising from the prediction using the previous frame alone are not available. The required new prediction errors need to be re-estimated in the pixel domain. This process involves highly complex computation and introduces re-encoding errors. We propose a new approach to convert a B-picture into a P-picture by making use of some properties of motion compensation in the DCT domain and the direct addition of DCT coefficients. We derive a set of equations and formulate the problem of how to obtain the DCT coefficients. One difficulty is that the last P-frame inside a GOP with an IBBP structure, for example, needs to be transcoded to become the last P-frame in the IPPP structure, and it has to be linked to the previous reconstructed P-frame instead of to the I-frame. We increased the speed of the transcoding process by making use of the motion activity which is expressed in terms of the correlation between pictures. The whole transcoding process is done in the transform domain, hence re-encoding errors are completely avoided. Results from our experimental work show that the proposed video transcoder not only achieves a speed-up of two to six times that of the conventional video transcoder, but it also substantially improves the quality of the video.
doi_str_mv 10.1109/TMM.2007.902895
format Article
fullrecord <record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_crossref_primary_10_1109_TMM_2007_902895</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>4303035</ieee_id><sourcerecordid>880680478</sourcerecordid><originalsourceid>FETCH-LOGICAL-c391t-434b53f0dbd4c86a6ebb995f5af58ef30c93fa87403aaba3dbd8b76f5e21ed93</originalsourceid><addsrcrecordid>eNpdkMtLAzEQhxdRsFbPHrwsgnjadrJ5XwStT2iph72HbDbRLfuoyfbgf2_KFgWZQ2aS7zeEL0kuEcwQAjkvVqtZDsBnEnIh6VEyQZKgLN7w49jTHDKZIzhNzkLYACBCgU-Su3WXFl53wfRV3X2kOn3Inr1ubTr0cXg_DHWXDp82XfTt1tsQbJU-9q2uu_PkxOkm2IvDOU2K56di8Zot1y9vi_tlZrBEQ0YwKSl2UJUVMYJpZstSSuqodlRYh8FI7LTgBLDWpcaREyVnjtoc2UriaXI7rt36_mtnw6DaOhjbNLqz_S4oIYAJIFxE8vofuel3vot_U4IRThgnOELzETK-D8Fbp7a-brX_VgjUXqaKMtVephplxsTNYa0ORjcuGjN1-ItJhAhhLHJXI1dba3-fCYZYFP8ADip68g</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>864746743</pqid></control><display><type>article</type><title>On Transcoding a B-Frame to a P-Frame in the Compressed Domain</title><source>IEEE Electronic Library (IEL)</source><creator>SIU, Wan-Chi ; CHAN, Yui-Lam ; FUNG, Kai-Tat</creator><creatorcontrib>SIU, Wan-Chi ; CHAN, Yui-Lam ; FUNG, Kai-Tat</creatorcontrib><description>Only a limited number of methods have been proposed to realize heterogeneous transcoding, for example from MPEG-2 to H.263, or from H.264 to H.263. The major difficulties of transcoding a B-picture to a P-picture are that the incoming discrete cosine transform (DCT) coefficients of the B-frame are prediction errors arising from both forward and backward predictions, whilst the prediction errors in the DCT domain arising from the prediction using the previous frame alone are not available. The required new prediction errors need to be re-estimated in the pixel domain. This process involves highly complex computation and introduces re-encoding errors. We propose a new approach to convert a B-picture into a P-picture by making use of some properties of motion compensation in the DCT domain and the direct addition of DCT coefficients. We derive a set of equations and formulate the problem of how to obtain the DCT coefficients. One difficulty is that the last P-frame inside a GOP with an IBBP structure, for example, needs to be transcoded to become the last P-frame in the IPPP structure, and it has to be linked to the previous reconstructed P-frame instead of to the I-frame. We increased the speed of the transcoding process by making use of the motion activity which is expressed in terms of the correlation between pictures. The whole transcoding process is done in the transform domain, hence re-encoding errors are completely avoided. Results from our experimental work show that the proposed video transcoder not only achieves a speed-up of two to six times that of the conventional video transcoder, but it also substantially improves the quality of the video.</description><identifier>ISSN: 1520-9210</identifier><identifier>EISSN: 1941-0077</identifier><identifier>DOI: 10.1109/TMM.2007.902895</identifier><identifier>CODEN: ITMUF8</identifier><language>eng</language><publisher>New York, NY: IEEE</publisher><subject>Applied sciences ; Artificial intelligence ; B-picture and P-picture ; Bandwidth ; Compressed ; compressed domain processing ; Computer science; control theory; systems ; Computer systems and distributed systems. User interface ; Correlation ; Decoding ; Discrete cosine transforms ; Equations ; Errors ; Exact sciences and technology ; heterogeneous transcoding ; IP networks ; Mathematical analysis ; Motion compensation ; Multimedia ; Pattern recognition. Digital image processing. Computational geometry ; Software ; Studies ; Transcoders ; Transcoding ; Transform coding ; Transforms ; Video coding ; video coding and transcoding ; Video compression</subject><ispartof>IEEE transactions on multimedia, 2007-10, Vol.9 (6), p.1093-1102</ispartof><rights>2007 INIST-CNRS</rights><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2007</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c391t-434b53f0dbd4c86a6ebb995f5af58ef30c93fa87403aaba3dbd8b76f5e21ed93</citedby><cites>FETCH-LOGICAL-c391t-434b53f0dbd4c86a6ebb995f5af58ef30c93fa87403aaba3dbd8b76f5e21ed93</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/4303035$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,777,781,793,27905,27906,54739</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/4303035$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=19114466$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>SIU, Wan-Chi</creatorcontrib><creatorcontrib>CHAN, Yui-Lam</creatorcontrib><creatorcontrib>FUNG, Kai-Tat</creatorcontrib><title>On Transcoding a B-Frame to a P-Frame in the Compressed Domain</title><title>IEEE transactions on multimedia</title><addtitle>TMM</addtitle><description>Only a limited number of methods have been proposed to realize heterogeneous transcoding, for example from MPEG-2 to H.263, or from H.264 to H.263. The major difficulties of transcoding a B-picture to a P-picture are that the incoming discrete cosine transform (DCT) coefficients of the B-frame are prediction errors arising from both forward and backward predictions, whilst the prediction errors in the DCT domain arising from the prediction using the previous frame alone are not available. The required new prediction errors need to be re-estimated in the pixel domain. This process involves highly complex computation and introduces re-encoding errors. We propose a new approach to convert a B-picture into a P-picture by making use of some properties of motion compensation in the DCT domain and the direct addition of DCT coefficients. We derive a set of equations and formulate the problem of how to obtain the DCT coefficients. One difficulty is that the last P-frame inside a GOP with an IBBP structure, for example, needs to be transcoded to become the last P-frame in the IPPP structure, and it has to be linked to the previous reconstructed P-frame instead of to the I-frame. We increased the speed of the transcoding process by making use of the motion activity which is expressed in terms of the correlation between pictures. The whole transcoding process is done in the transform domain, hence re-encoding errors are completely avoided. Results from our experimental work show that the proposed video transcoder not only achieves a speed-up of two to six times that of the conventional video transcoder, but it also substantially improves the quality of the video.</description><subject>Applied sciences</subject><subject>Artificial intelligence</subject><subject>B-picture and P-picture</subject><subject>Bandwidth</subject><subject>Compressed</subject><subject>compressed domain processing</subject><subject>Computer science; control theory; systems</subject><subject>Computer systems and distributed systems. User interface</subject><subject>Correlation</subject><subject>Decoding</subject><subject>Discrete cosine transforms</subject><subject>Equations</subject><subject>Errors</subject><subject>Exact sciences and technology</subject><subject>heterogeneous transcoding</subject><subject>IP networks</subject><subject>Mathematical analysis</subject><subject>Motion compensation</subject><subject>Multimedia</subject><subject>Pattern recognition. Digital image processing. Computational geometry</subject><subject>Software</subject><subject>Studies</subject><subject>Transcoders</subject><subject>Transcoding</subject><subject>Transform coding</subject><subject>Transforms</subject><subject>Video coding</subject><subject>video coding and transcoding</subject><subject>Video compression</subject><issn>1520-9210</issn><issn>1941-0077</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2007</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNpdkMtLAzEQhxdRsFbPHrwsgnjadrJ5XwStT2iph72HbDbRLfuoyfbgf2_KFgWZQ2aS7zeEL0kuEcwQAjkvVqtZDsBnEnIh6VEyQZKgLN7w49jTHDKZIzhNzkLYACBCgU-Su3WXFl53wfRV3X2kOn3Inr1ubTr0cXg_DHWXDp82XfTt1tsQbJU-9q2uu_PkxOkm2IvDOU2K56di8Zot1y9vi_tlZrBEQ0YwKSl2UJUVMYJpZstSSuqodlRYh8FI7LTgBLDWpcaREyVnjtoc2UriaXI7rt36_mtnw6DaOhjbNLqz_S4oIYAJIFxE8vofuel3vot_U4IRThgnOELzETK-D8Fbp7a-brX_VgjUXqaKMtVephplxsTNYa0ORjcuGjN1-ItJhAhhLHJXI1dba3-fCYZYFP8ADip68g</recordid><startdate>20071001</startdate><enddate>20071001</enddate><creator>SIU, Wan-Chi</creator><creator>CHAN, Yui-Lam</creator><creator>FUNG, Kai-Tat</creator><general>IEEE</general><general>Institute of Electrical and Electronic Engineers</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>IQODW</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>F28</scope><scope>FR3</scope></search><sort><creationdate>20071001</creationdate><title>On Transcoding a B-Frame to a P-Frame in the Compressed Domain</title><author>SIU, Wan-Chi ; CHAN, Yui-Lam ; FUNG, Kai-Tat</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c391t-434b53f0dbd4c86a6ebb995f5af58ef30c93fa87403aaba3dbd8b76f5e21ed93</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2007</creationdate><topic>Applied sciences</topic><topic>Artificial intelligence</topic><topic>B-picture and P-picture</topic><topic>Bandwidth</topic><topic>Compressed</topic><topic>compressed domain processing</topic><topic>Computer science; control theory; systems</topic><topic>Computer systems and distributed systems. User interface</topic><topic>Correlation</topic><topic>Decoding</topic><topic>Discrete cosine transforms</topic><topic>Equations</topic><topic>Errors</topic><topic>Exact sciences and technology</topic><topic>heterogeneous transcoding</topic><topic>IP networks</topic><topic>Mathematical analysis</topic><topic>Motion compensation</topic><topic>Multimedia</topic><topic>Pattern recognition. Digital image processing. Computational geometry</topic><topic>Software</topic><topic>Studies</topic><topic>Transcoders</topic><topic>Transcoding</topic><topic>Transform coding</topic><topic>Transforms</topic><topic>Video coding</topic><topic>video coding and transcoding</topic><topic>Video compression</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>SIU, Wan-Chi</creatorcontrib><creatorcontrib>CHAN, Yui-Lam</creatorcontrib><creatorcontrib>FUNG, Kai-Tat</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>Pascal-Francis</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>ANTE: Abstracts in New Technology &amp; Engineering</collection><collection>Engineering Research Database</collection><jtitle>IEEE transactions on multimedia</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>SIU, Wan-Chi</au><au>CHAN, Yui-Lam</au><au>FUNG, Kai-Tat</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>On Transcoding a B-Frame to a P-Frame in the Compressed Domain</atitle><jtitle>IEEE transactions on multimedia</jtitle><stitle>TMM</stitle><date>2007-10-01</date><risdate>2007</risdate><volume>9</volume><issue>6</issue><spage>1093</spage><epage>1102</epage><pages>1093-1102</pages><issn>1520-9210</issn><eissn>1941-0077</eissn><coden>ITMUF8</coden><abstract>Only a limited number of methods have been proposed to realize heterogeneous transcoding, for example from MPEG-2 to H.263, or from H.264 to H.263. The major difficulties of transcoding a B-picture to a P-picture are that the incoming discrete cosine transform (DCT) coefficients of the B-frame are prediction errors arising from both forward and backward predictions, whilst the prediction errors in the DCT domain arising from the prediction using the previous frame alone are not available. The required new prediction errors need to be re-estimated in the pixel domain. This process involves highly complex computation and introduces re-encoding errors. We propose a new approach to convert a B-picture into a P-picture by making use of some properties of motion compensation in the DCT domain and the direct addition of DCT coefficients. We derive a set of equations and formulate the problem of how to obtain the DCT coefficients. One difficulty is that the last P-frame inside a GOP with an IBBP structure, for example, needs to be transcoded to become the last P-frame in the IPPP structure, and it has to be linked to the previous reconstructed P-frame instead of to the I-frame. We increased the speed of the transcoding process by making use of the motion activity which is expressed in terms of the correlation between pictures. The whole transcoding process is done in the transform domain, hence re-encoding errors are completely avoided. Results from our experimental work show that the proposed video transcoder not only achieves a speed-up of two to six times that of the conventional video transcoder, but it also substantially improves the quality of the video.</abstract><cop>New York, NY</cop><pub>IEEE</pub><doi>10.1109/TMM.2007.902895</doi><tpages>10</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1520-9210
ispartof IEEE transactions on multimedia, 2007-10, Vol.9 (6), p.1093-1102
issn 1520-9210
1941-0077
language eng
recordid cdi_crossref_primary_10_1109_TMM_2007_902895
source IEEE Electronic Library (IEL)
subjects Applied sciences
Artificial intelligence
B-picture and P-picture
Bandwidth
Compressed
compressed domain processing
Computer science
control theory
systems
Computer systems and distributed systems. User interface
Correlation
Decoding
Discrete cosine transforms
Equations
Errors
Exact sciences and technology
heterogeneous transcoding
IP networks
Mathematical analysis
Motion compensation
Multimedia
Pattern recognition. Digital image processing. Computational geometry
Software
Studies
Transcoders
Transcoding
Transform coding
Transforms
Video coding
video coding and transcoding
Video compression
title On Transcoding a B-Frame to a P-Frame in the Compressed Domain
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-19T15%3A34%3A59IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=On%20Transcoding%20a%20B-Frame%20to%20a%20P-Frame%20in%20the%20Compressed%20Domain&rft.jtitle=IEEE%20transactions%20on%20multimedia&rft.au=SIU,%20Wan-Chi&rft.date=2007-10-01&rft.volume=9&rft.issue=6&rft.spage=1093&rft.epage=1102&rft.pages=1093-1102&rft.issn=1520-9210&rft.eissn=1941-0077&rft.coden=ITMUF8&rft_id=info:doi/10.1109/TMM.2007.902895&rft_dat=%3Cproquest_RIE%3E880680478%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=864746743&rft_id=info:pmid/&rft_ieee_id=4303035&rfr_iscdi=true