Universal lossless coding for sources with repeating statistics

A lower bound is derived on the achievable redundancy for universal lossless coding of parametric sources with piecewise stationary, abruptly changing, occasionally repeating statistics. In particular, it is shown that if the number of repeating statistical parameter vectors (or states) is not too l...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on information theory 2004-08, Vol.50 (8), p.1620-1635
Hauptverfasser: Shamir, G.I., Costello, D.J.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1635
container_issue 8
container_start_page 1620
container_title IEEE transactions on information theory
container_volume 50
creator Shamir, G.I.
Costello, D.J.
description A lower bound is derived on the achievable redundancy for universal lossless coding of parametric sources with piecewise stationary, abruptly changing, occasionally repeating statistics. In particular, it is shown that if the number of repeating statistical parameter vectors (or states) is not too large, for any uniquely decipherable code, for almost every set of states that govern all the different segments in the data sequence, for almost every arrangement of these states in the different segments, and for almost every vector of transition times, the minimum achievable redundancy is composed of 0.5 log d extra code bits for each unknown component of each state, log m extra code bits for each unknown transition time, and log s extra code bits for each repetition of a state, where d is the average duration of each state in the input string, TO is the average length of a segment, and s is the total number of states. If s is essentially large compared to TO, it is shown that the minimum redundancy is composed of 0.5 log 77i bits for each unknown component in each segment and log TO bits for each unknown transition time, which is the same lower bound as that of general piecewise stationary sources (PSSs). These results are true also in the minimax and maximin senses. The lower bound is shown to be achievable through construction of mixture and estimation based codes. Different special cases are reviewed, and it is shown that unless s is essentially large compared to m, optimal codes that are designed particularly for sources with repeating statistics outperform codes designed for PSSs when coding sources with repeating statistics. In particular, the bound for general PSSs is shown to be a special case of the new bound.
doi_str_mv 10.1109/TIT.2004.831759
format Article
fullrecord <record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_proquest_miscellaneous_28507127</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>1317110</ieee_id><sourcerecordid>749659731</sourcerecordid><originalsourceid>FETCH-LOGICAL-c347t-5757e68f79daeae50d84d84ac3853f51069f172373e1c98dc7b89bd31c7fc8423</originalsourceid><addsrcrecordid>eNpdkM1LAzEQxYMoWKtnD14WQW_bZjZJk5xEih-Fgpf2HNLsrG7Z7tbMVvG_N6VCQRgYhvebYd5j7Br4CIDb8WK2GBWcy5ERoJU9YQNQSud2ouQpG3AOJrdSmnN2QbROo1RQDNjDsq2_MJJvsqYjapAoC11Zt-9Z1cWMul0MSNl33X9kEbfo-71EferU14Eu2VnlG8Krvz5ky-enxfQ1n7-9zKaP8zwIqftcaaVxYiptS48eFS-NTOWDMEpUCvjEVqALoQVCsKYMemXsqhQQdBWMLMSQ3R_ubmP3uUPq3aamgE3jW-x25AqjuIZCJ_D2H7hOHtr0mwOrjJVK8wSND1CIyXTEym1jvfHxxwF3-zRdStPt03SHNNPG3d9ZT8E3VfRtqOm4NgENACJxNweuRsSjLPYyF79drXzu</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>195894570</pqid></control><display><type>article</type><title>Universal lossless coding for sources with repeating statistics</title><source>IEEE Xplore</source><creator>Shamir, G.I. ; Costello, D.J.</creator><creatorcontrib>Shamir, G.I. ; Costello, D.J.</creatorcontrib><description>A lower bound is derived on the achievable redundancy for universal lossless coding of parametric sources with piecewise stationary, abruptly changing, occasionally repeating statistics. In particular, it is shown that if the number of repeating statistical parameter vectors (or states) is not too large, for any uniquely decipherable code, for almost every set of states that govern all the different segments in the data sequence, for almost every arrangement of these states in the different segments, and for almost every vector of transition times, the minimum achievable redundancy is composed of 0.5 log d extra code bits for each unknown component of each state, log m extra code bits for each unknown transition time, and log s extra code bits for each repetition of a state, where d is the average duration of each state in the input string, TO is the average length of a segment, and s is the total number of states. If s is essentially large compared to TO, it is shown that the minimum redundancy is composed of 0.5 log 77i bits for each unknown component in each segment and log TO bits for each unknown transition time, which is the same lower bound as that of general piecewise stationary sources (PSSs). These results are true also in the minimax and maximin senses. The lower bound is shown to be achievable through construction of mixture and estimation based codes. Different special cases are reviewed, and it is shown that unless s is essentially large compared to m, optimal codes that are designed particularly for sources with repeating statistics outperform codes designed for PSSs when coding sources with repeating statistics. In particular, the bound for general PSSs is shown to be a special case of the new bound.</description><identifier>ISSN: 0018-9448</identifier><identifier>EISSN: 1557-9654</identifier><identifier>DOI: 10.1109/TIT.2004.831759</identifier><identifier>CODEN: IETTAW</identifier><language>eng</language><publisher>New York, NY: IEEE</publisher><subject>Application software ; Applied sciences ; Capacity ; Cities and towns ; Codes ; Coding, codes ; Exact sciences and technology ; Gas insulated transmission lines ; Information theory ; Information, signal and communications theory ; Mathematics ; Minimax techniques ; Parameter estimation ; Parametric statistics ; Signal and communications theory ; Source coding ; Telecommunications and information theory</subject><ispartof>IEEE transactions on information theory, 2004-08, Vol.50 (8), p.1620-1635</ispartof><rights>2004 INIST-CNRS</rights><rights>Copyright Institute of Electrical and Electronics Engineers, Inc. (IEEE) Aug 2004</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c347t-5757e68f79daeae50d84d84ac3853f51069f172373e1c98dc7b89bd31c7fc8423</citedby><cites>FETCH-LOGICAL-c347t-5757e68f79daeae50d84d84ac3853f51069f172373e1c98dc7b89bd31c7fc8423</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/1317110$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,796,27924,27925,54758</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/1317110$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=16171113$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>Shamir, G.I.</creatorcontrib><creatorcontrib>Costello, D.J.</creatorcontrib><title>Universal lossless coding for sources with repeating statistics</title><title>IEEE transactions on information theory</title><addtitle>TIT</addtitle><description>A lower bound is derived on the achievable redundancy for universal lossless coding of parametric sources with piecewise stationary, abruptly changing, occasionally repeating statistics. In particular, it is shown that if the number of repeating statistical parameter vectors (or states) is not too large, for any uniquely decipherable code, for almost every set of states that govern all the different segments in the data sequence, for almost every arrangement of these states in the different segments, and for almost every vector of transition times, the minimum achievable redundancy is composed of 0.5 log d extra code bits for each unknown component of each state, log m extra code bits for each unknown transition time, and log s extra code bits for each repetition of a state, where d is the average duration of each state in the input string, TO is the average length of a segment, and s is the total number of states. If s is essentially large compared to TO, it is shown that the minimum redundancy is composed of 0.5 log 77i bits for each unknown component in each segment and log TO bits for each unknown transition time, which is the same lower bound as that of general piecewise stationary sources (PSSs). These results are true also in the minimax and maximin senses. The lower bound is shown to be achievable through construction of mixture and estimation based codes. Different special cases are reviewed, and it is shown that unless s is essentially large compared to m, optimal codes that are designed particularly for sources with repeating statistics outperform codes designed for PSSs when coding sources with repeating statistics. In particular, the bound for general PSSs is shown to be a special case of the new bound.</description><subject>Application software</subject><subject>Applied sciences</subject><subject>Capacity</subject><subject>Cities and towns</subject><subject>Codes</subject><subject>Coding, codes</subject><subject>Exact sciences and technology</subject><subject>Gas insulated transmission lines</subject><subject>Information theory</subject><subject>Information, signal and communications theory</subject><subject>Mathematics</subject><subject>Minimax techniques</subject><subject>Parameter estimation</subject><subject>Parametric statistics</subject><subject>Signal and communications theory</subject><subject>Source coding</subject><subject>Telecommunications and information theory</subject><issn>0018-9448</issn><issn>1557-9654</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2004</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNpdkM1LAzEQxYMoWKtnD14WQW_bZjZJk5xEih-Fgpf2HNLsrG7Z7tbMVvG_N6VCQRgYhvebYd5j7Br4CIDb8WK2GBWcy5ERoJU9YQNQSud2ouQpG3AOJrdSmnN2QbROo1RQDNjDsq2_MJJvsqYjapAoC11Zt-9Z1cWMul0MSNl33X9kEbfo-71EferU14Eu2VnlG8Krvz5ky-enxfQ1n7-9zKaP8zwIqftcaaVxYiptS48eFS-NTOWDMEpUCvjEVqALoQVCsKYMemXsqhQQdBWMLMSQ3R_ubmP3uUPq3aamgE3jW-x25AqjuIZCJ_D2H7hOHtr0mwOrjJVK8wSND1CIyXTEym1jvfHxxwF3-zRdStPt03SHNNPG3d9ZT8E3VfRtqOm4NgENACJxNweuRsSjLPYyF79drXzu</recordid><startdate>20040801</startdate><enddate>20040801</enddate><creator>Shamir, G.I.</creator><creator>Costello, D.J.</creator><general>IEEE</general><general>Institute of Electrical and Electronics Engineers</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>RIA</scope><scope>RIE</scope><scope>IQODW</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20040801</creationdate><title>Universal lossless coding for sources with repeating statistics</title><author>Shamir, G.I. ; Costello, D.J.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c347t-5757e68f79daeae50d84d84ac3853f51069f172373e1c98dc7b89bd31c7fc8423</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2004</creationdate><topic>Application software</topic><topic>Applied sciences</topic><topic>Capacity</topic><topic>Cities and towns</topic><topic>Codes</topic><topic>Coding, codes</topic><topic>Exact sciences and technology</topic><topic>Gas insulated transmission lines</topic><topic>Information theory</topic><topic>Information, signal and communications theory</topic><topic>Mathematics</topic><topic>Minimax techniques</topic><topic>Parameter estimation</topic><topic>Parametric statistics</topic><topic>Signal and communications theory</topic><topic>Source coding</topic><topic>Telecommunications and information theory</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Shamir, G.I.</creatorcontrib><creatorcontrib>Costello, D.J.</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) Online</collection><collection>IEEE Xplore</collection><collection>Pascal-Francis</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>IEEE transactions on information theory</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Shamir, G.I.</au><au>Costello, D.J.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Universal lossless coding for sources with repeating statistics</atitle><jtitle>IEEE transactions on information theory</jtitle><stitle>TIT</stitle><date>2004-08-01</date><risdate>2004</risdate><volume>50</volume><issue>8</issue><spage>1620</spage><epage>1635</epage><pages>1620-1635</pages><issn>0018-9448</issn><eissn>1557-9654</eissn><coden>IETTAW</coden><abstract>A lower bound is derived on the achievable redundancy for universal lossless coding of parametric sources with piecewise stationary, abruptly changing, occasionally repeating statistics. In particular, it is shown that if the number of repeating statistical parameter vectors (or states) is not too large, for any uniquely decipherable code, for almost every set of states that govern all the different segments in the data sequence, for almost every arrangement of these states in the different segments, and for almost every vector of transition times, the minimum achievable redundancy is composed of 0.5 log d extra code bits for each unknown component of each state, log m extra code bits for each unknown transition time, and log s extra code bits for each repetition of a state, where d is the average duration of each state in the input string, TO is the average length of a segment, and s is the total number of states. If s is essentially large compared to TO, it is shown that the minimum redundancy is composed of 0.5 log 77i bits for each unknown component in each segment and log TO bits for each unknown transition time, which is the same lower bound as that of general piecewise stationary sources (PSSs). These results are true also in the minimax and maximin senses. The lower bound is shown to be achievable through construction of mixture and estimation based codes. Different special cases are reviewed, and it is shown that unless s is essentially large compared to m, optimal codes that are designed particularly for sources with repeating statistics outperform codes designed for PSSs when coding sources with repeating statistics. In particular, the bound for general PSSs is shown to be a special case of the new bound.</abstract><cop>New York, NY</cop><pub>IEEE</pub><doi>10.1109/TIT.2004.831759</doi><tpages>16</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 0018-9448
ispartof IEEE transactions on information theory, 2004-08, Vol.50 (8), p.1620-1635
issn 0018-9448
1557-9654
language eng
recordid cdi_proquest_miscellaneous_28507127
source IEEE Xplore
subjects Application software
Applied sciences
Capacity
Cities and towns
Codes
Coding, codes
Exact sciences and technology
Gas insulated transmission lines
Information theory
Information, signal and communications theory
Mathematics
Minimax techniques
Parameter estimation
Parametric statistics
Signal and communications theory
Source coding
Telecommunications and information theory
title Universal lossless coding for sources with repeating statistics
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-28T07%3A18%3A00IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Universal%20lossless%20coding%20for%20sources%20with%20repeating%20statistics&rft.jtitle=IEEE%20transactions%20on%20information%20theory&rft.au=Shamir,%20G.I.&rft.date=2004-08-01&rft.volume=50&rft.issue=8&rft.spage=1620&rft.epage=1635&rft.pages=1620-1635&rft.issn=0018-9448&rft.eissn=1557-9654&rft.coden=IETTAW&rft_id=info:doi/10.1109/TIT.2004.831759&rft_dat=%3Cproquest_RIE%3E749659731%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=195894570&rft_id=info:pmid/&rft_ieee_id=1317110&rfr_iscdi=true