Universal lossless coding for sources with repeating statistics
A lower bound is derived on the achievable redundancy for universal lossless coding of parametric sources with piecewise stationary, abruptly changing, occasionally repeating statistics. In particular, it is shown that if the number of repeating statistical parameter vectors (or states) is not too l...
Gespeichert in:
Veröffentlicht in: | IEEE transactions on information theory 2004-08, Vol.50 (8), p.1620-1635 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 1635 |
---|---|
container_issue | 8 |
container_start_page | 1620 |
container_title | IEEE transactions on information theory |
container_volume | 50 |
creator | Shamir, G.I. Costello, D.J. |
description | A lower bound is derived on the achievable redundancy for universal lossless coding of parametric sources with piecewise stationary, abruptly changing, occasionally repeating statistics. In particular, it is shown that if the number of repeating statistical parameter vectors (or states) is not too large, for any uniquely decipherable code, for almost every set of states that govern all the different segments in the data sequence, for almost every arrangement of these states in the different segments, and for almost every vector of transition times, the minimum achievable redundancy is composed of 0.5 log d extra code bits for each unknown component of each state, log m extra code bits for each unknown transition time, and log s extra code bits for each repetition of a state, where d is the average duration of each state in the input string, TO is the average length of a segment, and s is the total number of states. If s is essentially large compared to TO, it is shown that the minimum redundancy is composed of 0.5 log 77i bits for each unknown component in each segment and log TO bits for each unknown transition time, which is the same lower bound as that of general piecewise stationary sources (PSSs). These results are true also in the minimax and maximin senses. The lower bound is shown to be achievable through construction of mixture and estimation based codes. Different special cases are reviewed, and it is shown that unless s is essentially large compared to m, optimal codes that are designed particularly for sources with repeating statistics outperform codes designed for PSSs when coding sources with repeating statistics. In particular, the bound for general PSSs is shown to be a special case of the new bound. |
doi_str_mv | 10.1109/TIT.2004.831759 |
format | Article |
fullrecord | <record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_proquest_miscellaneous_28507127</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>1317110</ieee_id><sourcerecordid>749659731</sourcerecordid><originalsourceid>FETCH-LOGICAL-c347t-5757e68f79daeae50d84d84ac3853f51069f172373e1c98dc7b89bd31c7fc8423</originalsourceid><addsrcrecordid>eNpdkM1LAzEQxYMoWKtnD14WQW_bZjZJk5xEih-Fgpf2HNLsrG7Z7tbMVvG_N6VCQRgYhvebYd5j7Br4CIDb8WK2GBWcy5ERoJU9YQNQSud2ouQpG3AOJrdSmnN2QbROo1RQDNjDsq2_MJJvsqYjapAoC11Zt-9Z1cWMul0MSNl33X9kEbfo-71EferU14Eu2VnlG8Krvz5ky-enxfQ1n7-9zKaP8zwIqftcaaVxYiptS48eFS-NTOWDMEpUCvjEVqALoQVCsKYMemXsqhQQdBWMLMSQ3R_ubmP3uUPq3aamgE3jW-x25AqjuIZCJ_D2H7hOHtr0mwOrjJVK8wSND1CIyXTEym1jvfHxxwF3-zRdStPt03SHNNPG3d9ZT8E3VfRtqOm4NgENACJxNweuRsSjLPYyF79drXzu</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>195894570</pqid></control><display><type>article</type><title>Universal lossless coding for sources with repeating statistics</title><source>IEEE Xplore</source><creator>Shamir, G.I. ; Costello, D.J.</creator><creatorcontrib>Shamir, G.I. ; Costello, D.J.</creatorcontrib><description>A lower bound is derived on the achievable redundancy for universal lossless coding of parametric sources with piecewise stationary, abruptly changing, occasionally repeating statistics. In particular, it is shown that if the number of repeating statistical parameter vectors (or states) is not too large, for any uniquely decipherable code, for almost every set of states that govern all the different segments in the data sequence, for almost every arrangement of these states in the different segments, and for almost every vector of transition times, the minimum achievable redundancy is composed of 0.5 log d extra code bits for each unknown component of each state, log m extra code bits for each unknown transition time, and log s extra code bits for each repetition of a state, where d is the average duration of each state in the input string, TO is the average length of a segment, and s is the total number of states. If s is essentially large compared to TO, it is shown that the minimum redundancy is composed of 0.5 log 77i bits for each unknown component in each segment and log TO bits for each unknown transition time, which is the same lower bound as that of general piecewise stationary sources (PSSs). These results are true also in the minimax and maximin senses. The lower bound is shown to be achievable through construction of mixture and estimation based codes. Different special cases are reviewed, and it is shown that unless s is essentially large compared to m, optimal codes that are designed particularly for sources with repeating statistics outperform codes designed for PSSs when coding sources with repeating statistics. In particular, the bound for general PSSs is shown to be a special case of the new bound.</description><identifier>ISSN: 0018-9448</identifier><identifier>EISSN: 1557-9654</identifier><identifier>DOI: 10.1109/TIT.2004.831759</identifier><identifier>CODEN: IETTAW</identifier><language>eng</language><publisher>New York, NY: IEEE</publisher><subject>Application software ; Applied sciences ; Capacity ; Cities and towns ; Codes ; Coding, codes ; Exact sciences and technology ; Gas insulated transmission lines ; Information theory ; Information, signal and communications theory ; Mathematics ; Minimax techniques ; Parameter estimation ; Parametric statistics ; Signal and communications theory ; Source coding ; Telecommunications and information theory</subject><ispartof>IEEE transactions on information theory, 2004-08, Vol.50 (8), p.1620-1635</ispartof><rights>2004 INIST-CNRS</rights><rights>Copyright Institute of Electrical and Electronics Engineers, Inc. (IEEE) Aug 2004</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c347t-5757e68f79daeae50d84d84ac3853f51069f172373e1c98dc7b89bd31c7fc8423</citedby><cites>FETCH-LOGICAL-c347t-5757e68f79daeae50d84d84ac3853f51069f172373e1c98dc7b89bd31c7fc8423</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/1317110$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,796,27924,27925,54758</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/1317110$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=16171113$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>Shamir, G.I.</creatorcontrib><creatorcontrib>Costello, D.J.</creatorcontrib><title>Universal lossless coding for sources with repeating statistics</title><title>IEEE transactions on information theory</title><addtitle>TIT</addtitle><description>A lower bound is derived on the achievable redundancy for universal lossless coding of parametric sources with piecewise stationary, abruptly changing, occasionally repeating statistics. In particular, it is shown that if the number of repeating statistical parameter vectors (or states) is not too large, for any uniquely decipherable code, for almost every set of states that govern all the different segments in the data sequence, for almost every arrangement of these states in the different segments, and for almost every vector of transition times, the minimum achievable redundancy is composed of 0.5 log d extra code bits for each unknown component of each state, log m extra code bits for each unknown transition time, and log s extra code bits for each repetition of a state, where d is the average duration of each state in the input string, TO is the average length of a segment, and s is the total number of states. If s is essentially large compared to TO, it is shown that the minimum redundancy is composed of 0.5 log 77i bits for each unknown component in each segment and log TO bits for each unknown transition time, which is the same lower bound as that of general piecewise stationary sources (PSSs). These results are true also in the minimax and maximin senses. The lower bound is shown to be achievable through construction of mixture and estimation based codes. Different special cases are reviewed, and it is shown that unless s is essentially large compared to m, optimal codes that are designed particularly for sources with repeating statistics outperform codes designed for PSSs when coding sources with repeating statistics. In particular, the bound for general PSSs is shown to be a special case of the new bound.</description><subject>Application software</subject><subject>Applied sciences</subject><subject>Capacity</subject><subject>Cities and towns</subject><subject>Codes</subject><subject>Coding, codes</subject><subject>Exact sciences and technology</subject><subject>Gas insulated transmission lines</subject><subject>Information theory</subject><subject>Information, signal and communications theory</subject><subject>Mathematics</subject><subject>Minimax techniques</subject><subject>Parameter estimation</subject><subject>Parametric statistics</subject><subject>Signal and communications theory</subject><subject>Source coding</subject><subject>Telecommunications and information theory</subject><issn>0018-9448</issn><issn>1557-9654</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2004</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNpdkM1LAzEQxYMoWKtnD14WQW_bZjZJk5xEih-Fgpf2HNLsrG7Z7tbMVvG_N6VCQRgYhvebYd5j7Br4CIDb8WK2GBWcy5ERoJU9YQNQSud2ouQpG3AOJrdSmnN2QbROo1RQDNjDsq2_MJJvsqYjapAoC11Zt-9Z1cWMul0MSNl33X9kEbfo-71EferU14Eu2VnlG8Krvz5ky-enxfQ1n7-9zKaP8zwIqftcaaVxYiptS48eFS-NTOWDMEpUCvjEVqALoQVCsKYMemXsqhQQdBWMLMSQ3R_ubmP3uUPq3aamgE3jW-x25AqjuIZCJ_D2H7hOHtr0mwOrjJVK8wSND1CIyXTEym1jvfHxxwF3-zRdStPt03SHNNPG3d9ZT8E3VfRtqOm4NgENACJxNweuRsSjLPYyF79drXzu</recordid><startdate>20040801</startdate><enddate>20040801</enddate><creator>Shamir, G.I.</creator><creator>Costello, D.J.</creator><general>IEEE</general><general>Institute of Electrical and Electronics Engineers</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>RIA</scope><scope>RIE</scope><scope>IQODW</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20040801</creationdate><title>Universal lossless coding for sources with repeating statistics</title><author>Shamir, G.I. ; Costello, D.J.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c347t-5757e68f79daeae50d84d84ac3853f51069f172373e1c98dc7b89bd31c7fc8423</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2004</creationdate><topic>Application software</topic><topic>Applied sciences</topic><topic>Capacity</topic><topic>Cities and towns</topic><topic>Codes</topic><topic>Coding, codes</topic><topic>Exact sciences and technology</topic><topic>Gas insulated transmission lines</topic><topic>Information theory</topic><topic>Information, signal and communications theory</topic><topic>Mathematics</topic><topic>Minimax techniques</topic><topic>Parameter estimation</topic><topic>Parametric statistics</topic><topic>Signal and communications theory</topic><topic>Source coding</topic><topic>Telecommunications and information theory</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Shamir, G.I.</creatorcontrib><creatorcontrib>Costello, D.J.</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) Online</collection><collection>IEEE Xplore</collection><collection>Pascal-Francis</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>IEEE transactions on information theory</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Shamir, G.I.</au><au>Costello, D.J.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Universal lossless coding for sources with repeating statistics</atitle><jtitle>IEEE transactions on information theory</jtitle><stitle>TIT</stitle><date>2004-08-01</date><risdate>2004</risdate><volume>50</volume><issue>8</issue><spage>1620</spage><epage>1635</epage><pages>1620-1635</pages><issn>0018-9448</issn><eissn>1557-9654</eissn><coden>IETTAW</coden><abstract>A lower bound is derived on the achievable redundancy for universal lossless coding of parametric sources with piecewise stationary, abruptly changing, occasionally repeating statistics. In particular, it is shown that if the number of repeating statistical parameter vectors (or states) is not too large, for any uniquely decipherable code, for almost every set of states that govern all the different segments in the data sequence, for almost every arrangement of these states in the different segments, and for almost every vector of transition times, the minimum achievable redundancy is composed of 0.5 log d extra code bits for each unknown component of each state, log m extra code bits for each unknown transition time, and log s extra code bits for each repetition of a state, where d is the average duration of each state in the input string, TO is the average length of a segment, and s is the total number of states. If s is essentially large compared to TO, it is shown that the minimum redundancy is composed of 0.5 log 77i bits for each unknown component in each segment and log TO bits for each unknown transition time, which is the same lower bound as that of general piecewise stationary sources (PSSs). These results are true also in the minimax and maximin senses. The lower bound is shown to be achievable through construction of mixture and estimation based codes. Different special cases are reviewed, and it is shown that unless s is essentially large compared to m, optimal codes that are designed particularly for sources with repeating statistics outperform codes designed for PSSs when coding sources with repeating statistics. In particular, the bound for general PSSs is shown to be a special case of the new bound.</abstract><cop>New York, NY</cop><pub>IEEE</pub><doi>10.1109/TIT.2004.831759</doi><tpages>16</tpages></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISSN: 0018-9448 |
ispartof | IEEE transactions on information theory, 2004-08, Vol.50 (8), p.1620-1635 |
issn | 0018-9448 1557-9654 |
language | eng |
recordid | cdi_proquest_miscellaneous_28507127 |
source | IEEE Xplore |
subjects | Application software Applied sciences Capacity Cities and towns Codes Coding, codes Exact sciences and technology Gas insulated transmission lines Information theory Information, signal and communications theory Mathematics Minimax techniques Parameter estimation Parametric statistics Signal and communications theory Source coding Telecommunications and information theory |
title | Universal lossless coding for sources with repeating statistics |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-28T07%3A18%3A00IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Universal%20lossless%20coding%20for%20sources%20with%20repeating%20statistics&rft.jtitle=IEEE%20transactions%20on%20information%20theory&rft.au=Shamir,%20G.I.&rft.date=2004-08-01&rft.volume=50&rft.issue=8&rft.spage=1620&rft.epage=1635&rft.pages=1620-1635&rft.issn=0018-9448&rft.eissn=1557-9654&rft.coden=IETTAW&rft_id=info:doi/10.1109/TIT.2004.831759&rft_dat=%3Cproquest_RIE%3E749659731%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=195894570&rft_id=info:pmid/&rft_ieee_id=1317110&rfr_iscdi=true |