About Adaptive Coding on Countable Alphabets: Max-Stable Envelope Classes

Type: Article

Publication Date: 2015-07-20

Citations: 12

DOI: https://doi.org/10.1109/tit.2015.2455058

Abstract

In this paper, we study the problem of lossless universal source coding for stationary memoryless sources on countably infinite alphabets. This task is generally not achievable without restricting the class of sources over which universality is desired. Building on our prior work, we propose natural families of sources characterized by a common dominating envelope. We particularly emphasize the notion of adaptivity, which is the ability to perform as well as an oracle knowing the envelope, without actually knowing it. This is closely related to the notion of hierarchical universal source coding, but with the important difference that families of envelope classes are not discretely indexed and not necessarily nested. Our contribution is to extend the classes of envelopes over which adaptive universal source coding is possible, namely by including max-stable (heavy-tailed) envelopes which are excellent models in many applications, such as natural language modeling. We derive a minimax lower bound on the redundancy of any code on such envelope classes, including an oracle that knows the envelope. We then propose a constructive code that does not use knowledge of the envelope. The code is computationally efficient and is structured to use an {E}xpanding {T}hreshold for {A}uto-{C}ensoring, and we therefore dub it the \textsc{ETAC}-code. We prove that the \textsc{ETAC}-code achieves the lower bound on the minimax redundancy within a factor logarithmic in the sequence length, and can be therefore qualified as a near-adaptive code over families of heavy-tailed envelopes. For finite and light-tailed envelopes the penalty is even less, and the same code follows closely previous results that explicitly made the light-tailed assumption. Our technical results are founded on methods from regular variation theory and concentration of measure.

Locations

  • arXiv (Cornell University) - View - PDF
  • HAL (Le Centre pour la Communication Scientifique Directe) - View
  • DataCite API - View
  • IEEE Transactions on Information Theory - View

Similar Works

Action Title Year Authors
+ PDF Chat About adaptive coding on countable alphabets: max-stable envelope classes 2015 Boucheron Stephane
Élisabeth Gassiat
Mesrob I. Ohannessian
+ Pattern Coding Meets Censoring: (almost) Adaptive Coding on Countable Alphabets 2016 Anna Ben-Hamou
Stéphane Boucheron
Élisabeth Gassiat
+ PDF Chat Coding on Countably Infinite Alphabets 2009 Stéphane Boucheron
Aurélien Garivier
Élisabeth Gassiat
+ PDF Chat About Adaptive Coding on Countable Alphabets 2014 Dominique Bontemps
Stéphane Boucheron
Élisabeth Gassiat
+ PDF Chat On Universal D-Semifaithful Coding for Memoryless Sources With Infinite Alphabets 2021 Jorge F. Silva
Pablo Piantanida
+ On Universal D-Semifaithful Coding for Memoryless Sources with Infinite Alphabets 2021 Jorge F. Silva
Pablo Piantanida
+ On redundancy of memoryless sources over countable alphabets 2014 Maryam Hosseini
Narayana Santhanam
+ On redundancy of memoryless sources over countable alphabets 2014 Maryam Hosseini
Narayana Santhanam
+ PDF Chat Tail Redundancy and its Characterization of Compression of Memoryless Sources 2022 Maryam Hosseini
Narayana Santhanam
+ Redundancy of unbounded memory Markov classes with continuity conditions 2018 Changlong Wu
Maryam Hosseini
Narayana Santhanam
+ Tail redundancy and its characterization of compression of memoryless sources 2018 Maryam Hosseini
Narayana Santhanam
+ Redundancy of Markov Family with Unbounded Memory 2018 Changlong Wu
Maryam Hosseini
Narayana Santhanam
+ PDF Chat Universal Weak Variable-Length Source Coding on Countably Infinite Alphabets 2019 Jorge F. Silva
Pablo Piantanida
+ Universal compression of Gaussian sources with unknown parameters 2014 Alon Orlitsky
Narayana Santhanam
+ Fundamental Limits of Universal Variable-to-Fixed Length Coding of Parametric Sources 2017 Nematollah Iri
Oliver Kosut
+ Fundamental Limits of Universal Variable-to-Fixed Length Coding of Parametric Sources 2017 Nematollah Iri
Oliver Kosut
+ Universal Compression of Envelope Classes: Tight Characterization via Poisson Sampling 2014 Jayadev Acharya
Ashkan Jafarpour
Alon Orlitsky
Ananda Theertha Suresh
+ PDF Chat Universal covertness for Discrete Memoryless Sources 2016 RĂ©mi A. Chou
Matthieu R. Bloch
Aylin Yener
+ Universal Covertness for Discrete Memoryless Sources 2021 RĂ©mi A. Chou
Matthieu R. Bloch
Aylin Yener
+ PDF Chat Results on the redundancy of universal compression for finite-length sequences 2011 Ahmad Beirami
Faramarz Fekri