Channel Capacity of Coding System on Tsallis Entropy and q-Statistics

Tsuruyama, Tatsuaki

doi:10.3390/e19120682

Open AccessArticle

Channel Capacity of Coding System on Tsallis Entropy and q-Statistics

by

Tatsuaki Tsuruyama

Department of Pathology, Kyoto University, Graduate School of Medicine, Yoshida-Konoe-cho, Sakyo-ku, Kyoto 606-8315, Japan

Entropy 2017, 19(12), 682; https://doi.org/10.3390/e19120682

Submission received: 12 August 2017 / Revised: 4 December 2017 / Accepted: 8 December 2017 / Published: 12 December 2017

(This article belongs to the Section Information Theory, Probability and Statistics)

Download Versions Notes

Abstract

:

The field of information science has greatly developed, and applications in various fields have emerged. In this paper, we evaluated the coding system in the theory of Tsallis entropy for transmission of messages and aimed to formulate the channel capacity by maximization of the Tsallis entropy within a given condition of code length. As a result, we obtained a simple relational expression between code length and code appearance probability and, additionally, a generalized formula of the channel capacity on the basis of Tsallis entropy statistics. This theoretical framework may contribute to data processing techniques and other applications.

Keywords:

Tsallis entropy; channel capacity

1. Introduction

Information theory has developed greatly in recent years, and has found broader applications in various research fields [1]. Shannon developed the information theory of entropy [2], and there have been extensive studies of entropy generalizations. On the other hand, the theory of Tsallis statistics originated in the 1980s, and the principle of entropy maximization was a means of expanding on the basic statistics theory. It was based on the fact that Boltzmann–Gibbs statistical mechanics could be reconstructed by the entropy maximization principle, a development beginning in 1957 with the work of Jaynes [3,4]; the framework has been developed primarily for extending statistical mechanics. Tsallis entropy was introduced in 1988 by Constantino Tsallis as a basis for generalizing the standard statistical mechanics on the basis of q-statistics [5]. It is possible to derive Tsallis distributions from the optimization of Tsallis entropy. For example, the q-Gaussian is one of the probability distributions that arise from the maximization of Tsallis entropy.

In this short paper, we aim to define the channel capacity of the coding system for transmission of messages on the basis of Tsallis entropy and aim to understand Tsallis entropy theory from the viewpoint of coding theory and seek a way to maximize the entropy corresponding to the signal event number. For the purpose, we introduce the coding symbols, the appearance probability of the symbols, and the message duration. Through this theory-based development, we reconsider the significance of Tsallis entropy in coding theory. Especially with respect to source coding, the objective is to determine the relational expression when applying the entropy maximization principle to establishing the relationship between code length and the appearance probability of the code used to calculate the channel capacity of a coding system. This theory regarding the relationship between code length and the appearance probability of the code was developed by Brillouin and the simple formula was obtained by maximizing information in the coding system, on the basis of information theory [6,7,8]. Several research progressions have been achieved regarding mutual information [7]; however, the relational expression between source code length and the appearance probability of the code remains to be improved in the theory.

2. Source Coding for Tsallis Entropy Formulation

Consider all the possible distinct messages that correspond to all the possible combinations of the code A_j, whose code length is τ_j. For instance, a message is described using n types of code symbols, S_j (1 ≤ j ≤ n) as follows:

A₁ A₃ A₂ A₃ A₁ A₄ A₃ A₅ A₃ A₃ A₃

(1)

Our aim is to identify the way of coding in which the total information within a given duration can be maximized. The messages, which consist of symbols A_j with numbers N_j (1 ≤ j ≤ n), will correspond to all the possible combinations of symbols A_j. Therefore, N₁ = 2, N₂ = 1, N₃ = 6, and n = 5 in the message (1). Here, we consider Ψ, the total number of such distinct messages, in the selection of n symbols. We assume absolutely no restrictions, constraints, or correlations in using various symbols. We obtain information I derived from the above messages consisting of N_j,

I = K \log ψ .

(2)

Here, K is an arbitrary constant. If we use entropy unit, we take K = k_B, Boltzmann’s constant. On the other hand, in information science, K is equivalent to log₂e. Shannon defines the channel capacity as follows [2]:

C = \max \lim_{τ \to \infty} \frac{K \log ψ}{τ} = \max \lim_{τ \to \infty} \frac{I}{τ} .

(3)

Here, Ψ is signified as a function of τ, a message of total duration. The unit of channel capacity is given by bits per second, if the total duration is measured in seconds. We define the total number of code symbols N in a given message as:

N = \sum_{j = 1}^{n} N_{j}

(4)

For example, N = 11 in (1). Thus, N is variable in different individual messages. Next, p_j is the appearance probability of the S_j symbol in the messages consisting of a total of N symbols. In the following summations over j:

p_{j} ≜ \frac{N_{j}}{N}

(5)

and

\sum_{j = 1}^{n} p_{j} = 1

(6)

Using (5), we can rewrite (3) as follows:

C = \max \lim_{τ \to \infty} (- K N \sum_{j = 1}^{n} p_{j} \log p_{j} / τ) = \max \lim_{τ \to \infty} (K S / τ) .

(7)

Here, S represents the Shannon entropy, S. In this study, we investigate the channel capacity when the entropy is given by Tsallis entropy. Here, we introduce the q-duration of the message, as follows:

τ_{q} = N \sum_{j = 1}^{n} ϕ_{j} τ_{j}

(8)

τ_j signifies the jth code length. Here, we used escort probability φ_j:

ϕ_{j} ≜ \frac{p_{j}^{q}}{c_{q}},

(9)

In actuality,

\sum_{j = 1}^{n} ϕ_{j} = \sum_{j = 1}^{n} \frac{p_{j}^{q}}{c_{q}} = 1

(10)

For simplification, we use general notation in Tsallis statistics:

c_{q} ≜ \sum_{j = 1}^{n} p_{j}^{q}

(11)

Tsallis entropy is given by [6] (http://www.tsakkus,cat.cbpf.br/TEMUCO.pdf):

S_{q} = N \sum_{j = 1}^{n} \frac{p_{j} - p_{j}^{q}}{q - 1}

(12)

The theory of Tsallis statistics, based on the generalized form of entropy S_q (q ∈ R), when q→1, recovers the Shannon entropy:

S = - N \sum_{j = 1}^{n} p_{j} \log p_{j} .

(13)

We aimed to maximize Tsallis entropy (12) [8], S_q, instead of Shannon entropy. Then, we introduced a function G, using non-determined parameters β and γ, in reference to (8), (9), and (10):

G (p_{1}, p_{2}, \dots p_{n}; N) ≜ S_{q} - β \sum_{j = 1}^{n} ϕ_{j} - γ N \sum_{j = 1}^{n} ϕ_{j} τ_{j}

(14)

Then

\frac{\partial}{\partial p_{j}} G (p_{1}, p_{2}, \dots p_{n}; N) = N \frac{1 - q p_{j}^{q - 1}}{q - 1} - (β + γ N τ_{j}) \frac{q p_{j}^{q - 1} (- p_{j}^{q} + c_{q})}{c_{q}^{2}}

(15)

\frac{\partial}{\partial N} G (p_{1}, p_{2}, \dots p_{n}; N) = \sum_{j = 1}^{n} \frac{p_{j} - p_{j}^{q}}{q - 1} - \sum_{j = 1}^{n} γ \frac{p_{j}^{q}}{c_{q}} τ_{j}

(16)

For calculation of (15), we used

\frac{\partial ϕ_{j}}{\partial p_{j}} = \frac{q p_{j}^{q - 1} (- p_{j}^{q} + c_{q})}{c_{q}^{2}}

(17)

For maximization of

G (p_{1}, p_{2}, \dots p_{n}; N)

, setting the right sides of (15) and (16) equal to zero, we have:

(β + γ N τ_{j}) \frac{q p_{j}^{q - 1} (- p_{j}^{q} + c_{q})}{c_{q}^{2}} = N \frac{1 - q p_{j}^{q - 1}}{q - 1}

(18)

γ \frac{p_{j}^{q}}{c_{q}} τ_{j} = \frac{p_{j} - p_{j}^{q}}{q - 1}

(19)

and solving the above equations with respect to undetermined coefficients β and γ, we have:

β = \frac{N c_{q} p_{j}^{- q} (p_{j}^{q} (- p_{j} + p_{j}^{q}) q + (p_{j} + q p_{j} - 2 q p_{j}^{q}) c_{q})}{q (q - 1) (- p_{j}^{q} + c_{q})}

(20)

and

γ_{q} ≜ \frac{γ}{c_{q}} = - \frac{1 - p_{j}^{1 - q}}{(q - 1) τ_{j}}

(21)

Rewriting (21), using the q-logarithm function,

\log_{q} x = \frac{1 - x^{1 - q}}{q - 1},

(22)

and we obtain from (8), (21) and (22):

- \log_{q} p_{j} = γ_{q} τ_{j}

(23)

Equation (23) implies that most probable code symbols must be short, while the improbable code may be long. In fact, when q approaches 1, (23) gives the logarithm according to Brillouin’s work using another constant γ’:

- \log p_{j} = γ' τ_{j}

(24)

Thus, the probability of symbol appearance can be described using the Tsallis duration in (22), which is similar to the Shannon entropy coding in (24) [9]. The above result is explicitly a natural extension of Brillouin’s theory, regarding the relationship between coding and Shannon entropy, to the concept of Tsallis entropy.

3. Channel Capacity and Tsallis Entropy

As shown in (23), γ_q is equivalent to the Tsallis average entropy production rate σ_q during the transmission of the message:

γ_{q} = - \log_{q} p_{j} / τ_{j} ≜ σ_{q}

(25)

Our definition now yields the q-channel capacity C_q, in reference to (3), (7), (8), and (21) as follows:

C_{q} ≜ \lim_{τ_{q} \to \infty} \frac{- K N \sum_{j = 1}^{n} p_{j}^{q} \log_{q} p_{j}}{τ_{q}} = \lim_{τ_{q} \to \infty} \frac{K N σ_{q} \sum_{j = 1}^{n} p_{j}^{q} τ_{j}}{τ_{q}} = K_{q} σ_{q}

(26)

with

K_{q} ≜ K c_{q}

(27)

Here, K is an arbitrary constant. Therefore, the channel capacity has a dimension identical to the entropy production rate and is equivalent to Tsallis average entropy production rate. Thus, the above result is explicitly a natural extension of channel capacity on Tsallis entropy.

4. Conclusions

In this short article, we achieved an important formulation between code length and appearance probability in Equation (26). In a similar way to how Shannon’s entropy was extended to Tsallis entropy, the source-coding theory based on the former entropy by Brillouin [9,10] was theoretically generalized for the theory based on Tsallis entropy [5,7,11]. On the other hand, it remains to be discussed how one should interpret q-duration in Equations (7) and (19). The duration of a given message event is generally shorter than the actual message duration; however, when the appearance probability distribution obeys a q-Gaussian, the q-duration is an indicator available for use in the analysis of the coding background, in place of the actual duration. From this perspective, we will further investigate the definition of the q-duration and the interpretation of the limitation of our calculation in future.

My theoretical attempt to generalize source coding can be applied to data management within the areas of data communications, processing, and conversion, particularly in the development of imaging applications. Further investigation is needed with regards to the tractability of Tsallis entropy and q-statistics in evaluating actual experimental data. We actually applied the medical imaging technique, following previous reports [12,13]; in future work, we look to investigate systems in which entropy is measurable and/or definable.

Acknowledgments

This work was supported by a Grant-in-Aid from the Ministry of Education, Culture, Sports, Science, and Technology of Japan (Synergy of Fluctuation and Structure: Quest for Universal Laws in Non-Equilibrium Systems, P2013-201 Grant-in-Aid for Scientific Research on Innovative Areas, MEXT, Japan). We thank Kenichi Yoshikawa of Doshisha University, for his advices.

Conflicts of Interest

The author declare no conflict of interest.

References

Zoltan, D. Generalized information functions. Inf. Control 1970, 16, 36–51. [Google Scholar]
Shannon, C.E. A mathematical theory of communication. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef]
Jaynes, E.T. Information theory and statistical mechanics. Phys. Rev. 1957, 106, 620–630. [Google Scholar] [CrossRef]
Jaynes, E.T. Information theory and statistical mechanics II. Phys. Rev. 1957, 108, 171–190. [Google Scholar] [CrossRef]
Tsallis, C. Possible generalization of Boltzmann-Gibbs statistics. J. Stat. Phys. 1988, 52, 479–487. [Google Scholar] [CrossRef]
Rosso, O.A.R.; Teresa, M.; Larrondo, H.A.; Kowalski, A.; Plastino, A. Generalized statistical complexity: A new tool for dynamical systems. In Concepts and Recent Advances in Generalized Information Measures and Statistics; Kowalski, A.M., Rossignoli, R.D., Curado, E.M.F., Eds.; Bentham Science Publishers: Emirate of Sharjah, United Arab Emirates, 2013. [Google Scholar]
Tsallis, C. Introduction to Nonextensive Statistical Mechanics: Approaching a Complex World; Springer: New York, NY, USA, 2009; pp. 329–334. [Google Scholar]
Livadiotis, G. Entropy Maximization in Part 1 Theory and Formalism. In Kappa Distribution: Theory and Applications in Plasmas; Elsevier: Amsterdam, The Netherlands; Cambridge, UK; Atlanta, GA, USA, 2017; pp. 22–27. [Google Scholar]
Brillouin, L. Chapter 4 Principle of coding. In Science and Information Theory, 2nd ed.; Dover Publications Inc.: Mineola, NY, USA, 2013; pp. 28–50. [Google Scholar]
Brillouin, L. Chapter 8 The analysis of coding. In Science and Information Theory, 2nd ed.; Dover Publications Inc.: Mineola, NY, USA, 2013; pp. 78–113. [Google Scholar]
Angulo, J.M.; Esquivel, F.J. Multifractal dimensional dependence assessment based on Tsallis mutual information. Entropy 2015, 17, 5382–5401. [Google Scholar] [CrossRef]
Hamza, A.B. An information-theoretic method for multimodality medical image registration. Expert Syst. Appl. 2012, 39, 5548–5556. [Google Scholar]
Tarmissi, K. Information-theoretic hashing of 3D objects using spectral graph theory. Expert Syst. Appl. 2009, 36, 9409–9414. [Google Scholar] [CrossRef]

© 2017 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Tsuruyama, T. Channel Capacity of Coding System on Tsallis Entropy and q-Statistics. Entropy 2017, 19, 682. https://doi.org/10.3390/e19120682

AMA Style

Tsuruyama T. Channel Capacity of Coding System on Tsallis Entropy and q-Statistics. Entropy. 2017; 19(12):682. https://doi.org/10.3390/e19120682

Chicago/Turabian Style

Tsuruyama, Tatsuaki. 2017. "Channel Capacity of Coding System on Tsallis Entropy and q-Statistics" Entropy 19, no. 12: 682. https://doi.org/10.3390/e19120682

APA Style

Tsuruyama, T. (2017). Channel Capacity of Coding System on Tsallis Entropy and q-Statistics. Entropy, 19(12), 682. https://doi.org/10.3390/e19120682

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Channel Capacity of Coding System on Tsallis Entropy and q-Statistics

Abstract

1. Introduction

2. Source Coding for Tsallis Entropy Formulation

3. Channel Capacity and Tsallis Entropy

4. Conclusions

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI