cs0511078/papi.tex
1: 
2: 
3: %----------------------------------------------------------------
4: %%%%%%%%%%%%%%%%%%%%5Check-
5: 
6: % check whether to use pseudo-additivity or nonextensive additivity
7: 
8: 
9: 
10: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
11: %    INSTITUTE OF PHYSICS PUBLISHING                                   %
12: %                                                                      %
13: %   `Preparing an article for publication in an Institute of Physics   %
14: %    Publishing journal using LaTeX'                                   %
15: %                                                                      %
16: %    LaTeX source code `ioplau2e.tex' used to generate `author         %
17: %    guidelines', the documentation explaining and demonstrating use   %
18: %    of the Institute of Physics Publishing LaTeX preprint files       %
19: %    `iopart.cls, iopart12.clo and iopart10.clo'.                      %
20: %                                                                      %
21: %    `ioplau2e.tex' itself uses LaTeX with `iopart.cls'                %
22: %                                                                      %
23: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
24: %
25: %
26: % First we have a character check
27: %
28: % ! exclamation mark    " double quote  
29: % # hash                ` opening quote (grave)
30: % & ampersand           ' closing quote (acute)
31: % $ dollar              % percent       
32: % ( open parenthesis    ) close paren.  
33: % - hyphen              = equals sign
34: % | vertical bar        ~ tilde         
35: % @ at sign             _ underscore
36: % { open curly brace    } close curly   
37: % [ open square         ] close square bracket
38: % + plus sign           ; semi-colon    
39: % * asterisk            : colon
40: % < open angle bracket  > close angle   
41: % , comma               . full stop
42: % ? question mark       / forward slash 
43: % \ backslash           ^ circumflex
44: %
45: % ABCDEFGHIJKLMNOPQRSTUVWXYZ 
46: % abcdefghijklmnopqrstuvwxyz 
47: % 1234567890
48: %
49: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
50: %
51: \documentclass[12pt]{iopart}
52: \newcommand{\gguide}{{\it Preparing graphics for IOP journals}}
53: 
54: %==============================
55: %Mine
56: 
57: \usepackage{amssymb}
58: \usepackage{amsthm}
59: 
60: %------------------theorm env---------------
61:  \newtheorem{theorem}{Theorem}[section]
62:        \newtheorem{lemma}[theorem]{Lemma}
63:        \newtheorem{proposition}[theorem]{Proposition}
64:        \newtheorem{corollary}[theorem]{Corollary}
65:     \newtheorem{definition}[theorem]{Definition}
66:        \newtheorem{remark}[theorem]{Remark}
67: % \def\QED{\mbox{\rule[0pt]{1.5ex}{1.5ex}}}
68: % \def\proof{\noindent\hspace{2em}{\it Proof: }}
69: % \def\endproof{\hspace*{\fill}~\QED\par\endtrivlist\unskip}
70: %--------------------------------------------
71: \newcommand{\ud}{\mathrm{d}}
72: %=====================================
73: 
74: 
75: %Uncomment next line if AMS fonts required
76: %\usepackage{iopams}  
77: \begin{document}
78: 
79: %\title[R\'{e}nyi's Recipe and Nonextensitivity]{R\'{e}nyi's
80: %Recipe and Nonextensitivity: A Characterization Theorem for Tsallis
81: %Entropy}
82: 
83: \title[]{Uniqueness of Nonextensive entropy under \\ R\'{e}nyi's Recipe}
84: 
85: \author{Ambedkar Dukkipati\footnote{Corresponding author}, M Narasimha
86: Murty and Shalabh Bhatnagar}
87: 
88: \address{Department of Computer Science and Automation,
89: Indian Institute of Science, Bangalore-560012, India.}
90: \ead{\mailto{ambedkar@csa.iisc.ernet.in},
91: \mailto{mnm@csa.iisc.ernet.in}, \mailto{shalabh@csa.iisc.ernet.in}}
92: 
93: 
94: %----------------------------------------
95: \begin{abstract}
96: 	By replacing linear
97:         averaging in Shannon entropy with Kolmogorov-Nagumo
98:         average (KN-averages) or quasilinear mean and further
99:         imposing the additivity   
100:         constraint, R\'{e}nyi proposed the first formal generalization of
101:         Shannon entropy. Using this recipe of R\'{e}nyi, one can prepare only
102:         two information measures: 
103: 	Shannon and R\'{e}nyi entropy. Indeed, using this formalism
104:         R\'{e}nyi characterized these additive entropies in terms of 
105:         axioms of quasilinear mean. As additivity is a characteristic
106:         property of Shannon entropy, pseudo-additivity of the form $x \oplus_{q}
107:         y = x + y + (1-q)x y$ is a characteristic property of
108:         nonextensive (or Tsallis)
109:         entropy.
110: 	One can apply R\'{e}nyi's recipe in the nonextensive case by
111:         replacing the linear averaging in
112: 	Tsallis entropy with KN-averages and thereby imposing the
113: 	constraint of 
114: 	pseudo-additivity.
115: 	In this paper we show that nonextensive entropy is unique
116:         under the R\'{e}nyi's recipe, and there by give a
117:         characterization.
118: \end{abstract}
119: 
120: %Uncomment for PACS numbers title message
121: \pacs{ 65.40.Gr, 89.70.+c, 02.70.Rr}
122: % Keywords required only for MST, PB, PMB, PM, JOA, JOB? 
123: %\vspace{2pc}
124: %\noindent{\it Keywords}: Article preparation, IOP journals
125: % Uncomment for Submitted to journal title message
126: %\submitto{\JPA}
127: % Comment out if separate title page not required
128: \maketitle
129: 
130: %=========================Introduction===========================
131: \section{Introduction}
132: 
133: 	In recent years, interest in generalized information measures
134:          has increased dramatically, after the introduction of
135:          {\em nonextensive entropy} in Physics
136:          in 1988 by
137:          Tsallis~\cite{Tsallis:1988:GeneralizationOfBoltzmannGibbsStatistics}.
138: 	One can get this nonextensive entropy or Tsallis entropy by
139:          generalizing the information of single 
140: 	event in the definition of Shannon entropy, by replacing
141: 	logarithm with so
142: 	called $q$-logarithm, 
143: 	which is defined as 
144:         $\ln_{q} x = \frac{x^{1-q}-1}{1-q}$. Tsallis entropy does not
145:          satisfy the additivity property which is a characteristic
146:          property of Shannon entropy.  Instead, it satisfies
147:          pseudo-additivity of the form 
148: 	$x \oplus_{q} y = x + y + (1-q)xy$ and this
149:          definition of entropy 
150:         (also known as nonextensive entropy) led
151:         to the field of nonextensive statistical mechanics in
152:         Physics. In this paper we use the term pseudo-addition to
153:          represent the binary operation $x \oplus_{q} y = x + y +
154:          (1-q)xy$ for any $q \in \mathbb{R}$ and $q > 0$.
155: 
156: 	 Tsallis entropy is considered as a useful
157:          measure in describing the thermostatistical properties of a
158:          certain class of physical systems that entail long-range
159:          interactions, long-term memories and multi-fractal structures.
160: 	 Tsallis entropy is also studied in information theory and
161:          Shannon-Khinchin axioms have been generalized to
162:          nonextensive case. While 
163:          canonical distributions resulting from maximization of
164:          Shannon entropy are exponential in nature, in the
165:          Tsallis case, these result in power-law distributions. To a great extent, the success of Tsallis proposal is due to
166: 	the ubiquity of power law distributions in nature.
167: 
168: 	Indeed, the starting point of the theory of generalized measures of
169: 	information is due to Alfred
170: 	R{\'{e}}nyi~\cite{Renyi:1960:SomeFundamentalQuestionsOfInformationTheory,Renyi:1961:OnMeasuresOfEntropyAndInformation}.
171: 	By using Kolmogorov-Nagumo averages (KN-average) 
172: 	R\'{e}nyi introduced a
173: 	generalized information measure, known as $\alpha$-entropy or
174: 	R\'{e}nyi entropy, the first formal well-known generalization
175: 	of Shannon entropy.
176:         {\em KN-average} or quasilinear mean (we use these two
177: 	terms interchangeably) is of
178:         the form
179: 	${\langle x 
180:         \rangle}_{\psi} = \psi^{-1} \left (\sum_{k} p_{k}
181:         \psi(x_{k})\right)$,
182: 	where $\psi$ is an arbitrary continuous
183:         and strictly monotone function.
184: 	Replacing linear
185:         averaging in Shannon entropy with KN-averages and further
186: 	imposing the additivity   
187:         constraint -- a characteristic property of underlying
188:         information associated with single event, which is
189:         logarithmic -- leads to {\em R\'{e}nyi
190:         entropy}. 
191: 	Using this recipe of R\'{e}nyi, one can prepare only
192:         two information measures: 
193: 	Shannon and R\'{e}nyi entropy. Using this formalism
194:         R\'{e}nyi characterized these additive entropies in terms of
195:         axioms of KN-averages.
196: 
197: 	One can apply R\'{e}nyi's recipe in the nonextensive case by
198:         replacing the linear averaging in
199: 	Tsallis entropy with KN-averages and thereby imposing the
200: 	constraint of 
201: 	pseudo-additivity. 
202: 	A natural question arises: what are all the pseudo-additive
203: 	information measures one can prepare with this recipe? We
204: 	prove that only Tsallis entropy is possible in this case,
205: 	which allows us to characterize
206: 	Tsallis entropy based on axioms of KN-averages.
207: 
208: %	Tsallis and R{\'{e}}nyi entropy measures are two possible
209: %	different generalization of the Shannon entropy but are not
210: %	generalizations of each other.
211: 
212: 	To understand these generalizations, the so called Hartley
213:         function~\cite{Hartley:1928:TransmissionOfInformation} of a
214:         single stochastic event plays a fundamental role. We discuss
215:         Hartley function in
216:         \S~\ref{Section:KN-avearagesAndInformationMeasures} along 
217:         with a brief discussion on quasilinear mean and R\'{e}nyi
218:         entropy. The main results of this paper, on uniqueness of Tsallis
219:         entropy under R\'{e}nyi's recipe and a result on
220:         characterization of Tsallis entropy are presented in
221:         \S~\ref{Section:RenyisRecipieAndTsallisEntropy} and 
222:         \S~\ref{Section:AcharacterizationTheoremForTsallisEntropy}
223:         respectively.  
224: 
225: %====================================================================
226: \section{KN-averages and Information measures}
227: \label{Section:KN-avearagesAndInformationMeasures}
228: 
229:   \subsection{Hartley Function and Shannon Entropy}
230: 
231: 	Let $X$ be a discrete random variable (r.v) defined on some
232: 	probability space, which takes only $n$ values, $n < \infty$.  
233: 	We denote the set of all such random
234: 	variables by $\mathcal{X}$. Corresponding
235: 	to the $n$-tuple $(x_{1}, \ldots, x_{n})$ of values which $X$
236: 	takes, probability mass function (pmf) of
237: 	$X$ is denoted by $p = (p_{1}, \ldots p_{n})$, where $p_{k}
238: 	\geq 0$ for $k = 1, \ldots n$ and $\sum_{k=1}^{n} p_{k}
239: 	=1$. Expectation of r.v $X$ is denoted by $EX$ or $\langle X
240: 	\rangle$; in this paper we use both the notations,
241: 	interchangeably. 
242: 
243: 	Shannon entropy, a logarithmic measure of information on $X$ denoted by $S(X)$,
244: 	reads~\cite{Shannon:1948:MathematicalTheoryOfCommunication_BellLabs} 
245: 	\begin{equation}
246: 	\label{Equation:DefinitionOfShannonEntropy}
247: 	S(X) = - \sum_{k=1}^{n} p_{k} \ln p_{k} \enspace,		
248: 	\end{equation}
249: 	and measures the average lack of information that is
250: 	inherent in $p$. 
251: 
252: 	This motivation to quantify information in terms of logarithmic
253: 	functions is due to
254: 	Hartley~\cite{Hartley:1928:TransmissionOfInformation}, who
255: 	first used a logarithmic function to define uncertainty
256: 	associated with a finite set.
257: 	This is known as Hartley information measure.
258:         The Hartley information measure  of a 
259:         finite set $A$ with $n$ elements is defined as
260:         $H(A) = \log_{b} n$.
261:         If the base of the logarithm is $2$, then the uncertainty is
262:         measured in {\em bits}, and in the case of natural logarithm,
263: 	the unit is nats. Throughout this paper we use only natural
264: 	logarithm as a convention. 
265: 
266: 	One can give a more general definition of Hartley information
267: 	measure, which is a special case of Shannon entropy as
268: 	follows. Define a function $H:
269: 	\{x_{1}, \ldots, x_{n}  \} \rightarrow \mathbb{R}$ of the
270: 	values taken by r.v $X \in \mathcal{X}$ with corresponding
271: 	p.m.f $p = (p_{1}, \ldots p_{n})$ 
272: 	as~\cite{AczelDaroczy:1975:OnMeasuresOfInformationAndTheirCharacterization} 
273: 	\begin{equation}
274: 	\label{Equation:HartleyFunctionForRV}
275: 	H(x_{k}) = \ln \frac{1}{p_{k}} \enspace,\:\: \forall k = 1, \ldots n.
276: 	\end{equation}
277: 	$H$ is also known as entropy of a single event and plays an
278: 	important role in all classical measures of information. It can be
279: 	interpreted either as a measure of how unexpected the event was,
280: 	or as measure of the information yielded by the event.
281: 	Hartley function satisfies: (i) H is {\em
282: 	nonnegative}: $H(x_{k})  \geq 0$ (ii) H is {\em additive}:
283: 	$H(x_{i}x_{j}) = H(x_{i}) + H(x_{j})$ (iii) H is {\em
284: 	normalized}:  $H(x_{k}) = 1$, whenever $p_{k} = \frac{1}{e}$
285: 	(in the case of logarithm with 
286: 	base $2$, the same satisfied for $p_{k} = \frac{1}{2}$). These properties
287: 	are both necessary and
288: 	sufficient~\cite{AczelDaroczy:1975:OnMeasuresOfInformationAndTheirCharacterization}. 
289: 
290: 	Now, Shannon
291: 	entropy~(\ref{Equation:DefinitionOfShannonEntropy}) can be
292: 	written as expectation of Hartley 
293: 	function as   
294: 	\begin{equation}
295: 	\label{Equation:Definition_ShannonEntropy}
296: 	     S (X) = {\langle H \rangle} = \sum_{k=1}^{n} p_{k} H_{k} \enspace,
297: 	\end{equation}
298: 	where $H_{k} = H(x_{k}),\: \forall k = 1, \ldots n$, with the
299: 	understanding that ${\langle H \rangle} = {\langle H(X)
300: 	\rangle}$.
301: 	
302: 	The characteristic additive property of Shannon entropy
303: 	\begin{equation}
304: 	\label{Equation:AdditivityOfShannonEntropy}
305: 	     S(X \times Y) = S(X) + S(Y) \enspace,
306: 	\end{equation}
307: 	for two independent random variables $X$ and
308: 	$Y$ now follows as a consequence of the additivity property of
309: 	Hartley function. 
310: 
311: 	There are two postulates involved in defining Shannon entropy
312: 	as expectation of Hartley function. One is the additivity of
313: 	information which is the characteristic property of Hartley
314: 	function, and the other is
315: 	that if different amounts of information occur with different
316: 	probabilities, the total information will be the
317: 	average of the individual informations weighted by the
318: 	probabilities of their occurrences. 
319: 
320: 	The basic idea behind R\'{e}nyi's generalization is any
321:         putative candidate for an entropy should be a mean and there
322:         by use a  well known
323:         idea in mathematics   
324:         that the linear mean, though most widely used, is not the only
325:         possible way of averaging, however, one can define the mean with
326:         respect to an arbitrary
327:         function. Here we briefly discuss
328:         generalized averages and its properties which are essential for
329:         the results we present in this paper.
330: 
331:   %-----------------------------------------------------------------
332:   \subsection{Kolmogorov-Nagumo Averages or Quasilinear Mean}
333: 
334:         In the general theory of means, quasilinear mean of a random variable
335:         $X$ is defined as{\footnote{Kolmogorov~\cite{Kolmogorov:1930:SurLaNotionDeLaMoyenne} and Nagumo~\cite{Nagumo:1930:UberEineKlasseVonMittlewerte}
336:         first characterized the quasilinear mean ${\langle x
337:         \rangle}_{\psi}$ for a vector $(x_{1}, \ldots,
338:         x_{n})$ as ${\langle x \rangle}_{\psi} =
339:         \psi^{-1}\left(\sum_{k=1}^{n} \frac{1}{n} \psi(x_{k})\right)$
340:         where $\psi$ is a continuous and strictly monotone
341:         function. De Finetti~\cite{DeFinetti:1931:SulConcettoDiMedia}
342:         extended their result to the case of simple (finite)
343:         probability distributions. The version of the quasilinear mean
344:         representation theorem referred to in
345:         \S~\ref{Section:AcharacterizationTheoremForTsallisEntropy} is
346:         due to Hardy, Littlewood and
347:         P{\'{o}}lya~\cite{HardyLittlewoodPolya:1934:Inequalities}, which
348:         followed closely the approach of de
349:         Finetti. Acz{\'{e}}l~\cite{Aczel:1948:OnMeanValues} proved a
350:         characterization of the quasilinear mean using functional
351:         equations.
352:         Ben-Tal~\cite{Ben-Tal:1977:OnGeneralizedMeansAndGeneralizedConvexFucntions}
353:         showed that quasilinear means are ordinary arithmetic means
354:         under suitably defined addition and scalar multiplication
355:         operations.
356:         Norris~\cite{Norris:1976:GeneralMeansAndStatisticalTheory} did
357:         a survey of quasilinear means and its more restrictive forms in
358:         Statistics. More recent survey of generalized means can be
359:         found
360:         in~\cite{OstasiewiczOstasiewicz:2000:MeansAndTheirAppliacations}.
361: 	Applications of quasilinear means can be found in economics
362:         (for example, 
363:         \cite{EpsteinZin:1989:SubstitutionRisk_SecondaryRef}) and 
364:         decision theory (for example,
365:         \cite{KrepsPorteus:1978:TemporalResolution_SecondaryRef}).
366:         Recently Czachor and
367:         Naudts~\cite{CzachorNaudts:2002:ThermostatisticsBasedOnKolmogorov-NagumoAverages}
368:         studied generalized thermostatistics based on quasilinear means.}%ENDfootnote
369:         \begin{equation}
370:         \label{Equation:Definition_KNaverages}
371:          E_{\psi}X = {\langle X \rangle}_{\psi} = \psi^{-1} \left( \sum_{k=1}^{n}
372:         p_{k} \psi\left(x_{k} \right)    \right) \enspace,
373:         \end{equation}
374:         where $\psi$ is continuous and strictly monotonic (increasing
375:         or decreasing) in which
376:         case it has an inverse $\psi^{-1}$ which satisfies the same
377:         conditions. In the context of generalized means, $\psi$ is
378:         referred to as Kolmogorov-Nagumo 
379:         function or KN-function.
380:         If, in particular, $\psi$ is linear, then 
381:         (\ref{Equation:Definition_KNaverages}) reduces to the
382:         expression of linear averaging,
383:         $EX = {\langle X \rangle} = \sum_{k=1}^{n} p_{k} x_{k}$.
384: 
385:         The following theorem qualifies quasilinear means.
386: 	%THEOREM:KN-average as a Mean----
387:         \begin{theorem}
388:         \label{Theorem:KN:KNaverageAsMean}
389:                 If  $\psi$ is continuous and strictly monotone in
390:                         $a \leq x \leq b$, $a \leq x_{k} \leq b,\:\:\:
391:         k = 1, \ldots n$, $p_{k} > 0 $ and $\sum_{k=1}^{n} p_{k} =1 $,
392:         then
393:                 $\exists$ unique $x_{0} \in (a,b)$ such that
394:                 \begin{displaymath}
395:                  \psi(x_{0}) = \sum_{k=1}^{n} p_{k} \psi(x_{k})
396:                 \end{displaymath}
397:                 and $x_{0}$ is greater than some and less than
398:                 others of the $x_{k}$ unless all $x_{k}$ are zero.
399:         \end{theorem}
400: 
401:         Thus, the mean ${\langle \, . \,\rangle}_{\psi}$ is determined when the
402:         function $\psi$ is given. We may ask whether the converse is
403:         true: if ${\langle X \rangle}_{\psi_{1}} ={\langle
404:         X \rangle}_{\psi_{2}} $ for all $X \in \mathcal{X}$, is
405:         $\psi_{1}$ 
406:         necessarily the same function as $\psi_{2}$?  
407:         First we give the following definition.
408:         %DEFINITION:Equivalent Mean-----
409:         \begin{definition}
410:         \label{Definition:KNequivalentFunctions}
411:         Continuous and strictly monotone functions $\psi_{1}$ and $\psi_{2}$ are
412:         said to be {\em KN-equivalent} if ${\langle X \rangle}_{\psi_{1}} =
413:         {\langle X \rangle}_{\psi_{2}}$ for all $X \in \mathcal{X}$.
414:         \end{definition}
415:         Note that when we compare two means, it is to be understood
416:         that the underlying probabilites are same. The following
417:         theorem characterizes KN-equivalent functions.
418:         %THEOREM:Condition for KN-equivalent Functions
419:         \begin{theorem}
420:         \label{Theorem:ConditionForKNequivalentFuntions}
421:         In order that two continuous and strictly monotone functions
422:         $\psi_{1}$ and $\psi_{2}$ are KN-equivalent, it is necessary and sufficient
423:         that
424:         \begin{displaymath}
425:                 \psi_{1} = \alpha \psi_{2} + \beta \enspace,
426:         \end{displaymath}
427:         where $\alpha$ and $\beta$ are constants and $\alpha \neq 0$.
428:         \end{theorem}
429:         
430:         \begin{corollary}
431:         Let $\psi$ be a KN-function then ${\langle X \rangle}_{\psi} =
432:         {\langle X \rangle}_{-\psi}$ .
433:         \end{corollary}
434:         Hence, when ever required, without loss of generality, one
435:         can assume that $\psi$ is an increasing function. 
436:         The following theorem characterizes additivity of quasilinear means.
437:         \begin{theorem}
438:         \label{Theorem:AdditivityOfKNaverages}
439:         Let $\psi$ be a KN-function and $c$ be a real constant then
440:         ${\langle X + c\rangle}_{\psi} = {\langle X \rangle}_{\psi} +
441:         c$ i.e.,
442:         \begin{displaymath}
443:          \psi^{-1} \left( \sum_{k=1}^{n}
444:         p_{k} \psi\left(x_{k} + c \right) \right) = \psi^{-1} \left( \sum_{k=1}^{n}
445:         p_{k} \psi\left(x_{k} \right) \right) + c
446:         \end{displaymath}
447:         if and only if $\psi$ is either linear or exponential.
448:         \end{theorem}
449: 	Proof of
450:         Theorems~\ref{Theorem:KN:KNaverageAsMean},
451:         \ref{Theorem:ConditionForKNequivalentFuntions} and
452:         \ref{Theorem:AdditivityOfKNaverages}  
453:         can be found in the book on inequalities by Hardy, Littlewood,
454:         P{\'{o}}lya~\cite{HardyLittlewoodPolya:1934:Inequalities}.  
455: 
456:   %-----------------------------------------------------        
457:   \subsection{R\'{e}nyi Entropy}
458: 
459:         In the definition of Shannon entropy
460:         (\ref{Equation:Definition_ShannonEntropy}), if the standard
461:         mean 
462:         of Hartley function $H$
463:         is replaced with the quasilinear
464:         mean~(\ref{Equation:Definition_KNaverages}), one can obtain a
465:         generalized measure of information of r.v $X$ with respect to
466:         a KN-function $\psi$ as
467:         \begin{equation}
468: 	\label{Equation:QuasilinearEntropy}
469:         S_{\psi}(X) = \psi^{-1} \left(\sum_{k=1}^{n} p_{k} \psi \left(
470:         \ln \frac{1}{p_{k}} \right) \right) = \psi^{-1}
471:         \left(\sum_{k=1}^{n} p_{k} \psi \left( 
472:         H_{k} \right) \right) \enspace,
473:         \end{equation}
474:         where $\psi$ is a KN-function. We refer to
475:         (\ref{Equation:QuasilinearEntropy}) as quasilinear entropy
476:         with respect to the KN-function $\psi$.
477:         If we impose the constraint of additivity on $S_{\psi}$, then
478:         $\psi$  should
479:         satisfy~\cite{Renyi:1960:SomeFundamentalQuestionsOfInformationTheory}  
480:         \begin{equation}
481:         \label{Equation:AdditivityEquationForKNaverages}
482:         {\langle X + c \rangle}_{\psi} = {\langle X \rangle}_{\psi} +
483:         c \enspace, 
484:         \end{equation}
485:         for any random variable $X \in \mathcal{X}$ and a constant
486:         $c$. 
487: 
488:         R\'{e}nyi employed this formalism to define a
489:         one-parameter family 
490:         of measures of information ($\alpha$-entropies) as follows:
491: 	%Equation: Definition of Renyi entropy
492:         \begin{equation}
493: 	\label{Equation:Definition_RenyiEntropy}
494:         S_{\alpha}(X) = \frac{1}{1-\alpha} \ln \left(\sum_{k=1}^{n}
495:         p_{k}^{\alpha} \right) \enspace,
496:         \end{equation}
497:         where the KN-function $\psi$ is chosen in 
498:         (\ref{Equation:QuasilinearEntropy}) as   
499:         $\psi(x) = e^{(1-\alpha)x}$ whose choice is motivated by 
500:         Theorem~\ref{Theorem:AdditivityOfKNaverages}. If we choose
501:         $\psi$ as a 
502:         linear function in quasilinear
503:         entropy~(\ref{Equation:QuasilinearEntropy}), what we get is
504:         Shannon entropy.  
505:         R\'{e}nyi entropy is a
506:         one-parameter generalization of Shannon entropy in the sense
507:         that the limit $\alpha \rightarrow 1$ in
508:         (\ref{Equation:Definition_RenyiEntropy}) retrieves Shannon
509:         entropy. 
510: 
511:         %applications
512:         Despite its formal origin R\'{e}nyi entropy proved important
513:         in a variety of practical applications in coding
514:         theory~\cite{AczelDaroczy:1975:OnMeasuresOfInformationAndTheirCharacterization},
515:         statistical
516:         inference~\cite{ArimitsuArimitsu:2000:TsallisStatisticsAndTurbulence_SecondaryRef,ArimitsuArimitsu:2001:AnalysisOfTurbulence_SecondaryRef}, quantum 
517:         mechanics~\cite{MaassenUffink:1988:GeneralizedEntropicUncertaintyRelations},
518:         chaotic dynamics
519:         systems~\cite{HalseyJensenKadanoffProcacciaShraiman:1986:FractalMeasuresAndThierSingularities}.
520:         Thermodynamic properties of systems with multi-fractal 
521:         structures have been studied by extending the notion of
522:         Gibbs-Shannon entropy into a more general framework - R\'{e}nyi
523:         entropy~\cite{JizbaArimitsu:2004:ObservabilityOfRenyiEntropy}.
524: 
525: %=============================================================
526: \section{R\'{e}nyi's Recipe and Tsallis Entropy}
527: \label{Section:RenyisRecipieAndTsallisEntropy}
528: 
529:   %--------------------------------------------------
530:   \subsection{Tsallis Entropy}
531: 
532: 	Due to an increasing interest in long-range correlated systems
533: 	and non-equilibrium phenomena there has recently been much
534: 	focus on the Tsallis (or nonextensive)
535: 	entropy. Although, first introduced by Havrda and Charvat
536: 	\cite{HavrdaCharvat:1967:QuantificationMethodOfClassificationProcess}
537: 	in the context of cybernetics theory 
538:         and later studied by
539: 	Dar{\'{o}}czy~\cite{Daroczy:1970:GeneralizedInformationFunctions},
540: 	it was 
541: 	Tsallis~\cite{Tsallis:1988:GeneralizationOfBoltzmannGibbsStatistics}
542: 	who exploited its nonextensive features and placed it in a
543: 	physical setting. Hence it is also known as
544: 	Harvda-Charvat-Dar\'{o}czy-Tsallis entropy. Throughout this
545: 	paper we refer to this as Tsallis or nonextensive
546: 	entropy. Tsallis entropy of a r.v $X \in \mathcal{X}$ with p.m.f
547: 	$p=(p_{1}, \ldots p_{n})$ is defined as
548: 	\begin{equation}
549: 	\label{Equation:Definition_TsallisEntropy}
550: 	  S_{q}(X) = \frac{1 - \sum_{k=1}^{n} p_{k}^{q}}{q-1} \enspace,
551: 	\end{equation}
552: 	where $q >0$ is called the nonextensive index.
553: 	%($q$ is positive in
554: 	%order to ensure the concavity of $S_{q}$).
555: 	Tsallis entropy too, like R\'{e}nyi entropy, is a
556: 	one-parameter generalization of 
557: 	Shannon entropy in the sense that $q \rightarrow 1$ in
558: 	(\ref{Equation:Definition_TsallisEntropy}) retrieves Shannon
559: 	entropy. Tsallis entropy is
560: 	concave for all $q > 0$, but R\'{e}nyi entropy is concave only
561: 	for $0 < \alpha < 1 $.  The index $q$ characterizes the
562: 	degree of 
563: 	nonextensivity reflected in the pseudo-additivity property
564: 	\begin{equation}
565: 	\label{Equation:PseudoAdditivityOfTsallisEntropy}
566: 	S_{q}(X \times Y) = S_{q}(X) \oplus_{q} S_{q}(Y) = S_{q}(X) + S_{q}(Y) +
567: 	(1-q) S_{q}(X) S_{q}(Y) \enspace,
568: 	\end{equation}
569: 	where $X,Y \in \mathcal{X}$ are two independent random variables.
570: 
571: 
572:   %----------------------------------------------------------
573:   \subsection{Nongeneralizability of Tsallis Entropy}
574: 
575: 	Though the derivation of Tsallis entropy, when it was proposed
576: 	in 1988~\cite{Tsallis:1988:GeneralizationOfBoltzmannGibbsStatistics} is slightly different, one can understand this
577: 	generalization using $q$-logarithm
578: 	function (see~(\ref{Equation:Definition_q-Logorithm})), where
579: 	one would first generalize logarithm in the 
580: 	Hartley information with $q$-logarithm and define $q$-Hartley
581: 	function $\widetilde{H}: \{x_{1}, \ldots, x_{n}\} \rightarrow
582: 	\mathbb{R}$ of r.v $X$ as
583: 	~\cite{Tsallis:1999:NonextensiveStatisticalMechanics} 
584: 	\begin{equation}
585: 	\label{Equation:Definition_q-HartleyInformationMeasure}
586: 	\widetilde{H}_{k}=\widetilde{H}(x_{k}) = \ln_{q}
587: 	\frac{1}{p_{k}}\enspace, \quad k=1,\ldots n \enspace.
588: 	\end{equation}
589: 	The $q$-logarithm
590: 	in~(\ref{Equation:Definition_q-HartleyInformationMeasure}) is
591: 	defined as 
592: 	\begin{equation}
593: 	\label{Equation:Definition_q-Logorithm}
594: 	\ln_{q}(x) = \frac{x^{1-q}-1}{1-q} \enspace,
595: 	\end{equation}
596: 	which satisfies pseudo-additivity of the form
597: 	$\ln_{q}(xy)=\ln_{q}x \oplus_{q}
598: 	\ln_{q}y$ and in the limit $q \to 1$, we have $\ln_{q} x \to \ln x$.
599: 	Now Tsallis entropy
600: 	(\ref{Equation:Definition_TsallisEntropy}) 
601: 	can be defined as the expectation of $q$-Hartley function $\widetilde{H}$
602: 	as 
603: 	\begin{equation}
604: 	\label{Equation:Definition_TsallisEntropy_2}
605: 	S_{q}(X) = {\left\langle \widetilde{H} \right\rangle} \enspace.
606: 	\end{equation}
607: 	Note that the characteristic pseudo-additivity property of Tsallis
608: 	entropy~(\ref{Equation:PseudoAdditivityOfTsallisEntropy}) 
609: 	is a consequence of additivity property of Hartley
610: 	function. 
611: 
612: 	Before we present the main results of this paper, we briefly
613: 	discuss the context of quasilinear means where there is a
614: 	relation between Tsallis and R\'{e}nyi entropy.
615: 	The $q$-Hartley function can be written as
616: 	\begin{displaymath}
617: 	\widetilde{H}_{k} = \ln_{q} \frac{1}{p_{k}} = \phi_{q}(H_{k})\enspace,
618: 	\end{displaymath}
619: 	where
620: 	 \begin{equation}
621:         \label{Equation:KN:ModfiedKNfunction}
622:         \phi_{q}(x) = \frac{e^{(1-q)x} -1}{1 - q} =
623: 	\ln_{q}(e^{x}) \enspace. 
624:         \end{equation}
625: 	Note that $\phi_{q}$ is KN-equivalent to $e^{(1-q)x}$
626: 	(by Theorem~\ref{Theorem:ConditionForKNequivalentFuntions}), the 
627: 	KN-function used in R\'{e}nyi entropy. Hence 
628: 	Tsallis entropy is related to R\'{e}nyi entropies as 
629: 	\begin{equation}
630:         \label{Equation:RelationBetweenTsallisAndRenyi_ViaKN}
631: 	S_{q}^{\mbox{T}} = \phi_{q}(S_{q}^{\mbox{R}}) \enspace,
632: 	\end{equation}
633: 	where $S_{q}^{\mbox{T}}$ and $S_{q}^{\mbox{R}}$ denote the
634: 	Tsallis and R\'{e}nyi entropy respectively with a real number
635: 	$q$ as a parameter.
636: 	Hence, Tsallis entropy and R\'{e}nyi entropy are monotonic
637: 	functions of each other and, as a result, both must be
638: 	maximized by the same probability distribution. 
639: 
640: 	Now a natural question that arises is
641: 	whether one could generalize Tsallis
642: 	entropy using R\'{e}nyi's recipe i.e., by replacing linear average in
643: 	(\ref{Equation:Definition_TsallisEntropy_2}) by KN-averages
644: 	and impose the 
645: 	condition of pseudo-additivity. It is equivalent to determining
646: 	the KN-function $\psi$ for which so called $q$-quasilinear
647: 	entropy defined as 
648: 	\begin{equation}
649: 	\label{Equation:Definition_q-QuasilinearEntropy}
650: 	\widetilde{S}_{\psi} (X) = {\left\langle \widetilde{H}
651: 	\right\rangle}_{\psi} = \psi^{-1}
652: 	\left[ \sum_{k=1}^{n} p_{k} \psi \left( \widetilde{H}_{k}
653: 	\right) \right] \enspace,
654: 	\end{equation}
655: 	where $\widetilde{H}_{k} = \widetilde{H}(x_{k})\: \forall k = 
656: 	1, \ldots n$, satisfies the pseudo-additive property.
657: 
658: 	First, we present the following result which characterizes the
659: 	pseudo-additivity of quasilinear means.
660: 	%THEOREM:Nonextensive Additivity of Two Random Variables
661:         \begin{theorem}
662:         \label{Theorem:NonextensiveAditivityOfTwoRandomVariables}
663: 	Let $X,Y \in \mathcal{X}$ be two independent random
664:         variables. Let $\psi$ be any KN-function. Then 
665: 	\begin{equation}
666: 	\label{Equation:NonextensiveAdditivityOfKN-averages_Condition_Form1}
667: 	{\langle X \oplus_{q} Y \rangle}_{\psi} = {\langle X \rangle}_{\psi} \oplus_{q}{\langle Y \rangle}_{\psi}
668: 	\end{equation}
669: 	if and only if $\psi$ is linear.
670:         \end{theorem}
671: 	%PROOF....
672:         \proof
673: 	Let $p$ and $r$ be the p.m.fs of random variables $X, Y \in
674: 	\mathcal{X}$ respectively.
675: 	The proof of
676: 	sufficiency is simple which follows from
677: 	\begin{displaymath}
678: 	{\langle X \oplus_{q} Y \rangle}_{\psi} = {\langle X
679: 	\oplus_{q} Y \rangle} = \sum_{i=1}^{n} \sum_{j=1}^{n}
680: 	p_{i}r_{j} (x_{i} \oplus_{q} y_{j}) \enspace,
681: 	\end{displaymath}
682: 	and by the definition of $\oplus_{q}$, we have
683: 	{\setlength\arraycolsep{0pt}
684:         \begin{eqnarray}
685: 	{\langle X \oplus_{q} Y \rangle} &=& \sum_{i=1}^{n} \sum_{j=1}^{n}
686: 	p_{i}r_{j} (x_{i} + y_{j} + (1-q) x_{i} y_{j}) \nonumber\\
687: 	& = & \sum_{i=1}^{n} p_{i} x_{i} + \sum_{j=1}^{n} r_{j} y_{j}
688:         + (1-q) \sum_{i=1}^{n} p_{i} x_{i} \sum_{j=1}^{n} r_{j} y_{j}\enspace.
689: 	\nonumber 
690:         \end{eqnarray}}
691: 
692: 	To prove the converse, we need to determine all forms of $\psi$ which
693: 	satisfy
694:         \begin{equation}
695:         \label{Equation:NonextensiveAdditivityOfKN-averages_Condition_Form2}
696:         \psi^{-1} \left(\sum_{i=1}^{n} \sum_{j=1}^{n} p_{i}r_{j} 
697:         \psi \left( x_{i} \oplus_{q} y_{j}
698:         \right)  \right)  
699:          = \psi^{-1} \left(\sum_{i=1}^{n} p_{i} \psi \left( x_{i}
700:         \right)  \right) \oplus_{q} \psi^{-1} \left(\sum_{j=1}^{n}
701:         r_{j} \psi \left( y_{j} \right)  \right) \enspace.
702:         \end{equation}
703: 
704:         Since~(\ref{Equation:NonextensiveAdditivityOfKN-averages_Condition_Form2})
705:         must hold for arbitrary p.m.fs $p$,$r$ and for arbitrary
706:         numbers  
707:         $\{x_{1}, \ldots, x_{n}\}$ and $\{y_{1}, \ldots, y_{n}\}$, one
708:         can choose $y_{j} = c$ independently of $j$.  Then 
709:         (\ref{Equation:NonextensiveAdditivityOfKN-averages_Condition_Form2})
710:         yields  
711:         \begin{equation}
712:         \label{Equation:NonextensiveAdditivityOfKN-averages_Condition_Form3}
713:         \psi^{-1} \left(\sum_{i=1}^{n} p_{k}
714:         \psi \left( x_{i} \oplus_{q} c \right)  \right) =
715:         \psi^{-1} \left(\sum_{i=1}^{n} p_{k} \psi \left(
716:         x_{i} \right) \right) \oplus_{q} c \enspace.
717:         \end{equation}
718: 	That is, $\psi$ should satisfy
719:         \begin{equation}
720:         \label{Equation:NonextensiveAdditivityOfKN-averages_Condition_Form4}
721:         {\langle X \oplus_{q} c \rangle}_{\psi} = {\langle X
722:         \rangle}_{\psi} \oplus_{q} c \enspace,
723:         \end{equation}
724:         for any $X \in \mathcal{X}$ and any constant $c$. This can be
725:         rearranged as
726:         \begin{displaymath}
727:         {\langle (1 + (1-q) c) X + c \rangle}_{\psi} =
728:           (1 + (1-q) c) {\langle X \rangle}_{\psi} + c 
729:         \end{displaymath}
730: 	by using the definition of $\oplus_{q}$.
731:         Since  $q$ is independent of other quantities, $\psi$ should
732:         satisfy an equation of the form 
733:         \begin{equation}
734:         \label{Equation:NonextensiveAdditivityOfKN-averages_Condition_Form5}
735:         {\langle dX + c \rangle}_{\psi} = d {\langle X \rangle}_{\psi}
736:         + c \enspace,
737:         \end{equation}
738:         where $d \neq 0$ (by writing $d =(1+(1-q)c)$).
739:         Finally $\psi$ must satisfy
740:         \begin{equation}
741:         \label{Equation:NonextensiveAdditivityOfKN-averages_Condition_Sub1}
742:         {\langle X + c \rangle}_{\psi} = {\langle X \rangle}_{\psi} + c
743:         \end{equation}
744:         and 
745:         \begin{equation}
746:         \label{Equation:NonextensiveAdditivityOfKN-averages_Condition_Sub2}
747:         {\langle dX \rangle}_{\psi} = d {\langle X \rangle}_{\psi} \enspace,
748:         \end{equation}
749:         for any $X \in \mathcal{X}$ and any constants $d$, $c$. 
750:         From Theorem~\ref{Theorem:AdditivityOfKNaverages}, the condition 
751:         (\ref{Equation:NonextensiveAdditivityOfKN-averages_Condition_Sub1}) 
752:         is satisfied only when $\psi$ is linear or exponential.
753: 
754: 	To complete the theorem we have to show that
755:         KN-averages do not satisfy condition
756:         (\ref{Equation:NonextensiveAdditivityOfKN-averages_Condition_Sub2})
757:         when $\psi$ is exponential.
758: 	For a particular choice of
759:         $\psi(x) = e^{(1- \alpha)x}$, assume that
760:         \begin{equation}
761: 	\label{Equation:ToGetTheContradiction_ForTheTheorem}
762:         {\langle d X \rangle}_{\psi} = d {\langle X
763:         \rangle}_{\psi} \enspace,
764:         \end{equation}
765:         where
766:         \begin{displaymath}
767:         {\langle d X \rangle}_{\psi_{1}} = \frac{1}{1-\alpha} \ln
768:         \left( \sum_{k=1}^{n} p_{k} e^{(1-\alpha) d x_{k}} \right) \enspace,
769:         \end{displaymath}
770: 	and
771:         \begin{displaymath}
772:         d {\langle X \rangle}_{\psi_{1}} = \frac{d}{1-\alpha} \ln
773:         \left( \sum_{k=1}^{n} p_{k} e^{(1-\alpha) x_{k}} \right)  \enspace.
774:         \end{displaymath}
775:         Now define a KN-function $\psi'$  as $\psi'(x) = e^{(1-
776:         \alpha)dx}$, for which 
777:         \begin{displaymath}
778:         {\langle X \rangle}_{\psi'} = \frac{1}{d(1-\alpha)} \ln 
779:         \left( \sum_{k=1}^{n} p_{k} e^{(1-\alpha) d x_{k}} \right) \enspace.
780:         \end{displaymath}
781: 	Condition
782:         (\ref{Equation:ToGetTheContradiction_ForTheTheorem}) implies
783:        	\begin{displaymath}
784:         {\langle X \rangle}_{\psi} = {\langle X \rangle}_{\psi'} \enspace,
785: 	\end{displaymath}
786: 	and by 
787:         Theorem~\ref{Theorem:ConditionForKNequivalentFuntions},
788:         $\psi$ and $\psi'$ are 
789:         KN-equivalent which gives a contradiction.
790: 
791:         \endproof
792: 	%ENDPROOF.....
793: 
794: 	One can observe that the above proof avoids solving 
795: 	functional equations as in the case of
796: 	Theorem~\ref{Theorem:AdditivityOfKNaverages} (see
797: 	\cite{AczelDaroczy:1975:OnMeasuresOfInformationAndTheirCharacterization}).
798: 	Instead it makes
799: 	use of basic results of KN-averages. 
800: 	The following corollary is the immediate consequence of 
801: 	Theorem~\ref{Theorem:NonextensiveAditivityOfTwoRandomVariables}. 
802: 	%Theorem: Nongeneralizability of Tsallis Entropy------
803:         \begin{corollary}
804:         \label{Corollary:NongenralizabilityOfTsallisEntropy}
805: 	$q$-quasilinear entropy $\widetilde{S}_{\psi}$ (defined as
806: 	in~(\ref{Equation:Definition_q-QuasilinearEntropy})) with respect to
807: 	a KN-function $\psi$ satisfies pseudo-additivity if 
808:         and only if $\widetilde{S}_{\psi}$ is Tsallis entropy.
809:         \end{corollary}
810:         \proof
811: 	Let $X,Y \in \mathcal{X}$ be two independent random variables
812: 	and let
813: 	$p,r$ be their corresponding pmfs. 
814:         By the pseudo-additivity constraint, $\psi$ should satisfy
815:         \begin{equation}
816:         \label{Equation:KNtsallis_PseudoAdditivity_Condition_Form1}
817:         \widetilde{S}_{\psi}(X \times Y) = \widetilde{S}_{\psi}(X) \oplus_{q}
818:         \widetilde{S}_{\psi}(Y) 
819:         \end{equation}
820:         From the property of $q$-logarithm that $\ln_{q} x y = \ln_{q}x
821:         \oplus_{q} \ln_{q}y$, we need
822:         {\setlength\arraycolsep{0pt}
823:         \begin{eqnarray}
824:         \label{Equation:KNtsallis_PseudoAdditivity_Condition_Form2}
825:         \psi^{-1}  && \left(\sum_{i=1}^{n} \sum_{j=1}^{n} p_{i}r_{j} \psi
826:         \left( \ln_{q} \frac{1}{p_{i}r_{j}}  \right)  \right)  \nonumber\\
827:         && = \psi^{-1} \left(\sum_{i=1}^{n} p_{i} \psi \left( \ln_{q}
828:         \frac{1}{p_{i}}  \right)  \right) \oplus_{q}
829:         \psi^{-1} \left(\sum_{j=1}^{n} r_{j} \psi \left( \ln_{q}
830:         \frac{1}{r_{j}}  \right)  \right) \enspace.
831:         \end{eqnarray}
832:         Equivalently, we need
833:         {\setlength\arraycolsep{0pt}
834:         \begin{eqnarray}
835:         \psi^{-1} && \left(\sum_{i=1}^{n} \sum_{j=1}^{n} p_{i}r_{j} 
836:         \psi \left( \widetilde{H}_{i}^{p} \oplus_{q} \widetilde{H}_{j}^{r}
837:         \right)  \right)   \nonumber \\
838:          && = \psi^{-1} \left(\sum_{i=1}^{n} p_{i} \psi \left(
839:         \widetilde{H}_{i}^{p}   \right)  \right) \oplus_{q} 
840:         \psi^{-1} \left(\sum_{j=1}^{n} r_{j} \psi
841:         \left(\widetilde{H}_{j}^{r} \right)  \right) \enspace, \nonumber
842:         \end{eqnarray}
843:         where $\widetilde{H}^{p}$ and $\widetilde{H}^{r}$ represent
844:         the $q$-Hartley functions corresponding to probability distributions $p$
845:         and $r$ respectively.
846: 	That is, $\psi$ should satisfy
847: 	\begin{displaymath}
848: 	{\langle \widetilde{H}^{p} \oplus_{q}  \widetilde{H}^{r}
849: 	\rangle}_{\psi}  =  {\langle \widetilde{H}^{p} \rangle}_{\psi}
850: 	\oplus_{q}  {\langle \widetilde{H}^{r} \rangle}_{\psi} \enspace.
851: 	\end{displaymath}
852: 	Also from 
853: 	Theorem~\ref{Theorem:NonextensiveAditivityOfTwoRandomVariables},
854: 	$\psi$ is linear and hence $\widetilde{S}_{\psi}$ is Tsallis.
855:         \endproof
856: 	Corollary~\ref{Corollary:NongenralizabilityOfTsallisEntropy}
857: 	shows that using the R\'{e}nyi's recipe in the nonextensive
858: 	case one can prepare only Tsallis entropy, while in the
859: 	classical there are two possibilities.
860: 
861: %=============================================================
862: \section{A Characterization Theorem for Tsallis Entropy}
863: \label{Section:AcharacterizationTheoremForTsallisEntropy}
864: 
865: 	The importance of R\'{e}nyi's formalism to generalize Shannon
866: 	entropy is a characterization of Shannon entropy in terms of
867: 	axiom of quasilinear
868: 	means~\cite{Renyi:1960:SomeFundamentalQuestionsOfInformationTheory}.
869: 	By the result,
870: 	Theorem~\ref{Theorem:NonextensiveAditivityOfTwoRandomVariables},
871: 	that we presented in this paper, one can give a
872: 	characterization of 
873: 	Tsallis entropy in terms of axioms of quasilinear means. For such a
874: 	characterization one would assume that entropy is the expectation
875: 	of a function of underlying r.v. In the classical case, the
876: 	function is Hartley function, while in the nonextensive case
877: 	it is $q$-Hartlay function.
878: 
879: 	Since characterization of quasilinear means is given in terms of
880: 	cumulative distribution of a random variable, we use the 
881: 	following definitions and notation.
882: 	
883: 	Let $F:{\mathbb{R}} \rightarrow
884:         {\mathbb{R}}$ denote the cumulative distribution function of
885:         random variable $X \in \mathcal{X}$. Corresponding to a
886:         KN-function $\psi: {\mathbb{R}} \rightarrow {\mathbb{R}}$,
887:         generalized mean of $F$ (or $X$) can be written as
888:         \begin{equation}
889:         \label{Equation:KN-averagesInTermsOfCumulativeDistribution}
890:           E_{\psi}(F)= E_{\psi}(X) = {\langle X \rangle}_{\psi} =
891:         \psi^{-1}\left(\int \psi \, \ud 
892:         F \right) \enspace,
893:         \end{equation}
894: 	which is continuous analogue to
895:         (\ref{Equation:Definition_KNaverages}) and it is axiomized by
896:         Kolmogorov, Nagumo and De Finetti (see 
897:         \cite[Theorem 215]{HardyLittlewoodPolya:1934:Inequalities}) as
898:         follows.
899: 
900: 
901:         %Theorem: Axioms of Kolmogorov Nagumo Averages
902:         \begin{theorem}
903:         \label{Theorem:AxiomsForKN-averages}
904:         Let $\mathcal{F}_{I}$ be the set of all cumulative
905:         distribution functions defined on some interval $I$ of the
906:         real line ${\mathbb{R}}$. A functional $\kappa:
907:         {\mathcal{F}}_{I} \rightarrow {\mathbb{R}}$ satisfies the
908:         following axioms:
909:         \begin{description}
910:           \item[axiom 1:] $\kappa(\delta_{x}) = x$, where $\delta_{x} \in 
911:         {\mathcal{F}}_{I}$ denotes the step function at
912:         $x$ (\textit{Consistency with certainty}) ,
913: 
914:           \item[axiom 2:] $F,G \in
915:           {\mathcal{F}}_{I}$, if $F \leq G $ then $\kappa(F) \leq
916:           \kappa(G)$; the equality holds if and only if $F = G$
917:           (\textit{Monotonicity}) and,
918: 
919: %         \item[axiom 2:] (\textit{Substitution}) $F,G \in
920: %         {\mathcal{F}}_{I}$, if $E(F) = E(G)$ then
921: %         $\forall \beta \in (0,1) \:\: \exist \gamma \in (0,1)$ such
922: %         that $ E(\beta F + (1-\beta)H) = E( \gamma
923: %         G + (1-\gamma)H)$, for any $H \in {\mathcal{F}}_{I}$ 
924: 
925:           \item[axiom 3:] $F,G \in
926:           {\mathcal{F}}_{I}$, if $\kappa(F) = \kappa(G)$ then
927:           $ \kappa(\beta F + (1-\beta)H) = \kappa( \beta
928:           G + (1-\beta)H)$, for any $H \in {\mathcal{F}}_{I}$
929:           (\textit{Quasilinearity})  
930: 
931:         \end{description}
932:         if and only if
933:         there is a continuous strictly monotone function $\psi$ such
934:         that
935:         \begin{displaymath}
936:         \kappa(F) = 
937:         \psi^{-1}\left(\int \psi \, \ud F \right) \enspace.
938:         \end{displaymath}
939:         \end{theorem}
940:         
941:         The modified axioms for quasilinear mean can be found in
942:         \cite{Chew:1983:AgeneralizationOfTheQuasilinearMean,Fishburn:1986:ImplicitMeanValues,OstasiewiczOstasiewicz:2000:MeansAndTheirAppliacations}).
943:         Now we give our characterization theorem for Tsallis entropy
944:         that is similar to the
945:         characterization of Shannon entropy given by
946:         R\'{e}nyi~\cite{Renyi:1960:SomeFundamentalQuestionsOfInformationTheory}.  
947:         \begin{theorem}
948:         \label{Theorem:CharacterizationOfTsallisEntropy}
949: 	Let $X \in \mathcal{X}$ be a random variable. An information measure 
950:         defined as a (generalized) mean $\kappa$ of $q$-Hartley function of
951:         $X$ is Tsallis entropy if and only if
952:         \begin{enumerate}
953:           \item $\kappa$ satisfies axioms of quasilinear means given in 
954:           Theorem~\ref{Theorem:AxiomsForKN-averages} and, 
955: 
956:           \item If $X,Y \in \mathcal{X}$ are two random variables which 
957:           are independent, then
958: 	\begin{displaymath}
959: 	\kappa(X \oplus_{q} Y) =
960:            \kappa(X) \oplus_{q} \kappa(Y) \enspace.
961: 	\end{displaymath}
962:         \end{enumerate}
963:         \end{theorem}
964: 	Theorem~\ref{Theorem:CharacterizationOfTsallisEntropy} is a
965:           direct consequence of
966:           Theorems~\ref{Theorem:NonextensiveAditivityOfTwoRandomVariables}
967:           and \ref{Theorem:AxiomsForKN-averages}. 
968:           This characterization of Tsallis entropy only replaces the
969: 	additivity constraint in the characterization of Shannon
970: 	entropy given by R\'{e}nyi in
971: 	~\cite{Renyi:1960:SomeFundamentalQuestionsOfInformationTheory},
972: 	with pseudo-additivity, which further does not make use
973: 	of the postulate $\kappa(H) + \kappa(-H)=0$. (This postulate is needed to
974: 	distinguish Shannon entropy from R\'{e}nyi entropy). This
975: 	is possible because Tsallis entropy is unique by means of
976: 	KN-averages and under pseudo-additivity.
977: 
978: 
979: %         \proof
980: %         From the Theorem~\ref{Theorem:AxiomsForKN-averages} we have
981: %         \begin{displaymath}
982: %         E(H) = {\langle H \rangle}_{\psi} =
983: %         \psi^{-1}\left(\int \psi \, \ud F \right) \enspace,
984: %         \end{displaymath}
985: %         where $\psi$ is strictly monotone and continuous. From the
986: %         postulate (2) and
987: %         Theorem~\ref{Theorem:NonextensiveAditivityOfTwoRandomVariables} we
988: %         have the remaining proof.
989: %         \endproof
990: 
991: %====================================================================
992: \section{Conclusions}
993: \label{Section:Conclusions}
994: 
995: 	Passing an information measure through R\'{e}nyi formalism --
996: 	procedure followed by R\'{e}nyi to generalize Shannon entropy
997: 	-- allows one to study the possible generalizations and 
998: 	characterize information measure in the context in terms of
999: 	axioms of quasilinear means. In this paper we studied this
1000: 	technique for nonextensive entropy and showed that Tsallis
1001: 	entropy is unique under R\'{e}nyi's recipe.
1002: 	Considering the attempts to study generalized thermostatistics 
1003: 	based on
1004: 	KN-averages (for example
1005: 	\cite{CzachorNaudts:2002:ThermostatisticsBasedOnKolmogorov-NagumoAverages}),
1006: 	the results presented in this paper further the 
1007: 	relation between entropic measures and generalized averages.
1008: 
1009: \section*{References}
1010: 
1011: \bibliographystyle{unsrt}
1012: \bibliography{papi}
1013: 
1014: 
1015: \end{document}
1016: 
1017: 
1018: 
1019: 
1020: 
1021: