Abstract
In the theory of complex systems, long tailed probability distributions are often discussed. For such a probability distribution, a deformed expectation with respect to an escort distribution is more useful than the standard expectation. In this paper, by generalizing such escort distributions, a sequence of escort distributions is introduced. As a consequence, it is shown that deformed expectations with respect to sequential escort distributions effectively work for anomalous statistics. In particular, it is shown that a Fisher metric on a q-exponential family can be obtained from the escort expectation with respect to the second escort distribution, and a cubic form (or an Amari–Chentsov tensor field, equivalently) is obtained from the escort expectation with respect to the third escort distribution.
Keywords:
escort distribution; escort expectation; statistical manifold; deformed exponential family; Tsallis statistics; information geometry MSC:
53A15; 53B50; 62F99; 94A14
1. Introduction
Long tailed probability distributions and their related probability distributions are important objects in anomalous statistical physics (cf. [1,2,3]). For such long tailed probability distributions, the standard expectation does not exist in general. Therefore, the notion of escort distribution has been introduced [4]. Since an escort distribution gives a suitable weight for tail probability, the escort expectation which is the expectation with respect to an escort distribution is more useful than the standard one.
In anomalous statistics, a deformed exponential function and a deformed logarithm function play essential roles. In fact, a deformed exponential family is an important statistical model in anomalous statistics. Such a statistical model is described by such a deformed exponential function. In particular, the set of all q-normal distributions (or Student’s t-distributions, equivalently) is a q-exponential family, which is described by a q-deformed exponential function [5] (see also [6,7]).
On the other hand, a generalized score function is defined from a deformed logarithm function. In the previous works, the author showed that a deformed score function is unbiased with respect to the escort expectation [8,9]. This implies that a deformed score function is regarded as an estimating function on a deformed exponential family. In addition, in information geometry, it is known that a deformed exponential family has a statistical manifold structure. Then a deformed score function is regarded as a tangent vector on this statistical manifold [6,10]. Therefore, properties of escort expectations are closely related to geometric structures on a deformed exponential family.
In this paper, we introduce a sequence of escort distributions, then we consider a sequential structure of escort expectations. It is known that a deformed exponential family naturally has at least three kind of different statistical manifold structures [6,11]. Then we show that such statistical manifold structures can be obtained from a sequential structure of escort expectations. In particular, we show that a Fisher metric on a q-exponential family can be obtained from the deformed expectations with respect to the second escort distribution, and a cubic form (or an Amari–Chentsov tensor field, equivalently) is obtained from the deformed expectations with respect to the third escort distribution.
This paper is written based on the proceeding paper [7]. However, this paper focuses on deformed expectations of a q-exponential family, whereas the previous paper focused on deformed independences. We remark that several authors have been studying deformed expectations recently. See [12,13], for example.
2. Deformed Exponential Families
In this paper, we assume that all objects are smooth for simplicity. Let us review preliminary facts about deformed exponential functions and deformed exponential families. For more details, see [2,6], for example. Historically, Tsallis [14] introduced the notion of q-exponential function and Naudts [5] introduced the notion of q-exponential family together with a further generalization. Such a historical note is provided in [2].
Let be the set of all positive real numbers, . Let χ be a strictly increasing function from to . We define a χ-logarithm function or a deformed logarithm function by
The inverse of is called a χ-exponential function or a deformed exponential function, which is defined by
where the function is given by .
From now on, we suppose that χ is a power function, that is, . Then the deformed logarithm and the deformed exponential are defined by
We say that is a q-logarithm function and is a q-exponential function. In this case, the function is given by
By taking a limit , these functions coincide with the standard logarithm and the standard exponential , respectively.
A statistical model is called a q-exponential family if
where are functions on a sample space Ω, is a parameter, and is the normalization with respect to the parameter θ. Under suitable conditions, is regarded as a manifold with a local coordinate system . In this case, we call a natural coordinate system.
In this paper, we focus on the q-exponential case. However, many results for the q-exponential family can be generalized for the χ-exponential family (cf. [6,8]). We remark that a q-exponential family and a χ-exponential family have further generalizations. See [15], for example.
Example 1
(Student’s t-distribution (cf. [2,6,7])). Fix a number , and set . We define a d-dimensional Student’s t-distribution with degree of freedom ν or a q-Gaussian distribution by
where is a random vector on , is a location vector on and Σ is a scale matrix on . For simplicity, we assume that Σ is invertible. Otherwise, we should choose a suitable basis on such that . Then, the set of all Student’s t-distributions is a q-exponential family. In fact, setting parameters by
we have
Since and , the set of all Student’s t-distributions is a -dimensional q-exponential family. The normalization is given by
A univariate Student’s t-distribution is a well-known probability distribution in elementary statistics. We denote it by
where is a location parameter, is a scale parameter, and is the normalization defined by
In this case, the degree of freedom is . Conversely, the parameter q is give by
3. Escort Distributions and Generalizations of Expectations
In anomalous statistics, a generalized expectation, called an escort expectation, is often discussed since the standard expectation does not exist in general (cf. [2,5,6]). In this section, we recall generalizations of expectations and introduce a sequential structure of escort distributions.
Let be a q-exponential family. For a given we define the q-escort distribution of and the normalized q-escort distribution by
respectively. For a q-exponential family , the set of normalized escort distributions is a -exponential family with .
Example 2.
Let be a univariate Student’s t-distribution with degree of freedom ν. Then its normalized escort distribution is also a univariate Student’s t-distribution with degree of freedom . In fact, from Equation (4), a direct calculation shows that
This implies that the degree of freedom . Therefore, we obtain a sequence of escort distributions from a given Student’s t-distribution :
This sequence is called a τ-sequence, and the procedure to obtain from a given t-distribution to another t-distribution through an escort distribution is called the τ-transformation [16].
For a given , we can define the escort of an escort distribution
We call the second escort distribution of . The coefficient q before comes from considerations of U-information geometry [17]. We will discuss in the latter part of Section 5.
Similarly, we can define the n-th escort distribution from the sequence of escort distributions:
Let be a function on Ω. The q-expectation and the normalized q-expectation with respect to are defined by
respectively. We denote by the expectation with respect to the second escort distribution , that is,
Since a differential of a power function is also a power function, we can give a characterization for escort distributions.
Proposition 1.
Suppose that is a q-exponential family defined by (1). Then the n-th escort distribution is given by the n-th differential of q-exponential function. That is, by setting , we have the following formula:
Proof.
Since a q-exponential function is , its differential is given by
Therefore, we obtain .
By induction, the n-th differential of coincides with the n-th escort distribution , which is given by Equation (5). ☐
4. Statistical Manifolds and Their Generalized Conformal Structures
In this section, we us review the geometry of statistical manifolds. For more details about the geometry of statistical manifolds, see [18,19].
Let be a Riemannian manifold and ∇ be a torsion-free affine connection on S. We say that the triplet is a statistical manifold if is totally symmetric. In this case, we can define a totally symmetric -tensor field by
where and Z are arbitrary vector fields on S. The tensor field C is called a cubic form or an Amari–Chentsov tensor field.
The notion of statistical manifold was introduced by Lauritzen [20]. He called the triplet a statistical manifold. In this paper, the definition is followed to Kurose [18]. Though these two definitions are different, the other statistical manifold structure can be obtained from a given one, However, the motivation for the notion of conformal equivalence using is different from that one using , which we will discuss in the latter part of this section.
For a given statistical manifold , we can define another torsion-free affine connection on S by
The connection is called the dual connection of ∇ with respect to g. The triplet is also a statistical manifold, which is called the dual statistical manifold of . The cubic form is given by the difference of two affine connections and ∇:
We define generalized conformal structures for statistical manifolds followed to Kurose [18]. Two statistical manifolds and are said to be 1-conformally equivalent if there exists a function such that
where is the gradient vector field of with respect to g, that is, . We say that is 1-conformally flat if is locally 1-conformally equivalent to a flat statistical manifold.
Two statistical manifolds and are said to be -conformally equivalent if there exists a function such that
where . If two statistical manifolds and are 1-conformally equivalent, then their dual statistical manifolds and are -conformally equivalent.
Proposition 2.
If two statistical manifolds and are 1-conformally equivalent, then their cubic forms have the following relation:
Proof.
From Equations (7) and (8), we obtain
By taking an inner product with respect to g, we obtain the result. ☐
5. Statistical Manifold Structures on q-Exponential Families
In this section, we consider statistical manifold structures on a q-exponential family. It is known that a q-exponential family naturally has at least three kinds of statistical manifold structures (cf. [6,8]). We reformulate these structures from the viewpoint of the sequence of escort distributions. In this paper, we omit the details about information geometry. See [21,22] for further details.
Firstly, we review basic facts about q-exponential family. Let be a q-exponential family. The normalization on is convex, but may not be strictly convex. In fact, we obtain the following proposition.
Proposition 3.
Let be a q-exponential family. Then the normalization function is convex.
Proof.
Set and . Then we have
Since and , we have
For an arbitrary vector , since and , we have
This implies that the Hessian matrix is semi-positive definite. ☐
We assume that ψ is strictly convex in this paper. Under this assumption, we can induce many geometric structures for a q-exponential family.
Since ψ is strictly convex, we can define a Riemannian metric and a cubic form by
We call and a q-Fisher metric and a q-cubic form, respectively [23,24]. Since is a Hessian of a function ψ, is a Hessian metric, and ψ is the potential of with respect to the natural coordinate [25].
For a fixed real number α, set
where is the Levi-Civita connection with respect to . Since is a Hessian metric, from standard arguments in Hessian geometry [25], and are flat affine connections and mutually dual with respect to . Therefore, the triplets and are flat statistical manifolds, and the quadruplet is a dually flat space.
Under q-expectations, we have the following proposition (cf. [10]).
Proposition 4.
For a q-exponential family, (1) Set . Then is a -affine coordinate system such that
(2) Set , then is the potential of with respect to .
Next, let us consider the standard Fisher metric and the standard cubic form. Suppose that is a statistical model. Set , for simplicity. We define the (standard) Fisher metric on by
and the (standard) cubic form or the Amari–Chentsov vector field by
From similar arguments of (11), we can define an α-connection on , and we can obtain a statistical manifold structure . In this case, is called an invariant statistical manifold [21,22].
A Fisher metric and a cubic form have the following representation using a sequence of escort distributions,
Theorem 1.
Let be a q-exponential family. For , suppose that and are the second and the third escort distribution of , respectively. Then the Fisher metric and the cubic form are given as follows:
Proof.
Differentiating the q-logarithm, we have
Therefore, we obtain
By a similar argument, we obtain the representation for . ☐
We define an α-divergence with and a q-relative entropy (or a normalized Tsallis relative entropy) by
respectively. It is known that the α-divergence induces a statistical manifold structure , where is the Fisher metric on and is the α-connection with , and the q-relative entropy induces .
Theorem 2
(cf. [10,24]). For a q-exponential family , two statistical manifolds and are 1-conformally equivalent. In particular, an invariant statistical manifold is 1-conformally flat. Riemannian metrics and cubic forms have the following relations:
Proof.
The results were essentially obtained in [10]. However, we give a simpler proof for Equations (16) and (17). The key idea is a sequence of escort distributions and the escort representations of and in Theorem 1.
From Equation (10), we directly obtain the conformal equivalence relation (16) using the escort representation of in (12).
By differentiating (9) and taking an integration, we obtain
Since , we have
From the escort representation of in (13), and Proposition 1, we obtain Equation (17) since and . ☐
We remark that the cubic form of is not but .
The difference of a α-divergence and a q-relative entropy is only the normalization . This implies that a normalization for probability density imposes a generalized conformal change for a statistical model.
In the next part of this section, let us consider another statistical manifold on (cf. [6,17,26]).
Recall that a Fisher metric has the following representation:
In information geometry, is called an e-representation (exponential representation) of , and is called a m-representation (mixture representation). Intuitively, and are regarded as tangent vectors on a statistical model. Hence a Fisher metric is regarded as a -inner product of e- and m-representations.
Let us generalize e- and m-representations for a q-exponential family. For , we call a q-score function. Then we define a Riemannian metric by
By differentiating the above equation, we can define mutually dual torsion-free affine connections and :
where and are the Christoffel symbols of and of the first kind, respectively. It is known that is a Hessian metric, and the quadruplet is a dually flat space. In addition, a natural parameter is a -affine coordinate sysem. Therefore, the cubic form for is
We remark that the statistical manifold structure is induced from a β-divergence [17,26] (or a density power divergence [27]):
Theorem 3.
For the statistical manifold structure , the escort representations of the Riemannian metric and the cubic form are given as follows:
Proof.
For the Riemannian metric , since , we immediately obtain Equation (21) from the definition of .
Let us consider the expression for cubic form (22). The q-score function is unbiased under the q-expectation. In fact,
From Equation (19), we obtain
☐
We remark that Naudts [5] gave another generalization of Fisher metric , which is defined by
The metric is conformally equivalent to with conformal factor . That is, . (See also [6]). Naudts gave a further generalization of Fisher metric and he showed a Cramér–Rao type bound theorem [5].
6. Concluding Remarks
In this paper, we introduced a sequence of escort distributions. Then we gave representations of Riemannian metrics and cubic forms from a viewpoint of the sequence of escort distributions.
In particular, we can define the following -tensor fields on a q-exponential family. For , set .
- (1)
- From the standard expectation, we obtainThe tensor G is a covariance matrix. However, G may not be important in anomalous statistics.
- (2)
- From the q-expectation, we obtainThe Riemannian metric is a Hessian metric, and it is induced from the β-divergence (20).
- (3)
- From the expectation with respect to the second escort distribution, we obtain
The Riemannian metric is a Fisher metric. Hence is invariant to the choice of reference measure on Ω, but it is not a Hessian metric. In addition, is induced from the α-divergence (14). The conformal Riemannian metric is a q-Fisher metric. It is a Hessian metric, and it is induced from a normalized Tsallis relative entropy (15).
We may define a Riemannian metric and a cubic form from higher order escort expectations:
Then we obtain a sequence of statistical manifold structures.
However, the geometric meaning of this sequence is not clear at this moment. Elucidating geometric properties of this sequence is a future problem.
Acknowledgments
This research was partially supported by JSPS (Japan Society for the Promotion of Science), KAKENHI (Grants-in-Aid for Scientific Research) Grant Numbers JP26108003 and JP15K04842.
Conflicts of Interest
The author declares no conflict of interest.
References
- Tsallis, C. Introduction to Nonextensive Statistical Mechanics: Approaching a Complex World; Springer: New York, NY, USA, 2009. [Google Scholar]
- Naudts, J. Generalised Thermostatistics; Springer: London, UK, 2011. [Google Scholar]
- Kaniadakis, G. Theoretical foundations and mathematical formalism of the power-law tailed statistical distributions. Entropy 2013, 15, 3983–4010. [Google Scholar] [CrossRef]
- Beck, C.; Schlögl, F. Thermodynamics of Chaotic Systems: An Introduction; Cambridge University Press: Cambridge, UK, 1993. [Google Scholar]
- Naudts, J. Estimators, escort probabilities, and ϕ-exponential families in statistical physics. J. Inequal. Pure Appl. Math. 2004, 5, 102. [Google Scholar]
- Matsuzoe, H.; Henmi, M. Hessian structures and divergence functions on deformed exponential families. In Geometric Theory of Information, Signals and Communication Technology; Nielsen, F., Ed.; Springer: Basel, Switzerland, 2014; pp. 57–80. [Google Scholar]
- Sakamoto, M.; Matsuzoe, H. A generalization of independence and multivariate Student’s t-distributions. In Geometric Science of Information, Proceedings of Second International Conference on Geometric Science of Information (GSI 2015), Palaiseau, France, 28–30 October 2015; Volume 9389, pp. 740–749.
- Matsuzoe, H.; Wada, T. Deformed algebras and generalizations of independence on deformed exponential families. Entropy 2015, 17, 5729–5751. [Google Scholar] [CrossRef]
- Wada, T.; Matsuzoe, H.; Scarfone, A.M. Dualistic hessian structures among the thermodynamic potentials in the κ-Thermostatistics. Entropy 2015, 17, 7213–7229. [Google Scholar] [CrossRef]
- Matsuzoe, H. Statistical manifolds and geometry of estimating functions. In Prospects of Differential Geometry and Its Related Fields, Proceedings of the 3rd International Colloquium on Differential Geomentry and Its Related Fields, Veliko Tarnovo, Bulgaria, 3–7 September 2012; Adachi, T., Hashimoto, H., Hristov, M.J., Eds.; World Scientific: Hackensack, NJ, USA, 2013; pp. 187–202. [Google Scholar]
- Matsuzoe, H.; Henmi, M. Hessian structures on deformed exponential families. In Geometric Science of Information, Proceedings of First International Conference on Geometric Science of Information (GSI 2013), Paris, France, 28–30 August 2013; Springer: Berlin/Heidelberg, Germany, 2015; Volume 8085, pp. 275–282. [Google Scholar]
- Eguchi, S.; Komori, O. Path connectedness on a space of probability density functions. In Geometric Science of Information, Proceedings of Second International Conference on Geometric Science of Information (GSI 2015), Palaiseau, France, 28–30 October 2015; Springer: Berlin/Heidelberg, Germany, 2015; Volume 9389, pp. 615–624. [Google Scholar]
- Scarfone, A.M.; Matsuzoe, H.; Wada, T. Consistency of the structure of Legendre transform in thermodynamics with the Kolmogorov-Nagumo average. Phys. Lett. A 2016, 380, 3022–3028. [Google Scholar] [CrossRef]
- Tsallis, C. What are the numbers that experiments provide? Quim. Nova 1994, 17, 468–471. [Google Scholar]
- Zhang, J. On monotone embedding in information geometry. Entropy 2015, 17, 4485–4499. [Google Scholar] [CrossRef]
- Tanaka, M. Meaning of an escort distribution and τ-transformation. J. Phys. Conf. Ser. 2010, 201, 012007. [Google Scholar] [CrossRef]
- Murata, N.; Takenouchi, T.; Kanamori, T.; Eguchi, S. Information geometry of U-boost and Bregman divergence. Neural Comput. 2004, 16, 1437–1481. [Google Scholar] [CrossRef] [PubMed]
- Kurose, T. On the divergences of 1-conformally flat statistical manifolds. Tôhoku Math. J. 1994, 46, 427–433. [Google Scholar] [CrossRef]
- Matsuzoe, H. Statistical manifolds and affine differential geometry. Adv. Stud. Pure Math. 2010, 57, 303–321. [Google Scholar]
- Lauritzen, S.L. Statistical manifolds. In Differential Geometry in Statistical Inferences; Gupta, S.S., Ed.; IMS Lecture Notes Monograph Series 10; Institute of Mathematical Statistics: Hayward, CA, USA, 1987; pp. 96–163. [Google Scholar]
- Amari, S.; Nagaoka, H. Method of Information Geometry; Translations of Mathematical Monographs; American Mathematical Society: Providence, RI, USA; Oxford University Press: Oxford, UK, 2000. [Google Scholar]
- Amari, S. Information Geometry and Its Applications; Springer: Tokyo, Japan, 2016. [Google Scholar]
- Amari, S.; Ohara, A.; Matsuzoe, H. Geometry of deformed exponential families: Invariant, dually-flat and conformal geometry. Physica A 2012, 391, 4308–4319. [Google Scholar] [CrossRef]
- Matsuzoe, H.; Ohara, A. Geometry for q-exponential families. In Recent Progress in Differential Geometry and Its Related Fields, Proceedings of the 2nd International Colloquium on Differential Geomentry and Its Related Fields, Veliko Tarnovo, Bulgaria, 6–10 September 2010; Adachi, T., Hashimoto, H., Hristov, M.J., Eds.; World Scientific: Hackensack, NJ, USA, 2011; pp. 55–71. [Google Scholar]
- Shima, H. The Geometry of Hessian Structures; World Scientific: Hackensack, NJ, USA, 2007. [Google Scholar]
- Ohara, A.; Wada, T. Information geometry of q-Gaussian densities and behaviors of solutions to related diffusion equations. J. Phys. A 2010, 43, 035002. [Google Scholar] [CrossRef]
- Basu, A.; Harris, I.R.; Hjort, N.L.; Jones, M.C. Robust and efficient estimation by minimising a density power divergence. Biometrika 1998, 85, 549–559. [Google Scholar] [CrossRef]
© 2016 by the author; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC-BY) license (http://creativecommons.org/licenses/by/4.0/).