1. Introduction
Long tailed probability distributions and their related probability distributions are important objects in anomalous statistical physics (cf. [
1,
2,
3]). For such long tailed probability distributions, the standard expectation does not exist in general. Therefore, the notion of escort distribution has been introduced [
4]. Since an escort distribution gives a suitable weight for tail probability, the escort expectation which is the expectation with respect to an escort distribution is more useful than the standard one.
In anomalous statistics, a deformed exponential function and a deformed logarithm function play essential roles. In fact, a deformed exponential family is an important statistical model in anomalous statistics. Such a statistical model is described by such a deformed exponential function. In particular, the set of all 
q-normal distributions (or Student’s 
t-distributions, equivalently) is a 
q-exponential family, which is described by a 
q-deformed exponential function [
5] (see also [
6,
7]).
On the other hand, a generalized score function is defined from a deformed logarithm function. In the previous works, the author showed that a deformed score function is unbiased with respect to the escort expectation [
8,
9]. This implies that a deformed score function is regarded as an estimating function on a deformed exponential family. In addition, in information geometry, it is known that a deformed exponential family has a statistical manifold structure. Then a deformed score function is regarded as a tangent vector on this statistical manifold [
6,
10]. Therefore, properties of escort expectations are closely related to geometric structures on a deformed exponential family.
In this paper, we introduce a sequence of escort distributions, then we consider a sequential structure of escort expectations. It is known that a deformed exponential family naturally has at least three kind of different statistical manifold structures [
6,
11]. Then we show that such statistical manifold structures can be obtained from a sequential structure of escort expectations. In particular, we show that a Fisher metric on a 
q-exponential family can be obtained from the deformed expectations with respect to the second escort distribution, and a cubic form (or an Amari–Chentsov tensor field, equivalently) is obtained from the deformed expectations with respect to the third escort distribution.
This paper is written based on the proceeding paper [
7]. However, this paper focuses on deformed expectations of a 
q-exponential family, whereas the previous paper focused on deformed independences. We remark that several authors have been studying deformed expectations recently. See [
12,
13], for example.
  2. Deformed Exponential Families
In this paper, we assume that all objects are smooth for simplicity. Let us review preliminary facts about deformed exponential functions and deformed exponential families. For more details, see [
2,
6], for example. Historically, Tsallis [
14] introduced the notion of 
q-exponential function and Naudts [
5] introduced the notion of 
q-exponential family together with a further generalization. Such a historical note is provided in [
2].
Let 
 be the set of all positive real numbers, 
. Let 
χ be a strictly increasing function from 
 to 
. We define a 
χ-logarithm function or a 
deformed logarithm function by
      
The inverse of 
 is called a 
χ-exponential function or a 
deformed exponential function, which is defined by
      
      where the function 
 is given by 
.
From now on, we suppose that 
χ is a power function, that is, 
. Then the deformed logarithm and the deformed exponential are defined by
      
We say that 
 is a 
q-logarithm function and 
 is a 
q-exponential function. In this case, the function 
 is given by
      
By taking a limit , these functions coincide with the standard logarithm  and the standard exponential , respectively.
A statistical model 
 is called a 
q-exponential family if
      
      where 
 are functions on a sample space Ω, 
 is a parameter, and 
 is the normalization with respect to the parameter 
θ. Under suitable conditions, 
 is regarded as a manifold with a local coordinate system 
. In this case, we call 
 a 
natural coordinate system.
In this paper, we focus on the 
q-exponential case. However, many results for the 
q-exponential family can be generalized for the 
χ-exponential family (cf. [
6,
8]). We remark that a 
q-exponential family and a 
χ-exponential family have further generalizations. See [
15], for example.
Example 1 (Student’s 
t-distribution (cf. [
2,
6,
7]))
. Fix a number , and set . 
We define a d-dimensional Student’s t-distribution with degree of freedom ν or a q-Gaussian distribution bywhere  is a random vector on ,  is a location vector on  and Σ 
is a scale matrix on . 
For simplicity, we assume that Σ 
is invertible. Otherwise, we should choose a suitable basis  on  such that . 
Then, the set of all Student’s t-distributions is a q-exponential family. In fact, setting parameters bywe haveSince  and , the set of all Student’s t-distributions is a -
dimensional q-exponential family. The normalization  is given by A univariate Student’s 
t-distribution is a well-known probability distribution in elementary statistics. We denote it by
      
      where 
 is a location parameter, 
 is a scale parameter, and 
 is the normalization defined by
      
	  In this case, the degree of freedom is 
. Conversely, the parameter 
q is give by
      
  3. Escort Distributions and Generalizations of Expectations
In anomalous statistics, a generalized expectation, called an escort expectation, is often discussed since the standard expectation does not exist in general (cf. [
2,
5,
6]). In this section, we recall generalizations of expectations and introduce a sequential structure of escort distributions.
Let 
 be a 
q-exponential family. For a given 
 we define the 
q-escort distribution  of 
 and the 
normalized q-escort distribution  by
      
      respectively. For a 
q-exponential family 
, the set of normalized escort distributions 
 is a 
-exponential family with 
.
Example 2. Let  be a univariate Student’s t-distribution with degree of freedom ν. Then its normalized escort distribution is also a univariate Student’s t-distribution with degree of freedom . 
In fact, from Equation (4), a direct calculation shows thatThis implies that the degree of freedom . 
Therefore, we obtain a sequence of escort distributions from a given Student’s t-distribution :This sequence is called a τ-sequence, and the procedure to obtain from a given t-distribution to another t-distribution through an escort distribution is called the τ-transformation [16].  For a given 
, we can define the escort of an escort distribution
      
We call 
 the 
second escort distribution of 
. The coefficient 
q before 
 comes from considerations of 
U-information geometry [
17]. We will discuss in the latter part of 
Section 5.
Similarly, we can define the 
n-th escort distribution  from the sequence of escort distributions:
	  
Let 
 be a function on Ω. The 
q-expectation  and the 
normalized q-expectation  with respect to 
 are defined by
      
      respectively. We denote by 
 the expectation with respect to the second escort distribution 
, that is,
      
Since a differential of a power function is also a power function, we can give a characterization for escort distributions.
Proposition 1. Suppose that  is a q-exponential family defined by (1). Then the n-th escort distribution is given by the n-th differential of q-exponential function. That is, by setting , we have the following formula:  Proof.  Since a 
q-exponential function is 
, its differential is given by
		
		Therefore, we obtain 
.
 By induction, the n-th differential of  coincides with the n-th escort distribution , which is given by Equation (5). ☐
  4. Statistical Manifolds and Their Generalized Conformal Structures
In this section, we us review the geometry of statistical manifolds. For more details about the geometry of statistical manifolds, see [
18,
19].
Let 
 be a Riemannian manifold and ∇ be a torsion-free affine connection on 
S. We say that the triplet 
 is a 
statistical manifold if 
 is totally symmetric. In this case, we can define a totally symmetric 
-tensor field by
      
      where 
 and 
Z are arbitrary vector fields on 
S. The tensor field 
C is called a 
cubic form or an 
Amari–Chentsov tensor field.
The notion of statistical manifold was introduced by Lauritzen [
20]. He called the triplet 
 a statistical manifold. In this paper, the definition is followed to Kurose [
18]. Though these two definitions are different, the other statistical manifold structure can be obtained from a given one, However, the motivation for the notion of conformal equivalence using 
 is different from that one using 
, which we will discuss in the latter part of this section.
For a given statistical manifold 
, we can define another torsion-free affine connection 
 on 
S by
      
The connection 
 is called the 
dual connection of ∇ with respect to 
g. The triplet 
 is also a statistical manifold, which is called the 
dual statistical manifold of 
. The cubic form is given by the difference of two affine connections 
 and ∇:
	  
We define generalized conformal structures for statistical manifolds followed to Kurose [
18]. Two statistical manifolds 
 and 
 are said to be 
1-conformally equivalent if there exists a function 
 such that
      
      where 
 is the gradient vector field of 
 with respect to 
g, that is, 
. We say that 
 is 
1-conformally flat if 
 is locally 1-conformally equivalent to a flat statistical manifold.
Two statistical manifolds 
 and 
 are said to be 
-conformally equivalent if there exists a function 
 such that
      
      where 
. If two statistical manifolds 
 and 
 are 1-conformally equivalent, then their dual statistical manifolds 
 and 
 are 
-conformally equivalent.
Proposition 2. If two statistical manifolds  and  are 1-conformally equivalent, then their cubic forms have the following relation:  Proof.  From Equations (7) and (8), we obtain
		
 By taking an inner product with respect to g, we obtain the result.   ☐
  5. Statistical Manifold Structures on q-Exponential Families
In this section, we consider statistical manifold structures on a 
q-exponential family. It is known that a 
q-exponential family naturally has at least three kinds of statistical manifold structures (cf. [
6,
8]). We reformulate these structures from the viewpoint of the sequence of escort distributions. In this paper, we omit the details about information geometry. See [
21,
22] for further details.
Firstly, we review basic facts about q-exponential family. Let  be a q-exponential family. The normalization  on  is convex, but may not be strictly convex. In fact, we obtain the following proposition.
Proposition 3. Let  be a q-exponential family. Then the normalization function  is convex.
 Proof.  Set 
 and 
. Then we have
		
 Since 
 and 
, we have
      
For an arbitrary vector 
, since 
 and 
, we have
      
This implies that the Hessian matrix  is semi-positive definite.   ☐
We assume that ψ is strictly convex in this paper. Under this assumption, we can induce many geometric structures for a q-exponential family.
Since 
ψ is strictly convex, we can define a Riemannian metric and a cubic form by
      
We call 
 and 
 a 
q-Fisher metric and a 
q-cubic form, respectively [
23,
24]. Since 
 is a Hessian of a function 
ψ, 
 is a 
Hessian metric, and 
ψ is the 
potential of 
 with respect to the natural coordinate 
 [
25].
For a fixed real number 
α, set
      
      where 
 is the Levi-Civita connection with respect to 
. Since 
 is a Hessian metric, from standard arguments in Hessian geometry [
25], 
 and 
 are flat affine connections and mutually dual with respect to 
. Therefore, the triplets 
 and 
 are flat statistical manifolds, and the quadruplet 
 is a dually flat space.
Under 
q-expectations, we have the following proposition (cf. [
10]).
Proposition 4. For  a q-exponential family, (1) 
Set . 
Then  is a -
affine coordinate system such that
		  (2) 
Set , then  is the potential of  with respect to .
  Next, let us consider the standard Fisher metric and the standard cubic form. Suppose that 
 is a statistical model. Set 
, for simplicity. We define the 
(standard) Fisher metric  on 
 by
      
      and the 
(standard) cubic form or the 
Amari–Chentsov vector field  by
      
From similar arguments of (11), we can define an 
α-connection  on 
, and we can obtain a statistical manifold structure 
. In this case, 
 is called an 
invariant statistical manifold [
21,
22].
A Fisher metric and a cubic form have the following representation using a sequence of escort distributions,
Theorem 1. Let  be a q-exponential family. For , suppose that  and  are the second and the third escort distribution of , respectively. Then the Fisher metric  and the cubic form  are given as follows:  Proof.  Differentiating the 
q-logarithm, we have
		
 By a similar argument, we obtain the representation for .   ☐
We define an 
α-divergence  with 
 and a 
q-relative entropy (or a 
normalized Tsallis relative entropy) 
 by
      
      respectively. It is known that the 
α-divergence 
 induces a statistical manifold structure 
, where 
 is the Fisher metric on 
 and 
 is the 
α-connection with 
, and the 
q-relative entropy 
 induces 
.
Theorem 2 (cf. [
10,
24])
. For a q-exponential family , two statistical manifolds  and  are 1-conformally equivalent. In particular, an invariant statistical manifold  is 1-conformally flat. Riemannian metrics and cubic forms have the following relations: Proof.  The results were essentially obtained in [
10]. However, we give a simpler proof for Equations (16) and (17). The key idea is a sequence of escort distributions and the escort representations of 
 and 
 in Theorem 1.
 From Equation (10), we directly obtain the conformal equivalence relation (16) using the escort representation of  in (12).
By differentiating (9) and taking an integration, we obtain
      
Since 
, we have
      
From the escort representation of  in (13), and Proposition 1, we obtain Equation (17) since  and .   ☐
We remark that the cubic form of  is not  but .
The difference of a α-divergence and a q-relative entropy is only the normalization . This implies that a normalization for probability density imposes a generalized conformal change for a statistical model.
In the next part of this section, let us consider another statistical manifold on 
 (cf. [
6,
17,
26]).
Recall that a Fisher metric 
 has the following representation:
	  
In information geometry,  is called an e-representation (exponential representation) of , and  is called a m-representation (mixture representation). Intuitively,  and  are regarded as tangent vectors on a statistical model. Hence a Fisher metric is regarded as a -inner product of e- and m-representations.
Let us generalize 
e- and 
m-representations for a 
q-exponential family. For 
, we call 
 a 
q-score function. Then we define a Riemannian metric 
 by
      
By differentiating the above equation, we can define mutually dual torsion-free affine connections 
 and 
:
	  
	  where 
 and 
 are the Christoffel symbols of 
 and 
 of the first kind, respectively. It is known that 
 is a Hessian metric, and the quadruplet 
 is a dually flat space. In addition, a natural parameter 
 is a 
-affine coordinate sysem. Therefore, the cubic form for 
 is
      
We remark that the statistical manifold structure 
 is induced from a 
β-divergence [
17,
26] (or a 
density power divergence [
27]):
	  
Theorem 3. For the statistical manifold structure , the escort representations of the Riemannian metric  and the cubic form  are given as follows:  Proof.  For the Riemannian metric , since , we immediately obtain Equation (21) from the definition of .
 Let us consider the expression for cubic form (22). The 
q-score function 
 is unbiased under the 
q-expectation. In fact,
      
From Equation (19), we obtain
      
	   ☐
We remark that Naudts [
5] gave another generalization of Fisher metric 
, which is defined by
      
The metric 
 is conformally equivalent to 
 with conformal factor 
. That is, 
. (See also [
6]). Naudts gave a further generalization of Fisher metric and he showed a Cramér–Rao type bound theorem [
5].