1. Introduction
In information geometry, signal processing is one of the most important applications. In particular, an information geometric approach to various linear time series models has been also well-known [
1–
7]. The geometric description of the linear systems is not confined to the pursuit of mathematical beauty. Komaki’s work [
8] is in the line of developing practical tools for Bayesian inference. Using the Kullback–Leibler divergence as a risk function for estimation, he found that superharmonic shrinkage priors outperform the Jeffreys prior in the viewpoint of information theory. Better prediction in the Bayesian framework is attainable by the Komaki priors.
However, a difficult part of Komaki’s idea in practice is verifying whether or not a prior function is superharmonic. In particular, when high-dimensional statistical manifolds are considered, it is technically tricky to test the superharmonicity of prior functions because Laplace–Beltrami operators on the manifolds are non-trivial. Although some superharmonic priors for the autoregressive (AR) models were found not only in the two-dimensional cases [
5,
7] but also in arbitrary dimensions [
6], there is no clue about the Bayesian shrinkage priors of more complicated models such as the autoregressive moving average (ARMA) models, the fractionally integrated ARMA (ARFIMA) models, and any arbitrary signal filters. Additionally, generic algorithms for systematically obtaining the information shrinkage priors are not known yet.
The connection between Kähler manifolds and information geometry has been reported [
4,
9–
12] and the mathematical correspondence between a Kähler manifold and the information geometry of a linear system is recently revealed. It is found that the information geometry of a signal filter with a finite complex cepstrum norm is a Kähler manifold [
7]. In particular, the Hermitian condition on the Kählerian information manifolds is clearly seen under conditions on the transfer function of the linear system. Moreover, many practical aspects of introducing Kähler manifolds to information geometry for signal processing were also reported in the same literature [
7]. One of the benefits in the Kählerian information geometry is that the simpler form of the Laplace–Beltrami operator on the Kähler manifold is beneficial to finding the Komaki priors.
In this paper, we construct Komaki-style shrinkage priors for Kählerian signal filters. By introducing an algorithm which is based on the characteristics of Kähler manifolds, the Bayesian predictive priors outperforming the Jeffreys prior can be obtained in a more efficient and more robust way. Several prior ansätze are also suggested. Among the ansätze, the geometric shrinkage priors related to Kähler potential are intrinsic priors on the information manifold because the geometry is given by the Kähler potential. We also provide the geometric priors for the ARFIMA models where the Komaki priors have not been reported. The structure of this paper is as follows. In next section, theoretical backgrounds of Kählerian information geometry and superharmonic priors are introduced. In Section 3, an algorithm and ansätze for the geometric shrinkage priors are suggested. The implication of the algorithm to the ARFIMA models is given in Section 4. We conclude the paper in the last section.
3. Geometric Shrinkage Priors
As shown in the previous section, Kähler manifolds in information geometry are useful in order to obtain the superharmonic priors. In this section, we introduce an algorithm to find the geometric shrinkage priors by using the properties of Kähler geometry. Moreover, several ansätze for the priors are suggested.
For further discussions, let us set
where u* is a constant in
and its complex conjugate
. The following lemma is worthwhile when the algorithm for the prior functions is constructed.
Lemma 1. On a Kähler manifold, a function is superharmonic if is in the form of such that κ is subharmonic (or harmonic) and Ψ′(τ) > 0, Ψ″(τ) ≤ 0 (or Ψ′(τ) > 0, Ψ″(τ) < 0).
Proof. The Laplace–Beltrami operator on
ψ is given by
where the derivatives on Ψ are taken with respect to
τ. It is obvious that if
κ is subharmonic (or harmonic) and if Ψ′(
τ) > 0, Ψ″(
τ)
≤ 0 (or Ψ′(
τ) > 0, Ψ″(
τ) < 0), then the right-hand side is negative,
i.e.,
ψ is a superharmonic function.□
According to Lemma 1, superharmonic functions are easily obtained from subharmonic or harmonic functions by simply plugging the (sub-)harmonic functions as κ into Lemma 1.
By considering that a prior function should be positive, it is able to utilize Lemma 1 for obtaining the superharmonic prior functions. Let us confine the function ψ in Lemma 1 to be positive.
Theorem 1. On a Kähler manifold, a positive function ψ = Ψ(u*−κ) is a superharmonic prior function if κ is subharmonic (or harmonic) and Ψ′(τ) > 0, Ψ″(τ) ≤ 0 (or Ψ′(τ) > 0, Ψ″(τ) < 0).
Proof. Since this is a special case of Lemma 1, the proof is obvious. □
Although any (sub-)harmonic function κ can be used for constructing superharmonic priors, restriction on κ makes finding the ansätze of the geometric priors easier. From now on, upper-bounded functions are only our concerns. Additionally, we assume that κ and u* are real. With these assumptions, it is possible to set u* as a constant greater than the upper bound of κ in order for τ to be positive.
Ansätze for Ψ can be found in the following example.
Example 1. Given subharmonic (or harmonic) κ and positive τ, i.e.,
upper-bounded κ, the following functions are candidates for Ψ
where 0
< a ≤ 1
(or 0
< a < 1
). Proof. We only cover a subharmonic case for
κ here and it is also straightforward for the harmonic case. First of all, Ψ
1 and Ψ
2 are all positive. For Ψ
1, it is easy to verify the followings:
for 0
< a ≤ 1. The similar calculation is repeated for Ψ
2:
for 0
< a ≤ 1.
Both functions Ψ1 and Ψ2 satisfy the conditions for Ψ in Lemma 1.□
It is also possible to find ansätze for upper-bounded subharmonic κ. The following functions are candidates for upper-bounded and subharmonic κ.
Example 2. For positive real numbers ar and bi, the following subharmonic functions are candidates for κ in the cases that those are upper-bounded: Proof. Let us assume that the ansätze are upper-bounded in given domains. For
κ1, it is easy to show that the Kähler potential
K is subharmonic:
The proof for subharmonicity of
κ2 is as follows:
The subharmonicity of
κ3 is tested by
If the upper-boundedness is satisfied, the above subharmonic functions are ansätze for κ. □
Superharmonic prior functions on the Kähler manifolds are efficiently constructed from the following algorithm which exploits Theorem 1 and the ansätze for Ψ and
κ. When we find positive and superharmonic functions, it is automatically the Komaki-style prior functions as usual. If positive, upper-bounded, and (sub-)harmonic functions are found, those functions are plugged into Theorem 1 in order to obtain superharmonic prior functions. Multiplying the Jeffreys prior by the superharmonic prior functions, we finally acquire the geometric shrinkage priors. Additionally, since the ansätze are already given, there is no extra cost to find the Komaki prior functions except for verifying whether or not the information geometry is a Kähler manifold. Comparing with the literature on the Komaki priors of the time series models [
5–
7], obtaining the geometric priors on the Kähler manifolds becomes more efficient and more robust.
4. Example: ARFIMA Models
The ARFIMA model is the generalization of the ARMA model with a fractional differencing parameter in order to model the long memory process. The transfer function of the ARFIMA(
p, d, q) model with parameters
is given by
where
d is the differencing parameter and
μi, λi, σ are a pole, a root, and a gain in the ARMA model, respectively. It is noteworthy that the transfer function of the ARFIMA model is decomposed into the ARMA model part and the fractionally integration part. Additionally, every poles and roots of the linear system are located inside the unit disk,
i.e., |
λi| < 1 for i = 1, ⋯,
p and |
μi| < 1 for i = 1, ⋯,
q.
Similar to the ARMA case [
7], the full geometry of the ARFIMA model is a Kähler manifold and the submanifold of a constant gain
σ is also Kähler geometry. This submanifold also exhibits the explicit Hermitian condition on the metric tensor. It is easy to cross-check the Hermitian structure by fixing
h0 = 1 up to the gain of the signal filter. We will work on this submanifold.
Since the information geometry of the ARFIMA model is a Kähler manifold, the Kähler potential of the ARFIMA geometry is obtained from the square of the Hardy norm of the logarithmic transfer function (or the square of the complex cepstrum norm),
Equation (5), represented with
It is obvious that the Kähler potential for the ARFIMA model,
Equation (8), is reducible to the Kähler potential of the ARMA geometry by setting
d = 0. It is easy to verify that the Kähler potential of the ARFIMA geometry is upper-bounded by
By using
Equation (4), the metric tensor of the Kähler geometry is simply derived from the Kähler potential. The metric tensor of the Kähler-ARFIMA geometry is given by
and it is easy to show that the metric tensor contains the pure ARMA metric. The metric tensor is also in the similar form to the ARFIMA geometry in non-complexified coordinates [
3]. The metric tensor indicates that the ARMA geometry is embedded in the ARFIMA geometry and corresponds to the submanifold of the ARFIMA manifold. The ARMA part of the metric tensor is the same metric with the Kähler-ARMA geometry in Choi and Mullhaupt [
7]. In addition to that, we can cross-check the fact that the ARMA geometry is also a Kähler manifold based on a property of a Kähler manifold that a submanifold of the Kähler geometry is Kähler.
Other geometric objects can be derived from the metric tensor. For example, the non-trivial components of the 0-connection are given by
Equation (6). It is noteworthy that any connection components with the
d-coordinate in the first two indices of the connection are trivially zero and the others might not be vanishing. Similar to the 0-connection, the Ricci tensor components along the fractionally integrated direction are also zero because there is no dependence on
d in the metric tensor. Considering the Schur complement, the non-vanishing Ricci tensor components are decomposed into the Ricci tensor from the pure ARMA part and the term from the mixing between the ARMA part and the fractionally integrated (FI) part:
where
i and
j are not along the
d-coordinate.
It is the time to be back to the geometric shrinkage priors. Since the Kähler potential of a given ARFIMA model is upper-bounded by a constant
, the intrinsic priors on the Kähler manifold can be found as it is proven in the previous section. By using the algorithm and the ansätze related to the Kähler potential, some geometric shrinkage prior functions for the ARFIMA model are constructed as
where 0
< a ≤ 1. It is also noteworthy that when
d = 0 in the Kähler potential, superharmonic priors of the ARMA (or AR/MA) models are obtained and finding the priors becomes much simpler than the literature on the Komaki priors of the AR models [
5–
7]. Similarly,
κ2 and
κ3 are also utilized for the superharmonic prior function ansätze in the ARFIMA models because the both functions are upper-bounded on the ARFIMA manifold. Moreover, if we set
d = 0 for
κ2 or
b0 = 0 for
κ3, the ansätze for the ARFIMA models are reducible to the Komaki priors of the ARMA models.