Numerical Inverse Transformation Methods for Z-Transform

Illés Horváth; András Mészáros; Miklós Telek

doi:10.3390/math8040556

Abstract

Numerical inverse Z-transformation (NIZT) methods have been efficiently used in engineering practice for a long time. In this paper, we compare the abilities of the most widely used NIZT methods, and propose a new variant of a classic NIZT method based on contour integral approximation, which is efficient when the point of interest (at which the value of the function is needed) is smaller than the order of the NIZT method. We also introduce a vastly different NIZT method based on concentrated matrix geometric (CMG) distributions that tackles the limitations of many of the classic methods when the point of interest is larger than the order of the NIZT method.

Keywords:

inverse Z-transformation; numerical analysis; contour integral; finite order approximation; matrix geometric distribution

1. Introduction

Z-transformation is one of the most frequently used non-linear transformations for describing discrete time series [1]. In several engineering and applied mathematical fields, non-linear transformations provide a compact description of the system behavior. Many practically important operations, e.g., discrete convolution, are much easier to handle in Z-transform domain, which makes the use of Z-transform domain system description widespread in many fields. While a lot of important characteristics (e.g., poles, frequency response, initial/final values, etc.) can be obtained directly from Z-transform domain description, there are plenty of practically important cases where explicit time domain values are needed. In this case, inverse Z-transformation (IZT) must be performed.

In some special cases, symbolic IZT is feasible, but in a wide range of practically important cases numerical IZT (NIZT) is required to compute the time domain values based on the Z-transform domain description. This paper focuses on the problem of NIZT.

Since NIZT has been widely applied in practice for a long time, its literature is rather rich. In this paper, we consider only the most efficient methods of the recent literature [2]. These methods are introduced in the subsequent discussions and are used also for numerical comparison.

Section 2 is devoted to the general setup of Z-transformation. Section 3 gives a brief review of general methods in the literature. From among the few NIZT methods available in the literature, one stands out, described by several authors independently [2,3,4]. We will refer to this method as the CIR method, and it is described in Section 4 along with an interpretation based on Contour Integral approximation starting from the positive Real axis. Section 5 provides a variant of this method with the starting point of the Contour Integral Shifted, referred to as the CIS method. Section 6 provides an entirely new method, referred to as the CMG method, based on Concentrated Matrix Geometric (CMG) functions with the necessary background. Section 7 contains numerical comparison of the various NIZT methods. Section 8 concludes the paper.

2. The Z-Transform and Its Inverse

Let

g (t)

be a series with countably many elements. The (unilateral) Z-transform of

g (t)

is defined as

Z {g (t)} = g^{★} (z) = \sum_{ℓ = 0}^{\infty} z^{- ℓ} g (ℓ),

(1)

where

g^{★} (z)

is the Z-transform of

g (t)

.

The inverse transform problem is to find the T-th element of

g (t)

, i.e.,

g (T)

based on

g^{★} (z)

. This paper focuses on the case when symbolic inverse Z-transformation is available and NIZT is required to find an approximate value of

g (T)

based on

g^{★} (z)

.

The region of convergence (ROC) for the summation in (1) is always of the form

{z : | z | > c}

, possibly including some points of the boundary

| z | = c

. (1) is absolute convergent on

{z : | z | > c}

, divergent on

{z : | z | < c}

, and on the boundary can be either absolute convergent, convergent or divergent. The region may also be empty (

c = \infty

), or the entire complex plane (

c = 0

). The real constant c is known as the limit of absolute convergence. In case c is finite, the function

Z {g (t)}

may extend analytically to a domain larger than the region of convergence (e.g., for

g (t) = q^{t}

, (1) is convergent for

| z | > q

, but

Z {g (t)} = \frac{z}{z - q}

extends analytically to

C \ {q}

).

The inverse at point T can be obtained from the contour integral [1]

\begin{matrix} g (T) = \frac{1}{2 π ℑ} \oint_{C} g^{★} (z) z^{T - 1} d z, \end{matrix}

(2)

where C is a counterclockwise closed path encircling the origin and entirely in the region of convergence, and i is the complex unit.

The Z-transform has a natural scaling property:

\begin{matrix} g^{★} (a z) = \sum_{ℓ = 0}^{\infty} z^{- ℓ} a^{- ℓ} g (ℓ) = Z {a^{- t} g (t)}, \end{matrix}

(3)

which allows any NIZT method to be applied to

g^{★} (a z)

instead of

g^{★} (z)

to approximate

a^{- T} g (T)

(and then

g (T)

by multiplying by

a^{T}

). In some cases, the scaling with a has special analytical interpretation; we provide such interpretation for contour integral methods. In the numerical section we study the effect of a for all the methods included in the present paper.

In this work, we assume that

g (t)

is real for any non-negative integer t. This assumption is well-suited for most practical applications and implies

g^{★} (\bar{z}) = {\bar{g}}^{★} (z)

(where

\bar{z}

denotes the complex conjugate of z). Consequently, it is sufficient to evaluate

g^{★} (z)

only at one of each complex conjugate pair.

3. General Inverse Z-Transformation Methods

In this section, we provide a short overview of various NIZT methods proposed in the literature that do not use the contour integral in (2) for the inverse Z-transformation. The contour integral-based NIZT approach of [2,3,4] will be discussed separately in Section 4.

3.1. Inverse Transformation Based on Moments

In [5], Tagliani proposes a method for NIZT that requires the availability of a finite number of the transform’s derivatives. The derivatives are used to calculate the moments of

g (t)

, and based on these moments an approximating “analytical form” is calculated. Consequently, while the author presents it as an inverse Z-transform method, it is more of a moment fitting algorithm. The benefit of Tagliani’s approach is that it can be used as long as the moments of

g (t)

are obtainable even if only a functional equation is available for the Z-transform. However, the performance of the method is significantly worse than numerical integration-based methods both in terms of precision and in computation time.

3.2. Inverse Transformation Based on Orthogonal Decomposition

Rajković et al. propose an inversion method in [6] that approximates

g (t)

with

g_{N, q} (t) = \sum_{n = 0}^{N} c_{n} φ_{q}^{(n)} (t),

where

φ_{q}^{(n)} (t) = \sum_{k = 1}^{n} b_{q, n, k} q^{k t} .

Parameter q can be chosen freely, but should be slightly smaller than 1 (in the numerical experiments of [6]

q = 3 / 4

and

q = 5 / 6

are used). The

b_{q, n, k}

coefficients are calculated as

b_{q, n, k} = {(- 1)}^{n - k} q^{- (\binom{n}{2}) + (\binom{k + 1}{2}) - k n} [\binom{n}{k}] [\binom{n + k - 1}{k - 1}],

where

[n] = \frac{1 - q^{n}}{1 - q}, [n]! = [n] [n - 1] \dots [1], [\binom{n}{k}] = \frac{[n]!}{[n - k]! [k]!} .

The

c_{n}

coefficients are calculated as

c_{n} = \frac{q^{n (1 - n)}}{1 - q^{2 n}} \sum_{j = 1}^{k} b_{q, k, j} g^{★} (1 / q^{j}) .

The above

c_{n}

parameters will minimize

| | g (t) - g_{N} (t) {| |}^{2}

for the given

φ_{q}^{(1)} (t), \dots, φ_{q}^{(N)} (t)

set of series. The idea behind the method is that the

b_{q, n, k}

parameters are chosen such that

φ_{q}^{(1)} (t), \dots, φ_{q}^{(N)} (t)

is a set of orthogonal series, i.e.,

\sum_{t = 0}^{\infty} φ_{q}^{(j)} (t) φ_{q}^{(k)} (t) = δ_{j, k} r (q) \forall j, k \in N

, where

δ_{j, k}

is the Kronecker-delta (

δ_{j, k} = 1

if

j = k

and

δ_{j, k} = 0

otherwise) and

r (q)

is a function of q. It is the consequence of this orthogonality that the

c_{n}

values calculated as above are optimal (in 2-norm).

3.3. Inverse Transformation Based on a Linear System of Equations

The

g (t)

series can also be approximated based on a simple truncation of the Z-transform presented by Merrikh–Bayat in [4]. From the definition of the Z-transform we have

\begin{matrix} g^{★} (z) = \sum_{t = 0}^{\infty} g (t) z^{- t} \approx \sum_{t = 0}^{N} g (t) z^{- t} \end{matrix}

(4)

By using this approximation in the

z_{1}, z_{2}, \dots, z_{m}

set of points chosen from the ROC we obtain

(\begin{matrix} g^{★} (z_{1}) \\ g^{★} (z_{1}) \\ \dots \\ g^{★} (z_{m}) \end{matrix}) \approx (\begin{matrix} 1 & z_{1}^{- 1} & z_{1}^{- 2} & \dots & z_{1}^{- N} \\ 1 & z_{2}^{- 1} & z_{2}^{- 2} & \dots & z_{2}^{- N} \\ ⋮ & ⋮ & ⋮ & ⋮ \\ 1 & z_{m}^{- 1} & z_{m}^{- 2} & \dots & z_{m}^{- N} \end{matrix}) \times (\begin{matrix} g (0) \\ g (1) \\ \dots \\ g (N) \end{matrix}) .

If

m = N

, the above equation has a unique solution (assuming that the matrix is non-singular, which is true if

z_{j} \neq z_{k}, \forall j \neq k

). The issue with the

m = N

case is that it leads to an ill-conditioned problem, thus Merrikh–Bayat proposes to choose

m \approx 1.1 N

then minimize

| | A {\underset{̲}{g}}_{t} - {\underset{̲}{g}}_{z} {| |}_{2}

, where

{\underset{̲}{g}}_{t} = [g (0), \dots, g (N)]

,

{\underset{̲}{g}}_{z} = [g^{★} (z_{1}), \dots, g^{★} (z_{m})]

, and

A = (\begin{matrix} 1 & z_{1}^{- 1} & z_{1}^{- 2} & \dots & z_{1}^{- N} \\ 1 & z_{2}^{- 1} & z_{2}^{- 2} & \dots & z_{2}^{- N} \\ ⋮ & ⋮ & ⋮ & ⋮ \\ 1 & z_{m}^{- 1} & z_{m}^{- 2} & \dots & z_{m}^{- N} \end{matrix}) .

Minimization of

| | A {\underset{̲}{g}}_{t} - {\underset{̲}{g}}_{z} {| |}_{2}

is a least squares problem, which is done using QR decomposition or singular value decomposition in [4]. These have a computational cost of

O (m^{2} N)

(

= O (N^{3})

when

m \approx 1.1 N

) for an

m \times N

matrix, thus this method is computationally expensive compared to numerical integration-based methods.

The error in (4) is small when

g (t)

is rapidly decaying and N is large enough so that

g (t), 0 \leq t < N

captures most of the significant values in the sequence. Accordingly, numerical experiments show that the method gives best results for rapidly decaying

g (t)

sequences when N is sufficiently large; however, increasing N further does not seem to improve the error. Overall, the non-vanishing error renders the applicability of this method limited. This is addressed further in Section 7.

4. Contour Integral-Based Inverse Z-Transformation Methods

Equation (2) can be rewritten as

\begin{matrix} g (T) = \frac{1}{2 π} \int_{0}^{2 π} g^{★} (a e^{ℑ ω}) {(a e^{ℑ ω})}^{T} d ω, \end{matrix}

(5)

where ℑ is the complex unit. Equation (6) corresponds to the case when C is the circle of radius a in (2). a is actually equivalent to the scaling parameter in (3), as shown in Remark 4.

Contour integral-based methods, in general, approximate integral (5) with the finite sum

\begin{matrix} g (T) \approx g_{N} (T) = \frac{1}{2 π} \sum_{k = 1}^{N} (ω_{k} - ω_{k - 1}) g^{★} (a e^{ℑ ω_{k}}) {(a e^{ℑ ω_{k}})}^{T}, \end{matrix}

(6)

where

ω_{k}, k = 0, 1, 2, \dots, N

define a properly chosen partition of

[0, 2 π]

and N is the order of the approximation.

This method is described in [3,4,7] in a slightly different manner independently from each other. In theory any

ω_{k}

partition could be chosen, but the above papers use

\begin{matrix} ω_{k} = \frac{2 k π}{N}, k = 1, 2, \dots, N \end{matrix}

(7)

exclusively. Since the

ω_{k}

’s are equidistant with

ω_{k} - ω_{k - 1} = \frac{2 π}{N}

, (6) can be further simplified as

\begin{matrix} g_{N} (T) = \frac{1}{N} \sum_{k = 1}^{N} g^{★} (a e^{ℑ ω_{k}}) {(a e^{ℑ ω_{k}})}^{T} = \frac{1}{N} \sum_{k = 1}^{N} g^{★} (a β_{k}) {(a β_{k})}^{T}, \end{matrix}

(8)

where

\begin{matrix} β_{k} = e^{ℑ ω_{k}} = exp (\frac{2 k π ℑ}{N}), k = 1, 2, \dots, N . \end{matrix}

(9)

The

β_{k}

parameters are referred to as nodes in the rest of the paper. Choosing the nodes according to (9) has several consequences:

Since the $β_{k}$ s consist of complex conjugate pairs (along with real numbers 1 (and also $(- 1)$ for even N)), approximation (8) is guaranteed to be real.
In case one of the $a β_{k}$ values coincides with a pole of $g^{★}$ , (8) cannot be evaluated (even if they are close, there is numerical instability). A typical example is when $g^{★}$ has a pole at 1 and $a = 1$ .
Selecting a to be larger than the limit of absolute convergence c guarantees that (8) avoids all poles of $g^{★}$ . That said, selecting a too large may also cause issues: depending on the function g (and $g^{★}$ ) evaluating $g^{★} (a β_{k})$ with sufficient precision might be difficult; also, the large factor $a^{T}$ may cause numerical instability if there are cancellations in the sum in (8). The choice of a that provides the most accurate estimate for $g (T)$ is highly dependent on g (and $g^{★}$ ) and can be difficult to determine in general.
We have

$\begin{matrix} g_{N} (T) = \frac{1}{N} \sum_{k = 1}^{N} g^{★} (a β_{k}) {(a β_{k})}^{T} = a^{T} \cdot \frac{1}{N} \sum_{k = 1}^{N} g^{★} (a β_{k}) β_{k}^{T}, \end{matrix}$

showing that in accordance with (3), (8) is indeed the same numerical method applied to $g^{★} (a z)$ instead of $g^{★} (z)$ .
Due to complex conjugate pairs, the actual number of evaluations of $g^{★}$ to compute (8) is $⌊ (N + 1) / 2 ⌋$ . However, we stick with N in the notation as N is in general a better indicator for the properties of the approximation.

4.1. FFT-Based Implementation

Calculating

g_{N} (T)

for

T = 0, 1, \dots, N - 1

with a naive approach has

O (N^{2})

computational cost. This can be reduced, however, by realizing that (8) has the form of an inverse discrete Fourier transform (IDFT). The IDFT of the sequence

{X_{0}, X_{1}, \dots, X_{N - 1}}

is

{x_{0}, \dots, x_{N - 1}}

with

x_{T} = \frac{1}{N} \sum_{k = 0}^{N - 1} X_{k} exp (\frac{2 k T π ℑ}{N}),

thus

g_{N} (0),

g_{N} (1),

\dots,

g_{N} (N - 1)

is the IDFT of

g^{★} (a β_{1}),

g^{★} (a β_{2}),

\dots,

g^{★} (a β_{N})

.

The IDFT of the

g_{N} (0),

g_{N} (1),

\dots,

g_{N} (N - 1)

sequence can be calculated using fast Fourier transform (FFT) algorithms, which have

O (N log (N))

computational cost. The first FFT algorithm was presented by Cooley and Tukey in [8]. Its most commonly used radix-2 version requires that

N = 2^{k}, k \in N

. Efficient implementations of the radix-2 Cooley-Tukey algorithm are available in most major programming languages (see, e.g., [9]). There are multiple other FFT methods (e.g., Rader’s method, the prime-factor algorithm [10], Bluestein’s algorithm [11], etc.). Some of these can calculate the IDFT in

O (N log (N))

time even for prime N; however, in general, choosing

N = 2^{k}

is the most efficient.

4.2. Interpretation Using Approximate Dirac function

This section provides a different interpretation of the approximating function

g_{N} (T)

. Substituting (1) into (8) gives

\begin{matrix} g_{N} (T) & = \frac{1}{N} \sum_{k = 1}^{N} g^{★} (a β_{k}) {(a β_{k})}^{T} = \frac{1}{N} \sum_{k = 1}^{N} \sum_{ℓ = 0}^{\infty} {(a β_{k})}^{- ℓ} g (ℓ) {(a β_{k})}^{T} \\ = \sum_{ℓ = 0}^{\infty} g (ℓ) \frac{1}{N} \sum_{k = 1}^{N} {(a β_{k})}^{T - ℓ} = \sum_{ℓ = 0}^{\infty} g (ℓ) f_{N, a} (T - ℓ), \end{matrix}

(10)

where

\begin{matrix} f_{N, a} (ℓ) = \frac{1}{N} \sum_{k = 1}^{N} {(a β_{k})}^{ℓ} = \frac{1}{N} \sum_{k = 1}^{N} a^{ℓ} exp (\frac{2 k ℓ π ℑ}{N}) . \end{matrix}

(11)

(10) tells us that

g_{N} (t)

is a convolution of the original

g (ℓ)

with the function

f_{N, a} (ℓ)

;

g_{N}

will approximate g well if

f_{N, a} (ℓ)

is close to the function

δ_{0}

(which is defined to be 1 at 0 and 0 at every other integer).

At integer points

f_{N, a} (ℓ)

simplifies to

\begin{matrix} f_{N, a} (ℓ) = \{\begin{matrix} a^{ℓ}, & for ℓ = K \cdot N, K \in Z, \\ 0, & otherwise, \end{matrix} \end{matrix}

(12)

thus

f_{N, a} (0) = 1

, and the closest non-zero values are at

\pm N

. As

N \to \infty

,

f_{N, a} (ℓ)

converges to

δ_{0}

at integer points.

(10) and (12) also ensure that

g_{N} (T)

is N-periodic except for the exponential factor introduced by a, i.e.,

\begin{matrix} g_{N} (N + T) = a^{- N} g_{N} (T) . \end{matrix}

(13)

This periodic behavior indicates that for non-periodic

g (t)

functions the

g_{N} (T)

approximation might be poor.

The error of the approximation is

\begin{matrix} g_{N} (T) - g (T) & = \sum_{ℓ = 0}^{\infty} f_{N, a} (T - ℓ) g (ℓ) - g (T) \\ = \sum_{ℓ = 0}^{T - 1} f_{N, a} (T - ℓ) g (ℓ) + \sum_{ℓ = T + 1}^{\infty} f_{N, a} (T - ℓ) g (ℓ) \\ = \sum_{ℓ = 1}^{T} f_{N, a} (ℓ) g (T - ℓ) + \sum_{ℓ = 1}^{\infty} f_{N, a} (- ℓ) g (T + ℓ) . \end{matrix}

(14)

Remarks about (14):

As long as $N > T$ , the first sum vanishes, since $f_{N, a} (T) = 0$ for $T = 1, 2, \dots, N - 1$ .
Based on (12), (14) can be written as

$\begin{matrix} g_{N} (T) - g (T) = \sum_{k \geq - ⌊ T / N ⌋, k \neq 0} a^{- k N} g (T + k N) = \\ \sum_{k = 1}^{⌊ T / N ⌋} a^{k N} g (T - k N) + \sum_{k = 1}^{\infty} a^{- k N} g (T + k N) . \end{matrix}$

(15)

(15) can be intuitively understood as cutting the sequence $g (T)$ into sections of length N, shifting them over the same interval and summing them, see Figure 1. The first section ( $K = 0$ ) corresponds to values of the original function over the interval $0 \leq T < N$ , while the rest corresponds to the error. One consequence of (15) is that for non-decaying or slowly decaying $z (T)$ sequences, selecting $a > 1$ is necessary to ensure fast decay of the error.

Figure 1. Sections shifted and summed according to (15) when $T < N$ .
If $a > 1$ , then the first sum in (15) is magnified and the second sum is diminished. This is particularly useful when $N > T$ , since in that case the first sum vanishes and the second sum can be diminished arbitrarily (at the cost of loss of precision, more on this later).
If $a < 1$ , the first sum is diminished, and the second sum is magnified.
If $N \leq T$ , the first sum in (15) does not vanish, and is magnified when choosing $a < 1$ . However, choosing $a > 1$ magnifies the second sum in (15) instead; altogether, this results in a non-vanishing error regardless of the choice of a. Consequently, the classic method (or any contour integral-based method) has significant approximation error when $N \leq T$ , as shown by the numerical results in Section 7.
When $g (ℓ) > 0$ , according to (12) the second sum has positive terms only, so there are no cancellations. On the other hand, the approximation preserves nonnegativity, which might be a relevant property in certain applications.
If $g (ℓ)$ has positive and negative values alternating, then there can be cancellations according to (15) which reduce the corresponding error.

5. Shifting the Nodes of the CIR Method

In this section, we present a variant of the CIR method, referred to as CIS method, where the nodes are shifted by a half period compared to (7), i.e., we propose to use

\begin{matrix} β_{k}^{(2)} = exp (\frac{(2 k - 1) π ℑ}{N}), k = 1, 2, \dots, N \end{matrix}

(16)

in (6) instead of (7). Figure 2 and Figure 3 display the positioning of the

β_{k}

nodes for the CIR and the

β_{k}^{(2)}

nodes for the CIS methods for even and odd values of N.

Figure 2. Nodes of the CIR and CIS NIZT methods for

N = 6

in the complex plane.

Figure 3. Nodes of the CIR and CIS NIZT methods for

N = 7

in the complex plane.

Theorem 1.

Applying the

β_{k}^{(2)}

nodes partition in the contour integral-based NIZT according to (8), results in the NIZT procedure

\begin{matrix} g_{N}^{(2)} (T) = \frac{1}{N} \sum_{k = 1}^{N} g^{★} (a β_{k}^{(2)}) {(a β_{k}^{(2)})}^{T}, \end{matrix}

(17)

whose error is

\begin{matrix} g_{N}^{(2)} (T) - g (T) = \sum_{ℓ = 1}^{T} f_{N, a}^{(2)} (ℓ) g (T - ℓ) + \sum_{ℓ = 1}^{\infty} f_{N, a}^{(2)} (- ℓ) g (T + ℓ), \end{matrix}

(18)

where

\begin{matrix} f_{N, a}^{(2)} (ℓ) = \{\begin{matrix} {(- 1)}^{K} a^{ℓ}, & f o r ℓ = K \cdot N, K \in Z, \\ 0, & f o r ℓ \in N \ {K \cdot N : K \in Z} . \end{matrix} \end{matrix}

(19)

As

N \to \infty

,

f_{N, a}^{(2)} (ℓ)

converges to

δ_{0}

at integer points.

Proof.

Applying the

β_{k}^{(2)}

values in (8) gives (17).

Similar to (10), we have

\begin{matrix} g_{N}^{(2)} (T) & = \frac{1}{N} \sum_{k = 1}^{N} g^{★} (a β_{k}^{(2)}) {(a β_{k}^{(2)})}^{T} = \frac{1}{N} \sum_{k = 1}^{N} \sum_{ℓ = 0}^{\infty} {(a β_{k}^{(2)})}^{- ℓ} g (ℓ) {(a β_{k}^{(2)})}^{T} \\ = \sum_{ℓ = 0}^{\infty} g (ℓ) \frac{1}{N} \sum_{k = 1}^{N} {(a β_{k}^{(2)})}^{T - ℓ} = \sum_{ℓ = 0}^{\infty} g (ℓ) f_{N, a}^{(2)} (T - ℓ), \end{matrix}

(20)

where

\begin{matrix} f_{N, a}^{(2)} (ℓ) = \frac{1}{N} \sum_{k = 1}^{N} {(a β_{k}^{(2)})}^{ℓ}, \end{matrix}

(21)

which satisfies (19) in integer points. Subtracting

g (T)

from both sides of (20) gives (18).□

Remarks about (18):

$f_{N, a} (0) = 1$ , and the closest non-zero values are at $\pm N$ , which are negative.
As long as $N > T$ , the first sum vanishes, since $f_{N, a}^{(2)} (T) = 0$ for $T = 1, 2, \dots, N - 1$ .
According to (19), (18) can be rewritten as

$\begin{matrix} g_{N}^{(2)} (T) - g (T) = \sum_{k \geq - ⌊ T / N ⌋, k \neq 0} {(- a)}^{- k N} g (T + k N) = \\ = \sum_{k = 1}^{⌊ T / N ⌋} {(- a)}^{k N} g (T - k N) + \sum_{k = 1}^{\infty} {(- a)}^{- k N} g (T + k N) . \end{matrix}$

(22)
If $a > 1$ , then the first sum is magnified, and the second sum is diminished. This is particularly useful when $N > T$ , since in that case the first sum vanishes, and the second sum can be diminished arbitrarily.
If $a < 1$ , the first sum is diminished, and the second sum is magnified.
When $g (T) > 0$ , (22) has alternating terms, so there are cancellations in the second sum, reducing the corresponding error. This is particularly useful when $g^{★}$ has a pole at c, since the $β_{k}^{(2)}$ values do not include a after the node shift, so (17) can still be evaluated with $a = c$ , and the error from the second sum will be smaller due to cancellations. On the other hand, unlike for CIR, the approximation does not necessarily preserve nonnegativity.
If $g (T)$ is alternating, then there are no cancellations in (22).
Due to complex conjugate pairs, the actual number of evaluations of $g^{★}$ necessary to compute (17) is $⌊ N / 2 ⌋$ . However, just like for CIR, we stick with N as the notation.

6. Concentrated Matrix Geometric Distribution-Based Inverse Transformation

As we have seen in Remark 5 of (14), contour integral-based approximation methods are generally ill-suited to approximate

g (T)

when the node number is smaller than T, i.e.,

N \leq T

. If increasing N further is not feasible (e.g., because the computational cost of evaluating

g^{★} (z)

N times gets too large, or due to the precision loss of the procedure with the given floating point number representation), then a different NIZT procedure is needed.

In this section, we propose an approach for NIZT based on concentrated matrix geometric distributions, which we thus call CMG method and is the application of the CME (concentrated matrix exponential) method [12] for discrete time. The CME method is a numerical inverse Laplace transformation (NILT) procedure that uses the Abate–Whitt framework [13]. In the following, we first introduce the Abate–Whitt framework, then we present the CME method, finally we show how it can be applied to discrete time to obtain the proposed CMG method.

6.1. Abate–Whitt Framework for Numerical Inverse Laplace Transformation

The Laplace-transform of a function

h (t)

is defined as

h^{*} (s) = \int_{t = 0}^{\infty} e^{- s t} h (t) d t .

The inverse transform problem is to find an approximate value of function h at point T (i.e.,

h (T)

) based on

h^{*} (s)

. The Abate–Whitt framework uses the following form for this approximation:

h (T) \approx h_{n} (T) : = \sum_{k = 1}^{n} \frac{η_{k}}{T} h^{*} (\frac{β_{k}}{T}), T > 0 .

This approximation has a simple interpretation based on the reformulation

h_{n} (T) = \frac{1}{T} \sum_{k = 1}^{n} η_{k} h^{*} (\frac{β_{k}}{T}) = \int_{0}^{\infty} h (t) \cdot \frac{1}{T} f_{n} (\frac{t}{T}) d t,

(23)

where

f_{n} (t) = \sum_{k = 1}^{n} η_{k} e^{- β_{k} t} .

(24)

If

f_{n} (t)

was the Dirac impulse function at point T, then the Laplace inversion would be perfect, but depending on the weights

η_{k}

and nodes

β_{k}

, the function

f_{n} (t)

only approximates the Dirac impulse function with a given accuracy. There are multiple different types of

f_{n} (t)

functions that can be used for the approximation [12].

6.2. CME Method

In the CME method the probability density function (pdf) of a matrix exponential distribution is chosen as the

f_{n} (t)

function. The class of matrix exponential (ME) distributions of order N contains positive random variables with pdf of the form

\begin{matrix} f_{X} (t) = - \underset{̲}{α} A e^{A t} \underset{̲}{1}, t \geq 0, \end{matrix}

(25)

where

\underset{̲}{α}

is a real row vector of length N, A is a real matrix of size

N \times N

, and

\underset{̲}{1}

is a column vector of ones of size N [14]. To ensure

\int_{0}^{\infty} f_{X} (t) d t = 1

,

\underset{̲}{α}

and A are such that

\underset{̲}{α} \underset{̲}{1} = 1

and the eigenvalues of A have negative real part.

Nonnegativity of

f_{X} (t)

does not follow from (25), but some

(\underset{̲}{α}, A

) pairs result in

f_{X} (t)

functions that are non-negative for

t \geq 0

[15]. When A is diagonalizable with spectral decomposition

A = \sum_{k = 1}^{N} u_{k} λ_{k} v_{k},

where

λ_{k}

are the eigenvalues,

u_{k}

are the right eigenvectors and

v_{k}

are the left eigenvectors of A for

k = 1, \dots N

, then

f_{X}

can be written as

f_{X} (t) = \sum_{k = 1}^{N} \underset{c_{k}}{\underset{⏟}{- \underset{̲}{α} A \underset{̲}{u_{k}} \underset{̲}{v_{k}} \underset{̲}{1}}} e^{λ_{k} t} = \sum_{k = 1}^{N} c_{k} e^{λ_{k} t} = \sum_{k = 1}^{n} η_{k} e^{- β_{k} t},

(26)

with

η_{k} = c_{k}

and

β_{k} = - λ_{k}

. Comparing (26) and (24) shows that ME distributions with diagonalizable matrix A can be used in the place of

f_{n} (t)

to obtain an ILT method of the Abate–Whitt framework.

As mentioned before, the primary task when using the Abate–Whitt framework is to approximate the Dirac impulse with the

f_{n} (t)

function as closely as possible. The squared coefficient of variation (SCV) measures how concentrated a non-negative normalized function on

R^{+}

is, and it is a good indicator of the quality of the approximation. The SCV of f can be calculated as

\begin{matrix} SCV (f) : = \frac{\int_{t = 0}^{\infty} t^{2} f (t) d t}{{(\int_{t = 0}^{\infty} t f (t) d t)}^{2}} - 1 . \end{matrix}

Function f with

SCV (f) = 0

is the Dirac function and the smaller

SCV (f)

is, the better f approximates the Dirac function. The parameters of CME distributions with low SCV have been calculated for up to order 1000 [15] and can be accessed at [16].

The CME method has several advantages compared to other NILT methods [12]. It is more stable numerically, provides smooth, over- and under-shooting free approximation even for discontinuous functions and, contrary to other methods of the family, its precision gradually improves when increasing its order (N). The application of the CME method for NIZT is discussed in the next section.

6.3. CMG Method

A discrete counterpart of the CME method is formulated in the following theorem.

Theorem 2.

For a discrete function

g : Z \to R

, defining the continuous function

\hat{g} (t) = \{\begin{matrix} 0, & f o r t < 1 / 2, \\ g (ℓ), & f o r ℓ - 1 / 2 \leq t < ℓ + 1 / 2, \end{matrix}

and applying the CME inverse Laplace transformation method at point T with weights

η_{k}

and nodes

β_{k}

, results in the NIZT procedure

\begin{matrix} g_{N} (T) = \{\begin{matrix} g^{★} (0), & f o r T = 0, \\ \sum_{k = 1}^{N} {\bar{η}}_{k} (g^{★} (e^{\frac{β_{k}}{T}}) - g^{★} (0)), & f o r T > 0, \end{matrix} \end{matrix}

(27)

where

{\bar{η}}_{k} = \frac{η_{k}}{β_{k}} (e^{\frac{β_{k}}{2 T}} - e^{- \frac{β_{k}}{2 T}}) .

Proof.

The inverse Laplace transformation of

\hat{g} (t)

according to (23) can be written as

\begin{matrix} {\hat{g}}_{N} (T) & = \frac{1}{T} \sum_{k = 1}^{N} η_{k} \int_{t = 0}^{\infty} e^{- \frac{β_{k}}{T} t} \hat{g} (t) d t = \sum_{ℓ = 1}^{\infty} \int_{t = ℓ - 1 / 2}^{ℓ + 1 / 2} g (ℓ) e^{- \frac{β_{k}}{T} t} d t \\ = \frac{1}{T} \sum_{k = 1}^{N} η_{k} \sum_{ℓ = 1}^{\infty} - \frac{T}{β_{k}} (e^{- \frac{β_{k}}{T} (ℓ + 1 / 2)} - e^{- \frac{β_{k}}{T} (ℓ - 1 / 2)}) g (ℓ) \\ = \frac{1}{T} \sum_{k = 1}^{N} η_{k} \sum_{ℓ = 1}^{\infty} \frac{T}{β_{k}} (e^{\frac{β_{k}}{2 T}} - e^{- \frac{β_{k}}{2 T}}) g (ℓ) e^{- \frac{β_{k}}{T} ℓ} \\ = \sum_{k = 1}^{N} \frac{η_{k}}{β_{k}} (e^{\frac{β_{k}}{2 T}} - e^{- \frac{β_{k}}{2 T}}) \sum_{ℓ = 1}^{\infty} {(e^{\frac{β_{k}}{T}})}^{- ℓ} g (ℓ) . \end{matrix}

Based on this last expression, we can express

{\hat{g}}_{N} (T)

from

g^{★} (z)

using its definition in (1) as

{\hat{g}}_{N} (T) = \sum_{k = 1}^{N} \frac{η_{k}}{β_{k}} (e^{\frac{β_{k}}{2 T}} - e^{- \frac{β_{k}}{2 T}}) \sum_{ℓ = 1}^{\infty} {(e^{\frac{β_{k}}{T}})}^{- ℓ} g (ℓ) = \sum_{k = 1}^{N} {\bar{η}}_{k} (g^{★} (e^{\frac{β_{k}}{T}}) - g (0))

(28)

From the definition of the Z-transform we have

g (0) = g^{★} (0)

, thus (28) takes the form of (27).□

The scaled version of (27) according to (3) is

\begin{matrix} g_{N} (T) = \{\begin{matrix} g^{★} (0), & f o r T = 0, \\ a^{T} \sum_{k = 1}^{N} {\bar{η}}_{k} (g^{★} (a e^{\frac{β_{k}}{T}}) - g^{★} (0)), & f o r T > 0 . \end{matrix} \end{matrix}

(29)

In general, the interpretation of the CMG method is similar to the one of the CME method for NILT in that the approximation is essentially based on a discrete approximation of the Dirac function. The main advantage of this method compared to many other NIZT methods discussed above is that the error of CMG method remains small also when

T \geq N

.

The parameter a affects the “shape” of the discrete approximation of the Dirac function, such that for

a > 1

, the contribution of the

g (t)

terms to the error of

g_{N} (T)

is magnified for

t < T

and diminished for

t > T

, while for

a < 1

, it is the other way around. The effect of a on the accuracy of

g_{N} (T)

is examined in Section 7.

7. Numerical Examples

To evaluate the numerical properties of the above listed NIZT methods we use a collection of test functions according to Table 1. This set of functions exhibits a wide range of behaviors. For each function, Table 1 lists the name, the t domain form,

g (t)

, the z domain form,

g^{★} (z)

, and the limit of absolute convergence, c.

Table 1. Set of test functions.

We check the behavior of the NIZT methods by evaluating the difference of the values computed from

g (T)

with the ones computed from

g^{★} (z)

(

g_{N} (T)

) with different choices of the order, the largest evaluated point and the scaling factor (

N, T_{max}

, and a). Apart from the error in the approximation, we also keep track of the precision loss (number of digits lost due to round-off error) during the calculation of

g_{N} (T)

. The computations were carried out using Wolfram Mathematica. The precision of the applied arithmetic was high enough (200 digits) to dominate the precision loss.

7.1. Numerical Properties When $T < N$

When we are interested in the cases with

T < N

, we use the following error measure based on infinity norm over an interval

[0, T_{max}]

:

\begin{matrix} ∥ g - g_{N} ∥ = ∥ g - g_{N} ∥_{\infty} = max_{0 \leq T \leq T_{max}} | g (T) - g_{N} (T) | \end{matrix}

(30)

T_{max}

is the largest point we are interested in, i.e., we assume

T \leq T_{max} < N

in this subsection.

7.1.1. General Comparison of the NIZT Methods

For this comparison, we included the NIZT methods from the previous sections which provide the best results. Ort1 and Ort2 refer to the method presented in Section 3.2 and [6] (using the suggested

q = 3 / 4

and

q = 5 / 6

parameters), based on orthogonal functions. MB refers to the method in Section 3.3, based on matrix pseudoinverse calculation.

Table 2 compares the error of all methods for the list of functions in Table 1 for

N = 64

and

T_{max} = 32

. In the table,“∼0” means practical zero, a value smaller than

10^{- 100}

, “p.inf.” stands for practical infinity, denoting errors larger than

10^{2}

, while “n/a” means not applicable due to a pole of

g^{★}

which is evaluated by the given NIZT method (e.g., the CIR method with

a = 1

fails for functions with a pole at 1). All calculations related to Table 2 were carried out using high precision (200 digits) floating point arithmetic.

Table 2. Errors for various test functions (

N = 64, T_{max} = 32, a = 1

).

According to Remark 4 at the end of Section 4 (and Remark 5 at the end of Section 5), as long as

T_{max} < N

, the accuracy of the contour integral-based methods CIR and CIS improves as a is increased, and this also helps avoiding the pole at 1 for the CIR method. Table 3 displays this effect by setting

a = 2

instead of

a = 1

.

Table 3. Errors for various test functions (

N = 64, T_{max} = 32, a = 2

).

Based on Table 2 and Table 3, we conclude that the Ort1 method gives the most precise results, the CIR, CIS give precise results when a is sufficiently large, the Ort2 and MB methods are unreliable, and CMG is relatively reliable for

a = 1

(although the error is not as small as for CIR, CIS and Ort1), but unreliable for

a = 2

. Altogether, one might have the impression that Ort1 is the best method; however, we have not yet examined other important questions like precision loss or running time. In the next subsection, precision loss is examined along with a more detailed analysis of the role of a.

Due to their unreliability (c.f. Table 2 and Table 3) methods Ort2 and MB are excluded from further investigations.

7.1.2. Precision Loss and the Effect of a

For a more detailed view on the performance of the methods CIR, CIS, Ort1, and CMG, we investigate their accuracy as a function of parameter a.

Table 4 contains the error and precision loss (p.l., in digits, calculated using Wolfram Mathematica) for the Polynomial(

1 / t

) function with

N = 64, T_{max} = 32

, and a taking the values

1 / 2, 1, 11 / 10, 2, 4

.

Table 4. The error

∥ g - g_{N} ∥

and precision loss for Polynomial(

1 / t

) function,

N = 64, T_{max} = 32

.

For contour integral methods CIR and CIS, as long as

N > T

, setting a to a larger value will diminish the error in the second term of (14) and (18), while the first term cancels out entirely.

For

a = 1 / 2

, the approximations in Table 4 are poor. This is because for

a < 1

, the error in the tail is magnified. If

g (t)

is rapidly decaying, then the error introduced by

a = 0.5

is small, but

a = 1

or

a > 1

are still better choices.

For

a = 1

, the “n/a” values for the CIR method are due to the pole of the function at 1. This is where the CIS method has an advantage: the shifted nodes avoid the pole when

a = 1

, so it gives a meaningful result.

For

a > 1

, the error decreases rapidly for the contour integral methods CIR and CIS as the error in the tail is diminished, at the cost of increased precision loss. Interestingly, the precision loss is of similar order as the error. (This is not necessarily the case in general, but the precision loss does seem to increase rapidly with a in general.)

The error for the CIS method is slightly smaller than for the CIR due to cancellations (see Remark 6 after (18)), but the difference is practically negligible.

The Ort1 method, while gives the lowest error, suffers from huge precision losses. Notably, for any calculations with precision smaller than 142 digits, Ort1 would give meaningless results due to precision loss. The precision loss is inherent to the Ort1 method due to the highly fluctuating orthogonal functions involved. Overall, we recommend avoiding the use of the Ort1 method due to its unpredictable high precision loss, and instead we recommend using either CIR or CIS when

T < N

, with a set to as large as possible, depending on the precision loss tolerated.

Interestingly, with the Polynomial(

1 / t

) function the CMG method works best when setting

a = 1

. An intuitive explanation of this property is as follows. For the CMG method,

a > 1

enlarges the errors that are caused by the non-zero

g (t)

values for

t < T

and diminishes errors that are caused by the non-zero

g (t)

values for

t > T

, and

a < 1

has the opposite effect. Since Polynomial(

1 / t

) is a rather flat function the effect of enlarging the error is more dominant for both

a > 1

and

a < 1

. This property might slightly change for steeper functions, but, in general, we recommend using

a = 1

for the CMG method. We also note that the CMG method involves practically no loss of precision, so it can be used efficiently even with standard precision floating point arithmetic.

Table 5 investigates the error and the precision loss for various Z-transform

g^{★}

functions using the CIR, CIS and Ort1 methods with parameters

N = 64, T_{max} = 32, a = 2

. We note that the CMG method fails due to

a > 1

.

Table 5. Error and precision loss for various test functions (

N = 64, T_{max} = 32, a = 2

).

From Table 5, we conclude that the precision loss heavily depends on the

g^{★}

function. As a result, high precision calculations with precision loss check is recommended for NIZT with the CIR, CIS and Ort1 methods. However, with high precision calculations, the precision loss remains tolerable for CIR and CIS.

7.2. Numerical Properties of NIZT Methods When $T \geq N$

Finally, we examine the case when

T \geq N

. This case is relevant when sampling of the Z-transform

g^{★}

is costly, but we still need to approximate

g (T)

for large values of T.

7.2.1. Failure of Contour Integral Methods

To start off, we display the remarks at the end of Section 4.2 in practice. Table 6 shows the result of applying the CIR approximation method to the z-transform of the Poisson(1) distribution with order

N = 4

and various choices of a, compared with the actual values of the Poisson distribution.

Table 6. Order 4 CIR method approximation of the Poisson(1) distribution.

For

a = 1

, the approximation is relatively accurate up to

T = N - 1 = 3

, but there is a sharp change in the behavior at

T = N

: the approximation is periodic with a period of N, rendering the approximation values for

T \geq N

useless (see also Figure 1). For

a = 2

, the approximation is even more accurate up to

T = N - 1

, but the error is magnified for

T \geq N

(due to the scaling by

a^{N}

in accordance with (13), e.g.,

g_{N} (4) = 5.9014 = a^{4} \cdot g_{N} (0) = 16 \cdot 0.3688

in this case). For

a = 1 / 2

, the error for the

T \geq N

terms is diminished, but the approximation up to

T = N - 1

is much less accurate. Altogether, the CIR method cannot be used to obtain an approximation suitable for both

T < N

and

T \geq N

. The CIS method suffers from the same issues.

7.2.2. Comparing The Error of NIZT Methods When $T \geq N$

Since the error of the cases when

T < N

has been considered in Section 7.1, in this section we define the error as

\begin{matrix} ∥ g - g_{N} ∥ = ∥ g - g_{N} ∥_{\infty} = max_{N \leq T < T_{max}} | g (T) - g_{N} (T) | . \end{matrix}

(31)

Table 7 contains the error of each method for various functions. The parameters are

T_{max} = 32, N = 16

and a either 1 or

1.1

. In this table, “p.inf.” marks elements larger than

10^{3}

. We note that the MB method is not applicable with these parameters, since the pseudoinverse calculation is only applicable when

T_{max} < 1.1 N

.

Table 7. Errors for various test functions (

N = 16, T_{max} = 32

).

Based on Table 7, we conclude that as long as

T \geq N

, the error of the CIR and CIS methods ((14) and (18)) is large either in first or the second term depending on whether

a < 1

or

a > 1

, and the Ort1 and Ort2 methods are also unreliable. Only the CMG method gives meaningful results for the case when

T \geq N

. As for the value of a, we recommend using 1 as generally that seems to give a reliably low error.

Table 8 contains the errors with parameters

T_{max} = 512, N = 256

and

a = 1

. At this parameter setting, all other methods except CMG fail, while CMG gives a reasonably good approximation.

Table 8. Errors for various test functions (

N = 256, T_{max} = 512, a = 1

).

8. Conclusions

In this paper, we collected many of the NIZT methods from the literature and presented two new methods. One of them, the CIS method, is a variant of the contour integral-based CIR method and the other one, the CMG method, is inherited from numerical inverse Laplace transformation.

A wide numerical investigation on these NIZT methods indicated that different methods are accurate when the order of the method is higher than the required parameter (

N > T

) and when it is lower (

N \leq T

). In the first case, the CIR and the CIS methods are the most reliable (moderate error and tolerable precision loss) and they perform rather similarly. Their behavior differs only when one of the methods must evaluate the transform function near one of its poles. In the second case, when

N \leq T

, the CMG method outperforms all other methods in both accuracy and precision loss. The rest of the methods perform poorly, in general, except the Ort1 method, which gives more accurate results than the CIR and the CIS methods if appropriately high precision is used, but its numerical instability often results in larger precision loss than the applied high precision arithmetic.

While the optimal method to choose may depend on multiple factors (e.g., tolerated precision loss, computational time, tolerated error, etc.) as a rule of thumb for

N > T

we recommend to use the CIR and CIS methods, and for

N \leq T

the CMG method. The Mathematica implementation of the considered NIZT methods and some results of the numerical evaluation are available at http://webspn.hit.bme.hu/~telek/tools/nizt.zip.

Author Contributions

The contribution is equally shared by the authors. All authors have read and agreed to the published version of the manuscript.

Funding

This work is partially supported by the OTKA K-123914 and the TUDFO/51757/2019-ITM projects.

Conflicts of Interest

The authors declare no conflict of interest.

References

Oppenheim, A.V.; Schafer, R.W. Discrete-Time Signal Processing, 3rd ed.; Pearson: London, UK, 2009. [Google Scholar]
Abate, J.; Choudhury, G.L.; Whitt, W. An Introduction to Numerical Transform Inversion and Its Application to Probability Models. In Computational Probability; Grassmann, W.K., Ed.; Springer US: Boston, MA, USA, 2000; pp. 257–323. [Google Scholar] [CrossRef]
Mills, P.L. Numerical inversion of z-transforms with application to polymerization kinetics. Comput. Chem. 1987, 11, 137–151. [Google Scholar] [CrossRef]
Merrikh-Bayat, F. Two Methods for Numerical Inversion of the Z-Transform. arXiv 2014, arXiv:1409.1727. [Google Scholar]
Tagliani, A. Inverse Z transform and moment problem. Probab. Eng. Inf. Sci. 2000, 14, 393–404. [Google Scholar] [CrossRef]
Rajković, P.M.; Stanković, M.S.; Marinković, S.D. A method for numerical evaluating of inverse Z-transform. Facta Univ.-Ser. Mech. Autom. Control. Robot. 2004, 4, 133–139. [Google Scholar]
Abate, J.; Whitt, W. Numerical inversion of probability generating functions. Oper. Res. Lett. 1992, 12, 245–251. [Google Scholar] [CrossRef]
Cooley, J.W.; Tukey, J.W. An algorithm for the machine calculation of complex Fourier series. Math. Comput. 1965, 19, 297–301. [Google Scholar] [CrossRef]
Rosetta Code. Fast Fourier Transform. 2019. Available online: https://rosettacode.org/wiki/Fast_Fourier_transform (accessed on 1 August 2019).
Duhamel, P.; Vetterli, M. Fast Fourier transforms: A tutorial review and a state of the art. Signal Process. 1990, 19, 259–299. [Google Scholar] [CrossRef]
Bluestein, L. A linear filtering approach to the computation of discrete Fourier transform. IEEE Trans. Audio Electroacoust. 1970, 18, 451–455. [Google Scholar] [CrossRef]
Horváth, G.; Horváth, I.; Almousa, S.A.D.; Telek, M. Numerical Inverse Laplace Transformation by concentrated matrix exponential distributions. Perform. Eval. 2020, 137, 102067. [Google Scholar] [CrossRef]
Abate, J.; Whitt, W. A Unified Framework for Numerically Inverting Laplace Transforms. INFORMS J. Comput. 2006, 18, 408–421. [Google Scholar] [CrossRef]
Asmussen, S.; Bladt, M. Renewal Theory and Queueing Algorithms for Matrix-Exponential Distributions. In Lecture Notes in Pure and Applied Mathematics; Alfa, A.S., Chakravarthy, S.R., Eds.; Marcel Dekker: New York, NY, USA, 1997; pp. 313–341. [Google Scholar]
Horváth, G.; Horváth, I.; Telek, M. High order concentrated matrix-exponential distributions. Stoch. Model. 2019. [Google Scholar] [CrossRef]
inverselaplace.org. Available online: http://inverselaplace.org/ (accessed on 13 February 2020).

Figure 1. Sections shifted and summed according to (15) when

T < N

.

Figure 1. Sections shifted and summed according to (15) when

T < N

.

Figure 2. Nodes of the CIR and CIS NIZT methods for

N = 6

in the complex plane.

Figure 2. Nodes of the CIR and CIS NIZT methods for

N = 6

in the complex plane.

Figure 3. Nodes of the CIR and CIS NIZT methods for

N = 7

in the complex plane.

Figure 3. Nodes of the CIR and CIS NIZT methods for

N = 7

in the complex plane.

Table 1. Set of test functions.

Function	g(t)	$g^{★} (z)$	c
Dirac(10)	$δ^{10}$	z⁻¹⁰	0
Poisson(1)	$\frac{1}{t!} e^{- 1}$	$e^{(\frac{1}{z} - 1)}$	0
Heaviside step	1	$\frac{z}{z - 1}$	1
Geometric( $1 / 2$ )	${(1 / 2)}^{t}$	$\frac{z}{z - 1 / 2}$	1/2
Geometric( $- 1 / 2$ )	${(- 1 / 2)}^{t}$	$\frac{z}{z + 1 / 2}$	1/2
Triangle wave	$\frac{1 + {(- 1)}^{t}}{2}$	$\frac{z^{2}}{z^{2} - 1}$	1
Polynomial( $1 / t$ )	$\frac{1}{t} 1 (t \geq 1)$	$- \log (1 - z^{- 1})$	1
Polynomial(t)	t	$\frac{z}{{(- 1 + z)}^{2}}$	1
Uniform(5,10)	$1 (5 ≦ t ≦ 10)$	$\frac{z^{6} - 1}{z^{11} - z^{10}}$	0

Table 2. Errors for various test functions (

N = 64, T_{max} = 32, a = 1

).

Table 2. Errors for various test functions (

N = 64, T_{max} = 32, a = 1

).

Function	CIR	CIS	Ort1	Ort2	MB	CMG
Dirac(10)	∼0	∼0	∼0	p.inf.	∼0	$1.77 \times 10^{- 2}$
Poisson(1)	$2.90 \times 10^{- 90}$	$2.90 \times 10^{- 90}$	∼0	p.inf.	$7.58 \times 10^{- 30}$	$3.17 \times 10^{- 5}$
Heaviside step	n/a	0.500	$5.10 \times 10^{- 42}$	p.inf.	p.inf.	$\times 10^{1.82 - 5}$
Geometric( $1 / 2$ )	$5.42 \times 10^{- 20}$	$5.42 \times 10^{- 20}$	$7.72 \times 10^{- 66}$	p.inf.	p.inf.	$3.82 \times 10^{- 5}$
Geometric( $- 1 / 2$ )	$5.42 \times 10^{- 20}$	$5.42 \times 10^{- 20}$	$4.45 \times 10^{- 59}$	p.inf.	$1.04 \times 10^{- 7}$	$3.82 \times 10^{- 5}$
Triangle wave	n/a	0.500	$5.11 \times 10^{- 42}$	p.inf.	p.inf.	0.431
Polynomial( $1 / t$ )	n/a	$1.08 \times 10^{- 2}$	$5.69 \times 10^{- 44}$	p.inf.	p.inf.	$6.54 \times 10^{- 5}$
Polynomial(t)	n/a	31.5	$4.56 \times 10^{- 40}$	p.inf.	p.inf.	$3.35 \times 10^{- 3}$
Uniform(5,10)	∼0	∼0	∼0	p.inf.	0.181	$1.40 \times 10^{- 2}$

Table 3. Errors for various test functions (

N = 64, T_{max} = 32, a = 2

).

Table 3. Errors for various test functions (

N = 64, T_{max} = 32, a = 2

).

Function	CIR	CIS	Ort1	Ort2	MB	CMG
Dirac(10)	∼0	∼0	∼0	p.inf.	∼0	p.inf.
Poisson(1)	∼0	∼0	∼0	p.inf.	∼0	p.inf.
Heaviside step	$5.42 \times 10^{- 20}$	$5.42 \times 10^{- 20}$	$1.66 \times 10^{- 56}$	p.inf.	p.inf.	p.inf.
Geometric( $1 / 2$ )	$2.94 \times 10^{- 39}$	$2.94 \times 10^{- 39}$	∼0	p.inf.	p.inf.	p.inf.
Geometric( $- 1 / 2$ )	$2.94 \times 10^{- 39}$	$2.94 \times 10^{- 39}$	∼0	p.inf.	$7.17 \times 10^{- 8}$	p.inf.
Triangle wave	$5.42 \times 10^{- 20}$	$5.42 \times 10^{- 20}$	$4.78 \times 10^{- 50}$	p.inf.	p.inf.	p.inf.
Polynomial( $1 / t$ )	$8.47 \times 10^{- 22}$	$8.47 \times 10^{- 22}$	$1.48 \times 10^{- 58}$	p.inf.	p.inf.	p.inf.
Polynomial(t)	$5.15 \times 10^{- 18}$	$5.15 \times 10^{- 18}$	$1.53 \times 10^{- 54}$	p.inf.	p.inf.	p.inf.
Uniform(5,10)	∼0	∼0	∼0	p.inf.	∼0	p.inf.

Table 4. The error

∥ g - g_{N} ∥

and precision loss for Polynomial(

1 / t

) function,

N = 64, T_{max} = 32

.

Table 4. The error

∥ g - g_{N} ∥

and precision loss for Polynomial(

1 / t

) function,

N = 64, T_{max} = 32

.

	CIR		CIS		Ort1		CMG
$a$	Error	p.l.	Error	p.l.	Error	p.l.	Error	p.l.
$1 / 2$	0.693	12	0.693	3	$1.84 \times 10^{- 29}$	185	$7.52 \times 10^{- 2}$	1
1	n/a	n/a	$1.08 \times 10^{- 2}$	4	$5.69 \times 10^{- 44}$	162	$6.54 \times 10^{- 5}$	1
$11 / 10$	0.214	5	0.212	5	$1.59 \times 10^{- 42}$	162	$2.33 \times 10^{- 2}$	1
2	$5.15 \times 10^{- 18}$	20	$5.15 \times 10^{- 18}$	20	$1.53 \times 10^{- 54}$	143	p.inf.	1
4	$2.79 \times 10^{- 37}$	38	$2.79 \times 10^{- 37}$	38	∼0	142	p.inf.	1

Table 5. Error and precision loss for various test functions (

N = 64, T_{max} = 32, a = 2

).

Table 5. Error and precision loss for various test functions (

N = 64, T_{max} = 32, a = 2

).

	CIR		CIS		Ort1		CMG
	Error	p.l.	Error	p.l.	Error	p.l.	Error	p.l.
Dirac(10)	∼0	19	∼0	19	∼0	92	p.inf.	1
Poisson(1)	∼0	107	∼0	107	∼0	174	p.inf.	1
Heaviside step	$5.42 \times 10^{- 20}$	21	$5.42 \times 10^{- 20}$	21	$1.66 \times 10^{- 56}$	141	p.inf.	1
Geom( $1 / 2$ )	$2.94 \times 10^{- 39}$	39	$2.94 \times 10^{- 39}$	39	∼0	149	p.inf.	1
Geom( $- 1 / 2$ )	$2.94 \times 10^{- 39}$	39	$2.94 \times 10^{- 39}$	39	∼0	149	p.inf.	1
Triangle wave	$5.42 \times 10^{- 20}$	21	$5.42 \times 10^{- 20}$	21	$4.78 \times 10^{- 50}$	199	p.inf.	1
Polynomial( $1 / t$ )	$8.47 \times 10^{- 22}$	22	$8.47 \times 10^{- 22}$	22	$1.48 \times 10^{- 58}$	139	p.inf.	1
Polynomial(t)	$5.15 \times 10^{- 18}$	20	$5.15 \times 10^{- 18}$	20	$1.53 \times 10^{- 54}$	143	p.inf.	1
Uniform(5,10)	∼0	20	∼0	20	∼0	115	116	1

Table 6. Order 4 CIR method approximation of the Poisson(1) distribution.

T	0	1	2	3	4	5	6
$a = 1 / 2$	0.6155	0.4172	0.1921	0.0625	0.0385	0.0261	0.0120
$a = 1$	0.3832	0.3709	0.1845	0.0614	0.3832	0.3709	0.1845
$a = 2$	0.3688	0.3681	0.1840	0.0613	5.9014	5.8891	2.9436
exact	0.3679	0.3679	0.1839	0.0613	0.0153	0.0031	0.0005

Table 7. Errors for various test functions (

N = 16, T_{max} = 32

).

Table 7. Errors for various test functions (

N = 16, T_{max} = 32

).

Function	a	CIR	CIS	Ort1	Ort2	CMG
Dirac(10)	$1 / 2$	$1.53 \times 10^{- 5}$	$1.53 \times 10^{- 5}$	$6.62 \times 10^{- 4}$	$4.77 \times 10^{- 3}$	$2.39 \times 10^{- 5}$
Dirac(10)	1	1.00	1.00	$7.15 \times 10^{- 2}$	0.305	$1.53 \times 10^{- 3}$
Dirac(10)	$11 / 10$	4.59	4.59	0.229	0.541	$2.71 \times 10^{- 3}$
Poisson(1)	$1 / 2$	$5.61 \times 10^{- 6}$	$5.61 \times 10^{- 6}$	$9.46 \times 10^{- 15}$	0.312.	$4.21 \times 10^{- 9}$
Poisson(1)	1	0.368	0.368	$2.56 \times 10^{- 11}$	p.inf.	$1.04 \times 10^{- 4}$
Poisson(1)	$11 / 10$	1.69	1.69	$2.39 \times 10^{- 11}$	p.inf.	$1.00 \times 10^{- 3}$
Heaviside step	$1 / 2$	1.00	1.00	p.inf.	3.58	1.00
Heaviside step	1	n/a	1.50	0.407	p.inf.	$9.46 \times 10^{- 5}$
Heaviside step	$11 / 10$	4.87	4.77	$8.63 \times 10^{- 2}$	p.inf.	$4.66 \times 10^{- 2}$
Geometric( $1 / 2$ )	$1 / 2$	n/a	$2.29 \times 10^{- 5}$	$3.70 \times 10^{- 6}$	0.423	$1.44 \times 10^{- 9}$
Geometric( $1 / 2$ )	1	1.00	1.00	$1.93 \times 10^{- 10}$	p.inf.	$2.22 \times 10^{- 4}$
Geometric( $1 / 2$ )	$11 / 10$	4.59	4.59	$4.72 \times 10^{- 10}$	p.inf.	$2.26 \times 10^{- 3}$
Geometric( $- 1 / 2$ )	$1 / 2$	p.inf.	$2.29 \times 10^{- 5}$	$1.43 \times 10^{- 5}$	0.420	$1.53 \times 10^{- 5}$
Geometric( $- 1 / 2$ )	1	1.00	1.00	$2.67 \times 10^{- 4}$	p.inf.	$1.23 \times 10^{- 4}$
Geometric( $- 1 / 2$ )	$11 / 10$	4.59	4.59	$1.04 \times 10^{- 3}$	p.inf.	$1.37 \times 10^{- 3}$
Triangle wave	$1 / 2$	1.00	1.00	956	1.72	1.00
Triangle wave	1	n/a	1.50	0.703	242	0.500
Triangle wave	$11 / 10$	4.87	4.77	0.677	p.inf.	0.524
Polynomial( $1 / t$ )	$1 / 2$	0.0625	0.0625	71.1	0.679	$6.29 \times 10^{- 2}$
Polynomial( $1 / t$ )	1	n/a	1.02	$9.65 \times 10^{- 3}$	p.inf.	$8.20 \times 10^{- 4}$
Polynomial( $1 / t$ )	$11 / 10$	4.60	4.60	$1.65 \times 10^{- 3}$	p.inf.	$4.44 \times 10^{- 3}$
Polynomial(t)	$1 / 2$	31.0	31.0	p.inf.	36.3	31.0
Polynomial(t)	1	n/a	34.5	16.9	p.inf.	$1.08 \times 10^{- 4}$
Polynomial(t)	$11 / 10$	83.2	76.8	4.05	p.inf.	0.367
Uniform(5,10)	$1 / 2$	$1.53 \times 10^{- 5}$	$1.53 \times 10^{- 5}$	$4.39 \times 10^{- 4}$	$5.78 \times 10^{- 3}$	$2.73 \times 10^{- 5}$
Uniform(5,10)	1	1.0	1.0	$4.06 \times 10^{- 2}$	0.437	$2.65 \times 10^{- 3}$
Uniform(5,10)	$11 / 10$	4.59	4.59	0.119	0.814	$5.34 \times 10^{- 3}$

Table 8. Errors for various test functions (

N = 256, T_{max} = 512, a = 1

).

Table 8. Errors for various test functions (

N = 256, T_{max} = 512, a = 1

).

Function	CIR	CIS	Ort1	Ort2	CMG
Dirac(10)	1.00	1.00	p.inf.	p.inf.	$1.51 \times 10^{- 3}$
Poisson(1)	0.368	0.368	p.inf.	p.inf.	$1.61 \times 10^{- 3}$
Heaviside step	n/a	1.50	p.inf.	p.inf.	$1.01 \times 10^{- 3}$
Geometric( $1 / 2$ )	1.00	1.00	p.inf.	p.inf.	$1.23 \times 10^{- 4}$
Geometric( $- 1 / 2$ )	1.00	1.00	p.inf.	p.inf.	$1.23 \times 10^{- 4}$
Triangle wave	n/a	1.50	p.inf.	p.inf.	0.500
Polynomial( $1 / t$ )	n/a	1.0	p.inf.	p.inf.	$6.11 \times 10^{- 3}$
Polynomial(t)	n/a	575	p.inf.	p.inf.	$9.29 \times 10^{- 4}$
Uniform(5,10)	1.0	1.0	p.inf.	p.inf.	$3.28 \times 10^{- 3}$

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Article Metrics

Citations

Article Access Statistics

Journal Statistics

Multiple requests from the same IP address are counted as one view.

Numerical Inverse Transformation Methods for Z-Transform

Abstract

1. Introduction

2. The Z-Transform and Its Inverse

3. General Inverse Z-Transformation Methods

3.1. Inverse Transformation Based on Moments

3.2. Inverse Transformation Based on Orthogonal Decomposition

3.3. Inverse Transformation Based on a Linear System of Equations

4. Contour Integral-Based Inverse Z-Transformation Methods

4.1. FFT-Based Implementation

4.2. Interpretation Using Approximate Dirac function

5. Shifting the Nodes of the CIR Method

6. Concentrated Matrix Geometric Distribution-Based Inverse Transformation

6.1. Abate–Whitt Framework for Numerical Inverse Laplace Transformation

6.2. CME Method

6.3. CMG Method

7. Numerical Examples

7.1. Numerical Properties When T < N

7.1.1. General Comparison of the NIZT Methods

7.1.2. Precision Loss and the Effect of a

7.2. Numerical Properties of NIZT Methods When T ≥ N

7.2.1. Failure of Contour Integral Methods

7.2.2. Comparing The Error of NIZT Methods When T ≥ N

8. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Article Metrics

Article Access Statistics

7.1. Numerical Properties When $T < N$

7.2. Numerical Properties of NIZT Methods When $T \geq N$

7.2.2. Comparing The Error of NIZT Methods When $T \geq N$