Statistical Tests of Symbolic Dynamics

Fernando López; Mariano Matilla-García; Jesús Mur; Manuel Ruiz Marín

doi:10.3390/math9080817

,

and

¹

Departamento de Métodos Cuantitativos, Universidad Politécnica de Cartagena, 30202 Cartagena, Spain

²

Departamento de Economía Aplicada y Estadística, Universidad Nacional de Educación a Distancia (UNED), 28040 Madrid, Spain

³

Departamento de Análisis Económico, Universidad de Zaragoza, 50018 Zaragoza, Spain

^*

Author to whom correspondence should be addressed.

Mathematics2021, 9(8), 817;https://doi.org/10.3390/math9080817

This article belongs to the Special Issue Mathematical Methods on Economic Dynamics

Version Notes

Order Reprints

Abstract

A novel general method for constructing nonparametric hypotheses tests based on the field of symbolic analysis is introduced in this paper. Several existing tests based on symbolic entropy that have been used for testing central hypotheses in several branches of science (particularly in economics and statistics) are particular cases of this general approach. This family of symbolic tests uses few assumptions, which increases the general applicability of any symbolic-based test. Additionally, as a theoretical application of this method, we construct and put forward four new statistics to test for the null hypothesis of spatiotemporal independence. There are very few tests in the specialized literature in this regard. The new tests were evaluated with the mean of several Monte Carlo experiments. The results highlight the outstanding performance of the proposed test.

Keywords:

symbolic dynamics; time series analysis; test hypothesis

1. Introduction

The construction and design of powerful statistical tests are crucial elements for both theoretical and applied scientists. The utility of a test generally depends on its degree of applicability, which is usually related to the assumptions contained in the design of the test, and the restrictions of the scientific field in which the test will be used. Nowadays, the utility of statistical tests also depends on efficiency: reducing the need for computational resources and speed, which are vital for real-time monitoring and control applications. Taking applicability and efficiency into account, in this paper we propose a new general, flexible statistical methodology to design and test central hypotheses, and we establish an asymptotic distribution theory for a wide range of tests by using the new proposed approach.

The new framework is based on symbolic analysis, which is a field of increasing interest for several scientific disciplines (see [1]) Symbolic analysis studies dynamical systems on the basis of the sequences of symbols which are obtained for a suitable (and generally selected by the user) partition of the state space. In other words, the idea behind the symbolic approach is to split the phase space into a finite number of regions, and then each region is labeled with a symbol. From this point of view, the symbolic approach is a coarse-grained description of dynamics. As coarse-grained methods, which are usually used to provide some description of the data generating process, symbolic analysis focuses on some essential features of the generating dynamics which are frequently of interest to the researcher, for example, (in)dependence, cycles and nonlinear structure. In general terms, it can be said that symbolic analysis allows for designing tests that only focus on the relevant information required for the problem at hand.

This approach is not new in science. In the particular case of time series analysis, the symbolic approach implies transforming raw time series into a sequence of symbols. Although seeminglycounter-intuitive, symbolic analysis is rooted in information theory and also in dynamics theory. For example, properties of symbols or codes are central to the theory of communication [2]. Not in vein, there is a well-established mathematical discipline, namely, symbolic dynamics, that studies the behavior of dynamical systems. The name of “symbolic dynamics” was firstly coined by [3], although the discipline started in 1898 with the pioneering work by Hadamard, who developed a symbolic description of sequences of geodesic flows. Interestingly, ref. [4] highlighted the power of the symbolic approach by showing that a complete description of the behavior of a dynamical system can be captured in terms of symbols. Notice that this property is crucial for the understanding of this paper as long as important characteristics of a random variable can also be captured by studying the symbols derived from it.

The symbolic approach has been useful in many areas of scientific research. In the experimentalist realm, relevant contributions have been made in several fields: astrophysics; biology and medicine; chemistry, mechanical systems and fluid flow; artificial intelligence, control and communication; and data mining, classification and rule discovery ([5,6,7], for an overview). In the non-experimentalist realm, symbolic analysis has been interestingly used. In economics and finance, data are transformed and analyzed in terms of particular symbols [8]. Two examples are recession indicators utilized to study and to determine the business cycle, and the indicators used to characterize the stock market bull and bear market periods. In geography, works like that of [9] show how qualitative variables (symbolic analysis) can be used to map descriptions. In spatial econometrics, economic spatial dependence has recently been studied by transforming data into symbols [10,11]. Other interesting applications are [12,13].

Despite all these interesting applications and the scientifically founded roots of the symbolic approach, there is no systematic body of statistical tools for conducting inference based on symbolic sequences. There are some notable exceptions: [14,15,16,17,18,19,20,21]. A common factor to all of these statistical approaches is that they are centered on ordinal patterns, which is one type of symbol. In this paper we present a novel, systematic and general framework for any potential symbol in order to test for wide range of potential null hypotheses that include, as particular cases, most of the previously indicated multidisciplinary situations, namely, ordinal patterns. We also provide a general asymptotic distribution theory for symbolic analysis. Particularly, this paper shows how, by means of symbols, it is possible to design nonparametric tests for a wide class of null hypotheses with special attention to limitations (restrictions) that typically appear in economics and finance. Therefore, this paper aims also to provide the theoretical basis for hypothesis testing by means of symbols.

An appealing advantage to symbolic analysis is that it requires very few assumptions about the data generating process in order to conduct statistical inference. This advantage is promising as the tools based on this method will share the model-free property, which avoids making unnecessary assumptions and provides more general results. Most of the econometric and statistical tests typically used in some of the mentioned disciplines cannot deal with potential nonlinear forms of dependence. By construction, nonlinear structures are not a limitation for symbolic analysis.

The capability of this approach is clearly illustrated by the scope of what we label “the symbolic main theorem” (SMT). Given a null hypothesis H

_{0}

, for example, the null of serial independence, the SMT will give us four nonparametric asymptotic tests for that null, which are distribution free. The transformation of data into symbols is done by means of a symbolization map. Some of its properties are also studied in this paper. These symbolic-based tests have to deal with ordinary statistical problems that usually appear in economics and finance, such as data scarcity and suboptimal empirical power of the test. Given the flexibility of the symbols, we provide theoretical results and strategies to overcome such difficulties.

A clear example of the power of the new tool is illustrated by the spatio-temporal data modeling issues occupying a prominent role in spatial econometrics, geography and regional science, about which we can find a vast amount of literature ([3], and references therein). We constructed several symbolic-based tests by using the SMT. These tests also constitute an added-value of the paper, because there are currently very few available tests designed to deal with spatiotemporal dependence. The problem becomes more difficult if potential nonlinear dependence is considered. A notable exception is [9] who has treated nonlinearity in a spatial framework.

Finally, the results of this paper might be of interest to fields of research where information theory plays a relevant role. Particularly, nonparametric entropy measures and tests for serial dependence have drawn the attention of econometricians (see [22] and references therein). The clearest link between our results and information theory is through the concept of symbolic entropy. In the context of time series analysis, permutation entropy, which is a type of symbolic entropy, uses the probabilities of length-m ordinal patterns in the definition of Shannon entropy. An ordinal pattern is a particular type of symbolization map. Given the characteristics of this map, the SMT allows us to obtain an asymptotic distribution theory for a permutation entropy-based test. Providing the statistical foundation for permutation entropy is specially relevant because: (a) there are very few asymptotic distribution theories available for entropy, in general; and (b) permutation entropy is currently used in computer science due to its relation to “incompressibility”, and is also useful in the study of dynamical systems because of its connection to complexity.

From another point of view, some well-established nonparametric tests can be understood as particular types of symbolic analysis. For example, the nonparametric runs test for randomness by Wald-Wolfowitz (see [23]); joint-counting procedures for spatial association [24]; and in general, categorical data techniques [25] are simple examples that use the very general procedures of translating information into symbols. In this regard, symbolic analysis can be understood as a method related to this literature.

The paper is organized as follows: In Section 2, we provide the main notation and relevant concepts that will be used in the paper. Among them we highlight: symbolization maps, standard or non-standard maps and decomposable maps. Due to the generality of the method, we require the potential tests to be adaptable to different contexts that are to be able to deal with a wide range of null hypotheses. To this end we introduce the notion of perfect and non-perfect set on subindexes in Section 3. This allows us to give general theoretical results to tackle practical situations that might otherwise be intractable because of the problem and/or of the type of hypothesis. Therefore we distinguish between two main classes of theoretical situations that lead us to different statistical solutions. In Section 4 we show how to construct symbolic-based tests via likelihood ratio statistics and via asymptotic normality. Section 5 considers the theoretical case that the null hypothesis cannot be treated under perfect situations, and hence other results are applicable. Section 6 puts forward the main theorem of this paper. Under the general conditions of this theorem, we introduce four tests for serial independence, four tests for spatial independence and four new tests for spatiotemporal independence, in Section 7. These tests are based on different symbolization maps, according to those given in Section 2 and Section 3. Finally, in Section 8, we outline a Monte Carlo simulation experiment to show the capabilities of the spatiotemporal test for independence under linear and nonlinear settings. The paper ends with some conclusions.

2. Notation and Definitions

As indicated in the previous section, we give some definitions and introduce the basic notation that will be used throughout the rest of the paper.

Let

{X_{i}}_{i \in I}

be a stationary real-valued process, where I is a set of indexes.

Let

Γ = {η_{1}, η_{2}, \dots, η_{n}}

be a set of

n > 1

elements that we label as symbols. Now assume that there exists a map

f : {X_{i}}_{i \in I^{'}} \to Γ

for some subset of indexes

I^{'} \subseteq I

. We will say that

i \in I

is of

η

-type if and only if

f (X_{i}) = η

. We will call the map f a symbolization map for

{X_{i}}_{i \in I}

.

Notice that it is possible to expand the definition of a symbolization map to the

k —

dimensional case by introducing the concept of decomposable maps: if

f_{j} : {X_{i j}}_{i \in I^{'}} \to Γ_{j}

,

j = 1, 2, \dots, k

, are k symbolization maps, then the product

F = \prod_{j = 1}^{k} f_{j} : \prod_{j = 1}^{k} {X_{i j}}_{i \in I^{'}} \to \prod_{j = 1}^{k} Γ_{j}

is a symbolization map for the k-dimensional variable

{(X_{i 1}, X_{i 2}, \dots, X_{i k})}_{i \in I^{'}}

. We will call F a k-decomposable symbolization map.

Given a symbolization map

f : {X_{i}}_{i \in I^{'}} \to Γ

and a symbol

η \in Γ

, we denote by

p_{η}

the probability of occurrence of symbol

η

. Symbolization maps can be classified according to their behavior under the null hypothesis. If the symbolization map f is such that under a given null hypothesis (H

_{0}

) all the symbols have the same probability to occur, we will say that f is a standard symbolization map. On the contrary, we will refer to f as a non-standard symbolization map.

The symbolic entropy of a process

{X_{i}}_{i \in I^{'}}

is defined as the Shannon’s entropy of the n distinct symbols as follows:

h (Γ) = - \sum_{η \in Γ} p_{η} ln (p_{η}),

(1)

with the convention

0 \times ln 0 = l \overset{´}{ı} m_{x \to 0^{+}} x ln x = 0 .

Symbolic entropy,

h (Γ)

, can be understood as the information in terms of symbols

η \in Γ

of the process

{X_{i}}_{i \in I^{'}}

. Notice that

0 \leq h (Γ) \leq ln (n)

. Notice also that the lower bound is attained when only one symbol occurs, and the upper bound when all n possible symbols appear with the same probability.

Consider the following index

i \in I^{'}

that we define an indicator random variable

Z_{η i}

as follows:

Z_{η i} = \{\begin{matrix} 1 & if f (X_{i}) = η \\ 0 & otherwise, \end{matrix}

that is, we have that

Z_{η i} = 1

if and only if i is of

η

-type, and

Z_{η i} = 0

otherwise.

Then

Z_{η i}

is a Bernoulli variable with probability of “success”

p_{η}

, where “success” means that i is of

η

-type. It is straightforward to see that

\sum_{η \in Γ} p_{η} = 1

Our interest is in knowing how many is are of

η

-type for all symbol

η \in Γ

. In order to answer the question, we construct the following counting variable:

Y_{η} = \sum_{i \in I^{'}} Z_{η i}

The variable

Y_{η}

can take the values

{0, 1, 2, \dots, R},

where

| I^{'} | = R

.

To complete with notation, we will denote by

n_{η} = |{i \in I^{'} | f (X_{i}) = η}|,

the cardinality of the subset of symbolized indexes

I^{'}

formed by all the elements of

η

-type.

Then, under the conditions above, one could easily compute the relative frequency of a symbol

η \in Γ

by:

{\hat{p}}_{η} : = \frac{|{i \in I^{'} | i is of η - type}|}{R}

which is the maximum likelihood estimator of

p_{η} .

3. On the Independence of Variables ${Z_{η i}}_{i \in I^{'}}$

If the subset of symbolized indexes

I^{'}

is chosen such that the

Z_{η i}

variables are independent for all

i \in I^{'}

, then

Y_{η}

is a binomial random variable

Y_{η} \sim B (R, p_{η}) .

Moreover, if the subset of indexes

I^{'}

is such that the variables

Z_{η_{s}}

and

Z_{σ_{t}}

are independent for all symbol

η_{s}, σ_{t} \in Γ,

s \neq t,

where

s, t \in {1, 2, \dots, n},

then the joint probability density function of the n variables

(Y_{η_{1}}, Y_{η_{2}}, \dots, Y_{η_{n}})

is:

P (Y_{η_{1}} = a_{1}, Y_{η_{2}} = a_{2}, \dots, Y_{η_{n}} = a_{n}) = \frac{(a_{1} + a_{2} + \dots + a_{n})!}{a_{1}! a_{2}! \cdot \dots \cdot a_{n}!} p_{η_{1}}^{a_{1}} p_{η_{2}}^{a_{2}} \dots p_{η_{n}}^{a_{n}}

where

a_{1} + a_{2} + \dots + a_{n} = R

. Consequently the joint distribution of the n variables

(Y_{η_{1}}, Y_{η_{2}}, \dots, Y_{η_{n}})

is a multinomial distribution. In this case we will call the set of symbolized indexes

I^{'}

a perfect subset.

It is possible to develop theoretical results that contemplate situations for which the researcher will benefit of constructing symbolization maps for which the set

I^{'}

is not perfect. Given our previous definition of perfect set, the indexes subset

I^{'}

can be nonperfect in the following cases:

Case (a): $Z_{η i}$ and $Z_{η j}$ are not independent for all $i \neq j$ for some $η \in Γ$ , and hence $Y_{η}$ is not a binomial random variable.
Case (b): $Z_{η_{s}}$ and $Z_{σ_{t}}$ are not independent for all $s \neq t$ for some $σ_{t}, η_{s} \in Γ$ , and hence $(Y_{η_{1}}, Y_{η_{2}}, \dots, Y_{η_{n}})$ is not a multinomial distribution.

Theoretical results for perfect and non-perfect subsets are the topics of the following two sections.

4. Constructing Tests with Perfect Subsets of $I$

In this section we establish a general framework for testing for a null hypothesis H

_{0}

. This is done by focusing on the symbols’ distribution when the subset

I^{'}

of symbolized symbols is perfect. Under this condition, we now show how to construct tests of hypothesis via likelihood ratio statistics and via asymptotic normality.

Procedure

In general, our procedure consists of proceeding systematically as follows:

(step 1) Fix the null hypothesis H $_{0}$ to be tested.
(step 2) Define the set of symbols $Γ$ and the symbolization map f.
(step 3) Compute the distribution of the symbols under H $_{0}$ , namely, $p_{η}^{(0)}$ for all $η \in Γ$ .
(step 4) Finally design and compute a desired test statistic.

Steps 1 to 2 will be developed later in the paper. To accomplish the aim with steps 3 and 4, we first show how to construct a likelihood ratio test.

Recall that when the set is perfect,

(Y_{η_{1}}, \dots, Y_{η_{n}})

follows a multinomial distribution and its likelihood function is:

L (p_{η_{1}}, p_{η_{2}}, \dots, p_{η_{n}}) = \frac{R!}{n_{η_{1}}! n_{η_{2}}! \cdot \dots \cdot n_{η_{n}}!} p_{η_{1}}^{n_{η_{1}}} p_{η_{2}}^{n_{η_{2}}} \dots p_{η_{n}}^{n_{η_{n}}}

and the likelihood ratio test statistic is

λ (Y) = \frac{\frac{R!}{n_{η_{1}}! n_{η_{2}}! \cdot \dots \cdot n_{η_{n}}!} p_{η_{1}}^{(0) n_{η_{1}}} p_{η_{2}}^{(0) n_{η_{2}}} \dots p_{η_{n}}^{(0) n_{η_{n}}}}{\frac{R!}{n_{η_{1}}! n_{η_{2}}! \cdot \dots \cdot n_{η_{n}}!} {\hat{p}}_{η_{1}}^{n_{η_{1}}} {\hat{p}}_{η_{2}}^{n_{η_{2}}} \dots {\hat{p}}_{η_{n}}^{n_{η_{n}}}},

(2)

where

{\hat{p}}_{η_{i}}

is the maximum likelihood estimator of

p_{η_{i}}

for all

i = 1, 2, \dots, n

. In this case, as shown in the Appendix A, maximum likelihood estimators are

{\hat{p}}_{η_{i}} = \frac{n_{η_{i}}}{R} .

Moreover, it is possible to show (see Appendix A.2) that

- 2 [R L n (R) + \sum_{i = 1}^{n} n_{η_{i}} L n (\frac{p_{η_{i}}}{n_{η_{i}}^{0}})] \sim χ_{k}^{2},

where the k degrees of freedom will depend on the set of symbols.

It is also proved that, if standard symbolization maps are considered, then

2 R [L n (n) - h (Γ)] \sim χ_{k}^{2},

(3)

which is an affine transformation of the symbolic entropy (1).

As we have emphasized in the Introduction that nonparametric results of entropy based measures are relevant for econometrics and for other fields of research. Given that the distribution of the affine transformation in (3) holds for standard symbolization maps under broad conditions, this result will be of general applicability. In particular, as we will show in other sections, permutation entropy, as introduced in [26], is a particular type of symbolic entropy that has drew the attention of several scholars for theoretical interests basically because stationary and ergodic processes coincide with Shannon entropy, and for an applied interest in nonlinear and complex systems or processes (see [1]). In this regard, (2) and (3) can be viewed as an initial step towards statistical inference of ordinal pattern distributions.

An alternative to likelihood ratio tests for symbolic maps can be considered by modifying step 4. Given that under a perfect subset

I^{'}

the indicator variables

Z_{η_{i}}

are independent, the random variable

\frac{Y_{η} - R p_{η}}{\sqrt{R p_{η} (1 - p_{η})}}

has a limiting normal distribution with zero mean and unit variance for all symbol

η \in Γ

. Moreover

(\frac{Y_{η_{1}} - R p_{η_{1}}}{\sqrt{R p_{η_{1}} (1 - p_{η_{1}})}}, \frac{Y_{η_{2}} - R p_{η_{2}}}{\sqrt{R p_{η_{2}} (1 - p_{η_{2}})}}, \dots, \frac{Y_{η_{n}} - R p_{η_{n}}}{\sqrt{R p_{η_{n}} (1 - p_{η_{n}})}})

asymptotically distributes as a multivariate normal distribution M

N (0, I)

. In this case, the asymptotic distribution holds for standard or nonstandard symbolization maps at the cost of estimating

p_{η_{i}}

that can be consistently estimated by

{\hat{p}}_{η_{i}}

.

5. Constructing Tests with Non Perfect Subsets of $I$

In this section we establish the equivalent counterpart of the general framework (above presented) when the subset

I^{'}

of symbolized symbols is non-perfect. Accordingly, we now show to what extent and under which situations the previous likelihood and asymptotically normal tests (elaborated under perfect situations) can be adapted to deal with them.

Non perfect sets of indexes I might be very useful for test design, especially for situations or scientific domains characterized by relatively scarce sample size as compared with the number of symbols. In macroeconomics, although not necessarily in finance, data scarcity can be the usual restriction. Ideally, one can be able to design perfect sets of indexes to carry out symbolic-based hypothesis testing. This section then tries to provide symbolic-based methods for constructing hypothesis tests for situations for which an ideal design is not possible because of the nature of the problem, because of the computational capabilities or because of any other potential reason.

5.1. Binomial Approximation

Let us consider that variables

Z_{η i}

and

Z_{η j}

are not independent for all

i \neq j

for some

η \in Γ

. In this case,

Y_{η}

is not a binomial random variable. The interesting question is how far is

Y_{η}

from

B (R, p_{η}) .

We are interested in studying under what assumptions the variable

Y_{η}

can be approximated to a binomial random variable:

Y_{η} \approx B (R, p_{η}) .

In fact, it is possible to compute a bound for this binomial approximation.

Denote by

L (Y_{η})

the distribution of

Y_{η}

, and we are interested in the bound of the binomial approximation of the distribution of

Y_{η}

measured in terms of total variation distance. The total variation distance

d_{T V}

between two probability measures P and Q is defined by

d_{T V} (P, Q) = sup_{A} | P (A) - Q (A) |

where the supremum is taken over all measurable sets of the real line.

Following Theorem 1.1 in [27] and after a few calculations, a bound can be given as follows:

For each

i, j \in I^{'}

let

Z_{η j}, Z_{η i}

and

J_{η i j}

be defined in the same probability space where

J_{η j i} = (Z_{η j} | Z_{η i} = 1)

. Let

W_{i} = \sum_{j \neq i} Z_{η j} V_{i} = \sum_{j \neq i} J_{η j i}

and

C_{R, p} = \frac{1 - p^{R + 1} - q^{R + 1}}{(R + 1) p q} .

W_{i}

counts the number of indexes that are of

η

-type and

p = p_{η}

and

q = 1 - p_{η}

. On the other hand

V_{i} = \sum_{j \neq i} (Z_{η j} | Z_{η i} = 1)

counts the number of indexes that are of

η

-type conditioned to location i is of

η

-type.

Then

d_{T V} (L (Y_{η}), B (R, p)) \leq C_{R, p} p \sum_{i = 1}^{R} E (| W_{i} - V_{i} |) .

Therefore, in order to get a bound for the binomial approximation, we have to get bound

\sum_{i = 1}^{R} E (| W_{i} - V_{i} |) .

On the other hand, we have that

E (| W_{i} - V_{i} |) \leq \sum_{j \neq i} E (| Z_{η j} - J_{η j i} |)

and

p E (J_{η j i}) = E (Z_{η i} Z_{η j})

. Now denote by

B_{i}^{η}

a subset of indexes such that

Z_{η i}

is independent of

{Z_{η j} | j \notin B_{i}}

. Therefore, we obtain that

\begin{matrix} d_{T V} (L (Y_{η}), B (R, p)) & \leq & C_{R, p} p \sum_{i = 1}^{R} E (| W_{i} - V_{i} |) \\ \leq & C_{R, p} p \sum_{i = 1}^{R} \sum_{j \in B_{i} \ {i}} E (| Z_{η j} - J_{η j i} |) \\ \leq & C_{R, p} p \sum_{i = 1}^{R} \sum_{j \in B_{i} \ {i}} (p + E (J_{η j i})) \\ \leq & C_{R, p} (\sum_{i = 1}^{R} p^{2} (| B_{i} | - 1) + \sum_{i = 1}^{R} \sum_{j \in B_{i} \ {i}} E (Z_{η i} Z_{η j})) \end{matrix}

Therefore, we have shown that the sum of dependent indicators can be approximated to a binomial random variable when the following two conditions are satisfied: (1) dependencies among the indicators are weak and (2) the probabilities of the indicators occurring under the null hypothesis are small. Notice that point (1) can be guaranteed by selecting the subset of symbolized indexes

I^{'}

such that

| B_{i} |

is small enough.

5.2. Normal Approximation

Additionally, when the indicator variables

Z_{η i}

are not independent for all

i \in I^{'}

, the following central limit theorem for dependent indicators ensures the convergence to a normal distribution:

Theorem 1.

(Theorem 7.7.5 (Anderson, 1971)). Let

Z_{1}, Z_{2}, \dots

be a stationary stochastic process such that for every integer n and integers

t_{1}, t_{2}, \dots, t_{n}

with

t_{1} < \dots < t_{n},

Z_{t_{1}}, \dots, Z_{t_{n}}

are distributed independently of

Z_{1}, \dots, Z_{t_{1} - m - 1}

and

Z_{t_{n} + m + 1}, \dots

. If

E (Z_{t}) = 0

and

E (Z_{t}^{2}) < \infty

, then

\frac{\sum_{t = 1}^{T} Z_{t}}{\sqrt{T}}

has a limiting normal distribution with mean 0 and variance

E (Z_{1}^{2}) + 2 E (Z_{1} Z_{2}) + \dots + 2 E (Z_{1} Z_{m + 1}) .

Theorem 1 states that, if the dependencies are weak (for instance,

| B_{i} |

is small enough for all

i \in I^{'}

) we get that

\frac{Y_{η} - R p_{η}}{\sqrt{V a r (Y_{η})}} \overset{d}{\to} N (0, 1)

as

R \to \infty

. The variance of the variable

Y_{η}

can be computed as follows:

\begin{matrix} V a r (Y_{η}) & = & E (Y_{η}^{2}) - E {(Y_{η})}^{2} = \sum_{i = 1}^{R} E (Z_{η i}^{2}) + \sum_{i = 1}^{R} \sum_{j \neq i} E (Z_{η i} Z_{η j}) - R^{2} p_{η}^{2} \\ = & R p_{η} - R^{2} p_{η}^{2} + \sum_{i = 1}^{R} (R - | B_{i} |) p_{η}^{2} + \sum_{i = 1}^{R} \sum_{j \in B_{i} \ {i}} E (Z_{η i} Z_{η j}) \\ = & R p_{η} - R^{2} p_{η}^{2} + R^{2} p_{η}^{2} - p_{η}^{2} \sum_{i = 1}^{R} | B_{i} | + \sum_{i = 1}^{R} \sum_{j \in B_{i} \ {i}} E (Z_{η i} Z_{η j}) \\ = & R p_{η} - p_{η}^{2} \sum_{i = 1}^{R} | B_{i} | + \sum_{i = 1}^{R} \sum_{j \in B_{i} \ {i}} E (Z_{η i} Z_{η j}) \end{matrix}

(4)

We now consider the case where

Z_{η_{s}}

and

Z_{σ_{t}}

are not independent for all

s \neq t

for some

σ_{t}, η_{s} \in Γ

, and hence

(Y_{η_{1}}, Y_{η_{2}}, \dots, Y_{η_{n}})

is not a multinomial distribution (i.e., previous case (b)). To this end, denote by

B_{i}^{(η, σ)}

the subset of indexes satisfying that

Z_{η i}

is independent of

{Z_{σ j} | j \notin B_{i}^{(η, σ)}}

and

X_{η} = \frac{Y_{η} - R p_{η}}{\sqrt{V a r (Y_{η})}}

. It is possible to show that

(X_{η_{1}}, X_{η_{2}}, \dots, X_{η_{n}})

is a multivariate normal distribution. In fact, it is equivalent to proving that any linear combination of the

X_{η}

s is normal. In order to do so, notice that each variable

X_{η}

is a sum of indicator variables, and therefore, any linear combination of these variables is again a sum of indicator variables. Consider an arbitrary linear combination as follows:

M = α_{1} X_{η_{1}} + α_{2} X_{η_{2}} + \dots + α_{n} X_{η_{n}}

Notice that the variable

α X_{η}

can be written as

α X_{η} = \sum_{i = 1}^{R} I_{i}^{η}

where

I_{i}^{η}

is an indicator variable for all

i \in I^{'}

. Therefore, it follows that the variable M is a sum of dependent indicator variables. Again, by Theorem 1 we get that M follows a normal distribution whenever the dependencies among the indicator variables are weak (for instance, when the cardinality of the set

B_{i} \cup B_{i}^{(η_{s}, η_{t})}

is small enough for all

i \in I^{'}

and all symbol

η, η_{s}, η_{t} \in Γ

). Then we get that

(X_{η_{1}}, X_{η_{2}}, \dots, X_{η_{n}})

has a limiting multivariate normal distribution and we can compute the variance and covariance matrix for all

η_{s}, η_{t} \in Γ

as follows:

\begin{matrix} C o v (Y_{η_{s}}, Y_{η_{t}}) & = & \sum_{i = 1}^{R} \sum_{j = 1}^{R} C o v (Z_{η_{s} i}, Z_{η_{t} j}) = \sum_{i = 1}^{R} \sum_{j \in B_{i}^{(η_{s}, η_{t})}} C o v (Z_{η_{s} i}, Z_{η_{t} j}) \\ = & \sum_{i = 1}^{R} \sum_{j \in B_{i}^{(η_{s}, η_{t})}} {E (Z_{η_{s} i} Z_{η_{t} j}) - E (Z_{η_{s} i}) E (Z_{η_{t} j})} \\ = & p_{η_{s}} \sum_{i = 1}^{R} \sum_{j \in B_{i}^{(η_{s}, η_{t})}} p (Z_{η_{t} j} = 1 | Z_{η_{s} i} = 1) - p_{η_{s}} p_{η_{t}} \sum_{i = 1}^{R} | B_{i}^{(η_{s}, η_{t})} | . \end{matrix}

(5)

6. The Symbolic Main Theorem

Previous partial results can be collected in the following main theorem.

Theorem 2.

Let

{X_{i}}_{i \in I^{'}}

be a stationary real valued process. Let

Γ = {η_{1}, η_{2}, \dots, η_{n}}

be a finite set of symbols. Let

f : {X_{i}}_{i \in I^{'}} \to Γ

be a standard symbolization map for a null hypothesis H

_{0}

. Then under the null H

_{0}

we have that:

If the set of symbolized indexes $I^{'}$ is perfect:
- Then $G (Γ) = 2 R [ln (n) - h (Γ)]$ is asymptotically $χ_{k}^{2}$ distributed where k is the difference between the number of parameters to be estimated under the alternative hypothesis $H_{1}$ and the number of parameters to be estimated under the null H $_{0}$ .
- Then $(\frac{Y_{η_{1}} - R p_{η_{1}}}{\sqrt{R p_{η_{1}} (1 - p_{η_{1}})}}, \frac{Y_{η_{2}} - R p_{η_{2}}}{\sqrt{R p_{η_{2}} (1 - p_{η_{2}})}}, \dots, \frac{Y_{η_{n}} - R p_{η_{n}}}{\sqrt{R p_{η_{n}} (1 - p_{η_{n}})}})$ is a multivariate normal distribution MN $(0, I)$ .
The set of symbolized indexes $I^{'}$ is not perfect:
- If the sets $B_{i}$ and $B_{i}^{(η_{s}, η_{t})}$ have good properties in the sense that for all symbol $η \in Γ$ the variables $Y_{η}$ have a good approximation to a binomial distribution and $(Y_{η_{1}}, Y_{η_{2}}, \dots, Y_{η_{n}})$ has a good approximation to a multinomial distribution, then $G (Γ) = 2 R [ln (n) - h (Γ)]$ is asymptotically $χ_{k}^{2}$ distributed where k is the difference between the number of parameters to be estimated under the alternative hypothesis $H_{1}$ and the number of parameters to be estimated under the null H $_{0}$ .
- If $I^{'}$ is such that Theorem 1 holds, then $(\frac{Y_{η_{1}} - R p_{η_{1}}}{\sqrt{V a r (Y_{η_{1}})}}, \frac{Y_{η_{2}} - R p_{η_{2}}}{\sqrt{V a r (Y_{η_{2}})}}, \dots, \frac{Y_{η_{n}} - R p_{η_{n}}}{\sqrt{V a r (Y_{η_{n}})}})$ is a multivariate normal distribution MN $(0, Σ)$ where the variance and covariance matrix can be estimated using (4) and (5).

Notice that in the case of nonstandard symbolization maps, an analogous theorem to Theorem 2 will hold. More concretely, for point 1 of the SMT, a likelihood ratio test is also available, although not in the closed form as presented here (see for example [28]). Point 2 of the SMT hold independently of whether the symbolization map is standard.

The usefulness of the theorem will become evident in the next section, where it will be applied to specific null hypotheses. Naturally, it is possible to consider an alternatively bootstrap-based test for symbolization maps, instead of asymptotic ones, although we do not follow this way in this paper, it being a subfield for further research of interest since it partially might avoid taking care of nonperfect indexes sets.

7. Different Symbolizations for Different Nulls Related with Independence

According to the general symbolic theorem, in this section we show how it is possible to test interesting null hypotheses by using symbolic analysis. To concrete, we focus on testing for different nulls of independence as it is a well-known field of research and because recently published articles can be generally understood and extended under this new theoretical framework. Given that each null hypothesis (step 1) will require a particular symbolization map, in this section we present different symbolization procedures (step 2) to test for serial dependence, spatial dependence, and spatiotemporal dependence, respectively. Then we present the results of step 3 and step 4 depending on the statistic technique the researcher wants to use according to Theorem 2, i.e., either likelihood ratio statistics or/and asymptotically normal statistics. Given a null hypothesis, the behavior of the tests obtained from this approach will strongly depend on the expertise of the researcher in constructing the symbolization map. We emphasize that both power analysis of the class of tests, and power competition among alternative nonparametric tests were already given in previous work [10,16]; therefore, we are not going to replicate them here.

As we have indicated, the crucial component of the symbolic procedure is to choose a symbolic mapping which ensures that the distribution of the symbols can detect deviations from the null. The null hypotheses considered in this section are related to the important topic of “statistical independence”. This is a very well-studied topic in time series analysis and therefore there is a generous number of available tests. On the contrary, spatial independence is not so well-known and is non-trivial how to test for it. As we will show, it is needed to use another different symbolization map for detecting spatial patterns. Similar comments can be made for spatio-temporal independence. Needless to say, there are other hypotheses of interest in econometrics, and the researcher will have to design suitable symbolic maps for testing them. For example, in [29], the authors dealt with the opposite problem: how to test for a pure deterministic chaotic process. In these and other cases, the power of the tests will centrally depend on the ability of the research to design the symbolization map for the desired null hypothesis.

7.1. Serial Independence Tests

In the case of time series, refs. [15,16] used the following symbolization procedure to test for serial dependence: Let

{X_{t}}_{t \in I}

be a real-valued time series (in this case the subindex t refers to time) for which we are interested in testing the null of serial independence (step 1). In order to complete step 2, we denote by

Γ_{1} = S_{m}

the symmetric group of order

m!

, that is, the group formed by all the permutations of length m (for a positive integer

m \geq 2)

. Let

π_{i} = (i_{1}, i_{2}, \dots, i_{m}) \in S_{m}

. The positive integer m is usually known as the embedding dimension.

An ordinal pattern for a symbol is defined as

π_{i} = (i_{1}, i_{2}, \dots, i_{m}) \in S_{m}

at a given time

t \in I

. The time series can be embedded in an m-dimensional space:

X_{m} (t) = (X_{t + 1}, X_{t + 2}, \dots, X_{t + m}) for t \in I

It is said that t is of

π_{i} —

type if and only if

π_{i} = (i_{1}, i_{2}, \dots, i_{m})

is the unique symbol in the group

S_{m}

satisfying the two following conditions:

\begin{matrix} (a) X_{t + i_{1}} \leq X_{t + i_{2}} \leq \dots \leq X_{t + i_{m}}, and \\ (b) i_{s - 1} < i_{s} i f X_{t + i_{s - 1}} = X_{t + i_{s}} \end{matrix}

Notice that condition

(b)

guaranties uniqueness of the symbol

π_{i}

. This is justified if the values of

X_{t}

have a continuous distribution so that equal values are very uncommon, with a theoretical probability of occurrence of 0.

In this case, the symbolization map is defined as

f_{1} : {X_{t}}_{t \in I^{'}} \to S_{m}

given by

f_{1} (X_{t}) = (i_{1}, i_{2}, \dots, i_{m})

(6)

where

(i_{1}, i_{2}, \dots, i_{m}) \in S_{m}

is such that t is of

(i_{1}, i_{2}, \dots, i_{m})

-type. Now the design of the symbolization map (step 2) is completed.

Moreover, under the null of independence the distribution of the symbols is uniform and therefore the map

f_{1}

is a standard symbolization map. Additionally, the set of symbolized indexes is

I^{'} = {1, 2, \dots, T - m + 1},

which is not perfect.

Notice that in order to have a perfect set and therefore ensure the independence of the indicator variables

Z_{π t}

, it is enough to consider as a set of symbolized indexes

I^{'} = {0, m - 1, 2 (m - 1), \dots, t (m - 1), \dots} .

Accordingly, using this symbolization map, the next corollary straightforwardly follows from Theorem 2:

Corollary 1.

Let

f_{1} : {X_{t}}_{t \in I^{'}} \to Γ_{1}

be the symbolization map defined in (6) with

| I^{'} | = R

. Denote by

h (Γ_{1})

the permutation entropy defined in (1). If the time series

{X_{t}}_{t \in I}

is independent, then

If the set $I^{'}$ is perfect, then:
1.
The affine transformation of the permutation entropy

$G (S_{m}) = 2 R [L n (m!) - h (S_{m})]$

is asymptotically $χ_{m! - 1}^{2}$ distributed.
2.
Then $(\frac{Y_{π_{1}} - R p_{π_{1}}}{\sqrt{R p_{π_{1}} (1 - p_{π_{1}})}}, \frac{Y_{π_{2}} - R p_{π_{2}}}{\sqrt{R p_{π_{2}} (1 - p_{π_{2}})}}, \dots, \frac{Y_{π_{m!}} - R p_{π_{m!}}}{\sqrt{R p_{π_{m!}} (1 - p_{π_{m!}})}})$ is a multivariate normal distribution NM $(0, I)$ .
If the set $I^{'}$ is not perfect:
1.
Since the sets $B_{i}^{π}$ ’s has cardinality of at most $2 m$ , we can get a good approximation to the following result via [21]. The affine transformation of the permutation entropy

$G (S_{m}) = 2 R [L n (m!) - h (S_{m})]$

is asymptotically $χ_{m! - 1}^{2}$ distributed.
2.
Then $(\frac{Y_{π_{1}} - R p_{π_{1}}}{\sqrt{V a r (Y_{π_{1}})}}, \frac{Y_{π_{2}} - R p_{π_{2}}}{\sqrt{V a r (Y_{π_{2}})}}, \dots, \frac{Y_{π_{m!}} - R p_{π_{m!}}}{\sqrt{V a r (Y_{π_{m!}})}})$ is a multivariate normal distribution MN $(0, Σ)$ where the variance and covariance matrix can be estimated using (4) and (5).

These results for permutation entropy are in relation to a relatively recent line of research based on order patterns for analyzing time series. Ordinal patterns can be, per se, used for descriptive purposes, like autocorrelation, with the added advantage that the require no assumptions such as Gaussianity or linearity. On the contrary, only mild stationary conditions can exist in the underlying process. The above corollary is a further step for the development of statistical inference for ordinal time series. Naturally, it is possible to obtain other kinds of statistical results by adding more assumptions to the generating process. In fact, notorious results can be found in [4]) if Gaussianity and ergodicity are assumed. In this regard, our asymptotic results for order patterns keep assumptions at a minimum. Additionally, by maintaining general applicability at minimum cost (in terms of assumptions) for serial independence tests, some bootstrap-based statistics for ordinal patterns have been put forward in [29].

An interesting property of the symbolization procedure presented in this section is that it can be also used for discrete distributions. To do so it necessary to consider a non-standard version of the map. Under such circumstances, the likelihood ratio (2) can be directly used once the behavior of

p_{η_{i}}

is known under the null of serial independence.

7.2. Spatial Independence Tests

In the case of spatial processes, ref. [10] gave a symbolization procedure to test for spatial independence as follows: Let

{X_{s}}_{s \in S}

be a real-valued spatial process, where S is a set of coordinates. Given a location

s_{0}

, we will denote by

(ρ_{i}^{0}, θ_{i}^{0})

the polar coordinates of location

s_{i}

taking as origin

s_{0}

.

Let

m \in N

with

m \geq 2

. Consider now that the spatial process

{X_{s}}_{s \in S}

is embedded in a different m-dimensional space as follows:

X_{m} (s_{0}) = (X_{s_{0}}, X_{s_{1}}, \dots, X_{s_{m - 1}}) for s_{0} \in S

where

s_{1}, s_{2}, \dots, s_{m - 1}

are the

m - 1

nearest neighbors to

s_{0}

, which are ordered from lesser to higher Euclidean distance with respect to location

s_{0}

. Notice that in the case of two or more locations being equidistant to

s_{0}

, we will choose them in an anticlockwise manner. In formal terms,

s_{1}, s_{2}, \dots, s_{m - 1}

are the

m - 1

nearest neighbors to

s_{0}

satisfying the following two conditions:

(a) $ρ_{1}^{0} \leq ρ_{2}^{0} \leq \dots \leq ρ_{m - 1}^{0}$ ;
(b) If $ρ_{i}^{0} = ρ_{i + 1}^{0}$ , then $θ_{i}^{0} < θ_{i + 1}^{0}$ .

Notice that conditions

(a)

and

(b)

ensure the uniqueness of

X_{m} (s)

for all

s \in S

.

The proposed standard symbolization map f is defined as follows: denote by

M e

the median of the spatial process

{X_{s}}_{s \in S}

and let

δ_{s} = \{\begin{matrix} 0 if X_{s} \leq M e \\ 1 otherwise \end{matrix}

Now, define the indicator function

I_{s_{1} s_{2}} = \{\begin{matrix} 0 if δ_{s_{1}} \neq δ_{s_{2}} \\ 1 otherwise \end{matrix}

(7)

Then, the standard symbolization map

f_{2} : {X_{s}}_{s \in I^{'}} \to Γ_{2}

(8)

is defined as:

f_{2} (X_{s}) = (I_{s s_{1}}, I_{s s_{2}}, \dots, I_{s s_{m - 1}}),

(9)

where

Γ_{2}

stands for the set of symbols defined by

f_{2} .

Notice that under the null of spatial independence, the distribution of the symbols is uniform and therefore the map

f_{2}

is a standard symbolization map.

Moreover, in this case

I^{'} = S

is not a perfect symbolized set. To construct a perfect symbolized set

I^{'}

, one can proceed as follows. Take a location

s_{0} \in S

at random. Let

N_{s}

be the set of nearest neighbors to s. Now select the following element in

I^{'}

by taking

s_{1} \in S

such that

N_{s_{1}} \cap N_{s_{0}} = \emptyset

. Then construct recursively the set

I^{'}

by taking

s_{k} \in S \ {s_{0}, s_{1}, \dots, s_{k - 1}}

satisfying

N_{s_{i}} \cap N_{s_{j}} = \emptyset

for all

i \neq j

with

i, j = 1, 2, \dots, k

.

As it is evident, the method is flexible enough to allow the researcher to select his own set and map of symbols for a given null. For example, if under the previous symbolization procedure, the power (or size) of the test is not satisfactory, one can always consider other possible symbolization procedures for the same null and for the same spatial process

{X_{s}}_{s \in S}

. Let

Γ_{3} = {1, 2, \dots, k} \times {1, 2, \dots, k}

. Again, let

N_{s}

be the set of nearest neighbors to s and let

n_{s}

be its cardinality. Denote by

X_{s}^{N} = \frac{1}{n_{s}} \sum_{s^{'} \in N_{s}} X_{s^{'}}

. Denote by

q_{i}

and

q_{i}^{N}

the i-th quantile of the variables X and

X^{N}

respectively, for

i \in {1, 2, \dots, k - 1}

. We will denote by

q_{0} = min_{s \in S} X_{s}

(resp

q_{0}^{N} = min_{s \in S} X_{s}^{N}

) and

q_{k + 1} = max_{s \in S} X_{s}

(resp.

q_{k + 1}^{N} = max_{s \in S} X_{s}^{N}

). Then we define the symbolization map

f_{3} (X_{s}) = (i, j)

(10)

if and only if

X_{s} \in [q_{i - 1}, q_{i}]

and

X_{s}^{N} \in [q_{j - 1}^{N}, q_{j}^{N}]

.

Again, under the null of independence the distribution of the symbols is uniform and therefore the map

f_{3} : {X_{s}}_{s \in I^{'}} \to {1, 2, \dots, k} \times {1, 2, \dots, k}

is a standard symbolization map.

Again, the same set of recursively constructed symbolized indexes

S^{'}

ensures the independence of the indicator variables

Z_{η s}

. Accordingly, using this symbolization map, the next corollary straightforwardly follows from Theorem 2:

Corollary 2.

Let

f_{i} : {X_{s}}_{s \in S^{'}} \to Γ_{i}

,

i = 2, 3

be the symbolization maps defined in (8) and (10) with

| S^{'} | = R

. Denote by

h (Γ_{i})

the symbolic entropy defined in (1). If the spatial process

{X_{s}}_{s \in S}

is independent, it follows that:

If the set $S^{'}$ is perfect then:
1.
The affine transformation of the symbolic entropy

$S G (Γ_{i}) = 2 R [L n (n_{i}) - h (Γ_{i})]$

is asymptotically $χ_{d_{i}}^{2}$ distributed where $d_{2} = 2^{m - 1}$ , $d_{3} = {(k - 1)}^{2} + 2$ , $n_{2} = 2^{m - 1}$ and $n_{3} = k^{2}$ .
2.
Then $(\frac{Y_{σ_{1}} - R p_{σ_{1}}}{\sqrt{R p_{σ_{1}} (1 - p_{σ_{1}})}}, \frac{Y_{σ_{2}} - R p_{σ_{2}}}{\sqrt{R p_{σ_{2}} (1 - p_{σ_{2}})}}, \dots, \frac{Y_{σ_{n i}} - R p_{σ_{n i}}}{\sqrt{R p_{σ_{n i}} (1 - p_{n i})}})$ is a multivariate normal distribution MN $(0, I)$ .
If the set $S^{'}$ is not perfect, then:
1.
Since the sets $B_{i}^{σ}$ have small cardinality, we can get a good approximation to the following result of [17]. The affine transformation of the symbolic entropy

$S G (Γ_{i}) = 2 R [L n (n_{i}) - h (Γ_{i})]$

is asymptotically $χ_{d_{i}}^{2}$ distributed where $d_{2} = 2^{m - 1}$ , $d_{3} = {(k - 1)}^{2} + 2$ , $n_{2} = 2^{m - 1}$ and $n_{3} = k^{2}$ .
2.
Then $(\frac{Y_{σ_{1}} - R p_{σ_{1}}}{\sqrt{V a r (Y_{σ_{1}})}}, \frac{Y_{σ_{2}} - R p_{σ_{2}}}{\sqrt{V a r (Y_{σ_{2}})}}, \dots, \frac{Y_{σ_{n i}} - R p_{σ_{n i}}}{\sqrt{V a r (Y_{σ_{n i}})}})$ is a multivariate normal distribution MN $(0, Σ)$ where the variance and covariance matrix can be estimated using (4) and (5).

In Section 2 we indicate that there is a class of symbolization maps that are non-standard. Consider a situation in which a reduction in the number of possible symbols under study will benefit the behavior and properties of the test. In this, and other potential situations, non-standard maps might be useful. As an example, we now construct a non-standard symbolization map to test for independence in the spatial context. The following symbolization is an example of the most general procedure that we give in Appendix A.3.

Consider again the set

Γ_{2}

of symbols defined in (9) for a fixed embedding dimension m. Now we will denote by

\bar{a}

the rest of the division of a over

m - 1

.

Now define the following equivalence relation ∼:

(I_{s s_{1}}, I_{s s_{2}}, \dots, I_{s s_{m - 1}}) \sim (I_{s^{'} s_{1}^{'}}, I_{s^{'} s_{2}^{'}}, \dots, I_{s^{'} s_{m - 1}^{'}})

if and only if there exists an integer k such that

I_{s^{'} s_{i}^{'}} = I_{s s_{\bar{i + k}}}

for all

i \in {1, 2, \dots, m - 1}

.

Now we consider as a set of symbols

Γ_{4} = Γ_{2} / \sim

the set of classes in

Γ_{2}

modulo, the equivalence relation ∼.

Notice that, in general, in this case not all the symbols in

Γ_{4}

have the same probability of occurring, and therefore the symbolization map

f_{4} : {X_{s}}_{s \in S^{'}} \to \tilde{Γ_{4}}

is non-standard.

7.3. Spatiotemporal Independence Tests

The issues related to spatiotemporal data modeling occupy a prominent role in current econometrics, where we can find recent literature devoted to this topic (see [9,30]). Spatiotemporal dependence introduces considerable difficulties with respect to modeling, computation and statistical theory. If independence can be taken for granted, and likewise the common assumption of cross-sectional independence, then computations and the application of inference rules simplifies significantly. It seems reasonable therefore to test first for spatiotemporal independence, and if the evidence for independence is strong, then proceed with the well-known methods. Unfortunately, tests for spatiotemporal independence are scarce. The aim of this section is twofold: to contribute to this rather scarce literature, and to highlight the usefulness of the novel general method presented in this paper. To this end we consider the relevant null of spatiotemporal dependence. Of particular interest for our tests is that dependence is not taken as a synonymous with correlation, and therefore nonlinearities are not restrictions for our test.

Consider the process

{X_{t s}}_{t \in I, s \in S}

. As in the previous cases, one can define several standard and non-standard symbolization maps. For simplicity, we adapt the previous symbolizations to the spatiotemporal case as follows:

For a fixed location

s_{0} \in S^{'}

define

{X_{t (s_{0})}}

as the time series

{X_{1 s_{0}}, X_{2 s_{0}}, \dots, X_{p (s_{0})}, \dots}

. Similarly for a fixed period

t_{0} \in I^{'}

we define

{X_{(t_{0}) s}}

as the spatial process

{X_{t_{0} s_{1}}, X_{t_{0} s_{2}}, \dots, X_{t_{0} s_{p}}, \dots}

Let

m_{t}, m_{s} \in N

with

m_{t}, m_{s} \geq 2

be the time and space embedding dimensions respectively. Then under this setting we define the following decomposable symbolization maps

F_{1 i} : {X_{t s}}_{t \in I^{'}, s \in S^{'}} \to §_{m} \times Γ_{i}

for

i = 2, 3

and 4 defined by:

F_{1 i} (X_{t s}) = (f_{1} (X_{t (s)}), f_{i} (X_{(t) s}))

(11)

where

f_{1} : {X_{t (s)}} \to S_{m}

and

f_{i} : {X_{(t) s}} \to Γ_{i}

for

i = 2, 3, 4

are defined as above.

Notice that, when testing for spatiotemporal independence, when

i = 2, 3

the symbolization map

F_{1 i}

is standard, while for

i = 4

is non-standard.

It is also possible to define an extension of the symbolization map

f_{2}

in a spatiotemporal context. Indeed, consider the following map:

g : {X_{t s}}_{t \in I^{'}, s \in S^{'}} \to \prod_{i = 1}^{m_{t}} Γ_{2}

(12)

defined by

g (X_{t s}) = (f (X_{(t) s}), f (X_{(t + 1) s}), \dots, f (X_{(t + m_{t} - 1) s}))

where

f (X_{(t + i) s}) = (I_{t s, (t + i) s_{1}}, I_{t s, (t + i) s_{2}}, \dots, I_{t s, (t + i) s_{m_{t} - 1}}))

for all

i = 0, 1, \dots, m_{t} - 1

and the indicator function

I_{t s, (t + i) s_{j}}

is defined as in (7).

Accordingly, using this symbolization map, the next corollary straightforwardly follows from Theorem 2:

Corollary 3.

Let

F_{1 i} : {X_{t s}}_{t \in I^{'}, s \in S^{'}} \to S_{m_{t}} \times Γ_{i}

and

g : {X_{t s}}_{t \in I^{'}, s \in S^{'}} \to \prod_{i = 1}^{m_{t}} Γ_{2}

be the standard symbolization maps defined in (11) with

i = 2, 3

and in (12) respectively. Denote by

h (S_{m_{t}} \times Γ_{i})

and

h (\prod_{i = 1}^{m_{t}} Γ_{2})

the symbolic entropy defined in (1). If the spatiotemporal process

{X_{t s}}_{t \in I, s \in S}

is independent, then:

If indexes sets are perfect then:
1.
The affine transformations of the symbolic entropy

$\begin{matrix} S T G (S_{m_{t}} \times Γ_{i}) & = & 2 R T [L n (n_{i}) - h (S_{m_{t}} \times Γ_{i})] \end{matrix}$

(13)

$\begin{matrix} S T G (\prod_{i = 1}^{m_{t}} Γ_{2}) & = & 2 R T [L n (2^{m_{t} (m_{s} - 1)}) - h (\prod_{i = 1}^{m_{t}} Γ_{2})] \end{matrix}$

(14)

are asymptotically $χ_{q}^{2}$ distributed. In (13) for $i = 2$ $q = (m_{t} - 1)! 2^{m_{s} - 1}$ and $n_{2} = m_{t}! 2^{m_{s} - 1}$ , and for $i = 3$ $q = (m_{t}! - 1) ({(k - 1)}^{2} + 2)$ and $n_{3} = m_{t}! k^{2}$ . In (14) $q = 2^{m_{t} (m_{s} - 1)}$ .
2.
Then $(\frac{Y_{σ_{1}} - R p_{σ_{1}}}{\sqrt{R p_{σ_{1}} (1 - p_{σ_{1}})}}, \frac{Y_{σ_{2}} - R p_{σ_{2}}}{\sqrt{R p_{σ_{2}} (1 - p_{σ_{2}})}}, \dots, \frac{Y_{σ_{2^{m - 1}}} - R p_{σ n}}{\sqrt{R p_{σ_{n}} (1 - p_{σ_{n}})}})$ is a multivariate normal distribution MN $(0, I)$ where n is the cardinality of the set of symbols.
If indexes sets are not perfect then
1.
If the sets $B_{i}$ ’s have small cardinality for all symbol η, then the affine transformations of the symbolic entropy

$\begin{matrix} S T G (S_{m} \times Γ_{i}) & = & 2 R T [L n (n_{i}) - h (S_{m} \times Γ_{i})] \end{matrix}$

(15)

$\begin{matrix} S T G (\prod_{i = 1}^{m} Γ_{2}) & = & 2 R T [L n (2^{m_{t} (m - 1)}) - h (\prod_{i = 1}^{m_{t}} Γ_{2})] \end{matrix}$

(16)

are asymptotically $χ_{q}^{2}$ distributed. In (15) for $i = 2$ $q = (m_{t} - 1)! 2^{m_{s} - 1}$ and $n_{2} = m_{t}! 2^{m_{s} - 1}$ , and for $i = 3$ $q = (m_{t}! - 1) ({(k - 1)}^{2} + 2)$ and $n_{3} = m_{t}! k^{2}$ . In (16) $q = 2^{m_{t} (m_{s} - 1)}$ .
2.
Then $(\frac{Y_{η_{1}} - R p_{η_{1}}}{\sqrt{V a r (Y_{η_{1}})}}, \frac{Y_{η_{2}} - R p_{η_{2}}}{\sqrt{V a r (Y_{η_{2}})}}, \dots, \frac{Y_{η_{n}} - R p_{η_{n}}}{\sqrt{V a r (Y_{η_{n}})}})$ is a multivariate normal distribution MN $(0, Σ)$ where the variance and covariance matrix can be estimated using (4) and (5) and n is the cardinality of the set of symbols.

8. Empirical Behavior of the Tests for Spatiotemporal Independence

In this section we evaluate the empirical behavior of the STG test with different configurations for the subset

I^{'} .

The first aim of this section is to show the flexibility of Corollary 3 to cope with different scenarios. The second goal is to evaluate the empirical behavior of the new test. An the third intention of this simulation is to evaluate the incidence of the selection of

I^{'}

on the empirical size of the test and on the power.

To those ends we designed a Monte Carlo experiment as follows: Firstly we consider the problem of testing for independence on regular lattices of several orders—R = 64 (8 × 8) and T = 150; R = 100 (10 × 10)—for which we consider two possible temporal scenarios, depending on data availability, T = 200 and T = 800. We also simulated richer regular lattices of order R = 400 (20 × 20), although on this occasion we only considered T = 200. The symbolization map follows from (12) with

m_{s} = 4

and

m_{t} = 2

. The test under study was generated from Corollary 3 under Expression (13). Therefore, we used a perfect indexes subset. This subset was constructed recursively, as indicated in Section 7:

N_{s_{i} t_{j}} \cap N_{s_{r} t_{k}} = \emptyset

, where

N_{s_{i} t_{j}}

is the set conformed with

s_{i},

the three nearest neighbors of

s_{i}

in

t_{j}

and the four spatial locations in the next time period. The power of the test is evaluated with the following DGPs:

\begin{matrix} D G P 1 & : & y_{t} = {(I - γ W)}^{- 1} (α y_{t - 1} + λ + ε_{t}) \\ D G P 2 & : & y_{t} = \sqrt[3]{{(I - γ W)}^{- 1} (α y_{t - 1} + λ + ε_{t})} \end{matrix}

where

ε_{t} \sim N (0, 1)

, which was also used for evaluating the empirical size of the test. Parameters

α, γ

intensified temporal and spatial dependencies, respectively, and

λ

was fixed at five in all simulations. The weighting matrix W has been specified as a binary type using a contiguity criterion and rook-type movements.

Table 1 collects the empirical size and power of STG statistical test for 1000 repetitions. It is straightforward to observe that the size is controlled, and the test is powerful. For low intensity level of parameter (

α = γ = 0.1)

the test is absolutely powerful. We have to set

α = 1 / 40, γ = 1 / 25

(or below) to lose power. This occurred despite the DGP under consideration.

Table 1. Size and power of STG test under a perfect subset of indexes.

Regular spatiotemporal configurations are interesting because (1) time series posit a natural order for observations, (2) lattice data provide the simplest extension of time series and (3) some scientific methods are compatible with this spatiotemporal configuration. However, irregular patterns are of frequent occurrence with spatial data. In geographical settings, data are liable to be recorded across heterogeneously-sized administrative regions, while economic distances do not correspond to regular spacing. Therefore, it is also useful to adapt the STG symbolic test to irregular spatiotemporal settings. In terms of our general methodology (see Corollary 3) this problem in tractable by considering the symbolization map

F_{1 i}, i = 2

where we control the dependence among the indicators by controlling on average the cardinality of the sets

B_{i}

. Particularly, we will select the set of indexes

I^{'}

such that

|{\bar{B}}_{i}| \leq (m_{s} + m_{t} - 1) / 2;

i.e., the average of the cardinality of the sets

B_{i}

is less than half of the number of spationtemporal neighbors.

Therefore, to complete the experiment (in the case of nonperfect lattices) we evaluate the STG-version for irregular lattices where coordinates of each spatial location are drawn from a N(0,1). We have considered the three nearest neighbors for irregular lattices. Afterwards, the resulting matrix was row-standardised in the usual way.

Table 2 collects the size and power for models constructed from DGP1 and DGP2. The introduction of irregular lattices has led us to introduce non-perfect indexes, and accordingly the size of the test slightly increased, although the levels seem acceptable, particularly for generous sample data. Power is as interesting as for the case of perfect indexes, and therefore the same comments applies (similar results are obtained in the case of using the multivariate normal approximation).

Table 2. Size and power of STG test under a non-perfect subset of indexes.

Comparison with Other Spatiotemporal Test for Independence

We now face our test with an unfavorable scenario characterized by small amount of available data on irregular lattices, also in linear and nonlinear setups. To this end we consider pairs of the following sample sizes: (36 × 10), (64 × 10), (100 × 10), (100 × 30) and (200 × 10). According to our theoretical discussion, given data scarcity and irregular spatial configuration, we use the non-perfect subset of indexes. Additionally, we consider the symbolization map based on equivalence relations

{\bar{F}}_{1, 2} = (f_{1}, {\bar{f}}_{2})

as depicted in Appendix A.3 for nonstandard maps.

To complete the empirical study, we compare our test with another nonparametric spatiotemporal test [31] which is described in Appendix A.4 and we refer to it as STBP. Notice, however, that the STBP test requires one to correctly specify the weighting matrix, W; this is not a requirement for the symbolic test.

In terms of empirical size, both tests behave similarly well for linear processes (Table 3). On the contrary, for the nonlinear processes (Table 4), the size of the STBP test is poor, while the symbolic-based test performs as expected. In terms of empirical power, the STBP test outperforms the STG test, especially for low intensity levels of dependence in the case of the linear process. However, under a nonlinear spatiotemporal configuration, the STG clearly presents a better balance between size and power and outperforms the STBP in all cases.

Table 3. Size and power of STG and STBP tests under linearity.

Table 4. Size and power of STG and STBP tests under nonlinearity.

9. Conclusions

Central null hypotheses in experimental and non-experimental branches of science can be easily tested by means of symbolized information. This paper provides with the analytical tools to construct nonparametric hypothesis tests based on symbols. These tools are able to cope with different null hypotheses and with distinct scenarios in which some realistic limitations might be imposed to test designs.

A shared characteristic of all these symbolic test families is that few assumptions are needed to obtain asymptotic results. Therefore, general applicability of this method is guaranteed. In particular, in this paper we have shown that two well-known symbolic-based tests are particular cases of the main symbolic theorem (Theorem 2), which is stated in this paper for the first time. Furthermore, a set of new symbolic-based tests for spatiotemporal independence is put forward by using the main results of this paper collected under the main symbolic theorem (Theorem 2). Monte Carlo simulations provide evidence of the extraordinary power of the contrasted test. Currently, there are circumstances where robustness to speed, noise or computational cost are paramount, so fruitful applications of symbolic analysis are favored.

Further lines of research are worthy. We now indicate some of them on which we and other scholars are currently working: (i) One of the appealing properties of symbolic-based testing is that it requires few assumptions. In this paper we have assumed stationarity; however, it would be interesting to study whether it is possible to be less restrictive. (ii) In the context of time series analysis, most available techniques require the existence of second moments; however, by using certain symbolizations, it might be possible to waive this requirement. This will allow time series researchers to consider a wider variety of model classes. (iii) One of the main contributions of the paper is that it suggests that researchers can design a symbolization procedure (map) to test null hypotheses. It would be interesting to study what types of null hypotheses are more suitable to analysis using symbolic maps.

Author Contributions

Conceptualization, F.L., M.M.-G., J.M. and M.R.M.; Methodology, F.L., M.M.-G., J.M. and M.R.M.; Writing, original draft, F.L., M.M.-G., J.M. and M.R.M. All authors have read and agreed to the published version of the manuscript.

Funding

This was partially funded by Ministerio de Ciencia e Innovación under grant PID2019-107192GB-I00 and under grant PID2019-107800GB-I00/AEI/10.13039/501100011033. This study is part of the collaborative activities carried out under the program Groups of Excellence of the region of Murcia, the Fundacion Seneca, Science, and Technology Agency of the region of Murcia project 19884/GERM/15.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

We are deeply saddened by the loss of our dear friend and coauthor of this paper: Jesús Mur. We have worked together for a long time, enjoying a wonderful friendship. Jesús transmitted to us his passion for spatial econometrics and economic analysis, and was always persistent in pushing the frontier of knowledge. We dedicate this paper to his memory.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Appendix A.1. Maximum Likelihood Estimators

Since

\sum_{i = 1}^{n} p_{η_{i}} = 1

it follows that

L (p_{η_{1}}, p_{η_{2}}, \dots, p_{η_{n}}) = \frac{R!}{n_{η_{1}}! n_{η_{2}}! \cdot \dots \cdot n_{η_{n}}!} p_{η_{1}}^{n_{η_{1}}} p_{η_{2}}^{n_{η_{2}}} \dots {(1 - p_{η_{1}} - p_{η_{2}} - \dots - p_{η_{n - 1}})}^{n_{η_{n}}} .

Then the logarithm of this likelihood function remains as

\begin{matrix} L n (L (p_{η_{1}}, p_{η_{2}}, \dots, p_{η_{n}})) & = & L n (\frac{R!}{n_{η_{1}}! n_{η_{2}}! \cdot \dots \cdot n_{η_{n}}!}) + \sum_{i = i}^{n - 1} n_{η_{i}} L n (p_{η_{i}}) \\ + n_{η_{n}} L n (1 - p_{η_{1}} - p_{η_{2}} - \dots - p_{η_{n - 1}}) . \end{matrix}

Maximum likelihood estimators

{\hat{p}}_{η_{i}} = \frac{n_{η_{i}}}{R}

are obtained by solving the following equation

\frac{\partial L n (L (p_{η_{1}}, p_{η_{2}}, \dots, p_{η_{n}}))}{\partial p_{η_{i}}} = 0 .

Appendix A.2. Chi-Squared Distributions for Symbolic Maps

Notice that

2 L n (λ (Y))

asymptotically follows a Chi-squared distribution with k degrees of freedom, which will depend on the set of symbols. Hence

- 2 L n (λ (Y)) = - 2 [R L n (R) + \sum_{i = 1}^{n} n_{η_{i}} L n (\frac{p_{η_{i}}}{n_{η_{i}}})] \sim χ_{k}^{2}

Now, if the symbolization map f is standard, that is, under the null H

_{0}

all the symbols have the same probability to occur,

p_{η_{i}} = \frac{1}{n}

for all

i = 1, 2, \dots, n

, then it follows that

\begin{matrix} - 2 L n (λ (Y)) & = & - 2 R [L n (R) + \sum_{i = 1}^{n} \frac{n_{η_{i}}}{R} L n (\frac{p_{η_{i}}}{n_{η_{i}}})] \\ = & - 2 R [L n (R) + \sum_{i = 1}^{n} \frac{n_{η_{i}}}{R} (L n (\frac{1}{n}) - L n (n_{η_{i}}))] \\ = & - 2 R [L n (R) + \sum_{i = 1}^{n} \frac{n_{η_{i}}}{R} (L n (\frac{1}{n}) - L n (\frac{n_{η_{i}}}{R}) - L n (R))] \end{matrix}

Now taking into account that

h (Γ) = - \sum_{i = 1}^{n} p_{η_{i}} L n (p_{η_{i}}) = - \sum_{i = 1}^{n} \frac{n_{η_{i}}}{R} L n (\frac{n_{η_{i}}}{R}),

we have that

- 2 L n (λ (Y)) = 2 R [L n (n) - h (Γ)] .

Appendix A.3. Non-Standard Symbolization Maps

This appendix gives a procedure to construct a non-standard symbolization maps.

Let

A = {A_{1}, A_{2}, \dots, A_{d}}

be a family of nonempty subsets of the set of symbols

Γ

. Assume that

A

is a partition of

Γ

, that is,

Γ = ⋃_{i = 1}^{d} A_{i}

and

A_{1} \cap A_{j} = \emptyset

for all

i \neq j

. Now we define the relation ∼ in

Γ

by

η \sim σ

if and only if

η, σ \in A_{i}

for some

i = 1, 2, \dots, d

. Obviously the relation ∼ is an equivalence relation and therefore we can consider the quotient set

\bar{Γ} = Γ / \sim

formed by all the classes of equivalence as a new set of symbols. Denote by

\bar{σ}

the class o equivalence of symbol

σ

. Therefore, there exists a natural projection

π : Γ \to \bar{Γ}

defined by

π (σ) = \bar{σ}

. Moreover, any symbolization map with set of symbols

Γ

, namely

f : {X_{i}}_{i \in I^{'}} \to Γ

, can be extended to a symbolization map with set of symbols

\bar{Γ}

by considering the following map

\bar{f} = π \circ f : {X_{i}}_{i \in I^{'}} \to \bar{Γ}

defined by

\bar{f} (X_{i}) = \bar{f (X_{i})} .

Notice that the cardinality of the set

\bar{Γ}

is d which is always smaller or equal than the cardinality of the former set

Γ

.

Appendix A.4. A Generalization to Spatiotemporal Data of the Brett and Pinkse Test

The test of [32] is a nonparametric test of spatial dependence based on that two variables are independent if the joint characteristic function factorizes into the product of the marginal characteristic functions. We adapt this test to its use for studying spatiotemporal data.

Let

{y_{t s}}_{t \in I, s \in S}

a spatiotemporal realization of a process. The

y_{t s}

’s can have continuous, discrete or mixed distributions, and the distribution functions are generally unknown. Under the null hypothesis, the spatiotemporal process is stationary and independent in space and time.

The design of the test is as follows. Let g be any practitioner-chosen density function with infinite support, and denote by

h (x) = \int e^{i u x} g (u) d u

the Fourier transform of g. Given a location s,

N_{s}

refers to the set of neighbors of coordinate s. Fix a positive integer m and define

N_{t s}^{m} = {t^{'} s^{'} | t^{'} = t, t + 1, \dots, t + m - 1; s^{'} \in N_{s}}

as the set of nearby observations to location s in period t and

n_{t s} = ♯ N_{t s}^{m}

as the number of observations. Let

y_{t s}^{N} = \frac{1}{n_{t s}} \sum_{r k \in N_{t s}^{m}} y_{r k}

stands for the sampling average of the proximate observations to

y_{t s}

. The

S T B P

test null hypothesis is

H₀.

y

_{t s}

and y

_{t s}^{N}

are independent for all t∈ I and s∈ S.

Define

h_{(t_{1} s_{1}, t_{2} s_{2})} = h (y_{t_{1} s_{1}} - y_{t_{2} s_{2}})

and

h_{(t_{1} s_{1}, t_{2} s_{2})}^{N N} = h (y_{t_{1} s_{1}}^{N} - y_{t_{2} s_{2}}^{N})

. Introduce

η_{n 1}, η_{n 2}

and

η_{n 3}

defined by

\begin{matrix} η_{n 1} = n^{- 2} \sum_{t s, r k} h_{(t s, r k)} h_{(t s, r k)}^{N N}; η_{n 2} = n^{- 3} \sum_{t s, r k, u v} h_{(t s, r k)} h_{(t s, u v)}^{N N}; \\ η_{n 3} = n^{- 4} \sum_{t s, r k, u v, p q} h_{(t s, r k)} h_{(u v, p q)}^{N N}, \end{matrix}

where

n = R T

is the number of observations. Let

η_{n} = {(η_{n 1} - η_{n 2})}^{2} + {(η_{n 2} - η_{n 3})}^{2}

and

ν_{n} = {(γ_{n} - μ_{n}^{2})}^{2} n^{- 1} \sum_{t s} n_{t s}^{- 1} (I (n_{t s} > 0) + \sum_{r k} n_{r k}^{- 1} I (r k \in N_{t s}) I (t s \in N_{r k}))

where

μ_{n} = n^{- 2} \sum_{t, s} h_{t s}

,

γ_{n} = n^{- 3} \sum_{s, t, u} h_{t s} h_{t u}

and

I (\cdot)

is an indicator function.

Under the null of the test, the extension of the Brett and Pinkse statistic for a spatiotemporal context is the following:

S T B P = \frac{n η_{n}}{2 ν_{n}}

which is asymptotically

χ_{1}^{2}

distributed.

The following two sufficient conditions are required by the

S T B P

test to be consistent: (1) spatiotemporal dependence of a fixed order, (2) the sequence has to be strongly mixing. Strong mixing is a weak dependence condition, while fixed ordered dependence is a restriction regarding these relationships must be produced between proximate observations. In this case, then the null hypothesis will be asymptotically rejected; however the behavior of the test will be undetermined, when the dependence involves observations that are not geographically or temporally proximate. Using spatial data, this means that an specification of the so-called spatial weighting matrix is needed and that this specification must be correct (Lopez et al, 2011, for more details).

References

Amigo, J.M. Permutation Complexity in Dynamical Systems, 1st ed.; Springer: Berlin, Germany, 2010. [Google Scholar]
Weaver, W.; Shannon, C.E. The Mathematical Theory of Communication; University of Illinois: Urbana, IL, USA, 1949. [Google Scholar]
Morse, M. Recurrent geodesics on a surface of negative curvature. Trans. Am. Math. Soc. 1921, 22, 84. [Google Scholar] [CrossRef]
Collet, P.; Eckmann, J.P. Iterated Maps on the Interval as Dynamical Systems; Brickhauser: Basel, Switzerland, 1980. [Google Scholar]
Ruiz, M.; Matilla-García, M.; García, J.A.; Susillo, J.L.; Romo, A.; Gonzalez, A.; Ruiz, A.; Gayan, J. An Entropy Test for Single-Locus Genetic Association Analysis. BMC Genet. 2010, 11, 1–15. [Google Scholar]
Daw, C.S.; Finney, C.E.A.; Tracy, E.R. A review of symbolic analysis of experimental data. Rev. Sci. Instrum. 2003, 74, 915–930. [Google Scholar] [CrossRef]
Hao, B.; Zheng, W. Applied Symbolic Dynamics and Chaos; World Scientific: Singapore, 1998. [Google Scholar]
Lunde, A.; Timmermann, A.G. Duration Dependence in Stock Prices:An Analysis of Bull and Bear Markets. J. Bus. Econ. Stat. 2004, 22, 253–273. [Google Scholar] [CrossRef]
Robinson, P. Large-Sample Inference on Spatial Dependence. Econom. J. 2009, 12, 68–82. [Google Scholar] [CrossRef]
López, F.; Matilla-García, M.; Mur, J.; Ruiz, M. Non-Parametric Spatial Independence Test Using Symbolic Entropy. Reg. Sci. Urban Econ. 2010, 40, 106–115. [Google Scholar] [CrossRef]
García-Córdoba, J.A.; Matilla-García, M.; Ruiz Marin, M. A test for deterministic dynamics in spatial processes. Spat. Econ. Anal. 2019, 14, 361–377. [Google Scholar] [CrossRef]
Conaway, M.R. Analysis of repeated categorical measurements with conditional likelihood methods. J. Am. Stat. 1989, 84, 53–62. [Google Scholar] [CrossRef]
Stephenson, D.B. Use of the “Odds Ratio” for Diagnosing Forecast Skill. Weather. Forecast. 2000, 15, 221–232. [Google Scholar] [CrossRef]
Bandt, C.; Shiha, F. Order patterns in time series. J. Time Ser. Anal. 2007, 28, 646–665. [Google Scholar] [CrossRef]
Matilla-García, M. A non-parametric test for independence based on symbolic dynamics. J. Econ. Dyn. Control. 2007, 31, 3889–3903. [Google Scholar] [CrossRef]
Matilla-García, M.; Ruiz, M. A non-parametric independence test using permutation entropy. J. Econom. 2008, 144, 139–155. [Google Scholar] [CrossRef]
Sinn, M.; Keller, K. Estimation of ordinal pattern probabilities in Gaussian processes with stationary increments. Comput. Stat. Data Anal. 2011, 55, 1781–1790. [Google Scholar] [CrossRef]
Schnurr, A. An Ordinal Pattern Approach to Detect and to Model Leverage Effects and Dependence Structures Between Financial Time Series. Stat. Pap. 2014, 55, 919–931. [Google Scholar] [CrossRef]
Schnurr, A.; Dehling, H. Testing for Structural Breaks via Ordinal Pattern Dependence. J. Am. Stat. Assoc. 2017. [Google Scholar] [CrossRef]
Caballero-Pintado, M.V.; Matilla-García, M.; Marín, M.R. Symbolic correlation integral. Econom. Rev. 2019, 38, 533–556. [Google Scholar] [CrossRef]
Caballero-Pintado, M.V.; Matilla-García, M.; Ruiz Marín, M. Symbolic recurrence plots to analyze dynamical systems. Chaos Interdiscip. J. Nonlinear Sci. 2018, 28, 063112. [Google Scholar] [CrossRef]
Hong, Y.; White, H. Asymptotic distribution theory for nonparametric entropy measures of serial dependence. Econometrica 2005, 73, 837–901. [Google Scholar] [CrossRef]
Faura, Ú; Lafuente, M.; Matilla-García, M.; Ruiz, M. Identifying the Most Relevant Lag with Runs. Entropy 2015, 17, 2706–2722. [Google Scholar] [CrossRef]
Cliff, A.D.; Ord, J.K. Spatial Processes: Models and Applications; Pion: London, UK, 1981. [Google Scholar]
Agresi, A. An Introduction to Categorical Data Analysis; John Wiley and Sons: Brisbane, Australia, 1996. [Google Scholar]
Bandt, C.; Keller, G.; Pompe, G. Entropy of interval maps via permutations. Nonlinearity 2002, 12, 1595–1602. [Google Scholar] [CrossRef]
Soon Spario, Y.T. Binomial Approximation for dependent indicators. Stat. Sin. 1996, 6, 703–714. [Google Scholar]
Ruiz, M.; López, F.; Páez, A. Testing for spatial association of qualitative data using symbolic dynamics. J. Geogr. 2009, 12, 281–308. [Google Scholar] [CrossRef]
Matilla-García, M.; Marín, M.R. A new test for chaos and determinism based on symbolic dynamics. J. Econ. Behav. Organ. 2010, 76, 600–614. [Google Scholar] [CrossRef]
Baltagi, B.; Kelejian, H.; Prucha, I. Anal. Spatiallydependent Data. J. Econom. 2007, 140, 1–4. [Google Scholar] [CrossRef]
López, F.A.; Matilla-García, M.; Mur, J.; Marín, M.R. Four tests of independence in spatiotemporal data. Pap. Reg. 2011. [Google Scholar] [CrossRef]
Brett, C.; Pinkse, J. Those Taxes are all over the Map A Test for Spatial Independence of Municipal Tax Rates in British Columbia. Int. Reg. Sci. Rev. 1997, 20, 131–151. [Google Scholar] [CrossRef]

Table 1. Size and power of STG test under a perfect subset of indexes.

			DGPs1			DGPs2
		Size	Power
			$α = 0.1$	$α = 2 / 80,$	$α = 1 / 80,$	$α = 0.1$	$α = 2 / 60,$	$α = 1 / 60,$
R	T		$γ = 0.1$	$γ = 2 / 50$	$γ = 1 / 50$	$γ = 0.1$	$γ = 2 / 30$	$γ = 1 / 30$
64	150	0.060	1.000	0.096	0.073	1.000	0.152	0.077
100	200	0.064	1.000	0.149	0.065	1.000	0.220	0.086
100	800	0.053	1.000	0.492	0.112	1.000	0.860	0.196
400	200	0.058	1.000	0.412	0.111	1.000	0.772	0.160

Table 2. Size and power of STG test under a non-perfect subset of indexes.

			DGPs1			DGPs2
		Size	Power
			$α = 0.1$	$α = 2 / 80,$	$α = 1 / 80,$	$α = 0.1$	$α = 2 / 60,$	$α = 1 / 60,$
R	T		$γ = 0.1$	$γ = 2 / 50$	$γ = 1 / 50$	$γ = 0.1$	$γ = 2 / 30$	$γ = 1 / 30$
64	150	0.077	1.000	0.094	0.070	1.000	0.082	0.082
100	200	0.070	1.000	0.211	0.098	1.000	0.286	0.096
100	800	0.077	1.000	0.794	0.201	1.000	0.410	0.274
400	200	0.060	1.000	0.758	0.196	1.000	0.467	0.278

Table 3. Size and power of STG and STBP tests under linearity.

		Size		Power
		STG	STBP	STG	STBP	STG	STBP	STG	STBP
R	T			( $α = 5 / 80, γ = 5 / 50)$		( $α = 10 / 80, γ = 10 / 50)$		( $α = 15 / 80, γ = 15 / 50)$
36	10	0.069	0.046	0.096	0.154	0.328	0.615	0.787	0.989
64	10	0.057	0.054	0.145	0.193	0.504	0.815	0.962	0.999
100	10	0.063	0.066	0.192	0.228	0.677	0.926	0.993	1.000
100	30	0.058	0.054	0.443	0.395	0.982	1.000	1.000	1.000
200	10	0.063	0.067	0.346	0.327	0.935	0.997	1.000	1.000

Table 4. Size and power of STG and STBP tests under nonlinearity.

		Size		Power
		STG	STBP	STG	STBP	STG	STBP	STG	STBP
R	T			( $α = 1 / 10, γ = 1 / 10)$		( $α = 2 / 10, γ = 2 / 10)$		( $α = 3 / 10, γ = 3 / 10)$
36	10	0.060	0.002	0.094	0.007	0.269	0.060	0.507	0.473
64	10	0.055	0.000	0.111	0.002	0.376	0.119	0.784	0.743
100	10	0.067	0.000	0.175	0.011	0.557	0.209	0.941	0.915
100	30	0.056	0.000	0.357	0.005	0.949	0.642	1.000	1.000
200	10	0.062	0.001	0.281	0.007	0.856	0.381	1.000	0.998

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Statistical Tests of Symbolic Dynamics^†

Abstract

1. Introduction

2. Notation and Definitions

3. On the Independence of Variables ${Z_{η i}}_{i \in I^{'}}$

4. Constructing Tests with Perfect Subsets of $I$

Procedure

5. Constructing Tests with Non Perfect Subsets of $I$

5.1. Binomial Approximation

5.2. Normal Approximation

6. The Symbolic Main Theorem

7. Different Symbolizations for Different Nulls Related with Independence

7.1. Serial Independence Tests

7.2. Spatial Independence Tests

7.3. Spatiotemporal Independence Tests

8. Empirical Behavior of the Tests for Spatiotemporal Independence

Comparison with Other Spatiotemporal Test for Independence

9. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A

Appendix A.1. Maximum Likelihood Estimators

Appendix A.2. Chi-Squared Distributions for Symbolic Maps

Appendix A.3. Non-Standard Symbolization Maps

Appendix A.4. A Generalization to Spatiotemporal Data of the Brett and Pinkse Test

References

Article Metrics

Citations

Article Access Statistics

Statistical Tests of Symbolic Dynamics †

Abstract

1. Introduction

2. Notation and Definitions

3. On the Independence of Variables { Z η i } i ∈ I ′

4. Constructing Tests with Perfect Subsets of I

Procedure

5. Constructing Tests with Non Perfect Subsets of I

5.1. Binomial Approximation

5.2. Normal Approximation

6. The Symbolic Main Theorem

7. Different Symbolizations for Different Nulls Related with Independence

7.1. Serial Independence Tests

7.2. Spatial Independence Tests

7.3. Spatiotemporal Independence Tests

8. Empirical Behavior of the Tests for Spatiotemporal Independence

Comparison with Other Spatiotemporal Test for Independence

9. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A

Appendix A.1. Maximum Likelihood Estimators

Appendix A.2. Chi-Squared Distributions for Symbolic Maps

Appendix A.3. Non-Standard Symbolization Maps

Appendix A.4. A Generalization to Spatiotemporal Data of the Brett and Pinkse Test

References

Article Metrics

Citations

Article Access Statistics

Statistical Tests of Symbolic Dynamics^†

3. On the Independence of Variables ${Z_{η i}}_{i \in I^{'}}$

4. Constructing Tests with Perfect Subsets of $I$

5. Constructing Tests with Non Perfect Subsets of $I$