Necessary and Sufficient Reservoir Condition for Universal Reservoir Computing

Sugiura, Shuhei; Ariizumi, Ryo; Asai, Toru; Azuma, Shun-ichi

doi:10.3390/math13213440

Open AccessFeature PaperArticle

Necessary and Sufficient Reservoir Condition for Universal Reservoir Computing

¹

Department of Mechanical and Aerospace Engineering, Graduate School of Engineering, Nagoya University, Nagoya 464-8603, Japan

²

Department of Mechanical Systems Engineering, Tokyo University of Agriculture and Technology, Koganei 184-8588, Japan

³

Department of Mechanical Engineering, College of Engineering, Chubu University, Kasugai 487-8501, Japan

⁴

Department of Informatics, Graduate School of Informatics, Kyoto University, Kyoto 606-8501, Japan

^*

Author to whom correspondence should be addressed.

Mathematics 2025, 13(21), 3440; https://doi.org/10.3390/math13213440

Submission received: 18 September 2025 / Revised: 20 October 2025 / Accepted: 27 October 2025 / Published: 28 October 2025

(This article belongs to the Special Issue Machine Learning: Mathematical Foundations and Applications)

Download

Browse Figures

Versions Notes

Abstract

We discuss necessary and sufficient conditions for universal approximation using reservoir computing. Reservoir computing is a machine learning method used to train a dynamical system model by tuning only the static part of the model. The universality is the ability of the model to approximate any dynamical system with any precision. In the previous studies, we provided two sufficient conditions for the universality. We employed the universality definition that has been discussed since the earliest studies on reservoir computing. In this present paper, we prove that these two conditions and the universality are equivalent to one another. Using this equivalence, we show that a universal model must have a “pathological” property that can only be achieved or approached by chaotic reservoirs.

Keywords:

machine learning; reservoir computing; neural network; nonlinear dynamical system

MSC:

37C30; 46N99

1. Introduction

Reservoir computing (RC) is a computationally cheap method used to train dynamical system models, which was initially proposed for recurrent neural networks (RNNs) [1]. Generally, RNNs are trained using the gradient descent method, but this is computationally expensive because the signal flow must be tracked in the network for a certain period. Jaeger et al. [2] and Maass et al. [3] found that RNNs achieve approximation tasks by training only a static function, converting network states to the output. Their methods were unified as RC, a framework that trains dynamical models by training static functions [4,5].

An RC model is a dynamical system model designed to be trained through RC, and it consists of a dynamical system called the “reservoir” and a static function called the “readout.” As shown in Figure 1, the reservoir processes the input to the model first, and the readout maps the reservoir state to the output. In supervised learning, the model aims to approximate the given target system. During RC model training, only the readout is tuned for each target, but the reservoir does not adapt to targets. Empirically, an RC model performs better using a reservoir with complex dynamics. For example, echo state networks [2] use an RNN with random parameters as a reservoir.

Another advantage of RC is that various dynamical systems can be used as the reservoir, even if they are difficult to train or adjust. Recently, physical RC, which uses a reservoir implemented as hardware, has been drawing attention in terms of energy efficiency and computation speed [6,7,8]. Many studies have been conducted on various implementations of reservoirs, e.g., electric and electronic circuits [9,10,11,12,13], network [14] and delay feedback systems [15,16] using optical elements, and spin torque oscillators [17,18,19].

An RC model and its reservoir are said to be universal if the model can approximate an arbitrary target with arbitrary precision. The concept of universality concerning RC appeared simultaneously with RC itself [3]. Maass et al. [3] also proposed a sufficient condition for a continuous-time RC model to be universal. The sufficient condition in [3] is the combination of the continuity and injectivity of the reservoir, which is a functional from input functions to output values. The injectivity of the reservoir is also a necessary condition for universality.

Sugiura et al. [20] proposed the relaxed condition called the neighborhood separation property (NSP), which can be applied to more complex reservoirs with multiple equilibrium states. The authors of [21] showed that a reservoir with a finite-dimensional output can satisfy the NSP but not the condition in [3]. In [21], another sufficient condition was proposed: the existence of the continuous inverse of a reservoir. This condition is also sufficient for the NSP and used to show that the NSP can be satisfied. Relationships among universality and the explained reservoir conditions are summarized in Figure 2.

As mentioned above, some necessary conditions and sufficient conditions for universality are known, but the necessary and sufficient condition, equivalent to universality, is still unknown. Such a condition is critical as it provides an essential answer to the question regarding which properties of the reservoir enable approximation with an RC model. In this paper, we show that the NSP and continuous inverse, which are sufficient conditions for a reservoir to be universal, are also equivalent to the universality itself. Moreover, we also show that a universal reservoir has a “pathological” property using the obtained equivalence. Similarly to previous studies, we considered a continuous-time RC model with a polynomial readout and evaluated the approximation using the maximum error. A dynamical system, e.g., a model, reservoir, and target, is treated as a functional from input functions to output values.

Our result concerning the conditions equivalent to universality can be extended to a general case where the input function space is compactifiable. Hence, our result may be applied to various types of RC not discussed in this paper, such as discrete-time ones.

The pathological property of a universal reservoir is that it has dense discontinuous points. As we show later, a universal reservoir has a continuous inverse map. Hence, if there is a continuous and universal reservoir, it is a homeomorphism. However, the infinite-dimensional space of input functions and the finite-dimensional space of output values cannot be homeomorphic. The same holds if we restrict the reservoir domain to an arbitrary open subset, i.e., a universal reservoir has a discontinuous point in any open subset. This result suggests that a universal reservoir is highly sensitive to inputs, and chaotic reservoirs, such as those described in [19,22,23,24], are necessary to achieve universality. These facts support the empirical rule that a complex reservoir tends to be effective and provide significant insight into the development of high-performance reservoirs.

Although considering noise in inputs and observations is important in practice, we focus on the deterministic and noiseless case for the following reasons. First, theoretical research on continuous-time RC remains limited even in the deterministic setting. Second, the definition of universality in the stochastic case is not straightforward and has not yet been established.

The reminder of this paper is structured as follows: Section 2 provides preliminaries and describes RC and previous results. In Section 3, we prove that the NSP and the continuous inverse of a reservoir are equivalent to universality. In Section 4, we prove that a universal reservoir has dense discontinuous points. The main symbols used in this paper are summarized in Table 1.

2. Preliminary

We discuss a dynamical system represented as a functional on functions of time. Let

A \subset R^{n}

be a compact and convex set of input values and

K > 0

be the limit of the speed of input change. We define the set V of input functions as follows:

V = \{v : R_{-} \to A |\forall t_{1}, t_{2} \leq 0, ∥v (t_{1}) - v (t_{2})∥ \leq K |t_{1} - t_{2}|\},

(1)

where

∥\cdot∥

is the Euclidean norm, and

R_{-}

is defined as

(- \infty, 0]

. One must set A and

K > 0

large and wide enough according to the input functions that one considers. In reality, such A and K may be unknown or may not exist, but we do not consider these cases here.

Because input functions are given on a finite time interval in practice, we define another set of input functions by restricting the domain of functions in V. For a function v and

t \geq 0

, we write the restriction of v to

[- t, 0]

as

{v |}_{[- t, 0]}

. We define the input functions on a finite time interval as

V^{res} = \{{v |}_{[- t, 0]} |v \in V, t \geq 0\} .

(2)

Note that

V^{res}

is not defined for a specific

t \geq 0

and contains input functions on time intervals of various lengths. The dynamical system that we discuss is a functional from

V^{res}

to

R^{m}

. In the real world, such a functional is a machine or a device that processes an input v for a period

[- t, 0]

in real time and outputs its state at time 0.

For example, we define a functional using the following state-space system:

\dot{x} (t) = ϕ (x (t), u (t)), x (0) = x_{init} (u : R_{+} \to R^{n}, t \geq 0),

(3)

where

x (t) \in R^{m}

and

u (t) \in R^{n}

are the system state and input at time

t \in R_{+} = [0, \infty)

. The initial state is

x_{init} \in R^{m}

, and the derivative of the state is given by the function

ϕ

of the state and input. In considering the fixed

x_{init}

, System (3) determines

x (t)

for

u (τ)

(τ \in [0, t])

. Hence, we can define the functional

f : V^{res} \to R^{m}

as

f ({v |}_{[- t, 0]}) = x (0), \dot{x} (τ) = ϕ (x (τ), v (τ)), x (- t) = x_{init} (v \in V, t \geq 0, τ \in [- t, 0]) .

(4)

Note that time is shifted so that the input signal starts at time

- t

and ends at time 0. The functional output

f ({v |}_{[- t, 0]})

is the state to which the system transitions from the initial state

x_{init} \in R^{m}

with the input

{v |}_{[- t, 0]}

of the length t. State spaces can represent many physical systems, and state-space systems can also be represented as functionals, as in the example. Therefore, our discussion of functionals covers a fairly wide range of dynamical systems.

In supervised learning, reservoir computing (RC) trains a model to approximate a functional we call the “target.” Let

f^{*}

and

\hat{f} : V^{res} \to R

be the target and the model, respectively. We consider the uniform approximation, which is evaluated based on the worst error,

{sup}_{v \in V^{res}} |f^{*} (v) - \hat{f} (v)|

. The RC model is represented as the composition of two maps as

\hat{f} = p \circ f

. The map

f : V^{res} \to R^{m}

is the dynamical part of the model called the “reservoir,” and the map

p : R^{m} \to R

is the static part called the “readout.” To approximate the target, RC trains only the readout and fixes the reservoir because training the dynamical part of the model is technically difficult and computationally expensive. Because the reservoir is fixed, we can implement it not only as software on a general-purpose computer but also as hardware. In the field of physical reservoir computing, many physical phenomena are studied as reservoirs and expected in terms of computation speed and energy consumption.

Let

F^{*}

be a set of targets, which are functionals from

V^{res}

to

R

. An RC model and its reservoir are said to be universal in

F^{*}

if the model can approximate an arbitrary

f^{*} \in F^{*}

. Let

P_{m}

be the set of polynomial functions from

R^{m}

to

R

. If we use a polynomial readout, training the model involves selecting a polynomial function from

P_{m}

, and universality in

F^{*}

is defined as follows.

Definition 1.

Reservoir

f : V^{res} \to R^{m}

is said to be universal for uniform approximations in

F^{*}

if

\forall f^{*} \in F^{*}, \forall ε > 0, \exists p \in P_{m}, \forall v \in V^{res}, |f^{*} (v) - p \circ f (v)| < ε .

(5)

Polynomial readouts are not practical because of their limited generalization ability. However, they are theoretically tractable, and the discussion on them can be extended to other types of readouts that can approximate continuous functions. Hence, it suffices to discuss the theoretical aspects by assuming polynomial readouts, even if we use other types of continuous functions, such as feed-forward neural networks. The definition of universality using a polynomial readout and uniform approximation, like Definition 1, has been discussed since the earliest studies on reservoir computing [3].

In previous studies, two sufficient conditions for universality were provided. One condition [20] is the weakest ever known and has been shown to be satisfiable using the other [21]. To explain these conditions, we need a metric on

V^{res} \cup V

. Let

w : R_{+} \to (0, 1]

be a non-increasing function that satisfies

{lim}_{t \to \infty} w (t) = 0

. Using the supremum on the domain, we define the weighted norm

{∥v∥}_{w}

of the function v as

{∥v∥}_{w} = \{\begin{matrix} sup_{τ \in [- t, 0]} ∥v (τ)∥ w (- τ) & (v : [- t, 0] \to R^{n}) \\ sup_{τ \leq 0} ∥v (τ)∥ w (- τ) & (v : R_{-} \to R^{n}) \end{matrix},

(6)

where

t \in R_{+}

. Let

{v |}_{[- \infty, 0]} = v

for any

v \in V

and define the map

λ : V^{res} \cup V \to [0, \infty]

as

λ ({v |}_{[- t, 0]}) = t (v \in V, t \in [0, \infty]) .

(7)

For an input function, the map

λ

returns the length of the interval on which that function is defined.

Let

θ : R_{+} \to R_{+}

be a strictly increasing, bounded, and continuous function. Using the weighted norm

{∥\cdot∥}_{w}

and function

θ

, we define the map

d (v_{1}, v_{2})

as

d (v_{1}, v_{2}) = {∥v_{1} {|_{[- t_{min}, 0]} - v_{2} |}_{[- t_{min}, 0]}∥}_{w} + |θ (t_{1}) - θ (t_{2})| (v_{1}, v_{2} \in V^{res} \cup V),

(8)

where

t_{1} = λ (v_{1})

,

t_{2} = λ (v_{2})

,

t_{min} = min \{t_{1}, t_{2}\}

, and

θ (\infty) = {lim}_{t \to \infty} θ (t)

. The map d is a metric on

V^{res} \cup V

under the conditions we describe later. The first term of the distance (8) compares the inputs on the intersection of their domains. The function w assigns greater weight to the difference in the newer part of the inputs. The second term of (8) compares the length of the two inputs via the function

θ

. From the definition of

θ

, the second term is negligible if

t_{1}

and

t_{2}

are sufficiently large. The following proposition [20] provides the condition that makes d a metric:

Proposition 1.

Suppose that the following triangle inequality holds for any

v_{1}

,

v_{2} \in V

,

t_{1} \geq 0

, and

t_{2} \geq t_{1}

:

d (v_{1} {|_{[- t_{2}, 0]}, v_{2} |}_{[- t_{2}, 0]}) \leq d (v_{1} {|_{[- t_{1}, 0]}, v_{2} |}_{[- t_{2}, 0]}) + d (v_{1} {|_{[- t_{1}, 0]}, v_{1} |}_{[- t_{2}, 0]}) .

(9)

Then,

(V^{res} \cup V, d)

is a compact metric space, and

V^{res}

is dense in

V^{res} \cup V

.

The triangle inequality (9) guarantees that d satisfies other triangle inequalities. The density of

V^{res}

is confirmed as follows: for any

v \in V

,

{v |}_{[- t, 0]} \in V^{res}

converges to

v \in V

as

t \to \infty

. From the density of

V^{res}

, we have

V^{res} \cup V = \bar{V^{res}}

, where the symbol

\bar{\cdot}

is the closure. An example of a pair,

(w, θ)

, that makes d a metric is shown by (38) in [20].

In the rest of this paper, we assume that targets in

F^{*}

are uniformly continuous following the previous study [3]. This assumption enables us to extended a functional

f^{*} \in F^{*}

onto

\bar{V^{res}}

, i.e., there is a continuous functional

{\bar{f}}^{*} : \bar{V^{res}} \to R

such that

f^{*} (v) = {\bar{f}}^{*} (v)

for any

v \in V^{res}

. Such continuity on a compact set is needed for theoretical discussion on approximation.

The known weakest sufficient condition for universality is called the neighborhood separation property (NSP) [20]. The NSP means that the reservoir transfers the neighborhoods of distinct points to the images disconnected from each other. For

δ > 0

and

v \in \bar{V^{res}}

, we define the set

N_{δ} (v) \subset V^{res}

as

N_{δ} (v) = \{v^{'} \in V^{res} |d (v, v^{'}) < δ\} .

(10)

Although the set

N_{δ} (v)

is similar to a general neighborhood, note that

N_{δ} (v) \subset V^{res}

holds even if

v \in V

. The mathematical definition of the NSP of the reservoir

f : V^{res} \to R^{m}

is the following.

Condition 1.

For any distinct

v_{1}

,

v_{2} \in \bar{V^{res}}

, some

δ > 0

satisfies

\bar{f (N_{δ} (v_{1}))} \cap \bar{f (N_{δ} (v_{2}))} = \emptyset .

Another sufficient condition for universality is that the reservoir

f

has a uniformly continuous inverse, as shown below [21].

Condition 2.

There is a uniformly continuous map,

g : f (V^{res}) \to V^{res}

, that satisfies

g \circ f = {id}_{V^{res}}

.

Condition 2 means that the input function can be continuously reconstructed from the reservoir output and that there is a continuous map from the reservoir outputs to the target outputs through input functions. Condition 2 is sufficient for Condition 1, which shows that there is a universal reservoir. The Hahn–Mazurkiewicz theorem [25] claims that there is a continuous surjection

g : [0, 1] \to \bar{V^{res}}

, called a space-filling curve. Hence, we can obtain the reservoir

f : V^{res} \to R

, satisfying Condition 2, by restricting the domain of g for its image to be

V^{res}

and taking a right inverse of the restriction. This result can be easily extended to

f : V^{res} \to R^{m}

with a general

m \in N

.

3. Necessary and Sufficient Condition for Universality

In this section, we prove the following theorem:

Theorem 1.

Let

F^{*}

be the set of uniformly continuous functionals from

V^{res}

to

R

. Let

f : V^{res} \to R^{m}

be a bounded functional. Then, Equation (5), Condition 1, and Condition 2 are equivalent to one another.

Theorem 1 means that a reservoir’s universality, NSP, and uniformly continuous inverse are equivalent. This is the first result that provides the necessary and sufficient conditions of a reservoir to be universal. Theorem 1 is proved as a corollary of the following generalization:

Theorem 2.

Let X be a metric space with the metric d and

\bar{X}

be a compactification of X. Suppose that

m \in N

and that

f : X \to R^{m}

is bounded. Then, the following three conditions are equivalent to one another:

(i): For any uniformly continuous $f^{*} : X \to R$ and $ε > 0$ , there is some polynomial function, $p : R^{m} \to R$ , that satisfies

$sup_{x \in X} |f^{*} (x) - p \circ f (x)| < ε .$

(11)
(ii): For any distinct $x_{1}$ and $x_{2} \in \bar{X}$ , there is some $δ > 0$ that satisfies

$\bar{f (N_{δ} (x_{1}))} \cap \bar{f (N_{δ} (x_{2}))} = \emptyset,$

(12)

where $N_{δ} (x) = \{x^{'} \in X |d (x, x^{'}) < δ\}$ for $x \in \bar{X}$ .
(iii): There is a uniformly continuous map $g : f (X) \to X$ that satisfies

$g \circ f = {id}_{X},$

(13)

where ${id}_{X}$ is the identity mapping on X.

From Proposition 1, Theorem 1 is a corollary of Theorem 2 where

X = V^{res}

. Conditions (i)–(iii) correspond to the universality of Definition 1 and Conditions 1 and 2, respectively. Set X does not have to be a strict subset of

\bar{X}

, i.e.,

X = \bar{X}

is allowed. Hence, instead of

V^{res}

, we can consider the compact set V or

V \cup V^{res}

as X. In this case, we obtain the same result as that of Theorem 1 for input functions on an infinite time interval. We prove Theorem 2 by proving the following propositions: (iii)⇒(i), ¬(iii)

\Rightarrow \neg

(i), and ¬(iii)

\Leftrightarrow \neg

(ii). To prove propositions premised on ¬(iii), we use the following lemma:

Lemma 1.

Suppose that Condition (iii) does not hold on the same premise as Theorem 2. Then, there exist the Cauchy sequences

{(x_{1, k})}_{k \in N}

and

{(x_{2, k})}_{k \in N} \in X

that satisfy

lim_{k \to \infty} x_{1, k} \neq lim_{k \to \infty} x_{2, k}, lim_{k \to \infty} f (x_{1, k}) = lim_{k \to \infty} f (x_{2, k}) .

(14)

Because of

x_{1, k} = g \circ f (x_{1, k})

and

x_{2, k} = g \circ f (x_{2, k})

, Equation (14) means that even if two inputs to g are close, the outputs are not necessarily close. More precisely, g is not uniformly continuous if (14) holds. Lemma 1 claims the inverse of this proposition.

Proof of Lemma 1.

First, we consider the case where no map

g : f (X) \to X

satisfies

g \circ f = {id}_{X}

, i.e.,

f

does not have a left inverse. Having no left inverse is equivalent to not being injective. Hence, there exist distinct

x_{1}

and

x_{2} \in X

that satisfy

f (x_{1}) = f (x_{2})

, and the sequences

x_{1, k} = x_{1}

and

x_{2, k} = x_{2}

(k \in N)

, which do not change with k, satisfy (14).

Next, we consider the case where a map

g : f (X) \to X

satisfying

g \circ f = {id}_{X}

exists but is not uniformly continuous, i.e.,

\exists ε > 0, \forall δ > 0, \exists α_{1}, α_{2} \in f (X), ∥α_{1} - α_{2}∥ < δ, d (g (α_{1}), g (α_{2})) \geq ε .

(15)

Then, there exist

ε > 0

and the sequences

{(α_{1, k})}_{k \in N}

,

{(α_{2, k})}_{k \in N} \in f (X)

that satisfy

∥α_{1, k} - α_{2, k}∥ < \frac{1}{k}, d (g (α_{1, k}), g (α_{2, k})) \geq ε (k \in N) .

(16)

A bounded

f

means that

\bar{f (X)} \subset R^{m}

is bounded and closed, i.e., compact. An infinite sequence on a compact metric space includes a subsequence that converges on that space. Hence, there exist an infinite set

N^{'} \subset N

and

α \in \bar{f (X)}

such that the subsequence

{(α_{1, k})}_{k \in N^{'}}

converges to

α

. From the first inequality in (16), we have

α_{1, k} \to α, α_{2, k} \to α (k \to \infty, k \in N^{'}) .

(17)

Because

α_{1, k}

and

α_{2, k}

are included in

f (X)

, there exist the sequences

{(x_{1, k})}_{k \in N}

and

{(x_{2, k})}_{k \in N} \in X

that satisfy the following:

f (x_{1, k}) = α_{1, k}, f (x_{1, k}) = α_{2, k} (k \in N) .

(18)

We show that

{(x_{1, k})}_{k \in N}

and

{(x_{2, k})}_{k \in N}

include subsequences that satisfy (14). From (17) and (18), the subsequences

{(x_{1, k})}_{k \in N^{'}}

and

{(x_{2, k})}_{k \in N^{'}}

satisfy the second equation in (14). Because

\bar{X}

is compact, there is some infinite set,

N^{″} \subset N^{'}

, such that the subsequence

{(x_{1, k})}_{k \in N^{″}}

converges. Similarly, there is also some infinite set,

N^{‴} \subset N^{″}

, such that the subsequence

{(x_{2, k})}_{k \in N^{‴}}

converges. Using g in (18) and substituting

g \circ f = {id}_{X}

, the following holds:

x_{1, k} = g (α_{1, k}), x_{2, k} = g (α_{2, k}) (k \in N) .

(19)

From (19) and the second inequality in (16), we have

d (x_{1, k}, x_{2, k}) \geq ε

for any

k \in N

. Therefore, the Cauchy sequences

{(x_{1, k})}_{k \in N^{‴}}

and

{(x_{2, k})}_{k \in N^{‴}}

satisfy the first equation in (14), which proves Lemma 1. □

Theorem 2 is proved as follows:

Proof of Theorem 2.

As we explained, we prove Propositions (iii)⇒(i), ¬(iii)

\Rightarrow \neg

(i), ¬(iii)

\Rightarrow \neg

(ii), and ¬(ii)

\Rightarrow \neg

(iii).

Proof of (iii)⇒(i): Let

f^{*} : X \to R

and

ε > 0

be an arbitrary uniformly continuous functional and an arbitrary real number. We define a function

q : f (X) \to R

as

q = f^{*} \circ g

. Because

f^{*} = f^{*} \circ g \circ f = q \circ f

, (11) is equivalent to

sup_{x \in X} |q \circ f (x) - p \circ f (x)| < ε .

(20)

The function q is uniformly continuous because

f^{*}

and g are uniformly continuous. Hence, there is a unique continuous extension,

\bar{q} : \bar{f (X)} \to R

, satisfying

\bar{q} (α) = q (α)

for any

α \in f (X)

. From the Stone–Weierstrass theorem [26], there is some polynomial function

p : R^{m} \to R

that satisfies

|\bar{q} (α) - p (α)| < ε

for any

α

on the compact set

\bar{f (X)}

. The polynomial function p also satisfies (20), which proves (iii)⇒(i). The relationship among maps in the proof is shown in Figure 3.

Proof of ¬(iii)

\Rightarrow \neg

(i): From Lemma 1, there exist the Cauchy sequences

{(x_{1, k})}_{k \in N}

and

{(x_{2, k})}_{k \in N} \in X

that satisfy (14). We define distinct

x_{1}

and

x_{2} \in \bar{X}

as

x_{1} = lim_{k \to \infty} x_{1, k}, x_{2} = lim_{k \to \infty} x_{2, k} .

(21)

Let

f^{*} : X \to R

be a uniformly continuous function that satisfies

lim_{k \to \infty} f^{*} (x_{1, k}) = 0, lim_{k \to \infty} f^{*} (x_{2, k}) = 1 .

(22)

A specific definition of

f^{*}

is not necessary for the proof, but it can be defined as follows:

f^{*} (x) = \frac{d (x_{1}, x)}{d (x_{1}, x_{2})} (x \in X) .

(23)

To prove by contradiction, we suppose that (i) holds and let

ε = 1 / 3

. Then, there is some polynomial function

p : R^{m} \to R

that satisfies

|f^{*} (x_{1, k}) - p \circ f (x_{1, k})| < \frac{1}{3}, |f^{*} (x_{2, k}) - p \circ f (x_{2, k})| < \frac{1}{3} (k \in N) .

(24)

From (14) and the continuity of p, the following holds:

lim_{k \to \infty} p \circ f (x_{1, k}) = lim_{k \to \infty} p \circ f (x_{2, k}) .

(25)

However, from (22) and the limit of (24) when

k \to \infty

, the following also holds:

lim_{k \to \infty} p \circ f (x_{1, k}) < \frac{1}{3}, \frac{2}{3} < lim_{k \to \infty} p \circ f (x_{1, k}) .

(26)

This contradicts (25), and we have ¬(i). The relationship among sets and sequences in the proof is shown in Figure 4.

Proof of ¬(iii)

\Rightarrow \neg

(ii): From Lemma 1, there exist the Cauchy sequences

{(x_{1, k})}_{k \in N}

and

{(x_{2, k})}_{k \in N} \in X

that satisfy (14). We define distinct

x_{1}

and

x_{2} \in \bar{X}

and

α \in \bar{f (X)}

as

x_{1} = lim_{k \to \infty} x_{1, k}, x_{2} = lim_{k \to \infty} x_{2, k}, α = lim_{k \to \infty} f (x_{1, k}) = lim_{k \to \infty} f (x_{2, k}) .

(27)

Let

δ > 0

be an arbitrary real number. From the first and second equations in (27), there is a sufficiently large

k \in N

that satisfies

f (x_{1, k}) \in f (N_{δ} (x_{1}))

and

f (x_{2, k}) \in f (N_{δ} (x_{2}))

. Because both

f (x_{1, k})

and

f (x_{2, k})

converge to

α

as

k \to \infty

, we have

α \in \bar{f (N_{δ} (x_{1}))} \cap \bar{f (N_{δ} (x_{2}))}

, which proves ¬(iii)

\Rightarrow \neg

(ii). The relationship among sets and sequences in the proof is shown in Figure 5.

Proof of ¬(ii)

\Rightarrow \neg

(iii): Suppose that distinct

x_{1}

and

x_{2} \in \bar{X}

satisfy

\bar{f (N_{δ} (x_{1}))} \cap \bar{f (N_{δ} (x_{2}))} \neq \emptyset

for any

δ > 0

. Let

δ = d (x_{1}, x_{2}) / 3

. We consider the sequences

{(α_{1, k})}_{k \in N} \in f (N_{δ} (x_{1}))

and

{(α_{2, k})}_{k \in N} \in f (N_{δ} (x_{2}))

that converge to

α \in \bar{f (N_{δ} (x_{1}))} \cap \bar{f (N_{δ} (x_{2}))}

. To prove by contradiction, suppose that a uniformly continuous map

g : f (X) \to X

exists and satisfies

g \circ f = {id}_{X}

. Then, we have

g (α_{1, k}) \in N_{δ} (x_{1})

and

g (α_{2, k}) \in N_{δ} (x_{2})

. Therefore, although

α_{1, k}

and

α_{2, k}

converge to the same point

α

as

k \to \infty

, we have

d (g (α_{1, k}), g (α_{2, k})) > d (x_{1}, x_{2}) / 3

for any

k \in N

. This means that g is not uniformly continuous, and we have ¬(iii). The relationship among sets and sequences in the proof is shown in Figure 6.

From the above four propositions, we have (iii)⇔(i) and (iii)⇔(ii), which proves Theorem 2. □

4. Pathological Property of Universal Reservoir

Although a previous study [21] showed that a universal reservoir exists mathematically, whether we can physically construct one is still unknown. The authors of [21] suggested that the proposed universal reservoir using a space-filling curve has an infinite number of discontinuous points and is difficult to implement. In this section, we show that all universal reservoirs have the same problem as the example in [21]. The main result of this section is the following:

Theorem 3.

Let

F^{*}

be the set of uniformly continuous functionals from

V^{res}

to

R

. Let

f : V^{res} \to R^{m}

be a functional that is bounded and universal for uniform approximations in

F^{*}

. Then, the set of points

v \in V^{res}

at which

f

is discontinuous is dense in

V^{res}

.

Theorem 3 says that a universal reservoir has a discontinuous point on an arbitrary neighborhood of an arbitrary point and is very sensitive to input change. Note that the discontinuity of

f : V^{res} \to R

does not mean the discontinuity of the time derivative

ϕ

of states in the state-space representation (4) of f. The same holds for

f

, which has m outputs. Theorem 3 is proved using the contradiction that if

f

is continuous on some open set, two spaces with different dimensions are homeomorphic. To this end, we use the following lemma about a topological embedding:

Lemma 2.

For any

v \in V^{res}

,

δ > 0

, and any

l \in N

, there is some topological embedding

h : {[0, 1]}^{l} \to N_{δ} (v)

.

Proof of Lemma 2.

Let

v \in V^{res}

be an arbitrary function. Let

δ > 0

and l be arbitrary real and natural numbers, respectively. Let

t = λ (v)

, i.e., the domain of v is

[- t, 0]

. Because the bounded function

θ : R_{+} \to R_{+}

in (8) is continuous and monotonically increasing, there is some

T > 0

satisfying

θ (t + l T) - θ (t) < δ

. Let an input value

a \in A

satisfy

0 < ∥a - v (- t)∥ \leq K T

, where A is the input value space and K is the maximum Lipschitz constant in (1). Using T and a, we define the map

h : {[0, 1]}^{l} \to N_{δ} (v)

as

h (β) = v_{β}, v_{β} (τ) = \{\begin{matrix} v (τ) & (- t < τ \leq 0), \\ v (- t) + [a - v (- t)] r_{β} (τ) & (- t - l T \leq τ \leq - t) \end{matrix} (β \in {[0, 1]}^{l}),

(28)

where

r_{β} : [- l T - t, - t] \to [0, 1]

is defined for

β = [β_{1} \dots β_{l}]

as

r_{β} (τ) = \sum_{i = 1}^{l} β_{i} max \{1 - \frac{|τ + i T + t|}{T}, 0\} (- t - l T \leq τ \leq - t) .

(29)

As shown in Figure 7,

v_{β}

is an extension of v to

[- t - l T, 0]

with the values on the added domain defined as some internal division point between

v (- t)

and a. The division rate

r_{β}

is piece-wise linear and takes the value of

β_{i}

at

- t - i T

for

i \in \{1, \dots, l\}

, which are the borders of the linear pieces.

First, we show that

v_{β} \in N_{δ} (v)

holds for any

β \in {[0, 1]}^{l}

. Because the convex set A includes a and

v (- t)

, the image of

v_{β}

is included in A. Because the Lipschitz constant of

r_{β}

is at most

1 / T

, and

∥a - v (- t)∥ \leq K T

holds, the Lipschitz constant of

v_{β}

is less than or equal to K. Hence, we have

v_{β} \in V^{res}

. The functions v and

v_{β}

have the same values on the intersection of their domain. Hence, from the definition of T, we have

d (v, v_{β}) = θ (t + l T) - θ (t) < δ

, i.e.,

v_{β} \in N_{δ} (v)

.

Next, we show that h is an embedding. A continuous bijection from a compact set is homeomorphic, and the domain

{[0, 1]}^{l}

of h is compact. Hence, if h is a continuous injection, it is homeomorphic to its image, i.e., an embedding. As we explained, we have

r_{β} (- t - i T) = β_{i}

for

i \in \{1, \dots, l\}

. Hence, different

β

gives different

r_{β}

and

h (β)

, i.e., the map h is injective. Let

v_{β_{1}} = h (β_{1})

and

v_{β_{2}} = h (β_{2})

for

β_{1}

,

β_{2} \in {[0, 1]}^{l}

. Then, the distance between

v_{β_{1}}

and

v_{β_{2}}

is written as

d (v_{β_{1}}, v_{β_{2}}) = sup_{- t - l T \leq τ \leq 0} ∥v_{β_{1}} (τ) - v_{β_{2}} (τ)∥ \leq ∥a - v (- t)∥ ∥β_{1} - β_{2}∥ .

(30)

This means that the map h is continuous. Therefore, the map

h : {[0, 1]}^{l} \to N_{δ} (v)

is an embedding, which proves Lemma 2. □

Using Lemma 2, we prove Theorem 3 as follows:

Proof of Theorem 3.

Let bounded

f : V^{res} \to R^{m}

be universal for uniform approximations in

F^{*}

. To prove by contradiction, suppose that discontinuous points of

f

are not dense in

V^{res}

, i.e., there exist

v \in V^{res}

and

δ > 0

such that

f

is continuous on

N_{δ} (v)

. From Theorem 1, universality is equivalent to Condition 2, and

f

has a continuous inverse map. Hence, the restriction

{f |}_{N_{δ} (v)}

of

f

to

N_{δ} (v)

is a topological embedding because it and its inverse are continuous. From Lemma 2, there is a topological embedding

h : {[0, 1]}^{m + 1} \to N_{δ} (v)

. The composition

{f |}_{N_{δ} (v)} \circ h

is also an embedding, i.e.,

{[0, 1]}^{m + 1}

and

{f |}_{N_{δ} (v)} \circ h ({[0, 1]}^{m + 1})

are homeomorphic. The relationship among the domains and images of h and

f

is shown in Figure 8.

We call the small inductive dimension simply a dimension and write the dimension of the space A as

ind A

. Dimensions have the following two properties (see pages 3–4 of [27]): First, two homeomorphic topological spaces have the same dimension. Second, a topological space has a dimension equal to its subspace or larger. Therefore, we have the contradiction of

m + 1 = ind {[0, 1]}^{m + 1} {= ind f |}_{N_{δ} (v)} \circ h ({[0, 1]}^{m + 1}) \leq ind R^{m} = m,

(31)

which proves Theorem 3. □

5. Conclusions

We show that the universality of a reservoir and its two sufficient conditions, the NSP and the continuous inverse, are equivalent to one another. We also show that a reservoir has dense discontinuous points if it has a continuous inverse. Our results indicate that universality, discussed since [3], is an ideal approximation ability difficult to achieve in practice.

The behavior of a universal reservoir, as shown in this study, indicates that chaotic systems are necessary to achieve or approach universality. The difference between two inputs remains in a state-space system as the difference between states. Hence, a system sensitive to inputs is also sensitive to the initial state. As we showed, a universal reservoir has dense discontinuous points and is very sensitive to inputs. Therefore, the universality can be related to chaotic systems. Some studies support this interpretation [19,22,23,24].

Our results also provide an insight to guarantee a weaker but practical approximation ability. We reveal that a reservoir’s “resolution,” ability to distinguish inputs, is directly linked to the approximation ability of the RC model. The NSP can be seen as an infinite resolution that distinguishes two input functions, regardless of how similar they are. We can also consider reservoirs with finite resolution, and it may be possible to guarantee approximation abilities for them according to their resolution. Such reservoirs behave more gently and are easier to implement than those with the NSP.

Author Contributions

Conceptualization, S.S. and R.A.; methodology, S.S.; investigation, S.S., R.A., T.A. and S.-i.A.; writing—original draft preparation, S.S.; writing—review and editing, S.S., R.A., T.A. and S.-i.A.; supervision, R.A, T.A. and S.-i.A.; project administration, R.A.; funding acquisition, R.A. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by JSPS KAKENHI, Grant Number JP25K03203.

Data Availability Statement

The original contributions presented in this study are included in the article; further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Rumelhart, D.E.; Hinton, G.E.; Williams, R.J. Learning Internal Representations by Error Propagation; Technical Report; California Univ San Diego La Jolla Inst For Cognitive Science: La Jolla, CA, USA, 1985. [Google Scholar] [CrossRef]
Jaeger, H. The “echo state” approach to analysing and training recurrent neural networks—With an erratum note. In Technical Report GMD Report 148; German National Research Center for Information Technology: Bonn, Germany, 2001. [Google Scholar]
Maass, W.; Natschl, T.; Markram, H. Real-time computing without stable states: A new framework for neural computation based on perturbations. Neural Comput. 2002, 14, 2531–2560. [Google Scholar] [CrossRef] [PubMed]
Verstraeten, D.; Schrauwen, B.; D’Haene, M.; Stroobandt, D. An experimental unification of reservoir computing methods. Neural Netw. 2007, 20, 391–403. [Google Scholar] [CrossRef] [PubMed]
Lukoševičius, M.; Jaeger, H. Reservoir computing approaches to recurrent neural network training. Comput. Sci. Rev. 2009, 3, 127–149. [Google Scholar] [CrossRef]
Tanaka, G.; Yamane, T.; Héroux, J.B.; Nakane, R.; Kanazawa, N.; Takeda, S.; Numata, H.; Nakano, D.; Hirose, A. Recent advances in physical reservoir computing: A review. Neural Netw. 2019, 115, 100–123. [Google Scholar] [CrossRef]
Nakajima, K. Physical reservoir computing–an introductory perspective. Jpn. J. Appl. Phys. 2020, 59, 6. [Google Scholar] [CrossRef]
Nakajima, K.; Fischer, I. Reservoir Computing; Springer: Berlin/Heidelberg, Germany, 2021. [Google Scholar]
Soriano, M.C.; Brunner, D.; Escalona-Morán, M.; Mirasso, C.R.; Fischer, I. Minimal approach to neuro-inspired information processing. Front. Comput. Neurosci. 2015, 9, 68. [Google Scholar] [CrossRef]
Antonik, P.; Smerieri, A.; Duport, F.; Haelterman, M.; Massar, S. FPGA implementation of reservoir computing with online learning. In Proceedings of the 24th Belgian-Dutch Conference on Machine Learning, Delft, The Netherlands, 19 June 2015. [Google Scholar]
Lin, C.; Liang, Y.; Yi, Y. FPGA-based reservoir computing with optimized reservoir node architecture. In Proceedings of the 23rd International Symposium on Quality Electronic Design, Santa Clara, CA, USA, 6–7 April 2022; pp. 1–6. [Google Scholar] [CrossRef]
Schürmann, F.; Meier, K.; Schemmel, J. Edge of chaos computation in mixed-mode vlsi–a hard liquid. In Proceedings of the Advances in Neural Information Processing Systems, Vancouver, BC, Canada, 13–18 December 2004; p. 17. [Google Scholar]
Kulkarni, M.S.; Teuscher, C. Memristor-based reservoir computing. In Proceedings of the 2012 IEEE/ACM International Symposium on Nanoscale Architectures, Amsterdam, The Netherlands, 4–6 July 2012; pp. 226–232. [Google Scholar] [CrossRef]
Vandoorne, K.; Dierckx, W.; Schrauwen, B.; Verstraeten, D.; Baets, R.; Bienstman, P.; Campenhout, J.V. Toward optical signal processing using photonic reservoir computing. Opt. Express 2008, 16, 11182–11192. [Google Scholar] [CrossRef] [PubMed]
Larger, L.; Soriano, M.C.; Brunner, D.; Appeltant, L.; Gutiérrez, J.M.; Pesquera, L.; Mirasso, C.R.; Fischer, I. Photonic information processing beyond Turing: An optoelectronic implementation of reservoir computing. Opt. Express 2012, 20, 3241–3249. [Google Scholar] [CrossRef] [PubMed]
Guo, X.; Zhou, H.; Xiang, S.; Yu, Q.; Zhang, Y.; Han, Y.; Wang, T.; Hao, Y. Experimental demonstration of a photonic reservoir computing system based on Fabry Perot laser for multiple tasks processing. Nanophotonics 2024, 13, 1569–1580. [Google Scholar] [CrossRef] [PubMed]
Torrejon, J.; Riou, M.; Araujo, F.A.; Tsunegi, S.; Khalsa, G.; Querlioz, D.; Bortolotti, P.; Cros, V.; Yakushiji, K.; Fukushima, A.; et al. Neuromorphic computing with nanoscale spintronic oscillators. Nature 2017, 547, 428–431. [Google Scholar] [CrossRef] [PubMed]
Allwood, D.A.; Ellis, M.O.; Griffin, D.; Hayward, T.J.; Manneschi, L.; Musameh, M.F.; O’Keefe, S.; Stepney, S.; Swindells, C.; Trefzer, M.A.; et al. A perspective on physical reservoir computing with nanomagnetic devices. Appl. Phys. Lett. 2023, 122, 040501. [Google Scholar] [CrossRef]
Namiki, W.; Nishioka, D.; Nomura, Y.; Tsuchiya, T.; Yamamoto, K.; Terabe, K. Iono-Magnonic Reservoir Computing With Chaotic Spin Wave Interference Manipulated by Ion-Gating. Adv. Sci. 2025, 12, 2411777. [Google Scholar] [CrossRef] [PubMed]
Sugiura, S.; Ariizumi, R.; Asai, T.; Azuma, S. Nonessentiality of Reservoir’s Fading Memory for Universality of Reservoir Computing. IEEE Trans. Neural Netw. Learn. Syst. 2023, 35, 16801–16815. [Google Scholar] [CrossRef] [PubMed]
Sugiura, S.; Ariizumi, R.; Asai, T.; Azuma, S. Existence of reservoir with finite-dimensional output for universal reservoir computing. Sci. Rep. 2024, 14, 8448. [Google Scholar] [CrossRef] [PubMed]
Jensen, J.H.; Tufte, G. Reservoir computing with a chaotic circuit. In Artificial Life Conference Proceedings; MIT Press: Cambridge, MA, USA, 2017; pp. 222–229. [Google Scholar] [CrossRef]
Choi, J.; Kim, P. Reservoir computing based on quenched chaos. Chaos Solitons Fractals 2020, 140, 110131. [Google Scholar] [CrossRef]
Fukuda, K.; Horio, Y. Analysis of dynamics in chaotic neural network reservoirs. Nonlinear Theory Its Appl. IEICE 2021, 12, 639–661. [Google Scholar] [CrossRef]
Hocking, J.G.; Young, G.S. Topology; Addison-Wesley Publishing Company: Reading, MA, USA, 1961. [Google Scholar]
Dieudonne, J. Foundations of Modern Analysis; Academic Press: Cambridge, MA, USA, 1969. [Google Scholar]
Engelking, R. Dimension Theory; North-Holland Publishing Company: Amsterdam, The Netherlands, 1978. [Google Scholar]

Figure 1. Structure of an RC model.

Figure 2. Reservoir conditions relating to universality. Arrows (⇒) indicate implication.

Figure 3. Maps used in the proof of (iii)⇒(i).

Figure 4. Sets and sequences used in the proof of ¬(iii)

\Rightarrow \neg

(i). The dotted arrows indicate convergence.

Figure 4. Sets and sequences used in the proof of ¬(iii)

\Rightarrow \neg

(i). The dotted arrows indicate convergence.

Figure 5. Sets and sequences used in the proof of ¬(iii)

\Rightarrow \neg

(ii). The dotted arrows indicate convergence.

Figure 5. Sets and sequences used in the proof of ¬(iii)

\Rightarrow \neg

(ii). The dotted arrows indicate convergence.

Figure 6. Sets and sequences used in the proof of ¬(ii)

\Rightarrow \neg

(iii). The dotted arrows indicate convergence.

Figure 6. Sets and sequences used in the proof of ¬(ii)

\Rightarrow \neg

(iii). The dotted arrows indicate convergence.

Figure 7. Examples of functions v,

v_{β}

, and

r_{β}

when

A \subset R

and

l = 3

.

Figure 7. Examples of functions v,

v_{β}

, and

r_{β}

when

A \subset R

and

l = 3

.

Figure 8. Domains and images of the maps in the proof of Theorem 3.

Table 1. Main symbols and their definitions.

Symbol	Definition
$R_{-} \subset R$	$(- \infty, 0]$
$R_{+} \subset R$	$[0, \infty)$
$P_{m}$	Set of all polynomial functions from $R^{m}$ to $R$
$A \subset R^{m}$	Compact and convex set of input values
$K > 0$	Bound of Lipschitz constants of input functions
V	Set of input functions on $R_{-}$ defined as (1)
${v \|}_{\cdot}$	Restriction of map v to set $(\cdot)$
$V^{res}$	Set of input functions on a finite time interval defined as (2)
$\bar{\cdot}$	Closure of set $(\cdot)$
$\bar{V^{res}}$	Compactification of $V^{res}$ defined as $\bar{V^{res}} = V^{res} \cup V$
$d : \bar{V^{res}} \times \bar{V^{res}} \to R$	Metric on $\bar{V^{res}}$ defined as (8)
$N_{δ} (v) \subset V^{res}$	$\{v^{'} \in V^{res} \|d (v, v^{'}) < δ\}$ for any $δ > 0$ and $v \in \bar{V^{res}}$
${id}_{\cdot}$	Identity mapping on set $(\cdot)$
$ind \cdot$	Inductive dimension of set $(\cdot)$
Symbol (only in Theorem 2)	Definition
X	General compactifiable metric space
$\bar{X}$	Compactification of X
$d : \bar{X} \times \bar{X} \to R$	Metric on $\bar{X}$
$N_{δ} (x) \subset X$	$\{x^{'} \in X \|d (x, x^{'}) < δ\}$ for any $δ > 0$ and $x \in \bar{X}$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Sugiura, S.; Ariizumi, R.; Asai, T.; Azuma, S.-i. Necessary and Sufficient Reservoir Condition for Universal Reservoir Computing. Mathematics 2025, 13, 3440. https://doi.org/10.3390/math13213440

AMA Style

Sugiura S, Ariizumi R, Asai T, Azuma S-i. Necessary and Sufficient Reservoir Condition for Universal Reservoir Computing. Mathematics. 2025; 13(21):3440. https://doi.org/10.3390/math13213440

Chicago/Turabian Style

Sugiura, Shuhei, Ryo Ariizumi, Toru Asai, and Shun-ichi Azuma. 2025. "Necessary and Sufficient Reservoir Condition for Universal Reservoir Computing" Mathematics 13, no. 21: 3440. https://doi.org/10.3390/math13213440

APA Style

Sugiura, S., Ariizumi, R., Asai, T., & Azuma, S.-i. (2025). Necessary and Sufficient Reservoir Condition for Universal Reservoir Computing. Mathematics, 13(21), 3440. https://doi.org/10.3390/math13213440

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Necessary and Sufficient Reservoir Condition for Universal Reservoir Computing

Abstract

1. Introduction

2. Preliminary

3. Necessary and Sufficient Condition for Universality

4. Pathological Property of Universal Reservoir

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI