Computability of the Zero-Error Capacity of Noisy Channels

Boche, Holger; Deppe, Christian

doi:10.3390/info16070571

Open AccessArticle

Computability of the Zero-Error Capacity of Noisy Channels^†

by

Holger Boche

^{1,‡,§,‖,¶}

and

Christian Deppe

^2,*,§

¹

Theoretical Information Technology, Technical University of Munich, 80333 Munich, Germany

²

Institute for Communications Technology, Technische Universität Braunschweig, 38106 Brunswick, Germany

^*

Author to whom correspondence should be addressed.

^†

This article is a revised and expanded version of a paper entitled Insights into [Computability of the zero-error capacity of noisy channels], which was presented at [2021 IEEE Information Theory Workshop (ITW), Kanazawa, Japan, 17–21 October 2021].

^‡

Current address: Cyber Security in the Age of Large-Scale Adversaries–Exzellenzcluster, Ruhr-Universität Bochum, 44801 Bochum, Germany.

^§

Current address: BMBF Research Hub 6G-Life, 80333 Munich, Germany.

^‖

Current address: Munich Center for Quantum Science and Technology (MCQST), 80799 Munich, Germany.

^¶

Current address: Munich Quantum Valley (MQV), 80799 München, Germany.

Information 2025, 16(7), 571; https://doi.org/10.3390/info16070571

Submission received: 28 March 2025 / Revised: 19 June 2025 / Accepted: 1 July 2025 / Published: 3 July 2025

(This article belongs to the Special Issue Feature Papers in Information in 2024–2025)

Download Versions Notes

Abstract

The zero-error capacity of discrete memoryless channels (DMCs), introduced by Shannon, is a fundamental concept in information theory with significant operational relevance, particularly in settings where even a single transmission error is unacceptable. Despite its importance, no general closed-form expression or algorithm is known for computing this capacity. In this work, we investigate the computability-theoretic boundaries of the zero-error capacity and establish several fundamental limitations. Our main result shows that the zero-error capacity of noisy channels is not Banach–Mazur-computable and therefore is also not Borel–Turing-computable. This provides a strong form of non-computability that goes beyond classical undecidability, capturing the inherent discontinuity of the capacity function. As a further contribution, we analyze the deep connections between (i) the zero-error capacity of DMCs, (ii) the Shannon capacity of graphs, and (iii) Ahlswede’s operational characterization via the maximum-error capacity of 0–1 arbitrarily varying channels (AVCs). We prove that key semi-decidability questions are equivalent for all three capacities, thus unifying these problems into a common algorithmic framework. While the computability status of the Shannon capacity of graphs remains unresolved, our equivalence result clarifies what makes this problem so challenging and identifies the logical barriers that must be overcome to resolve it. Together, these results chart the computational landscape of zero-error information theory and provide a foundation for further investigations into the algorithmic intractability of exact capacity computations.

Keywords:

Turing computability; zero-error capacity; Shannon capacity

1. Introduction

The zero-error capacity of discrete memoryless channels (DMCs) was introduced by Shannon in 1956 [1]. Since then, numerous works have examined this capacity across various channel classes. From the outset, determining

C_{0}

for DMCs has been recognized as highly challenging. In response, Shannon posed a key question: can the zero-error capacity of a DMC be expressed via the other error capacities of suitably chosen channels? A major breakthrough came from Ahlswede, who proved that the

C_{0}

value for a DMC equals the maximum-error capacity of a related 0–1 arbitrarily varying channel (AVC) [2].

This paper investigates the algorithmic computability of the zero-error capacity of DMCs and explores the broader computational implications of the Shannon and Ahlswede characterizations. We adopt Turing machine theory as our model of computability, which accurately reflects the capabilities of real-world digital computers.

Shannon’s original theory also provided a graph-theoretic interpretation: each channel corresponds to a simple graph whose Shannon capacity coincides with the channel’s zero-error capacity. In practice, however, channel descriptions are usually given directly by a transition mapping

W : X \to P (Y),

where

X

and

Y

are finite alphabets, and

P (Y)

denotes the set of probability distributions over

Y

. A notable application of this formulation is remote state estimation and stabilization [3].

The zero-error capacity is also central in quantum channels and entanglement-assisted classical channels. Research has focused on superactivation effects [4,5], entanglement-assisted gains [6], and connections to noncommutative graph theory [7]. Further studies have explored nonlocal correlations [8], no-signaling assistance [9], and noiseless feedback [10]. Surveys of quantum channel capacities provide broader context [11,12], while recent advances in quantum graph theory offer fresh insights into zero-error communication [13].

In general, one seeks the numerical value of

C_{0} (W)

, typically irrational, and strives for reliable approximation algorithms that compute it to any specified precision.

Shannon’s use of graph theory involved defining the confusability graph

G_{W}

of a DMC and using its Shannon capacity [14,15,16,17,18,19]. Since then, information theory has vastly expanded to cover multi-user systems, feedback channels, and advanced coding theory. Significant progress has been made in the zero-error capacity within relay, multiple-access, broadcast, and interference channels [20] and in specific models like binary adder and duplication channels [21,22,23]. Further studies have addressed list decoding [24,25], variable-length coding [26], and adversarial multiple-access channels [27].

Recent work [28] has determined the Shannon capacity for two infinite subclasses of strongly regular graphs and analyzed novel graph-join types, strengthening earlier results.

Today, two main algorithmic strategies exist for approximating the zero-error capacity: Shannon’s graph-theoretic method and Ahlswede’s 0–1 AVC–based method. We show that both approaches are non-recursive: there is no Turing machine that, given W, produces the confusability graph

G_{W}

, nor one that constructs the corresponding 0–1 AVC.

Moreover, the zero-error capacity plays a significant and important role in analyzing the

ϵ

-capacity of compound channels under the average decoding error, even when the compound set has only

| S | = 2

elements [29].

This paper is structured as follows:

Section 2 introduces computability concepts and the zero-error capacity of noisy channels and clarifies its links to the Shannon graph capacity and Ahlswede’s AVC framework;
Section 3 presents our main results: the non-computability of the zero-error capacity and the unresolved computability status of the Shannon graph capacity and the maximum-error AVC capacity;
Section 4 analyzes 0–1 AVCs under average error constraints, establishes the computability of their capacity, and shows that the Shannon capacity $Θ$ is Borel–Turing-computable if and only if the corresponding 0–1 AVC capacity is;
Section 6 summarizes our conclusions and discusses future directions.

Some findings were previously presented at the IEEE Information Theory Workshop 2021 in Kanazawa [30], and related results from ISIT 2020 [31] are revisited in Section 5.

2. Basic Definitions and Results

We apply the theory of Turing machines [32] and recursive functions [33] to investigating the computability of the zero-error capacity. For brevity, we restrict ourselves to an informal description and refer to [34,35,36,37] for detailed treatment.

Table 1 gives an overview of the main definitions and notations.

Turing machines provide a mathematical idealization of real-world computational machines. Any algorithm that can be executed by a real-world computer can, in theory, be simulated by a Turing machine, and vice versa. However, unlike real-world computers, Turing machines are not constrained by factors such as energy consumption, computation time, or memory size. Furthermore, all computation steps on a Turing machine are assumed to be executed without error.

Recursive functions form a special subset of the set

⋃_{n = 0}^{\infty} \{f : N^{n} ↪ N\}

, where the symbol “↪” denotes a partial mapping. Turing machines and recursive functions are equivalent in the following sense: a function

f : N^{n} ↪ N

is computable by a Turing machine if and only if it is a recursive function.

Definition 1.

A sequence of rational numbers

{(r_{n})}_{n \in N}

is said to be computable if recursive functions

f_{si}, f_{nu}, f_{de} : N \to N

exist such that

r_{n} = {(- 1)}^{f_{si} (n)} \frac{f_{nu} (n)}{f_{de} (n)}

(1)

holds true for all

n \in N

. Likewise, a double sequence of rational numbers

{(r_{n, m})}_{n, m \in N}

is said to be computable if recursive functions

f_{si}, f_{nu}, f_{de} : N \times N \to N

exist such that

r_{n, m} = {(- 1)}^{f_{si} (n, m)} \frac{f_{nu} (n, m)}{f_{de} (n, m)}

(2)

holds true for all

n, m \in N

.

Definition 2.

A sequence

{(x_{n})}_{n \in N}

of real numbers is said to converge effectively towards a number

x_{*} \in R

if a recursive function

κ : N \to N

exists such that

| x_{*} - x_{n} | < \frac{1}{2^{N}}

holds true for all

n, N \in N

that satisfy

n \geq κ (N)

.

Definition 3.

A real number x is said to be computable if a computable sequence of rational numbers exists that converges effectively towards x.

We denote the set of computable real numbers as

R_{c}

.

Definition 4.

A sequence

{(x_{n})}_{n \in N}

of computable numbers is called computable if a computable double sequence

{(r_{n, m})}_{n, m \in N}

of rational numbers, as well as a recursive function

κ : N \times N \to N

, exists such that

\begin{matrix} | x_{n} - r_{n, m} | < \frac{1}{2^{M}} \end{matrix}

(3)

holds true for all

n, m, M \in N

that satisfy

m \geq κ (n, M)

.

Definition 5.

A sequence of functions

{F_{n}}_{n \in N}

with

F_{n} : X \to R_{c}

is computable if the mapping

(i, x) \to F_{i} (x)

is computable.

Definition 6.

A computable sequence of computable functions

{F_{N}}_{N \in N}

is called computably convergent to F if a partial recursive function

ϕ : N \times X \to N

exists such that

|F (x) - F_{N} (x)| < \frac{1}{2^{M}}

holds true for all

M \in N

, all

N \geq ϕ (M, x)

, and all

x \in X

.

In the following, we consider Turing machines with only one output state. We interpret this output status as the stopping of the Turing machine. This means that for an input

x \in R_{c}

, the Turing machine

T M (x)

ends its calculation after an unknown but finite number of arithmetic steps, or it computes forever.

Definition 7.

We call a set

M \subseteq R_{c}

semi-decidable if there is a Turing machine

T M_{M}

that stops for the input

x \in R_{c}

, if and only if

x \in M

applies.

In [38], Specker constructed a monotonically increasing computable sequence

{r_{n}}_{n \in N}

of rational numbers that is bounded by 1, but the limit

x^{*}

, which naturally exists, is not a computable number. For all

M \in N

,

n_{0} = n_{0} (M)

exists such that for all

n \geq n_{0}

,

0 \leq x - r_{n} < \frac{1}{2^{M}}

always holds, but the function

n_{0} : N \to N

is not partial recursive. This means there are computable monotonically increasing sequences of rational numbers, which each converge to a finite limit value, but for which the limit values are not computable numbers and therefore the convergence is not effective. Of course, the set of computable numbers is countable.

We will later examine the zero-error capacity

C_{0} (\cdot)

as a function of computable DMCs. To do this, we need to define computable functions generally.

Definition 8.

A function

f : R_{c} \to R_{c}

is called Banach–Mazur-computable if f maps any given computable sequence

{x_{n}}_{n = 1}^{\infty}

of computable numbers into a computable sequence

{f (x_{n})}_{n = 1}^{\infty}

of real numbers.

Definition 9.

A function

f : R_{c} \to R_{c}

is called Borel–Turing-computable if there is an algorithm that transforms each given computable sequence of a computable real x into a corresponding representation for

f (x)

.

We note that Turing’s original definition of computability conforms to the definition of Borel–Turing computability above. Banach–Mazur computability (see Definition 8) is the weakest form of computability. For an overview of the logical relations between different notions of computability, we again refer to [39].

Now, we want to define the zero-error capacity. Therefore, we need the definition of a discrete memoryless channel. In the theory of transmission, the receiver must be in a position to successfully decode all of the messages transmitted by the sender.

Let

X

be a finite alphabet. We denote the set of probability distributions as

¶ (X)

. We define the set of computable probability distributions

¶_{c} (X)

as the set of all probability distributions

P \in ¶ (X)

such that

P (x) \in R_{c}

for all

x \in X

. Furthermore, for finite alphabets

X

and

Y

, let

C H (X, Y)

be the set of all conditional probability distributions (or channels)

P_{Y | X} : X \to ¶ (Y)

.

{C H}_{c} (X, Y)

denotes the set of all computable conditional probability distributions, i.e.,

P_{Y | X} (\cdot | x) \in ¶_{c} (Y)

for every

x \in X

.

Let

M \subset {C H}_{c} (X, Y)

. We call M semi-decidable (see Definition 7) if and only if there is a Turing machine

T M_{M}

that either stops or computes forever, depending on whether

W \in M

is true. That means

T M_{M}

accepts exactly the elements of M and calculates forever for an input

W \in M^{c} = {C H}_{c} (X, Y) ∖ M

.

Definition 10.

A discrete memoryless channel (DMC) is a triple

(X, Y, W)

, where

X

is the finite input alphabet,

Y

is the finite output alphabet, and

W (y | x) \in C H (X, Y)

with

x \in X

,

y \in Y

. The probability of a sequence

y^{n} \in Y^{n}

being received if

x^{n} \in X^{n}

was sent is defined by

W^{n} (y^{n} | x^{n}) = \prod_{j = 1}^{n} W (y_{j} | x_{j}) .

Definition 11.

A block code

C

with the rate R and the block length n consists of

A message set $M = {1, 2, \dots, M}$ with $M = 2^{n R} \in N$ ;
An encoding function $e : M \to X^{n}$ ;
A decoding function $d : Y^{n} \to M$ .

We call such a code an

(R, n)

-code.

Definition 12.

1.: The individual message probability of error is defined by the conditional probability of error given that the message m is transmitted:

$P_{m} (C) = P r {d (Y^{n}) \neq m | X^{n} = e (m)} .$
2.: We define the maximal probability of the error as $P_{max} (C) = {max}_{m \in M} P_{m} (C)$ .
3.: A rate R is said to be achievable if a sequence of $(R, n)$ -codes ${C_{n}}$ exists with a probability of error $P_{max} (C_{n}) \to 0$ as $n \to \infty$ .

Two sequences

x^{n}

and

x^{' n}

of the size n of input variables are distinguishable by a receiver if the vectors

W^{n} (\cdot | x^{n})

and

W^{n} (\cdot | x^{' n})

are orthogonal. That means if

W^{n} (y^{n} | x^{n}) > 0

, then

W^{n} (y^{n} | x^{' n}) = 0

, and if

W^{n} (y^{n} | x^{' n}) > 0

then

W^{n} (y^{n} | x^{n}) = 0

. We denote as

M (W, n)

the maximum cardinality of a set of mutually orthogonal vectors among

W^{n} (\cdot | x^{n})

with

x^{n} \in X^{n}

.

There are different ways to define the capacity of a channel. The so-called pessimistic capacity is defined as

{lim inf}_{n \to \infty} \frac{{log}_{2} M (W, n)}{n}

, and the optimistic capacity is defined as

{lim sup}_{n \to \infty} \frac{{log}_{2} M (W, n)}{n}

. A discussion of these quantities can be found in [40]. We define the zero-error capacity of W as follows:

C_{0} (W) = \underset{n \to \infty}{lim inf} \frac{{log}_{2} M (W, n)}{n} .

For the zero-error capacity, the pessimistic capacity and the optimistic capacity are equal.

First, we want to introduce the representation of the zero-error capacity of Ahlswede. Therefore, we need to introduce the arbitrarily varying channel (AVC). This was introduced under a different name by Blackwell, Breiman, and Thomasian [41], and considerable progress has been made in the study of these channels.

Definition 13.

Let

X

and

Y

be finite sets. A (discrete) arbitrarily varying channel (AVC) is determined by a family of channels with a common input alphabet

X

and output alphabet

Y

W = \{W (\cdot | \cdot, s) \in C H (X, Y) : s \in S\} .

(4)

The index s is called the state, and the set

S

is called the state set. Now, an AVC is defined by a family of sequences of channels

W^{n} (y^{n} | x^{n}, s^{n}) = \prod_{t = 1}^{n} W (y_{t} | x_{t}, s_{t}), x^{n} \in X^{n}, y^{n} \in Y^{n}, s^{n} \in S^{n}

(5)

for all

x^{n} \in X^{n}, y^{n} \in Y^{n}, s^{n} \in S^{n}, n \in N

.

Definition 14.

An

(n, M)

code is a system

{(u_{i}, D_{i})}_{i = 1}^{M}

with

u_{i} \in X^{n}

,

D_{i} \subset Y^{n}

, and for

i \neq j

D_{i} \cap D_{j} = ⌀

.

Definition 15.

1.: The maximal probability of error of the code for an AVC $W$ is
$λ = max_{s^{n} \in S^{n}} max_{1 \leq i \leq M} W^{n} (D_{i}^{c} | u_{i}, s^{n})$ .
2.: The average probability of error of the code for an AVC $W$ is
$\bar{λ} = max_{s^{n} \in S^{n}} M^{- 1} \sum_{i = 1}^{M} W^{n} (D_{i}^{c} | u_{i}, s^{n})$ .

Definition 16.

1.: The capacity of an AVC with the maximal probability of error is the maximal number $C_{max} (W)$ such that for all $ε, λ$ , an $(n, M)$ code of the AVC $W$ exists for all large n with a maximal probability lower than λ and $\frac{1}{n} log M > C_{max} (W) - ε$ ;
2.: The capacity of an AVC $W$ with an average probability of error is the maximal number $C_{a v} (W)$ such that for all ε, $\bar{λ} > 0$ , an $(n, M)$ code of the AVC exists for all large n with an average probability lower than $\bar{λ}$ and $\frac{1}{n} log M > C_{a v} (W) - ε$ .

In the following, we denote

A V C_{0 - 1}

to be the set of AVCs

W

that satisfies

W (y | x, s) \in {0, 1}

for all

y \in Y

, all

x \in X

, and all

s \in §

.

Theorem 1

(Ahlswede [2]). Let

X

and

Y

be finite alphabets with

| X | \geq 2

and

| Y | \geq 2

.

(i): For all DMCs $W^{*} \in C H (X, Y)$ , $W \in A V C_{0 - 1}$ exists such that for the zero-error capacity of $W^{*}$

$C_{0} (W^{*}) = C_{max} (W) .$

(6)
(ii): Conversely, for each $W \in A V C_{0 - 1}$ , a DMC $W^{*} \in C H (X, Y)$ exists such that (6) holds.

The construction is interesting. Therefore, we cite it from [42]:

(i): For a given $W^{*}$ , we let $W$ be the set of stochastic matrices with the index (state) set $S$ such that for all $x \in X$ , $y \in Y$ , and $s \in S$ , it holds that $W (y | x, s) = 1$ implies $W^{*} (y | x) > 0$ . Then, for all n, $x^{n} \in X^{n}$ , and $y^{n} \in Y^{n}$ , $W^{* n} (y^{n} | x^{n}) > 0$ if and only if $s^{n} \in S^{n}$ such that

$W^{n} (y^{n} | x^{n}, s^{n}) = 1 .$

(7)

Notice that for all $λ < 1$ , a code for $W$ with the maximal probability of error $λ$ is a zero-error code for $W$ . Thus, it follows from (7) that a code is a zero-error code for $W^{*}$ if and only if it is a code for $W$ with the maximal probability of error $λ < 1$ .
(ii): For a given 0-1 type AVC $W$ (with the state set $S$ ) and any probability $π \in ¶ (S)$ with $π (s) > σ$ for all s, let $W^{*} = \sum_{s \in S} π (s) W (\cdot | \cdot, s)$ . Then, (7) holds.

The zero-error capacity can be characterized in graph-theoretic terms as well. Let

W \in C H (X, Y)

be given and

| X | = q

. Shannon [1] introduced the confusability graph

G_{W}

with

q = | G |

. In this graph, two letters/vertices x and

x^{'}

are connected, if they can be confused with one another due to the channel noise (i.e., y exists such that

W (y | x) > 0

and

W (y | x^{'}) > 0

). Therefore, the maximum independent set is the maximum number of single-letter messages which can be sent without danger of confusion. In other words, the receiver knows whether the received message is correct or not. It follows that

α (G)

is the maximum number of messages which can be sent without danger of confusion. Furthermore, the definition is extended to words of a length n by

α (G^{⊠ n})

. Therefore, we can give the following graph-theoretic definition of the Shannon capacity.

Definition 17.

The Shannon capacity of a graph

G \in G

is defined by

Θ (G) : = \underset{n \to \infty}{lim sup} α {(G^{⊠ n})}^{\frac{1}{n}} .

Shannon discovered the following.

Theorem 2

(Shannon [1]). Let

(X, Y, W)

be a DMC. Then,

2^{C_{0} (W)} = Θ (G_{W}) = lim_{n \to \infty} α {(G_{W}^{⊠ n})}^{\frac{1}{n}} .

This limit exists and equals the supremum

Θ (G_{W}) = sup_{n \in N} α {(G_{W}^{⊠ n})}^{\frac{1}{n}}

according to Fekete’s lemma.

Observe that Theorem 2 yields no further information on whether

C_{0} (W)

and

Θ (G)

are computable real numbers.

3. The Algorithmic Computability of the Zero-Error Capacity

In this section, we investigate the algorithmic computability of the zero-error capacity

C_{0} (W)

for discrete memoryless channels (DMCs), since no closed-form expression for

C_{0} (W)

is known to date. Furthermore, we analyze the algorithmic relationship between Shannon’s and Ahlswede’s characterizations of the zero-error capacity.

We show that the function

C_{0} : {C H}_{c} (X, Y) \to R

and the cardinality of a maximum-size zero-error code of the blocklength n

M_{*} (W, n)

are not Banach–Mazur-computable. Alon and Lubetzky raised the question of whether the set

{G : Θ (G) < μ}

is semi-decidable. We provide three equivalent conditions under which the answer is affirmative.

Moreover, we demonstrate that the set of channels with a 0 zero-error capacity—which are channels that are useless in this context—is not computable, though they are semi-decidable. To prove this result, we rely on the following auxiliary lemmas.

Lemma 1.

There is a Turing machine $T M_{> 0}$ that stops for $x \in R_{c}$ , if and only if $x > 0$ applies. Hence, the set $R_{c}^{+} : = \{x \in R_{c} : x > 0\}$ is semi-decidable.
There is a Turing machine $T M_{< 0}$ that stops for $x \in R_{c}$ , if and only if $x < 0$ applies. Hence, the set $R_{c}^{-} : = \{x \in R_{c} : x < 0\}$ is semi-decidable.
There is no Turing machine $T M_{= 0}$ that stops for $x \in R_{c}$ , if and only if $x = 0$ applies.

Proof.

Let

x \in R_{c}

be given by the quadruple

(a, b, s, ζ)

, with

v_{k} : = {(- 1)}^{s (k)} (\frac{a (k)}{b (k)}) .

Then,

{\tilde{a}}_{1}, {\tilde{a}}_{2}, {\tilde{a}}_{3}, \dots

with

{\tilde{a}}_{k} : = max \{v_{ζ (l)} - \frac{1}{2^{l}} : 1 \leq l \leq k\}

is a computable monotonically increasing sequence and converges to x. The Turing machine

T M_{> 0}

sequentially computes the sequence

{\tilde{a}}_{1}, {\tilde{a}}_{2}, {\tilde{a}}_{3}, \dots

. Obviously,

k \in N

with

{\tilde{a}}_{k} > 0

if and only if

x > 0

. Since

{\tilde{a}}_{k}

is always a rational number,

T M_{> 0}

can directly check algorithmically whether

{\tilde{a}}_{k} > 0

applies. We set

\begin{matrix} T M_{> 0} (x) : = \{\begin{matrix} STOP if it finds k_{0} \in N with {\tilde{a}}_{k_{0}} > 0, \\ The Turing machine computes forever . \end{matrix} \end{matrix}

(8)

Then,

T M_{> 0} (x) = STOP

applies if and only if

x > 0

applies.

The construction of

T M_{< 0}

is analogous to the computable sequence

{\tilde{b}}_{1}, {\tilde{b}}_{2}, {\tilde{b}}_{3}, \dots

where

{\tilde{b}}_{k} : = min \{v_{ζ (l)} + \frac{1}{2^{l}} : 1 \leq l \leq k\}

converges monotonically to x. Consequently,

\begin{matrix} T M_{< 0} (x) : = \{\begin{matrix} STOP if it finds k_{0} with \in N {\tilde{b}}_{k} < 0, \\ The Turing machine computes forever . \end{matrix} \end{matrix}

(9)

We now want to prove the last statement of this lemma. We provide the proof indirectly. Assume that the corresponding Turing machine

T M_{= 0}

exists. Let

n \in N

be arbitrary. We consider an arbitrary Turing machine

T M

and the computable sequence

{λ_{m}}_{m \in N}

of computable numbers:

λ_{m} : = \{\begin{matrix} \frac{1}{2^{l}} & T M stops for input n after l \leq m steps; \\ \frac{1}{2^{m}} & T M does not stop entering n after l \leq m steps . \end{matrix}

Obviously, for all

m \in N

, it holds that

λ_{m} \geq λ_{m + 1}

and

{lim}_{m \to \infty} λ_{m} = : x \geq 0

, where

{lim}_{m \to \infty} λ_{m} = 0

if and only if

T M

for the input n does not stop in a finite number of steps. For all

m, N \in N

with

N \leq m

,

\begin{matrix} | λ_{m} - x | \leq \frac{1}{2^{N}}, \end{matrix}

(10)

holds, as we will show by considering the following cases:

Assume that $T M$ stops for the input n after $l \leq N$ steps. For all $m \geq N$ , then $λ_{m} = λ_{N}$ applies, and thus $| λ_{m} - x | = 0$ .
Assume that $T M$ does not stop for the input n after $l \leq N$ steps. For all $m \geq N$ , then $\frac{1}{2^{N}} = λ_{N} \geq λ_{m}$ , and thus $| λ_{m} - x | \leq | \frac{1}{2^{N}} - x | \leq | \frac{1}{2^{N}} - 0 | = \frac{1}{2^{N}}$ .

Hence, we can use the pair

(T M, n) \mapsto ({(λ_{m})}_{m \in N}, η)

with the estimate

η : N \to N

as a computable real number, which we can pass to a potential Turing machine

T M_{= 0}

as input. The partial recursive function

η

is a representation of the computable number x, that is,

x \sim ({(λ_{m})}_{m \in N}, η)

. Consequently,

T M_{= 0}

stops for the input x if and only if

T M

for the input n does not stop in a finite number of steps. Thus, every Turing machine

T M_{= 0}

solves for every input n the halting problem. The halting problem cannot be solved by a Turing machine ([32]). This proves the lemma. □

In the following lemma, we give an example of a function that is not Banach–Mazur-computable.

Lemma 2.

Let

x \in [0, \infty) \cap R_{c}

be arbitrary. We consider the following function:

f_{1} (x) : = \{\begin{matrix} 1, & x > 0, x \in R_{c} \\ 0, & x = 0 . \end{matrix}

(11)

The function

f_{1}

is not Banach–Mazur-computable.

Proof.

For all

x \in [0, \infty) \cap R_{c}

holds

f (x) \in R_{c}

. We assume that

f_{1}

is Banach–Mazur-computable. Let

{x_{n}}_{n \in N}

be an arbitrary computable sequence of computable numbers with

{x_{n}}_{n \in N} \subset [0, n)

.

The sequence

{(f (x_{n}))}_{n \in N}

is a computable sequence of computable numbers. We take a set

A \subset N

that is recursively enumerable but not recursive. Then, let

T M_{A}

be a Turing machine that stops for the input n if and only if

n \in A

holds.

T M_{A}

accepts exactly the elements from A. Let

n \in N

be arbitrary. We now define

λ_{n, m} : = \{\begin{matrix} \frac{1}{2^{l}}, & T M_{A} stops for the input n after l \leq m steps; \\ \frac{1}{2^{m}}, & T M_{A} does not stop after l \leq m steps for the input n . \end{matrix}

(12)

Then,

{(λ_{n, m})}_{m, n \in N^{2}}

is a computable (double) sequence of computable numbers. For

n \in N

,

m \geq M

and

M \in N

implies

| λ_{n, m} - λ_{n, M} | < \frac{1}{2^{M}} .

This means that there is effective convergence for every

n \in N

. Consequently, according to Lemma 1, for every

n \in N

,

λ_{n}^{*} \in R_{c}

with

{lim}_{m \to \infty} | λ_{n}^{*} - λ_{n, m} | = 0

, and the sequence

{(λ_{n}^{*})}_{n \in N}

is a computable sequence of computable numbers. This means that

{(f_{1} (λ_{n}^{*}))}_{n \in N}

is a computable sequence of computable numbers, where

f_{1} (λ_{n}^{*}) = \{\begin{matrix} 1 & if λ_{n}^{*} > 0, \\ 0 & if λ_{n}^{*} = 0 \end{matrix}

(13)

applies. The following Turing machine

T M_{*} : N ⟶ {yes, no}

exists:

T M_{*}

computes the value

f_{1} (λ_{n}^{*})

for the input n. If

f_{1} (λ_{n}^{*}) = 1

, then the set

T M_{*} (n) = yes

, i.e.,

n \in A

. If

f_{1} (λ_{n}^{*}) = 0

, then set

T M_{*} (n) = no

, i.e.,

n \notin A

. This applies to every

n \in N

and therefore A is recursive, which contradicts the assumption. This means that

f_{1}

is not Banach–Mazur-computable. □

Theorem 3.

Let

X, Y

be finite alphabets with

| X | \geq 2

and

| Y | \geq 2

. Then,

C_{0} : {C H}_{c} (X, Y) \to R

is not Banach–Mazur-computable.

Proof.

Let

| X | = | Y | = 2

; then, we will show that

C_{0} : {C H}_{c} (X, Y) \to R

is not Banach–Mazur-computable. For

0 \leq δ < \frac{1}{2}

, we choose

W_{δ} (y | 1) = (\binom{1 - δ}{δ})

and

W_{δ} (y | 0) = (\binom{δ}{1 - δ})

. Then, we have

C_{0} (W_{δ}) = \{\begin{matrix} 1 & , if δ = 0 \\ 0 & , if 0 < δ < \frac{1}{2} . \end{matrix}

We consider the function

ξ : [0, \frac{1}{2}) \to {0, 1}

with

ξ (δ) = C_{0} (W_{δ})

. It follows from Lemma 1 that

ξ

is not Banach–Mazur-computable. □

Therefore, the zero-error capacity cannot be computed algorithmically.

Remark 1.

There are still some questions that we would like to discuss.

1.: It is not clear whether $C_{0} (W) \in R_{c}$ applies to all channels $W \in {C H}_{c} (X, Y)$ .
2.: In addition, it is not clear whether Θ is Borel–Turing-computable. Theorem 3 shows that this does not apply to the zero-error capacity for DMCs. We show that even $C_{0}$ is not Banach–Mazur-computable.

In the following, we want to investigate the semi-decidability of the set

{W \in {C H}_{c} (X, Y) : C_{0} (W) > λ}

.

Theorem 4.

Let

X, Y

be finite alphabets with

| X | \geq 2

and

| Y | \geq 2

. For all

λ \in R_{c}

with

0 \leq λ < {log}_{2} min {| X |, | Y |}

, the sets

{W \in {C H}_{c} (X, Y) : C_{0} (W) > λ}

are not semi-decidable.

Proof.

Let

X = {1, 2, \dots, | X |} \subset N

and

Y = {1, 2, \dots, | Y |} \subset N

be arbitrary finite alphabets with

| X | \geq 2

and

| Y | \geq 2

, and let

D = min {| X |, | Y |}

. First, we consider the case

| X | = D

. Let us consider the channel

W_{*} (y | x) = \{\begin{matrix} 1 & , if y = x \\ 0 & , if y \neq x \end{matrix} .

It holds that

C_{0} (W_{*}) = {log}_{2} | D |

. For

0 < δ < \frac{1}{| Y | - 1}

, we define the channel

W_{δ, *} (y | x) = \{\begin{matrix} 1 - δ (| Y | - 1) & , if y = x \\ δ & , if y \neq x \end{matrix} .

It holds that

C_{0} (W_{δ, *}) = 0

for

0 < δ < \frac{1}{| Y | - 1}

. Let us now assume that

\hat{λ} \in R_{c}

with

0 \leq \hat{λ} < {log}_{2} D

such that the set

{W \in {C H}_{c} (X, Y) : C_{0} (W) > \hat{λ}}

is semi-decidable. Then, we consider the Turing machine

T M_{> \hat{λ}}

which accepts this set. Furthermore, we consider for

0 < δ < \frac{1}{| Y | - 1}

the following Turing machine

T M_{*}

:

$T M_{*}$ simulates two Turing machines $T M_{1} : = T M_{> 0}$ and $T M_{2} : = T M_{> \hat{λ}}$ ;
In parallel, $T M_{> 0}$ receives the input $δ$ and tests if $δ > 0$ ;
$T M_{> 0}$ stops if and only if $δ > 0$ .

It is shown in Lemma 1 that such a Turing machine exists. For the input

δ = 0

,

T M_{> 0}

computes forever. The second Turing machine is defined by

$T M_{2} (δ) : = T M_{> \hat{λ}} (W_{δ, *})$ ;
For $δ > 0$ , it holds that $C_{0} (W_{δ, *}) = 0$ ;
Therefore, $T M_{2}$ stops for $0 < δ < \frac{1}{| Y | - 1}$ if and only if $δ = 0$ .

We now let

T M_{*}

stop for the input

δ

if and only if one of the two Turing machines

T M_{1}

or

T M_{2}

stops. Exactly one Turing machine has to stop for every

0 < δ < \frac{1}{| Y | - 1}

.

If the Turing machine

T M_{1}

stops at the input

δ

, we set

T M_{*} (δ) = 1

. If the Turing machine

T M_{2}

stops at the input

δ

, we set

T M_{*} (δ) = 0

. Therefore, we have

T M_{*} (δ) = \{\begin{matrix} 0 & , if δ = 0 \\ 1 & , if 0 < δ < \frac{1}{| Y | - 1} . \end{matrix}

We have shown in Lemma 1 that such a Turing machine cannot exist. This proves the theorem for

D = | X |

. The proof for

D = | Y |

is very similar. □

For

W \in {C H}_{c} (X, Y)

and

n \in N

, let

M_{*} (W, n)

be the cardinality of a maximum code with a decoding error of 0. This maximum code always exists because we only have a finite set of possible codes for the blocklength n. Of course, a well-defined function

M_{*} (\cdot, n) : {C H}_{c} (X, Y) \to N

exists for every

n \in N

. Because of Fekete’s lemma, we have

C_{0} (W) = lim_{n \to \infty} \frac{1}{n} {log}_{2} M_{*} (W, n) = sup_{n \in N} \frac{1}{n} {log}_{2} M_{*} (W, n) .

(14)

We now have the following theorem regarding the Banach–Mazur computability of the function

M_{*}

.

Theorem 5.

Let

X, Y

be finite alphabets with

| X | \geq 2

and

| Y | \geq 2

. The function

M_{*} (\cdot, n) : {C H}_{c} (X, Y) \to N

is not Banach–Mazur-computable for all

n \in N

.

Proof.

Let

X

and

Y

be finite alphabets with

| X | \geq 2

and

| Y | \geq 2

, and let

N \in N

be arbitrary. Consider the “ideal channel”

W_{1} \in {C H}_{c} (X, Y)

with

M_{*} (W, N) = min {| X |, | Y |}^{N}

. Furthermore, consider any channel

W_{2} \in {C H}_{c} (X, Y)

with

W_{2} (y | x) > 0

for all

y \in Y

and all

x \in X

. Then,

M_{*} (W_{2}, N) = 1

for all

N \in N

, and consequently, because of (14),

C_{0} (W_{2}) = 0

. Now, we can directly apply the proof of Theorem 3 to the function

M_{*} (\cdot, N) : {C H}_{c} (X, Y) \to R

.

M_{*} (\cdot, N)

is therefore not Banach–Mazur-computable. □

We now want to examine the question of whether a computable sequence of Banach–Mazur-computable lower bounds can be found for

C_{0} (\cdot)

. We set

F_{N} (W) : = max_{1 \leq n \leq N} \frac{1}{n} {log}_{2} M_{*} (W, n) .

For all

W \in {C H}_{c} (X, Y)

and for all

N \in N

, we have

F_{N} (W) \leq F_{N + 1} (W)

and

{lim}_{N \to \infty} (W) = C_{0} (W)

. However, this cannot be expressed algorithmically because due to Theorem 5, the functions

F_{N}

are not Banach–Mazur-computable. We next want to show that this is a general phenomenon for

C_{0}

.

Theorem 6.

Let

X, Y

be finite alphabets with

| X | \geq 2

and

| Y | \geq 2

. No computable sequence

{F_{N}}_{N \in N}

of Banach–Mazur-computable functions exists that simultaneously satisfies the following:

1.: For all $N \in N$ , it holds that $F_{N} (W) \leq C_{0} (W)$ for all $W \in {C H}_{c} (X, Y)$ ;
2.: For all $W \in {C H}_{c} (X, Y)$ , it holds that ${lim}_{N \to \infty} F_{N} (W) = C_{0} (W)$ .

Proof.

Assume to the contrary that finite alphabets

\hat{X}

and

\hat{Y}

exist with

| \hat{X} | \geq 2

and

| \hat{Y} | \geq 2

, as well as a computable sequence

{F_{N}}

of Banach–Mazur-computable functions, such that the following holds true:

For all $N \in N$ and all $W \in {C H}_{c} (\hat{X}, \hat{Y})$ , we have $F_{N} (W) \leq C_{0} (W)$ ;
For all $W \in {C H}_{c} (\hat{X}, \hat{Y})$ , we have ${lim}_{N \to \infty} F_{N} (W) = C_{0} (W)$ .

We consider for

N \in N

the function

{\bar{F}}_{N} (W) = max_{1 \leq n \leq N} F_{n} (W), W \in {C H}_{c} (X, Y) .

The function

{\bar{F}}_{N}

is Banach–Mazur-computable (see [37]). The sequence

{{\bar{F}}_{N}}_{N \in N}

is a computable sequence of Banach–Mazur-computable functions. For all

N \in N

, it holds that

{\bar{F}}_{N} (W) \leq {\bar{F}}_{N + 1} (W)

and

{lim}_{N \to \infty} {\bar{F}}_{N} (W) = C_{0} (W)

for

W \in {C H}_{c} (\hat{X}, \hat{Y})

. Since the sequence

{{\bar{F}}_{N}}

can be computed, we can find a Turing machine

\underset{̲}{T M}

, so that for all

W \in {C H}_{c} (X, Y)

and for all

N \in N

,

{\bar{F}}_{N} (W) = \underset{̲}{T M} (W, N)

applies (according the

s_{n}^{m}

Theorem [43]). Let

λ

be given arbitrarily with

0 < λ < {log}_{2} (min {| X |, | Y |})

. In addition, we also use the Turing machine

T M_{> λ}

, which stops for the input x if and only if

x > λ

(see the proof of Theorem 4). Just as in the proof of Theorem 4, we use the two Turing machines

T M_{> λ}

and

\underset{̲}{T M}

to build a Turing machine

T M_{*}

. The Turing machine

T M_{*}

stops exactly when

N \in N

exists, so that

{\bar{F}}_{N_{0}} (W) > λ

holds.

N_{0}

exists for

W \in {C H}_{c} (\hat{X}, \hat{Y})

if and only if

C_{0} (W) > λ

. The set

{W \in {C H}_{c} (\hat{X}, \hat{Y}) : C_{0} (W) > λ}

will then be semi-decidable, which is a contradiction to the assumption. □

We make the following observation: For all

μ \in R_{c}

with

μ \geq 1

, the sets

{G \in G : Θ (G) > μ}

are semi-decidable. It holds that

2^{C_{0} (W)} = Θ (G_{W})

.

Theorem 7.

The following three statements A, B, and C are equivalent:

A: For all $X, Y$ , where $X$ and $Y$ are finite alphabets with $| X | \geq 2$ and $| Y | \geq 2$ and for all $λ \in R_{c}$ with $0 < λ$ , the sets

${W \in {C H}_{c} (X, Y) : C_{0} (W) < λ}$

are semi-decidable.
B: For all $μ \in R_{c}$ with $μ > 1$ , the sets

${G \in G : Θ (G) < μ}$

are semi-decidable.
C: For all $X, Y$ , where $X$ and $Y$ are finite alphabets with $| X | \geq 2$ and $| Y | \geq 2$ and for all $λ \in R_{c}$ with $λ > 0$ , the sets

$\{{W (\cdot | \cdot, s)}_{s \in §} \in A V C_{0 - 1} : C_{max} ({W (\cdot | \cdot, s)}_{s \in §}) < λ\}$

are semi-decidable.

Proof.

First, we show

A \Rightarrow B

. Let

μ \in R_{c}

with

μ > 1

. Then,

μ = 2^{λ}

with

λ \in R_{c}

. Let

X, Y

be finite sets with

| X | \geq 2

and

| Y | \geq 2

. Then, the set

\{W \in {C H}_{c} (X, Y) : C_{0} (W) < λ\}

is semi-decidable by assumption. Let

T M_{< λ}

be the associated Turing machine. Let

\hat{G} \in {G \in G : Θ (G) < μ}

be chosen arbitrarily. From

\hat{G} = (\hat{V}, \hat{E})

, we algorithmically construct a channel

W_{\hat{G}} \in C (X, Y)

with

| X | = | Y | = | \hat{G} |

as follows. We consider the set

Z : = \{{v} : v \in V\} \cup E

and an arbitrary output alphabet

Y

with the bijection

f : Z \to Y

. Therefore, it is obvious that

| Y | = | Z |

. For

v \in V

, we set

Y (v) : = \{f (z) : z \in Z and v \in Z\}

. We define

{\hat{W}}_{G} (y | v) = \{\begin{matrix} \frac{1}{| Y (v) |} & if y \in Y (v) \\ 0 & otherwise . \end{matrix}

Of course,

G_{\hat{W}} = \hat{G}

applies to the confusability graph of

W_{\hat{G}}

. It holds that

C_{0} (W_{\hat{G}}) = {log}_{2} Θ (\hat{G})

. Therefore,

W_{\hat{G}} \in {W \in {C H}_{c} (X, Y) : C_{0} (W) < λ}

. Therefore,

T M_{< μ} (G) : = T M_{< λ} (W_{G})

stops. Conversely, if

T M_{< λ} (W_{G})

stops for

G \in G

, then

C_{0} (W_{G}) < λ

. Therefore,

Θ (G) < μ

. Thus, we have shown

A \Rightarrow B

.

Now, we show

B \Rightarrow A

. Let

X

and

Y

be finite alphabets with

| X | \geq 2

and

| Y | \geq 2

. We construct a sequence of confusability graphs as follows.

For all pairs

x, x^{'} \in X

with

x \neq x^{'}

, we start the first computation step of the Turing machine

T M_{> 0}

for

N = 1

in parallel for the number

\sum_{y = 1}^{| Y |} W (y | x) W (y | x^{'}) = d (x, x^{'})

. That means for the input

d (x, x^{'})

, we compute the first step of the calculation of

T M_{> 0} (d (x, x^{'}))

. If the Turing machine

T M_{> 0}

stops at the input

d (x, x^{'})

in the first step, then

G_{1}

has the edge

{x, x^{'}}

. If for

x, x^{'} \in X

with

x \neq x^{'}

the Turing machine

T M_{> 0}

does not stop after the first step, then

{x, x^{'}} \notin E (G_{1})

.

For

N = 2

, we construct

G_{2}

as follows. For all

x, x^{'}

that have no edge in

G_{1}

, we let

T M_{> 0}

calculate the second computation step at the input

d (x, x^{'})

. If

T M_{> 0}

stops, then we set

G_{2}

to have the edge

{x, x^{'}}

and also receive all edges of

G_{1}

. If for

x, x^{'} \in X

with

x \neq x^{'}

the Turing machine

T M_{> 0}

does not stop after the second step, then

{x, x^{'}} \notin E (G_{2})

. We continue this process iteratively, generating a sequence of graphs

G_{1}, G_{2}, G_{3}, \dots

, all sharing the same vertex set, with the edges satisfying

E (G_{1}) \subseteq E (G_{2}) \subseteq E (G_{3}) \subseteq \dots

. The Turing machine

T M_{> 0}

stops for the input

d (x, x^{'})

if and only if

d (x, x^{'}) > 0

. We have a number of tests in each step that fall monotonically depending on N (generally not strictly). It holds that

Θ (G_{1}) \geq Θ (G_{2}) \geq \dots \geq Θ (G_{n}) \geq \dots .

n_{0}

exists such that

G_{W} = G_{n_{0}} .

G_{W}

is the confusability graph of W. Note that we do not have a computable upper bound for

n_{0}

. However, the latter is not required for the proof. Therefore,

Θ (G_{n_{0}}) = 2^{C_{0} (W)} .

Let

T M_{G, λ}

be the Turing machine which accepts the set

{G \in G : Θ (G) < 2^{λ}}

. We have already shown that

\hat{W} \in {W \in {C H}_{c} (X, Y) : C_{0} (W) < λ}

holds if and only if

n_{0} \in N

, so that the sequence

{\hat{G_{n}}}_{n \in N}

satisfies

E ({\hat{G}}_{n}) \subset E ({\hat{G}}_{n + 1})

. These are all graphs with the same set of nodes, and

n_{0}

with

Θ ({\hat{G}}_{n_{0}}) < 2^{λ}

exists. Furthermore, the sequence is computable. We only have to test for the sequence

{\hat{G_{n}}}_{n \in N}

, which is generated algorithmically from

\hat{W}

, whether

{\hat{G}}_{n} \in {G \in G : Θ (G) < 2^{λ}}

applies. This means that we have to test whether

T M_{G, λ} ({\hat{G}}_{n})

stops for a certain n. We compute the first step for

T M_{G, λ} ({\hat{G}}_{1})

. If the Turing machine stops, then

C_{0} (\hat{W}) < λ

. Otherwise, we compute the second step for

T M_{G, λ} ({\hat{G}}_{1})

and the first step for

T M_{G, λ} ({\hat{G}}_{2})

. We continue recursively like this, and it is clear that the computation stops if and only if

C_{0} (\hat{W}) < λ

. Otherwise, the Turing machine computes forever.

Now, we show

A \Rightarrow C

. Let

λ \in R_{c}

and let

{W (\cdot | \cdot, s)}_{s \in §} \in A V C_{0 - 1}

be arbitrarily chosen. From

{W (\cdot | \cdot, s)}_{s \in §}

, we can effectively construct a DMC

W^{*} \in C H {(X, Y)}_{c}

according to the Ahlswede approach (Theorem 1), so that

C_{0} (W^{*}) = C_{max} ({W (\cdot | \cdot, s)}_{s \in §})

. This means that

C_{0} (W^{*}) < λ

if and only if

C_{max} ({W (\cdot | \cdot, s)}_{s \in §}) < λ

. By assumption, the set

\{W \in {C H}_{c} (X, Y) : C_{0} (W) < λ\}

is semi-decidable. We have used it to construct a Turing machine

T M_{c, < λ}

that stops when

C_{max} ({W (\cdot | \cdot, s)}_{s \in §}) < λ

applies; otherwise

T M_{c, < λ}

, computes for ever. Therefore, C holds.

Now, we show

C \Rightarrow A

. The idea of this part of the proof is similar to that of part

B \Rightarrow A

. Let

W \in {C H}_{c} (X, Y)

be arbitrary. Similar to case

B \Rightarrow A

, we construct a suitable sequence

{{W_{k} (\cdot | \cdot, s)}_{s \in §_{k}}}_{k \in N}

of computable sequences of

0 - 1

AVCs on

X

and

Y

, such that the following assertions are satisfied:

For all $k \in N$ , we have $§_{k} \subset §_{k + 1}$ , as well as

$\begin{matrix} W_{k} (y | x, s) = W_{k + 1} (y | x, s) \end{matrix}$

(15)

for all $x \in X$ , all $y \in Y$ , and all $s \in §_{k}$ .
$k_{0} \in N$ exists such that $§_{k_{0}} = §_{k}$ for all $k \geq k_{0}$ and

$\begin{matrix} W (y | x) > 0 \Leftrightarrow \exists s \in §_{k_{0}} : W_{k_{0}} (y | x, s) = 1 \end{matrix}$

(16)

for all $x \in X$ and all $y \in Y$ .

The AVC

{W_{k_{0}} (\cdot | \cdot, s)}_{s \in §_{k_{0}}}

then satisfies the requirements of Theorem 1.

In general,

k_{0}

cannot be computed effectively depending on

W \in {C H}_{c} (X, Y)

, but this is not a problem for the semi-decidability for all finite

X, Y

with

| X | \geq 2

and

| Y | \geq 2

.

So, we have for

k \in N

C_{max} ({W_{k + 1} (\cdot | \cdot, s)}_{s \in §_{k + 1}}) \leq C_{max} ({W_{k} (\cdot | \cdot, s)}_{s \in §_{k}})

and it holds that

C_{0} (W) < λ \Leftrightarrow \exists k_{0} : C_{max} ({W_{k_{0}} (\cdot | \cdot, s)}_{s \in §_{k_{0}}}) < λ .

We can use this property and the semi-decidability requirement in C just like in the proof of

B \Rightarrow A

to construct a Turing machine

T M_{< λ}

, which stops for

W \in {C H}_{c} (X, Y)

exactly then, if

C_{0} (W) < λ

applies or computes forever.

This proves the theorem. □

Remark 2

(See also Section 1). Alon and Lubetzky have asked whether the set

{G : Θ (G) < μ}

is semi-decidable (see [44]). We see that the answer to Alon and Lubetzky’s question is positive if and only if Assertion A from Theorem 7 holds true. This is interesting for the following reason: on the one hand, the set

{G \in G : Θ (G) > μ}

is semi-decidable for

μ \in R_{c}

with

μ \geq 1

, but on the other hand, even for

| X | = | Y | = 2

and

λ \in R_{c}

with

0 < λ < 1

, the set

{W \in {C H}_{c} (X, Y) : C_{0} (W) > λ}

is not semi-decidable. So, there is no equivalence regarding the semi-decidability of these sets.

In the next theorem, we look at useless channels in terms of the zero-error capacity. The set of useless channels is defined by

N_{0} (X, Y) : = {W \in {C H}_{c} (X, Y) : C_{0} (W) = 0},

where

X

and

Y

are finite alphabets with

| X | \geq 2

and

| Y | \geq 2

. It is clear from our previous theorem that

N_{0} (X, Y)

is not semi-decidable.

Theorem 8.

Let

X

and

Y

be finite alphabets with

| X | \geq 2

and

| Y | \geq 2

. Then, the set

N_{0} (X, Y)

is semi-decidable.

Proof.

For the proof of this theorem, we use the proof of Theorem 7. We have to construct a Turing machine

T M_{0}

as follows.

T M_{0}

is defined on

{C H}_{c} (X, Y)

and stops for an input W if and only if

W \in N_{0} (X, Y)

; otherwise, it calculates forever. For the input W, we start the Turing machine

T M_{> 0}

in parallel for all

x, x^{'} \in X

with

x \neq x^{'}

and test

T M_{> 0} (W (y | x) W (y | x^{'}))

. We let all

| Y | | X | (| X | - 1)

Turing machines

T M_{< 0}

compute one step of the computation in parallel. As soon as a Turing machine stops, it will not be continued. The Turing machine

T M_{0} (W)

stops if and only if y exists for every

x, x^{'} \in X

with

x \neq x^{'}

such that

T M_{> 0} (W (y | x) W (y | x^{'}))

stops. Then, the confusability graph G is a complete graph, and consequently,

Θ (G) = 1

and

C_{0} (W) = 0

. □

In [45], on the other hand, it was shown that the zero-error capacity for a fixed input and output alphabets can be calculated on a Blum–Shub–Smale machine.

4. The Computability of $Θ$ and 0–1 AVCs

We know that the set

{G : Θ (G) = 0}

is decidable, and we know that the set

{{W (\cdot | \cdot, s)}_{s \in §} \in A V C_{0 - 1} : C_{max} ({W (\cdot | \cdot, s)}_{s \in §}) = 0}

is decidable. However, we have shown nothing about the computability of the above quantities so far. If we look at the 0-1 AVC with the average errors, it holds that

C_{a v} : A V C_{0 - 1} \to R_{c}

is calculable and the set

{{W (\cdot | \cdot, s)}_{s \in §} \in A V C_{0 - 1} : C_{a v} ({W (\cdot | \cdot, s)}_{s \in §}) = 0}

is decidable. It holds that

N_{0, a v} > N_{0, max}

. For an AVC

{W (\cdot | \cdot, s)}_{s \in §}

, let

M (x) : = {y \in Y : \exists s \in § with W (y | x, s) = 1}

; we have

C_{max} ({W (\cdot | \cdot, s)}_{s \in §}) = 0 \Leftrightarrow \forall x, \hat{x} \in X holds M (x) \cap M (\hat{x}) \neq \emptyset .

In general, it is unclear whether

Θ (G)

and

C_{max} ({W (\cdot | \cdot, s)}_{s \in §})

are computable. The computability of both capacities is open, but we can show the computability of the average error capacity of 0-1 AVCs. For a comprehensive survey on the general theory of AVCs, see [42].

Theorem 9.

The function

C_{a v} : A V C_{0 - 1} \to R_{c}

is Borel–Turing-computable.

Remark 3.

It is important that we restrict

C_{a v}

in Theorem 9 to the set of all 0-1 AVCs as a function and examine the Borel–Turing computability on this restricted set. This is because it was shown in [46] that for all

| X | \geq 2

,

| Y | \geq 3

, and a fixed

| § | \geq 2

, the capacity

C_{a v} : {C H}_{c} (X, Y) \to R_{c}

is not Banach–Mazur-computable.

Proof of Theorem 9.

We want to design a Turing machine that solves the above task. We choose

x, y, s

as variables with

1 \leq x \leq | X |

,

1 \leq y \leq | Y |

, and

1 \leq s \leq | § |

. Let

{W (\cdot | \cdot, s)}_{s \in §} \in A V C_{0 - 1}

be an arbitrary 0-1 AVC. A set of vectors on

R_{+}^{| Y |}

is given by

v_{x, s} = (\begin{matrix} W (1 | x, s) \\ ⋮ \\ W (| Y | | x, s) \end{matrix})

with

x \in X

and

s \in §

. Each of these vectors is a 0-1 vector with only one non-zero element. Let

E : = {e_{i}}_{i \in | Y |}

be the standard basis of

R^{| Y |}

. Then, E forms the set of extreme points of the probability vectors in

R^{| Y |}

. We can identify the set of probability vectors with the set

¶ (Y)

. We now want to show that the set

\{{W (\cdot | \cdot, s)}_{s \in §} \in A V C_{0 - 1} : C_{a v} ({W (\cdot | \cdot, s)}_{s \in §}) = 0\}

is decidable by constructing a Turing machine that decides for each channel

{W (\cdot | \cdot, s)}_{s \in §}

whether it is symmetrizable or not. An AVC

{W (\cdot | \cdot, s)}_{s \in §}

is called symmetrizable if and only if a DMC

U \in C H (X, §)

such that

\sum_{s \in §} v_{\tilde{x}, s} U (s | x) = \sum_{s \in §} v_{x, s} U (s | \tilde{x})

(17)

holds true for all

x, \tilde{x} \in X

. If a general AVC

{W (\cdot | \cdot, s)}_{s \in §}

is symmetrizable, then

C_{a v} ({W (\cdot | \cdot, s)}_{s \in §}) = 0

. First, we will show that we can algorithmically decide whether an AVC

{W (\cdot | \cdot, s)}_{s \in §} \in A V C_{0 - 1}

is symmetrizable or not. Let

{W (\cdot | \cdot, s)}_{s \in §} \in A V C_{0 - 1}

be symmetrizable. Define for all

x \in X

I_{U} (x) : = {s \in § : U (s | x) > 0} .

If

s \in I_{U}

holds true, then the vector

v_{x, s}

appears on the right-hand side in (17). Observe that for all

x \in X

and all

s \in I_{U} (x)

, the vector

v_{x, s}

is an element of E. Due to (17), for all

x, \tilde{x} \in X, s \in I_{U} (x)

,

\tilde{s}

must exist such that

v_{x, s} = v_{\tilde{x}, \tilde{s}}

. Then, it follows that

\tilde{s}

belongs to the set

I_{U} (\tilde{x})

. We can now swap the roles of x and

\tilde{x}

and have thus shown that

| I_{U} (x) | = | I_{U} (\tilde{x}) |

. Since both x and

\tilde{x}

were arbitrary, we have

| I_{U} (1) | = | I_{U} (2) | = \dots = | I_{U} (| X |) | = ν .

(18)

Let

V_{x} : = {v_{x, s} : s \in §, U (s | x) > 0} \subseteq E

with

x \in X

. Then, for all

x, \tilde{x} \in X

with

x \neq \tilde{x}

, it holds that

V_{x} \cap V_{\tilde{x}} = V_{(x, \tilde{x})} \neq \emptyset

, and because of (18), it holds that

| V_{(x, \tilde{x})} | = ν

. Let

V_{(x, \tilde{x})} = {(v_{1} (x, \tilde{x}), \dots, v_{ν} (x, \tilde{x})}

be a list of the elements. For

1 \leq x \leq | X |

and

1 \leq \tilde{x} \leq | X |

, let

s_{x, \tilde{x}} : {1, \dots, ν} \to §

be the function with

\begin{matrix} v_{x, s_{x, \tilde{x}} (1)} & = v_{1} (x, \tilde{x}) \\ ⋮ \\ v_{x, s_{x, \tilde{x}} (ν)} & = v_{ν} (x, \tilde{x}) . \end{matrix}

Let

f_{\tilde{x}} (s) : = U (s | \tilde{x})

with

\tilde{x} \in X

and

s \in §

; then, it holds that

\begin{matrix} \sum_{s \in §} v_{x, s} f_{\tilde{x}} (s) & = \sum_{s \in I_{U} (\tilde{x})} v_{x, s} f_{\tilde{x}} (s) \\ = \sum_{t = 1}^{ν} v_{t} (x, \tilde{x}) f_{\tilde{x}} (s_{x, \tilde{x}} (t)) \\ = \sum_{t = 1}^{ν} v_{t} (x, \tilde{x}) f_{x} (s_{x, \tilde{x}} (t)) . \end{matrix}

Because

v_{t} (x, \tilde{x})

with

1 \leq t \leq ν

are extreme points of the set

¶ (Y)

, the following applies for

1 \leq t \leq ν

:

0 \neq f_{\tilde{x}} (s_{x, \tilde{x}} (t)) = f_{j} (s_{x, \tilde{x}} (t)) .

(19)

We can now define a new function

f^{*}

as follows:

\begin{matrix} f_{\tilde{x}}^{*} (s_{x, \tilde{x}} (t)) = \frac{1}{ν} \frac{f_{\tilde{x}} (s_{x, \tilde{x}} (t))}{f_{\tilde{x}} (s_{x, \tilde{x}} (t))} . \end{matrix}

(20)

It holds that

\begin{matrix} f_{\tilde{x}}^{*} (s_{x, \tilde{x}} (t)) = \{\begin{matrix} \frac{1}{ν} & 1 \leq t \leq r \\ 0 & o t h e r w i s e . \end{matrix} \end{matrix}

(21)

Then, a channel is given by

U^{*} (s | \tilde{x}) = f_{\tilde{x}}^{*} (s)

with

s \in §

and

\tilde{x} \in X

. This channel fulfills the following:

\sum_{r = 1}^{ν} v_{t} (x, \tilde{x}) f_{\tilde{x}}^{*} (s_{x, \tilde{x}} (t)) = \sum_{r = 1}^{ν} v_{t} (x, \tilde{x}) f_{\tilde{x}}^{*} (s_{x, \tilde{x}} (t)) .

(22)

So,

U^{*}

is a symmetrizable channel. With this, we can specify an algorithm for the proof of the symmetrizability as follows (see also Algorithm 1):

Input ${W (\cdot | \cdot, s)}_{s \in §}$ .
Compute ${\underset{̲}{V}}_{x} : = {v_{x, s}}_{s \in §}$ .
Compute ${min}_{\tilde{x} \neq x} | {\underset{̲}{V}}_{x} \cap {\underset{̲}{V}}_{\tilde{x}} | = : \underset{̲}{ν}$ .
–
If $\underset{̲}{ν} = 0$ , then the channel is not symmetrizable.
–
If $\underset{̲}{ν} \geq 1$ , then test for all $ν$ with $1 \leq ν \leq \underset{̲}{ν}$ , all pairs $1 \leq x, \tilde{x} \leq | X |$ with $x \neq \tilde{x}$ , all subsets $V_{*} \subset ({\underset{̲}{V}}_{x} \cap {\underset{̲}{V}}_{\tilde{x}})$ of cardinality $ν$ , and all functions of the form $f_{\tilde{x}}^{*}$ whether they fulfill the following symmetrizability condition for all $1 \leq x, \tilde{x} \leq | X |$ with $x \neq \tilde{x}$ :

$f_{\tilde{x}}^{*} (s_{x, \tilde{x}} (t)) = f_{x}^{*} (s_{x, \tilde{x}} (t)) \forall 1 \leq t \leq ν .$

Algorithm 1 Check symmetrizability from the transition family

{W (\cdot | \cdot, s)}_{s \in §}

Require: The transition matrices

W (\cdot | \cdot, s)

for each state

s \in §

1:: Let $X$ be the input alphabet, and S the set of states
2:: for all $x \in X$ do
3:: ${\underset{̲}{V}}_{x} \leftarrow {v_{x, s} : s \in S}$
4:: end for
5:: $\underset{̲}{ν} \leftarrow {min}_{\begin{matrix} x, \tilde{x} \in X \\ x \neq \tilde{x} \end{matrix}} |{\underset{̲}{V}}_{x} \cap {\underset{̲}{V}}_{\tilde{x}}|$
6:: if $\underset{̲}{ν} = 0$ then
7:: return false ▹ The channel is not symmetrizable
8:: end if
9:: for $ν \leftarrow 1$ to $\underset{̲}{ν}$ do
10:: for all $x, \tilde{x} \in X$ with $x \neq \tilde{x}$ do
11:: $C \leftarrow {\underset{̲}{V}}_{x} \cap {\underset{̲}{V}}_{\tilde{x}}$
12:: for all subsets $V_{*} \subseteq C$ with $| V_{*} | = ν$ do
13:: for all functions $f_{x}^{*}, f_{\tilde{x}}^{*} : V_{*} \to some co - domain$ do
14:: $sym \leftarrow true$
15:: for $t \leftarrow 1$ to $ν$ do
16:: let $s_{t}$ be the t-th element of $V_{*}$
17:: if $f_{\tilde{x}}^{*} (s_{t}) \neq f_{x}^{*} (s_{t})$ then
18:: $sym \leftarrow false$ ▹ Symmetry is broken
19:: break
20:: end if
21:: end for
22:: if $sym = true$ then
23:: return true ▹ A symmetrizing assignment is found
24:: end if
25:: end for
26:: end for
27:: end for
28:: end for
29:: return false ▹ No symmetrizing assignment exists

Clearly, there are only a finite number of options to test. Functions

f_{\tilde{x}}^{*}

with

\tilde{x} \in X

can be found if and only if the channel can be symmetrized. Using the described subroutine, we can now fully specify an algorithm that computes

C_{a v} : A V C_{0 - 1} \to R_{c}

:

If we can prove algorithmically that ${W (\cdot | \cdot, s)}_{s \in §}$ is symmetrizable, then we set $C_{a v} ({W (\cdot | \cdot, s)}_{s \in §}) = 0$ .
If we can prove algorithmically that ${W (\cdot | \cdot, s)}_{s \in §}$ is not symmetrizable, then we compute $C_{a v} ({W (\cdot | \cdot, s)}_{s \in §})$ as follows [42]: for $q \in ¶ (§)$ , let $W_{q} (\cdot | \cdot) = \sum_{s \in §} q (s) W (\cdot | \cdot, s)$ . Then, it holds that

$\begin{matrix} C_{a v} ({W (\cdot | \cdot, s)}_{s \in §}) & = & min_{q \in ¶ (§)} C (W_{q}) \\ = & min_{q \in R^{| § |}, q (s) \geq 0, \sum q (s) = 1} C (W_{q}) . \end{matrix}$

Here, C denotes the capacity of a DMC, which is a computable continuous function (this follows from Shannon’s theorem and the continuity of the mutual information). Thus,

C_{a v} ({W (\cdot | \cdot, s)}_{s \in §})

is a computable number, and we have constructed an algorithm which transforms an algorithmic description of

{W (\cdot | \cdot, s)}_{s \in §}

into an algorithmic description of the number

C_{a v} ({W (\cdot | \cdot, s)}_{s \in §})

(see Definition 9). □

We have now shown that

C_{a v}

is Borel–Turing-computable. Although this does not say anything about

C_{max}

,

C_{a v}

is similar in structure to

C_{max}

. For example, if local randomness is available at the encoder, the maximum-error capacity coincides with the average error capacity [42]. We now want to look at the computability of

Θ

and

C_{max}

. We want to show the following.

Theorem 10.

Θ is Borel–Turing-computable if and only if

C_{max}

is Borel–Turing-computable.

Proof.

From the proof of Theorem 7, it follows that two Turing machines

T M_{1}

and

T M_{2}

exist with the following properties:

For all $G \in G$ , it holds that

$Θ (G) = C_{max} (T M_{1} (G)) .$
For all ${W (\cdot | \cdot, s)}_{s \in §} \in A V C_{0 - 1}$ , it holds that

$C_{max} ({W (\cdot | \cdot, s)}_{s \in §}) = Θ (T M_{2} ({W (\cdot | \cdot, s)}_{s \in §})) .$

So, if

C_{max}

is Borel–Turing-computable, then for any input G for

Θ

, we can effectively find a suitable input

{W (\cdot | \cdot, s)}_{s \in §}

for

C_{max}

and then use it as an oracle. A similar line of reasoning applies if

Θ

is Borel–Turing-computable. □

5. The Computability of the Zero-Error Capacity with the Kolmogorov Oracle

We have shown that the zero-error capacity

C_{0}

is not Banach–Mazur-computable as a function of the channels. The question now arises as to whether a Turing machine with additional input can be found so that, for example, the upper bounds for the zero-error capacity can be calculated. This question will be briefly discussed in this section. In [31], we showed that the zero-error capacity is semicomputable if we allow for the Kolmogorov oracle.

To define the Kolmogorov oracle, we need a special enumeration for

The set $N$ ;
The set of the partial recursive functions $Φ$ : $D o m a i n (Φ) \subset N \to N$ .

The problem is that the natural listing of the set of natural numbers is inappropriate because many numbers in

N

are too large for the natural enumerations. We start with the set of partially recursive functions from

N

to

N

. A listing

M_{o p t} = {Φ_{l} : Φ_{l} i s a p a r t i a l r e c u r s i v e f u n c t i o n l \in N}

of the partial recursive functions is called an optimal listing if for any other recursive listing

{g_{l} : l \in N}

of the set of recursive functions, there is a constant

C_{1}

such that for all

l \in N

, the following holds:

t (l) \in N

exists with

t (l) \leq C_{1} l

and

Φ_{t (l)} = g_{l}

. This means that all partial recursive functions

Φ

have a small Gödel number with respect to the system

M_{o p t}

. Schnorr [34] has shown that such an optimal recursive listing of the set of partial recursive functions exists. The same holds true for the sets of natural numbers

N

.

For

N

, let

u_{N}

be an optimal listing and

η : N \to G

be a numbering of graphs.

For the set

G

, we define

C_{u_{G}} (G) : = min {k : η (u_{N} (k)) = G} .

This is the Kolmogorov complexity generated by

u_{N}

and

η

.

Definition 18.

The Kolmogorov oracle

O_{K, G} (\cdot)

is a function of

N

in the power set of the set of graphs that produces a list

O_{K, G} (n) : = \{G : C_{u_{G}} (G) \leq n\}

for each

n \in N

, where the graphs G are listed by size.

Let

T M

be a Turing machine. We say that

T M

can use the oracle

O_{K, G}

if for every

n \in N

, for the input n, the Turing machine acquires the list

O_{K, G} (n)

. With

T M (O_{K, G})

, we denote a Turing machine that has access to the oracle

O_{K, G}

. We now consider for

λ \in R_{c}

,

λ \geq 0

the set

L (λ) = {G : Θ (G) \leq λ}

, i.e., the

λ

-level set of the zero-error capacity. We have the following theorem:

Theorem 11

([31]). Let

λ \in R_{c} w i t h λ > 0

. Then, the set

L (λ)

is decidable with a Turing machine

T M^{*} (O_{K, G})

. This means a Turing machine

T M^{*} (O_{K, G})

exists such that the set

G (λ)

is computable with this Turing machine with the oracle.

Corollary 1

([31]). Let

λ \in R_{c}

,

λ \geq 0

. Then, the set

L (λ)

is semi-decidable for Turning machines with the oracle

O_{K, N} (O_{K, G})

.

Alon and Lubetzky have asked whether the set

{G : Θ (G) \leq λ}

is semi-decidable. We gave in [31] a positive answer to this question on whether we can include the oracle. We do not know if

C_{0}

is computable concerning

T M (O_{K, G})

.

Let

M \in N

be a number with

2^{M} > | X |

. We set

I_{k, M} = [\frac{k}{2^{M}}, \frac{k + 1}{2^{M}}]

for

k = 0, 1, \dots, 2^{2 M} - 1

. We have the following theorem:

Theorem 12.

A Turing machine

T M^{(1)} (\cdot, O_{K, N})

exists with

T M^{(1)} (\cdot, O_{K, N}) : G \to {0, 1, \dots, 2^{M}}

such that for all

G \in G

, it holds that

T M^{(1)} (G, O_{K, N}) = r \Leftrightarrow Θ (G) \in I_{r, M} .

Thus, this approach does not directly provide the computability of

C_{0}

through

T M

with the oracle

O_{K, N}

. However, we can compute

C_{0}

with any given accuracy.

We have seen that in order to prove the computability of

C_{0}

or

Θ

, we need computable converses. In this sense, the recent characterization by Zuiddam [47] using the functions from the asymptotic spectrum of graphs is interesting.

6. Conclusions and Discussion

This paper revisited Ahlswede’s foundational approach in [2] to characterizing the zero-error capacity using arbitrarily varying channels (AVCs). Although the theoretical connection remains intriguing, it has not yet yielded practical methods for calculating

C_{0} (W)

for discrete memoryless channels (DMCs). Obstacles include the absence of explicit formulas for the maximum-error AVC capacities and the impossibility of algorithmically transforming any DMC into a finite 0–1 AVC, as shown by Theorems 3, 4, and 6. These results prove that no Turing machine can realize the map

T M_{*} : C H {(X, Y)}_{c} ⟶ \{{W_{s}}_{s \in S} : W_{s} is finite 0 - 1 AVC\} .

Table 2 gives an overview of the main results of this paper.

Our focus has been on the computability of the zero-error capacity as a function of a DMC W; we did not address the computability questions arising from a graph-based perspective via the confusability graph

G_{W}

. Whether the Shannon capacity

Θ (G)

is computable in that representation remains open.

This paper shows that the confusability graph derived from a channel’s transition matrix is not computable in general. This means that you cannot simply calculate the graph from the matrix data. Consequently, knowing the capacity of this confusability graph does not provide a concrete tool for evaluating the performance of the channel.

Furthermore, the capacity of the confusability graph is defined as a regularized limit, which makes it intrinsically difficult to evaluate in practice. In other words, this capacity is not given by a single, computable expression; instead, it emerges only in the limit of increasingly long codes or repeated graph operations. This regularization step complicates any attempt to actually compute or approximate the capacity, rendering it of limited utility in practical scenarios.

Nevertheless, because the descriptions of the DMC are standard in practical settings, our negative computability results are of broad significance.

Beyond coding theory, the zero-error capacity is relevant in areas like remote state estimation and quantum communication (see [3,48]). Our findings are part of a broader narrative in information theory: many core problems—calculating the finite-state channel capacity [49,50,51], optimizing mutual information, and even constructing capacity-achieving codes [52,53]—have been proven to be non–Turing-computable in general.

A compelling open question is whether computable channels W exist for which

C_{0} (W)

itself is a non-computable real. If so, this would establish that exact capacity statements require more than an algorithmic effort—they confront the fundamental limits of computability. Similar phenomena have been observed in compound channels [54], colored Gaussian channels [55], and Wiener prediction problems [56], suggesting that rich, non-computable structures may also appear in zero-error contexts.

Moving forward, research should probe the algorithmic frontiers of zero-error information theory, especially in connection with automated systems and software-defined communications (see [57,58]). Though Turing computability hits a wall, other computational frameworks—such as Blum—Shub—Smale machines—may offer new possibilities [45]. Understanding these alternative models may be key to effectively navigating the computability landscape of the zero-error capacity.

In summary, while the zero-error capacity remains a cornerstone of information theory, our results clarify that its algorithmic determination is blocked by deep, non-computable obstructions. Characterizing or circumventing these obstructions should be a key priority in future studies.

Author Contributions

Conceptualization, Data curation, Investigation, Methodology, Writing—review & editing, H.B. and C.D. All authors have read and agreed to the published version of the manuscript.

Funding

The authors acknowledge the financial support from the Federal Ministry of Education and Research of Germany (BMBF) through the program “Souverän. Digital. Vernetzt.” as part of the joint project 6G-life (Project IDs: 16KISK002 and 16KISK263). H. Boche and C. Deppe also gratefully acknowledge the support from the BMBF quantum program QuaPhySI (Grants 16KIS1598K and 16KIS2234), QUIET (Grants 16KISQ093 and 16KISQ0170), and the QD-CamNetz project (Grants 16KISQ077 and 16KISQ169). Their research was further supported by the German Research Foundation (DFG) under the project “Post Shannon Theory and Implementation” (Grants BO 1734/38-1 and DE 1915/2-1). Additionally, the DFG supported H. Boche under Grant BO 1734/20-1. The authors also express their gratitude to the BMBF for supporting H. Boche through the national initiative under Grant 16KIS1003K and C. Deppe under Grant 16KIS1005.

Data Availability Statement

Data is contained within the article.

Acknowledgments

This work was initiated following discussions with Martin Bossert and Vince Poor at the IEEE International Symposium on Information Theory 2019 in Paris. Holger Boche extends his gratitude to Martin Bossert and Vince Poor for their valuable insights on the significance of the zero-error capacity in various areas of information theory. Finally, we extend our thanks to Yannik Böck for his helpful and insightful comments.

Conflicts of Interest

The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

Shannon, C.E. The zero-error capacity of a noisy channel. Inst. Radio Eng. Trans. Inf. Theory 1956, IT-2, 8–19. [Google Scholar] [CrossRef]
Ahlswede, R. A note on the existence of the weak capacity for channels with arbitrarily varying channel probability functions and its relation to Shannon’s zero-error capacity. Ann. Math. Stat. 1970, 41, 1027–1033. [Google Scholar] [CrossRef]
Matveev, A.S.; Savkin, A.V. Shannon zero error capacity in the problems of state estimation and stabilization via noisy communication channels. Int. J. Control 2007, 80, 241–255. [Google Scholar] [CrossRef]
Cubitt, T.S.; Chen, J.; Harrow, A.W. Superactivation of the asymptotic zero-error classical capacity of a quantum channel. IEEE Trans. Inf. Theory 2011, 57, 8114–8126. [Google Scholar] [CrossRef]
Cubitt, T.S.; Smith, G. An extreme form of superactivation for quantum zero-error capacities. IEEE Trans. Inf. Theory 2012, 58, 1953–1961. [Google Scholar] [CrossRef]
Cubitt, T.S.; Leung, D.; Matthews, W.; Winter, A. Improving zero-error classical communication with entanglement. Phys. Rev. Lett. 2010, 104, 230503. [Google Scholar] [CrossRef]
Duan, R.; Severini, S.; Winter, A. Zero-Error Communication via Quantum Channels, Noncommutative Graphs, and a Quantum Lovász Number. IEEE Trans. Inf. Theory 2013, 59, 1164–1174. [Google Scholar] [CrossRef]
Cubitt, T.S.; Leung, D.; Matthews, W.; Winter, A. Zero-error channel capacity and simulation assisted by non-local correlations. IEEE Trans. Inf. Theory 2011, 57, 5509–5523. [Google Scholar] [CrossRef]
Duan, R.; Winter, A. No-Signalling-Assisted Zero-Error Capacity of Quantum Channels and an Information Theoretic Interpretation of the Lovász Number. IEEE Trans. Inf. Theory 2016, 62, 891–914. [Google Scholar] [CrossRef]
Duan, R.; Severini, S.; Winter, A. On zero-error communication via quantum channels in the presence of noiseless feedback. IEEE Trans. Inf. Theory 2016, 62, 5260–5277. [Google Scholar] [CrossRef]
Koudia, S.; Cacciapuoti, A.S.; Simonov, K.; Caleffi, M. How Deep the Theory of Quantum Communications Goes: Superadditivity, Superactivation and Causal Activation. IEEE Commun. Surv. Tutor. 2022, 24, 1926–1956. [Google Scholar] [CrossRef]
Gyongyosi, L.; Imre, S.; Nguyen, H.V. A survey on quantum channel capacities. IEEE Commun. Surv. Tutor. 2018, 20, 1149–1205. [Google Scholar] [CrossRef]
Daws, M. Quantum graphs: Different perspectives, homomorphisms and quantum automorphisms. Commun. Am. Math. Soc. 2024, 2, 1–35. [Google Scholar] [CrossRef]
Aigner, M.; Ziegler, G.M. Proofs from THE BOOK; Springer: Berlin/Heidelberg, Germany, 2010. [Google Scholar] [CrossRef]
Haemers, W. On some problems of Lovász concerning the Shannon capacity of a graph. IEEE Trans. Inf. Theory 1979, 25, 231–232. [Google Scholar] [CrossRef]
Körner, J.; Orlitsky, A. Zero-error information theory. IEEE Trans. Inf. Theory 1998, 44, 2207–2229. [Google Scholar] [CrossRef]
Lovász, L. On the Shannon capacity of a graph. IEEE Trans. Inf. Theory 1979, 25, 1–7. [Google Scholar] [CrossRef]
Schrijver, A. Combinatorial Optimization; Springer: Berlin/Heidelberg, Germany, 2003. [Google Scholar]
West, D.B. Introduction to Graph Theory, 2nd ed.; Prentice Hall: Hoboken, NJ, USA, 2001. [Google Scholar]
Devroye, N. When is the zero-error capacity positive in the relay, multiple-access, broadcast and interference channels? In Proceedings of the 2016 54th Annual Allerton Conference on Communication, Control, and Computing (Allerton), Monticello, IL, USA, 27–30 September 2016; pp. 935–942. [Google Scholar] [CrossRef]
Mattas, M.; Östergård, P.R.J. A new bound for the zero-error capacity region of the two-user binary adder channel. IEEE Trans. Inf. Theory 2005, 51, 3305–3308. [Google Scholar] [CrossRef]
Gu, Y. Zero-error communication over adder MAC. arXiv 2018, arXiv:1809.07364. [Google Scholar]
Kovačević, M. Zero-error capacity of duplication channels. IEEE Trans. Commun. 2019, 67, 7623–7630. [Google Scholar] [CrossRef]
Dalai, M.; Guruswami, V. An improved bound on the zero-error list-decoding capacity of the 4/3 channel. IEEE Trans. Inf. Theory 2019, 65, 5635–5647. [Google Scholar]
Bhandari, S.; Radhakrishnan, J. Bounds on the zero-error list-decoding capacity of the q/(q–1) channel. IEEE Trans. Inf. Theory 2021, 68, 238–247. [Google Scholar] [CrossRef]
Charpenay, N.; Treust, M.L. Variable-length coding for zero-error channel capacity. arXiv 2020, arXiv:2001.03523. [Google Scholar]
Zhang, Y. Zero-error communication over adversarial MACS. IEEE Trans. Inf. Theory 2023, 69, 4532–4547. [Google Scholar] [CrossRef]
Sason, I. Observations on graph invariants with the Lovász ϑ-function. AIMS Math. 2024, 9, 15385–15468. [Google Scholar] [CrossRef]
Ahlswede, A.; Althöfer, I.; Deppe, C.; Tamm, U. (Eds.) Transmitting and Gaining Data: Rudolf Ahlswede’s Lectures on Information Theory 2, 1st ed.; Foundations in Signal Processing, Communications and Networking; Springer: Berlin/Heidelberg, Germany, 2015; Volume 11. [Google Scholar]
Boche, H.; Deppe, C. Computability of the zero-error capacity of noisy channels. In Proceedings of the 2021 IEEE Information Theory Workshop (ITW), Kanazawa, Japan, 17–21 October 2021; IEEE: Piscataway, NJ, USA, 2021; pp. 1–6. [Google Scholar] [CrossRef]
Boche, H.; Deppe, C. Computability of the Zero-Error Capacity with Kolmogorov Oracle. In Proceedings of the 2020 IEEE International Symposium on Information Theory (ISIT), Los Angeles, CA, USA, 21–26 June 2020; pp. 2020–2025. [Google Scholar] [CrossRef]
Turing, A.M. On computable numbers, with an application to the Entscheidungsproblem. Proc. Lond. Math. Soc. 1936, 2, 230–265. [Google Scholar]
Kleene, S.C. General recursive functions of natural numbers. Math. Ann. 1936, 112, 727–742. [Google Scholar] [CrossRef]
Schnorr, C.P. Rekursive Funktionen und ihre Komplexität, 1st ed.; Vieweg+Teubner: Berlin, Germany, 1974. [Google Scholar] [CrossRef]
Weihrauch, K. Computable Analysis—An Introduction; Springer: Berlin/Heidelberg, Germany, 2000. [Google Scholar] [CrossRef]
Soare, R.I. Recursively Enumerable Sets and Degrees; Springer: Berlin/Heidelberg, Germany, 1987. [Google Scholar] [CrossRef]
Pour-El, M.B.; Richards, J.I. Computability in Analysis and Physics; Cambridge University Press: Cambridge, UK, 2017. [Google Scholar] [CrossRef]
Specker, E. Nicht konstruktiv beweisbare Sätze der Analysis. J. Symb. Log. 1949, 14, 145–158. [Google Scholar] [CrossRef]
Avigad, J.; Brattka, V. Computability and analysis: The legacy of Alan Turing. In Turing’s Legacy: Developments from Turing’s Ideas in Logic; Downey, R., Ed.; Cambridge University Press: Cambridge, UK, 2014. [Google Scholar] [CrossRef]
Ahlswede, R. On concepts of performance parameters for channels. In General Theory of Information Transfer and Combinatorics; Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 2006; Volume 4123, pp. 639–663. [Google Scholar] [CrossRef]
Blackwell, D.; Breiman, L.; Thomasian, A.J. The capacities of certain channel classes under random coding. Ann. Math. Stat. 1960, 31, 558–567. [Google Scholar] [CrossRef]
Ahlswede, A.; Althöfer, I.; Deppe, C.; Tamm, U. (Eds.) Probabilistic Methods and Distributed Information: Rudolf Ahlswede’s Lectures on Information Theory 5, 1st ed.; Foundations in Signal Processing, Communications and Networking; Springer: Berlin/Heidelberg, Germany, 2019; Volume 14. [Google Scholar]
Kleene, S.C. Recursive Predicates and Quantifiers. Trans. Am. Math. Soc. 1943, 53, 41–73. [Google Scholar] [CrossRef]
Alon, N.; Lubetzky, E. The Shannon capacity of a graph and the independence numbers of its powers. IEEE Trans. Inf. Theory 2006, 52, 2172–2176. [Google Scholar] [CrossRef]
Boche, H.; Böck, Y.; Deppe, C. Deciding the Problem of Remote State Estimation via Noisy Communication Channels on Real Number Signal Processing Hardware. In Proceedings of the ICC 2022—IEEE International Conference on Communications, Seoul, Republic of Korea, 16–20 May 2022; pp. 4510–4515. [Google Scholar] [CrossRef]
Boche, H.; Schaefer, R.F.; Poor, H.V. Secure Communication and Identification Systems—Effective Performance Evaluation on Turing Machines. IEEE Trans. Inf. Forensics Secur. 2020, 15, 1013–1025. [Google Scholar] [CrossRef]
Zuiddam, J. The asymptotic spectrum of graphs and the Shannon capacity. Combinatorica 2019, 39, 1173–1184. [Google Scholar] [CrossRef]
Wiese, M.; Oechtering, T.J.; Johansson, K.H.; Papadimitratos, P.; Sandberg, H.; Skoglund, M. Secure Estimation and Zero-Error Secrecy Capacity. IEEE Trans. Autom. Control 2019, 64, 1047–1062. [Google Scholar] [CrossRef]
Elkouss, D.; Pérez-García, D. Memory effects can make the transmission capability of a communication channel uncomputable. Nat. Commun. 2018, 9, 1149. [Google Scholar] [CrossRef] [PubMed]
Boche, H.; Schaefer, R.F.; Poor, H.V. Shannon meets Turing: Non-computability and non-approximability of the finite state channel capacity. Commun. Inf. Syst. 2020, 20, 81–116. [Google Scholar] [CrossRef]
Grigorescu, A.; Boche, H.; Schaefer, R.F.; Poor, H.V. Capacity of Finite State Channels with Feedback: Algorithmic and Optimization Theoretic Properties. In Proceedings of the 2022 IEEE International Symposium on Information Theory (ISIT), Espoo, Finland, 26 June–1 July 2022; pp. 498–503. [Google Scholar] [CrossRef]
Lee, Y.; Boche, H.; Kutyniok, G. Computability of Optimizers. IEEE Trans. Inf. Theory 2024, 70, 2967–2983. [Google Scholar] [CrossRef]
Boche, H.; Schaefer, R.F.; Poor, H.V. Turing Meets Shannon: On the Algorithmic Construction of Channel-Aware Codes. IEEE Trans. Commun. 2022, 70, 2256–2267. [Google Scholar] [CrossRef]
Boche, H.; Schaefer, R.F.; Poor, H.V. Communication Under Channel Uncertainty: An Algorithmic Perspective and Effective Construction. IEEE Trans. Signal Process. 2020, 68, 6224–6239. [Google Scholar] [CrossRef]
Boche, H.; Grigorescu, A.; Schaefer, R.F.; Poor, H.V. Algorithmic Computability of the Capacity of Additive Colored Gaussian Noise Channels. In Proceedings of the GLOBECOM 2023—2023 IEEE Global Communications Conference, Kuala Lumpur, Malaysia, 4–8 December 2023; pp. 4375–4380. [Google Scholar] [CrossRef]
Boche, H.; Pohl, V.; Poor, H.V. The Wiener Theory of Causal Linear Prediction Is Not Effective. In Proceedings of the 2023 62nd IEEE Conference on Decision and Control (CDC), Singapore, Singapore, 13–15 December 2023; pp. 8229–8234. [Google Scholar] [CrossRef]
Boche, H.; Böck, Y.; Deppe, C. On the Semi-Decidability of Remote State Estimation and Stabilization via Noisy Communication Channels. In Proceedings of the 2021 60th IEEE Conference on Decision and Control (CDC), Austin, TX, USA, 13–17 December 2021; pp. 3428–3435. [Google Scholar] [CrossRef]
Boche, H.; Deppe, C. Computability of the channel reliability function and related bounds. In Proceedings of the 2022 IEEE International Symposium on Information Theory (ISIT), Espoo, Finland, 26 June–1 July 2022; IEEE: Piscataway, NJ, USA, 2022. [Google Scholar] [CrossRef]

Table 1. An overview of the main definitions and notations.

Symbol/Notation	Description
$R_{c}$	A set of computable real numbers.
$T M (x)$	The Turing machine for the input $x \in R_{c}$ .
$T M_{M}$	The Turing machine that halts for the input $x \in R_{c}$ if and only if $x \in M$ .
$P (X)$	The set of all probability distributions over a finite alphabet $X$ .
$P_{c} (X)$	The set of all computable probability distributions $P \in P (X)$ such that $P (x) \in R_{c}$ for all $x \in X$ .
$CH$	The set of all conditional probability distributions (channels) $P_{Y \| X} : X \to P (Y)$ for finite alphabets $X$ and $Y$ .
${CH}_{c}$	The set of all computable channels, i.e., $P_{Y \| X} (\cdot \| x) \in P_{c} (Y)$ , for all $x \in X$ .
$C_{0} (W)$	The zero-error capacity of a channel W.
$C_{a v} (W)$	The capacity of an arbitrarily varying channel (AVC) $W$ under the average error probability.
$C_{max} (W)$	The capacity of an AVC $W$ under the maximal error probability.
$Θ (G)$	The Shannon capacity of a graph G, defined by $Θ (G) : = \underset{n \to \infty}{lim sup} α {(G^{⊠ n})}^{1 / n}$ , where $α$ denotes the independence number.

Table 2. Overview of main results.

Theorem	Statement
Theorem 3	For finite alphabets $X$ , $Y$ with $\| X \| \geq 2$ , $\| Y \| \geq 2$ , the function $C_{0} : {C H}_{c} (X, Y) \to R$ is not Banach–Mazur-computable.
Theorem 4	For all $λ \in R_{c}$ with $0 \leq λ < {log}_{2} min {\| X \|, \| Y \|}$ , the set ${W \in {C H}_{c} (X, Y) : C_{0} (W) > λ}$ is not semi-decidable.
Theorem 5	For all $n \in N$ , the function $M_{*} (\cdot, n) : {C H}_{c} (X, Y) \to N$ is not Banach–Mazur-computable.
Theorem 8	The set $N_{0} (X, Y)$ is semi-decidable for finite alphabets $X, Y$ with $\| X \| \geq 2$ and $\| Y \| \geq 2$ .
Theorem 9	The function $C_{a v} : A V C_{0 - 1} \to R_{c}$ is Borel–Turing-computable.
Theorem 10	$Θ$ is Borel–Turing-computable if and only if $C_{max}$ is Borel–Turing-computable.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Boche, H.; Deppe, C. Computability of the Zero-Error Capacity of Noisy Channels. Information 2025, 16, 571. https://doi.org/10.3390/info16070571

AMA Style

Boche H, Deppe C. Computability of the Zero-Error Capacity of Noisy Channels. Information. 2025; 16(7):571. https://doi.org/10.3390/info16070571

Chicago/Turabian Style

Boche, Holger, and Christian Deppe. 2025. "Computability of the Zero-Error Capacity of Noisy Channels" Information 16, no. 7: 571. https://doi.org/10.3390/info16070571

APA Style

Boche, H., & Deppe, C. (2025). Computability of the Zero-Error Capacity of Noisy Channels. Information, 16(7), 571. https://doi.org/10.3390/info16070571

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Computability of the Zero-Error Capacity of Noisy Channels^†

Abstract

1. Introduction

2. Basic Definitions and Results

3. The Algorithmic Computability of the Zero-Error Capacity

4. The Computability of $Θ$ and 0–1 AVCs

5. The Computability of the Zero-Error Capacity with the Kolmogorov Oracle

6. Conclusions and Discussion

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Computability of the Zero-Error Capacity of Noisy Channels †

Abstract

1. Introduction

2. Basic Definitions and Results

3. The Algorithmic Computability of the Zero-Error Capacity

4. The Computability of Θ and 0–1 AVCs

5. The Computability of the Zero-Error Capacity with the Kolmogorov Oracle

6. Conclusions and Discussion

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Computability of the Zero-Error Capacity of Noisy Channels^†

4. The Computability of $Θ$ and 0–1 AVCs