The Computability of the Channel Reliability Function and Related Bounds

Boche, Holger; Deppe, Christian

doi:10.3390/a18060361

Open AccessArticle

The Computability of the Channel Reliability Function and Related Bounds

by

Holger Boche

¹

and

Christian Deppe

^2,*

¹

Theoretical Information Technology, Technical University of Munich, 80333 Munich, Germany

²

Institute for Communications Technology, Technische Universität Braunschweig, 38106 Brunswick, Germany

^*

Author to whom correspondence should be addressed.

Algorithms 2025, 18(6), 361; https://doi.org/10.3390/a18060361

Submission received: 28 March 2025 / Revised: 21 May 2025 / Accepted: 6 June 2025 / Published: 11 June 2025

(This article belongs to the Special Issue Numerical Optimization and Algorithms: 3rd Edition)

Download Versions Notes

Abstract

The channel reliability function is a crucial tool for characterizing the dependable transmission of messages across communication channels. In many cases, the only upper and lower bounds of this function are known. We investigate the computability of the reliability function and its associated functions, demonstrating that the reliability function is not Turing computable. This also holds true for functions related to the sphere packing bound and the expurgation bound. Additionally, we examine the

R_{\infty}

function and zero-error feedback capacity, as they are vital in the context of the reliability function. Both the

R_{\infty}

function and the zero-error feedback capacity are not Banach–Mazur computable.

Keywords:

Turing computability; channel reliability function; zero-error feedback capacity

1. Introduction

In [1], C. Shannon established the foundations of information theory by characterizing the key mathematical properties of communication channels. For a transmission rate R that is less than the channel capacity C, the probability of erroneous decoding with respect to an optimal code decreases exponentially as the code length

n \in N

increases. Shannon introduced the channel reliability function

E (R)

as being the exponent governing this exponential decrease in relation to the transmission rate R.

A major goal in information theory is to find a closed-form expression for the channel reliability function. This expression should be computable and fully determined by the parameters of the communication task. Naturally, we must define what constitutes a closed-form expression. In [2], Chow, and in [3], Borwein and Crandall discuss different approaches to defining closed-form expressions. All of the various representations satisfy the requirement that the corresponding functions can be computed algorithmically using a digital computer. This can be achieved with great precision, depending on the inputs within their domain of definition.

Shannon’s characterization of the capacity for message transmission via the discrete memoryless channel (DMC) in [1], Ahlswede’s characterization of the capacity for message transmission via the multiple access channel in [4], and Ahlswede and Dueck’s characterization of the identification capacity for DMCs in [5] are all significant examples of closed-form solutions using elementary functions. These provide important instances of the computability of the corresponding performance functions, as defined in the previous context. The precise definition of computability, as outlined by Turing, is presented in Section 2.

Lovász’s characterization of the zero-error capacity for the pentagram also represents a closed-form number according to Chow’s definition in [2], which can be computed algorithmically—an outcome that is desirable. However, the characterization of the zero-error capacity for a cyclical heptagon remains an open problem. Moreover, it is still unclear whether the zero-error capacities of DMCs can take computable values for computable channels. Additionally, the algorithmic computability of the broadcast capacity region is still uncertain.

In the age of artificial intelligence, it is increasingly important to determine whether a digital computer can solve a given problem or compute a given function. Since every function that can be computed by a digital computer can also be computed by a Turing machine (as will be discussed in more detail below), this question is reduced to asking whether a function is computable. It is therefore crucial to distinguish between determining how to compute the zero-error capacity and whether it is computable at all. In this work, we focus on the latter: the computability of the zero-error capacity.

The Lovász

ϑ

-function for graphs was analyzed in [6] from three distinct research perspectives related to various graph invariants. This investigation resulted in new insights into the Shannon capacity of graphs, observations on cospectral and nonisomorphic graphs, and bounds on graph invariants while also serving as a tutorial in zero-error information theory and algebraic graph theory. Further observations on the Lovász

ϑ

-function are provided by the author in [7].

In this paper, we provide a negative answer to the question of whether the channel reliability function and several related bounds are algorithmically computable by Turing machines.

Significant research has been conducted on the channel reliability function, but many aspects of its behavior remain unresolved (see surveys [8] and [9]). In fact, a complete characterization of the channel reliability function is still unknown for binary-input binary-output channels. As a result, considerable efforts have been made to derive computable lower and upper bounds for the function (see [10,11,12]).

Determining the behavior of the channel reliability function across the entire interval

(0, C)

is a challenging problem. Various approaches have attempted to compute the reliability function algorithmically by constructing sequences of upper and lower bounds. The first significant contribution in this direction was made by Shannon, Gallager, and Berlekamp in [13].

A fundamental question that arises is whether the reliability function can be computed in this manner. To investigate this, we employ the framework of Turing computability [14]. In general, a function is considered Turing computable if there exists an algorithm capable of computing it. The Turing machine serves as the most fundamental and powerful model of computation, underpinning theoretical computer science. Unlike physical computers, which have practical constraints, a Turing machine is an abstract mathematical construct that can be rigorously analyzed using formal mathematical methods.

It is important to note that the Turing machine represents the ultimate performance limit of current digital computers, including supercomputers. A Turing machine models an algorithm or a program, where computation consists of step-by-step manipulation of symbols or characters that are read from and written to a memory tape according to a set of rules. These symbols can be interpreted in various ways, including as numbers. To perform computations on abstract sets, the elements of the set must be encoded as strings of symbols on the tape. This approach allows Turing computability to be defined for real and complex numbers.

The use of digital computers to compute approximations of channel capacities or channel reliability functions has been a prominent topic in information theory. The computation of channel capacity for discrete memoryless channels (DMCs) is a convex optimization problem, and in 1972, an algorithm for approximating the capacity of a DMC on digital computers was independently published in [15] and [16].

Even for binary symmetric channels with rational crossover probabilities (excluding the case

p = \frac{1}{2}

), the channel capacity is a transcendental number. As a result, despite the relative simplicity of these channels, their capacity can only be approximated with finite precision by digital computers. In contrast to the problem of computing channel capacity, determining the behavior of the channel reliability function over the entire interval

(0, C) \subset R

is a significantly more complex task. A common approach to this challenge involves considering sequences of upper and lower bounds for

E (R)

(see [13]).

In general, the channel reliability function is a well-studied topic in information theory. Originally introduced and analyzed for discrete memoryless channels (DMCs), the concept has since been significantly extended to various other scenarios and channel models. In [17], the reliability function of a DMC was studied at rates above capacity. Subsequent refinements and theoretical bounds were proposed, such as the Poor–Verdú upper bound addressed in [18]. Extensions beyond DMCs include continuous channels and channels with feedback or secrecy constraints. For instance, upper bounds for Gaussian channels were developed in [19], while the role of feedback in Poisson and Gaussian channels was explored in [20,21]. The impact of signal constraints was analyzed in [22], and improved Gaussian channel bounds were proposed in [23]. Secrecy considerations and cost constraints were incorporated into the analysis of the reliability and secrecy functions in [24]. The reliability function in the presence of side information, as in the Gelfand–Pinsker channel, was considered in [25]. More recently, a new upper bound for DMCs was given in [26], and noisy feedback for binary symmetric channels was studied in [27]. These developments culminated in the analysis of reliability functions in quantum communication settings. Foundational work includes [28,29], and recent advancements include [30].

In this work, we explore whether it is possible to compute the channel reliability function in this manner using a mathematically rigorous formalization of computability. Specifically, our analysis is based on the theory of Turing machines and recursive functions.

In many cases, there is no direct characterization of the behavior of a general function over an abstract set in terms of an algorithm on a Turing machine. Consequently, a common strategy is to approximate the function successively using a sequence of computable upper and lower bounds, for which an algorithm is available. One can then ask the weaker question of whether it is possible to approximate the function in a computable manner. This requires computable sequences of computable upper and lower bounds. This approach is also necessary for the reliability function, and we conducted this analysis. Unfortunately, our results show that the channel reliability function is not a Turing computable performance function when the channel is considered as input.

We also examine several other closely related functions, including the

R_{\infty}

function, the sphere packing bound function, the expurgation bound function, and the zero-error feedback capacity, all of which are closely tied to the reliability function. We treat all of these functions as functions of the channel.

As envisioned, the sixth generation (6G) of mobile networks will introduce a wide range of new features [31]. These innovations bring new challenges to the design of wireless communication systems. Specifically, the Tactile Internet will enable not only the control of data but also the manipulation of physical and virtual objects [31]. With such applications, there arises an increased need to ensure the trustworthiness of the system and its services [32,33].

6G will impose more diverse and demanding quality-of-service (QoS) requirements on network resilience, reliability, service availability, and delay [31]. The channel reliability function plays a vital role in the reliability and delay performance analysis of communication systems. It is therefore of interest to explore whether the reliability and delay performance of communication systems can be verified automatically on digital hardware [33]. Analyzing the channel reliability function with respect to Turing computability becomes crucial in this context. The question of Turing computability for performance functions is a central issue in information theory, as closed-form expressions are only known for a few performance functions. It is therefore important to compute corresponding performance functions on available computers with provable performance, ensuring the strict requirements for future communication systems [31,33].

The structure of this paper is as follows. In Section 2, we begin by presenting the basic definitions and known results that will be used throughout the paper. Section 3 focuses on the

R_{\infty}

function. We examine the decidability of connected sets with the

R_{\infty}

function and demonstrate that only an approximation from below is possible. This has implications for the sphere packing bound, and we show that it is not a Turing computable performance function.

In Section 4, we analyze the reliability function and prove that it is also not Turing computable. The same result holds for the expurgation bound. In Section 5, we investigate the zero-error feedback capacity, which is closely related to the

R_{\infty}

function. We first address a question posed by Alon and Lubetzky in [34] regarding the zero-error capacity with feedback, specifically for the case without feedback (which was examined in [35]). We then show that the zero-error feedback capacity is not Banach–Mazur computable and cannot be approximated by computable increasing sequences of computable functions. Additionally, we characterize the superadditivity of the zero-error feedback capacity and demonstrate that the

R_{\infty}

function is additive.

In Section 6, we analyze the behavior of the expurgation bound rates. Finally, we conclude by summarizing the implications of our results for the channel reliability function. Our findings indicate that, in general, there cannot be a simple recursive closed-form expression for the channel reliability function over a very precise interval.

Some of the results in this paper were presented at the IEEE International Symposium on Information Theory in Espoo, as noted in [36].

2. Definitions and Basic Results

2.1. Basic Concepts of Computability Theory

In this section, we present the basic definitions and results from computability theory that are necessary for this work. We begin with the fundamental definitions of computability, starting with the concept of a Turing machine [14].

A Turing machine serves as a mathematical model for what we intuitively understand as computation machines. In this sense, they provide an abstract idealization of modern-day computers. Any algorithm that can be executed by a real-world computer can, in principle, be simulated by a Turing machine, and vice versa. However, unlike real-world computers, Turing machines are not constrained by limitations such as energy consumption, computation time, or memory size. Furthermore, all computation steps on a Turing machine are assumed to be executed flawlessly, with no possibility of error.

Recursive functions, more specifically known as μ-recursive functions, form a special subset of the set

⋃_{n = 0}^{\infty} \{f : N^{n} ↪ N\}

, where the symbol “↪” denotes a partial mapping. The set of recursive functions provides an alternative characterization of the notion of computability. Turing machines and recursive functions are equivalent in the following sense: a function

f : N^{n} ↪ N

is computable by a Turing machine if and only if it is a partial recursive function.

Next, we introduce several key definitions from computable analysis [37,38,39], which we will apply in the subsequent sections.

Definition 1.

A sequence of rational numbers

{r_{n}}_{n \in N}

is called a computable sequence if there exist recursive functions

a, b, s : N \to N

with

b (n) \neq 0

for all

n \in N

and

r_{n} = {(- 1)}^{s (n)} \frac{a (n)}{b (n)}, n \in N .

Definition 2.

We say that a computable sequence

{r_{n}}_{n \in N}

of rational numbers converges effectively, i.e., computably, to a number x, if a recursive function

a : N \to N

exists such that

| x - r_{n} | < \frac{1}{2^{N}}

for all

N \in N

and all

n \in N

with

n \geq a (N)

applies.

We can now introduce computable numbers.

Definition 3.

A real number x is said to be computable if there exists a computable sequence of rational numbers

{r_{n}}_{n \in N}

, such that

| x - r_{n} | < 2^{- n}

for all

n \in N

. We denote the set of computable real numbers using

R_{c}

.

Next, we need suitable subsets of the natural numbers.

Definition 4.

A set

A \subset N

is called recursive if there exists a recursive function f, such that

f (x) = 1

if

x \in A

and

f (x) = 0

if

x \in A^{c}

, where

A^{c}

stands for the complement set of A.

Definition 5.

A set

A \subset N

is recursively enumerable if there exists a recursive function whose domain is exactly A.

Remark 1.

For the definition of recursive and partial recursive functions, see [37]. Recursive functions

f : N \to N

are the building blocks to develop the framework for computing theory on rational numbers, on real numbers, and on related functions defined over these number fields. This theory captures exactly what can be achieved in theory with digital computers in these number fields. We next introduce the concept of computable performance functions on the basis of computability theory. It is important to note that computability theory formalizes exactly what is computable with perfect digital computers.

2.2. Basic Concepts of Information Theory

To define the reliability function and its related functions, we first need the definition of a discrete memoryless channel. In the theory of transmission, the receiver must be in a position to successfully decode all the messages transmitted by the sender.

Let

X

be a finite alphabet, and denote the set of all probability distributions on

X

using

P (X)

. We define the set of computable probability distributions,

P_{c} (X)

, as the subset of

P (X)

consisting of all distributions

P \in P (X)

for which

P (x) \in R_{c}

holds for all

x \in X

.

Furthermore, for finite alphabets

X

and

Y

, let

CH

denote the set of all conditional probability distributions (or channels)

P_{Y | X} : X \to P (Y)

. We define

{CH}_{c}

as the set of computable conditional probability distributions, i.e., those for which

P_{Y | X} (\cdot | x) \in P_{c} (Y)

holds for every

x \in X

.

Let

M \subset {C H}_{c} (X, Y)

. We call M semi-decidable if and only if there is a Turing machine

T M_{M}

that either stops or computes forever, depending on whether

W \in M

is true. This means

T M_{M}

exactly accepts the elements of M, and for an input

W \in M^{c} = {C H}_{c} (X, Y) ∖ M

, it computes forever.

Definition 6.

A discrete memoryless channel (DMC) is a triple

(X, Y, W)

, where

X

is the finite input alphabet,

Y

is the finite output alphabet, and

W (y | x) \in C H (X, Y)

, with

x \in X

,

y \in Y

. The probability that a sequence

y^{n} \in Y^{n}

is received if

x^{n} \in X^{n}

was sent is defined by

W^{n} (y^{n} | x^{n}) = \prod_{j = 1}^{n} W (y_{j} | x_{j}) .

Definition 7.

A (deterministic) block code

C (n)

with rate R and block length n consists of

A message set $M = {1, 2, . . ., M}$ with $M = 2^{n R} \in N$ ;
An encoding function $e : M \to X^{n}$ ;
aAdecoding function $d : Y^{n} \to M$ .

We call such a code an

(R, n)

-code.

Definition 8.

Let

(X, Y, W)

be a DMC. Using

C (n)

, we denote a block code with the block length n and message set

M

.

1.: The individual message probability of error is defined by the conditional probability of error, given that message $m \in M$ is transmitted:

$P_{e} (C (n), W, m) = P r {d (Y^{n}) \neq m | X^{n} = e (m)} .$
2.: We define the average probability of error by

$P_{e, av} (C (n), W) = \frac{1}{| M |} \sum_{m \in M} P_{e} (C (n), W, m) .$

$P_{e, av} (W, R, n)$ denotes the minimum error probability $P_{e, av} (C (n), W)$ over all block codes $C (n)$ of block length n and with message set $M = 2^{n R}$ .
3.: We define the maximal probability of error by

$P_{e, max} (C (n), W) = max_{m \in M} P_{e} (C (n), W, m) .$

$P_{e, max} (W, R, n)$ denotes the minimum error probability $P_{e, max} (C (n), W)$ over all block codes $C (n)$ of block length n and with message set $M = 2^{n R}$ .
4.: The Shannon capacity for a channel $W \in C H (X, Y)$ is defined by

$C (W) : = sup {R : lim_{n \to \infty} P_{e, max} (W, R, n) = 0} .$
5.: The zero-error capacity for a channel $W \in C H (X, Y)$ is defined by

$C_{0} (W) : = sup {R : P_{e, max} (W, R, n) = 0 for some n} .$

Remark 2.

For R with

C_{0} (W) < R < C (W)

, there exists

A (W, R), B (W, R) \in R^{+}

, such that

2^{- n A (W, R) + o (1)} \leq P_{e, max} (W, R, n) \leq 2^{- n B (W, R) + o (1)} .

We also define the discrete memoryless channel with noiseless feedback (DMCF). By this, we mean that, in addition to the DMC, there exists a return channel that sends the element of

Y

actually received back from the receiving point to the transmitting point. It is assumed that this information is received at the transmitting point before the next letter is sent and can therefore be used to choose the next letter to be sent. We assume that this feedback is noiseless. We denote the feedback capacity of a channel W by

C^{F B} (W)

and the zero-error feedback capacity by

C_{0}^{F B} (W)

. Shannon proved in [40] that

C (W) = C^{F B} (W)

. This is, in general, not true for the zero-error capacity. We see that the zero-error (feedback) capacity is related to the reliability function, which we analyze in this paper. It is defined as follows.

Definition 9.

The channel reliability function (error exponent) is defined by

E (W, R) = \underset{n \to \infty}{lim sup} - \frac{1}{n} {log}_{2} P_{e, max} (W, R, n) .

(1)

Remark 3.

We make use of the common convention that

{log}_{2} 0 : = - \infty

.

Remark 4.

We need the lim sup in (1), because it is not known whether the limit value, i.e., the limts on the right-hand side of (1), exist.

The first simple observation is that for

R > C (W)

, we have

E (W, R) = 0

, and if

C_{0} (W) > 0

for

0 \leq R < C_{0} (W)

, we have

E (W, R) = + \infty

. One well-known upper bound is the sphere packing bound, which can be defined as follows (see [10]).

Definition 10.

Let

X, Y

be finite alphabets, and

(X, Y, W)

be a DMC. Then, for all

R \in (0, C (W))

, we define the sphere packing bound function:

E_{S P} (W, R) = sup_{ρ > 0} max_{P \in P (X)} (- log \sum_{y} {(\sum_{x} P (x) W {(y | x)}^{\frac{1}{1 + ρ}})}^{1 + ρ} - ρ R) .

(2)

Theorem 1

(Fano 1961, Shannon, Gallager, Berlekamp 1967). For any DMC W and for all

R \in (0, C (W))

, it holds that

E (W, R) \leq E_{S P} (W, R) .

The sphere packing upper bound is an important upper bound. The following two lower bounds of the reliability function are also very important. In [41], the random coding bound was defined as follows:

Definition 11.

Let

X, Y

be finite alphabets, and

(X, Y, W)

be a DMC. Then, for all

R \in (0, C (W))

, we define the random coding bound function as

\begin{matrix} E_{r} (W, R) & = max_{0 \leq ρ \leq 1} E_{0} (W, ρ) - ρ R, where \end{matrix}

(3)

\begin{matrix} E_{0} (ρ) & = max_{P \in P (X)} [- log \sum_{y} {(\sum_{x} P (x) W {(y | x)}^{1 / (1 + ρ)})}^{1 + ρ}] . \end{matrix}

(4)

Theorem 2.

Let

X, Y

be finite alphabets and

(X, Y, W)

be a DMC; then,

E (W, R) \geq E_{r} (W, R) .

Gallager also defined in [41] the k-letter expurgation bound as follows:

Definition 12.

Let

X, Y

be finite alphabets and

(X, Y, W)

be a DMC; then, for all

R \in (0, C (W))

, we define the k-letter expurgation bound function:

\begin{matrix} E_{ex} (W, R, k) & = sup_{ρ \geq 1} E_{x} (ρ, k) - ρ R \end{matrix}

(5)

\begin{matrix} E_{x} (ρ, k) & = - \frac{ρ}{k} log min_{P_{X^{k}} \in P (X^{k})} Q^{k} (ρ, P_{X^{k}}) \end{matrix}

(6)

\begin{matrix} Q^{k} (ρ, P_{X^{k}}) & = \sum_{x^{k}, x^{' k}} P_{X^{k}} (x^{k}) P_{X^{k}} (x^{' k}) g_{k} {(x^{k}, x^{' k})}^{\frac{1}{ρ}} \end{matrix}

(7)

\begin{matrix} g_{k} (x^{k}, x^{' k}) & = \sum_{y^{k}} \sqrt{W^{k} (y^{k} | x^{k}) W^{k} (y^{k} | x^{' k})} . \end{matrix}

(8)

Theorem 3.

Let

X, Y

be finite alphabets and

(X, Y, W)

be a DMC. Then, for all

R \in (0, C (W))

, we have

E (W, R) \geq lim_{k \to \infty} E_{ex} (W, R, k) .

(9)

The inequality in (9) follows from Fekete’s lemma.

The smallest value of R, at which the convex curve

E_{S P} (W, R)

meets its supporting line of slope -1, is called the critical rate and is denoted by

R_{c r i t}

[9]. For the certain interval

[R_{c r i t}, C]

, the random coding lower bound corresponds to the sphere packing upper bound. The channel reliability function is therefore known for this interval. The channel reliability function is generally not known for the interval

[0, R_{c r i t}]

. For the interval

[0, R_{c r i t}]

, there are also better lower bounds than the random coding lower bound.

R_{\infty} (W)

is the infimum of all rates

\underset{̲}{R}

such that

E_{S P} (W, \underset{̲}{R})

is finite on the open interval

(\underset{̲}{R}, C (W))

.

C_{0} (W) \leq R_{\infty} (W)

applies if

C_{0} (W) > 0

. The following representation of

R_{\infty}

exists (see [9]):

R_{\infty} (W) = min_{Q \in P (Y)} max_{x \in X} {log}_{2} \frac{1}{\sum_{y : W (y | x) > 0} Q (y)} .

(10)

There exist alphabets

X, Y

and channels

W \in CH

such that

C_{0} (W) = 0

while

R_{\infty} (W) > 0

.

Moreover, for the zero-error feedback capacity

C_{0}^{F B}

, it holds that

C_{0}^{F B} (W) = R_{\infty} (W)

whenever

C_{0} (W) > 0

. However, if

C_{0} (W) = 0

, there exists a channel W for which

C_{0}^{F B} (W) = 0

while

R_{\infty} (W) > 0

(see [9]).

For the zero-error feedback capacity, the following is known.

Theorem 4

(Shannon 1956, [40]). Let

W \in C H (X, Y)

; then,

C_{0}^{F B} (W) = \{\begin{matrix} 0 & if C_{0} (W) = 0 \\ max_{P \in P (X)} min_{y} {log}_{2} \frac{1}{\sum_{x : W (y | x) > 0} P (x)} & otherwise . \end{matrix}

(11)

2.3. Lower and Upper Bounds on the Reliability Function for the Typewriter Channel

As mentioned before, Shannon, Gallager, and Berlekamp assumed in [13] that the expurgation is bound tight. Katsman, Tsfasman, and Vladut showed in [42] a counterexample for the symmetric q-ary channel when

q \geq 49

. Dalai and Polyanskiy found a simpler counterexample in [43]. They showed that the conjecture is already wrong for the q-ary typewriter channel for

q \geq 4

. We would like to briefly present their results here.

Definition 13.

Let

X = Y = Z_{q}

and

0 \leq ϵ \leq \frac{1}{2}

. The typewriter channel

W_{ϵ}

is defined by

W_{ϵ} (y | x) = \{\begin{matrix} 1 - ϵ & y = x \\ ϵ & y = x + 1 mod q . \end{matrix}

(12)

The extension of the channel

W_{ϵ}^{n}

is defined by

W_{ϵ}^{n} (y^{n} | x^{n}) = \prod_{k = 1}^{n} W_{ϵ} (y_{i} | x_{i}) .

(13)

For the reliability function of this channel, the interval

(C_{0} (W_{ϵ}), C (W_{ϵ}))

is of interest. The capacity of a typewriter channel

W_{ϵ}

has the formula

C (W_{ϵ}) = log (q) - h_{2} (ϵ),

where

h_{2}

is the binary entropy function. Shannon showed in [40] that

C_{0} (W_{ϵ})

is positive if

q \geq 4

. He showed that for even q, it holds that

C_{0} (W_{ϵ}) = log (\frac{q}{2})

. It is difficult to get a formula for odd q. Lovász proved in [44] that Shannon’s lower bound for

q = 5

:

C_{0} (W_{ϵ}) = log \sqrt{5}

is tight. For general odd q, Lovász proved

C_{0} (W_{ϵ}) \leq log \frac{cos (π q)}{1 + cos (π q)} q .

It is only known for

q = 5

that this bound is tight. In general, this is not true. For special q, there are special results outlined in [44,45,46,47].

Dalai and Polyanskiy provide upper and lower bounds on the reliability function in [43]. They observed that the zero-error capacity of the pentagon can be determined by a careful study of the expurgated bound.

They present an improved lower bound for the case of even and odd q, showing that it also is a precisely shifted version of the expurgated bound for the BSC. Their result also provides a new elementary disproof of the conjecture suggested in [13] that the expurgated bound is asymptotically tight when computed on arbitrarily large blocks. Furthermore, in [43], Dalai and Polyanskiy present a new upper bound for the case of odd q based on the minimum distance of codes. They use Delsarte’s linear programming method [48] (see also [49]), combining the construction used by Lovász [44] for bounding the graph capacity with the construction used by McEliece–Rodemich–Rumsey–Welch [50] for bounding the minimum distance of codes in Hamming spaces. In the special case

ϵ = 1 / 2

, they give another improved upper bound for the case of odd q, following the ideas of Litsyn [51] and Barg–McGregor [52], which in turn are based on estimates for the spectra of codes originated by Kalai–Linial [53].

2.4. Computable Channels and Computable Performance Functions

We need further basic concepts for computability. We want to investigate the function

E (W, R)

and the upper bounds like

E_{S P} (W, R)

and

E_{e x} (W, R)

for

k \in N

as functions of W and R. These functions are generally only well defined for fixed channels W on sub-intervals of

[0, C (W)]

as functions depending on R. For example, for

W \in C H (X, Y)

with

C_{0} (W) > 0

,

E (W, R)

is infinite for

R < C_{0} (W)

. Hence,

E (W, R)

must be examined and computed as a function of R on the interval

(C_{0} (W), C (W)]

. Similar statements also apply to the other functions that have already been introduced. We now fix non-trivial alphabets

X, Y

and the corresponding set

{C H}_{c} (X, Y)

of the computable channels and

R \in R_{c}

.

Definition 14

(Turing computable channel function). We call a function

f : {C H}_{c} (X, Y) \to R_{c}

a Turing computable channel function if there is a Turing machine that converts any program for the representation of

W \in {C H}_{c} (X, Y)

into a program for the computation of

f (W)

—that is,

f (W) = T M_{f} (W)

,

W \in {C H}_{c} (X, Y)

.

We want to determine whether there is a closed form for the channel reliability function. For this, we need the following definition, which we discuss in more detail in Remark 5 below.

Definition 15

(Turing computable performance function). Let ⊥ be a symbol. We call a function

F : C H {(X, Y)}_{c} \times R_{c}^{+} \to R_{c} \cup {⊥}

a Turing computable performance function if there are two Turing computable channel functions

\underset{̲}{f}

and

\bar{f}

with

\underset{̲}{f} (W) \leq \bar{f} (W)

for

W \in {C H}_{c} (X, Y)

, and a Turing machine

T M_{F}

, which is defined for input

R \in R_{c}^{+}

and

W \in {C H}_{c} (X, Y)

. The Turing machine

T M_{F}

stops for the variables

R \in R_{c}^{+}

and

W \in {C H}_{c} (X, Y)

and any representation for W and R as input if and only if

R \in (\underset{̲}{f} (W), \bar{f} (W))

and the Turing machine

T M_{F}

delivers

F (W, R) = T M_{F} (W, R)

. If

R \notin (\underset{̲}{f} (W), \bar{f} (W))

, then

T M_{F}

does not stop.

Remark 5.

The requirement for function

F : C H {(X, Y)}_{c} \times R_{c}^{+} \to R_{c} \cup {⊥}

to be a Turing computable performance function is relatively weak. For example, let us take W and R as inputs. Then, the interval

(\underset{̲}{f} (W), \bar{f} (W))

is computed first. If R is now in the interval

((\underset{̲}{f} (W), \bar{f} (W))

, then the Turing machine

T M_{F}

must stop for the input

(W, R)

and deliver the result for

F (W, R)

. We impose no requirements on the behavior of the Turing machine for input W and

R \notin (\underset{̲}{f} (W), \bar{f} (W))

. In particular, the Turing machine

T M_{F}

does not have to stop for the input

(W, R)

in this case.

Take, for example, any Turing computable function

G : C H {(X, Y)}_{c} \times R_{c}^{+} \to R_{c} \cup {⊥}

with the corresponding Turing machine

T M_{G}

. Furthermore, let

\underset{̲}{T M} : {C H}_{c} (X, Y) \to R_{c}

and

\bar{T M} : {C H}_{c} (X, Y) \to R_{c}

be any two TMs, so that

\underset{̲}{T M} (W) \leq \bar{T M} (W)

always holds for all

W \in {C H}_{c} (X, Y)

. Then, the following Turing machine

T M : {C H}_{c} (X, Y) \times R_{c} \to R_{c} \cup {⊥}

defines a Turing computable performance function.

1.

For any input

W \in {C H}_{c} (X, Y)

and

R \in R_{c}

, first compute

\underset{̲}{f} (W) = \underset{̲}{T M} (W)

and

\bar{f} (W) = \bar{T M} (W)

.

2.

Compute the following two tests in parallel:

(a): Use the Turing machine $T M_{> \underset{̲}{f} (W)}$ and test $R > \underset{̲}{f} (W)$ using $T M_{> \underset{̲}{f} (W)}$ for input $R \in R_{c}$ .
(b): Use the Turing machine $T M_{< \bar{f} (W)}$ and test $R < \bar{f} (W)$ using $T M_{< \bar{f} (W)}$ for input $R \in R_{c}$ .

Let these two tests run until both Turing machines stop. If both Turing machines stop in 2, then compute

G (W, R)

and set

T M (W, R) = G (W, R)

.

T M

actually generates a Turing computable performance function, and the Turing machine

T M

stops for the input

(W, R)

if and only if

R \in (\underset{̲}{f} (W), \bar{f} (W))

applies. Then, it gives the value

G (W, R)

as output. This follows from the fact that the Turing machine

T M_{> \underset{̲}{f} (W)}

stops for input

R \in R_{c}

if and only if

R > \underset{̲}{f} (W)

. The second Turing machine

T M_{< \bar{f} (W)}

from 2 stops exactly when

R < \bar{f} (W)

, i.e. the Turing machine

T M

in 2., which simulates

T M_{> \underset{̲}{f} (W)}

and

T M_{< \bar{f} (W)}

in parallel, stops exactly when

R \in (\underset{̲}{f} (W), \bar{f} (W))

applies.

Remark 6.

Using the above approach, we can try, for example, to find upper and lower bounds for the channel reliability function by allowing general Turing computable functions

G : C H {(X, Y)}_{c} \times R_{c}^{+} \to R_{c} \cup {⊥}

and algorithmically determine the interval from

R_{c}^{+}

for which the function

G (W, \cdot)

delivers lower or upper bounds for the channel reliability function.

Definition 16

(Banach–Mazur computable channel function). We call

f : {C H}_{c} (X, Y) \to R_{c}

a Banach–Mazur computable channel function if every computable sequence

{W_{r}}_{r \in N}

from

{C H}_{c} (X, Y)

is mapped by f into a computable sequence from

R_{c}

.

For practical applications, it is necessary to have performance functions that satisfy Turing computability. Depending on W, the channel reliability function or the bounds for this function should be computed. This computation is carried out by an algorithm that also receives W as input. This means that the algorithm should also be recursively dependent on W; otherwise, a special algorithm would have to be developed for each W (depending on W but not recursively dependent), since the channel reliability function for this channel, or a bound for this function, is computed.

It is now clear that when defining the Turing computable performance function, the Turing computable channel functions

\underset{̲}{f}, \bar{f}

cannot be dispensed with, because the channel reliability function depends on the specific channel and the permissible rate region for which the function can be computed. For

\bar{f}

, one often has the representation

\bar{f} (W) = C (W)

with

W \in {C H}_{c} (X, Y)

. For

\underset{̲}{f}

, the choice

\underset{̲}{f} (W) = C_{0} (W)

with

W \in {C H}_{c} (X, Y)

for the channel reliability function is a natural choice, because the channel reliability function is only useful for this interval. (We note that we showed in [35] that

C_{0} (W)

is not Turing computable in general.)

For the Turing computability of the channel reliability function or corresponding upper and lower bounds, it is therefore a necessary condition that the dependency of the relevant rate intervals on W be Turing computable—that is, recursive.

Remark 7.

As noted in the Introduction, very few closed-form expressions for performance functions are known in information theory. Even for relatively simple scenarios, such as secure message transmission over a wiretap channel with an active jammer, closed-form solutions are not available (see [54,55,56]). Existing methods in information theory provide convergent multi-letter sequences for determining capacity. While these sequences enable the investigation of important properties of the capacity (see [54,57,58]), they are not yet suitable for direct numerical computation of the capacity. This is due to the reliance on Fekete’s lemma to prove the existence of the limit of these sequences. However, it was shown in [59] that Fekete’s lemma is not constructive, meaning no algorithm can effectively compute the associated limit values.

Moreover, the problem of finding simple optimizers for performance functions is generally not algorithmically solvable [60,61]. For instance, the Blahut–Arimoto algorithm can be used to compute an infinite sequence of input distributions that converge to an optimal distribution. However, there is no way to halt the process based on a reliable approximation error, making it impossible to stop the computation at a specific point (see [60,61]).

3. Results for the Rate Function $R_{\infty}$ and Applications on the Sphere Packing Bound

In this section, we analyze the function

R_{\infty}

and its implications for the sphere packing bound. Specifically, we demonstrate that

R_{\infty}

is not a Turing computable performance function.

We begin by expressing

R_{\infty} (W)

as

R_{\infty} (W) = min_{Q \in P (Y)} max_{x \in X} {log}_{2} \frac{1}{\sum_{y : W (y | x) > 0} Q (y)} .

(14)

From this, we derive the equivalent representations:

\begin{matrix} R_{\infty} (W) & = & min_{Q \in P (Y)} max_{x \in X} {log}_{2} \frac{1}{\sum_{y : W (y | x) > 0} Q (y)} \\ = & min_{Q \in P (Y)} {log}_{2} \frac{1}{{min}_{x \in X} \sum_{y : W (y | x) > 0} Q (y)} \\ = & {log}_{2} min_{Q \in P (Y)} \frac{1}{{min}_{x \in X} \sum_{y : W (y | x) > 0} Q (y)} \\ = & {log}_{2} \frac{1}{{max}_{Q \in P (Y)} {min}_{x \in X} \sum_{y : W (y | x) > 0} Q (y)} \\ = & {log}_{2} \frac{1}{Ψ_{\infty} (W)}, \end{matrix}

where

\begin{matrix} Ψ_{\infty} (W) = max_{Q \in P (Y)} min_{x \in X} \sum_{y : W (y | x) > 0} Q (y) . \end{matrix}

(15)

In summary, the following holds true: let

X, Y

be arbitrary non-trivial finite alphabets; then, for

W \in {C H}_{c} (X, Y)

R_{\infty} (W) = {log}_{2} \frac{1}{Ψ_{\infty} (W)} .

(16)

Lemma 1.

It holds that

R_{\infty} : {C H}_{c} (X, Y) \to R_{c} .

Proof.

Let W be fixed. We consider the vector

{(\begin{matrix} Q (1) & \dots & Q (| Y |) \end{matrix})}^{T}

of the convex set

M_{P r o b} = {u \in R^{| Y |} : u = (\begin{matrix} u_{1} \\ ⋮ \\ u_{| Y |} \end{matrix}), u_{l} \geq 0, l = 1, \dots, | Y |, \sum_{l} u_{l} = 1} .

G (u) : = {min}_{x} \sum_{y : W (y | x) > 0} u_{y}

is a computable continuous function on

M_{P r o b}

. Thus, for

Ψ_{\infty} (W) = {max}_{u \in M_{P r o b}} G (u)

, we always have

Ψ_{\infty} (W) \in R_{c}

with

Ψ_{\infty} (W) > 0

, and thus

R_{\infty} (W) \in R_{c}

. □

Remark 8.

We do not know whether

C_{0} : {C H}_{c} (X, Y) \to R_{c}

holds for any finite

X, Y

. This statement holds for

max {| X |, | Y |} \leq 5

, but the general case is open.

For finite alphabets

X, Y

and

λ \in R_{c}

with

λ > 0

, we want to analyze the set

{W \in {C H}_{c} (X, Y) : R_{\infty} (W) > λ} .

To accomplish this, we refer to the proof of Theorem 23 in [35]. Along the same lines, one can show that the following holds true:

Theorem 5.

Let

X, Y

be non-trivial finite alphabets. For all

λ \in R_{c}

with

0 < λ < {log}_{2} (min {| X |, | Y |})

, the set

{W \in {C H}_{c} (X, Y) : R_{\infty} (W) > λ}

is not semi-decidable.

The following theorem can be derived from a combination of the proof of Theorem 5 and Theorem 24 in [35]. The proof is carried out in the same way as the proof of Theorem 24 in [35].

Theorem 6.

Let

X, Y

be non-trivial finite alphabets. The function

R_{\infty} : {C H}_{c} (X, Y) \to R

is not Banach–Mazur computable.

We now prove a stronger result then what we were able to show for

C_{0}

in [35] so far. We show that the analogous question, like the question in [34] for

C_{0}

for the function

R_{\infty}

, can be answered positively.

We need a concept of distance for

W_{1}, W_{2} \in C H (X, Y)

. Therefore, for fixed and finite alphabets

X, Y

, we define the distance between

W_{1}

and

W_{2}

based on the total variation distance

d_{C} (W_{1}, W_{2}) = max_{x \in X} \sum_{y \in Y} | W_{1} (y | x) - W_{2} (y | x) | .

(17)

Definition 17.

A function

f : C H (X, Y) \to R

is called computable continuously if the following are true:

1.: f is sequentially computable, i.e., f maps every computable sequence ${W_{n}}_{n \in N}$ with $W_{n} \in {C H}_{c} (X, Y)$ into a computable sequence ${f (W_{n})}_{n \in N}$ of computable numbers,
2.: f is effectively uniformly continuous, i.e., there is a recursive function $d : N \to N$ such that for all $W_{1}, W_{2} \in {C H}_{c} (X, Y)$ and all $N \in N$ with $d_{C} (W_{1}, W_{2}) \leq \frac{1}{d (N)}$ , it holds that $| f (W_{1}) - f (W_{2}) | \leq \frac{1}{2^{N}} .$

Theorem 7.

Let

X, Y

be finite alphabets with

| X | \geq 2

and

| Y | \geq 2

. There exists a computable sequence of computable continuous functions

{F_{N}}_{N \in N}

on

{C H}_{c} (X, Y)

with

1.: $F_{N} (W) \geq F_{N + 1} (W)$ with $W \in C H (X, Y)$ and $N \in N$ ,
2.: ${lim}_{N \to \infty} F_{N} (W) = R_{\infty} (W)$ for all $W \in C H (X, Y)$ .

Proof.

We consider the function

Φ_{N} (W) = max_{Q \in P (Y)} min_{x \in X} \sum_{y \in Y} \frac{N W (y | x)}{1 + N W (y | x)} Q (y)

for

N \in N

. For all

x \in X

we have for all

Q \in P (Y)

\sum_{y \in Y} \frac{N W (y | x)}{1 + N W (y | x)} Q (y) \leq \sum_{y \in Y : W (y | x) > 0} Q (y),

(18)

and for all

N \in N

, we have for all

x \in X

and

Q \in P (Y)

\sum_{y \in Y} \frac{N W (y | x)}{1 + N W (y < x)} Q (y) \leq \sum_{y \in Y : W (y | x) > 0} \frac{(N + 1) W (y | x)}{1 + (N + 1) W (y | x)} Q (y) .

(19)

Φ_{N}

is a computable continuous function, and

{Φ_{N}}_{N \in N}

is a computable sequence of computable continuous functions. So,

F_{N} (W) = {log}_{2} \frac{a}{Φ_{N} (W)},

for

N \in N

and

W \in C H (X, Y)

.

F_{N}

satisfies all properties of the theorem, and point 1 is shown.

It holds

\begin{matrix} |\sum_{y \in Y : W (y | x) > 0} Q (y) - \sum_{y \in Y} \frac{N W (y | x)}{1 + N W (y | x)} Q (y)| \\ = & |\sum_{y \in Y : W (y | x) > 0} \frac{1}{1 + N W (y | x)} Q (y)| \\ \leq & \frac{1}{1 + N {min}_{y \in Y : W (y | x) > 0} W (y | x)} . \end{matrix}

Therefore, we have

\begin{matrix} \sum_{y \in Y : W (y | x) > 0} Q (y) & \leq & \frac{1}{1 + N {min}_{y \in Y : W (y | x) > 0} W (y | x)} \\ + \sum_{y \in Y} \frac{N W (y | x)}{1 + N W (y | x)} Q (y) . \end{matrix}

(20)

Because of (18), we have

Φ_{N} (W) \leq Ψ_{\infty} (W)

for all

W \in {C H}_{c} (X, Y)

. (20) yields

\begin{matrix} \sum_{y \in Y : W (y | x) > 0} Q (y) & \leq & \frac{1}{1 + N {min}_{x \in X} ({min}_{y \in Y : W (y | x) > 0} W (y | x))} \\ + \sum_{y \in Y} \frac{N W (y | x)}{1 + N W (y | x)} Q (y) . \end{matrix}

So,

\begin{matrix} min_{x \in X} \sum_{y \in Y : W (y | x) > 0} Q (y) & \leq & \frac{1}{1 + N {min}_{x \in X} ({min}_{y \in Y : W (y | x) > 0} W (y | x))} \\ + min_{x \in X} \sum_{y \in Y} \frac{N W (y | x)}{1 + N W (y | x)} Q (y) \end{matrix}

and

\begin{matrix} Ψ_{\infty} (W) & \leq & \frac{1}{1 + N {min}_{x \in X} ({min}_{y \in Y : W (y | x) > 0} W (y | x))} + Φ_{N} (W) \end{matrix}

holds. So, we have

0 \leq Ψ_{\infty} (W) - Φ_{N} (W) \leq \frac{1}{1 + N {min}_{x \in X} ({min}_{y \in Y : W (y | x) > 0} W (y | x))} .

□

We now want to prove that the corresponding question in [34] can be answered positively for

R_{\infty}

.

Theorem 8.

Let

X, Y

be finite alphabets with

| X | \geq 2

and

| Y | \geq 2

. For all

λ \in R_{c}

with

0 < λ < {log}_{2} (min {| X |, | Y |})

, the set

{W \in {C H}_{c} (X, Y) : R_{\infty} (W) < λ}

is semi-decidable.

Proof.

We use the computable sequences of computable continuous functions

F_{N}

from Theorem 7. It holds that

W \in {W \in {C H}_{c} (X, Y) : R_{\infty} (W) < λ}

if and only if there is an

N_{0}

such that

F_{N_{0}} < λ

holds. As in the proof of Theorem 28 from [35], we now use the construction of a Turing machine

T M_{R_{\infty}, < λ}

, which exactly accepts the set

{W \in {C H}_{c} (X, Y) : R_{\infty} (W) < λ} .

□

We now consider the approximability “from below” (this can be seen as a kind of reachability). We have shown that

R_{\infty} (\cdot)

can always be represented as a limit value of monotonically decreasing computable sequences of computable continuous functions. From this, it can be concluded that the sequence is then also a computable sequence of Banach–Mazur computable functions. We now have the following:

Theorem 9.

Let

X, Y

be finite alphabets with

| X | \geq 2

and

| Y | \geq 2

. There does not exist a sequence of Banach–Mazur computable functions

{F_{N}}_{N \in N}

with

1.: $F_{N} (W) \leq F_{N + 1} (W)$ with $W \in {C H}_{c} (X, Y)$ and $N \in N$ ;
2.: ${lim}_{N \to \infty} F_{N} (W) = R_{\infty} (W)$ for all $W \in C H (X, Y)$ .

Proof.

We assume that such a sequence

{F_{N}}_{N \in N}

does exist. Then, from Theorem 7 and the assumptions from this theorem, it can be concluded that

R_{\infty}

is a Banach–Mazur computable function. This has created a contradiction. □

With this, we immediately get the following:

Corollary 1.

Consider finite alphabets

X, Y

with

| X | \geq 2, | Y | \geq 2

, and let

{F_{N}}_{N \in N}

be a sequence of Banach–Mazur computable functions that satisfies the following:

1.: $F_{N} (W) \leq F_{N + 1} (W)$ with $W \in {C H}_{c} (X, Y)$ and $N \in N$ ,
2.: ${lim}_{N \to \infty} F_{N} (W) = R_{\infty} (W)$ for all $W \in C H (X, Y)$ .

Then, there exists

\hat{W} \in {C H}_{c} (X, Y)

such that

{lim}_{N \to \infty} F_{N} (\hat{W}) < R_{\infty} (\hat{W})

holds true.

We now want to apply the results for

R_{\infty}

to the sphere packing bound as an application. With the results via the rate function, we immediately get

Theorem 10.

Let

X, Y

be finite alphabets with

| X | \geq 2

and

| Y | \geq 2

. The sphere packing bound

E_{S P} (\cdot, \cdot)

is not a Turing computable performance function for

{C H}_{c} (X, Y) \times R_{c}^{+}

.

Proof.

Assuming that the statement of the theorem is incorrect, then

R_{\infty}

is a Turing computable performance function on

{C H}_{c} (X, Y) \times R_{c}^{+}

. But then the channel functions

\underset{̲}{f} (W) = R_{\infty} (W)

for

W \in {C H}_{c} (X, Y)

and

\bar{f} (W) = C (W)

for

W \in {C H}_{c} (X, Y)

must be Turing computable channel functions. As was already shown, however,

R_{\infty}

is not Banach–Mazur computable. We have thus created a contradiction. □

4. Computability of the Channel Reliability Function and the Sequence of Expurgation Bound Functions

In this section, we consider the reliability function and the expurgation bound and show that these functions are not Turing computable performance functions.

With the help of the results from [35] for

C_{0}

for noisy channels, we immediately get the following theorem:

Theorem 11.

Let

X, Y

be finite alphabets with

| X | \geq 2

and

| Y | \geq 2

. The channel reliability function

E (\cdot, \cdot)

is not a Turing computable performance function for

{C H}_{c} (X, Y) \times R_{c}

.

Proof.

Here,

\underset{̲}{f} (W) = C_{0} (W)

for

W \in {C H}_{c} (X, Y)

is a Turing computable function, according to Definition 14. We already know that

C_{0}

is not Banach–Mazur computable on

{C H}_{c} (X, Y)

. This gives the proof in the same way as for the sphere packing bound, i.e., the proof of Theorem 10. □

Now, we consider the rate function for the expurgation bound. The k-letter expurgation bound

E_{e x} (W, R, k)

as a function of W and R is a lower bound for the channel reliability function. The latter can only be finite for certain intervals

(R_{k}^{e x} (W), C (W))

. Thus, we want to compute the function in these intervals. In their famous paper [13], Shannon, Gallager, and Berlekamp examined the sequence of functions

{E_{e x} (\cdot, \cdot, k)}_{k \in N}

and analyzed the relationship to the channel reliability function. They conjectured that for all

W \in C H (X, Y)

for all R with

E (W, R) < + \infty

(one would have convergence and also

E_{e x} (W, R, k) < + \infty

), the relation

lim_{k \to \infty} E_{e x} (W, R, k) = E (W, R)

holds. This conjecture was first refuted in [42] and later refuted by a simpler example in [43].

It was already clear with the introduction of the channel reliability function that it had a complicated behavior. A closed-form formula for the channel reliability function is not yet known, and the results of this paper show that such a formula cannot exist. Shannon, Gallager, and Berlekamp tried in [13] in 1967 to find sequences of seemingly simple formulas for the approximation of the channel reliability function. It seems that they considered the sequence of the k-letter expurgation bounds to be very good channel data for its approximation. It was hoped that these sequences could be computed more easily with the use of new powerful digital computers.

Let us now examine the sequence

{E_{e x} (\cdot, \cdot, k)}_{k \in N}

. We have already introduced the concept of computable sequences of computable continuous channel functions. We now introduce the concept of computable sequences of Turing computable performance functions.

Definition 18.

A sequence

{F_{k}}_{k \in N}

of Turing computable performance functions is called a computable sequence if there is a Turing machine that generates the description of

F_{k}

for input k according to the definition of the function

F_{k}

for the values for which the function is defined.

In the following theorem, we prove that the sequence of the k-letter expurgation bounds is not a computable sequence of computable performance functions. So, the hope mentioned above cannot be fulfilled.

Theorem 12.

Let

X, Y

be finite alphabets with

| X | \geq 2

and

| Y | \geq 2

. The sequence of the expurgation lower bounds

{E_{e x} (\cdot, \cdot, k)}_{k \in N}

is not a computable sequence of Turing computable performance functions.

Proof.

We prove the theorem by contradiction, assuming that there exists a Turing machine

T M_{*}

that generates a description of the function

E_{e x} (\cdot, \cdot, k)

for a given input k, as defined in its formulation. This implies that the sequence

{R_{k}^{e x}}_{k \in N}

is computable, since we have an algorithm that can generate each function in the sequence.

Notably, we can express

{\underset{̲}{f}}_{k} (\cdot)

as

R_{k}^{e x} (\cdot)

. Given an input k, the Turing machine

T M_{*}

produces the description of

E_{e x} (\cdot, \cdot, k)

, from which

R_{k}^{e x}

can be directly obtained via projection (in the sense of primitive recursive functions).

According to Shannon, Gallager, and Berlekamp [13], the following limit holds:

lim_{k \to \infty} R_{k}^{e x} (W) = C_{0} (W)

for all

W \in C H (X, Y)

. Furthermore, the sequence

{R_{k}^{e x} (W)}_{k \in N}

is monotonically increasing, i.e.,

R_{k}^{e x} (W) \leq R_{k + 1}^{e x} (W) for all k \in N and W \in C H (X, Y) .

Let us consider the set

{W \in {C H}_{c} (X, Y) : C_{0} (W) > λ}

for

λ \in R_{c}

with

0 < λ < {log}_{2} (min {| X |, | Y |})

. We are now constructing a Turing machine

T M_{*}

with only one holding state, “stop”, which means that it either stops or computes forever.

T M_{*}

should stop for input

W \in {C H}_{c} (X, Y)

if and only if

C_{0} (W)

applies, that is,

T M_{*}

stops if W is in the above set. According to the assumption,

{R_{k}^{e x} (\cdot)}_{k \in N}

is a computable sequence of Turing computable channel functions. For the input W, we can generate the computable sequence

{R_{k}^{e x} (W)}_{k \in N}

of computable numbers. We now use the Turing machine

T M_{λ}^{1}

, which receives an arbitrary computable number x as input and stops if and only if

x > λ

, i.e.,

T M_{λ}^{1}

has only one hold state and accepts exactly the computable numbers x as input for which

x > λ

holds. We now use this program for the following algorithm.

We start with $l = 1$ and let $T M_{λ}^{1}$ compute one step for input $R_{1}^{e x} (W)$ . If $T M_{λ}^{1} (R_{1}^{e x} (W))$ stops; then, we stop the algorithm.
If $T M_{λ}^{1} (R_{1}^{e x} (W))$ does not stop, we set $l = l + 1$ and compute $l + 1$ steps $T M_{λ}^{1} (R_{r}^{e x} (W))$ for $1 \leq r \leq l + 1$ . If one of these Turing machines stops, then the algorithm stops; if not, we set $l = l + 1$ and repeat the second computation.

The above algorithm stops if and only if there is a

\hat{k} \in N

such that

R_{e x}^{\hat{k}} (W) > λ

. But this is the case (because of the monotony of the sequence

{R_{k}^{e x} (W)}_{k \in N}

) if and only if

C_{0} (W) > λ

. But with this, the set

{W \in {C H}_{c} (X, Y) : C_{0} (W) > λ}

is semi-decidable. So, we have shown that this is not the case. We have thus created a contradiction. □

5. Computability of the Zero-Error Capacity of Noisy Channels with Feedback

In this section, we consider the zero-error capacity for noisy channels with feedback. In our paper [35], we examined the properties of the zero-error capacity without feedback. Let

W \in C H (X, Y)

. We already noted that Shannon showed in [40] that

C_{0}^{F B} = \{\begin{matrix} 0 & if C_{0} (W) = 0 \\ max_{P} min_{y} {log}_{2} \frac{1}{\sum_{x : W (y | x) > 0} P (x)} & o t h e r w i s e . \end{matrix}

(21)

From (15), recall that

Ψ_{\infty} (W) = max_{p \in P (X)} min_{y \in Y} \sum_{x : W (y | x) > 0} P (x) .

(22)

Then, we have for W with

C_{0} (W) \neq 0

,

C_{0}^{F B} = {log}_{2} \frac{1}{Ψ_{\infty} (W)} .

We know that

C_{0}^{F B} (W) = R_{\infty} (W)

if

C_{0} (W) > 0

. If

C_{0} (W) = 0

, then there is a channel W with

C_{0}^{F B} (W) = 0

and

R_{\infty} > 0

. Like in Lemma 1, we can show the following:

Lemma 2.

Let

X, Y

be finite non-trivial alphabets. It holds that

C_{0}^{F B} : {C H}_{c} (X, Y) \to R_{c} .

From Theorem 5 and the relationship between

C_{0}

and

C_{0}^{F B}

, we get the following results for

C_{0}^{F B}

, which we have already proved for

C_{0}

in [35].

Theorem 13.

Let

X, Y

be finite alphabets with

| X | \geq 2

and

| Y | \geq 2

. For all

λ \in R_{c}

with

0 \leq λ < {log}_{2} min {| X |, | Y |}

, the sets

{W \in {C H}_{c} (X, Y) : C_{0}^{F B} (W) > λ}

are not semi-decidable.

Theorem 14.

Let

X, Y

be finite alphabets with

| X | \geq 2

and

| Y | \geq 2

. Then,

C_{0}^{F B} : {C H}_{c} (X, Y) \to R

is not Banach-Mazur computable.

Now, we will prove the following:

Theorem 15.

Let

X, Y

be finite alphabets with

| X | \geq 2

and

| Y | \geq 2

. There is a computable sequence of computable continuous functions G with

1.: $G_{N} (W) \geq G_{N + 1} (W)$ for $W \in C H (X, Y)$ and $N \in N$ ;
2.: ${lim}_{n \to \infty} G_{N} (W) = C_{0}^{F B} (W)$ for $W \in C H (X, Y)$ .

Proof.

We use for

N \in N

,

y \in Y

and

P \in P (X)

the function

\sum_{x \in X} \frac{N W (y | x)}{1 + N W (y | x)} P (x) .

Then, for

Φ_{N} (W) = min_{P \in P (X)} max_{y \in Y} \sum_{x \in X : W (y | x) > 0} P (x),

we have the same properties as in Theorem 7 and

U_{N} (W) = {log}_{2} \frac{1}{Φ_{n} (W)}

is an upper bound for

C_{0}^{F B}

, which is monotonically decreasing. Now, the relation

C_{0}^{F B} (W) > 0

holds for

W \in C H (X, Y)

if and only if there are two

x_{1}, x_{2} \in X

so that

\sum_{y \in Y} W (y | x_{1}) W (y | x_{2}) = 0

holds. We now set

g (\hat{x}, x) = \sum_{y \in Y} W (y | \hat{x}) W (y | x) = g (W, \hat{x}, x)

and have

0 \leq g (\hat{x}, x) \leq 1

for

x, \hat{x} \in X

. g is a computable continuous function with respect to

W \in C H (X, Y)

. Now, we set

V_{N} (W) = (1 - \prod_{x, \hat{x}} g {(W, \hat{x}, x)}^{N}) U_{N} (W)

for

N \in N

.

{V_{N}}_{N \in N}

is thus a computable sequence of computable continuous functions. Obviously,

V_{N} (W) \geq V_{N + 1} (W)

for

W \in C H (X, Y)

and

N \in N

is satisfied.

{(1 - \prod_{x, \hat{x}} g (W, x, h x))}^{N} = 1

if and only if

C_{0}^{F B} > 0

. So, for

C_{0}^{F B} (W) = 0

, we always have

lim_{N \to \infty} V_{N} (W) = 0 .

For W with

C_{0}^{F B} (W)

,

lim_{N \to \infty} V_{N} (W) = lim_{N \to \infty} U_{N} (W) = C_{0}^{F B} (W) .

This is shown in the proof of Theorem 7. □

This immediately gives us the following theorem.

Theorem 16.

Let

X, Y

be finite alphabets with

| X | \geq 2

and

| Y | \geq 2

. For all

λ \in R_{c}

with

0 \leq λ < {log}_{2} min {| X |, | Y |}

, the sets

{W \in {C H}_{c} (X, Y) : C_{0}^{F B} (W) < λ}

are semi-decidable.

Now, we want to look at the consequences of the results above for

C_{0}^{F B}

. The same statements apply here as in Section 3 for

R_{\infty}

with regard to the approximation from below.

C_{0}^{F B}

cannot be approximated by monotonically increasing sequences.

There is an elementary relationship between

R_{\infty}

and

C_{0}^{F B}

, which we use in the following. Again, we assume that

X, Y

are finite non-trivial alphabets. We remember the following functions:

R_{\infty} (W) = {log}_{2} \frac{1}{Ψ_{\infty} (W)},

(23)

where

Ψ_{\infty} (W) = {max}_{Q \in P (Y)} {min}_{x \in X} \sum_{y : W (y | x) > 0} Q (y) .

C_{0}^{F B} = \{\begin{matrix} 0 & C_{0} (W) = 0 \\ G (W) & C_{0} (W) > 0 \end{matrix},

(24)

where

G (W) = {log}_{2} \frac{1}{Ψ_{\infty} (W)}

and

Ψ_{\infty} (W) = min_{p \in P (X)} min_{y \in Y} \sum_{x : W (y | x) > 0} P (x) .

(25)

Let

A (W)

be the

| Y | \times | X |

matrix with

{(A (W))}_{k l} \in {0, 1}

for

1 \leq k \leq | Y |

and

1 \leq l \leq | X |

, such that

{(A (W))}_{k l} = 1

if and only if

W (k (l)) > 0

. Furthermore, let

M_{X} = \{u \in R^{| X |} : u = (\begin{matrix} u_{1} \\ \dots \\ u_{| X |} \end{matrix}), u_{l} \geq 0, \sum_{l = 1}^{| X |} u_{l} = 1\}

(26)

and

M_{Y} = \{v \in R^{| Y |} : v = (\begin{matrix} v_{1} \\ \dots \\ v_{| Y |} \end{matrix}), v_{l} \geq 0, \sum_{l = 1}^{| Y |} v_{l} = 1\} .

(27)

For

v \in R^{| Y |}

and

u \in R^{| X |}

, we consider the function

F (v, u) = v^{T} A (W) u

. The function F is concave in

v \in M_{Y}

and convex in

u \in M_{X}

.

M_{Y}

and

M_{X}

are closed convex and compact sets, and

F (v, u)

is continuous in both variables. So,

max_{v \in M_{Y}} min_{u \in M_{X}} F (v, u) = min_{u \in M_{X}} max_{v \in M_{Y}} F (v, u) .

(28)

Let

v \in M_{Y}

be fixed. Then,

\begin{matrix} F (v, u) & = & (\sum_{l = 1}^{| X |} (\sum_{k = 1}^{| Y |} v_{k} A_{k l} (W)) u_{l}) \end{matrix}

(29)

\begin{matrix} F (v, u) & = & (\sum_{l = 1}^{| X |} d_{l} (v) u_{l}), \end{matrix}

(30)

with

d_{l} (v) = \sum_{k = 1}^{| Y |} v_{k} A_{k l} (W)

. Now,

d_{l} (v) \geq 0

for

1 \leq l \leq | X |

. Hence,

min_{u \in M_{X}} F (v, u) = min_{1 \leq l \leq | X |} d_{l} (v) = min_{1 \leq l \leq | X |} \sum_{k : A_{k} l (W) > 0} v_{k} = min_{x \in X} \sum_{y : W (y | x) > 0} Q_{v} (y),

with

Q_{v} (y) = v_{y}

for

y \in {1, \dots, | Y |}

. So,

max_{v \in M_{Y}} min_{u \in M_{X}} F (v, u) = max_{Q \in P (Y)} min_{x \in X} \sum_{y : W (y | x) > 0} Q_{v} (y) = Ψ_{\infty} (W) .

Furthermore, for

u \in M_{X}

fixed,

\begin{matrix} F (v, u) & = & (\sum_{k = 1}^{| Y |} (\sum_{l = 1}^{| X |} u_{l} A_{k l} (W)) v_{k}) \\ = & (\sum_{k = 1}^{| Y |} β_{k} (u) v_{k}), \end{matrix}

with

β_{k} (u) = \sum_{l = 1}^{| X |} u_{l} A_{k l} (W) \geq 0

and

1 \leq k \leq | Y |

. Therefore,

max_{v \in M_{Y}} F (v, u) = max_{1 \leq k \leq | Y |} β_{k} (u) = max_{1 \leq k \leq | Y |} \sum_{l : A_{k l} (W) > 0} u_{l} = max_{y \in Y} \sum_{x : W (Y | x) > 0} p_{u} (x)

with

p_{u} (x) = u_{x}

for

1 \leq x \leq | X |

. It follows that

min_{u \in M_{X}} max_{v \in M_{Y}} F (v, u) = min_{p \in P (X)} max_{y \in Y} \sum_{x : W (y | x) > 0} P (x) = Ψ_{\infty} (W) .

We get the following lemma.

Lemma 3.

Let

W \in C H (X, Y)

; then,

R_{\infty} (W) = G (W) .

We want to investigate the behavior of

E (\cdot, R)

for the input

W_{1} \otimes W_{2}

, where

W_{1} \otimes W_{2}

denotes the Kronecker product of the matrices

W_{1}

and

W_{2}

compared to

E (W_{1}, R)

and

E (W_{2}, R)

. For this purpose, let

X_{1}, Y_{1}, X_{2}, Y_{2}

be arbitrary finite non-trivial alphabets, and we consider

W_{l} \in C H (X_{l}, Y_{l})

for

l = 1, 2

.

Theorem 17.

Let

X_{1}, Y_{1}, X_{2}, Y_{2}

be arbitrary finite non-trivial alphabets, and

W_{l} \in C H (X_{l}, Y_{l})

for

l = 1, 2

. Then, we have

R_{\infty} (W_{1} \otimes W_{2}) = R_{\infty} (W_{1}) + R_{\infty} (W_{2}) .

Proof.

We use the

Ψ_{\infty}

function. It applies to

Q = Q_{1} \cdot Q_{2}

with

Q_{1} \in P (Y_{1})

and

Q_{2} \in P (Y_{2})

, so that

\begin{matrix} min_{x_{1} \in X_{1}, x_{2} \in X_{2}} \sum_{y_{1} : W_{1} (y_{1} | x_{1}) > 0} \sum_{y_{2} : W_{2} (y_{2} | x_{2})} Q_{1} (y_{1}) Q_{2} (y_{2}) \\ = & (min_{x_{1} \in X_{1}} \sum_{y_{1} : W_{1} (y_{1} | x_{1}) > 0} Q_{1} (y_{1})) (min_{x_{1} \in X_{1}} \sum_{y_{2} : W_{2} (y_{2} | x_{2}) > 0} Q_{2} (y_{2})) . \end{matrix}

This applies to all

Q_{1} \in P (Y_{1})

and

Q_{2} \in P (Y_{2})

arbitrarily. So,

Ψ_{\infty} (W_{1} \otimes W_{2}) \geq Ψ_{\infty} (W_{1}) \cdot Ψ_{\infty} (W_{2}) .

Also, we have

\begin{matrix} Ψ_{\infty} (W_{1} \otimes W_{2}) \\ = & min_{P \in P (X_{1} \times X_{2})} max_{(y_{1}, y_{2}) \in Y_{1} \times Y_{2}} \sum_{x_{1} : W_{1} (y_{1} | x_{1}) > 0} \sum_{x_{2} : W_{2} (y_{2} | x_{2}) > 0} P (x_{1}, y_{2}) \\ \leq & Ψ_{\infty} (W_{1}) \cdot Ψ_{\infty} (W_{2}) \end{matrix}

as well. So,

Ψ_{\infty} (W_{1} \otimes W_{2}) = Ψ_{\infty} (W_{1}) \cdot Ψ_{\infty} (W_{2})

and the theorem is proven. □

We want to investigate the behavior of

C_{0}^{F B}

for the input

W_{1} \otimes W_{2}

compared to

C_{0}^{F B} (W_{1})

and

C_{0}^{F B} (W_{2})

. For this purpose, let

X_{1}, Y_{1}, X_{2}, Y_{2}

be arbitrary finite non-trivial alphabets and consider

W_{l} \in C H (X_{l}, Y_{l})

for

l = 1, 2

.

Theorem 18.

Let

X_{1}, Y_{1}, X_{2}, Y_{2}

be arbitrary finite non-trivial alphabets, and

W_{l} \in C H (X_{l}, Y_{l})

for

l = 1, 2

. Then, we have

1.: $C_{0}^{F B} (W_{1} \otimes W_{2}) \geq C_{0}^{F B} (W_{1}) + C_{0}^{F B} (W_{2})$

(31)
2.: $C_{0}^{F B} (W_{1} \otimes W_{2}) > C_{0}^{F B} (W_{1}) + C_{0}^{F B} (W_{2})$

(32)

if and only if

$min_{1 \leq l \leq 2} C_{0}^{F B} (W_{l}) = 0 and max_{1 \leq l \leq 2} C_{0}^{F B} (W_{l}) > 0 and min_{1 \leq l \leq 2} R_{\infty} (W_{l}) > 0 .$

(33)

Remark 9.

The condition (33) is equivalent to

min_{1 \leq l \leq 2} C_{0} (W_{l}) = 0 and max_{1 \leq l \leq 2} C_{0} (W_{l}) > 0 and min_{1 \leq l \leq 2} R_{\infty} (W_{l}) > 0 .

(34)

Proof.

(31) follows directly from the operational definition of C. Let (33) now be fulfilled. Then,

C_{0}^{F B} (W_{1} \otimes W_{2}) > 0

must be fulfilled. Without loss of generality, we assume

C_{0}^{F B} (W_{1}) = 0

,

C_{0}^{F B} (W_{2}) > 0

and

R_{\infty} (W_{1}) > 0

,

R_{\infty} (W_{2}) > 0

. Since

C_{0}^{F B} (W_{1} \otimes W_{2}) > 0

,

\begin{matrix} C_{0}^{F B} (W_{1} \otimes W_{2}) & = & R_{\infty} (W_{1} \otimes W_{2}) \\ = & R_{\infty} (W_{1}) + R_{\infty} (W_{2}) \\ = & R_{\infty} (W_{1}) + C_{0}^{F B} (W_{2}) \\ > & 0 + C_{0}^{F B} (W_{2}) \\ = & C_{0}^{F B} (W_{1}) + C_{0}^{F B} (W_{2}) . \end{matrix}

If (32) is fulfilled, then

C_{0}^{F B} (W_{1} \otimes W_{2}) > 0

. Then,

{max}_{1 \leq l \leq 2} C_{0}^{F B} (W_{l}) > 0

must be, because if

{max}_{1 \leq l \leq 2} C_{0}^{F B} (W_{l}) = 0

, then

{max}_{1 \leq l \leq 2} C_{0} (W_{l}) = 0

, and thus

C_{0} (W_{1} \otimes W_{2}) = 0

also (since the

C_{0}

capacity has no super-activation). This means that

C_{0}^{F B} (W_{1} \otimes W_{2}) = 0

, which would be a contradiction.

If

{min}_{1 \leq 2} C_{0}^{F B} (W_{l}) > 0

, then

\begin{matrix} C_{0}^{F B} (W_{1} \otimes W_{2}) & = & R_{\infty} (W_{1} \otimes W_{2}) \\ = & R_{\infty} (W_{1}) + R_{\infty} (W_{2}) \\ = & C_{0}^{F B} (W_{1}) + C_{0}^{F B} (W_{2}) . \end{matrix}

This is a contradiction, and thus

{min}_{1 \leq 2} C_{0}^{F B} (W_{l}) = 0

. Furthermore,

{min}_{1 \leq l \leq 2} R_{\infty} (W_{l}) > 0

must apply, because if

{min}_{1 \leq l \leq 2} R_{\infty} (W_{l}) = 0

, then

R_{\infty} (W_{1}) = 0

without loss of generality. Then,

\begin{matrix} C_{0}^{F B} (W_{1} \otimes W_{2}) & = & R_{\infty} (W_{1} \otimes W_{2}) \\ = & R_{\infty} (W_{1}) + R_{\infty} (W_{2}) \\ = & 0 + R_{\infty} (W_{2}) \\ = & 0 + C_{0}^{F B} (W_{2}) \\ = & C_{0}^{F B} (W_{1}) + C_{0}^{F B} (W_{2}), \end{matrix}

because

C_{0}^{F B} (W_{1}) = 0

when

R_{\infty} (W_{1}) = 0

. This is again a contradiction. With this, we have proven the theorem. □

We still want to show for which alphabet sizes the behavior according to Theorem 18 can occur.

Theorem 19.

1.: If $| X_{1} | = | X_{2} | = | Y_{1} | = | Y_{2} | = 2$ , then for all $W_{l} \in C H (X_{l}, Y_{l})$ with $l = 1, 2$ , we have

$C_{0}^{F B} (W_{1} \otimes W_{2}) = C_{0}^{F B} (W_{1}) + C_{0}^{F B} (W_{2}) .$

(35)
2.: If $X_{1}, X_{2}, Y_{1}, Y_{2}$ are non-trivial alphabets with

$max {min {| X_{1} |, | Y_{1} |}, min {| X_{2} |, | Y_{2} |}} \geq 3,$

then there exists ${\hat{W}}_{l} \in C H (X_{l}, Y_{l})$ with $l = 1, 2$ , such that

$C_{0}^{F B} ({\hat{W}}_{1} \otimes {\hat{W}}_{2}) > C_{0}^{F B} ({\hat{W}}_{1}) + C_{0}^{F B} ({\hat{W}}_{2}) .$

(36)

Proof.

If $C_{0} (W_{1}) = C_{0} (W_{2})$ , then (35) holds, since $C_{0} (W_{1} \otimes W_{2}) = 0$ .
If $max {C_{0} (W_{1}), C_{0} (W_{2})} > 0$ , we can assume without loss of generality that $C_{0} (W_{1}) = 0$ . In this case, $W_{1}$ must be either

$W_{1} = (\begin{matrix} 1 & 0 \\ 0 & 1 \end{matrix}) or W_{1} = (\begin{matrix} 0 & 1 \\ 1 & 0 \end{matrix}),$

which implies that $C_{0} (W_{2}) = 1$ , and consequently, $C_{0}^{F B} (W_{2}) = 1$ . Furthermore, if $R_{\infty} (W_{2}) > 0$ , then $W_{2}$ must also be one of the two matrices above, ensuring that (35) holds. If instead $R_{\infty} (W_{2}) = 0$ , Theorem 17 guarantees that (35) remains valid.
We now prove (36) under the assumption that $| X_{1} | = | Y_{1} | = 2$ and $| X_{2} | = | Y_{2} | = 3$ . If we have found channels ${\hat{W}}_{1}, {\hat{W}}_{2}$ for this case, such that (36) holds, then it is also clear how general case 2 can be proved. We set ${\hat{W}}_{1} = (\begin{matrix} 1 & 0 \\ 0 & 1 \end{matrix})$ , which means $C_{0} ({\hat{W}}_{1}) = C_{0}^{F B} ({\hat{W}}_{1}) = R_{\infty} ({\hat{W}}_{1}) = 1$ . For ${\hat{W}}_{2}$ , we take the three-ary typewriter channel ${\hat{W}}_{2} (ϵ)$ with $X_{2} = Y_{2} = {0, 1, 2}$ (see [43]):

${\hat{W}}_{2} (ϵ) (y | x) = \{\begin{matrix} 1 - ϵ & y = x, \\ ϵ & y = x + 1 mod 3 . \end{matrix}$

Let $ϵ \in (0, \frac{1}{2})$ be arbitrary, then $C ({\hat{W}}_{2} (ϵ)) = {log}_{2} (3) - H_{2} (ϵ)$ . We have $R_{\infty} ({\hat{W}}_{2} (ϵ)) = {log}_{2} \frac{3}{2}$ and $C_{0} ({\hat{W}}_{2} (ϵ)) = 0$ . This means that $C_{0}^{F B} ({\hat{W}}_{2} (ϵ)) = 0$ . Thus, because $C_{0} ({\hat{W}}_{1} \times {\hat{W}}_{2} (ϵ)) \geq C_{0} ({\hat{W}}_{1}) = 1$ ,

$\begin{matrix} C_{0}^{F B} ({\hat{W}}_{1} \otimes {\hat{W}}_{2} (ϵ)) & = & R_{\infty} ({\hat{W}}_{1}) = R_{\infty} ({\hat{W}}_{2} (ϵ)) \\ = & 1 + {log}_{2} (\frac{3}{2}) > C_{0}^{F B} ({\hat{W}}_{1}) + C_{0}^{F B} ({\hat{W}}_{2} (ϵ)) \end{matrix}$

and we have proven case 2.

□

6. Behavior of the Expurgation-Bound Rates

In this section, we consider the behavior of the expurgation-bound rate.

R_{e x}^{k}

occurs in the expurgation bound as a lower bound for the channel reliability function, where k is the parameter for the k-letter description. Let

X_{1}, Y_{1}, X_{2}, Y_{2}

be arbitrary finite non-trivial alphabets, and

W_{l} \in C H (X_{l}, Y_{l})

for

l = 1, 2

. We want to examine

R_{e x}^{k}

.

Theorem 20.

There exist non-trivial alphabets

X_{1}, Y_{1}, X_{2}, Y_{2}

and channels

W_{l} \in C H (X_{l}, Y_{l})

for

l = 1, 2

, such that for all

\hat{k}

, there exists

k \geq \hat{k}

with

R_{e x}^{k} (W_{1} \otimes W_{2}) \neq R_{e x}^{k} (W_{1}) + R_{e x}^{k} (W_{2}) .

Proof.

Assume that for all

X_{1}, Y_{1}, X_{2}, Y_{2}

and

W_{l} \in C H (X_{l}, Y_{l})

with

l = 1, 2

for all

k \in N

,

R_{e x}^{k} (W_{1} \otimes W_{2}) = R_{e x}^{k} (W_{1}) + R_{e x}^{k} (W_{2}) .

We now take

X_{1}^{'}, Y_{1}^{'}, X_{2}^{'}, Y_{2}^{'}

such that

C_{0}

is superadditive. Then, we have for certain

W_{1}^{'}, W_{2}^{'}

with

W_{l}^{'} \in C H (X_{l}^{'}, Y_{l}^{'})

,

C_{0} (W_{1}^{'} \otimes W_{2}^{'}) > C_{0} (W_{1}^{'}) + C_{0} (W_{2}^{'}) .

(37)

Then,

\begin{matrix} C_{0} (W_{1}^{'} \otimes W_{2}^{'}) & = & lim_{k \to \infty} R_{e x}^{k} (W_{1}^{'} \otimes W_{2}^{'}) \\ = & lim_{k \to \infty} R_{e x}^{k} (W_{1}^{'}) + R_{e x}^{k} (W_{2}^{'}) \\ = & C_{0} (W_{1}^{'}) + C_{0} (W_{2}^{'}) . \end{matrix}

This is a contradiction, and thus the theorem is proven. □

We improve the statement of Theorem 20 with the following theorem.

Theorem 21.

There exist non-trivial alphabets

X_{1}, Y_{1}, X_{2}, Y_{2}

and channels

W_{l} \in C H (X_{l}, Y_{l})

for

l = 1, 2

and a

\hat{k}

, such that for all

k \geq \hat{k}

,

R_{e x}^{k} (W_{1} \otimes W_{2}) > R_{e x}^{k} (W_{1}) + R_{e x}^{k} (W_{2})

holds true.

Proof.

Assume the statement of the theorem is false, which means for all channels

W_{l} \in C H (X_{l}, Y_{l})

with

l = 1, 2

, the following applies: There exists a sequence

{k_{j}}_{j \in N} \subset N

with

{lim}_{j \to \infty} k_{j} = + \infty

, such that

R_{e x}^{k_{l}} (W_{1} \otimes W_{2}) \leq R_{e x}^{k_{l}} (W_{1}) + R_{e x}^{k_{l}} (W_{2})

for

l \in N

. We now take

{\hat{X}}_{1}, {\hat{Y}}_{1}, {\hat{X}}_{2}, {\hat{Y}}_{2}

so that

C_{0}

is superadditive for these alphabets. Then, we have for certain

{\hat{W}}_{1}, {\hat{W}}_{2}

with

{\hat{W}}_{l} \in C H (X_{l}, Y_{l})

for

l = 1, 2

,

C_{0} ({\hat{W}}_{1} \otimes {\hat{W}}_{2}) > C_{0} ({\hat{W}}_{1}) + C_{0} ({\hat{W}}_{2}) .

(38)

Then,

\begin{matrix} C_{0} ({\hat{W}}_{1} \otimes {\hat{W}}_{2}) = lim_{j \to \infty} R_{e x}^{k_{j}} ({\hat{W}}_{1} \otimes {\hat{W}}_{2}) & \leq & lim_{j \to \infty} (R_{e x}^{k_{j}} ({\hat{W}}_{1}) + R_{e x}^{k_{j}} ({\hat{W}}_{2})) \\ = & C_{0} ({\hat{W}}_{1}) + C_{0} ({\hat{W}}_{2}) . \end{matrix}

This is a contradiction to (38), and thus the theorem is proven. □

We have already observed that the function

E (W, \cdot)

exhibits significantly different behavior over certain rate intervals

[R, \hat{R}]

. In particular, we have analyzed the impact of the channel product

W_{1} \otimes W_{2}

on the intervals

(R_{\infty} (W_{1} \otimes W_{2}), C (W_{1} \otimes W_{2}))

and

(E_{e x}^{k} (W_{1} \otimes W_{2}), C (W_{1} \otimes W_{2}))

for

k \in N

.

For the first interval, we established the relation

(R_{\infty} (W_{1} \otimes W_{2}), C (W_{1} \otimes W_{2})) = (R_{\infty} (W_{1}) + R_{\infty} (W_{2}), C (W_{1}) + C (W_{2})) .

However, for the second interval, we have shown that such a simple additive behavior does not hold. Given the proof of Theorem 18, we conclude that there exist channels

W_{1}^{'}, W_{2}^{'}

for which

R_{e x}^{k} (W_{1}^{'} \otimes W_{2}^{'}) > R_{e x}^{k} (W_{1}^{'}) + R_{e x}^{k} (W_{2}^{'})

is satisfied for all

k \geq \hat{k}

.

Another important aspect is understanding the conditions under which the interval

[0, \hat{R})

causes

E (W, r)

to become infinite. This occurs if and only if

C_{0} (W) > 0

, in which case, the interval is given by

[0, C_{0} (W))

. Consequently, there exist channels

W_{1}^{'}, W_{2}^{'}

such that for the function

E (W_{1}^{'} \otimes W_{2}^{'}, \cdot)

, this interval extends beyond

[0, C_{0} (W_{1}^{'}) + C_{0} (W_{2}^{'})]

.

Thus, we conclude that

C_{0}

is generally superadditive.

7. Conclusions

We have shown that the channel reliability function is not a Turing computable performance function. The same conclusion holds for the functions associated with the sphere packing bound and the expurgation bound.

An interesting aspect of our work is that the constraints we impose on Turing computable performance functions are strictly weaker than those typically required for Turing computable functions. Specifically, we do not require that the Turing machine halt for all inputs

(W, R) \in C_{h} \times R_{c}^{+}

. This means we allow the Turing machine to compute indefinitely for certain inputs, i.e., it may never halt for some inputs. Consequently, we permit performance functions that are not defined for all

(W, R) \in C_{h} \times R_{c}^{+}

. However, we do require the Turing machine to halt for inputs

(W, R) \in C_{h} \times R_{c}

whenever the performance function F is defined, and in such cases, the machine must return the computable value

F (W, R)

as output. This ensures that the algorithm generated corresponds to the number

F (W, R)

according to Definition 15.

Additionally, we considered the

R_{\infty}

function and the zero-error feedback capacity, both of which play a critical role in the context of the channel reliability function. We demonstrated that neither the

R_{\infty}

function nor the zero-error feedback capacity is Banach–Mazur computable. Furthermore, we proved that the

R_{\infty}

function is additive.

We also established that for all finite alphabets

X, Y

with

| X | \geq 2

and

| Y | \geq 2

, the channel reliability function itself is not a Turing computable performance function. Moreover, we showed that the commonly studied bounds, which have been extensively examined in the literature, are also not Turing computable performance functions. It remains unclear whether non-trivial upper bounds for the channel reliability function that are Turing computable even exist.

In [13], the sequence of k-letter expurgation bounds was considered an effective method for approximating the channel reliability function. It was hoped that these sequences could be computed more efficiently using modern digital computers. However, we have shown that this is not the case. Table 1 gives an overview of the main results of the paper.

As mentioned in the Introduction, future communication systems, such as 6G, will face stringent requirements for trustworthiness. Ultra-reliability, along with the corresponding performance functions, is central to 6G, and this paper addresses that challenge. It is currently unclear how the non-Turing computability of performance functions will impact the system evaluation and certification of future communication systems. A recent study [62] showed that the non-Turing computability of performance functions in artificial intelligence (AI) leads to digital AI algorithms being unable to meet essential legal requirements. It is an intriguing research question whether similar issues might arise in the context of communication systems.

This work does not claim that machine learning or artificial intelligence (AI) approaches are useless for computing capacity functions. Rather, it demonstrates that certain solutions cannot be found by such methods, or that a computer may not be able to assess how close a given result is to the optimum. Nevertheless, employing machine learning tools remains valuable; one must simply be aware that these approaches do not always guarantee optimality. In such cases, alternative theoretical frameworks may be necessary.

Turing computability and Banach–Mazur computability are two central notions in the theory of computation. Every function that is Turing computable is also Banach–Mazur computable, meaning that Banach–Mazur computability subsumes Turing computability. However, the converse does not hold: not every Banach–Mazur computable function is Turing computable. In fact, if a function is not Banach–Mazur computable, then it cannot be computable under any other standard notion of computability. This underscores the foundational and maximal character of Banach–Mazur computability within the hierarchy of computability concepts. Moreover, as shown in [63], there exist even total functions—functions defined on all computable real numbers—that are Banach–Mazur computable but not Turing computable. For readers interested in a deeper understanding of computability theory—how to determine whether a function is computable, along with illustrative examples and detailed explanations—we recommend the comprehensive work by Soare [64] and Cooper’s New Computational Paradigms [65]. Practical implications of these theoretical analyses, especially their relevance to real-world applications, are further explored in [66], which may be of particular interest to those seeking connections between theory and practice.

Author Contributions

The contributions of both authors are equal across all categories. All authors have read and agreed to the published version of the manuscript.

Funding

The authors gratefully acknowledge the financial support provided by the Federal Ministry of Education and Research (BMBF) of Germany under the “Souverän. Digital. Vernetzt.” program, specifically through the joint project 6G-life, project identification numbers 16KISK002 and 16KISK263. H. Boche and C. Deppe also acknowledge the financial support from the BMBF’s quantum program QuaPhySI under grants 16KIS1598K and 16KIS2234, as well as from the QUIET project under grants 16KISQ093 and 16KISQ0170. Additionally, they were supported by the QC-CamNetz project under grants 16KISQ077 and 16KISQ169. Furthermore, they received funding from the DFG through the project “Post Shannon Theorie und Implementierung,” under grants BO 1734/38-1 and DE 1915/2-1. Special thanks also go to the BMBF within the national initiative for their support of H. Boche under grant 16KIS1003K and of C. Deppe under grant 16KIS1005.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author(s).

Acknowledgments

Holger Boche would like to thank Martin Bossert for insightful discussions and questions regarding the theory of the channel reliability function and the trustworthiness of numerical simulations of this function on digital computers. He also expresses his gratitude to Vince Poor and Martin Bossert for their discussions at ISIT 2019 in Paris, which sparked the research leading to the results presented in this paper. Finally, we express our appreciation to Yannik Böck for his helpful and insightful comments.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Shannon, C.E. A mathematical theory of communication. Bell Syst. Tech. J. 1948, 27, 379–423, 623–656. [Google Scholar] [CrossRef]
Chow, T.Y. What is a Closed-Form Number? Amer. Math. Mon. 1999, 106, 440–448. [Google Scholar] [CrossRef]
Borwein, J.; Crandall, R. Closed Forms: What They Are and Why We Care. Not. Am. Math. Soc. 2013, 60, 50–65. [Google Scholar] [CrossRef]
Ahlswede, R. Multi-way communication channels. In Proceedings of the Second International Symposium on Information Theory, Tsahkadsor, Armenia, 2–8 September 1971. [Google Scholar]
Ahlswede, R.; Dueck, G. Identification via channels. IEEE Trans. Inform. Theory 1989, 35, 15–29. [Google Scholar] [CrossRef]
Sason, I. Observations on graph invariants with the Lovász ϑ-function. AIMS Math. 2024, 9, 15385–15468. [Google Scholar] [CrossRef]
Sason, I. Observations on the Lovász θ-Function, Graph Capacity, Eigenvalues, and Strong Products. Entropy 2023, 25, 104. [Google Scholar] [CrossRef]
Gallager, R.G. Information Theory and Reliable Communication; John Wiley & Sons, Inc.: Hoboken, NJ, USA, 1968. [Google Scholar]
Haroutunian, E.; Haroutunian, M.; Harutyunyan, A. Reliability Criteria in Information Theory and in Statistical Hypothesis Testing. Found. Trends Commun. Inf. Theory 2008, 4, 97–263. [Google Scholar] [CrossRef]
Blahut, R.E. Principles and Practice of Information Theory; Addison-Wesley Longman Publishing Co., Inc.: Boston, MA, USA, 1987. [Google Scholar]
Elias, P. Coding for noisy channels. IRE Conv. Rec. 1955, 4, 37–46. [Google Scholar]
Fano, R.M. Transmission of information: A statistical theory of communications. Am. J. Phys. 1961, 29, 793–794. [Google Scholar] [CrossRef]
Shannon, C.; Gallager, R.; Berlekamp, E. Lower Bounds to Error Probability for Coding in Discrete Memoryless Channels. Inf. Control 1967, 10, 65–103. [Google Scholar] [CrossRef]
Turing, A. On Computable Numbers, with an Application to the Entscheidungsproblem. Proc. Lond. Math. Soc. 1936, 42, 230–265. [Google Scholar]
Arimoto, S. An algorithm for computing the capacity of arbitrary discrete memoryless channels. IEEE Trans. Inf. Theory 1972, 18, 14–20. [Google Scholar] [CrossRef]
Blahut, R. Computation of channel capacity and rate-distortion functions. IEEE Trans. Inf. Theory 1972, 18, 460–473. [Google Scholar] [CrossRef]
Dueck, G.; Körner, J. Reliability function of a discrete memoryless channel at rates above capacity (corresp.). IEEE Trans. Inf. Theory 1979, 25, 82–85. [Google Scholar] [CrossRef]
Alajaji, F.; Chen, P.; Rached, Z. A note on the Poor-Verdú upper bound for the channel reliability function. IEEE Trans. Inf. Theory 2002, 48, 309–313. [Google Scholar] [CrossRef]
Ashikhmin, A.; Barg, A.; Litsyn, S. A new upper bound on the reliability function of the Gaussian channel. IEEE Trans. Inf. Theory 2000, 46, 1945–1961. [Google Scholar] [CrossRef]
Lapidoth, A. On the reliability function of the ideal Poisson channel with noiseless feedback. IEEE Trans. Inf. Theory 2002, 39, 491–503. [Google Scholar] [CrossRef]
Burnashev, M.; Yamamoto, H. Noisy feedback improves the Gaussian channel reliability function. In Proceedings of the IEEE International Symposium on Information Theory, Honolulu, HI, USA, 29 June–4 July 2014. [Google Scholar]
Hajek, B.; Subramanian, V. Capacity and reliability function for small peak signal constraints. IEEE Trans. Inf. Theory 2002, 48, 828–839. [Google Scholar] [CrossRef]
Ben-Haim, Y.; Litsyn, S. Improved upper bounds on the reliability function of the Gaussian channel. In Proceedings of the IEEE International Symposium on Information Theory, Seattle, WA, USA, 9–14 July 2006. [Google Scholar]
Endo, H.; Sasaki, M. Reliability and secrecy functions of the wiretap channel under cost constraint. IEEE Trans. Inf. Theory 2014, 60, 6819–6843. [Google Scholar]
Tyagi, H.; Narayan, P. The Gelfand-Pinsker channel: Strong converse and upper bound for the reliability function. In Proceedings of the IEEE International Symposium on Information Theory, Seoul, Republic of Korea, 28 June–3 July 2009. [Google Scholar]
Somekh-Baruch, A. An upper bound on the reliability function of discrete memoryless channels. IEEE Trans. Inf. Theory 2024, 70, 3059–3081. [Google Scholar] [CrossRef]
Burnashev, M.; Yamamoto, H. On the reliability function for a BSC with noisy feedback. Probl. Inf. Transm. 2010, 46, 103–121. [Google Scholar] [CrossRef]
Burnashev, M.; Holevo, A. On the reliability function for a quantum communication channel. Probl. Peredachi Informatsii 1998, 34, 3–15. [Google Scholar]
Holevo, A. Reliability function of general classical-quantum channel. IEEE Trans. Inf. Theory 2002, 46, 2256–2261. [Google Scholar] [CrossRef]
Li, K.; Yang, D. Reliability function of classical-quantum channels. Phys. Rev. Lett. 2025, 134, 010802. [Google Scholar] [CrossRef]
Fettweis, G.P.; Boche, H. 6G: The Personal Tactile Internet—And Open Questions for Information Theory. IEEE BITS Inf. Theory Mag. 2021, 1, 71–82. [Google Scholar] [CrossRef]
Boche, H.; Schaefer, R.; Poor, H.; Fettweis, G. Trustworthiness Verification and Integrity Testing for Wireless Communication Systems. In Proceedings of the IEEE International Conference on Communications, Seoul, South Korea and Virtual, 16–20 May 2022. [Google Scholar]
Fettweis, G.P.; Boche, H. On 6G and trustworthiness. Commun. ACM 2022, 65, 48–49. [Google Scholar] [CrossRef]
Alon, N.; Lubetzky, E. The Shannon capacity of a graph and the independence numbers of its powers. IEEE Trans. Inf. Theory 2006, 52, 2172–2176. [Google Scholar] [CrossRef]
Boche, H.; Deppe, C. Computability of the zero-error capacity of noisy channels. In Proceedings of the 2021 IEEE Information Theory Workshop (ITW), Kanazawa, Japan, 17–21 October 2021; IEEE: Piscataway, NJ, USA, 2021; pp. 1–6. [Google Scholar]
Boche, H.; Deppe, C. Computability of the channel reliability function and related bounds. In Proceedings of the 2022 IEEE International Symposium on Information Theory (ISIT), Espoo, Finland, 26 June–1 July 2022; IEEE: Piscataway, NJ, USA, 2022; pp. 1530–1535. [Google Scholar]
Weihrauch, K. Computable Analysis: An Introduction, 1st ed.; Springer Publishing Company, Incorporated: Berlin/Heidelberg, Germany, 2013. [Google Scholar]
Soare, R.I. Recursively Enumerable Sets and Degrees, 1st ed.; Springer: Berlin/Heidelberg, Germany, 1987. [Google Scholar] [CrossRef]
Pour-El, M.B.; Richards, J.I. Computability in Analysis and Physics; Perspectives in Logic; Cambridge University Press: Cambridge, UK, 2017. [Google Scholar] [CrossRef]
Shannon, C. The zero error capacity of a noisy channel. IRE Trans. Inf. Theory 1956, 2, 8–19. [Google Scholar] [CrossRef]
Gallager, R. A simple derivation of the coding theorem and some applications. IEEE Trans. Inf. Theory 1965, 11, 3–18. [Google Scholar] [CrossRef]
Katsman, G.L.; Tsfasman, M.A.; Vladuţ, S.G. Spectra of linear codes and error probability of decoding. In Coding theory and Algebraic Geometry (Luminy, 1991); Lecture Notes in Math; Springer: Berlin/Heidelberg, Germany, 1992; Volume 1518, pp. 82–98. [Google Scholar]
Dalai, M.; Polyanskiy, Y. Bounds on the Reliability Function of Typewriter Channels. IEEE Trans. Inf. Theory 2018, 64, 6208–6222. [Google Scholar] [CrossRef]
Lovász, L. On the Shannon capacity of a graph. IEEE Trans. Inf. Theory 1979, 25, 1–7. [Google Scholar] [CrossRef]
Baumert, L.D.; McEliece, R.J.; Rodemich, E.; Rumsey, H.C., Jr.; Stanley, R.; Taylor, H. A combinatorial packing problem. In Proceedings of the Computers in Algebra and Number Theory, New York, NY, USA, 25–26 March 1970; Volume IV, pp. 97–108. [Google Scholar]
Bohman, T. A limit theorem for the Shannon capacities of odd cycles. I. Proc. Am. Math. Soc. 2003, 131, 3559–3569. [Google Scholar] [CrossRef]
Polak, S.C.; Schrijver, A. New lower bound on the Shannon capacity of C7 from circular graphs. Inf. Process. Lett. 2019, 143, 37–40. [Google Scholar] [CrossRef]
Delsarte, P. An algebraic approach to the association schemes of coding theory. Philips Res. Rep. Suppl. 1973, 10, vi+-97. [Google Scholar]
Schrijver, A. A comparison of the Delsarte and Lovász bounds. IEEE Trans. Inform. Theory 1979, 25, 425–429. [Google Scholar] [CrossRef]
McEliece, R.J.; Rodemich, E.R.; Rumsey, H., Jr.; Welch, L.R. New upper bounds on the rate of a code via the Delsarte-MacWilliams inequalities. IEEE Trans. Inform. Theory 1977, IT-23, 157–166. [Google Scholar]
Litsyn, S. New upper bounds on error exponents. IEEE Trans. Inform. Theory 1999, 45, 385–398. [Google Scholar] [CrossRef]
Barg, A.; McGregor, A. Distance distribution of binary codes and the error probability of decoding. IEEE Trans. Inform. Theory 2005, 51, 4237–4246. [Google Scholar] [CrossRef]
Kalai, G.; Linial, N. On the distance distribution of codes. IEEE Trans. Inform. Theory 1995, 41, 1467–1472. [Google Scholar] [CrossRef]
Nötzel, J.; Wiese, M.; Boche, H. The arbitrarily varying wiretap channel—Secret randomness, stability, and super-activation. IEEE Trans. Inf. Theory 2016, 62, 3504–3531. [Google Scholar] [CrossRef]
Schaefer, R.F.; Boche, H.; Poor, H.V. Secure communication under channel uncertainty and adversarial attacks. Proc. IEEE 2015, 103, 1796–1813. [Google Scholar] [CrossRef]
Wiese, M.; Nötzel, J.; Boche, H. A channel under simultaneous jamming and eavesdropping attack—Correlated random coding capacities under strong secrecy criteria. IEEE Trans. Inf. Theory 2016, 62, 3844–3862. [Google Scholar] [CrossRef]
Boche, H.; Deppe, C. Secure identification for wiretap channels; robustness, super-additivity and continuity. IEEE Trans. Inf. Forensics Secur. 2018, 13, 1641–1655. [Google Scholar] [CrossRef]
Boche, H.; Schaefer, R.F. Capacity results and super-activation for wiretap channels with active wiretappers. IEEE Trans. Inf. Forensics Secur. 2013, 8, 1482–1496. [Google Scholar] [CrossRef]
Boche, H.; Böck, Y.; Deppe, C. On Effective Convergence in Fekete’s Lemma and Related Combinatorial Problems in Information Theory. In Festschrift in Memory of Ning Cai: Information Theory and Related Fields; Springer: Cham, Switzerland, 2025; pp. 289–318. [Google Scholar]
Boche, H.; Schaefer, R.F.; Poor, H.V. Algorithmic Computability and Approximability of Capacity-Achieving Input Distributions. IEEE Trans. Inf. Theory 2023, 69, 5449–5462. [Google Scholar] [CrossRef]
Lee, Y.; Boche, H.; Kutyniok, G. Computability of Optimizers. IEEE Trans. Inf. Theory 2024, 70, 2967–2983. [Google Scholar] [CrossRef]
Boche, H.; Fono, A.; Kutyniok, G. A Mathematical Framework for Computability Aspects of Algorithmic Transparency. In Proceedings of the IEEE International Symposium on Information Theory, Athens, Greece, 7–12 July 2024; IEEE: Piscataway, NJ, USA, 2024. [Google Scholar]
Hertling, P. A Banach–Mazur computable but not Markov computable function on the computable real numbers. Ann. Pure Appl. Log. 2005, 132, 227–246. [Google Scholar] [CrossRef]
Soare, R.I. Turing Computability: Theory and Applications, 1st ed.; Springer Publishing Company, Incorporated: Berlin/Heidelberg, Germany, 2016. [Google Scholar]
Cooper, S.B.; Löwe, B.; Sorbi, A. New Computational Paradigms: Changing Conceptions of What Is Computable; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2007. [Google Scholar]
Boche, H.; Fojtik, V.; Fono, A.; Kutyniok, G. Computability of Classification and Deep Learning: From Theoretical Limits to Practical Feasibility through Quantization. J. Fourier Anal. Appl. 2025, 31, 35. [Google Scholar] [CrossRef]

Table 1. Overview of results.

Problem	Result
Computability of capacity	$R_{\infty}$ : Yes $R_{0}$ : Unknown $R_{0}$ for alphabet size $< 5$ : Yes
Semi-decidability of capacity $> λ$	$R_{\infty}$ : No $C_{0}^{F B}$ : No
Semi-decidability of capacity $< λ$	$R_{\infty}$ : Yes $C_{0}^{F B}$ : Yes
Computability of capacity function	$R_{\infty}$ : No $C_{0}^{F B}$ : No
Computability of performance function	$E (W, R)$ : No ${E_{e x} (\cdot, \cdot, k)}_{k \in N}$ : No
Additivity	$R_{\infty}$ : Yes $C_{0}^{F B}$ : Unknown $C_{0}^{F B}$ for alphabet size 2: Yes $R_{e x}^{k}$ : No

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Boche, H.; Deppe, C. The Computability of the Channel Reliability Function and Related Bounds. Algorithms 2025, 18, 361. https://doi.org/10.3390/a18060361

AMA Style

Boche H, Deppe C. The Computability of the Channel Reliability Function and Related Bounds. Algorithms. 2025; 18(6):361. https://doi.org/10.3390/a18060361

Chicago/Turabian Style

Boche, Holger, and Christian Deppe. 2025. "The Computability of the Channel Reliability Function and Related Bounds" Algorithms 18, no. 6: 361. https://doi.org/10.3390/a18060361

APA Style

Boche, H., & Deppe, C. (2025). The Computability of the Channel Reliability Function and Related Bounds. Algorithms, 18(6), 361. https://doi.org/10.3390/a18060361

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

The Computability of the Channel Reliability Function and Related Bounds

Abstract

1. Introduction

2. Definitions and Basic Results

2.1. Basic Concepts of Computability Theory

2.2. Basic Concepts of Information Theory

2.3. Lower and Upper Bounds on the Reliability Function for the Typewriter Channel

2.4. Computable Channels and Computable Performance Functions

3. Results for the Rate Function $R_{\infty}$ and Applications on the Sphere Packing Bound

4. Computability of the Channel Reliability Function and the Sequence of Expurgation Bound Functions

5. Computability of the Zero-Error Capacity of Noisy Channels with Feedback

6. Behavior of the Expurgation-Bound Rates

7. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

The Computability of the Channel Reliability Function and Related Bounds

Abstract

1. Introduction

2. Definitions and Basic Results

2.1. Basic Concepts of Computability Theory

2.2. Basic Concepts of Information Theory

2.3. Lower and Upper Bounds on the Reliability Function for the Typewriter Channel

2.4. Computable Channels and Computable Performance Functions

3. Results for the Rate Function R ∞ and Applications on the Sphere Packing Bound

4. Computability of the Channel Reliability Function and the Sequence of Expurgation Bound Functions

5. Computability of the Zero-Error Capacity of Noisy Channels with Feedback

6. Behavior of the Expurgation-Bound Rates

7. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3. Results for the Rate Function $R_{\infty}$ and Applications on the Sphere Packing Bound