Deterministic K-Identification for Future Communication Networks: The Binary Symmetric Channel Results

Salariseddigh, Mohammad Javad; Dabbabi, Ons; Deppe, Christian; Boche, Holger

doi:10.3390/fi16030078

Open AccessArticle

Deterministic K-Identification for Future Communication Networks: The Binary Symmetric Channel Results^†

¹

Institute for Communications Engineering, Technical University of Munich (TUM), 80333 Munich, Germany

²

Federal Ministry of Education and Research, Hub 6G-Life, Technical University of Munich (TUM), 80333 Munich, Germany

³

Institute for Communications Technology, Technical University of Braunschweig, 38106 Braunschweig, Germany

⁴

Chair of Theoretical Information Technology, Technical University of Munich, 80333 Munich, Germany

^*

Author to whom correspondence should be addressed.

^†

This paper is an extended version of our paper published in IEEE Global Communications Conference (GLOBECOM 2023), Deterministic K-Identification For Binary Symmetric Channels, Saint-Malo, France, 23–28 April 2023.

Future Internet 2024, 16(3), 78; https://doi.org/10.3390/fi16030078

Submission received: 11 January 2024 / Revised: 14 February 2024 / Accepted: 20 February 2024 / Published: 26 February 2024

(This article belongs to the Special Issue Featured Papers in the Section Internet of Things)

Download

Browse Figures

Versions Notes

Abstract

Numerous applications of the Internet of Things (IoT) feature an event recognition behavior where the established Shannon capacity is not authorized to be the central performance measure. Instead, the identification capacity for such systems is considered to be an alternative metric, and has been developed in the literature. In this paper, we develop deterministic K-identification (DKI) for the binary symmetric channel (BSC) with and without a Hamming weight constraint imposed on the codewords. This channel may be of use for IoT in the context of smart system technologies, where sophisticated communication models can be reduced to a BSC for the aim of studying basic information theoretical properties. We derive inner and outer bounds on the DKI capacity of the BSC when the size of the goal message set K may grow in the codeword length n. As a major observation, we find that, for deterministic encoding, assuming that K grows exponentially in n, i.e.,

K = 2^{n κ}

, where

κ

is the identification goal rate, then the number of messages that can be accurately identified grows exponentially in n, i.e.,

2^{n R}

, where R is the DKI coding rate. Furthermore, the established inner and outer bound regions reflects impact of the input constraint (Hamming weight) and the channel statistics, i.e., the cross-over probability.

Keywords:

deterministic K-identification; capacity region; binary symmetric channel; Hamming distance; post Shannon communications; internet of things

1. Introduction

The Internet of Things (IoT) refers to a system of interconnected devices that communicate and share data with one another [1,2]. The IoT is first-class and the fastest growing area of technology, where its constituent is called a thing. These things are classified in three groups: people, machines and information (food, medicines, books, etc.). Examples include a driving car with built-in sensors monitoring vehicle health and driving performance, or a person with a heart monitor implant for efficient patient management, and can be very varied, including any natural or human-made objects that has sensors, processing/controlling ability, and can transfer information over a network using specific communication technologies. Some of the key challenges and possible research topics for IoT are highlighted in [3]. Moreover, in [4], different physical layer security techniques for IoT are studied.

Smart cities: IoT can be used in the context of smart cities [5], where it provides an urban network to connect devices such as sensors, lights, and meters, for the sake of data collection and analysis. The smart cities exploit state-of-the-art technologies such as cloud computing [6] and machine learning [7] to provide a better quality of government service, enhancing infrastructure, public utilities, and citizen services. In particular, in the context of smart mobility and transportation systems [8], IoT may provide opportunities for integrating control, communications, and date processing across a heterogeneous network of transportation systems. IoT applications can be extended to different aspects of such systems, including the infrastructure, vehicle, and user/driver. The interactions between such components give rise to inter- and intra-vehicular communication, smart traffic control, safety, logistics, user/vehicle control, electronic toll collection systems, etc. [9]. Specifically, a potential IoT application scenario for these contexts is exploiting sensors for the sake of environmental monitoring [10]. That is, in a wireless sensor network, a group of sensors which monitor the environment are expected to send the minimum amount of information to the decision center for the sake of performing an appropriate and reliable timely act.

Smart medical and health-care systems: Applications of IoT for medical and health-care purposes are referred to as the Internet of Medical Things (IoMT) [11,12]. In this context, the technology for creating a digitized healthcare system where the medical resources cooperate with others for providing health-care services is referred to as smart health-care. In particular, IoT devices may be used for enabling remote emergency notification systems and health monitoring. Such devices range from blood pH/pressure and heart rate monitors to more advanced devices capable of monitoring specialized implants, such as pacemakers, wristbands, or sophisticated hearing aids [11]. Moreover, a field related and concurrently expanding to the IoMT is the Internet of Bio-Nano Things (IoBNT) [13,14] which is the application of IoT for connecting bio-nano things inside the human body in order to provide a network of nano-scale and biological devices. A parallel developing and linked field to IoMT and IoBNT is molecular communication (MC), which provides platform, tools and techniques for establishing communications in the molecular scale [15,16].

1.1. Post-Shannon Communications for IoT

The classical information theory was established by Shannon in [17], where three levels of communications, including technical (reliable symbol transmission), semantic (message’s meaning transfer) and effectiveness (achieve goal/pragmatic aspect of message exchange) problems were defined. Shannon, in [17], considered solely the technical problem, which focuses on the accurate transmission of symbols. However, several applications for emerging sixth-generation (6G) or future-generation (XG) wireless communications/networking systems in the context of IoT demand to deal with the semantic and effectiveness aspects of the message. In fact, future XG systems fold the semantic of message and the goal of message communication into their design. This is required in these applications in order to fulfill certain performance features, including sustainability (robustness), latency, reliability, security, etc. Studying these new aspects of the message goes beyond the conventional Shannon paradigm/framework, and are referred to as post-Shannon communications (PSCs) [18]. For example, in goal/task-oriented communications [19], the success of execution for specific task (effectiveness problem) at the destination/receiver is the key concern, and is demanded by the transmitter.

In particular, a first discussion of the PSC for 6G can be found in [18]. The use of PSC for MC is studied in [20], in which the possible capabilities of MC for 6G is discussed for the first time. Also, a detailed discussion of the requirements for tactile internet (which refers to the data transfer in real-time (extremely low latency) in combination with high availability and reliability requirements) and 6G can be found in [21], in which the PSC is introduced to be of particular importance for several key areas of applications for 6G, wherein new communication scenarios, performance requirements and open questions for the PSC are discussed as well. Moreover, the wireless communication systems in 5G and beyond networks, which include reconfigurable intelligent surfaces (RISs) [22], deal with aspects such as localization, synchronization and beamforming design. These aspects in RISs often require use of the semantic metrics rather than the conventional Shannon metrics; cf. [23,24] for further details. Moreover, various applications in the context of smart medical and health-care systems for 6G networks require task accomplishment [20], and are needed to adapt the encoded signal depending to the specific application-driven requirements of the receiver.

1.2. IoT Needs and Impact of the Deterministic K-Identification

The evolving growth and development of technologies for IoT use cases have given rise to several applications where a reliable symbol transmission (the technical problem of Shannon) is less relevant. In particular, the 5G and 6G wireless communications systems in the horizon of IoT are expected to create new applications where the semantic and goal performing aspects of the messages are the key concern. Furthermore, these applications suffer other challenges, such as having difficulty coping with generation of randomness and working with sophisticated random number generators. Also, in some case, a strict criterion on the performance speed for recognition/identification of an event is imposed, or it is needed to deal with an increasing size of the search space. In the following, we expand on such challenges in more detail and suggest the K-identification problem as a promising approach for them.

Semantic and goal-oriented communications: Let us define the K-identification problem considered in this paper as follows: Assume that the message set is

ℳ = {1, 2, \dots, M}

, and message i is sent by the transmitter. Furthermore, assume an arbitrary subset of the message set with size K by

K

. In the technical problem setting (symbol transmission), the receiver is interested in determining exactly which message is sent by the transmitter, i.e., to reconstruct the sent message. However, in the K-identification setting, the receiver is only interested in determining whether or not the sent message belongs to the set

K

. In other words, the receiver decided

i \in K

or

i \notin K

without stating exactly which message is sent. Note that, in principle, identification should be guaranteed for any goal identification message set

K \subseteq ℳ

of size

| K | = K

, regardless of whether these identification message sets are intended for one or different receivers. In the K-identification problem, receiver seeks to perform a specific goal/task if its desired message sent at the transmitter, belongs to a set of K messages. Therefore, this problem may help to deliver the semantic aspects associated with the messages and can be adapted to the goal/task-oriented communications settings. That is, the K-identification problem can be a compelling candidate/answer to the IoT needs for applications defined in the context of PSC. These applications often ignore a reliable transmission of bits/symbols, and instead are alarm-triggered and demand to convey the semantic aspects of the messages. Potential applications of the K-identification problem for IoT systems are considered in [25].

Randomness generation/management: The original problem of K-identification proposed by Ahlswede in [26] considers employing randomness in the encoding module of a communication setup. That is, for each message at the transmitter, a unique distribution is assigned, which associates/maps the message to a codeword. This randomized mechanism for the K-identification problem allows for a remarkable gain in terms of the number of different messages (or/and their semantics/effects) that can be conveyed to the receiver, namely a double exponential behavior for the size of the message set; cf. [26] for details. Although in majority of use cases for IoT applications, such a double exponential behavior demand might be already real and steadily increasing, it has not necessarily been a focus point when launching an IoT device on the market. This occurs mostly because of cost and integration barriers. Specifically, in order to ensure standard realization of distributions in the encoding procedure, a true random number generator (TRNG) [27] should be embedded in IoT devices and utilized. Hardware-based TRNGs are often difficult to launch, manage and maintain for specific use cases [28]. These difficulties can be mitigated by exploiting deterministic codes in the system design for some of the applications. In addition, deterministic codes often have the advantage of simpler implementation, simulation [29,30] and explicit construction [31]. As a result, the deterministic K-identification (DKI) considered in this paper may be regarded a promising solution for several IoT applications that do not comprised randomness in their encoding part.

Performance speed: In the standard identification with deterministic encoding (DI) problem (i.e.,

K = 1

) [32,33], the receiver performs a series of comparisons between a given goal message and each element of the message set (one-to-one comparison). However, in the DKI problem, the receiver is capable of performing a one-to-set comparison, i.e., an inclusion test. In other words, the receiver is searching for a specific message within an arbitrary set of K messages (goal message set), and is able to declare reliably whether or not a specific message which is searching for is included in the goal message set. This feature for the DKI problem may be regarded as an advantage in terms of speed in the set-wise search, compared to the DI for identification-based IoT devices. In the following, we explain from a quantitative perspective that why the one-by-one comparison as made in the DI is slow, and why the simple inclusion test as made in the DKI is fast. In order to evaluate the search performance speed of K-identification against the standard identification, let us define the time complexity that is required in order to exhaust the entire collection of subsets of size K as a metric. Then, observe that the message set

ℳ = {1, \dots, M}

with size M has

(\binom{M}{K})

subsets of size K, referred to as the search space. Now, note that the total search space is the power set of the message set, i.e., the set of all subsets of the message set with size

2^{M}

. Therefore, ratio of the size of the search space to the size of the power set for the message set, converges exponentially to zero in the message size, M, i.e.,

\begin{matrix} \frac{(\binom{M}{K})}{2^{M}} \leq \frac{2^{M H (K / M)}}{2^{M}} = 2^{M H (K / M) - 1)} \overset{n \to \infty}{\to} 0, \end{matrix}

(1)

for

K \geq 1

and

M - K \geq 1

, where the inequality holds by ([34], p. 353), with

H (z) ≜ - z log (z) - (1 - z) log (1 - z)

, being the binary entropy function. On the other hand, for the DI problem the sequence of one-to-one comparisons for the asymptotic codeword lengths, n (i.e., very large message set size) trades a long delay on the receiver’s proficiency with an inverse polynomial order in M. More specifically, the receiver searches for a single message among M different messages; therefore, the ratio of the size of the search space to the size of the whole search space is

1 / M

, which tends to zero for increasing M.

Growing search space: Some of the envisioned IoT applications may need a K-identification task where size of the goal message set

K = K (n)

has to grow in n. For example, where it is required that the size of the goal message set, K, for which the inclusion test (search in a set) is conducted, remains a fixed percentage order of the size of the message set. Therefore, by growing codeword length, n, which implies a growing size of the message set, the corresponding goal message set also grows. To account for these cases, we consider a generalized identification model, whose parameter

K \geq 1

can grow exponentially in n. Possible implications of this observation in the context of IoT include locating an malfunctioned server within a network of K web servers; spotting/detecting a faulty node in a local partition of wireless sensor network with size K; and in data mining within the procedure of sorting data, where some algorithms need to know that a desired data are included to which set of element with size K.

1.3. Binary Symmetric Channel

A binary symmetric channel (BSC) in information/coding theory is one of the most well-known and fundamental models for communications channels where the input and output alphabets are binary, i.e.,

{0, 1}

. In this model, each symbol (bit) sent by the transmitter experiences a distortion (flipping); that is, the received symbol (bit) can be flipped with a cross-over probability of

p \in (0, 1)

, but is otherwise received correctly. In contrast to the simplicity of the BSC, many information theoretical problems related to this model are still being investigated in the literature. For example, studying the behavior of the decoding error probabilities and characterization of them as a function of the codeword length n, in the asymptotic for the entire region of coding rate R, which requires knowing the analytic function of the so-called channel reliability function (CRF) [35], is still unknown. In addition, the error exponents for a binary symmetric channel in several settings are not yet completely characterized; cf. [35,36] for further details. The K-identification problem considered in this work is the most generalized and difficult version of the identification problem [26]; therefore, it is rather evident that studying this topic for a general model may be exceedingly hard. However, we can obtain some insights into the effects of the size of goal messages, K, by restricting our investigations to a basic/simple frame of model, i.e., the BSC. More specifically, such information is a theoretical endeavor dedicated to the basic BSC model, which can be useful in the subsequent aspects.

Upgrade to advanced models: Often, studying an information theoretical problem begins with considering the most basic and simple abstract model. This allows the theorists to develop the required analytical tools and techniques in more straightforward manner and benefit the specific results as guides to the use and analysis of more advanced models. In other words, general/advanced models can often inherit/benefit analytical tools, techniques, and comprehensive steps that have been developed for the basic models. For example, studying the DI problem for a discrete memoryless channel (DMC) [32] was initiated/sparked by an earlier work in the literature for the BSC [33].

Error correction codes and modulation: The simplicity of such a basic model with a binary alphabet often is favorable for an explicit code construction problem or for employing modulation techniques. This advantage facilitates the procedure of cultivating novel coding methods. For example, the widely used polar transmission codes are adopted initially for a binary input memoryless channel [31]. Therefore, the simplicity of the BSC model allows experts to utilize it as a promising candidate for evaluation/analyzing the performance of future error correction DKI codes.

Information theoretical characteristics: Several advanced channel models for IoT applications can be simplified/specialized to a BSC. This allows information theorists to examine basic characteristics of such IoT systems (CRF, error exponent, critical rate, etc.) and acquire decent analytical insights needed for practical aspects such as modulation/detection design and explicit code construction. Therefore, studying the BSC effectively yields/suggest solutions for more advanced problems of IoT [37]. In addition, the BSC model is a useful model for studying network coding, which is an important technique in order to enhance the performance of a communication network [36]. Concrete modern scenarios in IoT systems that include the BSC model are telephone links, radio communication lines [37], implementation of noise aggregation methods for physical layer security [4], decision fusions for multi-route and multi-hop wireless sensor networks [38], and multi-hop networks [39].

1.4. Information Theoretical Analysis of BSC-Based IoT Systems

Theoretical advancements of communication channels for IoT systems modeled by BSC are helpful for characterization of their performance limits, which may be used in related system designs. For example, evaluation of explicitly constructed codes for such applications against such performance limit bounds may provide instructive recommendations/interpretations for the sake of efficient encoding/decoding procedures. In this context, for a given error probability and with no restriction imposed on the codeword length, the Shannon message transmission (TR) capacity of the BSC is studied in [17]. In [40,41,42,43], for a specified codeword length and a fix rate less than the TR capacity, the error probability for the optimal TR code is investigated. The problem of construction of optimum or at least good codes for TR problem with a given rate and codeword length is addressed in [40,44,45,46]. Furthermore, the TR capacity of the BSC is shown to be attained by Bernoulli input with

1 / 2

success probability, i.e.,

X \sim Bern (1 / 2)

[35]. In [41], random linear code for the achievability proof with an exponential decoding search is investigated.

However, in the research that is currently available, the BSC has mostly been studied for the TR problem. On the other hand, in [33], the DI for the BSC without input constraint is studied, where the lower bound on the DI capacity is established. In addition, in [32], the DI for the BSC with input constraint in a generalized context of the channel model, namely, DMCs is addressed and an extensive proof, dedicated for the BSC, was not provided. Based on the author’s information, for the BSC with input constraint, with the exception of this paper’s conference version [47], the ultimate performance limits for the deterministic K-identification (DKI) problem have not yet been examined in the literature.

1.5. Applications of the K-Identification Problem for IoT

The use of PSC for MC systems, whose objective is based on recognition of specific event, is studied in [20,48]. In the vision of IoT, the identities of the things are often required to be verified for each other. This identification task is needed in order to make sure that the things can address and reliably communicate with themselves. Consequently, the identification capacity [49] is the primary relevant quantitative metric in such systems, and the TR capacity [17] may not be the primary performance measure. In particular, for event-recognition, alarm-prompt or object-finding problems, where the receiver aims to recognize the occurrence of a specific event, determine an alarm, or realize the presence of an object, with respect to a set, in terms of a reliable Yes/No final decision, the so-called K-identification capacity [26] is the appropriate metric. For the K-identification problems, the receiver is focused on a subset of size K of the message set,

ℳ

, which is known as the goal message set. The recipient chooses a message at random, and confirms if it is part of the specified goal message set. The error requirements imposed on the associated K-identification codes guarantee that each inclusion test is reliable for every arbitrary choice of the goal message set.

In the context of IoT, specific instances of the K-identification problem may be found in the detection of damaged cells in a memory disk drive, where, e.g., a failure detector wants to know whether or not the corrupted cell is present in a group of cells; in lottery prize events, where, e.g., a person aims to determine whether a winner is among their favorite teams or where people seek to know if a specific lottery number is among their collection of numbers; in smart traffic management, where, e.g., one may be interested in finding to which group/set of streets a goal location belongs to. Additionally, K-identification might be used in health monitoring within the context of smart medical and health-care systems. For example, in a remote surgery [50], where the inclusion of a particular cancer or illness inside a goal group of K-cancers/diseases may be the communication goal. Finally, the K-identification problem may find applications in the generalized identification with decoding problem [26] in various IoT applications. Such a problem is an extension of the K-identification, wherein when the receiver identifies that the message belongs to set

K

, and it also identifies the message itself.

1.6. Contributions

In this paper, we address identification systems whose encoders are deterministic and their receiver is required to conduct the K-identification job, i.e., spotting an object/event/message within a set of goal objects/events/messages with size

K = 2^{n κ}

for some

κ \in [0, 1)

. We assume that the communication over n channel uses are independent of each other, and the noise is additive Bernoulli process. We formulate the problem of DKI over the BSC with and without Hamming weight input constraint. Our primary goal is to study the BSC’s DKI capacity region. This study specifically contributes the subsequent contributions:

◊: Generalized identification model: We examine the BSC, in which the size of the goal message set, K, may scale with the codeword length, n. As a consequence, this model incorporates the DI with $K = 1$ , and DKI with constant $K >$ 1. Therefore, we can confirm whether asymptotic codeword lengths allow for reliable identification, even when the goal message set grows in size, using our suggested generalized model. As far as is known by the authors’ knowledge, no previous research has been conducted on a generalized DKI model in the literature.
◊: Codebook scale: We prove that, for K-identification over the BSC with deterministic encoding, the codebook size grows in n, similarly to that of the DI problem ( $K = 2^{0} = 1$ ) [32,33] and the TR problem [17] over the same channel, namely exponentially in the codeword length n, i.e., ∼ $2^{n R}$ , where R is the DKI coding rate, even when the size of the goal message set grows exponentially in n, i.e., $K = 2^{n κ}$ , where $κ \in [0, 1)$ is the identification goal rate, and certain functions of the channel statistics and input restrictions set upper bounds on it. This result implies that one can extend the collection of goal messages for identification without compromising the codebook’s scalability.
◊: Capacity formula: We derive inner and outer bounds on the DKI capacity region for constant $K \geq 1$ and growing $K = 2^{n κ}$ , for the BSC with and without Hamming weight constraints. Our capacity bounds reflect the impact of the channels statistics, i.e., cross-over probability and the input constraint A in the optimal scale of the codebook size, i.e., $2^{n R}$ . In particular, in the coding procedure, we define a parameter $β \in (0, 1)$ , referred to as the distinction property of the codebook which adjust the Hamming distance property for the constructed codebook. Then, assuming a given codebook distinction, $β$ , a channel with asymptotic small cross-over probability (i.e., an almost perfect channel) causes the feasible range for the goal identification rate $κ$ to shrink; that is, the capability of the BSC for K-identification decreases, which is unfavorable. On the other hand, when the cross-over probability increases and converges to its maximum possible values, i.e., $ε \to 1 / 2$ (almost pure noisy channel), then the feasible range for $κ$ begins to enlarge favorably. This observation can be interpreted as follows: The channel noise can be exploited as an additional inherent source embedded in the communication setting for performing the K-identification task with a larger value of K. This observation is in contrast to previous results for DKI over the slow fading channel [51], or the DI for Gaussian and Poisson channels [32,48,52], where capacity bounds were shown to be independent of the input constraints or the channel parameters. We demonstrate that the suggested upper and lower bounds on attainable rates $(R, κ)$ are independent of K for constant K, whereas they are functions of the goal identification rate $κ$ for increasing goal message sets.
◊: Technical novelty: To obtain the proposed inner bound on the DKI capacity region, we address the input set imposed by the input constraints, and exploit it for an appropriate ball covering (overlapping balls with identical radius); namely, we consider covering of hyper balls inside a Hamming cube, whose Hamming radius grows in the codeword length n, i.e., ∼ $n β$ , for some $β \in (0, 1)$ upper bounded by a function of the channel statistic. We exploit a greedy construction similar as for the Gilbert bound method. While the radius of the small balls in the DI problem for the Gaussian channel with slow and fast fading [32], tends to zero as $n \to \infty$ , here, the radius similar to the DKI problem for the slow fading channel [51] grows in the codeword length n for asymptotic n. In general, the derivation of lower bound for the BSC is more complicated compared to that for the Gaussian [32] and Poisson channels with/out memory [48,52], and entails exploiting of new analysis and inequalities. Here, the error analysis in the achievability proof requires dealing with several combinatorial arguments and using of bounds on the tail for the cumulative distribution function (CDF) of the Binomial distribution. The DKI problem was recently investigated in [52] for a DTPC with ISI where the size of the ISI taps is assumed to scale as $L (n, l) = 2^{l log n}$ . In contrast to the findings in [52], where the attainable rate region of triple rates $(κ, l, R)$ for the Poisson channel with memory was derived, here, we study the DKI problem for a memoryless BSC, i.e., $L = 1$ , and the attainable rate region of pair rates $(κ, R)$ is established. Furthermore, while the method in the achievability proof of [52] is based on sphere packing, which includes an arrangement of non-overlapping spheres in the feasible input set. Here, we use a rather different approach called sphere/ball covering, which allows for the spheres/ball to overlap with each other. For the derivation of the outer bound on the DKI capacity region, it is assumed that a random series of code with diminishing error probabilities is provided. Then, for such a sequence, we prove that an one-to-one mapping between the message set and the set of the feasible input set (induced by the input constraint) can be established. Unlike the previous upper bound proof for DI over the DMC [32]; here, the proof for corresponding lemma is adopted in order to incorporate relevant set of the goal message sets, appropriately. Moreover, in the converse proof, similarly to [52], the method of proof by contradiction was utilized; that is, assuming that a certain property regarding the distance or number of the codewords is negated, we lead to a contradiction related to the sum of the sort I and sort II error probabilities. However, unlike [52], where a sub-linear function for the size of the goal message set was considered, i.e., $K (n, κ) = 2^{κ log n} = n^{κ}$ , here, our converse entails a faster function, namely $K (n, κ) = 2^{κ n}$ .

Notations: We use the subsequent notations throughout this paper: We use symbol ≜ for a definition. Alphabet sets are shown by blackboard bold letters

K, X, Y, Z \dots

. Random variables (RVs) are indicated by upper case letters

X, Y, Z, \dots

. Constants and values (realization) of RVs are specified by lower case letters

x, y, z, \dots

. Row vectors of size n, i.e.,

x = (x_{1}, \dots, x_{n})

and

y = (y_{1}, \dots, y_{n})

, are represented by lower case bold symbol x and y. The distribution of a RV X is specified by a probability mass function (pmf)

p_{X} (x)

over a finite set

𝒳

. The CDF of a Binomial RV is indicated by

B_{X} (x) ≜ Pr (X \leq x)

. All information quantities and logarithms are in base 2. Symbol

[[M]]

represents the set of all consecutive natural numbers from 1 to M. We indicate the modulo two addition operator by ⊕. The number of points for which the corresponding symbols for two sequences,

x_{1}

and

x_{2}

, are different is known as the Hamming metric (distance), i.e.,

d_{H} (x_{1}, x_{2}) ≜ \sum_{t = 1}^{n} δ (x_{i_{1}, t}, x_{i_{2}, t})

, where

δ (\cdot, \cdot)

is the Kronecker delta, defined as follows:

\begin{matrix} δ (x_{i}, x_{j}) = 〈 \begin{matrix} 1 & x_{i} \neq x_{j}, \\ 0 & x_{i} = x_{j} . \end{matrix} \end{matrix}

(2)

The Hamming cube is defined as the set of binary sequences with length n, and is denoted by

H^{n}

. The n-dimensional Hamming hyper ball of radius r for integers

n, r

such that

n \geq r \geq 1

, in the binary alphabet, centered at

x_{0} = (x_{0, t}) |_{t = 1}^{n}

, is defined as

\begin{matrix} B_{x_{0}} (n, r) = {x^{n} \in X^{n} : d_{H} (x, x_{0}) \leq r} . \end{matrix}

(3)

Specifically,

B_{x_{0}} (n, r)

for alphabet

X^{n} = H^{n}

, center

0 ≜ (0, \dots, 0)

and radius

r = n A (A \geq 0)

is given by

B_{0} (n, n A) = {x \in H^{n} : \sum_{t = 1}^{n} x_{t} \leq n A}

. The volume of the Hamming hyper ball

B_{x_{0}} (n, r)

in the q-ary alphabet is defined as the number of points that lie inside the ball, and is denoted by

Vol (B_{x_{0}} (n, r))

. The set of whole numbers is denoted by

N_{0} ≜ {0, 1, 2, \dots}

. The q-ary entropy function

H_{q} : [0, 1] \to R

for positive integer

q \geq 2

, is defined as

H_{q} (ε) ≜ x {log}_{q} (q - 1) - x {log}_{q} x - (1 - x) {log}_{q} (1 - x)

.

H_{q} (\cdot)

for

q = 2

, is denoted by

H (\cdot)

, and is defined as

H (ε) ≜ - ε log (ε) - (1 - ε) log (1 - ε)

. Throughout the paper, we denote the BSC with cross-over probability

ε \in (0, 1 / 2)

by

ℬ

.

1.7. Organization

This paper is structured as follows. Section 2 provides background information on the identification and K-identification problems, and reviews previous results on them. In Section 3, system model and fundamental definitions are established, and the background knowledge about DKI codes are provided. Section 4 introduces our primary results and contributions for the DKI capacity of the BSC. In the end, Section 5 include a summary and possible directions for more research.

2. Background on the Identification Problem

In the subsequent section, we give the background for the current work and establish the identification problem. Also, we motivate for the deterministic-encoder identification versus the well-known randomized-encoder identification (RI) scheme. In addition, we review relevant previous results on the DI, RI, DKI, and randomized K-identification (RKI) capacities for different channels.

2.1. Identification Problem

In the Shannon communication problem [17], a sender encodes its message in a manner that the receiver can perform a reliable reconstruction. That is, the receiver is interested in knowing which message was sent from the transmitter. In contrast, the coding design for the identification setting [49] is intended to conduct a different goal, namely to find out if a desired message was sent by the transmitter or not. Furthermore, we assume that prior to the communication, the transmitter is not informed of the message that the receiver seeks to identify.

Randomized identification: The identification problem (which has been studied in various setting of deterministic or randomized protocols, in the context of communication complexity; see [53,54]) in communication theory is initiated by Ahlswede and Dueck in [49], where a randomized encoder is employed to select the codewords. In this problem, the codewords are chosen based on their corresponding distribution, and the codebook size grows double-exponentially in the codeword length n, i.e., ∼

2^{2^{n R}}

[49], where R is the coding rate. This observation stands different from the TR problem, where the size growth for the codebook is only exponentially with the codeword length, i.e., ∼

2^{n R}

. The realization of explicitly constructed RI codes features high complexity, and is often challenging for the applications of MC in the context of IoBNT; cf. [48] for further details. However, in [55,56], explicit construction of RI codes using algebraic codes (Reed-Solomon) has been considered.

Deterministic identification: Although the remarkable properties of RI schemes for the codebook size may seem appealing for some applications, in several practical settings, using a huge amount of randomness may not be favorable. Examples include MC, where implementation in the nano-scaled environment is prohibitive [51], or in a pessimistic jamming scenario, where it is assumed that the radar jammer has access to the whole codebook [57]; therefore, using randomness results in extra expenses and does not guarantee a benefit. Additionally, deterministic codes typically offer advantages such as ease of implementation, simulation experimentation [29,30], and systematic construction [31]. The motivation of Ahlswede and Dueck to develop the RI problem [49] is probably traced back to the work of JáJá [33], who considered DI from a communication complexity perspective (an important observation regarding the behavior of the identification function has been well studied in communication complexity, where the out-performance of randomized protocols over the deterministic protocols (exponential gap between the two classes) for computing such a function is established; for instance, while the error-free deterministic complexity of the identification function is lower bounded by

log m

, where m is the length of message, for the randomized protocol and when

ε

error is allowed in computation of the identification function, only

O (log log m + 1 / ε)

bits suffices; see [54,58] for further details); that is, where the codewords are determined by a deterministic function from the messages. Moreover, it seems that Ahlswede and Dueck were inspired to show that employing randomness similar to what has been accomplished in the communication complexity field yields an advantage of exponential gap compared to the DI problem (a detailed comparison of codebook sizes in DI and RI problem over various channel models can be found in [48]) for the codebook size. In application cases where complexity is restricted, DI could be preferred over RI. For instance, in MC systems, where the development and deploying of a huge number of random sources (distributions) may not be clear.

K-identification scenario: In the standard DI or RI problems [32,49], the receiver aims to identify the occurrence of a single message, that is, the decoder at the receiver selects an arbitrary message from the message set referred to as the goal message, and then, by exploiting a decision rule (decoder), determines reliably whether or not this goal message is identical to the sent message. The identification problem can be extended in the subsequent sense: The receiver chooses a subset of K messages from the message set, called the goal message set (denoted by

K

) and, unlike the standard DI or RI problems, it checks whether or not the sent message is a member of

K

. This problem is called K-identification in the literature [26]. The goal message set selected by the receiver can be any arbitrary subset of the message set of size K, among the total

(\binom{M}{K})

such subsets.

The K-identification framework can be thought of as a generalization of DI or RI problems, in which the receiver’s single goal message is replaced with a collection of K goal messages, where

K \geq 1

. Therefore, the DKI for the special case where

K = 1

corresponds to the DI problem studied in [48,59]. Moreover, the K-identification problem is extended in [26] to generalized identification with decoding, where when the receiver identifies that the message belongs to set

K

, it also identifies the message itself. The K-identification problem, as considered in this paper, is different from a similar scheme called multiple object identification [60], where the sender’s data contains the information of K messages and the receiver’s objective is to identify whether or not a specific message belongs to set

K

. Here, it is assumed that the receiver does not know the set of objects selected by the sender.

2.2. Previous Results on DI Capacity

The DI problem for DMCs subject to an average constraint, is studied in [32] and a full characterization of capacity is established. Therein, the codebook size similar to that of the TR problem [17], is shown to grow exponentially in the codeword length, i.e., ∼

2^{n R}

[32]. Ahlswede and Cai studied the DI problem for the compound channels in [57]. Furthermore, recent observation for DI over continuous input alphabet channels including Gaussian channels with fast and slow fading [32], memoryless discrete-time Poisson channel (DTPC) [48], DTPC with inter-symbol interference (ISI) [52], and Binomial channel [59], revealed a new observation regarding the codebook size, namely, it scales super-exponentially in the codeword length, i.e., ∼

2^{(n log n) R}

, which is different than the standard exponential [32] and double exponential [49] behavior for DI and RI problems, respectively.

2.3. Previous Results on DKI Capacity

Ahlswede studied RKI for DMC in ([26] Th. 1), and showed that assuming

K = 2^{n κ}

, the set of all attainable pairs

(R, κ)

, where R is the RKI coding rate and

κ

is the goal identification rate, contains

\begin{matrix} \{(R, κ) : 0 \leq R, κ; R + 2 κ \leq C_{TR}\}, \end{matrix}

(4)

where

C_{TR}

is the TR capacity of the DMC. The DKI problem for the slow fading channels, denoted by

𝒢_{s l o w}

, in the presence of an average power constraint and assuming a codebook size of super-exponential scale, i.e.,

K (n, κ) = 2^{κ log n}

, is studied in [51], and the subsequent bounds on the DKI capacity are established:

\begin{matrix} \frac{1 - κ}{4} \leq C_{DKI} (𝒢_{s l o w}, M, K) \leq 1 + κ, \end{matrix}

(5)

for

0 \leq κ < 1

. As far as we know, there has not yet been any research performed in the literature on the DKI capacity of the BSC with input constraint, which is pertinent to IoT systems; hence, it is the primary emphasis of this study.

3. System Model and Preliminaries

This section presents the selected system model, and some preliminaries regarding DKI coding are established.

3.1. System Model

We target a communication setting, which is focused on the identification goal; that is, the objective of the decoder is defined as follows: Determine if the sent message belongs to a goal group of messages of size K. In order to do this, the transmitter and the receiver build (the suggested inner and outer bounds on the DKI capacity region functions, whether or not a particular code is utilized for the communication; however, in order to approach the capacity limits, appropriate, explicitly built codes could be needed), a coded communication channel over n, uses of the binary symmetric channel. We assume that the random variables (RVs)

X \in {0, 1}

and

Y \in {0, 1}

indicate model the input and output of the channel. Each binary input symbol is flipped with probability

0 < ε < 1 / 2

; see Figure 1. The stochastic flipping (the extreme cases of

ε = 0

or

ε = 1 / 2

result in

C_{TR} = 1

and

C_{TR} = 0

, respectively; hence, these cases are commonly excluded from the analysis) of the input symbol is modeled via an additive binary Bernoulli noise, i.e.,

Z \in {0, 1}

. Therefore, the input–output relation of channel reads:

Y = X \oplus Z

, where ⊕ indicate the modulo two addition. That is, the channel input/output

X / Y

are related as follows:

\begin{matrix} W (Y | X) = 〈 \begin{matrix} 1 - ε & Y = X, \\ ε & Y \neq X, \end{matrix} \end{matrix}

(6)

for all

X, Y \in {0, 1}

and

0 < ε < 1 / 2

.

Furthermore, it is assumed that the various channel uses are independent of one another and that the communication channel is memoryless. Therefore, the transition probability distribution for n channel uses is given by

\begin{matrix} W^{n} (y | x) = \prod_{t = 1}^{n} W (y_{t} | x_{t}) = ε^{d_{H} (x, y)} {(1 - ε)}^{n - d_{H} (x, y)}, \end{matrix}

(7)

where

x = (x_{1}, \dots, x_{n})

and

y = (y_{1}, \dots, y_{n})

stand for the sent codeword and received signal, respectively, and

d_{H} (\cdot)

denotes the Hamming distance. Observe that

d_{H} (x, y)

is a RV, and follows a Binomial distribution; see Remark 1. We assume that the codewords are restricted by an input constraint of the form

\frac{1}{n} \sum_{t = 1}^{n} x_{t} \leq A

, where

A > 0

constrain the Hamming weight over the entire n channel uses in each codeword normalized by the codeword length.

Memoryless property: In the standard modeling of the BSC, we assume that the channel is exploited at different time instances in an independent manner; that is, the communications of symbols at distinct time instances are statistically independent of each other. However, in the physical channels, such as telephone lines with impulse noise or slowly fading radio communications with binary alphabet, communication is usually dispersive and the channel exhibit memory [35,61]. Therefore, appropriate steps need to be take in order to ensure the orthogonality of the different channel uses. Some immediate approaches include applying interlacing or scrambling the symbols of a codeword; cf. [61] for further details. Therefore, in the analysis, we can assume that such methods can be applied to circumvent the effect of channel memory and assert statistical independence between different channel noise samples to ensure the memoryless property.

3.2. DKI Coding for the BSC

The definition of a DKI code for the BSC,

ℬ

, is given below.

Definition 1

(BSC DKI Code). An

(n, M (n, R), K (n, κ), e_{1}, e_{2})

-BSC-DKI code for a BSC $ℬ$ for integers

M (n, R)

and

K (n, κ)

, where n and R are the codeword length and coding rate, respectively, is defined as a system

(C, J_{K})

, which consists of a codebook

C = {c_{i}}_{i \in [[M]]} \subseteq H^{n}

, with

c_{i} = (c_{i, t}) |_{t = 1}^{n} \subseteq H^{n}

, such that

n^{- 1} \sum_{t = 1}^{n} c_{i, t} \leq A, \forall i \in [[M]]

and a decoder (We recall that the decoding sets for the DKI problem, similarly to that for the RI problem, may have in general intersection; however, to guarantee a vanishing sort II error probability for the asymptotic codeword lengths n, an optimal decoder may be defined in a way such that the size of such intersection regions becomes negligible)

J_{K} \subseteq H^{n}

, where

K

is an arbitrary subset (recall that the system (family) of all subsets of the set

[[M]]

, of size K, is

\{K \subseteq [[M]]; | K | = K\}

; note that

| {K \subseteq [[M]]; | K | = K} | = (\binom{M}{K})

, and the error requirements, required by the DKI code definition, apply to every possible choice of the set

K

with K arbitrary messages among all

(\binom{M}{K})

cases) of

[[M]]

with size K, see Figure 2 and Figure 3.

The encoder sends codeword

c_{i}

, given a message

i \in [[M]]

, and the decoder’s job is to solve a binary hypothesis: was

j \in K

a goal message that was sent or not? See Figure 3. There exist two sorts of errors that may happen:

◊: Sort I Error Event: Rejection of the actual message; $i \in K$ .
◊: Sort II Error Event: Acceptance of a wrong message; $i \notin K$ .

The associated error probabilities of the DKI code

(C, J)

read

\begin{matrix} P_{e, 1} (i, K) & = Pr (Y \in J_{K}^{c} | x = c_{i}) = 1 - \sum_{y \in J_{K}} W^{n} (y | c_{i}), \forall i \in K miss - identification, \end{matrix}

(8)

\begin{matrix} P_{e, 2} (i, K) & = Pr (Y \in J_{K}^{c} | x = c_{i}) = \sum_{y \in J_{K}} W^{n} (y | c_{i}), \forall i \notin K false identification, \end{matrix}

(9)

where, for every

e_{1}, e_{2} > 0

, fulfill the bounds

P_{e, 1} (i, K) \leq e_{1}, \forall i \in K

and

P_{e, 2} (i, K) \leq e_{2}, \forall i \notin K

, where

K \in {K \subseteq [[M]]; | K | = K}

is an arbitrary subset of

[[M]]

with size K.

Definition 2

(DKI Coding/Goal Identification Rates). The codebook size

M (n, R)

and the goal message set size

K (n, κ)

are sequences of non-decreasing monotonically functions in the codeword length n, with

R, κ

, and l indicating the DKI coding rate and the goal identification rate, respectively. In this work, we consider the subsequent functions:

\begin{matrix} M (n, R) = 2^{n R} and K (n, κ) = 2^{n κ} . \end{matrix}

(10)

Thereby, the DKI coding rate, R, and the goal identification rate, κ, are defined as follows (additionally, in the literature, other rate definitions for different communication settings are adopted; for example, in the RI [49] problem, the RI coding rate is defined as

(log log M) / n

, while in the TR [17] or DI [32] problems for a DMC, the TR and DI coding rates are given by

R = (log M) / n

.):

\begin{matrix} R = \frac{log M}{n}, κ = \frac{log K}{n} . \end{matrix}

(11)

Definition 3

(Attainable Rate Region). The pair of rates

(R, κ)

is called attainable if, for every

e_{1}, e_{2} > 0

and sufficiently large n, there exists an

(n, M (n, R), K (n, κ), e_{1}, e_{2})

-BSC-DKI code. Then, the set of all attainable rate pairs

(R, κ)

is referred to as the attainable rate region for the BSC,

ℬ

, and is denoted by

R_{DKI} (B, M, K)

.

Definition 4

(Capacity Region/Capacity). The operational DKI capacity region of the BSC,

ℬ

, is defined as the closure of all attainable rate triples

(R, κ)

(the closure of a set

A

consists of all points in

A

together with all limit points of

A

, where the limit point of

A

is a point x that can be approximated by the points of

A

; see [62] for further details), and is denoted by

C_{DKI} (B, M, K)

. For the standard identification (

K = 1

), the capacity region is specialized to a single point, also called the DI capacity which is the supremum of all attainable DI coding rates, R. The DI capacity is denoted by

C_{DI} (B, M)

.

Remark 1

(Distribution of Output Statistics). Assuming that the codeword

c_{i}

is sent and the channel output y is observed at the receiver, the number of cross-overs (flips) that occurs in the channel is given by

d_{H} (y, c_{i})

. Therefore, the probability that k cross-overs among the n channel uses occurs, follows a Binomial distribution with parameters n and ε as follows:

\begin{matrix} Pr (d_{H} (Y, c_{i}) = k) = (\binom{n}{k}) ε^{k} {(1 - ε)}^{n - k} . \end{matrix}

(12)

4. DKI Capacity Region of the BSC

In this section, we first present our main results, i.e., the inner and outer bounds on the attainable rates region for

ℬ

. Subsequently, we provide the detailed proofs.

4.1. Main Results

Our DKI capacity region theorem for the BSC, $ℬ$ , is stated below.

Theorem 1.

Let $ℬ$ indicate a BSC with cross-over probability

0 < ε < 1 / 2

, and let

β \in (0, β_{\max})

be an arbitrary constant, where

β_{\max} ≜ (4 ε) / (2 ε + 1)

. Further, let

H (p)

indicate the binary entropy function and the tangent line of

H (p)

in point ε be specified as follows:

T_{ε} (p) = H (ε) + (p - ε) \frac{d H (p)}{d p} |_{p = ε} .

Next, assume that $ℬ$ endows an exponential size for the codebook and the goal message set, i.e.,

M (n, R) = 2^{n R}

and

K (n, κ) = 2^{n κ}

, respectively, where the codewords are subject to the Hamming weight constraint of the form

n^{- 1} \sum_{t = 1}^{n} c_{i, t} \leq A, \forall i \in [[M]]

. Now, let us define the subsequent functions:

\begin{matrix} f_{1} (ε, β) ≜ & \frac{(1 - β / 2) ε - β / 4}{1 - β}, \end{matrix}

(13)

\begin{matrix} f_{2} (ε, β) ≜ & (1 - β / 2) ε + β / 4 . \end{matrix}

(14)

Next, let us define the inner and outer rate regions, i.e.,

R^{inn} (B)

and

R^{out} (B)

, respectively, as follows:

\begin{matrix} R^{inn} (B) ≜ ⋃_{β \in (0, β_{\max})} R_{β}^{inn} (B), \end{matrix}

(15)

where

\begin{matrix} R_{β}^{inn} (B) ≜ 〈 \begin{matrix} {(R, κ); 0 \leq R \leq H (A) - H (β), 0 \leq κ < min (κ_{UB}^{1}, κ_{UB}^{2})} & A < 1 / 2, \\ {(R, κ); 0 \leq R \leq 1 - H (β), 0 \leq κ < min (κ_{UB}^{1}, κ_{UB}^{2})} & A \geq 1 / 2, \end{matrix} \end{matrix}

(16)

with

\begin{matrix} κ_{UB}^{1} & ≜ T_{ε} (f_{1} (ε, β)) - H (f_{1} (ε, β)), \end{matrix}

(17)

\begin{matrix} κ_{UB}^{2} & ≜ T_{ε} (f_{2} (ε, β)) - H (f_{2} (ε, β)), \end{matrix}

(18)

and

\begin{matrix} R^{out} (B) ≜ 〈 \begin{matrix} {(R, κ); 0 \leq R \leq H (A), 0 \leq κ \leq H (A)} & A < 1 / 2, \\ {(R, κ); 0 \leq R \leq 1, 0 \leq κ \leq 1} & A \geq 1 / 2 . \end{matrix} \end{matrix}

(19)

Then, the DKI capacity region

C_{DKI} (B, M, K)

is bounded by

\begin{matrix} R^{in} (B) \subseteq C_{DKI} (B, M, K) \subseteq R^{out} (B) . \end{matrix}

(20)

Proof of Theorem 1.

The proof of Theorem 1 comprises two components, presented in Section 4.2 and Section 4.3, respectively, which are the inner and the outer bound proofs. □

Corollary 1

(DI Capacity of The BSC). The inner and outer bounds for the DKI capacity region of the BSC,

ℬ

, for an extreme case (standard identification) where the goal message set consists of only one message, i.e.,

K = 1

, recover the previous results for the BSC with Hamming constraint ([32] Ex. 1):

\begin{matrix} C_{DI} (B, M) & = \{\begin{matrix} H (A) & if A < 1 / 2, \\ 1 & if A \geq 1 / 2, \end{matrix} \end{matrix}

(21)

and the BSC without Hamming constraint ([33] Th. 3.1):

\begin{matrix} C_{DI} (B, M) = 1 . \end{matrix}

(22)

Proof.

The proof is obtained directly by placing

K = 1

into the upper bounds given in (17) and (18) in Theorem 1, and making further mathematical simplifications. In particular, we show that closure of the inner bound for

K = 1

coincides the outer bound. Therefore, a full characterization of the capacity region is yielded. We begin with the subsequent observation: The upper bounds provided in (17) and (18) for

K = 2^{n κ} = 1 (κ = 0)

tend to zero. That is,

\begin{matrix} R^{inn} (B) |_{κ = 0} & = ⋃_{β \in (0, β_{\max})} R_{β}^{inn} (B) |_{κ = 0} \end{matrix}

(23)

where

\begin{matrix} R_{β}^{inn} (B) |_{κ = 0} = 〈 \begin{matrix} {(R, κ); 0 \leq R \leq H (A) - H (β), κ = 0} & if A < 1 / 2, \\ {(R, κ); 0 \leq R \leq 1 - H (β), κ = 0} & if A \geq 1 / 2 . \end{matrix} \end{matrix}

(24)

Next, observe that the outer bound provided in (19) for

K = 2^{n κ} = 1 (κ = 0)

is given by

\begin{matrix} R^{out} (B) |_{κ = 0} ≜ 〈 \begin{matrix} {(R, κ); 0 \leq R \leq H (A), κ = 0} & if A < 1 / 2, \\ {(R, κ); 0 \leq R \leq 1, κ = 0} & if A \geq 1 / 2, \end{matrix} \end{matrix}

(25)

which is the closure of the inner bound. Therefore, since the closure of the inner bound region calculated in (24) coincides with the outer bound region given in (25), we obtain a closed form formula for the DI capacity of the BSC as follows:

\begin{matrix} C_{DI} (B, M) & = C_{DKI} (B, M, K = 1) = \{\begin{matrix} H (A) & if A < 1 / 2, \\ 1 & if A \geq 1 / 2, \end{matrix} \end{matrix}

(26)

where there is a Hamming constraint, and

\begin{matrix} C_{DI} (B, M) = 1, \end{matrix}

(27)

where there is no Hamming constraint. This concludes the proof of Corollary 1. □

Proof.

The proof of Theorem 1 comprises two components, presented in Section 4.2 and Section 4.3, respectively, which are the achievability and converse proofs. □

Here, we summarize some key findings from the proof of Theorem 1.

◊

Input constraint: Theorem 1 reveals an important observation regarding the impact of the input constraint (when it is effective, i.e.,

0 < A < 1 / 2

) on the inner and outer regions formulas for the DKI capacity. In contrast to previous results for DI over Gaussian channel [32] or DKI over slow fading channel [51], where the capacity bounds does not reflect the impact of the input constraint, our results for DKI over the BSC in this paper reflect the impact of the Hamming weight constraint on the inner and outer regions.

◊

Scale of codebook: The inner and outer bounds on the DKI capacity region given in Theorem 1 are valid in the standard scale for the codebook size, i.e.,

M = 2^{n R}

, where R is the coding rate. This result coincides the conventional behavior of the codebook size for TR [17] and DI [32] problems over the BSC. Other scales higher than the exponential for the codebook size of K-identification problem are reported in the literature; see Figure 4.

◊

Scale of goal message set: Theorem 1 unveils that the size of the set of the goal messages scales exponentially in the codeword length, i.e., ∼

2^{n κ}

. In particular, the result in Theorem 1 about size of the goal message set constitutes of the subsequent three cases in terms of K:

DI, $K = 1$ : In this scenario, the goal message set is a degenerate case; that is, $K = {i}$ , with $i \in [[M]]$ , and is equivalent to the standard identification setup ( $κ = 0$ ), where $| K | = K = 1$ . As a result, the identification setup in randomized regimes [49] and deterministic regimes [32] can be thought of as a particular instance of the K-identification that is examined in this work. See Corollary 1 for further details.
Constant $K > 1$ : The scenario where $κ \to 0$ as $n \to \infty$ is implied by a constant $K > 1$ . Our capacity bounds in Theorem 1 on the attainable rate pairs $(R, κ)$ are the same as those for $K = 1$ . That is, the result in this cases converge to those for $K = 1$ given in Corollary 1, for the asymptotic $n \to \infty$ .
Growing K: The fact that a trustworthy K-identification is still attainable, even in cases where K scales with the codeword length as ∼ $2^{n κ}$ for some $κ \in [0, 1)$ , is another significant finding of Theorem 1 ; see Figure 5.

We provide the inner bound proof in Section 4.2 and the outer bound proof in Section 4.3 as the proof of Theorem 1.

4.2. Inner Bound (Achievability Proof)

Before we provide the inner bound proof, we explain on our methodological approaches that are used here and expand on them. In particular, similar to other information theoretical problems, the derivation of the inner bound on the DKI capacity region, consists of the subsequent two main steps:

◊: Step 1 (rate analysis): First, we propose a greedy-wise method for codebook construction, which has a flavor similar to that observed in the classical approach of the Gilbert–Varshamov (GV) bound (the early introduction of such a bound in the literature is accomplished by Gilbert in [63]) for covering of overlapping balls embedded in the input set. More specifically, we introduce a codebook of exponential size in the codeword length n, which fulfills the input constraint and enjoys a Hamming distance property; namely, every pair of distinct codewords are separated by a certain distance. Moreover, we introduced a parameter $β$ in order to account/adjust such a distance. This step is particularly relevant in the sort II error analysis, as well as the derivation for the final lower bound on the identification coding rate. Additionally, we identify the whole range across which the parameter $β$ can change, which is needed to derive an analytical lower bound on the corresponding codebook size.
◊: Step 2 (error analysis): In the second part (error analysis), we show that the suggested codebook in the previous part is optimal, i.e., leads to an attainable rate pairs $(R, κ)$ . To this end, we begin with introducing a decision rule which is a distance decoder based on the Hamming metric, and would show that the associated errors of the sort I and the II probabilities vanish in the asymptotic codeword length, i.e., when $n \to \infty$ . Moreover, the error analysis for the sort II error probability determines the associated error exponent. As a result, the feasible region for the goal identification rate is obtained.

In the following, we confine ourselves to codewords that meet the subsequent condition:

n^{- 1} \sum_{t = 1}^{n} c_{i, t} \leq A

,

\forall i \in [[M]]

. Furthermore, we divide them into two cases:

◊: Case 1—with Hamming weight constraint: $A \leq 1$ , then the condition $n^{- 1} \sum_{t = 1}^{n} c_{i, t} \leq 1, i \in [[M]]$ is non-trivial in the sense that it induces a strict subset of the entire input set $H^{n}$ . We denote such subset by $B_{0} (n, n A)$ and is equivalent to ${‖ c_{i} ‖}_{1} \leq A$ .
◊: Case 2—without Hamming weight constraint: $A \geq 1$ , then each codeword belonging to the n-dimensional Hamming cube $H^{n}$ fulfilled the Hamming weight constraint, since $\frac{1}{n} \sum_{t = 1}^{n} c_{i, t} \leq 1 \leq A, i \in [[M]]$ . Therefore, we address the entire input set $H^{n} = {0, 1}^{n}$ as the possible set of codewords and attempt to exhaust it in a brute-force manner in the asymptotic, i.e., as $n \to \infty$ .

Observe that, within this case, we again divide into two cases:

$0 < A < 1 / 2$ .
$A \geq 1 / 2$ .

The argument for the need of such division is that the binary entropy function

H (\cdot)

is monotonic increasing in domain

0 < A < 1 / 2

and decreasing in domain

A \geq 1 / 2

. In the latter case, we can introduce an alternative Bernoulli process, which results in a larger volume space, and at the same time, it guarantees the Hamming weight constraint.

For the sub-case 1, i.e., where

0 < A < 1 / 2

, we restrict our considerations to an n-dimensional Hamming hyper ball with edge length A. We use a packing arrangement of overlapping hyper balls of radius

r_{0} = ⌊ n β ⌋

in an n-dimensional Hamming hyper ball

B_{0} (n, n A)

.

Lemma 1

(Space exhaustion). Let

R < H (A)

and let

β \in (0, β_{\max})

be an arbitrary positive constant referred to as the distinction property of the casebook.

Then, for sufficiently large codeword length n, there exists a codebook

C = {c_{i}}_{i \in [[M]]} \subseteq H^{n}

, with

c_{i} = (c_{i, t}) |_{t = 1}^{n} \subseteq H^{n}

, which consists of M sequences in the n-dimensional Hamming hyper ball

B_{0} (n, n A)

, such that the subsequent holds:

◊: Hamming distance property: $d_{H} (c_{i}, c_{j}) \geq ⌊ n β ⌋ + 1 \forall i, j \in [[M]]$ , where $i \neq j$ .
◊: Codebook size: the codebook size is at least $M \geq 2^{n (R - H (β))}$ .

Proof.

Recall that the minimum Hamming distance of a code

𝒞

is given by

\begin{matrix} d_{\min} ≜ min_{(i, j) \in [[M]] \times [[M]]} d_{H} (c_{i}, c_{j}) . \end{matrix}

(28)

We begin to obtain some codewords that fulfill the Hamming weight constraint, namely,

\begin{matrix} \frac{1}{n} \sum_{t = 1}^{n} c_{t} \leq A . \end{matrix}

(29)

First, we generate a codeword

C \overset{i . i . d}{\sim} Bern (A)

(such a random generation should not be confused with a similar procedure as is accomplished in the encoding stage of the RI problem. While therein, each message is mapped to a codeword through a random distribution, here for the DI problem, we first solely restrict ourselves to generation of codewords through the Bernoulli distribution to guarantee the Hamming weight constraint, and employ them in the next procedure called the greedy construction up to an exhaustion. Then, after the exhaustion, we establish a deterministic mapping between the message set and the codebook; that is, each message is associated with a codeword. Further, in the RI problem, it is in general possible that two different messages are mapped to a common codeword; however, considering the DKI problem in here, there exists a one-to-one mapping between the set of messages and the set of codewords). Since

E [C_{t}] = A

, by the weak law of large numbers, we obtain

\begin{matrix} lim_{n \to \infty} Pr (| \frac{1}{n} \sum_{t = 1}^{n} C_{t} - A | \leq τ) = 1, \end{matrix}

(30)

where

τ > 0

is an arbitrary small positive. Therefore, for sufficiently large codeword length n, the event

| n^{- 1} \sum_{t = 1}^{n} C_{t} - A | \leq τ

occurs with probability 1, which implies that, for sufficiently large n, the subsequent event happens with probability one:

\begin{matrix} \frac{1}{n} \sum_{t = 1}^{n} C_{t} \leq A + τ . \end{matrix}

(31)

Now, observe that since (31) holds for arbitrary values of

τ

, it implies that the subsequent condition for sufficiently large n, is fulfilled

\begin{matrix} \frac{1}{n} \sum_{t = 1}^{n} C_{t} \leq A, \end{matrix}

(32)

which is the Hamming weight constraint, as required.

Next, we begin with the greedy procedure as follows: Let us denote the first codeword determined by the Bernoulli distribution by

c_{1}

, and assign it to message with index 1. Then, we remove all the sequences that have a Hamming distance of less or equal than

⌊ n β ⌋

from

c_{1}

. That is, we delete all the codewords that lie inside the Hamming ball with center

c_{1}

and radius

r = ⌊ n β ⌋

. Then, we generate a second codeword by the Bernoulli distribution, and repeat this procedure until all the sequences belonging to the feasible subspace, i.e., the Hamming hyper ball,

B_{0} (n, n A)

, are exhausted. Therefore, such a construction fulfills the property provided in Lemma 1 regarding the minimum Hamming distance of the code, i.e.,

\begin{matrix} d_{H} (c_{i}, c_{j}) \geq ⌊ n β ⌋ + 1 . \end{matrix}

(33)

In general, the volume of a Hamming ball of radius r, assuming that the alphabet size is q, is the number of codewords that it encompasses, and is given by ([64] see Ch. 1)

\begin{matrix} Vol (B_{x} (n, r)) = \sum_{i = 0}^{r} (\binom{n}{i}) {(q - 1)}^{i} . \end{matrix}

(34)

Let

B

denote the obtained ball covering after the exhaustion of the entire Hamming hyper ball

B_{0}

, i.e., an arrangement of M overlapping small hyper balls

B_{c_{i}} (n, r_{0})

, with radius

r_{0} = ⌊ n β ⌋

where

i \in [[M]]

, that cover the entire Hamming hyper ball,

B_{0} (n, n A)

, where their centers are coordinated inside the

B_{0} (n, n A)

, and the distance between the closest centers is

⌊ n β ⌋ + 1

; see Figure 6. As opposed to the standard ball packing observed in coding techniques [65], the balls here are neither necessarily entirely contained within the Hamming hyper ball, nor disjoint. That is, we only require that the centers of the balls are inside

B_{0} (n, n A)

and have a non-empty intersection with

B_{0} (n, n A)

, which is rather a ball covering problem.

Th ball covering

B

is called exhausted if no point within the input set,

B_{0} (n, n A)

, remains as an isolated point; that is, with the property that it does not belong to at least one of the small Hamming hyper balls. In particular, we use a covering argument that has a similar flavor as that observed in the GV bound ([66] Th. 5.1.7). Specifically, consider an exhausted packing arrangement of

\begin{matrix} ⋃_{i = 1}^{M (n, R)} B_{c_{i}} (n, ⌊ n β ⌋), \end{matrix}

(35)

balls with radius

r_{0} = ⌊ n β ⌋

embedded within the space

B_{0} (n, n A)

. According to the greedy construction, the center

c_{i}

of each small Hamming hyper ball, corresponds to a codeword. Since the volume of each hyper ball is equal to

Vol (B_{c_{i}} (n, r_{0}))

, the centers of all balls lie inside the space

B_{0} (n, n A)

, and the Hamming hyper balls overlap with each other, the total number of balls is bounded from below by

\begin{matrix} M \geq \frac{Vol (⋃_{i = 1}^{M} B_{c_{i}} (n, r_{0}))}{Vol (B_{c_{1}} (n, r_{0}))} \overset{(a)}{\geq} \frac{Vol (B_{0} (n, n A))}{Vol (B_{c_{1}} (n, r_{0}))} \overset{(b)}{\geq} \frac{\sum_{j = 0}^{⌊ n A ⌋} (\binom{n}{j})}{Vol (B_{c_{1}} (n, r_{0}))}, \end{matrix}

(36)

where

(a)

holds since the Hamming hyper balls may have in general intersection, and

(b)

follows by (34) with setting

q = 2

, since

⌊ n A ⌋ \leq n A

. Now, the bound in (36) can be further simplified as follows:

\begin{matrix} log M \geq log (\sum_{j = 0}^{⌊ n A ⌋} (\binom{n}{j}) / Vol (B_{c_{1}} (n, r_{0}))) \overset{(a)}{\geq} n H (A) + o (log n) - n H (β), \end{matrix}

(37)

where

(a)

exploits Lemma (A66) for setting radius

r = ⌊ n ε ⌋ = ⌊ n A ⌋

and

q = 2

, and (A76) with

r_{0} = ⌊ n ε ⌋ = ⌊ n β ⌋

. Now, we obtain

\begin{matrix} log M & \geq n H (A) + o (log n) - n H (β), \end{matrix}

(38)

where the dominant term has an order of n. Therefore, in order to obtain finite value for the lower bound on the DKI coding rate, R, (38) induces the scaling law of codebook size, M, to be

2^{n R}

. Hence, we obtain

\begin{matrix} R \geq \frac{1}{n} [n H (A) + o (log n) - n H (β)] = H (A) + \frac{o (log n)}{n} - H (β), \end{matrix}

(39)

which tends to

H (A) - H (β)

as

n \to \infty

.

Now, we proceed to the sub-case 2, i.e., where

A \geq 1 / 2

. In this case, instead of sticking to generation of codewords

\sim Bern (A)

, we generate the codewords according to Bernoulli process with success probability of

1 / 2

; that is,

C \overset{i . i . d}{\sim} Bern (1 / 2)

. Observe that the required Hamming weight constraint given in (29) is now met, since for

E [C_{t}] = 1 / 2

, we have

\begin{matrix} \frac{1}{n} \sum_{t = 1}^{n} c_{t} \leq 1 / 2 \leq A . \end{matrix}

(40)

Therefore, subsequent similar line of arguments as provided for the sub-case 1, we obtain the subsequent lower bound on the DKI coding rate, R,

\begin{matrix} R \geq \frac{1}{n} [n H (1 / 2) + o (log n) - n H (β)] = H (1 / 2) + \frac{o (log n)}{n} - H (β), \end{matrix}

(41)

which tends to

H (1 / 2) = 1

as

n \to \infty

. □

Lemma 2

(see [33], Claim 1). Let

R < 1

, and let

β \in (0, β_{\max})

be an arbitrary positive constant referred to as the distinction property of the casebook. Then, the entire Hamming cube

H^{n}

can be exhausted for the codebook in the asymptotic codeword length n, i.e., where

n \to \infty

. That is, for a sufficiently large n, we obtain

C = {c_{i}}_{i \in [[M]]} = H^{n}

, with

c_{i} = (c_{i, t}) |_{t = 1}^{n} \subseteq H^{n}

, which consists of M sequences in the n-dimensional Hamming hyper ball

B_{0} (n, n A)

, such that the subsequent holds:

◊: Hamming distance property: For every $i, j \in [[M]]$ , where $i \neq j$ , we have

$\begin{matrix} d_{H} (c_{i}, c_{j}) \geq ⌊ n β ⌋ + 1 . \end{matrix}$

(42)
◊: Codebook size: The codebook size is at least $M \geq 2^{n (R - H (β))}$ .

Proof.

Recall that the minimum Hamming distance of a code

𝒞

is given by

\begin{matrix} d_{\min} ≜ min_{(i, j) \in [[M]] \times [[M]]} d_{H} (c_{i}, c_{j}) . \end{matrix}

(43)

Next, we begin with the greedy procedure as follows: Let us denote the first codeword determined by the Bernoulli distribution by

c_{1}

, and assign it to message with index 1. Then, we remove all the sequences that have a Hamming distance of less or equal than

⌊ n β ⌋

from

c_{1}

. That is, we delete all the codewords that lie inside the Hamming ball with center

c_{1}

and radius

r = ⌊ n β ⌋

. Then, we generate a second codeword by the Bernoulli distribution and repeat this procedure until all the sequences are exhausted.

Let

B

denotes the obtained ball covering after the exhaustion of the entire input set

H^{n}

, i.e., an arrangement of M overlapping small hyper balls

B_{c_{i}} (n, r_{0})

, with radius

r_{0} = ⌊ n β ⌋

, where

i \in [[M]]

, which covers n-dimensional Hamming cube

H^{n}

, where their centers are coordinated inside

H^{n}

, and the distance between the closest centers is

⌊ n β ⌋ + 1

. As opposed to the standard ball packing observed in coding techniques [65], the balls here are neither necessarily entirely contained within the Hamming hyper ball, nor disjointed. That is, we only require that the centers of the balls are inside

H^{n}

, and have a non-empty intersection with

H^{n}

, which is rather a ball covering problem.

The ball covering

B

is called exhausted if no point within the input set;

H^{n}

, remains as an isolated point; that is, with the property that it does not belong to at least one of the small Hamming hyper balls. In particular, we use a covering argument that has a similar flavor as that observed in the GV bound ([66] Th. 5.1.7). Specifically, consider an exhausted packing arrangement of

\begin{matrix} ⋃_{i = 1}^{M (n, R)} B_{c_{i}} (n, ⌊ n β ⌋), \end{matrix}

(44)

balls with radius

r_{0} = ⌊ n β ⌋

embedded within the space

H^{n}

. According to the greedy construction, the center

c_{i}

of each small Hamming hyper ball corresponds to a codeword. Since the volume of each hyper ball is equal to

Vol (B_{c_{i}} (n, r_{0}))

, the centers of all balls lie inside the space

H^{n}

, and the Hamming hyper balls overlap with each other, the total number of balls is bounded from below by

\begin{matrix} M \geq \frac{Vol (⋃_{i = 1}^{M} B_{c_{i}} (n, r_{0}))}{Vol (B_{c_{1}} (n, r_{0}))} \overset{(a)}{\geq} \frac{Vol (H^{n})}{Vol (B_{c_{1}} (n, r_{0}))} \overset{(b)}{\geq} \frac{{| X |}^{n}}{Vol (B_{c_{1}} (n, r_{0}))}, \end{matrix}

(45)

where

(a)

holds since the Hamming hyper balls may have, in general, an intersection, and

(b)

follows, since

Vol (H^{n}) = |X^{n}| = {| X |}^{n}

. Now, the bound in (45) can be further simplified as follows

\begin{matrix} log M \geq log (\frac{{| X |}^{n}}{Vol (B_{c_{1}} (n, r_{0}))}) \overset{(a)}{\geq} n log | X | + o (log n) - n H (β) \overset{(b)}{\geq} n + o (log n) - n H (β), \end{matrix}

(46)

where

(a)

exploits Lemma (A76) with

ε = β

. Now, for

β \in (0, β_{\max})

being an arbitrary small positive constant, we obtain

\begin{matrix} log M \geq n + o (log n) - n H (β) = n (1 - H (β)) + o (log n), \end{matrix}

(47)

where the dominant term has an order of n. Therefore, in order to obtain finite value for the lower bound on the DKI coding rate, R, (38) induces the scaling law of codebook size, M, to be

2^{n R}

. Hence, we obtain

\begin{matrix} R \geq \frac{1}{n} [n (1 - H (β)) + o (log n)] = 1 - H (β) + \frac{o (log n)}{n}, \end{matrix}

(48)

which tends to

1 - H (β)

as

n \to \infty

. □

Given a message

i \in [[M]]

, transmit

x = c_{i}

.

Let us define

δ_{β} \neq 1 / 2

as follows:

\begin{matrix} δ_{β} = (1 - β / 2) ε + β / 4, \end{matrix}

(49)

which is referred to as the decoding threshold where

β \in (0, β_{\max})

is an arbitrary constant. Observe that given

0 < ε < 1 / 2

and (49), we obtain the subsequent bounds on the

δ_{β} :

\begin{matrix} ε < δ_{β} < (1 - β) ε + β / 2 . \end{matrix}

(50)

In order to recognize/identify whether message

j \in [[M]]

has been sent, the decoder at the receiver verifies whether or not the output of the channel y is included in the decoding set

J_{K} = ⋃_{j \in K} T_{j}

, with

\begin{matrix} T_{j} = \{y \in H^{n}; T (y, c_{j}) \leq ⌊ n δ_{β} ⌋\}, \end{matrix}

(51)

where

\begin{matrix} T (y, c_{j}) = d_{H} (y, c_{j}) ≜ \sum_{t = 1}^{n} δ_{β} (y_{t}, c_{j, t}), \end{matrix}

(52)

is known as the decoding metric assessed for the individual codeword

c_{j}

and the observation vector , with the Kronecker delta being

δ_{β} (\cdot, \cdot)

. In other words, given the channel output vector

y \in H^{n}

, the decoder indicates that the message j was sent if there is at least one

j \in K

, such that

d_{H} (y, c_{j}) \leq ⌊ n δ_{β} ⌋

. In the alternative scenario, wherein the inequality

d_{H} (y, c_{j}) > ⌊ n δ_{β} ⌋

applies for every index

j \in K

, the decoder determines that j was not sent.

Remark 2.

Adopted decoder For the achievability proof, we use a decoder that, given an output sequence y, states that if the output vector y is in the subsequent set, then the message

j \in K

was sent

\begin{matrix} ⋃_{j \in K} \{y \in H^{n}; d_{H} (y, c_{j}) \leq ⌊ n δ_{β} ⌋\}, \end{matrix}

(53)

where

δ_{β}

is a decoding threshold and

c_{j} = [c_{j, 1}, \dots, c_{j, n}]

is the codeword linked to message j. We notice that the decoder in (53) combines the elements of set

K

through a fundamental union operator. Such a simple operator may feature a penalty with respect to the error exponents for the sort I/II error probabilities or the obtained attainable rates. Therefore, we recall that in principle a more optimum decoder for the K-Identification scheme, which guarantees vanishing sort I/II error probabilities, might demand a more complicated algebraic operators between the realization of members for each specific set

K

, and entails advanced dependencies on the elements of set

K

.

In the subsequent, we examine the error probabilities of sort I and sort II. In particular, the sort I error analysis is less involved and exploiting known bounds related to the upper tail of the Binomial CDF we guarantee its vanishing. The sort II error analysis is more complicated, where we combines techniques from JáJá [33] and certain Hamming distance property for the binary alphabet. In addition, we exploit some bound on the Binomial CDF. Moreover, the error exponents yield the feasible range for the goal identification rate

κ

. Before we start the analysis, we introduce the subsequent parameter definitions and conventions: Fix

e_{1}, e_{2} > 0

and let

ζ_{0}, ζ_{1} > 0

be arbitrarily small constants. Further, let introduce the subsequent conventions:

$Y_{t} (i)$ is output of channel at time tconditioned that $x = c_{i}$ , i.e., $Y_{t} (i) = c_{i, t} \oplus Z_{t}$ .
The vector of symbols is $Y (i) ≜ (Y_{1} (i), \dots, Y_{n} (i))$ .

Sort I errors: This error event occur when the transmitter sends

c_{i}

, yet

y \notin J_{K}

for every

i \in K

. More specifically, the sort I error probability is given by

\begin{matrix} P_{e, 1} (i, K) = Pr (Y (i) \in J_{K}^{c}) = Pr (Y (i) \in {(⋃_{j \in K} T_{j})}^{c}) . \end{matrix}

(54)

In order to show that the probability term provided in (54) tends to zero for asymptotic codeword lengths, we show that this term is upper bounded by certain upper tail of the Binomial CDF. Next, employing existing bounds for this tail given in Appendix G, we establish an upper bound on such an upper tail which vanishes in the asymptotic. The extensive analysis for the sort I errors is provided in Appendix A.

Sort II errors: The sort II error event happens when

Y (i) \in T_{K}

while the transmitter sent

c_{i}

with

i \notin K

. Then, for each possible

(\binom{M}{K})

case of

K

, where

i \notin K

, the sort II error probability is given by

\begin{matrix} P_{e, 2} (i, K) = Pr (Y (i) \in T_{K}) = Pr (Y (i) \in ⋃_{j \in K} T_{j}) . \end{matrix}

(55)

To show that the probability term provided in (55) vanishes for asymptotic regime, we break this term into two new terms and address them separately. One of the terms is shown to vanish by exploiting the proof derived in the sort I error analysis. For the other term, using standard techniques we show that it corresponds to certain Binomial CDF. Then, employing some existing bounds on such Binomial CDF given in Appendix H, we assert an upper bound for it which tends to zero in the asymptotic. The detailed analysis for the sort II errors is provided in Appendix B.

Observe that considering the established lower bound on the DKI coding rate R and the established upper bound on the goal identification rate

κ

, as provided in (41) and (48) and (A60), means that we have shown for every

e_{1}, e_{2} > 0

and sufficiently large n, there exists an

(n, M (n, R), K (n, κ), e_{1}, e_{2})

-BSC-DKI code, such that the set

R_{DKI} (B, M, K)

of all attainable rate pairs

(R, κ)

contains

\begin{matrix} R_{DKI} (B, M, K) \supseteq R^{inn} (B) ≜ ⋃_{β \in (0, β_{\max})} R_{β}^{inn} (B), \end{matrix}

(56)

with

\begin{matrix} R_{β}^{inn} ≜ 〈 \begin{matrix} {(R, κ); 0 \leq R \leq H (A) - H (β), 0 \leq κ < min (κ_{UB}^{1}, κ_{UB}^{2})} & if A < 1 / 2, \\ {(R, κ); 0 \leq R \leq 1 - H (β), 0 \leq κ < min (κ_{UB}^{1}, κ_{UB}^{2})} & if A \geq 1 / 2, \end{matrix} \end{matrix}

(57)

where

κ_{UB}^{1}

and

κ_{UB}^{2}

are provided in (A58) and (A59), respectively.

Remark 3.

Methodology for establishing the feasible region of β Observe that, since the parameter β adjusts the radius of the hyper spheres used in the codebook construction, a trivial restriction on it would be as follows:

β \geq 0

. Next, employing the Hamming distance property of Lemma 1 and Lemma 2, β can not be greater or equal than 1; therefore, we conclude that

0 \leq β < 1

. Now, we exclude the boundary points

β = 0

, since it makes the upper bounds on the κ equal to zero (

κ < 0

), which is a contradiction since

κ \geq 0

. Next, we focus on the arguments of

T_{ε} (\cdot)

and

H (\cdot)

given in (A58) and (A59); see Figure 7. First, observe that the function

f_{2} (ε, β)

(cf. (17)) has no zero, and is monotonically increasing for

0 < β < 1

. Second, note that the function

f_{1} (ε, β)

(cf. (17)) is decreasing for

0 < β < 1

with a zero at

β_{\max} = (4 ε) / (2 ε + 1)

; therefore, the subsequent feasible interval for β is yielded:

0 < β < β_{\max} = (4 ε) / (2 ε + 1) .

Observe that the function

β_{\max} = (4 ε) / (2 ε + 1)

is continuous and monotonically increasing for domain

ε \in (0, 1 / 2)

. That is,

β_{\max}

tends to zero for asymptotic small β and tends to one for

β \to β_{\max}

arbitrary.

Remark 4.

Trade-off between goal identification rate and attainable DKI/RKI rate Our results in the achievability proof unveil a common behavior between the DKI and RKI problems; namely, for a given codeword length, there is a trade-off between the size of the goal message set and DKI/RKI codebook size. Specifically, considering the RKI problem for a DMC with zero sort I error probability (cf. (A65)), or obtained inner bound on the set of all attainable rate pairs

(R, κ)

for a DMC (cf. (4)), we deduce that if one allows for larger goal identification coding rate κ, subsequently a penalty on the upper bound for the attainable RKI rate, R, is incurred, and this upper bound would be decreased. A similar observation for the DKI problem as considered in this paper is found, namely, the same trade-off between attainable DKI coding rate R and goal identification rate κ exist. In particular, the calculated upper bounds provided in (16) on R and κ suggest that for asymptotic small

β \to 0

, while the upper bound on κ tends to zero (

f_{z} (ε, β) \to ε

for

z \in {1, 2}

), the upper bound on R is increased. On the other hand, in one allows that

β \to β_{\max}

arbitrary, then upper bounds on κ and R are increased and decreased, respectively.

Remark 5.

In the analysis for the sort II error probability, an upper bound is found which vanishes exponentially in the codeword length n, (cf. (A51)). This observation reveals that the fastest scales for the size of the goal message set

K (n, κ)

, which guarantees the vanishing of the sort II error probability, as

n \to \infty

is permitted to be defined as follows:

K (n, κ) = 2^{n κ}

. In other words, the upper bound on the sort II error probability is capable of being exploited for having a set of goal messages with exponential size.

4.3. Upper Bound (Converse Proof)

Before we start the converse proof, some comprehensive steps are explained: We show that the feasible input set (subset of the input sequences that fulfills the Hamming constraint) can be entirely exhausted for selection of the codewords. To this end, we establish an one-to-one mapping between the message and input sets. Hence, the number of messages

2^{n R}

is bounded by the size of the feasible input set. More specifically, depending on whether or not an effective Hamming weight constraint is imposed on the input of the channel, we divide it into two cases and address them separately. In particular, the converse proof for each case consists of the subsequent two main technical steps.

◊: Step 1: we show in Lemma 3 that for any attainable DKI rate whose error probabilities of sort I and sort II tends to zero as $n \to \infty$ , any pair of distinct messages are associated with different codewords.
◊: Step 2: exploiting Lemma 3, we acquire an upper bound for the DKI codebook size of a the BSC.

We begin with the below lemma on a DKI codebook size.

Lemma 3

(DKI codebook size). Consider a sequence of

(n, M (n, R), K (n, κ), e_{1}^{(n)}, e_{2}^{(n)})

-BSC-DKI codes

(C^{(n)}, J^{(n)})

, such that

e_{1}^{(n)}

and

e_{2}^{(n)}

tend to zero as

n \to \infty

. Then, given a sufficiently large n, the codebook

C^{(n)}

satisfies the subsequent property: two different messages

i_{1}, i_{2} \in [[M]]

cannot have the same codeword representing them; that is,

\begin{matrix} i_{1} \neq i_{2} \Rightarrow c_{i_{1}} \neq c_{i_{2}} . \end{matrix}

(58)

Proof.

Contrarily, suppose that there are two messages

i_{1}

and

i_{2}

, such that

i_{1} \neq i_{2}

, and

\begin{matrix} c_{i_{1}} = c_{i_{1}} = x^{n}, \end{matrix}

(59)

for some

x^{n} \in X^{n}

. Since

(C^{(n)}, J^{(n)})

forms a

(n, M (n, R), K (n, κ), e_{1}^{(n)}, e_{2}^{(n)})

-BSC-DKI code, as stated in Definition 1, it implies that for every possible choice (arrangement) of the goal message set

K \subseteq [[M]]

of size K, the upper bound on the sort I and sort II error probabilities, i.e.,

e_{1}^{(n)}

and

e_{2}^{(n)}

, respectively, tends to zero as n tends to infinity.

Remark 6.

Decoder in converse proof While we imposed a concrete structure on the decoding set

J_{K}

, in the achievability proof provided in Section 4.2, i.e., we set

J_{K} = ⋃_{i_{1} \in K} T_{i_{1}}

, the converse proof treats the decoding set

J_{K}

as a generic function.

Next, we review the definition of a BSC DKI code found in (1), and concentrate on the underlying presumptions about the characteristics of a particular series of BSC DKI codes

(C^{(n)}, J^{(n)})

found in Lemma 3. The subsequent property is endowed by such a code sequence with five parameters,

(n, M (n, R), K (n, κ), e_{1}^{(n)}, e_{2}^{(n)})

. For any overall/generic selection of the goal message, set

K \subseteq [[M]]

of size K, as n approaches to infinity, the upper bound on the sort I and sort II error probabilities, or

e_{1}^{(n)}

and

e_{2}^{(n)}

, respectively, tends to zero. That is,

\begin{matrix} lim_{n \to \infty} [P_{e, 1} (i_{1}, K) + P_{e, 2} (i_{2}, K)] = 0, \forall K \subseteq [[M]] . \end{matrix}

(60)

Next, we will represent a particular class of the goal message sets by

K (i_{1}, i_{2})

, where

i_{1} \in K

and

i_{2} \notin K

, i.e.,

\begin{matrix} K (i_{1}, i_{2}) ≜ \{K \subseteq [[M]]; | K | = K; i_{1} \in K, i_{2} \notin K\} . \end{matrix}

(61)

Observe that

| K (i_{1}, i_{2}) | \geq 1

; that is, there exists at least one arrangement

K^{'}

belonging to

K (i_{1}, i_{2})

, where

i_{1} \in K, i_{2} \notin K

. This is valid as the two messages

i_{1}

and

i_{2}

are different, i.e.,

i_{1} \neq i_{2}

, in accordance with Lemma 3. The sort I and sort II error probability, so have the subsequent upper bounds:

\begin{matrix} P_{e, 1} (i_{1}, K) = W^{n} {(J_{K}^{c} | x^{n} = c_{i_{1}})}_{i_{1} \in K} & \leq e_{1}^{(n)}, \\ P_{e, 2} (i_{2}, K) = W^{n} {(J_{K} | x^{n} = c_{i_{2}})}_{i_{2} \notin K} & \leq e_{2}^{(n)}, \end{matrix}

(62)

where

J_{K} \subseteq H^{n}

is the decoding set considered for the set of goal messages

K

. This leads to a contradiction, since

\begin{matrix} 1 & = W^{n} (J_{K}^{c} | x^{n}) + W^{n} (J_{K} | x^{n}) \\ = P_{e, 1} (i_{1}, K) + P_{e, 2} (i_{2}, K) \\ \leq e_{1}^{(n)} + e_{2}^{(n)}, \end{matrix}

(63)

where the last inequality exploits the definition of sort I/II error probabilities given in (8) and (9). Therefore,

e_{1}^{(n)} + e_{2}^{(n)} \geq 1

, which is a contradiction to (60).

Put differently, Lemma 3 asserts that every given sequence of BSC DKI codes

(C^{(n)}, T^{(n)})

has the below property: The upper limits on the sort I and sort II error probabilities disappear for an arbitrary (generic) choice of

K

of size

K (n, κ)

, meaning that

e_{1}^{(n)}

and

e_{2}^{(n)}

tend to zero as

n \to \infty

. Nevertheless, we demonstrate that there are specific options for

K

, shown by

K (i_{1}, i_{2})

, whose elements does not satisfy this property, namely,

e_{1}^{(n)}

and

e_{2}^{(n)}

do not disappear since the sum of the corresponding upper limits on the sort I and sort II errors is lower bounded by one. This observation is obviously contradictory, as the inequality presented in (59) does not hold. Hence, distinct messages

i_{1}

and

i_{2}

cannot share the same codeword, and there exist an one-to-one mapping between the message set

ℳ

and the codebook

𝒞

. This concludes the proof of Lemma 3. □

Lemma 3 states that every message has a distinct/unique codeword. As a result, the number of input sequences that meet the input restriction/constraint serves as the maximum number of messages. We divide in two cases, namely, where

0 < A < 1 / 2

and

1 / 2 \leq A < 1

. For the first case, we obtain the subsequent upper bound on the size of the DKI codebook:

\begin{matrix} 2^{n R} \leq |B_{0} (n, n A)| = | \{x \in H^{n} : 0 \leq \sum_{t = 1}^{n} x_{t} \leq n A\} | \overset{(a)}{\leq} 2^{n H (A)}, \end{matrix}

(64)

where

(a)

exploits the upper bound on the volume of the Hamming ball provided in Lemma A2 for

0 < A < 1 / 2

. Thereby, (64) implies

\begin{matrix} R \leq H (A) . \end{matrix}

(65)

On the other hand, for a given sequence of DKI code in the converse, the size of the goal message set

K

is always upper bounded by the size of the message set

ℳ

; that is,

2^{n κ} \leq 2^{n R}

gives

κ \leq R

. Therefore, exploiting (65), we obtain

\begin{matrix} κ \leq H (A) . \end{matrix}

(66)

Now, we proceed to calculate the upper bound on the size of the DKI codebook, where

1 / 2 \leq A < 1

. We argue that this case is equivalent to having a Hamming weight constraint of the form

A^{*} = 1 / 2

. That is, the codewords with constraint

\sum_{t = 1}^{n} x_{t} \leq n A^{*}

, where

A^{*} = 1 / 2

fulfilled the same constraint with

1 / 2 \leq A < 1

. The new Bernoulli input process has

1 / 2

success probability, i.e.,

X \sim Bern (1 / 2)

. Therefore, again employing Lemma A2 for the critical point

ε = 1 / 2

, we obtain

\begin{matrix} 2^{n R} \leq |B_{0} (n, n A^{*})| = | \{x \in H^{n} : 0 \leq \sum_{t = 1}^{n} x_{t} \leq n A^{*}\} | \leq 2^{n H (A^{*} = 1 / 2)}, \end{matrix}

(67)

which implies

\begin{matrix} R \leq H (A^{*} = 1 / 2) = 1 . \end{matrix}

(68)

In this instance, the size of the complete input set, i.e.,

{| X |}^{n}

, that is, the number of input sequences, is a maximum amount on the number of messages. Therefore, we can establish the subsequent upper bound on the size of the DKI codebook

2^{n R} \leq {| X |}^{n}

which, for

| X | = 2

, implies

\begin{matrix} R \leq \frac{1}{n} log {| X |}^{n} = 1 . \end{matrix}

(69)

Next, similar to the provided arguments for deriving (66), we obtain

\begin{matrix} κ \leq 1 . \end{matrix}

(70)

Observe that the established upper bound on the DKI coding rate R as provided in (65), (68) and (69) and implies that the set

R_{DKI} (B, M, K)

of all attainable rate pairs

(R, κ)

is contained as follows:

\begin{matrix} R_{DKI} (B, M, K) \subseteq R^{out} (B), \end{matrix}

(71)

where

\begin{matrix} R^{out} (B) ≜ 〈 \begin{matrix} {(R, κ); 0 \leq R \leq H (A), 0 \leq κ \leq H (A)} & if A < 1 / 2, \\ {(R, κ); 0 \leq R \leq 1, 0 \leq κ \leq 1} & if A \geq 1 / 2, \end{matrix} \end{matrix}

(72)

where

κ_{UB}^{1}

and

κ_{UB}^{2}

are provided in (A58) and (A59), respectively.

Thus, exploiting the fact that DKI capacity region is the closure of the set

R_{DKI}^{β} (B, M, K)

of all attainable rate pairs

(R, κ)

is contained as follows:

\begin{matrix} C_{DKI} (B, M, K) \subseteq R^{out} (B) . \end{matrix}

(73)

Thereby, the relations provided in (56) and (71) complete the proof of Theorem 1.

5. Future Directions and Summary

In this work, the deterministic K-identification problem for IoT systems was studied. The results obtained in this paper can serve as a model for tasks that are based on an event recognition within the context of future IoT applications. Specifically, we consider IoT systems that can be modeled by the binary symmetric channel. For this setup, we established inner and outer bounds on the DKI capacity region with/without the Hamming weight constraint for a codebook size of

M (n, R) = 2^{n R}

. Our results in this work regarding the DKI capacity for the BSC model unveiled that the conventional exponential scale of

2^{n R}

considered for the DI [32] and TR problems [17], is the appropriate scale for the codebook size of the DKI problem of the BSC with/without Hamming weight constraint. This observation is was proved by finding a suitable ball covering for an n-dimensional Hamming hyper ball or the entire input set in the same line of arguments as that for the basic Gilbert bound method. In particular, in the presence of a Hamming weight constraint A, we pack hyper balls with radius

⌊ n β ⌋

, for some

β \in (0, 1)

inside a larger Hamming hyper ball, which results in ∼

2^{n H (A)}

codewords. We remind you that the scale of the codebook for DKI over the BSC is lower than that for the DKI over slow fading channels [51] or the DI over Poisson channel with and without ISI [48,52]. Moreover, we find out that the BSC features an exponentially large set of the goal messages set, in the codeword length, n, i.e.,

2^{n κ}

; and characterize the entire feasible range on the goal identification rate

κ

as a function of the channels statistic

ε

and the Hamming constraint (for

0 < A < 1 / 2

).

For the converse part, a similar approach as our previous work for DI over the DMC [32] is followed. That is, for the case where a non-trivial Hamming weight constraint is present (

0 < A < 1

), we establish an one-to-one mapping between the message set and the feasible set induced by the Hamming weight constraint. In particular, we exploit the method of proof by the contradiction. Namely, we first assume that two generic different messages

i_{1}

and

i_{2}

share the common codewords, and then show that such an assumption leads to a contradiction regarding the sum of the error probabilities, i.e., we derive that the sum of the sort I and sort II error probabilities converges to one. Hence, the falsehood of the early assumption is guaranteed, and the total number of messages

M = 2^{n R}

is bounded by the size of the feasible input set, i.e.,

M \leq 2^{n H (A)}

. For the case where

A \geq 1

, (absent of a Hamming constraint), a similar line of argument can be applied in order to establish the one-to-one function.

There are numerous ways to expand upon the findings we have showcased in this manuscript. Some of the possible topics for the future research are as follows:

◊: Explicit code construction: In this paper, we mainly address the determination of basic performance constraints for the DKI for the BSC with/without Hamming weight constraint, where an explicit code construction was not investigated. That is, in the achievability proof, we only guarantee the existence of a code without suggesting a systematic method for construction of the code. Therefore, an important direction for research may be explicit construction of K-identification codes for the BSC and development of efficient encoding and low complexity decoding schemes. Furthermore, the efficiency of such concrete designs can be measured versus the information theoretical bounds derived in this paper.
◊: Generalized channel models: We consider in this work one of the simplest and most basic channel model, namely the BSC in the absence of channel state, memory, or feedback. Therefore, our result can be extended to a DMC (with or without memory/feedback), compound, and arbitrary varying channels, which are generalizations of the BSC. In particular, several realistic IoT scenarios modeled by the BSC feature memory to some extent and the effect of memory may not be made negligible in a straightforward manner. Therefore, the application of memoryless channels as conducted in this paper to these realistic instances may in general yields different capacity results. In addition, it may be possible to exploit the memory effect in terms of gaining more optimum inner and outer bounds on the DKI capacity, as well as the specification of the encoding and decoding modules; cf. [61,67,68,69] for detailed studies on the BSC models with memory.
◊: Multi-user and multi-antenna systems: The results in this paper study a point-to-point single user system, and might be extended to advanced scenarios proper for the future communication network settings including multiple-input multiple-output channels or multi-user channels, which are deemed to be more relevant in the complex IoT systems.
◊: Finite codeword length coding: The obtained bounds on the K-identification capacity region studied in this paper determine the performance limits of BSC with/without Hamming weight constraint when the codeword length can grow arbitrarily. However, in practical applications, the codeword length is finite, where there is no way to afford significant encoding/decoding delays. As a result, studying the non-asymptotic DKI capacity of the BSC is an interesting direction for future research.

Author Contributions

Conceptualization, M.J.S. and O.D.; methodology, M.J.S.; validation, C.D. and H.B.; formal analysis, M.J.S. and O.D.; resources, C.D. and H.B.; writing—original draft preparation, M.J.S.; writing—review and editing, M.J.S., O.D., C.D. and H.B.; visualization, M.J.S. and O.D.; supervision, H.B.; project administration, H.B.; funding acquisition, C.D. and H.B. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the German Federal Ministry of Education and Research (BMBF) within the 6G-life project grant number 16KISK002 (M.J.S.), the German Research Foundation (DFG) within the Gottfried Wilhelm Leibniz Prize grant number BO 1734/20-1 (H.B.), the BMBF within the national initiative for “Post-Shannon Communication (NewCom)” with the project “Basics, Simulation and Demonstration For New Communication Models” grant number 16KIS1003K (H.B.), the BMBF within the national initiative for “Post-Shannon Communication (NewCom)” with the project “Coding Theory and Coding Methods For New Communication Models” grant number 16KIS1005 (C.D.), the DFG within Germany’s Excellence Strategy grant number EXC-2111—390814868 and EXC-2092 CASA—390781972 (H.B.), the BMBF grant number 16KIS1005 (C.D.) and the DFG Project grant number DE1915/2-1 (C.D.), the BMBF in the program of “Souverän. Digital. Vernetzt.”, joint project 6G-life, project identification grant number 16KISK002.

Data Availability Statement

The data presented in this study are available in this article.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The subsequent abbreviations are used in this manuscript:

IoT	Internet of Things
IoMT	Internet of Medical Things
pH	Potential Hydrogen
IoBNT	Internet of Bio-Nano Things
MC	Molecular Communications
6G	Sixth-Generation
PSC	Post-Shannon Communications
XG	Future-Generation
BSC	Binary Symmetric Channel
TR	Shannon’s Message Transmission
Bern	Bernoulli
DI	Deterministic Identification
DMC	Discrete Memoryless Channel
RV	Random Variable
Vol	Volume
DKI	Deterministic K-Identification
CDF	Cumulative Distribution Function
RI	Randomized Identification
RKI	Randomized K-Identification
DTPC	Memoryless Discrete-Time Poisson Channel
GSF	Gaussian Channel With Slow Fading
GV	Gilbert–Varshamov
ISI	Inter-Symbol Interference
CRF	Channel Reliability Function

Appendix A. Sort I Error Analysis

Consider the sort I error, i.e., the transmitter sends

c_{i}

, yet

y \notin J_{K}

for every

i \in K

. The sort I error probability is given by

\begin{matrix} P_{e, 1} (i, K) & = Pr (Y (i) \in J_{K}^{c}) = Pr (Y (i) \in {(⋃_{j \in K} T_{j})}^{c}) \\ \overset{(a)}{=} Pr (Y (i) \in ⋂_{j \in K} T_{j}^{c}) \overset{(b)}{\leq} Pr (Y (i) \in T_{i}^{c}) = Pr (T (Y (i), c_{i}) > ⌊ n δ_{β} ⌋), \end{matrix}

(A1)

where

(a)

follows by De Morgan’s law, i.e.,

{(⋃_{i \in K} T_{i})}^{c} = ⋂_{i \in K} T_{i}^{c}

and

(b)

holds since

⋂_{j \in K} T_{j}^{c} \subset T_{i}

. Now, observe that

\begin{matrix} Pr (T (Y (i), c_{i}) > ⌊ n δ_{β} ⌋) \overset{(a)}{=} Pr (d_{H} (Y (i), c_{i}) > ⌊ n δ_{β} ⌋) \overset{(b)}{=} \sum_{l = ⌊ n δ_{β} ⌋ + 1}^{n} (\binom{n}{l}) ε^{l} {(1 - ε)}^{n - l}, \end{matrix}

(A2)

where

(a)

follows by (52) and

(b)

holds by (12). In order to bound (A2), we proceed to apply the bound provided in (A86) given in Lemma A4: Observe that

\begin{matrix} \frac{l}{n} = \frac{⌊ n δ_{β} ⌋ + 1}{n} \overset{(a)}{>} \frac{n δ_{β}}{n} = δ_{β} \overset{(b)}{>} ε, \end{matrix}

(A3)

where

(a)

follows, since

x < ⌊ x ⌋ + 1

for real x and

(b)

holds by (50). On the other hand,

\begin{matrix} \frac{l}{n} = \frac{⌊ n δ_{β} ⌋ + 1}{n} \leq \frac{max ⌊ n δ_{β} ⌋ + 1}{n} \overset{(a)}{<} \frac{⌊ n max (ε + β (1 / 2 - ε)) ⌋ + 1}{n} \overset{(b)}{<} \frac{⌊ n / 2 ⌋ + 1}{n} \overset{n \geq 3}{<} 1, \end{matrix}

(A4)

where

(a)

follows by (50) and

(b)

holds since

ε + β (1 / 2 - ε)

is upper bounded by the boundary value of

ε

, i.e., where

ε = 1 / 2

. Observe that the last inequality in (A4) holds for sufficiently large n. Now, since the inequalities provided in (A3) and (A4) fulfill the conditions in Lemma A4, we employ Lemma A4 to establish the following lower bound on (A2) as follows

\begin{matrix} Pr (T (Y (i), c_{i}) > ⌊ n δ_{β} ⌋) \\ = \sum_{l = ⌊ n δ_{β} ⌋ + 1}^{n} (\binom{n}{l}) ε^{l} {(1 - ε)}^{n - l} \\ \leq [\frac{(⌊ n δ_{β} ⌋ + 1) (1 - ε)}{(⌊ n δ_{β} ⌋ + 1) (1 - ε) - [n - (⌊ n δ_{β} ⌋ + 1)] ε}] \cdot 2^{- n [T_{ε} (\frac{⌊ n δ_{β} ⌋ + 1}{n}) - H (\frac{⌊ n δ_{β} ⌋ + 1}{n})]} . \end{matrix}

(A5)

Observe that the denominator in (A5) is always a strict positive term, since assuming we arrive to a trivial inequality as follows

\begin{matrix} (⌊ n δ_{β} ⌋ + 1) (1 - ε) & > [n - (⌊ n δ_{β} ⌋ + 1)] ε ⟺ \end{matrix}

(A6)

\begin{matrix} ⌊ n δ_{β} ⌋ + 1 - ε ⌊ n δ_{β} ⌋ - ε & > n ε - ε ⌊ n δ_{β} ⌋ - ε ⟺ \end{matrix}

(A7)

\begin{matrix} ⌊ n δ_{β} ⌋ + 1 & > n ε ⟺ \end{matrix}

(A8)

\begin{matrix} \frac{⌊ n δ_{β} ⌋ + 1}{n} & > ε, \end{matrix}

(A9)

which is already verified in (A3). Now, we proceed to find a simplified upper bound on the left hand side coefficient in the bracket given in (A5) as follows:

\begin{matrix} \frac{(⌊ n δ_{β} ⌋ + 1) (1 - ε)}{(⌊ n δ_{β} ⌋ + 1) (1 - ε) - [n - (⌊ n δ_{β} ⌋ + 1)] ε} \\ \overset{(a)}{=} \frac{(n δ_{β} + 1) (1 - ε)}{(⌊ n δ_{β} ⌋ + 1) - ε (⌊ n δ_{β} ⌋ + 1) - n ε + ε (⌊ n δ_{β} ⌋ + 1)} \\ \leq \frac{(n δ_{β} + 1) (1 - ε)}{(⌊ n δ_{β} ⌋ + 1) - n ε} \\ \overset{(b)}{\leq} \frac{(n δ_{β} + 1) (1 - ε)}{n δ_{β} - n ε}, \end{matrix}

(A10)

where

(a)

holds by exploiting

x \leq ⌊ x ⌋

for real x and simplifying the denominator by distributing

ε

over the bracket, and

(b)

follows, since

\begin{matrix} n δ_{β} < ⌊ n δ_{β} ⌋ + 1 ⟺ n δ_{β} - n ε < ⌊ n δ_{β} ⌋ + 1 - n ε ⟺ \frac{1}{n δ_{β} - n ε} > \frac{1}{⌊ n δ_{β} ⌋ + 1 - n ε} . \end{matrix}

(A11)

where the first inequality follows since

x < ⌊ x ⌋ + 1

for real x. Thereby, employing (A10) unto (A5), we obtain

\begin{matrix} Pr (|T (Y (i), c_{i})| > ⌊ n δ_{β} ⌋) & = \sum_{l = ⌊ n δ_{β} ⌋ + 1}^{n} (\binom{n}{l}) ε^{l} {(1 - ε)}^{n - l} \\ \leq \frac{(n δ_{β} + 1) (1 - ε)}{n δ_{β} - n ε} \cdot 2^{- n [T_{ε} (\frac{⌊ n δ_{β} ⌋ + 1}{n}) - H (\frac{⌊ n δ_{β} ⌋ + 1}{n})]} \\ = \frac{(δ_{β} + \frac{1}{n}) (1 - ε)}{δ_{β} - ε} \cdot 2^{- n [T_{ε} (\frac{⌊ n δ_{β} ⌋ + 1}{n}) - H (\frac{⌊ n δ_{β} ⌋ + 1}{n})]} \\ ≜ ζ_{1, n} . \end{matrix}

(A12)

Observe that the exponent of exponential term is always strictly positive, since for

ε \in (0, 1 / 2)

, the arguments of

T_{ε} (\cdot)

and

H (\cdot)

are strictly less than

1 / 2

. That is, we have the following

\begin{matrix} T_{ε} ((⌊ n δ_{β} ⌋ + 1) / n) > H ((⌊ n δ_{β} ⌋ + 1) / n) . \end{matrix}

(A13)

The argument is as follows:

\begin{matrix} \frac{l}{n} = \frac{⌊ n δ_{β} ⌋ + 1}{n} \leq \frac{max ⌊ n δ_{β} ⌋ + 1}{n} & \overset{(a)}{<} \frac{⌊ n max (ε + β (1 / 2 - ε)) ⌋ + 1}{n} \\ \overset{(b)}{<} \frac{⌊ n / 2 ⌋ + 1}{n} \overset{(c)}{\leq} \frac{n / 2 + 1}{n}, \end{matrix}

(A14)

which is strictly less than

1 / 2

in the asymptotic, i.e., as

n \to \infty

, where

(a)

and

(b)

follows by the same arguments given for (A4), and

(c)

follows since

⌊ x ⌋ \leq x

for real x.

Therefore, the difference for the evaluation of

T_{ε} (\cdot)

and

H (\cdot)

for a given fix argument is always a strict positive value; see Figure 7. Hence,

P_{e, 1} (i, K) \leq e_{1}, \forall i \in T_{K}

holds for sufficiently large n and arbitrarily small

e_{1} > 0

. Thereby, the sort I error probability satisfies

P_{e, 1} (i, K) \leq ζ_{1, n} \leq e_{1}

. This complete the analysis for the sort I error probability.

Appendix B. Sort II Error Analysis

In the following, we address sort II errors, i.e., when

Y (i) \in T_{K}

while the transmitter sent

c_{i}

with

i \notin K

. Then, for each possible

(\binom{M}{K})

cases of

K

, where

i \notin K

, the sort II error probability is given by

\begin{matrix} P_{e, 2} (i, K) & = Pr (Y (i) \in T_{K}) = Pr (Y (i) \in ⋃_{j \in K} T_{j}) \overset{(a)}{=} Pr (⋃_{j \in K} \{T (Y (i), c_{j}) \leq ⌊ n δ_{β} ⌋\}) \\ \overset{(b)}{=} Pr (⋃_{j \in K} \{d_{H} (Y (i), c_{j}) \leq ⌊ n δ_{β} ⌋\}) \overset{(c)}{\leq} \sum_{j \in K} Pr (d_{H} (Y (i), c_{j}) \leq ⌊ n δ_{β} ⌋) \\ \leq K \cdot Pr (d_{H} (Y (i), c_{j}) \leq ⌊ n δ_{β} ⌋), \end{matrix}

(A15)

where

(a)

follows by (51),

(b)

holds by (52) and

(c)

follows by the union bound, i.e., the sum of each individual event’s probability sets an upper constraint on the probability of the union of events. Let us define the following events

\begin{matrix} F_{δ_{β}} (i) & ≜ \{Y \in H^{n}; d_{H} (Y (i), c_{i}) \leq ⌊ n δ_{β} ⌋\}, \end{matrix}

(A16)

\begin{matrix} F_{δ_{β}} (i, j) & ≜ \{Y \in H^{n}; d_{H} (Y (i), c_{j}) \leq ⌊ n δ_{β} ⌋\} . \end{matrix}

(A17)

Next, employing the law of total probability with respect to the event

\{d_{H} (Y (i), c_{i}) \leq ⌊ n δ_{β} ⌋\}

, we establish an upper bound on

Pr (d_{H} (Y (i), c_{j}) \leq ⌊ n δ_{β} ⌋)

given in (A15) as follows:

\begin{matrix} Pr (d_{H} (Y (i), c_{j}) \leq ⌊ n δ_{β} ⌋) & \overset{(a)}{=} Pr (F_{δ_{β}} (i, j) \cap F_{δ_{β}} (i)) + Pr (F_{δ_{β}} (i, j) \cap F_{δ_{β}}^{c} (i)) \\ \overset{(b)}{\leq} Pr (F_{δ_{β}} (i, j) \cap F_{δ_{β}} (i)) + Pr (F_{δ_{β}}^{c} (i)) \\ \overset{(c)}{=} Pr (F_{δ_{β}} (i, j) \cap F_{i} (δ_{β})) + Pr (d_{H} (Y (i), c_{i}) > ⌊ n δ_{β} ⌋) \\ \overset{(d)}{\leq} Pr (F_{δ_{β}} (i, j) \cap F_{δ_{β}} (i)) + ζ_{1, n}, \end{matrix}

(A18)

where

(a)

holds by the law of total probability,

(b)

follows since

F_{i}^{c} (δ_{β}) \supset F_{δ_{β}} (i, j) \cap F_{i}^{c} (δ_{β})

,

(c)

holds by (A16), and

(d)

exploits (A12).

Now, we focus on the event

F_{δ_{β}} (i, j) \cap F_{δ_{β}} (i)

. Let

\begin{matrix} d ≜ d_{H} (c_{i}, c_{j}) \overset{(a)}{\geq} ⌊ n β ⌋ + 1, \end{matrix}

(A19)

where

(a)

follows by the assumption made in the code construction regarding the minimum Hamming distance; see Lemma 1 and (42). Now, without loss of generality, we may assume that the two sequence

c_{i}

and

c_{j}

differ in the first d symbols, i.e.,

\begin{matrix} c_{i} & = (c_{i_{1}}, c_{i_{2}}, \dots, c_{i_{d}}, c_{i_{d + 1}}, \dots, c_{i_{n}}) \\ c_{j} & = (c_{j_{1}}, c_{j_{2}}, \dots, c_{j_{d}}, c_{j_{d + 1}}, \dots, c_{j_{n}}) \\ y = (y_{1}, y_{2}, \dots, y_{d}, y_{d + 1}, \dots, y_{n}), \end{matrix}

(A20)

where y is the realization of vector

Y (i)

. Therefore, the

n - d

last symbols (bits) of

c_{i}

and

c_{j}

are identical. Observe that the event

\{d_{H} (Y (i), c_{i}) \leq ⌊ n δ_{β} ⌋\}

implies that the received vector y and

c_{i}

differ in p bits, where

p \leq ⌊ n δ_{β} ⌋

, i.e.,

\begin{matrix} d_{H} (y, c_{i}) = p \leq ⌊ n δ_{β} ⌋ . \end{matrix}

(A21)

Now, we assume that

p_{1}

bits out of the p bits happen in the first d bits, i.e.,

d_{H} {(y |}_{1}^{d}, c_{i} |_{1}^{d}) = p_{1}

, where

\begin{matrix} c_{i} {|_{1}^{d} ≜ (c_{i_{1}}, c_{i_{2}}, \dots, c_{i_{d}}) and y |}_{1}^{d} ≜ (y_{1}, y_{2}, \dots, y_{d}), \end{matrix}

(A22)

and

p_{2}

bits with

p_{2} = p - p_{1}

happens in last

n - d

bits, i.e.,

d_{H} {(y |}_{d + 1}^{n}, c_{i} |_{d + 1}^{n}) = p_{2}

, where

\begin{matrix} c_{i} {|_{d + 1}^{n} ≜ (c_{i_{d + 1}}, \dots, c_{i_{n}}) and y |}_{d + 1}^{n} ≜ (y_{d + 1}, \dots, y_{n}) . \end{matrix}

(A23)

Observe that since the symbols of sequences are bits, i.e., either 0 or 1; therefore,

d = d_{H} (c_{i}, c_{j})

implies that the two sequences

c_{i}

and

c_{j}

are complementary for the first d bits. Now, we infer that if the two sequences

{y |}_{1}^{d}

and

c_{i} |_{1}^{d}

differ in

p_{1}

, then

{y |}_{1}^{d}

and

c_{i} |_{1}^{d}

are identical in those

p_{1}

bits. Hence,

d_{H} {(y |}_{1}^{d}, c_{j} |_{1}^{d}) = d - p_{1}

.

Now, if we collect all the positions for which

{y |}_{1}^{n}

and

c_{j} |_{1}^{n}

differ, we obtain

\begin{matrix} d_{H} (y, c_{j}) = d_{H} {(y |}_{1}^{n}, c_{j} |_{1}^{n}) = d_{H} {(y |}_{1}^{d}, c_{j} |_{1}^{d}) + d_{H} {(y |}_{d + 1}^{n}, c_{j} |_{d + 1}^{n}) = d - p_{1} + p_{2} . \end{matrix}

(A24)

Observe that, since we restrict ourselves to the event

\begin{matrix} F_{δ_{β}} (i, j) \cap F_{i}^{c} (δ_{β}) ≜ \{d_{H} (Y (i), c_{j}) \leq ⌊ n δ_{β} ⌋\} \cap \{d_{H} (Y (i), c_{i}) \leq ⌊ n δ_{β} ⌋\}, \end{matrix}

(A25)

\begin{matrix} d - p_{1} + p_{2} \leq ⌊ n δ_{β} ⌋ \Rightarrow p_{2} \leq ⌊ n δ_{β} ⌋ - d + p_{1} . \end{matrix}

(A26)

On the other hand, since

d_{H} (y, c_{j}) \leq ⌊ n δ_{β} ⌋

, we obtain

\begin{matrix} p \leq ⌊ n δ_{β} ⌋ \Rightarrow p_{1} + p_{2} \leq ⌊ n δ_{β} ⌋ \Rightarrow p_{2} \leq ⌊ n δ_{β} ⌋ - p_{1} . \end{matrix}

(A27)

Now, in order to calculate

Pr (d_{H} (Y (i), c_{j}) \leq ⌊ n δ_{β} ⌋)

in (A15), we first fix

p_{1}

, and then sum up over all possible cases for the

p_{2}

, then we would have a second sum which runs for values of

p_{1}

from 0 to d. Observe that the

p_{2}

has two upper bounds given in (A26) and (A27); therefore, in the calculation, we restrict ourselves to the minimum of those two upper bounds. Let define

p_{2}^{UB} ≜ min \{⌊ n δ_{β} ⌋ - p_{1}, ⌊ n δ_{β} ⌋ - d + p_{1}\}

. Thereby,

\begin{matrix} Pr (F_{δ_{β}} (i, j) \cap F_{δ_{β}} (i)) & \overset{(a)}{\leq} \sum_{p_{1} = 0}^{d} (\binom{d}{p_{1}}) \cdot \sum_{p_{2} = 0}^{p_{2}^{UB}} (\binom{n - d}{p_{2}}) ε^{p_{1} + p_{2}} {(1 - ε)}^{n - (p_{1} + p_{2}) + d - d} \\ \overset{(b)}{=} [\sum_{p_{1} = 0}^{d} (\binom{d}{p_{1}}) ε^{p_{1}} {(1 - ε)}^{d - p_{1}}] \cdot [\sum_{p_{2} = 0}^{p_{2}^{UB}} (\binom{n - d}{p_{2}}) ε^{p_{2}} {(1 - ε)}^{n - d - p_{2}}], \end{matrix}

(A28)

where

(a)

holds since

p = p_{1} + p_{2}

, and

(b)

follows since every expression that is independent of the sum’s variable can be shifted left behind the inner sum. In

(b)

, we have added

0 = d - d

, to obtain the correct form for the two binomial distribution expressions. Now, observe that the first sum is the Binomial cumulative distribution function at point

x = d

and can be upper bounded by 1, i.e.,

\begin{matrix} \sum_{p_{1} = 0}^{d} (\binom{d}{p_{1}}) ε^{p_{1}} {(1 - ε)}^{d - p_{1}} & = Pr (p_{1} \leq d) = B_{X} {(x) |}_{x = d} = B_{X} (d) = 1 . \end{matrix}

(A29)

Now, let focus on the second sum in (A28), for which we establish an upper bound by maximizing

p_{2}^{UB}

through setting

p_{1} = ⌊ d / 2 ⌋

, i.e.,

\begin{matrix} \underset{p_{1}}{arg max} p_{2}^{UB} = ⌊ d / 2 ⌋ . \end{matrix}

(A30)

Therefore,

\begin{matrix} max p_{2}^{UB} & ≜ max [min \{⌊ n δ_{β} ⌋ - p_{1}, ⌊ n δ_{β} ⌋ - d + p_{1}\}] \\ = min \{⌊ n δ_{β} ⌋ - p_{1}, ⌊ n δ_{β} ⌋ - d + p_{1}\} |_{p_{1} = ⌊ d / 2 ⌋} \\ = \{⌊ n δ_{β} ⌋ - d / 2, ⌊ n δ_{β} ⌋ - d + ⌊ d / 2 ⌋\} \\ = \{⌊ n δ_{β} ⌋ - ⌊ d / 2 ⌋, ⌊ n δ_{β} ⌋ - (d - ⌊ d / 2 ⌋)\} \\ = ⌊ n δ_{β} ⌋ - d + ⌊ d / 2 ⌋, \end{matrix}

(A31)

where the last equality holds since by

⌊ d / 2 ⌋ \leq d / 2

for real

d / 2

, we obtain

d / 2 \leq d - ⌊ d / 2 ⌋

.

Now, we exploit the inequality (A95) given in Lemma A5 to obtain an upper bound for the second sum in (A28) as follows: First, we check whether the required condition in Lemma A5 are satisfied or not. Namely, we set

k = ⌊ n δ_{β} ⌋ - d + ⌊ d / 2 ⌋

and

n = n - d

. Now, we calculate their ratio as follows:

\begin{matrix} \frac{k}{n - d} = \frac{⌊ n δ_{β} ⌋ - d + ⌊ d / 2 ⌋}{n - d} & \overset{(a)}{\leq} \frac{n δ_{β} - d + d / 2}{n - d} \\ = \frac{n δ_{β} - d / 2}{n - d} = \frac{δ_{β} - (d / 2 n)}{1 - d / n} \overset{(b)}{<} \frac{δ_{β} - β / 2}{1 - β} ≜ τ, \end{matrix}

(A32)

where

(a)

holds since

⌊ x ⌋ \leq x

for real x and

(b)

holds by the following argument: we assume that

(b)

holds and assuming that

δ_{β} \neq 1 / 2

, we arrive at a trivial inequality, namely,

d > n β :

\begin{matrix} \frac{δ_{β} - (d / 2 n)}{1 - d / n} & < \frac{δ_{β} - β / 2}{1 - β} \Rightarrow \end{matrix}

(A33)

\begin{matrix} (δ_{β} - (d / 2 n)) (1 - β) & < (δ_{β} - β / 2) (1 - d / n) \Rightarrow \end{matrix}

(A34)

\begin{matrix} δ_{β} - β δ_{β} - (d / 2 n) + (β d / 2 n) & < δ_{β} - (δ_{β} d / n) - β / 2 + (β d / 2 n) \Rightarrow \end{matrix}

(A35)

\begin{matrix} β (1 / 2 - δ_{β}) & < (d / 2 n) - (δ_{β} d / n) \Rightarrow \end{matrix}

(A36)

\begin{matrix} β (1 / 2 - δ_{β}) & < (d / n) \cdot (1 / 2 - δ_{β}) \Rightarrow \end{matrix}

(A37)

\begin{matrix} n β & < d, \end{matrix}

(A38)

which can be deduced by assumptions of code construction given in (42), i.e.,

\begin{matrix} d_{H} (c_{i}, c_{j}) \geq ⌊ n β ⌋ + 1 \overset{(a)}{>} n β - 1 + 1 = n β, \end{matrix}

(A39)

where

(a)

holds, since

⌊ n β ⌋ > n β - 1

for real

n β

. Now, we exploit (50), to show that (A32) is upper bounded by

ε

as follows

\begin{matrix} δ_{β} < ε + β (1 / 2 - ε) \Rightarrow δ_{β} < ε + β / 2 - β ε \Rightarrow δ_{β} - β / 2 < ε (1 - β) \Rightarrow \frac{δ_{β} - β / 2}{1 - β} < ε . \end{matrix}

(A40)

Thereby, we apply safely Lemma A5 with parameters

j = p_{2}

,

k = p_{2}^{UB} ≜ ⌊ n δ_{β} ⌋ - d + ⌊ d / 2 ⌋

and

n = n - d

, and obtain

\begin{matrix} \sum_{p_{2} = 0}^{⌊ n δ_{β} ⌋ - d + ⌊ d / 2 ⌋} (\binom{n - d}{p_{2}}) ε^{p_{2}} {(1 - ε)}^{n - d - p_{2}} & \leq \frac{ε ((n - d) - k)}{ε (n - d) - k} \cdot 2^{n [H (\frac{k}{n - d}) - T_{ε} (\frac{k}{n - d})]} \\ \leq \frac{ε (1 - \frac{k}{n - d})}{ε - \frac{k}{n - d}} \cdot 2^{n [H (\frac{k}{n - d}) - T_{ε} (\frac{k}{n - d})]} . \end{matrix}

(A41)

Let us focus on the coefficient in (A41). In the following, assuming an upper bound for it, we arrive to a trivial inequality, therefore, the upper bound is valid.

\begin{matrix} \frac{ε (1 - \frac{k}{n - d})}{ε - \frac{k}{n - d}} & < \frac{ε (1 - τ)}{ε - τ} . \end{matrix}

(A42)

Observe that (A42) yield the following chain of expressions:

\begin{matrix} \frac{1 - \frac{k}{n - d}}{ε - \frac{k}{n - d}} & < \frac{1 - τ}{ε - τ} \Rightarrow \end{matrix}

(A43)

\begin{matrix} ε - τ - \frac{k ε}{n - d} + \frac{k τ}{n - d} & < ε - \frac{k}{n - d} - ε τ + \frac{k τ}{n - d} \Rightarrow \end{matrix}

(A44)

\begin{matrix} - τ - \frac{k ε}{n - d} & < - \frac{k}{n - d} - ε τ \Rightarrow \end{matrix}

(A45)

\begin{matrix} \frac{k}{n - d} (1 - ε) & < τ (1 - ε) \Rightarrow \end{matrix}

(A46)

\begin{matrix} \frac{k}{n - d} & < ε, \end{matrix}

(A47)

which is trivial, since it is already proved in (A32). Now, observe that for

0 < \frac{k}{n - d} < τ < ε

, the following holds

\begin{matrix} H (\frac{k}{n - d}) - T_{ε} (\frac{k}{n - d}) < H (τ) - T_{ε} (τ), \end{matrix}

(A48)

see Figure 7. Therefore, since

τ

always yield a smaller exponent, we obtain an upper bound on the sum in (A41) as follows

\begin{matrix} \sum_{p_{2} = 0}^{⌊ n δ_{β} ⌋ - d + ⌊ d / 2 ⌋} (\binom{n - d}{p_{2}}) ε^{p_{2}} {(1 - ε)}^{n - d - p_{2}} & \leq \frac{ε ((n - d) - k)}{ε (n - d) - k} \cdot 2^{n [H (\frac{k}{n - d}) - T_{ε} (\frac{k}{n - d})]} \\ \overset{(a)}{<} \frac{ε (1 - τ)}{ε - τ} \cdot 2^{n [H (\frac{k}{n - d}) - T_{ε} (\frac{k}{n - d})]} \\ \overset{(b)}{<} \frac{ε (1 - \frac{k}{n - d})}{ε - \frac{k}{n - d}} \cdot 2^{n [H (τ) - T_{ε} (τ)]} \\ ≜ ζ_{0, n}, \end{matrix}

(A49)

where

(a)

exploits (A42), and

(b)

follows by (A48). Thereby, recalling (A28) and employing (A29), we obtain

\begin{matrix} Pr (F_{δ_{β}} (i, j) \cap F_{δ_{β}} (i)) \leq 1 \cdot \sum_{j = 0}^{k} (\binom{n - d}{j}) ε^{j} {(1 - ε)}^{n - d - j} < \frac{ε (1 - τ)}{ε - τ} \cdot 2^{n [H (τ) - T_{ε} (τ)]} ≜ ζ_{0, n} . \end{matrix}

(A50)

Hence, recalling (A15) and (A18), we obtain

\begin{matrix} P_{e, 2} (i, K) \\ \leq K \cdot [Pr (d_{H} (Y (i), c_{j}) \leq ⌊ n δ_{β} ⌋)] \\ \leq K \cdot [Pr (F_{δ_{β}} (i, j) \cap F_{δ_{β}} (i)) + ζ_{1, n}] \\ = K \cdot [\frac{ε (1 - τ)}{ε - τ} \cdot 2^{n [H (τ) - T_{ε} (τ)]} + \frac{(δ_{β} + \frac{1}{n}) (1 - ε)}{δ_{β} - ε} \cdot 2^{- n [T_{ε} (\frac{⌊ n δ_{β} ⌋ + 1}{n}) - H (\frac{⌊ n δ_{β} ⌋ + 1}{n})]}] \\ \overset{(a)}{=} 2^{n κ} \cdot [\frac{ε (1 - τ)}{ε - τ} \cdot 2^{- n [T_{ε} (τ) - H (τ)]} + \frac{(δ_{β} + \frac{1}{n}) (1 - ε)}{δ_{β} - ε} \cdot 2^{- n [T_{ε} (\frac{⌊ n δ_{β} ⌋ + 1}{n}) - H (\frac{⌊ n δ_{β} ⌋ + 1}{n})]}] \\ = \frac{ε (1 - τ)}{ε - τ} \cdot 2^{- n [T_{ε} (τ) - H (τ) - κ]} + \frac{(δ_{β} + \frac{1}{n}) (1 - ε)}{δ_{β} - ε} \cdot 2^{- n [T_{ε} (\frac{⌊ n δ_{β} ⌋ + 1}{n}) - H (\frac{⌊ n δ_{β} ⌋ + 1}{n}) - κ]}, \end{matrix}

(A51)

which implies that both the exponential factors given in (A51) should yields strict positive exponents; that is, we obtain two separate upper bounds on the

κ

as follows:

\begin{matrix} κ < T_{ε} (τ) - H (τ) and κ < T_{ε} (\frac{⌊ n δ_{β} ⌋ + 1}{n}) - H (\frac{⌊ n δ_{β} ⌋ + 1}{n}), \end{matrix}

(A52)

Therefore,

\begin{matrix} κ < min \{T_{ε} (τ) - H (τ), T_{ε} (\frac{⌊ n δ_{β} ⌋ + 1}{n}) - H (\frac{⌊ n δ_{β} ⌋ + 1}{n})\} . \end{matrix}

(A53)

Now, we focus on the second argument in (A53), and provide the following asymptotic behavior:

\begin{matrix} lim_{n \to \infty} T_{ε} (\frac{⌊ n δ_{β} ⌋ + 1}{n}) - H (\frac{⌊ n δ_{β} ⌋ + 1}{n}) & = T_{ε} (lim_{n \to \infty} \frac{⌊ n δ_{β} ⌋ + 1}{n}) - H (lim_{n \to \infty} \frac{⌊ n δ_{β} ⌋ + 1}{n}), \end{matrix}

(A54)

where the equality holds, since

T_{ε} (\cdot)

and

H (\cdot)

are continuous functions of

δ_{β}

. Now, observe that since

⌊ n δ_{β} ⌋ - 1 < ⌊ n δ_{β} ⌋ \leq n δ_{β}

for real

n δ_{β}

, we obtain

\begin{matrix} lim_{n \to \infty} \frac{n δ_{β} - 1 + 1}{n} \leq lim_{n \to \infty} \frac{⌊ n δ_{β} ⌋ + 1}{n} \leq lim_{n \to \infty} \frac{n δ_{β} + 1}{n} & \Rightarrow \\ δ_{β} \leq lim_{n \to \infty} \frac{⌊ n δ_{β} ⌋ + 1}{n} \leq lim_{n \to \infty} δ_{β} + \frac{1}{n} & \overset{(a)}{\Rightarrow} lim_{n \to \infty} \frac{⌊ n δ_{β} ⌋ + 1}{n} = δ_{β}, \end{matrix}

(A55)

where

(a)

holds by the squeeze theorem. Thereby,

\begin{matrix} lim_{n \to \infty} T_{ε} (\frac{⌊ n δ_{β} ⌋ + 1}{n}) - H (\frac{⌊ n δ_{β} ⌋ + 1}{n}) = T_{ε} (δ_{β}) - H (δ_{β}) . \end{matrix}

(A56)

Thus, recalling (A53), we obtain the subsequent upper bound on the goal identification rate

κ

:

\begin{matrix} κ & < min \{T_{ε} (τ) - H (τ), T_{ε} (\frac{⌊ n δ_{β} ⌋ + 1}{n}) - H (\frac{⌊ n δ_{β} ⌋ + 1}{n})\} \\ \overset{(a)}{=} min \{T_{ε} (\frac{δ_{β} - β / 2}{1 - β}) - H (\frac{δ_{β} - β / 2}{1 - β}), T_{ε} (δ_{β}) - H (δ_{β})\}, \end{matrix}

(A57)

where

(a)

follows from (A32) and (A56). Next, exploiting (49), we derive the arguments provided in (A57) as follows:

\begin{matrix} κ_{UB}^{1} & ≜ T_{ε} (f_{1} (ε, β)) - H (f_{1} (ε, β)) \end{matrix}

(A58)

\begin{matrix} κ_{UB}^{2} & ≜ T_{ε} (f_{2} (ε, β)) - H (f_{2} (ε, β)), \end{matrix}

(A59)

where

f_{1} (ε, β)

and

f_{2} (ε, β)

are given in (13) and (14). Thereby,

\begin{matrix} κ & < min (κ_{UB}^{1}, κ_{UB}^{2}) . \end{matrix}

(A60)

Therefore, recalling (A51), we obtain

\begin{matrix} P_{e, 2} (i, j) & \leq Pr (F_{δ_{β}} (i, j) \cap F_{δ_{β}} (i)) + Pr (d_{H} (Y (i), c_{i}) > ⌊ n δ_{β} ⌋) \\ \leq ζ_{0, n} + ζ_{1, n} \leq ζ_{0} + ζ_{1} \leq e_{2}, \end{matrix}

(A61)

hence,

P_{e, 2} (i, j) \leq e_{2}

holds for sufficiently large n and arbitrarily small

e_{2} > 0

.

Appendix C. Cover-Free Families

In this subsection, we provide some preliminaries about the concept of cover-free families and establish some basic and well-known results. Furthermore, we draw the connection between such concept and the RKI.

Definition A1

(r-cover-free family). Let pair

(X, F)

be a set system, where X is a set of points and

ℱ

is a set of subsets (blocks) of X. A set system

(X, F)

is called r-cover-free family, if for an arbitrary r distinct blocks

A_{1}, \dots, A_{r} \in F

and any other block

A_{0} \in F

, we have

\begin{matrix} A_{0} ⊄ ⋃_{i = 1}^{r} A_{i} . \end{matrix}

(A62)

The concept of r-cover-free families in the literature was first found in [70]. In the following, we introduce a well-known theorem in the literature, which established a power law decaying the lower and upper bounds on the size of cover-free families.

Theorem A1

(see [70]). Let

A ≜ {1, \dots, | A |}

and

ℱ

be the set of points and subsets, respectively, such that the set system

(A, F)

constitute a r-cover-free family. Then, let indicate the maximum size of

ℱ

over 𝒜 by

M (| A |, r)

. Now, we have

\frac{c_{1}}{r^{2}} \leq \frac{log M (| A |, r)}{| A |} \leq \frac{c_{2}}{r},

for some constants

c_{1}

and

c_{2}

.

Next, we present a theorem which establish an upper bound on the size of r-cover-free family as follows:

Theorem A2

(see [71]). Assume that set system

(A, F)

constitute a r-cover-free family where

A ≜ {1, \dots, | A |}

. Now, the maximum size of the r-cover-free family, i.e.,

| F |

, is upper bounded as follows:

\begin{matrix} \frac{log M (| A |, r)}{| A |} \leq k \cdot \frac{log r}{r^{2}}, \end{matrix}

(A63)

where k is a constant.

Next, we explain on the connection between the notion of r-cover-free families in the combinatorics and RKI for noiseless discrete memoryless channel found by Ahlswede in [26]: Let

a = | X |

,

r = a^{κ n}

,

| A | = a^{n}

, then the RI coding with 0-valued first type error, is upper bounded by:

\begin{matrix} R_{n} ≜ \frac{log log M (a^{n}, a^{κ n})}{n} \leq (1 - 2 κ) log a + o (1) . \end{matrix}

(A64)

Then, for a DMC with input alphabet of size

| X |

, we obtain

\begin{matrix} R \leq (1 - 2 κ) log | X | . \end{matrix}

(A65)

Therefore, for the binary input channel, i.e., where

| X | = 2

, we obtain

R \leq 1 - 2 κ

.

Appendix D. Lower Bound on the Volume of the Hamming Ball

Lemma A1

(see [72], Lem. 16.19). Let

n, q \geq 2

be positive integers and assume a real ε where

0 \leq ⌊ n ε ⌋ / n \leq 1 - 1 / q

. Then, volume of the Hamming ball in the q-ary alphabet is lower bounded as follows:

\begin{matrix} Vol (B_{x_{0}} (n, r)) ≜ \sum_{j = 0}^{⌊ n ε ⌋} (\binom{n}{j}) {(q - 1)}^{j} \geq q^{H_{q} (\frac{⌊ n ε ⌋}{n}) - o ({log}_{q} n)} . \end{matrix}

(A66)

Proof.

Observe that the Stirling approximation [73] gives the following bounds on

n!

:

\begin{matrix} \sqrt{2 n π} {(\frac{n}{e})}^{n} e^{λ_{1} (n)} \leq n! \leq \sqrt{2 n π} {(\frac{n}{e})}^{n} e^{λ_{2} (n)} . \end{matrix}

(A67)

Now, we have

\begin{matrix} (\binom{n}{⌊ n ε ⌋}) \\ = \frac{n!}{⌊ n ε ⌋! (n - ⌊ n ε ⌋)!} \\ > \frac{\sqrt{2 n π} \cdot {(\frac{n}{e})}^{n} \cdot e^{λ_{1} (n)}}{[\sqrt{2 ⌊ n ε ⌋ π} \cdot {(\frac{⌊ n ε ⌋}{e})}^{⌊ n ε ⌋} \cdot e^{λ_{1} (n)}] [\sqrt{2 (n (1 - \frac{⌊ n ε ⌋}{n})) π} \cdot {(\frac{n (1 - \frac{⌊ n ε ⌋}{n})}{e})}^{n (1 - \frac{⌊ n ε ⌋}{n})}]} \\ = [\frac{{(\frac{n}{e})}^{n}}{{(\frac{⌊ n ε ⌋}{e})}^{⌊ n ε ⌋} \cdot {(\frac{n (1 - \frac{⌊ n ε ⌋}{n})}{e})}^{n (1 - \frac{⌊ n ε ⌋}{n})}}] \cdot [\frac{e^{λ_{1} (n) - λ_{2} (⌊ n ε ⌋) - λ_{2} (n (1 - \frac{⌊ n ε ⌋}{n}))}}{\sqrt{2 π ⌊ n ε ⌋ (1 - \frac{⌊ n ε ⌋}{n})}}] \\ \overset{(a)}{=} [\frac{1}{{(\frac{⌊ n ε ⌋}{n})}^{⌊ n ε ⌋} \cdot {(1 - \frac{⌊ n ε ⌋}{n})}^{n (1 - \frac{⌊ n ε ⌋}{n})}}] \cdot [\frac{e^{⌊ n ε ⌋} \cdot e^{n (1 - \frac{⌊ n ε ⌋}{n})}}{e^{n}}] \cdot Res (n) \\ \overset{(b)}{=} \frac{Res (n)}{{(\frac{⌊ n ε ⌋}{n})}^{⌊ n ε ⌋} \cdot {(1 - \frac{⌊ n ε ⌋}{n})}^{n (1 - \frac{⌊ n ε ⌋}{n})}} \end{matrix}

(A68)

where

(a)

holds, since we let

\begin{matrix} Res (n) ≜ \frac{e^{λ_{1} (n) - λ_{2} (⌊ n ε ⌋) - λ_{2} (n (1 - \frac{⌊ n ε ⌋}{n}))}}{\sqrt{2 π ⌊ n ε ⌋ (1 - \frac{⌊ n ε ⌋}{n})}}, \end{matrix}

(A69)

and

(b)

holds, since

\begin{matrix} \frac{e^{⌊ n ε ⌋} \cdot e^{n (1 - \frac{⌊ n ε ⌋}{n})}}{e^{n}} = 1 . \end{matrix}

(A70)

Next, we proceed to bound the Hamming ball as follows: Observe that the volume of Hamming ball as provided in (A66) is lower bounded by the Binomial coefficient for the largest index, i.e.,

j = ⌊ n ε ⌋

. Therefore,

\begin{matrix} Vol (B_{x_{0}} (n, r)) & ≜ \sum_{j = 0}^{⌊ n ε ⌋} (\binom{n}{j}) {(q - 1)}^{j} \\ \geq (\binom{n}{⌊ n ε ⌋}) {(q - 1)}^{⌊ n ε ⌋} \\ > \frac{{(q - 1)}^{⌊ n ε ⌋}}{{(\frac{⌊ n ε ⌋}{n})}^{n ε} \cdot {(1 - \frac{⌊ n ε ⌋}{n})}^{n (1 - \frac{⌊ n ε ⌋}{n})}} \cdot Res (n) \\ = q^{{log}_{q} (\frac{{(q - 1)}^{⌊ n ε ⌋}}{{(\frac{⌊ n ε ⌋}{n})}^{⌊ n ε ⌋} \cdot {(1 - \frac{⌊ n ε ⌋}{n})}^{n (1 - \frac{⌊ n ε ⌋}{n})}}) + {log}_{q} Res (n)} \\ = q^{⌊ n ε ⌋ {log}_{q} (q - 1) - ⌊ n ε ⌋ {log}_{q} \frac{⌊ n ε ⌋}{n} - n (1 - \frac{⌊ n ε ⌋}{n}) {log}_{q} (1 - \frac{⌊ n ε ⌋}{n}) + {log}_{q} Res (n)} \\ = q^{n (\frac{⌊ n ε ⌋}{n} {log}_{q} (q - 1) - \frac{⌊ n ε ⌋}{n} {log}_{q} \frac{⌊ n ε ⌋}{n} - (1 - \frac{⌊ n ε ⌋}{n}) {log}_{q} (1 - \frac{⌊ n ε ⌋}{n}) + {log}_{q} Res (n))} \\ = q^{n H_{q} (\frac{⌊ n ε ⌋}{n}) + {log}_{q} Res (n)} . \end{matrix}

(A71)

Now, by letting

λ_{1} (n) = 0

and

λ_{2} (n) = 1 / (12 n)

, we obtain

\begin{matrix} Res (n) = \frac{e^{- \frac{1}{12 ⌊ n ε ⌋} - \frac{1}{n - ⌊ n ε ⌋}}}{\sqrt{2 π ⌊ n ε ⌋ (1 - \frac{⌊ n ε ⌋}{n})}} \overset{(a)}{\leq} \frac{e^{- \frac{1}{12 ⌊ n ε ⌋} - \frac{1}{n - ⌊ n ε ⌋}}}{\sqrt{2 π ⌊ n ε ⌋ (1 - ε)}} \overset{(b)}{=} K (ε) {⌊ n ε ⌋}^{- \frac{1}{2}} e^{- \frac{1}{12 ⌊ n ε ⌋} - \frac{1}{n - ⌊ n ε ⌋}}, \end{matrix}

(A72)

where

(a)

follows for sufficiently large n, since

⌊ n ε ⌋ \leq n ε

and

(b)

holds by setting

K (ε) ≜ \frac{1}{\sqrt{2 π (1 - ε)}}

. Therefore,

\begin{matrix} {log}_{q} Res (n) = {log}_{q} K (ε) - \frac{1}{2} {log}_{q} ⌊ n ε ⌋ - \frac{1}{12 ⌊ n ε ⌋} - \frac{1}{n - ⌊ n ε ⌋} = o ({log}_{q} n), \end{matrix}

(A73)

which implies that

\begin{matrix} lim_{n \to \infty} \frac{{log}_{q} Res (n)}{{log}_{q} n} = 0 . \end{matrix}

(A74)

Thereby,

\begin{matrix} Vol (B_{x_{0}} (n, r)) ≜ \sum_{j = 0}^{⌊ n ε ⌋} (\binom{n}{j}) {(q - 1)}^{j} \geq q^{n H_{q} (\frac{⌊ n ε ⌋}{n}) + o ({log}_{q} n)} . \end{matrix}

(A75)

□

Appendix E. Upper Bound on the Volume of the Hamming Ball

Lemma A2

(see [72], Lem. 16.19). Let integer

n \geq 1

and

0 < ε \leq 1 / 2

with

n > ⌊ n ε ⌋ \geq 1

. Then, volume of the Hamming ball in the binary alphabet is upper bounded as follows:

\begin{matrix} Vol (B_{x_{0}} (n, r)) ≜ \sum_{j = 0}^{⌊ n ε ⌋} (\binom{n}{j}) \leq 2^{n H (ε)}, \end{matrix}

(A76)

Proof.

Note that

0 < \forall ε \leq 1 / 2

, the logit function

H (ε) ≜ log (\frac{ε}{1 - ε})

is non-positive, i.e.,

\begin{matrix} H (ε) = log (\frac{ε}{1 - ε}) = log ε - log (1 - ε) \leq 0 . \end{matrix}

(A77)

Next, notice that for

i \in [0, ⌊ n ε ⌋]

we obtain the following:

\begin{matrix} i log ε + (n - i) log (1 - ε) \geq - n H (ε), \end{matrix}

(A78)

where

H (ε)

is the binary entropy function. Hence,

ε^{i} {(1 - ε)}^{n - i} \geq 2^{- n H (ε)}

. Now,

\begin{matrix} 1 = {(ε + (1 - ε))}^{n} = \sum_{i = 0}^{n} (\binom{n}{i}) ε^{i} {(1 - ε)}^{n - i} \geq \sum_{i = 0}^{⌊ n ε ⌋} ε^{i} {(1 - ε)}^{n - i} \geq 2^{- n H (ε)} \sum_{i = 0}^{⌊ n ε ⌋} (\binom{n}{i}) . \end{matrix}

(A79)

□

Therefore, we obtain

\begin{matrix} Vol (B_{x_{0}} (n, r)) ≜ \sum_{j = 0}^{⌊ n ε ⌋} (\binom{n}{j}) \leq 2^{n H (ε)} . \end{matrix}

(A80)

Appendix F. Bound on the Upper Tail of the Binomial Cumulative Distribution Function—Part 1

Lemma A3

(see ([35 Probl. 5.8-(c))). Let

0 < ε < 1

and

ε < \frac{k}{n} < 1

. Then,

\begin{matrix} (\binom{n}{k}) ε^{j} {(1 - ε)}^{n - k} \leq \sum_{j = k}^{n} (\binom{n}{j}) ε^{j} {(1 - ε)}^{n - j} \leq (\binom{n}{k}) ε^{k} {(1 - ε)}^{n - k} [\frac{k (1 - ε)}{k (1 - ε) - (n - k) ε}] . \end{matrix}

(A81)

Proof.

The proof for the lower bound is trivial and obvious. For proving the upper bound, we employ the provided hints given in ([35] p. 531) as follows: Observe that

\begin{matrix} (\binom{n}{j + 1}) = (\binom{n}{j}) (\binom{n - k}{k + 1}) < (\binom{n}{j}) (\binom{n - j}{j}), \end{matrix}

(A82)

and

\begin{matrix} (\binom{n}{k + m}) = (\binom{n}{k + m - 1}) (\binom{n - (k + m - 1)}{k + m - 1}) < (\binom{n}{k + m - 1}) (\binom{n - k}{k}), \end{matrix}

(A83)

Using the induction, we obtain

\begin{matrix} (\binom{n}{k + m}) < (\binom{n}{k}) {(\binom{n - k}{k})}^{m} . \end{matrix}

(A84)

Now, we sum over the variable j by using a geometric series. Next, we combine this results with the result of part

(a)

in the Problem 5.8 of [35], and we obtain the desired upper bound. That is,

\begin{matrix} \sqrt{\frac{n}{8 k (n - k)}} e^{n H (k / n) + k log ε + (n - k) log (1 - ε)} \leq \sum_{j = k}^{n} (\binom{n}{j}) ε^{j} {(1 - ε)}^{n - j} \\ < \sqrt{\frac{n}{2 π k (n - k)}} \cdot \frac{k (1 - ε)}{k (1 - ε) - (n - k) ε} \cdot e^{n H (k / n) + k log ε + (n - k) log (1 - ε)} . \end{matrix}

(A85)

□

Appendix G. Bound on the Upper Tail of the Binomial Cumulative Distribution Function—Part 2

Lemma A4.

Let

0 < ε < 1

and

ε < \frac{k}{n} < 1

. Then,

\begin{matrix} \sum_{j = k}^{n} (\binom{n}{j}) ε^{j} {(1 - ε)}^{n - j} \leq 2^{n [H (\frac{k}{n}) - T_{ε} (\frac{k}{n})]} [\frac{k (1 - ε)}{k (1 - ε) - (n - k) ε}] . \end{matrix}

(A86)

Proof.

Recall that the equation of the tangent line to the binary entropy function

H (δ_{β})

at the specific point

δ_{β} = ε

is given by

\begin{matrix} T_{ε} (δ_{β}) \\ \overset{(a)}{=} H (ε) + (δ_{β} - ε) \frac{d H (δ_{β})}{d δ_{β}} |_{δ_{β} = ε} \\ \overset{(b)}{=} H (ε) + (δ_{β} - ε) log (\frac{1 - ε}{ε}) \\ = H (ε) + (δ_{β} - ε) [log (1 - ε) - log ε] \\ \overset{(c)}{=} - ε log ε - (1 - ε) log (1 - ε) + δ_{β} log (1 - ε) - δ_{β} log ε - ε log (1 - ε) + ε log ε \\ = - ε log ε - log (1 - ε) + ε log (1 - ε) + δ_{β} log (1 - ε) - δ_{β} log ε - ε log (1 - ε) + ε log ε \\ = - log (1 - ε) + δ_{β} log (1 - ε) - δ_{β} log ε \\ = - log (1 - ε) + δ_{β} log (1 - ε) - δ_{β} log ε \\ = - δ_{β} log (ε) - (1 - δ_{β}) log (1 - ε), \end{matrix}

(A87)

where

(a)

holds by definition of a tangent line to a function at specific point,

(b)

follows since derivative of the entropy function reads the negative of the logit function, i.e.,

\begin{matrix} \frac{d H (δ_{β})}{d δ_{β}} = - logit (δ_{β}) ≜ - log (δ_{β} / (1 - δ_{β})), \end{matrix}

(A88)

for

0 < δ_{β} < 1

, and

(c)

holds by definition of the entropy function, i.e.,

\begin{matrix} H (ε) ≜ - ε log ε - (1 - ε) log (1 - ε) . \end{matrix}

(A89)

Therefore, exploiting (A87) we obtain,

\begin{matrix} T_{ε} (\frac{k}{n}) = - \frac{k}{n} log (ε) - (1 - \frac{k}{n}) log (1 - ε), \end{matrix}

(A90)

which implies

- n T_{ε} (\frac{k}{n}) = k log (ε) + (n - k) log (1 - ε)

. Thereby,

\begin{matrix} 2^{- n T_{ε} (\frac{k}{n})} = ε^{k} {(1 - ε)}^{n - k} . \end{matrix}

(A91)

Now, observe that the Binomial coefficient

(\binom{n}{k})

where

k \geq 1

and

n - k \geq 1

, can be upper bounded as follows ([34] see p. 353)

\begin{matrix} (\binom{n}{k}) \leq 2^{n H (\frac{k}{n})} . \end{matrix}

(A92)

Therefore,

\begin{matrix} \frac{k (1 - ε)}{k (1 - ε) - (n - k) ε} \cdot (\binom{n}{k}) ε^{k} {(1 - ε)}^{n - k} \overset{(a)}{\leq} \frac{k (1 - ε)}{k (1 - ε) - (n - k) ε} \cdot 2^{n H (\frac{k}{n})} \cdot ε^{k} {(1 - ε)}^{n - k} \\ \overset{(b)}{\leq} \frac{k (1 - ε)}{k (1 - ε) - (n - k) ε} \cdot 2^{n H (\frac{k}{n})} \cdot 2^{- n T_{ε} (\frac{k}{n})} = [\frac{k (1 - ε)}{k (1 - ε) - (n - k) ε}] \cdot 2^{n [H (\frac{k}{n}) - T_{ε} (\frac{k}{n})]}, \end{matrix}

(A93)

where

(a)

holds by (A91), and

(b)

follows by exploiting (A91). Now, recalling (A86), we obtain

\begin{matrix} \sum_{j = k}^{n} (\binom{n}{j}) ε^{j} {(1 - ε)}^{n - j} \leq \frac{k (1 - ε)}{k (1 - ε) - (n - k) ε} 2^{n [H (\frac{k}{n}) - T_{ε} (\frac{k}{n})]} . \end{matrix}

(A94)

This completes the proof of Lemma A4. □

Appendix H. Bound on the Binomial Cumulative Distribution Function

Lemma A5

(see ([74] App. A)). Let

0 < ε < 1

and

k < n

with

\frac{k}{n} < ε

. Then,

\begin{matrix} \sum_{j = 0}^{k} (\binom{n}{j}) ε^{j} {(1 - ε)}^{n - j} \leq \frac{ε (n - k)}{ε n - k} \cdot 2^{n [H (\frac{k}{n}) - T_{ε} (\frac{k}{n})]} . \end{matrix}

(A95)

Proof.

Let us define

\begin{matrix} k^{'} ≜ n - k, \\ ε^{'} ≜ 1 - ε, \end{matrix}

(A96)

i.e.,

k \leftrightarrow k^{'}

and

ε \leftrightarrow ε^{'}

or equivalently

\begin{matrix} k & \leftrightarrow n - k, \\ ε & \leftrightarrow 1 - ε . \end{matrix}

(A97)

Now, observe that

\frac{k}{n} > ε \Rightarrow \frac{k^{'}}{n} < ε^{'}

.

Furthermore, by definition of the binary entropy function and its tangent line, we have

\begin{matrix} H (\frac{k}{n}) & = H (\frac{n - k}{n}), \end{matrix}

(A98)

and

\begin{matrix} T_{ε} (\frac{k}{n}) & = T_{1 - ε} (\frac{n - k}{n}), \end{matrix}

(A99)

where (A98) follows by (A89) and (A99) holds by (A90).

Now, applying the variable exchange of

j \leftrightarrow n - j

unto (A86), we obtain

\begin{matrix} \sum_{n - j = k}^{n - j = n} (\binom{n}{n - j}) ε^{n - j} {(1 - ε)}^{n - (n - j)} \leq 2^{n [H (\frac{k}{n}) - T_{ε} (\frac{k}{n})]} [\frac{k (1 - ε)}{k (1 - ε) - (n - k) ε}] . \end{matrix}

(A100)

Observe that, since the index of sum in (A86) runs form k to n, i.e.,

k \leq j \leq n

, in the new system, we have

k \leq n - j \leq n

, which is equivalent to

0 \leq j \leq n - k

. Further, the Binomial coefficient for

0 \leq j \leq n

fulfills the subsequent identity:

\begin{matrix} (\binom{n}{n - j}) = (\binom{n}{j}), \end{matrix}

(A101)

Thereby,

\begin{matrix} \sum_{j = 0}^{n - k} (\binom{n}{j}) ε^{n - j} {(1 - ε)}^{j} \leq 2^{n [H (\frac{k}{n}) - T_{ε} (\frac{k}{n})]} [\frac{k (1 - ε)}{k (1 - ε) - (n - k) ε}] . \end{matrix}

(A102)

Now, applying the exchange of variables given in (A97) unto (A102), we obtain

\begin{matrix} \sum_{j = 0}^{k} (\binom{n}{j}) {(1 - ε)}^{n - j} ε^{j} & \leq 2^{n [H (\frac{n - k}{n}) - T_{1 - ε} (\frac{n - k}{n})]} [\frac{(n - k) ε}{(n - k) ε - k (1 - ε)}] \\ = 2^{n [H (\frac{k}{n}) - T_{ε} (\frac{k}{n})]} [\frac{(n - k) ε}{(n - k) ε - k (1 - ε)}], \end{matrix}

(A103)

where the equality holds by (A98) and (A99). Therefore,

\begin{matrix} \sum_{j = 0}^{k} (\binom{n}{j}) {(1 - ε)}^{n - j} ε^{j} \leq 2^{n [H (\frac{k}{n}) - T_{ε} (\frac{k}{n})]} [\frac{(n - k) ε}{(n - k) ε - k (1 - ε)}] . \end{matrix}

(A104)

Now, we focus on the bracket in (A103), which can be simplified as follows:

\begin{matrix} \frac{(n - k) ε}{(n - k) ε - k (1 - ε)} = \frac{(\frac{n - k}{n}) ε}{(\frac{n - k}{n}) ε - \frac{k}{n} (1 - ε)} = \frac{ε - \frac{k}{n} ε}{ε - \frac{k}{n}} = \frac{ε (n - k)}{ε n - k}, \end{matrix}

(A105)

where the first equality follows by dividing both sides in the left side by factor n. Thereby,

\begin{matrix} \sum_{j = 0}^{k} (\binom{n}{j}) {(1 - ε)}^{n - j} ε^{j} \leq \frac{ε (n - k)}{ε n - k} \cdot 2^{n [H (\frac{k}{n}) - T_{ε} (\frac{k}{n})]} . \end{matrix}

(A106)

This completes the proof of Lemma A5. □

References

Li, S.; Xu, L.D.; Zhao, S. The Internet of Things: A Survey. Inf. Syst. Front. 2015, 17, 243–259. [Google Scholar] [CrossRef]
Da Xu, L.; He, W.; Li, S. Internet of Things in Industries: A Survey. IEEE Trans. Ind. Inform. 2014, 10, 2233–2243. [Google Scholar]
Stankovic, J.A. Research Directions For The Internet of Things. IEEE Internet Things J. 2014, 1, 3–9. [Google Scholar] [CrossRef]
Sun, L.; Du, Q. A Review of Physical Layer Security Techniques For Internet of Things: Challenges and Solutions. Entropy 2018, 20, 730. [Google Scholar] [CrossRef] [PubMed]
Batty, M.; Axhausen, K.W.; Giannotti, F.; Pozdnoukhov, A.; Bazzani, A.; Wachowicz, M.; Ouzounis, G.; Portugali, Y. Smart Cities of The Future. Eur. Phys. J. Spec. Top. 2012, 214, 481–518. [Google Scholar] [CrossRef]
Ray, P.P. An Introduction to Dew Computing: Definition, Concept and Implications. IEEE Access 2018, 6, 723–737. [Google Scholar] [CrossRef]
Jordan, M.I.; Mitchell, T.M. Machine Learning: Trends, Perspectives, and Prospects. Science 2015, 349, 255–260. [Google Scholar] [CrossRef]
Paiva, S.; Ahad, M.A.; Tripathi, G.; Feroz, N.; Casalino, G. Enabling Technologies For Urban Smart Mobility: Recent Trends, Opportunities and Challenges. Sensors 2021, 21, 2143. [Google Scholar] [CrossRef]
Mahmud, K.; Town, G.E.; Morsalin, S.; Hossain, M. Integration of Electric Vehicles and Management in The Internet of Energy. Renew. Sustain. Energy Rev. 2018, 82, 4179–4203. [Google Scholar] [CrossRef]
Fascista, A.; Coluccia, A.; Ravazzi, C. A Unified Bayesian Framework For Joint Estimation and Anomaly Detection in Environmental Sensor Networks. IEEE Access 2023, 11, 227–248. [Google Scholar] [CrossRef]
Gatouillat, A.; Badr, Y.; Massot, B.; Sejdić, E. Internet of Medical Things: A Review of Recent Contributions Dealing With Cyber-Physical Systems in Medicine. IEEE Internet Things J. 2018, 5, 3810–3822. [Google Scholar] [CrossRef]
da Costa, C.A.; Pasluosta, C.F.; Eskofier, B.; da Silva, D.B.; da Rosa Righi, R. Internet of Health Things: Toward Intelligent Vital Signs Monitoring in Hospital Wards. Med. Artif. Intell. 2018, 89, 61–69. [Google Scholar] [CrossRef]
Lee, C.; Koo, B.H.; Chae, C.B.; Schober, R. The Internet of Bio-Nano Things in Blood Vessels: System Design and Prototypes. J. Commun. Netw. 2023, 25, 222–231. [Google Scholar] [CrossRef]
Akyildiz, I.F.; Pierobon, M.; Balasubramaniam, S.; Koucheryavy, Y. The Internet of Bio-Nano Things. IEEE Commun. Mag. 2015, 53, 32–40. [Google Scholar] [CrossRef]
Nakano, T.; Eckford, A.W.; Haraguchi, T. Molecular Communication; Cambridge University Press: New York, NY, USA, 2013. [Google Scholar]
Farsad, N.; Yilmaz, H.B.; Eckford, A.; Chae, C.B.; Guo, W. A Comprehensive Survey of Recent Advancements in Molecular Communication. IEEE Commun. Surv. Tutor. 2016, 18, 1887–1919. [Google Scholar] [CrossRef]
Shannon, C.E. A Mathematical Theory of Communication. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef]
Cabrera, J.A.; Boche, H.; Deppe, C.; Schaefer, R.F.; Scheunert, C.; Fitzek, F.H. 6G and the Post-Shannon Theory. In Shaping Future 6G Networks: Needs, Impacts, and Technologies; IEEE Press: Piscataway, NJ, USA, 2021; pp. 271–294. [Google Scholar]
Zhang, C.; Zou, H.; Lasaulce, S.; Saad, W.; Kountouris, M.; Bennis, M. Goal-Oriented Communications For The IoT and Application to Data Compression. IEEE Internet Things Mag. 2022, 5, 58–63. [Google Scholar] [CrossRef]
Schwenteck, P.; Nguyen, G.T.; Boche, H.; Kellerer, W.; Fitzek, F.H.P. 6G Perspective of Mobile Network Operators, Manufacturers, and Verticals. IEEE Netw. Lett. 2023, 5, 169–172. [Google Scholar] [CrossRef]
Fettweis, G.P.; Boche, H. 6G: The Personal Tactile Internet—And Open Questions for Information Theory. IEEE BITS Inf. Theory Mag. 2021, 1, 71–82. [Google Scholar] [CrossRef]
Liu, Y.; Liu, X.; Mu, X.; Hou, T.; Xu, J.; Di Renzo, M.; Al-Dhahir, N. Reconfigurable Intelligent Surfaces: Principles and Opportunities. IEEE Commun. Surv. Tutor. 2021, 23, 1546–1577. [Google Scholar] [CrossRef]
Fascista, A.; Keskin, M.F.; Coluccia, A.; Wymeersch, H.; Seco-Granados, G. RIS-Aided Joint Localization and Synchronization With a Single-Antenna Receiver: Beamforming Design and Low-Complexity Estimation. IEEE J. Sel. Top. Signal Process. 2022, 16, 1141–1156. [Google Scholar] [CrossRef]
Shi, J.; Chan, T.T.; Pan, H.; Lok, T.M. Reconfigurable Intelligent Surface Assisted Semantic Communication Systems. arXiv 2023, arXiv:2306.09650. [Google Scholar]
Torres-Figueroa, L.; Ferrara, R.; Deppe, C.; Boche, H. Message Identification for Task-Oriented Communications: Exploiting an Exponential Increase in the Number of Connected Devices. IEEE Internet Things Mag. 2023, 6, 42–47. [Google Scholar] [CrossRef]
Ahlswede, R. General Theory of Information Transfer: Updated. Discrete Appl. Math. 2008, 156, 1348–1388. [Google Scholar] [CrossRef]
Seyhan, K.; Akleylek, S. Classification of Random Number Generator Applications in IoT: A Comprehensive Taxonomy. J. Inf. Secur. Appl. 2022, 71, 103365. [Google Scholar] [CrossRef]
Hughes, J.P.; Diffie, W. The Challenges of IoT, TLS, and Random Number Generators in The Real World: Bad Random Numbers are Still With us and Are Proliferating in Modern Systems. Queue 2022, 20, 18–40. [Google Scholar] [CrossRef]
Brakerski, Z.; Kalai, Y.T.; Saxena, R.R. Deterministic and Efficient Interactive Coding From Hard-to-Decode Tree Codes. In Proceedings of the IEEE Symposium on Foundations of Computer Science, Durham, NC, USA, 16–19 November 2020; pp. 446–457. [Google Scholar]
Bocchino, R.L.; Adve, V.; Adve, S.; Snir, M. Parallel Programming Must be Deterministic by Default. Usenix HotPar 2009, 6, 1855591–1855595. [Google Scholar]
Arıkan, E. Channel Polarization: A Method For Constructing Capacity-Achieving Codes For Symmetric Binary-Input Memoryless Channels. IEEE Trans. Inf. Theory 2009, 55, 3051–3073. [Google Scholar] [CrossRef]
Salariseddigh, M.J.; Pereg, U.; Boche, H.; Deppe, C. Deterministic Identification Over Channels With Power Constraints. IEEE Trans. Inf. Theory 2022, 68, 1–24. [Google Scholar] [CrossRef]
JáJá, J. Identification is Easier Than Decoding. In Proceedings of the Annual Symposium on Foundations of Computer Science, Portland, OR, USA, 21–23 October 1985; pp. 43–50. [Google Scholar]
Cover, T.; Thomas, J. Elements of Information Theory; Wiley Series Telecomm.; John Wiley & Sons: New York, NY, USA, 1991. [Google Scholar]
Gallager, R.G. Information Theory and Reliable Communication; John Wiley & Sons, Inc.: New York, NY, USA, 1968. [Google Scholar]
Gamal, A.E.; Kim, Y.H. Network Information Theory; Cambridge University Press: New York, NY, USA, 2012. [Google Scholar]
MacKay, D.J. Information Theory, Inference and Learning Algorithms; Cambridge University Press: New York, NY, USA, 2003. [Google Scholar]
Zhang, G.; Chen, K.; Ma, C.; Reddy, S.K.; Ji, B.; Li, Y.; Han, C.; Zhang, X.; Fu, Z. Decision Fusion For Multi-Route and Multi-Hop Wireless Sensor Networks Over The Binary Symmetric Channel. Comput. Commun. 2022, 196, 167–183. [Google Scholar] [CrossRef]
Premkumar, K.; Chen, X.; Leith, D.J. Utility Optimal Coding For Packet Transmission Over Wireless Networks—Part I: Networks of Binary Symmetric Channels. In Proceedings of the 49th Annual Allerton Conference on Communication, Control, and Computing (Allerton), Monticello, IL, USA, 28–30 September 2011; pp. 1592–1599. [Google Scholar] [CrossRef]
Slepian, D. A Class of Binary Signaling Alphabets. Bell Syst. Tech. J. 1956, 35, 203–234. [Google Scholar] [CrossRef]
Elias, P. Coding For Noisy Channels. In Proceedings of the IRE WESCON Convention Record; 1955; Volume 2, pp. 94–104. Available online: https://cir.nii.ac.jp/crid/1570009750462156928 (accessed on 13 February 2024).
Elias, P. Coding For Two Noisy Channels. In Proceedings of the 3rd London Symposium in Information Theory, London, UK, September 1955; Available online: https://cir.nii.ac.jp/crid/1571417125336937088 (accessed on 13 February 2024).
Elias, P. List Decoding For Noisy Channels. In Proceedings of the IRE WESCON Convention Record, San Francisco, CA, USA, 20–23 August 1957; pp. 94–104. [Google Scholar]
Golay, M.J. Notes on Digital Coding. Proc. IEEE 1949, 37, 657. [Google Scholar]
Hamming, R.W. Error Detecting and Error Correcting Codes. Bell Syst. Tech. J. 1950, 29, 147–160. [Google Scholar] [CrossRef]
Reed, I.S. A Class of Multiple-Error-Correcting Codes and The Decoding Scheme. IEEE Trans. Inf. Theory 1954, 4, 38–49. [Google Scholar] [CrossRef]
Dabbabi, O.; Salariseddigh, M.J.; Deppe, C.; Boche, H. Deterministic K-Identification For Binary Symmetric Channel. arXiv 2023, arXiv:2305.04260. [Google Scholar]
Salariseddigh, M.J.; Jamali, V.; Pereg, U.; Boche, H.; Deppe, C.; Schober, R. Deterministic Identification For Molecular Communications Over The Poisson Channel. IEEE Trans. Mol. Biol. Multi-Scale Commun. 2023, 9, 408–424. [Google Scholar] [CrossRef]
Ahlswede, R.; Dueck, G. Identification Via Channels. IEEE Trans. Inf. Theory 1989, 35, 15–29. [Google Scholar] [CrossRef]
Kumar, S.; Marescaux, J. Telesurgery; Springer Science & Business Media: New York, NY, USA, 2008. [Google Scholar]
Spahovic, M.; Salariseddigh, M.J.; Deppe, C. Deterministic K-Identification For Slow Fading Channels. In Proceedings of the IEEE Information Theory Workshop (ITW), Saint-Malo, France, 23–28 April 2023; pp. 353–358. [Google Scholar] [CrossRef]
Salariseddigh, M.J.; Jamali, V.; Pereg, U.; Boche, H.; Deppe, C.; Schober, R. Deterministic K-Identification For MC Poisson Channel With Inter-Symbol Interference. IEEE Open J. Commun. Soc. 2024. [Google Scholar] [CrossRef]
Abu-Mostafa, Y.S. Complexity in Information Theory; Springer: New York, NY, USA, 1988. [Google Scholar]
Yao, A.C. Some Complexity Questions Related to Distributive Computing. In Proceedings of the Annual ACM Symposium on the Theory Computing, Atlanta, GA, USA, 30 April–2 May 1979; pp. 209–213. [Google Scholar]
Verdu, S.; Wei, V. Explicit Construction of Optimal Constant-Weight Codes For Identification Via Channels. IEEE Trans. Inf. Theory 1993, 39, 30–36. [Google Scholar] [CrossRef]
Günlü, O.; Kliewer, J.; Schaefer, R.F.; Sidorenko, V. Code Constructions and Bounds For Identification Via Channels. IEEE Trans. Commun. 2021, 70, 1486–1496. [Google Scholar] [CrossRef]
Ahlswede, R.; Cai, N. Identification Without Randomization. IEEE Trans. Inf. Theory 1999, 45, 2636–2642. [Google Scholar] [CrossRef]
Mehlhorn, K.; Schmidt, E.M. Las Vegas is Better Than Determinism in VLSI and Distributed Computing. In Proceedings of the 14th Annal ACM Symposium on Theory of Computation, San Francisco, CA, USA, 5–7 May 1982; pp. 330–337. [Google Scholar]
Salariseddigh, M.J.; Jamali, V.; Boche, H.; Deppe, C.; Schober, R. Deterministic Identification For MC Binomial Channel. In Proceedings of the IEEE International Symposium on Information Theory (ISIT), Taipei, Taiwan, 25–30 June 2023; pp. 448–453. [Google Scholar] [CrossRef]
Yamamoto, H.; Ueda, M. Multiple Object Identification Coding. IEEE Trans. Inf. Theory 2015, 61, 4269–4276. [Google Scholar] [CrossRef]
Kennedy, R.S. Finite-Sate Binary Symmetric Channels. Ph.D. Thesis, Massachusetts Institute of Technology, Cambridge, MA, USA, 1963. [Google Scholar]
Rudin, W. Principles of Mathematical Analysis; McGraw-Hill: New York, NY, USA, 1953. [Google Scholar]
Gilbert, E.N. A Comparison of Signalling Alphabets. Bell Syst. Tech. J. 1952, 31, 504–522. [Google Scholar] [CrossRef]
Richardson, T.; Urbanke, R. Modern Coding Theory; Cambridge University Press: New York, NY, USA, 2008. [Google Scholar]
Conway, J.H.; Sloane, N.J.A. Sphere Packings, Lattices and Groups; Springer: New York, NY, USA, 2013. [Google Scholar]
Van Lint, J.H. Introduction to Coding Theory; Springer Science & Business Media: New York, NY, USA, 1998; Volume 86. [Google Scholar]
Gilbert, E.N. Capacity of a Burst-Noise Channel. Bell Syst. Tech. J. 1960, 39, 1253–1265. [Google Scholar] [CrossRef]
Alexander, A.A.; Gryb, R.M.; Nast, D.W. Capabilities of The Telephone Network For Data Transmission. Bell Syst. Tech. J. 1960, 39, 431–476. [Google Scholar] [CrossRef]
Fontaine, A.B.; Gallager, R.G. Error Statistics and Coding For Binary Transmission Over Telephone Circuits. Proc. IRE 1961, 49, 1059–1065. [Google Scholar] [CrossRef]
Kautz, W.; Singleton, R. Nonrandom Binary Superimposed Codes. IEEE Trans. Inf. Theory 1964, 10, 363–377. [Google Scholar] [CrossRef]
Füredi, Z. On r-Cover-Free Families. J. Comb. Theory Ser. A 1996, 73, 172–173. [Google Scholar] [CrossRef]
Flum, J.; Grohe, M. Parameterized Complexity Theory; Texts in Theoretical Computer Science (An EATCS Series); Springer: New York, NY, USA, 2006. [Google Scholar]
Robbins, H. A Remark On Stirling’s Formula. Am. Math. Mon. 1955, 62, 26–29. [Google Scholar] [CrossRef]
Jeřábek, E. Dual Weak Pigeonhole Principle, Boolean Complexity, and Derandomization. Ann. Pure Appl. Log. 2004, 129, 1–37. [Google Scholar] [CrossRef]

Figure 1. Bit transition graph over a BSC. Each bit is flipped independently of other bits, with a cross-over probability of

ε \in (0, 1 / 2)

.

Figure 1. Bit transition graph over a BSC. Each bit is flipped independently of other bits, with a cross-over probability of

ε \in (0, 1 / 2)

.

Figure 2. System model for DKI communication setting over a BSC. Employing a deterministic encoder at the transmitter, the message i is mapped to the codeword

c_{i} = (c_{i, t}) |_{t = 1}^{n}

using a deterministic function. The decoder at the receiver is provided with an arbitrary goal message set K, and given the channel output

Y = (Y_{t}) |_{t = 1}^{n}

, it asks whether or not i belongs to

K

.

Figure 2. System model for DKI communication setting over a BSC. Employing a deterministic encoder at the transmitter, the message i is mapped to the codeword

c_{i} = (c_{i, t}) |_{t = 1}^{n}

using a deterministic function. The decoder at the receiver is provided with an arbitrary goal message set K, and given the channel output

Y = (Y_{t}) |_{t = 1}^{n}

, it asks whether or not i belongs to

K

.

Figure 3. A DKI configuration with

K = 4

and a goal message set

K = {2, 4, 5, 7}

is displayed. The channel’s output is located in the union of each individual decoder

T_{j}

(marked in blue) in the correct identification event, where j is a member of the goal message set. If the channel output is seen in the complement of the union of distinct decoders that the codeword’s index belongs to, a sort I error event takes place. When the transmitted codeword’s index does not not belong to

K

, and the channel output is recognized in the union of the individual decoders

T_{j}

, with

j \in K

, an error event of sort II occurs.

Figure 3. A DKI configuration with

K = 4

and a goal message set

K = {2, 4, 5, 7}

is displayed. The channel’s output is located in the union of each individual decoder

T_{j}

(marked in blue) in the correct identification event, where j is a member of the goal message set. If the channel output is seen in the complement of the union of distinct decoders that the codeword’s index belongs to, a sort I error event takes place. When the transmitted codeword’s index does not not belong to

K

, and the channel output is recognized in the union of the individual decoders

T_{j}

, with

j \in K

, an error event of sort II occurs.

Figure 4. Range of codebook sizes for various K-identification configurations. The codebook scale for DKI problem over the BSC coincide the conventional exponential behavior. But, aside from the standard exponential and double exponential code sizes [26] (RKI over DMC), a different non-standard codebook size is also observed for Gaussian channel with slow fading (GSF); namely, it grows super-exponentially in the codeword length n, i.e.,

2^{(n log n) R}

.

Figure 4. Range of codebook sizes for various K-identification configurations. The codebook scale for DKI problem over the BSC coincide the conventional exponential behavior. But, aside from the standard exponential and double exponential code sizes [26] (RKI over DMC), a different non-standard codebook size is also observed for Gaussian channel with slow fading (GSF); namely, it grows super-exponentially in the codeword length n, i.e.,

2^{(n log n) R}

.

Figure 5. Spectrum of goal message set sizes for different K-identification setups. The goal message set scale for DKI problem over the BSC grows exponentially in the codeword length. Additionally, the GSF channel represent a sub-linear scale, which is lower than the conventional exponential behavior. The scale of goal message set for the BSC is identical to its codebook scale, i.e., exponentially in the codeword length.

Figure 6. Illustration of an exhausted greedy-wise ball covering of an n-dimensional Hamming hyper ball

B_{0} (n, n A)

, where the union of the small balls of radius

r_{0} = ⌊ n β ⌋

cover a larger Hamming hyper ball. As the codewords are assigned to the center of each ball lying inside the an n-dimensional Hamming hyper ball

B_{0} (n, n A)

according to the greedy construction, the Hamming weight of a codeword is bounded by

n A

, as required.

Figure 6. Illustration of an exhausted greedy-wise ball covering of an n-dimensional Hamming hyper ball

B_{0} (n, n A)

, where the union of the small balls of radius

r_{0} = ⌊ n β ⌋

cover a larger Hamming hyper ball. As the codewords are assigned to the center of each ball lying inside the an n-dimensional Hamming hyper ball

B_{0} (n, n A)

according to the greedy construction, the Hamming weight of a codeword is bounded by

n A

, as required.

Figure 7. Depiction of the error exponent for a BSC. The tangent line of the binary entropy function

H (p)

in the cross-over probability point

0 < p = ε < 1 / 2

, calculated for

ε < p = δ < (1 - β) ε + β / 2

, marked in green, is denoted by

T_{ε} (δ)

. For a given cross-over probability

ε

, the difference between

T_{ε} (δ)

and

H (δ)

is referred to as the error exponent. For example, the upper bounds on the goal identification rate

κ

calculated in (A58) and (A59) are two different error exponents that are derived in the sort II error analysis. The minimum of these error exponents is the bottleneck for the rate

κ

, i.e., an eligible upper bound.

Figure 7. Depiction of the error exponent for a BSC. The tangent line of the binary entropy function

H (p)

in the cross-over probability point

0 < p = ε < 1 / 2

, calculated for

ε < p = δ < (1 - β) ε + β / 2

, marked in green, is denoted by

T_{ε} (δ)

. For a given cross-over probability

ε

, the difference between

T_{ε} (δ)

and

H (δ)

is referred to as the error exponent. For example, the upper bounds on the goal identification rate

κ

calculated in (A58) and (A59) are two different error exponents that are derived in the sort II error analysis. The minimum of these error exponents is the bottleneck for the rate

κ

, i.e., an eligible upper bound.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Salariseddigh, M.J.; Dabbabi, O.; Deppe, C.; Boche, H. Deterministic K-Identification for Future Communication Networks: The Binary Symmetric Channel Results. Future Internet 2024, 16, 78. https://doi.org/10.3390/fi16030078

AMA Style

Salariseddigh MJ, Dabbabi O, Deppe C, Boche H. Deterministic K-Identification for Future Communication Networks: The Binary Symmetric Channel Results. Future Internet. 2024; 16(3):78. https://doi.org/10.3390/fi16030078

Chicago/Turabian Style

Salariseddigh, Mohammad Javad, Ons Dabbabi, Christian Deppe, and Holger Boche. 2024. "Deterministic K-Identification for Future Communication Networks: The Binary Symmetric Channel Results" Future Internet 16, no. 3: 78. https://doi.org/10.3390/fi16030078

APA Style

Salariseddigh, M. J., Dabbabi, O., Deppe, C., & Boche, H. (2024). Deterministic K-Identification for Future Communication Networks: The Binary Symmetric Channel Results. Future Internet, 16(3), 78. https://doi.org/10.3390/fi16030078

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Deterministic K-Identification for Future Communication Networks: The Binary Symmetric Channel Results †

Abstract

1. Introduction

1.1. Post-Shannon Communications for IoT

1.2. IoT Needs and Impact of the Deterministic K-Identification

1.3. Binary Symmetric Channel

1.4. Information Theoretical Analysis of BSC-Based IoT Systems

1.5. Applications of the K-Identification Problem for IoT

1.6. Contributions

1.7. Organization

2. Background on the Identification Problem

2.1. Identification Problem

2.2. Previous Results on DI Capacity

2.3. Previous Results on DKI Capacity

3. System Model and Preliminaries

3.1. System Model

3.2. DKI Coding for the BSC

4. DKI Capacity Region of the BSC

4.1. Main Results

4.2. Inner Bound (Achievability Proof)

4.3. Upper Bound (Converse Proof)

5. Future Directions and Summary

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Abbreviations

Appendix A. Sort I Error Analysis

Appendix B. Sort II Error Analysis

Appendix C. Cover-Free Families

Appendix D. Lower Bound on the Volume of the Hamming Ball

Appendix E. Upper Bound on the Volume of the Hamming Ball

Appendix F. Bound on the Upper Tail of the Binomial Cumulative Distribution Function—Part 1

Appendix G. Bound on the Upper Tail of the Binomial Cumulative Distribution Function—Part 2

Appendix H. Bound on the Binomial Cumulative Distribution Function

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Deterministic K-Identification for Future Communication Networks: The Binary Symmetric Channel Results^†