Wiretap Channel with Information Embedding on Actions

Yin, Xinxing; Xue, Zhi

doi:10.3390/e16042105

Open AccessArticle

Wiretap Channel with Information Embedding on Actions

by

Xinxing Yin

^* and

Zhi Xue

Electronic Engineering Department, Shanghai Jiao Tong University, Dongchuan Road 800, Shanghai 200240, China

^*

Author to whom correspondence should be addressed.

Entropy 2014, 16(4), 2105-2130; https://doi.org/10.3390/e16042105

Submission received: 11 November 2013 / Revised: 28 March 2014 / Accepted: 10 April 2014 / Published: 14 April 2014

Download

Browse Figures

Versions Notes

Abstract

: Information embedding on actions is a new channel model in which a specific decoder is used to observe the actions taken by the encoder and retrieve part of the message intended for the receiver. We revisit this model and consider a different scenario where a secrecy constraint is imposed. By adding a wiretapper in the model, we aim to send the confidential message to the receiver and keep it secret from the wiretapper as much as possible. We characterize the inner and outer bounds on the capacity-equivocation region of such a channel with noncausal (and causal) channel state information. Furthermore, the lower and upper bounds on the sum secrecy capacity are also obtained. Besides, by eliminating the specific decoder, we get a new outer bound on the capacity-equivocation region of the wiretap channel with action-dependent states and prove it is tighter than the existing outer bound. A binary example is presented to illustrate the tradeoff between the sum secrecy rate and the information embedding rate under the secrecy constraint. We find that the secrecy constraint and the communication requirements of information embedding have a negative impact on improving the secrecy transmission rate of the given communication link.

Keywords:

information embedding; wiretap channel; action-dependent states; sum secrecy capacity

1. Introduction

In wireless communication systems, such as sensor networks, mobile networks and satellite communications, sensitive data is transferred through multiple hops. The nodes in the network usually need to take various type of actions to acquire the state of the network before transferring the packets. The state acquisition in the network often requires the exchange of control information, which uses physical resources within the system. For example, routers measure the network congestion levels via the transmission of probing packets; wireless transceivers evaluate the channel quality through training or feedback; and radio terminals switch among different operating modes, such as transmit, receive or idle. Based on these motivating observations, Weissman introduced channels with action-dependent states where the encoder in a point-to-point channel could take actions to affect the channel state information [1]. The capacity of such a channel where the channel inputs depended noncausally (and causally) on the channel states was determined. After Weissman’s publication, [2] investigated the channel model where both the channel encoder and decoder could probe the channel states and obtained the cost constrained “probing capacity”. Different from this, [1–3] studied the degraded broadcast channel with causal action-dependent states where the transmitter sent two kinds of messages to two different receivers. The capacity region was derived. For the noncausal case, only inner and outer bounds on the capacity region were obtained [4]. Other extensions of the channel with action-dependent states can be seen in [5–10].

Recently, [11,12] explored information embedding on actions in the channel with noncausal action-dependent states; see Figure 1. In this new setup, an additional decoder was introduced to observe a function of the actions taken by the encoder. It tried to get part of the transmitted message. Actually, the idea of “information embedding” on actions in such a channel is related to the classical topic of information hiding (e.g., [13–17]) and could be explained by the following example. In communication networks, probing the congestion state requires sending training packets to the nearest router. Meanwhile, the router (the “recipient” of the actions) may need to obtain partial information, such as the header of the packet, to find the address of the intended receiver. Since the actions play the role of providing necessary information about the message for the router, it is natural to ask how much information could be embedded in the actions without affecting the system performance. [11] got the capacity-cost region and showed that the communication requirements of the action-cribbing decoder were generally in conflict with the goal of improving the efficiency of the communication link.

However, the above action-dependent channel models [1–12] considered no secrecy constraint, which was extremely important in communications. For instance, the broadcast nature of wireless networks gives rise to the hidden danger of information leakage to malicious receiver when broadcasting the sensitive data and acquiring the state information. Recent works [18,19] studied the secure communication problems in channels with action-dependent states. [18] added a wiretapper to the model in [1] and got the inner and outer bounds on the capacity-equivocation region. The capacity-equivocation region is the set of all the achievable rate pairs (R, R_e), where R and R_e are the rates of the confidential message and wiretapper’s equivocation about the message. [19] studied the effects of feedback on the secrecy capacity, which is the maximum rate of data transmission at which the message can be communicated in perfect secrecy, of the wiretap channel with action-dependent states.

From the perspective of secure communication, we consider a different communication model in Figure 2, i.e., the wiretap channel with information embedding on actions. In this setup, the transmitter aims to send the confidential message to the receiver and keep it secret from the wiretapper as much as possible. We use equivocation (i.e., the uncertainty about the confidential message) at the wiretapper to measure the level of information leakage. Meanwhile, like [11], a specific decoder is introduced to retrieve a portion of the confidential message (see m₁ in Figure 2). The specific decoder observes a function of the message-dependent actions, which affects the formulation of the channel states. Our work is novel in the sense that we consider the secrecy constraint in the information transmission, and we try to characterize how much information could be embedded in the actions without increasing the information leakage.

For the new channel model described above, this paper obtains the inner and outer bounds on the capacity-equivocation region of such a channel with noncausal (and causal) channel state information. Furthermore, the lower and upper bounds on the sum secrecy capacity are also attained. Through a special case where no message needs to be retrieved by the specific decoder, we get a new outer bound on the capacity-equivocation region of the wiretap channel with action-dependent states and prove that it is tighter than the existing outer bound. To illustrate the tradeoff between the sum secrecy rate and the information embedding rate under the secrecy constraint, we provide a binary example. It shows that the sum secrecy rate is reduced when the information embedding rate increases. We find that the secrecy constraint and the communication requirements of the specific decoder have a negative impact on improving the secrecy transmission rate of the given communication link.

The rest of the article is organized as follows. Section 2 describes the wiretap channel with information embedding on actions and outlines the main results. Section 3 discusses the results and presents a binary example. We conclude in Section 4 with a summary of the whole work and some future directions.

2. Channel Model and Main Results

The symbol notations and description of the channel model are presented in Subsection 2.1. Subsection 2.2 characterizes the inner and outer bounds on the capacity-equivocation region of the wiretap channel with information embedding on actions.

2.1. Symbol Notations and Channel Model

Throughout this paper, we use calligraphic letters, e.g., Entropy 16 02105f7 , Entropy 16 02105f8 , to denote the finite sets and || || to denote the cardinality of the set, . Uppercase letters, e.g., X, Y, are used to denote random variables taking values from finite sets, e.g., , . The value of a random variable, X, is denoted by the lowercase letter, x. We use $Z_{i}^{j}$ to denote the (j − i + 1)-vectors (Z_i, Z_i₊₁, …, Z_j) of random variables for 1 ≤ i ≤ j and will always drop the subscript when i = 1. Moreover, we use X ~ p(x) to denote the probability mass function of the random variable, X. For X ~ p(x) and 0 ≤ ε ≤ 1, the set of the typical N-sequences x^N is defined as $T_{X}^{N} (ɛ) = {x^{N} : ∣ π (x ∣ x^{N}) - p (x) ∣ \leq ε p (x) for all x \in X}$ , where π(x|x^N) denotes the frequency of occurrences of letter x in the sequence, x^N (for more details about typical sequences, please refer to [23,24]). The set of the conditional typical sequences, e.g., $T_{Y ∣ X}^{N} (ɛ)$ , follows similarly. In this paper, it is assumed that the base of the log function is two.

The wiretap channel with information embedding on actions is depicted in Figure 2. We aim to send the confidential message (M₁, M₂) to the legitimate receiver through such a channel and keep it secret from the wiretapper as much as possible. Part of the message embedded in the actions needs to be retrieved by the specific decoder. We use equivocation at the wiretapper to measure the secrecy of the confidential message.

Concretely, the model of wiretap channel with information embedding on actions is specified by { Entropy 16 02105f9 , $ℬ$ , Entropy 16 02105f10 , f, p(s|a), Entropy 16 02105f7 , p(y, z|s, x), Entropy 16 02105f8 , Entropy 16 02105f11 }. To send the message (M₁, M₂), an action sequence, A^N(M₁, M₂), is first selected by the encoder. Then, the generation of the channel states, S^N, is affected by the actions, instead of by nature. The channel states, S^N, are generated through a discrete memoryless channel (DMC) $p (s^{N} ∣ a^{N} (m_{1}, m_{2})) = \prod_{i = 1}^{N} p (s_{i} ∣ a_{i})$ . The stochastic channel encoder, ϕ, is specified by a matrix of conditional probability distributions, ϕ(x^N|m₁, m₂, s^N). Note that $\sum_{x^{N}} ϕ (x^{N} ∣ m_{1}, m_{2}, s^{N}) = 1$ , and ϕ(x^N|m₁, m₂, s^N) is the probability that the message (m₁, m₂) and the state sequence, s^N, are encoded as the channel input, x^N. When the state sequence, s^N, is known causally by the channel encoder, the channel encoder at time i is specified by ϕ_i(x_i|m₁, m₂, sⁱ), where x_i is the output of the channel encoder at time i and sⁱ = (s₁, s₂, …, s_i) is the channel states up to time i. When the channel encoder knows the state, s^N, in a noncausal manner, the channel encoder at time i is specified by ϕ_i(x_i|m₁, m₂, s^N).

The main channel is a DMC with discrete input alphabet Entropy 16 02105f7 × Entropy 16 02105f10 and output alphabet Entropy 16 02105f8 . The channel is memoryless in the sense that $p (y^{N} ∣ x^{N}, s^{N}) = \prod_{i = 1}^{N} p (y_{i} ∣ x_{i}, s_{i})$ , where y^N ∈ Entropy 16 02105f12 , x^N ∈ Entropy 16 02105f13 and s^N ∈ Entropy 16 02105f14 . Decoder 1 observes signal B^N as a deterministic function of the actions, A^N, i.e., B^N = f(A^N). It estimates part of the transmitted message. Decoder 1 is specified by ψ₁: $ℬ$ ^N → $ℳ$ ₁. The output of Decoder 1 is M̂₁. The probability of the error of Decoder 1 is defined as P_e₁ = Pr{M̂ ₁ ≠ M₁}. The legitimate receiver decodes the message ( ${\hat{\hat{M}}}_{1}, {\hat{\hat{M}}}_{2}$ ) by Decoder 2 (see Figure 2). Decoder 2 is specified by ψ₂: Entropy 16 02105f12 → $ℳ$ ₁× $ℳ$ ₂. The probability of the error of Decoder 2 is defined as $P_{e 2} = P r {({\hat{\hat{M}}}_{1}, {\hat{\hat{M}}}_{2}) \neq (M_{1}, M_{2})}$ . The wiretap channel is also a DMC with transition probability $p (z^{N} ∣ y^{N}) = \prod_{i = 1}^{N} p (z_{i} ∣ y_{i})$ , where z^N ∈ Entropy 16 02105f15 is the observation of the wiretapper. The uncertainty of the message for the wiretapper is measured by $lim_{N \to \infty} Δ = lim_{N \to \infty} \frac{H (M_{1}, M_{2} ∣ Z^{N})}{N}$ . In our model, the wiretap channel is assumed to be degraded from the main channel, i.e., X → Y → Z form a Markov chain.

Then, we give the definition of “achievable” and “sum secrecy capacity” as follows.

Definition 1

A rate triple (R₁, R₂, R_e) is said to be achievable for the model in Figure 2 if there exists a channel encoder-decoder, such that:

lim_{N \to \infty} \frac{l o g ‖ ℳ_{1} ‖}{N} = R_{1}

(1)

lim_{N \to \infty} \frac{l o g ‖ ℳ_{2} ‖}{N} = R_{2}

(2)

lim_{N \to \infty} \frac{H (M_{1}, M_{2} ∣ Z^{N})}{N} \geq R_{e}

(3)

P_{e 1} \leq ɛ, P_{e 2} \leq ɛ

(4)

where ε is an arbitrary small positive real number, (R₁, R₂) are the rates of the message (M₁, M₂) and R_e is the rate of equivocation. The capacity-equivocation region is defined as the convex closure of all achievable rate triples (R₁, R₂, R_e).

Definition 2

The sum secrecy capacity is the maximum rate at which the confidential message can be sent to the receiver in perfect secrecy. The sum secrecy capacity:

C_{s} = max_{(R_{1}, R_{2}, R_{e} = R_{1} + R_{2}) \in R} (R_{1} + R_{2})

(5)

where $ℛ$ is the capacity-equivocation region.

Based on the definition in Equation (5), the sum secrecy capacity for the model in Figure 2 with noncausal action-dependent states is $C_{s n} = max_{(R_{1}, R_{2}, R_{e} = R_{1} + R_{2}) \in R_{n}} (R_{1} + R_{2})$ , where $ℛ$ _n is the capacity-equivocation region of the noncausal case. Similarly, we can define the sum secrecy capacity of the causal case, C_sc.

2.2. Main Results

In this subsection, four theorems are presented. Theorems 1 and 2 give the inner and outer bounds on the capacity-equivocation region for the channel model in Figure 2 with noncausal action-dependent states. For the causal case, the inner and outer bounds are characterized in Theorems 3 and 4.

Theorem 1

An achievable rate-equivocation region of the wiretap channel with information embedding on actions when the states are noncausally known to the channel encoder is the set:

\begin{array}{l} R_{i n} = {(R_{1}, R_{2}, R_{e}) \\ R_{1} \leq H (B) \end{array}

(6)

R_{1} + R_{2} \leq I (U; Y) - I (U; S ∣ A)

(7)

R_{e} \leq R_{1} + R_{2}

(8)

R_{e} \leq I (U; Y) - max {I (U; Z), I (U; S ∣ A)}

(9)

R_{e} \leq H (A ∣ Z)}

(10)

where the joint distributions p(a, b, u, x, s, y, z) = p(z|y)p(y|x, s)p(x|u, s)p(s|u, a)p(u|a)p(a)1_{_b₌_f₍_a_)}, which indicates (A, B, U) → (X, S) → Y → Z form a Markov chain.

Theorem 2

An outer bound on the capacity-equivocation region of the wiretap channel with information embedding on actions when the states are noncausally known to the channel encoder is the set:

\begin{array}{l} R_{o n} = {(R_{1}, R_{2}, R_{e}) \\ R_{1} \leq H (B) \end{array}

(11)

R_{1} + R_{2} \leq I (U; Y) - I (U; S ∣ A)

(12)

R_{e} \leq R_{1} + R_{2}

(13)

R_{e} \leq I (U; Y) - I (V; Z ∣ Q) - I (U; S ∣ V)}

(14)

where the joint distributions p(a, b, u, x, s, y, z) = p(z|y)p(y|x, s)p(x|u, s)p(q|v)p(v|u)p(a, u, s)1_{_b₌_f₍_a_)}, which indicates (B, A, U, V, Q) → (X, S) → Y → Z and Q → V → U → Y → Z form Markov chains.

Comments

Theorems 1 and 2 are proven in Appendix A.
To exhaust $ℛ$ _in and $ℛ$ _on, it is enough to restrict , and to satisfy:

$\begin{array}{l} ‖ U ‖ \leq ‖ A ‖ ‖ X ‖ ‖ S ‖ + 2 \\ ‖ Q ‖ \leq ‖ A ‖ ‖ X ‖ ‖ S ‖ \\ ‖ V ‖ \leq ‖ A ‖ ‖ X ‖ ‖ S ‖ (‖ A ‖ ‖ X ‖ ‖ S ‖ + 1) \end{array}$

This can be easily proven by using the support lemma (see p. 310 in [25]).

Theorem 3

An achievable rate-equivocation of the wiretap channel with information embedding on actions when the states are causally known to the channel encoder is the set:

\begin{array}{l} R_{i c} = {(R_{1}, R_{2}, R_{e}) \\ R_{1} \leq H (B) \\ R_{1} + R_{2} \leq I (U; Y) \\ R_{e} \leq R_{1} + R_{2} \\ R_{e} \leq I (U; Y) - I (U; Z) \\ R_{e} \leq H (A ∣ Z)} \end{array}

(15)

where the joint distributions p(a, b, u, x, s, y, z) = p(z|y)p(y|x, s)p(x|u, s)p(s|a)p(u|a)p(a)1_{_b₌_f₍_a_)}, which indicates (A, B, U) → (X, S) → Y → Z and U → A → S form Markov chains.

Theorem 4

An outer bound on the capacity-equivocation region of the wiretap channel with information embedding on actions when the states are causally known to the channel encoder is the set:

\begin{array}{l} R_{o c} = {(R_{1}, R_{2}, R_{e}) \\ R_{1} \leq H (B) \end{array}

(16)

R_{1} + R_{2} \leq I (U; Y)

(17)

R_{e} \leq R_{1} + R_{2}

(18)

R_{e} \leq I (U; Y) - I (V; Z ∣ Q)}

(19)

where the joint distributions p(a, b, u, x, s, y, z) = p(z|y)p(y|x, s)p(x|u, s)p(q|v)p(v|u)p(a, u, s)1_{_b₌_f₍_a_)}, which indicates (B, A, U, V, Q) → (X, S) → Y → Z and Q → V → U → Y → Z form Markov chains.

Comments

Theorems 3 and 4 are proven in Appendix B.
To exhaust $ℛ$ _ic and $ℛ$ _oc, it is enough to restrict , and to satisfy:

$\begin{array}{l} ‖ U ‖ \leq ‖ A ‖ ‖ X ‖ ‖ S ‖ + 1 \\ ‖ Q ‖ \leq ‖ A ‖ ‖ X ‖ ‖ S ‖ \\ ‖ V ‖ \leq {(‖ A ‖ ‖ X ‖ ‖ S ‖)}^{2} \end{array}$

This can be easily proven by using the support lemma (see p. 310 in [25]).

Further discussion about the theorems and the comparison with other existing results are given in Section 3.

3. Discussion and Example

In this section, we first calculate the sum secrecy capacities of the noncausal and causal cases. Then, we compare our results with some existing results and present a binary example to illustrate the tradeoff between the sum secrecy rate and the information embedding rate under the secrecy constraint.

3.1. Discussion

Corollary 1

The lower and upper bounds on the sum secrecy capacity of the model in Figure 2 with noncausal action-dependent states are:

C_{l n} = max_{p (x ∣ u, s) p (u ∣ s, a) p (a)} min {I (U; Y) - max {I (U; Z), I (U; S ∣ A)}, H (A ∣ Z)}

(20)

and:

C_{u n} = max_{p (x ∣ u, s) p (q ∣ v) p (v ∣ u) p (u ∣ s, a) p (a)} min {I (U; Y) - I (U; S ∣ A), I (U; Y) - I (V; Z ∣ Q) - I (U; S ∣ V)}

(21)

respectively.

Proof

According to the definition of Formula (5), $ℛ$ _in and $ℛ$ _on, we can easily get Equations (20) and (21).

Similarly, we have the following corollary for the causal case.

Corollary 2

The lower and upper bounds on the sum secrecy capacity of the model in Figure 2 with causal action-dependent states are:

C_{l c} = max_{p (u, a) p (x ∣ u, s)} min {I (U; Y) - I (U; Z), H (A ∣ Z)}

(22)

and:

C_{u c} = max_{p (x ∣ s, u) p (q ∣ v) p (v ∣ u) p (u, a) p (a)} I (U; Y) - I (V; Z ∣ Q)

(23)

respectively.

Proof

According to the definition of Formula (5), $ℛ$ _ic and $ℛ$ _oc, we can easily get Equations (22) and (23).

According to [11], the capacity region of the model in Figure 1 (without the secrecy constraint) is:

\begin{array}{l} R_{E} = {(R_{1}, R_{2}) \\ R_{1} \leq H (B) \end{array}

(24)

R_{1} + R_{2} \leq I (U; Y) - I (U; S ∣ A)}

(25)

Then, the corresponding capacity is:

C_{E} = max_{p (u, a, x, s)} I (U; Y) - I (U; S ∣ A)

(26)

Compare Formula (26) with (20); we can get C_ln ≤ C_E. This implies that the secrecy constraint reduces the secrecy transmission rate of the communication link. Therefore, once the problem of information leakage is considered, the system designer has to trade off between the transmission rate and data security. Moreover, without the secrecy constraint, we have the following corollary.

Corollary 3

Without considering the secrecy constraint (i.e., ignoring R_e in $ℛ$ _in), we arrive at the results in [11].

Proof

This corollary is verified by setting R_e = 0 in $ℛ$ _in.

When no message needs to be embedded in the actions, the model in Figure 2 turns to the wiretap channel with action-dependent states [18]; see Figure 3. [18] gave an upper bound on the secrecy capacity of the model with noncausal states as:

C_{d a i n} = max_{p (u, v, q, a, x, s)} min {I (U; Y) - I (U; S ∣ A), I (U; Y) - I (V; Z ∣ Q)}

(27)

Substituting R₁ = 0 into $ℛ$ _on, we get a new upper bound on the secrecy capacity of the model in Figure 3 with noncausal states. This new upper bound is:

\begin{array}{l} C_{u n}^{'} = max_{(R, R_{e} = R) \in R_{n d}} R \\ = max_{p (u, v, q, a, x, s)} min {I (U; Y) - I (U; S ∣ A), I (U; Y) - I (V; Z ∣ Q) - I (U; S ∣ V)} \end{array}

(28)

where $ℛ$ _nd is the capacity-equivocation region of the model with noncausal states. Note that the difference between Equations (28) and (27) is the extra term, I(U; S|V ). The cause of the emergence of this term is stated in detail at the end of Appendix A. Then, we give the following corollary.

Corollary 4

For the wiretap channel with noncausal action-dependent states shown in Figure 3, the new upper bound on the secrecy capacity $C_{u n}^{'} \leq C_{d a i n}$ .

Proof

From the two Formulas, (27) and (28), we can get the difference between $C_{u n}^{'}$ and C_dain as:

\begin{array}{l} Λ_{1} = C_{d a i n} - C_{u n}^{'} \\ = max min {I (U; Y) - I (U; S ∣ A), I (U; Y) - I (V; Z ∣ Q)} - max min {I (U; Y) - I (U; S ∣ A), I (U; Y) - I (V; Z ∣ Q) - I (U; S ∣ V)} . \end{array}

It is easy to see that Λ₁ ≥ 0, i.e., our new upper bound $C_{u n}^{'} \leq C_{d a i n}$ . Note that it is always desired to find a smaller upper bound to approach the secrecy capacity.

Similarly, substituting R₁ = 0 into $ℛ$ _oc, we can get an upper bound on the secrecy capacity of the model in Figure 3 with causal states, which is:

\begin{array}{l} C_{u c}^{'} = max_{(R, R_{e} = R) \in R_{c d}} R \\ = max_{p (x ∣ s, u) p (u, v, q, a, s)} I (U; Y) - I (V; Z ∣ Q) \end{array}

(29)

where $ℛ$ _cd is the capacity-equivocation region of the model with causal states. $C_{u c}^{'}$ coincides with the upper bound on the secrecy capacity of the model in [18] with causal states.

Then, we study a special channel model where the “action” is removed, i.e., the wiretap channel with noncausal channel state information. In this model, the channel state is generated by nature. This model is a special case of the wiretap channel with information embedding on actions by eliminating the action encoder and the mapping, f. It is also a special case of the model in Figure 3 without action. Setting the random variable, A, in Equation (28) to be a constant, we get a new outer bound of the wiretap channel with noncausal channel state information as:

C_{u n}^{''} = max_{p (u, v, q, x, s)} min {I (U; Y) - I (U; S), I (U; Y) - I (V; Z ∣ Q) - I (U; S ∣ V)} .

(30)

The outer bound in [20] was derived as $C_{d a i n}^{'} = max_{p (u, v, q, x, s)} min {I (U; Y) - I (U; S), I (U; Y) - I (V; Z ∣ Q)}$ . Comparing the two bounds, we see that $C_{u n}^{''} \leq C_{d a i n}^{'}$ . This is stated in the following corollary.

Corollary 5

For the wiretap channel with noncausal channel state information [20], the new upper bound on the secrecy capacity $C_{u n}^{''} \leq C_{d a i n}^{'}$ .

In addition, in the case of no actions, our model turns into a special case that was also studied in [21]. The comparison between our results and those in [21] for this case is stated as follows.

The achievable rate-equivocation region of the special case obtained from [21] is contained in that obtained from our results.
We also provide an outer bound for this special case.

[21] obtained an achievable rate region for the broadcast wiretap channel with the asymmetric side information, which was:

\begin{array}{l} R_{I} = {(R_{1}, R_{2}) \\ R_{1} \leq I (U_{1}; Y_{1}, S_{1}) - max (I (U_{1}; Z), I (U_{1}; S_{1}, S_{2})) \end{array}

(31)

R_{2} \leq I (U_{2}; Y_{2}, S_{2}) - max (I (U_{2}; Z), I (U_{2}; S_{1}, S_{2}))

(32)

R_{1} + R_{2} \leq I (U_{1}; Y_{1}, S_{1}) + I (U_{2}; Y_{2}, S_{2}) - I (U_{1}; U_{2}) - max (I (U_{1}, U_{2}; Z), I (U_{1}, U_{2}; S_{1}, S_{2}))}

(33)

We first give an equivalent expression of the achievable rate region $ℛ$ _I as follows.

\begin{array}{l} R_{I}^{'} = {(R_{1}, R_{2}, R_{e 1}, R_{e 2}) \\ R_{1} \leq I (U_{1}; Y_{1}, S_{1}) - max (I (U_{1}; Z), I (U_{1}; S_{1}, S_{2})) \end{array}

(34)

R_{2} \leq I (U_{2}; Y_{2}, S_{2}) - max (I (U_{2}; Z), I (U_{2}; S_{1}, S_{2}))

(35)

R_{1} + R_{2} \leq I (U_{1}; Y_{1}, S_{1}) + I (U_{2}; Y_{2}, S_{2}) - I (U_{1}; U_{2}) - max (I (U_{1}, U_{2}; Z), I (U_{1}, U_{2}; S_{1}, S_{2}))

(36)

R_{e 1} \leq R_{1}

(37)

R_{e 2} \leq R_{2}

(38)

R_{e 1} + R_{e 2} \leq R_{1} + R_{2}}

(39)

where R_e₁ and R_e₂ are the equivocation rates of the messages, m₁ and m₂, respectively. We can easily prove that $R_{I}^{'}$ is equivalent to $ℛ$ _I via replacing Formulas (9), (10) and (11) in [21] accordingly by:

\begin{array}{l} lim_{N \to \infty} \frac{H (m_{1} ∣ Z^{N})}{N} \geq R_{e 1} - ɛ \\ lim_{N \to \infty} \frac{H (m_{2} ∣ Z^{N})}{N} \geq R_{e 2} - ɛ \\ lim_{N \to \infty} \frac{H (m_{1}, m_{2} ∣ Z^{N})}{N} \geq R_{e 1} + R_{e 2} - ɛ \end{array}

Since information is embedded on the actions, the information embedding rate R₁ = 0 when no actions are imposed in our model. At the same time, the specific decoder (Decoder 1) is no longer needed. For this special case, we get its achievable rate-equivocation region from our results (by setting R₁ = 0 and A = const in $ℛ$ _in) as:

\begin{array}{l} R_{s p e c i a l} = {(R_{2}, R_{e}) \\ R_{2} \leq I (U; Y) - I (U; S) \end{array}

(40)

R_{e} \leq R_{2}

(41)

R_{e} \leq I (U; Y) - max {I (U; Z), I (U; S)}}

(42)

As stated in [21], by removing the receiver ( Entropy 16 02105f19 ) and the side information (S₂ = Entropy 16 02105f20 ) in the model considered in [21], we also arrive at the special case. Removing R₁, R_e₁, Y₁, U₁ and S₂ in $R_{I}^{'}$ , one has an achievable rate-equivocation region as:

\begin{array}{l} R_{s p e c i a l}^{'} = {(R_{2}, R_{e 2}) \\ R_{2} \leq I (U_{2}; Y_{2}) - max (I (U_{2}; Z), I (U_{2}, S_{1})) \end{array}

(43)

R_{e 2} \leq R_{2}}

(44)

For simplicity, we replace U₂, S₁, R_e₂ and Y₂ by U, S, R_e and Y, respectively. We then show that $R_{s p e c i a l}^{'} \subseteq R_{s p e c i a l}$ . For any rate pair $(R_{2}, R_{e}) \in R_{s p e c i a l}^{'}$ , from Equation (43),

\begin{array}{l} R_{2} \leq I (U; Y) - max (I (U; Z), I (U, S)) \\ \leq I (U; Y) - I (U, S) \end{array}

(45)

From Equations (43) and (44),

\begin{array}{l} R_{e} \leq R_{2} \\ \leq I (U; Y) - max (I (U; Z), I (U, S)) \end{array}

(46)

Therefore, (R₂, R_e) ∈ $ℛ$ _special. This verifies $R_{s p e c i a l}^{'} \subseteq R_{s p e c i a l}$ .

Moreover, we get an outer bound for the special case. The outer bound is:

\begin{array}{l} R_{o u t e r} = {(R_{2}, R_{e}) \\ R_{2} \leq I (U; Y) - I (U; S) \\ R_{e} \leq R_{2} \\ R_{e} \leq I (U; Y) - I (V : Z ∣ Q) - I (U; S ∣ V) \end{array}

It can be directly gotten from $ℛ$ _on by setting R₁ = 0 and A = const. Note that [21] did not provide an outer bound.

3.2. A Binary Example

We give an example of a binary symmetric channel with causal channel states. The channel model is shown in Figure 4. Let the main channel be a binary symmetric channel (BSC). Its crossover probability is affected by the channel states. The wiretap channel is also assumed to be a BSC with crossover probability q. More precisely, define:

p (y ∣ x, s = i) = {\begin{array}{l} (1 - p) (1 - i) + p i, & if y = x \\ (1 - p) i + p (1 - i), & otherwise \end{array}

(47)

and:

p (z ∣ y) = {\begin{array}{l} 1 - q, & if z = y \\ q, & otherwise \end{array}

(48)

where i ∈ {0, 1}, 0 ≤ p ≤ 1 and 0 ≤ q ≤ 1.

It is assumed that the channel from the action to the channel states is a BSC with crossover probability equal to α, where 0 ≤ α ≤ 1. In this example, the parameter, α, is fixed as 0.2 (the other value of α could also be assumed). Similar to the arguments in [1,18,19], the maximum values of H(A|Z), I(U; Y ) and I(U; Y )−I(U; Z) are achieved when g₁: Entropy 16 02105f16 → Entropy 16 02105f9 and g₂: × Entropy 16 02105f10 → Entropy 16 02105f7 are deterministic mappings. We choose g₁ and g₂ as:

\begin{array}{l} g_{1} (u = i) = i \\ g_{2} (u = i, s = j) = i + j (mod 2) \end{array}

where i, j ∈ {0, 1}. Let B ~ Bernoulli(β), where 0 ≤ β ≤ 1. Let the function, f, be a one-to-one mapping for simplicity. Here, we set:

f : {\begin{matrix} 0 \to 0 \\ 1 \to 1 \end{matrix}

From the above conditions, we see that the random variables, B, A and U, share the same distribution. Then, the joint distribution p(a, b, s, u, x, y, z) = p(z|y)p(y|x, s)p(x|u, s)p(s|a)p(a|u)p(u)1_{_b₌_f₍_a_)} can be calculated. The joint distributions $p (u, y) = \sum_{a, b, s, x, z} p (a, b, s, u, x, y, z)$ and $p (u, z) = \sum_{a, b, s, x, y} p (a, b, s, u, x, y, z)$ can also be obtained. By some mathematical calculation and Theorem 3, we can get the maximum sum secrecy rate of the example with causal channel states for given p, q as:

R_{1} + R_{2} = max_{0 \leq β \leq 1} {\frac{5}{2} [h (q * (β * p)) - h (β * p)] - \frac{3}{2} [h (p * q) - h (p)]}

(49)

under the constraint H(B) = h(β) ≥ R₁ and the secrecy constraint:

\begin{array}{l} R_{e} \leq min {max_{0 \leq β \leq 1} {h (p * q) - h (p), \frac{5}{2} [h (q * (β * p)) - h (β * p)] - \frac{3}{2} [h (p * (0.2 * q)) - h (0.2 * q)]} \\ max_{0 \leq β \leq 1} {h (β) - 1 + h (q * (β * p)), 1 - \frac{5}{2} [h (q * (β * p)) - h (β * p)]}} \end{array}

(50)

where p*q = p+q−2pq and h(p) is the binary entropy function, i.e., h(p) = −p log p−(1−p) log(1−p).

The tradeoff between the sum secrecy rate and the information embedding rate under the secrecy constraint is shown in Figure 5. It can be seen that the sum secrecy rate is reduced when the equivocation rate, R_e, increases. In practical communication systems involving security, we always desire a bigger secure transmission rate when the extent of information leakage is at a reasonable level. Moreover, it can be seen that when the information embedding rate, R₁, goes up, the sum secrecy rate also decreases. This tells us that the communication requirements of Decoder 1 have a negative impact on improving the secrecy transmission rate of the given communication link.

4. Conclusions

This paper studies the wiretap channel with information embedding on actions. In this extended setup, the confidential message needs to be decoded only by the receiver and kept secret from the wiretapper as much as possible. Meanwhile, a specific decoder is introduced in the model to observe a function of the actions and wishes to decode part of the transmitted message. Our channel model is actually an extension of Ahmadi’s channel with information embedding [11] by considering the secrecy constraint. We get the inner and outer bounds on the capacity-equivocation region of such a channel with noncausal (and causal) channel states. The corresponding lower and upper bounds on the sum secrecy capacity are also obtained. Besides, through a special case, we get a new upper bound on the secrecy capacity of the wiretap channel with action-dependent states and show that it is tighter than the upper bound obtained in [18]. We also discuss the tradeoff between the sum secrecy rate and the information embedding rate under the secrecy constraint.

Some potential directions that are worthy of being explored are listed as follows.

In practical application, the wiretapper may also wish to eavesdrop on the embedded information. In the example of communication networks, the information embedded in the packet for the next router may also be of interest to the eavesdropper. Our current setting does not consider the confidentiality of m₁ and m₂ separately, so this problem will be further explored.
Only inner and outer bounds on the capacity-equivocation region are obtained at present. We can try to find some special cases where the two bounds match. Moreover, if there exists a channel between A^N and B^N instead of a function, what will the capacity-equivocation region be?
Adaptive action means that the action sequence is generated by the message and the previous channel states, i.e., a_i(m, sⁱ⁻¹). Adaptive action is widely used in many applications, such as information hiding, digital watermarking and data storage in the memory. It is valuable to study the adaptive action in our model. From [10], we have already known that adaptive action is not useful in increasing the point-to-point channel capacity. We will study whether it influences the sum secrecy capacity of our channel model under the secrecy constraint.

Acknowledgments

This work was supported in part by the National Natural Science Foundation of China under Grant No. 61171173, 60932003 and 61271220. The authors also would like to thank the anonymous reviewers for helpful comments.

Appendix

A. Proof of Theorems 1 and 2

In this section, Theorems 1 and 2 are proven. To prove Theorem 1, the methods in [18] are utilized, and we present a coding scheme for the model in Figure 2 with noncausal action-dependent states in Subsection A.1. The proof of Theorem 2 is given in Subsection A.2.

A.1. Proof of Theorem 1

We need to prove that any rate-equivocation triple (R₁, R₂, R_e) ∈ $ℛ$ _in is achievable. Similar to [18], two cases are considered. Since the channel states are noncausally known to the channel encoder, Gel’fand and Pinsker’s coding technique [22] will be used in the encoding process.

A.1.1. H(A|Z) ≥ I(U; Y ) − max{I(U; Z), I(U; S|A)}

In this case, we need to prove that any rate-equivocation triple (R₁, R₂, R_e) satisfying the following constraints are achievable.

\begin{array}{l} R_{1} \leq H (B) \\ R_{1} + R_{2} \leq I (U; Y) - I (U; S ∣ A) \\ R_{e} \leq R_{1} + R_{2} \\ R_{e} \leq I (U; Y) - max {I (U; Z), I (U; S ∣ A)} \end{array}

It is sufficient to show that the rate triples (R₁, R₂, R_e = I(U; Y ) − max{I(U;Z), I(U; S|A)}) are achievable. The coding scheme includes codebook generation, encoding and decoding. Then we give the equivocation analysis.

Codebook generation and encoding

Let R₁ = H(B)−τ₁ and R₁ + R₂ = I(U; Y ) − I(U; S|A) − τ₂, where τ₁, τ₂ are fixed positive numbers. Since R_e ≤ R₁ + R₂, it is easy to get τ₂ ≤ max{I(U; S|A), I(U;Z)}−I(U; S|A). For each m₁ ∈ {1, 2, …, 2^NR^₁}, an independent and identically distributed (i.i.d) codeword, b^N(m₁), is generated according to $p (b^{N}) = \prod_{i = 1}^{N} p (b_{i})$ . Then, 2^NR^₂ action sequences a^N(m₁, m₂) are i.i.d generated for each b^N(m₁) according to $p (a^{N} (m_{1}, m_{2}) ∣ b^{N} (m_{1})) = \prod_{i = 1}^{N} p (a_{i} ∣ b_{i})$ , where m₂ ∈ {1, 2, …, 2^NR^₂}. For each a^N(m₁, m₂), we generate || Entropy 16 02105f21 || = 2^N⁽^I⁽^U;Y ⁾⁻^R^₁−^R^₂−^∊⁾ i.i.d codewords u^N(m₁, m₂, t_b, t_u) according to $p (u^{N} (m_{1}, m_{2}, t_{b}, t_{u}) ∣ a^{N} (m_{1}, m_{2})) = \prod_{i = 1}^{N} p (u_{i} ∣ a_{i})$ . These codewords are put into || Entropy 16 02105f22 || = 2^N^(max{^I⁽^U;S|A⁾^,I⁽^U;Z^)}−^I⁽^U;Z⁾⁺^ε^′) bins, such that each bin contains || Entropy 16 02105f21 ||/|| || codewords. Note that t_b, t_u are the indexes of the bin and codeword, respectively. The codebook structure is shown in Figure A1. To send the message (m₁, m₂) with the action sequence, a^N(m₁, m₂), and corresponding state sequence s^N, the encoder chooses a u^N(m₁, m₂, t_b, t_u) from the || Entropy 16 02105f21 || sequences, such that $(u^{N} (m_{1}, m_{2}, t_{b}, t_{u}), a^{N} (m_{1}, m_{2}), s^{N}) \in T_{U S ∣ A}^{N}$ . If no such sequence exists, it picks (t_b, t_u) = (1, 1). Then, the input sequence of the channel is generated by $p (x^{N} ∣ u^{N}, s^{N}) = \prod_{i = 1}^{N} p (x_{i} ∣ u_{i}, s_{i})$ .

Figure A1. Codebook structure.

Decoding and error probability analysis

Decoder 1 can decode the message, m₁, correctly, since R₁ ≤ H(B). For the receiver, he tries to find a unique sequence $u^{N} ({\hat{\hat{m}}}_{1}, {\hat{\hat{m}}}_{2}, {\hat{\hat{t}}}_{b}, {\hat{\hat{t}}}_{u})$ , such that $(u^{N} ({\hat{\hat{m}}}_{1}, {\hat{\hat{m}}}_{2}, {\hat{\hat{t}}}_{b}, {\hat{\hat{t}}}_{u}), a^{N} ({\hat{\hat{m}}}_{1}, {\hat{\hat{m}}}_{2}), y^{N}) \in T_{U A Y}^{N}$ . It is easy to show the decoding error probabilities P_e₁ ≤ ε and P_e₂ ≤ ε by similar arguments in [11,12]. We mainly focus on the analysis of equivocation.

Equivocation analysis

\begin{array}{l} H (M_{1}, M_{2} ∣ Z^{N}) = H (M_{1}, M_{2}, Z^{N}) - H (Z^{N}) \\ = H (M_{1}, M_{2}, Z^{N}, U^{N}) - H (U^{N} ∣ M_{1}, M_{2}, Z^{N}) - H (Z^{N}) \\ = H (M_{1}, M_{2}, U^{N}) + H (Z^{N} ∣ M_{1}, M_{2}, U^{N}) - H (U^{N} ∣ M_{1}, M_{2}, Z^{N}) - H (Z^{N}) \\ \geq H (U^{N}) + H (Z^{N} ∣ U^{N}) - H (U^{N} ∣ M_{1}, M_{2}, Z^{N}) - H (Z^{N}) \end{array}

(51)

\begin{array}{l} \geq H (U^{N}) - I (U^{N}; Z^{N}) - H (T_{b}, U^{N} ∣ M_{1}, M_{2}, Z^{N}) \\ = H (U^{N}) - I (U^{N}; Z^{N}) - H (T_{b} ∣ M_{1}, M_{2}, Z^{N}) - H (U^{N} ∣ M_{1}, M_{2}, Z^{N}, T_{b}) \\ \geq I (U^{N}; Y^{N}) - I (U^{N}; Z^{N}) - H (T_{b}) - H (U^{N} ∣ M_{1}, M_{2}, Z^{N}, T_{b}) \\ \geq N I (U; Y) - N I (U; Z) - H (T_{b}) - H (U^{N} ∣ M_{1}, M_{2}, Z^{N}, T_{b}), \end{array}

(52)

where Equation (51) is from the Markov chain (M₁, M₂) → U^N → Z^N, and Equation (52) is from that the codewords, u^N, are i.i.d and the channels are discrete memoryless.

Next, we bound H(T_b) and H(U^N|M₁, M₂, Z^N, T_b). Since || Entropy 16 02105f22 || = 2^N^(max{^I⁽^U;S|A⁾^,I⁽^U;Z^)}−^I⁽^U;Z⁾⁺^ε^′), we have H(T_b) ≤ log || || = N(max{I(U; S|A), I(U;Z)} − I(U;Z) + ε′).

The explanation for bounding $\frac{1}{N} H (U^{N} ∣ M_{1}, M_{2}, Z^{N}, T_{b})$ is presented as follows. We first show that, given M₁, M₂ and T_b, the probability of error for Z^N to decode U^N satisfies P_e ≤ ν. Here, ν is small for sufficiently large N. Given the knowledge of M₁, M₂ and T_b, the total number of possible codewords of U^N is:

\begin{array}{l} \frac{‖ T ‖}{‖ T_{b} ‖} = \frac{2^{N (I (U; Y) - R_{1} - R_{2} - ɛ)}}{2^{N (max {I (U; S ∣ A), I (U; Z)} - I (U; Z) + ɛ^{'})}} \\ = \frac{2^{N (I (U; S ∣ A) + τ_{2} - ɛ)}}{2^{N (max {I (U; S ∣ A), I (U; Z)} - I (U; Z) + ɛ^{'})}} \\ \leq \frac{2^{N (max {I (U; S ∣ A), I (U; Z)} - ɛ)}}{2^{N (max {I (U; S ∣ A), I (U; Z)} - I (U; Z) + ɛ^{'})}} \end{array}

(53)

\begin{array}{l} = 2^{N (I (U; Z) - ɛ - ɛ^{'})} \\ \leq 2^{N I (U; Z)} \end{array}

(54)

where Equation (53) follows from τ₂ ≤ max{I(U; S|A), I(U; Z)} − I(U; S|A). Based on Equation (54), we can easily show that a unique codeword u^N(m₁, m₂, t_b, t_u) exists, such that $(u^{N} (m_{1}, m_{2}, t_{b}, t_{u}), z^{N}) \in T_{U Z}^{N}$ with high probability. This indicates that the probability of error for Z^N to decode U^N satisfies P_e ≤ ν. Therefore, by Fano’s inequality, we obtain:

\frac{1}{N} H (U^{N} ∣ M_{1}, M_{2}, Z^{N}, T_{b}) \leq \frac{1}{N} (1 + P_{e} l o g (\frac{‖ T ‖}{‖ T_{b} ‖})) \leq ν^{'}

(55)

where ν′ is small for sufficiently large N.

Substituting these two results into Equation (52) and utilizing Equation (3), we finish the proof of $lim_{N \to \infty} Δ \geq R_{e}$ for the model in Figure 2 with noncausal channel states.

A.1.2. H(A|Z) ≤ I(U; Y ) − max{I(U; Z), I(U; S|A)}

In this case, we need to prove that any rate-equivocation triple (R₁, R₂, R_e) satisfying the following constraints are achievable.

\begin{array}{l} R_{1} \leq H (B) \\ R_{1} + R_{2} \leq I (U; Y) - I (U; S ∣ A) \\ R_{e} \leq R_{1} + R_{2} \\ R_{e} \leq H (A ∣ Z) \end{array}

It is sufficient to show that the rate triples (R₁, R₂, R_e = H(A|Z)) are achievable. The coding scheme is as follows.

Codebook generation and encoding

Let $R_{1} = H (B) - τ_{1}^{'}$ and $R_{1} + R_{2} = I (U; Y) - I (U; S ∣ A) - τ_{2}^{'}$ , where $τ_{1}^{'}, τ_{2}^{'}$ are fixed positive numbers. For each m₁ ∈ {1, 2, …, 2^NR^₁}, an independent and identically distributed (i.i.d) codeword, b^N(m₁), is generated according to $p (b^{N}) = \prod_{i = 1}^{N} p (b_{i})$ . Then, 2^NR^₂ action sequences a^N(m₁, m₂) are i.i.d generated for each b^N(m₁) according to $p (a^{N} (m_{1}, m_{2}) ∣ b^{N} (m_{1})) = \prod_{i = 1}^{N} p (a_{i} ∣ b_{i})$ , where m₂ ∈ {1, 2, …, 2^NR^₂}. For each a^N(m₁, m₂), we generate || Entropy 16 02105f21 || = 2^N⁽^I⁽^U;Y ⁾⁻^R^₁−^R^₂−^ε⁾ i.i.d codewords u^N(m₁, m₂, t_u) according to $p (u^{N} (m_{1}, m_{2}, t_{u}) ∣ a^{N} (m_{1}, m_{2})) = \prod_{i = 1}^{N} p (u_{i} ∣ a_{i})$ . To send the message (m₁, m₂) with the action sequence, a^N(m₁, m₂), and corresponding state sequence s^N, the encoder chooses a u^N(m₁, m₂, t_u) from the || Entropy 16 02105f21 || sequences, such that $(u^{N} (m_{1}, m_{2}, t_{u}), a^{N} (m_{1}, m_{2}), s^{N}) \in T_{U S ∣ A}^{N}$ . If no such sequence exists, it picks t_u = 1. Then, the input sequence of the channel is generated by $p (x^{N} ∣ u^{N}, s^{N}) = \prod_{i = 1}^{N} p (x_{i} ∣ u_{i}, s_{i})$ .

Decoding and error probability analysis

Decoder 1 can decode the message, m₁, correctly, since R₁ ≤ H(B). For the receiver, he tries to find a unique sequence $u^{N} ({\hat{\hat{m}}}_{1}, {\hat{\hat{m}}}_{2}, {\hat{\hat{t}}}_{u})$ , such that $(u^{N} ({\hat{\hat{m}}}_{1}, {\hat{\hat{m}}}_{2}, {\hat{\hat{t}}}_{u}), a^{N} ({\hat{\hat{m}}}_{1}, {\hat{\hat{m}}}_{2}), y^{N}) \in T_{U A Y}^{N}$ . It is easy to show the decoding error probabilities P_e₁ ≤ ε and P_e₂ ≤ ε by similar arguments in [11,12].

Equivocation analysis

We need to prove $lim_{N \to \infty} Δ = lim_{N \to \infty} \frac{H (M_{1}, M_{2} ∣ Z^{N})}{N} \geq R_{e}$ . The methods in [18] are utilized.

lim_{N \to \infty} \frac{H (M_{1}, M_{2} ∣ Z^{N})}{N} = lim_{N \to \infty} \frac{H (A^{N} (M_{1}, M_{2}) ∣ Z^{N})}{N}

(56)

\begin{array}{l} = lim_{N \to \infty} \frac{N H (A ∣ Z)}{N} \\ = H (A ∣ Z) \\ \geq R_{e} \end{array}

(57)

where Equation (56) is from that A^N is a function of (M₁, M₂), and Equation (57) is from that the sequences A^N and X^N are i.i.d generated and the channels are discrete memoryless.

We complete the proof of Theorem 1.

A.2. Proof of Theorem 2

In this subsection, we prove that all achievable rate triples (R₁, R₂, R_e) for the model in Figure 2 with noncausal channel states are contained in $ℛ$ _on.

To prove condition in Equation (11), we consider:

\begin{array}{l} R_{1} = lim_{N \to \infty} \frac{log ‖ M_{1} ‖}{N} \\ = lim_{N \to \infty} \frac{H (M_{1})}{N} \\ = lim_{N \to \infty} \frac{1}{N} [I (M_{1}; B^{N}) + H (M_{1} ∣ B^{N})] \\ \leq lim_{N \to \infty} \frac{1}{N} [I (M_{1}; B^{N}) + δ (P_{e 1})] \end{array}

(58)

\begin{array}{l} \leq lim_{N \to \infty} \frac{1}{N} [H (B^{N}) + δ (P_{e 1})] \\ = lim_{N \to \infty} \frac{1}{N} [\sum_{i = 1}^{N} H (B_{i} ∣ B^{i - 1}) + δ (P_{e 1})] \\ \leq lim_{N \to \infty} \frac{1}{N} [\sum_{i = 1}^{N} H (B_{i}) + δ (P_{e 1})] \end{array}

(59)

where Equation (58) is based on Fano’s inequality.

To prove the condition in Equation (12), we consider:

\begin{array}{l} R_{1} + R_{2} = lim_{N \to \infty} \frac{log (‖ M_{1} ‖ \cdot ‖ M_{2} ‖)}{N} \\ = lim_{N \to \infty} \frac{H (M_{1}, M_{2})}{N} \\ = lim_{N \to \infty} \frac{1}{N} [I (M_{1}, M_{2}; Y^{N}) + H (M_{1}, M_{2} ∣ Y^{N})] \\ \leq lim_{N \to \infty} \frac{1}{N} [I (M_{1}, M_{2}; Y^{N}) + δ (P_{e 2})] \end{array}

(60)

where Equation (60) is based on Fano’s inequality. Then, the mutual information I(M₁, M₂; Y^N) in Equation (60) is calculated as follows.

I (M_{1}, M_{2}; Y^{N}) = I (M_{1}, M_{2}; Y^{N}) - I (M_{1}, M_{2}; S^{N} ∣ A^{N})

(61)

\begin{array}{l} = \sum_{i = 1}^{N} [I (M_{1}, M_{2}; Y_{i} ∣ Y^{i - 1}) - I (M_{1}, M_{2}; S_{i} ∣ S_{i + 1}^{N}, A^{N})] \\ = \sum_{i = 1}^{N} [I (M_{1}, M_{2}, S_{i + 1}^{N}, A^{N}; Y_{i} ∣ Y^{i - 1}) - I (S_{i + 1}^{N}, A^{N}; Y_{i} ∣ M_{1}, M_{2}, Y^{i - 1}) \\ - I (M_{1}, M_{2}, Y^{i - 1}; S_{i} ∣ S_{i + 1}^{N}, A^{N}) + I (Y^{i - 1}; S_{i} ∣ M_{1}, M_{2}, S_{i + 1}^{N}, A^{N})] \\ = \sum_{i = 1}^{N} [I (M_{1}, M_{2}, S_{i + 1}^{N}, A^{N}; Y_{i} ∣ Y^{i - 1}) - I (M_{1}, M_{2}, Y^{i - 1}; S_{i} ∣ S_{i + 1}^{N}, A^{N})] \end{array}

(62)

\begin{array}{l} \leq \sum_{i = 1}^{N} [I (M_{1}, M_{2}, S_{i + 1}^{N}, A^{N}, Y^{i - 1}; Y_{i}) - I (M_{1}, M_{2}, Y^{i - 1}; S_{i} ∣ S_{i + 1}^{N}, A^{N})] \\ = \sum_{i = 1}^{N} [I (M_{1}, M_{2}, S_{i + 1}^{N}, A^{N}, Y^{i - 1}; Y_{i}) - I (M_{1}, M_{2}, Y^{i - 1}, S_{i + 1}^{N}, A^{N}; S_{i} ∣ A_{i}) + I (S_{i + 1}^{N}, A^{N}; S_{i} ∣ A_{i})] \\ = \sum_{i = 1}^{N} [I (M_{1}, M_{2}, S_{i + 1}^{N}, A^{N}, Y^{i - 1}; Y_{i}) - I (M_{1}, M_{2}, Y^{i - 1}, S_{i + 1}^{N}, A^{N}; S_{i} ∣ A_{i})] \end{array}

(63)

= \sum_{i = 1}^{N} [I (U_{i}; Y_{i}) - I (U_{i}; S_{i} ∣ A_{i})]

(64)

In the above deduction, Equation (61) is from the Markov chain (M₁, M₂) → A^N → S^N. The Equation (62) is from the $\sum_{i = 1}^{N} I (S_{i + 1}^{N}, A^{N}; Y_{i} ∣ M_{1}, M_{2}, Y^{i - 1}) = \sum_{i = 1}^{N} I (Y^{i - 1}; S_{i} ∣ M_{1}, M_{2}, S_{i + 1}^{N}, A^{N})$ , which can be derived similarly according to [1] and [18]. The Equation (63) is from the Markov chain $S_{i} \to A_{i} \to (S_{i + 1}^{N}, A^{i - 1}, A_{i + 1}^{N})$ . The Equation (64) is from defining $U_{i} = (M_{1}, M_{2}, Y^{i - 1}, S_{i + 1}^{N}, A^{N})$ .

The condition in Equation (13) is proven as follows.

\begin{array}{l} R_{e} \leq lim_{N \to \infty} Δ \\ = lim_{N \to \infty} \frac{H (M_{1}, M_{2} ∣ Z^{N})}{N} \\ \leq lim_{N \to \infty} \frac{H (M_{1}, M_{2})}{N} = R_{1} + R_{2} \end{array}

(65)

The condition in Equation (14) is proven as follows.

\begin{array}{l} H (M_{1}, M_{2} ∣ Z^{N}) = H (M_{1}, M_{2} ∣ Z^{N}) - H (M_{1}, M_{2} ∣ Y^{N}) + H (M_{1}, M_{2} ∣ Y^{N}) \\ = H (M_{1}, M_{2} ∣ Z^{N}) - H (M_{1}, M_{2}) + H (M_{1}, M_{2}) - H (M_{1}, M_{2} ∣ Y^{N}) + H (M_{1}, M_{2} ∣ Y^{N}) \\ = I (M_{1}, M_{2}; Y^{N}) - I (M_{1}, M_{2}; Z^{N}) + H (M_{1}, M_{2} ∣ Y^{N}) \\ \leq I (M_{1}, M_{2}; Y^{N}) - I (M_{1}, M_{2}; Z^{N}) + δ (P_{e 2}) \end{array}

(66)

From Equation (62), we have:

I (M_{1}, M_{2}; Y^{N}) = \sum_{i = 1}^{N} [I (M_{1}, M_{2}, S_{i + 1}^{N}, A^{N}; Y_{i} ∣ Y^{i - 1}) - I (M_{1}, M_{2}, Y^{i - 1}; S_{i} ∣ S_{i + 1}^{N}, A^{N})]

(67)

Similarly, we can get:

I (M_{1}, M_{2}; Z^{N}) = \sum_{i = 1}^{N} [I (M_{1}, M_{2}, S_{i + 1}^{N}, A^{N}; Z_{i} ∣ Z^{i - 1}) - I (M_{1}, M_{2}, Z^{i - 1}; S_{i} ∣ S_{i + 1}^{N}, A^{N})]

(68)

Substitute Equations (67) and (68) into Equation (66),

\begin{array}{l} H (M_{1}, M_{2} ∣ Z^{N}) \leq \sum_{i = 1}^{N} [I (M_{1}, M_{2}, S_{i + 1}^{N}, A^{N}; Y_{i} ∣ Y^{i - 1}) - I (M_{1}, M_{2}, Y^{i - 1}; S_{i} ∣ S_{i + 1}^{N}, A^{N}) \\ - I (M_{1}, M_{2}, S_{i + 1}^{N}, A^{N}; Z_{i} ∣ Z^{i - 1}) + I (M_{1}, M_{2}, Z^{i - 1}; S_{i} ∣ S_{i + 1}^{N}, A^{N})] + δ (P_{e 2}) \\ = \sum_{i = 1}^{N} [I (M_{1}, M_{2}, S_{i + 1}^{N}, A^{N}; Y_{i} ∣ Y^{i - 1}) - I (M_{1}, M_{2}, Z^{i - 1}, Y^{i - 1}; S_{i} ∣ S_{i + 1}^{N}, A^{N}) \\ - I (M_{1}, M_{2}, S_{i + 1}^{N}, A^{N}; Z_{i} ∣ Z^{i - 1}) + I (M_{1}, M_{2}, Z^{i - 1}; S_{i} ∣ S_{i + 1}^{N}, A^{N})] + δ (P_{e 2}) \end{array}

(69)

\begin{array}{l} = \sum_{i = 1}^{N} [I (M_{1}, M_{2}, S_{i + 1}^{N}, A^{N}; Y_{i} ∣ Y^{i - 1}) - I (M_{1}, M_{2}, S_{i + 1}^{N}, A^{N}; Z_{i} ∣ Z^{i - 1}) \\ - I (Y^{i - 1}; S_{i} ∣ M_{1}, M_{2}, Z^{i - 1}, S_{i + 1}^{N}, A^{N})] + δ (P_{e 2}) \\ \leq \sum_{i = 1}^{N} [I (M_{1}, M_{2}, S_{i + 1}^{N}, A^{N}, Y^{i - 1}; Y_{i}) - I (M_{1}, M_{2}, S_{i + 1}^{N}, A^{N}, Z^{i - 1}; Z_{i} ∣ Z^{i - 1}) \\ - I (M_{1}, M_{2}, S_{i + 1}^{N}, A^{N}, Y^{i - 1}; S_{i} ∣ M_{1}, M_{2}, Z^{i - 1}, S_{i + 1}^{N}, A^{N})] + δ (P_{e 2}) \end{array}

(70)

= I (U_{i}; Y_{i}) - I (V_{i}; Z_{i} ∣ Q_{i}) - I (U_{i}; S_{i} ∣ V_{i}) + δ (P_{e 2})

(71)

where Equation (69) is from the Markov chain $S_{i} \to (S_{i + 1}^{N}, A^{N}, M_{1}, M_{2}, Y^{i - 1}) \to Z^{i - 1}$ and Equation (71) is from defining $U_{i} = (M_{1}, M_{2}, S_{i + 1}^{N}, A^{N}, Y^{i - 1}), V_{i} = (M_{1}, M_{2}, S_{i + 1}^{N}, A^{N}, Z^{i - 1})$ and Q_i = Zⁱ⁻¹.

To serve the single-letter characterization, let us introduce a time-sharing random variable, J, independent of all other random variables and uniformly distributed over {1, 2, …, N}. Set:

\begin{array}{l} U = (U_{J}, J), V = (V_{J}, J), Q = (Q_{J}, J) \\ A = A_{J}, B = B_{J}, S = S_{J}, X = X_{J}, Y = Y_{J}, Z = Z_{J} \end{array}

Then, substituting the above new random variables into Equations (59), (64) and (71), the conditions in Equations (11), (12) and (14) are verified. From the definition of the auxiliary random variables, the Markov chain Q → V → U → Y → Z is easy to be verified. We complete the proof of Theorem 2.

We note that Dai et al. [18] got an upper bound on R_e for the model in Figure 3 as I(U; Y ) − I(V ; Z|Q). Comparing it with our bound on R_e, we notice that an extra term, I(U; S|V ), is contained in our bound. The difference between our proof and the proof in [18] about the upper bounds is stated in detail as follows. We both concentrate on zooming H(M₁, M₂|Z^N). In [18], H(M|Z^N) was considered, since no embedding information was imposed. From Equation (66),

H (M_{1}, M_{2} ∣ Z^{N}) \leq I (M_{1}, M_{2}; Y^{N}) - I (M_{1}, M_{2}; Z^{N}) + δ (P_{e 2})

(72)

The two terms in Equation (72) are calculated independently in [18] as:

I (M_{1}, M_{2}; Y^{N}) \leq \sum_{i = 1}^{N} [I (U_{i}; Y_{i}) - I (U_{i}; S_{i} ∣ A_{i})]

and:

I (M_{1}, M_{2}; Z^{N}) \geq \sum_{i = 1}^{N} [I (V_{i}; Z_{i} ∣ Q_{i}) - I (U_{i}; S_{i} ∣ A_{i})]

respectively. Then, the term, I(U_i; S_i|A_i), was offset in [18] by subtraction in order to obtain their upper bound. However, the weak aspect of calculating the two terms independently is that the interrelation between the two terms are missed.

In our proof, we focus on calculating I(M₁, M₂; Y ^N) − I(M₁, M₂; Z^N). The result is shown in Equation (71). The main difference in the proof steps between our work and [18] is Equations (69) and (70). In the above derivation, we can see that the extra term, I(U_i; S_i|V_i), is originated from $I (M_{1}, M_{2}, Y^{i - 1}; S_{i} ∣ S_{i + 1}^{N}, A^{N}) - I (M_{1}, M_{2}, Z^{i - 1}; S_{i} ∣ S_{i + 1}^{N}, A^{N})$ .

B. Proof of Theorems 3 and 4

In this section, Theorems 3 and 4 are proven. To prove Theorem 3, the methods in [18] are utilized and a coding scheme for the model in Figure 2 with causal action-dependent states is provided in Subsection B.1. Subsection B.2 gives the proof of the outer bound on the capacity-equivocation region.

B.1. Proof of Theorem 3

We need to prove that any achievable rate triple (R₁, R₂, R_e) ∈ $ℛ$ _ic is achievable. Two cases are considered.

B.1.1. H(A|Z) ≥ I(U; Y ) − I(U; Z)

In this case, we need to prove that any rate-equivocation triple (R₁, R₂, R_e) satisfying the following constraints are achievable.

\begin{array}{l} R_{1} \leq H (B) \\ R_{1} + R_{2} \leq I (U; Y) \\ R_{e} \leq R_{1} + R_{2} \\ R_{e} \leq I (U; Y) - I (U; Z) \end{array}

It is sufficient to show that the rate triples (R₁, R₂, R_e = I(U; Y ) − I(U; Z)) are achievable. The coding scheme includes codebook generation, encoding and decoding. Then, we give the equivocation analysis.

Codebook generation and encoding

Let R₁ = H(B) − θ₁ and R₁ + R₂ = I(U; Y ) − θ₂, where θ₁, θ₂ are fixed positive numbers. Since R_e ≤ R₁ + R₂, it is easy to get θ₂ ≤ I(U; Z). For each m₁ ∈ {1, 2, …, 2^NR^₁}, an independent and identically distributed (i.i.d) codeword, b^N(m₁), is generated according to $p (b^{N}) = \prod_{i = 1}^{N} p (b_{i})$ . Then, 2^NR^₂ action sequences a^N(m₁, m₂) are i.i.d generated for each b^N(m₁) according to $p (a^{N} (m_{1}, m_{2}) ∣ b^{N} (m_{1})) = \prod_{i = 1}^{N} p (a_{i} ∣ b_{i})$ , were m₂ ∈ {1, 2, ..., 2^NR^₂}. For each a^N(m₁, m₂), we generate || Entropy 16 02105f23 || = 2^N⁽^I⁽^U;Y ⁾⁻^R^₁−^R^₂−^ε⁾ i.i.d codewords u^N(m₁, m₂, t_u) according to $p (u^{N} (m_{1}, m_{2}, t_{u}) ∣ a^{N} (m_{1}, m_{2})) = \prod_{i = 1}^{N} p (u_{i} ∣ a_{i})$ . Note that t_u is the index of codeword u^N. To send the message (m₁, m₂) with the action sequence, a^N(m₁, m₂), and corresponding state sequence s^N, the encoder randomly chooses an index $t_{u}^{*} \in {1, 2, \dots, ‖ T_{u} ‖}$ . Then, the input sequence of the channel is generated by $p (x^{N} ∣ u^{N} (m_{1}, m_{2}, t_{u}^{*}), s^{N}) = \prod_{i = 1}^{N} p (x_{i} ∣ u_{i}, s_{i})$ .

Decoding and error probability analysis

Decoder 1 can decode the message, m₁, correctly, since R₁ ≤ H(B). For the receiver, he tries to find a unique sequence $u^{N} ({\hat{\hat{m}}}_{1}, {\hat{\hat{m}}}_{2}, {\hat{\hat{t}}}_{u})$ , such that $(u^{N} ({\hat{\hat{m}}}_{1}, {\hat{\hat{m}}}_{2}, {\hat{\hat{t}}}_{u}), a^{N} ({\hat{\hat{m}}}_{1}, {\hat{\hat{m}}}_{2}), y^{N}) \in T_{U A Y}^{N}$ . It is easy to show the decoding error probabilities P_e₁ ≤ ε and P_e₂ ≤ ε, and therefore, we omit the proof here. We mainly focus on the analysis of equivocation.

Equivocation analysis

\begin{array}{l} H (M_{1}, M_{2} ∣ Z^{N}) = H (M_{1}, M_{2}, Z^{N}) - H (Z^{N}) \\ = H (M_{1}, M_{2}, Z^{N}, U^{N}) - H (U^{N} ∣ M_{1}, M_{2}, Z^{N}) - H (Z^{N}) \\ = H (M_{1}, M_{2}, U^{N}) + H (Z^{N} ∣ M_{1}, M_{2}, U^{N}) - H (U^{N} ∣ M_{1}, M_{2}, Z^{N}) - H (Z^{N}) \\ \geq H (U^{N}) + H (Z^{N} ∣ U^{N}) - H (U^{N} ∣ M_{1}, M_{2}, Z^{N}) - H (Z^{N}) \end{array}

(73)

\begin{array}{l} = H (U^{N}) - I (U^{N}; Z^{N}) - H (U^{N} ∣ M_{1}, M_{2}, Z^{N}) \\ \geq I (U^{N}; Y^{N}) - I (U^{N}; Z^{N}) - H (U^{N} ∣ M_{1}, M_{2}, Z^{N}) \\ \geq N I (U; Y) - N I (U; Z) - H (U^{N} ∣ M_{1}, M_{2}, Z^{N}) \end{array}

(74)

where Equation (73) is from the Markov chain (M₁, M₂) → U^N → Z^N, and Equation (74) is from that the codewords, u^N, are i.i.d and the channels are discrete memoryless. The conditional entropy, H(U^N|M₁, M₂, Z^N), is calculated as follows. Given the message (M₁, M₂), the number of U^N is || Entropy 16 02105f23 || = 2^N⁽^I⁽^U;Y ⁾⁻^R^₁−^R^₂−^ε⁾ = 2^N⁽^θ^₂−^ε⁾ ≤ 2^N⁽^I⁽^U;Z⁾⁻^ε⁾. Therefore, H(U^N|M₁, M₂, Z^N) → 0 as N → ∞. Substituting this result into Equation (74) and utilizing Equation (3), we finish the proof of $lim_{N \to \infty} Δ \geq R_{e}$ for the model in Figure 2 with causal channel states.

B.1.2. H(A|Z) ≤ I(U; Y ) − I(U; Z)

In this case, we need to prove that any rate-equivocation triple (R₁, R₂, R_e) satisfying the following constraints are achievable.

\begin{array}{l} R_{1} \leq H (B) \\ R_{1} + R_{2} \leq I (U; Y) \\ R_{e} \leq R_{1} + R_{2} \\ R_{e} \leq H (A ∣ Z) \end{array}

It is sufficient to show that (R₁, R₂, R_e = H(A|Z)) are achievable. The coding scheme is as follows.

Codebook generation and encoding

Let $R_{1} = H (B) - θ_{1}^{'}$ and $R_{1} + R_{2} = I (U; Y) - θ_{2}^{'}$ , where $θ_{1}^{'}, θ_{2}^{'}$ are fixed positive numbers. For each m₁ ∈ {1, 2, …, 2^NR^₁}, an independent and identically distributed (i.i.d) codeword, b^N(m₁), is generated according to $p (b^{N}) = \prod_{i = 1}^{N} p (b_{i})$ . Then, the 2^NR^₂ action sequences a^N(m₁, m₂) are i.i.d generated for each b^N(m₁) according to $p (a^{N} (m_{1}, m_{2}) ∣ b^{N} (m_{1})) = \prod_{i = 1}^{N} p (a_{i} ∣ b_{i})$ , where m₂ ∈ {1, 2, …, 2^NR^₂}. For each a^N(m₁, m₂), a corresponding codeword, u^N(m₁, m₂), is generated according to $p (u^{N} (m_{1}, m_{2}) ∣ a^{N} (m_{1}, m_{2})) = \prod_{i = 1}^{N} p (u_{i} ∣ a_{i})$ . To send the message (m₁, m₂) with the action sequence, a^N(m₁, m₂), and corresponding state sequence s^N, the encoder selects the codeword, u^N(m₁, m₂). Then, the input sequence of the channel is generated by $p (x^{N} ∣ u^{N} (m_{1}, m_{2}) ∣, s^{N}) = \prod_{i = 1}^{N} p (x_{i} ∣ u_{i}, s_{i})$ .

Decoding and error probability analysis

This step follows similarly from Case A in Section IV.

Equivocation analysis

We need to prove $lim_{N \to \infty} Δ = lim_{N \to \infty} \frac{H (M_{1}, M_{2} ∣ Z^{N})}{N} \geq R_{e}$ . The methods in [18] are utilized.

lim_{N \to \infty} \frac{H (M_{1}, M_{2} ∣ Z^{N})}{N} = lim_{N \to \infty} \frac{H (A^{N} (M_{1}, M_{2}) ∣ Z^{N})}{N}

(75)

\begin{array}{l} = lim_{N \to \infty} \frac{N H (A ∣ Z)}{N} \\ = H (A ∣ Z) \\ \geq R_{e} \end{array}

(76)

where Equation (75) is from that A^N is a function of (M₁, M₂), and Equation (76) is from that the sequences, A^N and X^N, are i.i.d generated and that the channels are discrete memoryless.

We complete the proof of Theorem 3.

B.2. Proof of Theorem 4

In this subsection, we need to prove that all achievable rate triples (R₁, R₂, R_e) for the model in Figure 2 with causal channel states are contained in $ℛ$ _on.

The conditions in Equations (16) and (18) follow the same as those of Equations (59) and (65). Therefore, we show Equations (17) and (19) as follows.

To prove the condition in Equation (17), we consider:

\begin{array}{l} R_{1} + R_{2} = lim_{N \to \infty} \frac{log (‖ M_{1} ‖ \cdot ‖ M_{2} ‖)}{N} \\ = lim_{N \to \infty} \frac{H (M_{1}, M_{2})}{N} \\ = lim_{N \to \infty} \frac{1}{N} [I (M_{1}, M_{2}; Y^{N}) + H (M_{1}, M_{2} ∣ Y^{N})] \\ \leq lim_{N \to \infty} \frac{1}{N} [I (M_{1}, M_{2}; Y^{N}) + δ (P_{e 2})] \end{array}

(77)

\begin{array}{l} = lim_{N \to \infty} \frac{1}{N} [\sum_{i = 1}^{N} I (M_{1}, M_{2}; Y_{i} ∣ Y^{i - 1}) + δ (P_{e 2})] \\ \leq lim_{N \to \infty} \frac{1}{N} [\sum_{i = 1}^{N} I (M_{1}, M_{2}, Y^{i - 1}, S^{i - 1}; Y_{i}) + δ (P_{e 2})], \\ \leq lim_{N \to \infty} \frac{1}{N} [\sum_{i = 1}^{N} I (U_{i}; Y_{i}) + δ (P_{e 2})] \end{array}

(78)

where Equation (77) is based on Fano’s inequality and Equation (78) is from defining U_i = (M₁, M₂, Y ⁱ⁻¹, Sⁱ⁻¹).

Before proving the condition in Equation (19), we consider:

\begin{array}{l} I (M_{1}, M_{2}; Z^{N}) = \sum_{i = 1}^{N} I (M_{1}, M_{2}; Z_{i} ∣ Z^{i - 1}) \\ = \sum_{i = 1}^{N} I (M_{1}, M_{2}, Z^{i - 1}; Z_{i} ∣ Z^{i - 1}) . \\ = \sum_{i = 1}^{N} I (V_{i}; Z_{i} ∣ Q_{i}) \end{array}

(79)

where Equation (79) is from defining V_i = (M₁, M₂, Zⁱ⁻¹) and Q_i = Zⁱ⁻¹.

Then, utilizing Equations (78) and (79),

\begin{array}{l} H (M_{1}, M_{2} ∣ Z^{N}) = H (M_{1}, M_{2} ∣ Z^{N}) - H (M_{1}, M_{2} ∣ Y^{N}) + H (M_{1}, M_{2} ∣ Y^{N}) \\ = H (M_{1}, M_{2} ∣ Z^{N}) - H (M_{1}, M_{2}) + H (M_{1}, M_{2}) - H (M_{1}, M_{2} ∣ Y^{N}) + H (M_{1}, M_{2} ∣ Y^{N}) \\ = I (M_{1}, M_{2}; Y^{N}) - I (M_{1}, M_{2}; Z^{N}) + H (M_{1}, M_{2} ∣ Y^{N}) \\ \leq I (M_{1}, M_{2}; Y^{N}) - I (M_{1}, M_{2}; Z^{N}) + δ (P_{e 2}) \end{array}

(80)

\leq \sum_{i = 1}^{N} [I (U_{i}; Y_{i}) - I (V_{i}; Z_{i} ∣ Q_{i})]

(81)

where Equation (80) is from Fano’s inequality.

To serve the single-letter characterization, let us introduce a time-sharing random variable, J, independent of all other random variables and uniformly distributed over {1, 2, …, N}. Set:

\begin{array}{l} U = (U_{J}, J), V = (V_{J}, J), Q = (Q_{J}, J), \\ A = A_{J}, B = B_{J}, S = S_{J}, X = X_{J}, Y = Y_{J}, Z = Z_{J} \end{array}

Then, substituting the above definition into Equations (78) and (81), the conditions in Equations (17) and (19) are verified straightforwardly. From the definition of the auxiliary random variables, the Markov chains Q → V → U → Y → Z and U → A → S are easy to be verified. We complete the proof of Theorem 4.

Conflicts of Interests

The authors declare no conflict of interest.

Author ContributionXinxing Yin and Zhi Xue did the theoretical work and wrote this paper. All authors have read and approved the final manuscript

References

Weissman, T. Capacity of channels with action-dependent states. IEEE Trans. Inf. Theory 2010, 56, 5396–5411. [Google Scholar]
Asnani, H.; Permuter, H.; Weissman, T. Probing capacity. IEEE Trans. Inf. Theory 2011, 57, 7317–7332. [Google Scholar]
Steinberg, Y.; Weissman, T. The degraded broadcast channel with action-dependent states. Proceedings of the IEEEInternational Symposium on Information Theory (ISIT), Boston, MA, USA, 1–6 July 2012; pp. 596–600.
Steinberg, Y. The degraded broadcast channel with non-causal action-dependent side information. Proceedings of the IEEE International Symposium on Information Theory (ISIT), Istanbul, Turkey, 7–12 July 2013; pp. 2965–2969.
Ahmadi, B.; Simeone, O. On channels with action-dependent states. 2012. arXiv:1202.4438 Available online: http://arxiv.org/abs/1202.4438 accessed on 22 July 2012. [Google Scholar]
Asnani, H.; Permuter, H.; Weissman, T. To observe or not to observe the channel state. Proceedings of the Annual Allerton Conference on Communication, Control, and Computing, Monticello, IL, USA, 29 September–1 October 2010; pp. 1434–1441.
Kittichokechai, K.; Oechtering, T.J.; Skoglund, M. Capacity of the channel with action-dependent state and reversible input. Proceedings of IEEE Swedish Communication Technologies Workshop (Swe-CTW), Stockholm, Swedish, 19–21 October2011.
Kittichokechai, K.; Oechtering, T.J.; Skoglund, M. Coding with action-dependent side information and additional reconstruction requirements. 2012. arXiv:1202.1484 Available online: http://arxiv.org/abs/1202.1484 accessed on 7 February 2012. [Google Scholar]
Choudhuri, C.; Mitra, U. Action dependent strictly causal state communication. 2012. arXiv:1202.0934. Available online: http://arxiv.org/abs/1202.0934 accessed on 5 February 2012. [Google Scholar]
Choudhuri, C.; Mitra, U. How useful is adaptive action? Proceedings of the Global Communications Conference, Anaheim, CA, USA, 3–7 December 2012; pp. 2251–2255.
Ahmadi, B.; Asnani, H.; Simeone, O.; Permuter, H. Information embedding on actions. Proceedings of the IEEE IEEE International Symposium on Information Theory (ISIT), Istanbul, Turkey, 7– 12 July 2013; pp. 186–190.
Ahmadi, B.; Asnani, H.; Simeone, O.; Permuter, H. Information embedding on actions. 2012. arXiv:1207.6084 Available online: http://arxiv.org/abs/1207.6084 accessed on 25 July 2012. [Google Scholar]
Petitcolas, F.A.P.; Anderson, R.J.; Kuhn, M.G. Information hiding—A survey. Proc. IEEE 1999, 87, 1062–1078. [Google Scholar]
Moulin, P.; O’Sullivan, J.A. Information-theoretic analysis of information hiding. IEEE Trans. Inf. Theory 2003, 49, 563–593. [Google Scholar]
O’Sullivan, J.A.; Moulin, P.; Ettinger, J.M. Information theoretic analysis of steganography. Proceedings of the IEEE International Symposium on Information Theory (ISIT), Cambridge, MA, USA, 16–21 August 1998.
Zaidi, A.; Vandendorpe, L. Coding schemes for relay-assisted information embedding. IEEE Trans. Inf. Forensics Secur 2009, 4, 70–85. [Google Scholar]
Zaidi, A.; Piantanida, P.; Duhamel, P. Broadcast- and MAC-aware coding strategies for multiple user information embedding. IEEE Trans. Signal Process 2007, 55, 2974–2992. [Google Scholar]
Dai, B.; Vinck, A.J.H.; Luo, Y.; Tang, X. Wiretap channel with action-dependent channel state information. Entropy 2013, 15, 445–473. [Google Scholar]
Dai, B.; Vinck, A.J.H.; Luo, Y. Wiretap channel in the presence of action-dependent states and noiseless feedback. J. Appl. Math 2013, 2013. [Google Scholar] [CrossRef]
Dai, B.; Luo, Y. Some new results on the wiretap channel with side information. Entropy 2013, 14, 1671–1702. [Google Scholar]
Le Treust, M.; Zaidi, A.; Lasaulce, S. An achievable rate region for the broadcast wiretap channel with asymmetric side information. Proceedings of the 49th Annual Allerton Conference on Communication, Control, and Computing, Monticello, IL, USA, 28–30 September 2011; pp. 68–75.
Gel’fand, S.I.; Pinsker, M.S. Coding for channel with random parameters. Probl. Control Inf. Theory 1980, 9, 19–31. [Google Scholar]
Cover, T.M. Elements of Information Theory; Wiley: New York, NY, USA, 1991. [Google Scholar]
El Gamal, A.; Kim, Y. Network Information Theory; Cambridge University Press: New York, NY, USA, 2011. [Google Scholar]
Csiszár, I.; Köner, J. Information Theory: Coding Theorems for Discrete Memoryless Systems; Academic Press: London, UK, 1981. [Google Scholar]

Figure 1. Information embedding on actions [11].

Figure 2. Wiretap channel with information embedding on actions.

Figure 3. The wiretap channel with action-dependent states [18].

Figure 4. The binary symmetric channel with information embedding on actions.

Figure 5. The tradeoff between the sum secrecy rate and the information embedding rate under the secrecy constraint.

© 2014 by the authors; licensee MDPI, Basel, Switzerland This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).

Share and Cite

MDPI and ACS Style

Yin, X.; Xue, Z. Wiretap Channel with Information Embedding on Actions. Entropy 2014, 16, 2105-2130. https://doi.org/10.3390/e16042105

AMA Style

Yin X, Xue Z. Wiretap Channel with Information Embedding on Actions. Entropy. 2014; 16(4):2105-2130. https://doi.org/10.3390/e16042105

Chicago/Turabian Style

Yin, Xinxing, and Zhi Xue. 2014. "Wiretap Channel with Information Embedding on Actions" Entropy 16, no. 4: 2105-2130. https://doi.org/10.3390/e16042105

Article Menu

Wiretap Channel with Information Embedding on Actions

Abstract

1. Introduction

2. Channel Model and Main Results

2.1. Symbol Notations and Channel Model

Definition 1

Definition 2

2.2. Main Results

Theorem 1

Theorem 2

Comments

Theorem 3

Theorem 4

Comments

3. Discussion and Example

3.1. Discussion

Corollary 1

Proof

Corollary 2

Proof

Corollary 3

Proof

Corollary 4

Proof

Corollary 5

3.2. A Binary Example

4. Conclusions

Acknowledgments

Appendix

A. Proof of Theorems 1 and 2

A.1. Proof of Theorem 1

A.1.1. H(A|Z) ≥ I(U; Y ) − max{I(U; Z), I(U; S|A)}

Codebook generation and encoding

Decoding and error probability analysis

Equivocation analysis

A.1.2. H(A|Z) ≤ I(U; Y ) − max{I(U; Z), I(U; S|A)}

Codebook generation and encoding

Decoding and error probability analysis

Equivocation analysis

A.2. Proof of Theorem 2

B. Proof of Theorems 3 and 4

B.1. Proof of Theorem 3

B.1.1. H(A|Z) ≥ I(U; Y ) − I(U; Z)

Codebook generation and encoding

Decoding and error probability analysis

Equivocation analysis

B.1.2. H(A|Z) ≤ I(U; Y ) − I(U; Z)

Codebook generation and encoding

Decoding and error probability analysis

Equivocation analysis

B.2. Proof of Theorem 4

Conflicts of Interests

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI