1. Introduction
In recent years, the application of semantic communication (SC) in wireless relay networks has received extensive attention. Numerous studies have been conducted on semantic information transmission, forwarding strategies, and optimized architectures to enhance the efficiency and reliability of communication.
Firstly, in the aspect of deep learning-driven semantic relay communication, Study Wang et al. [
1] proposed a two-way relay network based on deep joint source-channel coding (DeepJSCC-TWRN), focusing on optimizing the image transmission quality. In addition, Erkantarcı et al. [
2] investigated integrating advanced deep learning models such as ResNet-18, GoogLeNet, and enhanced GoogLeNet into the semantic forwarding (SF) framework in 6G networks to improve the performance of collaborative communication. Zhang et al. [
3] further proposed a task-oriented relay strategy, performing re-encoding and forwarding (RF) based on a deep neural network (DNN) to optimize the performance of classification tasks in device-edge collaborative inference systems.
In terms of semantic forwarding optimization, Luo et al. [
4] proposed a semantic communication scheme for wireless relay channels based on autoencoders (AESC) and designed a new semantic forwarding (SF) mode, which is particularly suitable for scenarios where there is a lack of common knowledge between the source node and the destination node. Tang et al. [
5] further proposed an on-demand semantic forwarding framework to enhance the flexibility and adaptability of semantic information transmission. Arda et al. [
6] studied collaborative semantic text communication and proposed two strategies, namely semantic lossy forwarding (SLF) and semantic predictive forwarding (SPF), to optimize the transmission performance under different channel conditions.
In terms of semantic relay architecture, Ma et al. [
7] designed an intelligent relay-assisted semantic communication system, combining two modes of amplify-and-forward (AF) and decode-and-forward (DF). By using an intelligent semantic relay (Sem-Relay) to continuously share the knowledge background with the transmitter and receiver, the transmission efficiency is improved. Mu et al. [
8] explored a new type of heterogeneous communication system that enables access points (APs) to simultaneously transmit semantic streams and bit streams to semantic users (S-users) and bit users (B-users). Yin et al. [
9] studied semantic communication (SC) in multi-hop relay networks for long-distance or high path loss attenuation conditions to enhance the reliability of long-distance information transmission. In addition, Yan et al. [
10] proposed the concept of semantic spectral efficiency (S-SE), providing a new measure for the transmission efficiency of semantic information.
In the development of semantic relay (SemRelay), Liu et al. [
11] proposed a SemRelay architecture equipped with a semantic receiver, specifically designed to assist in text transmission from a resource-rich base station (BS) to resource-constrained mobile devices. Hu et al. [
12] further expanded this architecture to support text transmission for multiple resource-constrained mobile devices. Zhao et al. [
13] proposed an uplink SemRelay-assisted wireless communication system, using the probability graph shared between the SemRelay and the base station (BS) to provide efficient semantic communication services for multiple users. Hu et al. [
14] regarded the SemRelay as an edge server, providing deep learning-based semantic communication (DeepSC) services for semantic users (SemUsers) with rich computing resources and traditional users (ConUsers) with limited resources.
In summary, these studies have promoted the application and development of semantic communication in wireless relay networks from multiple aspects such as deep learning methods, forwarding optimization, architecture design, and the evolution of SemRelay, laying the foundation for future efficient and intelligent semantic communication systems.
The DeepSC system Xie et al. [
15] proposed a deep learning-based semantic communication framework, but it did not analyze the outage probability in relay cooperation and multi-user scenarios. Although L-DeepSC Xie and Qin [
16] optimized lightweight semantic transmission, it is only applicable to direct connection links and does not consider the semantic conversion function of relay nodes. The SemRelay framework Hu et al. [
14] introduced semantic relay for the first time, but it did not quantify the impact of background knowledge differences on the outage probability.
There are two key limitations in the current research in the field of semantic relay communication: Firstly, most of the existing work focuses on the optimization of semantic rate, but ignores the modeling of the outage probability, a key reliability indicator. In particular, it fails to establish a theoretical correlation between the outage probability and semantic parameters (such as the matching degree of background knowledge, the symbol compression rate K, the computing power of the destination node, etc.), which leads to a blind spot in the performance evaluation of the system in actual complex channel environments. Existing models usually assume that the destination node (D) has sufficient computing power to support complex semantic decoding. However, in practical scenarios, the limited computing resources of node D may lead to the failure of semantic information parsing, and the dynamic coupling relationship between computing power and semantic decoding ability has not been quantitatively analyzed. Secondly, when traditional relay strategies (such as AF/DF) are directly applied to semantic transmission scenarios, their design does not fully consider the impact of semantic decoding failure on system reliability. For example, when there is a mismatch in background knowledge or semantic symbols are damaged, the simple amplification or forwarding of traditional relays will significantly reduce the fidelity of semantic information, thereby increasing the risk of system outage. These limitations make it difficult for existing methods to meet the requirements of 6G communication for highly reliable semantic transmission, and there is an urgent need to establish an outage probability analysis framework that integrates semantic features and channel dynamics.
This paper focuses on the relay semantic communication system and conducts an in-depth study of its outage probability performance under different conditions. It innovatively proposes a multi-mode intelligent relay design framework. Through four working modes of a single relay node (semantic collaborative decoding-forwarding, semantic adaptation forwarding, semantic enhancement forwarding, semantic-bit hybrid forwarding), this framework achieves the collaborative optimization of the matching degree of background knowledge, channel quality, and the computing power of the destination node. Based on the three-point two-line communication model (source node-relay node-destination node), the study combines the DeepSC model to achieve cross-layer collaboration of semantic encoding and decoding and channel encoding and decoding at the relay node. For complex scenarios such as limited computing power of the destination node, mismatch of background knowledge, and channel degradation, the closed-form expressions of the outage probability of the four relay modes are derived for the first time, and the coupling influence mechanism of semantic symbol redundancy, background knowledge differences, and computing power conversion efficiency on system reliability is systematically quantified. Theoretical analysis shows that:
In scenarios with small background knowledge differences, the semantic adaptation forwarding mode can significantly reduce the system outage probability;
When the computing power of the destination node is limited, the semantic-bit hybrid mode can effectively improve communication reliability by flexibly switching transmission strategies;
The reasonable configuration of semantic symbol redundancy plays a key role in maintaining the integrity of semantic information and resisting channel interference.
Verified by Monte Carlo simulation, in dynamic channel conditions and heterogeneous background knowledge scenarios, the dynamic switching mechanism of the multi-mode relay shows significant performance advantages compared with a single strategy, and the outage probability in different scenarios is greatly reduced. This study provides theoretical support for the reliable transmission and resource optimization of 6G semantic communication systems, reveals the potential value of the collaborative optimization of semantic parameters and channel dynamics, and lays an important theoretical foundation for the design and performance evaluation of future intelligent communication systems.
The organization of this paper is as follows: The second part establishes the system model. The third part deals with the outage probability performance of the proposed relay selection strategy and gives the exact and high signal-to-noise ratio approximation. The fourth part presents the simulation and analysis results, and the fifth part summarizes the full text and gives our conclusions.
Notation: and denote the sets of complex matrices and real matrices of size , respectively. Boldface variables represent matrices or vectors. indicates that the variable x follows a circularly symmetric complex Gaussian distribution with mean and covariance .
2. System Model
We consider a three-point two-line communication transmission model, as shown in
Figure 1, which includes the source node (
S), the relay node (
R), and the destination node (
D), with no direct link. The relay operates in half-duplex mode and uses DeepSC as the communication transmission model. Semantic and channel encoders and decoders are deployed at both
S and
R, where the semantics of the text can be effectively extracted through the Transformer model. It is assumed that the DeepSC transceivers are trained at a base station or cloud platform.
In this model, S inputs a sentence , where represents the L-th word in the sentence. The DeepSC transmitter consists of two parts: the semantic encoder and the channel encoder, which are responsible for extracting semantic information from s and ensuring its successful transmission over the physical channel. The sentence is input into the DeepSC transmitter and mapped into a semantic symbol vector , where , and is the length of the semantic symbol vector after transforming the sentence. We note that the length of changes with L, allowing more effective extraction of semantic information from sentences of varying lengths. In this model, K represents the average number of semantic symbols used for each word, and each semantic symbol can be directly transmitted over the communication medium.
The encoded symbol stream can be expressed as:
where
,
is the semantic encoder network with parameter set
, and
is the channel encoder with parameter set
.
At
R, the received signal is:
where
,
h represents the Rayleigh fading channel, which follows a
distribution, and
. The decoded signal can be expressed as:
where
is the recovered sentence,
is the channel decoder with parameter set
, and
is the semantic decoder network with parameter set
.
At D, depending on D’s computational power, either bit transmission or semantic transmission is employed. Specifically, if D has sufficient computational power to decode the semantic stream, semantic transmission is used over the link. If D lacks sufficient computational power, traditional bit transmission is used.
Both the
(Source-Relay) link and
(Relay-Destination) link employ Rayleigh fading channel transmission models. In a Rayleigh fading channel,
follows an exponential distribution with parameter
, that is:
For The signal-to-noise ratio of the Source-Relay Link(
), its expression is:
where
is the transmit power, and
is the noise power.
is a linear transformation of
, which is:
It can be derived that
follows an exponential distribution with parameter
, whose probability density function is:
where:
Similarly, we can derive the following for the
link: In a Rayleigh fading channel,
also follows an exponential distribution with parameter
, that is:
For The signal-to-noise ratio of the Relay-Destination Link(
), its expression is:
where
is the transmit power and
is the noise power.
is a linear transformation of
, which is:
It can be derived that
follows an exponential distribution with parameter
, whose probability density function is:
where:
Based on the computational power of D and the semantic-transmission-suitability of channel conditions, the channel transmission strategy falls into four distinct scenarios:
Both SR and RD links engage in semantic transmission, and S and D share identical background knowledge (): Here, D has ample storage and computational resources to decode the semantic information being transmitted. Moreover, its semantic encoding logic aligns with that of S, indicating shared . R, functioning much like a traditional decode-and-forward (DF) relay, decodes and re-encodes data semantically. This ensures the integrity and efficient conveyance of semantic information.
Both SR and RD links carry out semantic transmission, yet S and D possess different : In this case, D still has enough computational resources for semantic-information decoding. However, its background knowledge diverges from that of S. This disparity may cause discrepancies in semantic-information expression and comprehension. Therefore, R must undertake additional tasks, such as semantic-information conversion and adaptation, to enable D to correctly interpret the received semantic information.
Both SR and RD links perform semantic transmission. S and D share the same , but signal degradation impedes D’s decoding: Although D has sufficient computational resources and shares background knowledge with S, long-distance propagation and poor channel quality result in significant signal degradation. This prevents D from accurately decoding the semantic information. Consequently, R operates as a traditional amplify-and-forward (AF) relay, amplifying and forwarding the signal. In this situation, the system’s channel capacity is equivalent to that of a two-hop AF relay.
The SR link uses semantic transmission while the RD link employs bit transmission: In this scenario, S and R communicate using semantic transmission, whereas R and D rely solely on bit transmission. This model is suitable when D has limited computational resources. R must decode the semantic information and re-encode it as a bit stream for transmission to D. Here, the transmission performance of the RD link is determined by the bit-transmission channel capacity.
In summary, these four cases demonstrate the optimization strategies for transmission under different system conditions. By combining semantic transmission with traditional transmission methods, these strategies address various limitations, such as background knowledge matching, channel quality, and computational resources, providing a theoretical basis and practical guidance for efficient semantic transmission in relay communication systems.
Based on the above four transmission strategies, four types of working modes assumed by the relay node in different scenarios can be further summarized, namely semantic collaborative decoding-forwarding, semantic adaptation forwarding, semantic enhancement forwarding, and semantic-bit hybrid forwarding. Under different modes, the relay node has different functional focuses to adapt to various constraints such as the matching of background knowledge, channel quality, and computing resources, providing flexible solutions for achieving efficient semantic communication.
3. Performance Analysis
To evaluate the performance of semantic communication in text transmission, this paper uses semantic similarity as the primary performance metric. Specifically, the semantic similarity
is defined as:
where
represents the bidirectional encoder representations from Transformers (BERT) model, which has made significant improvements in state-of-the-art sentence embedding methods. The pre-trained Sentence-BERT model is used. Unlike other semantic evaluation metrics like BLEU, the BERT similarity can more accurately measure the semantic distance between two sentences. Notably, the range of semantic similarity values is
, where
indicates that the two sentences are identical, and
means they have no similarity.
Based on semantic similarity, a new performance metric called semantic rate is introduced in literature [
10] to measure the semantic information transmission rate achieved by DeepSC. Let
I represent the semantic units (suts), i.e., the average amount of semantic information contained in a sentence
s. Therefore, the semantic information for each semantic symbol can be expressed as
(units: suts/symbol). Recall that the symbol rate is equal to the transmission bandwidth
W. Thus, the effective semantic rate (units: suts/s) can be expressed as:
where
is a function dependent on the specific semantic communication system and physical channel conditions. The value of
can be obtained by running the DeepSC tool, with the mapping relationship shown in
Figure 2.
Semantic similarity has been proven to be a function of received signal-to-noise ratio (SNR)
and
K. For any given
K, the semantic similarity function
typically follows an “S” curve as SNR changes. Therefore, a generalized logistic regression method can be used to approximate it as a sigmoid function, expressed as:
where
,
,
, and
are constant coefficients dependent on
K.
Based on this, the channel capacity formulas for four scenarios can be derived:
3.1. Semantic Collaborative Decoding-Forwarding
where
W is the channel bandwidth,
I is the average amount of semantic information per sentence, measured in semantic units (suts),
K is the average number of semantic symbols per word in the original sentence,
L is the number of words in the sentence, and
is the semantic similarity.
The bit rate for transmission is:
where
is the conversion factor for encoding the source information into bits, indicating the average number of bits per word.
The capacity of the RD link is:
where the bit stream includes encoded semantic symbols, and
represents the average number of bits per semantic symbol using the traditional joint encoder. Thus, the effective semantic symbol rate on the RD link is:
The effective semantic bit rate on the SR link is:
Assuming the predefined channel capacity is
C, the signal-to-noise ratio (SNR) threshold is
, and the outage probability is:
Based on
, we can derive:
Substituting:
where
, we can obtain:
The SR link follows a Rayleigh channel model, so the probability
is:
Using the CDF for the Rayleigh distribution, this becomes:
Similarly, for
, we get:
and the probability:
Hence, the outage probability is:
3.2. Semantic Adaptation Forwarding
For the SR link, the effective semantic rate is:
which can be converted to the bit transmission rate:
For the RD link, the effective semantic rate is:
and the bit transmission rate is:
The overall communication capacity is determined by the minimum value of the two links:
Thus, the outage probability can be expressed as:
Using the probability properties, we get:
Substituting the expressions for
and
, we have:
3.3. Semantic Enhancement Forwarding
In a two-hop relay communication system, the channel capacity is:
The semantic channel capacity is derived as:
The bit rate for transmission is:
Based on this, the outage probability is:
Substituting the expression for
, we get:
Let
, and simplify:
Under high SNR conditions, where
and
, we can approximate:
and the final expression for the outage probability involves integrating the joint probability of two independent exponential random variables:
3.4. Semantic-Bit Hybrid Forwarding
In this case, the effective semantic rate for the SR link is:
and the bit rate is:
For the RD link, the bit transmission rate is:
The overall communication capacity is:
The outage probability is:
The probabilities for the SR and RD links are derived as:
and:
The final outage probability is:
After completing the theoretical derivations of the outage probabilities in various scenarios and systematically summarizing the key performance indicators and their mathematical expressions of different transmission strategies, this section will summarize and compare the analysis results of the outage probabilities in the four situations.
Table 1 details the expressions of the channel capacities of the SR link and the RD link in each scenario, the overall channel capacity model of the SD link, as well as the corresponding closed-form formulas for the outage probabilities.
4. Simulation and Results
This section presents the theoretical analysis of outage probability for the four proposed semantic communication systems and verifies it through Monte Carlo simulations. The experimental parameters are shown in
Table 2. In this simulation, the outage probability for four relay-based semantic communication models is tested under different values of
K and transmit power of the source.
This simulation is implemented on the MATLAB R2022a platform, making full use of its numerical calculation and stochastic process simulation capabilities. The core modules of the simulator include the channel model, semantic similarity mapping, and outage event determination.
The channel model adopts the Rayleigh fading channel, in which the channel gains and follow an exponential distribution, and the instantaneous signal-to-noise ratios (SNRs) and are calculated. Semantic similarity mapping calculates the semantic similarity based on the pre-trained Sentence-BERT model, and then approximates it using the Sigmoid function. The outage event determination checks, according to the outage probability formula derived in the paper, whether each transmission meets the capacity threshold C in sequence.
The Monte Carlo simulation runs 2000 independent iterations. Each iteration includes the generation of a complete channel state, semantic encoding and decoding, calculation of the transmission rate, and outage determination. This number of iterations balances computational efficiency while ensuring the stability of the statistical results. The final outage probability is obtained by calculating the ratio of the frequency of outage events to the total number of iterations.
4.1. First Case
In the BD channel, bit transmission is performed, so the SD link’s channel capacity is actually determined by the SR link. Since the signal-to-noise ratio (SNR)
is much greater than 1, no matter how the transmit powers of
S and
R are changed,
is approximately equal to 1, meaning the outage probability can be approximated to 0. The SD link’s channel capacity decreases as
K increases. As shown in
Figure 3.
4.2. Second Case
By adjusting the transmit power
P of
S and
R, the semantic similarity
between the SR and RD links can be modified, thereby altering the channel capacity of the SD and RD links. As the value of
P increases, the SNR of the link increases. At this point, the SD link’s channel capacity is determined by the link with the lower SNR. When
K exceeds 6, the outage probability decreases and approaches 0.As shown in
Figure 4.
4.3. Third Case
In this case, the outage probability decreases as
K increases. Since the relay performs an amplify-and-forward (AF) function, this case is consistent with a traditional three-node communication model. The outage probability decreases as the SNR increases. As shown in
Figure 5.
4.4. Fourth Case
In this case, the outage probability decreases as
K increases. As shown in
Figure 6.