Next Article in Journal
Naked-Eye Detection of Water Contaminants Enabled by Engineered Plasmonic Resonance in Hybrid Detector Systems
Previous Article in Journal
Statistical Modeling and Analysis of Similar Compound Interaction in Scientific Research
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

On Rate Fairness Maximization for the Downlink NOMA with Improper Signaling and Imperfect SIC

1
School of Internet of Things, Nanjing University of Posts and Telecommunications, Nanjing 210003, China
2
National Mobile Communications Research Laboratory, Southeast University, Nanjing 210096, China
*
Author to whom correspondence should be addressed.
Appl. Sci. 2025, 15(18), 9970; https://doi.org/10.3390/app15189970
Submission received: 18 August 2025 / Revised: 4 September 2025 / Accepted: 6 September 2025 / Published: 11 September 2025
(This article belongs to the Section Electrical, Electronics and Communications Engineering)

Abstract

Non-orthogonal multiple access (NOMA) is a key enabler for 6G networks due to its efficient spectrum utilization, which is garnering significant attention among the Internet of Things (IoT) community. This paper investigates the benefits of the improper Gaussian signaling (IGS) technique on the max–min fairness of the downlink NOMA system under imperfect successive interference cancellation (SIC), where both of the users have the potential to adopt IGS. We first investigate fairness optimization under perfect SIC. In this case, the max–min optimization is solved by the alternate optimization algorithm, where the impropriety degree and power level are iteratively optimized. The closed-form solution for conventional proper Gaussian signaling is also obtained. Then, a deep Q network-based solution is considered for the rate fairness maximization of the downlink NOMA system under IGS and imperfect SIC. The simulations presented for the IGS-aided NOMA system support the analysis, illustrating that IGS can efficiently improve the fairness achievable rate compared to the conventional proper one.

1. Introduction

Non-orthogonal multiple access (NOMA) is a promising technique to achieve massive connectivity and higher spectral efficiency for various kinds of sophisticated applications [1,2,3,4,5,6], such as smart healthcare, remote driving, and virtual reality [7], in beyond 5G and 6G communication networks by allowing different users to access the same time and frequency resources with different power levels [8,9,10,11,12,13,14,15,16]. Typically, the user with weaker channel strength will be allocated more power, and its signal is recovered by directly treating the signals of other users as noise at the receiver end. On the other hand, the signal of the strong channel user is decoded by the successive interference cancellation (SIC) [17,18,19] process, in which the signal of the weak channel user is first detected and then removed from the received signal.
Proper Gaussian signaling (PGS), which is a kind of complex signaling with its in-phase and quadrature components uncorrelated and having equal variance, is a commonly used assumption in various wireless communication systems [20,21]. However, some recent results have proven that improper Gaussian signaling (IGS), where the real and imaginary parts of the signals are correlated or possess unequal variance, can achieve better system throughput and reliability, especially when there exists strong interference between users [22,23,24]. For instance, the achievable rate regions of improper signals were studied in terms of the Pareto boundary and the outer boundary by assuming cooperation between players [25]. However, that work focuses on perfect channel information. For the practical imperfect channel information case, the authors in [26] derived a robust design for the rate-region boundary in closed form, which is independent of the phase of the channels, where only one of the multiple users is allowed to transmit IGS. The above two works only consider the two-user situation. Relatedly, the pioneering work in [27] extends them to the multiuser case, where the precoding matrices of the K-user single-input single-output interference channels were optimized by minimizing the pairwise error probability and symbol error rate. In [28], the rate advantages of IGS were investigated in the interference channel with additive asymmetric hardware distortions by treating interference as noise. The joint design of transmit beamformers and reflecting coefficients of a reconfigurable intelligent surface was considered in [29] in order to maximize the geometric mean of the achievable rates by developing an efficient alternating descent algorithm via generating feasible points for the non-convex problem. To further improve the geometric mean of the data rates, the joint design of widely linear transmit beamformers integrated with programmable reflecting coefficients was also considered. The authors in [30] investigated different optimization frameworks for rate splitting in multiple-input multiple-output reconfigurable intelligent surface-based systems, including weighted sum rate maximization, total transmit power minimization, weighted minimum energy efficiency, and global energy efficiency maximization. Moreover, by integrating simultaneous wireless information and power transfer and hardware impairments with an intelligent reflecting surface [31], fairness maximization was considered by jointly optimizing the reflecting coefficients, beamforming, and power splitting coefficients, subject to the minimum harvested energy requirements and total power budget. A two-layer low-complexity iterative algorithm was proposed in [32] to solve the max–min achievable rate optimization problem, where the transmit beamforming vectors and reflecting phase shifts must be optimized in terms of the transmit power budget constraint.
Despite many reported solutions for IGS-based interference channels and relay networks, this paradigm has just emerged in NOMA systems. Transmit beamforming to maximize users’ minimum throughput under power constraints was investigated [33] in terms of orthogonal multiple access and the NOMA system by using IGS. Moreover, the advantage of IGS over PGS was highlighted in the context of the mixed information superposition technique for the general multi-cell multiuser NOMA network [34]. However, max–min optimization has not been characterized for the IGS-assisted NOMA system under imperfect successive interference cancellation (SIC). In this work, we first formulate the data rates of both users in terms of the impropriety degree under imperfect SIC for the NOMA system. Then, the alternate optimization algorithm is used to optimize the impropriety degrees and power level for the perfect SIC case. We also derive the closed-form solution for the PGS-based NOMA with perfect SIC. Furthermore, by virtue of the deep Q network (DQN), we optimize the impropriety degrees and power levels for the imperfect SIC case. The main contributions of this work are summarized as follows.
  • This work is a novel attempt to investigate max–min optimization for the IGS-assisted NOMA system via designing the impropriety degrees and power levels for both users regarding perfect and imperfect SIC. Conversely, [33,34] only considered the perfect SIC case, which is not practical.
  • The max–min optimization of the NOMA system employing IGS is formulated under perfect SIC. To solve this problem, the alternate optimization (AO) technique is utilized to iteratively optimize the impropriety degrees and power level.
  • The fairness optimization of the IGS-based NOMA system is formulated under imperfect SIC. To address this issue, a DQN-based suboptimal solution is proposed to learn the impropriety degrees and power levels of both users.
  • The simulation results confirm the effectiveness of the IGS-aided NOMA system and highlight the improvements introduced by IGS as compared to PGS.
The rest of this paper is organized as follows. Section 2 presents the system model of the IGS-based NOMA system under imperfect SIC. Section 3 optimizes the fairness maximization of the NOMA system with IGS and perfect SIC via the alternate optimization technique. Section 4 provides the DQN solution for the IGS-aided NOMA system under imperfect SIC. The simulation results and discussion are presented in Section 5. Finally, Section 6 concludes the paper.
Notation 1.
Lowercase letters, such as x, are used to denote scalars, and boldface letters, such as x, are used for column vectors. The statistical expectation and absolute value operators are denoted by E { · } and | · | , respectively.

2. System Model

2.1. Preliminaries for Improper Gaussian Random Variables

A zero-mean complex-valued random variable, x, with variance σ x 2 = E { | x | 2 } and complementary variance σ ˜ x 2 = E { x 2 } is called proper if σ ˜ x 2 = 0 ; otherwise, it is called improper. Furthermore, the variance σ x 2 and the complementary variance σ ˜ x 2 are a valid pair if and only if σ x 2 0 and | σ ˜ x 2 | σ x 2 . The impropriety degree of x is defined as
κ x = | σ ˜ x 2 | σ x 2 .
It is easy to verify that 0 κ x 1 [22,23].

2.2. System Description

This work investigates a two-user downlink NOMA system with a single antenna base station (BS), where a strong channel user (user 1) and a weak channel user (user 2) are served. The channel from the BS to the user i, denoted by h i , i = 1 , 2 , is modeled as a complex independent and identically distributed Gaussian random variable with zero mean and variance σ h i 2 , while the zero mean additive white Gaussian noise n i with variance σ n i 2 is assumed. We define Γ i = | h i | 2 / σ n i 2 as the channel-to-noise ratio (CNR) of the user i, which satisfies Γ 1 Γ 2 . The power allocated to user i is p i , which is bounded by P i , i = 1 , 2 . According to the protocol of NOMA, the strong channel user is allowed to transmit with a lower power, while the weak channel user is allocated a higher power, i.e.,  p 1 p 2 , and subsequently, P 1 P 2 , which operates the efficient SIC decoder. Specifically, the weak user is able to decode its transmitted symbol directly by treating the signal of the strong user as interference due to its limited power. On the other hand, the strong user has to decode the signal of the weak user and then remove it from the received signal by using the SIC technique in order to detect its own symbol. In practice, this SIC procedure is likely to be imperfect, which can be quantified by a factor λ , where λ = 1 refers to fully invalid SIC, while λ = 0 implies perfect one. In this way, the received signal after SIC of both users can be described as
y 1 = p 1 h 1 x 1 + λ p 2 h 2 x 2 + n 1 z 1 ,
and  
y 2 = p 2 h 2 x 2 + p 1 h 2 x 1 + n 2 z 2
where x 1 and x 2 are the transmitted signals of user 1 and user 2, respectively, subject to potential IGS with impropriety degrees κ 1 and κ 2 . The terms z 1 and z 2 are the interference-plus-noise elements of the strong user and the weak user, respectively. Following the analysis in [23], the achievable rate of user i can be described as
R i = 1 2 log 2 σ y i 4 | σ ˜ y i 2 | 2 σ z i 4 | σ ˜ z i 2 | 2 = log 2 σ y i 2 σ z i 2 R i , Proper + 1 2 log 2 1 σ y i 4 | σ ˜ y i 2 | 2 1 σ z i 4 | σ ˜ z i 2 | 2 R i , Improper ,
where σ y i 2 ( σ z i 2 ) and σ ˜ z i 2 ( σ ˜ z i 2 ) are the variance and complementary variance of y i ( z i ) , respectively, provided by
σ y 1 2 = E { y 1 y 1 * } = p 1 | h 1 | 2 + λ 2 p 2 | h 1 | 2 + σ n 1 2 ,
σ ˜ y 1 2 = E { y 1 2 } = h 1 2 p 1 κ 1 e j ϕ 1 + λ 2 h 1 2 p 2 κ 2 e j ϕ 2 ,
σ y 2 2 = E { y 2 y 2 * } = p 2 | h 2 | 2 + p 1 | h 2 | 2 + σ n 2 2 ,
σ ˜ y 2 2 = E { y 2 2 } = h 2 2 p 1 κ 1 e j ϕ 1 + h 2 2 p 2 κ 2 e j ϕ 2 ,
and
σ z 1 2 = E { z 1 z 1 * } = λ 2 p 2 | h 1 | 2 + σ n 1 2 ,
σ ˜ z 1 2 = E { z 1 2 } = λ 2 h 1 2 p 2 κ 2 e j ϕ 2 ,
σ z 2 2 = E { z 2 z 2 * } = p 1 | h 2 | 2 + σ n 2 2 ,
σ ˜ z 2 2 = E { z 2 2 } = h 2 2 p 1 κ 1 e j ϕ 1 ,
where ϕ 1 and ϕ 2 are the phases of the complementary variance of x 1 and x 2 , respectively. By substituting (5), (6), (9) and (10) into (4), we obtain the achievable rate of user 1 as
R 1 ( κ 1 , p 1 , κ 2 , p 2 , ϕ 1 , ϕ 2 )   = log 2 1 + | h 1 | 2 p 1 σ n 1 2 + λ 2 | h 1 | 2 p 2 R 1 , Proper + 1 2 log 2 1 σ y 1 4 | σ ˜ y 1 2 | 2 1 σ z 1 4 | σ ˜ z 1 2 | 2 R 1 , Improper ,     =   1 2 log 2 ( Γ 1 p 1 + Γ 1 p 2 λ 2 + 1 ) 2 Γ 1 2 | p 1 κ 1 + λ 2 p 2 κ 2 e j ( ϕ 2 ϕ 1 ) | 2 ( λ 2 Γ 1 p 2 + 1 ) 2 λ 4 Γ 1 2 p 2 2 κ 2 2 .
Similarly, after some algebraic manipulations, the achievable rate of the weak user can be calculated as
R 2 ( κ 1 , p 1 , κ 2 , p 2 , ϕ 1 , ϕ 2 )   = log 2 1 + | h 2 | 2 p 2 σ n 2 2 + | h 2 | 2 p 1 R 2 , Proper + 1 2 log 2 1 σ y 2 4 | σ ˜ y 2 2 | 2 1 σ z 2 4 | σ ˜ z 2 2 | 2 R 2 , Improper ,     =   1 2 log 2 ( Γ 2 p 1 + Γ 2 p 2 + 1 ) 2 Γ 2 2 | p 1 κ 1 + p 2 κ 2 e j ( ϕ 2 ϕ 1 ) | 2 ( Γ 2 p 1 + 1 ) 2 Γ 2 2 p 1 2 κ 1 2 ,
Note that, when the transmitted signals of both users adopt the conventional PGS, the data rates in (13) and (14) reduce to the standard one.

3. Fairness Maximization of the NOMA System with IGS Under Perfect SIC

In this section, we investigate the joint impropriety degree and power allocation design to achieve the max–min fairness for the IGS-assisted NOMA system with perfect SIC, which aims to provide fairness for both users. In this case, by setting λ = 0 , the achievable data rate of the strong user simplifies to
R 1 ( κ 1 , p 1 ) = 1 2 log 2 ( ( 1 κ 1 2 ) Γ 1 2 p 1 2 + 2 Γ 1 p 1 + 1 ) ,
and the max–min optimization can be formulated as
( P 1 ) : max κ 1 , p 1 , κ 2 , p 2 , ϕ 1 , ϕ 2 min { R 1 ( κ 1 , p 1 ) , R 2 ( κ 1 , p 1 , κ 2 , p 2 , ϕ 1 , ϕ 2 ) } s . t . ( C 1 ) : 0 p 1 p 2 P 2 , ( C 2 ) : 0 κ 1 , κ 2 1 , ( C 3 ) : 0 ϕ 1 , ϕ 2 π ,
where R 1 and R 2 refer to (15) and (14), respectively, (C1) represents the power constraint according to the NOMA principle, (C2) guarantees the validity of the impropriety degree, and (C3) reflects the phase constraints of the complementary variance for the transmitted signals. The constrained optimization in (P1) is obviously non-convex and involves difficulty finding the global solution.
In (14), evidently, the phases of the complementary variance of the transmitted signals of both users, i.e.,  x 1 and x 2 , are independent of the other unknown parameters, i.e.,  p i and κ i . Moreover, it is straightforward that R 2 in (14) is maximized when the term | p 1 κ 1 + p 2 κ 2 e j ( ϕ 2 ϕ 1 ) | 2 is minimized. In this way, we have
ϕ 2 ϕ 1 = π .
Observe, from the data rates in (13) and (14), we are able to find that the transmitted signal of the weak user, x 2 , does not interfere with the received signal of the strong user, y 1 . To this end, the optimal transmission strategy for the weak user is to transmit with its maximum power budget, P 2 .
From the above discussion, the solution to
min p 1 P 2 , 0 κ i 1 , i { 1 , 2 } | p 1 κ 1 + P 2 κ 2 e j π | 2 ,
can be calculated as
κ 2 = κ 1 p 1 P 2 .
Note that the above equation always holds since p 1 P 2 and 0 κ 1 1 . In this way, by substituting (19) into (14), the achievable rate of the weak user can be reformulated as
R 2 ( κ 1 , p 1 ) = 1 2 log 2 ( Γ 2 p 1 + Γ 2 P 2 + 1 ) 2 ( Γ 2 p 1 + 1 ) 2 Γ 2 2 p 1 2 κ 1 2 .
Now, the max–min fairness in (16) is simplified to
( P 2 ) : max κ 1 , p 1 min { R 1 ( κ 1 , p 1 ) , R 2 ( κ 1 , p 1 ) } s . t . ( C 4 ) : 0 p 1 P 2 , ( C 5 ) : 0 κ 1 1 .
Observe from (13) that it is easy to examine that R 1 ( κ 1 , p 1 ) is a monotonically decreasing function of κ 1 and a monotonically increasing function of p 1 , while R 2 ( κ 1 , p 1 ) is a monotonically increasing function of κ 1 and a monotonically decreasing function of p 1 . It is difficult to obtain the optimal solution to the optimization in (21). In this work, we proposed an alternate optimization algorithm to solve it by iteratively optimizing the impropriety degree and power allocation of the strong user, i.e.,  κ 1 and p 1 .

3.1. Optimal κ 1 Design for IGS-Based NOMA with Perfect SIC

With fixed p 1 , the max–min fairness in (21) can be reformulated as
( P 3 ) : max κ 1 min { R 1 ( κ 1 ) , R 2 ( κ 1 ) } s . t . ( C 4 )
This optimization also presents difficulty in obtaining the closed-form solution since R 1 ( κ 1 ) is a monotonically decreasing function of κ 1 , while R 2 ( κ 1 ) is a monotonically increasing function of κ 1 . In this sense, there must exist κ 1 * that satisfies
R 1 ( κ 1 * ) = R 2 ( κ 1 * ) ,
which can be determined by the well-known bisection method, as shown in Algorithm 1.   
Algorithm 1: Bisection-based algorithm for solving (23)
1 
  Initialization
2 
  Set κ 1 a and κ 1 b with R 1 ( κ 1 a ) < R 2 ( κ 1 b ) and the opposite case, respectively. Define the stopping criterion ε .
3 
  Repeat
4 
  Update κ 1 = ( κ 1 a + κ 1 b ) / 2
5 
  Compute R 1 ( κ 1 ) and R 2 ( κ 1 )
6 
  If  R 1 ( κ 1 ) < R 2 ( κ 1 )
7 
   κ 1 a = κ 1
8 
  Else κ 1 b = κ 1
9 
  EndIf
10 
Until  | R 1 ( κ 1 a ) R 2 ( κ 1 b ) | < ε

3.2. Optimal p 1 Design for IGS-Based NOMA with Perfect SIC

With the κ 1 in hand, we can rewrite the max–min fairness in (21) as
( P 4 ) : max p 1 min { R 1 ( p 1 ) , R 2 ( p 1 ) } s . t . ( C 5 )
Similar to the optimization in (22), it is also not feasible to obtain the closed-form solution. This is because R 1 ( p 1 ) is a monotonically increasing function of p 1 , while R 2 ( p 1 ) is a monotonically decreasing function of p 1 . We resort to finding p 1 * , which satisfies
R 1 ( p 1 * ) = R 2 ( p 1 * ) ,
which can also be solved by the bisection method in Algorithm 1.
The overall AO algorithm to solve the max–min fairness in (21) is summarized in Algorithm 2. It is straightforward that the optimization of the unknown parameters follows the sequence , κ 1 ( i ) , p 1 ( i ) , κ 1 ( i + 1 ) , p 1 ( i + 1 ) , , where i and i + 1 represent the number of iterations. Considering the iteration process, we have  
max p 1 ( i ) min { R 1 ( p 1 ( i ) ) , R 2 ( p 1 ( i ) ) } max κ 1 ( i ) min { R 1 ( κ 1 ( i ) ) , R 2 ( κ 1 ( i ) ) }
and
max κ 1 ( i + 1 ) min { R 1 ( κ 1 ( i + 1 ) ) , R 2 ( κ 1 ( i + 1 ) ) } max p 1 ( i ) min { R 1 ( p 1 ( i ) ) , R 2 ( p 1 ( i ) ) } .
Observing Equations (26) and (27), the convergence of Algorithm 2 can be guaranteed.   
Algorithm 2: The overall AO algorithm for solving (21)
1 
Initialization
2 
Set the total iteration number as T i .
3 
Repeat
4 
Compute κ 1 in (23) according to Algorithm 1.
5 
Compute p 1 in (25) according to Algorithm 1.
6 
Until the maximum iterative number T i .
7 
Output:  κ 1 and p 1 .

3.3. Optimal Solution for PGS-Based NOMA with Perfect SIC

When κ 1 = κ 2 = 0 and ϕ 1 = ϕ 2 = 0 , the corresponding achievable rates in (13) and (14) reduce to the standard PGS one, which can be described as
R 1 p ( p 1 ) = log 2 ( Γ 1 p 1 + 1 ) ,
and
R 2 p ( p 1 , p 2 ) = log 2 Γ 2 p 1 + Γ 2 p 2 + 1 Γ 2 p 1 + 1 ,
respectively. In this way, the max–min fairness for the conventional PGS NOMA system becomes
( P 5 ) : max p 1 min { R 1 p ( p 1 ) ) , R 2 p ( p 1 , p 2 ) ) } s . t . ( C 1 )
Similar to the analysis above in Equation (18), the optimal transmission strategy for the weak user is to allocate its power budget, P 2 , and, subsequently, the optimization in (30) simplifies to
( P 6 ) : max p 1 min { log 2 ( Γ 1 p 1 + 1 ) , log 2 ( Γ 2 p 1 + Γ 2 P 2 + 1 Γ 2 p 1 + 1 ) } s . t . ( C 4 )
This optimization is solved according to
log 2 ( Γ 1 p 1 + 1 ) = log 2 ( Γ 2 p 1 + Γ 2 P 2 + 1 Γ 2 p 1 + 1 )
or equivalently,
Γ 1 p 1 = Γ 2 P 2 Γ 2 p 1 + 1 .
After a few mathematical manipulations, the optimal power allocation for user 1 can be obtained as  
p 1 * = Γ 1 + Γ 1 2 + 4 Γ 1 Γ 2 2 P 2 2 Γ 1 Γ 2 .
In this way, the optimal closed-form solution to (30) can be provided by
max min { R 1 p , R 2 p } = log 2 2 Γ 2 Γ 1 + Γ 1 2 + 4 Γ 1 Γ 2 2 P 2 2 Γ 2 .

4. Fairness Maximization of the NOMA System with IGS Under Imperfect SIC

In this section, we consider the fairness maximization of the NOMA system with IGS and imperfect SIC. In (13) and (14), R i , i = 1 , 2 , can achieve its maximum when ϕ 2 = ϕ 1 + π . In this sense, the resulting data rates can be rewritten as
R 1 ( κ 1 , p 1 , κ 2 , p 2 ) = 1 2 log 2 ( Γ 1 p 1 + Γ 1 p 2 λ 2 + 1 ) 2 Γ 1 2 ( p 1 κ 1 λ 2 p 2 κ 2 ) 2 ( λ 2 Γ 1 p 2 + 1 ) 2 λ 4 Γ 1 2 p 2 2 κ 2 2 ,
and
R 2 ( κ 1 , p 1 , κ 2 , p 2 ) = 1 2 log 2 ( Γ 2 p 1 + Γ 2 p 2 + 1 ) 2 Γ 2 2 ( p 1 κ 1 p 2 κ 2 ) 2 ( Γ 2 p 1 + 1 ) 2 Γ 2 2 p 1 2 κ 1 2 .
Then, the objective of joint power allocation and impropriety degree optimization to maximize the minimum data rate can be described as
max κ 1 , p 1 , κ 2 , p 2 min { R 1 , R 2 } s . t . 0 p 1 p 2 P 2 , 0 κ 1 , κ 2 1 , R i R
where R 1 and R 2 refer to (36) and (37), respectively; R is the rate constraint of both users. The above optimization (38) is obviously non-convex and difficult to solve since the optimization variables, κ 1 , p 1 , κ 2 , p 2 , appear in both the nominators and denominators. To address this issue, the deep Q network (DQN) is introduced to solve the joint power allocation and impropriety degree optimization. This is because the max–min optimization can be expressed as a discrete action space, where value-based methods are more suitable and stable than policy gradient approaches. Moreover, DQN achieves higher sample efficiency through experience replay, making it a more practical choice [35].
Q learning is an off-policy and model-free reinforcement learning algorithm, which aims to maximize the action–value (Q) function Q π ( s , a ) under policy π , and its convergence to optimal can be guaranteed, where s and a represent the finite discrete set of state A and action 𝒮, respectively. The optimal Q-function Q ( s , a ) can be provided by
Q ( s , a )   = max E n = 1 γ n 1 r t + n | s t = s , a t = a ,   =   E { r + γ max a Q ( s , a ) | s t = s , a t = a } ,
where r , s , and a are the reward, state, and action, respectively, and  γ is the discount coefficient with 0 γ 1 , which determines the importance of future rewards. By means of immediate rewards, the update of the Q-function can be described as  
Q ( s , a ) = ( 1 α ) Q ( s , a ) + α [ r + γ max a Q ( s , a ) ] ,
where α ( 0 , 1 ) is the learning rate, which determines the balance between the speed of learning and convergence behavior; i.e., a larger α leads to a faster learning procedure yet results in non-convergence of learning procedure, and a smaller α leads to a slower learning procedure. However, the Q table becomes large when the numbers of actions and states are large, which causes great challenges in storage and reading, and the update of Q-function becomes no longer accurate. To overcome these challenges, a deep neural network (DNN) is introduced into Q learning to approximate the Q-function, referred to as DQN. By denoting the DNN parameters as θ , the DQN structure can be defined as Q ( s , a , θ ) . The training goal of DQN is to obtain the best values of the weights θ . Typically, we adopt the hyperbolic tangent function tanh ( · ) as the nonlinear activation function, indicated by
tanh ( x ) = e x e x e x + e x .
One key mechanism in DQN is the experience replay strategy, which stores the quadruple ( s , a , r , s ) in its memory pool. The neural network is trained by randomly sampling mini-batches of experience data in a first-in-first-out manner from this memory pool. Moreover, to enhance its stability, two NNs, i.e., the quasi-static target network and the local network, are introduced into DQN by updating the weights of target network as θ = θ for every T timeslot, while the weights of the local network θ are trained in each timeslot. We use the mean squared error (MSE) loss function in the training phase, indicated by
J ( θ ) = Ω ( r + γ max a Q ( s , a , θ ) Q ( s , a , θ ) ) 2 .
To solve the fairness maximization for the downlink NOMA system with improper Gaussian signaling and imperfect SIC in (38), the action space, state space, and the reward function are defined as follows.
1.
State space: The state space is defined as
s t = { p 1 , p 2 , κ 1 , κ 2 , r t 1 } ,
where r t 1 is the reward obtained in the last timeslot.
2.
Action space: The action space involves different levels of power and impropriety degrees. Let the power and impropriety degree levels uniformly distribute in [ 0 , P 2 ] and [ 0 , 1 ] , respectively. The corresponding quantized intervals are assumed to be Δ p and Δ i , and thus the power and impropriety degree levels can be calculated as N p = Δ p and N i = Δ i , respectively. In this way, the action space is defined as
A = { a 1 , a 2 , , a u , , a N 1 N 2 } ,
where
a u = [ p 1 , n 1 , κ l 1 , p 1 , n 2 , κ l 2 ] T ,
in which n 1 , n 2 = 1 , 2 , , N p , l 1 , l 2 = 1 , 2 , , N i . The adaptive ϵ -greedy strategy is used to conduct action selection, provided by
  • As time goes on, the action selection strategy becomes less inclined to take a random action from A . In this sense, the decreasing probability ϵ t at timeslot t, defined as 
    ϵ t = max ϵ m i n , ϵ 0 ( 1 t T + t T T ) ,
    is used in this work, where ϵ 0 , ϵ m i n , T, T , and t are the initial exploration probability, the minimum exploration probability, the number of training timeslots, the number of training episodes, and the time sample of the current training episode, respectively.
  • Take the action that maximizes the Q-value Q ( s , a , θ ) with probability 1 ϵ t .
3.
Reward function: The immediate achieved fairness data rate is defined as the instantaneous reward, provided by
r t + 1 = R 1 , i f R 1 < R 2 & p 1 < p 2 & R i R R 2 , i f R 1 > R 2 & p 1 < p 2 & R i R 0 , otherwise .
Note that the discount coefficient γ , the replay memory size N r , the learning rate α , the initial value of the adaptive ϵ -greedy strategy ϵ 0 , its minimum value ϵ m i n , the batch size N b , the weights of the local and the target DNN, θ and θ , the copy frequency of the target DNN weights N c , and the total number of training episodes T e are initialized before training. The total number of action spaces becomes larger when the quantized intervals, i.e.,  Δ p and Δ i , become smaller, which makes the design of neural network infeasible. In this sense, the output of neural network is designed to be the movement directions of the unknown parameters. Specifically, the outputs of 1 and 1 represent a decrease and an increase for the parameters to be optimized, respectively, while the output of 0 denotes remaining unchanged. The iterative DQN implementation for the fairness maximization of the downlink NOMA sytem with improper signaling and imperfect SIC is summarized in Algorithm 3.   
Algorithm 3: Rate fairness maximization for the downlink NOMA with IGS and imperfect SIC: the DQN solution
1 
Initialization
2 
Initialize ϵ 0 , ϵ m i n , N r , γ , N b , α , the weights of both DNNs as θ = θ , and  N c .
3 
for  t = 0 , 1 , , T do
4 
  for  t = 0 , 1 , , T do
5 
    Generate the decreasing probability ϵ t according to (46).
6 
    Generate a random number 0 < η < 1 .
7 
    if  η < ϵ
8 
      Select a random action a t ;
9 
    else
10 
      Calculate the action pair a t based on the output of the DNN.
11 
    end if
12 
    Calculate the reward r t via the chosen action a t , and then store the transition ( s t , a t , r t , s t + 1 ) in the replay memory.
13 
    Sample a random mini-batch of size N b from the replay memory and train the local DQN with the MSE loss function in (42).
14 
    if  t % N c = 0 then
15 
      Update the weights of the target DNN according to θ = θ .
16 
    end if
17 
  end for
18 
end for

5. Simulation Results

Numerical experiments were performed in order to verify the advantages of the proposed IGS-based signaling scheme for the downlink NOMA system in terms of max–min fairness. The signal-to-noise ratio (SNR) of each user was defined as p i / σ n i 2 , i = 1 , 2 .

5.1. Fairness Rate for the IGS-Based NOMA Under Perfect SIC

This subsection investigates the fairness achievable rate for the IGS-based NOMA system under perfect SIC in the MATLAB 2024b programming environment. We first simulated the fairness achievable rate for the IGS-assisted NOMA system as a function of Γ 2 against different levels of SNR in Figure 1. The fairness achievable rate was computed according to the alternate optimization method, as shown in Algorithm 2. The SNRs of both users were 13 dB and 15 dB, respectively. A considerable rate improvement can be observed from Figure 1 by using IGS instead of PGS. In Figure 1, it is obvious that a higher SNR results in a larger fairness achievable rate since a higher SNR implies a corresponding power budget.
We also investigated the fairness achievable rate for the IGS-assisted NOMA system as a function of SNR against different levels of Γ 2 in Figure 2. The channel-to-noise ratios were 0.5 and 0.8, respectively. As shown in Figure 2, a larger Γ 2 results in a greater fairness achievable rate for the IGS-based NOMA system, and it also highlights the superiority of the advanced IGS technique.

5.2. Fairness Rate for the IGS-Based NOMA Under Imperfect SIC

In this subsection, we investigate the fairness achievable rate for the IGS-based NOMA system under imperfect SIC in Python 3.9.19 with Pytorch 2.2.2 programming environment. The fully connected DNN in DQN consists of two hidden layers, each with 512 neurons. The replay memory size, the discount coefficient, the batch size, and the learning rate are set as N r = 1000 , γ = 0.9 , N b = 64 , and α = 10 4 , respectively. The copy frequency mapping the local DNN weights θ to the target DNN weights θ was set as N c = 10 . The training parameters are summarized in Table 1. We first compare the fairness rate for the IGS-based NOMA system with different combinations of Δ p , Δ i , and λ and different SNRs in Table 2. It is obvious that the achievable rate of Δ p = Δ i = 0.05 is almost the same as the case Δ p = Δ i = 0.01 , while it suffers different levels of rate performance attenuation for other cases. We set Δ p = Δ i = 0.05 in the following simulations to find a trade-off between fast convergence and fairness performance. The convergence behavior of the DQN in terms of training loss is illustrated in Figure 3. As indicated in Figure 3, it is suitable to set the number of training episodes as T e = 500 in order to balance training cost and fairness performance.
We also investigated the fairness achievable rate for IGS-based NOMA system as a function of λ for different SNRs under imperfect SIC in Figure 4. The advantage of the IGS over PGS for the fairness of the NOMA system is obviously observed from Figure 4, where a higher SNR results in a larger achievable rate. Moreover, the rate enhancement is highlighted when the imperfect SIC becomes larger. In the last set of simulations, the fairness achievable rate for IGS-based NOMA system as a function of λ for different Γ 1 was considered under imperfect SIC in Figure 5. It is straightforward that a higher value of Γ 1 results in a higher fairness rate. All the simulation results demonstrate that the advanced IGS technique has the potential to enhance the fairness rate performance of the IGS-based NOMA system over the PGS one.

6. Conclusions

In this work, we consider the fairness rate optimization of the downlink NOMA system employing the novel improper signaling technique with both users subjected to a potential IGS. We first formulate the max–min optimization, and then the alternate optimization algorithm is considered by iteratively optimizing the impropriety degrees and power level under perfect SIC. For the PGS case, the closed-form solution is also investigated. Moreover, a DQN-based solution to fairness optimization is applied to verify the effectiveness of the IGS-based downlink NOMA system under imperfect SIC. Numerical experiments show that the proposed IGS-based signaling scheme is able to achieve a higher fairness rate than the traditional PGS-based one.

Author Contributions

Conceptualization, H.C., M.Z., and R.S.; methodology, H.C.; writing—original draft preparation, H.C.; writing—review and editing, H.C., M.Z., and R.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the National Natural Science Foundation of China under Grants 62301271 and 62401290, the open research fund of National Mobile Communications Research Laboratory, Southeast University (No. 2025D13), and the Natural Science Research Start-up Foundation of Recruiting Talents of Nanjing University of Posts and Telecommunications under Grants No. NY222083 and No. NY222082.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Wang, C.; You, X.; Gao, X.; Zhu, X.; Li, Z.; Zhang, C.; Wang, H.; Huang, Y.; Chen, Y.; Haas, H.; et al. On the Road to 6G: Visions, Requirements, Key Technologies, and Testbeds. IEEE Commun. Surv. Tutor. 2023, 25, 905–974. [Google Scholar] [CrossRef]
  2. Chafii, M.; Bariah, L.; Muhaidat, S.; Debbah, M. Twelve Scientific Challenges for 6G: Rethinking the Foundations of Communications Theory. IEEE Commun. Surv. Tutor. 2023, 25, 868–904. [Google Scholar] [CrossRef]
  3. Atzori, L.; Iera, A.; Morabito, G. The Internet of Things: A survey. Comput. Netw. 2010, 54, 2787–2805. [Google Scholar] [CrossRef]
  4. Zanella, A.; Bui, N.; Castellani, A.; Vangelista, L.; Zorzi, M. Internet of Things for smart cities. IEEE Internet Things J. 2014, 1, 22–32. [Google Scholar] [CrossRef]
  5. Zhong, Y.; Dutkiewicz, E.; Yang, Y.; Zhu, X.; Zhou, Z.; Jiang, T. Internet of mission-critical things: Human and animal classification-A device-free sensing approach. IEEE Internet Things J. 2018, 5, 3369–3377. [Google Scholar] [CrossRef]
  6. Zhao, H.; Tan, Y.; Guo, K.; Xia, W.; Xu, B.; Quek, T.Q.S. Client Scheduling for Multi-Server Federated Learning in Industrial IoT With Unreliable Communications. IEEE Internet Things J. 2024, 11, 16478–16490. [Google Scholar] [CrossRef]
  7. Ji, B.; Zhang, X.; Mumtaz, S.; Han, C.; Li, C.; Wen, H.; Wang, D. Survey on the internet of vehicles: Network architectures and applications. IEEE Commun. Stand. Mag. 2020, 4, 34–41. [Google Scholar] [CrossRef]
  8. Al-Dulaimi, O.M.K.; Al-Dulaimi, A.M.K.; Alexandra, M.O.; Al-Dulaimi, M.K.H. Strategy for Non-Orthogonal Multiple Access and Performance in 5G and 6G Networks. Sensors 2023, 23, 1705. [Google Scholar] [CrossRef]
  9. Wang, D.; Wu, M.; He, Y.; Pang, L.; Xu, Q.; Zhang, R. An HAP and UAVs Collaboration Framework for Uplink Secure Rate Maximization in NOMA-Enabled IoT Networks. Remote Sens. 2022, 14, 4501. [Google Scholar] [CrossRef]
  10. Liu, Y.; Gao, H.; Cheng, H.; Xia, Y.; Pei, W. Outage Performance Analysis of Improper Gaussian Signaling for Two-User Downlink NOMA Systems with Imperfect Successive Interference Cancellation. Entropy 2023, 25, 1172. [Google Scholar] [CrossRef]
  11. Vidal-Beltrán, S.; López-Bonilla, J.L. Improving Spectral Efficiency in the SCMA Uplink Channel. Mathematics 2021, 9, 651. [Google Scholar] [CrossRef]
  12. Sultana, A.; Woungang, I.; Anpalagan, A.; Zhao, L.; Ferdouse, L. Efficient resource allocation in SCMA-enabled device-to-device communication for 5G networks. IEEE Trans. Veh. Technol. 2020, 69, 5343–5354. [Google Scholar] [CrossRef]
  13. Yu, L.; Liu, Z.; Wen, M.; Cai, D.; Dang, S.; Wang, Y.; Xiao, P. Sparse code multiple access for 6G wireless communication networks: Recent advances and future directions. IEEE Commun. Stand. Mag. 2021, 5, 92–99. [Google Scholar] [CrossRef]
  14. Islam, S.M.R.; Avazov, N.; Dobre, O.A.; Kwak, K. Power-domain non-orthogonal multiple access (NOMA) in 5G systems: Potentials and challenges. IEEE Commun. Surv. Tutor. 2017, 19, 721–742. [Google Scholar] [CrossRef]
  15. Wang, P.; Ye, N.; Li, J.; Di, B.; Wang, A. Asynchronous Multi-User Detection for Code-Domain NOMA: Expectation Propagation Over 3D Factor-Graph. IEEE Trans. Veh. Technol. 2022, 71, 10770–10781. [Google Scholar] [CrossRef]
  16. Dai, L.; Wang, B.; Yuan, Y.; Han, S.; Chih-Lin, I.; Wang, Z. Non-orthogonal multiple access for 5G: Solutions, challenges, opportunities, and future research trends. IEEE Commun. Mag. 2015, 53, 74–81. [Google Scholar] [CrossRef]
  17. Liang, L.; Xie, S.; Li, G.Y.; Ding, Z.; Yu, X. Graph-based resource sharing in vehicular communication. IEEE Trans. Wirel. Commun. 2018, 17, 4579–4592. [Google Scholar] [CrossRef]
  18. Higuchi, K.; Kishiyama, Y. Non-orthogonal access with random beamforming and intra-beam SIC for cellular MIMO downlink. In Proceedings of the 2013 IEEE 78th Vehicular Technology Conference (VTC Fall), Las Vegas, NV, USA, 2–5 September 2013. [Google Scholar]
  19. Zeng, M.; Yadav, A.A.; Dobre, O.; Tsiropoulos, G.I.; Poor, H.V. Capacity comparison between MIMO-NOMA and MIMO-OMA with multiple users in a cluster. IEEE J. Sel. Areas Commun. 2017, 35, 2413–2424. [Google Scholar] [CrossRef]
  20. Neeser, F.D.; Massey, J.L. Proper complex random processes with applications to information theory. IEEE Trans. Inf. Theory 1993, 39, 1293–1302. [Google Scholar] [CrossRef]
  21. Schreier, P.J.; Scharf, L.L. Statistical Signal Processing of Complex-Valued Data: The Theory of Improper and Noncircular Signals; Cambridge University Press: Cambridge, UK, 2010. [Google Scholar]
  22. Santamaria, I.; Crespo, P.M.; Lameiro, C.; Schreier, P.J. Information-theoretic analysis of a family of improper discrete constellations. Entropy 2018, 20, 45. [Google Scholar] [CrossRef]
  23. Zeng, Y.; Yetis, C.M.; Gunawan, E.; Guan, Y.L.; Zhang, R. Transmit optimization with improper Gaussian signaling for interference channels. IEEE Trans. Signal Process. 2013, 61, 2899–2913. [Google Scholar] [CrossRef]
  24. Javed, S.; Amin, O.; Shihada, B.; Alouini, M.-S. A Journey From Improper Gaussian Signaling to Asymmetric Signaling. IEEE Commun. Surv. Tutor. 2020, 22, 1539–1591. [Google Scholar] [CrossRef]
  25. Ho, Z.K.M.; Jorswieck, E. Improper Gaussian signaling on the two-user SISO interference channel. IEEE Trans. Wirel. Commun. 2012, 11, 3194–3203. [Google Scholar] [CrossRef]
  26. Soleymani, M.; Lameiro, C.; Santamaria, I.; Schreier, P.J. Robust Improper Signaling for Two-User SISO Interference Channels. IEEE Trans. Commun. 2019, 67, 4709–4723. [Google Scholar] [CrossRef]
  27. Nguyen, H.D.; Zhang, R.; Sun, S. Improper Signaling for Symbol Error Rate Minimization in K-User Interference Channel. IEEE Trans. Commun. 2015, 63, 857–869. [Google Scholar] [CrossRef]
  28. Soleymani, M.; Lameiro, C.; Santamaria, I.; Schreier, P.J. Improper Signaling for SISO Two-User Interference Channels With Additive Asymmetric Hardware Distortion. IEEE Trans. Commun. 2019, 67, 8624–8638. [Google Scholar] [CrossRef]
  29. Yu, H.; Tuan, H.D.; Dutkiewicz, E.; Poor, H.V.; Hanzo, L. Maximizing the Geometric Mean of User-Rates to Improve Rate-Fairness: Proper vs. Improper Gaussian Signaling. IEEE Trans. Wirel. Commun. 2022, 21, 295–309. [Google Scholar] [CrossRef]
  30. Soleymani, M.; Santamaria, I.; Jorswieck, E.A. Rate Splitting in MIMO RIS-Assisted Systems With Hardware Impairments and Improper Signaling. IEEE Trans. Veh. Technol. 2023, 72, 4580–4597. [Google Scholar] [CrossRef]
  31. Fang, J.; Zhang, C.; Wu, Q.; Li, A. Improper Gaussian Signaling for IRS Assisted Multiuser SWIPT Systems With Hardware Impairments. IEEE Trans. Veh. Technol. 2023, 72, 13024–13038. [Google Scholar] [CrossRef]
  32. Fang, J.; Zhang, C.; Wu, Q.; Zeng, Y.; Shi, Q. A Two-Layer Iterative Algorithm for Max-Min Rate Optimization in IRS Assisted Multiuser Systems with Improper Gaussian Signaling. IEEE Trans. Commun. 2024, 72, 7596–7610. [Google Scholar] [CrossRef]
  33. Tuan, H.D.; Nasir, A.A.; Nguyen, H.H.; Duong, T.Q.; Poor, H.V. Non-Orthogonal Multiple Access With Improper Gaussian Signaling. IEEE J. Sel. Top. Signal Process. 2019, 13, 496–507. [Google Scholar] [CrossRef]
  34. Nasir, A.A.; Tuan, H.D.; Nguyen, H.H.; Duong, T.Q.; Poor, H.V. Signal Superposition in NOMA With Proper and Improper Gaussian Signaling. IEEE Trans. Commun. 2020, 68, 6537–6551. [Google Scholar] [CrossRef]
  35. Miuccio, L.; Riolo, S.; Bennis, M.; Panno, D. Design of a Feasible Wireless MAC Communication Protocol via Multi-Agent Reinforcement Learning. In Proceedings of the 2024 IEEE International Conference on Machine Learning for Communication and Networking (ICMLCN), Stockholm, Sweden, 5–8 May 2024. [Google Scholar]
Figure 1. The fairness achievable rate for the IGS-assisted NOMA system as a function of Γ 2 against different levels of SNR under perfect SIC.
Figure 1. The fairness achievable rate for the IGS-assisted NOMA system as a function of Γ 2 against different levels of SNR under perfect SIC.
Applsci 15 09970 g001
Figure 2. The fairness achievable rate for the IGS-assisted NOMA system as a function of SNR against different levels of Γ 2 under perfect SIC.
Figure 2. The fairness achievable rate for the IGS-assisted NOMA system as a function of SNR against different levels of Γ 2 under perfect SIC.
Applsci 15 09970 g002
Figure 3. Loss convergence over training episodes.
Figure 3. Loss convergence over training episodes.
Applsci 15 09970 g003
Figure 4. The fairness achievable rate for IGS-based NOMA system as a function of λ for different SNRs.
Figure 4. The fairness achievable rate for IGS-based NOMA system as a function of λ for different SNRs.
Applsci 15 09970 g004
Figure 5. The fairness achievable rate for IGS-based NOMA system as a function of λ for different Γ 1 .
Figure 5. The fairness achievable rate for IGS-based NOMA system as a function of λ for different Γ 1 .
Applsci 15 09970 g005
Table 1. Training parameters for DQN.
Table 1. Training parameters for DQN.
ParametersValues
Size of each hidden layer512
Size of mini-batch64
The discount coefficient 0.9
The learning rate 10 4
OptimizerAdam
Number of hidden layers2
Activation function tanh ( · )
Size of replay memory1000
Table 2. Comparison of the fairness rate for the IGS-based NOMA system with different combinations of Δ p , Δ i , and λ .
Table 2. Comparison of the fairness rate for the IGS-based NOMA system with different combinations of Δ p , Δ i , and λ .
Δ p Δ i λ = 0.15 λ = 0.45
SNR = 12 dB SNR = 15 dB SNR = 12 dB SNR = 15 dB
0.1 0.1 1.69221.97111.31071.6433
0.1 0.05 1.75012.03771.36931.6904
0.05 0.1 1.74282.02541.35501.6838
0.05 0.05 1.83762.11371.44981.7751
0.01 0.01 1.83992.11871.45321.7792
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Cheng, H.; Zhang, M.; Su, R. On Rate Fairness Maximization for the Downlink NOMA with Improper Signaling and Imperfect SIC. Appl. Sci. 2025, 15, 9970. https://doi.org/10.3390/app15189970

AMA Style

Cheng H, Zhang M, Su R. On Rate Fairness Maximization for the Downlink NOMA with Improper Signaling and Imperfect SIC. Applied Sciences. 2025; 15(18):9970. https://doi.org/10.3390/app15189970

Chicago/Turabian Style

Cheng, Hao, Min Zhang, and Ruoyu Su. 2025. "On Rate Fairness Maximization for the Downlink NOMA with Improper Signaling and Imperfect SIC" Applied Sciences 15, no. 18: 9970. https://doi.org/10.3390/app15189970

APA Style

Cheng, H., Zhang, M., & Su, R. (2025). On Rate Fairness Maximization for the Downlink NOMA with Improper Signaling and Imperfect SIC. Applied Sciences, 15(18), 9970. https://doi.org/10.3390/app15189970

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop