1. Introduction
Future 6G networks will support data rates of up to 1T bps while also facing the challenge of coping with a large number of mobile devices [
1]. However, limited radio spectrum resources make it difficult to meet the needs of explosive massive data transmission, and visible light communication (VLC) combines the advantages of lighting and communication with hundreds of high-frequency bandwidth spectrum that does not require certification [
2], alleviating the pressure of insufficient radio frequency (RF) spectrum. In addition, VLC has several advantages, such as high data rate, low power consumption, high security, and small latency [
3]. Therefore, VLC is considered to be a promising complementary technology in emerging wireless communications. Data transmission in VLC systems is achieved through intensity modulation on the transmitter side and direct detection on the receiver side, and as with other high-frequency communication technologies, signal propagation at this frequency is highly susceptible to obstacles. Therefore, data transmission between transmitter and receiver in VLC requires a line-of-sight (LOS) path. When there is a non-line-of-sight (NLOS) transmission between the VLC access point (AP) and the user access point, the performance of VLC will degrade sharply. The coverage area of VLC AP is limited, so transmission reliability has become the main concern of the VLC system. Conversely, in the case of NLOS transmission, RF systems can provide a wider coverage area [
4]. Therefore, VLC/RF heterogeneous networks have received great attention in recent years, and they have significantly improved in terms of sum rate [
5], spectral efficiency [
6], power consumption [
7], and energy efficiency [
8]. However, the introduction of VLC into RF systems faces the problem of access point allocation and resource allocation.
For VLC/RF heterogeneous systems, access point allocation and resource allocation are two important issues to be identified before the user transmits data and are also important factors affecting the transmission rate of the system. At present, most of the research on VLC/RF hybrid systems is mainly focused on power allocation (PA), and there are few related studies on how to reasonably allocate APs for users. Ma et al. [
9] developed a two-stage access point selection (APS) method based on a fuzzy logic algorithm to maximize energy efficiency under minimum rate requirements and power constraints. Obeed et al. [
10] proposed an iterative algorithm to assign APs to users and derived a new efficient algorithm to find the optimal dual variable through the correlation of dual variables and allocate the power of each AP to connected users to maximize the total capacity of the system and improve the fairness of the system. Pratamay et al. [
11] propose a bandwidth aggregation protocol to optimize the throughput of the system. However, under the condition of ensuring system stability, there are still some challenges in how to effectively and dynamically allocate AP, bandwidth, and transmit power and jointly optimize system performance. First, different types of APs can make system design difficult. Generally speaking, binary allocation variables and power allocation make the resource allocation problem become a mixed-integer nonlinear programming (MINP) problem, which is a non-convex problem with high computational complexity. Second, resource allocation should be carried out under random channel conditions; as a joint optimization problem, it is usually necessary to optimize multiple parameters at the same time, so the resource allocation optimization problem is affected by many factors and is difficult to solve.
In this paper, we investigate the performance of VLC/RF systems based on orthogonal frequency-division multiple access (OFDMA) technology. The system consists of multiple VLC APs and one RF AP working together to serve multiple users. Using the OFDMA scheme, resources can be divided into the time domain and frequency domain at the same time, and the divided time–frequency block is called a resource block (RB); for each resource block, the smallest resource block that is indivisible is called a resource unit (RU). OFDMA technology allows multiple users to share all subcarriers of a system, but each subcarrier is limited to one user in the same time slot. If other users need to use this subcarrier, they must queue up in the next time slot, which will reduce system efficiency. Allocating system resources to different users can improve the efficiency and flexibility of the entire communication system rather than just allocating subcarriers or time slots. In addition, OFDMA can flexibly select channels of different frequencies for users and adaptively allocate resources, thereby significantly improving the spectral efficiency and transmission rate of the system. At present, in the field of VLC, OFDMA research mainly focuses on spectral efficiency, and its combination with system transmission rate and stability has not been thoroughly studied. In this paper, in the indoor VLC/RF system based on OFDMA technology, Lyapunov optimization technology is used to transform the time averaging problem into a single time slot problem and divide it into three independent subproblems to solve the optimization problem in a single time slot. The simulation results show that the Lyapunov optimization algorithm does not require iteration, which can greatly improve the optimization speed. In addition, compared with the two traditional resource allocation algorithms, the Lyapunov optimization algorithm can significantly improve the average transmission rate of the system and ensure the stability of the system.
3. Optimization of the Problem Description
The goal of this article is to achieve the maximum transfer rate while the system remains stable. Within the time slot
t, the transmission rate of the entire system can be expressed as
To achieve the optimization goal of maximizing the average rate of the system, the above function is designed as a time-averaging expression. Therefore, the stochastic optimization problem is formulated as
where (28) is the system stability requirement; (29) ensures that in each time slot, the user can only be served by one AP; (30) ensures that for the RF AP, only one user can be served on each resource block; (31) represents the average power budget constraint of each AP; (32) ensures that each subchannel of the VLC AP can only serve one user; (33) ensures that for one VLC AP, each subcarrier can only be assigned to one user at a time; and (34) indicates that the transmit power of all APs is not negative and meets the actual conditions.
There are long-term constraints in the above problem, and to effectively handle long-term time-averaged power constraints, virtual queues in Lyapunov optimization can be used to convert (31) into queue stability constraints [
18]. Virtual queues are defined as
Z(
t) ≜ {
Z1(
t),···,
Zl(
t),···,
ZL(
t)},
Zl(
t) and evolved as follows:
It should be noted that there is no actual queue data in queue
Zl(
t), which is only proposed to satisfy the constraint (31). If the virtual queue
Zl(
t) is average-rate stable, constraint (31) automatically holds [
19].
Suppose the order vector of queue length in the system is Θ(
t) = [
Q(
t),
Z(
t)]. The Lyapunov function is defined as a measure of the total queue length for each time slot
t, denoted by the following equation:
In each time slot, the conditional Lyapunov drift is defined as
It can be observed that at each time slot, the Lyapunov function can control the final value of the Lyapunov drift by adjusting the final queue length. The purpose of this paper is to maximize the average system transfer rate, and the drift-penalty function term is defined as follows:
where
V is a non-negative constant parameter used to balance drift with reward. When the
V value increases, the system assigns a higher priority to maximize the transmission rate, that is, sacrifice the queue length; when the
V value decreases, the system prioritizes the queue length to maintain the stability of the system.
For each period, for a given
V ≥ 0, the upper bound of the drift-penalty function for any possible Θ(
t) is
where
C is a constant satisfying the following conditions:
According to the Lyapunov optimization method [
20], the network utility is optimized by minimizing the drift-penalty function term to minimize the queue length. Therefore, Problem (35) can be transformed into minimizing the upper bound of the Lyapunov drift-penalty function on each time slot, which is expressed as
The reformulated optimization problem (41) is a constrained mixed-integer programming problem where takes integer values, and and take continuous values. Therefore, the optimization problem (41) is decomposed into three relatively independent subproblems, and the corresponding optimization algorithm is proposed.
3.1. AP Allocation
The AP allocation variables
a(u,l)(
t) in the system can be obtained by solving the following subproblems:
It can be seen from (42) that the network selection between different users is independent. Therefore, Problem (42) can be decomposed into
U subproblems, and each subproblem is
where
e(u,l)(
t) = (
Qu(
t) +
V)v
(u,l). Since (43) is a linear optimization problem, its solution is as follows:
Under the above network selection strategy, e(u,l)(t) can be regarded as the congestion cost of the AP serving the user within the time slot t. Then, Equation (44) ensures that the user can connect to the AP with the smallest congestion cost.
3.2. VLC AP Joint Subcarrier and Power Allocation
According to the access point allocation variable
a(
t) obtained from subproblem (42), the joint subcarrier and power allocation subproblem in the VLC system can be solved continuously:
Since the selection of subcarriers is independent among different VLC APs, Problem (45) can be decomposed into |
LVLC| subproblems, and each subproblem is expressed as
where
Ul denotes the set of users connected to VLC AP. The VLC joint subcarrier and power allocation problem is a hybrid combinatorial optimization problem. To solve this problem, the Lagrange relaxation method can be used to transform the original integer programming mixed problem into a convex optimization problem. Relaxing
to
, the optimization problem is reformulated as
Since
is a convex function of
and
, think of it as a linear function of
. Therefore, the given optimization problem is a
K × |
Ul| linear combination of convex functions. Let
be the objective function and find the partial derivative of
for
:
According to the Karush–Kuhn–Tucker (KKT) conditions, the system can achieve optimal power allocation when the following conditions are satisfied:
where
is the multiplier for constructing the Lagrange function. To ensure that (51) holds, then
or
. The partial derivative of Formula (49) is 0, and the relationship between
and
can be obtained.
The result of power allocation will be used to obtain the subcarrier allocation. Substituting Equation (53) into Equation
, we obtain
where
Obviously, Problem (54) is a linear assignment problem, which can be decomposed into
k subproblems. Each subproblem is represented by the following equation:
For subcarrier
k, the optimal allocation is to obtain the user with the maximum
, and the solution can be obtained.
3.3. RF AP Joint Resource Block and Power Allocation
In RF communication systems, the RF AP joint resource block and power allocation subproblem can be expressed as
It can be seen from the above problems that the joint RB and power allocation problems are independent of each other on different time slots t. Therefore, the objective function of Problem (58) is a function of discrete variables
and continuous variables
, so the problem is also a mixed-integer nonlinear convex optimization problem, which can be solved by the Lagrange relaxation method. Firstly,
is relaxed to the continuous interval [0, 1], and a new variable
is defined. Problem (58) can be rewritten as
Let the objective function
and calculate the partial derivative of
:
According to the KKT conditions, the system achieves the optimal power allocation when the following conditions are satisfied.
where
is the multiplier for constructing the Lagrange function. To ensure that Equation (63) holds, then
or
. When the partial derivative of Equation (61) is 0, the relationship between
and
can be obtained.
The result of power allocation will be used for the allocation of RF resource blocks. Substituting Equation (63) into
, we can obtain
where
Obviously, Problem (66) is also a linear assignment problem. Therefore, it can be decomposed into
i subproblems, and each subproblem is expressed as
The objective of this problem is to maximize the
of all subcarriers. Therefore, for all subcarriers
i, the optimal allocation is to select the user with the largest
.
4. Analysis of Algorithm Simulation Results
In this section, we analyze the simulation results of the Lyapunov optimization algorithm and verify the performance of the proposed indoor VLC/RF system resource allocation scheme with OFDMA technology. Assuming that the room size is 8 m × 8 m × 3 m, as shown in
Figure 1, 4 VLC APs are installed on the ceiling, which are located at (3,2,3), (5,2,3), (3,5,3) and (5,5,3), respectively, and each AP has 25 subcarriers. The RF AP is located at (0,0,0), and it has four RBs. Other key simulation parameters of the system are shown in
Table 2.
To simplify the simulation, we consider the normalized spectrum bandwidth of VLC AP and RF AP, traffic arrival for each user device follows the Poisson distribution, the average traffic arrival rate is expressed in γ, the traffic arrival rate and RF channel conditions remain constant within a time slot and change on the time slot boundary, γ specific values will be given in the simulation. Since the spectral bandwidth is normalized and the unit of transmitted data volume is set to package, assuming that the average communication rate of the VLC AP subcarrier in the system is five packages and the average communication rate of an RF AP single RB is three packages, the average traffic arrival rate is limited to less than five packages.
Figure 4a shows the relationship between the penalty parameter V and the average queue length Q under different average traffic arrival rates γ. It can be observed that the Q value in the system increases with the increase in V, and as the average traffic arrival rate γ increases, the average queue length Q also increases.
Figure 4b shows the change in penalty parameter V and average transmission rate R under different average traffic arrival rates γ. It can be seen that the average transfer rate R increases with the increase in the V value because when the V value increases, the optimization degree of the algorithm on the average transmission rate increases. In addition, as the average traffic arrival rate γ value increases, the growth rate of the average transmission rate slows down because when the γ value is small, the amount of data transmitted by the system can meet the amount of data arrived. When the γ value is large, the system rejects additional packets to ensure data queue stability. From Little’s theorem [
21], since the average transmission delay is proportional to the average queue length, the average queue length can be used to describe the average delay, and in the Lyapunov optimization process, selecting the appropriate control parameter V can balance the relationship between the average transmission rate and the average transmission delay.
In addition, the instantaneous change in the system is simulated. Assuming that the user’s position is changing in each time slot, the instantaneous change in the user on 30 time slots is given.
Figure 5a–c depict the instantaneous transfer rate and queue length change for four users in the system when the penalty parameter V is valued at 50, 150, and 300. It can be observed that for each user, as the length of the user’s queue increases, so does the data transfer rate for that user. This is because the Lyapunov optimization algorithm makes the system provide a higher transfer rate to reduce the buffer queue, thus ensuring the stability of the entire system. Another conclusion is that the queue length in the system is longer when V = 300 is longer than when V = 50 and 150.
Figure 6 shows the change in average queue length Q and average transfer rate R with the number of users in the system using different algorithms. The Lyapunov optimization algorithm was compared with the minimized distance resource allocation (MD-RA) algorithm and the best channel state information resource allocation (BCSI-RA) algorithm [
22]. Obviously, from
Figure 6a, it can be seen that with the increase in the number of users, the average queue length of the system gradually increases compared to the MD-RA algorithm and BCSI-RA algorithm, the queue length under the Lyapunov optimization algorithm is smaller, and the growth rate is the slower, indicating that the system has better stability when using the Lyapunov algorithm. It can be seen from
Figure 6b that as the number of users in the system increases, the average transmission rate also increases, and the average transmission rate under the Lyapunov optimization algorithm is significantly higher than that of the MD-RA algorithm and BCSI-RA algorithm, and the average transmission rate increases significantly when the number of users is greater than six.
Figure 7 shows the average queue length vs. average transfer rate as a function of the number of users, taking into account the interrupt probability, and compares the performance of the system using the three algorithms. Suppose p represents the probability of LOS availability in VLC systems and generates a random number between 0 and 1; if this number is greater than p, then the LOS of VLC AP is available. Otherwise, LOS is not available. This study shows that the Lyapunov optimization algorithm can achieve higher transmission rates while ensuring that the average queue length of the system is low; that is, the algorithm can provide better system performance.
As shown in
Figure 8, as the penalty parameter V value increases, the average queue length Q and the average transmission rate R in the uplink communication link under different numbers of users also increase. This is because increasing the penalty parameter will cause the system to allocate more resources to the user, thereby optimizing the performance of the system. At the same time, as can be seen from
Figure 8a, as the number of users increases, the queue backlog in the system accelerates because the increase in the number of users leads to a decrease in the uplink communication rate, which affects the performance of the entire system. To balance the relationship between queue length and transmission rate, the appropriate penalty parameter V can be selected to make the system communication more stable.
Figure 8b also shows that the average transfer rate increases gradually as the number of users increases. Changes in both Q and R values indicate a decrease in the communication rate of the upstream RF link. This is because, with limited spectrum resources and base station processing power, users must share resources, resulting in network congestion and transmission delays, which in turn reduce the communication rate of the entire system.
Figure 9 shows the average queue length Q and average transmission rate R in the OFDMA-VLC/RF system and VLC/RF system as a function of the number of users. As shown in
Figure 9a, the queue extrusion speed in the VLC/RF system using OFDMA technology is significantly slower than that in the VLC/RF system without OFDMA technology, and the Q value change in the OFDMA-VLC/RF system is relatively stable, indicating that the system has strong stability. It can be observed from
Figure 9b that as the number of users increases, the average transmission rate of the system gradually increases, and when the number of users increases to 15, the average transmission rate of the OFDMA-VLC/RF system increases by 17 packages compared with the VLC/RF system, because the use of OFDMA technology increases the capacity of the entire communication system, thereby increasing the system transmission rate.
5. Conclusions
In this paper, the resource allocation problem in indoor OFDMA-VLC/RF systems is studied, and an allocation strategy based on the Lyapunov optimization algorithm is proposed. The purpose of this study is to solve the resource allocation problem by maximizing the average transmission rate of the system under the constraints of system stability and the average power budget:
(1) To find the optimal resource allocation strategy, we formulate a random optimization problem, which considers the stability constraints and average power budget of the system and takes the average transmission rate of the system as the optimization goal.
(2) Using the Lyapunov optimization algorithm, the stochastic optimization problem is decomposed into three independent subproblems, and their optimal solutions are given respectively. Through simulation, the effectiveness of the proposed resource allocation strategy is verified. The results show that this strategy significantly improves the average transmission rate of the system while maintaining the stability of the system. This means that our approach can improve system performance by making more efficient use of limited resources in indoor OFDMA-VLC/RF systems.
This study has important practical significance for improving the performance of indoor wireless communication systems and provides a valuable reference for further research. In the future, the results can meet the communication needs between IoT devices and provide efficient communication services. In the smart home system, this technology can realize the interconnection between devices. In addition, the technology can also support real-time interaction and immersive experience of virtual reality (VR) and augmented reality (AR) technologies, providing high-speed and low-latency communication services. In the field of health care, the resource allocation strategy based on this technology can provide high-speed and reliable communication services, support data transmission between medical devices and telemedicine services, and improve the efficiency and quality of health care. In short, the resource allocation strategy of indoor VLC/RF systems based on OFDMA technology and the Lyapunov optimization algorithm has the characteristics of high speed, high efficiency, and reliability in communication applications in various fields, which will promote the development of future communication technology.