Next Article in Journal
Wave Load Reduction and Tranquility Zone Formation Using an Elastic Plate and Double Porous Structures for Seawall Protection
Previous Article in Journal
A Cascaded Data-Driven Approach for Photovoltaic Power Output Forecasting
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Graph Neural Network-Based Beamforming Optimization for Multi-BS RIS-Aided Communication Systems

1
Department of Information and Communication Engineering, Sejong University, Seoul 05006, Republic of Korea
2
Department of Convergence Engineering for Intelligent Drone, Sejong University, Seoul 05006, Republic of Korea
3
Department of Computer Engineering, Sejong University, Seoul 05006, Republic of Korea
*
Author to whom correspondence should be addressed.
Mathematics 2025, 13(17), 2732; https://doi.org/10.3390/math13172732
Submission received: 30 July 2025 / Revised: 20 August 2025 / Accepted: 22 August 2025 / Published: 25 August 2025

Abstract

The optimization of beamforming in multi-base station (multi-BS) reconfigurable intelligent surface (RIS)-aided systems is a challenging task due to its high computational complexity. This paper first demonstrates that an optimized multi-BS system exhibits superior communication performance compared to a centralized large-scale single-BS system. To efficiently solve the complex beamforming problem in the multi-BS environment, this paper proposes a novel optimization solver based on a graph neural network (GNN) that models the physical structure of the system. Experimental results show that the proposed GNN solver finds solutions of higher quality, achieving a 42% performance increase with 45% less total computational complexity compared to a conventional iterative optimization method. Furthermore, when compared to other complex AI models such as transformer and Bi-LSTM, the proposed GNN achieves similar state-of-the-art performance while having less than 1% of the parameters and a fraction of the computational cost. These findings demonstrate that the GNN is a powerful, efficient, and practical solution for beamforming optimization in multi-BS RIS-aided systems, satisfying the demands for performance, computational efficiency, and model compactness.

1. Introduction

The advent of beyond-5G (B5G) and 6G communication systems has spurred research into innovative technologies to meet the ever-increasing demands for higher data rates, broader coverage, and improved energy efficiency. Reconfigurable intelligent surfaces (RISs) have emerged as a promising technology capable of intelligently manipulating the wireless propagation environment by controlling the phase shifts of a large number of passive reflecting elements [1,2,3]. An RIS-assisted system can create favorable propagation paths, mitigate interference, and extend coverage, all with minimal energy consumption thanks to the passive nature of its elements.
While initial research has predominantly focused on single-base station (single-BS) scenarios, this paradigm faces fundamental limitations in practical large-scale deployments. Single-BS systems, even those with large numbers of antennas, suffer from coverage holes at the cell edges and are vulnerable to link blockages by obstacles. In contrast, a distributed multi-base station (multi-BS) architecture offers significant advantages. By leveraging macroscopic diversity, multi-BS architectures can provide more robust and uniform coverage while mitigating the impact of shadowing and fading [4]. However, this transition also presents a formidable challenge in the form of joint active and passive beamforming optimization. The RIS must now coherently combine signals originating from multiple geographically distributed sources, creating a large-scale coupled optimization problem that is highly sensitive to inter-cell interference. The computational complexity of finding an optimal solution for this coordinated beamforming problem increases dramatically compared to the single-BS case [5].

1.1. Related Work

Traditional approaches to this problem often rely on iterative optimization techniques such as semidefinite programming (SDP) or alternating optimization (AO) [6,7,8]. While capable of finding high-quality solutions, these methods suffer from prohibitively high computational complexity, often scaling polynomially with the number of RIS elements or antennas. This renders them unsuitable for real-time applications where channel conditions change dynamically.
To overcome this complexity, artificial intelligence (AI) has been explored as a powerful alternative [9,10]. While the multi-BS system in this paper is a form of distributed network, it differs from other prominent architectures. Compared to cell-free massive MIMO [11], which uses a massive number of simple access points, the approach in this paper focuses on a scalable number of powerful base stations. Unlike reconfigurable distributed antenna and reflecting surface (RDARS) [12] or distributed RIS [13] approaches, which distribute reconfigurable or passive surfaces, this paper focuses on the challenge of coordinating distributed active base stations with a centralized RIS.
Despite these architectural differences, all distributed systems share the fundamental challenge of high-complexity joint optimization. This paper focuses on tackling this common computational bottleneck by proposing a novel GNN-based solver. Therefore, the contribution is not a competing architecture but rather an efficient optimization tool that could enhance the practicality of these systems and serve as a foundation for more complex future architectures such as multi-BS, multi-RIS/RDARS systems. Deep neural networks (DNNs), convolutional neural networks (CNNs), and long short-term memory (LSTM) networks have been applied to various wireless communication problems, demonstrating the ability to perform near-instantaneous inference after an offline training phase [14]. However, these architectures often treat the channel information as a generic vector or image, failing to exploit the underlying physical structure of the communication network. This leads to large and complex "black-box" models that require a massive number of parameters and may struggle to adapt to changes in network topology.
Graph neural networks (GNNs) offer a compelling solution to these inefficiencies [15]. A wireless network can be naturally represented as a graph, with its base stations, users, and RIS elements as nodes and the channels as edges. GNNs are specifically designed to operate on such graph-structured data, allowing them to explicitly model the network topology and the relationships between its components. This structural inductive bias makes GNNs a highly promising yet less explored candidate for the multi-BS RIS beamforming problem, with the potential to learn the complex channel interactions more efficiently and with significantly fewer parameters [16,17].

1.2. Contribution and Novelty of This Paper

This paper addresses the limitations of existing methods by proposing a novel and efficient optimization solver based on a GNN. To establish a clear motivation for this research, the paper first experimentally validates the superiority of an optimized multi-BS RIS system over a centralized large-scale single-BS system. The paper then demonstrates, using the GNN architecture itself, that using preprocessed SVD-based channel features is a more effective input strategy for AI solvers than using the full raw channel information. The primary contribution is the proposed GNN-based solver, which is novel in its application to the multi-BS RIS beamforming problem. Through comprehensive simulations, this solver is shown to not only outperform a conventional iterative optimization technique in both solution quality and total computational cost but also to achieve state-of-the-art performance with unparalleled efficiency compared to other AI architectures.

2. System Model and Problem Formulation

This section details the mathematical framework for the communication systems under investigation. First, the conventional single-BS architecture is presented in order to establish a baseline and highlight its inherent limitations. Then, the multi-BS architecture is introduced as a solution, followed by a detailed comparison of two distinct modeling approaches for this more complex system. Finally, based on the selected model, the signal transmission process and the beamforming optimization problem are formally defined.

2.1. Single-BS RIS-Aided System: Model and Limitations

The analysis begins with a conventional single-BS RIS-aided system, as depicted in Figure 1. In this setup, a single BS equipped with N t antennas serves a group of N u users via a shared RIS composed of N ris elements.
The channel from the BS to the RIS is denoted by H 1 C N ris × N t , while the channel from the RIS to the users is H r C N u × N ris . The RIS applies a phase shift matrix, Θ = diag ( e j θ 1 , , e j θ N ris ) to the incident signal. The end-to-end effective channel is then formulated as
H eff , single ( Θ ) = H r Θ H 1 .
Let the signal transmitted by the BS be x C N t × 1 and the signal received by the users be y C N u × 1 . The input–output relationship, including the additive white Gaussian noise (AWGN) vector n , is provided by
y 1 y N u = H eff , single x 1 x N t + n 1 n N u .
While this centralized architecture is straightforward to model, it suffers from fundamental limitations. The system’s performance is highly dependent on a single link, making it vulnerable to deep fading or physical blockages and often creating coverage holes at the cell edge. These limitations motivate the transition to a more distributed architecture.

2.2. Multi-BS RIS-Aided System: Architecture and Advantages

To overcome the limitations of the single-BS setup, this paper focuses on a distributed multi-BS system, as shown in Figure 2. In this architecture, K coordinated BSs, each with N t antennas, jointly serve the same user group through the shared RIS.
This distributed approach offers significant advantages over its centralized counterpart. By leveraging signals from multiple geographically dispersed locations, the system benefits from macroscopic diversity. This mitigates the impact of large-scale fading and shadowing, providing more robust and uniform coverage across the entire service area. However, the core challenge now lies in jointly optimizing the RIS phase shifts in order to coherently combine signals from these multiple sources, which requires a more sophisticated system model.

2.3. Channel Modeling Approaches for the Multi-BS System

To analyze the complex multi-BS system, the individual channels from all K BSs must be integrated into a unified multiple-input multiple-output (MIMO) model. Let H 1 , k C N ris × N t be the channel from the k-th BS to the RIS and  H r C N u × N ris be the channel from the RIS to the user group. Two primary and mathematically equivalent approaches for this integration are presented below.

2.3.1. The Computational Model (Horizontal Concatenation)

This model prioritizes computational efficiency by creating compact dense matrices. The aggregate BS-to-RIS channel H 1 , agg is formed by concatenating the individual channel matrices horizontally:
H 1 , agg = [ H 1 , 1 H 1 , 2 H 1 , K ] .
In this compact representation, the RIS-to-user channel is simply H r , while the RIS phase matrix is the original Θ . The effective channel is then derived by the sequential multiplication of these three matrices:
H eff = H r Θ H 1 , agg = [ H r Θ H 1 , 1 H r Θ H 1 , 2 H r Θ H 1 , K ] .

2.3.2. The Structural Model (Block-Diagonal)

This model preserves the modularity of the distributed system by creating large matrices that maintain the separation of each signal path. The BS-to-RIS channel H 1 is constructed as a large block-diagonal matrix:
H 1 = H 1 , 1 0 0 0 H 1 , 2 0 0 0 H 1 , K .
To align with this structure, the other matrices are expanded. The RIS-to-user channel H 2 is formed by repeating H r horizontally, while the RIS phase matrix T eff is formed by using the Kronecker product to apply Θ to each parallel path.
H 2 = [ H r H r H r ]
T eff = I K Θ
The effective channel is then calculated as H eff = H 2 T eff H 1 . The derivation below shows how the expanded matrices interact to form the final effective channel, which is identical to the one derived using the computational model.
H eff = [ H r H r ] Θ Θ H 1 , 1 H 1 , K = [ H r Θ H 1 , 1 H r Θ H 1 , 2 H r Θ H 1 , K ]

2.3.3. Model Selection

As the above derivations demonstrate, both models produce the identical mathematical result for the effective channel. Therefore, the choice between them is not about correctness but rather utility for the specific task. The objective of this paper is to design a multi-stage optimization algorithm that requires analyzing each BS–RIS link as a distinct subproblem. For this purpose, the block-diagonal model is adopted for several compelling reasons.
First, the proposed solution methodologies fundamentally rely on processing the individual channel matrices H 1 , k . The block-diagonal model provides a natural framework for this decomposition, whereas the monolithic representation of the computational model would require an artificial and counterintuitive slicing process.
Furthermore, this structural separation offers significant computational advantages in terms of scalability. A prime example is matrix inversion, which is common in advanced beamforming techniques. Inverting a large and dense ( K N t × K N t ) matrix has a complexity of O ( ( K N t ) 3 ) . In contrast, inverting the corresponding block-diagonal matrix only requires inverting K smaller ( N t × N t ) matrices, resulting in a much lower complexity of O ( K N t 3 ) . This implies that for achieving a large antenna array, it is far more computationally efficient to increase the number of distributed BSs (K) rather than concentrating a massive number of antennas ( N t ) at a single location. This aligns perfectly with the trend towards distributed and cell-free architectures. Therefore, for its superior logical alignment with the proposed modular optimization strategy and its critical advantages in computational scalability, the block-diagonal model is adopted throughout this paper.

2.4. Signal Transmission and Problem Formulation

Using the selected block-diagonal model, the end-to-end effective channel is denoted by H eff ( Θ ) . Let the aggregate signal vector transmitted from all K base stations be x and the received signal vector at the N u users be y . The fundamental input–output relationship of the wireless channel, including the AWGN vector n , is provided by
y = H eff ( Θ ) x + n .
The following block matrix equation explicitly shows how the signal x k from each BS is shaped by its unique channel path and subsequently combined, demonstrating that the final received signal is a superposition of signals from all K paths:
y 1 y N u = [ H r H r ] Θ Θ H 1 , 1 H 1 , K x 1 x K + n 1 n N u .
To transmit independent data streams, the signal x is generated by applying a linear precoding matrix P to a vector of data symbols s such that x = P s . Substituting this into the channel equation yields the complete end-to-end signal model:
y = H eff ( Θ ) P ( Θ ) s + n .
To mitigate multi-user interference, this paper employs zero-forcing (ZF) precoding, where the precoder is centrally computed using the pseudo-inverse of the effective channel:
P ( Θ ) = β H eff H ( Θ ) ( H eff ( Θ ) H eff H ( Θ ) ) 1
where β is a normalization factor to satisfy the total transmit power constraint.
The primary objective is to find the optimal RIS phase shift matrix Θ that maximizes the overall system performance. Under the ZF scheme, the postprocessing signal-to-noise ratio (SNR) is directly related to the condition of the effective channel. A higher channel gain (as captured by the Frobenius norm) leads to a better-conditioned channel matrix, which in turn results in a higher SNR. This directly improves system-level metrics such as the bit error rate (BER) and spectral efficiency (SE). Therefore, the optimization problem is formulated as maximizing the squared Frobenius norm of the effective channel:
max Θ H eff ( Θ ) F 2 subject to | Θ n , n | = 1 , n = 1 , , N ris .
This is a non-convex optimization problem with a unit-modulus constraint, which is generally difficult to solve optimally.

3. Beamforming Optimization Techniques

This section presents the different techniques used to solve the optimization problem defined in (13). First, several baseline heuristic methods are described to serve as benchmarks. Then, the proposed GNN-based solver is detailed.

3.1. Baseline Heuristic Techniques

Before presenting the specific multi-BS heuristics, this section first defines a powerful method based on singular value decomposition (SVD) for optimizing the RIS phase shifts for a single BS–RIS link. This method serves as a foundational building block for the more advanced heuristics. The specific implementation is formally described in Algorithm 1.
Algorithm 1: SVD-Based RIS Phase Shift Optimization for a Single Link
Input: H 1 C N ris × N t (BS→RIS),    H 2 C N u × N ris (RIS→User)
Perform singular-value decomposition on the RIS–BS covariance:
H 1 H 1 H = U Σ V H ,
   then extract the right singular matrix V
Select the first N u columns:
V opt = V ( : , 1 : N u )
Form the phase-mapping matrix and extract its diagonal entries:
M phase = V opt H 2 , v phase = diag M phase
Build the phase-shift matrix:
Θ opt = diag e j ( v phase )
Output: Θ opt (optimized RIS phase matrix)
With this foundational single-link optimization method established, three baseline heuristic techniques for the multi-BS problem are now introduced.
Technique 1 (Random Phase) serves as a performance lower bound. In this approach, the phase shifts for all N ris elements are chosen independently and uniformly at random from the interval [ 0 , 2 π ) .
Technique 2 (SVD-Based Phase Averaging) extends the SVD principle to the multi-BS problem. For each subsystem k, an optimal phase vector t k is computed using Algorithm 1. These K individually optimized phase vectors are then combined by calculating the vector sum of all t k and setting the final RIS phase to the angle of this resultant vector.
Technique 3 (Optimized Weighted Combination) improves upon simple averaging. This method also uses the SVD-based phase vectors t k as a starting point, but introduces a set of complex weights w k to create a weighted combination w k t k . A conventional iterative solver is then used to find the optimal weights { w k } * that maximize the final objective function.
However, these baseline techniques possess fundamental limitations. Simple heuristics such as random phase and averaging cannot guarantee a coherent superposition of signals. Even the more sophisticated iterative method faces two critical challenges: it is prone to becoming trapped in local optima in the non-convex landscape, and its high computational cost makes it unsuitable for real-time environments. These limitations create a clear need for a new solution paradigm, motivating the shift towards data-driven AI approaches.

3.2. Proposed GNN-Based Solver

To overcome the limitations of conventional methods, this paper proposes a solver based on a GNN. This approach is designed not merely to find a solution, but to learn how to intelligently synthesize a globally optimal RIS configuration from a set of individually optimized yet mutually conflicting sub-solutions.

3.2.1. Overall Architecture

The core idea is to treat the GNN as an “information synthesis engine.” The architecture of the proposed solver (illustrated in Figure 3) is designed around this principle. The process begins not with raw channel data but with a set of high-quality SVD-based features (“Per-BS Optimal Phases”). These features are then fed into a pipeline of two GCN layers, a mean pooling layer, and a decoder to synthesize the final, coherent RIS phase vector θ. To validate this feature-based approach, a “GNN (Direct)” variant, which uses the identical pipeline but takes raw channel data as input, was also evaluated for comparison.

3.2.2. Core Components and Mechanism

The solver’s effectiveness comes from the careful design of its core components, which are the static graph structure, the dynamic input features, and the mechanism that processes them. The GNN operates on a graph that models the physical topology of the RIS panel, where the N ris elements are the graph’s nodes. The connectivity between these nodes is defined by physical proximity.
This connection rule is translated into a 64 × 64 adjacency matrix A and its version with self-loops, A ˜ = A + I . The elements of A ˜ are formally defined based on the grid coordinates of the RIS elements. Let ( r k , c k ) be the row and column coordinates of the k-th element on the N ris × N ris grid. Then, each element of the matrix is provided by
( A ˜ ) i j = 1 , if max ( | r i r j | , | c i c j | ) 1 0 , otherwise .
This rule states that a connection exists only if two elements are immediate neighbors, including diagonally. As visualized in Figure 4, the resulting matrix is static, sparse, and channel-independent, serving as a fixed `blueprint’ that dictates how information flows between RIS elements.
While the graph structure is static and channel-independent, the information that flows across this fixed blueprint is dynamic. This information is captured in the GNN’s node feature matrix, denoted as F . The process begins with an initial feature matrix F ( 0 ) , where each node’s feature vector is the collection of optimal phase shifts suggested by each of the K base stations as derived via the SVD-based method. Each of these suggestions represents a locally optimal solution tailored to a single BS–RIS link, meaning that it is designed to maximize signal strength for its corresponding path while disregarding the impact on others.
However, these locally optimal solutions are mutually conflicting when considered for the global multi-BS system. Therefore, the fundamental task of the GNN is to learn the complex and nonlinear process of reconciling these conflicting inputs into a single coherent phase vector that is globally optimal. The GNN must learn to weigh the importance of each suggestion across different parts of the RIS, effectively finding a sophisticated compromise. This initial feature matrix, which artfully encapsulates the inherent challenge of the multi-BS problem, serves as the input to the first GCN layer. The core mechanism inside each GCN layer, known as neighborhood aggregation, then begins the iterative process of refining this information.
As conceptually visualized in Figure 5, the neighborhood aggregation process allows each node to update its state by gathering feature vectors from its physical neighbors. This intuitive process is efficiently implemented for all nodes simultaneously using the GCN propagation rule, where F ( l ) is the matrix of all node features at layer l and W ( l ) is a trainable weight matrix for that layer.
F ( l + 1 ) = σ A ^ F ( l ) W ( l )
The matrix multiplication A ^ F ( l ) is the mathematical embodiment of the parallelized neighborhood aggregation. The full architecture applies this mechanism sequentially. The initial features are passed through two GCN layers. The first layer performs a local reconciliation of the conflicting suggestions by aggregating information from immediate physical neighbors. The second layer then establishes a broader panel-wide consensus by allowing this refined information to propagate across a two-hop neighborhood. This two-layer design was chosen to balance the need for capturing sufficient spatial correlation against the risk of ‘over-smoothing’, which is a common issue where node features become indistinguishable in deeper GNNs. After this feature refinement, the resulting node features are summarized by a mean pooling layer, which distills the rich node-level information into a single global consensus vector. Finally, this global vector is mapped by the decoder to the final unified phase vector θ.

3.2.3. Cost Definition for AI Solvers

The efficiency of the AI solvers is evaluated using two key metrics based on floating-point operations (FLOPs), which provides a hardware-independent measure of computational complexity.
First, the total computational cost is defined as the total FLOPs consumed during the entire training process to find an optimized solution for the given problem instance. This is approximated as follows: Total FLOPs ≈ (Inference FLOPs) × 3 × (Number of Epochs), reflecting the computational load of backpropagation. This metric serves as a strong proxy for the total energy consumption required by the solver.
Second, the inference cost is the number of FLOPs required for a single forward pass of the model. This metric reflects the operational cost (latency and energy) if the model were deployed as a pretrained real-time solver.

4. Performance Evaluation

4.1. Experimental Setup

The simulation framework is defined by the system parameters in Table 1 and the AI model architectures detailed in Table 2. To create a stringent and realistic testbed, this paper models a non-line-of-sight (NLOS) environment using independent Rayleigh fading, assuming that direct BS–user links are fully blocked. This setup reflects a key challenge for future B5G/6G systems in which higher-frequency bands suffer from poor diffraction, making RIS-assisted communication essential in scattering-rich NLOS environments. All experiments are conducted on a single fixed channel realization in order to ensure a fair comparison of the optimization capabilities of different solvers on an identical problem instance. The solvers are evaluated on two primary fronts: communication performance, measured by Bit Error Rate (BER) and Spectral Efficiency (SE), and cost efficiency, measured by total computational cost (Total MFLOPs) and number of trainable parameters.

4.2. Results and Analysis

4.2.1. System Architecture Comparison: Single-BS vs. Multi-BS

First, the motivation for focusing on the multi-BS scenario is validated by comparing the fundamental performance of a distributed multi-BS system against a centralized single-BS counterpart. The performance in terms of both BER and SE is presented in Figure 6. The results for the multi-BS system consistently outperform the single-BS system across both metrics. Specifically, the optimized multi-BS system achieves a significant power gain for the same BER target and demonstrates substantially higher SE. This result highlights the advantage of macroscopic diversity and confirms that investigating optimization techniques for multi-BS systems is a crucial research direction.

4.2.2. Input Strategy for GNN Solvers: Direct vs. Feature-Based

To validate the core design choice of using preprocessed SVD-based features, this paper compares the ‘GNN (Proposed)’ and ‘GNN (Direct)’ models with the iterative solver (“Optimized”) as a baseline in Figure 7. The figure first confirms that both GNN solvers find higher-quality solutions than the iterative baseline. While both GNN variants achieve a similar high level of communication performance, the critical advantage of the proposed feature-based approach is its exceptional cost efficiency, requiring only a fraction of the parameters and computational cost of the “GNN (Direct)” variant, as detailed in Table 3. This ability to deliver comparable performance with significantly greater efficiency validates the feature-based strategy as the superior solution, which is consequently used for all subsequent AI model comparisons.

4.2.3. Performance Comparison of Feature-Based AI Solvers

The final communication performance of all feature-based AI models is presented and compared against the conventional iterative solver (“Technique 3”), which serves as a strong baseline. Figure 8 shows the communication performance in terms of both BER and SE as a function of transmit power. A key observation is that all AI-based solvers consistently outperform the iterative optimization baseline, achieving lower BER and higher SE. This indicates their superior ability to navigate the high-dimensional and non-convex solution space to find higher-quality solutions. Among the AI models, the GNN, transformer, and Bi-LSTM models form a top-tier performance group, achieving the best results that are closely clustered together.
The learning dynamics shown in Figure 9 offer deeper insights that go beyond the final performance metrics. The plot visually confirms that all AI models converge to a significantly higher objective value than the iterative baseline, which directly explains their superior communication performance. However, the most critical insight comes from comparing the convergence behavior of the different AI architectures. The proposed GNN demonstrates remarkable training stability, a highly desirable property for practical systems. This stability is visually evident in two key aspects of its convergence curve.
First, the standard deviation band across multiple training runs, represented in the figure by the shaded area, is exceptionally narrow for the “GNN (Proposed)” model. This contrasts sharply with the wider bands of the transformer and Bi-LSTM models, which indicates that the GNN’s training process is highly consistent, repeatable, and less sensitive to random initialization. In other words, it reliably reaches a high-quality solution regardless of minor variations in the training setup. Second, its convergence curve is notably smooth, indicating a stable and predictable learning trajectory without significant oscillations.
This desirable stability is a direct result of GNNs’ strong structural inductive bias. Unlike models such as transformer or Bi-LSTM, which treat the input as a generic sequence and must learn all relationships from scratch, GNNs are explicitly designed to leverage the underlying physical topology of the RIS panel, which is provided as a graph structure. This inherent understanding of which RIS elements are neighbors acts as a powerful regularizer, guiding the optimization process more efficiently and preventing it from exploring less plausible regions of the solution space. Therefore, while the final performance of the top-tier models is nearly indistinguishable, the GNN’s superior training stability makes it a more robust and dependable choice for practical systems. This observation reinforces the idea that factors other than final accuracy, such as training stability and computational cost, become the primary differentiators in model selection.

4.2.4. Comprehensive Cost Efficiency Analysis

To provide a holistic evaluation, the performance versus the computational and model costs is analyzed. Table 3 summarizes the key metrics for all solvers, including the final objective value, total computational cost, inference cost, and number of trainable parameters. The inference cost, measured in MFLOPs, serves as a hardware-agnostic proxy for crucial operational metrics such as inference latency and energy consumption.
The results presented in Table 3 reveal several key findings. The proposed GNN not only achieves a 42% higher objective value than the traditional solver but does so with 45% less total computational work, demonstrating the fundamental superiority of this AI-based approach.
Among the AI solvers, the proposed GNN stands out as the most efficient architecture. While achieving the highest objective value, it does so with orders of magnitude fewer parameters and less computational cost than complex models such as the transformer and Bi-LSTM. The “Performance per Parameter” metric, where the GNN scores an exceptional 4.69, quantitatively highlights this superior architectural efficiency. Furthermore, a cost–benefit analysis against the “GNN (Direct)” variant is telling; the proposed GNN achieves a higher objective value while using only 11% of the parameters and 8% of the inference FLOPs. This solidifies the proposed GNN as the most practical solution, since it delivers top-tier performance without the excessive overhead of larger or less efficient models, making it ideal for resource-constrained edge deployments.

5. Discussion

The results of this paper provide several key insights into the application of AI for RIS beamforming optimization.
First, the superiority of all AI-based solvers over the conventional iterative method suggests that data-driven approaches can more effectively navigate the complex non-convex solution space to find higher-quality solutions.
Second, the comparison among AI architectures reveals a clear tradeoff between model complexity and efficiency. While complex models such as the transformer and Bi-LSTM achieve high performance, they do so at a significant cost in terms of parameters and computations. This makes them less suitable for deployment in resource-constrained edge devices.
The proposed GNN strikes an optimal balance. Its remarkable efficiency stems from its inherent ability to exploit the underlying graph structure of the wireless system. This structural inductive bias allows the GNN to learn the essential relationships with unparalleled efficiency. This is quantitatively captured by the “Performance per Parameter” metric, where the GNN significantly outperforms all other models, as well as by its orders of magnitude lower total computational cost (Total MFLOPs).
This focus on computational efficiency is not merely a technical advantage but aligns with the broader goals of developing practical and sustainable AI solutions, often referred to as ‘Green Edge AI’ [18]. The total number of floating-point operations (FLOPs) serves as a strong and hardware-independent proxy for energy consumption. As such, the drastically lower computational cost of the GNN implies significant energy savings, making it a genuinely practical and sustainable solution for future networks.

Limitations and Future Work

This paper has several limitations that open up avenues for future research. The primary limitation is that the analysis is based on a single-channel realization. This approach was chosen in order to rigorously compare the optimization efficiency of different solvers on a fixed complex problem instance, rather than due to their generalization capabilities. Future work should focus on training the models on a large dataset of diverse channel realizations in order to evaluate and enhance their generalization capabilities across various user positions and mobility scenarios.
Furthermore, this paper assumes perfect channel state information (CSI). In practical systems, channel estimation errors are inevitable. Such inaccuracies would lead to a mismatch between the assumed channel and the actual channel, resulting in imprecise beamforming and residual inter-user interference, which would degrade the overall system performance in terms of BER and SE. Therefore, investigating the robustness of the proposed GNN under imperfect CSI is another critical direction for future work.
Finally, the GNN in this study was utilized as an optimization tool in a scenario where the training process itself finds the solution for a static problem. Adapting this efficient architecture into a pretrained general-purpose solver capable of near-instantaneous inference for dynamic channels remains a key objective. Such a pretrained model could potentially be updated for new network conditions without requiring full retraining, for instance through transfer learning or other fine-tuning techniques, which represents a promising direction for future investigation.

6. Conclusions

This paper has proposed a GNN-based solver for the complex reflective beamforming problem in multi-BS RIS-aided systems. The superiority of the multi-BS architecture over a centralized single-BS setup is first established. It is then demonstrated that AI-based solvers can find higher-quality solutions than conventional iterative optimization methods.
Through a comprehensive comparative analysis, this paper has shown that the proposed GNN is the most powerful and practical solution to the above problem. It surpasses the traditional optimization method in both performance and total computational efficiency. Furthermore, among various state-of-the-art AI architectures, the GNN delivers top-tier performance while also being exceptionally lightweight, requiring significantly fewer parameters and computational resources than complex models such as transformers and LSTMs. This remarkable efficiency stems from the inherent ability of GNNs to effectively model the graph structure of wireless communication systems. The findings in this paper suggest that GNNs are a highly promising paradigm for tackling complex optimization problems in future 6G networks.

Author Contributions

Conceptualization, S.-H.S. and H.-K.S.; methodology, S.-H.S.; software, S.-H.S. and S.-G.C.; validation, S.-H.S., S.-G.C., J.-H.Y., Y.-J.C., K.-C.T., M.-H.C., and Y.-G.J.; formal analysis, S.-H.S.; investigation, S.-H.S.; resources, H.-K.S.; data curation, S.-H.S.; writing—original draft preparation, S.-H.S.; writing—review and editing, H.-K.S. and Y.-H.Y.; visualization, S.-H.S.; supervision, H.-K.S. and Y.-H.Y.; project administration, H.-K.S.; funding acquisition, H.-K.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (2020R1A6A1A03038540). This work was supported by Institute of Information & communications Technology Planning & Evaluation (IITP) under the metaverse support program to nurture the best talents (IITP-2025-RS-2023-00254529) grant funded by the Korea government (MSIT). This work was supported by the IITP (Institute of Information & Communications Technology Planning & Evaluation)-ITRC (Information Technology Research Center) grant funded by the Korea government (Ministry of Science and ICT) (IITP-2024-RS-2024-00437191). This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korean government (MSIT) (No. RS-2023-00219051).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Wu, Q.; Zhang, R. Towards Smart and Reconfigurable Environment: Intelligent Reflecting Surface Aided Wireless Network. IEEE Commun. Mag. 2020, 58, 106–112. [Google Scholar] [CrossRef]
  2. Basar, E.; Di Renzo, M.; De Rosny, J.; Debbah, M.; Alouini, M.S.; Zhang, R. Wireless Communications Through Reconfigurable Intelligent Surfaces. IEEE Access 2019, 7, 116753–116773. [Google Scholar] [CrossRef]
  3. Basar, E.; Alexandropoulos, G.C.; Liu, Y.; Wu, Q.; Jin, S.; Yuen, C.; Dobre, O.A.; Schober, R. Reconfigurable Intelligent Surfaces for 6G: Emerging Hardware Architectures, Applications, and Open Challenges. IEEE Veh. Technol. Mag. 2024, 19, 27–47. [Google Scholar] [CrossRef]
  4. Pan, C.; Ren, H.; Wang, K.; Xu, W.; Elkashlan, M.; Nallanathan, A.; Hanzo, L. Multicell MIMO Communications Relying on Intelligent Reflecting Surfaces. IEEE Trans. Wirel. Commun. 2020, 19, 5218–5233. [Google Scholar] [CrossRef]
  5. Guo, H.; Liang, Y.C.; Chen, J.; Larsson, E.G. Weighted Sum-Rate Maximization for Reconfigurable Intelligent Surface Aided Wireless Networks. IEEE Trans. Wirel. Commun. 2020, 19, 3064–3076. [Google Scholar] [CrossRef]
  6. You, C.; Zheng, B.; Zhang, R. Wireless Communication via Double IRS: Channel Estimation and Passive Beamforming Designs. IEEE Wirel. Commun. Lett. 2021, 10, 431–435. [Google Scholar] [CrossRef]
  7. Wu, Q.; Zhang, R. Beamforming Optimization for Wireless Network Aided by Intelligent Reflecting Surface With Discrete Phase Shifts. IEEE Trans. Commun. 2020, 68, 1838–1851. [Google Scholar] [CrossRef]
  8. Zhou, X.; Fang, T.; Mao, Y. Joint Active and Passive Beamforming Optimization for Beyond Diagonal RIS-Aided Multi-User Communications. IEEE Commun. Lett. 2025, 29, 517–521. [Google Scholar] [CrossRef]
  9. Di Renzo, M.; Zappone, A.; Debbah, M.; Alouini, M.S.; Yuen, C.; de Rosny, J.; Tretyakov, S. Smart Radio Environments Empowered by Reconfigurable Intelligent Surfaces: How It Works, State of Research, and The Road Ahead. IEEE J. Sel. Areas Commun. 2020, 38, 2450–2525. [Google Scholar] [CrossRef]
  10. Sun, H.; Liu, Y.; Al-Tahmeesschi, A.; Nag, A.; Soleimanpour, M.; Canberk, B.; Arslan, H.; Ahmadi, H. Advancing 6G: Survey for Explainable AI on Communications and Network Slicing. IEEE Open J. Commun. Soc. 2025, 6, 1372–1412. [Google Scholar] [CrossRef]
  11. Shi, E.; Zhang, J.; An, J.; Zhang, G.; Liu, Z.; Yuen, C.; Ai, B. Joint AP-UE Association and Precoding for SIM-Aided Cell-Free Massive MIMO Systems. IEEE Trans. Wirel. Commun. 2025, 24, 5352–5367. [Google Scholar] [CrossRef]
  12. Wang, J.; Ma, C.; Gong, S.; Yang, X.; Ma, S. Joint Beamforming Optimization and Mode Selection for RDARS-Aided MIMO Systems. IEEE Trans. Wirel. Commun. 2024, 23, 17557–17572. [Google Scholar] [CrossRef]
  13. Chen, W.; Wen, C.K.; Li, X.; Jin, S. Channel Customization for Joint Tx-RISs-Rx Design in Hybrid mmWave Systems. IEEE Trans. Wirel. Commun. 2023, 22, 8304–8319. [Google Scholar] [CrossRef]
  14. Taha, A.; Alrabeiah, M.; Alkhateeb, A. Enabling Large Intelligent Surfaces With Compressive Sensing and Deep Learning. IEEE Access 2021, 9, 44304–44321. [Google Scholar] [CrossRef]
  15. Wu, Z.; Pan, S.; Chen, F.; Long, G.; Zhang, C.; Yu, P.S. A Comprehensive Survey on Graph Neural Networks. IEEE Trans. Neural Netw. Learn. Syst. 2021, 32, 4–24. [Google Scholar] [CrossRef] [PubMed]
  16. Shen, Y.; Shi, Y.; Zhang, J.; Letaief, K.B. Graph Neural Networks for Scalable Radio Resource Management: Architecture Design and Theoretical Analysis. IEEE J. Sel. Areas Commun. 2021, 39, 101–115. [Google Scholar] [CrossRef]
  17. Shen, Y.; Zhang, J.; Song, S.H.; Letaief, K.B. Graph Neural Networks for Wireless Communications: From Theory to Practice. IEEE Trans. Wirel. Commun. 2023, 22, 3554–3569. [Google Scholar] [CrossRef]
  18. Mao, Y.; Yu, X.; Huang, K.; Zhang, Y.J.A.; Zhang, J. Green Edge AI: A Contemporary Survey. Proc. IEEE 2024, 112, 880–911. [Google Scholar] [CrossRef]
Figure 1. The baseline single-BS RIS-aided system architecture.
Figure 1. The baseline single-BS RIS-aided system architecture.
Mathematics 13 02732 g001
Figure 2. The proposed multi-BS RIS-aided system architecture.
Figure 2. The proposed multi-BS RIS-aided system architecture.
Mathematics 13 02732 g002
Figure 3. High−level flowchart of the proposed feature−based GNN solver.
Figure 3. High−level flowchart of the proposed feature−based GNN solver.
Mathematics 13 02732 g003
Figure 4. The sparsity pattern of the adjacency matrix A ˜ as defined in (14).
Figure 4. The sparsity pattern of the adjacency matrix A ˜ as defined in (14).
Mathematics 13 02732 g004
Figure 5. The core mechanism of a GCN layer.
Figure 5. The core mechanism of a GCN layer.
Mathematics 13 02732 g005
Figure 6. Performance comparison between the single−BS and various multi−BS schemes.
Figure 6. Performance comparison between the single−BS and various multi−BS schemes.
Mathematics 13 02732 g006
Figure 7. Performance comparison of GNN solvers with different input strategies.
Figure 7. Performance comparison of GNN solvers with different input strategies.
Mathematics 13 02732 g007
Figure 8. Communication performance of all feature−based AI models against the optimized baseline.
Figure 8. Communication performance of all feature−based AI models against the optimized baseline.
Mathematics 13 02732 g008
Figure 9. Training progress of the AI models versus number of epochs.
Figure 9. Training progress of the AI models versus number of epochs.
Mathematics 13 02732 g009
Table 1. System simulation parameters.
Table 1. System simulation parameters.
ParameterValue
Number of BSs (K)4
Antennas per BS ( N t )8
Number of Users ( N u )8
Number of RIS elements ( N ris )64
Channel ModelRayleigh Fading
Modulation Scheme16-QAM
Transmit Power (P)0 to 20 dB
Noise ModelAWGN
Table 2. Core architectural parameters of evaluated AI models.
Table 2. Core architectural parameters of evaluated AI models.
ModelLayer/Input DetailsShape
GNN (Proposed)Input Features (SVD-based)( N ris , K × 2 )
GCN Layer 1 (32 units)( N ris , 32)
GCN Layer 2 (64 units)( N ris , 64)
Linear Decoder( N ris )
GNN (Direct)Input Features (Raw Channel)( N ris , ( K N t + N u ) × 2 )
GCN Layer 1 (128 units)( N ris , 128)
GCN Layer 2 (256 units)( N ris , 256)
Linear Decoder( N ris )
CNNInput Image (SVD-based)(8, 8, K × 2 )
Conv2D Layer 1 (16 filters, 3 × 3 kernel)(8, 8, 16)
Conv2D Layer 2 (32 filters, 3 × 3 kernel)(8, 8, 32)
Linear Layer (64 units)(64)
Linear Decoder( N ris )
Bi-LSTMInput Sequence (SVD-based)( N ris , K × 2 )
2-layer Bi-LSTM (128 hidden units)( N ris , 256)
Linear Decoder( N ris )
TransformerInput Sequence (SVD-based)( N ris , K × 2 )
Input Projection (Linear, 64 units)( N ris , 64)
Self-Attention Encoder (3 layers, 4 heads/layer)( N ris , 64)
Linear Decoder( N ris )
Common Training Hyperparameters
OptimizerAdam
Learning Rate 1 × 10 4 for GNN (Direct), 1 × 10 3 for all other models
SchedulerReduceLROnPlateau (patience = 20, factor = 0.5)
Training Epochs200
Table 3. Comprehensive analysis of performance and cost efficiency for all solvers.
Table 3. Comprehensive analysis of performance and cost efficiency for all solvers.
Solver MethodObjectiveTotal MFLOPsInf. MFLOPsParams. (K)Perf./Param.
Technique 3 (Optimized)2.155 ×   10 4 ∼266---
GNN (Proposed)3.077 ×   10 4 ∼1440.246.64.69
CNN3.071 ×   10 4 ∼2280.3812.12.54
Transformer3.071 ×   10 4 ∼30,28250.47848.20.04
Bi-LSTM3.068 ×   10 4 ∼20,77234.62553.00.06
GNN (Direct)2.958 ×   10 4 ∼18843.1459.80.49
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Seo, S.-H.; Choi, S.-G.; Yu, J.-H.; Choi, Y.-J.; Tong, K.-C.; Choi, M.-H.; Jung, Y.-G.; Song, H.-K.; You, Y.-H. Graph Neural Network-Based Beamforming Optimization for Multi-BS RIS-Aided Communication Systems. Mathematics 2025, 13, 2732. https://doi.org/10.3390/math13172732

AMA Style

Seo S-H, Choi S-G, Yu J-H, Choi Y-J, Tong K-C, Choi M-H, Jung Y-G, Song H-K, You Y-H. Graph Neural Network-Based Beamforming Optimization for Multi-BS RIS-Aided Communication Systems. Mathematics. 2025; 13(17):2732. https://doi.org/10.3390/math13172732

Chicago/Turabian Style

Seo, Seung-Hwan, Seong-Gyun Choi, Ji-Hee Yu, Yoon-Ju Choi, Ki-Chang Tong, Min-Hyeok Choi, Yeong-Gyun Jung, Hyoung-Kyu Song, and Young-Hwan You. 2025. "Graph Neural Network-Based Beamforming Optimization for Multi-BS RIS-Aided Communication Systems" Mathematics 13, no. 17: 2732. https://doi.org/10.3390/math13172732

APA Style

Seo, S.-H., Choi, S.-G., Yu, J.-H., Choi, Y.-J., Tong, K.-C., Choi, M.-H., Jung, Y.-G., Song, H.-K., & You, Y.-H. (2025). Graph Neural Network-Based Beamforming Optimization for Multi-BS RIS-Aided Communication Systems. Mathematics, 13(17), 2732. https://doi.org/10.3390/math13172732

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop