Joint User Scheduling and Hybrid Beamforming Design for Massive MIMO LEO Satellite Multigroup Multicast Communication Systems

In the satellite multigroup multicast communication systems based on the DVB-S2X standard, due to the limitation of the DVB-S2X frame structure, user scheduling and beamforming design have become the focus of academic research. In this work, we take the massive multi-input multi-output (MIMO) low earth orbit (LEO) satellite communication system adopting the DVB-S2X standard as the research scenario, and the LEO satellite adopts a uniform planar array (UPA) based on the fully connected hybrid structure. We focus on the coupling design of user scheduling and beamforming; meanwhile, the scheme design takes the influence of residual Doppler shift and phase disturbance on channel errors into account. Under the constraints of total transmission power and quality of service (QoS), we study the robust joint user scheduling and hybrid beamforming design aimed at maximizing the energy efficiency (EE). For this problem, we first adopt the hierarchical clustering algorithm to group users. Then, the semidefinite programming (SDP) algorithm and the concave convex process (CCCP) framework are applied to tackle the optimization of user scheduling and hybrid beamforming design. To handle the rank-one matrix constraint, the penalty iteration algorithm is proposed. To balance the performance and complexity of the algorithm, the user preselected step is added before joint design. Finally, to obtain the digital beamforming matrix and the analog beamforming matrix in a hybrid beamformer, the alternative optimization algorithm based on the majorization-minimization framework (MM-AltOpt) is proposed. Numerical simulation results show that the EE of the proposed joint user scheduling and beamforming design algorithm is higher than that of the traditional decoupling design algorithms.


Introduction
In recent years, LEO satellite communication systems have played an increasingly important role in wireless communication networks [1].However, facing the high-performance requirements of future wireless communication systems, such as higher spectrum efficiency (SE) and increased EE, the performance of LEO satellite communication systems needs to be improved [2].Applying the massive MIMO technology to LEO satellites is a good choice, and with the advantage of 5G technology [3] and the spatial multiplexing principle of MIMO technology [4], the performance of LEO satellite communication systems can be further improved.Meanwhile, by using the high-precision multiple beams generated by the massive MIMO technology and aggressive full frequency reuse scheme among beams, the performance of the communication system can be greatly improved.However, the full frequency reuse scheme will cause severe inter-beam interference [5], and adopting the beamforming design at the LEO transmitter side can efficiently manage it.In addition, the super-frame structure of multibeam satellite communications standards such as DVB-S2X [6] needs to apply the same beamformer to multiple users that share the same frame.
Sensors 2022, 22, 6858 2 of 31 Therefore, the multigroup multicasting principle can be used in the beamformer design.In this paper, we concentrate on the massive MIMO LEO satellite communication system forward link multigroup multicasting beamforming scheme [7].
In the beamforming design of the massive MIMO LEO satellite multigroup multicast communication system, the following issues need to be considered:

•
User scheduling: Due to that only a few users can be bound into a DVB-S2X frame, and there are a large number of active users in each multicast group, it is necessary to design the user scheduling algorithm.Meanwhile, it should be noted that the interference between scheduled users depends on the beamforming design, which in turn depends on the scheduled users in other beams.Therefore, user scheduling and beamforming design are coupled, and the joint design scheme of user scheduling and beamforming needs to be considered.

•
Channel errors: The LEO satellite has high orbital speed, which will produce a large Doppler shift and result in the channel phase deviation [8].Meanwhile, the factors such as distortion of high-frequency devices, expiration of the CSI and large propagation delay also can cause the channel phase disturbance.Therefore, it is difficult to obtain accurate channel state information (CSI) at the LEO satellite transmitter.Due to the existence of CSI errors, the designed beamforming vector does not match the actual CSI, resulting in the reduction in the receiving gain and signal to interference plus noise ratio (SINR) of the user terminal.Then, the QoS will not be guaranteed.Thus, it is of practical significance to study the robust user scheduling and beamforming design.

•
EE optimization: Due to the limited energy load of LEO satellites, to prolong the service life of the LEO satellite and improve the stability of the LEO satellite communications system, under the consideration of green communications and economic benefits, we need to pay attention to the EE optimization [9].

•
Beamforming scheme: In the beamforming architectures of the massive MIMO technology, although the digital beamforming design can significantly improve the SE, it would bring high hardware complexity and high power consumption.Although the hardware overhead of hybrid beamforming architecture based on full connection is slightly higher than that of the partial connection architecture, it can balance the hardware complexity and system performance, and has higher cost performance.Therefore, in this paper, we selected the hybrid beamforming technology based on the full connection structure [10].

Related Works and Main Contributions 2.1. Related Works
There is extensive literature regarding user scheduling and beamforming design in the wireless multigroup multicast communication system.In Ref. [11], based on the perfect CSI, taking the throughput maximization as the optimization objective the authors studied the precoding design of the multibeam satellite communication system, and proposed a decoupling scheme of the user scheduling and beamforming design.In Ref. [12], considering the influence of CSI errors and taking the minimizing transmission power as the optimization objective, the robust multigroup multicast transmission scheme of the multibeam satellite communication system was investigated, and the low complexity beamforming algorithm and the user grouping algorithm were proposed, but the length limit of the DVB-S2X frame was not considered.In Ref. [13], based on the perfect CSI and taking the maximizing SE as the optimization objective, the multigroup multicast transmission design scheme of the frame-based multibeam satellite communication system was investigated, and the authors proposed a joint design scheme of the user scheduling and beamforming.However, the influence of CSI errors was not considered, and the user grouping algorithm was simple.In Ref. [14], based on the perfect CSI, the authors studied the user scheduling problem of the multicast transmission in the high-throughput satellite communication system.The user scheduling was decoupled into intra-beam scheduling and inter-beam scheduling, and the correlation degree was calculated by using the equivalent CSI; therefore, the interaction of Sensors 2022, 22, 6858 3 of 31 intra-beam and inter-beam scheduling cannot be fully considered.In Ref. [15], the authors studied the user scheduling problem of the multibeam satellite communication system but did not consider the inter-beam interference caused by scheduling.In Ref. [16], based on the perfect CSI, the authors studied the joint scheduling and beamforming design problem for multiuser MISO downlink, and the message-based user grouping and scheduling algorithm was mainly proposed, but the impact of user grouping on system performance was not fully considered.In Ref. [17], the user scheduling and hybrid beamforming design of the massive MIMO orthogonal frequency division multiple access (OFDMA) communication system was studied, in scheme design.First, a joint design algorithm of user scheduling and analog beamforming was proposed; then, the digital beamforming matrix was solved by the weighted minimum mean-square error (WMMSE) algorithm.In Ref. [18], the authors studied the design of user scheduling and subcarrier allocation in the downlink of a massive MIMO OFDMA communication system and proposed a hybrid beamforming scheme.First, based on the optimal solution of digital beamforming, the analog beamforming matrix was obtained by a singular value decomposition algorithm.Then, the authors proposed an algorithm to solve the digital beamforming matrix and its corresponding scheduling users.
Most of the above research took the geosynchronous earth orbit (GEO) satellite or the terrestrial cellular network as the research object, and less used the LEO satellite.In addition, the optimization objectives mainly focused on maximizing SE and minimizing transmission power, and less on the EE optimization.Meanwhile, the analysis of CSI errors was insufficient, as some research considered the CSI errors, but did not analyze the influence of the Doppler shift.In terms of the user scheduling, the common idea was to adopt the decoupling scheme of user scheduling and beamforming design, without fully considering the coupling relationship between the user scheduling and the beamforming design.

Main Contributions
Inspired by the above research, we focus on the downlink transmission design of the massive MIMO LEO satellite multigroup multicast communication system.In scheme design, comprehensively considering the influence of CSI errors caused by the residual Doppler shift and the phase disturbance, we mainly investigate the robust joint user scheduling and hybrid beamforming design to maximize the system EE.Meanwhile, we take the constraints of the transmission power and QoS into account.The main works are summarized as follows:

•
We establish the downlink transmission system model and channel model of the massive MIMO LEO satellite multigroup multicast communication system and analyze the CSI errors.

•
Based on the CSI, we adopt a low complexity hierarchical clustering algorithm based on the Ward connection method to group users, which can lay a foundation for the joint user scheduling and beamforming design.

•
We establish the joint user scheduling and hybrid beamforming design problem model based on EE maximization, and binary variables are defined to represent whether the user is scheduled or not.Then, we transform the optimization problem into a Boolean fractional programming (BFP) problem, which is also a quadratic constraint quadratic programming (QCQP) form problem.

•
For the BFP problem in QCQP form, we invoke the quadratic transformation algorithm to handle the fractional programming form problem in the objective function.Meanwhile, the SDP algorithm is invoked to convert the objective function in QCQP form into a concave function, and some nonconvex constraints can be converted into linear constraints.In addition, we adopt the relaxation and penalty algorithm to deal with the Boolean constraint.Then, the optimization problem is equivalently transformed into a difference of convex (DC) programming problem.

•
For the DC programming problem, an iterative optimization algorithm based on the CCCP framework is proposed.For the rank-one matrix constraint introduced by the SDP algorithm, a penalty iterative algorithm is adopted.

•
For the solution of the digital beamforming matrix and the analog beamforming matrix in the hybrid beamformer, the MM-AltOpt algorithm is proposed.

System Model
As shown in Figure 1, we focus on the downlink of the massive MIMO LEO satellite multigroup multicast communication system, and the LEO satellite uses a UPA, which is composed of N = N x × N y antennas and L(L ≤ N) RF links and covers L multicast groups and K active users, where K ≥ L, K ≥ N. Let the multicast group covered by the lth beam be U l and the number of users in this multicast group be |U l |, assuming that the number of users that can be accommodated in each DVB-S2X frame is U s .It should be noted that each user terminal belongs to only one multicast group, i.e., U i ∩ U j = ∅, ∀i, j ∈ {1, . . . ,L}, i = j.Meanwhile, we assume that the user terminal is equipped with a single antenna capable of data stream demodulation.According to the DVB-S2X standard, in a transmission slot, multiple users' data in a multicast group are multiplexed into a specific forward error correction (FEC) codeword to provide services for more users.The service process is shown in Figure 2. linear constraints.In addition, we adopt the relaxation and penalty algorithm to deal with the Boolean constraint.Then, the optimization problem is equivalently transformed into a difference of convex (DC) programming problem.

•
For the DC programming problem, an iterative optimization algorithm based on the CCCP framework is proposed.For the rank-one matrix constraint introduced by the SDP algorithm, a penalty iterative algorithm is adopted.

•
For the solution of the digital beamforming matrix and the analog beamforming matrix in the hybrid beamformer, the MM-AltOpt algorithm is proposed.

System Model
As shown in Figure 1, we focus on the downlink of the massive MIMO LEO satellite multigroup multicast communication system, and the LEO satellite uses a UPA, which is composed of

 
, , 1,...., , Meanwhile, we assume that the user terminal is equipped with a single antenna capable of data stream demodulation.According to the DVB-S2X standard, in a transmission slot, multiple users' data in a multicast group are multiplexed into a specific forward error correction (FEC) codeword to provide services for more users.The service process is shown in Figure 2.
where the first term in (1) represents the expected received signal of the th k user in the th l multicast group, the second term represents the interference of other multicast groups and the third term represents the additive Gaussian white noise; f as the hybrid beamforming matrix, and is the hybrid beamforming vector of the th l multicast group.Therefore, (1) can be rewritten as: Due to the high orbital speed of LEO satellites and the long transmission delay, it is difficult to obtain the precise instantaneous CSI.To cope with this problem, we adopt the statistical CSI, and the channel vector between the LEO satellite and the th k user in the th l multicast group at instant t and frequency f can be modeled as follows [20]: ) is the UPA array response vector, which can be given by cos The received signal y k,l of the kth user in the lth multicast group can be expressed as: where the first term in (1) represents the expected received signal of the kth user in the lth multicast group, the second term represents the interference of other multicast groups and the third term represents the additive Gaussian white noise; h k,l ∈ C N×1 represents the channel vector of the kth user in the lth multicast group, F BB ∈ C L×L represents the digital beamforming matrix, F BB [:, l] ∈ C L×1 represents the digital beamforming vector of the lth multicast group, and F RF ∈ C N×L represents the analog beamforming matrix, where each element of F RF should meet the unit modulus element [19], i.e., (F RF ) i,j = 1.In addition, s l represents the signal of the multicast group U l , which meets the unit power constraint, i.e., E |s l | 2 = 1, n l ∼ CN(0, σ 2 ) represents the additive Gaussian white noise, which is related to the Boltzmann constant κ, system bandwidth B and the noise temperature T.
For the convenience of analysis, we set F ∈ C N×L = F RF F BB = [f 1 , f 2 , . . ., f L ] as the hybrid beamforming matrix, and f l ∈ C N×1 = F RF F BB [:, l] is the hybrid beamforming vector of the lth multicast group.Therefore, (1) can be rewritten as: Due to the high orbital speed of LEO satellites and the long transmission delay, it is difficult to obtain the precise instantaneous CSI.To cope with this problem, we adopt the statistical CSI, and the channel vector between the LEO satellite and the kth user in the lth multicast group at instant t and frequency f can be modeled as follows [20]: where f denotes the carrier frequency, a k,l,p , f d(k,l,p) ,τ k,l,p are the complex channel gain, Doppler shift and propagation delay, respectively, P k,l denotes the number of propagation paths and V k,l,p ∈ C N×1 is the UPA array response vector, which can be given by where ϕ x k,l,p , ϕ y k,l,p represent azimuth angle and pitch angle associated with the propagation path p of the kth user in the lth multicast group, respectively, λ denotes the wavelength and d represents the spacing of antenna elements, the value of which is usually λ/2 [5].
Note that the LEO satellite communication system is usually operated under the line of sight (LOS) transmission, and the channel vector can be modeled using the widely accepted Rician distribution model as follows: where represents Rician component, Tr(∑) = 1 and γ k,l represents the average channel power, which mainly includes the transmit antenna gain G leo , receiver antenna gain G ut and link power loss.The link power loss is mainly caused by the free space path loss LP f s and the atmospheric absorption loss LP at .Therefore, the average channel power γ k,l can be written as where LP f s can be given by LP f s = 20 log 10 (D k,l ) + log 10 ( f ) + log 10 (4π/c), c is the speed of light, D k,l represents the transmission distance, LP at is related to the carrier frequency, temperature T(h), pressure P(h) and humidity ρ(h), which can be given by LP at = h at h ut LP at ( f , T(h), P(h), ρ(h))dh, LP at ( f , T(h), P(h), ρ(h)) is the loss per meter, h ut is the user's height and h at is the atmosphere thickness.The specific calculation method of LP at can be found in the literature [21].
For the convenience of analysis, it is assumed that the parameters in the channel vector h k,l are constant within coherence time and change over time in a certain ergodic process.In (3), the Doppler shift f d(k,l,p) and the propagation delay τ k,l,p usually cause CSI errors.Next, we focus on analyzing the influence of propagation delay and Doppler shift on CSI errors.Doppler shift: In the LEO satellite communication systems, the Doppler shift is usually large, which is mainly composed of the Doppler shift f leo d(k,l,p) generated by the LEO satellite motion and the Doppler shift f ut d(k,l,p) generated by the users' motion [22].Since the transmission between the LEO satellite and user terminals is mainly under LOS, f leo d(k,l,p) of different transmission paths can be considered to be the same, and we omit the path index of f leo d(k,l,p) , i.e., f leo d(k,l,p) ) can be calculated using the LEO satellite ephemeris information and the location information of user terminals, as shown in Figure 3.
where µ(θ max ) = cos cos −1 r e r cos θ max − θ max , r e denotes the earth radius, r represents the distance between the LEO satellite track point and the earth center, (φ t − φ t 0 ) represents the angular distance of the earth's surface along the LEO satellite trajectory from instant t to instant t 0 , ω represents the angular velocity of the LEO satellite and c represents the speed of light.
where ( ) spectrum can be expressed as: The large Doppler shift in LEO satellite communication systems can make it difficult to receive correctly and result in the degradation of communication performance.In application, to mitigate the impact of Doppler shift, the solution of estimation and compensation is usually adopted [23].The Doppler shift estimation mainly includes two steps: coarse estimation and fine estimation, which can refer to the literature [24].When we obtain the estimated value of Doppler shift, e d f , the Doppler shift compensation of size cps e dd ff = can be implemented at the receivers.It should be noted that due to that the Doppler shift changes rapidly, in addition to the low SINR at the receivers and the limited pilot length in the DVB-S2X frame, the Doppler shift estimation is usually inaccurate, which can result in the incomplete compensation.Then, there would be the residual Doppler shift rsd d f , which can cause the sliding of channel phase.According to the Doppler shift estimation theory based the Cramer-Rao bound, the variance in the Doppler shift estimation can be expressed as the Cramer-Rao lower bound (CRLB) [25], i.e., The value of f ut d(k,l,p) in different transmission paths is different, which is mainly caused by the movement of user terminals and surrounding scatterers.The power spectrum of f ut d(k,l,p) follows the Jakes power spectrum model, and the normalized power spectrum can be expressed as: The large Doppler shift in LEO satellite communication systems can make it difficult to receive correctly and result in the degradation of communication performance.In application, to mitigate the impact of Doppler shift, the solution of estimation and compensation is usually adopted [23].The Doppler shift estimation mainly includes two steps: coarse estimation and fine estimation, which can refer to the literature [24].When we obtain the estimated value of Doppler shift, f e d , the Doppler shift compensation of size f cps d = f e d can be implemented at the receivers.It should be noted that due to that the Doppler shift changes rapidly, in addition to the low SINR at the receivers and the limited pilot length in the DVB-S2X frame, the Doppler shift estimation is usually inaccurate, which can result in the incomplete compensation.Then, there would be the residual Doppler shift f rsd d , which can cause the sliding of channel phase.According to the Doppler shift estimation theory based the Cramer-Rao bound, the variance in the Doppler shift estimation can be expressed as the Cramer-Rao lower bound (CRLB) [25], i.e., where N is the pilot length, T is the sampling time and SNR is the signal-to-noise ratio.
According to the properties of variance, after Doppler shift compensation of size f cps d , the variance of residual Doppler shift f rsd d is equal to the variance of f e d , i.e., The influence of residual Doppler shift on channel phase errors can be expressed as where ∆T represents the sum of the downlink propagation delay and the duration of the DVB-S2X frame.Therefore, the variance of φ f rsd d can be expressed as Propagation Delay: The orbital height of LEO satellites is about 300 km to 2000 km, and the long transmission distance can cause the larger propagation delay.Note that the influence of atmosphere on propagation delay mainly includes ionospheric delay and tropospheric delay [26].When the signal passes through the ionosphere, due to the refraction effect of the electromagnetic wave, the propagation path and speed of the signal will change.Meanwhile, the ionospheric delay is irregular, which is difficult to describe with a physical model.When the signal passes through the troposphere, the propagation speed, direction and path of the signal will change, which can result in propagation delay.The tropospheric delay is related to air pressure, air humidity and satellite elevation.The commonly used tropospheric delay correction model is given in the literature [27,28].The round-trip delay of the LEO satellite with an orbit altitude of 1200 km is about 20 ms.The long propagation delay will lead to the expiration of CSI, which can result in CSI errors, phase disturbance and other problems.To handle this problem, the delay compensation of size τ cps = βτ min + (1 − β)τ max , (0 ≤ β ≤ 1) is usually implemented at the receivers, and τ min k,l = min τ k,l,p represent the minimum propagation delay and the maximum propagation delay of the kth user in the lth multicast group, respectively.However, due to that the atmospheric propagation delay is irregular, the transmission delay cannot be fully compensated.Therefore, the incomplete delay compensation, expired CSI and distortion of high-frequency devices would cause the channel phase disturbance [29].Let the phase disturbance be φ τ , which follows a real-valued Gaussian distribution with mean zero and variance σ 2 φ τ , i.e., φ τ ∼ N 0, σ 2 φ τ .The channel error vector caused by φ τ can be expressed as v φ τ = [e jφ τ,1 , e jφ τ2 , . . ., e jφ τ,N ] T .
In conclusion, considering the influence of the residual Doppler shift and the phase disturbance, the relationship between the real channel vector h k,l and the estimated channel vector ĥk,l can be expressed as: where represents the Hadamard product.Let the channel phase of the kth user in the lth multicast group be θ k,l = [θ k,l,1 , θ k,l,2 , . . ., θ k,l,N ] T , which satisfies the uniform distribution between 0 ∼ 2π.Then, the real channel phase with phase errors at instant t 1 is as follows:

User Clustering
Before the joint user scheduling and hybrid beamforming design, it is necessary to group the active users within the coverage of the LEO satellite.Based on the CSI, the hierarchical clustering algorithm is adopted to group users [30].As shown in Figures 4 and 5, the hierarchical clustering algorithm adopts the bottom-up method, where each user initially forms a group, and then according to the similarity measurement function, the user groups which meet the similarity threshold constraint are combined until the desired number of groups is formed.
group the active users within the coverage of the LEO satellite.Based on the CSI, the hierarchical clustering algorithm is adopted to group users [30].As shown in Figures 4 and  5, the hierarchical clustering algorithm adopts the bottom-up method, where each user initially forms a group, and then according to the similarity measurement function, the user groups which meet the similarity threshold constraint are combined until the desired number of groups is formed.5, the hierarchical clustering algorithm adopts the bottom-up method, where each user initially forms a group, and then according to the similarity measurement function, the user groups which meet the similarity threshold constraint are combined until the desired number of groups is formed.We adopt the similarity measurement function among multicast groups based on the Ward connection method, i.e., where n i , n j represent the number of users of the group i and the group j, respectively, h eq i , h eq j represent the equivalent CSI of the group i and the group j, respectively, and dist(h eq i , h eq j ) represents the Euclidean distance between the vector h eq i and the vector h eq i , i.e., dist(h eq i , h eq j ) =

System Rate
Affected by CSI errors, both the ergodic communication rate and the ergodic SINR do not admit explicit expressions.To handle this challenge, the statistical average method is adopted to model the SINR and the communication rate.Therefore, the SINR and the communication rate of the kth user in the lth multicast group can be expressed as: where B denotes system bandwidth.Equations ( 17) and ( 18) are approximations with closed form, the feasibility of which have been discussed in detail in Refs.[31,32].

Problem Description
We take the system EE as the optimization objective, and the EE is defined as the ratio of the system communication rate to the total power consumption, which can be modeled as: where P total represents the total power consumption, denotes the transmission power of the LEO satellite, and P 0 denotes the inherent power consumption of the communication system.Let the Boolean variable η k,l ∈ {0, 1} indicate whether the kth user in the lth multicast group is served, η k,l = 1 and η k,l = 0 indicate that the user can be served and not served, respectively, and η = [η 1 , η 2 , . . . ,η L ], η l = η 1,l , η 2,l , . . ., η |U l |,L T .In conclusion, under the constraints of the transmission power and QoS, the problem of maximizing system EE can be modeled as: C 4 : C 5 : where SINR min l represents the minimum SINR of the lth multicast group and constraint C 3 represents that SINR min l should be greater than the minimum SINR constraint SINR 0 .Constraint C 4 limits the number of scheduled users in each multicast group to U s .

Joint User Scheduling and Hybrid Beamforming Design for Maximizing EE
In this section, we focus on the robust joint user scheduling and hybrid beamforming design strategy to maximize system EE.To handle the QCQP form problem and nonconvexity in optimization problem Q 1 , the SDP method is applied to make the optimization problem more tractable.Then, we transform the optimization problem Q 1 into a DC programming problem.To address the DC programming problem, we adopt the CCCP algorithm.Finally, a penalty iterative algorithm is adopted to handle the rank-one matrix constraint.

SDP Algorithm
It is worth noting that the objective function and constraints C 2 and C 5 in the problem Q 1 involve the quadratic form of the variable f l , therefore, Q 1 is the QCQP form problem. To handle this problem, we invoke the SDP algorithm, a new variable W l f l f H l is introduced, and the positive semidefinite matrix W l needs to meet the constraints of W l 0 and rank(W l ) = 1.Then, the problem Q 1 can be equivalent to: Similarly, the SINR and the communication rate of the kth user in the lth multicast group can be equivalently converted to: where H k,l ∈ C M×M is the instantaneous channel autocorrelation matrix of the kth user in the lth multicast group and H k,l ∈ C M×M is the long-term channel autocorrelation matrix.
The relationship between the two can be expressed as: where and Q τ k,l can be expressed as follows: Sensors 2022, 22, 6858 In (34), the diagonal elements of P f rsd d,k,l are all 1, and the elements in row i and column ,j , according to Similarly, E e Sensors 2022, 22, 6858 13 of 31 In (36), the diagonal elements of Q τ k,l are all 1, and the elements in row i and column j on the non-diagonal are E e jφ τ k,l ,i e −jφ τ k,l ,j = E e jφ τ k,l ,i E e −jφ τ k,l ,j , according to Similarly, E e jφ τ k,l

DC Programming
Since the constraint C 1 is a Boolean constraint and the constraint C 2 is a nonconvex constraint, the problem Q 2 is a nonconvex and nonsmooth combinatorial optimization problem.To handle this challenge, we can transform the problem Q 2 into a DC programming problem [33].Therefore, the relaxation variable ζ k,l is introduced as the lower bound of the SINR of the kth user in the lth multicast group, The problem Q 2 can be equivalently converted to: where the constraint C 2 can be equivalently converted to: To further express (42) in the form of DC programming, we introduce new function variables Γ k,l (W) and I k,l (W, ζ k,l ): Therefore, the constraint C 2 can be rewritten as: Sensors 2022, 22, 6858 14 of 31 In (45), Γ k,l (W l ) is the affine function of W, I k,l (W, ζ k,l ) is the concave function of W and ζ k,l .The transformed constraint C 2 is a typical DC constraint.
Similarly, the constraint C 8 can be equivalently converted into the following DC form: In (38), the objective function in the problem Q 3 is a fractional programming problem with the sum-of-ratios form.To handle this problem, we invoke the quadratic transformation algorithm [34] and convert the problem Q 3 into the following form: Tr(W l ) + P 0 , (47) where q is the introduced auxiliary variable, and Υ l (SINR min l ) is the introduced auxiliary function, which can be expressed as: In addition, for the nonsmooth combinatorial optimization problem caused by constraint C 1 , we invoke a relaxation and penalty algorithm.Firstly, we relax constraint C 1 into C 1 ⇒ 0 ≤ η k,l ≤ 1, ∀k, l .Meanwhile, to avoid the non-duality of the solution of η k,l caused by the relaxation, the penalty term P(η k,l ) = η k,l log η k,l + (1 − η k,l ) log(1 − η k,l ) is introduced into the objective function: let λ 1 > 0 be the penalty factor, and the problem Q 4 can be equivalently converted to:

CCCP Algorithm
From (51), ( 52) and (54), it can be seen that the problem Q 5 is a DC programming problem.To handle this challenge, the CCCP framework algorithm is a common method to solve the DC programming problem [35], which is an iterative framework including two operations: convexification and optimization.In the convexification step, by adopting the first-order Taylor expansion, the convex part of the objective function and the concave part of the constraint function can be linearized; then, the DC programming problem is transformed into a convex problem.It should be noted that the convex problem obtained from the convexification step provides a global lower bound for the original problem, and the optimization step is mainly to maximize the lower bound.Meanwhile, the performance of the CCCP algorithm is closely related to the initial point of the variables, but the equality constraint C 4 of the problem Q 5 limits the selection of the initial point.To find a feasible initial point, we substitute the constraint C 4 into the objective function and set the penalty factor λ 2 > 0.Then, the problem Q 5 can be equivalently converted to: be the estimated value of variables In iteration t, for the convex part ), we adopt the first-order Taylor expansion to replace it, which is reflected in line three of Algorithm 1.The first-order Taylor expansion of P(η k,l ) can be expressed as: ∇P η Similarly, in the constraint ) with its first-order Taylor expansion, which is reflected in line three of Algorithm 1.The first-order Taylor expansion of I k,l (W, ζ k,l ) can be expressed as: In the constraint Optimization: The optimization step is reflected in line nine of Algorithm 1.According to (57), (59) and (61), the problem Q 6 can be equivalently converted to For the problem Q 7 , the variables η k,l , SINR min can be updated by the iterative optimization.
Feasible Initial Point: It should be noted that the CCCP algorithm needs a feasible initial point to ensure that the algorithm converges to a stationary point, as the selection of the initial point can affect the performance of the CCCP algorithm.To find a better initial point, we adopt the following method, which is reflected in line one of Algorithm 1.

•
Initialize η , the following optimization problem are modeled: • If P FES is feasible, proceed to the next step, otherwise, update η k,l , 0 < δ < 1 and repeat step 2;

•
Based on the W (0) obtained in step 2, calculate the SINR of each user, i.e., SINR k,l , ∀k, l, and update η

Penalty Iteration Algorithm
It should be noted that the SDP algorithm brings the nonconvex and nonsmooth constraint, i.e., C 7 : rank(W l ) = 1.To solve the rank-one constraint, many existing research directly relaxes the rank-one constraint in the optimization step [36], and then judges whether the optimization solution {W l } L l=1 meets the rank-one constraint.If so, the eigenvalue decomposition (EVD) algorithm is directly adopted to obtain the hybrid beamforming vectors {f l } L l=1 according to W l = f l f H l , and if the optimization solution {W l } L l=1 does not meet the rank-one constraint, the Gaussian randomization algorithm (GRA) is usually adopted.The basic idea of the GRA is as follows: Firstly, a set of candi- , where G represents the number of the Gaussian randomization.Secondly, from the generated G-group candidate Gaussian vector pool, combined with the power redistribution among the multicast groups, a group of Gaussian vectors is selected as the optimal hybrid beamforming matrix to maximize the objective function in the problem Q 7 .It should be noted that in the case of the high-dimensional matrix, GRA has high complexity and large performance loss, resulting in poor availability.To this end, we adopt a feasible algorithm with the better performance: the penalty iteration algorithm.
According to the properties of the matrix, rank(W l ) = 1 is equivalent to Tr(W l ) − λ max (W l ) = 0. Therefore, the nonsmooth method is adopted to transform the constraint rank(W l ) = 1 in the problem Q 7 into the following form: where λ max (W l ) is the function of solving the maximum eigenvalue.It should be noted that for any positive semidefinite matrix W l 0, the inequality Tr(W l ) − λ max (W l ) ≥ 0 is always true, which means that the transformed constraint C 7 and Tr(W l ) − λ max (W l ) = 0 are equivalent.Then, we can obtain that the matrix W l has only one non-zero eigenvalue and can be given by where w l,max is the corresponding unit eigenvector.Therefore, the problem Q 7 can be converted into In the iterative calculation of the problem Q 8 , based on the obtained W l , if the value of Tr(W l ) − λ max (W l ) is small enough, the matrix W l can be considered to meet the rank-one constraint, which is reflected in line seven of Algorithm 1.Therefore, to make the value of Tr(W l ) − λ max (W l ) as small as possible, we adopt the penalty iteration algorithm and substitute the constraint C 7 into the objective function in the problem Q 8 .Therefore, the problem Q 8 can be converted into where Tr(W l ) + P 0 It should be noted that Tr(W l ) is an affine function and λ max (W l ) is nonsmooth, which can result in the non- smoothness of the objective function in the problem Q 9 .To handle the challenge, we replace λ max (W l ) with its first-order Taylor expansion.The subgradient of λ max (W l ) is ∂λ max (W l ) ∂W l = w l,max w H l,max and its first-order Taylor expansion can be expressed as follows, which is reflected in line eight of Algorithm 1.
where w l,max w H l,max , W We substitute (76) into the objective function in the problem Q 9 to replace λ max (W l ), and the problem Q 9 can be expressed as: In conclusion, the robust joint user scheduling and hybrid beamforming design algorithm for the massive MIMO LEO satellite multigroup multicast communication system is shown in Algorithm 1. , q (k=0) .57), ( 59), (61).4. Calculation q (k) , substitute q (k) into (77).5. Optimization step.

Let η, W, SINR min
Calculate the maximum eigenvalue λ max (W l ) of W (m) l and the corresponding eigenvector

Convergence
The effectiveness of the algorithm depends on its convergence.For the convergence of the CCCP algorithm, the convergence has been proven by [37].To prove the convergence of the penalty iteration algorithm, let the variable solution and objective function value of the optimization problem Q 10 be η, W, SINR min at the kth iteration.Therefore, the convergence can be proved as follows: , the proposed algorithm can iteratively converge to an optimal solution by setting a reasonable convergence threshold.

Complexity
The complexity of the algorithm directly affects its performance.In the algorithms adopted, the complexity of the hierarchical clustering algorithm can be calculated according to the connection algorithm, similarity measurement criteria and hierarchical grouping process, and the algorithm complexity can be expressed as O LK 2 N .The complexity of the joint user scheduling and hybrid beamforming design algorithm is closely related to the number of multicast groups and scheduling users.In addition, the number of optimization variables and constraints in the CCCP algorithm and the penalty iteration algorithm can also affect the complexity [38].In the problem Q 10 , the number of optimization variables According to the algorithm complexity, the proposed joint user scheduling and hybrid beamforming design algorithm has a strong timeliness in small dimensional communication systems.However, for large dimensional communication systems, such as the satellite communication system, the number of active users is usually large.According to the complexity analysis, with the increase in the total number of active users, the convergence speed of the algorithm would gradually slow down, and the complexity would gradually increase.Considering the characteristics of the LEO satellite communication system, the delay caused by the high complexity is unacceptable, which would affect the overall performance of the communication system.To handle this problem, considering the balance of the algorithm performance and complexity, before the joint user scheduling and hybrid beamforming design, we can appropriately reduce the system dimension by adding the user preselection step in the algorithm process.The user preselection step can reduce the total number of active users in each transmission, and then we carry out the joint user scheduling and hybrid beamforming design for preselected users.

User Preselection Algorithm
In the user preselection step, U l,p users in the lth multicast group are preselected as the user representatives, where U s < U l,p ≤ |U l |.The user preselection process is shown in Figure 6.The symbols (circles, squares and triangles) represent the users in different the multicast group.The red circles represent preselected users and scheduling users.
joint user scheduling and hybrid beamforming design for preselected users.

User Preselection Algorithm
In the user preselection step,   The selection of preselected users can affect the performance of the joint user scheduling and hybrid beamforming design algorithm, which depends on the preselected algorithm.In the beamforming design of the multigroup multicast communication system, the beamforming vector is oriented to multiple users in the multicast group.Therefore, in the process of user preselection, to maximize the receive gain of each user, i.e., h H k,l f l , the beamforming vector f l of the lth multicast group should be collinear with the users' channel vectors in the multicast group as far as possible.Therefore, the channel vectors of the preselected users in the same multicast group should also be strongly linearly correlated.Meanwhile, the interference among multicast groups should also be taken into account in the user preselection stage.To reduce the interference among multicast groups, the channel vectors of preselected users among different multicast groups should be orthogonal.Similarly, the beamforming vector of the multicast group should be orthogonal to the users' channel vectors in other multicast groups.
In conclusion, we adopt a low complexity user preselection algorithm, which can preselect orthogonal users among the different multicast groups and linearly correlated users in the same multicast group.The proposed algorithm is divided into two steps, as follows:

•
The first step: according to the orthogonal criterion [11], a user is preselected for each multicast group in turn, which is reflected in line three and line four of Step 1 in Algorithm 2;

•
The second step: based on the users of each multicast group selected in the first step, linearly correlated users are selected for each multicast group, which is reflected in line two and line three of Step 2 in Algorithm 2.
The specific preselection process of the two steps is as follows: Algorithm 2: User preselection algorithm.
Step 1: Orthogonal user preselection algorithm among the different multicast groups.Input: CSI. 1.Let Id (1) = Index max h k,l , ∀k, l , select the user with the largest channel gain, Id (1) is the index of the user.2. while l ≤ L, l = Id (1) 3. For all users in the lth multicast group, calculate in turn.
4. Id (l) = Index max Z k,l , ∀k ∈ l , the user with index Id (l) is the preselected orthogonal user of the lth multicast group.

end
Output: Orthogonal users among the different multicast groups.
Step 2: User preselection algorithm in each multicast group.Input: Orthogonal users among the different multicast groups, CSI. 1.For l = 1 : L 2. For other users in the lth multicast group except the orthogonal user preselected in step 1, calculate the linear correlation value between each user and the preselected orthogonal user of the multicast group in turn, i.e., 3. Based on the C k,l of users in each multicast group, select top U l,p − 1 largest users, plus the orthogonal users in step 1 as the preselected users of each multicast group.4. end 5. end Output: Preselected users for each multicast group.
After the user preselection, the joint user scheduling and hybrid beamforming design is for the preselected users.Therefore, the dimension of the LEO satellite communication system will be reduced, and the algorithm complexity will be reduced.Although the algorithm performance has a slight loss, compared with the decoupling design of user scheduling and beamforming, the performance is greatly improved.In conclusion, the joint user scheduling and hybrid beamforming design with the user preselection step is a better choice after balancing performance and complexity.

Solution of The Digital Beamforming Matrix and The Analog Beamforming Matrix
In this section, we aim to investigate the design of digital beamforming matrix F BB and analog beamforming matrix F RF in a hybrid beamformer.After obtaining W, we need to further solve F BB and F RF .The solution method of F BB and F RF can be divided into two steps:

•
The first step: we adopt the EVD algorithm to solve the hybrid beamforming matrix F from W.

•
The second step: we propose the MM-AltOpt algorithm to obtain F BB and F RF .

Solution of The Hybrid Beamforming Matrix
Before calculating F BB and F RF , it is necessary to obtain the hybrid beamforming matrix F. For the solution of F, we can adopt the EVD algorithm based on the previously obtained optimization variable W. According to the relationship between W and F, i.e., W l f l f H l , the solution of F can be modeled as follows: min where the hybrid beamforming vector f l can be given by where v l is the maximum eigenvalue of the matrix W l and u l is the maximum eigenvector of the matrix W l .

MM-AltOpt Algorithm: Solution of F BB and F RF
According to F = F RF F BB and (F RF ) i,j = 1, the solution of F BB and F BB can be modeled as a joint optimization problem with the power and constant modulus constraints, as follows: P 1 : min where ulus constraint, which is determined by the phase shifter in UPA, and the constraint C 2 represents the power constraint.
It is worth noting that the problem P 1 is a matrix decomposition problem with the constant modulus constraint and the equality constraint.The objective function is a nonconvex function of variables F BB and F RF , and the constraints C 1 and C 2 are also nonconvex.Meanwhile, it can be seen that when one of the two variables is given, the objective function is the convex function of the other variable.To solve the problem P 1 , we invoke the alternating optimization algorithm.The alternating optimization algorithm can decompose the multivariable joint optimization problem into multiple subproblems according to the partial convexity of the problem P 1 , and one of the variables can be iteratively solved by fixing the residual variables.
It should be noted that the nonconvexity of constraints is still a challenge.To this end, we first relax the constraint C 2 , and then use the scale factor to adjust the digital beamforming matrix F BB to meet the power constraint.Then, for the solution of the analog beamforming matrix F RF with the unit modulus constraint, the MM algorithm is adopted [33].

Solution of The Analog Beamforming Matrix Based on The MM Algorithm
According to the solution process of the alternating optimization algorithm, we first solve the analog beamforming matrix F RF based on the digital beamforming matrix F BB .Thus, the problem P 1 can be expressed as: where BB represents the estimated value of the digital beamforming matrix F BB at the nth iteration.Due to the unit module constraint of elements in F RF , the problem P 2 is a nonconvex optimization problem.
According to the MM framework theory, the key step is constructing a surrogate function of the objective function in the optimization problem [39].To construct the surrogate function, we decompose the matrix F by rows.According to the equivalence of the F-norm and L 2 -norm of the vector, the problem P 2 can be rewritten as: where F H i represents the ith row vector of the matrix F, and F H RF,i represents the ith row vector of the matrix F RF .
It should be noted that the third term 86) is a convex function term, which needs further conversion.According to the first-order Taylor expansion, H F RF,i can be converted into: where RF,i represents the estimated value of F RF,i at the qth iteration.According to the MM algorithm, the surrogate of H F RF,i can be expressed as follows: where X (n) is a positive semidefinite matrix and satisfies the constraint here, we let In conclusion, (89) can be further expressed as: According to (90), the surrogate function of the objective function in the problem P 3 can be expressed as: It is worth noting that the first and third terms of the objective function in the problem P 4 are constant terms, and the last term is independent of the variable F H RF,i .After ignoring the above three items, the problem P 4 can be converted to the following projection problem: RF,i .Therefore, the following closed form solution can be obtained for the problem P 5 , which is reflected in line three of the inner algorithm in Algorithm 3: where RF , which is reflected in line two of the inner algorithm in Algorithm 3.

Solution of The Digital Beamforming Matrix
Based on the analog beamforming matrix F RF obtained in the previous section, the solution problem of the digital beamforming matrix F BB can be modeled as follows: where RF represents the estimated value of the analog beamforming matrix F RF at the nth iteration.Due to the quadratic form and the convex equality constraint in the problem P 6 , the problem P 6 is a nonconvex QCQP form problem.
One of the ways to solve the problem P 6 is to relax the equality constraint into the inequality constraint, and then convert the problem P 6 into a convex minimization problem, which can be solved with the CVX toolbox, but the complexity of this method is high.To this end, based on the fact that the hybrid beamforming matrix F satisfies the power constraint, i.e., F 2 F = P T , the following closed form solution can be obtained for the problem P 6 , which is reflected in line two of the main algorithm in Algorithm 3: In conclusion, the MM-AltOpt algorithm for solving F BB and F RF can be described as the main algorithm and the inner algorithm, as follows: 3. Calculate F RF = e −jarg(C (q)T ) . 4. Set q = q + 1.

Results and Discussion
In this section, we evaluate the performance of the proposed joint user scheduling and hybrid beamforming design algorithm by numerical simulations.In the numerical simulations, we set the number of multicast groups to L = 7, which cover 150 active users, and set the SINR constraint threshold of each multicast group to SINR 0 = 1.To facilitate analysis, we assume that the CSI errors of different multicast groups are the same, which are expressed as σ 2 The value of P 0 can be calculated by [31].In addition, the system parameters used in the numerical simulations are shown in Table 1. Figure 7 shows the convergence trajectory of the EE of the massive MIMO LEO satellite multigroup multicast communication system, versus the number of iterations for different CSI errors, different numbers of preselected users and different scheduling algorithms.In this simulation, two groups of channel errors are set according to Refs.[23,29] φ τ = 5, the EE performance gain of the proposed robust algorithm is improved by 9.8% and 6.7%, respectively, compared with the traditional nonrobust algorithm.The system EE of the proposed joint user scheduling and hybrid beamforming design algorithm is higher than that of the decoupling design algorithm, and the more preselected users, the higher performance improvement.
Sensors 2022, 22, x FOR PEER REVIEW 26 of 32 Figure 7 shows the convergence trajectory of the EE of the massive MIMO LEO satellite multigroup multicast communication system, versus the number of iterations for different CSI errors, different numbers of preselected users and different scheduling algorithms.In this simulation, two groups of channel errors are set according to Refs.[23,29], i.e., 2       = , the EE performance gain of the proposed robust algorithm is im- proved by 9.8% and 6.7%, respectively, compared with the traditional nonrobust algorithm.The system EE of the proposed joint user scheduling and hybrid beamforming design algorithm is higher than that of the decoupling design algorithm, and the more preselected users, the higher performance improvement.Figure 8 compares the EE performance of the proposed algorithm and the traditional algorithm under different system parameters, versus different transmission power thresholds T P .It can be seen that with the increase in transmission power, the EE performance shows a trend of first rising and then falling.The reason is that the growth rate of the system rate is lower than that of the power consumption.Meanwhile, we can see that the EE of the joint user scheduling and hybrid beamforming design algorithm is higher than that of the decoupling design algorithm versus different transmission power thresholds  Figure 8 compares the EE performance of the proposed algorithm and the traditional algorithm under different system parameters, versus different transmission power thresholds P T .It can be seen that with the increase in transmission power, the EE performance shows a trend of first rising and then falling.The reason is that the growth rate of the system rate is lower than that of the power consumption.Meanwhile, we can see that the EE of the joint user scheduling and hybrid beamforming design algorithm is higher than that of the decoupling design algorithm versus different transmission power thresholds P T .Under the conditions of σ 2 f rsd d = 25, σ 2 φ τ = 10 and P T = 15 W, when U l,p /U s = 2, U l,p = 4 and U l,p /U s = 3, U l,p = 6, the EE performance gain of the proposed joint design algorithm is 28.41% and 45.19% higher than that of the traditional decoupling design algorithm.Figure 9 indicates the change trend of the SE of the massive MIMO LEO satellite multigroup multicast communication system versus different transmission power thresholds T P .It can be seen that the SE increases with the increase in the transmission power.Meanwhile, we can see that the SE of the joint user scheduling and hybrid beamforming design algorithm is higher than that of the decoupling design algorithm.In addition, the more preselected users, the higher the system SE.Compared with the traditional algorithm, the proposed robust joint design algorithm can obtain higher system SE at the same transmission power, and thus can improve the system EE.Under the conditions of

U U U
== , with the improvement of system SE performance, the EE performance gain of the proposed joint design algorithm is 26.16% and 37.85% higher than that of the traditional decoupling design algorithm.Figure 9 indicates the change trend of the SE of the massive MIMO LEO satellite multigroup multicast communication system versus different transmission power thresholds P T .It can be seen that the SE increases with the increase in the transmission power.Meanwhile, we can see that the SE of the joint user scheduling and hybrid beamforming design algorithm is higher than that of the decoupling design algorithm.In addition, the more preselected users, the higher the system SE.Compared with the traditional algorithm, the proposed robust joint design algorithm can obtain higher system SE at the same transmission power, and thus can improve the system EE.Under the conditions of σ 2 f rsd d = 25, σ 2 φ τ = 10 and P T = 30 W, when U l,p /U s = 2, U l,p = 4 and U l,p /U s = 3, U l,p = 6, with the improvement of system SE performance, the EE performance gain of the proposed joint design algorithm is 26.16% and 37.85% higher than that of the traditional decoupling design algorithm.
Figure 10 shows the SE comparison of different multicast groups.It can be seen that the SE of each multicast group of the proposed robust algorithm is higher than that of the nonrobust algorithm.Meanwhile, with the increase in the number of preselected users, the diversity of users increases, and the performance of the proposed joint user scheduling and hybrid beamforming design algorithm also improves.This is because with the increase in the number of preselected users, the range of users that can be scheduled and selected increases.By scheduling different users in each multicast group, the SE can be further improved.Meanwhile, with the improvement of the system SE, the system EE performance gain also increases, which verifies the effectiveness of the joint user scheduling and hybrid beamforming design algorithm.
proposed robust joint design algorithm can obtain higher system SE at the same transmission power, and thus can improve the system EE.Under the conditions of 2 == , with the improvement of system SE performance, the EE performance gain of the proposed joint design algorithm is 26.16% and 37.85% higher than that of the traditional decoupling design algorithm.Figure 10 shows the SE comparison of different multicast groups.It can be seen that the SE of each multicast group of the proposed robust algorithm is higher than that of the nonrobust algorithm.Meanwhile, with the increase in the number of preselected users, the diversity of users increases, and the performance of the proposed joint user scheduling and hybrid beamforming design algorithm also improves.This is because with the increase in the number of preselected users, the range of users that can be scheduled and selected increases.By scheduling different users in each multicast group, the SE can be further improved.Meanwhile, with the improvement of the system SE, the system EE performance gain also increases, which verifies the effectiveness of the joint user scheduling and hybrid beamforming design algorithm.Figure 11 shows the change trajectory of the system EE and SE versus the different s U .It can be seen that with the increase in s U , the system EE and SE show a downward trend.This is because the communication rate of each multicast group is constrained by the user with the worst SINR in the multicast group.With the increase in s U , if the users' channel vectors in the multicast group remain collinear, the EE and SE would remain unchanged.However, according to the rules of the user preselection and scheduling, with the increase in s U , the collinearity among users in the multicast group would decrease, which can result in the increase in interference and the decrease in the worst SINR in each multicast group.In other words, with the decrease in s U , the users' SINR will be improved.Therefore, with the improvement of SINR, the system EE performance gain also increases, as shown in Figure 11a.Under the condition of ,, /  Figure 11 shows the change trajectory of the system EE and SE versus the different U s .It can be seen that with the increase in U s , the system EE and SE show a downward trend.This is because the communication rate of each multicast group is constrained by the user with the worst SINR in the multicast group.With the increase in U s , if the users' channel vectors in the multicast group remain collinear, the EE and SE would remain unchanged.However, according to the rules of the user preselection and scheduling, with the increase in U s , the collinearity among users in the multicast group would decrease, which can result in the increase in interference and the decrease in the worst SINR in each multicast group.In other words, with the decrease in U s , the users' SINR will be improved.Therefore, with the improvement of SINR, the system EE performance gain also increases, as shown in Figure 11a.Under the condition of U l,p /U s = 2, U l,p = 4, when U s = 2, the EE performance gain is 21.05% higher than when U s = 4.In this simulation, we set three comparison algorithms, i.e., the optimal design algorithm, the alternating minimization algorithm based on the phase extraction (PE-Altmin) algorithm and the orthogonal matching pursuit (OMP) algorithm.The optimal design algorithm refers to the numerical simulation result of the hybrid beamforming matrix F .It can be seen that the system performance of the MM-AltOpt algorithm is slightly lower than that of the optimal design algorithm.Meanwhile, in Figure 12a, when  are the number of iterations of the corresponding algorithm.In conclusion, we can see that the complexity of the proposed MM-AltOpt algorithm is close to that of the other two algorithms, however, the system EE performance is higher, which can verify the effectiveness of the proposed algorithm.Figure 12 shows the performance comparison of different algorithms for solving F BB and F RF .In this simulation, we set three comparison algorithms, i.e., the optimal design algorithm, the alternating minimization algorithm based on the phase extraction (PE-Altmin) algorithm and the orthogonal matching pursuit (OMP) algorithm.The optimal design algorithm refers to the numerical simulation result of the hybrid beamforming matrix F. It can be seen that the system performance of the MM-AltOpt algorithm is slightly lower than that of the optimal design algorithm.Meanwhile, in Figure 12a, when P T = 10 W, we can see that the system EE performance gain of the proposed MM-AltOpt algorithm is improved by about 2% and 5%, respectively, compared with the PE-Altmin algorithm and the OMP algorithm.In addition, from the perspective of algorithm complexity, the complexities of the MM-AltOpt algorithm, PE-Altmin algorithm and OMP algorithm are O I MM N 3 + I Inner 2NL 2 + NL , O I PE N 3 + L 3 + NL and O I OMP L 4 N + L 2 + N 2 L 2 + 2L 3 , respectively, where I MM , I Inner , I PE and I OMP are the number of iterations of the corresponding algorithm.In conclusion, we can see that the complexity of the proposed MM-AltOpt algorithm is close to that of the other two algorithms, however, the system EE performance is higher, which can verify the effectiveness of the proposed algorithm.Figure 12 shows the performance comparison of different algorithms for solving BB F and RF F .In this simulation, we set three comparison algorithms, i.e., the optimal design algorithm, the alternating minimization algorithm based on the phase extraction (PE-Altmin) algorithm and the orthogonal matching pursuit (OMP) algorithm.The optimal design algorithm refers to the numerical simulation result of the hybrid beamforming matrix F .It can be seen that the system performance of the MM-AltOpt algorithm is slightly lower than that of the optimal design algorithm.Meanwhile, in Figure 12a, when

Conclusions
In this paper, we investigated the robust joint user scheduling and hybrid beamforming design scheme for the massive MIMO LEO satellite multigroup multicast communication system.The scheme design considered the limited transmission power of the LEO satellite and the requirement of QoS and analyzed the influence of residual Doppler shift and phase disturbance on CSI errors.On this basis, taking the system EE as the optimization objective, we focused on the robust joint user scheduling and hybrid beamforming design.To reduce the complexity of the algorithm, we proposed the user preselection step, which can significantly reduce the system complexity while ensuring the system performance.For the nonconvex problem of the objective function, we adopted the CCCP framework after transforming the optimization problem into the DC programming problem.For the rank-one constraint, we proposed the penalty iterative algorithm.Finally, to obtain the digital and analog beamforming matrices, we adopted the MM-AltOpt algorithm.
Numerical results indicated that the proposed algorithm can effectively improve the system EE.The EE performance gain of the proposed robust algorithm was improved by nearly 10% compared with the traditional nonrobust algorithm.Meanwhile, the EE performance gain of the proposed joint user scheduling and hybrid beamforming design algorithm was improved by nearly 40% compared with the traditional decoupling design algorithm.In conclusion, the robust joint user scheduling and hybrid beamforming design algorithm proposed in this paper can significantly improve the system EE performance.
RF links and covers L multicast groups and K active users, where , K L K N  .Let the multicast group covered by the th l beam be l U and the number of users in this multicast group be l U , assuming that the number of users that can be accommodated in each DVB-S2X frame is s U .It should be noted that each user terminal belongs to only one multicast group, i.e.,

Figure 1 .
Figure 1.Transmission model of the massive MIMO LEO satellite multigroup multicast communication system.

Figure 1 .
Figure 1.Transmission model of the massive MIMO LEO satellite multigroup multicast communication system.

Figure 2 .
Figure 2. Service process of the LEO communication satellite multigroup multicast transmission based on the DVB-S2X.The received signal , kl y of the th k user in the th l multicast group can be expressed

F
should meet the unit modulus element [19], i.e., ( ) represents the additive Gaussian white noise, which is related to the Boltzmann constant  , system bandwidth B and the noise temperature T .For the convenience of analysis, we set 12 = [ , ,..., ]

Figure 2 .
Figure 2. Service process of the LEO communication satellite multigroup multicast transmission based on the DVB-S2X.

Figure 3 .
Figure 3. Geometric diagram of LEO satellite's orbital motion relative to the user terminal.
earth radius, r represents the distance between the LEO satellite track point and the earth center, ( ) 0 tt  − represents the angular distance of the earth's surface along the LEO satellite trajectory from instantt to instant 0 t ,  represents the angular velocity of the LEO satellite and paths is different, which is mainly caused by the movement of user terminals and surrounding scatterers.The power spectrum of Jakes power spectrum model, and the normalized power

Figure 3 .
Figure 3. Geometric diagram of LEO satellite's orbital motion relative to the user terminal.

1 )
, and φ f rsd d follows a real-valued Gaussian distribu-

Figure 4 .
Figure 4. Schematic diagram of the hierarchical clustering algorithm.

Figure 5 .
Figure 5. Schematic diagram of multibeam coverage with 7 multicast groups.

Figure 4 .
Figure 4. Schematic diagram of the hierarchical clustering algorithm.

Figure 4 .
Figure 4. Schematic diagram of the hierarchical clustering algorithm.

Figure 5 .
Figure 5. Schematic diagram of multibeam coverage with 7 multicast groups.Figure 5. Schematic diagram of multibeam coverage with 7 multicast groups.

Figure 5 .
Figure 5. Schematic diagram of multibeam coverage with 7 multicast groups.Figure 5. Schematic diagram of multibeam coverage with 7 multicast groups.

l 2 ∀k 2 with 2 can
, l, we replace the concave function η k,l − SINR min l its first-order Taylor expansion, which is reflected in line 3 of Algorithm 1.The first-order Taylor expansion of η k,l − SINR min l be expressed as: and W(0) , calculate ζ (0) and SINR min, on the optimization solution{W l } L l=1

2 and λ 3 >
0 is the penalty factor, which is generally larger enough to ensure that a smaller value of Tr(W l ) − λ max (W l ) can be obtained.According to (74), the iterative calculation of the problem Q 9 can maximize function F η, W, SINR min l , ζ and minimize function Tr(W l ) − λ max (W l ).

is 2 L
∑ l=1 |U l | + 2L, the number of convex constraints is L ∑ l=1 |U l | and the number of linear constraints is L ∑ l=1 |U l | + 3L.Let the number of iterations in the penalty iteration algorithm and the CCCP algorithm be I p and I c , respectively.In conclusion, the overall complexity of the proposed algorithm is O LK 2 N + I p I c 2 L ∑ l=1 |U l | + 2L 2 L ∑ l=1 |U l | + 3L .

.Figure 6 .
Figure 6.The symbols (circles, squares and triangles) represent the users in different the multicast group.The red circles represent preselected users and scheduling users.

Figure 6 .
Figure 6.Design process of joint user scheduling and beamforming with the user preselection.

Figure 6 .
Figure 6.Design process of joint user scheduling and beamforming with the user preselection.

Algorithm 3 :
Design algorithm of the digital beamforming matrix and the analog beamforming matrix.Main algorithm: MM-AltOpt algorithm.Input: Hybrid beamforming matrix F, initial: F (n=0) RF ∈ F, iteration index n = 0, threshold ε 3 = 10 −3 , the solution of the objective function of the problem P 1 in the nth iteration is δ RFaccording to the inner algorithm.4. Set n = n + 1.5.endOutput: F RF ,F BB , normalize F BB = √ P T F RF F BB F F BB .Inner algorithm: Algorithm for solving the analog beamforming matrix.Input: Hybrid beamforming matrix F, F (n) BB , F (q=0) RF ∈ F, iteration index q = 0, threshold ε 4 = 10 −3 ,the solution of the objective function of the problem P 5 in the qth iteration is δ Figure7shows the convergence trajectory of the EE of the massive MIMO LEO satellite multigroup multicast communication system, versus the number of iterations for different CSI errors, different numbers of preselected users and different scheduling algorithms.In this simulation, two groups of channel errors are set according to Refs.[23,29], i.e., σ 2f rsd d = 25, σ 2 φ τ = 10 and σ 2 f rsd d = 20, σ 2 φ τ = 5.In addition, we set U s = 2 and P T = 50 W.Meanwhile, we set two different numbers of the preselected users, i.e., U l,p /U s = 2, U l,p = 4 and U l,p /U s = 3, U l,p = 6.It can be seen that the proposed robust algorithm has higher performance gain than the traditional nonrobust algorithm, which shows the effectiveness of the robust algorithm.Meanwhile, it can be seen that when σ 2 f rsd d = 25, σ 2 φ τ = 10 and σ 2 f rsd d = 20, and σ 2φ τ = 5, the EE performance gain of the proposed robust algorithm is improved by 9.8% and 6.7%, respectively, compared with the traditional nonrobust algorithm.The system EE of the proposed joint user scheduling and hybrid beamforming design algorithm is higher than that of the decoupling design algorithm, and the more preselected users, the higher performance improvement.
set two different numbers of the preselected users, i.e., == .It can be seen that the proposed robust algorithm has higher performance gain than the traditional nonrobust algorithm, which shows the effectiveness of the robust algorithm.Meanwhile, it can be seen that when2

Figure 7 .
Figure 7. Convergence trajectory of system EE relative to different CSI errors, different number of preselected users and different scheduling algorithms.

Figure 7 .
Figure 7. Convergence trajectory of system EE relative to different CSI errors, different number of preselected users and different scheduling algorithms.

Sensors 2022 , 32 Figure 8 .
Figure 8.Comparison of system EE of different algorithms with different transmission power thresholds T P .

Figure 8 .
Figure 8.Comparison of system EE of different algorithms with different transmission power thresholds P T .

Figure 9 .
Figure 9. Change trajectory of system SE versus different transmission power thresholds T P .

Figure 9 .
Figure 9. Change trajectory of system SE versus different transmission power thresholds P T .

Figure 10 .
Figure 10.Comparison of SE of different multicast groups.

,
the EE performance gain is 21.05% higher than when 4 s U = .

Figure 10 .
Figure 10.Comparison of SE of different multicast groups.

Figure 11 .
Figure 11.Change trajectory of system performance versus different s U . (a) Change trajectory of system EE versus different s U ; (b) change trajectory of system SE versus different s U .

Figure 12
Figure 12 shows the performance comparison of different algorithms for solving BB F see that the system EE performance gain of the proposed MM-AltOpt algorithm is improved by about 2% and 5%, respectively, compared with the PE-Altmin algorithm and the OMP algorithm.In addition, from the perspective of algorithm complexity, the complexities of the MM-AltOpt algorithm, PE-Altmin algorithm and OMP algorithm are

Figure 11 .
Figure 11.Change trajectory of system performance versus different U s .(a) Change trajectory of system EE versus different U s ; (b) change trajectory of system SE versus different U s .

Figure 11 .
Figure 11.Change trajectory of system performance versus different s U . (a) Change trajectory of system EE versus different s U ; (b) change trajectory of system SE versus different s U .
see that the system EE performance gain of the proposed MM-AltOpt algorithm is improved by about 2% and 5%, respectively, compared with the PE-Altmin algorithm and the OMP algorithm.In addition, from the perspective of algorithm complexity, the complexities of the MM-AltOpt algorithm, PE-Altmin algorithm and OMP algorithm are

Figure 12 .
Figure 12.Comparison of system performance of different algorithms for solving F BB and F RF .(a) Comparison of system EE of different algorithms for solving F BB and F RF ; (b) comparison of system SE of different algorithms for solving F BB and F RF .