1. Introduction
Due to the Internet’s widespread use and the rocket launch and satellite mass production technology, the LEO satellite network has grown to be a significant component of the mobile communication network. Recently, the Third Generation Partnership Project (3GPP) standards group is working on the convergence of terrestrial mobile communication network and satellite networks to support fifth generation (5G) networks [
1]. Currently, many companies have put forward their own plans to construct LEO satellite constellations. For example, SpaceX has launched more than 3000 satellites to provide Internet service to people on the ground. Among them, the latest batch of satellites are equipped with laser terminals to establish inter-satellite links (ISLs) with neighboring satellites. The system architecture of a LEO satellite network system, as shown in 
Figure 1a, consists of the earth stations (ESs), the satellite network, and the user terminals(UTs). Satellites that can establish links with ESs are called visible satellites and the communication links connecting ESs to their visible satellites are called feeder links. One critical function of satellites is to transfer data from satellite networks to ground core network through earth stations (ESs).
Compared with the traditional SISO(single-input single-output), MIMO communication technology has the advantages of higher transmission rate, spectrum utilization, and power efficiency [
2]. Recently, many researchers have proposed to use MIMO technology at Ka-band to build a high-speed feeder link between satellite and ES [
3]. Nevertheless, radio propagation at ka-band suffers from atmospheric attenuation mechanisms in the troposphere, such as rain absorption and scattering. Rain attenuation is a phenomenon relative to the rate and frequency of rainfall that can lead to increased path losses, decreasing coverage areas and thus deteriorating system performance. Consequently, the capacity of feeder links in different channel environments will be different. This will lead to inadequate utilization of feeder link bandwidth. For instance, some feeder links may be severely affected by regional rain attenuation and have their capacity reduced, while the the adjacent satellite feeder links still have spare/available bandwidth.
In this paper, we first analyze the capacity of feeder links at Ka-band. Then, we propose the Cooperative Data Downloading (CDD) scheme, in which the traffic converges to the satellites with feeder link through the ISLs. More exactly, the ISLs are used to assist in the data download process by scheduling packets between different feeder links, thus achieving better load balancing. A MFMC routing algorithm is presented to calculate routes for traffic demands. Two linear programs (LPs) are required by the routing algorithm, the first LP finds a maximum flow between the visible satellites, and the second LP minimizes the routing cost given a fixed maximum flow. The above process is repeated until all capacities of feeder links are completely utilized, or until all data from visible satellites can be downloaded to the ESs. In this case, the increased packet delay or packet loss rate caused by a long queue of data in those feeder links with insufficient bandwidth can be alleviated.
At present, many routing algorithms have been proposed to solve the problem of data transmission on LEO satellite networks. In 2001, E. Ekici et al. depicted the network structure of the satellite as a virtual logical topology [
4]. By introducing virtual topology, the high dynamic of satellite network can be masked when calculating the route, so that the lightweight goal of routing algorithm can be achieved. Using this work as a foundation, Q. Chen improved the traditional method and proposed a better link connection strategy between satellites to resolve the asynchronous switches caused by the phase difference between satellite orbit planes [
5]. Afterwards, J. Li etc. proposed a temporary netgrid model to describe the topology of LEO satellite networks, and utilized the shortest path calculation between grids instead of the shortest path calculation between satellite nodes to make the routing algorithm more efficient [
6]. Furthermore, aiming at the challenges of routing protocol convergence and network maintenance in software defined satellite networks, an elastic routing framework is proposed to establish a stable forwarding path for signaling in Ref. [
7]. To find the best nexthop for forwarding a packet in the LEO satellite network, Beatriz Soret et al. proposed a new routing protocol named GomHop in which the packet is forwarded to the target orbit as soon as possible [
8]. Nevertheless, none of these studies have ever taken into account the feeder links’ constrained capacity, which is a significant bottleneck for the throughput of the LEO constellation networks. In order to increase system throughput, we investigate in this paper how to employ the ISLs to fully use the feeder link.
To increase the throughput of the LEO satellite network, feeder link resources must be utilized effectively. In 1996, Gooley et al. first modeled the Satellite Range Scheduling Problem (SRSP) for time window allocation between satellites and ground stations, when satellites need to transmit data to the ground, and developed an automated scheduling module using mixed-integer programming [
9]. In 2004, L. Barbulescu et al. independently proposed the Single-Resource Range Scheduling (SiRRS) based method and the Multi-Resource Range Scheduling (MuRRS) based method after thoroughly examining the SRSP problem [
10]. ISLs-based methods for downloading data to ESs have been proposed in recent years. In Ref. [
11], Lv Tao et al. considered that the traditional satellite scheduling algorithms have very limited improvement on the system throughput due to the inconsistency between the amount of data to be downloaded and the contact time of the satellite. Therefore, a data downloading scheme based on ISLs is proposed to make the satellite download more data during the contact time. On this basis, X. Jia et al. proposed an iterative optimization strategy to calculate the data transmission demand between satellites according to the length of contact time of each satellite, and modeled the data offloading between satellites as a bipartite graph maximum matching problem in Ref. [
12]. In Ref. [
13], M. Zhang et al. focused on the energy constraints on satellite data transmission, and modeled the satellite network topology as a transmission graph that varies according to the power level of ISLs. An efficient data offloading algorithm was proposed based on the transmission graph in their research. Nevertheless, the majority of these studies are principally concerned with offloading data for remote sensing observations and cannot be adapted to LEO satellite network broadband communication due to a variety of constraints, including latency, priority, bandwidth, etc.
In this paper, we present the Cooperative Data Downloading (CDD) routing scheme to overcome the problem that a high quantity of user data must be downloaded and feeder link resources are not balanced in a large-scale broadband LEO satellite. Following is the primary creative contribution of this paper:
- As far as we know, we are the first to use ISLs to facilitate data download in the large-scale broadband LEO satellite network; In order to increase system throughput, traffic is sent through ISLs to satellites with substantial feeder link resources in our scheme; 
- Secondly, we model the topology of LEO satellites network and analyze the data transmission requirements between visible satellites; 
- Finally, we present a maximum-flow minimum-cost routing algorithm to calculate routes for traffic demands, and implemented our algorithm on satellite router. 
The remainder of the paper is structured as follows: In 
Section 2, the system model is presented along with the optimization target of maximizing feeder link bandwidth usage. In 
Section 3, we describe the ISLs-based collaborative data download scheme. In 
Section 4, we conduct a simulation experiment and analyze the numerical results. Finally, the paper is summarized in 
Section 5.
  2. System Model and Problem Formulation
In this paper, we mainly consider the data downloading problem in a polar orbit constellation. We consider the constellation with 
N orbits, where each orbit contains 
M satellites in the space segment. The angle between the orbital planes is 
. The LEO satellite network is modeled as a undirected weighted graph, where 
 is the set of satellites and 
 is the set of wireless links, including ISLs and feeder links. Let 
 denote the ISL established by satellites 
 and 
. Intra-orbit ISLs can always be maintained, however inter-orbit ISLs in the earth’s polar area will be momentarily disconnected when a satellite travels through the region and will be re-established after exiting the zone. There is an orbital seam between neighboring orbits because the satellites in these two orbits are moving in opposite directions. The relative speed of the satellites on both sides of the orbital seam is too fast, which leads to the high cost of link establishment [
14], hence this paper assumes that satellites on either side of the orbital seams do not establish.
Furthermore, two satellites in neighboring orbits will swap relative positions after traveling over the polar areas, as shown in 
Figure 2a. When the upper part of 
Figure 2a is rotated 180 degrees, the topology of the network translates to 
Figure 2b. In this way, the topology of the LEO satellite network can be modeled as a set two-dimensional mash structure. The dynamic nature of a network topology is principally driven by the establishment of inter-orbit ISLs, as satellites enter and exit the polar area, and by the interference that prevents laser terminals from establishing ISLs due to a high bit error rate.
Let  represent the set of visible satellites that are located above the ESs and can establish feeder links with the ESs. When the satellite  can establish the feeder link  with the ground station, the satellite will join the set V. When the feeder link of the satellite is disconnected, it will exit the set V. There are many ESs scattered around the globe. Each ESs is equipped with k MIMO antenna arrays, each of which can establish a feeder link to a visible satellite. Due to the fact that all satellites follow their own orbits, their contact time windows with ESs (i.e., the start and finish times of windows) are known. On account of that the satellite travels with period T, the satellite network topology is likewise regularly transformed with period T. This paper separates the satellite operating period into time slots of the same size and duration of , such that the timeline can be represented as 0, 1, 2, …, T. The network topology in each time slice is fixed so as to eliminate the influence of satellite network dynamics on routing algorithm.
  2.1. Feeder Link Model and Capacity
Compared with the traditional SISO, MIMO communication technology has the advantages of higher transmission rate, spectrum utilization, and power efficiency. Recently, many scholars proposed to use MIMO technology at Ka-band to construct high-speed feeder link between satellites and ESs. However, ka-band propagation is influenced by atmospheric attenuation mechanisms from the troposphere, which will result in a difference in feeder link capacity. In this section, we elaborate MIMO channel model at Ka-band and the impact of rain attenuation on feeder link capacity.
As shown in 
Figure 3, a MIMO system can be composed of 
 receiving antennas and 
 transmitting antennas between the satellite and the ES. Let 
 denote the channel matrix between the satellite and the ES. The input-output relation in a MIMO channel for constructing the feeder link can be expressed as [
15]: 
        where 
 is the average energy of the symbols. 
 is the 
 noise vector of independent ZMCSCG random variables, 
 is the 
 vector of signals transmitted from each antenna and 
 is the 
 vector of signals received by each antenna.
Some studies researched by other scholars have led to the “log-det” capacity formula in Ref. [
16]. In this paper, the channel capacity of a MIMO channel can be obtained from the channel matrix: 
In (2), 
 is the covariance matrix of the transmitted signal vector 
s and 
 represent the 
 identity matrix. Once the distribution of 
H is known, the maximum data rate 
C that the channel can transmit can be calculated. The element 
 of 
 denotes the channel parameter from the 
jth transmitting antenna to the 
ith receiving antenna. The propagation and scattering impacts are considered in this paper. For convenience, 
 is written as [
17].
        
        where 
 denotes path loss and 
 is a complex random variable, which denotes the fading coefficient induced by random multipath propagation or the incoherent scattering in the troposphere. When the signal of satellite feeder link passes through the atmosphere, it will be absorbed and scattered by oxygen molecules and water vapor condensates in the troposphere, resulting in atmospheric loss. The extreme rain attenuation will reduce feeder link channel capacity and cause bandwidth imbalance of feeder links.
  2.2. Problem Formulation
In the scenario studied in this paper, each visible satellite will receive data, which needs to be downloaded to the ES, denoted by 
. Given the download data rate of each visible satellite, a set of satellites and an ES, our challenge is to design a strategy that allows visible satellites to utilize ISLs to transfer data from satellites with high data loads to those with light loads, so that the bandwidth resources of feeder links can be fully used and the LEO system’s download throughput is maximized. Let 
 denotes the actual capacity of the feeder link of satellite 
, and it also denotes the maximum data rate that can be downloaded to the ES through this feeder link. The objective of our optimization is to minimize the packet loss rate due to feeder link congestion, which can be given by: 
        where
        
For simplicity, the following assumptions are made:
- The size of data packets delivered across all satellites is the same. Under this assumption, the amount of data can be expressed in terms of the number of packets; 
- Each satellite is equipped with four laser terminals to establish ISLs with four adjacent satellites in the same orbit and different orbits, and a wireless signal terminal to establish a feeder link with ESs. These links work in simplex mode. 
  3. The Proposed Downloading Scheme
  3.1. Overview of Our Solution
Our scheme for maximizing the satellite system throughput of downloading data is to transmit data between visible satellites via ISLs, so as to reduce the load of feeder links with insufficient bandwidth. In order to allocate the feeder link bandwidth resources, we must determine the amount of data rate this satellite can receive from others or need to transfer to others. To get the traffic demands, we calculate the sender and receiver of inter-satellite transmission. Then, a constrained MCMF routing algorithm is described to compute maximum-flow routing for all traffic demand needs inside a network’s capacity area subject to routing cost limitations. Two linear programs (LPs) are required by the routing algorithm. The first LP finds a maximum flow between the visible satellites, and the second LP minimizes the routing cost given a fixed maximum flow. After one round of calculations, some feeder lines may have capacity surpluses, but others may have data waiting to be downloaded. Therefore, the above process needs to be iterated until all data from visible satellites can be downloaded to the ES or all feeder links have no spare capacity.
  3.2. Calculating the Source and Sink of Intersatellite Transmission
To utilize the ISLs to aid the feeder link in downloading data, it is important to first define the traffic transmission requirements, mainly including the source satellite, destination satellite, and the rate of data transmission. Then, we can calculate the route between visible satellites according to these requirements.
The set of real numbers is denoted by . Let  represent the traffic demand matrix, where each element represents the required data transmission rate of two visible satellites. At the beginning of each time slice , each satellite updates the capacity and data transmission rate of each link, and the topology of the network changes along with it.
When 
, the satellite 
 can receive additional download data stream. Let 
 denote the rate of this stream and let 
 denote the set of such satellites. On the basis of the capacity of this satellite’s ISLs and feeder link, 
 can be calculated as follows: 
        where 
 denotes the input ISLs capacity of node 
 and 
 denotes the data transmission rate of the link 
.
When 
, satellite 
 needs to transfer data stream 
 to other visible satellites. Let 
 denote the rate of this stream and let 
 denote the set of such satellites. On the basis of the capacity of this satellite’s ISLs and feeder link, 
 can be calculated as follows: 
        where 
 denotes the output ISLs capacity of node 
 and 
 denotes the data transmission rate of the link 
.
We arrange the satellites in descending order based on the  and  of each satellite in the  and  set. The  between the two satellites is utilized to calculate the element of the source-to-destination demand flow rate . Consequently, matrices D of the traffic demand are obtained.
  3.3. The Minimum-Cost Maximum-Flow Algorithm
This section will introduce the MCMF routing algorithm. As shown in 
Figure 4, the two minimum cost paths between (9, 12) are 
 and 
, which are identified by the blue arrows and lines. The maximum flow allowed between (9, 12) in 
Figure 4 includes four paths, and the other maximum flow paths with larger costs are identified by black arrows and lines. The traditional multi-data stream maximum flow linear programming problem [
18] has the following three problems: 1. The possibility of data transmission on a shorter or longer path is the same, so the transmission cost cannot be minimized; 2. Because the transmission cost cannot be minimized, it may cause loopback; 3. On-board computing resources are limited, it is also difficult to solve the LPs problem in a timely manner even if in a limited network size.
In order to solve the above-mentioned problems, the distance bandwidth product (BD) is proposed as the link cost, allowing shorter paths to be planned for larger traffic, minimizing transmission costs, and preventing loopback. Secondly, each data stream’s computation is constrained by subgraph restrictions depending on energy and computational resources, thereby greatly reducing the computational complexity. Once a subgraph  is generated for every flow , the Constrained MCMF algorithm including two LPs can be formulated.
  3.4. The Maximum-Flow LP and the Minimum-Cost LP
In the satellite network topology map , we need to calculate the distance-constrained subgraph  through the traffic demand matrix . First, the Dijkstra algorithm is used to compute the shortest path from all satellites to other satellites in the visible area of an ES. Then, a set of candidate links and nodes is initialized to NULL. Let  denote the number of hops in the shortest path between the pair of source and destination . After that, the algorithm traverses other intermediate nodes in the area, using  to represent the number of hops of the shortest path of , and an appropriate hop threshold  can be taken based on the computing resources of the network. If the satellite v satisfies the inequality , the satellite v and its four ISLs as the element of the subgraph node set  and edge set . Once the subgraph for each data flow has been calculated, two linear programmings can be formulated to the MCMF algorithm.
By constraining the data flow distance between each pair of source/destination 
, we can obtain the matching subgraph. The linear programming issue of the greatest data flow of multiple sources and sinks constrained by the subgraph 
 is represented by Equation (
8).
   Let  denote the traffic rate of commodity c on link e. Let ,  and  denote the in/out overall traffic rate of ISLs and the traffic rate of the feeder link in satellite v, respectively. Let  denote the request flow c of the matrix D. Equation (8a)–(8c) require that the sum of all commodity traffic rates on a link e is less than the link capacity . Equation (8d)–(8f) are limited to the subgraphs for each commodity flow. Equation (8g) ensures the receive rate of each commodity at most . The maximum flow of multiple source-destination in the network can be attained by solving this LP.
To minimum the routing cost, the second linear programming problem 
 is represented by Equation (
9).
Let  denote the maximum flow of the commodity c obtained by solving the maximum flow LP, and . In the above formula, Equation (9a) requires the fixed rate of each commodity flow obtained by . The remaining resemble the preceding constraints. The objective of our function is to minimize the sum of flow rates  multiplied by link cost . In our scheme, the greedy approach is utilized to solve these two LPs in order to achieve the maximum flow and minimum cost routing.
  3.5. Iterating the Algorithm
After finishing a round of computations in the preceding sections, it is necessary to update and iterate the network topology because the link status changes, as shown in 
Figure 5. In order to reduce the iteration rounds and speed up the convergence speed, if the satellites in the 
 set 
, then the satellites exit the set 
, changing from a red node to a white node. If the satellites in the 
 set 
, they will also exit the set 
, changing from a blue node to a white node. The rest of the nodes of set 
 and 
 repeat the preceding calculation procedure until the number of nodes in set 
 or 
 is NULL, that means, the feeder resources are completely filled and the throughput reaches its maximum. The algorithm terminates the iterative process until the next time slot 
, at which point it will begin again.
The flow chart of this algorithm is shown in Algorithm 1.
        
| Algorithm 1: Downloading Algorithm | 
| ![Sensors 22 07986 i001 Sensors 22 07986 i001]() |