You are currently on the new version of our website. Access the old version .
EntropyEntropy
  • Article
  • Open Access

10 December 2023

A Loss Differentiation Method Based on Heterogeneous Ensemble Learning Model for Low Earth Orbit Satellite Networks

,
,
and
1
Communication and Network Laboratory, College of Information Engineering, Dalian University, Dalian 116622, China
2
School of Automation, Nanjing University of Science and Technology, Nanjing 210094, China
*
Authors to whom correspondence should be addressed.

Abstract

In light of the high bit error rate in satellite network links, the traditional transmission control protocol (TCP) fails to distinguish between congestion and wireless losses, and existing loss differentiation methods lack heterogeneous ensemble learning models, especially feature selection for loss differentiation, individual classifier selection methods, effective ensemble strategies, etc. A loss differentiation method based on heterogeneous ensemble learning (LDM-HEL) for low-Earth-orbit (LEO) satellite networks is proposed. This method utilizes the Relief and mutual information algorithms for selecting loss differentiation features and employs the least-squares support vector machine, decision tree, logistic regression, and K-nearest neighbor as individual learners. An ensemble strategy is designed using the stochastic gradient descent method to optimize the weights of individual learners. Simulation results demonstrate that the proposed LDM-HEL achieves higher accuracy rate, recall rate, and F1-score in the simulation scenario, and significantly improves throughput performance when applied to TCP. Compared with the integrated model LDM-satellite, the above indexes can be improved by 4.37%, 4.55%, 4.87%, and 9.28%, respectively.

1. Introduction

Satellite communication has the advantages of long communication distance, wide coverage, and no geographical restrictions, and plays an irreplaceable role in current and future communication systems [1]. Satellite networks operate in a wireless environment that is highly susceptible to solar activity, cosmic particle radiation, noise interference, etc., resulting in a bit error rate ranging from 10−8 to 10−5. Moreover, the Earth’s atmosphere, ionosphere, and ice crystal rainfall layer also affect satellite-to-ground links, leading to higher bit error rates, especially in military scenarios where human interference can cause even higher rates exceeding 10−5. In traditional TCP, packet loss is considered as an indication of network congestion by default. Therefore, the congestion avoidance mechanism needs to be enabled, which reduces the congestion window. However, in satellite networks, apart from packet losses caused by congestion, it is crucial not to ignore packet losses caused by channel errors. If network congestion is the default, the congestion window will be reduced, resulting in inefficient utilization of network bandwidth and compromising network reliability.
In light of these issues, extensive research has been conducted by numerous scholars [2,3]. From a measurement acquisition perspective, packet loss differentiation can be categorized into explicit and implicit methods. Explicit loss differentiation primarily relies on the active queue management mechanism of routing nodes to determine the cache queue occupancy and provide feedback to the sender. This approach is more direct and accurate but requires support from intermediate nodes. Implicit loss differentiation mainly utilizes end-to-end round-trip time (RTT) [3], relative one-way trip time (ROTT) [4,5], packet inter-arrival time (IAT) [6], loss number [5], and other decision mechanisms to identify types of packet loss [7,8,9,10]. It does not require the support of nodes other than the sender and receiver, and can be implemented with minimal changes in the network.
Implicit loss differentiation can further be divided into threshold-based and learning algorithm-based loss differentiation methods. Packet loss differentiation based on threshold determination involves utilizing statistical analysis and calculation of feature parameter values, such as RTT, ROTT, IAT, etc., to characterize the network state. By comparing these feature parameters with predefined thresholds, the type of packet loss can be determined. Although this algorithm has low complexity, it heavily relies on experiential threshold settings. Additionally, relying solely on a single feature cannot achieve high determination accuracy and defining the relationship between multiple feature thresholds is challenging, which ultimately affects the accuracy of packet loss differentiation.
In machine learning-based packet loss discrimination algorithms, decisions are made using models that learn correlations between features. Among them, decision trees (DTs), random forests (RFs), artificial neural networks (ANNs), K-nearest neighbors (KNNs), naive Bayes (NB), and other single classification models have become the mainstream methods for packet loss differentiation. However, the accuracy of differentiation in these single classification models is significantly affected by the number and features of samples, and their accuracy may not reach optimal levels.
Ensemble learning is a classification method that combines results from multiple individual classifiers to obtain a final result, typically achieving better classification performance than using a single classifier alone while also offering greater diversity compared to homogeneous integration methods. Based on literature research findings, existing loss differentiation algorithms lack feature selection methods, individual classifier selection strategies, and effective ensemble strategy for accurate packet loss differentiation.
Therefore, we proposed a novel loss differentiation algorithm based on heterogeneous ensemble learning. The algorithm employs ROTT along with average ROTTs, ROTT deviations, and IAT statistics (including maximums, minimums, and averages) as features for loss differentiation. The Relief method, combined with mutual information techniques, is utilized for feature selection. Q statistics and double failure (DF) value are used for the selection of individual classifiers in ensemble learning, including least-squares support vector machine (LSSVM), DT, logistic regression (LR), and KNN algorithm. The integration strategy of the heterogeneous ensemble learning model is designed using the stochastic gradient descent (SGD) method, with dynamically adjusted voting weight vectors for each classifier to enhance classification accuracy.
The rest of this paper is organized as follows. We analyze the related works on loss differentiation algorithms from threshold determination-based loss differentiation and learning algorithm-based loss differentiation methods in Section 2. In Section 3, we employ the Relief and mutual information method to construct a feature set that distinguishes packet loss in LEO satellite networks. In Section 4, we propose a heterogeneous ensemble learning model for discriminating packet loss in satellite networks. In Section 5, the loss differentiation method based on heterogeneous ensemble learning (LDM-HEL), loss detection method (LDM)-satellite [11], and LLD method [12] are used as comparative experiments for simulation, which proves the superiority of the algorithm proposed in this paper. Finally, Section 6 concludes the paper.

3. Loss Differentiation Feature Selection for LEO Satellite Network

Utilizing machine learning techniques to identify the causes of packet loss, our first task is to extract relevant features from the time information of packet transmission.

3.1. Loss Differentiation Features for LEO Satellite Network

The ROTT is defined as the time difference between sending a packet at the sender and receiving it at the receiver. Accurate time synchronization of end-to-end satellite nodes is required to obtain this parameter. In congested areas, routers experience an increase in ROTT due to a large number of packets being queued in the cache for forwarding. When incoming packets exceed the cache capacity, packet loss occurs as they cannot enter the cache. However, wireless loss exhibits a random state and does not significantly affect the ROTT. Therefore, analyzing ROTT is crucial for identifying the causes of packet loss in satellite networks. Table 1 presents features related to end-to-end ROTT.
Table 1. The features regarding the end-to-end ROTT.
The IAT refers to the duration between two adjacent packets arriving at the receiver and plays a vital role in Biaz and mBiaz schemes. These schemes consider that if packet IAT falls within a certain range, bit errors are likely causing packet loss. If a packet arrives much earlier than expected, previous packets may have been discarded from buffer memory; if it arrives much later than anticipated, queue delay might have increased in buffers. In either case, network congestion can be attributed as the cause of packet loss. Table 2 outlines features regarding IAT.
Table 2. The features regarding IAT.
Furthermore, the concept of consecutive lost packets is introduced in Table 3.
Table 3. Packet loss number.
In machine learning applications, data classification accuracy and efficiency heavily rely on feature selection methods. Filtering, encapsulation, and embedding are commonly used techniques for feature selection. The filtering method is independent of the classifier, but it scores the features based on the difference or correlation between sample data and selects features by setting a scoring threshold or specifying the number of features. Consequently, the feature subset can be determined prior to classification, demonstrating excellent adaptability. In this study, we combine Relief and mutual information methods for feature selection.

3.2. Packet Loss Differentiation Feature Selection Method

The Relief algorithm represents a typical filtering approach. Its feature scoring and selection process is decoupled from classification algorithms, making it simple, convenient, and low in complexity. However, it is only suitable for calculating binary classification feature weights. The underlying principle is as follows: Firstly, positive samples and negative samples are segregated within the training dataset. For any given sample in the training dataset, one nearest-neighbor sample is selected from both same-class and different-class samples. If a particular feature attribute present in that sample exhibits greater similarity with its nearest neighbor from the same class compared to its nearest neighbor from a different class, it indicates a higher differentiation capability of that feature attribute. Consequently, larger feature scores or weights are assigned.
Let the training set be D = { ( x i , y i ) | x i R m , m > 0 , y i { 1 , 1 } , i = 1 , 2 , , n } , where n is the number of samples in the training set, and m is the number of feature attributes contained in each sample. The sample is x i = ( x i 1 , x i 2 , , x i k , , x i m ) . Define the difference between samples x i and x j on the kth attribute as shown in Equations (1) and (2).
If the feature is scalar:
diff ( k , x i , x j ) = { 0 ,             x i k = x j k 1 ,             x i k x j k
If the feature is numeric:
diff ( k , x i , x j ) = | x i k x j k max 1 l n { x l k } min 1 l n { x l k } |
If x i , nh is the homogeneous nearest neighbor of x i , and x i , nm is the heterogeneous nearest neighbor of x i , then the iterative calculation formula for the weight of the kth feature is as follows:
θ k ( i ) = θ k ( i 1 ) diff ( k , x i , x nh ) n + diff ( k , x i , x nm ) n , i = 1 , 2 , , n
After calculation, let θ k ( n ) = θ k , and subsequently the larger θ k denotes a stronger classification ability associated with the kth feature attribute. Each individual weight of each specific feature is calculated using Equation (3) to form a vector θ = ( θ 1 , θ 2 , , θ m ) T representing all respective weights. These weights can then be arranged in descending order to determine their relative importance as well as establish corresponding arrangement order for each respective featured attribute. This determines their overall significance.
The Relief algorithm solely ranks the importance of featured attributes without reflecting correlations among them. Therefore, we utilized the mutual information algorithm to analyze feature correlations, aiming to reduce redundancy between feature attributes and optimize the final feature selection results.
The concept of information entropy quantifies the level of uncertainty associated with the occurrence of each possible event within an information source, which is defined as:
H ( X ) = i p ( x i ) × log 2 p ( x i )
where p ( x i ) is the frequency of event x i . X = { x 1 , x 2 , , x n } is the information source containing all possible states.
The conditional information entropy of X with respect to Y quantifies the additional knowledge provided by X when Y is known for a pair of associated random variables:
H ( X | Y ) = i , j p ( x i , y j ) × log 2 p ( x i | y j )
where Y = { y 1 , y 2 , , y n } , p ( x i , y j ) is the joint probability density function of the random variable ( X , Y ) , and p ( x i | y j ) is the conditional probability that x i occurs when y j occurs.
The mutual information between random variables X and Y is given as follows.
I ( X ; Y ) = H ( X ) H ( X | Y ) = i , j p ( x i , y j ) × log 2 p ( x i , y j ) p ( x i ) p ( y j )

4. Packet Loss Differentiation Model Based on Heterogeneous Ensemble Learning

Considering the classification process, the performance of different classifiers may vary with changes in the dataset. Single classifiers often exhibit high error rates, while combining multiple classifiers can effectively reduce errors and enhance model generalization ability. Hence, this paper adopts an ensemble learning model. To ensure that individual classifiers within an ensemble classifier are “good but different”, it is common practice to employ diverse types of classifiers. Research has shown that heterogeneous classifiers tend to have higher diversity compared to homogeneous ones. Moreover, using heterogeneous ensembles can help mitigate potential biases resulting from inherent assumptions in each classification method and thus improve overall diversity. Consequently, six classification algorithms (LSSVM, DT, LR, KNN, BP neural network, and naive Bayes) were selected as initial individual classifiers for heterogeneous ensemble learning; subsequently simplified based on the principle of being “good but different”.
In this study, the Q-statistic and DF value were employed to investigate the disparities among the aforementioned six individual classifiers. Let C i and C j represent two distinct classifiers. N 11 denotes the number of samples correctly classified by both classifiers, while N 00 represents the number of misclassified samples by both classifiers. Additionally, N 10 indicates instances in which C i was accurately classified and C j was misclassified. N 01 represents cases in which C i was incorrectly classified and C j was correctly classified by the classifier. The Q-statistic and DF value are subsequently computed using the following formulas.
Q i j = N 11 N 00 N 10 N 01 N 11 N 00 + N 10 N 01
DF i j = N 00 N 11 + N 00 + N 10 + N 01
According to the aforementioned equations, the Q-statistic value ranges from −1 to 1, while the DF value ranges from 0 to 1. A smaller value indicates a higher level of diversity in this pair of individual classifiers.
In order to integrate outputs from various individual classifiers within a heterogeneous ensemble learning framework and enhance learner output effectiveness in the ensemble learning algorithm, voting methods such as majority voting or weighted voting are commonly employed. The majority voting method is ineffective in utilizing the complementary information of each individual classifier. Therefore, we propose an adaptive dynamic weighted ensemble method to dynamically adjust the voting weight of each individual classifier, aiming to enhance the generalization ability of the ensemble learning algorithm for the loss cause detection model. Figure 1 illustrates the heterogeneous ensemble learning model.
Figure 1. Packet loss differentiation model based on heterogeneous ensemble learning.
Let D = { ( x i , y i ) | x i R m , m > 0 , y i { 1 , 1 } , i = 1 , 2 , , n } be the subset of features generated using the feature selection algorithm, and the Sigmoid function is adopted as the default regression function for each individual classifier, i.e.,
( x ) = 1 1 + e θ T x
Suppose that the output of the ith individual classifier is h i ( x ) , then
h i ( x ) = { 1 ,                     ( x )   0.5 1 ,               ( x )   < 0.5  
The Sigmoid function is employed to regress the output as follows:
p ( w 1 , w 2 , , w M ) = 1 1 + e i = 1 M x D i w i   ×   h i ( x )
where w i ( i = 1 , , M ) is the output weight of each individual classifier, so for any x D , we define:
H ( x ) = { 1 ,                           p ( w 1 , w 2 , , w M ) 0.5 1 ,                     p ( w 1 , w 2 , , w M )   < 0.5  
To obtain the optimal weights ( w 1 , w 2 , , w M ) , we employ the cross-entropy loss function as the ensemble learning algorithm’s loss function, which is denoted as:
Loss = i = 1 n I ( H ( x i ) = y i ) log 2 p ( w 1 , , w M ) + I ( H ( x i ) y i ) log 2 ( 1 p ( w 1 , , w M ) )
where I ( ) is the indicator function, which is 1 if the condition is true and 0 if the condition is false.
The stochastic gradient descent method is employed to dynamically adapt the voting weight vector [27] and obtain the optimal weight. The resulting output is presented as follows:
H ^ ( x ) = { 1 ,                           p ( w 1 , w 2 , , w M ) 0.5 1 ,                     p ( w 1 , w 2 , , w M )   < 0.5  
The network packet loss discrimination algorithm proposed in this paper is referred to as the LDM-HEL algorithm. In the context of packet loss discrimination addressed in this study, channel error-induced packet loss is denoted as W, with samples belonging to this category represented by +1, while congestion-induced packet loss is denoted as C, and samples belonging to this category are represented by −1.

5. Simulation and Performance Analysis

In this study, we utilized STK and NS2 to construct the simulation system for the Iridium satellite network, enabling us to simulate network packet loss scenarios and generate a comprehensive dataset. Subsequently, feature selection was performed, followed by training of a heterogeneous ensemble learning model for packet loss differentiation to validate the feasibility and effectiveness of our proposed method. Additionally, optimal parameters for relevant algorithms were determined and their rationality in terms of parameter settings is verified. Finally, in order to assess the performance advantages of our proposed method, we compared it with the LDM-satellite [11] and LLD [12] loss differentiation models under identical conditions.

5.1. Experimental Evaluation Indicators

The experimental evaluation indicators in this study encompass ACCuracy (ACC), RECall (REC), PREcision (PRE), False Alarm Rate (FAR), F1-score, Area Under Curve (AUC), and Mean Absolute Error (MAE). Higher values of ACC, PRE, REC, F1-score, and AUC are indicative of superior classifier performance. The formulas for the aforementioned parameters are provided below.
A CC = TP + TN TP + TN + FP + FN ,   REC = TP TP + FN ,   PRE = TP TP + FP ,   FAR = FP TN + FP ,
F 1 - score = 2 × REC × PRE REC + PRE ,   MAE = 1 N × i = 1 N | y i y ^ i |
where y ^ i is the predicted value, y i is the actual value, and the parameters are from the following confusion matrix, as shown in Table 4.
Table 4. Confusion matrix.
The AUC represents the area under the receiver operating characteristic curve (ROC).

5.2. Dataset and Feature Selection Process

In order to acquire packet loss data in a satellite network and subsequently evaluate the performance of loss differentiation algorithms, we employed NS2 to simulate the occurrence of packet loss in a satellite network based on the Iridium constellation. Specifically, channel error-induced packet loss was observed at the last hop of the wireless link, while congestion-induced packet loss occurred at the bottleneck link. The network topology is illustrated in Figure 2.
Figure 2. Simulation topology of satellite network.
The network parameter configuration is as follows: The satellite-to-ground link has a bandwidth of 10 Mbps, while the inter-satellite link has a bandwidth of 25 Mbps. Background traffic consists of TCP flows generated by ON/OFF sources following a Pareto distribution with shape parameters α = 1.2 , and the means were, respectively, set to μ 1 = 500 ms and μ 2 = 1300 ms. The flow rate is set at 10 Mbps, packet size at 1000 bytes, and cache size at 50 packets. To induce congestion packet loss, initially there are two TCP flows present in the system, followed by the addition of N TCP flows every 10 s. Four cases are considered for N: N = 2, 4, 6, and 8. Additionally, three different bit error rates (1%, 3%, and 5%) are employed along with a tail-dropping queue management strategy to obtain the dataset on packet loss.
These data were then utilized in the Relief algorithm to derive the weight results for each feature, as presented in Table 5.
Table 5. Feature weight calculation results.
The importance ranking of each feature is derived from the aforementioned table. Subsequently, based on their respective feature weights, they are sequentially incorporated into the classification model in descending order. The resulting classification accuracy is then examined. Notably, when the feature reaches approximately the 14th position, all algorithms achieve their highest accuracy levels, as depicted in Figure 3. Consequently, following the Relief algorithm-based feature selection, only the initial 14 features were retained for further analysis in this study.
Figure 3. Recognition accuracy of feature subset.
The Relief algorithm is capable of ranking features, but it fails to capture the interdependence between them. In order to assess the relevance of two features in terms of their informativeness given each other, we opted for employing mutual information. Consequently, we utilized this method to investigate the correlation among 14 features selected by the Relief algorithm, and the corresponding results are presented in Table 6 below.
Table 6. Feature mutual information matrix.
In the table, the bold numbers 0.96 and 1 indicate a significant correlation coefficients between F 8 and F 5 with F 7 . To further streamline the features, these two features ( F 8 and F 5 ) were eliminated in subsequent model inputs, leaving only the remaining 12 features intact.

5.3. Classifier Performance Analysis

The dataset contains a total of 31,260 samples with bit error packet loss, out of which 23,267 samples were used for testing purposes. Additionally, there are 18,680 samples indicating congestion-related packet loss, among which 6000 were utilized for testing. The ensemble learning model was trained iteratively for a total of 1820 times. During the experiment, multiple independent repeated trials were conducted on the dataset and cross-validation was performed using test sets to obtain average detection results for each dataset in order to enhance the impartiality of experimental outcomes.
To optimize the performance of the heterogeneous ensemble model, an analysis was carried out on individual learners’ predictive abilities, resulting in accuracy measures such as recall rate precision and false alarm rate being obtained for each learner (Table 7).
Table 7. Hyperparameters of individual learners and their performance in the dataset.
Table 7 reveals that the NB exhibits a low accuracy and precision rate, along with a high false alarm rate of 0.557. This indicates that when the algorithm is employed for distinguishing packet loss, a significant number of congestion loss packets are misclassified as bit error loss packets, posing substantial congestion risks to the network. Consequently, the naive Bayes method was preliminarily excluded from consideration.
Moreover, in order to achieve more accurate prediction results, it becomes imperative to select individual classifiers with greater diversity. In this study, we utilized Q statistic value and DF value to quantify the dissimilarity among individual classifiers (as presented in Table 8 and Table 9).
Table 8. Q statistics among individual classifiers.
Table 9. DF values among individual classifiers.
Furthermore, it can be observed from Table 8 that the Q statistic value of the BP neural network is 0.9, ranking second-highest. Table 9 indicates that the DF value of the BP neural network is 0.128, which is the highest among all classifiers considered. Additionally, considering algorithm complexity and false alarm rate, the BP neural network does not perform optimally in discriminating data packet loss; hence, it was excluded from the final selection scheme for individual classifiers, leaving four types: LSSVM, DT, LR, and kNN.
To validate the performance enhancement of our proposed integrated detection method LDM-HEL compared to existing methods such as LDM-satellite [11], LLD [12], Spike method, Zigzag method, and ZBS method [4,5,6]; we plotted average performance results for four detection indicators (accuracy, precision, recall, and F1-score) across five different detection methods as shown in Figure 4.
Figure 4. Performance comparison between proposed LDM-HEL and the other five classifiers.
Firstly, as depicted in Figure 4, the proposed LDM-HEL, the LDM-Satellite, and the LLD method have shown significant performance enhancements compared to conventional classifiers. Moreover, the accuracy, precision, recall, and F1-score metrics of the proposed heterogeneous ensemble method surpass those of the other five classifiers, indicating its superior comprehensive detection performance and low false negative rate in effectively identifying bit errors and packet losses.
Analyzing the causes of the performance difference, it was found that the Spike schemes only consider one packet loss feature and need to set a threshold to determine the packet loss type. The setting of the threshold is related to the characteristics of the network and the experience of the manager, which affects the packet loss classification performance. The ZigZag and ZBS schemes consider multi-features, and the packet loss differentiation ability is improved, but its threshold also needs to be set reasonably. The LDM-satellite algorithm has better performance, but it uses a homogeneous integrated model, and its classification accuracy is not optimal. RTT is employed in the LLD algorithm. The change of RTT in the low-orbit satellite network not only reflects the network congestion, but also may be caused by the change of the distance between nodes, so it is easy to cause packet loss and misjudgment. Due to the heterogeneous individual classification model and the integration strategy based on stochastic gradient descent, the accuracy rate, accuracy rate, recall rate, and F1-score indexes of the proposed algorithm are higher than those of the other five classifiers, which indicates that the proposed heterogeneous integration detection method has good comprehensive detection performance and low false negatives, and can effectively detect packet loss caused by channel error.
Furthermore, employing the heterogeneous integration architecture proposed in this paper and adjusting the approach for generating final integration results by utilizing the voting method, ACC weighting method based on accuracy assessment, as well as the SGD-based weighting method for comparison purposes (as depicted in Figure 5), reveals that the SGD-based weighting significantly enhances accuracy, precision, recall, and F1-score measurements. This enhancement is attributed to the dynamic adaptive weighted integration method introduced in the integration process, which dynamically adjusts the voting weight of each base classifier, thereby improving the generalization ability of the multi-classifier integrated detection model.
Figure 5. Performance comparison of the three weighting algorithms.
The AUC values were calculated and compared with the LDM-satellite and LLD methods to validate the comprehensive advantages of the proposed method in detecting packet loss differentiation performance. As depicted in Figure 6, the AUC value of the proposed method is 0.976, surpassing that of the other two methods (0.883 and 0.921), thereby indicating its superior overall efficacy.
Figure 6. Comparison of AUC values of LDM-HEL, LDM-satellite, and LLD algorithms.
Moreover, the proposed heterogeneous integration architecture is utilized and the resulting integrations are combined using a voting method, ACC weighting method, and SGD-based weighting method, respectively. The ROC graph in Figure 7 illustrates that the AUC value of the SGD-based weighting method (0.976) surpasses those of the other two methods (0.927 and 0.916), indicating its superiority.
Figure 7. Comparison of AUC values of voting, ACC weighting, and SDG weighting.
In order to verify the performance of the proposed method compared with the existing learning model-based packet loss discrimination methods, this experiment uses the same sampled data set to detect packet loss differentiation. The LLD, LDM-satellite, and LDM-HEL methods were selected to assess the F1-score performance and training time of different differentiation methods under various training data ratios. As shown in Figure 8, among the three differentiation methods, the LDM-satellite and LDM-HEL methods show a gradual increase with the growth of the training data ratio, and generally, the F1-score of our LDM-HEL method is superior to the other two methods. Additionally, the comparison of training time is presented in Figure 9. The training time of the three packet loss differentiation methods increases with the rise in the training data ratio. The training time of the proposed method is shorter than LLR and slightly higher than LDM-satellite.
Figure 8. Comparison of F1-score performance of three packet loss differentiation models.
Figure 9. Comparison of training time of three kinds of packet loss differentiation models.

5.4. Network Performance Analysis

The packet loss differentiation algorithm proposed in this paper requires the cooperation of the receiver because it uses information such as ROTT and IAT at the receiver. In other words, the final packet loss differentiation is completed at the receiver, necessitating the receiver to provide feedback on the judgment information to the sender. To achieve this, the explicit loss notification (ELN) mechanism is employed. When the receiver determines that the cause of packet loss is congestion loss, the ELN flag bit of the TCP packet header is set to 0, and when the receiver determines that the cause of packet loss is bit error loss, the ELN flag bit of the TCP packet header is set to 1. Accordingly, the sender adopts the corresponding congestion control window adjustment strategy.
In the experiment, the link bit error rate was set to 10−1, and LDM-HEL, LDM-satellite, and TCP-RLLD [26] packet loss discrimination algorithms were applied to TCP New Reno. We evaluated the Goodput performance of the LDM-HEL algorithm and four baseline algorithms (LDM-Satellite, TCP-RLLD, New Reno, Westwood, LA, USA) at different end-to-end packet sending rates, as shown in Figure 10.
Figure 10. Comparison of Goodput rates between LDM-HEL, LDM-satellite, TCP-RLLD, Westwood, and New Reno.
The experimental results show that New Reno and Westwood do not exhibit significant Goodput improvement when the transmission rate increases. This is mainly due to their lack of an effective packet loss differentiation mechanism. In contrast, LDM-HEL demonstrates better performance. By employing a heterogeneous ensemble learning model, it comprehensively considers more network state characteristics, fully leverages the advantages of multiple classifiers, and enhances the model’s generalization ability. This results in more accurate packet loss differentiation, helping to avoid unnecessary congestion window reduction. Compared with the LDM-satellite and the TCP-RLLD algorithms, its throughput increased by an average of 9.28% and 19.73%, respectively.
We also compare the performance of each algorithm under different bit error rates, and the results are shown in Figure 11. We set the packet sending rate to 1250 packets/sec. In the case of a low packet loss rate, the Goodput of all protocols is relatively high. With the increase in bit error rate, the Goodput of each algorithm decreases obviously. However, LDM-HEL still exhibits a better Goodput.
Figure 11. Comparison of Goodput with different protocols for LEO.

6. Conclusions

This paper proposes a packet loss differentiation method based on heterogeneous ensemble learning, aiming to address the high bit error characteristics of satellite networks and the inability of traditional TCP protocols to differentiate between bit error loss and congestion loss. Firstly, we summarize the packet features used in existing packet loss differentiation algorithms, including ROTT, IAT, and the number of consecutive packet losses. Relief and mutual information algorithms are employed for feature selection. Then, a heterogeneous ensemble consisting of LSSVM, DT, LR, and kNN is utilized. In order to enhance the generalization ability of the heterogeneous ensemble learning algorithm, an adaptive dynamic weighted ensemble method is introduced to adjust the voting weight of each individual classifier. Simulation results demonstrate that LDM-HEL achieves higher accuracy and significantly improves throughput performance when applied to TCP.
In the future, we will apply LDM-HEL to real network traffic data for further evaluation. LDM-HEL can also be applied to other wireless networks or wired/wireless hybrid networks to distinguish packet loss, but it needs to select suitable packet features and effective individual classifiers according to network characteristics. In addition to link errors and congestion, packet loss caused by routing is also a problem worth studying. Finally, LDM-HEL is a packet loss discrimination method, and the next step is to improve the accuracy of packet loss discrimination by combining cross-layer information.

Author Contributions

Conceptualization, D.W.; data curation, C.G.; formal analysis, L.Y.; investigation, Y.X.; methodology, D.W.; project administration, L.Y.; resources, C.G.; software, C.G.; supervision, L.Y.; validation, D.W.; visualization, Y.X.; writing—original draft, D.W.; writing—review and editing, D.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was partly financially supported through grants from the National Natural Science Foundation of China (U21B2003).

Institutional Review Board Statement

Not applicable.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Adinoyi, A.; Aljamae, M.; Aljlaoud, A. The future of broadband connectivity: Terrestrial networks vs. satellite constellations. Int. J. Commun. Netw. Syst. Sci. 2022, 5, 53–66. [Google Scholar] [CrossRef]
  2. Zong, L.; Wang, H.; Du, W.; Zhao, C.; Luo, G.F. Optimizing the end-to-end transmission scheme for hybrid satellite and multihop networks. Neural Comput. Appl. 2023, 35, 3063–3074. [Google Scholar] [CrossRef]
  3. Sun, Y.; Ji, Z.; Wang, H. TFRC-Satellite: A TFRC variant with a loss differentiation algorithm for satellite networks. IEEE Trans. Aerosp. Electron. Syst. 2013, 49, 716–725. [Google Scholar] [CrossRef]
  4. Biaz, S.; Vaidya, N.H. Discriminating congestion losses from wireless losses using inter-arrival times at the receiver. In Proceedings of the 1999 IEEE Symposium on Application-Specific Systems and Software Engineering and Technology (ASSET), Richardson, TX, USA, 24–27 March 1999. [Google Scholar]
  5. Cen, S.; Cosman, P.C.; Voelker, G.M. End-to-end differentiation of congestion and wireless losses. IEEE ACM Trans. Netw. 2003, 11, 703–717. [Google Scholar]
  6. Tobe, Y.; Tamura, Y.; Molano, A.; Ghosh, S.; Tokuda, H. Achieving moderate fairness for UDP flows by path-status classification. In Proceedings of the 25th Annual IEEE Conference on Local Computer Networks, Tampa, FL, USA, 8–10 November 2000. [Google Scholar]
  7. Mascolo, S.; Casetti, C.; Gerla, M.; Sanadidi, M.Y.; Wang, R. TCP westwood: Bandwidth estimation for enhanced transport over wireless links. In Proceedings of the 7th Annual International Conference on Mobile Computing and Networking, Rome, Italy, 16–21 July 2001. [Google Scholar]
  8. Zong, L.; Wang, H.; Bai, Y.; Luo, G. Cross-regional transmission control for satellite network-assisted vehicular ad hoc networks. IEEE Trans. Intell. Transp. Syst. 2022, 23, 9692–9701. [Google Scholar] [CrossRef]
  9. Du, H.P.; Zheng, Q.; Zhang, W.; Yan, J. A packet loss differentiation algorithm for 4G-LTE network. J. Comput. Res. Dev. 2015, 52, 2684–2694. [Google Scholar]
  10. Cai, Y.; Xiong, H.; Yu, S.; Chen, M.; Zhou, X. D-OLIA: The packet loss differentiation based opportunistic linked-increases algorithm for MPTCP in wireless heterogeneous network. In Proceedings of the 2021 31st International Telecommunication Networks and Applications Conference (ITNAC), Sydney, Australia, 24–26 November 2021. [Google Scholar]
  11. Li, N.; Zhu, Q.; Deng, Z. LDM-Satellite: A new scheme for packet loss classification over LEO satellite network. China Commun. 2022, 19, 207–215. [Google Scholar] [CrossRef]
  12. Chen, Y.; Yan, J.; Zhang, Y.; Hummel, K.A. Differentiating losses in wireless networks: A learning approach. In Proceedings of the IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS), New York, NY, USA, 2–5 May 2022. [Google Scholar]
  13. Samaraweera, N. Non-congestion packet loss detection for TCP error recovery using wireless links. IEE Proc.-Commun. 1999, 146, 222–230. [Google Scholar] [CrossRef]
  14. Jeyasekar, A.; Kasmir Raja, S.V.; Uthra Annie, R. Congestion avoidance algorithm using ARIMA(2,1,1) model-based RTT estimation and RSS in heterogeneous wired-wireless networks. J. Netw. Comput. Appl. 2017, 93, 91–109. [Google Scholar]
  15. Ramadan, W.; Dedu, E.; Dhoutaut, D.; Bourgeois, J. RELD, RTT ECN loss differentiation to optimize the performance of transport protocols on wireless networks. Telecommun. Syst. 2013, 52, 1797–1817. [Google Scholar] [CrossRef]
  16. Seddik-Ghaleb, A.; Ghamri-Doudane, Y.; Senouci, S.M. TCP WELCOME: TCP variant for wireless environment, link losses, and congestion packet loss models. In Proceedings of the 2009 First International Communication Systems and Networks and Workshops, Bangalore, India, 5–10 January 2009. [Google Scholar]
  17. Sarkar, N.I.; Ho, P.H.; Gul, S.; Zabir, S.M.S. TCP-LoRaD: A loss recovery and differentiation algorithm for improving TCP performance over MANETs in noisy channels. Electronics 2022, 11, 1479. [Google Scholar] [CrossRef]
  18. Zhao, H.; Dong, Y.; Li, Y. An End-to-End loss discrimination scheme for multimedia transmission over wireless IP networks. In Proceedings of the International Conference on Ad Hoc Networks, Niagara Falls, ON, Canada, 22–25 September 2009. [Google Scholar]
  19. Truchly, P.; Sith, M.; Repka, R. End-to-end packet loss differentiation algorithms and their performance in heterogeneous networks. In Proceedings of the 17th International Conference on Emerging eLearning Technologies and Applications, Starý Smokovec, Slovakia, 21–22 November 2019. [Google Scholar]
  20. Liu, J.; Matta, I.; Crovella, M. End-to-End inference of loss nature in a hybrid wired/wireless environment. In Proceedings of the WiOpt’03: Modeling and Optimization in Mobile, Ad Hoc and Wireless Networks, Sophia Antipolis, France, 3–5 March 2003. [Google Scholar]
  21. Fonseca, N.; Crovella, M. Bayesian packet loss detection for TCP. In Proceedings of the IEEE 24th Annual Joint Conference of the IEEE Computer and Communications Societies, Miami, FL, USA, 13–17 March 2005. [Google Scholar]
  22. Chen, Y.; Lu, L.; Yu, X. Adaptive method for packet loss types in IoT: A naive Bayes distinguisher. Electronics 2019, 8, 134. [Google Scholar] [CrossRef]
  23. Arif, M.; Qamar, U.; Riaz, A. A machine learning-based approach for improving TCP congestion detection mechanism in IoTs. In Proceedings of the International Conference on Frontiers of Information Technology, Islamabad, Pakistan, 12–13 December 2022. [Google Scholar]
  24. Deng, Q.; Cai, A. SVM-based loss differentiation algorithm for wired-cum-wireless networks. J. China Univ. Posts Telecommun. 2009, 16, 104–111. [Google Scholar] [CrossRef]
  25. Murugan, S.; Veeramanikandan, V. Performance of Packet Loss Differentiation Model by using Cuckoo Search Back-Propagation Neural Network (CSBPNN) for Mobile Ad-hoc Networks. In Proceedings of the 2020 International Conference on Inventive Computation Technologies (ICICT), Coimbatore, India, 26–28 February 2020. [Google Scholar]
  26. Molia, H.K.; Kothari, A.D. TCP-RLLD: TCP with reinforcement learning based loss differentiation for mobile ad hoc networks. Wirel. Netw. 2023, 29, 1937–1948. [Google Scholar] [CrossRef]
  27. Shi, J.R.; Wang, D.; Shang, F.H.; Zhang, H.Y. Research advances on stochastic gradient descent algorithms. ACTA Autom. Sin. 2021, 47, 2103–2119. [Google Scholar]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.