1. Introduction
The compound annual growth rate (CAGR) for passenger transport is envisaged to be 3.6% between 2019 and 2043, while that of cargo transport is estimated at 3.2% over the same period [
1]. Such growth is expected to result in elevated air traffic, fuel consumption, and noise pollution [
2]. The environmental impact of gas emissions resulting from fuel consumption is highly detrimental. Considering the environmental consequences associated with fuel use, aviation authorities, airlines, aircraft manufacturers, and engine manufacturers have undertaken a series of key initiatives to address this concern [
3]. Aircraft fuel consumption is directly affected by dynamic equations governing flight performance, such as aerodynamic drag and thrust forces, engine performance, and flight regime. As a result, these equations constitute key parameters that influence optimal fuel usage. Since aircraft dynamics demonstrate nonlinear and underactuated characteristics, the related control problems become increasingly complex, thereby drawing significant attention in both academic research and industrial applications [
4]. Therefore, the stability, performance quality and robustness of flight control systems against disturbances and noise during operation are of critical importance. Recently, the field of aviation has made significant advancements by emphasizing the development of performance and control strategies. When considering the nonlinear dynamic behavior of aircraft, extensive research has been carried out to design control systems that ensure accurate trajectory tracking, asymptotic stability, and convergence. Aircraft control systems have been continuously evolving to meet the specific requirements of various operational environments and have been adapted accordingly by means of tailored solutions [
5].
Several studies have been carried out in the literature about the modeling of aircraft and the design of flight control. When comparing various control methods, it can be realized that the Proportional–Integral–Derivative (PID) [
6], Sliding Mode Control [
7], Backstepping [
8], Finite-Time Lyapunov Stability-based Control [
9], Passivity-based Control [
10], and Model Predictive Control (MPC) [
11] are most commonly employed. Moreover, the PID method has gained widespread attention from both industry and researchers owing to its basic structure and facility of application [
12]. Conventional flight control systems often employ PID controllers since these are simple and easy to integrate into hardware across diverse applications. Nevertheless, in realistic flight environments, PID controllers may not perform adequately under conditions that involve uncertainties and external disturbances, primarily due to the necessity of precisely tuning the controller coefficients. These effective control parameters are often determined by means of a trial-and-error process, which can be time-consuming and prone to inaccuracies. To tackle the challenges associated with manual tuning, researchers have developed various automated parameter optimization techniques, including modern metaheuristic and swarm intelligence-based algorithms, as well as reinforcement learning approaches [
13].
Metaheuristic algorithms have ensured effective solutions for several flight control problems, such as navigation, path planning, obstacle avoidance, fault-tolerant control, and flight stabilization. It could be specified that the most used metaheuristic algorithms for flight control optimization are Genetic Algorithms (GA), Harris Hawks Optimization (HHO) [
14], Particle Swarm Optimization (PSO) [
15], Grey Wolf Optimizer (GWO), and Artificial Bee Colony (ABC) optimization [
16,
17]. These algorithms are employed to optimize the control parameters of controllers since they have a key role in achieving stable and efficient flight performance. Numerous studies have demonstrated that these algorithms yield highly effective results in tuning controller parameters for flight control applications, which leads to significant improvements in system stability, response time, and trajectory tracking accuracy [
18].
Flight simulation software has emerged as a viable alternative to real flight testing in areas such as aircraft design, flight data generation, pilot training, and autopilot development. Flight simulators are preferred due to factors such as time loss, high cost, and potential safety risks related to real flight tests. Recent advancements in this field have enabled highly accurate simulations. These simulators enable opportunities to analyze the dynamic behavior of aircraft, introduce disturbances, observe parameter uncertainties, and visualize an aircraft’s three-dimensional motion [
19]. In the literature, flight simulators such as FlightGear, Microsoft Flight Simulator, and X-Plane are frequently utilized in autopilot development studies [
20]. Particularly, X-Plane becomes prominent for its flexibility and advanced capabilities due to offering users the ability to design custom aircraft tailored to specific needs. Additionally, the flight simulator certified by the Federal Aviation Administration (FAA) enhances the reliability of its simulation results. As a result, many companies such as Cessna, Cirrus, and Boeing, along with numerous researchers, prefer X-Plane for developing and testing control strategies [
21].
Thanks to data analysis, predictive methods are employed to develop equation-based models that obtain the relationships between variables within a system. For this purpose, techniques such as regression analysis, polynomial regression, support vector machines (SVMs), and decision trees have been commonly used in the literature to construct sets of equations, thereby transforming the behavior of the system into a mathematical model. However, solving these equations manually is often challenging and time-consuming. While various optimization methods have been proposed in the literature to solve such equations, they often find local rather than global optima. To tackle this limitation, recent studies have employed metaheuristic algorithms to solve these equations and achieve globally optimal solutions.
In this study, an attempt is made to minimize the fuel consumption of a Boeing 737-800 aircraft. To achieve this, Software-in-the-Loop (SIL) simulations are carried out by data exchange via UDP between MATLAB/Simulink and the X-Plane flight simulator. A PID speed controller is designed to maintain a determined altitude for the B737-800. A total of 1032 SIL simulations are performed by altering parameters such as altitude, speed, weight, wind speed and direction, as well as PID controller coefficients. These simulations are used to establish a dataset incorporating fuel consumption data belonging to the B737-800 aircraft. The collected data are modeled by applying artificial intelligence, metaheuristic algorithms and statistical methods to develop a fuel consumption equation based on the identified input variables. Based on these equations, the Decision Tree–Robust Multiple Regression–Harris Hawks Optimization Algorithm (DRHA), which is used for the first time in the literature, is proposed to determine the optimal PID controller coefficients.
1.1. Problem Statement
PID control is one of the main pillars of modern flight systems. To provide the stability and precision of aero vehicles such as aircraft and drones, it is commonly employed. Moreover, this algorithm can find deviations during flight and perform real-time corrections according to observational data. The accurate tuning of PID parameters is of high importance for achieving stable flight performance and ensuring precise responses to control commands for aero vehicles. Meteorological factors and flight dynamics have a significant impact on PID control. Since sudden changes in wind speed and direction can affect the stability of the aero vehicle, a PID control algorithm should be more sensitive and adaptive against these perturbations. Similarly, factors involving air density, temperature, and humidity can complicate the prediction of aerodynamic loads during flight. To overcome such changing conditions, it is essential to adaptively optimize PID parameters.
1.2. The Requirements of PID Optimization
The optimization of PID parameters is of high significance for flight safety and efficiency. Namely, inconveniently tuned PID parameters can result in excessive oscillations of control surfaces, delayed system responses, or even loss of control. Therefore, optimization of PID parameters benefits from maximizing flight performance and tailoring the control algorithm to the specific design characteristics of the aero vehicle. Moreover, the optimization of PID parameters plays a critical role not only in ensuring stability and precision but also in enhancing energy efficiency. A properly optimized control system minimizes power demand from the engines, thereby reducing energy consumption and extending the range of the aircraft. Consequently, the optimization of PID parameters is an integral component of flight systems in terms of both performance and cost-effectiveness.
1.3. The Scope of the Study
The constant changes in aerodynamic forces, environmental disturbances, and operational conditions during the fundamental flight phases, such as takeoff, climb, cruising, and landing, make it difficult for PID-based control systems to achieve consistent performance with fixed parameters in all situations. Based on this requirement, this study proposes a novel optimization framework that can maintain stable speed control while reducing fuel consumption, particularly during the cruising phase, for a Boeing 737-800 aircraft. In this study, PID parameters are considered not only in terms of classical control performance but also as decision variables affecting fuel consumption. For this purpose, aircraft weight, altitude, wind speed, wind direction, flight speed, and PID coefficients are evaluated together; thus, the control problem is structured as a single-objective optimization problem based on fuel minimization under realistic flight conditions.
The fundamental originality of the proposed approach lies in addressing the fuel minimization problem within a two-stage and integrated structure. In the first stage, a data-driven model representing fuel consumption dependent on flight and control variables was established by harnessing data obtained from Software-in-the-Loop simulations performed in the MATLAB/Simulink X-Plane environment. In the second stage, this model was used directly as an objective function, and the PID coefficients and speed values that ensure minimum fuel consumption were determined using an optimization algorithm. Thus, this study differs from studies in the literature that focus solely on PID tuning or only on fuel consumption estimation, offering a unique approach that combines simulation-based data generation, data-driven fuel modeling, and optimization-based control parameter determination processes in a single hybrid framework. In this context, the proposed method presents a systematic and feasible decision support structure that considers both fuel efficiency and control applicability for a Boeing 737-800.
3. Methods
In this study, the objective is to reduce the fuel consumption of a B737-800 aircraft flying at different flight conditions. To achieve this goal, a PID speed controller is designed. The equation of the speed controller is presented in Equation (1).
where
e(t) represents the difference between the reference speed and the actual speed.
The methodology section of this article consists of two main stages. In the first stage, methods used to develop a fuel consumption model based on the simulation data of the B737-800 aircraft are presented. This model is constructed as a function of the variables “KP”, “KI”, “KD”, “Weight”, “Wind Direction”, “Wind Speed”, “Aircraft Speed”, and “Altitude.” In the second stage, the methods for determining the controller parameters that yield minimum fuel consumption based on the developed model are described.
3.1. Modeling of the Fuel Consumption Equation
Machine learning, statistical methods, and a reinforced hybrid metaheuristic algorithm are employed to develop the fuel consumption equations. These methods are analyzed comparatively, and the equation yielding the best prediction is selected for the optimization stage. The ultimate goal of this study is to find the decision variables that minimize the fuel consumption function . In the first stage, a surrogate fuel consumption model of the form is obtained from the simulation data. In the second stage, this surrogate model is used as the objective function, and the decision variables and speed values are optimized to give the minimum fuel consumption under the relevant flight conditions.
3.2. Statistical Approaches
In this study, eight different statistical methods are employed for obtainment and comparison of prediction equations based on relevant variables. Multiple regression analysis is preferred for analyzing the effects of multiple independent variables on a dependent variable. There are several advantages of the method, such as the facility of application and interpretation. Additionally, the effect of each variable on the model can be clearly observed [
24]. Robust regression is included in the study as it reduces the influence of outliers, provides more reliable predictions, and prevents model degradation when outliers are present in the data [
25]. Polynomial regression is utilized in the analyses due to its ability to capture nonlinear patterns in the data, produce models that better fit the data as the degree increases, and approximate linear models by generating first-degree equations [
26,
27,
28]. Non-parametric regression is employed for its flexibility in making predictions without requiring model assumptions and for its capacity to produce models well-adapted to the data structure [
29]. On the other hand, ridge regression is selected as another method to address multicollinearity issues, improve model generalizability by shrinking coefficients, and prevent overfitting [
30]. Additionally, methods for variable selection are applied to identify significant variables and simplify models, enhancing model interpretability [
31]. A summary of these methods is provided in
Table 2.
3.3. Machine Learning Algorithms
Another method considered in the development of the fuel consumption equation in this study is artificial intelligence. Machine learning algorithms such as XGBoost, Random Forest, SVM, and K-Nearest Neighbor are utilized for generating prediction equations and compared with other methods. These techniques are selected in this study due to their distinct features and are employed as tools to achieve an optimal fuel consumption model. Among the machine learning algorithms used, XGBoost (Extreme Gradient Boosting) is chosen for its high prediction accuracy on large and complex datasets and its ability to prevent overfitting [
32]. The Random Forest algorithm is employed due to its capacity to combine multiple decision trees and to calculate the importance of each feature [
33]. A Support Vector Machine (SVM) is utilized for developing fuel consumption equations because it produces highly accurate models for nonlinear data, performs efficiently on high-dimensional datasets, and has mechanisms to prevent overfitting [
34]. K-Nearest Neighbor (KNN) is also adopted in the study for its ease of implementation, flexibility, and lack of assumptions about the underlying model [
35].
3.4. Decision Tree–Robust Multiple Regression–Harris Hawks Optimization Algorithm (DRHA)
In this study, three well-known methods are combined to propose a novel hybrid approach, used for the first time in the literature. The dataset is divided into small relational clusters utilizing decision trees. A general fuel consumption equation is developed through robust regression, and the error/deviation values of the fuel consumption equation for each cluster are optimized by employing the Harris Hawks Optimization algorithm. As a result, optimal fuel consumption equations are generated for each cluster. Using the same hybrid structure, PID controller parameters for each aircraft weight, wind speed, wind direction, and altitude value are optimized to minimize fuel consumption.
In the first stage of the method, the dataset was divided into more homogeneous sub-regions by harnessing a decision tree algorithm in order to more accurately represent the heterogeneous fuel consumption behavior observed in different flight regimes. The aim of this stage is not to establish completely independent regression models for each cluster, but to add cluster-specific correction terms to the global robust regression model. In this context, the initial dataset was defined as follows:
Here,
represents the input vector (Kp, Ki, Kd, weight, wind speed, wind direction, velocity, and altitude) and
represents the corresponding fuel consumption value. Based on this information, the dataset is divided into C discrete subsets corresponding to leaf nodes using the decision tree algorithm:
Each cluster represents a relational subregion defined by the regional decision boundaries formed by the decision tree. The splitting process was performed to minimize intra-node variance (mean-squared error criterion), thus ensuring a more homogeneous data distribution within each cluster.
In the first stage, a global fuel consumption model was obtained by utilizing the Robust Multiple Regression method based on the Huber loss function:
In Equation (12), represents the deviation/correction term for the relevant set.
When a new input vector arrives, this data is assigned to the appropriate cluster via the decision tree, and the prediction is made using the robust regression equation along with the error term of the relevant cluster.
The cluster-specific error terms
were optimized using the Harris Hawks Optimization (HHO) algorithm. This optimization process minimizes the following error function:
This structure allows for the preservation of global model stability without forming completely independent local regression models; however, prediction accuracy is improved by correcting for regional behavioral differences. The aim is to reduce biases arising from data heterogeneity and achieve higher R2 performance with lower MAE and MSE values.
In summary, the proposed DRHA framework consists of two phases. In the first phase, a surrogate fuel consumption model is established by employing robust multiple regression and cluster-specific error correction terms optimized with HHO, based on data obtained from the SIL dataset. In the second phase, this surrogate model is used directly as the objective function to determine the optimal decision variables (PID gains and speed value) that minimize fuel consumption under the given flight and meteorological conditions. Furthermore, a perturbation mechanism is integrated into the search process in the second phase to prevent early convergence and to prevent the algorithm from getting stuck in local optimum solutions. In this mechanism, if no improvement is observed after a certain number of iterations, the available best solution is perturbed in a controlled manner, thus redirecting the search process to different regions to avoid local optimums and improve solution quality. A flowchart of the method is shown in
Figure 5, and the related algorithm is presented in Algorithm 1.
| Algorithm 1: HHO Algorithm with perturbation mechanism [36,37]. |
|
| |
| |
|
| | |
| | |
| | |
| | | |
| | | |
| | | |
| | | | (14) |
| | | | |
| | | | |
| | | | | (15) |
| | | | | (16) |
| | | | |
| | | | | |
| | | | | |
| | | | | | (17) |
| | | | | | (18) |
| | | | | |
| | | | | |
| | | | | | (19) |
| | | | | |
| | | | | |
| | | | | Equations (20)–(23) |
| | | | | | (20) |
| | | | | | (21) |
| | | | | | (22) |
| | | | | | (23) |
| | | | | |
| | | | | |
| | | | | (24)–(26). |
| | | | | | (24) |
| | | | | | (25) |
| | | | | | (26) |
| | . |
In the application section of the study, a fuel consumption prediction is performed by using 1032 flight data samples obtained from X-Plane. Of the total data, 80% is allocated for training and 20% for testing. In this regard, a computer equipped with 16 GB of RAM, an AMD Ryzen 5 5600H processor, and Radeon Graphics operating at 3.30 GHz is employed. A hybrid algorithm combining the regression and machine learning methods described in the aforementioned
Section 3 is employed for the prediction task. Details of the application are illustrated in
Figure 6.
3.5. Performance Metrics of Regression Analysis
In order to assess the accuracy of both modeling methods, two distinct error metrics are measured: mean square error (MSE) and mean absolute error (MAE). These metrics help quantify the deviation between actual observations and model predictions. Additionally, the coefficient of determination (R
2) is determined for each model to evaluate goodness-of-fit, with values ideally nearing one [
38,
39]. The related formulas are provided in Equations (27)–(29)
where
,
and
denote real, predicted and mean values, respectively.
4. Result and Findings
In this study, the main goal is to minimize fuel consumption by controlling the speed of a Boeing 737-800 aircraft at different altitudes, aircraft weights, wind speeds and directions. The simulation studies are performed in MATLAB/Simulink, and the results are validated by using the X-Plane 11 flight simulator. For the simulation studies, parameters such as altitude, speed, aircraft weight, wind speed, and wind direction are altered along with the PID controller coefficients. The fuel consumption values of the Boeing 737-800, performing the cruise phase at different altitudes, are obtained from the X-Plane flight simulator. Each simulation is run for 100 s, resulting in a dataset comprising 1032 samples. Based on this data, the controller coefficients are optimized by employing a hybrid method (DRHA) that integrates the HHO metaheuristic algorithm and Robust Multiple Regression with artificial intelligence-based prediction models. Moreover, the optimized PID controller coefficients obtained from this process are updated, which demonstrates improvements in fuel consumption compared to manually tuned PID controller coefficients. The PID controller coefficient values obtained via optimization, along with the manual PID controller coefficients, are further tested in simulations extended to 600 s, and the results are compared. In the simulations, the initial fuel quantity is set to 8700 pounds. The simulations are carried out at altitudes of 32,000, 34,000, 36,000, and 38,000 feet and total aircraft weights of 100,000, 120,000, 140,000, and 160,000 pounds under both windy and calm conditions. In addition, the application studies regarding regression models were carried out by utilizing the NCSS 2025 (v25.0.1) program. The equations were obtained from the outputs of this program. The obtained results are presented in the figures as follows.
Among the evaluated methods, DRHA is found to yield the best results and is subsequently utilized in the second stage to determine the optimal PID parameters. Furthermore, the optimization process is carried out for 188 different combinations of altitude, aircraft weight, wind speed, and wind direction. The results are then validated by comparing them with simulation outcomes obtained from X-Plane, thereby testing the accuracy and effectiveness of the method. The related analysis results are presented in
Table 3.
When
Table 3 is examined, it is observed that the best Regression Model is “Robust Multiple Regression”, and this statistical model is used within the DRHA Algorithm. The results obtained according to this hybrid model are shown in the DRHA row of
Table 3. According to the results in
Table 3, the best fuel consumption model was generated using the DRHA algorithm. The generated prediction model is shown in Equation (30), as follows. The coefficients of Equation (30) were estimated by employing a Huber-loss-based Robust Multiple Regression approach rather than ordinary least squares.
In the next stage, a mathematical model was established utilizing this equation and optimized using the DRHA method. The basic mathematical structure of the model is shown below in Equation (31).
Based on the information above, details regarding the application of the two-stage solution method and the resulting gains are shown in
Figure 7.
The application is tested under different conditions, harnessing the solution architecture shown in
Figure 7, and the results obtained are shown comparatively in the graphs below.
The simulation results for an altitude of 32,000-feet are presented in
Figure 8. As can be seen from the figure, after 600 s of simulation, when compared to flight results with real-time X-Plane, fuel consumption is reduced by 0.754% for an aircraft weight of 100,000-pounds, 5.33% for 120,000-pounds, 0.527% for 140,000-pounds, and 0.251% for 160,000-pounds, thanks to the DRHA approach.
The simulation results for an altitude of 34,000-feet are presented in
Figure 9. As illustrated in the figure, after 600 s of simulation, fuel consumption is reduced by 0.99% for an aircraft weight of 100,000-pounds, 2.035% for 120,000-pounds, 1.030% for 140,000-pounds, and 0.731% for 160,000-pounds.
The simulation results for an altitude of 36,000-feet are presented in
Figure 10. As shown in the figure, after 600 s of simulation, fuel consumption is reduced by 0.071% for an aircraft weight of 100,000-pounds, 1.320% for 120,000-pounds, 1.276% for 140,000-pounds, and 2.388% for 160,000-pounds.
The simulation results for an altitude of 38,000-feet are presented in
Figure 11. As shown in the figure, after 600 s of simulation, fuel consumption is reduced by 0.920% for an aircraft weight of 100,000-pounds, 0.799% for 120,000-pounds, 2.852% for 140,000-pounds, and 0.296% for 160,000-pounds.
The simulation results of fuel consumption at different altitudes for an aircraft with a total weight of 100,000-pounds are presented in
Figure 12. In this study, the aircraft is tested under the influence of a 40-knot wind speed at a 0-degree wind direction. As shown in the figure, after 600 s of simulation, fuel consumption is reduced by 0.103% at an altitude of 32,000-feet, 1.234% at 34,000-feet, 0.714% at 36,000-feet, and 0.461% at 38,000-feet.
The test results under the influence of a 40-knot wind speed at a 90-degree wind direction on the aircraft are presented in
Figure 13. In the simulation studies, the aircraft weight is set to 100,000-pounds. As shown in the figure, after a 600 s simulation, fuel consumption is reduced by 0.620% at 32,000-feet, 2.811% at 34,000-feet, 0.685% at 36,000-feet, and 1.052% at a 38,000-feet altitude.
The test results under the influence of a 40-knot wind speed at a 180-degree wind direction on the aircraft are presented in
Figure 14. In the simulation studies, the aircraft weight is set to 100,000-pounds. As shown in the figure, after a 600 s simulation, fuel consumption is reduced by 0.684% at 32,000-feet, 0.586% at 34,000-feet, 0.579% at 36,000-feet, and 0.676% at a 38,000-feet altitude.
Based on the application results presented in
Figure 15, it could be observed that the DRHA algorithm successfully achieves minimum fuel consumption across different aircraft weights, altitudes, wind speeds, and wind directions. In the figure, the X-Plane simulation result demonstrates the best values (minimum fuel consumption) obtained for the relevant input variables as a result of real-time simulation performed on X-Plane. The optimal fuel consumption values obtained as a result of the DRHA algorithm are expressed in the “Optimal DRHA Result” column.
To evaluate the consistency of the DRHA-based optimum solution obtained in this study, the PID controller coefficients and velocity values found in the optimization phase were reapplied to the MATLAB/Simulink–X-Plane closed-loop Software-in-the-Loop (SIL) simulation environment. Thus, the results predicted by the developed fuel consumption equation were compared with the simulation results obtained again in the X-Plane environment under the same flight conditions. Therefore, the process performed here should not be considered an independent external validation with real flight data, but rather as a simulator-based validation/confirmation process based on retesting the solution found as a result of optimization in a high-accuracy original simulation environment.
When the results in
Figure 16 and
Table 4 are examined together, it is seen that there is a low level of error and deviation between the optimum results obtained with the DRHA algorithm and the validation results obtained again in the X-Plane SIL environment. The differences between the two results may stem from approximation errors in the developed fuel consumption equation, the nonlinear behavior of the system, and the fact that some dynamic effects represented in the X-Plane environment could not be fully reflected by the regression-based model. However, the low MAE, MSE, and percentage deviation values obtained indicate that the proposed method largely preserves the optimum solution not only in the mathematical model but also in the original high-accuracy simulation environment in which the data was generated. This finding demonstrates that the proposed approach can provide both computational efficiency and simulation consistency.
Lastly, when evaluating the algorithm in terms of solution time, it was determined that optimal values were obtained in less than 2 min in all scenarios.
To statistically analyze the proximity of the developed fuel consumption equation to real systems, the “Optimal DRHA Result” and “Validation of Optimal DRHA in Xplane” results were compared, and the obtained results are presented in
Table 4.
According to the results in
Table 4, when the optimal fuel consumption results obtained from the DRHA algorithm are compared with the data obtained from the actual system (X-Plane), low levels of error and deviation were detected between the two methods. The mean absolute error (MAE) ranged from “1.29” to “1.51”, while the mean square error (MSE) remained between “2.30” and “2.84”. The errors at this level could be assessed as relatively low and indicate a high degree of similarity between the modelled system and the actual data. This finding demonstrates that the results obtained from the prediction equation can be reliably used in practice.