Gas Turbine Exhaust Gas Temperature Prediction Under Variable Operating Loads and IGV Positions Using Tree-Based Ensemble Learning
Abstract
1. Introduction
- A modeling approach representing real operating conditions was developed. Unlike conventional approaches based only on steady-state or full-load data, the present study used actual operational data obtained under variable load regimes and different IGV positions. Thus, the dynamic behavior of the GT was modeled in a more realistic manner.
- A large-scale dataset consisting of 18,334 operating hours was used in the study. This comprehensive data structure enabled the reliable representation of GT behavior under different operating conditions.
- The combined effects of the operational and environmental parameters affecting EGT were comprehensively evaluated. GTPO, IGV, CIT, FGF, LHV, AP, and RH were considered together within the same model, thereby revealing the mutual and nonlinear effects of these variables on EGT.
- Tree-based ensemble learning algorithms were implemented within a systematic and comparative framework for EGT prediction. Bagged Trees, Random Forest, Gradient Boosting, XGBoost, LightGBM, and CatBoost models were evaluated using the same dataset; thus, the model that performed best in EGT prediction was identified in detail.
- LightGBM and XGBoost exhibited higher accuracy, less errors, and more stable residual distributions compared to the other methods. The LightGBM model was identified as the most successful method for EGT prediction, with an R2 of 0.9703 and an RMSE of 1.5280.
- The dynamic prediction capability of the LightGBM model was additionally validated during SFC-based load ramp-up operation, demonstrating its ability to accurately follow the actual EGT trend under rapidly varying GTPO and IGV conditions.
- The proposed approach provides a reliable decision-support tool for performance monitoring, sensor validation, fault detection, and predictive maintenance applications in GT.
2. System Description
2.1. GT System
- GTPO: An operating parameter representing the amount of power produced by the GT at a given operating condition and measured in MW.
- FGF: An operational variable representing the flow rate of the fuel supplied to the GT and measured in Sm3/h.
- CIT: An inlet condition and operational parameter representing the temperature of the air entering the compressor. Its unit is °C.
- LHV: The amount of usable energy obtained from fuel combustion without considering the latent heat of condensation of water vapor. It is expressed in kcal/Sm3.
- IGV: An operating parameter representing the opening ratio of the inlet guide vanes. The IGV position is defined as a percentage (%).
- AP: An environmental variable representing atmospheric pressure. Its unit is mbar.
- RH: An environmental parameter representing the ratio of the amount of water vapor in the atmosphere to the maximum amount of moisture that air at the same temperature can contain. It is defined as a percentage (%).
- EGT: A performance indicator representing the temperature of the exhaust gases leaving the turbine. Its unit is °C.
2.2. GT Exhaust System
3. Tree-Based Ensemble Models
3.1. Bagging Methods
3.1.1. Bagged Trees
3.1.2. Random Forest
3.2. Boosting Methods
3.2.1. XGBoost
3.2.2. LightGBM
3.2.3. CatBoost
3.3. Model Optimization and Validation
3.3.1. k-Fold Cross-Validation
3.3.2. Grid Search
4. Model Evaluation Metrics
4.1. MAE
4.2. MSE
4.3. RMSE
4.4. R2
5. Results
5.1. Dataset Description
5.2. Analysis Results
6. Discussion
7. Conclusions
- Tree-based ensemble learning methods are highly effective in modeling the nonlinear relationship between GT operating parameters and EGT. Among the tested models, LightGBM demonstrated the best overall prediction performance, achieving the highest R2 value of 0.9703, together with RMSE and MSE values of 1.5280 and 2.3347, respectively.
- After hyperparameter optimization, all models were compared using the same dataset, the same training–testing split, and the same evaluation metrics. This approach enabled a fairer and more reliable performance assessment among the models.
- GTPO, CIT, FGF, and IGV position were identified as the dominant parameters determining EGT. The inclusion of IGV position in the model contributes to a more accurate representation of actual GT behavior, particularly under variable-load and part-load operating conditions.
- Although the effects of environmental and fuel-related variables such as AP, RH, and LHV are relatively lower, these variables still play a meaningful role in EGT prediction and should not be neglected.
- The SHAP-based interpretability analysis confirmed the dominant influence of GTPO, CIT, FGF, and IGV variables on model predictions and showed that the directions of their effects on EGT are consistent with the physical operating behavior of the GT.
- Using 18,334 h of real operating data, the proposed model successfully predicted EGT under both full-load and part-load conditions.
- The LightGBM model was tested under transient load-varying conditions in SFC mode and closely followed the actual EGT trend during the load increase from approximately 253 MW to 400 MW.
- The use of the model for sensor validation and fault detection will improve reliability in turbine operation. In addition, it can be used as an effective decision-support tool for online monitoring and predictive maintenance.
- The applicability of the study for online and real-time monitoring in modern CCPPs provides a contribution aligned with the digitalization trend in the energy sector. In this respect, the study has a structure that can be directly integrated not only into academic research but also into industrial decision-support systems.
Funding
Data Availability Statement
Conflicts of Interest
Abbreviations
| AP | Atmospheric Pressure |
| CatBoost | Categorical Boosting |
| CCPP | Combined Cycle Power Plant |
| CIT | Compressor Inlet Temperature |
| DCS | Distributed Control System |
| EGT | Exhaust Gas Temperature |
| FGF | Fuel Gas Flow |
| GT | Gas Turbine |
| GTPO | Gas Turbine Power Output |
| HRSG | Heat Recovery Steam Generator |
| IGV | Inlet Guide Vanes |
| LHV | Lower Heating Value |
| LightGBM | Light Gradient Boosting Machine |
| MAE | Mean Absolute Error |
| ML | Machine Learning |
| MSE | Mean Square Error |
| R2 | Coefficient of Determination |
| RH | Relative Humidity |
| RMSE | Root Mean Square Error |
| SC | Simple Cycle |
| SFC | Secondary Frequency Control |
| ST | Steam Turbine |
| TIT | Turbine Inlet Temperature |
| VGV | Variable Guide Vane |
| XGBoost | eXtreme Gradient Boosting |
References
- Ali, A.; Houda, M.; Waqar, A.; Khan, M.B.; Deifalla, A.; Benjeddou, O. A review on application of hydrogen in gas turbines with intercooler adjustments. Results Eng. 2024, 22, 101979. [Google Scholar] [CrossRef]
- Wang, Z.T.; Zhao, N.B.; Wang, W.Y.; Tang, R.; Li, S.Y. A Fault Diagnosis Approach for Gas Turbine Exhaust Gas Temperature Based on Fuzzy C-Means Clustering and Support Vector Machine. Math. Probl. Eng. 2015, 2015, 240267. [Google Scholar] [CrossRef]
- Hong, C.W.; Kim, J. Exhaust temperature prediction for gas turbine performance estimation by using deep learning. J. Electr. Eng. Technol. 2023, 18, 3117–3125. [Google Scholar] [CrossRef]
- Liu, J.; Long, Z.; Bai, M.; Zhu, L.; Yu, D. A comparative study on fault detection methods for gas turbine combustion systems. Energies 2021, 14, 389. [Google Scholar] [CrossRef]
- Tenango-Pirín, O.; Espinosa, S.; García, J.C.; Rodríguez, J.A. Analysis of gas turbine combustor exhaust emissions: Effects of transient inlet air pressure. Energy Technol. 2024, 12, 2301093. [Google Scholar] [CrossRef]
- Parapa, H.B.P. The Impact of Changes in Exhaust Temperature on the Power Output and Heat Rate of a Gas Turbine with a Capacity of 238 MW. INTEK J. Penelit. 2021, 8, 96. [Google Scholar] [CrossRef]
- Motamed, M.A.; Genrup, M.; Nord, L.O. Part-load thermal efficiency enhancement in gas turbine combined cycles by exhaust gas recirculation. Appl. Therm. Eng. 2024, 244, 122716. [Google Scholar] [CrossRef]
- Lee, J.H.; Kang, D.W.; Jeong, J.H.; Kim, T.S. Quantification of variations in the compressor characteristics of power generation gas turbines at partial loads using actual operation data. J. Mech. Sci. Technol. 2023, 37, 1509–1521. [Google Scholar] [CrossRef]
- Talah, D.; Bentarzi, H. Ambient temperature effect on the performance of gas turbine in the combined cycle power plant. Alger. J. Environ. Sci. Technol. 2023, 9, 3079–3085. [Google Scholar]
- González-Díaz, A.; Alcaráz-Calderón, A.M.; González-Díaz, M.O.; Méndez-Aranda, Á.; Lucquiaud, M.; González-Santaló, J.M. Effect of the ambient conditions on gas turbine combined cycle power plants with post-combustion CO2 capture. Energy 2017, 134, 221–233. [Google Scholar] [CrossRef]
- Farahani, A.S.; Kohandel, H.; Moradtabrizi, H.; Khosravi, S.; Mohammadi, E.; Ramesh, A. Power generation gas turbine performance enhancement in hot ambient temperature conditions through axial compressor design optimization. Appl. Therm. Eng. 2024, 236, 121733. [Google Scholar] [CrossRef]
- Liu, J.; Zhu, L.; Ma, Y.; Liu, J.; Zhou, W.; Yu, D. Anomaly detection of hot components in gas turbine based on frequent pattern extraction. Sci. China Technol. Sci. 2018, 61, 567–586. [Google Scholar] [CrossRef]
- Talebi, S.S.; Tousi, A.M. The effects of compressor blade roughness on the steady state performance of micro-turbines. Appl. Therm. Eng. 2017, 115, 517–527. [Google Scholar] [CrossRef]
- Long, Z.; Zhou, Z.; Suo, P.; Yao, P.; Bai, M.; Liu, J.; Yu, D. Gas turbine circumferential temperature distribution model for the combustion system fault detection. Eng. Fail. Anal. 2024, 158, 108032. [Google Scholar] [CrossRef]
- Kong, J.; Yu, W.; Chen, J.; Zhang, H. A Novel Power Prediction Model Based on the Clustering Modification Method for a Heavy-Duty Gas Turbine. Appl. Sci. 2025, 15, 432. [Google Scholar] [CrossRef]
- Purba, O.; Zhultriza, F. Inlet Guide Vane Tracking Effectiveness at Various Compressor Efficiency of Gas Turbine. In IOP Conference Series: Materials Science and Engineering; IOP Publishing: Bristol, UK, 2021; Volume 1096, p. 012088. [Google Scholar]
- Apostolidis, A.; Bouriquet, N.; Stamoulis, K.P. AI-based exhaust gas temperature prediction for trustworthy safety-critical applications. Aerospace 2022, 9, 722. [Google Scholar] [CrossRef]
- Liu, Z.; Karimi, I.A. Gas turbine performance prediction via machine learning. Energy 2020, 192, 116627. [Google Scholar] [CrossRef]
- Park, Y.; Choi, M.; Kim, K.; Li, X.; Jung, C.; Na, S.; Choi, G. Prediction of operating characteristics for industrial gas turbine combustor using an optimized artificial neural network. Energy 2020, 213, 118769. [Google Scholar] [CrossRef]
- Liu, S.; Zhou, N.; Song, C.; Chen, G.; Wu, Y. Exhaust gas temperature prediction of aero-engine via enhanced scale-aware efficient transformer. Aerospace 2024, 11, 138. [Google Scholar] [CrossRef]
- Zhou, W. Aero-engine exhaust gas temperature prediction based on adaptive disturbance quantum-behaved particle swarm optimization. Adv. Mech. Eng. 2022, 14, 16878132221119044. [Google Scholar] [CrossRef]
- Stephnie, A.C.; Osoka, D.E.E. Development of a Dynamic Neural Network Model for Multistep ahead Prediction of Exhaust Gas Temperature in Heavy Duty Gas Turbines. Saudi J. Eng. Technol. 2022, 7, 53–61. [Google Scholar]
- Brusa, A.; Grossi, A.; Lenzi, M.; Shethia, F.P.; Cavina, N.; Kitsopanidis, I. Modeling of Exhaust Gas Temperature at the Turbine Outlet Using Neural Networks and a Physical Expansion Model. Energies 2025, 18, 1721. [Google Scholar] [CrossRef]
- Ma, S.; Wu, Y.; Zheng, H.; Gou, L. A hybrid of NARX and moving average structures for exhaust gas temperature prediction of gas turbine engines. Aerospace 2023, 10, 496. [Google Scholar] [CrossRef]
- Du, J.; Zhang, Y.; García, M.M.; Spencer, A. A Novel Dynamic Surge Modeling Framework for Gas Turbines: Integration of Compressor Variable Geometry. Machines 2026, 14, 327. [Google Scholar] [CrossRef]
- Faqihi, B.; Ghaith, F. A comprehensive review and evaluation of heat recovery methods from gas turbine exhaust systems. Int. J. Thermofluids 2023, 18, 100347. [Google Scholar] [CrossRef]
- Büyükköse, A.O.; Aslan, A.; Çoban, K. Thermal Efficiency Impacts of Structural and Environmental Variables in Combined Cycle Plants: A Machine Learning Approach to Relocation Scenario. Case Stud. Therm. Eng. 2025, 75, 107123. [Google Scholar] [CrossRef]
- Zhu, R.; Ren, J.X.; Li, F.Q.; Zhang, H.D.; Tang, Y. Thermal and stress field analysis in heavy-duty gas turbine exhaust system. Adv. Mater. Res. 2012, 516, 688–691. [Google Scholar] [CrossRef]
- Pakmehr, M.; Costa, J.; Lu, G.; Behbahani, A. Optical exhaust gas temperature (EGT) sensor and instrumentation for gas turbine engines. In Proceedings of the NATO STO Meeting: Transitioning Gas Turbine Instrumentation from Test Cells to On-Vehicle Applications STO-M P-AVT-306At, Athens, Greece, 3–7 December 2018. [Google Scholar]
- Von Moll, A.; Behbahani, A.R.; Fralick, G.C.; Wrbanek, J.D.; Hunter, G.W. A review of exhaust gas temperature sensing techniques for modern turbine engine controls. In Proceedings of the 50th AIAA/ASME/SAE/ASEE Joint Propulsion Conference, Cleveland, OH, USA, 28–30 July 2014; p. 3977. [Google Scholar]
- Dutz, F.J.; Boje, S.; Orth, U.; Koch, A.W.; Roths, J. High-temperature profile monitoring in gas turbine exhaust-gas diffusors with six-point fiber-optic sensor array. Int. J. Turbomach. Propuls. Power 2020, 5, 25. [Google Scholar] [CrossRef]
- Sepiolo, D.; Ligęza, A. Towards explainability of tree-based ensemble models. A critical overview. In International Conference on Dependability and Complex Systems; Springer International Publishing: Cham, Switzerland, 2022; pp. 287–296. [Google Scholar]
- Zhu, M.; Li, Z.; Zhao, J.; Liu, X.; Liu, Y. TERM: Tree Ensemble Models for Interpretable Rule Mining. In International Conference on Web Information Systems Engineering; Springer: Singapore, 2024; pp. 367–382. [Google Scholar]
- Ngo, G.; Beard, R.; Chandra, R. Evolutionary bagging for ensemble learning. Neurocomputing 2022, 510, 1–14. [Google Scholar] [CrossRef]
- Li, Y.; Li, L.; Fang, Y.; Peng, H.; Ling, N. Bagged tree and ResNet-based joint end-to-end fast CTU partition decision algorithm for video intra coding. Electronics 2022, 11, 1264. [Google Scholar] [CrossRef]
- Saeed, M.S.; Mustafa, M.W.; Sheikh, U.U.; Jumani, T.A.; Mirjat, N.H. Ensemble bagged tree based classification for reducing non-technical losses in multan electric power company of Pakistan. Electronics 2019, 8, 860. [Google Scholar] [CrossRef]
- Ruiz-Abellon, M.D.C.; Gabaldón, A.; Guillamón, A. Load forecasting for a campus university using ensemble methods based on regression trees. Energies 2018, 11, 2038. [Google Scholar] [CrossRef]
- James, G.; Witten, D.; Hastie, T.; Tibshirani, R. An Introduction to Statistical Learning: With Applications in R; Springer: New York, NY, USA, 2013; Volume 103. [Google Scholar]
- Gao, Y.; Zhao, J.; Han, L. Quantifying the nonlinear relationship between block morphology and the surrounding thermal environment using random forest method. Sustain. Cities Soc. 2023, 91, 104443. [Google Scholar] [CrossRef]
- Zhou, Z.; Qiu, C.; Zhang, Y. A comparative analysis of linear regression, neural networks and random forest regression for predicting air ozone employing soft sensor models. Sci. Rep. 2023, 13, 22420. [Google Scholar] [CrossRef] [PubMed]
- Hassan, M.A.; Khalil, A.; Kaseb, S.; Kassem, M.A. Exploring the potential of tree-based ensemble methods in solar radiation modeling. Appl. Energy 2017, 203, 897–916. [Google Scholar] [CrossRef]
- Asadi, B.; Hajj, R. Prediction of asphalt binder elastic recovery using tree-based ensemble bagging and boosting models. Constr. Build. Mater. 2024, 410, 134154. [Google Scholar] [CrossRef]
- Niazkar, M.; Menapace, A.; Brentan, B.; Piraei, R.; Jimenez, D.; Dhawan, P.; Righetti, M. Applications of XGBoost in water resources engineering: A systematic literature review (Dec 2018–May 2023). Environ. Model. Softw. 2024, 174, 105971. [Google Scholar] [CrossRef]
- Zhang, P.; Jia, Y.; Shang, Y. Research and application of XGBoost in imbalanced data. Int. J. Distrib. Sens. Netw. 2022, 18, 15501329221106935. [Google Scholar] [CrossRef]
- Chen, C.; Zhang, Q.; Ma, Q.; Yu, B. LightGBM-PPI: Predicting protein-protein interactions through LightGBM with multi-information fusion. Chemom. Intell. Lab. Syst. 2019, 191, 54–64. [Google Scholar] [CrossRef]
- Sinha, B.B.; Ahsan, M.; Dhanalakshmi, R. LightGBM empowered by whale optimization for thyroid disease detection. Int. J. Inf. Technol. 2023, 15, 2053–2062. [Google Scholar] [CrossRef]
- Huang, G.; Wu, L.; Ma, X.; Zhang, W.; Fan, J.; Yu, X.; Zhou, H. Evaluation of CatBoost method for prediction of reference evapotranspiration in humid regions. J. Hydrol. 2019, 574, 1029–1041. [Google Scholar] [CrossRef]
- Dasilas, A.; Rigani, A. Machine learning techniques in bankruptcy prediction: A systematic literature review. Expert Syst. Appl. 2024, 255, 124761. [Google Scholar] [CrossRef]
- Teodorescu, V.; Obreja Brașoveanu, L. Assessing the validity of k-fold cross-validation for model selection: Evidence from bankruptcy prediction using random forest and XGBoost. Computation 2025, 13, 127. [Google Scholar] [CrossRef]
- Shams, M.Y.; Elshewey, A.M.; El-Kenawy, E.S.M.; Ibrahim, A.; Talaat, F.M.; Tarek, Z. Water quality prediction using machine learning models based on grid search method. Multimed. Tools Appl. 2024, 83, 35307–35334. [Google Scholar] [CrossRef]
- Aygun, H.; Dursun, O.O.; Dönmez, K.; Sahin, O.; Toraman, S. Prediction of performance characteristics of an experimental micro turbojet engine using machine learning approaches. Energy 2024, 313, 133997. [Google Scholar] [CrossRef]
- Xu, M.; Qiu, Y.; Khandelwal, M.; Kadkhodaei, M.H.; Zhou, J. Optimizing Random Forest with Hybrid Swarm Intelligence Algorithms for Predicting Shear Bond Strength of Cable Bolts. Machines 2025, 13, 758. [Google Scholar] [CrossRef]
- Ellahi, M.; Usman, M.R.; Arif, W.; Usman, H.F.; Khan, W.A.; Satrya, G.B.; Daniel, K.; Shabbir, N. Forecasting of wind speed and power through FFNN and CFNN using HPSOBA and MHPSO-BAACs techniques. Electronics 2022, 11, 4193. [Google Scholar] [CrossRef]
















| GT Model/Series | Siemens SGT5-8000H Series |
|---|---|
| Capacity | 401 MW |
| Frequency | 50 Hz |
| Speed | 3000 rpm |
| Application | Industrial-scale power generation |
| Fuel type | Natural gas |
| Reference ambient temperature | 15 °C |
| Reference RH | 70% |
| Reference AP | 1009 mbar |
| Compressor configuration | 13 stages; 3 VGVs + 1 IGV |
| Turbine configuration | 4 stages |
| Model | Optimized Hyperparameters |
|---|---|
| Bagged Trees | n_estimators = 500, max_samples = 1.0, max_features = 1.0, estimator max_depth = None, estimator min_samples_leaf = 1 |
| Random Forest | n_estimators = 500, max_depth = None, max_features = None, min_samples_split = 2 |
| Gradient Boosting | n_estimators = 500, learning_rate = 0.1, max_depth = 6, subsample = 0.8 |
| XGBoost | n_estimators = 500, learning_rate = 0.1, max_depth = 8, subsample = 0.9, colsample_bytree = 0.9 |
| LightGBM | n_estimators = 1000, learning_rate = 0.1, num_leaves = 50, subsample = 0.8, colsample_bytree = 0.8, importance_type = “split” |
| CatBoost | iterations = 1000, learning_rate = 0.1, depth = 8, l2_leaf_reg = 1, border_count = 64 |
| Min | Max | |
|---|---|---|
| GTPO (MW) | 239.34 | 420.11 |
| IGV (%) | 39.58 | 100.00 |
| CIT (°C) | −2.77 | 33.30 |
| FGF (Sm3/h) | 67,197.51 | 106,933.75 |
| LHV (kcal/Sm3) | 8155.91 | 9090.35 |
| AP (mbar) | 980.88 | 1019.04 |
| RH (%) | 10.59 | 100.00 |
| Variable | Pearson (r) | Spearman (ρ) | Difference (Δ) | Relationship Strength and Type |
|---|---|---|---|---|
| GTPO | −0.87 | −0.94 | 0.07 | Very strong negative |
| FGF | −0.87 | −0.91 | 0.04 | Very strong negative |
| IGV | −0.71 | −0.62 | 0.09 | Strong negative |
| CIT | 0.57 | 0.56 | 0.01 | Moderate positive |
| AP | −0.36 | −0.34 | 0.02 | Weak negative |
| RH | −0.24 | −0.25 | 0.01 | Weak negative |
| LHV | 0.10 | 0.10 | 0.00 | Weak positive |
| Model Type | RMSE | MSE | R2 | MAE |
|---|---|---|---|---|
| Bagged Trees | 1.6410 | 2.6927 | 0.9657 | 1.0854 |
| Random Forest | 1.6421 | 2.6966 | 0.9657 | 1.0857 |
| Gradient Boosting | 1.6097 | 2.5911 | 0.9670 | 1.1161 |
| XGBoost | 1.5335 | 2.3516 | 0.9701 | 1.0318 |
| LightGBM | 1.5280 | 2.3347 | 0.9703 | 1.0319 |
| CatBoost | 1.5818 | 2.5021 | 0.9683 | 1.0647 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Aslan, A. Gas Turbine Exhaust Gas Temperature Prediction Under Variable Operating Loads and IGV Positions Using Tree-Based Ensemble Learning. Machines 2026, 14, 630. https://doi.org/10.3390/machines14060630
Aslan A. Gas Turbine Exhaust Gas Temperature Prediction Under Variable Operating Loads and IGV Positions Using Tree-Based Ensemble Learning. Machines. 2026; 14(6):630. https://doi.org/10.3390/machines14060630
Chicago/Turabian StyleAslan, Asiye. 2026. "Gas Turbine Exhaust Gas Temperature Prediction Under Variable Operating Loads and IGV Positions Using Tree-Based Ensemble Learning" Machines 14, no. 6: 630. https://doi.org/10.3390/machines14060630
APA StyleAslan, A. (2026). Gas Turbine Exhaust Gas Temperature Prediction Under Variable Operating Loads and IGV Positions Using Tree-Based Ensemble Learning. Machines, 14(6), 630. https://doi.org/10.3390/machines14060630

