A Forest Fire Risk Prediction Framework Based on Machine Learning Models in the Greater Khingan
Abstract
1. Introduction
2. Materials and Methods
2.1. Study Area
2.2. Materials
2.2.1. Historical Fire Point Datasets
2.2.2. Drivers of Forest Fires
2.3. Methods
2.3.1. Feature Selection Methods
Variance Inflation Factor
Recursive Feature Elimination
2.3.2. Model Construction
Logistic Regression Model
Random Forest Model
Support Vector Machine Model
2.3.3. Interpretation
2.3.4. Performance Evaluation Methods
Confusion Matrix Analysis
The Receiver Operating Characteristic Curve
3. Results
3.1. Analysis of Characteristic Factor Screening Results
3.2. Driving Factor Selection and Importance Analysis Based on RF and SVM
3.3. Accuracy Evaluation
3.4. Thematic Map of Forest Fire Risk Prediction
4. Discussion
4.1. Correlation Characteristics Between Different Driving Factors and Forest Fires
4.2. Comparative Analysis and Applicability of Different Models
4.3. Limitations and Future Perspectives
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Yuan, S.; Wang, M.; Shu, L.; Ma, Q.; Song, J.; Xiao, F.; Zhou, X.; Wang, J. Characteristics of Lightning Ignition and Spatial–Temporal Distributions Linked with Wildfires in the Greater Khingan Mountains. Fire 2025, 8, 474. [Google Scholar] [CrossRef]
- Wang, Y.; Liu, X.; Jin, H.; Zeng, X.; Zhang, X.; Kang, H.; Kang, S.; Li, Y.; Zhang, Q. Climate-induced permafrost degradation exerts species-specific impacts on pine and larch growth in the Da Xing’anling Mountains, Northeast China. Agric. For. Meteorol. 2025, 372, 110665. [Google Scholar] [CrossRef]
- Zhang, H.; Liu, X.; Ban, Q.; Shu, Y. Data source effects on forest fire prediction models in Northern China under multiple climate scenarios. Res. Sq. 2026, preprint (Version 1). [Google Scholar] [CrossRef]
- Liu, Y.; Ding, A. Contrasting trends of carbon emission from savanna and boreal forest fires during 1999–2022. Meteorol. Appl. 2024, 31, e2177. [Google Scholar] [CrossRef]
- Gao, C.; Shi, C.; Li, J.; Yuan, S.; Huang, X.; Zhang, Q.; Ma, Q.; Wu, G. Igniting lightning, wildfire occurrence, and precipitation in the boreal forest of northeast China. Agric. For. Meteorol. 2024, 354, 110081. [Google Scholar] [CrossRef]
- San-Miguel-Ayanz, J.; Moreno, J.M.; Camia, A. Analysis of large fires in European Mediterranean landscapes: Lessons learned and perspectives. For. Ecol. Manag. 2013, 294, 11–22. [Google Scholar] [CrossRef]
- Pausas, J.G.; Keeley, J.E. Wildfires and global change. Front. Ecol. Environ. 2021, 19, 387–395. [Google Scholar] [CrossRef]
- Rafaqat, W.; Iqbal, M.; Kanwal, R.; Song, W. Study of Driving Factors Using Machine Learning to Determine the Effect of Topography, Climate, and Fuel on Wildfire in Pakistan. Remote Sens. 2022, 14, 1918. [Google Scholar] [CrossRef]
- Guria, R.; Mishra, M.; Mohanta, S.; Paul, S. Forest fire probability zonation using dNBR and machine learning models: A case study at the Similipal Biosphere Reserve (SBR), Odisha, India. Environ. Sci. Pollut. Res. 2025, 32, 31375–31396. [Google Scholar] [CrossRef] [PubMed]
- Su, Z.; Hu, H.; Tigabu, M.; Wang, G.; Zeng, A.; Guo, F. Geographically weighted negative Binomial regression model predicts wildfire occurrence in the Great Xing’an Mountains better than negative Binomial Model. Forests 2019, 10, 377. [Google Scholar] [CrossRef]
- Partheepan, S.; Sanati, F.; Hassan, J. Modelling bushfire severity and predicting future trends in Australia using remote sensing and machine learning. Environ. Model. Softw. 2025, 188, 106377. [Google Scholar] [CrossRef]
- Rodrigues, M.; De la Riva, J. An insight into machine-learning algorithms to model human-caused wildfire occurrence. Environ. Model. Softw. 2014, 57, 192–201. [Google Scholar] [CrossRef]
- Jain, P.; Coogan, S.C.; Subramanian, S.G.; Crowley, M.; Taylor, S.; Flannigan, M.D. A review of machine learning applications in wildfire science and management. Environ. Rev. 2020, 28, 478–505. [Google Scholar] [CrossRef]
- Abohaia, Z.; Elkhouly, A.; Barachi, M.E.; Al-Khatib, O. Regional Prediction of Fire Characteristics Using Machine Learning in Australia. Fire 2025, 8, 330. [Google Scholar] [CrossRef]
- Wang, C.; Liu, H.; Xu, Y.; Zhang, F. A forest fire prediction framework based on multiple machine learning models. Forests 2025, 16, 329. [Google Scholar] [CrossRef]
- Sheriff, R.; Meer, M.S.; Aslam, R.W.; Said, Y. Machine learning-based forest fire susceptibility mapping using random forest and CART models. Rangel. Ecol. Manag. 2025, 102, 96–109. [Google Scholar] [CrossRef]
- Tian, X.; Shu, L.; Wang, M.; Zhao, F.; Chen, L. The fire danger and fire regime for the Daxing’anling region for 1987–2010. Procedia Eng. 2013, 62, 1023–1031. [Google Scholar] [CrossRef]
- Song, K.; Wang, Z.; Zhang, D.; Yu, S.; Zhang, T.; Wang, X.; Li, X.; Du, B. Elevational Differentiation in Earlywood and Latewood Density Responses of Pinus sylvestris var. mongolica to Climate in the Northern Greater Khingan Range. Forests 2026, 17, 99. [Google Scholar] [CrossRef]
- Rew, J.; Cho, Y.; Hwang, E. A Robust Prediction Model for Species Distribution Using Bagging Ensembles with Deep Neural Networks. Remote Sens. 2021, 13, 1495. [Google Scholar] [CrossRef]
- Gui, R.; Song, W.; Lv, J.; Lu, Y.; Liu, H.; Feng, T.; Linghu, S. Digital elevation model-driven river channel boundary monitoring using the natural breaks (Jenks) method. Remote Sens. 2025, 17, 1092. [Google Scholar] [CrossRef]
- Xuejiao, C.; Kaitong, Z.; Jiao, W.; Munan, W.; Ying, Q. Analysis of surface albedo responses to forest fires in the Great Xing’an Range, China. Remote Sens. Nat. Resour. 2025, 37, 212–219. [Google Scholar]
- Díaz-Vázquez, D.; Casillas-García, L.F.; Garcia-Gonzalez, A.; Montero, S.H.G.; Rubio, J.I.M.; Llamas, J.J.L.; Hernandez, M.S.G. Integrating Remote Sensing and machine learning for dynamic burn probability mapping in data-limited contexts. Remote Sens. Appl. Soc. Environ. 2025, 38, 101554. [Google Scholar] [CrossRef]
- Durlević, U.; Ilić, V.; Valjarević, A. Wildfire susceptibility mapping using deep learning and machine learning models based on multi-sensor satellite data fusion: A case study of Serbia. Fire 2025, 8, 407. [Google Scholar] [CrossRef]
- Breiman, L. Random forests, machine learning 45. Mach. Learn. 2001, 2, 199–228. [Google Scholar]
- Duan, J.; Hu, J.; Fu, Y.; Liu, Q.; Li, R.; Wang, Y. Estimation of fire counts and fire radiative power using satellite optical and microwave vegetation indices with random forest method. J. Geophys. Res. Atmos. 2025, 130, e2024JD041680. [Google Scholar] [CrossRef]
- Gigović, L.; Pourghasemi, H.R.; Drobnjak, S.; Bai, S. Testing a new ensemble model based on SVM and random forest in forest fire susceptibility assessment and its mapping in Serbia’s Tara National Park. Forests 2019, 10, 408. [Google Scholar] [CrossRef]
- Jalili, A.; Saleki, Z.; Luo, Y.; Pan, F.; Chen, A.X.; Draayer, J.P. Performance of various kernel functions for mass prediction with support vector machine. Eur. Phys. J. A 2025, 61, 143. [Google Scholar] [CrossRef]
- Singh, H.; Ang, L.-M.; Paudyal, D.; Acuna, M.; Srivastava, P.K.; Srivastava, S.K. A Comprehensive Review of Empirical and Dynamic Wildfire Simulators and Machine Learning Techniques used for the Prediction of Wildfire in Australia. Technol. Knowl. Learn. 2025, 30, 935–968. [Google Scholar] [CrossRef]
- Dai, S.; Zhang, J.; Huang, Z.; Zeng, S. Fire prediction and risk identification with interpretable machine learning. J. Forecast. 2025, 44, 1699–1715. [Google Scholar] [CrossRef]
- Bouzeraa, Y.; Bouchemal, N.; Djaaboub, S.; Hristov, G.; Zahariev, P. Machine Learning-Based Wildfire Susceptibility Mapping: A GIS-Integrated Predictive Framework. Appl. Sci. 2025, 15, 12188. [Google Scholar] [CrossRef]
- Fan, C.-L. Evaluation model for crack detection with deep learning: Improved confusion matrix based on linear features. J. Constr. Eng. Manag. 2025, 151, 04024210. [Google Scholar] [CrossRef]
- Chen, R.; Zhang, Y.; Li, Y.; Yebra, M.; Fan, C.; Zhang, H.; He, B. Probabilistic mapping of high-intensity forest fire potential via time series machine learning and remote sensing-informed fire spread simulations. Remote Sens. Environ. 2026, 334, 115233. [Google Scholar] [CrossRef]
- Ahmad, H.; Wu, Z.; Huang, H.; Muhammad, S.; Hayat, M.; Abbas, K.; Yang, X.; Shu, Z. A comparative evaluation of forest fire hazard vulnerability through geographic information system-based techniques. Front. For. Glob. Change 2025, 8, 1635041. [Google Scholar] [CrossRef]
- Smithson, M. The receiver operating characteristic area under the curve (or mean ridit) as an effect size. Psychol. Methods 2025, 30, 678. [Google Scholar] [CrossRef] [PubMed]
- Bhattarai, H.; Val Martin, M.; Sitch, S.; Yung, D.H.; Tai, A.P. Global patterns and drivers of climate-driven fires in a warming world. EGUsphere 2025, 2025, 1–28. [Google Scholar] [CrossRef]
- Gaboriau, D.M.; Remy, C.C.; Girardin, M.P.; Asselin, H.; Hély, C.; Bergeron, Y.; Ali, A.A. Temperature and fuel availability control fire size/severity in the boreal forest of central Northwest Territories, Canada. Quat. Sci. Rev. 2020, 250, 106697. [Google Scholar] [CrossRef]
- Lou, L.; Ma, W.; Cheng, P.; Liu, H.; Huang, Y. Climatic and Fuel Drivers of Lightning-Induced Forest Fire Burned Area in the Da Hinggan Ling Region, Northeast China. Remote Sens. 2026, 18, 657. [Google Scholar] [CrossRef]
- Pereira-Pires, J.E.; Aubard, V.; Ribeiro, R.A.; Fonseca, J.M.; Silva, J.M.; Mora, A. Fuel break vegetation monitoring with sentinel-2 ndvi robust to phenology and environmental conditions. In Proceedings of the 2021 IEEE International Geoscience and Remote Sensing Symposium IGARSS; IEEE: Piscataway, NJ, USA, 2021; pp. 6264–6267. [Google Scholar]
- Anguita, D.; Ghio, A.; Greco, N.; Oneto, L.; Ridella, S. Model selection for support vector machines: Advantages and disadvantages of the machine learning theory. In Proceedings of the 2010 International Joint Conference on Neural Networks (IJCNN); IEEE: New York, NY, USA, 2010; pp. 1–8. [Google Scholar]
- Ramo, R.; Chuvieco, E. Developing a random forest algorithm for MODIS global burned area classification. Remote Sens. 2017, 9, 1193. [Google Scholar] [CrossRef]
- Pearson, R.K. Outliers in process modeling and identification. IEEE Trans. Control Syst. Technol. 2002, 10, 55–63. [Google Scholar] [CrossRef]







| Factor Category | Indicator Title (Abbreviation) | Resolution | Unit | Temporal Coverage | Data Source |
|---|---|---|---|---|---|
| Independent Variables | Elevation (DEM) | 30 m | m | 2000 | https://www.nasa.gov, accessed on 1 September 2025 |
| Slope | 30 m | ° | 2000 | https://www.nasa.gov, accessed on 1 September 2025 | |
| Aspect | 30 m | 2000 | https://www.nasa.gov, accessed on 1 September 2025 | ||
| NDVI | 1 km | 2001–2018 | https://www.earthdata.nasa.gov, accessed on 1 September 2025 | ||
| Precipitation (Prec) | 0.01° (~1 km) | mm | 2001–2018 | http://www.ncdc.ac.cn, accessed on 3 September 2025 | |
| Temperature (Tmp) | 0.01° (~1 km) | °C | 2001–2018 | http://www.ncdc.ac.cn, accessed on 3 September 2025 | |
| Pressure (Pres) | 0.01° (~1 km) | hPa | 2001–2018 | http://www.ncdc.ac.cn, accessed on 3 September 2025 | |
| Wind | 0.01° (~1 km) | m/s | 2001–2018 | http://www.ncdc.ac.cn, accessed on 3 September 2025 | |
| Relative Humidity (RH) | 0.01° (~1 km) | % | 2001–2018 | http://www.ncdc.ac.cn, accessed on 3 September 2025 | |
| Distance to Roads (DTR) | km | 2001–2018 | https://www.resdc.cn, accessed on 2 September 2025 | ||
| Distance to Residential Areas (DTRA) | km | 2001–2018 | https://www.resdc.cn, accessed on 2 September 2025 | ||
| Population (POP) | 1 km | persons/km2 | 2001–2018 | https://www.resdc.cn, accessed on 2 September 2025 | |
| GDP | 1 km | 104 yuan/km2 | 2001–2018 | https://www.resdc.cn, accessed on 2 September 2025 | |
| Dependent Variable | Fire point data | 1 km | Binary (0 = No fire, 1 = Fire) | 2001–2018 | https://www.nasa.gov, accessed on 1 September 2025 |
| Function Name | Formula | Meaning |
|---|---|---|
| Linear kernel function | ||
| Sigmoid kernel function | γ is the coefficient, r is the constant term | |
| Polynomial kernel function | γ is the coefficient, d is the degree, r is the constant term | |
| RBF kernel function | γ is the coefficient |
| Type | Explanation |
|---|---|
| True Positive (TP) | True fire point samples are correctly judged by the model |
| True Negative (TN) | True non-fire point samples are correctly judged by the model |
| False Positive (FP) | True fire point samples are misjudged by the model |
| False Negative (FN) | True non-fire point samples are misjudged by the model |
| VIF < 10 | |
|---|---|
| Feature factor | NDVI, GDP, Tmp, RH, Pres, DTR, DTRA, DEM, Slope |
| Variable Name | Estimation Coefficient | Standard Error | Chi-Square Value | Standardized Regression Coefficient |
|---|---|---|---|---|
| Intercept | 0.766 | 0.035 | 26.808 | −0.086 |
| NDVI | −0.489 | 0.374 | 45.235 | −1.464 |
| Tmp | 0.220 | 0.007 | 18.749 | 1.638 |
| Pres | 0.062 | 0.003 | 42.694 | 1.646 |
| DTR | 1.706 | 0.094 | 28.469 | 0.611 |
| DTRA | −3.522 | 0.577 | 37.194 | −0.193 |
| DEM | 0.005 | 0.0004 | 73.431 | 1.174 |
| Slope | −0.341 | 0.028 | 43.136 | −0.489 |
| Contribution Screening Results | |
|---|---|
| Characteristic factor | NDVI, GDP, POP, WIND, Tmp, RH, Pres, Prec, DTR, DTRA, DEM, Slope |
| Method | Confusion Matrix (TN/FP/FN/TP) | Accuracy | Precision | Recall | Specificity | F1-Score | AUC |
|---|---|---|---|---|---|---|---|
| Logistic | 943/429/283/1106 | 0.742 | 0.720 | 0.797 | 0.687 | 0.756 | 0.798 |
| RF | 1225/147/77/1312 | 0.919 | 0.899 | 0.945 | 0.893 | 0.921 | 0.966 |
| SVM | 1101/271/97/1292 | 0.867 | 0.827 | 0.930 | 0.802 | 0.875 | 0.929 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Li, H.; Zhang, J.; Yang, J.; Teng, C.; Luo, K.; Sun, K. A Forest Fire Risk Prediction Framework Based on Machine Learning Models in the Greater Khingan. Fire 2026, 9, 256. https://doi.org/10.3390/fire9060256
Li H, Zhang J, Yang J, Teng C, Luo K, Sun K. A Forest Fire Risk Prediction Framework Based on Machine Learning Models in the Greater Khingan. Fire. 2026; 9(6):256. https://doi.org/10.3390/fire9060256
Chicago/Turabian StyleLi, Heng, Jialong Zhang, Jingwen Yang, Chenkai Teng, Kai Luo, and Kaiping Sun. 2026. "A Forest Fire Risk Prediction Framework Based on Machine Learning Models in the Greater Khingan" Fire 9, no. 6: 256. https://doi.org/10.3390/fire9060256
APA StyleLi, H., Zhang, J., Yang, J., Teng, C., Luo, K., & Sun, K. (2026). A Forest Fire Risk Prediction Framework Based on Machine Learning Models in the Greater Khingan. Fire, 9(6), 256. https://doi.org/10.3390/fire9060256

