A Deep Random Forest Model with Symmetry Analysis for Hyperspectral Image Data Classification Based on Feature Importance

Jie Lian; Wei Feng; Qing Wang; Yuhang Dong; Gabriel Dauphin; Jian Bai

doi:10.3390/sym17122172

,

and

¹

The College of Computer, National University of Defense Technology, Changsha 410073, China

²

Beijing Institution of Remote Sensing Equipment, Beijing 100039, China

³

School of Information Mechanics and Sensing Engineering, Xidian University, Xi’an 710071, China

⁴

Xi’an Key Laboratory of Advanced Remote Sensing, Xi’an 710071, China

Symmetry2025, 17(12), 2172;https://doi.org/10.3390/sym17122172

This article belongs to the Section Computer

Version Notes

Order Reprints

Abstract

Hyperspectral imagery (HSI), as a core data carrier in remote sensing, plays a crucial role in many fields. Still, it also faces numerous challenges, including the curse of dimensionality, noise interference, and small samples. These problems severely affect the generalization ability and classification accuracy of traditional machine learning and deep learning algorithms. Existing solutions suffer from bottlenecks such as unknown cost matrices and excessive computational overhead. And ensemble learning fails to fully exploit the deep semantic features and feature importance relationships of high-dimensional data. To address these issues, this paper proposes a dual ensemble classification framework (DRF-FI) based on feature importance analysis and a deep random forest. This method integrates feature selection and two-layer ensemble learning. First, it identifies discriminative spectral bands through feature importance quantification. Then, it constructs a balanced training subset through random oversampling. Finally, it integrates four different ensemble strategies. Experimental results on three benchmark hyperspectral datasets demonstrate that DRF-FI exhibits outstanding performance across multiple datasets, particularly excelling in handling highly imbalanced data. Compared to traditional random forests, the proposed method achieves stable improvements in both overall accuracy (OA) and average accuracy (AA). On specific datasets, OA and AA were enhanced by up to 0.84% and 1.24%, respectively. This provides an effective solution to the class imbalance problem in hyperspectral images.

Keywords:

hyperspectral image; feature importance; deep random forest; ensemble learning

Article Metrics

Citations

Article Access Statistics

Journal Statistics

Multiple requests from the same IP address are counted as one view.