Hyperspectral Band Selection for Ground Fuel Classification for Prescribed Fires

Karankot, Mahmad Isaq; Glenn, Ethan M.; Masood, Muhammad Umer; Zhou, Xiaobing; Whitaker, Bradley M.

doi:10.3390/rs18091440

Open AccessArticle

Hyperspectral Band Selection for Ground Fuel Classification for Prescribed Fires

by

Mahmad Isaq Karankot

^1,*

,

Ethan M. Glenn

¹,

Muhammad Umer Masood

²,

Xiaobing Zhou

²

and

Bradley M. Whitaker

¹

Electrical and Computer Engineering, Montana State University, Bozeman, MT 59717, USA

²

Geological Engineering Department, Montana Technological University, Butte, MT 59701, USA

^*

Author to whom correspondence should be addressed.

Remote Sens. 2026, 18(9), 1440; https://doi.org/10.3390/rs18091440

Submission received: 28 February 2026 / Revised: 23 April 2026 / Accepted: 28 April 2026 / Published: 6 May 2026

Download

Browse Figures

Versions Notes

Highlights

What are the main findings?

The comparative evaluation of five hyperspectral band-selection methods–PCA, SSEP, SRPA, and DRL and a clustering based baseline (K-Means Clustering-Based Band Selection: KMCBS)–for ground fuel classification in prescribed-fire environments.
Selected spectral bands enable machine learning and deep learning models (RF, SVM, KNN, and 3D-CNN) to achieve competitive classification accuracy across benchmark datasets and a UAV-based VNIR hyperspectral dataset collected after prescribed burns.

What are the implications of the main finding?

Effective band selection reduces hyperspectral dimensionality and improves computational efficiency for large UAV hyperspectral datasets.
The framework supports the scalable AI-driven monitoring of ground fuels and post-fire vegetation conditions using hyperspectral remote sensing.

Abstract

Hyperspectral image (HSI) analysis plays a central role in remote sensing tasks requiring fine-grained material discrimination, vegetation health assessment, and post-disturbance monitoring. Yet, the high dimensionality and strong spectral redundancy in HSIs often reduce the efficiency and reliability of machine learning models. These challenges are especially important in wildfire science and prescribed-fire monitoring, where spectral responses vary due to burn severity, char deposition, canopy structure, and early vegetation recovery. Benchmark datasets such as Indian Pines and Pavia University and others provide controlled environments for algorithms’ evaluation, but real-world post-fire forest conditions pose additional complexity. This study presents a unified and comprehensive evaluation of five dimensionality reduction strategies: Principal Component Analysis (PCA), Spatial–Spectral Edge Preservation (SSEP), Spectral-Redundancy Penalized Attention (SRPA), and a Deep Reinforcement Learning (DRL)-based selector together with a clustering based baseline, K-Means Clustering-Based Band Selection (KMCBS). These strategies are combined with classical machine learning and deep learning classifiers: Random Forest (RF), Support Vector Machines (SVMs), K-Nearest Neighbors (KNNs), and 3D Convolutional Neural Networks (3D-CNN). The full pipeline includes exploratory data analysis, preprocessing, patch-based spatial–spectral modeling, consistent train–validation protocols, and multi-dataset evaluation across Indian Pines, Pavia University, and a new custom VNIR hyperspectral dataset collected after prescribed burns at the Lubrecht Experimental Forest in Montana, USA. By systematically comparing statistical, edge-aware, attention-guided, and reinforcement learning-based band-selection strategies, this work identifies compact yet informative spectral subsets that enhance classification performance while reducing computational cost. Importantly, the inclusion of the Montana prescribed-burn dataset provides a unique real-world testbed for understanding band selection behavior in fire-affected forest environments. Overall, this study contributes a generalizable and extensible framework for HSI dimensionality reduction and classification, laying the groundwork for future applications in wildfire assessment, vegetation recovery monitoring, and remote sensing.

Keywords:

hyperspectral imaging; band selection; machine learning; prescribed fire

1. Introduction

Hyperspectral imaging (HSI) has emerged as a powerful remote sensing tool, enabling the fine-grained discrimination of materials, vegetation health assessment, and monitoring of environmental disturbances due to its hundreds of contiguous narrow spectral bands [1,2,3]. The rich spectral information supports applications ranging from land cover classification [4] and crop mapping [5] to soil analysis [6] and quality inspection in agriculture [7]. However, the high dimensionality of HSI data introduces significant challenges, including the “curse of dimensionality”, spectral redundancy, computational complexity, and reduced model generalization in machine learning pipelines [8,9].

Band selection is a critical preprocessing step in hyperspectral image (HSI) analysis, aimed at reducing spectral redundancy while preserving the most informative and discriminative spectral bands for downstream tasks such as classification, detection, and regression [2,9]. Hyperspectral dimensionality reduction is typically achieved through either feature extraction or band selection [1,2]. Feature extraction methods transform the original data into a new feature space by combining spectral information across all bands, which may improve compactness but often sacrifices the physical interpretability of the spectral signatures [10,11,12,13,14]. In contrast, band selection directly identifies a subset of original spectral bands, thereby preserving the inherent spectral meaning and facilitating domain-specific interpretation [1,15].

A wide range of hyperspectral band-selection methods have been proposed and can be broadly categorized into six groups: ranking-based, searching-based, clustering-based, sparsity-based, embedding learning-based, and hybrid approaches [1,2,3]. Ranking-based methods evaluate each band independently using criteria such as variance, entropy, mutual information, or correlation, offering high computational efficiency but often neglecting inter-band relationships [1,2,3]. Searching-based methods aim to identify optimal band subsets by explicitly optimizing an objective function, typically achieving improved classification performance at the cost of higher computational complexity [2,16]. Clustering-based methods group spectrally similar bands and select representative bands from each cluster, effectively reducing redundancy while maintaining spectral diversity [17,18]. Recent advances in this category include fast neighborhood grouping strategies that exploit coarse-to-fine spectral partitioning to capture contextual information across broad spectral ranges [18], and intergroup-difference-based ranking methods that explicitly account for redundancy across selected band groups rather than within individual clusters [19]. Graph-based methods extend these ideas by learning structural relationships among spectral bands through graph matrices, enabling joint exploitation of spatial and spectral information that single-domain methods overlook [20]. These structural and grouping perspectives are particularly relevant to the present study: the edge-preserving operators in SSEP are motivated by the same principle of retaining discriminative high-frequency spatial structure that graph-based methods capture spectrally, and the redundancy penalty in SRPA directly mirrors the inter group difference criterion of [19]. More recently, embedding learning-based methods have integrated band selection into model training, enabling task-driven selection through deep learning, attention mechanisms, and graph-based models [21,22,23]. Reinforcement learning-based approaches further extend this paradigm by formulating band selection as a sequential decision-making problem, allowing for adaptive and dynamic selection strategies [24,25,26,27].

From a theoretical perspective, the band selection problem can be formulated as selecting a subset of bands that maximizes task-specific performance while minimizing redundancy and computational cost [14]. Importantly, prior studies have shown that redundant and irrelevant spectral bands can degrade model performance, introduce instability, and propagate noise, particularly under limited training samples, highlighting the necessity of effective band-selection strategies in hyperspectral analysis [15,28]. While these methods have been extensively studied in general remote sensing applications, their effectiveness in wildfire and prescribed fire environments remains less explored.

These challenges are particularly acute in wildfire science and prescribed fire management, where spectral responses are influenced by burn severity, char deposition, canopy structure, and early vegetation recovery [29,30]. Wildfires pose a growing threat to ecosystems, communities, and economies in the western United States, with Montana experiencing particularly pronounced impacts due to its vast forested landscapes, dry continental climate, and increasing human development in the wildland–urban interface. Over the past two decades, Montana has seen a dramatic rise in wildfire activity: average annual acres burned have exceeded 300,000 since 2000—more than ten times the pre-1950 average of less than 30,000 acres—driven by warmer temperatures, prolonged droughts, fuel accumulation from historical fire suppression, and climate change-induced aridity [31,32]. Recent seasons illustrate variability but underscore the trend: in 2024, over 352,000 acres burned, while 2025 saw a milder year with approximately 76,000 acres (one of the fourth-lowest totals in the last 15 years), attributed to favorable weather but still highlighting ongoing risks [33,34].

Prescribed fires—carefully planned, low-intensity burns—serve as a critical proactive tool to mitigate these threats by reducing hazardous fuel loads, restoring ecological processes, improving wildlife habitat, recycling nutrients, and promoting resilient forests. In Montana, where historical natural fire regimes burned tens to hundreds of thousands of acres annually before suppression, prescribed burning helps counteract fuel buildup and supports forest health. State and federal programs (e.g., DNRC, USFS Helena-Lewis and Clark National Forest) have expanded efforts, with recent accomplishments including tens of thousands of acres treated annually and proposals to scale to approximately 40,000 acres per year through 2045 on national forests alone [35,36,37]. Despite these advances, challenges persist, including limited burn windows due to air quality regulations, smoke concerns, and the need for precise monitoring of fuel types and post-burn recovery.

Accurate ground fuel classification—distinguishing live vegetation, dead biomass, and species-specific conditions—is essential for planning effective prescribed burns, predicting fire behavior, and evaluating ecological outcomes. Hyperspectral remote sensing, with its fine spectral resolution, offers significant potential for this task, yet real-world applications in fire-affected Montana forests remain underexplored due to data scarcity and dimensionality issues [38,39,40].

Despite advances in HSI band selection for specific tasks [41,42], few studies integrate multiple strategies (e.g., statistical, edge-aware, attention-guided, and reinforcement learning-based) in a unified pipeline, especially for prescribed fire applications. Prior work has explored attention and edge-aware methods for burned vegetation classification [43] and self-supervised/DRL approaches for prescribed burn impact analysis [44], highlighting the potential of learning-based techniques in fire-affected scenes. However, systematic comparisons across benchmarks and real-world Unmanned Aircraft System (UAS) datasets remain limited, particularly for ground fuel mapping in controlled burns.

This study addresses these gaps by presenting a comprehensive evaluation of four band-selection strategies, along with a clustering-based baseline method (K-Means Clustering-Based Band Selection: KMCBS), spanning both classical and modern approaches. Specifically, we consider Principal Component Analysis (PCA) as a variance-based baseline [15], Spatial–Spectral Edge Preservation (SSEP) to incorporate spatial structural information [21], Spectral-Redundancy Penalized Attention (SRPA) for attention-driven and redundancy-aware selection [21,45], and Deep Reinforcement Learning (DRL) to model band selection as a sequential decision-making process [24,25,26,27]. These methods represent complementary perspectives on hyperspectral band selection, enabling a systematic analysis of the trade-offs between information preservation, redundancy reduction, and computational efficiency. These band-selection strategies are evaluated in combination with classical classifiers, including Random Forest, Support Vector Machines, and K-Nearest Neighbors, as well as deep learning models such as 3D Convolutional Neural Networks, combined with classical (Random Forest, Support Vector Machines, and K-Nearest Neighbors) and deep (3D Convolutional Neural Networks) classifiers. We apply a modular, reproducible pipeline to benchmark datasets (Indian Pines, Pavia University, Salinas, Botswana, and Kennedy Space Center) and a novel VNIR hyperspectral dataset collected via UAS over prescribed burn sites at the Lubrecht Experimental Forest, Montana, USA. This dataset captures pre-burn conditions in a controlled thin-burn plot, providing a unique real-world testbed for understanding band selection behavior in fire-prone forest environments.

2. Materials and Methods

The goal of this study is to systematically evaluate a unified and reproducible hyperspectral image (HSI) classification pipeline across both benchmark airborne datasets and a complex UAS-based visible–near-infrared (VNIR) dataset captured over prescribed burn sites in the Lubrecht Experimental Forest, a large outdoor forest research laboratory located in the Blackfoot River drainage, about 30 miles northeast of Missoula, Montana. To accomplish this, we designed a modular workflow consisting of four tightly integrated stages illustrated in process diagram Figure 1: (1) dataset preparation and exploratory data analysis (EDA), (2) noise detection and data cleaning, (3) band selection using classical and deep learning-based feature reduction techniques, and (4) classification using both traditional machine learning models and deep learning architectures. All components were implemented in a consistent, dataset-agnostic fashion so that the same pipeline could be applied to Indian Pines, Pavia University, Salinas, Botswana, KSC, and the Montana VNIR dataset without modification.

This section describes the materials, datasets, preprocessing steps, algorithms, model architectures, and evaluation design used in the study, presenting each part as a connected narrative that reflects the logical progression of the project.

2.1. Data

Hyperspectral imagery consists of hundreds of narrow and contiguous spectral bands, providing rich spectral information for material discrimination. However, this high spectral resolution also introduces significant redundancy and the well-known Hughes phenomenon (also known as the “curse of dimensionality”), motivating the need for dimensionality reduction and band selection prior to classification. Consequently, standardized preprocessing and band selection are widely adopted in hyperspectral image analysis pipelines [1,2,3].

We evaluate a unified hyperspectral classification framework using a diverse collection of datasets that span controlled benchmark scenes and a complex real-world Unmanned Aircraft System (UAS)-based acquisition. Specifically, experiments were conducted on five widely used benchmark hyperspectral datasets—Indian Pines, Pavia University, Salinas Valley, Botswana, and Kennedy Space Center [46]—as well as a custom visible–near-infrared (VNIR) hyperspectral dataset collected over prescribed burn sites in Montana, USA. The benchmark datasets represent a range of land cover types, spatial resolutions, sensors, and class distributions, and have been extensively used in prior hyperspectral classification and band selection studies, enabling meaningful comparison with the existing literature [8,9,47].

A summary of the benchmark datasets, including the sensor type, spatial dimensions, number of spectral bands before and after sensor-documented noisy band removal, number of classes, and spatial resolution, is provided in Table 1. The cleaned benchmark cubes served as inputs to the subsequent exploratory data analysis. Rather than treating each benchmark dataset independently, all were processed using the same unified and dataset-agnostic pipeline to ensure consistency across preprocessing, band selection, and classification stages. This design allows performance differences to be attributed to algorithmic choices rather than dataset-specific handling.

In contrast to the benchmark scenes, the Montana VNIR dataset represents a real-world post-fire forest environment acquired using a UAS platform, characterized by higher spectral noise, severe class imbalance, sparse ground truth, and complex canopy structure. Details of the acquisition parameters, preprocessing steps, and dataset-specific challenges for the Montana dataset are described separately in Section 2.1, as they differ substantially from those of the benchmark datasets and require specialized treatment.

Montana

The hyperspectral images used in this study were acquired on 11 May 2024 over a controlled thin-burn plot (330 m × 300 m) in the Lubrecht Experimental Forest, approximately 42 km east of Missoula, Montana, USA. Data collection was conducted using a Headwall co-aligned visible–near infrared (VNIR) and short-wave infrared (SWIR) imaging system, integrated with a FreeFly ALTA X heavy-duty Unmanned Aircraft System (UAS). Prior to the flight, the imaging system was calibrated using a certified Spectralon white diffuse reflectance standard to determine the optimal exposure time, frame period, and recommended flight speed. Based on this information, a flight plan was designed to maintain a nearly constant altitude relative to the terrain, thereby eliminating the need for terrain correction in the resulting imagery. Magnetometer calibration was also performed to ensure the accurate detection of the Earth’s magnetic field for reliable navigation. The precise location of the imaging system was recorded using a Trimble SPS585 global navigation satellite system (GNSS) unit for Real-Time Kinematic (RTK) surveying. Before executing the flight plan, the Inertial Measurement Unit (IMU) was initialized to track the aircraft’s movement and orientation. The hyperspectral imaging system captured 543 spectral band images—273 in the VNIR range (400–1000 nm) and 270 in the SWIR range (900–2500 nm). The data files acquired were preprocessed using Spectral View, the proprietary software provided by the manufacturer of the hyperspectral imaging system. The raw data, initially recorded in digital numbers (DNs), was corrected for noise and dark current using imagery collected prior to data acquisition. This process converted the DN values into spectral radiance (mW/(cm²·sr·µm)). The radiance data was then transformed into spectral reflectance using a calibrated reference tarp placed within the imaging scene during processing. Finally, the processed images were exported in GeoTIFF format to enable further analysis using standard image processing software.

2.2. Exploratory Data Analysis

Exploratory data analysis (EDA) was conducted prior to model training to examine radiometric distributions, assess spectral redundancy, and identify noisy or unstable bands. EDA plays a critical role in ensuring data integrity and preventing the propagation of artifacts into downstream band selection and the classification model [9,38].

2.3. EDA for Benchmark Hyperspectral Datasets

The objective of the benchmark dataset EDA was to standardize all preprocessing steps across the five classical hyperspectral scenes—Indian Pines, Pavia University, Salinas, Botswana, and Kennedy Space Center (KSC)—so that band-selection algorithms could be evaluated under consistent noise conditions. These scenes differ substantially in terms of their spatial dimensions, radiometric behavior, and class distributions, making a unified and statistically grounded EDA pipeline necessary. The goal of this pipeline was to verify data integrity, characterize radiometric distributions, identify low-SNR or water-absorption bands, and construct noise-free normalized cubes to serve as inputs for SSEP, SRPA, and DRL-based band selection as well as for classical and deep classifiers.

Each dataset was loaded from its .mat hyperspectral file and associated ground-truth (gt) .mat label mask using dataset-specific variable keys. Upon loading, the benchmark hyperspectral cubes exhibited the following dimensions: Indian Pines (145 × 145 × 200), PaviaU (610 × 340 × 103), Salinas (512 × 217 × 204), Botswana (1476 × 256 × 145) and KSC (512 × 614 × 176). The basic integrity diagnostics included the computation of global intensity minima and maxima, band-wise means, and band-wise standard deviations. Botswana displayed reflectance values ranging from 0 to 45,106, with an average band-wise mean of 1749.62 and an average standard deviation of 420.48. Indian Pines ranged from 955 to 9604, with an average band-wise mean of 2652.39 and an average standard deviation of 336.36. KSC ranged from –27 to 1244, with an average band-wise mean of 93.42 and an average standard deviation of 72.36. PaviaU showed intensities between 0 and 8000, with an average band-wise mean of 1389.12 and an average standard deviation of 716.43. Salinas ranged from –11 to 9207, with a corresponding mean band-wise mean of 1196.40 and an average standard deviation of 389.91. These statistics confirmed proper radiometric scaling and the absence of corruption in the raw reflectance values.

Ground-truth label distributions were analyzed to verify class presence and quantify class imbalance. Botswana contained 15 classes with pixel counts ranging from 95 to 314, against 374,608 background pixels. Indian Pines exhibited 16 classes, including extremely sparse categories (e.g., class 1 with only 46 pixels) and dense categories such as class 11 with 2455 pixels. KSC contained 13 classes with class populations ranging from 105 to 927 and background comprising 309,157 pixels. PaviaU included nine semantic classes, with some highly populated categories (e.g., class 2 with 18,649 pixels) and background containing 164,624 pixels. Salinas exhibited 16 classes, with major categories (e.g., class 8 with 11,271 pixels) and small categories (e.g., class 13 with 916 pixels). These results confirm a substantial class imbalance across all benchmark datasets, which should be considered in downstream learning.

To characterize spectral noise, each hyperspectral cube was reshaped into a matrix of size

N \times B

, where

N = H \cdot W

, enabling band-wise analysis across all spatial pixels. For each spectral channel b, we computed its minimum and maximum intensities

(m_{b}, M_{b})

, mean

μ_{b}

, standard deviation

σ_{b}

, dynamic range

R_{b} = M_{b} - m_{b}

, and zero-fraction:

f_{b}^{(0)} = \frac{1}{N} \sum_{i = 1}^{N} 1 [X_{i, b} = 0]

(1)

Bands were flagged as suspicious if they exhibited extremely low variance (i.e.,

σ_{b}

lying in the lowest 5% percentile of all variances), unusually small dynamic range (i.e.,

R_{b}

less than

10^{- 3}

of the global range), or high zero-fraction (i.e.,

f_{b}^{(0)} \geq 0.80

). Although none of the benchmark datasets contained high zero-fraction bands, each dataset presented a small set of low-variance bands that aligned with known water-absorption and sensor-instability regions. This EDA-based noisy band identification was performed in addition to the initial sensor-documented band exclusion summarized in Table 1. The heuristic procedure identified the following candidate noisy bands: Botswana—[111, 138–144]; Indian Pines—[102–104, 143–145, 195, 197–199]; KSC—[0–8]; PaviaU—[0–5]; and Salinas—[106–109, 146–148, 198, 201–203]. These empirically identified indices match previously documented noisy band lists in the hyperspectral imaging literature.

Based on these technical diagnostics and literature support, we removed the identified noisy bands from each dataset, yielding cleaned spectral dimensionalities of 137 bands (Botswana), 190 bands (Indian Pines), 167 bands (KSC), 97 bands (PaviaU), and 193 bands (Salinas). The spatial dimensions were unchanged, ensuring that only the spectral axis was compressed. This removal eliminated known water-absorption wavelengths and low–SNR regions while preserving all meaningful spectral–spatial structure required for downstream tasks.

After noisy band removal, each dataset was normalized per spectral channel using min–max scaling:

X_{i, b}^{norm} = \frac{X_{i, b} - m_{b}}{M_{b} - m_{b} + ε}

(2)

where

ε

prevents division by zero. This transformation places all spectral channels in the

[0, 1]

range, eliminates scale disparities among bands, and stabilizes distance metrics and gradients for machine learning models. The cleaned and normalized hyperspectral cubes and associated summary statistics were retained for downstream analysis.

The results of this benchmark EDA demonstrate that the unified pipeline not only confirms the structural and radiometric consistency of all datasets but also quantitatively identifies and removes noisy spectral regions using both heuristic and literature-validated criteria. By producing noise-controlled hyperspectral cubes with well–characterized statistics, the benchmark EDA ensures that subsequent band-selection and classification experiments are conducted under standardized, reproducible, and scientifically rigorous conditions.

2.4. EDA for the Montana UAV Dataset

The Montana VNIR dataset consists of a drone-acquired hyperspectral mosaic with 273 spectral bands and spatial dimensions of 10,706 × 8360 pixels. Before applying any band-selection algorithms or training classification models, we conducted a comprehensive exploratory data analysis (EDA) to (i) construct an accurate pixel-wise ground truth (GT), (ii) assess class distribution and label sparsity, and (iii) evaluate spectral-band radiometric quality and class-wise separability.

2.4.1. Ground-Truth Construction and Alignment Verification

Field plot measurements, provided as longitude–latitude points with species labels, were converted into a pixel-wise raster mask using a custom reprojection and rasterization pipeline. The GPS coordinates (EPSG:4326) were transformed into the coordinate reference system of the VNIR cube using rasterio.warp.transform, followed by conversion to pixel indices via the GeoTIFF affine transform. Each species was assigned a unique integer class ID (with zero reserved for background), and points within the VNIR footprint were written into a 2D mask saved as montana_gt.npy.

To validate GT–image alignment, we generated a pseudo-RGB composite generated using three high-SNR VNIR bands (R = 150, G = 100, and B = 50) and overlaid with the GT to verify geometric alignment. This procedure revealed and resolved an initial misalignment between labels and imagery. After reconstructing the GT using accurate reprojection, the overlay confirmed correct spatial correspondence between labeled pixels and canopy/burned regions. As shown in Figure 2, the corrected GT aligns well with canopy structures visible in the hyperspectral mosaic. This verification step is essential, as even small registration errors in UAS hyperspectral imagery can significantly degrade classification accuracy.

2.4.2. Class Distribution and Label Sparsity

The final GT map contains five forest species or condition classes: Dead Biomass, Dead Ponderosa Pine, Douglas Fir, Ponderosa Pine, and Western Larch. The labels are extremely sparse relative to the full image extent, and the class distribution is highly imbalanced. Specifically, we observe 287 pixels for Dead Biomass, 212 for Ponderosa Pine, 183 for Douglas Fir, 49 for Western Larch, and only 30 for Dead Ponderosa Pine. Such an imbalance necessitates the use of class-balanced sampling, focal-loss formulations, or class weighting during model training to avoid bias toward majority classes.

2.4.3. Band-Wise Radiometric Quality Analysis

Each of the 273 bands was evaluated using descriptive and noise-related statistics computed across the full spatial domain, including the mean, standard deviation, minimum, maximum, dynamic range, coefficient of variation (

σ / μ

), zero-fraction (proportion of pixels equal to zero), and a simple SNR proxy (

μ / (σ + ϵ)

).

The zero-fraction curve in Figure 3 reveals substantial sensor noise at the spectral extremes. Bands 1–40 show very high zero-fraction values (

0.60

–

0.68

), indicating weak short-wavelength sensor responsivity. A similar degradation is observed at the high-wavelength end (bands 240–273), where the detector suffers from end-of-range rolloff.

The standard deviation curve in Figure 4 complements this observation. Bands 40–130 exhibit stable but moderate variance, whereas bands 130–240 show the strongest spatial variability (≈0.014–

0.016

), representing the most informative spectral region for vegetation and burn discrimination. Beyond band 240, variance drops sharply again due to sensor noise.

Based on these EDA diagnostics, bands 1–40 and 250–273 were removed prior to downstream modeling, leaving 211 high-quality bands for analysis.

2.4.4. Radiometric Normalization and Clean Cube Creation

For each retained band, we applied robust percentile clipping (1st–99th percentile) followed by min–max normalization to the range

[0, 1]

. This process reduces the influence of shadows and extreme outliers, while mitigating cross-track illumination variation. The resulting cleaned VNIR cube serves as the standardized input for subsequent band-selection and classification experiments.

2.4.5. Class-Wise Spectral Signature Analysis

Using the cleaned cube and GT mask, we computed the mean spectral signature for each class by averaging spectral reflectance values across all labeled pixels:

{\bar{s}}_{c} (b) = \frac{1}{N_{c}} \sum_{{(x, y) : G T (x, y) = c}} I_{b} (x, y)

(3)

The resulting spectra showed that the three live conifer species (Douglas Fir, Ponderosa Pine, and Western Larch) share closely aligned spectral profiles, with only subtle differences. Dead biomass exhibits lower reflectance and altered mid-VNIR slopes, but separability is distributed across many bands. Unlike benchmark datasets (e.g., Indian Pines and Pavia), spectral contrast in this dataset is broad rather than concentrated, indicating that larger Top-K values (e.g., 60–100 bands) are necessary for effective band selection.

2.4.6. Montana EDA Insights

The Montana EDA demonstrates the following: (i) accurate GT construction and alignment are essential for reliable supervision, (ii) extreme class imbalance requires balanced sampling strategies, (iii) only the mid-VNIR range (bands 40–240) provides high-quality spectral information, (iv) short- and long-wavelength bands should be discarded due to sensor noise, and (v) the subtle, distributed spectral differences between forest classes motivate the use of larger band subsets and patch-based deep models.

These findings guided all subsequent preprocessing, band selection, and classification experiments.

2.5. Feature Selection (Band Selection)

To mitigate high dimensionality and spectral redundancy in hyperspectral images (HSIs), five band-selection techniques were employed: Principal Component Analysis (PCA), Spatial–Spectral Edge Preservation (SSEP), Spectral-Redundancy Penalized Attention Ranking (SRPA), and Deep Reinforcement Learning (DRL). Each method processes a pre-cleaned HSI data cube (height × width × bands, stored as .npy) and the corresponding ground-truth (GT) map (2D integer labels; .npy) after noise removal and normalization. These methods include unsupervised (PCA) and label-guided or supervised (SSEP, SRPA, and DRL) band-selection strategies, producing ranked band indices, scores, and visualizations for subsequent classification tasks. For the Montana UAS dataset, the EDA-cleaned cube additionally includes robust percentile clipping prior to normalization, as described in Section 2.4.

2.6. K-Means Clustering-Based Band Selection (KMCBS)

Clustering-based band selection represents one of the mainstream and widely adoptedapproaches in hyperspectral dimensionality reduction [18,19]. To provide a strong unsupervised baseline for comparison with the proposed and learning-based methods, we include KMCBS. The KMCBS method clusters the B spectral bands into K groups using the K-Means algorithm [48] and selects the band closest to each cluster centroid as the representative band for that group, yielding exactly K selected bands with minimal intra-subset redundancy. Formally, let each spectral band

b_{i} \in R^{H \times W}

be treated as a data point after normalization. K-Means partitions the B bands into K clusters

{C_{1}, \dots, C_{K}}

by minimizing intra-cluster variance. The representative band for cluster

C_{k}

is

b_{k}^{*} = arg min_{b_{i} \in C_{k}} {∥ {\tilde{b}}_{i} - μ_{k} ∥}_{2}

(4)

where

{\tilde{b}}_{i}

is the normalized band vector and

μ_{k}

is the cluster centroid. For large-scale datasets such as the Montana UAV VNIR cube, pixel subsampling of 50,000 pixels is applied prior to clustering to manage memory requirements.

2.7. Principal Component Analysis (PCA)-Based Band Selection

PCA is a classical unsupervised technique for variance-based band prioritization. We fit PCA on spectra from labeled pixels (GT > 0) to focus variance estimation on class-relevant regions. Full PCA is fitted (n_components = B, where B is the number of bands), yielding explained variance ratios (EVR) and component loadings. It is worth noting that, strictly speaking, PCA is a feature-extraction rather than a band-selection method, as it produces linearly transformed components that combine information across all original spectral bands rather than identifying a subset of original wavelengths. Consequently, PCA-selected components do not preserve the physical interpretability of individual spectral bands. PCA is included in this study as a classical variance-based dimensionality reduction baseline, providing a reference point against which supervised and learning-based band-selection strategies can be compared [3].

The score for each band b is computed as

\sum_{i = 1}^{M} {EVR}_{i} \times {({loading}_{i, b})}^{2}

(5)

We set

M = 30

as a stable trade-off that captures dominant variance while avoiding noise-dominated components; sensitivity to M was empirically minor in pilot runs [49].

2.8. Spatial–Spectral Edge Preservation (SSEP)

SSEP is a label-guided band ranking method that leverages spatial class boundaries derived from ground-truth annotations [50]. A reference binary edge map is derived from the GT using the Sobel gradient operator on label transitions. For each band, light Gaussian smoothing (

σ = 1.0

) was applied to reduce noise, followed by Sobel edge computation, thresholding at the 95th percentile, and binarization. The alignment with the GT edge map is quantified using the Dice coefficient:

Dice = \frac{2 \times | E_{band} \cap E_{GT} |}{| E_{band} | + | E_{GT} |}

(6)

Bands are ranked in descending order of these scores, favoring those that maintain sharp spatial–spectral edges [50]. The implementation of SSEP [43] in this study follows the methodology described in Algorithm 1.

Algorithm 1 SSEP Band Selection

Require:: HSI cube C ( $H \times W \times B$ ), GT map G ( $H \times W$ )
Ensure:: Ranked band indices order (descending scores)
1:: edge_GT ← binarize(Sobel(float(G))) ▹ Binary edges where gradient $> 0$
2:: scores ← zeros(B)
3:: for $b = 0$ to $B - 1$ do
4:: band_img ← $C [:, :, b]$
5:: smoothed ← Gaussian_filter(band_img, $σ = 1.0$ )
6:: edge_band ← Sobel(smoothed)
7:: thresh ← percentile(edge_band, 95)
8:: edge_bin ← (edge_band > thresh)
9:: scores[b] ← Dice(edge_bin, edge_GT)
10:: end for
11:: order ← argsort(scores)[::−1]
12:: return order, scores

2.9. Spectral-Redundancy Penalized Attention Ranking (SRPA)

SRPA employs a supervised attention mechanism to balance band informativeness and diversity. Small

5 \times 5

patches are extracted around labeled pixels (up to 4000 samples per dataset; when fewer labeled samples were available, as in the Montana UAS dataset, all labeled patches were used) and split into train/validation sets (80/20; stratified). A lightweight 3D CNN (Conv3D layers, max-pooling, global average pooling, and Squeeze-and-Excitation block) is trained for two epochs to obtain stable attention trends while keeping the band-ranking stage computationally lightweight. After training, the mean band-wise attention weights are inferred from the SE block. Redundancy is computed from the correlation matrix of flattened subsampled patches:

red (b) = \frac{1}{B - 1} \sum_{j \neq b} | corr (b, j) |

(7)

This term penalizes bands that are highly correlated with the remainder of the spectrum. Final scores are attention

- λ \times

redundancy (

λ = 0.3

). Bands are ranked in descending order, promoting discriminative yet non-redundant selections [51]. The complete SRPA procedure, including patch extraction, network training, attention aggregation, and redundancy penalization, is summarized in Algorithm 2.

Algorithm 2 SRPA Band Selection

Require:: HSI cube C ( $H \times W \times B$ ), GT map G, patch_size = 5, max_patches = 4000, $λ = 0.3$
Ensure:: Ranked band indices order (descending scores)
1:: Extract patches P ( $N \times 5 \times 5 \times B$ ) and labels y from labeled positions ( $N \leq$ max_patches)
2:: Split P, y → train/validation (80/20, stratified)
3:: Initialize 3D CNN with SE attention block
4:: Train model on train patches (2 epochs, cross-entropy loss, Adam)
5:: attn ← mean(SE outputs over validation) ▹ shape (B,)
6:: $X \leftarrow$ flatten(subsample(P)) ▹ ( $N_{flat} \times B$ )
7:: corr ← corrcoef(X)
8:: redundancy ← (sum( $| corr |$ , axis = 1) − 1)/( $B - 1$ )
9:: scores ← attn $- λ \times$ redundancy
10:: order ← argsort(scores)[::−1]
11:: return order, scores, attn, redundancy

2.10. Deep Reinforcement Learning (DRL)-Based Band Selection

To enable adaptive and dataset-agnostic band selection, we formulate spectral band selection as a sequential decision-making problem and solve it using Deep Reinforcement Learning. The objective is to incrementally construct compact band subsets that maximize downstream classification performance while discouraging redundant or unnecessarily large selections. The problem is modeled as a Markov Decision Process (MDP) and optimized using a Deep Q-Network (DQN), allowing the agent to explicitly reason wtih regard to long-term selection quality rather than relying on greedy or one-shot ranking strategies.

2.10.1. Markov Decision Process Formulation

Band selection is formulated as a finite-horizon MDP defined by the tuple

(S, A, P, R, γ)

, where

S

denotes the state space,

A

the action space, P the transition dynamics, R the reward function, and

γ \in (0, 1]

the discount factor.

State.

At decision step t, the state

s_{t} \in S

encodes the current band selection context and is defined as

s_{t} = [m_{t}, | S_{t} | / B, r_{t}],

(8)

where

S_{t}

is the set of selected bands up to step t, B denotes the number of spectral bands after EDA-based cleaning (dataset-specific; e.g.,

B = 211

for the Montana VNIR dataset),

m_{t} \in {0, 1}^{B}

is a binary mask indicating the selected bands, and

r_{t} \in R^{B}

represents per-band redundancy statistics computed from the EDA-cleaned hyperspectral cube. This state representation captures selection history, relative subset size, and spectral redundancy.

Action.

The action space consists of selecting one unselected spectral band:

a_{t} \in {b ∣ b \notin S_{t}} .

(9)

Actions corresponding to previously selected bands are masked to prevent reselection.

Transition.

Transitions are deterministic. Executing action

a_{t}

updates the selected set as

S_{t + 1} = S_{t} \cup {a_{t}}

, yielding the next state

s_{t + 1}

. An episode terminates when a predefined band budget K is reached.

Reward.

The reward function balances downstream classification performance and subset compactness. After selecting K bands, a lightweight classifier is trained using only the selected subset, and the terminal reward is defined as

R = Acc (S_{K})

(10)

where

Acc (S_{K})

is the validation accuracy of a lightweight Random Forest classifier trained on the selected band subset

(S_{K})

. The band budget K is controlled externally via the Top-K parameter, so no compactness penalty is required.

2.10.2. Deep Q-Network Architecture

The action-value function

Q (s, a)

is approximated using a fully connected neural network that maps the state representation to Q-values over candidate actions. The network consists of multiple dense layers with ReLU activations. A target network and an experience replay buffer are employed to stabilize training and mitigate overestimation bias.

2.10.3. Training Procedure

At each episode, the agent sequentially selects spectral bands until the budget K is reached. The terminal reward is computed using validation accuracy, and transitions

(s_{t}, a_{t}, R, s_{t + 1})

are stored in the replay buffer. The DQN is trained by minimizing the temporal-difference loss:

L = E [{(R + γ max_{a^{'}} Q_{target} (s_{t + 1}, a^{'}) - Q (s_{t}, a_{t}))}^{2}] .

(11)

An

ϵ

-greedy strategy is used to balance exploration and exploitation during training.

2.10.4. Dataset-Specific Handling

The DRL formulation is dataset-agnostic. The value of B and redundancy statistics are derived from the EDA-cleaned hyperspectral cube for each dataset. For the Montana UAV dataset, EDA cleaning includes percentile clipping and the removal of low-quality VNIR bands, as described in Section 2.4. No additional dataset-specific tuning is applied.

2.10.5. Evaluation and Stability

Due to the stochastic nature of reinforcement learning, each DRL experiment is repeated multiple times with different random seeds. Band rankings are obtained by averaging selection frequencies across runs, yielding stable and reproducible band importance estimates.

The DRL-based band-selection method enables the adaptive, sequential selection of informative spectral bands while explicitly accounting for subset compactness. By integrating EDA-derived statistics and downstream classification feedback into a unified MDP framework, the method provides a flexible alternative to classical and attention-based band-ranking strategies.

2.11. Classification Models

To evaluate the effectiveness of the selected hyperspectral bands, we employed a suite of classical machine learning classifiers and a deep learning model. These models were trained on the reduced-dimensionality data obtained from the band-selection methods (PCA, SSEP, SRPA, and DRL) as well as the full-band baseline. The classifiers include Random Forest (RF), K-Nearest Neighbors (KNN), Support Vector Machines (SVM), and a 3D Convolutional Neural Network (3D-CNN). These models were chosen for their proven efficacy in hyperspectral image classification tasks, balancing computational efficiency with performance in handling high-dimensional spectral–spatial data [47,52]. All models were implemented using scikit-learn for classical methods and PyTorch 2.0.0 for the deep model, with the hyperparameters tuned based on standard practices in the HSI literature [53].

2.11.1. Random Forest

RF is an ensemble learning method that constructs multiple decision trees during training and outputs the class that is the mode of the classes from individual trees. In our implementation, we used the RandomForest Classifier from scikit-learn with 200 estimators, utilizing all available CPU cores (n_jobs = −1) for parallel processing, and a fixed random state of zero for reproducibility. RF is particularly effective for HSI classification due to its ability to handle multicollinearity in spectral bands, and its robustness to overfitting [54]. The model was trained on flattened spectral features from labeled pixels or selected bands, without incorporating spatial information explicitly. This approach aligns with prior studies on hyperspectral data where RF serves as a strong baseline for pixel-wise classification [55].

2.11.2. KNN

The KNN classifier assigns a class to a query point based on the majority vote of its K-Nearest Neighbors in the feature space. We implemented KNN using scikit-learn’s KNeighborsClassifier with k = 5, employing the default Euclidean distance metric. This value of k was selected based on empirical performance in HSI datasets, where smaller neighborhoods help capture local spectral similarities while avoiding excessive noise sensitivity [56]. KNN is computationally simple and non-parametric, making it suitable for hyperspectral data with varying class distributions, as seen in post-fire fuel classification scenarios [57]. Like RF, it operates on pixel-based spectral features, leveraging the reduced band subsets to mitigate the curse of dimensionality.

2.11.3. SVM

SVM aims to find the optimal hyperplane that separates classes in a high-dimensional space, maximizing the margin between support vectors. Our SVM implementation used scikit-learn’s SVC with a radial basis function (RBF) kernel, regularization parameter C = 10, and gamma set to ‘scale’ (automatically computed as

1 / (n_{features} \times X . var ())

. This configuration is commonly used in HSI classification to handle non-linear separability in spectral data [58]. SVM’s strength lies in its effectiveness with small sample sizes and high-dimensional inputs, which is relevant for our prescribed-burn dataset where labeled pixels may be sparse [59]. The model was trained on the same pixel-wise features as RF and KNN, benefiting from band selection to reduce kernel computation overhead.

2.11.4. 3D-CNN

For all datasets, spatial–spectral patches of a fixed size were extracted from the EDA-cleaned hyperspectral cube prior to 3D-CNN training, ensuring consistent spatial context across band-selection methods. The 3D Convolutional Neural Network (3D-CNN) extends traditional CNNs by incorporating spectral depth as an additional dimension, enabling joint spatial–spectral feature extraction. Our model, implemented in PyTorch 2.0.0, consists of two 3D convolutional layers followed by batch normalization, ReLU activation, max pooling, adaptive average pooling, and a fully connected output layer. The input shape is (batch_size, 1, bands, patch_size, patch_size), where patches are centered on labeled pixels with a patch_size of five (as defined in the dataset configurations). The first convolutional layer uses 16 filters with a kernel size of (3, 3, 7) and padding to preserve dimensions, while the second uses 32 filters with (3, 3, 5). Training was performed over 20 epochs with Adam optimization (learning rate 1 ×

10^{- 3}

), cross-entropy loss, and a batch size of 32, monitoring validation accuracy to select the best model [60]. This architecture is tailored for HSI, capturing volumetric patterns in post-fire environments, and has shown superior performance over 2D CNNs in spectral–spatial tasks [61].

2.12. Class Imbalance-Handling Strategies

The Montana UAV VNIR dataset exhibits a severe class imbalance, with Dead Ponderosa Pine comprising only 30 labeled pixels. Three strategies are evaluated to address this. Class-weighted loss assigns inverse-frequency weights to each class during training, penalizing the misclassification of minority classes more heavily. For Random Forest, this is implemented via class_weight=‘balanced’, and for 3D-CNN via a weighted cross-entropy loss function. Focal loss [62] modifies standard cross-entropy by down-weighting easy examples and focusing learning on hard minority samples, using a focusing parameter

γ = 2.0

. SMOTE (Synthetic Minority Oversampling Technique) [63] generates synthetic training samples for minority classes by interpolating between existing samples in feature space, applied to pixel-level features for Random Forest classification.

2.13. Evaluation Metrics

The model performance was assessed using a comprehensive set of metrics to ensure balanced evaluation across imbalanced classes typical in HSI datasets. Overall accuracy (OA) measures the percentage of correctly classified pixels. Cohen’s Kappa coefficient accounts for chance agreement, providing a more robust measure of inter-class reliability [64]. Macro-averaged Precision, Recall, and F1-score were computed to evaluate performance across all classes equally, handling class imbalances in prescribed-burn scenarios where certain fuel types (e.g., char or recovering vegetation) may be underrepresented [65]. These metrics were calculated in percentage form for consistency with OA, using scikit-learn’s precision_score, recall_score, and f1_score functions with ‘macro’ averaging and zero_division = 0 to manage undefined cases. Additionally, confusion matrices were generated to visualize misclassifications. For deep models, the best validation accuracy during training was also logged as an indicator of convergence [66]. All metrics were derived from a 70/30 train–validation split, stratified by class to maintain distribution. All classifier hyperparameters were fixed to commonly used values from prior hyperspectral classification literature, and were kept identical across all band-selection methods to ensure a fair comparative evaluation. All the reported metrics were computed exclusively on held-out validation or test data and were not used during training or band selection stages.

2.14. Experimental Setup

Experiments were conducted on a unified pipeline implemented in Python 3.12, utilizing NumPy for data handling, scikit-learn for classical models, and PyTorch for deep architectures. Datasets included benchmark HSI scenes (Indian Pines, Pavia University, Salinas, Botswana, and KSC) and a custom VNIR hyperspectral dataset from prescribed burns at Lubrecht Experimental Forest, Montana, USA. Data preprocessing involved loading hyperspectral cubes and ground-truth maps as NumPy arrays, extracting labeled pixels or center patches (patch_size = 5 for patch-based models), and applying a 70/30 train–validation split with stratification (random_state = 0). For band selection, methods (PCA, SSEP, SRPA, and DRL) were applied to generate Top-K band subsets (k in [5, 10, 15, 20, 25, 30, 35, 40, 45, 50]), with the results compared against a full-band baseline. Training was performed on an NVIDIA GPU-enabled system where available, with CPU fallback for classical models. The results were logged to a CSV file tracking dataset, method, Top-K, model, and metrics. Cross-dataset evaluation ensured generalizability, particularly testing Montana data for real-world fire monitoring applicability [67]. The setup emphasizes reproducibility, with all paths and hyperparameters defined in a central configuration file. All experiments were repeated over multiple random seeds, and the results were averaged to reduce variance.

In addition to dataset-specific evaluations, we report a cross-dataset summary in which, for each band selection technique, the best-performing configuration is selected based on aggregated performance across all datasets. This analysis is intended to highlight general performance trends and preprocessing sensitivity rather than dataset-specific accuracy.

3. Results

3.1. Result Aggregation and Best-Configuration Identification

For each dataset, all evaluated configurations—defined by the combination of the band-selection technique (No Band Selection, PCA, SSEP, SRPA, and DRL), the number of selected bands (Top-K), and classifier (RF, SVM, KNN, and 3DCNN)—were executed using five independent runs with different random seeds. The performance is reported as the mean ± standard deviation for overall accuracy (OA), Cohen’s Kappa, and the macro-averaged F1 score.

This exhaustive evaluation produced approximately 400–415 configuration-level results per dataset. The complete per-configuration results for all datasets, band-selection methods, Top-K values, and classifiers—including all five independent runs—are provided as supplementary repository, available in the project GitHub repository at https://github.com/BMW-lab-MSU/hyperspectral-feature-selection-prescribed-fires/ (accessed on 20 December 2025). The supplementary repository include: (i) the raw result CSV files for all ten experimental runs (five EDA; five No-EDA) across all six datasets; (ii) the per-band selection score CSV files for all four methods under both EDA and No-EDA conditions for all datasets (40 files total); and (iii) an interactive HTML summary of Table 2, Table 3, Table 4, Table 5, Table 6, Table 7 and Table 8.

The best configurations were identified using a hierarchical selection criterion that prioritizes the (i) highest mean OA, (ii) highest mean Kappa, (iii) highest mean macro-F1, and (iv) lower OA standard deviation. Note that the best configurations are selected independently for EDA and No-EDA conditions; consequently, the optimal Top-K value may differ between the two settings for the same method, as each preprocessing pathway produces a distinct spectral input. The resulting best-per-technique summaries are reported in Table 2, Table 3, Table 4, Table 5, Table 6, Table 7 and Table 8.

3.2. Dataset-Specific Results

3.2.1. Indian Pines (Table 2)

On Indian Pines, DRL achieves the highest OA under both EDA (

89.23 \pm 2.29

%) and No-EDA (

89.69 \pm 1.29

%) conditions, using compact Top-10 and Top-5 band subsets respectively with a 3D-CNN classifier. The no-band selection baseline ranks second under EDA (87.97%), while SRPA and PCA occupy intermediate positions. SSEP yields the lowest OA under EDA (72.26%), notably below PCA (77.59%), reflecting its sensitivity to spectral noise in this spectrally complex scene.

The

Δ

columns reveal that EDA provides a minimal or slightly negative impact on OA for most methods on this dataset. DRL (

Δ OA = - 0.46

), SRPA (

Δ OA = - 2.60

), and PCA (

Δ OA = - 0.16

) all show marginally lower performance under EDA relative to No-EDA, while SSEP shows a small positive gain (

+ 0.10

%). These results suggest that on Indian Pines, EDA preprocessing does not confer a consistent OA advantage, though performance differences remain modest. DRL consistently achieves the highest macro-F1 (90.28% with EDA), indicating strong per-class discrimination.

3.2.2. Pavia University (Table 3)

Pavia University exhibits strong overall performance across all band-selection methods. DRL achieves the highest OA under both conditions (EDA:

98.75 \pm 0.36

%; No-EDA:

99.03 \pm 0.19

%), with compact Top-15 band subsets and a 3D-CNN classifier. SRPA ranks second under EDA (97.68%), followed closely by PCA (96.77%) and SSEP (96.52%), all using Top-30 3D-CNN configurations. The no-band selection baseline (94.00%) is substantially outperformed by all band-selection methods, highlighting the benefit of dimensionality reduction in this densely annotated urban scene.

EDA provides a moderate positive gain for SRPA (

Δ OA = + 0.73

) and SSEP (

Δ OA = + 1.96

), while DRL and PCA show marginally lower OA under EDA (

- 0.28

and

- 0.48

respectively). The consistently high Kappa values (>0.95 for DRL, SRPA, and PCA under EDA) confirm strong inter-class agreement beyond chance, with low standard deviations indicating stable performance across runs.

3.2.3. Salinas (Table 4)

Salinas achieves a uniformly high OA across all methods, reflecting its relatively favourable class separability. DRL produces the highest EDA OA (

95.02

%) using a Top-50 RF configuration, with the no-band selection baseline ranking second (94.98%). SRPA (94.19%) and PCA (94.10%) follow closely, while SSEP yields the lowest OA (93.22%) under EDA. The narrow spread across methods (less than 2% range) indicates that Salinas is a relatively easy dataset for all evaluated strategies.

The

Δ

values are uniformly small (within

\pm 0.5

%), indicating that EDA preprocessing has negligible impact on the best-achievable performance for this dataset. This behavior is consistent with Salinas having comparatively clean spectral characteristics and limited noisy band contamination, reducing the marginal benefit of explicit noise removal and normalization.

3.2.4. Botswana (Table 5)

Botswana demonstrates strong classification performance across all techniques. DRL achieves the highest OA under EDA (

95.15 \pm 1.54

%) using a Top-10 3D-CNN configuration, representing a notable margin over the next best method, the no-band selection baseline (92.10%). SRPA, PCA, and SSEP are closely clustered between 90–91%, with classical SVM classifiers dominating for these methods.

The

Δ

values for most methods are close to zero (within

\pm 0.31

%), indicating that EDA has minimal impact on the best-achievable performance for Botswana. This behavior is consistent with the dataset’s relatively clean spectral signatures and strong class separability. DRL shows a positive EDA gain (

Δ OA = + 1.36

%), suggesting that noise normalization modestly benefits the reinforcement learning agent’s band selection on this scene. SRPA shows a negative

Δ OA

(

- 1.94

%), attributable to run-to-run variance (std 4.31% under EDA vs. 2.93% without).

3.2.5. Kennedy Space Center (Table 6)

For the KSC dataset, DRL achieves the highest OA under EDA (

94.11 \pm 1.55

%) using a Top-10 3D-CNN configuration, followed by SRPA (

92.97 \pm 0.78

%) and the no-band selection baseline (92.20%). PCA (88.30%) and SSEP (85.81%) trail substantially, with SSEP yielding the lowest OA on this dataset.

The

Δ

values are predominantly negative across methods, indicating that EDA does not improve and in some cases slightly reduces the best-achievable performance on KSC. The most notable case is PCA, which achieves a higher OA under No-EDA using a 3D-CNN configuration (

93.53 \pm 0.80

%) than under EDA with RF (

88.30

%), a difference attributable to the change in the optimal classifier rather than EDA alone. DRL shows a small positive

Δ OA

(

+ 0.40

%), the only method to benefit from EDA on this dataset. These mixed results indicate that the impact of EDA on KSC is technique-dependent rather than uniformly beneficial.

3.2.6. Montana UAV VNIR (Table 7)

The Montana UAV VNIR dataset exhibits fundamentally different behavior compared to the benchmark datasets. All methods achieve substantially lower OA (51–58% under EDA versus 87–99% on benchmarks), and the standard deviations are markedly higher (up to

\pm 5.73

%), reflecting the reduced stability across runs due to limited labeled samples, class imbalance, and increased spectral noise from UAS acquisition conditions.

Under EDA, DRL achieves the highest OA (

58.19 \pm 4.97

%) using Top-15 3D-CNN, followed by SRPA (

57.09 \pm 5.73

%), SSEP (

55.53 \pm 2.60

%), and PCA (

55.95 \pm 4.50

%). However, these margins are small relative to standard deviations and should be interpreted with caution. Unlike the benchmark datasets, EDA provides positive

Δ

OA for four of five methods on Montana (DRL:

+ 3.28

%, SRPA:

+ 4.25

%, PCA:

+ 2.61

%, and SSEP:

+ 1.49

%), suggesting that noise removal and normalization are beneficial for this noisy real-world dataset. However, the high variability across runs indicates that these gains are not statistically robust, and EDA alone cannot compensate for insufficient labeled data or highly variable spectral distributions. Macro-F1 scores remain low (38–52% under EDA), confirming difficulty in discriminating among the five closely related forest species and condition classes.

3.3. Cross-Dataset Trends (Table 8)

Table 8 summarizes performance aggregated across all six datasets by averaging the per-dataset best-configuration mean OA, Kappa, and macro-F1 for each method. Although absolute performance differs substantially across datasets (reflected in the large standard deviations of ±15–17%), consistent method-level trends are apparent.

DRL achieves the highest aggregated OA under both EDA (

88.41 \pm 15.11

%) and No-EDA (

87.79 \pm 16.34

%) conditions, confirming its robustness across diverse hyperspectral scenes. SRPA and the no-band selection baseline perform similarly (≈85%) and rank second under EDA and No-EDA respectively, while PCA (≈84%) and SSEP (≈82%) trail. The large cross-dataset standard deviations reflect the substantial performance gap between the Montana UAV dataset (≈52–58%) and the benchmark datasets (≈87–99%), rather than high method instability within any single dataset.

Comparing EDA and No-EDA aggregated results, the differences are small for all methods (within 0.6% OA), suggesting that while EDA is beneficial for specific datasets and methods, its effect on aggregated cross-dataset performance is modest. DRL shows the largest positive EDA contribution (

+ 0.62

% aggregated OA), while EDA provides a marginally positive advantage for SSEP (

+ 0.32

% aggregated OA). These observations support the conclusion that EDA is a safe and generally non-degrading preprocessing step, with dataset-dependent rather than universal benefits.

3.4. Summary of Results

Across Table 2, Table 3, Table 4, Table 5, Table 6, Table 7 and Table 8, several consistent findings emerge. First, DRL-based band selection achieves the highest OA on all six datasets under at least one preprocessing condition, confirming its effectiveness for task-driven spectral subset selection. Second, SRPA consistently ranks second on three of six datasets under EDA (Pavia University, KSC, and Montana), while the no-band selection baseline ranks second on Indian Pines, Salinas, and Botswana, demonstrating strong and stable performance as an attention-guided alternative. Third, SSEP and PCA yield lower and less stable performance, with SSEP in particular yielding the lowest OA on Indian Pines and KSC, and PCA performing poorly on Indian Pines relative to even the baseline. Fourth, the impact of EDA is strongly dataset-dependent: it provides consistent benefits on Pavia University and Montana, minimal impact on Salinas and Botswana, and mixed or negative effects on Indian Pines and KSC. Finally, the Montana dataset consistently underperforms relative to all benchmarks, with higher variance and a lower macro-F1, underscoring the challenges of real-world UAV hyperspectral classification under limited supervision.

3.5. Selected Wavelength Analysis for the Montana UAV Dataset

To address the spectral interpretability of the evaluated band-selection methods, Table 9 reports the wavelength ranges and dominant spectral regions selected by each method at its best-performing Top-K configuration for the Montana UAV VNIR dataset.

3.5.1. EDA Condition

Under EDA preprocessing, the four methods exhibit markedly different spectral selection strategies, reflecting their underlying algorithmic objectives. SSEP selects almost exclusively NIR bands (764–932 nm; 18 of 20 bands), consistent with its edge-preservation criterion: in this forest scene, class boundaries between live conifers, dead biomass, and dead Ponderosa Pine are sharpest in the NIR plateau, where canopy structural differences are most pronounced [29]. PCA concentrates on two clusters: Blue-Green (488–504 nm; six bands) and NIR (760–824 nm; twelve bands), reflecting variance maximization rather than class discriminability—high-variance wavelengths dominate regardless of their utility for species separation. DRL selects a more diverse set spanning Red (four bands; 636–696 nm), NIR (four bands; 801–916 nm), Green (five bands), and Red Edge (two bands; 707–742 nm), consistent with its task-driven optimization: by explicitly maximizing downstream classification accuracy, DRL identifies complementary spectral regions rather than concentrating on a single high-variance zone. SRPA produces the broadest selection (488–929 nm; 30 bands), distributing across NIR (ten), Green (6), Red (four), Red Edge (three), Blue (five), and Far NIR (two), reflecting its redundancy-penalization design which actively avoids correlated bands.

Across all four EDA methods, the NIR region of approximately 795–830 nm is consistently selected, representing the only wavelength range chosen by all four methods independently. This consensus band cluster aligns with the NIR reflectance plateau of vegetation, which is strongly modulated by leaf area index, canopy density, and cell structure—properties that differ substantially between live conifers (Douglas Fir, Ponderosa Pine, and Western Larch) and dead biomass classes in this post-fire environment [29].

3.5.2. No-EDA Condition

Under No-EDA conditions, the selection patterns shift considerably for some methods. DRL No-EDA (Top-10) retains a similar Red Edge and NIR focus (526–879 nm), suggesting that the most discriminative spectral structure for forest species is preserved even without preprocessing. SSEP No-EDA remains exclusively NIR (762–896 nm), confirming that the edge-based criterion consistently identifies NIR boundaries regardless of normalization. PCA No-EDA (Top-50) shifts heavily toward Blue wavelengths (400–430 nm; 23 bands), selecting the highest-variance region of the raw uncorrected cube—these short-wavelength bands exhibit large radiometric variance due to atmospheric scattering and sensor noise rather than vegetation signal, illustrating the limitation of variance-based selection on uncorrected data.

The contrast between PCA EDA (NIR-dominant) and PCA No-EDA (Blue-dominant) directly demonstrates the importance of EDA preprocessing for variance-based methods: without normalization, PCA is misled by noise-dominated spectral regions at the wavelength extremes.

The complete band-score CSV files for all methods, datasets, and conditions—including all 40 experimental run files and per-band scores for all six datasets—are available in the project GitHub repository at https://github.com/BMW-lab-MSU/hyperspectral-feature-selection-prescribed-fires/ (accessed on 20 December 2025).

3.6. Clustering-Based Band Selection (KMCBS)

Table 10 presents the classification performance of the K-Means Clustering-based band-selection method across all six datasets. The 3D-CNN classifier consistently achieves the highest OA across all datasets, confirming the trend observed for the other four band-selection methods in Table 2, Table 3, Table 4, Table 5, Table 6 and Table 7. On benchmark datasets, KMCBS achieves competitive performance:

95.55 \pm 0.51 %

OA on Salinas,

95.90 \pm 1.32 %

on Botswana, and

94.68 \pm 0.71 %

on KSC. These results are generally within 1–3% of the best-performing learning-based methods (DRL and SRPA), demonstrating that clustering-based selection provides a strong unsupervised baseline. On Indian Pines and Pavia University, KMCBS achieves

85.86 %

and

96.96 %

OA respectively, trailing DRL by approximately 3% on both datasets. This gap is consistent with findings in the prior literature [18,19] suggesting that learning-based methods better capture non-linear spectral structure in heterogeneous scenes. On the Montana UAV VNIR dataset, KMCBS achieves

53.60 %

OA with 3D-CNN, comparable to DRL (

51.32 %

) and above the statistical classifiers, confirming that clustering-based selection retains discriminative spectral information even in challenging real-world conditions with class imbalance and sparse labels.

Across all datasets, RF achieves competitive OA relative to SVM and KNN, while KNN consistently underperforms—a trend consistent with the other band-selection methods. The optimal Top-K value varies by dataset and classifier, with smaller subsets (Top-5 to Top-15) performing best for 3D-CNN and larger subsets (Top-40 to Top-50) preferred by RF and SVM, reflecting the different spatial context exploitation of patch-based versus pixel-based classifiers.

Comparing KMCBS against the four band-selection strategies evaluated in Table 2, Table 3, Table 4, Table 5, Table 6 and Table 7, several notable patterns emerge. On Salinas, Botswana, and KSC, KMCBS achieves the highest OA among all methods, with

95.55 %

,

95.90 %

, and

94.68 %

respectively—marginally outperforming DRL by

+ 0.53 %

,

+ 0.75 %

, and

+ 0.57 %

. This demonstrates that centroid-based clustering is sufficient to identify the most discriminative band subsets in scenes with moderate spectral complexity. On Indian Pines and Pavia University, DRL outperforms KMCBS by

3.37 %

and

1.79 %

OA respectively, reflecting the advantage of sequential learning-based selection in spectrally heterogeneous scenes where joint band interaction matters more than individual band representativeness. On the Montana UAV VNIR dataset, KMCBS (

53.60 %

) trails DRL (

58.19 %

) by

4.59 %

, which is the largest gap across all datasets—consistent with the noisy, imbalanced nature of the UAV acquisition where spectral variability is high and labeled samples are sparse. Overall, KMCBS ranks first among all methods on three of six datasets and remains within

3.4 %

OA of the best-performing method on all others, confirming that clustering-based band selection constitutes a competitive and computationally lightweight alternative to learning-based strategies, particularly on well-structured benchmark datasets.

3.7. Patch Size Sensitivity Analysis

To assess the sensitivity of the 3D-CNN classifier to spatial context scale, we evaluated patch sizes of

3 \times 3

,

5 \times 5

, and

7 \times 7

using the DRL band selector at its best-performing Top-K configuration on three representative datasets: Indian Pines, Pavia University, and Montana UAV VNIR. The results are reported as the mean ± standard deviation over five independent runs in Table 11.

On Indian Pines, performance increases modestly across patch sizes (

79.78 % \to 81.37 %

OA), though the larger

7 \times 7

patch exhibits notably higher run-to-run variance (

\pm 7.90 %

), suggesting that

5 \times 5

offers a more stable trade-off for this spectrally complex scene. On Pavia University, a consistent and stable improvement is observed with increasing patch size (

93.86 % \to 97.23 %

OA), reflecting the benefit of larger spatial context in this densely annotated urban scene. The Montana UAV VNIR dataset shows the strongest sensitivity to patch size: OA improves from

47.95 %

(

3 \times 3

) to

55.70 %

(

7 \times 7

), with macro-F1 increasing from

32.65 %

to

52.81 %

, indicating that larger patches capture more meaningful spatial structure in fire-affected forest environments where spectral contrast between classes is subtle and distributed. These results confirm that patch size is a dataset-dependent hyperparameter:

5 \times 5

remains a competitive and stable default across benchmarks, while larger patches (

7 \times 7

) provide consistent gains on spatially complex or real-world datasets such as Montana.

3.8. Ablation Study

To validate the contribution of core algorithmic components in SRPA and DRL, we conducted an ablation study comparing each method against a degraded variant with a key module removed. For SRPA, we compared the full method (

λ = 0.3

) against an attention-only variant with the redundancy penalty removed (

λ = 0

). For DRL, we compared sequential band selection against random band selection of the same Top-K subset size, serving as a lower-bound baseline. All experiments used the best-performing classifier and Top-K configuration from Table 2, Table 3, Table 4, Table 5, Table 6 and Table 7, EDA preprocessing, and five independent runs. The results are reported in Table 12.

3.8.1. DRL Sequential Selection vs. Random Baseline

Across all six datasets, sequential DRL selection consistently outperforms random band selection, confirming that the policy network learns meaningful selection strategies beyond chance. Gains are most pronounced on Indian Pines (

+ 2.29 %

OA), Botswana (

+ 2.01 %

OA), and KSC (

+ 1.14 %

OA). On Pavia University, DRL achieves

98.55 \pm 0.53 %

OA compared to

97.70 \pm 0.64 %

for random selection. On the Montana UAV VNIR dataset, DRL provides a consistent gain (

+ 0.88 %

OA) even under limited supervision and class imbalance, confirming that learned sequential selection retains value in real-world noisy conditions.

3.8.2. SRPA Redundancy Penalty

The effect of the redundancy penalty (

λ = 0.3

) is dataset-dependent. On Pavia University and Botswana, the full SRPA method outperforms the attention-only variant by

+ 1.54 %

and

+ 1.99 %

OA respectively, confirming that redundancy penalization is beneficial when spectral bands exhibit strong inter-correlation. On KSC, the difference is negligible (

- 0.04 %

), indicating near-equivalent performance. However, on Indian Pines, removing the penalty yields notably higher OA (

85.54 %

vs.

77.26 %

), and on Salinas and Montana the attention-only variant performs marginally better. These results suggest that the SE block attention scores are strongly discriminative on most datasets, and the redundancy penalty may over-regularize in scenes where inter-band correlation is lower or spectral structure is more complex. This finding motivates future investigation into adaptive

λ

selection strategies tailored to dataset-specific spectral correlation structure.

3.9. Hyperparameter Sensitivity Analysis

To address the rationality of fixed hyperparameter choices in SSEP and SRPA, we conducted a systematic sensitivity analysis varying the Gaussian smoothing parameter

σ

in SSEP over {0.5, 1.0, 1.5, 2.0} and the redundancy penalty coefficient

λ

in SRPA over {0.1, 0.2, 0.3, 0.5, 0.7}. For each configuration, the best-performing classifier and Top-K setting from Table 2, Table 3, Table 4, Table 5, Table 6 and Table 7 were used under EDA preprocessing, with five independent runs per configuration. The results are reported in Table 13 and Table 14. As discussed in Section 2.9, the DRL reward function was implemented without a compactness penalty term, as the band budget K is controlled externally via the Top-K parameter; accordingly, DRL hyperparameter sensitivity is not applicable.

3.9.1. SSEP $σ$ Sensitivity

The optimal

σ

value is dataset-dependent. On Pavia University, Salinas, and Botswana,

σ = 1.5

yields the highest OA (

96.65 %

,

94.29 %

, and

92.86 %

respectively), while

σ = 2.0

performs best on Indian Pines (

78.54 %

) and KSC (

86.04 %

), and

σ = 0.5

is marginally optimal on Montana (

50.53 %

). The chosen default

σ = 1.0

is near-optimal on KSC (

Δ = - 0.09 %

) and Montana (

Δ = - 0.61 %

), but is suboptimal on Indian Pines (

Δ = - 7.13 %

), Pavia University (

Δ = - 8.29 %

), Salinas (

Δ = - 1.02 %

), and Botswana (

Δ = - 3.32 %

). These results indicate that

σ = 1.0

is a reasonable dataset-agnostic default for low-complexity datasets but that larger smoothing values better preserve class-boundary structure in spectrally complex or densely annotated scenes. The high variance observed at

σ = 0.5

on Pavia University (

\pm 10.01 %

) further confirms that aggressive smoothing destabilizes edge detection in high-resolution urban imagery.

3.9.2. SRPA $λ$ Sensitivity

The redundancy penalty coefficient

λ

shows stable behavior on three of six datasets: KSC and Montana both achieve their best OA at

λ = 0.3

(

Δ = 0.00 %

), and Salinas is near-optimal (

Δ = - 0.64 %

). However,

λ = 0.3

is suboptimal on Indian Pines (

Δ = - 7.40 %

; best at

λ = 0.1

), Pavia University (

Δ = - 1.41 %

; best at

λ = 0.5

), and Botswana (

Δ = - 1.17 %

; best at

λ = 0.2

). The monotonically decreasing trend on Indian Pines suggests that stronger redundancy penalisation progressively degrades performance, consistent with the ablation study finding in Section 3.7. In contrast, Pavia University benefits from stronger penalisation (

λ = 0.5

), reflecting higher inter-band correlation in this densely annotated urban scene.

3.9.3. Summary

Both

σ = 1.0

and

λ = 0.3

were selected as dataset-agnostic defaults consistent with the prior literature [50,51], and perform competitively on lower-complexity datasets. The sensitivity analysis reveals that these values are suboptimal for spectrally complex benchmark scenes, motivating future work on adaptive parameter selection strategies that account for dataset-specific spectral correlation structure and edge characteristics.

3.10. Class Imbalance Analysis

The Montana UAV VNIR dataset exhibits severe class imbalance, with Dead Ponderosa Pine comprising only 30 labeled pixels and Ponderosa Pine 212 pixels out of 761 total labeled samples. To assess the impact of imbalance-handling strategies on minority class recognition, we compare four approaches: baseline classification with no imbalance handling, class-weighted loss (RF and 3D-CNN), SMOTE synthetic oversampling (RF), and focal loss (3D-CNN) [62]. All experiments use DRL Top-15 band selection with EDA preprocessing and five independent runs. Per-class F1 scores are reported alongside OA and Kappa to assess minority class recognition directly.

SMOTE oversampling could not be applied in this experiment due to insufficient training samples for the most minority class after the 70/30 stratified split—Dead Ponderosa Pine yields approximately 21 training samples, which, combined with label encoding producing a sixth background class with zero samples, caused SMOTE to fail consistently. This outcome itself reflects a fundamental limitation of the Montana dataset for synthetic oversampling approaches.

The results are presented in Table 15. Overall accuracy remains stable across all strategies, ranging from

47.60 %

to

51.93 %

, confirming that imbalance handling does not substantially alter global classification performance on this dataset. The 3D-CNN classifier consistently outperforms RF across all strategies due to its ability to exploit spatial context through patch-based learning. Among 3D-CNN variants, focal loss achieves the highest OA (

51.93 %

) and macro F1 (

43.37 %

), with a marginal improvement in Dead Ponderosa Pine F1 (

41.55 %

) over the baseline (

41.05 %

). Class-weighted 3D-CNN slightly degrades Dead Ponderosa Pine F1 (

36.79 %

) relative to the baseline, suggesting that inverse-frequency weighting over-suppresses majority class features that are also diagnostic for minority class boundaries in this spectrally complex scene.

Notably, Ponderosa Pine achieves

0.00 %

F1 across all strategies and all runs. With only 212 total pixels and high spectral similarity to Douglas Fir, this class cannot be reliably learned from the available labeled samples regardless of imbalance strategy. This finding underscores a fundamental data limitation rather than a methodological shortcoming, and highlights the need for additional field data collection targeting this species in future campaigns.

4. Discussion

The results presented in this study highlight several important insights regarding hyperspectral band selection, preprocessing strategies, and model behavior across both benchmark and real-world datasets. Rather than focusing solely on absolute performance, this discussion interprets the observed trends in the context of dataset characteristics, model complexity, and the role of exploratory data analysis (EDA).

4.1. Effectiveness of Learning-Based Band Selection

Across all six evaluated datasets, learning-based band-selection methods—particularly Deep Reinforcement Learning (DRL)—achieved the highest classification performance. This consistent dominance suggests that DRL is effective at jointly optimizing spectral relevance and downstream classification performance, especially when sufficient labeled data and relatively stable spectral signatures are available. Unlike classical techniques such as PCA, which prioritize variance preservation, DRL explicitly optimizes task-specific objectives, leading to improved class discrimination as reflected by higher macro-F1 and Kappa scores.

The attention-based SRPA method ranked second on three of six datasets under EDA (Pavia University, KSC, and Montana), while the no-band selection baseline ranked second on the remaining three (Indian Pines, Salinas, and Botswana), demonstrating a strong balance between performance and stability. Compared to DRL, SRPA appears less sensitive to dataset size and noise, which may explain its competitive behavior even in more challenging settings. These observations support the growing body of literature advocating task-aware and learning-based band-selection methods over purely statistical approaches.

4.2. Limitations of Variance-Based Selection

SSEP yielded the lowest OA on four of six datasets (Indian Pines, Salinas, Botswana, and KSC), while PCA consistently underperformed learning-based methods, confirming that variance maximization alone is insufficient for hyperspectral classification. High-variance bands do not necessarily correspond to class-discriminative features, particularly in datasets with class imbalance or overlapping spectral signatures. The consistently poor performance of PCA across both EDA and No-EDA settings reinforces the need for supervised or task-driven band-selection strategies in practical hyperspectral applications.

It should also be noted that PCA is, strictly speaking, a feature extraction method rather than a true band-selection method: its output components are linear combinations of all input bands and do not correspond to specific measurable wavelengths. This limits the physical interpretability of PCA-selected features in spectroscopic applications, where the identity of selected wavelengths carries domain-specific meaning—for example, the Red Edge (∼700–730 nm) for vegetation stress or the NIR plateau (∼750–900 nm) [38] for canopy structure. In contrast, SSEP, SRPA, and DRL all select original spectral bands, preserving this interpretability and enabling the wavelength analysis presented in Section 3.5. PCA’s inclusion here is therefore as a variance-based baseline rather than a competing band-selection strategy, and its consistently lower classification performance relative to the supervised methods further underscores the insufficiency of variance maximization alone as a criterion for spectral subset selection in hyperspectral classification tasks.

4.3. Role of Exploratory Data Analysis

The impact of EDA varied across datasets and methods. On Pavia University, EDA provided consistent gains for SSEP (+1.96%) and SRPA (+0.73%), likely arising from improved feature normalization in this densely annotated urban scene. On Indian Pines, however, EDA effects were mixed or slightly negative for most methods (DRL: −0.46%; SRPA: −2.60%), suggesting that the spectral complexity of this scene does not benefit uniformly from the applied preprocessing. Notably, EDA provided the clearest and most consistent gains on the Montana UAV dataset, where noise removal and normalization improved OA for four of five methods (SRPA: +4.25%, DRL: +3.28%, PCA: +2.61%, and SSEP: +1.49%), though high run-to-run variance limits the statistical reliability of these gains.

In contrast, datasets with cleaner spectral characteristics, such as Botswana and Salinas, showed minimal performance differences between EDA and No-EDA. For most methods and datasets, EDA did not substantially degrade performance. The most notable exception is PCA on KSC, where EDA reduced OA by 5.23%, attributable to a change in the optimal classifier between conditions (No-EDA favors 3D-CNN at 93.53% versus EDA favoring RF at 88.30%) rather than a fundamental failure of preprocessing, indicating that it is generally a safe preprocessing step, though its benefits may be marginal when data quality is already high.

4.4. Challenges of Real-World UAV Hyperspectral Data

The Montana UAV VNIR dataset exhibited fundamentally different behavior compared to benchmark datasets. The overall classification performance was substantially lower, and learning-based methods did not consistently outperform simpler baseline or classical models. This divergence can be attributed to several real-world challenges, including limited labeled samples, increased spectral noise, class imbalance, and domain-specific variability introduced by UAV acquisition conditions.

In this setting, EDA provided numerical gains for four of five methods (SRPA: +4.25%, DRL: +3.28%, PCA: +2.61%, and SSEP: +1.49%); however, the high run-to-run variability (standard deviations up to ±5.73%) indicates these gains are not statistically robust, suggesting that preprocessing alone cannot compensate for insufficient supervision or highly variable data distributions. These findings underscore an important limitation of deep and reinforcement-based band-selection methods: while powerful, they remain sensitive to data quality and training signal strength.

4.5. Implications for Practical Deployment

Taken together, the results suggest that the choice of band-selection strategy should be guided by dataset characteristics rather than assumed universally optimal. Learning-based methods such as DRL and SRPA are highly effective on well-annotated benchmark datasets and structured scenes, but simpler methods may remain competitive in real-world scenarios with limited labels. The dataset-dependent impact of EDA further emphasizes the need for adaptive preprocessing pipelines rather than fixed workflows.

4.6. Clustering-Based Band Selection

The inclusion of KMCBS as a clustering-based baseline provides important context for interpreting the performance of the four primary band-selection strategies. The consistently small gap between KMCBS and DRL (1–

3 %

OA) suggests that spectral clustering captures a substantial portion of the discriminative information available in the selected band subsets. The advantage of learning-based methods such as DRL is most pronounced on spectrally heterogeneous datasets like Indian Pines, where the sequential selection policy learns to prioritise bands that jointly maximise classification accuracy rather than selecting individually representative bands. On simpler datasets such as Salinas and Botswana, the distinction between clustering- and learning-based selection diminishes, suggesting that spectral redundancy is the dominant factor in those scenes and that centroid-based selection is sufficient to capture it. These findings are consistent with prior comparative studies [18,19] which report competitive clustering performance on benchmark HSI datasets.

4.7. Class Imbalance in Real-World UAV Datasets

The class imbalance analysis on the Montana UAV VNIR dataset reveals that standard imbalance-handling strategies provide only marginal improvements in overall accuracy, though focal loss consistently achieves the highest macro F1 and Dead Ponderosa Pine F1 among all strategies evaluated. The persistent failure to classify Ponderosa Pine—with

0.00 %

F1 across all strategies—highlights a fundamental limitation beyond algorithmic design: with only 212 labeled pixels and high spectral overlap with Douglas Fir, no imbalance strategy can compensate for insufficient training data. These findings reinforce the need for targeted field data collection in future prescribed fire campaigns and suggest that active learning or semi-supervised approaches may be more appropriate than post hoc resampling when labeled samples are critically scarce.

4.8. Contextualisation Against Prior Work

To situate our results within the broader band selection literature, we compare the OA achieved by our best-performing method on Indian Pines and Pavia University against closely related prior work that uses the same datasets and similar supervised or learning-based band-selection strategies. On Indian Pines, our DRL implementation achieves 89.23% OA (Top-10, 3D-CNN, and a 70/30 split). For reference, Cai et al. [51] report

70.61 \pm 2.56

% OA (BS-Net-Conv, 25 bands, SVM, and 5% training) on the same dataset, while Mou et al. [25] report

78.26 \pm 1.54

% OA (DRL, 30 bands, SVM-RBF, and 10% training). On Pavia University, our DRL achieves

98.75 \pm 0.36

% OA (Top-15; 3D-CNN). Cai et al. [51] report 89.29% OA (BS-Net-FC, 15 bands, SVM, and 5% training) on this dataset, while Mou et al. [25] report

92.96 \pm 0.58

% OA (DRL, 30 bands, SVM-RBF, and 10% training). Direct numerical comparison across these studies should be interpreted cautiously, as our 70/30 stratified split provides substantially more training supervision than the 10% per-class protocol used in most of the literature, which inflates absolute OA. The consistent method-level trend—DRL outperforming statistical and attention-based methods—holds across both protocols, supporting the generalizability of the findings. No prior band selection study has evaluated methods on a post-fire UAV VNIR dataset of the type introduced here; the Montana results therefore represent a novel contribution without a direct literature analogue.

4.9. Future Research Directions

Future work will focus on improving the robustness of learning-based band-selection methods for real-world UAV hyperspectral data. Promising directions include incorporating label-efficient or self-supervised learning strategies, leveraging spatial context more effectively, and designing hybrid approaches that combine the stability of classical methods with the adaptability of learning-based models. Extending the evaluation framework to additional sensing modalities and ecological monitoring tasks will further support the generalization of these findings.

5. Conclusions

This work presented a comprehensive and reproducible evaluation of hyperspectral band-selection techniques across six datasets, including widely used benchmark scenes and a real-world UAV-based VNIR dataset. Consistent with the objectives stated in the Abstract, we systematically compared classical, attention-based, and reinforcement learning-based band-selection methods under both EDA and No-EDA settings using a unified experimental protocol.

By aggregating the results over five independent runs and selecting representative configurations based on the mean performance and stability, we ensured a fair and transparent comparison across techniques. The results demonstrate that learning-based band-selection methods consistently outperform classical approaches on benchmark datasets. In particular, Deep Reinforcement Learning (DRL) achieved the highest overall accuracy on all six datasets under EDA conditions, and the highest or joint-highest on all six under No-EDA, confirming its robustness across diverse hyperspectral scenes. The attention-based SRPA method also showed strong and stable performance, frequently ranking second and consistently outperforming PCA-based selection.

The effect of exploratory data analysis (EDA) was found to be strongly dataset-dependent. EDA improved best-achievable performance most consistently on Pavia University, where SSEP and SRPA showed positive

Δ

OA values (+1.96% and +0.73% respectively), and on Montana, where four of five methods benefited from noise removal and normalization. On Indian Pines, EDA effects were mixed, with DRL and SRPA showing small negative

Δ

OA values (−0.46% and −2.60%), while on KSC, PCA showed a notably larger negative

Δ

OA (−5.23%), attributable to a classifier change rather than preprocessing failure. On cleaner datasets such as Botswana and Salinas, the impact of EDA was marginal but non-degrading. These findings support the Abstract’s claim that preprocessing can enhance performance, but should not be assumed to be universally beneficial.

In contrast to benchmark datasets, the Montana UAV VNIR dataset exhibited substantially lower performance across all methods, with higher variability across runs. In this real-world setting, learning-based band-selection methods did not consistently outperform simpler baseline or classical approaches. EDA provided numerical OA gains for four of five methods on Montana (SRPA: +4.25%, DRL: +3.28%, PCA: +2.61%, and SSEP: +1.49%), but the high run-to-run variability across runs (standard deviations up to ±5.73%) indicates that these gains are not statistically robust, and preprocessing alone cannot compensate for insufficient labeled data or highly variable spectral distributions. This observation highlights the limitations of deep and reinforcement-based methods when applied to noisy data with limited labeled samples, and directly supports the Abstract’s emphasis on the challenges of real-world hyperspectral analysis.

Overall, this study confirms that learning-based band selection—particularly DRL—offers significant advantages for hyperspectral image classification when data quality and supervision are sufficient. At the same time, the observed variability across datasets underscores the importance of aligning band-selection strategies, preprocessing choices, and model complexity with dataset characteristics. Future work will focus on improving robustness for real-world UAV hyperspectral data, incorporating label-efficient and self-supervised learning strategies, and extending the proposed evaluation framework to additional sensing modalities and ecological applications. The inclusion of a clustering-based band selection baseline (KMCBS) demonstrated that K-Means centroid selection achieves competitive performance within 1–

3 %

OA of the best learning-based methods, confirming the value of both classical and modern approaches to spectral dimensionality reduction. Class imbalance analysis on the Montana dataset demonstrated that focal loss provides the most consistent minority class improvements, though the critically limited labeled samples for Ponderosa Pine highlight the need for expanded field data collection in future prescribed fire monitoring campaigns.

Author Contributions

Methodology, M.I.K.; investigation, M.I.K.; resources, M.I.K. and E.M.G.; writing—original draft preparation, M.I.K.; writing—review and editing, M.I.K.; visualization, M.I.K., E.M.G.; data collection, M.U.M. and X.Z.; and supervision, B.M.W. All authors have read and agreed to the published version of the manuscript.

Funding

This material is based upon work supported in part by the National Science Foundation EPSCoR Cooperative Agreement OIA-2242802. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.

Data Availability Statement

The hyperspectral datasets, complete experimental pipeline, and source code associated with this study are publicly available via Zenodo and the project GitHub repository at the following links https://doi.org/10.5281/zenodo.18796586 (accessed on 20 December 2025) and https://github.com/BMW-lab-MSU/hyperspectral-feature-selection-prescribed-fires/ (accessed on 20 December 2025).

Acknowledgments

Computational efforts were performed on the Tempest High Performance Computing System, operated and supported by University Information Technology Research Cyberinfrastructure (RRID:SCR 026229) at Montana State University.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

HSI	Hyperspectral Imaging
VNIR	Visible–Near Infrared
SWIR	Short-Wave Infrared
UAV	Unmanned Aerial Vehicle
UAS	Unmanned Aircraft System
EDA	Exploratory Data Analysis
GT	Ground Truth
PCA	Principal Component Analysis
SSEP	Spatial–Spectral Edge Preservation
SRPA	Spectral-Redundancy Penalized Attention
DRL	Deep Reinforcement Learning
DQN	Deep Q-Network
MDP	Markov Decision Process
RF	Random Forest
SVM	Support Vector Machine
KNN	K-Nearest Neighbors
3D-CNN	Three-Dimensional Convolutional Neural Network
OA	Overall Accuracy
F1	F1-Score
SNR	Signal-to-Noise Ratio
RTK	Real-Time Kinematic
GNSS	Global Navigation Satellite System
IMU	Inertial Measurement Unit
SE	Squeeze-and-Excitation
KMCBS	K-Means Clustering-Based Band Selection

References

Sun, W.; Du, Q. Hyperspectral band selection: A review. IEEE Geosci. Remote Sens. Mag. 2019, 7, 118–139. [Google Scholar] [CrossRef]
Sawant, S.S.; Prabukumar, M. A survey of band-selection techniques for hyperspectral image classification. J. Spectr. Imaging 2020, 9, a5. [Google Scholar] [CrossRef]
Patro, R.N.; Subudhi, S.; Biswal, P.K.; Dell’acqua, F. A review of unsupervised band-selection techniques: Land cover classification for hyperspectral earth observation data. IEEE Geosci. Remote Sens. Mag. 2021, 9, 72–111. [Google Scholar] [CrossRef]
Lou, C.; Al-qaness, M.A.; AL-Alimi, D.; Dahou, A.; Abd Elaziz, M.; Abualigah, L.; Ewees, A.A. Land use/land cover (LULC) classification using hyperspectral images: A review. Geo-Spat. Inf. Sci. 2025, 28, 345–386. [Google Scholar] [CrossRef]
Tan, Y.; Gu, J.; Lu, L.; Zhang, L.; Huang, J.; Pan, L.; Lv, Y.; Wang, Y.; Chen, Y. Hyperspectral band selection for crop identification and mapping of agriculture. Remote Sens. 2025, 17, 663. [Google Scholar] [CrossRef]
Sharma, N.A.; Kumar, K.; Chand, R.R.; Kabir, M.A. Utilizing hyperspectral imaging with machine learning techniques for soil analysis. In Computational Intelligence Based Hyperspectral Image Analysis and Applications; Springer: Cham, Switzerland, 2025; Volume 2, pp. 117–143. [Google Scholar]
Yang, C.; Guo, Z.; Fernandes Barbin, D.; Dai, Z.; Watson, N.; Povey, M.; Zou, X. Hyperspectral Imaging and Deep Learning for Quality and Safety Inspection of Fruits and Vegetables: A Review. J. Agric. Food Chem. 2025, 73, 10019–10035. [Google Scholar] [CrossRef]
Tejasree, G.; Agilandeeswari, L. An extensive review of hyperspectral image classification and prediction: Techniques and challenges. Multimed. Tools Appl. 2024, 83, 80941–81038. [Google Scholar] [CrossRef]
Chutia, D.; Bhattacharyya, D.; Sarma, K.K.; Kalita, R.; Sudhakar, S. Hyperspectral remote sensing classifications: A perspective survey. Trans. GIS 2016, 20, 463–490. [Google Scholar] [CrossRef]
Harsanyi, J.C.; Chang, C.I. Hyperspectral image classification and dimensionality reduction: An orthogonal subspace projection approach. IEEE Trans. Geosci. Remote Sens. 1994, 32, 779–785. [Google Scholar] [CrossRef]
Lunga, D.; Prasad, S.; Crawford, M.M.; Ersoy, O. Manifold-learning-based feature extraction for classification of hyperspectral data: A review of advances in manifold learning. IEEE Signal Process. Mag. 2013, 31, 55–66. [Google Scholar] [CrossRef]
Miao, X.; Gong, P.; Swope, S.; Pu, R.; Carruthers, R.; Anderson, G.L. Detection of yellow starthistle through band selection and feature extraction from hyperspectral imagery. Photogramm. Eng. Remote Sens. 2007, 73, 1005–1015. [Google Scholar]
Dópido, I.; Villa, A.; Plaza, A.; Gamba, P. A quantitative and comparative assessment of unmixing-based feature extraction techniques for hyperspectral image classification. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2012, 5, 421–435. [Google Scholar] [CrossRef]
Sun, W.; Halevy, A.; Benedetto, J.J.; Czaja, W.; Liu, C.; Wu, H.; Shi, B.; Li, W. UL-Isomap based nonlinear dimensionality reduction for hyperspectral imagery classification. ISPRS J. Photogramm. Remote Sens. 2014, 89, 25–36. [Google Scholar] [CrossRef]
Bajcsy, P.; Groves, P. Methodology for hyperspectral band selection. Photogramm. Eng. Remote Sens. 2004, 70, 793–802. [Google Scholar] [CrossRef]
Sawant, S.S.; Manoharan, P.; Loganathan, A. Band selection strategies for hyperspectral image classification based on machine learning and artificial intelligent techniques–Survey. Arab. J. Geosci. 2021, 14, 646. [Google Scholar] [CrossRef]
MartÍnez-UsÓMartinez-Uso, A.; Pla, F.; Sotoca, J.M.; García-Sevilla, P. Clustering-based hyperspectral band selection using information measures. IEEE Trans. Geosci. Remote Sens. 2007, 45, 4158–4171. [Google Scholar] [CrossRef]
Wang, Q.; Li, Q.; Li, X. A fast neighborhood grouping method for hyperspectral band selection. IEEE Trans. Geosci. Remote Sens. 2020, 59, 5028–5039. [Google Scholar] [CrossRef]
Li, S.; Peng, B.; Fang, L.; Zhang, Q.; Cheng, L.; Li, Q. Hyperspectral band selection via difference between intergroups. IEEE Trans. Geosci. Remote Sens. 2023, 61, 5503310. [Google Scholar] [CrossRef]
Li, S.; Liu, Z.; Fang, L.; Li, Q. Structural Graph Learning Method for Hyperspectral Band Selection. Int. J. Remote Sens. 2024, 45, 6719–6743. [Google Scholar] [CrossRef]
Lorenzo, P.R.; Tulczyjew, L.; Marcinkiewicz, M.; Nalepa, J. Hyperspectral band selection using attention-based convolutional neural networks. IEEE Access 2020, 8, 42384–42403. [Google Scholar] [CrossRef]
Liu, Y.; Jiang, S.; Liu, Y.; Mu, C. Spatial feature enhancement and attention-guided bidirectional sequential spectral feature extraction for hyperspectral image classification. Remote Sens. 2024, 16, 3124. [Google Scholar] [CrossRef]
Zhao, X.; Ma, J.; Wang, L.; Zhang, Z.; Ding, Y.; Xiao, X. A review of hyperspectral image classification based on graph neural networks. Artif. Intell. Rev. 2025, 58, 172. [Google Scholar] [CrossRef]
Feng, J.; Li, D.; Gu, J.; Cao, X.; Shang, R.; Zhang, X.; Jiao, L. Deep reinforcement learning for semisupervised hyperspectral band selection. IEEE Trans. Geosci. Remote Sens. 2021, 60, 5501719. [Google Scholar] [CrossRef]
Mou, L.; Saha, S.; Hua, Y.; Bovolo, F.; Bruzzone, L.; Zhu, X.X. Deep reinforcement learning for band selection in hyperspectral image classification. IEEE Trans. Geosci. Remote Sens. 2021, 60, 5504414. [Google Scholar] [CrossRef]
Guo, Y.; Wang, Q.; Hu, B.; Qian, X.; Ye, H. Two-Stage Unsupervised Hyperspectral Band Selection Based on Deep Reinforcement Learning. Remote Sens. 2025, 17, 586. [Google Scholar] [CrossRef]
Wang, M.; Zhang, H.; Yin, B.; Chen, M.; Liu, W.; Ye, Z. An adaptive evolutionary-reinforcement learning algorithm for hyperspectral band selection. Expert Syst. Appl. 2024, 251, 123937. [Google Scholar] [CrossRef]
Landgrebe, D. Hyperspectral image data analysis. IEEE Signal Process. Mag. 2002, 19, 17–28. [Google Scholar] [CrossRef]
Jaime, X.A.; Angerer, J.P.; Yang, C.; Tolleson, D.R.; Fuhlendorf, S.D.; Wu, X.B. Effects of Prescribed Fire on Spatial Patterns of Plant Functional Traits and Spectral Diversity Using Hyperspectral Imagery from Savannah Landscapes on the Edwards Plateau of Texas, USA. Remote Sens. 2025, 17, 3873. [Google Scholar] [CrossRef]
Mambile, C.; Leo, J.; Kaijage, S. Deep Learning Models for Forest Fire Prediction: Insights into Feature Selection for Climate-Resilient Forestry. J. Sustain. For. 2026, 45, 1–30. [Google Scholar] [CrossRef]
Frontline Wildfire Defense. Live Montana Fire Map and Tracker|Montana Wildfire Statistics. 2025. Available online: https://www.frontlinewildfire.com/montana-wildfire-map/ (accessed on 10 January 2026).
McWethy, D. As Wildfires Increase Across the West, Preparedness Is Essential. Bozeman Daily Chronicle, Guest Column, 2025. Available online: https://www.bozemandailychronicle.com/opinions/guest_columnists/dave-mcwethy-as-wildfires-increase-across-the-west-preparedness-is-essential/article_ddc6af24-e99d-11ef-a951-2fd1c3b47315.html (accessed on 10 January 2026).
Montana Department of Natural Resources and Conservation (DNRC). Looking Back on Montana’s 2025 Wildfire Season. 2025. Available online: https://www.kpax.com/news/firewatch/looking-back-on-montanas-2025-wildfire-season (accessed on 10 January 2026).
Montana Department of Natural Resources and Conservation. Current Fire Information. 2025. Available online: https://www.mtfireinfo.org/pages/current-fire-info (accessed on 10 January 2026).
USDA Forest Service, Northern Region. Helena-Lewis and Clark National Forest Prescribed Fire Plan and Related Projects. Technical Report, USDA Forest Service. Forestwide Plan for 40,000 Acres of Prescribed Burns Annually Through 2045 to Reduce Fuels and Restore Fire Regimes. 2025. Available online: https://www.mtpr.org/montana-news/2025-07-25/forest-service-plans-expansive-prescribed-fire-project-in-montana (accessed on 10 January 2026).
Montana Department of Natural Resources and Conservation (DNRC). State and Private Forestry Fact Sheet Montana 2025. 2025. Available online: https://apps.fs.usda.gov/nicportal/temppdf/sfs/naweb/MT_std.pdf (accessed on 10 January 2026).
USDA Forest Service, Northern Region. Prescribed Burns in 2023: Northern Region Accomplishments; Technical Report; USDA Forest Service: Washington, DC, USA, 2023. [Google Scholar]
Hennessy, A.; Clarke, K.; Lewis, M. Hyperspectral classification of plants: A review of waveband selection generalisability. Remote Sens. 2020, 12, 113. [Google Scholar] [CrossRef]
Magalhães, A.H.; Magalhães, H.A.; Yehia, H.C. Hyperspectral Image Synthesis from RGB Images Applied to Wildfire Detection. In Computational Intelligence Based Hyperspectral Image Analysis and Applications; Springer: Cham, Switzerland, 2025; Volume 2, pp. 65–96. [Google Scholar]
Mani, A.; Chen, X.; Gorbachev, S.; Yan, J.; Dixit, A.; Sun, Y.; Yan, Z.; Wu, J.; Deng, J.; Jiang, X.; et al. A Comprehensive Hyperspectral Image Dataset for Forest Fire Detection and Classification. Sci. Data 2025, 13, 92. [Google Scholar] [CrossRef]
Cheng, M.F.; Mukundan, A.; Karmakar, R.; Valappil, M.A.E.; Jouhar, J.; Wang, H.C. Modern Trends and Recent Applications of Hyperspectral Imaging: A Review. Technologies 2025, 13, 170. [Google Scholar] [CrossRef]
Baek, S.; Kim, W. Review on hyperspectral remote sensing of tidal zones. Ocean Sci. J. 2025, 60, 3. [Google Scholar] [CrossRef]
Karankot, M.I.; Whitaker, B.M.; Zhou, X.; Masood, M.U. Attention and edge-aware band selection for efficient hyperspectral classification of burned vegetation. In Proceedings of the 2025 IEEE 35th International Workshop on Machine Learning for Signal Processing (MLSP); IEEE: Piscataway, NJ, USA, 2025; pp. 1–6. [Google Scholar]
Karankot, M.I.; Glenn, E.M.; Whitaker, B.M. Hyperspectral band selection via self-supervised and reinforcement learning for prescribed burn impact analysis. In Proceedings of the SPIE Future Sensing Technologies 2025; SPIE: Singapore, 2025; Volume 13710, pp. 133–140. [Google Scholar]
Liu, J.; Lan, J.; Zeng, Y.; Luo, W.; Zhuang, Z.; Zou, J. Explainability Feature Bands Adaptive Selection for Hyperspectral Image Classification. Remote Sens. 2025, 17, 1620. [Google Scholar] [CrossRef]
Graña, M.; Veganzones, M.A.; Ayerdi, B. Hyperspectral Remote Sensing Scenes. Available online: https://www.ehu.eus/ccwintco/index.php/Hyperspectral_Remote_Sensing_Scenes (accessed on 5 September 2025).
Li, S.; Song, W.; Fang, L.; Chen, Y.; Ghamisi, P.; Benediktsson, J.A. Deep Learning for Hyperspectral Image Classification: An Overview. IEEE Trans. Geosci. Remote Sens. 2019, 57, 6690–6709. [Google Scholar] [CrossRef]
McQueen, J.B. Some methods of classification and analysis of multivariate observations. In Proceedings of the 5th Berkeley Symposium on Mathematical Statistics and Probability; SPIE: Singapore, 1967; pp. 281–297. [Google Scholar]
Mounika, K.; Aravind, K.; Yamini, M.; Navyasri, P.; Dash, S.; Suryanarayana, V. Hyperspectral image classification using SVM with PCA. In Proceedings of the 2021 6th International Conference on Signal Processing, Computing and Control (ISPCC); IEEE: Piscataway, NJ, USA, 2021; pp. 470–475. [Google Scholar]
Torres, R.M.; Yuen, P.W.; Yuan, C.; Piper, J.; McCullough, C.; Godfree, P. Spatial spectral band selection for enhanced hyperspectral remote sensing classification applications. J. Imaging 2020, 6, 87. [Google Scholar] [CrossRef]
Cai, Y.; Liu, X.; Cai, Z. BS-Nets: An End-to-End Framework for Band Selection of Hyperspectral Image. IEEE Trans. Geosci. Remote Sens. 2020, 58, 1969–1984. [Google Scholar] [CrossRef]
Melgani, F.; Bruzzone, L. Classification of hyperspectral remote sensing images with support vector machines. IEEE Trans. Geosci. Remote Sens. 2004, 42, 1778–1790. [Google Scholar] [CrossRef]
Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; et al. Scikit-learn: Machine Learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
Ham, J.; Chen, Y.; Crawford, M.; Ghosh, J. Investigation of the random forest framework for classification of hyperspectral data. IEEE Trans. Geosci. Remote Sens. 2005, 43, 492–501. [Google Scholar] [CrossRef]
Cover, T.; Hart, P. Nearest neighbor pattern classification. IEEE Trans. Inf. Theor. 2006, 13, 21–27. [Google Scholar] [CrossRef]
Tu, B.; Wang, J.; Kang, X.; Zhang, G.; Ou, X.; Guo, L. KNN-based representation of superpixels for hyperspectral image classification. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2018, 11, 4032–4047. [Google Scholar] [CrossRef]
Cortes, C.; Vapnik, V. Support-vector networks. Mach. Learn. 1995, 20, 273–297. [Google Scholar] [CrossRef]
Camps-Valls, G.; Bruzzone, L. Kernel-based methods for hyperspectral image classification. IEEE Trans. Geosci. Remote Sens. 2005, 43, 1351–1362. [Google Scholar] [CrossRef]
Goodfellow, I.; Bengio, Y.; Courville, A.; Bengio, Y. Deep Learning; MIT Press: Cambridge, UK, 2016; Volume 1. [Google Scholar]
Chen, Y.; Lin, Z.; Zhao, X.; Wang, G.; Gu, Y. Deep Learning-Based Classification of Hyperspectral Data. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2014, 7, 2094–2107. [Google Scholar] [CrossRef]
Lin, T.Y.; Goyal, P.; Girshick, R.; He, K.; Dollár, P. Focal loss for dense object detection. In Proceedings of the IEEE International Conference on Computer Vision; IEEE: Piscataway, NJ, USA, 2017; pp. 2980–2988. [Google Scholar]
Chawla, N.V.; Bowyer, K.W.; Hall, L.O.; Kegelmeyer, W.P. SMOTE: Synthetic minority over-sampling technique. J. Artif. Intell. Res. 2002, 16, 321–357. [Google Scholar] [CrossRef]
Cohen, J. A coefficient of agreement for nominal scales. Educ. Psychol. Meas. 1960, 20, 37–46. [Google Scholar] [CrossRef]
Sokolova, M.; Lapalme, G. A systematic analysis of performance measures for classification tasks. Inf. Process. Manag. 2009, 45, 427–437. [Google Scholar] [CrossRef]
He, K.; Zhang, X.; Ren, S.; Sun, J. Deep Residual Learning for Image Recognition. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR); IEEE: Piscataway, NJ, USA, 2016; pp. 770–778. [Google Scholar] [CrossRef]
Veraverbeke, S.; Dennison, P.; Gitas, I.; Hulley, G.; Kalashnikova, O.; Katagis, T.; Kuai, L.; Meng, R.; Roberts, D.; Stavros, N. Hyperspectral remote sensing of fire: State-of-the-art and future perspectives. Remote Sens. Environ. 2018, 216, 105–121. [Google Scholar] [CrossRef]

Figure 1. Unified workflow for hyperspectral image classification, illustrating data preparation, exploratory data analysis, band selection, and classification stages applied consistently across all datasets.

Figure 2. Pseudo-RGB composite (R = 150, G = 100, and B = 50) with ground-truth labels overlaid, used to validate geometric alignment between field-derived annotations and the hyperspectral mosaic.

Figure 3. Per-band zero fraction across all 273 VNIR channels. High zero-fraction in bands 1–40 and 240–273 indicates significant sensor noise and low detector responsivity.

Figure 4. Per-band standard deviation for all VNIR channels. Bands 130–240 exhibit the strongest signal variability, while low and high wavelengths show weak or noisy responses.

Table 1. Summary of hyperspectral datasets used in this study.

Dataset	Sensor	Spatial Dim. (H × W)	Bands (Orig./Cleaned)	Classes	Spatial Res. (m)
Indian Pines	AVIRIS	145 × 145	224/200	16	20
Pavia University	ROSIS	610 × 340	103/103	9	1.3
Salinas	AVIRIS	512 × 217	224/204	16	3.7
Botswana	Hyperion	1476 × 256	242/145	14	30
KSC	AVIRIS	512 × 614	224/176	13	18

Table 2. Classification performance with and without exploratory data analysis (EDA) on Indian Pines. The results are reported as the mean ± standard deviation over five independent runs. The best OA per condition is shown in bold.

Δ

= EDA − No-EDA.

Table 2. Classification performance with and without exploratory data analysis (EDA) on Indian Pines. The results are reported as the mean ± standard deviation over five independent runs. The best OA per condition is shown in bold.

Δ

= EDA − No-EDA.

Technique	Configuration	OA (%)	Kappa	F1 (%)
With EDA
No Band Selection	All\|SVM	$87.97 \pm 0.00$	$0.862 \pm 0.000$	$87.61 \pm 0.00$
PCA	Top 50\|RF	$77.59 \pm 0.00$	$0.742 \pm 0.000$	$67.48 \pm 0.00$
SSEP	Top 50\|RF	$72.26 \pm 0.00$	$0.678 \pm 0.000$	$58.69 \pm 0.00$
SRPA	Top 50\|RF	$77.11 \pm 0.00$	$0.736 \pm 0.000$	$73.13 \pm 0.00$
DRL	Top 10\|3D-CNN	$89.23 \pm 2.29$	$0.877 \pm 0.026$	$90.28 \pm 2.04$
Without EDA and Performance Difference (Δ = EDA − No-EDA)
No Band Selection	All\|SVM	$88.00 \pm 0.00$	$0.862 \pm 0.000$	$88.63 \pm 0.00$
PCA	Top 50\|RF	$77.76 \pm 0.00$	$0.743 \pm 0.000$	$66.81 \pm 0.00$
SSEP	Top 50\|RF	$72.16 \pm 0.00$	$0.677 \pm 0.000$	$59.13 \pm 0.00$
SRPA	Top 50\|RF	$79.71 \pm 0.00$	$0.766 \pm 0.000$	$76.26 \pm 0.00$
DRL	Top 5\|3D-CNN	$89.69 \pm 1.29$	$0.882 \pm 0.014$	$90.33 \pm 0.83$
ΔOA/ΔKappa/ΔF1
No Band Selection		$- 0.03$ / $- 0.000$ / $- 1.01$
PCA		$- 0.16$ / $- 0.001$ / $+ 0.67$
SSEP		$+ 0.10$ / $+ 0.001$ / $- 0.44$
SRPA		$- 2.60$ / $- 0.030$ / $- 3.13$
DRL		$- 0.46$ / $- 0.005$ / $- 0.04$

Table 3. Classification performance with and without EDA on Pavia University. The results are mean ± std over five runs. The best OA per condition is in bold.

Δ

= EDA − No-EDA.

Table 3. Classification performance with and without EDA on Pavia University. The results are mean ± std over five runs. The best OA per condition is in bold.

Δ

= EDA − No-EDA.

Technique	Configuration	OA (%)	Kappa	F1 (%)
With EDA
No Band Selection	All\|SVM	$94.00 \pm 0.00$	$0.920 \pm 0.000$	$92.44 \pm 0.00$
PCA	Top 30\|3D-CNN	$96.77 \pm 0.70$	$0.957 \pm 0.009$	$95.66 \pm 1.02$
SSEP	Top 30\|3D-CNN	$96.52 \pm 0.45$	$0.954 \pm 0.006$	$95.92 \pm 0.51$
SRPA	Top 30\|3D-CNN	$97.68 \pm 0.69$	$0.969 \pm 0.009$	$97.13 \pm 0.37$
DRL	Top 15\|3D-CNN	$98.75 \pm 0.36$	$0.984 \pm 0.005$	$98.46 \pm 0.18$
Without EDA and Performance Difference (Δ = EDA − No-EDA)
No Band Selection	All\|SVM	$94.21 \pm 0.00$	$0.923 \pm 0.000$	$92.68 \pm 0.00$
PCA	Top 45\|3D-CNN	$97.25 \pm 0.53$	$0.964 \pm 0.007$	$96.10 \pm 0.53$
SSEP	Top 40\|3D-CNN	$94.56 \pm 0.54$	$0.929 \pm 0.007$	$93.31 \pm 1.84$
SRPA	Top 25\|3D-CNN	$96.95 \pm 1.40$	$0.960 \pm 0.018$	$96.81 \pm 0.91$
DRL	Top 15\|3D-CNN	$99.03 \pm 0.19$	$0.987 \pm 0.003$	$98.87 \pm 0.20$
ΔOA/ΔKappa/ΔF1
No Band Selection		$- 0.21$ / $- 0.003$ / $- 0.24$
PCA		$- 0.48$ / $- 0.006$ / $- 0.44$
SSEP		$+ 1.96$ / $+ 0.026$ / $+ 2.61$
SRPA		$+ 0.73$ / $+ 0.009$ / $+ 0.32$
DRL		$- 0.28$ / $- 0.004$ / $- 0.41$

Table 4. Classification performance with and without EDA on Salinas. The results are the mean ± std over five runs. The best OA per condition is in bold.

Δ

= EDA − No-EDA.

Table 4. Classification performance with and without EDA on Salinas. The results are the mean ± std over five runs. The best OA per condition is in bold.

Δ

= EDA − No-EDA.

Technique	Configuration	OA (%)	Kappa	F1 (%)
With EDA
No Band Selection	All\|RF	$94.98 \pm 0.00$	$0.944 \pm 0.000$	$97.45 \pm 0.00$
PCA	Top 30\|3D-CNN	$94.10 \pm 1.35$	$0.934 \pm 0.015$	$97.11 \pm 0.69$
SSEP	Top 50\|RF	$93.22 \pm 0.00$	$0.924 \pm 0.000$	$96.24 \pm 0.00$
SRPA	Top 45\|RF	$94.19 \pm 0.00$	$0.935 \pm 0.000$	$96.94 \pm 0.00$
DRL	Top 50\|RF	$95.02 \pm 0.00$	$0.944 \pm 0.000$	$97.48 \pm 0.00$
Without EDA and Performance Difference (Δ = EDA − No-EDA)
No Band Selection	All\|RF	$95.02 \pm 0.00$	$0.944 \pm 0.000$	$97.50 \pm 0.00$
PCA	Top 50\|RF	$93.97 \pm 0.00$	$0.933 \pm 0.000$	$96.73 \pm 0.00$
SSEP	Top 50\|RF	$93.23 \pm 0.00$	$0.924 \pm 0.000$	$96.25 \pm 0.00$
SRPA	Top 50\|RF	$94.30 \pm 0.00$	$0.937 \pm 0.000$	$96.80 \pm 0.00$
DRL	Top 10\|3D-CNN	$95.51 \pm 0.93$	$0.950 \pm 0.011$	$97.78 \pm 0.42$
ΔOA/ΔKappa/ΔF1
No Band Selection		$- 0.04$ / $- 0.000$ / $- 0.05$
PCA		$+ 0.13$ / $+ 0.001$ / $+ 0.39$
SSEP		$- 0.01$ / $- 0.000$ / $- 0.01$
SRPA		$- 0.11$ / $- 0.001$ / $+ 0.14$
DRL		$- 0.49$ / $- 0.005$ / $- 0.30$

Table 5. Classification performance with and without EDA on Botswana. The results are the mean ± std over five runs. The best OA per condition is in bold.

Δ

= EDA − No-EDA.

Table 5. Classification performance with and without EDA on Botswana. The results are the mean ± std over five runs. The best OA per condition is in bold.

Δ

= EDA − No-EDA.

Technique	Configuration	OA (%)	Kappa	F1 (%)
With EDA
No Band Selection	All\|SVM	$92.10 \pm 0.00$	$0.914 \pm 0.000$	$92.96 \pm 0.00$
PCA	Top 50\|SVM	$91.18 \pm 0.00$	$0.904 \pm 0.000$	$92.35 \pm 0.00$
SSEP	Top 40\|SVM	$90.15 \pm 0.00$	$0.893 \pm 0.000$	$90.99 \pm 0.00$
SRPA	Top 15\|3D-CNN	$91.42 \pm 4.31$	$0.907 \pm 0.047$	$91.67 \pm 4.38$
DRL	Top 10\|3D-CNN	$95.15 \pm 1.54$	$0.947 \pm 0.017$	$95.53 \pm 1.42$
Without EDA and Performance Difference (Δ = EDA − No-EDA)
No Band Selection	All\|SVM	$92.10 \pm 0.00$	$0.914 \pm 0.000$	$92.96 \pm 0.00$
PCA	Top 50\|SVM	$90.87 \pm 0.00$	$0.901 \pm 0.000$	$91.96 \pm 0.00$
SSEP	Top 50\|SVM	$90.26 \pm 0.00$	$0.894 \pm 0.000$	$91.20 \pm 0.00$
SRPA	Top 10\|3D-CNN	$93.35 \pm 2.93$	$0.928 \pm 0.032$	$93.82 \pm 2.82$
DRL	Top 15\|3D-CNN	$93.78 \pm 6.00$	$0.933 \pm 0.065$	$94.06 \pm 5.93$
ΔOA/ΔKappa/ΔF1
No Band Selection		$0.00$ / $0.000$ / $- 0.01$
PCA		$+ 0.31$ / $+ 0.003$ / $+ 0.39$
SSEP		$- 0.10$ / $- 0.001$ / $- 0.21$
SRPA		$- 1.94$ / $- 0.021$ / $- 2.15$
DRL		$+ 1.36$ / $+ 0.015$ / $+ 1.47$

Table 6. Classification performance with and without EDA on Kennedy Space Center (KSC). The results are the mean ± std over five runs. The best OA per condition is in bold.

Δ

= EDA − No-EDA.

Table 6. Classification performance with and without EDA on Kennedy Space Center (KSC). The results are the mean ± std over five runs. The best OA per condition is in bold.

Δ

= EDA − No-EDA.

Technique	Configuration	OA (%)	Kappa	F1 (%)
With EDA
No Band Selection	All\|RF	$92.20 \pm 0.00$	$0.913 \pm 0.000$	$88.37 \pm 0.00$
PCA	Top 30\|RF	$88.30 \pm 0.00$	$0.870 \pm 0.000$	$81.46 \pm 0.00$
SSEP	Top 45\|RF	$85.81 \pm 0.00$	$0.842 \pm 0.000$	$80.05 \pm 0.00$
SRPA	Top 10\|3D-CNN	$92.97 \pm 0.78$	$0.922 \pm 0.009$	$87.33 \pm 1.66$
DRL	Top 10\|3D-CNN	$94.11 \pm 1.55$	$0.934 \pm 0.017$	$90.11 \pm 2.51$
Without EDA and Performance Difference (Δ = EDA − No-EDA)
No Band Selection	All\|RF	$92.97 \pm 0.00$	$0.922 \pm 0.000$	$88.97 \pm 0.00$
PCA	Top 20\|3D-CNN	$93.53 \pm 0.80$	$0.928 \pm 0.009$	$87.95 \pm 1.47$
SSEP	Top 45\|RF	$86.06 \pm 0.00$	$0.844 \pm 0.000$	$80.43 \pm 0.00$
SRPA	Top 20\|3D-CNN	$94.11 \pm 0.78$	$0.934 \pm 0.009$	$89.73 \pm 2.33$
DRL	Top 5\|3D-CNN	$93.71 \pm 1.24$	$0.930 \pm 0.014$	$88.89 \pm 2.87$
ΔOA/ΔKappa/ΔF1
No Band Selection		$- 0.77$ / $- 0.009$ / $- 0.60$
PCA		$- 5.23$ / $- 0.058$ / $- 6.48$
SSEP		$- 0.26$ / $- 0.003$ / $- 0.38$
SRPA		$- 1.13$ / $- 0.013$ / $- 2.39$
DRL		$+ 0.40$ / $+ 0.004$ / $+ 1.22$

Table 7. Classification performance with and without EDA on Montana UAV VNIR. The results are the mean ± std over five runs. The best OA per condition is in bold.

Δ

= EDA − No-EDA. High standard deviations reflect limited labeled samples, class imbalance, and spectral noise inherent to UAS acquisition.

Table 7. Classification performance with and without EDA on Montana UAV VNIR. The results are the mean ± std over five runs. The best OA per condition is in bold.

Δ

= EDA − No-EDA. High standard deviations reflect limited labeled samples, class imbalance, and spectral noise inherent to UAS acquisition.

Technique	Configuration	OA (%)	Kappa	F1 (%)
With EDA
No Band Selection	All\|RF	$51.09 \pm 0.00$	$0.291 \pm 0.000$	$37.59 \pm 0.00$
PCA	Top 20\|3D-CNN	$55.95 \pm 4.50$	$0.355 \pm 0.066$	$45.56 \pm 9.11$
SSEP	Top 50\|3D-CNN	$55.53 \pm 2.60$	$0.348 \pm 0.038$	$44.99 \pm 6.55$
SRPA	Top 30\|3D-CNN	$57.09 \pm 5.73$	$0.374 \pm 0.081$	$47.53 \pm 10.72$
DRL	Top 15\|3D-CNN	$58.19 \pm 4.97$	$0.392 \pm 0.069$	$51.96 \pm 9.82$
Without EDA and Performance Difference (Δ = EDA − No-EDA)
No Band Selection	All\|3D-CNN	$53.68 \pm 1.34$	$0.317 \pm 0.020$	$39.17 \pm 2.37$
PCA	Top 50\|3D-CNN	$53.33 \pm 2.12$	$0.315 \pm 0.029$	$40.28 \pm 2.69$
SSEP	Top 25\|3D-CNN	$54.04 \pm 1.90$	$0.324 \pm 0.024$	$39.70 \pm 1.91$
SRPA	Top 40\|RF	$52.84 \pm 0.00$	$0.312 \pm 0.000$	$39.67 \pm 0.00$
DRL	Top 20\|3D-CNN	$54.91 \pm 0.95$	$0.337 \pm 0.015$	$45.44 \pm 3.34$
ΔOA/ΔKappa/ΔF1
No Band Selection		$- 2.59$ / $- 0.026$ / $- 1.59$
PCA		$+ 2.61$ / $+ 0.040$ / $+ 5.28$
SSEP		$+ 1.49$ / $+ 0.024$ / $+ 5.29$
SRPA		$+ 4.25$ / $+ 0.063$ / $+ 7.87$
DRL		$+ 3.28$ / $+ 0.055$ / $+ 6.52$

Table 8. Cross-dataset aggregated performance comparison with and without EDA. For each band selection technique, the per-dataset best-configuration mean OA, Kappa, and macro-F1 are averaged across all six datasets. The reported values are the mean ± std computed over the six per-dataset values, reflecting cross-dataset variability rather than run-to-run variance. The best aggregated OA per condition is in bold.

Technique	OA (%)	Kappa	F1 (%)
With EDA (Aggregated Mean across 6 Datasets)
No Band Selection	$85.70 \pm 16.21$	$0.808 \pm 0.253$	$81.16 \pm 26.23$
PCA	$83.98 \pm 15.25$	$0.794 \pm 0.228$	$79.94 \pm 20.18$
SSEP	$82.39 \pm 15.29$	$0.772 \pm 0.232$	$76.06 \pm 24.78$
SRPA	$85.08 \pm 15.44$	$0.807 \pm 0.227$	$82.29 \pm 19.19$
DRL	$88.41 \pm 15.11$	$0.846 \pm 0.225$	$87.30 \pm 17.67$
Without EDA (Aggregated Mean across 6 Datasets)
No Band Selection	$86.08 \pm 15.81$	$0.812 \pm 0.249$	$81.32 \pm 26.72$
PCA	$84.60 \pm 16.35$	$0.796 \pm 0.252$	$77.86 \pm 26.96$
SSEP	$82.07 \pm 15.06$	$0.768 \pm 0.231$	$75.12 \pm 25.77$
SRPA	$85.21 \pm 16.99$	$0.806 \pm 0.252$	$82.18 \pm 22.19$
DRL	$87.79 \pm 16.34$	$0.833 \pm 0.255$	$83.15 \pm 26.82$

Table 9. Selected spectral bands for the Montana UAV VNIR dataset under EDA and No-EDA conditions, for each band-selection method at its best-performing Top-K configuration (from Table 7). Wavelengths are derived from the 273-band VNIR cube spanning 400–1000 nm (≈2.2 nm per band). Band counts per spectral region are shown in parentheses. EDA-condition indices are mapped to original wavelengths by adding the 40-band EDA offset (488 nm start). ^† SRPA No-EDA scores are available only for the 400–625 nm range owing to a file truncation; complete scores are provided in the supplementary repository.

Method	Condition	Top-K	WL Range (nm)	Dominant Spectral Regions (Band Count)
With EDA
DRL	EDA	15	493–916	Green (5), Red (6), NIR (4)
PCA	EDA	20	488–824	Blue (6), Green (2), NIR (12)
SSEP	EDA	20	764–932	NIR (20)
SRPA	EDA	30	488–929	Blue (5), Green (6), Red (7), NIR (12)
Without EDA
DRL	No-EDA	10	526–879	Red (7), NIR (3)
PCA	No-EDA	50	400–804	Blue (23), Red(9), NIR (18)
SSEP	No-EDA	15	762–896	NIR (15)
SRPA	No-EDA	40	404–623	Blue (7), Green (10), Yellow-Orange (21), Red (2) ^†

Table 10. KMCBS classification results. Mean OA (%) ± std over five independent runs. Best Top-K per classifier per dataset shown.

Dataset	Classifier	Top-K	OA (%)	Kappa	F1 (%)
Indian Pines	RF	50	$84.87 \pm 0.56$	$0.826$	$83.10$
	SVM	50	$84.38 \pm 0.89$	$0.821$	$83.66$
	KNN	20	$73.67 \pm 0.42$	$0.698$	$64.58$
	3D-CNN	10	$85.86 \pm 1.99$	$0.839$	$83.49$
Pavia University	RF	40	$93.26 \pm 0.07$	$0.910$	$92.07$
	SVM	45	$94.22 \pm 0.13$	$0.923$	$92.76$
	KNN	40	$92.41 \pm 0.12$	$0.898$	$91.17$
	3D-CNN	10	$96.96 \pm 0.34$	$0.960$	$96.75$
Salinas	RF	40	$95.04 \pm 0.14$	$0.945$	$97.49$
	SVM	50	$91.99 \pm 0.12$	$0.911$	$95.66$
	KNN	5	$90.35 \pm 0.06$	$0.892$	$94.91$
	3D-CNN	15	$95.55 \pm 0.51$	$0.951$	$97.92$
Botswana	RF	45	$91.20 \pm 0.70$	$0.905$	$91.94$
	SVM	45	$92.72 \pm 0.76$	$0.921$	$93.51$
	KNN	45	$91.84 \pm 0.81$	$0.912$	$92.62$
	3D-CNN	10	$95.90 \pm 1.32$	$0.956$	$96.07$
KSC	RF	45	$91.99 \pm 0.90$	$0.911$	$88.08$
	SVM	50	$91.36 \pm 0.78$	$0.904$	$86.61$
	KNN	15	$90.24 \pm 1.08$	$0.891$	$85.29$
	3D-CNN	5	$94.68 \pm 0.71$	$0.941$	$90.60$
Montana UAV VNIR	RF	25	$48.03 \pm 1.68$	$0.233$	$33.79$
	SVM	30	$49.00 \pm 1.22$	$0.239$	$32.41$
	KNN	5	$47.07 \pm 1.34$	$0.220$	$34.78$
	3D-CNN	10	$53.60 \pm 3.06$	$0.319$	$46.78$

Bold indicates the best OA per dataset across all classifiers.

Table 11. Patch size sensitivity analysis. The results are the mean ± std over five independent runs using DRL band selection and 3D-CNN classifier under EDA preprocessing. The best OA per dataset is shown in bold.

Dataset	Patch Size	Top-K	OA (%)	Kappa	F1 (%)
Indian Pines	$3 \times 3$	10	$79.78 \pm 3.06$	$0.770 \pm 0.034$	$79.54 \pm 4.36$
	$5 \times 5$	10	$81.01 \pm 4.99$	$0.786 \pm 0.053$	$81.01 \pm 3.79$
	$7 \times 7$	10	$81.37 \pm 7.90$	$0.790 \pm 0.087$	$84.68 \pm 5.07$
Pavia University	$3 \times 3$	15	$93.86 \pm 2.82$	$0.918 \pm 0.039$	$94.07 \pm 2.57$
	$5 \times 5$	15	$95.42 \pm 2.90$	$0.939 \pm 0.040$	$95.09 \pm 2.44$
	$7 \times 7$	15	$97.23 \pm 2.93$	$0.963 \pm 0.040$	$97.04 \pm 2.42$
Montana UAV VNIR	$3 \times 3$	15	$47.95 \pm 2.00$	$0.231 \pm 0.033$	$32.65 \pm 5.27$
	$5 \times 5$	15	$52.54 \pm 1.99$	$0.302 \pm 0.029$	$42.74 \pm 2.25$
	$7 \times 7$	15	$55.70 \pm 2.37$	$0.356 \pm 0.039$	$52.81 \pm 2.98$

Table 12. Ablation study results. Mean ± std over five independent runs. The full method is compared against degraded variant with the core module removed. The best OA per dataset and method pair is shown in bold.

Dataset	Method	Variant	Classifier	Top-K	OA (%)	Kappa	F1 (%)
Indian Pines	SRPA	Full ( $λ = 0.3$ )	RF	50	$77.26 \pm 0.70$	$0.738$	$71.30$
	SRPA	No penalty ( $λ = 0$ )	RF	50	$85.54 \pm 0.73$	$0.834$	$83.29$
	DRL	Full sequential	3D-CNN	10	$88.03 \pm 2.12$	$0.864$	$89.24$
	DRL	Random selection	3D-CNN	10	$85.74 \pm 4.24$	$0.837$	$83.33$
Pavia University	SRPA	Full ( $λ = 0.3$ )	3D-CNN	30	$96.81 \pm 2.08$	$0.958$	$95.45$
	SRPA	No penalty ( $λ = 0$ )	3D-CNN	30	$95.27 \pm 3.10$	$0.939$	$95.10$
	DRL	Full sequential	3D-CNN	15	$98.55 \pm 0.53$	$0.981$	$98.10$
	DRL	Random selection	3D-CNN	15	$97.70 \pm 0.64$	$0.970$	$96.89$
Salinas	SRPA	Full ( $λ = 0.3$ )	RF	45	$94.12 \pm 0.11$	$0.934$	$96.87$
	SRPA	No penalty ( $λ = 0$ )	RF	45	$94.75 \pm 0.06$	$0.942$	$97.39$
	DRL	Full sequential	RF	50	$94.98 \pm 0.10$	$0.944$	$97.44$
	DRL	Random selection	RF	50	$94.74 \pm 0.11$	$0.941$	$97.35$
Botswana	SRPA	Full ( $λ = 0.3$ )	3D-CNN	15	$90.65 \pm 2.37$	$0.899$	$91.10$
	SRPA	No penalty ( $λ = 0$ )	3D-CNN	15	$88.66 \pm 3.18$	$0.877$	$89.07$
	DRL	Full sequential	3D-CNN	10	$94.83 \pm 1.37$	$0.944$	$95.02$
	DRL	Random selection	3D-CNN	10	$92.82 \pm 2.75$	$0.922$	$92.91$
KSC	SRPA	Full ( $λ = 0.3$ )	3D-CNN	10	$92.95 \pm 1.68$	$0.922$	$87.44$
	SRPA	No penalty ( $λ = 0$ )	3D-CNN	10	$92.99 \pm 0.75$	$0.922$	$87.02$
	DRL	Full sequential	3D-CNN	10	$93.61 \pm 1.56$	$0.929$	$87.29$
	DRL	Random selection	3D-CNN	10	$92.47 \pm 2.78$	$0.916$	$87.32$
Montana UAV VNIR	SRPA	Full ( $λ = 0.3$ )	3D-CNN	30	$49.82 \pm 2.09$	$0.252$	$37.97$
	SRPA	No penalty ( $λ = 0$ )	3D-CNN	30	$50.18 \pm 2.18$	$0.262$	$38.67$
	DRL	Full sequential	3D-CNN	15	$51.32 \pm 3.37$	$0.282$	$41.24$
	DRL	Random selection	3D-CNN	15	$50.44 \pm 3.14$	$0.277$	$41.45$

Table 13. SSEP

σ

sensitivity analysis. Mean OA (%) ± std over five independent runs. The best value per dataset shown in bold. † denotes the default value used in main experiments (

σ = 1.0

).

Table 13. SSEP

σ

sensitivity analysis. Mean OA (%) ± std over five independent runs. The best value per dataset shown in bold. † denotes the default value used in main experiments (

σ = 1.0

).

Dataset	$σ = 0.5$	$σ = {1.0}^{†}$	$σ = 1.5$	$σ = 2.0$
Indian Pines	$74.09 \pm 0.52$	$71.41 \pm 0.59$	$72.83 \pm 0.47$	$78.54 \pm 0.63$
Pavia University	$91.63 \pm 10.01$	$88.36 \pm 5.68$	$96.65 \pm 0.37$	$96.54 \pm 0.83$
Salinas	$93.55 \pm 0.06$	$93.27 \pm 0.09$	$94.29 \pm 0.06$	$93.51 \pm 0.12$
Botswana	$89.48 \pm 0.57$	$89.54 \pm 0.63$	$92.86 \pm 0.61$	$92.78 \pm 0.54$
KSC	$85.36 \pm 0.46$	$85.95 \pm 0.46$	$85.86 \pm 0.56$	$86.04 \pm 0.21$
Montana UAV	$50.53 \pm 3.84$	$49.91 \pm 2.46$	$49.56 \pm 2.43$	$49.56 \pm 2.59$

Table 14. SRPA

λ

sensitivity analysis. Mean OA (%) ± std over five independent runs. The best value per dataset is shown in bold. † denotes the default value used in main experiments (

λ = 0.3

).

Table 14. SRPA

λ

sensitivity analysis. Mean OA (%) ± std over five independent runs. The best value per dataset is shown in bold. † denotes the default value used in main experiments (

λ = 0.3

).

Dataset	$λ = 0.1$	$λ = 0.2$	$λ = {0.3}^{†}$	$λ = 0.5$	$λ = 0.7$
Indian Pines	$84.66 \pm 0.78$	$79.33 \pm 0.69$	$77.26 \pm 0.70$	$74.24 \pm 0.57$	$74.44 \pm 0.77$
Pavia University	$95.95 \pm 0.69$	$95.52 \pm 1.52$	$95.70 \pm 2.33$	$97.11 \pm 0.83$	$94.83 \pm 0.62$
Salinas	$94.76 \pm 0.05$	$94.26 \pm 0.11$	$94.12 \pm 0.11$	$93.88 \pm 0.16$	$93.65 \pm 0.17$
Botswana	$89.97 \pm 8.81$	$91.53 \pm 2.73$	$90.36 \pm 5.43$	$87.67 \pm 7.17$	$88.92 \pm 3.32$
KSC	$91.84 \pm 1.36$	$90.76 \pm 3.11$	$92.24 \pm 1.15$	$87.41 \pm 6.37$	$88.24 \pm 9.71$
Montana UAV	$50.26 \pm 1.06$	$49.47 \pm 2.50$	$50.88 \pm 1.64$	$50.09 \pm 1.99$	$50.35 \pm 2.64$

Table 15. Class imbalance analysis on Montana UAV VNIR dataset. Mean over five independent runs. DRL Top-15 bands; EDA condition. DB = Dead Biomass, DPP = Dead Ponderosa Pine, DF = Douglas Fir, PP = Ponderosa Pine, and WL = Western Larch.

Strategy	Classifier	OA (%)	Kappa	F1-Mac (%)	F1-DB (%)	F1-DPP (%)	F1-DF (%)	F1-PP (%)	F1-WL (%)
Baseline	RF	$47.69$	$0.231$	$34.12$	$67.21$	$11.30$	$19.80$	$0.00$	$30.66$
Class-weighted	RF	$47.60$	$0.229$	$34.27$	$66.94$	$14.49$	$17.90$	$0.00$	$32.20$
Baseline	3D-CNN	$51.23$	$0.281$	$42.04$	$67.88$	$41.05$	$18.25$	$0.00$	$35.94$
Class-weighted	3D-CNN	$51.32$	$0.296$	$43.18$	$68.70$	$36.79$	$30.09$	$0.00$	$27.28$
Focal loss	3D-CNN	$51.93$	$0.286$	$43.37$	$67.74$	$41.55$	$22.41$	$0.00$	$39.39$

Bold indicates the best OA per dataset across all classifiers.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Karankot, M.I.; Glenn, E.M.; Masood, M.U.; Zhou, X.; Whitaker, B.M. Hyperspectral Band Selection for Ground Fuel Classification for Prescribed Fires. Remote Sens. 2026, 18, 1440. https://doi.org/10.3390/rs18091440

AMA Style

Karankot MI, Glenn EM, Masood MU, Zhou X, Whitaker BM. Hyperspectral Band Selection for Ground Fuel Classification for Prescribed Fires. Remote Sensing. 2026; 18(9):1440. https://doi.org/10.3390/rs18091440

Chicago/Turabian Style

Karankot, Mahmad Isaq, Ethan M. Glenn, Muhammad Umer Masood, Xiaobing Zhou, and Bradley M. Whitaker. 2026. "Hyperspectral Band Selection for Ground Fuel Classification for Prescribed Fires" Remote Sensing 18, no. 9: 1440. https://doi.org/10.3390/rs18091440

APA Style

Karankot, M. I., Glenn, E. M., Masood, M. U., Zhou, X., & Whitaker, B. M. (2026). Hyperspectral Band Selection for Ground Fuel Classification for Prescribed Fires. Remote Sensing, 18(9), 1440. https://doi.org/10.3390/rs18091440

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Hyperspectral Band Selection for Ground Fuel Classification for Prescribed Fires

Highlights

Abstract

1. Introduction

2. Materials and Methods

2.1. Data

Montana

2.2. Exploratory Data Analysis

2.3. EDA for Benchmark Hyperspectral Datasets

2.4. EDA for the Montana UAV Dataset

2.4.1. Ground-Truth Construction and Alignment Verification

2.4.2. Class Distribution and Label Sparsity

2.4.3. Band-Wise Radiometric Quality Analysis

2.4.4. Radiometric Normalization and Clean Cube Creation

2.4.5. Class-Wise Spectral Signature Analysis

2.4.6. Montana EDA Insights

2.5. Feature Selection (Band Selection)

2.6. K-Means Clustering-Based Band Selection (KMCBS)

2.7. Principal Component Analysis (PCA)-Based Band Selection

2.8. Spatial–Spectral Edge Preservation (SSEP)

2.9. Spectral-Redundancy Penalized Attention Ranking (SRPA)

2.10. Deep Reinforcement Learning (DRL)-Based Band Selection

2.10.1. Markov Decision Process Formulation

2.10.2. Deep Q-Network Architecture

2.10.3. Training Procedure

2.10.4. Dataset-Specific Handling

2.10.5. Evaluation and Stability

2.11. Classification Models

2.11.1. Random Forest

2.11.2. KNN

2.11.3. SVM

2.11.4. 3D-CNN

2.12. Class Imbalance-Handling Strategies

2.13. Evaluation Metrics

2.14. Experimental Setup

3. Results

3.1. Result Aggregation and Best-Configuration Identification

3.2. Dataset-Specific Results

3.2.1. Indian Pines (Table 2)

3.2.2. Pavia University (Table 3)

3.2.3. Salinas (Table 4)

3.2.4. Botswana (Table 5)

3.2.5. Kennedy Space Center (Table 6)

3.2.6. Montana UAV VNIR (Table 7)

3.3. Cross-Dataset Trends (Table 8)

3.4. Summary of Results

3.5. Selected Wavelength Analysis for the Montana UAV Dataset

3.5.1. EDA Condition

3.5.2. No-EDA Condition

3.6. Clustering-Based Band Selection (KMCBS)

3.7. Patch Size Sensitivity Analysis

3.8. Ablation Study

3.8.1. DRL Sequential Selection vs. Random Baseline

3.8.2. SRPA Redundancy Penalty

3.9. Hyperparameter Sensitivity Analysis

3.9.1. SSEP σ Sensitivity

3.9.2. SRPA λ Sensitivity

3.9.3. Summary

3.10. Class Imbalance Analysis

4. Discussion

4.1. Effectiveness of Learning-Based Band Selection

4.2. Limitations of Variance-Based Selection

4.3. Role of Exploratory Data Analysis

4.4. Challenges of Real-World UAV Hyperspectral Data

4.5. Implications for Practical Deployment

4.6. Clustering-Based Band Selection

4.7. Class Imbalance in Real-World UAV Datasets

4.8. Contextualisation Against Prior Work

4.9. Future Research Directions

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

3.9.1. SSEP $σ$ Sensitivity

3.9.2. SRPA $λ$ Sensitivity