1. Introduction
Hyperspectral images are widely used in various of geographical analysis [
1,
2,
3]. They refer to remote sensing images with dozens or hundreds of band channels [
4,
5,
6]. Benefitting from the large number of bands, hyperspectral images can be used to detect subtle differences in ground features [
7]. For the same reason, hyperspectral images are widely used in geography [
8], medicine [
9] and mineral exploration [
10]. However, the large number of bands may sometimes lead to poor results [
11] and long processing times [
12,
13]. In this case, reducing hyperspectral image redundancy using dimensionality reduction techniques may be expected [
14,
15].
Hyperspectral image dimensionality reduction includes two categories of methods: feature extraction and feature selection [
16]. Feature extraction compresses the hyperspectral information into the first few bands through data transformation [
17]. Feature selection, also known as band selection, refers to selecting representative bands on the original dataset as the output bands using a particular method [
18]. Since band selection can preserve the original spectral and physical information without destroying the hyperspectral data, scholars have researched it extensively [
19,
20,
21].
When structural similarity (SSIM) was first proposed, it was applied to measure the similarity between the original image and the distorted image [
22]. Up to now, SSIM has been used in a diverse range of applications, including image classification, image matching and relative radiometric normalization. In 2011, Gao et al. [
23] employed CW-SSIM for image classification. In 2017, Renieblas et al. [
24] utilized SSIM for image quality assessments in radiological images. In 2021, Moghimi et al. [
25] conducted research on SSIM-based radiometric normalization for multitemporal satellite images.
Among all band selection methods, SSIM-based methods are popular because SSIM can better measure the similarity between bands. Initially, SSIM was used as an index to evaluate the selected bands after band selection in most situations and was not applied in the band selection process [
26,
27]. Jia et al. [
28] were the first to apply SSIM to band selection processing; they used SSIM as a measure to select the most representative bands from a hyperspectral dataset. In 2020, Ghorbanian et al. [
29] first used SSIM to structure an SSIM matrix between all the bands, then divided the dataset with the k-means method and finally selected the highest cumulative SSIM band in each. In 2021, Xu et al. [
30] proposed a similarity-based ranking structural similarity (SR-SSIM) band selection method, which was inspired by Rodriguez et al. [
31]. SR-SSIM uses SSIM to measure the similarity between bands, thereby calculating the similarity and dissimilarity of each band with other bands and selecting the required number of bands with both high similarity and high dissimilarity to the output bands.
The above band selection methods use SSIM to measure the similarity between bands better, but a problem still needs to be solved. The above methods calculate the SSIM of any two bands to form the SSIM matrix. As the number of bands in the dataset increases, the number of calculations presents a factorial upwards trend. Among them, through experiments, it has been found that SR-SSIM takes more than 99% of the time to calculate SSIM, and the increase in the number of calculations will undoubtedly increase the calculation time of the method.
Addressing the problem of the runtime of existing SSIM methods being too long, this paper uses the band subspace partition method to improve these SSIM methods. The band subspace partition divides the hyperspectral dataset into a certain number of subdatasets for which the intersection is an empty set and the union is a complete set according to the band number. Different subdatasets are each treated as independent datasets, and band selection is performed; calculating the SSIM between bands of different subdatasets is not necessary, thereby reducing the method operation time. According to the band subspace partition method, this paper proposes an improved band selection method to the SR-SSIM method, called enhance similarity-based ranking structural similarity (E-SR-SSIM). First, the improved band subspace partition method is used to dynamically divide the dataset into a corresponding number of subdatasets; second, the modified SR-SSIM method is used for all subdatasets, and the most representative band is selected from each subdataset. The contributions of this paper are as follows:
(1) This paper improves the SR-SSIM method. Compared with traditional SSIM methods calculating all the SSIM between bands, the improved SR-SSIM method reduces the number of SSIM calculations through the use of the band subspace partition method, thereby reducing the runtime.
(2) The band subspace partition method is improved and the partition points of adjacent subdatasets are dynamically adjusted. Compared with the original band subspace method, our proposed method overcomes the problem of ignoring the change in partition points and divides the band subspace more scientifically.
The contents of the rest of this article are as follows. In
Section 2, the method proposed in this paper and the experimental details are introduced. The method is composed of band subspace partition and subspace band selection. The experimental details include the used datasets, the experimental parameters and the evaluation indicators. In
Section 3, we demonstrate the classification effect and time efficiency of E-SR-SSIM and SR-SSIM under the four datasets. We analyse the reason why the band subspace partition affects the time efficiency. In
Section 4, we quantitatively analyse the reduction in SSIM in the band subspace partition from a mathematical perspective and validate the mathematical derivation through experiments. We discuss the classification performance of the improved band subspace partition method compared with the original method on four datasets and analyse the reasons. In
Section 5, we summarize this paper and identify the directions for improvement in the future.
5. Conclusions
This paper proposes a band selection method, E-SR-SSIM, based on band subspace partition to solve the problem of existing band selection methods calculating many SSIMs, resulting in an excessively long runtime. The method first divides the dataset into subdatasets and dynamically adjusts the partition points according to the situation for each subdataset. Then, the SR-SSIM method is used to modify the similarity threshold formula for each subdataset to select the most representative band. Using the RF classifier for supervised classification on the Indian Pines, Salinas and KSC public datasets, the following three conclusions can be drawn: (1) The classification effect of the E-SR-SSIM method is roughly the same as that of the SR-SSIM method. (2) Compared with SR-SSIM, the E-SR-SSIM method can effectively reduce the method’s runtime while ensuring the classification effect. (3) The improved band subspace partition method is compared with the original method. The classification effect can be improved slightly and can maintain specific stability.
Although the E-SR-SSIM method reduces the runtime of the method while ensuring the classification effect compared with the SR-SSIM method, some shortcomings still need improvement. First, due to the number of bands in a subdataset needing to be greater than or equal to 3, the E-SR-SSIM could not select any number of bands. In subsequent work, the band sub-space partition method of the E-SR-SSIM should be improved. The new band subspace partition method could select any number of bands easily. Second, when the dataset has lots of noise, such as the Botswana dataset, the E-SR-SSIM would compulsorily divide the dataset into many subdatasets, ignoring the situation of the dataset. As a result, the classification effect of the E-SR-SSIM is inferior to that of the SR-SSIM. In other words, E-SR-SSIM is susceptible to noise bands. In subsequent work, the E-SR-SSIM should enhance its robustness to noise bands. Third, how to pre-process the dataset without discarding the robustness of the method is the object of our future work. Fourth, E-SR-SSIM needs to be compared with some recently developed band selection methods to illustrate its advantages. Fifth, the influence of different numbers of training samples on the classification effect of E-SR-SSIM should be analysed in future work and E-SR-SSIM should be improved based on those experimental results.