Figure 1.
Structure of pyramid pooling module.
Figure 1.
Structure of pyramid pooling module.
Figure 2.
Structure of Inception module.
Figure 2.
Structure of Inception module.
Figure 3.
Structure of residual connection.
Figure 3.
Structure of residual connection.
Figure 4.
Schematic of dilated convolution.
Figure 4.
Schematic of dilated convolution.
Figure 5.
Schematic of spectral dilated convolution.
Figure 5.
Schematic of spectral dilated convolution.
Figure 6.
Structure of multiple spectral resolution module.
Figure 6.
Structure of multiple spectral resolution module.
Figure 7.
Schematic of Multiple Spectral Resolution 3D Convolutional Neural Network.
Figure 7.
Schematic of Multiple Spectral Resolution 3D Convolutional Neural Network.
Figure 8.
The training procedure of our method.
Figure 8.
The training procedure of our method.
Figure 9.
Changes of the overall accuracy of predictions on the four datasets, using 10% samples for training and different patch sizes.
Figure 9.
Changes of the overall accuracy of predictions on the four datasets, using 10% samples for training and different patch sizes.
Figure 10.
Results of the BO dataset using 10% samples for training and patch size.
Figure 10.
Results of the BO dataset using 10% samples for training and patch size.
Figure 11.
Results of the IP dataset using 10% samples for trainingand patch size.
Figure 11.
Results of the IP dataset using 10% samples for trainingand patch size.
Figure 12.
Results of the SA dataset using 10% samples for training and patch size.
Figure 12.
Results of the SA dataset using 10% samples for training and patch size.
Figure 13.
Results of the PU dataset using 10% samples for training and patch size.
Figure 13.
Results of the PU dataset using 10% samples for training and patch size.
Figure 14.
The confusion matrix between the predicted results of MSR-3DCNN and ground truth on the BO dataset using 10% samples for training and patch size.
Figure 14.
The confusion matrix between the predicted results of MSR-3DCNN and ground truth on the BO dataset using 10% samples for training and patch size.
Figure 15.
The confusion matrix between the predicted results of MSR-3DCNN and ground truth on the IP dataset using 10% samples for training and patch size.
Figure 15.
The confusion matrix between the predicted results of MSR-3DCNN and ground truth on the IP dataset using 10% samples for training and patch size.
Figure 16.
The confusion matrix between the predicted results of MSR-3DCNN and ground truth on the SA dataset using 10% samples for training and patch size.
Figure 16.
The confusion matrix between the predicted results of MSR-3DCNN and ground truth on the SA dataset using 10% samples for training and patch size.
Figure 17.
The confusion matrix between the predicted results of MSR-3DCNN and ground truth on the PU dataset using 10% samples for training and patch size.
Figure 17.
The confusion matrix between the predicted results of MSR-3DCNN and ground truth on the PU dataset using 10% samples for training and patch size.
Table 1.
Summary of the characteristics of the IP, SA, PU, and BO datasets.
Table 1.
Summary of the characteristics of the IP, SA, PU, and BO datasets.
Description | Datasets |
---|
IP | SA | PU | BO |
---|
Sensor | AVIRIS | AVIRIS | ROSIS | Hyperion |
Pixels | | | | |
GSD | 20 m/pixel | 3.7 m/pixel | 1.3 m/pixel | 30 m/pixel |
Spectral range | 0.4 to 2.5 µm | 0.4 to 2.5 µm | 0.43 to 0.86 µm | 0.4 to 2.5 µm |
Original band | 224 | 224 | 103 | 242 |
Processed band | 200 | 204 | 103 | 145 |
Ground truth | 10,249 | 54,129 | 42,776 | 3248 |
Class | 16 | 16 | 9 | 14 |
Mode | Airborne | Airborne | Airborne | Satellite |
Table 2.
Overall Accuracy (%) of the predictions by contextual deep CNN, HybridSN, DBMA, Fast 3D CNN, MSRN, and MSR-3DCNN using different patch size and the fuse the prediction results of these predictions obtained by sum rule.
Table 2.
Overall Accuracy (%) of the predictions by contextual deep CNN, HybridSN, DBMA, Fast 3D CNN, MSRN, and MSR-3DCNN using different patch size and the fuse the prediction results of these predictions obtained by sum rule.
Datasets | Patch Size | Models |
---|
Contextual Deep CNN | HybridSN | DBMA | Fast 3D CNN | MSRN | MSR-3DCNN |
---|
BO | | 67.37 | 97.26 | 97.30 | 98.50 | 95.42 | 98.80 |
IP | 60.65 | 96.41 | 98.03 | 97.04 | 95.83 | 98.10 |
SA | 99.82 | 99.65 | 99.91 | 99.88 | 99.94 | 99.96 |
PU | 97.60 | 99.28 | 99.58 | 99.37 | 99.84 | 99.62 |
BO | | 74.38 | 99.15 | 98.87 | 97.95 | 86.97 | 98.60 |
IP | 64.23 | 97.39 | 98.60 | 97.47 | 91.14 | 97.56 |
SA | 99.95 | 99.82 | 99.95 | 99.81 | 99.96 | 99.97 |
PU | 97.60 | 99.00 | 99.64 | 99.60 | 99.85 | 99.60 |
BO | | 79.21 | 99.59 | 99.08 | 97.95 | 91.93 | 99.15 |
IP | 72.44 | 97.77 | 97.08 | 98.51 | 97.21 | 97.30 |
SA | 99.95 | 99.79 | 99.91 | 99.93 | 99.92 | 99.99 |
PU | 98.38 | 99.60 | 99.54 | 99.19 | 99.84 | 99.72 |
BO | Fusion | 85.94 | 99.08 | 99.97 | 99.45 | 91.21 | 99.62 |
IP | 73.05 | 98.33 | 98.68 | 98.35 | 98.23 | 98.41 |
SA | 99.97 | 99.93 | 99.95 | 99.91 | 99.97 | 99.98 |
PU | 98.73 | 99.59 | 99.72 | 99.52 | 99.90 | 99.76 |
Table 3.
F1 Score (%) for Each Class, OA (%), AA (%), and Kappa Coefficient Values for the entire dataset, training time (s), and test time (s) using patch size and 10% samples for training on the BO datasets.
Table 3.
F1 Score (%) for Each Class, OA (%), AA (%), and Kappa Coefficient Values for the entire dataset, training time (s), and test time (s) using patch size and 10% samples for training on the BO datasets.
Class | Training | Test | Models |
---|
Total | 324 | 2924 | Contextual Deep CNN | HybridSN | DBMA | Fast 3D CNN | MSRN | MSR-3DCNN |
1 | 27 | 243 | 83.48 | 100.00 | 99.38 | 99.18 | 99.79 | 99.38 |
2 | 10 | 91 | 62.82 | 100.00 | 98.38 | 100.00 | 94.30 | 99.45 |
3 | 25 | 226 | 93.05 | 99.12 | 100.00 | 98.91 | 100.00 | 100.00 |
4 | 21 | 194 | 95.31 | 93.27 | 100.00 | 98.98 | 99.48 | 99.74 |
5 | 27 | 242 | 54.95 | 92.48 | 99.17 | 95.91 | 96.03 | 95.54 |
6 | 27 | 242 | 45.82 | 91.86 | 99.59 | 97.73 | 94.17 | 96.45 |
7 | 26 | 233 | 78.87 | 100.00 | 99.79 | 99.36 | 100.00 | 100.00 |
8 | 20 | 183 | 82.58 | 100.00 | 99.46 | 100.00 | 99.19 | 99.19 |
9 | 31 | 283 | 53.55 | 98.41 | 99.65 | 96.73 | 98.57 | 97.75 |
10 | 25 | 223 | 51.69 | 99.33 | 84.75 | 99.55 | 90.42 | 99.55 |
11 | 30 | 275 | 75.27 | 99.45 | 90.16 | 100.00 | 85.80 | 99.10 |
12 | 18 | 163 | 46.93 | 99.08 | 97.31 | 99.38 | 94.77 | 99.38 |
13 | 27 | 241 | 45.52 | 99.79 | 98.10 | 99.59 | 91.20 | 99.59 |
14 | 10 | 85 | 30.51 | 75.91 | 97.70 | 89.17 | 93.83 | 100.00 |
OA | 67.37 | 97.26 | 97.30 | 98.50 | 95.42 | 98.80 |
AA | 64.91 | 95.86 | 97.53 | 97.86 | 95.47 | 98.89 |
Kappa coefficient | 0.6458 | 0.9703 | 0.9707 | 0.9837 | 0.9503 | 0.9870 |
Training time | 19.35 | 17.66 | 50.02 | 14.28 | 51.86 | 224.18 |
Test time | 0.74 | 0.73 | 1.99 | 0.48 | 1.54 | 7.09 |
Table 4.
F1 Score (%) for each class OA (%), AA (%), and Kappa coefficient values for the entire dataset, training time (s), and test time (s) using patch size and 10% samples for training on the IP dataset.
Table 4.
F1 Score (%) for each class OA (%), AA (%), and Kappa coefficient values for the entire dataset, training time (s), and test time (s) using patch size and 10% samples for training on the IP dataset.
Class | Training | Test | Models |
---|
Total | 1024 | 9225 | Contextual Deep CNN | HybridSN | DBMA | Fast 3D CNN | MSRN | MSR-3DCNN |
1 | 5 | 41 | 22.64 | 100.00 | 80.00 | 100.00 | 42.31 | 89.19 |
2 | 143 | 1285 | 27.84 | 95.92 | 97.38 | 96.71 | 93.22 | 97.74 |
3 | 83 | 747 | 22.14 | 94.30 | 98.26 | 92.62 | 92.59 | 97.88 |
4 | 24 | 213 | 60.96 | 95.35 | 98.61 | 88.31 | 81.67 | 95.63 |
5 | 48 | 435 | 88.30 | 97.93 | 97.92 | 97.83 | 98.27 | 99.08 |
6 | 73 | 657 | 81.01 | 99.70 | 98.46 | 98.94 | 98.79 | 99.00 |
7 | 3 | 25 | 27.59 | 100.00 | 100.00 | 100.00 | 96.15 | 100.00 |
8 | 48 | 430 | 92.76 | 100.00 | 99.08 | 100.00 | 97.07 | 99.08 |
9 | 2 | 18 | 12.90 | 68.97 | 53.33 | 80.00 | 28.57 | 97.14 |
10 | 97 | 875 | 24.38 | 92.04 | 97.83 | 96.86 | 97.20 | 97.33 |
11 | 245 | 2210 | 71.90 | 96.22 | 97.88 | 97.58 | 97.13 | 98.43 |
12 | 59 | 534 | 30.79 | 93.79 | 97.13 | 95.26 | 94.47 | 96.16 |
13 | 20 | 185 | 80.00 | 100.00 | 99.73 | 99.73 | 99.73 | 98.93 |
14 | 126 | 1139 | 91.01 | 99.08 | 99.56 | 99.17 | 98.53 | 99.39 |
15 | 39 | 347 | 43.12 | 97.51 | 98.40 | 96.75 | 95.51 | 97.35 |
16 | 9 | 84 | 42.75 | 96.47 | 96.39 | 95.29 | 94.12 | 93.57 |
OA | 60.65 | 96.41 | 98.03 | 97.04 | 95.83 | 98.10 |
AA | 50.31 | 94.45 | 93.05 | 94.77 | 85.78 | 96.40 |
Kappa coefficient | 0.5447 | 0.9591 | 0.9775 | 0.9663 | 0.9523 | 0.9784 |
Training time | 39.09 | 33.45 | 126.40 | 30.99 | 117.89 | 667.04 |
Test time | 1.51 | 1.33 | 5.23 | 1.02 | 3.80 | 19.30 |
Table 5.
F1 Score (%) for each class, OA (%), AA (%), and Kappa Coefficient values for the entire dataset, training time (s), and test time (s) using patch size and 10% samples for training on the SA dataset.
Table 5.
F1 Score (%) for each class, OA (%), AA (%), and Kappa Coefficient values for the entire dataset, training time (s), and test time (s) using patch size and 10% samples for training on the SA dataset.
Class | Training | Test | Models |
---|
Total | 5412 | 48,717 | Contextual Deep CNN | HybridSN | DBMA | Fast 3D CNN | MSRN | MSR-3DCNN |
1 | 201 | 1808 | 99.75 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 |
2 | 372 | 3354 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 |
3 | 197 | 1779 | 99.92 | 99.97 | 100.00 | 100.00 | 100.00 | 100.00 |
4 | 139 | 1255 | 99.29 | 99.84 | 99.48 | 99.68 | 100.00 | 99.76 |
5 | 268 | 2410 | 99.50 | 99.92 | 99.73 | 99.83 | 100.00 | 99.88 |
6 | 396 | 3563 | 99.97 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 |
7 | 358 | 3221 | 99.92 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 |
8 | 1127 | 10,144 | 99.86 | 99.30 | 99.89 | 99.79 | 99.87 | 99.98 |
9 | 620 | 5583 | 99.92 | 99.89 | 100.00 | 99.98 | 100.00 | 99.96 |
10 | 328 | 2950 | 99.46 | 99.66 | 99.83 | 99.81 | 99.95 | 99.92 |
11 | 107 | 961 | 99.84 | 99.84 | 100.00 | 99.95 | 100.00 | 100.00 |
12 | 193 | 1734 | 99.80 | 99.91 | 99.97 | 99.97 | 100.00 | 99.97 |
13 | 91 | 825 | 100.00 | 99.88 | 99.94 | 100.00 | 100.00 | 99.82 |
14 | 107 | 963 | 99.53 | 99.69 | 99.84 | 99.64 | 99.95 | 99.84 |
15 | 727 | 6541 | 99.79 | 98.95 | 99.82 | 99.69 | 99.79 | 99.97 |
16 | 181 | 1626 | 99.88 | 99.88 | 100.00 | 100.00 | 99.94 | 100.00 |
OA | 99.82 | 99.65 | 99.91 | 99.88 | 99.94 | 99.96 |
AA | 99.81 | 99.78 | 99.89 | 99.89 | 99.97 | 99.95 |
Kappa coefficient | 0.9979 | 0.9961 | 0.9990 | 0.9986 | 0.9993 | 0.9996 |
Training time | 174.42 | 153.89 | 650.92 | 144.36 | 614.32 | 3476.82 |
Test time | 7.76 | 6.33 | 25.18 | 4.37 | 17.30 | 100.28 |
Table 6.
F1 Score(%) for each class, OA (%), AA (%), and Kappa Coefficient values for the entire dataset, training time (s), and test time (s) using patch size and 10% samples for training on the PU dataset.
Table 6.
F1 Score(%) for each class, OA (%), AA (%), and Kappa Coefficient values for the entire dataset, training time (s), and test time (s) using patch size and 10% samples for training on the PU dataset.
Class | Training | Test | Models |
---|
Total | 4277 | 38,499 | Contextual Deep CNN | HybridSN | DBMA | Fast 3D CNN | MSRN | MSR-3DCNN |
1 | 663 | 5968 | 95.93 | 99.31 | 99.62 | 99.46 | 99.86 | 99.72 |
2 | 1865 | 16,784 | 99.93 | 99.98 | 99.99 | 99.95 | 100.00 | 99.98 |
3 | 210 | 1889 | 94.63 | 95.23 | 98.20 | 96.22 | 98.85 | 97.28 |
4 | 306 | 2758 | 94.58 | 99.53 | 98.92 | 99.40 | 99.98 | 99.51 |
5 | 134 | 1211 | 99.92 | 99.92 | 99.63 | 100.00 | 100.00 | 99.96 |
6 | 503 | 4526 | 99.42 | 99.94 | 99.93 | 99.87 | 100.00 | 99.99 |
7 | 133 | 1197 | 94.16 | 99.29 | 99.58 | 99.83 | 99.96 | 100.00 |
8 | 368 | 3314 | 92.94 | 96.54 | 98.51 | 96.89 | 99.12 | 98.36 |
9 | 95 | 852 | 88.85 | 99.47 | 98.94 | 99.88 | 99.94 | 99.11 |
OA | 97.59 | 99.28 | 99.58 | 99.37 | 99.84 | 99.62 |
AA | 94.98 | 98.64 | 99.07 | 98.98 | 99.72 | 99.13 |
Kappa coefficient | 0.9681 | 0.9904 | 0.9945 | 0.9917 | 0.9979 | 0.9949 |
Training time | 137.17 | 124.86 | 526.00 | 113.83 | 485.23 | 2747.81 |
Test time | 5.44 | 4.53 | 20.80 | 3.66 | 13.88 | 75.55 |
Table 7.
Standardized normal test statistic (Z) for the BO dataset using patch size and 10% samples for training.
Table 7.
Standardized normal test statistic (Z) for the BO dataset using patch size and 10% samples for training.
| Class 2 | Contextual Deep CNN | HybridSN | DBMA | Fast 3D CNN | MSRN | MSR-3DCNN |
---|
Class 1 | |
---|
contextual deep CNN | 0.00 | −29.56 | −29.58 | −30.17 | −28.60 | −30.32 |
HybridSN | 29.56 | 0.00 | −0.08 | −5.69 | 4.38 | −5.42 |
DBMA | 29.58 | 0.08 | 0.00 | −3.42 | 5.53 | −4.54 |
Fast 3D CNN | 30.17 | 5.69 | 3.42 | 0.00 | 8.22 | −1.67 |
MSRN | 28.60 | −4.38 | −5.53 | −8.22 | 0.00 | −9.08 |
MSR-3DCNN | 30.32 | 5.42 | 4.54 | 1.67 | 9.08 | 0.00 |
Table 8.
Standardized normal test statistic (Z) for the IP dataset using patch size and 10% samples for training.
Table 8.
Standardized normal test statistic (Z) for the IP dataset using patch size and 10% samples for training.
| Class 2 | Contextual Deep CNN | HybridSN | DBMA | Fast 3D CNN | MSRN | MSR-3DCNN |
---|
Class 1 | |
---|
contextual deep CNN | 0.00 | −57.44 | −58.72 | −57.94 | −56.96 | −58.78 |
HybridSN | 57.44 | 0.00 | −9.93 | −5.09 | 4.00 | −10.77 |
DBMA | 58.72 | 9.93 | 0.00 | 6.58 | 13.19 | −0.69 |
Fast 3D CNN | 57.94 | 5.09 | −6.58 | 0.00 | 9.21 | −8.00 |
MSRN | 56.96 | −4.00 | −13.19 | −9.21 | 0.00 | −13.79 |
MSR-3DCNN | 58.78 | 10.77 | 0.69 | 8.00 | 13.79 | 0.00 |
Table 9.
Standardized normal test statistic (Z) for the SA dataset using patch size and 10% samples for training.
Table 9.
Standardized normal test statistic (Z) for the SA dataset using patch size and 10% samples for training.
| Class 2 | Contextual Deep CNN | HybridSN | DBMA | Fast 3D CNN | MSRN | MSR-3DCNN |
---|
Class 1 | |
---|
contextual deep CNN | 0.00 | 6.39 | −5.29 | −3.64 | −6.79 | −8.43 |
HybridSN | −6.39 | 0.00 | −10.22 | −9.99 | −11.79 | −12.09 |
DBMA | 5.29 | 10.22 | 0.00 | 2.66 | −1.82 | −3.54 |
Fast 3D CNN | 3.64 | 9.99 | −2.66 | 0.00 | −3.69 | −5.98 |
MSRN | 6.79 | 11.79 | 1.82 | 3.69 | 0.00 | −1.81 |
MSR-3DCNN | 8.43 | 12.09 | 3.54 | 5.98 | 1.81 | 0.00 |
Table 10.
Standardized normal test statistic (Z) for the PU dataset using patch size and 10% samples for training.
Table 10.
Standardized normal test statistic (Z) for the PU dataset using patch size and 10% samples for training.
| Class 2 | Contextual Deep CNN | HybridSN | DBMA | Fast 3D CNN | MSRN | MSR-3DCNN |
---|
Class 1 | |
---|
contextual deep CNN | 0.00 | −25.32 | −27.46 | −26.17 | −29.41 | −27.91 |
HybridSN | 25.32 | 0.00 | −8.39 | −3.36 | −14.76 | −11.00 |
DBMA | 27.46 | 8.39 | 0.00 | 6.02 | −8.85 | −1.41 |
Fast 3D CNN | 26.17 | 3.36 | −6.02 | 0.00 | −13.34 | −7.73 |
MSRN | 29.41 | 14.76 | 8.85 | 13.34 | 0.00 | 7.98 |
MSR-3DCNN | 27.91 | 11.00 | 1.41 | 7.73 | −7.98 | 0.00 |