Figure 1.
Flowchart of the proposed dual-branch Fourier-mixing transformer-based target detector (DBFTTD). In the training stage, data generation provides spectral pairs, and . Then, two spectral sequences in each spectral pair pass through the dual-branch Fourier-mixing transformer network at the same time. , and represent the prior spectrum, target training sample with label , and background training sample with label , respectively. In the testing stage, represents the test spectral sequence in the given HSI, which is paired with the prior target spectrum and the fed into the well-trained network to get the final detection result.
Figure 1.
Flowchart of the proposed dual-branch Fourier-mixing transformer-based target detector (DBFTTD). In the training stage, data generation provides spectral pairs, and . Then, two spectral sequences in each spectral pair pass through the dual-branch Fourier-mixing transformer network at the same time. , and represent the prior spectrum, target training sample with label , and background training sample with label , respectively. In the testing stage, represents the test spectral sequence in the given HSI, which is paired with the prior target spectrum and the fed into the well-trained network to get the final detection result.
Figure 2.
Illustration of training data generation in the proposed DBFTTD. For a given HSI (with p pixels and n bands) and the single prior spectrum , p target training samples and p background training samples are obtained. The target training samples are obtained by a simple yet efficient dropout strategy, which is applied to along the spectral dimension. Based on the background dominant assumption, all spectral sequences in the given HSI are considered as background training samples .
Figure 2.
Illustration of training data generation in the proposed DBFTTD. For a given HSI (with p pixels and n bands) and the single prior spectrum , p target training samples and p background training samples are obtained. The target training samples are obtained by a simple yet efficient dropout strategy, which is applied to along the spectral dimension. Based on the background dominant assumption, all spectral sequences in the given HSI are considered as background training samples .
Figure 3.
San Diego-1 scene and detection maps of different comparison methods. (a) False color image, (b) Ground truth, (c) SAM, (d) ACE, (e) CEM, (f) E-CEM, (g) HTD-IRN, (h) SFCTD, (i) TSCNTD, (j) Ours.
Figure 3.
San Diego-1 scene and detection maps of different comparison methods. (a) False color image, (b) Ground truth, (c) SAM, (d) ACE, (e) CEM, (f) E-CEM, (g) HTD-IRN, (h) SFCTD, (i) TSCNTD, (j) Ours.
Figure 4.
San Diego-2 scene and detection maps of different comparison methods. (a) False color image, (b) Ground truth, (c) SAM, (d) ACE, (e) CEM, (f) E-CEM, (g) HTD-IRN, (h) SFCTD, (i) TSCNTD, (j) Ours.
Figure 4.
San Diego-2 scene and detection maps of different comparison methods. (a) False color image, (b) Ground truth, (c) SAM, (d) ACE, (e) CEM, (f) E-CEM, (g) HTD-IRN, (h) SFCTD, (i) TSCNTD, (j) Ours.
Figure 5.
Urban-1 scene and detection maps of different comparison methods. (a) False color image, (b) Ground truth, (c) SAM, (d) ACE, (e) CEM, (f) E-CEM, (g) HTD-IRN, (h) SFCTD, (i) TSCNTD, (j) Ours.
Figure 5.
Urban-1 scene and detection maps of different comparison methods. (a) False color image, (b) Ground truth, (c) SAM, (d) ACE, (e) CEM, (f) E-CEM, (g) HTD-IRN, (h) SFCTD, (i) TSCNTD, (j) Ours.
Figure 6.
Urban-2 scene and detection maps of different comparison methods. (a) False color image, (b) Ground truth, (c) SAM, (d) ACE, (e) CEM, (f) E-CEM, (g) HTD-IRN, (h) SFCTD, (i) TSCNTD, (j) Ours.
Figure 6.
Urban-2 scene and detection maps of different comparison methods. (a) False color image, (b) Ground truth, (c) SAM, (d) ACE, (e) CEM, (f) E-CEM, (g) HTD-IRN, (h) SFCTD, (i) TSCNTD, (j) Ours.
Figure 7.
Muufl Gulfport scene and detection maps of different comparison methods. (a) False color image, (b) Ground truth, (c) SAM, (d) ACE, (e) CEM, (f) E-CEM, (g) HTD-IRN, (h) SFCTD, (i) TSCNTD, (j) Ours.
Figure 7.
Muufl Gulfport scene and detection maps of different comparison methods. (a) False color image, (b) Ground truth, (c) SAM, (d) ACE, (e) CEM, (f) E-CEM, (g) HTD-IRN, (h) SFCTD, (i) TSCNTD, (j) Ours.
Figure 8.
ROC curves of different comparison methods on the five data sets.
Figure 8.
ROC curves of different comparison methods on the five data sets.
Figure 9.
Comparison of detection maps without skip-layer connection and with skip-layer connection for five data sets. (a) San Diego-1. (b) San Diego-2. (c) Urban-1. (d) Urban-2. (e) Muufl Gulfport. First row: without skip-layer connection. Second row: with skip-layer connection.
Figure 9.
Comparison of detection maps without skip-layer connection and with skip-layer connection for five data sets. (a) San Diego-1. (b) San Diego-2. (c) Urban-1. (d) Urban-2. (e) Muufl Gulfport. First row: without skip-layer connection. Second row: with skip-layer connection.
Figure 10.
Comparison of ROC curves without skip-layer connection and with skip-layer connection for five data sets.
Figure 10.
Comparison of ROC curves without skip-layer connection and with skip-layer connection for five data sets.
Figure 11.
Comparison of detection maps with attention module and Fourier-mixing module for five data sets. (a) San Diego-1. (b) San Diego-2. (c) Urban-1. (d) Urban-2. (e) Muufl Gulfport. First row: attention-based. Second row: Fourier-based.
Figure 11.
Comparison of detection maps with attention module and Fourier-mixing module for five data sets. (a) San Diego-1. (b) San Diego-2. (c) Urban-1. (d) Urban-2. (e) Muufl Gulfport. First row: attention-based. Second row: Fourier-based.
Figure 12.
Comparison of ROC curves with attention module and Fourier-mixing module for five data sets.
Figure 12.
Comparison of ROC curves with attention module and Fourier-mixing module for five data sets.
Figure 13.
The parameters sensitivity analysis of the number of the filter ensembles for four data sets. The default ensemble number in our proposed DBFTTD is 4.
Figure 13.
The parameters sensitivity analysis of the number of the filter ensembles for four data sets. The default ensemble number in our proposed DBFTTD is 4.
Figure 14.
AUC values of the DBFTTD with different dropout rate for data augmentation for four data sets. The default dropout rate in our proposed DBFTTD is 0.1.
Figure 14.
AUC values of the DBFTTD with different dropout rate for data augmentation for four data sets. The default dropout rate in our proposed DBFTTD is 0.1.
Table 1.
AUC Values’ Comparison for San Diego-1.
Table 1.
AUC Values’ Comparison for San Diego-1.
Methods | | | | | | | |
---|
SAM | 0.9968 | 0.0253 | 0.2428 | 0.9714 | 1.2395 | 1.2142 | 9.5843 |
ACE | 0.9685 | 0.0018 | 0.2862 | 0.9667 | 1.2547 | 1.2530 | 162.6307 |
CEM | 0.9869 | 0.0738 | 0.3596 | 0.9131 | 1.3465 | 1.2727 | 4.8725 |
E-CEM | 0.8918 | 0.0140 | 0.4219 | 0.8778 | 1.3137 | 1.2997 | 30.1171 |
HTD-IRN | 0.9797 | 0.0153 | 0.4399 | 0.9644 | 1.4195 | 1.4042 | 28.7873 |
SFCTD | 0.9906 | 0.0794 | 0.6613 | 0.9111 | 1.6519 | 1.5724 | 8.3249 |
TSCNTD | 0.9387 | 0.0076 | 0.5412 | 0.9311 | 1.4799 | 1.4723 | 70.9253 |
ours | 0.9976 | 0.0025 | 0.6868 | 0.9951 | 1.6845 | 1.6819 | 270.4094 |
Table 2.
AUC Values’ Comparison for San Diego-2.
Table 2.
AUC Values’ Comparison for San Diego-2.
Methods | | | | | | | |
---|
SAM | 0.9929 | 0.0365 | 0.3818 | 0.9564 | 1.3747 | 1.3382 | 10.4689 |
ACE | 0.9934 | 0.0060 | 0.5207 | 0.9874 | 1.5141 | 1.5081 | 87.2144 |
CEM | 0.9971 | 0.1862 | 0.6831 | 0.8109 | 1.6802 | 1.4940 | 3.6683 |
E-CEM | 0.7693 | 0.0097 | 0.3134 | 0.7597 | 1.0827 | 1.0730 | 32.4054 |
HTD-IRN | 0.9983 | 0.0100 | 0.6010 | 0.9883 | 1.5993 | 1.5893 | 59.9830 |
SFCTD | 0.9945 | 0.0063 | 0.7385 | 0.9882 | 1.7329 | 1.7266 | 116.4748 |
TSCNTD | 0.9951 | 0.0024 | 0.8179 | 0.9927 | 1.8130 | 1.8106 | 342.2259 |
ours | 0.9971 | 0.0021 | 0.8958 | 0.9950 | 1.8929 | 1.8908 | 422.5377 |
Table 3.
AUC Values’ Comparison for Urban-1.
Table 3.
AUC Values’ Comparison for Urban-1.
Methods | | | | | | | |
---|
SAM | 0.9948 | 0.0323 | 0.4567 | 0.9625 | 1.4514 | 1.4192 | 14.1468 |
ACE | 0.9288 | 0.0022 | 0.3029 | 0.9266 | 1.2317 | 1.2295 | 137.6682 |
CEM | 0.9404 | 0.1094 | 0.5064 | 0.8309 | 1.4468 | 1.3373 | 4.6277 |
E-CEM | 0.8815 | 0.0059 | 0.3324 | 0.8756 | 1.2139 | 1.2080 | 56.6269 |
HTD-IRN | 0.9659 | 0.0164 | 0.3341 | 0.9495 | 1.2999 | 1.2836 | 20.3938 |
SFCTD | 0.9918 | 0.0455 | 0.7086 | 0.9463 | 1.7004 | 1.6549 | 15.5873 |
TSCNTD | 0.9813 | 0.0253 | 0.7123 | 0.9560 | 1.6936 | 1.6683 | 28.1335 |
ours | 0.9961 | 0.0006 | 0.6762 | 0.9955 | 1.6723 | 1.6717 | 1146.1186 |
Table 4.
AUC Values’ Comparison for Urban-2.
Table 4.
AUC Values’ Comparison for Urban-2.
Methods | | | | | | | |
---|
SAM | 0.9990 | 0.0051 | 0.2908 | 0.9940 | 1.2898 | 1.2847 | 57.3511 |
ACE | 0.9922 | 0.0034 | 0.3379 | 0.9888 | 1.3301 | 1.3267 | 99.0909 |
CEM | 0.9979 | 0.2003 | 0.6014 | 0.7976 | 1.5993 | 1.3990 | 3.0028 |
E-CEM | 0.8646 | 0.0047 | 0.3808 | 0.8600 | 1.2454 | 1.2408 | 81.8946 |
HTD-IRN | 0.9993 | 0.0747 | 0.7514 | 0.9246 | 1.7508 | 1.6761 | 10.0594 |
SFCTD | 0.9955 | 0.0383 | 0.9149 | 0.9572 | 1.9104 | 1.8721 | 23.8999 |
TSCNTD | 0.9954 | 0.0825 | 1.0964 | 0.9128 | 2.0917 | 2.0092 | 13.2860 |
ours | 0.9994 | 0.0015 | 0.8260 | 0.9979 | 1.8253 | 1.8239 | 561.8707 |
Table 5.
AUC Values’ Comparison for Muufl Gulfport.
Table 5.
AUC Values’ Comparison for Muufl Gulfport.
Methods | | | | | | | |
---|
SAM | 0.5439 | 0.2970 | 0.2945 | 0.2469 | 0.8383 | 0.5413 | 0.9914 |
ACE | 0.9374 | 0.0381 | 0.2761 | 0.8993 | 1.2135 | 1.1754 | 7.2435 |
CEM | 0.9865 | 0.4029 | 0.6575 | 0.5835 | 1.6440 | 1.2410 | 1.6318 |
E-CEM | 0.6763 | 0.2729 | 0.3037 | 0.4034 | 0.9800 | 0.7071 | 1.1127 |
HTD-IRN | 0.9749 | 0.1940 | 0.7865 | 0.7809 | 1.7614 | 1.5674 | 4.0545 |
SFCTD | 0.9880 | 0.1026 | 0.7245 | 0.8854 | 1.7125 | 1.6099 | 7.0620 |
TSCNTD | 0.9826 | 0.0162 | 0.7155 | 0.9664 | 1.6980 | 1.6818 | 44.1370 |
ours | 0.9939 | 0.0010 | 0.8314 | 0.9929 | 1.8254 | 1.8244 | 839.8182 |
Table 6.
PD under FAR = 0.01 of different comparison methods on five data sets.
Table 6.
PD under FAR = 0.01 of different comparison methods on five data sets.
Method | San Diego-1 | San Diego-2 | Urban-1 | Urban-2 | Muufl Gulfport |
---|
SAM | 0.9189 | 0.7896 | 0.8970 | 0.9886 | 0.0000 |
ACE | 0.8730 | 0.9476 | 0.8223 | 0.8978 | 0.4461 |
CEM | 0.8961 | 0.9476 | 0.8518 | 0.9319 | 0.9484 |
E-CEM | 0.6045 | 0.4036 | 0.4348 | 0.5796 | 0.2416 |
HTD-IRN | 0.8657 | 0.9649 | 0.1045 | 0.9886 | 0.6543 |
SFCTD | 0.7619 | 0.8951 | 0.7476 | 0.8864 | 0.8662 |
TSCNTD | 0.6130 | 0.9310 | 0.4348 | 0.9886 | 0.4461 |
ours | 0.9415 | 0.9662 | 0.9109 | 1.0000 | 0.9486 |
Table 7.
Inference time (in milliseconds) comparison of our attention-free DBFTTD and the attention-based variant on the four data sets.
Table 7.
Inference time (in milliseconds) comparison of our attention-free DBFTTD and the attention-based variant on the four data sets.
Method | San Diego-1 | San Diego-2 | Urban-1 | Urban-2 |
---|
Ours | 215.79 ms | 216.37 ms | 244.94 ms | 246.91 ms |
Attention Based | 335.41 ms | 336.45 ms | 355.99 ms | 363.08 ms |