Figure 1.
The variety of disparities in remote sensing stereo pairs of the US3D dataset.
Figure 1.
The variety of disparities in remote sensing stereo pairs of the US3D dataset.
Figure 2.
The architecture of the proposed edge-sense bidirectional pyramid network: the input stereo pairs are concatenated by the normal and reverse pairs. As the network is trained, the forward and backward occlusion-aware maps are generated to reduce the deformation by masking the occluded pixel, and the structure information is extracted from the input pairs, acting as an edge-sense prior for second-order smoothness loss.
Figure 2.
The architecture of the proposed edge-sense bidirectional pyramid network: the input stereo pairs are concatenated by the normal and reverse pairs. As the network is trained, the forward and backward occlusion-aware maps are generated to reduce the deformation by masking the occluded pixel, and the structure information is extracted from the input pairs, acting as an edge-sense prior for second-order smoothness loss.
Figure 3.
Reconstruction of the cost volume with the stacked feature maps, which has the size of , covering disparities that range from to , where F denotes the number of feature maps and H and W are the height and width of the input images, respectively.
Figure 3.
Reconstruction of the cost volume with the stacked feature maps, which has the size of , covering disparities that range from to , where F denotes the number of feature maps and H and W are the height and width of the input images, respectively.
Figure 4.
Bidirectional disparity maps. (a) The left image. (b) The disparity map based on the left image. (c) The right image. (d) The disparity map based on the right image.
Figure 4.
Bidirectional disparity maps. (a) The left image. (b) The disparity map based on the left image. (c) The right image. (d) The disparity map based on the right image.
Figure 5.
Bidirectional occlusion-aware maps.
Figure 5.
Bidirectional occlusion-aware maps.
Figure 6.
The examples of left images that contain occluded regions, the tile numbers of these images (from left to right) are: JAX-068-014-013, JAX-168-022-005, JAX-260-022-001, OMA-288-036-038, OMA-212-038-033.
Figure 6.
The examples of left images that contain occluded regions, the tile numbers of these images (from left to right) are: JAX-068-014-013, JAX-168-022-005, JAX-260-022-001, OMA-288-036-038, OMA-212-038-033.
Figure 7.
The comparison of different networks. Results in (a) are generated from DenseMapNet while (b) illustrates the results from PSMNet and Bidir-EPNet in detail. The columns from left to right represent Left image, PSMNet, Bidir-EPNet, ROI of PSMNet, and ROI of Bidir-EPNet, respectively.
Figure 7.
The comparison of different networks. Results in (a) are generated from DenseMapNet while (b) illustrates the results from PSMNet and Bidir-EPNet in detail. The columns from left to right represent Left image, PSMNet, Bidir-EPNet, ROI of PSMNet, and ROI of Bidir-EPNet, respectively.
Figure 8.
The generalization of different networks. The columns from left to right represent Left image, DenseMapNet, PSMNet, and Bidir-EPNet, respectively.
Figure 8.
The generalization of different networks. The columns from left to right represent Left image, DenseMapNet, PSMNet, and Bidir-EPNet, respectively.
Figure 9.
The examples of left images that contain textureless regions, the tile numbers of these images (from left to right) are: JAX-214-011-004, JAX-269-013-025, JAX-236-002-001, JAX-269-018-013.
Figure 9.
The examples of left images that contain textureless regions, the tile numbers of these images (from left to right) are: JAX-214-011-004, JAX-269-013-025, JAX-236-002-001, JAX-269-018-013.
Figure 10.
The comparison of different networks results in (a) are generated from DenseMapNet, whereas (b) illustrates the results from PSMNet and Bidir-EPNet in detail. The columns from left to right represent Left image, PSMNet, Bidir-EPNet, ROI of PSMNet, and ROI of Bidir-EPNet, respectively.
Figure 10.
The comparison of different networks results in (a) are generated from DenseMapNet, whereas (b) illustrates the results from PSMNet and Bidir-EPNet in detail. The columns from left to right represent Left image, PSMNet, Bidir-EPNet, ROI of PSMNet, and ROI of Bidir-EPNet, respectively.
Figure 11.
The generalization of different networks. The columns from left to right represent Left image, DenseMapNet, PSMNet, and Bidir-EPNet, respectively.
Figure 11.
The generalization of different networks. The columns from left to right represent Left image, DenseMapNet, PSMNet, and Bidir-EPNet, respectively.
Figure 12.
Different settings for parameters and .
Figure 12.
Different settings for parameters and .
Figure 13.
From left to right: Left image, weight maps of smooth term with , , and with a threshold of 0.3.
Figure 13.
From left to right: Left image, weight maps of smooth term with , , and with a threshold of 0.3.
Figure 14.
The comparison of different loss function. The columns from left to right denote: Left images, only bidirectional unsupervised loss, bidirectional, and smoothness loss without edge previously, the final loss.
Figure 14.
The comparison of different loss function. The columns from left to right denote: Left images, only bidirectional unsupervised loss, bidirectional, and smoothness loss without edge previously, the final loss.
Table 1.
The datasets configurations for experiments: JAX represents Jacksonville and OMA denotes Omaha.
Table 1.
The datasets configurations for experiments: JAX represents Jacksonville and OMA denotes Omaha.
Stereo Pair | Mode | Patch Size | Number of Images: Training/Testing | Usage |
---|
JAX | RGB | | 1500/346 | Training and Testing |
OMA | RGB | | –/2086 | Testing |
Table 2.
The comparison of averaged endpoint error (EPE) (pixel) and fraction of erroneous pixels (D1) (%). The best results are labeled as bold.
Table 2.
The comparison of averaged endpoint error (EPE) (pixel) and fraction of erroneous pixels (D1) (%). The best results are labeled as bold.
Networks | JAX | OMA |
---|
EPE | D1 | EPE | D1 |
---|
DenseMapNet | 1.8657 | 15.34 | 1.6251 | 14.84 |
PSMNet | 1.3644 | 9.51 | 1.5122 | 9.15 |
Bidir-EPNet | 1.2764 | 8.03 | 1.4899 | 8.96 |
Table 3.
The comparison of EPE and D1 of occluded regions. The best results are labeled as bold.
Table 3.
The comparison of EPE and D1 of occluded regions. The best results are labeled as bold.
Tile Number | DenseMapNet | PSMNet | Bidir-EPNet |
---|
EPE | D1 | EPE | D1 | EPE | D1 |
---|
JAX-068-014-013 | 3.59 | 32.1 | 1.31 | 9.20 | 1.14 | 7.49 |
JAX-168-022-005 | 1.40 | 9.88 | 0.89 | 3.54 | 0.80 | 3.27 |
JAX-260-022-001 | 2.40 | 20.6 | 0.94 | 4.16 | 0.91 | 3.55 |
OMA-288-036-038 | 1.40 | 10.0 | 0.91 | 4.54 | 0.88 | 4.37 |
OMA-212-038-033 | 1.32 | 9.54 | 1.22 | 7.14 | 0.98 | 3.78 |
Table 4.
The comparison of EPE and D1 of occluded regions in OMA. The best results are labeled as bold.
Table 4.
The comparison of EPE and D1 of occluded regions in OMA. The best results are labeled as bold.
Tile Number | DenseMapNet | PSMNet | Bidir-EPNet |
---|
EPE | D1 | EPE | D1 | EPE | D1 |
---|
OMA-276-040-038 | 2.72 | 31.6 | 3.11 | 33.6 | 1.84 | 17.3 |
OMA-172-041-036 | 1.69 | 14.6 | 1.62 | 10.9 | 1.20 | 7.03 |
OMA-225-031-027 | 3.07 | 37.3 | 4.33 | 41.2 | 1.48 | 10.3 |
OMA-247-034-038 | 1.49 | 8.47 | 2.05 | 9.12 | 0.71 | 2.14 |
OMA-248-036-027 | 2.23 | 17.9 | 1.60 | 9.67 | 0.96 | 3.65 |
OMA-203-041-038 | 1.85 | 18.1 | 1.64 | 10.0 | 1.29 | 6.41 |
Table 5.
The comparison of EPE and D1 of textureless regions. The best results are labeled as bold.
Table 5.
The comparison of EPE and D1 of textureless regions. The best results are labeled as bold.
Tile Number | DenseMapNet | PSMNet | Bidir-EPNet |
---|
EPE | D1 | EPE | D1 | EPE | D1 |
---|
JAX-214-011-004 | 2.27 | 23.31 | 1.13 | 5.81 | 1.04 | 4.72 |
JAX-269-013-025 | 1.67 | 15.76 | 1.01 | 5.96 | 0.92 | 4.68 |
JAX-236-002-001 | 1.32 | 11.18 | 1.02 | 7.33 | 0.98 | 6.52 |
JAX-269-018-013 | 2.69 | 29.27 | 1.63 | 13.71 | 1.45 | 11.12 |
Table 6.
The comparison of EPE and D1 of occluded regions in OMA. The best results are labeled as bold.
Table 6.
The comparison of EPE and D1 of occluded regions in OMA. The best results are labeled as bold.
Tile Number | DenseMapNet | PSMNet | Bidir-EPNet |
---|
EPE | D1 | EPE | D1 | EPE | D1 |
---|
OMA-134-004-008 | 1.72 | 15.9 | 1.67 | 14.7 | 1.22 | 9.27 |
OMA-176-031-026 | 1.34 | 10.2 | 0.96 | 6.90 | 0.85 | 6.86 |
OMA-181-027-028 | 0.91 | 1.45 | 0.98 | 1.48 | 0.86 | 1.35 |
OMA-211-033-031 | 2.09 | 18.1 | 1.31 | 6.10 | 1.11 | 4.32 |
OMA-251-043-041 | 6.66 | 45.1 | 4.29 | 23.5 | 1.92 | 8.74 |
OMA-292-039-003 | 1.07 | 3.58 | 1.76 | 3.38 | 1.38 | 2.93 |
Table 7.
The comparison of EPE (pixel) with different loss weights. The best result is labeled as bold.
Table 7.
The comparison of EPE (pixel) with different loss weights. The best result is labeled as bold.
Loss Weight | EPE(Pixel) |
---|
| | |
1.0 | 0.0 | 0.0 | 1.36 |
1.0 | 0.1 | 0.3 | 1.33 |
1.0 | 0.3 | 0.5 | 1.30 |
1.0 | 0.5 | 0.8 | 1.27 |
1.0 | 0.8 | 0.9 | 1.29 |
1.0 | 1.0 | 1.0 | 1.29 |