Simplified High-Performance Cost Aggregation for Stereo Matching
Abstract
:Featured Application
Abstract
1. Introduction
- (1)
- We propose an aggregation algorithm for stereo matching that significantly simplifies computation without sacrificing matching performance. The aggregation weights can be shared between different scene images with the same resolution.
- (2)
- To provide a higher matching accuracy, we integrate the algorithm with a multi-scale scheme to exploit the spatial distribution of texture that can achieve improved performance with a minor increase in computational efforts.
2. Methods
2.1. Traditional Local and Non-Local Stereo Matching Algorithms
2.2. The Proposed Aggregation Method
2.2.1. The First Step: Horizontal Aggregation
2.2.2. The Second Step: Vertical Aggregation
2.2.3. The Proposed Scheme: Integration of the Texture-Independent Scheme with a Cross-Scale Cost Aggregation Algorithm
3. Results
- End-to-end real-time stereo matching network proposed in [25], denoted as RTSMNet.
- Matching algorithm based on a combination of the adaptive support weight with iterative guided filter and the sum of gradient matching [15], denoted as ISM.
- Sparse representation over a learned discriminative dictionary for stereo matching [4], denoted as DDL.
- Stereo matching algorithm based on two-phase adaptive optimization of ad-census and gradient fusion [8], denoted as TPAO.
- Stereo matching algorithm based on per pixel difference adjustment, iterative guided filter, and graph segmentation [14], denoted as IGF.
- Local stereo matching using adaptive cross-region-based guided image filtering with orthogonal weights [16], denoted as ACR-GIF-OW.
- Hierarchical guided-image-filtering for stereo matching [20], denoted as HGIF.
- Stereo matching with fusing adaptive support weights [21], denoted as FASW.
- The proposed scheme, which is an integration with a cross-scale cost aggregation algorithm as described in the last section.
4. Discussion
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Nomenclature
a constant matrix used in deriving the proposed aggregation cost of the first step consisting of the parameter λ. | |
a constant matrix used in deriving the proposed aggregation cost of the second step consisting of the parameter λ. | |
B | a matrix consisting of elements of . |
the general form of an aggregated matching cost. | |
an updated disparity cost volume to reduce the noise effects in images. | |
the final aggregated matching cost that integrates the proposed algorithm with a cross-scale cost aggregation scheme described in [20,30]. | |
the primary matching cost used in this paper. | |
the proposed aggregation cost defined in the first step. | |
the proposed aggregation cost defined in the second step. | |
a matching cost calculated by the Hamming distance of the census transform. | |
the matching cost of the k-th scale layer, each aggregated by the proposed scheme. | |
the estimated matching cost of the k-th scale layer, each aggregated by an integrated scheme consisting of the proposed scheme and a cross-scale cost aggregation algorithm described in [20,30]. | |
a matching cost calculated by the truncated absolute difference of the gradient. | |
D | a matrix consisting of elements of . |
d | the estimated disparity of a pixel. |
the disparity map based on an image pair. | |
a function to calculate the Hamming Distance. | |
an independent objective function of image rows. | |
, | the left and right images, respectively. |
an objective function for the cross-scale scheme proposed in [20,30]. | |
, | objective functions representing the least squared difference between the proposed aggregation cost and the primary matching cost in the horizontal and vertical directions, respectively. |
K | the number of pairs in the cross-scale cost aggregation. Each image pair is down-sampled from the original pair. |
L | a path determined by the minimum spanning tree technique [17]. |
M | the vertical resolution of . |
min | a function to calculate the minimum value between several amounts. |
N | the horizontal resolution of . |
a set of 2 horizontal neighbors. | |
p, q | the location of a pixel. The symbols are interchangeably used as parameters. |
the Census transformation. | |
an independent objective function of image columns. | |
the general form of aggregation weights. | |
the aggregation weight of the weighted guided image filter. | |
the aggregation weight based on the minimum spanning tree technique [17]. | |
Greek symbols | |
α | a weighting constant. |
β | a shaping parameter. |
γ | a constraining factor. |
ε | a small constant introduced to avoid division by zero. |
λ | a normalization factor. |
μ | the mean value function. |
the variance function. | |
Ω | the supporting region centered at a pixel. |
Mathematical Operator | |
, | the gradients of intensity in the horizontal and vertical directions, respectively. |
References
- Scharstein, D.; Szeliski, R. A taxonomy and evaluation of dense two-frame stereo correspondence algorithms. Int. J. Comput. Vis. 2002, 47, 7–42. [Google Scholar] [CrossRef]
- Boykov, Y.; Veksler, O.; Zabih, R. Fast approximate energy minimization via graph cuts. IEEE Trans. Pattern Anal. Mach. Intell. 2001, 23, 1222–1239. [Google Scholar] [CrossRef] [Green Version]
- Hirschmuller, H. Stereo processing by semiglobal matching and mutual information. IEEE Trans. Pattern Anal. Mach. Intell. 2008, 30, 328–341. [Google Scholar] [CrossRef]
- Yin, J.; Zhu, H.; Yuan, D.; Xue, T. Sparse representation over discriminative dictionary for stereo matching. Pattern Recognit. 2017, 71, 278–289. [Google Scholar] [CrossRef]
- Hallek, M.; Boukamcha, H.; Mtibaa, A.; Atri, M. Dynamic programming with adaptive and self-adjusting penalty for real-time accurate stereo matching. J. Real-Time Image Proc. 2022, 19, 233–245. [Google Scholar] [CrossRef]
- Lu, Z.; Wang, J.; Li, Z.; Chen, S.; Wu, F. A Resource-Efficient Pipelined Architecture for Real-Time Semi-Global Stereo Matching. IEEE Trans. Circuits Syst. Video Technol. 2022, 32, 660–673. [Google Scholar] [CrossRef]
- Yoon, K.J.; Kweon, I.S. Adaptive support-weight approach for correspondence search. IEEE Trans. Pattern Anal. Mach. Intell. 2006, 28, 650–656. [Google Scholar] [CrossRef]
- Liu, H.; Zhang, H.; Nie, X.; He, W.; Luo, D.; Jiao, G.; Chen, W. Stereo matching algorithm based on two-phase adaptive optimization of AD-Census and gradient fusion. In Proceedings of the Conference on Real-time Computing and Robotics, Xining, China, 15–19 July 2021; pp. 726–731. [Google Scholar]
- He, K.; Sun, J.; Tang, X. Guided image filtering. IEEE Trans. Pattern Anal. Mach. Intell. 2013, 35, 1397–1409. [Google Scholar] [CrossRef]
- Hosni, A.; Rhemann, C.; Bleyer, M.; Rother, C. Fast cost-volume filtering for visual correspondence and beyond correspondence search. IEEE Trans. Pattern Anal. Mach. Intell. 2013, 35, 504–511. [Google Scholar] [CrossRef]
- Li, Z.; Zheng, J.; Zhu, Z.; Yao, W.; Wu, S. Weighted guided image filtering. IEEE Trans. Image Process. 2014, 24, 120–129. [Google Scholar]
- Hong, G.S.; Kim, B.G. A local stereo matching algorithm based on weighted guided image filtering for improving the generation of depth range images. Displays 2017, 49, 80–87. [Google Scholar] [CrossRef]
- Yang, Q.; Ji, P.; Li, D.; Yao, S.; Zhang, M. Fast stereo matching using adaptive guided filtering. Image Vis. Comput. 2014, 32, 202–211. [Google Scholar] [CrossRef]
- Hamzah, R.A.; Ibrahim, H.; Hassan, A.H.A. Stereo matching algorithm based on per pixel difference adjustment, iterative guided filter and graph segmentation. J. Vis. Commun. Image Represent. 2017, 42, 145–160. [Google Scholar] [CrossRef]
- Hamzah, R.A.; Kadmin, A.F.; Hamid, M.S.; Ghani, S.F.A.; Ibrahim, H. Improvement of stereo matching algorithm for 3D surface reconstruction. Signal Process. Image Commun. 2018, 65, 165–172. [Google Scholar] [CrossRef]
- Kong, L.; Zhu, J.; Ying, S. Local stereo matching using adaptive cross-region-based guided image filtering with orthogonal weights. Math. Probl. Eng. 2021, 2021, 1–20. [Google Scholar] [CrossRef]
- Yang, Q. Stereo matching using tree filtering. IEEE Trans. Pattern Anal. Mach. Intell. 2015, 37, 834–846. [Google Scholar] [CrossRef]
- Çı˘gla, C.; Alatan, A.A. An efficient recursive edge-aware filter. Signal Process. Image Commun. 2014, 29, 998–1014. [Google Scholar] [CrossRef]
- Zhu, C.; Chang, Y.Z. Efficient stereo matching based on pervasive guided image filtering. Math. Probl. Eng. 2019, 2019, 3128172. [Google Scholar] [CrossRef]
- Zhu, C.; Chang, Y.Z. Hierarchical guided-image-filtering for efficient stereo matching. Appl. Sci. 2019, 9, 3122. [Google Scholar] [CrossRef] [Green Version]
- [Wu, W.; Zhu, H.; Yu, S.; Shi, J. Stereo matching with fusing adaptive support weights. IEEE Access 2019, 7, 61960–61974. [Google Scholar]
- Nguyen, V.D.; Nguyen, H.V.; Jeon, J.W. Robust stereo data cost with a learning strategy. IEEE Trans. Intell. Transp. Syst. 2017, 18, 248–258. [Google Scholar] [CrossRef]
- Godard, C.; Aodha, O.M.; Brostow, G.J. Unsupervised monocular depth estimation with left-right consistency. In Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21–26 July 2017; pp. 6602–6611. [Google Scholar]
- Chen, S.; Zhang, J.; Jin, M. A simplified ICA-based local similarity stereo matching. Vis. Comput. 2021, 37, 411–419. [Google Scholar] [CrossRef] [Green Version]
- Chang, J.R.; Chen, Y.S. Pyramid stereo matching network. In Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA, 19–21 June 2018; pp. 5410–5418. [Google Scholar]
- Xie, Y.; Zheng, S.; Li, W. Feature-guided spatial attention upsampling for real-time stereo matching network. IEEE MultiMedia. 2021, 28, 38–47. [Google Scholar] [CrossRef]
- Tian, M.; Yang, B.; Chen, C.; Huang, R.; Huo, L. HPM-TDP: An efficient hierarchical PatchMatch depth estimation approach using tree dynamic programming. ISPRS 2019, 155, 37–57. [Google Scholar] [CrossRef]
- Zabih, R.; Woodfill, J. Non-parametric local transforms for computing visual correspondence. In Proceedings of the Third European Conference on Computer Vision, Stockholm, Sweden, 2–6 May 1994; pp. 151–158. [Google Scholar]
- Venetis, I.E.; Kouris, A.; Sobczyk, A.; Gallopoulos, E.; Sameh, A. A direct tridiagonal solver based on Givens rotations for GPU architectures. Parallel Comput. 2015, 49, 101–116. [Google Scholar] [CrossRef]
- Zhang, K.; Fang, Y.; Min, D.; Sun, L.; Yang, S.; Yan, S.; Tian, Q. Cross-scale cost aggregation for stereo matching. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA, 23-28 June 2014; pp. 1590–1597. [Google Scholar]
- Scharstein, D.; Hirschmüller, H.; Kitajima, Y.; Krathwohl, G.; Nesic, N.; Wang, X.; Westling, P. High-resolution stereo datasets with subpixel-accurate ground truth. In Proceedings of the German Conference on Pattern Recognition (GCPR 2014), Münster, Germany, 12–15 September 2014; pp. 1–12. [Google Scholar]
- Moritz, M.; Geiger, A. Object Scene Flow for Autonomous Vehicles. In Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA, 8–10 June 2015; pp. 1–10. [Google Scholar]
- Rani, E.F.I.; Pushparaj, T.L.; Raj, E.F.I. Escalating the resolution of an urban aerial image via novel shadow amputation algorithm. Earth Sci. Inform. 2022, 15, 905–913. [Google Scholar] [CrossRef]
Resolution | Adirondack | ArtL | Jadeplant | Motorcycle | MotorcycleE | Piano | PianoL | Pipes |
M | 496 | 277 | 497 | 497 | 497 | 481 | 481 | 485 |
N | 718 | 347 | 659 | 741 | 741 | 707 | 707 | 735 |
Resolution | Playroom | Playtable | PlaytableP | Recycle | Shelves | Teddy | Vintage | |
M | 476 | 463 | 462 | 486 | 497 | 450 | 480 | |
N | 699 | 680 | 681 | 720 | 738 | 375 | 722 |
Test Sets | RTSMNet | ISM | DDL | TPAO | IGF | ACR-GIF-OW | HGIF | FASW | Proposed |
---|---|---|---|---|---|---|---|---|---|
Adirondack | 4.5 | 15.5 | 6.5 | 7.1 | 14.9 | 7.9 | 5.7 | 6.8 | 4.8 |
ArtL | 11.5 | 16.5 | 13.1 | 13.2 | 15.9 | 10.1 | 10.9 | 10.5 | 9.7 |
Jadeplant | 43.0 | 25.1 | 18.8 | 16.1 | 24.2 | 17.4 | 19.7 | 19.0 | 17.4 |
Motorcycle | 8.9 | 10.0 | 9.0 | 6.8 | 9.6 | 6.8 | 8.5 | 6.8 | 7.6 |
MotorcycleE | 8.9 | 11.0 | 7.7 | 5.9 | 10.5 | 6.8 | 8.5 | 6.3 | 6.7 |
Piano | 13.3 | 23.5 | 14.8 | 17.9 | 24.1 | 19.1 | 14.0 | 13.8 | 14.3 |
PianoL | 25.2 | 32.6 | 26.6 | 26.7 | 32.1 | 34.2 | 29.3 | 25.8 | 27.2 |
Pipes | 13.8 | 15.5 | 11.6 | 10.1 | 15.1 | 9.4 | 9.9 | 9.4 | 9.1 |
Playroom | 18.0 | 21.6 | 16.2 | 17.6 | 21.5 | 18.7 | 15.2 | 16.7 | 16.3 |
Playtable | 10.6 | 39.6 | 18.4 | 27.3 | 39.6 | 24.3 | 16.7 | 20.8 | 16.6 |
PlaytableP | 6.7 | 20.9 | 10.6 | 12.5 | 20.9 | 11.4 | 11.0 | 10.2 | 10.3 |
Recycle | 6.8 | 15.0 | 9.1 | 9.1 | 14.4 | 9.5 | 8.3 | 8.1 | 8.7 |
Shelves | 22.2 | 34.4 | 37.8 | 38.6 | 36.0 | 38.4 | 31.6 | 32.6 | 33.6 |
Teddy | 9.4 | 7.2 | 6.2 | 6.2 | 7.0 | 6.1 | 5.6 | 5.2 | 6.3 |
Vintage | 39.3 | 34.4 | 27.3 | 27.5 | 34.6 | 25.6 | 27.0 | 23.9 | 26.5 |
Weighted Average | 14.7 | 19.3 | 13.6 | 13.9 | 19.1 | 14.0 | 12.9 | 12.5 | 12.4 |
Test Sets | RTSMNet | ISM | DDL | TPAO | IGF | ACR-GIF-OW | HGIF | FASW | Proposed |
---|---|---|---|---|---|---|---|---|---|
Adirondack | 6.2 | 18.4 | 8.8 | 11.7 | 17.6 | 12.5 | 8.0 | 8.7 | 7.1 |
ArtL | 15.5 | 25.6 | 22.8 | 24.5 | 24.8 | 23.4 | 21.2 | 20.8 | 20.7 |
Jadeplant | 47.8 | 36.8 | 32.4 | 28.4 | 35.9 | 31.3 | 33.0 | 32.3 | 31.8 |
Motorcycle | 11.7 | 14.9 | 13.4 | 14.1 | 14.5 | 14.9 | 12.6 | 10.9 | 11.9 |
MotorcycleE | 11.7 | 15.9 | 11.8 | 13.1 | 15.3 | 14.6 | 12.9 | 10.5 | 11.2 |
Piano | 16.8 | 27.6 | 19.2 | 22.9 | 28.1 | 23.8 | 18.1 | 18.2 | 18.4 |
PianoL | 28.4 | 36.3 | 30.5 | 30.9 | 35.7 | 37.9 | 33.1 | 29.5 | 30.7 |
Pipes | 21.7 | 26.2 | 22.7 | 22.8 | 25.7 | 21.8 | 21.5 | 20.2 | 20.7 |
Playroom | 25.0 | 29.7 | 25.1 | 26.7 | 29.4 | 28.4 | 23.8 | 25.5 | 25.0 |
Playtable | 13.6 | 42.9 | 23.3 | 33.1 | 42.9 | 30.5 | 21.2 | 25.0 | 22.6 |
PlaytableP | 8.8 | 25.9 | 14.3 | 19.2 | 25.9 | 18.5 | 14.3 | 13.4 | 14.4 |
Recycle | 7.8 | 17.5 | 11.4 | 12.7 | 17.0 | 13.6 | 10.3 | 10.2 | 11.2 |
Shelves | 23.7 | 35.5 | 39.1 | 39.9 | 37.0 | 40.3 | 32.6 | 34.0 | 34.7 |
Teddy | 11.2 | 12.0 | 11.8 | 12.7 | 11.8 | 12.3 | 11.2 | 10.9 | 11.7 |
Vintage | 41.1 | 37.7 | 31.5 | 32.2 | 37.9 | 30.1 | 31.0 | 28.7 | 30.8 |
Weighted Average | 18.0 | 24.9 | 19.5 | 21.1 | 24.6 | 21.6 | 18.7 | 18.2 | 18.4 |
Test Sets | ACR-GIF-OW | HGIF | Proposed |
---|---|---|---|
Adirondack | 215 | 31 | 10 |
ArtL | 34 | 7 | 1 |
Jadeplant | 374 | 87 | 21 |
Motorcycle | 191 | 37 | 10 |
MotorcycleE | 192 | 36 | 10 |
Piano | 169 | 27 | 7 |
PianoL | 171 | 26 | 7 |
Pipes | 197 | 37 | 11 |
Playroom | 211 | 31 | 11 |
Playtable | 169 | 29 | 9 |
PlaytableP | 167 | 30 | 9 |
Recycle | 199 | 30 | 8 |
Shelves | 195 | 29 | 8 |
Teddy | 68 | 14 | 4 |
Vintage | 499 | 98 | 25 |
Algorithms | Non-Occ (%) | All (%) | Non-Occ (Pixels) | All (Pixels) |
---|---|---|---|---|
ISM | 8.88 | 10.01 | 1.87 | 2.07 |
AGF | 8.59 | 9.73 | 1.77 | 1.99 |
MST | 23.27 | 24.41 | 3.47 | 4.15 |
HGIF | 6.57 | 7.78 | 1.35 | 1.61 |
FASW | 6.89 | 8.12 | 1.31 | 1.45 |
Proposed | 6.30 | 7.48 | 1.30 | 1.58 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Zhu, C.; Chang, Y.-Z. Simplified High-Performance Cost Aggregation for Stereo Matching. Appl. Sci. 2023, 13, 1791. https://doi.org/10.3390/app13031791
Zhu C, Chang Y-Z. Simplified High-Performance Cost Aggregation for Stereo Matching. Applied Sciences. 2023; 13(3):1791. https://doi.org/10.3390/app13031791
Chicago/Turabian StyleZhu, Chengtao, and Yau-Zen Chang. 2023. "Simplified High-Performance Cost Aggregation for Stereo Matching" Applied Sciences 13, no. 3: 1791. https://doi.org/10.3390/app13031791
APA StyleZhu, C., & Chang, Y.-Z. (2023). Simplified High-Performance Cost Aggregation for Stereo Matching. Applied Sciences, 13(3), 1791. https://doi.org/10.3390/app13031791