Accurate 3D Terrain Reconstruction for Multi-View Thermal Infrared Images with Small Intersection Angles
Highlights
- An affine-initialized RPC framework enables stable 3D reconstruction from bidirectional whisk-broom thermal infrared imagery with small intersection angles.
- A hierarchical, longitude/latitude-first RPC optimization improves numerical stability and mitigates height-error propagation under weak stereo geometry.
- Reliable 3D terrain reconstruction can be achieved without access to confidential rigorous sensor models.
- The framework offers a practical solution for DEM generation from low-texture thermal infrared stereo imagery under weak stereo geometry.
Abstract
1. Introduction
1.1. Motivation
1.2. Related Work
1.3. Contribution
- (1)
- We propose a two-stage, affine-initialized RPC reconstruction framework, in which a fast affine-based 3D estimator provides stable initial values for subsequent RPC iterative refinement. This design reduces sensitivity to initialization under weak stereo geometry while maintaining computational efficiency.
- (2)
- We develop an enhanced local affine preprocessing strategy by partitioning large whisk-broom images into overlapping sub-images for rapid estimation. By exploiting locally smooth imaging geometry, this strategy mitigates distortions introduced by affine linearization and helps alleviate boundary discontinuities in the initialization stage.
- (3)
- We propose a hierarchical coordinate optimization strategy for RPC refinement, where longitude and latitude are optimized before altitude. This decoupling addresses the ill-conditioning of altitude estimation under small intersection angles and improves the numerical stability of the refinement process.
2. Materials and Methods
2.1. Overall Framework
2.2. Rapid Estimation of Geographic Coordinates Based on an Affine Model
2.2.1. Local Affine-Based Initial 3D Approximation
2.2.2. Global Geographic Coordinate Upgrading
2.3. Iterative Coordinate Optimization Using the RPC Model
| Algorithm 1. Iterative Geocoordinate Optimization. | |
| Input: initial geographic coordinates threshold Output: Optimized coordinates 1: repeat | |
| 2: | ←solve the error equation by Equation (11) |
| 3: | ←add the corrections to initial values |
| 4: until and , | |
| 5: repeat | |
| 6: | ←solve the error equation by Equation (11) |
| 7: | ←add the corrections to initial values |
| 8: until , | |
| 9: result | |
2.4. Accuracy Assessment
3. Results
3.1. Experimental Data
3.2. Qualitative Comparison of Different 3D Scenes
3.3. Quantitative Assessment of Reconstructed 3D Scenes
4. Discussion
4.1. Applicability of Existing RPC-Based Methods
4.2. Overall Accuracy Performance and Interpretation
4.3. Influence of Terrain and Key Parameters
4.4. Limitations and Future Work
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Appendix A
| Scene Number | MAE (m) | Median Error (m) | RMSE (m) | Mean Error (m) |
|---|---|---|---|---|
| 1 | 21.35 | −3.18 | 30.32 | −4.94 |
| 2 | 24.27 | 5.32 | 31.89 | 1.41 |
| 3 | 38.23 | −2.68 | 47.79 | −4.68 |
| 4 | 30.82 | −2.41 | 52.05 | −11.08 |
| 5 | 39.42 | −1.34 | 50.61 | 1.77 |
| 6 | 36.02 | −1.25 | 50.37 | −5.26 |
| 7 | 14.08 | 0.70 | 19.89 | 1.30 |
| 8 | 27.28 | 4.99 | 38.10 | 7.27 |
| 9 | 19.81 | −1.68 | 28.70 | −4.41 |
| 10 | 27.67 | −5.27 | 32.71 | −1.55 |
| 11 | 11.34 | −2.68 | 35.48 | −6.92 |
| 12 | 3.74 | −0.19 | 5.41 | −0.81 |
| 13 | 5.15 | −0.10 | 7.77 | −0.27 |
| 14 | 6.88 | 1.23 | 8.96 | 0.79 |
References
- Ganci, G.; Cappello, A.; Neri, M. Data Fusion for Satellite-Derived Earth Surface: The 2021 Topographic Map of Etna Volcano. Remote Sens. 2023, 15, 198. [Google Scholar] [CrossRef]
- Colverd, G.; Takami, J.; Schade, L.; Bot, K.; Gallego-Mejia, J.A. Tomographic SAR Reconstruction for Forest Height Estimation. arXiv 2024, arXiv:2412.00903v2. [Google Scholar] [CrossRef]
- Petrović, I.; Sečnik, M.; Hočevar, M.; Berk, P. Vine Canopy Reconstruction and Assessment with Terrestrial Lidar and Aerial Imaging. Remote Sens. 2022, 14, 5894. [Google Scholar] [CrossRef]
- Li, Z.; Ji, S.; Fan, D.; Yan, Z.; Wang, F.; Wang, R. Introduction of 3D Information of Buildings from Single-View Images Based on Shadow Information. ISPRS Int. J. Geo-Inf. 2024, 13, 62. [Google Scholar] [CrossRef]
- Guo, H.D.; Liang, D.; Chen, F.; Sun, Z.C.; Liu, J. Big Earth Data Facilitates Sustainable Development Goals. Bull. Chin. Acad. Sci. 2021, 36, 874–884. [Google Scholar] [CrossRef]
- Schönberger, J.L.; Frahm, J.-M. Structure-from-Motion Revisited. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 4104–4113. [Google Scholar]
- Zhang, X.; Pan, H.; Zhou, S.; Zhu, X. Self-Calibration Strip Bundle Adjustment of High-Resolution Satellite Imagery. Remote Sens. 2024, 16, 2196. [Google Scholar] [CrossRef]
- Bullinger, S.; Bodensteiner, C.; Arens, M. 3D Surface Reconstruction from Multi-Date Satellite Images. Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci. 2021, XLIII-B2-2021, 313–320. [Google Scholar] [CrossRef]
- Hirschmüller, H.; Scholten, F.; Hirzinger, G. Stereo Vision Based Reconstruction of Huge Urban Areas from an Airborne Pushbroom Camera (HRSC). In Pattern Recognition: 27th DAGM Symposium; Springer: Berlin, Germany, 2005; pp. 58–66. [Google Scholar] [CrossRef]
- Sivakumar, V.; Kumar, B.; Srivastava, S.K.; Krishna, B.G.; Srivastava, P.K.; Kiran Kumar, A.S. DEM Generation for Lunar Surface using Chandrayaan-1 TMC Triplet Data. J. Indian Soc. Remote Sens. 2012, 40, 551–564. [Google Scholar] [CrossRef]
- Dong, Q.; Gao, X.; Cui, H.; Hu, Z. Robust Camera Translation Estimation via Rank Enforcement. IEEE Trans. Cybern. 2022, 52, 862–872. [Google Scholar] [CrossRef]
- Facciolo, G.; de Franchis, C.; Meinhardt-Llopis, E. Automatic 3D Reconstruction from Multi-Date Satellite Images. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, Honolulu, HI, USA, 21–26 July 2017. [Google Scholar]
- Gao, J.; Liu, J.; Ji, S. Rational Polynomial Camera Model Warping for Deep Learning Based Satellite Multi-View Stereo Matching. arXiv 2021, arXiv:2109.11121. [Google Scholar] [CrossRef]
- Hartley, R.I.; Saxena, T. The Cubic Rational Polynomial Camera Model; Technical Report; G.E. Corporate R&D and CMA Consulting: Niskayuna, NY, USA, 2001. [Google Scholar]
- Tao, C.V.; Hu, Y. 3D reconstruction methods based on the rational function model. Photogramm. Eng. Remote Sens. 2002, 68, 705–714. [Google Scholar]
- Seo, D.U.; Park, S.Y. 3D Reconstruction from Multi-view Google Earth Satellite Stereo Images by Generating Virtual RPC based on 3D Homography-based Georeferencing. Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2023, 48, 1075–1080. [Google Scholar] [CrossRef]
- Grodecki, J.; Dial, G. Block Adjustment of High-Resolution Satellite Images Described by Rational Polynomials. Photogramm. Eng. Remote Sens. 2003, 69, 59–68. [Google Scholar] [CrossRef]
- Noh, M.-J.; Howat, I.M. Automated stereo-photogrammetric DEM generation at high latitudes: Surface Extraction with TIN-based Search-space Minimization (SETSM) validation and demonstration over glaciated regions. GISci. Remote Sens. 2015, 52, 198–217. [Google Scholar] [CrossRef]
- d’Angelo, P.; Reinartz, P. DSM based orientation of large stereo satellite image blocks. Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2012, 39, 209–214. [Google Scholar] [CrossRef]
- Jacobsen, K. Analysis and correction of systematic height model errors. Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2016, 41, 333–339. [Google Scholar] [CrossRef]
- Singh, M.K.; Gupta, R.D.; Snehmani; Bhardwaj, A.; Ganju, A. Effect of sensor modelling methods on computation of 3-D coordinates from Cartosat-1 stereo data. Geocarto Int. 2015, 31, 506–526. [Google Scholar] [CrossRef]
- Zheng, E.; Wang, K.; Dunn, E.; Frahm, J.-M. Minimal Solvers for 3D Geometry from Satellite Imagery. In Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 22–29 October 2017. [Google Scholar]
- Goossens, R.; Schmidt, M.; Menz, G. High resolution DEM and ortho-photomap generation from TERRA-ASTER data—Case study of Morocco. In Geoinformation for European-Wide Integration; Benes, T., Ed.; Millpress: Rotterdam, The Netherlands, 2003; pp. 19–24. [Google Scholar]
- Wang, P.; Shi, L.; Chen, B.; Hu, Z.; Qiao, J.; Dong, Q. Pursuing 3-D Scene Structures with Optical Satellite Images from Affine Reconstruction to Euclidean Reconstruction. IEEE Trans. Geosci. Remote Sens. 2022, 60, 5632214. [Google Scholar] [CrossRef]
- de Franchis, C.; Meinhardt-Llopis, E.; Michel, J.; Morel, J.-M.; Facciolo, G. An Automatic and Modular Stereo Pipeline for Pushbroom Images. ISPRS Ann. Photogramm. Remote Sens. Spatial Inf. Sci. 2014, II-3, 49–56. [Google Scholar] [CrossRef]
- Stucker, C.; Schindler, K. ResDepth: A Deep Residual Prior For 3D Reconstruction from High-resolution Satellite Images. ISPRS J. Photogramm. Remote Sens. 2022, 183, 560–580. [Google Scholar] [CrossRef]
- Mao, Y.; Chen, K.; Zhao, L.; Chen, W.; Tang, D.; Liu, W.; Wang, Z.; Diao, W.; Sun, X.; Fu, K. Elevation Estimation-Driven Building 3D Reconstruction from Single-View Remote Sensing Imagery. IEEE Trans. Geosci. Remote Sens. 2023, 61, 5608718. [Google Scholar] [CrossRef]
- Ma, W.; Wen, Z.; Wu, Y.; Jiao, L.; Gong, M.; Zheng, Y.; Liu, L. Remote Sensing Image Registration with Modified SIFT and Enhanced Feature Matching. IEEE Geosci. Remote Sens. Lett. 2017, 14, 3–7. [Google Scholar] [CrossRef]
- Fischler, M.A.; Bolles, R.C. Random Sample Consensus: A Paradigm for Model Fitting with Applications to Image Analysis and Automated Cartography. Commun. ACM 1981, 24, 381–395. [Google Scholar] [CrossRef]
- Bu, P.; Wang, H.; Dou, Y.; Wang, Y.; Yang, T.; Zhao, H. Weighted omnidirectional semi-global stereo matching. Signal Process. 2024, 220, 109439. [Google Scholar] [CrossRef]
- Hu, A.; Li, A.; Jin, X.; Zou, D. ThermoStereoRT: Thermal Stereo Matching in Real Time via Knowledge Distillation and Attention-based Refinement. arXiv 2025, arXiv:2504.07418. [Google Scholar] [CrossRef]
- Tomasi, C.; Kanade, T. Shape and Motion from Image Streams under Orthography: A Factorization Method. Int. J. Comput. Vis. 1992, 9, 137–154. [Google Scholar] [CrossRef]
- Shi, R.; Zhang, Z.; Qiu, X.; Ding, C. A Novel Gradient Descent Least-Squares (GDLS) Algorithm for Efficient Gridless Line Spectrum Estimation with Applications in Tomographic SAR Imaging. IEEE Trans. Geosci. Remote Sens. 2023, 61, 5208313. [Google Scholar] [CrossRef]
- Gao, D.; Li, P.Z.X.; Sze, V.; Karaman, S. GEVO: Memory-Efficient Monocular Visual Odometry Using Gaussians. IEEE Robot. Autom. Lett. 2025, 10, 2774–2781. [Google Scholar] [CrossRef]









| Parameter | Value |
|---|---|
| Orbit | 505 km |
| Bands | 8~10.5 μm |
| 10.3~11.3 μm | |
| 11.5~12.5 μm | |
| Swath | 300 km |
| Spatial resolution | 30 m |
| Revisit period | 11 days |
| maximum intersection angle | 6.5° |
| minimum intersection angle | 0.57° |
| Scene Number | Terrain | Intersection Angle (°) | Region |
|---|---|---|---|
| 1 | Mountain | 6.5 | Fuxin County, Liaoning Province |
| 2 | Fuxin County, Liaoning Province | ||
| 3 | 3.45 | Jinzhou City, Liaoning Province | |
| 4 | Zhenping County, Henan Province | ||
| 5 | 0.57 | Fuxin City, Liaoning Province | |
| 6 | Lushi County, Henan Province | ||
| 7 | Hill | 6.5 | Kazuo County, Liaoning Province |
| 8 | 3.45 | Luohe City, Henan Province | |
| 9 | Nanyang City, Henan Province | ||
| 10 | 0.57 | Dengzhou City, Henan Province | |
| 11 | Chifeng City, Neimenggu Province | ||
| 12 | Plain | 6.5 | Zhumadian City, Henan Province |
| 13 | 3.45 | Tanghe County, Henan Province | |
| 14 | 0.57 | Jiamusi City, Heilongjiang Province |
| Terrain | Intersection Angle (°) | Affine RMSE (m) | Final RMSE (m) | Reduction (%) |
|---|---|---|---|---|
| Mountain | 0.57 | 85.53 | 50.37 | 41.10 |
| Hill | 3.45 | 42.74 | 28.70 | 32.85 |
| Plain | 6.5 | 8.32 | 5.41 | 34.98 |
| Scene Number | Terrain | Intersection Angle | RMSE (m) | Mean Error (m) | (%) | (%) |
|---|---|---|---|---|---|---|
| 1 | Mountain | 6.5° | 30.32 | −4.94 | 93.99 | 98.54 |
| 2 | 31.89 | 1.41 | 93.49 | 98.97 | ||
| 3 | 3.45° | 47.79 | −4.68 | 79.48 | 94.36 | |
| 4 | 52.05 | −11.08 | 88.06 | 94.10 | ||
| 5 | 0.57° | 50.61 | 1.77 | 78.48 | 91.82 | |
| 6 | 50.37 | −5.26 | 81.17 | 91.97 | ||
| Mean | 43.84 | 4.86 | 85.78 | 94.96 | ||
| 7 | Hill | 6.5° | 19.89 | 1.30 | 98.58 | 99.94 |
| 8 | 3.45° | 38.10 | 7.27 | 89.57 | 95.56 | |
| 9 | 28.70 | −4.41 | 96.37 | 98.82 | ||
| 10 | 0.57° | 32.71 | −1.55 | 93.11 | 98.29 | |
| 11 | 35.48 | −6.92 | 91.84 | 98.21 | ||
| Mean | 27.63 | 3.29 | 95.21 | 98.86 | ||
| 12 | Plain | 6.5° | 5.41 | −0.81 | 100 | 100 |
| 13 | 3.45° | 7.77 | −0.27 | 100 | 100 | |
| 14 | 0.57° | 8.96 | 0.79 | 100 | 100 | |
| Mean | 7.72 | 0.51 | 100 | 100 |
| Scene Number | Intersection Angle | Terrain | RMSE (m) | Mean Error (m) | (%) | (%) |
|---|---|---|---|---|---|---|
| 1 | 6.5° | Mountain | 30.32 | −4.94 | 93.99 | 98.54 |
| 2 | 31.89 | 1.41 | 93.49 | 98.97 | ||
| 7 | Hill | 19.89 | 1.30 | 98.58 | 99.94 | |
| 12 | Plain | 5.41 | −0.81 | 100 | 100 | |
| Mean | 18.99 | 1.71 | 97.21 | 99.49 | ||
| 3 | 3.45° | Mountain | 47.79 | −4.68 | 79.48 | 94.36 |
| 4 | 52.05 | −11.08 | 88.06 | 94.10 | ||
| 8 | Hill | 38.10 | 7.27 | 89.57 | 95.56 | |
| 9 | 28.70 | −4.41 | 96.37 | 98.82 | ||
| 13 | Plain | 7.77 | −0.27 | 100 | 100 | |
| Mean | 34.88 | 5.54 | 90.53 | 96.73 | ||
| 5 | 0.57° | Mountain | 50.61 | 1.77 | 78.48 | 91.82 |
| 6 | 50.37 | −5.26 | 81.17 | 91.97 | ||
| 10 | Hill | 32.71 | −1.55 | 93.11 | 98.29 | |
| 11 | 35.48 | −6.92 | 91.84 | 98.21 | ||
| 14 | Plain | 8.96 | 0.79 | 100 | 100 | |
| Mean | 35.63 | 3.26 | 88.92 | 96.06 |
| Tile Size (px × px) | MAE (m) | Median Error (m) | RMSE (m) | Affine Time (min) | RPC Time (min) | Total Time (min) |
|---|---|---|---|---|---|---|
| 25 × 25 | Fail (insufficient correspondences; RPC not executed) | |||||
| 50 × 50 | 29.73 | −0.76 | 47.53 | 5.93 | 36.04 | 41.97 |
| 100 × 100 | 39.26 | −2.9 | 56.96 | 1.57 | 45.67 | 47.24 |
| 200 × 200 | 54.29 | 2.42 | 72.94 | 0.42 | 46.50 | 46.92 |
| k | MAE (m) | Median Error (m) | RMSE (m) | Affine Time (min) | RPC Time (min) | Total Time (min) |
|---|---|---|---|---|---|---|
| 8 | 30.07 | −0.77 | 47.90 | 5.93 | 36.52 | 42.45 |
| 10 | 29.73 | −0.76 | 47.53 | 5.93 | 36.04 | 41.97 |
| 12 | 29.46 | −0.73 | 47.27 | 5.93 | 36.17 | 42.10 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Xu, Y.; Liang, Q.; Guo, J.; Du, X.; Wu, C.; Li, X.; Chen, F. Accurate 3D Terrain Reconstruction for Multi-View Thermal Infrared Images with Small Intersection Angles. Remote Sens. 2026, 18, 681. https://doi.org/10.3390/rs18050681
Xu Y, Liang Q, Guo J, Du X, Wu C, Li X, Chen F. Accurate 3D Terrain Reconstruction for Multi-View Thermal Infrared Images with Small Intersection Angles. Remote Sensing. 2026; 18(5):681. https://doi.org/10.3390/rs18050681
Chicago/Turabian StyleXu, Yixuan, Quan Liang, Junhong Guo, Xinwang Du, Chao Wu, Xiaoyan Li, and Fansheng Chen. 2026. "Accurate 3D Terrain Reconstruction for Multi-View Thermal Infrared Images with Small Intersection Angles" Remote Sensing 18, no. 5: 681. https://doi.org/10.3390/rs18050681
APA StyleXu, Y., Liang, Q., Guo, J., Du, X., Wu, C., Li, X., & Chen, F. (2026). Accurate 3D Terrain Reconstruction for Multi-View Thermal Infrared Images with Small Intersection Angles. Remote Sensing, 18(5), 681. https://doi.org/10.3390/rs18050681

