1. Introduction
Multi-view images captured by aerial or unmanned aerial vehicle (UAV) platforms have become a major source of data in 3D city modeling projects [
1,
2,
3,
4]. To alleviate occlusions and increase observation redundancy, in the last decade, images captured by different cameras, such as vertical and oblique views in multi-camera systems, are combined to product photo-realistic 3D models with better geometry quality and textures [
5,
6,
7].
Because the accuracy of modeling image distortions and orientations directly affects the product quality in subsequent image-processing steps such as dense image matching, 3D mesh generation, and texture mapping, numerous methods have been developed to recover EOPs and IOPs (including lens distortion parameters) [
8,
9,
10,
11,
12].
Among existing photogrammetry research and engineering practices, self-calibrated bundle adjustment (SCBA), through which IOPs and EOPs are simultaneously estimated according to image tie points, is an effective method for decreasing re-projection errors in 2D space and intersection errors in 3D space [
13,
14,
15,
16,
17]. Different from the traditional laboratory or field calibration processes, self-calibration (SC) methods treat IOP calibration as part of routine photogrammetric procedures in every project through bundle adjustment (BA), whether the cameras have been pre-calibrated or not [
18].
In recent years, terrestrial images captured by hand-held cameras or mobile mapping platforms have been integrated with aerial views through structure-from-motion and multi-view stereo pipelines to produce better 3D maps and models [
19,
20,
21,
22]. Due to large differences in viewpoint and scale and possible illumination conditions, automatic feature matching for cross-platform images is non-trivial work [
21]. The numbers and distributions of cross-platform tie points are not as favorable as those for inner-platform images. Hence, in 3D modeling applications that integrate images captured by aerial and terrestrial platforms, images taken in the same platform are often first aligned through BA using only inner-platform tie points. Then, aerial and terrestrial images are co-registered using a cross-platform involving BA [
19,
20,
21]. Although IOPs should be refined during the inner-platform BA, it remains unclear whether the IOPs in the cross-platform BA should be fixed.
On one hand, the IOPs are recovered through SCBA with inner-platform tie points, and the images used in the cross-platform remain unchanged; thus, the IOPs in the second BA should be physically the same as those in the first BA. IOP fixation could reduce the number of unknown parameters and stabilize the calculation of the nonlinear least square problem. On the other hand, according to SCBA theory, refining the IOPs in the cross-platform BA may mathematically improve the modeling quality of the image formatting process and thus enhance the co-registration quality between the aerial and terrestrial images.
Both strategies seem reasonable. Hence, to investigate the optimal SC strategy for the BA of aerial–terrestrial integrated images, four aerial–terrestrial BA settings were experimentally compared and analyzed in this study. According to the experimental results, recommendations on the integration of aerial and terrestrial images blocks in BA are provided.
The remainder of this paper is organized as follows:
Section 2 reviews the existing work on SCBA.
Section 3 introduces the four plausible SCBA strategies and the experimental datasets and procedure.
Section 4 reports the experimental results and analysis.
Section 5 presents the discussion, conclusions, and future perspectives.
2. Related Works
The geometry quality of image-based 3D mapping products largely relies on the precision of the recovered image IOPs and EOPs [
23]. In traditional photogrammetry engineering, the IOPs of metric cameras are first calibrated in the laboratory or field before image capture [
9,
13,
24]. After image collection, EOPs are recovered through BA according to image correspondence and a few ground control points [
4,
25,
26]. Previous investigations have proved that when a sufficiently accurate camera model is used, the 3D mapping inaccuracy related to systematic errors in IOPs is negligible [
27]; however, this is not the case in close-range photogrammetry [
28,
29].
With the rapid development of UAVs, consumer-level cameras are widely used in small- or clustered-area survey tasks [
30,
31,
32,
33,
34,
35]. Compared with traditional aerial photogrammetry, which collects images with only vertical views, adding oblique cameras could not only result in better façade information but also favor geometry measuring accuracy due to the larger intersection angles between overlapped images [
6,
36,
37]. This advantage is more noticeable for UAV photogrammetry since the flight plans are far more flexible [
38,
39]. While image collection is relatively easy, a rigorous laboratory or field camera calibration process is often neglected. Moreover, the images are often captured by unprofessional operators with inferior geometry networks. Furthermore, the sensor stability may be imperfect, as 3D mapping and modeling are not the main purposes of the camera design.
Hence, in most UAV photogrammetry applications, SCBA is commonly adopted to refine the IOPs to improve the 3D accuracy of object mapping [
3,
26,
32,
40]. In standard SCBA, the IOPs are treated as unknown parameters with initial observations; moreover, the IOPs, EOPs, and 3D coordinates of tie points during the BA process are refined according to the collinearity equation [
18]. Regarding the sensor stability under different temperatures and humidities, SCBA is often conducted in every image block, because the images are collected under various conditions.
Apart from the conventional focal length and the location of principal points, additional parameters are used to describe the distortions that occur between 3D points and their locations in 2D images, because the image formatting process is not a perfect perspective transformation [
24]. As pointed out in previous works [
41], there are two major categories of additional parameters: the physical and mathematical models [
13,
41,
42]. The physical models simulate the systematic errors caused by optics, while the mathematical models approximate the simulation process through algebra expansion.
The Brown model [
13] is the most widely used model in close-range photogrammetry, and it has been incorporated into numerous image-processing packages and commercial softwares in United States, China, and Russia [
43,
44,
45]. Although SCBA implementation with additional parameters may improve geometric accuracy in practice, the correlation between parameters may weaken the BA process [
18]. Moreover, to obtain satisfactory results, a large number and good distribution of image tie points are required [
46].
In recent years, 3D environmental modeling applications have combined images captured by airborne cameras and terrestrial platforms [
20,
21,
39]. Images captured by a singular platform and processed through BA also require cross-platform BA for accurate co-registration in 3D mapping and modeling tasks [
19].
In addition to the viewing perspective and image scales, images obtained by aerial and terrestrial platforms also vary in other aspects. First, the networks for aerial images are often more stable, because aerial views have more image connectivity than terrestrial views. Moreover, owing to the differences in looking directions and scenes, terrestrial images might have weak textures at the corners or edges. Thus, the distributions of automatic tie points are often more unsymmetrical than for aerial datasets.
However, it is still unclear whether it is best to re-compute the IOPs during the BA of aerial and terrestrial images. On one hand, the SCBA are already adopted in the BA, which align images captured by the same platform; the IOPs can be treated as stable because the images remain unchanged [
19]. Moreover, IOP fixation can reduce the number of unknown parameters, which may stabilize the EOP calculation process and possibly enhance the estimation accuracy. Furthermore, the additional cross-platform tie points that align aerial and terrestrial images can result in uneven tie point distribution [
39]. Because more tie points (mainly the cross-platform tie points) are incorporated into the BA process, the image geometric network is varied. From a mathematical viewpoint, the integrated refinement of both IOPs and EOPs may improve the recovery of the image formatting process and thereby improve the co-registration between aerial and terrestrial images.
To investigate the optimal SC strategy for the BA of aerial–terrestrial integrated images after the inner-platform SCBA, four SC strategies for the BA were compared and analyzed using real datasets.
5. Discussion
According to the experimental results and statistical analysis,
Table 12 summarizes the network stability, tie point distribution uniformity, and best SC strategies for the seven tested datasets.
Because the tie point distribution in aerial images is fairly even, the BA of aerial and terrestrial images may not require a second round of SC for aerial cameras. Moreover, fixing the IOPs of aerial cameras (which have already been refined) during the BA could reduce the number of unknown parameters.
However, this is not the case for terrestrial cameras. Owing to the different shooting conditions, the tie point coverage in terrestrial images is insufficient to regain the physical distortion parameters through theoretical lens calibration. Thus, herein, the IOPs obtained in the first round of the SCBA of terrestrial images inadequately represented the real IOPs of the terrestrial cameras. Hence, the IOPs of terrestrial images can be refined in the second-round BA that combines both aerial and terrestrial images if the tie points have relatively even distribution and large format coverage.
Considering this assumption, the FA_RT method will provide the best co-registration results for the Stadthaus, Verwaltung, and Lohnhalle datasets. However, the minimum rRMSE values acquired through the FA_FT strategy belonged to the Stadthaus dataset, presumably because the networks of aerial images in the Stadthaus dataset were not stable. Therefore, refining the IOPs of terrestrial cameras may degrade the EOPs of aerial images and result in suboptimal cross-platform co-registration accuracy.
According to the experimental results and above analysis, some suggestions regarding the SC strategies in the BA of aerial and terrestrial image blocks are offered. First, for aerial images, with better tie point distribution than terrestrial images, it is better to fix the IOPs in the cross-platform BA. Second, for most cases, fixing the IOPs of terrestrial images in the second-round BA will improve the co-registration accuracy. Third, if the tie point distribution in terrestrial images is relatively even and comprehensive and the networks of aerial images are reasonably stable, refining the IOPs of terrestrial cameras in the BA may yield the best results.
Future tests will investigate the effect of SC strategies on mathematical lens distortion parameters such as the Fourier SC additional parameters [
34] in cross-platform image BA. Moreover, automatic optimal SC strategy selection methods related to cross-platform image orientation should be developed and investigated, to build unified precision 3D mapping references for multi-platform photogrammetry.