1. Introduction
Hyperspectral imagery is a powerful tool that provides narrow bands data of observation targets, enabling the extraction of more detailed information about the physical and chemical properties of targets compared to multispectral imagery. The fine spectral imagery data cubes generated from hyperspectral imagery offer a platform for the quantitative analysis of the characteristics of observation targets [
1]. With the advancements in computer hardware and efficient processing techniques, hyperspectral imagery has been widely used in various fields, such as precision agriculture [
2,
3,
4], land resources management [
5,
6], water resource analysis [
7,
8], and natural disaster analysis [
9].
In recent decades, hyperspectral imaging has been primarily conducted using spaceborne and airborne platforms. Spaceborne sensors offer more stable working conditions, fixed revisit periods, and higher efficiency. On the other hand, airborne sensors provide greater flexibility in observation planning, higher ground sample distance (GSD), and signal-to-noise ratio (SNR) [
10]. Recently, unmanned aerial vehicles (UAVs) equipped with lightweight hyperspectral instruments have gained popularity due to their low operating costs and high flexibility [
11]. UAV-borne instruments typically offer the highest GSD due to the low flight height of the UAVs. Consequently, they are well-suited for local observation tasks as compared to airborne sensors. In contrast, airborne advanced hyperspectral imagers (AHIs) have longer operation times and higher payloads, making them more advantageous for certain observation tasks.
Georeferencing is a fundamental step for hyperspectral imagery analysis. Two categories of georeferencing methods exist, direct and indirect georeferencing. Direct georeferencing utilizes Global Navigation Satellite System (GNSS) and Inertial Measurement Units (IMU) to acquire the position and orientation of each image frame [
12]. In contrast, indirect georeferencing involves the extraction of image feature points, measurement of ground control points (GCPs), and bundle adjustment for position and orientation of each frame [
13,
14,
15].
Accurate calibration of boresight is crucial to ensure the precision of direct georeferencing [
16]. The imaging system’s high-precision GNSS and IMU sensors enable the projection of image data onto geographic coordinates without the need for costly ground control points (GCPs) [
12]. However, errors in the extrinsic parameters between the imaging subsystem and IMU are inevitable during long-term usage and transformation of the hyperspectral imaging system, with the boresight error representing the rotational component of these errors. While displacement errors can be limited by mechanical constraints, boresight errors are difficult to constrain. The resulting extrinsic parameter errors at each moment of AHI operation introduces georeferencing mismatches, posing obstacles for successive applications [
17].
Boresight calibration is easier to perform in a 2D focal plane array camera than in a push broom hyperspectral imager, which is because the former captures one 2D image per frame with a rigid projective relationship between all pixels [
18]. The relationship between corresponding points from two images is described by epipolar geometry, which can be utilized with feature extraction methods, such as SIFT [
19,
20], SURF [
21], or ORB [
22], to find corresponding feature pairs. The boresight error between the camera and IMU can then be solved by the equation
on Euclidean group [
23]. This problem is referred to as hand–eye calibration in robotics [
24]. In comparison, a pushbroom hyperspectral imager captures one 1D image per frame [
25], and the 2D hyperspectral image is generated by georeferencing the 1D image with data from the IMU sensor. Therefore, epipolar geometry is not applicable.
To obtain correct orientation parameters in the imaging process, constraints such as the position of photographed points or epipolar geometry are necessary. Muller et al. [
26] minimized the difference between GCPs and image points using collinearity equation linearized by a Taylor series and an iterative least squares fit. With the help of DEM data and 5 GCPs, the final mismatches are around 1 pixel. Lenz et al. [
27] used the DEM and automatically extracted tie points to calibrate boresight in a mountainous area. They minimized the RMS error over the distance among corresponding tie points with Fletcher’s modified Levenberg–Marquard Algorithm [
28], achieving final RMS residuals of 2∼4 pixels with the SURF [
21] feature extraction method. Habib et al. [
29] proposed three strategies for boresight calibration, achieving the lowest RMSE values with the third strategy, which improves the intersection of conjugate light rays corresponding to tie points. The third strategy demonstrated RMSE values of 2 pixels perpendicular to flight direction and about 1 pixel parallel to flight direction in six datasets.
Plane-based methods have been employed for Lidar boresight correction. Skaloud and Lichti [
30] used least-squares adjustment with direct-georeferencing equations and common plane points to determine Lidar boresight angles and other parameters. Ref. [
31] introduced a point-to-plane model to enhance accuracy. However, unlike Lidar data, depth information cannot be directly obtained from image data, making it challenging to correct camera boresight angles using plane constraints.
Traditional boresight calibration methods rely on costly GCPs and calibration flights, which are not suitable for large-area observation tasks. To address the limitations of traditional methods and improve calibration accuracy, we propose a novel boresight calibration method that utilizes open DEM datasets with the coplanarity constraint. The coplanarity constraint is achieved by extracting features with similar disparity to retrieving their relative height from the DEM based on the direct georeferencing location. This approach also reduces the influence of shadows and buildings on relative height retrieval. Finally, we use an optimization method to find the optimal solution among overlapping areas with the relative height derived from DEM. The Airborne Multi-Modality Imaging Spectrometer (AMMIS) is an advanced hyperspectral imaging system developed by the Shanghai Institute of Technical Physics, Chinese Academy of Sciences, with the purpose of facilitating the efficient and extensive acquisition of hyperspectral data cubes. We validated the proposed method on the dataset collected by the AMMIS hyperspectral imager and open DEM datasets. The results demonstrated that the proposed method significantly improves the calibration accuracy, particularly in cases of low-resolution DEM data, and exhibits better consistency across different open DEM datasets.
5. Discussion
This study explores the use of relative height data from DEM datasets to improve the accuracy of boresight calibration in large-area aerial remote sensing imagery, with a focus on the roll angle. The variability in average elevation in such imagery can lead to deviations from the ground truth, which can be mitigated by incorporating elevation data from open DEM datasets. To address the low accuracy and resolution of absolute elevation data, we employ two strategies to enhance calibration accuracy. Firstly, we utilize the coplanarity constraint to filter ground-level image points and decrease the uncertainty of the height obtained from the DEM dataset. Secondly, we incorporate relative height data into the calibration process by minimizing the reprojection error with relative height of overlapping area. This approach provides more precise constraints than the method using the averaging elevation. Furthermore, the absolute elevation error has a minor impact on our approach, as demonstrated by the consistency between the results obtained from different datasets, as presented in
Table 11.
Theconfiguration of opposite flight direction between adjacent strips is a crucial factor that affects the accuracy of boresight calibration. When opposite strips are used, the global minimum exists between adjacent overlapping areas, which can serve as a good initial state for determining the correct boresight angles based on relative height data derived from DEM data. This approach involves using the midpoints of corresponding point pairs to retrieve their elevation from the DEM data. Ifthe flight direction is the same between adjacent strips, the midpoint will deviate from the true coordinates at large boresight angles, leading to inaccuracies in the calibration process.
The horizontal resolution of the DEM dataset is a critical factor that affects calibration accuracy, as it has a significant impact on the standard deviation of the DEM elevation data. While the use of relative elevation data can somewhat alleviate the dependency on the accuracy of the DEM, low-resolution DEM datasets are still observed to decrease calibration accuracy. Additionally, since the DEM datasets were generated before the AHI images, the variation in the ground targets over time can adversely affect the calibration accuracy, as seen in
Figure 6 and
Figure 7. Combining high-precision 3D point cloud data with calibration could be a promising approach to further improve accuracy.