GNSS-RTK Adaptively Integrated with LiDAR/IMU Odometry for Continuously Global Positioning in Urban Canyons

: Global Navigation Satellite System Real-time Kinematic (GNSS-RTK) is an indispensable source for the absolute positioning of autonomous systems. Unfortunately, the performance of the GNSS-RTK is signiﬁcantly degraded in urban canyons, due to the notorious multipath and Non-Line-of-Sight (NLOS). On the contrary, LiDAR/inertial odometry (LIO) can provide locally accurate pose estimation in structured urban scenarios but is subjected to drift over time. Considering their complementarities, GNSS-RTK, adaptively integrated with LIO was proposed in this paper, aiming to realize continuous and accurate global positioning for autonomous systems in urban scenarios. As one of the main contributions, this paper proposes to identify the quality of the GNSS-RTK solution based on the point cloud map incrementally generated by LIO. A smaller mean elevation angle mask of the surrounding point cloud indicates a relatively open area thus the correspondent GNSS-RTK would be reliable. Global factor graph optimization is performed to fuse reliable GNSS-RTK and LIO. Evaluations are performed on datasets collected in typical urban canyons of Hong Kong. With the help of the proposed GNSS-RTK selection strategy, the performance of the GNSS-RTK/LIO integration was signiﬁcantly improved with the absolute translation error reduced by more than 50%, compared with the conventional integration method where all the GNSS-RTK solutions are used.


Introduction
Location awareness is one of the fundamental tasks in autonomous systems [1][2][3][4]. Global Navigation Satellite System (GNSS) plays an irreplaceable role in global positioning. The differential carrier-phase-based GNSS positioning technique, referred to as Real-Time Kinematic (RTK), could achieve centimeter-level localization precision provided that the environment is open [5,6]. Since the development of RTKLIB [7], even commercial-grade receivers are supported to generate fixed positioning solutions. However, essentially as a kind of Time of Arrival (TOA) based localization technique, GNSS is highly vulnerable to signal reflection and blockage, namely multi-path and None Line of Sight (NLOS) [8] receptions. According to our recent open-sourced UrbanNav dataset in [9], the positioning error of the GNSS-RTK can easily be larger than 10 m in typical urban scenarios of Hong Kong via a commercial level u-blox M8T GNSS receiver. In other words, the GNSS-RTK is significant for providing globally referenced positioning but the accuracy is not guaranteed in urban canyons, with a strong reliance on the degree of urbanization [10].
Intuitively, to enhance the performance of GNSS-RTK standalone-based positioning, multi-sensor fusion is a promising approach with the complementary properties of heterogeneous sensors taken into consideration [11][12][13]. The integration of GNSS and Inertial fact that the existing LIO can provide accurate relative positioning in a short period which is shown in our previous work [33], can we only periodically select the good GNSS-RTK for effective fusion? More importantly, the performance of the GNSS-RTK relies heavily on the environment structures and the LIO can effectively reconstruct the surroundings using 3D point clouds. Inspired by this exciting feature and to fully leverage the merits of multi-sensors, this paper proposes an adaptive GNSS-RTK/LIO integration scheme by selecting the reliable GNSS-RTK solutions based on the incrementally generated 3D point clouds map from LIO. Therefore, during the GNSS-RTK/LIO integration, the divergence caused by abnormal GNSS solutions is eradicated from the source. Meanwhile, smoothness and continuity are ensured by the LIO. The drift of the 3D point cloud map is corrected by the selected reliable GNSS-RTK solutions. In conclusion, the contributions of this paper are highlighted as follows: • This paper proposed a GNSS-RTK availability assessment method, during which, unreliable GNSS measurements rejection is performed through mean elevation angle mask evaluation exploiting the registered 3D map that was incrementally constructed by LIO based on our previous work in [33]. It is worth mentioning that the mean elevation angle in this paper is defined as that generated by the buildings around the GNSS receiver rather than the satellites. • This paper proposed a GNSS-RTK/LIO integration scheme based on factor graph optimization. Concretely, the globally referenced positioning from reliable GNSS-RTK and relative pose estimation are integrated using the FGO [24] which is free of abnormal GNSS-RTK solutions.

•
The effectiveness of the proposed GNSS-RTK availability assessment and the adaptive sensor fusion is validated on three typical urban canyons datasets collected in Hong Kong [9].
The rest of the paper is structured as followed: In Section 2 materials and methods are presented. Concretely, an overview of the framework with pre-definitions of important mathematical symbols is introduced in Section 2.1. The GNSS-RTK availability assessment method which is the so-called GNSS-RTK selection procedure is described in Section 2.2. The adaptive sensor fusion method would be discussed in detail in Section 2.3. What is more, the experimental results are displayed in Section 3, along with discussions in Section 4. Conclusions and prospects are given in Section 5.

Overview of the Proposed Framework
The overview of the proposed framework is depicted in Figure 1. The boxes painted in blue are inherited from our previous work LC-LIO in [33] where we developed a factor graph-based LIO scheme. The yellow ones represent the main contributions in this paper. As receiving the 3D point clouds and the IMU ego-motion measurements, the LiDAR factor and IMU factor are constructed via scan-matching [43] and IMU pre-integration [44] respectively, and then jointly optimized in the sliding window based local FGO. A globally consistent map consisting of registered 3D points is incrementally generated with each local optimization accomplished. For more details about the LIO adopted in this paper, the full paper [33] is recommended. The GNSS raw data produced by the rover and station are firstly processed by RTKLIB [7] to obtain GNSS-RTK solutions. For each of the epochs, the surrounding local map is extracted from the global map. The mean elevation angle mask could be calculated according to the 3D point cloud map, after which GNSS-RTK availability is assessed. Given a predefined threshold, those with heavy mean elevation angle masks, indicating encirclement by skyscrapers, towering trees, etc., are supposed to be unreliable and excluded. Others with small mean elevation angle masks are in relatively open areas and are assumed to be reliable. A global FGO is executed for the fusion of LiDAR factor, IMU factor and GNSS-RTK factor created by reliable GNSS-RTK. A continuous and accurate georeferenced position is finally achieved. The framework of the proposed method that adaptively integrate GNSS-RTK [7] with LiDAR/IMU [33]. 1 denotes an example of the global point cloud map produced by LiDAR scan matching during local FGO. 2 denotes an example of the local map for mean elevation angle mask calculation. 3 demonstrates the mean elevation angle mask for GNSS-RTK availability assessment. The meaning of the annotation on the mask would be introduced in detail in Section 3.2.1.
Before diving into the expanded introduction of the proposed method, the coordinates involved are clarified as follows:

•
The LiDAR frame l {·}: Originated at the geometric center of the LiDAR.

•
The IMU frame b {·}: Originated at the geometric center of the IMU.

•
The GNSS measurement frame g {·}: Originated at the GNSS measurement with the orientation of the axes consistent with that of the LiDAR frame.

•
The local world frame w {·}: Coincide with the initial LiDAR frame.

•
The east-north-up (ENU) frame n {·}: Originated at the initial vehicle position with the x-axis, y-axis, and z-axis pointing to the east, north, and up respectively.
In this paper, matrices are denoted in upper-case and bold letters. Vectors are denoted in lowercase and bold letters. Variable scalars are denoted as lower-case and italic letters. Poses under different frames could be transferred via multiplication rules defined in the 3D Special Euclidean group (SE(3)) [45]. Concretely, n T w represents the transformation matrix from the local world frame to the ENU frame. Following the Equation (1), the i-th map points w f i ∈ R 3 under the world frame would be transformed to the ENU frame and denotes as n f i :

GNSS-RTK Availablity Assessment
In this section, the mean elevation angle mask generation and the accompanying GNSS-RTK availability assessment are presented.
The first step is the local map extraction. Each GNSS-RTK resolved by RTKLIB [7] under the Earth-centered, Earth-fixed (ECEF) coordinate system is converted to the ENU frame through GeographicLib [46]. As shown in Figure 2a, the local map around it is extracted from the global map. The map is originally expressed under the local world frame and then converted to the ENU frame for convenience. What is more, in the "east" and "north" directions, the local map is specified by fixed Euclidean distance parameters. While in the "up" direction, all the points are included to completely reflect the density and the altitude of the obstacles. For computational efficiency, points lower than 1 m in the local map are removed. The second step is a sub-region division which takes the evenness and the robustness into consideration. A total of 360 degrees around the GNSS rover is divided into 36 subregions according to the azimuth angle, for example, the point with an azimuth angle of smaller than 10 degrees belongs to the first subregion, and that bigger than 10 degrees but smaller than 20 degrees belongs to the second subregion. It is worth mentioning that the reliability of the GNSS measurements is not yet clear until the assessment is finished. Therefore, the chronologically closest pose estimated by local FGO is exploited instead of quantifying the relative distance between the GNSS rover and the scanned objects. Regarding the k-th GNSS-RTK solution n g k = n x g k , n y g k , n z g k T , its correspondent position generated by local FGO is denoted as ng k = n xg k , n yg k , n zg k T . For a point n f i = n x f i , n y f i , n z f i T on the extracted local map, its azimuth angle is denoted as az f i is calculated as: Its elevation angle el f i is defined as: Considering all the points in one subregion, their elevation angles are sorted among which the upper quartile is taken as the elevation angle of this subregion. Then the mean elevation angle of all the subregions denoted as el g k , is taken as the mean elevation angle mask of the GNSS RTK measurement g n k . Provided a threshold el th , the GNSS-RTK availability assessment is finally achieved by: The effect of different thresholds is evaluated and discussed in the section of the experiment result.

Adaptive GNSS-RTK/LIO Fusion
The sensor fusion framework is constructed to estimate the motion states of the sensor carrier. Methods for such state estimation problems are extensively investigated which could be categorized as filter-based and non-linear optimization-based [45]. The filterbased method is typically a predictor and corrector architecture subjecting to the first-order Markov hypothesis [47]. The current state to be estimated is first predicted based on the last state and then corrected according to the motion model and the measurement model respectively [48].
The non-linear optimization-based method is typically a non-linear least-square problem. Concretely, the state estimation problem could be formulated as a Maximum A Posteriori (MAP) problem and then be converted to a Maximum likelihood Estimation (MLE) via Bayes' Theorem [49,50]. With the assumption of the Gaussian noise model, through negative logarithmic transformation, it is finally expressed as a nonlinear leastsquares problem [24]. Each known measurement, based on which the state estimation is performed, could be expressed as a function of the states to be estimated. The leastsquares term, commonly called a residual term, is the Mahalanobis distance between the known measurements and its approximation derived from the states. The residuals are expected to be small and thus provide constraints to the states. The objective function during the non-linear optimization is the sum of all the residuals. A group of states that minimizes the objective function is considered as the best solution for the estimator [24]. As shown in our recent work [41], compared with the filter-based estimator such as EKF, the non-linear optimization-based ones can better employ the correlation between consecutive epochs of measurements simultaneously. Furthermore, the computational efficiency of non-linear optimization-based methods is enhanced by the sliding-window manner [51] which involves states over some time rather than all the states during an optimization.
The non-linear optimization-based state estimation framework could be interpreted as a factor graph, namely FGO. States to be estimated are denoted as vertexes. Residuals constraining the states are expressed as edges and called factors in the graph. The FGO aims at finding a group of states that conform to all edges best [24].
The factor graph of the adaptive GNSS-RTK/LIO fusion procedure, that is the global FGO module in Figure 2, is displayed in Figure 3. The states to be estimated in one window is the 6D pose at the frequency of estimations from LIO represented as where w is the window size, n R k ∈ Special Orthogonal Group (SO (3)) [45] is the relative rotation and n p k ∈ R 3 is the absolute translation. The n R k could be expressed in Lie algebra [52] for derivation calculation as: ϕ k ∈ R 3 is the correspondent Lie algebra of n R k . (·) ∧ is the skew-symmetric matrix of ϕ k . exp(·) represents the exponential transformation namely Rodrigues's Formula [53] between the two patterns. The LiDAR factor constructed from scan-matching provides relative motion constraints. The motion increment derived from scan-matching is taken as the known measurements functioning as a reference, while that computed via states to be estimated are considered as observed measurements. The factor is designed to minimize the difference between them as follows: in which n T l k are solved by scan-matching. The IMU factor derived from pre-integration is formulated in a similar way. The motion increment provided by IMU pre-integration and that derived from the states to be estimated are considered known measurements and observed measurements respectively. The difference between them namely the IMU factor is defined as below: (3) is the rotation and translation increment pre-integrated by IMU raw data with the bias corrected by local FGO. It is converted from the IMU frame to the ENU frame.
The adaptively added GNSS factor once a GNSS-RTK solution is judged to be reliable by the GNSS-RTK availability assessment module directly restricts the position part of the state as follows: where n g j is the known measurement of the position provided by GNSS-RTK. The difference between the position to be estimated and n g j is expected to be small. When the commerciallevel GNSS receiver is exploited to generate location measurements under urban canyons, only 2D residuals respectively at the "earth" and the "north" direction are fused in the global optimization make more sense given the large error at the "up" direction. In short, the full objective function of the global optimization could be expressed as the sum of all the residuals in a window: In which (·) 2 is the Cauchy Kernel [54] for outlier suppression expressed as The control parameter c is set to 1 in our implementation [51]. C L,k is the covariance matrix weighting the confidence of the factor and is supposed to be constant in our implementation. C B,k is the covariance matrix for the IMU factor and is iteratively derived during the pre-integration according to the covariance propagation rule used when a linear transformation is performed on a random variable [44]. C G,j is the covariance matrix for the adaptively added GNSS-RTK factor and is assumed to be constant due to no further information quantitatively indicating the quality of the solution is taken into account such as the Ambiguity Dilution of Precision (ADOP) and the Position Dilution of Precision (PDOP) [55], among which the ADOP is based on the determinant of the covariance matrix of the least-squares ambiguities and defined as follows [56]: where Qâ is the covariance matrix of the ambiguities and n is its order. The estimated states could be derived as: Levenberg-Marquardt (L-M) algorithm [57] implemented in Ceres Solver [58] is utilized to solve the non-linear optimization problem during which the residuals are minimized. The states in a window are updated iteratively until convergence. As for the i-th iteration, the update procedure is as follows: in which, λ is the damping factor of the L-M algorithm automatically set according to the reduction of the residuals to control the step of the states. "⊗" denotes that for each state x k,i+1 = x k,i ∆x k,i . r represents the residual of each factor shaping to a column vector as follows: For brevity, the r L,(·) , r B,(·) , and r G,j n with j n ∈ [0, w) denotes the residual of the LiDAR factor, the IMU factor, and the adaptively added GNSS-RTK factor respectively. J is the jacobian matrix of r referring to X expressed as follows: Fortunately, the residual of each factor is only related to one or two states rather than most of the states as shown in Figure 3 [59]. In other words, the matrix J and J T J are both sparse. Therefore, computational efficiency is guaranteed.

Experimental Setup
Sensor setups: To validate the effectiveness of the proposed method, the experiment is conducted on three datasets collected in typical urban canyons in Hong Kong. As shown in Figure 4a,b, the Velodyne HDL-32E was employed to collect 3D point clouds at a frequency of 10 Hz. The Xsens Ti-10 IMU was employed to collect acceleration and angular velocity at a frequency of 200 Hz. A commercial-level u-blox GNSS receiver was used to collect the raw GNSS measurements, based on which the GNSS-RTK solution was further calculated by RTKLIB [7]. Besides, the NovAtel SPAN-CPT, a GNSS (GPS, GLONASS, and Beidou) RTK/INS (fiber-optic gyroscopes, FOG) integrated navigation system was used to provide the ground truth. In the SPAN-CPT, the high-end INS are integrated with GNSS to ensure continuous position estimation during GNSS outage. The NovAtel Inertial Explorer is also exploited to deliver precise positioning results. The ground truth provided by the SPAN is reliable and precise enough for evaluation. All the data were collected and synchronized using ROS [60]. The extrinsic parameters among all the sensors were calibrated in advance.  Table 1 summarizes the three datasets on which the framework proposed in this paper is verified, namely UrbanNav-HK-Data20190428, UrbanNav-HK-Deep-Urban-1, and UrbanNav-HK-Data20210714, abbreviated as urban-1, urban-2, and urban-3 respectively in the following. The first two data are contained in our recently open-sourced UrbanNav dataset [9]. As shown in Figure 5a, urban-1 is collected at Tsim Sha Tsui (an urban scenario with buildings and trees in Hong Kong) with a total length of 2.01 km. As shown in Figure 5b, urban-2 is collected at Whampoa (an urban scenario with dense buildings in Hong Kong) from which the last 2.37 km is selected for the experiment. As shown in Figure 5c, urban-3 is collected from Whampoa to Tsim Sha Tsui with a much longer length of 4.3 km. Table 1. The information of the three datasets used for experimental verification in this paper.

Dataset Length (km) Urbanization U-Blox RECEIVER
UrbanNav-HK-Data20190428 [9] (urban-1) [ The settings specified in the RTKLIB are listed in Table 2. The settings specified in the RTKLIB when processing the raw GNSS measurements from u-blox to achieve GNSS-RTK solutions for the three datasets are listed in Table 2 respectively.

Evaluation metrics:
The validity of the GNSS-RTK availability assessment method and the adaptive GNSS-RTK/LIO fusion pipeline for positioning are investigated. The 3D absolute translation error (ATE) implemented in EVO [61] is defined as follows: In which, n p k,est ∈ R 3 and n p k,gt ∈ R 3 represents the estimated position and the ground truth of the k-th epoch respectively. · 2 denotes the 2-norm. e k is the ATE of n p k,est . The quantitative comparison of the estimated position from different pipelines is performed by comparing the Root Mean Square Error (RMSE) of ATE to the total traveled length.
Evaluated methods: To verify the contributions of the proposed method in this paper, we evaluated the following methods.

•
Method 1: This indicates the conventional GNSS/LIO fusion method [34,42,62] that directly integrates all the received GNSS-RTK measurements, among which LIO-SAM [42] is open-sourced. In the fusion method proposed in this paper, the threshold for GNSS availability would be set as 90 • equivalently. To be more convincing, the performance of LIO-SAM [42] is also evaluated. Besides the above-mentioned positioning results of the various integrated pipelines, the positioning accuracy of alone GNSS-RTK measurements and LIO are also shown for exhaustive performance analysis. It is worth mentioning that the LIO here represents the loosely coupled LiDAR-inertial fusion method implemented in our previous work [33] which is different from that in the tightly coupled framework LIO-SAM [42].

Experimental Results in Urban Canyon 1 3.2.1. Results of GNSS-RTK Availability Assessment
To evaluate the proposed GNSS-RTK availability assessment method, the ATE of the u-blox measurement and the corresponding mean elevation angle mask of the surrounding local map is depicted in Figure 6a with three representative places annotated. The mean elevation angle mask is defined as that of a fixed-size local map with 50 m (Euclidean distance) both forward and backward of the rover. Thus, u-blox epochs within 50 m from the start and the end have no corresponding mean elevation angle masks. Figure 6b displays streetscapes in these three places. Figure 6c shows the elevation angle mask at each subregion of the local point cloud map 360 degrees around the place. It is worth mentioning that the direction from 0 • to 360 • on the outermost circle, namely the azimuth angle defined by Equation (2), refers to the position of the 3D LiDAR sensor, thus depending on the scanning pattern of the LiDAR at the correspondent epoch. As for Velodyne HDL-32E, the 0 • direction at different epochs changes continuously for all-around detection. It is not equal to the forward direction of the vehicle. For consistency, the directions on each elevation angle mask are labeled clockwise with the 0 • direction upward. The elevation angles ( • ) are depicted by concentric circles increasing from the outside to the inside. The 1 place has skyscrapers almost on three sides with more than half part of the sky mask reaching up to 50 • shown in Figure 6b1,c1, respectively. As displayed in Figure 6a, the calculated mean elevation angle mask and the ATE of the u-blox are consequently large, which implies the GNSS-RTK solution is unavailable. What is more, at the 2 place as shown in Figure 6b2, there is an open crossroad. Consistent with the scenario, the estimated mean elevation angle mask illustrated in the bottom panel of Figure 6a,c2, and the ATE of the u-blox illustrated in the top panel of Figure 6a, are notably small meaning that the GNSS-RTK solution tends to be reliable. However, at the 3 place shown in Figure 6b3, it is open near Victoria Harbor. The u-blox measurement is significantly accurate, seen in the top panel of Figure 6a. Unexpectedly, the elevation angle mask is large, with more than one-third reaching up to 40 • , as shown in Figure 6c3, due to the tall trees detected in the point cloud map.

Positioning Results Comparison
To validate the effectiveness of the adaptive GNSS-RTK/LIO fusion method for continuous position estimation, the trajectories generated by the adaptive integrated pipelines with thresholds el th for the GNSS-RTK availability assessment equals 90 • , 35 • , and 15 • are presented respectively in Figure 7a-c. All the u-blox RTK solutions processed by RTK-LIB [7], among which no fixed ones were reported, with a mean elevation angle mask smaller than el th are fused with LiDAR and IMU measurements as Equation (11). When el th equals 90 • , as the mean elevation angle mask must be smaller than 90 • , all the developed u-blox RTK solutions are integrated. Unavailable solutions with large errors would destroy the estimation, while the smaller el th rejects more unavailable ones, and therefore make more accurate position estimation. As shown in Figure 7a-c, the smaller el th is, the better its corresponding trajectory overlaps with the ground truth. The box plot in Figure 7d further shows the precision of the three trajectories quantitatively.  Table 3 shows the precision of the positioning results from alone GNSS-RTK measurements, the LIO, the LIO-SAM [42], and the three adaptive integrated pipelines quantitatively. The ATE of the pure GNSS-RTK is quite large with an RMSE of 42.02 m. The LIO achieves the best precision with the RMSE of 3.99 m as marked in bold font. The RMSE of ATE of the position estimated by the traditional fusion method LIO-SAM [42] is about 19.89 m. Among the adaptive integrated GNSS-RTK/LIO methods, when el th equals 90 • , the RMSE of ATE is 25.33 m. In LIO-SAM [42], only when the covariance of the pose generated by LIO exceeds a preset threshold, which means the reliability of the LIO decreases, the GNSS-RTK solutions without assessment are included for state optimization and correction. The proposed fusion pipeline with el th equals 90 • fused all the received GNSS-RTK solutions which would be more than that used in LIO-SAM [42]. Considering that the GNSS-RTK solution on this data has a large error, the positioning accuracy of the proposed pipeline with el th equals 90 • is lower than that of LIO-SAM [42] consequently. By decreasing the el th to 35 • , a 25% improved RMSE of 18.88 m is achieved. With el th set to 15 • the next best precision is obtained.

Experimental Results in Urban Canyon 2 3.3.1. Results of GNSS-RTK Availability Assessment
To challenge the performance of the proposed method, the other dataset is further evaluated on a denser urban scenario, the urban-2. The ATE of the u-blox measurement and the correspondent mean elevation angle mask of the surrounding local map is depicted in Figure 8a. A similar trend can also be seen in that large mean elevation angle masks are accompanied by u-blox solutions with large ATE and vice versa. In detail, at the 1 place seen in Figure 8b1, the footbridge in the front of the vehicle and skyscrapers on both sides result in poor GNSS solutions with large ATE. More than half of the corresponding sky mask of the surrounding point cloud reaches up to 60 • calculated as the Equation (3) shown in Figure 8c1. Then, the 2 place, displayed in Figure 8b2, is open near the bay which leads to accurate GNSS solutions with small ATE. Conforming to the scene, most elevation angles depicted in the mask in Figure 8c2 are smaller than 30 • .

Positioning Results Comparison
The effectiveness of the adaptive GNSS-RTK/LIO fusion method for continuous position estimation is further validated on urban-2. The trajectories generated by the adaptive integrated pipelines with thresholds el th for the GNSS-RTK availability assessment equals 90 • , 35 • , 20 • , and 15 • are presented respectively in Figure 9a-d. Different from urban-1, fixed u-blox RTK solutions processed by RTKLIB [2] are reported on urban-2. Among the fixed ones, the corresponding mean elevation angle mask of which is smaller than el th , are fused with LiDAR and IMU measurements as Equation (11) in our experiments. It is noteworthy that when fixed solutions are developed, the surroundings are relatively empty. The mean elevation angle mask of such places would be relatively small. Specifically, on urban-2 only one fixed solution has a mean elevation angle mask larger than 35 • . Therefore, trajectories estimated by pipelines with el th set to 90 • and 35 • achieves similar accuracy as plotted in Figure 9a,b. As el th decreasing, the percentage of available GNSS RTK solutions to integrate with LiDAR and IMU measurements increases. As a consequence, the estimated trajectory achieves better accuracy. As shown in Figure 9c,d, the estimated trajectories are in better agreement with the ground truth. The box plot in Figure 9d compares the precision of the four trajectories quantitatively.  Table 4 shows the precision of the positioning results from alone GNSS-RTK measurements, the LIO, and the four adaptive integrated pipelines quantitatively. Overall speaking, the positioning results on urban-2 are more accurate compared to that on urban-1 thanks to fixed solutions. The ATE of the pure GNSS-RTK is still large with an RMSE of 15.75 m. The LIO achieves the next best precision with the RMSE of 1.80 m. The RMSE of ATE of the position estimated by LIO-SAM is improved compared with GNSS RTK alone but still large due to the reliability of the fused GNSS RTK solutions is not proved. Among the four adaptive integrated pipelines, when el th equals 90 • and 35 • , comparable positioning precision is achieved. The RMSE of ATE is about 8.6 m. By decreasing the el th to 20 • , the RMSE is significantly improved by 50% to 4.86 m. The pipeline with el th set to 15 • achieves the best precision with the RMSE of 1.738 m as marked in bold font in Table 2. To perform extensive validation of the proposed method, the experiment is implemented on the urban-3 with a much longer length. The ATE of the u-blox measurement and the correspondent mean elevation angle mask of the surrounding local map is depicted in Figure 10a. The mean elevation angle masks and the ATE of the u-blox solutions are roughly correlated. When the ATE of the u-blox solution becomes large, the corresponding mean elevation angle mask would become large following and vice versa. In detail, at the 1 place seen in Figure 10b1, the overpasses over the vehicle result in poor GNSS solutions with large ATE. More than half of the corresponding sky mask of the surrounding point cloud reaches up to 60 • calculated as the Equation (3) shown in Figure 10c1. Then the 2 place, displayed in Figure 10b2, is relatively open with low objects on both sides which leads to accurate GNSS solutions with small ATE. Conforming to the scene, most elevation angles depicted in the mask in Figure 10c2 are smaller than 30 • . Around the 3 place, which is near an open crossing, the ATE of the u-blox solutions is small due to the unblocked transmission of the signal as shown in the top panel of Figure 10a. However, the mean elevation angle mask calculated from the point cloud is particularly large as shown in the bottom panel of Figure 10a. The false evaluation is caused by the trees over the vehicle and the tall double-decker bus around detected by the LiDAR presented in Figure 10b3. Most elevation angles of the surrounding points are larger than 60 degrees as plotted in the sky mask in Figure 10c3.

Positioning Results Comparison
The effectiveness of the adaptive GNSS-RTK/LIO fusion method for continuous position estimation is validated on urban-3. The trajectories generated by the adaptive integrated pipelines with thresholds el th for the GNSS-RTK availability assessment equals 90 • , 35 • , 25 • , and 15 • are presented respectively in Figure 11a-d. Fixed u-blox RTK solutions processed by RTKLIB [2] are reported on urban-3 as on urban-2. Among the fixed ones, the corresponding mean elevation angle mask of which is smaller than el th , are fused with LiDAR and IMU measurements as Equation (11) in our experiments. As el th decreasing, the estimated trajectory achieves better accuracy which would be in better agreement with the ground truth. The box plot in Figure 11d compares the precision of the four trajectories produced by the adaptive fusion pipelines quantitatively. Figure 11. (a-d) plots the trajectory generated by the adaptive fusion pipelines with the thresholds el th for the GNSS-RTK availability assessment set as 90 • , 35 • , 20 • and 15 • respectively on urban-3. The black trajectory is the ground truth. The ATE of each estimated position is mapped on the trajectory referring to the color bar with the same scale on the right, on which the largest and the smallest tick label represents the maximum and the minimum ATE respectively while the middle one represents the midrange of them. (e) shows the boxplot of the ATE on the whole trajectory produced by the four pipelines. Table 5 shows the precision of the positioning results from alone GNSS-RTK measurements, the LIO, the LIO-SAM [42], and the four adaptive integrated pipelines quantitatively. Overall speaking, the ATE of the positioning results on urban-3 are large. The RMSE of ATE of the pure GNSS-RTK reaches up to 22.23 m. The LIO achieves the next best precision with the RMSE of 11.43 m suffering from accumulated drift. Among the four adaptive integrated pipelines, when el th varies from 90 • to 15 • , the RMSE of ATE is gradually improved. The precision of the LIO-SAM is comparable with the proposed method with el th equal 90 • , both of which are categorized as conventional fusion methods. The pipeline with el th set to 15 • achieves the best precision with the RMSE of 10.02 m as marked in bold font in Table 3.

Performance of GNSS-RTK Availability Assessment
The GNSS-RTK Availability Assessment method proposed in the paper functions as selecting available GNSS-RTK solutions under urban canyons for further fusion with LIO. The exclusion of unavailable GNSS-RTK solutions with potential large errors guarantees the convergence and the accuracy of the optimization-based fusion. The availability assessment is implemented by evaluating the mean elevation angle mask of the buildings surrounding the GNSS-RTK solution as Equation (3). It benefits from the 3D range information perceived by LiDAR and the accurate point cloud map produced by LiDAR/inertial odometry. A large mean elevation angle mask indicates tall objects around. While a small one indicates open areas.
Ideally, under urban scenarios, a large mean elevation angle mask implies deep urban canyons and thus the ATE of the GNSS-RTK measurements should be large due to signal blockage or reflection. Such as the three places denoted as " 1 " in Figures 6, 8 and 10.
Oppositely, a small mean elevation angle mask denotes unblocked areas and the ATE of the GNSS-RTK measurements is expected to be small. Such as the three places denoted as " 2 " in Figures 6, 8 and 10. Most cases shown in the experiments on the three datasets are in line with the above law. Hence, available GNSS-RTK solutions could be selected according to the mean elevation angle mask. The correctness of the GNSS-RTK availability assessment via calculating the mean elevation angle mask of the point cloud map is validated.
However, false GNSS-RTK availability assessment under relatively open areas would occur affected by tall and dense trees, etc. Such as the two places denoted as " 3 " in Figures 6 and 10. These trees detected by the LiDAR over the vehicle would derive a large mean elevation angle mask while their influence on GNSS positioning accuracy is not that great. Such false assessment would miss really available GNSS-RTK solutions occasionally. To avoid this, semantic annotation of the 3D point map would be conducive to eliminate the influence of the tall objects nearby which will be one of our future works.

•
The continuous and smooth GNSS-RTK positioning under urban canyons with LIO.
Under deep urban canyons, the positioning results of the GNSS-RTK are likely to be discontinuous between epochs due to poor signal transmission [63]. The ATE of the GNSS-RTK measurements is consequently large, as shown in the first row of Tables 1-3. With the assistance of LIO, the positioning accuracy of the integrated pipelines is improved by more than 50% compared with that of the GNSS-RTK alone as shown in the last three rows of Table 3 and the last four rows of Tables 4 and 5. What is more, the visualized trajectory is smooth and continuous as shown in Figures 7c, 9d and 11d. In conclusion, through integration with LIO, the continuous and smooth positioning of GNSS-RTK under deep urban canyons is realized.

•
The accumulated drift alleviation of LIO with GNSS-RTK.
As local sensors, LiDAR and IMU could only provide relative motion estimations. Therefore, accumulated drift is unavoidable. Theoretically, in the fusion with the absolute positioning solutions from GNSS-RTK, the drift would be corrected. However, in our experiments on the three typical urban scenarios, the original GNSS-RTK solutions provided by the commercial level receivers are notably poor with tens of meters error as shown in the first row of Tables 3-5. For urban-1, no fixed GNSS-RTK solutions are provided. The RMSE of the ATE of the LIO is even slightly better than the best adaptive fusion positioning results, as shown in the second row and the last row of Table 3. On urban-2 and urban-3, with more fixed GNSS-RTK solutions developed, the RMSE of the ATE of the best adaptive fusion positioning results are better compared with that of LIO. The accumulated drift of LIO would be more alleviated as the quality of the GNSS-RTK measurements is improved. GNSS measurements refinement after selection would be a promising strategy for a further breakthrough.

•
The superiority of the proposed adaptive GNSS-RTK/LIO integration.
The proposed adaptive GNSS-RTK/LIO integration pipeline periodically performs a non-linear optimization-based fusion of GNSS-RTK and LIO, once the GNSS-RTK solution is assessed as available by the GNSS-RTK availability assessment module. As shown in the last three rows of Table 3 and the last four rows of Tables 4 and 5, the best precision achieved with the adaptive fusion pipeline is improved by more than 75% compared with that of the traditional method. The superiority of the proposed method is validated.
By lowering the threshold for the assessment, more unavailable GNSS-RTK solutions would be rejected. The ATE of the positioning results is continuously improved. Moreover, the visualized trajectories are smoother as plotted in Figure 7a-c, Figures 9a-d and 11a-d. The effectiveness of the GNSS-RTK availability assessment is further demonstrated. Theoretically, the pipeline with the smallest threshold always achieves the best precision. However, in our experiments in the deep urban canyons with dense skyscrapers, when the threshold is smaller than 15 • there is hardly any GNSS RTK solutions could be selected as available. In other words, 15 • is the best threshold for the proposed method tested on deep urban canyons such as our UrbanNav dataset. When running on less urban dataset, the best threshold for the proposed method would be lower than 15 • .
Nevertheless, the best precision achieved by the adaptive fusion method is still at meter-level due to the poor GNSS measurements provided by commercial level receivers in urban canyons. Deeper sensor fusion to correct the GNSS measurement model would lead to enhancement.

Conclusions and Prospects
This paper proposes a GNSS-RTK availability assessment method exploiting the registered 3D point map provided by LiDAR. Further integration of the available GNSS solutions, LiDAR, and IMU measurements are implemented, namely adaptive sensor fusion pipeline, to perform absolute position estimation under deep urban for autonomous navigation. The effectiveness of the GNSS-RTK availability assessment via mean elevation angle mask calculation of the local point cloud map is firstly verified according to the ATE of the GNSS measurements. For example, in most cases, a large mean elevation angle mask could be achieved under a sheltered environment. The GNSS would be assumed to be unavailable. Correspondingly, the ATE of the GNSS measurements tends to increase. With the help of the proposed GNSS-RTK selection method, poor GNSS solutions with large errors are excluded before the fused positioning. The trajectories estimated by such an adaptive fusion scheme are investigated quantitatively and qualitatively. During the experiments, different percentages of available GNSS solutions are leveraged as the threshold for availability assessment is tuned. The precision of the estimated trajectory is gradually improved as the percentage increases. In summary, the capabilities of the GNSS-RTK availability assessment module to avoid unavailable GNSS solutions and the adaptive fusion method to continuously estimate the position are demonstrated. However, the proposed method can be significantly challenged when the long-term GNSS interruption occurs. Specifically, the locally generated map will not be reliable for the GNSS-RTK selection due to the accumulated drift from the LiADR/inertial odometry. Integrating more sensors to mitigate the potential drift of the LiADR/inertial odometry will be one of the possible future research directions.
For future studies, we would further improve the accuracy of the position estimation from meter levels to centimeter levels. The semantic map [64][65][66] with labeled objects would contribute to avoiding false GNSS unavailability assessment under open areas. Because the tall trees and other nearby objects which lead to large mean elevation angle masks could be noted and removed. Moreover, it can be seen from the integrated positioning results on the three urban canyons, that the achieved best positioning precisions are still meter levels. This is because all the GNSS-RTK solutions provided by the commercial level GNSS receiver are inaccurate due to the dense building structures. Therefore, GNSS solutions refinement via measurement model correction during deeper sensor fusion would be promising [67,68]. Last but not least, the extrinsic parameters among different sensors make great sense [69]. Online calibration and adjustment are expected to be helpful [70,71]. Funding: This research was funded by Riemann Laboratory, Huawei Technologies grant number ZGD2. The APC was also funded by Riemann Laboratory, Huawei Technologies. This research was also supported by the Faculty of Engineering, The Hong Kong Polytechnic University under the project "Perception-based GNSS PPP-RTK/LVINS integrated navigation system for unmanned autonomous systems operating in urban canyons". This research was also supported by the Guangdong Natural Science Foundation (2021A1515110771).