1. Introduction
In recent years, multi-source hyperspectral data have played an increasingly important role in applications such as environmental monitoring, precision agriculture, and geological exploration [
1,
2,
3]. With their continuous narrow bands, high spectral resolution, and relatively fine spatial resolution, hyperspectral sensors provide rich spectral information that supports both quantitative inversion and qualitative identification of land surface features. However, variations in spectral response functions, radiometric calibration accuracy, and atmospheric correction procedures across different satellite platforms and sensors introduce inconsistencies, which in turn affect cross-sensor data fusion and joint applications. Therefore, evaluating the consistency of multi-source hyperspectral data is essential for ensuring interoperability, improving quantitative retrieval accuracy, and enhancing the comparability of derived products [
4,
5].
The German Environmental Mapping and Analysis Program (EnMAP) is one of the most advanced hyperspectral satellite missions, delivering Level-2A (L2A) surface reflectance products that undergo rigorous radiometric, geometric, and atmospheric corrections [
6]. These products comply with internationally accepted quality standards and have been widely used in fine-scale environmental monitoring, vegetation parameter retrieval, and mineral identification [
7,
8,
9]. To ensure high-quality data, the EnMAP team has established a comprehensive calibration and validation framework covering the full L1–L2 processing chain, supported by quarterly mission reports and global quality assessments [
10,
11]. Furthermore, cross-validation practices promoted by ESA and NASA have compared EnMAP data with those from PRISMA [
12], DESIS [
13], Sentinel-2 [
14], and airborne hyperspectral sensors [
15,
16], often employing pseudo-invariant calibration sites (PICSs) and vicarious calibration to assess inter-mission radiometric comparability [
17]. In addition, recent guidelines from CEOS/WGCV and community initiatives such as HYPSTAR and LANDHYPERNET emphasize automated ground networks and standardized uncertainty reporting to support multi-mission interoperability [
18].
China has also made rapid progress in spaceborne hyperspectral observation, successfully launching several operational satellites with hyperspectral capabilities, including ZY-1 02D [
19], GF5-02 [
20], and ZY-1 02E [
21]. GF5-02, equipped with the Advanced Hyperspectral Imager (AHSI), has undergone on-orbit radiometric performance assessments and uncertainty analyses, with cross-validation conducted at RadCalNet sites such as Baotou and Dunhuang to support its quantitative applications [
20,
22,
23]. ZY-1 02D has advanced efficient algorithms for reflectance retrieval, denoising, and rapid processing, and has been cross-calibrated with GF-5 AHSI to improve radiometric accuracy [
24]. Application-driven validation efforts have used near-contemporaneous ZY-1 02D/02E observations for soil and ecological parameter retrieval, and for environmental impact monitoring in wetlands, salinization areas, and lithological mapping [
25,
26].
Meanwhile, both Germany and China have carried out a series of multi-sensor cross-validation and interoperability studies to assess hyperspectral consistency. The EnMAP mission has been systematically compared with PRISMA, DESIS, and pseudo-invariant site analyses [
15,
16,
27]. These efforts have established a mature cross-mission framework emphasizing harmonized quality assessment. In contrast, Chinese hyperspectral missions such as GF5-02 and ZY-1 02D have focused on domestic calibration and vicarious validation at sites such as Dunhuang, Baotou, and Gobi [
19,
20,
22,
23].
Despite these advances, several limitations remain. First, inter-mission harmonization between Chinese and European hyperspectral systems is still restricted by differences in atmospheric correction algorithms (e.g., Python(version 2.7)-based atmospheric correction [
6] (PACO) for EnMAP vs. Fast Line-of-sight Atmospheric Analysis of Spectral Hypercubes [
28] (FLAASH) for GF5-02), radiometric reference standards, and spectral response characterization. Second, discrepancies in radiometric scaling, spectral resampling, and preprocessing workflows often lead to residual differences in surface reflectance, even for identical targets. These challenges emphasize the need to establish a standardized, globally consistent verification framework to enhance inter-mission comparability and improve quantitative interoperability across hyperspectral satellite systems.
Building upon these considerations, this study uses EnMAP L2A products [
27,
29,
30] as reference data and evaluates GF5-02 AHSI observations acquired in synchronous or near-synchronous conditions over representative land cover types in China. Land cover classes such as vegetation, water bodies, and rocks, which are widely representative across application domains and exhibit diverse spectral characteristics, serve as ideal targets for cross-sensor consistency assessments [
24,
31,
32]. Multiple evaluation metrics, including spectral angle, root mean squared error, and correlation coefficient, are employed to systematically assess surface reflectance consistency. By comparing reflectance results across diverse land cover types, this study not only quantifies cross-sensor spectral agreement but also identifies sensor-specific strengths and limitations under different surface conditions. The results provide insights for algorithm optimization and support the broader interoperability and application of Chinese hyperspectral data in international contexts.
3. Results
3.1. Reflectance Comparison and Spectral Consistency
Figure 2 presents representative reflectance spectra and difference plots derived from GF5-02 AHSI and EnMAP L2A products across four typical land cover types, excluding the spectral regions affected by strong water vapor absorption. Overall, the reflectance spectra from the two sensors show consistent shapes, with major absorption features well aligned.
For minerals, grasslands, and desert surfaces, the spectral trends from the visible to shortwave infrared (SWIR) regions match closely, with smooth curves and minimal deviations. In particular, mineral absorption features beyond 2000 nm are highly consistent between the two datasets, which is crucial for lithological mapping and alteration mineral detection.
When the spectra are further analyzed by wavelength sub-ranges, we observe that in the visible domain (400–700 nm) the deviations remain below 0.02 reflectance units for most surfaces, reflecting stable radiometric calibration. In the near-infrared (700–1300 nm), vegetation exhibits slightly larger differences due to sensitivity to chlorophyll absorption and canopy structure, but the overall correlation remains above 0.96. In the SWIR domain (1300–2500 nm), both desert and mineral sites preserve strong absorption alignments, particularly around 2200 nm and 2300 nm, demonstrating the ability of both sensors to capture diagnostic mineral features.
By contrast, for low-reflectance targets such as water bodies, discrepancies become more pronounced, especially in the near-infrared (NIR) region, where GF5-02 reflectance tends to be systematically higher than EnMAP by approximately 0.005–0.01 reflectance units. This bias mainly arises from the different atmospheric correction algorithms applied to the two datasets. Both FLAASH (used for GF5-02) and PACO (used for EnMAP) are based on MODTRAN radiative transfer calculations but differ in their retrieval and compensation strategies. FLAASH estimates the water vapor column using a look-up table (LUT)-based approach that relates absorption-to-reference band radiance ratios, assuming a Lambertian surface and simplified adjacency treatment. This empirical scaling can lead to slight overcorrection in low-reflectance environments. In contrast, PACO applies a pixel-wise Atmospheric Pre-corrected Differential Absorption (APDA) [
38] method, combining dense-dark-vegetation (DDV) aerosol retrieval [
39] and adjacency-effect compensation through spatial convolution with scene geometry and DEM information. These distinctions lead PACO to produce marginally lower reflectance values over water surfaces, consistent with the observed offset.
3.2. Quantitative Metrics
To further quantify the consistency, four metrics—SA, RMSE, RRMSE, and R—are calculated from 30 validation samples across the four land cover types (
Table 4). To ensure the results are not affected by residual atmospheric absorption, spectral bands strongly influenced by water vapor—centered around 1400 nm and 1900 nm—are excluded from the analysis, while all remaining valid bands within the 400–2500 nm range are used for quantitative evaluation.
The results show that GF5-02 and EnMAP reflectance are strongly correlated, particularly for desert and grassland targets. For these two surface types, the correlation coefficients reach 0.9721 and 0.9649, respectively, while the corresponding spectral angles remain below 0.08 rad, indicating a high degree of spectral similarity across the full wavelength range. These findings suggest that both sensors capture the dominant reflectance features of arid soils and vegetated canopies with high fidelity, which is essential for quantitative applications such as biomass estimation and surface albedo retrieval.
The Eastern Tianshan mineral area exhibits the highest correlation (R = 0.9765), reflecting excellent consistency even in complex geological environments with diverse lithologies and strong absorption features in the SWIR region. In particular, diagnostic absorption bands beyond 2000 nm are nearly identical between the two sensors, confirming the capability of GF5-02 to support mineralogical mapping and lithological discrimination at a level comparable to EnMAP. This level of agreement demonstrates that GF5-02 can provide reflectance information comparable to EnMAP for geological mapping purposes. However, the reliable detection of subtle absorption features also depends on additional factors such as spectral calibration accuracy, radiometric stability, signal-to-noise ratio, and the sensitivity of atmospheric correction to parameter uncertainties, which warrant further investigation in future studies.
By contrast, water bodies present significantly weaker agreement. The spectral angle increases sharply to 0.3423 rad (≈19.61°), the correlation coefficient decreases to 0.8214, and the relative RMSE rises to nearly 48%. These discrepancies are especially pronounced in the near-infrared region, where EnMAP reflectance is systematically higher than that of GF5-02. A plausible explanation lies in the different atmospheric correction algorithms applied to the two datasets: GF5-02 uses FLAASH, whereas EnMAP employs PACO, each adopting distinct parameterizations for water vapor absorption and adjacency-effect compensation. FLAASH relies on a LUT-based retrieval of water vapor using the ratio of absorption to reference radiances, assuming a Lambertian surface and simplified adjacency correction, which may lead to slight overestimation of reflectance over dark targets. In contrast, PACO applies an APDA method combined with explicit adjacency-effect correction through spatial convolution. In addition, EnMAP applies a detector non-linearity correction in its radiometric calibration chain to mitigate response curvature at low radiance levels, whereas GF5-02 employs a linear calibration model based on laboratory and vicarious field calibration without explicit non-linearity terms. This difference in calibration strategy may further contribute to the reflectance offset observed over low-signal surfaces such as water bodies. Furthermore, the inherently low signal-to-noise ratio (SNR) and the limited effective spectral range of water-leaving reflectance (primarily below 900 nm) amplify the relative impact of radiometric noise, atmospheric retrieval errors, and sensor-specific response differences. Heterogeneity in the Qinghai validation sites, where saline crusts and shallow water coexist, may further contribute to residual discrepancies.
These results underscore that while GF5-02 and EnMAP achieve strong interoperability over medium- to high-reflectance surfaces such as minerals, desert, and vegetation, low-reflectance targets like water remain challenging. Addressing these challenges will require refined atmospheric correction strategies and potentially harmonized inter-sensor processing chains to improve the reliability of aquatic applications.
3.3. Regression Analysis
Regression analysis is performed by averaging 30 spectra per land cover type and plotting band-wise scatterplots between GF5-02 and EnMAP reflectance (
Figure 3). For consistency, bands located within strong atmospheric water-vapor absorption regions near 1400 nm and 1900 nm are excluded prior to regression to ensure that the comparison reflects true sensor and algorithmic differences rather than atmospheric artifacts. The scatterplots show that most data points are distributed close to the 1:1 reference line, with regression slopes close to unity and small intercepts, indicating overall consistency.
For grassland, desert, and mineral targets, the coefficients of determination (R
2) all exceed 0.94, reflecting excellent linear agreement between GF5-02 and EnMAP reflectance. The scatterplots (
Figure 3) show that most points are closely aligned with the 1:1 reference line, with regression slopes ranging from 0.95 to 1.15 and intercepts close to zero. This indicates that not only the spectral shapes but also the absolute reflectance values of GF5-02 are highly consistent with those of EnMAP, even across heterogeneous vegetation canopies and spectrally complex mineral assemblages. Such agreement provides confidence that GF5-02 data can be directly applied to biophysical parameter retrieval and mineralogical mapping without introducing significant cross-sensor biases.
In contrast, water bodies yield a markedly lower R2 value of 0.680, and the scatter distribution shows a systematic departure from the 1:1 line, particularly in the near-infrared region where reflectance values approach zero. This suggests that small absolute deviations in low-reflectance conditions can lead to disproportionately large relative errors, thereby reducing linear agreement. The observed bias is consistent with the reflectance offset noted earlier and is likely attributable to differences in atmospheric correction strategies, adjacency effects from surrounding bright saline crusts, and the intrinsic sensitivity of water reflectance to sensor noise and illumination conditions.
To further examine the wavelength-dependent consistency,
Table 5 summarizes the correlation coefficients between GF5-02 and EnMAP reflectance within three major spectral sub-ranges (VIS: 400–700 nm; NIR: 700–1100 nm; SWIR: 1100–2500 nm), excluding bands affected by strong atmospheric absorption near 1400 nm and 1900 nm. The results show that the correlation remains high across all land-cover types in the VIS and SWIR regions (typically R > 0.95), indicating excellent radiometric consistency and stable cross-sensor spectral alignment. In the NIR domain, the correlation decreases slightly (R ≈ 0.91–0.94) for most surfaces and drops markedly for water (R = 0.6964), reflecting the higher sensitivity of this region to canopy structure, lower SNR, and residual water-vapor effects. These findings confirm that while both sensors achieve robust agreement for high- and medium-reflectance surfaces across the full spectral range, low-reflectance targets—particularly aquatic environments—remain more susceptible to atmospheric retrieval uncertainties and noise amplification.
Taken together, these regression results confirm that GF5-02 and EnMAP exhibit high agreement in spectral shape, correlation, and regression performance for most land cover types, especially for medium- and high-reflectance surfaces such as grasslands, deserts, and minerals. The high correlations observed in the VIS and SWIR regions further demonstrate the reliability of both sensors for quantitative retrieval of surface and mineralogical parameters. However, noticeable discrepancies remain in the NIR and aquatic environments, where the inherently low signal levels, limited effective spectral range, and higher sensitivity to atmospheric and adjacency effects reduce the overall consistency. These findings highlight the wavelength-dependent nature of cross-sensor differences and underline the need for harmonized atmospheric correction pipelines and targeted optimization strategies when applying GF5-02 data to aquatic and other low-reflectance environments.
3.4. Validation with Ground Measurements in the Eastern Tianshan
To further validate the consistency of GF5-02 hyperspectral data, two images acquired over the Eastern Tianshan region were analyzed, with acquisition dates of 2 March 2022 and 28 April 2023. Both images cover three historical in situ spectral measurement sites, where ground-based spectral measurements were originally collected on 31 July 2011. Although there is a temporal gap between the ground campaigns and the satellite acquisitions, the selected sites are dominated by exposed lithologies with highly stable spectral characteristics. Unlike vegetation or soils, these mineral assemblages show minimal seasonal or interannual variability, making them reliable reference targets for long-term sensor validation.
Ground-based surface reflectance data are obtained from field measurements conducted in 2011 using an Analytical Spectral Devices FieldSpec-3 [
40] spectroradiometer (Analytical Spectral Devices, Incorporated, Boulder, CO, USA). The instrument covers the 350–2500 nm spectral range with a spectral sampling interval of 1 nm and a full width at half maximum (FWHM) of 3–10 nm. Measurements are performed under clear-sky, near-nadir conditions around local solar noon to minimize illumination variability. Each reflectance spectrum represented the mean of several scans collected over a 1 m × 1 m homogeneous area, with radiometric calibration using a Spectralon
® reference panel (Labsphere Incorporated, North Sutton, NH, USA) before each measurement series.
The three validation sites represent distinct lithologies (
Table 6): (1) quartz–muscovite schist (identified as garnet–biotite–quartz schist by thin-section analysis), (2) quartz porphyry (composed mainly of quartz, K-feldspar, and minor sericite), and (3) silicified and altered quartz porphyry (altered dacite).
As shown in
Figure 4, the reflectance curves derived from GF-5B closely match the in situ spectra across the entire wavelength range, capturing both the overall shape and the main absorption features.
Quantitative comparisons between GF5-02 reflectance and the in-situ spectra are conducted using spectral angle (rad), RMSE, relative RMSE (%), and R. To ensure spectral consistency with the satellite observations, the ground spectra are convolved with the spectral response functions (SRFs) of GF5-02 using the same Gaussian-weighted integration method described in
Section 2.3.1. The results are summarized in
Table 7.
The analysis indicates that:
Spectral consistency—The reflectance curves of all three validation sites closely match the in situ measurements, showing consistent absorption features and overall spectral trends.
Quantitative agreement—All validation sites achieved correlation coefficients above 0.96, with RMSE values below 0.1, confirming strong consistency between GF-5B and ground-based spectra.
Temporal stability—Comparisons between the two acquisition dates demonstrate stable sensor performance, as the spectral features, magnitudes, and trends remain highly consistent, underscoring the reliability of GF-5B during its on-orbit operation.
4. Discussion
The cross-validation analysis demonstrates that GF5-02 exhibits high radiometric consistency with EnMAP across a wide range of land-cover types, particularly for medium- and high-reflectance surfaces such as grasslands, deserts, and mineral-rich regions. The observed spectral correlations (R > 0.95 in VIS and SWIR) and small spectral angles (<0.08 rad) indicate that the GF5-02 reflectance products maintain both spectral shape fidelity and absolute radiometric agreement with the EnMAP L2A reference. These findings are consistent with previous cross-sensor intercomparisons, such as between PRISMA and EnMAP, which reported similar levels of agreement after atmospheric correction and spectral resampling [
15]. This high consistency supports the use of GF5-02 data for quantitative biophysical parameter retrieval, mineral mapping, and cross-sensor time-series analyses. Moreover, the robust spectral alignment between the two sensors confirms the reliability of their radiometric calibration frameworks and the physical soundness of the FLAASH and PACO atmospheric correction algorithms when applied to stable terrestrial surfaces.
Despite the overall agreement, discrepancies become more pronounced in the near-infrared (NIR) and water scenes, where reflectance is intrinsically low and signal-to-noise ratios (SNRs) decrease significantly. The reduced correlation for aquatic environments (R = 0.6964 in the NIR) highlights the sensitivity of dark targets to atmospheric correction uncertainties, adjacency effects, and detector response non-linearities. EnMAP applies a non-linearity correction to compensate for radiometric response curvature at low signal levels, whereas GF5-02 relies on a linear calibration model based on vicarious field data. This calibration difference, combined with varying retrieval strategies for water vapor and adjacency compensation (FLAASH vs. PACO), likely contributes to the observed offsets.
A key strength of this study lies in its comprehensive multi-scene validation framework, integrating spectral and spatial evaluations to assess cross-sensor interoperability under diverse environmental conditions. The results provide quantitative evidence that GF5-02 can complement EnMAP for multi-sensor hyperspectral analyses. Nevertheless, several limitations should be acknowledged. First, temporal acquisition differences—though limited to a few days—may still introduce slight changes in illumination or atmospheric state, particularly over aquatic surfaces. Second, due to the lack of dedicated laboratory non-linearity characterization for GF5-02, reflectance deviations at low radiance levels remain a source of uncertainty. Future work should focus on (i) harmonized atmospheric correction pipelines for aquatic environments, (ii) dedicated SNR and BRDF characterization campaigns, and (iii) improved uncertainty propagation analyses following the QA4EO metrological framework [
41]. These efforts would strengthen the quantitative interoperability of hyperspectral satellite constellations and advance the traceability of surface reflectance products for climate and Earth system monitoring.
5. Conclusions
In this study, EnMAP L2A reflectance products were used as reference data to systematically evaluate the surface reflectance consistency of GF5-02 across four representative land cover types: minerals, grassland, desert, and inland water bodies.
The results show that for high- and medium-reflectance surfaces (e.g., minerals, grassland, and desert), GF5-02 and EnMAP exhibit highly consistent spectral shapes, with strong correlations (R > 0.96) and small spectral angles (<0.08 rad), demonstrating robust interoperability and applicability for quantitative remote sensing. However, for low-reflectance targets such as water, the consistency decreases significantly, reflected in larger spectral angles, lower correlation coefficients, and higher RRMSE values. This discrepancy likely arises from differences in atmospheric correction models and sensor responses, compounded by the inherently low signal levels and the limited effective spectral range of water-leaving reflectance.
Additionally, the ground-based validation in the Eastern Tianshan confirmed the reliability and stability of GF5-02 reflectance products, further supporting their suitability for long-term quantitative applications.
Overall, GF5-02 and EnMAP exhibit strong agreement across most land cover types, supporting their combined use in quantitative remote sensing and cross-sensor studies. Nevertheless, further methodological improvements in atmospheric correction and radiometric harmonization are still required to enhance consistency in challenging environments, particularly over water and other low-reflectance surfaces.