An Operational Ground-Based Vicarious Radiometric Calibration Method for Thermal Infrared Sensors: A Case Study of GF-5A WTI

Bai, Jingwei; Bao, Yunfei; Zhou, Guangyao; Zhang, Shuyan; Guan, Hong; Zhang, Mingmin; Zhao, Yongchao; Jiang, Kang

doi:10.3390/rs18020302

Open AccessArticle

An Operational Ground-Based Vicarious Radiometric Calibration Method for Thermal Infrared Sensors: A Case Study of GF-5A WTI

by

Jingwei Bai

^1,2,3

,

Yunfei Bao

⁴,

Guangyao Zhou

^1,2,3,

Shuyan Zhang

⁵,

Hong Guan

⁶,

Mingmin Zhang

⁶,

Yongchao Zhao

^1,2,3 and

Kang Jiang

^1,2,*

¹

Key Laboratory of Target Cognition and Application Technology (TCAT), Chinese Academy of Sciences, Beijing 100190, China

²

Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, China

³

School of Electronic, Electrical and Communication Engineering, University of Chinese Academy of Sciences, Beijing 100049, China

⁴

Beijing Institute of Space Mechanics and Electricity, Beijing 100094, China

⁵

School of Mathematics and Systems Science, Xinjiang University, Ürümqi 830046, China

⁶

UTAN Technology Co., Ltd., Hangzhou 310012, China

^*

Author to whom correspondence should be addressed.

Remote Sens. 2026, 18(2), 302; https://doi.org/10.3390/rs18020302

Submission received: 1 December 2025 / Revised: 7 January 2026 / Accepted: 14 January 2026 / Published: 16 January 2026

(This article belongs to the Special Issue Radiometric Calibration of Satellite Sensors Used in Remote Sensing)

Download

Browse Figures

Versions Notes

Highlights

What are the main findings?

An operational, year-round automated ground-based vicarious calibration system for thermal infrared sensors was deployed at three Gobi pseudo-invariant sites, establishing a traceable surface–atmosphere–sensor radiometric chain.
Applied to GF-5A WTI, the method yields band- and gain-specific linear calibration coefficients with stable radiometric response and brightness–temperature errors within mission requirements, as verified by independent on-orbit validation.

What are the implications of the main findings?

The Gobi-based network demonstrates that autonomous ground stations can provide routine, low-manpower on-orbit radiometric calibration and long-term stability monitoring for high-resolution thermal infrared missions such as GF-5A WTI.
The framework provides a scalable blueprint for arid-region calibration networks, enabling multi-gain consistency checks, robust vicarious calibration, and more reliable land-surface-temperature and related thermal infrared products.

Abstract

High-resolution TIR missions require sustained and well-characterized radiometric accuracy to support applications such as land surface temperature retrieval, drought monitoring, and surface energy budget analysis. To address this need, we develop an operational and automated ground-based vicarious radiometric calibration framework for TIR sensors and demonstrate its performance using the Wide-swath Thermal Infrared Imager (WTI) onboard Gaofen-5 01A (GF-5A). Three arid Gobi calibration sites were selected by integrating Moderate Resolution Imaging Spectroradiometer (MODIS) cloud products, Shuttle Radar Topography Mission (SRTM)-derived topography, and WTI-based radiometric uniformity metrics to ensure low cloud cover, flat terrain, and high spatial homogeneity. Automated ground stations deployed at Golmud, Dachaidan, and Dunhuang have continuously recorded 1 min contact surface temperature since October 2023. Field-measured emissivity spectra, Integrated Global Radiosonde Archive (IGRA) radiosonde profiles, and MODTRAN (MODerate resolution atmospheric TRANsmission) v5.2 simulations were combined to compute top-of-atmosphere (TOA) radiances, which were subsequently collocated with WTI imagery. After data screening and gain-stratified regression, linear calibration coefficients were derived for each TIR band. Based on 189 scenes from February–July 2024, all four bands exhibit strong linearity (R-squared greater than 0.979). Validation using 45 independent scenes yields a mean brightness–temperature root-mean-square error (RMSE) of 0.67 K. A full radiometric-chain uncertainty budget—including contact temperature, emissivity, atmospheric profiles, and radiative transfer modeling—results in a combined standard uncertainty of 1.41 K. The proposed framework provides a low-maintenance, traceable, and high-frequency solution for the long-term on-orbit radiometric calibration of GF-5A WTI and establishes a reproducible pathway for future TIR missions requiring sustained calibration stability.

Keywords:

operational vicarious calibration; automated ground observations; thermal infrared (TIR); GF-5A; Wide-swath Thermal Infrared Imager (WTI)

1. Introduction

Gaofen-5 01A (GF-5A) is a key component of China’s civilian High-Resolution Earth Observation System, designed to support comprehensive environmental monitoring and resource assessment. Launched on 8 December 2022, the satellite carries three primary payloads: the Advanced Hyperspectral Imager (AHSI), the Environmental Trace Gas Monitoring Instrument (EMI), and the Wide-swath Thermal Infrared Imager (WTI) [1]. WTI employs a whiskbroom cross-track scanning configuration and provides an exceptionally wide swath of 1500 km with a ground sampling distance of 100 m across four longwave infrared bands. This combination of wide coverage and multiple thermal-window channels enables enhanced spatiotemporal observations for applications such as land surface temperature (LST), sea surface temperature (SST), drought monitoring, and urban thermal environment assessment [2]. Thermal infrared (TIR) remote sensing retrieves key parameters—such as land surface temperature, surface emissivity, and atmospheric characteristics—by measuring longwave radiance signals [3]. These products are widely used in climate diagnostics, surface energy budget analysis, disaster monitoring, and environmental assessment. Such applications require radiometric calibration that is accurate, temporally stable, and traceable across long-term satellite missions. Historical sensors such as Advanced Spaceborne Thermal Emission and Reflection Radiometer (ASTER), MODIS, and Landsat have demonstrated the importance of maintaining consistent and comparable TIR records [4,5,6,7].

TIR sensors are typically equipped with an onboard radiometric calibration subsystem. For GF-5A WTI, two temperature-controlled internal blackbodies are configured as the primary radiance reference sources. The high-temperature blackbody is maintained at an elevated operating temperature through dedicated heaters and thermal control components, whereas the low-temperature blackbody, in addition to the same types of components as the high-temperature blackbody, requires a heat dissipation surface and a heat pipe to regulate its temperature at a lower level. During routine operational imaging, the instrument periodically observes the onboard blackbodies for calibration, enabling on-orbit calibration coefficients to be updated and applied to the digital number (DN)-to-radiance conversion for product generation. Nevertheless, onboard calibration alone cannot fully eliminate long-term radiometric drift; residual systematic biases may still arise from stray-light contamination, orbit-driven thermal-environment perturbations, limitations in thermal control and thermometry stability, and instrument aging [8,9]. Therefore, vicarious calibration remains necessary to provide an independent, traceable external pathway for verifying and, when required, correcting the onboard-calibrated radiometric response [10,11,12].

Common approaches for TIR vicarious calibration include lake- or water-body–based methods using uniform temperature/radiance targets, as well as radiance-based methods employing ground-based or airborne radiometers. For example, the U.S. Geological Survey (USGS) Earth Resources Observation and Science (EROS) Center has established a comprehensive on-orbit calibration and validation framework for the Landsat 8 Thermal Infrared Sensor (TIRS), which demonstrates sub-Kelvin radiometric stability under favorable observing conditions and thereby enhances the consistency of long-term thermal data records [13]. Additionally, studies over Qinghai Lake have conducted vicarious calibration and uncertainty evaluation for Fengyun-4A (FY-4A) Advanced Geostationary Radiation Imager (AGRI) TIR channels, quantifying the brightness temperature biases and stability characteristics of individual bands [14].

To enhance calibration frequency and repeatability without substantially increasing costs, the Radiometric Calibration Network (RadCalNet), established under the Committee on Earth Observation Satellites (CEOS) Working Group on Calibration and Validation (WGCV), has provided an operational service framework for radiometric calibration in the visible-to-shortwave infrared (VNIR–SWIR) spectral region. This framework integrates standardized automated sites with unified processing chains and enables cross-mission and cross-sensor radiometric-consistency assessments [15,16]. Meanwhile, methodologies based on automated stations and standardized processing pipelines have already been operationalized. For instance, the automated reflectance-based method designed for Landsat-8 Operational Land Imager (OLI) significantly reduces field deployment requirements while maintaining timely and robust calibration updates [17]; Similarly, a ground-based radiance-based method for Sentinel-2A MultiSpectral Instrument (MSI) has demonstrated a long-term radiometric-stability evaluation framework driven by automated observations [18]. Using the Baotou Desert automated calibration site, time-series vicarious calibration of Ziyuan-3 (ZY-3) multispectral camera (MUX) has also verified the effectiveness of the automated workflow and its capability to close the uncertainty budget [19].

However, automated vicarious calibration systems for the thermal infrared domain remain limited. Existing lake-based campaigns are typically short in duration, seasonally restricted, and difficult to maintain year-round. Furthermore, automated networks such as RadCalNet do not currently extend to the TIR spectral region, where radiance is more sensitive to atmospheric variability and surface emissivity. This gap highlights the need for a persistent, automated, traceable TIR calibration framework suitable for long-term operational missions.

Several recent efforts have extended automated vicarious calibration to the TIR domain. Over radiometrically homogeneous lake surfaces, automated buoys, radiometers, and uncrewed surface vehicles (USV) have been used to reduce manual intervention and increase temporal sampling. For example, USV-based observations over Lake Qinghai have enabled multi-scene radiometric consistency assessments for the FY-4A/AGRI TIR channels [14]. Similarly, the Ziyuan-1 02E (ZY-1-02E) Infrared Spectrometer sensor (IRS) has been calibrated using coordinated ground measurements from a Fourier transform infrared (FTIR) spectrometer and an SI-111 infrared thermometer (Apogee Instruments, Inc., Logan, UT, USA), achieving an accuracy better than 0.6 K under diverse surface and atmospheric conditions [20]. In addition, Hu et al. established a lake-based ground-to-satellite synchronous calibration workflow for the Sustainable Development Goals Satellite-1 (SDGSAT-1) Thermal Infrared Spectrometer (TIS), utilizing high-resolution ground radiometer observations and atmospheric radiative transfer simulations to quantify systematic brightness temperature biases (approximately 0.3–1.1 K) across the three TIR bands, confirming the feasibility of routine on-orbit calibration and performance monitoring enabled by automated lake-based observation sites [21].

Despite these advances, lake-based calibration campaigns are limited in operational continuity. Their reliance on water bodies also restricts spatial scalability and limits radiometric coverage across gain states, because the limited dynamic range of water-surface temperature cannot adequately span the sensor’s full gain range. In contrast, the extensive Gobi regions of northwestern China offer naturally stable, spatially uniform, and low-humidity surfaces, making them ideal pseudo-invariant calibration sites (PICS) for thermal infrared sensors. However, no operational, year-round, multi-site automated calibration system has yet been established over these Gobi surfaces. Addressing this gap is essential for enabling persistent, traceable, and high-frequency vicarious calibration of modern TIR satellite missions.

To address this gap and to develop a long-term, low-maintenance, and traceable TIR calibration framework, we deployed automated ground-based observation systems at three Gobi sites in October 2023 and initiated a dedicated field experiment for GF-5A WTI. The framework integrates continuous contact surface-temperature measurements, field-measured emissivity spectra, radiosonde atmospheric profiles, and MODTRAN v5.2 radiative-transfer simulations to construct a complete surface–atmosphere–sensor radiometric chain. Using this chain, top-of-atmosphere radiances are computed, collocated with satellite observations, and used to derive on-orbit calibration coefficients. Although previous studies have investigated TIR vicarious calibration, no multi-site, year-round automated system has been established over Gobi pseudo-invariant calibration surfaces. The present work provides the first implementation of such a system and demonstrates its feasibility for routine TIR calibration.

This study proposes a long-term and traceable framework to support operational TIR vicarious calibration. We establish three year-round Gobi calibration sites selected under PICS-candidate criteria and deploy an automated ground-based observation system, whose accuracy is verified through metrological traceability and radiometric closure. By integrating ground observations with IGRA radiosonde profiles and MODTRAN v5.2 radiative-transfer simulations, we implement a traceable “surface–atmosphere–satellite” radiometric chain to derive on-orbit calibration coefficients for GF-5A/WTI in each band and gain mode. The resulting calibration is further validated by independent on-orbit samples and a comprehensive uncertainty budget.

The remainder of this paper is organized as follows: Section 2 provides detailed descriptions of the study area, data sources, measurement systems, and calibration methods. The calibration results are presented in Section 3. Section 4 discusses the findings, and Section 5 concludes the study.

2. Materials and Methods

2.1. Materials

2.1.1. GF-5A WTI

The Gaofen-5 01A (GF-5A) satellite is an important component of China’s “High-Resolution Earth Observation System.” It is also known as the Hyperspectral Comprehensive Observation Satellite and serves as the follow-up mission to the original Gaofen-5 satellite. Successfully launched into orbit in 2022, GF-5A continues to support environmental and resource monitoring tasks [1]. The mission enhances the integrated observation capability for atmospheric and surface parameters, enabling the retrieval of multidimensional remote sensing information covering atmospheric composition, ecosystem dynamics, and natural resource utilization [22,23].

GF-5A carries the Wide-swath Thermal Infrared Imager (WTI), developed by Beijing Institute of Space Mechanics and Electricity, Beijing, China.The instrument is characterized by medium-to-high spatial resolution and exceptionally large coverage, providing an observation swath of approximately 1500 km and a spatial resolution of about 100 m, and supporting both daytime and nighttime imaging. WTI is equipped with four longwave infrared bands, with the major spectral parameters summarized in Table 1. These TIR bands offer abundant information for quantitative retrievals such as LST and SST [23]. Figure 1 presents the relative spectral response functions (RSRF) of the GF-5A WTI. Compared with the Landsat 8/9 Thermal Infrared Sensor (TIRS), which has a similar spatial resolution, WTI achieves an approximately sevenfold increase in swath width, substantially enhancing its potential for global-scale thermal environment monitoring [24].

2.1.2. Study Area

To ensure that the automated ground-based observation systems provide long-term, stable, and high-quality data for radiometric calibration, candidate sites should satisfy four criteria: (1) high spatial uniformity, (2) temporal stability, (3) flat terrain, and (4) persistently low cloud cover. Building upon the preliminary identification of more than thirty pseudo-invariant calibration sites (PICS) reported by Hu et al. (2020) [25], we performed a regional search and field reconnaissance across the arid and semi-arid regions of Northwest China. Considering long-term observational requirements and environmental constraints, the following three locations were selected for the deployment of automated surface temperature measurement stations.

Golmud, Qinghai

External field experiments and cross-validation studies conducted in the Golmud region have demonstrated its favorable spatial uniformity and practical suitability as a radiometric calibration and validation site [26]. The selected site is located west of Golmud City, on the northern flank of the Kunlun Mountains and southeast of Zhongzao volcano. The terrain slopes gently from south to north. The surface is primarily sandy desert, interspersed with areas of Gobi.

Dachaidan, Qinghai

This site is situated northwest of Xitieshan Town in Dachaidan and south of National Highway G315, within an uninhabited Gobi area. The surface is dominated by gravel-covered Gobi, exhibiting a north-to-south descending topographic trend. The region features large unobstructed areas, flat terrain, and spatially uniform, temporally stable surface properties, while remaining distant from roads and human disturbances. These characteristics make the site highly suitable for deploying automated ground-temperature measurement equipment for long-term vicarious calibration.

Dunhuang, Gansu

The Dunhuang site lies within a well-established radiometric calibration region characterized by a flat and homogeneous gravel Gobi surface with minimal interannual variability. Previous studies have demonstrated that this area exhibits excellent spatial uniformity and temporal stability, making it suitable for on-orbit calibration and cross-sensor consistency assessment [27,28]. The site is situated on the gravel-covered alluvial fan of the Danghe River, featuring a flat and vegetation-free Gobi surface. The region experiences an arid continental climate with dry conditions, low precipitation, clean atmosphere, minimal aerosol loading, and a high number of clear days per year. Its stable spectral properties and favorable natural environment make it an ideal location for radiometric calibration.

2.1.3. Atmospheric Data Acquisition

In this study, atmospheric profiles were primarily obtained from the IGRA, which compiles radiosonde and pilot balloon observations from more than 2800 stations worldwide. IGRA provides continuous records dating back to 1905, with approximately 800 stations offering stable and routine observations in recent decades [29,30]. The dataset includes pressure, temperature, relative humidity, and wind variables reported on standard and significant levels, making it suitable for radiative-transfer modeling.

The IGRA stations used in this study are CHM00052418 (Dunhuang) and CHM00052818 (Golmud). Since the Dachaidan site is geographically closest to Dunhuang, the contemporaneous atmospheric profiles from station CHM00052418 were adopted for its calibration computations. Considering that operational radiosondes typically reach a maximum altitude of approximately 30 km, the region above the balloon’s burst height up to 120 km was supplemented using MODTRAN standard atmospheric profiles. These supplemental layers include altitude, pressure, temperature, and relative humidity, thereby providing a continuous and complete atmospheric profile for radiative-transfer modeling and calibration simulations.

Figure 2 illustrates the vertical profiles of pressure, temperature, and relative humidity measured on 3 April 2024, after merging the radiosonde observations with the upper-atmosphere standard profiles. These atmospheric parameters exhibit typical variations with altitude.

2.1.4. Ground-Based Synchronous Observations

Automated ground-based observation systems were deployed at the three calibration sites in October 2023 to obtain high-temporal-resolution surface parameters synchronized with satellite overpasses. Figure 3 illustrates the system configuration, which was jointly developed by our research group and UTAN Technology Co., Ltd. (Hangzhou, China). Each station is designed for long-term, unattended operation and is powered by a solar-panel and battery module. The instrument enclosure is weather-resistant and dust-proof, allowing stable operation under the large diurnal temperature variations and strong winds characteristic of Gobi environments. Data are sampled at preset intervals and transmitted via the BeiDou-3 communication system, with a reporting interval configured as one observation per minute.

Surface temperature was measured using semi-buried contact sensors installed according to standard meteorological protocols. The sensors have an accuracy of approximately 0.1 °C, providing continuous in-situ temperature time series for radiometric calibration and validation of GF-5A/WTI observations. This measurement chain ensures high temporal resolution, traceability, and long-term stability required for automated vicarious calibration.

2.2. Methods

2.2.1. Metrological and Radiometric-Closure Validation of Contact Temperature Sensors

To rigorously assess the accuracy and metrological reliability of the contact temperature sensors deployed at the field sites, two complementary and independent validation procedures were implemented. These procedures jointly quantify sensor performance under both controlled metrological conditions and physically realistic radiometric scenarios.

The first validation approach employed a metrological-chain comparison. A Jinming (JM) precision thermometer (Tianjin Jinming Instrument Co., Ltd., Tianjin, China), traceable to the National Institute of Metrology (NIM), China, was co-located with the contact sensor under controlled thermal-contact conditions to ensure identical thermal conditions. Two 90-minute experiments were conducted on 20 September 2024 (11:00 local time) and 27 September 2024 (22:00 local time), representing heating and cooling regimes, respectively. The instantaneous measurement bias was quantified from the time series of temperature differences:

T_{Bias 1} (t) = T_{contact} (t) - T_{JM} (t),

(1)

The second validation procedure consisted of a radiometric-closure experiment. Two experiments were conducted on 20 and 26 September 2024 over reference surfaces with comparable material composition and surface roughness, thereby minimizing emissivity-induced discrepancies. Under low-wind conditions, Turbo FT radiance spectra

L (λ)

and simultaneous contact temperatures

T_{contact}

were acquired. Surface temperatures were then retrieved using the iterative spectrally smooth temperature and emissivity separation (ISSTES) algorithm [31], yielding

T_{ISSTES}

. Radiometric consistency was subsequently evaluated through the deviation metric:

T_{Bias 2} (t) = T_{contact} (t) - T_{ISSTES} (t),

(2)

Together, the absolute metrological reference and the radiometric physical-constraint evaluations form a complementary validation framework that quantitatively characterizes the absolute accuracy of the deployed sensors.

2.2.2. Surface Emissivity Measurement

The surface emissivity spectra, denoted by

ε_{t} (λ)

, were obtained in situ using an FTIR spectrometer (Turbo FT, Designs & Prototypes Instruments, Simsbury, CT, USA). The radiative transfer model can be written as

L_{at - sensor} (λ) = τ (λ) ε_{t} (λ) B (λ, T_{t}) + τ (λ) ρ_{t} (λ) L^{↓} (λ) + L^{↑} (λ) .

(3)

where

L_{at - sensor} (λ)

is the measured radiance,

τ (λ)

is the atmospheric transmittance,

ε_{t} (λ)

is the directional surface emissivity,

B (λ, T_{t})

is the Planck radiance at the target surface temperature

T_{t}

,

ρ_{t} (λ) = 1 - ε_{t} (λ)

the surface reflectance according to Kirchhoff’s law,

L^{↓} (λ)

is the downwelling sky radiance incident on the surface, and

L^{↑} (λ)

is the atmospheric path radiance.

T_{t}

denotes the physical target temperature, which is approximated by the contact-measured temperature

T_{contact}

under homogeneous and thermally stable surface conditions.

The downwelling radiance

L^{↓} (λ)

was estimated using observations of a calibrated infrared reference panel with known emissivity

ε_{g} (λ)

(specified by the manufacturer and periodically verified against a laboratory blackbody).

L_{g} (λ)

is the measured spectral radiance from the reference panel at temperature

T_{g}

.

L^{↓} (λ)

is computed as

L^{↓} (λ) = \frac{L_{g} (λ) - ε_{g} (λ) B (λ, T_{g})}{1 - ε_{g} (λ)} .

(4)

For the short sensor–target distance (≤1 m) used in the field FTIR measurements, MODTRAN v5.2-based short-path simulations indicate that atmospheric attenuation and path radiance are negligible over 8–14 μm (

τ (λ) \approx 1

and

L^{↑} (λ) \approx 0

).

Because the target is opaque in the thermal infrared, the constraint

ρ_{t} (λ) = 1 - ε_{t} (λ)

applies. Under these conditions, the surface emissivity

ε_{t} (λ)

can be retrieved as:

ε_{t} (λ) = \frac{L_{at - sensor} (λ) - L^{↓} (λ)}{B (λ, T_{t}) - L^{↓} (λ)} .

(5)

The retrieved emissivity spectra for the three calibration sites are shown in Figure 4.

2.2.3. Radiometric Calibration Method

The objective of this study is to construct a consistent “surface–atmosphere–satellite” radiometric chain for each calibration site at the time of satellite overpass. By computing the channel-equivalent at-sensor radiance and establishing a linear relationship between this radiance and the corresponding DN of the GF-5A WTI imagery, the on-orbit radiometric calibration coefficients are derived. The overall workflow is illustrated in Figure 5 and consists of five major components: automated ground measurements, atmospheric radiative transfer modeling, satellite–ground co-registration, sample selection, and derivation of calibration coefficients.

In this study, IGRA radiosonde atmospheric profiles and satellite overpass parameters are used as inputs to MODTRAN v5.2 to simulate the atmospheric transmittance

τ (λ)

, path radiances

L_{↑} (λ)

and downwelling radiance

L_{↓} (λ)

. Combined with the contact surface temperature

T_{contact} (t)

and the site emissivity spectrum

ε_{t} (λ)

, these atmospheric terms enable the computation of the top-of-atmosphere (TOA) spectral radiance

L_{TOA} (λ)

. The band-equivalent TOA radiance

L_{TOA, band}

is then obtained by convolving

L_{TOA} (λ)

with the spectral response function

R_{band} (λ)

, and regressed against the collocated WTI digital numbers to derive on-orbit calibration coefficients for each band and gain mode.

At the three sites of Golmud, Dachaidan, and Dunhuang, the automated observation system continuously records contact surface temperature

T_{contact}

, while

ε_{t} (λ)

for each site were measured in advance and used as inputs for the calibration workflow.

To ensure traceable atmospheric inputs, radiosonde profiles from the corresponding IGRA stations were used as the primary atmospheric state, and the layers above the balloon-burst altitude were extended with the MODTRAN standard atmosphere. For each overpass scenario, the IGRA-based profile together with the satellite viewing geometry was provided to MODTRAN v5.2 to compute the

τ (λ)

,

L^{↑} (λ)

, and

L^{↓} (λ)

.

Given the measured surface emissivity

ε_{t} (λ)

and the contact temperature

T_{contact}

, the TOA spectral radiance follows directly from the general radiative-transfer expression in Equation (3) by substituting

L (λ) = L_{TOA} (λ)

and

T_{t} = T_{contact}

, and using the opaque-surface constraint

ρ_{t} (λ) = 1 - ε_{t} (λ)

.

The

L_{TOA, band} (λ)

can be converted into the WTI band-equivalent radiance by applying response-weighted integration using the corresponding relative spectral response function

R_{band} (λ)

. The band-equivalent radiance is therefore computed as

L_{TOA, band} = \frac{\int_{λ_{min}}^{λ_{max}} R_{band} (λ) L_{TOA} (λ) d λ}{\int_{λ_{min}}^{λ_{max}} R_{band} (λ) d λ} .

(6)

To guarantee the traceability and statistical reliability of satellite–ground matched samples, the same screening procedure was applied to both the calibration and validation datasets. First, a 200 × 200 pixel subimage centered on each ground measurement location was extracted and visually inspected to eliminate scenes affected by clouds or cloud shadows. Second, temperature stability within ±5 min of the satellite overpass time was examined to identify possible thin clouds or transient shading, and samples exhibiting abnormal fluctuations were removed.

If the sample passed these checks, a 5 × 5 pixel region centered on the ground point (approximately 500 m × 500 m) was extracted, and the mean DN value was taken as the matched radiometric observation. The DN standard deviation and sensor gain state were recorded for subsequent calibration.

Using all valid matched pairs, calibration coefficients for each band were derived using a least-squares linear regression. For sensors operating under multiple gain states (j), separate calibration coefficients were computed for each state. Outliers were identified using the residuals with respect to the fitted regression line: samples with an absolute residual exceeding twice the standard deviation of the residual distribution were removed, and the regression was then recomputed using the remaining samples. The calibration model is expressed as:

L_{TOA, i} = {gain}_{i j} D N_{i j} + {offset}_{i j},

(7)

where

L_{TOA, i}

is the band-i equivalent TOA radiance, and

{gain}_{i j}

,

D N_{i j}

, and

{offset}_{i j}

denote the gain, digital number, and offset for band i under gain state j, respectively.

3. Results

3.1. Accuracy Verification of Ground-Based Temperature Measurements

To evaluate the performance of the ground-based contact temperature sensor across different thermal transition regimes, two field comparison experiments were conducted under warming (20 September 2024, 11:00–12:30 local time) and cooling (27 September 2024, 22:00–23:30 local time) conditions. In each experiment, the ground-based contact sensor was compared synchronously with a JM standard thermometer traceable to the National Institute of Metrology of China. The temperature difference time series

T_{Bias 1} (t)

, defined in Equation (1), was used to characterize the measurement bias between the two sensors.

Figure 6 presents the synchronous temperature measurements and the corresponding bias series. The two temperature profiles exhibit high temporal coherence, indicating consistent thermal response behavior between the contact sensor and the JM reference. During the warming experiment, surface temperature increased from approximately 28.6 °C to 36.6°C, with instantaneous deviations

T_{Bias 1} (t)

generally confined within

\pm 0.4

°C. Statistically, the bias metrics are

T_{Bias 1, MEAN} = 0.078 ° C, T_{Bias 1, RMSE} = 0.173 ° C .

During the cooling experiment, temperature decreased from 8.57°C to 5.77°C, with even bigger deviations:

T_{Bias 1, MEAN} = 0.175 ° C, T_{Bias 1, RMSE} = 0.177 ° C .

Both experiments indicate that the bias remains small throughout the experiments. Such behavior suggests that the contact sensor and JM standard thermometer maintain robust agreement in both heating and cooling conditions. The slight positive bias may arise from the thermal interface between the probe and surface: higher contact pressure or marginally thicker thermal paste may cause the contact probe to sense heating slightly earlier than the standard thermometer.

Given the intrinsic single-point measurement uncertainty of the contact sensor (approximately ± 0.1

° C

), and considering the experiment-derived statistics (

T_{Bias 1, MEAN} < 0.18

° C

;

T_{Bias 1, RMSE} < 0.18

° C

), the observed deviation is far smaller than typical uncertainties in thermal infrared inversion or on-orbit radiometric calibration (often

< 1

K). Thus, the accuracy is sufficient for validating TIR products and evaluating retrieval algorithms.

To independently assess the suitability of contact temperature measurements for serving as reference inputs in radiative-transfer calculations, a radiometric-closure experiment was performed. This experiment evaluates whether temperatures derived from FTIR-based radiometric inversion are consistent with direct contact measurements under controlled surface and atmospheric conditions. Turbo FT radiance spectra

L (λ)

and contact temperatures

T_{contact}

were collected simultaneously, and the ISSTES algorithm was applied to retrieve temperature

T_{ISSTES}

[31]. The radiometric consistency metric

T_{Bias 2} (t)

, as defined in Equation (2), was then used to quantify the difference between the contact measurements and the ISSTES-derived temperatures.

The experiments covered a high-temperature case (approximately 40

° C

, 20 September 2024) and a low-temperature case (approximately 7

° C

, 26 September 2024). As shown in Figure 7, the deviations

T_{Bias 2} (t)

are generally confined within

\pm 0.25

° C

. The overall statistics are

T_{Bias 2, MEAN} = - 0.019 ° C, T_{Bias 2, RMSE} = 0.128 ° C,

indicating that the contact temperatures are consistent with the ISSTES-retrieved radiometric temperatures at a level well within the accuracy requirements for thermal infrared calibration in this study.

Across both dynamic thermal transitions and radiometrically retrieved temperature ranges, the contact sensors demonstrate high consistency (

T_{Bias, MEAN} \approx 0.1

–

0.18

° C

;

T_{Bias, RMSE} \approx 0.13

–

0.18

° C

). This accuracy level is fully sufficient for thermal infrared calibration requirements and supports both on-orbit and field-based radiometric calibration as well as emissivity algorithm validation. Based on these results, the contact temperature measurements are adopted as the reference temperature input for subsequent surface temperature retrieval and calibration procedures.

3.2. Analysis of Surface Temperature Variations

In October 2023, the automated ground measurement systems were successfully deployed at the three calibration sites of Dunhuang, Dachaidan, and Golmud, and stable data acquisition has since been achieved. Analysis of the 2024 contact temperature time series enables an assessment of the long-term suitability of the three Gobi sites for persistent vicarious calibration. The annual thermal dynamics provide insight into the stability, representativeness, and radiometric diversity required for calibrating thermal infrared sensors across their full dynamic range.

As shown in Figure 8, all three sites exhibit a typical seasonal cycle characterized by low temperatures in winter and high temperatures in summer. A sustained high-temperature plateau lasting approximately two months is observed in summer, while maximum winter temperatures do not exceed 20

° C

. The annual warmest period occurs in late July, and the coldest temperatures appear in January (Dachaidan) and December (Dunhuang and Golmud), consistent with regional radiative climate characteristics. These seasonal behaviors enable seasonal calibration strategies, with high-temperature calibration in summer and low-temperature calibration in winter.

Due to planned maintenance and system switching, data gaps occur between 1 November and 15 November 2024; however, the overall coverage and continuity remain sufficient for statistical analysis and do not affect the interpretation of seasonal patterns or thermal extremes. The annual temperature dynamic range is large across all sites, with Dunhuang spanning 87.3

° C

, Dachaidan 95.3

° C

, and Golmud 89.2

° C

. This broad thermal range ensures that the three sites collectively cover the full spectrum of surface temperatures from cold to hot throughout the year. Consequently, ample calibration samples can be obtained to support dynamic-range extension and long-term drift monitoring of thermal infrared sensors, providing continuous and traceable ground-based reference measurements for radiometric calibration.

3.3. Radiometric Calibration Results

GF-5A WTI imagery acquired from 1 February 2024 to 31 July 2024 was selected for calibration analysis (182 days in total). During this period, approximately six WTI scenes per day were available over the calibration sites, yielding 1092 candidate scenes. Following the satellite–ground matchup and the quality-control procedure described in Section 2.2.3, 903 unsuitable scenes were excluded, and a total of 189 valid overpass samples remained across the three calibration sites.

This dataset provides both a sufficiently wide radiometric dynamic range and a representative distribution of multiple gain states, meeting the requirements of linear regression–based calibration. Notably, GF-5A/WTI employs multiple gain modes within a given band, and gain switching introduces gain-dependent offsets and response variations. Consequently, the DN–radiance relationship is not globally continuous across gain modes, and pooling samples from different gains in a single regression will bias the fitted coefficients. Therefore, we perform gain-stratified regression and derive the calibration coefficients for each gain mode using the linear model in Equation (7).

Figure 9 presents the on-orbit calibration results for WTI channels B1–B4 under different gain modes (Z2, Z3, Z4). For each band–gain combination, linear regression was performed using all valid matched samples, and the regression equation and corresponding coefficient of determination (

R^{2}

) were obtained. A larger

R^{2}

indicates a stronger linear correspondence between DN and radiance

L_{TOA}

. Overall, all channel–gain combinations exhibit clear and statistically significant linear responses, with coefficients of determination ranging from

R^{2} = 0.979

to

0.993

and RMSE ranging from 0.12 to 0.24. These results demonstrate that the on-orbit response of GF-5A WTI is well described by a first-order linear model across its operational dynamic range, with no evidence of nonlinear distortion.

From a channel-by-channel perspective, distinct patterns emerge in sample size n and residual characteristics (RMSE) across gain states. Band 1 under gain mode Z2 exhibits the best overall performance (

R^{2} = 0.993

, RMSE = 0.18,

n = 189

), indicating a favorable signal-to-noise ratio and sufficient radiance coverage, making it the most stable channel in the current calibration framework. Band 2 demonstrates excellent cross-gain consistency between Z2 and Z3, with identical coefficients of determination (

R^{2} = 0.989

) and RMSE values of 0.12 and 0.24 (

n = 149 / 40

). This suggests that transitioning between gain states does not introduce systematic response deviations and that the linear relationship remains portable and internally consistent.

Bands 3 and 4 also maintain strong linearity under Z3/Z4 modes (Band 3:

R^{2} = 0.981 /

0.989

, RMSE = 0.16/0.23; Band 4:

R^{2} = 0.979 / 0.981

, RMSE = 0.14/0.23). The slightly higher residuals observed in the high-gain modes are primarily attributable to the limited number of samples (

n = 40

) and the broader distribution of operating radiance levels, reflecting the influence of sample size and radiance coverage on regression stability rather than any intrinsic nonlinearity of the sensor.

In summary, all WTI channels exhibit highly consistent linear behavior across gain states, with strong model fits and well-controlled residuals, meeting the requirements of on-orbit radiometric calibration. To further evaluate cross-gain coherence and the applicability of the derived calibration coefficients, the next section analyzes error characteristics in overlapping gain regions and validation results from independent imagery.

4. Discussion

4.1. Selection of Calibration Sites

To ensure that automated ground-based observation systems provide long-term, high-quality data for satellite radiometric calibration and validation, calibration sites must satisfy several core criteria, including spatial homogeneity, temporal stability, flat terrain, and low cloud cover. The overarching principle is to prioritize surface types with highly stable radiometric properties and minimal anthropogenic disturbance—such as desert and gravel–gobi surfaces—and to include regions that span different thermal regimes so that sensor performance can be evaluated under diverse surface temperature conditions. Spatially uniform regions with minimal topographic variation and weak spatial gradients in reflectance/emissivity help improve the geometric consistency between satellite and ground observations, whereas persistent low cloudiness and low aerosol loading ensure high ratios of valid overpasses and robust temporal continuity.

For candidate site selection, this study builds upon the preliminary site inventory of Hu et al. (2020), which identified 32 Pseudo-Invariant Calibration Sites (PICS) [25]. A regional search and field surveys were conducted across the arid to semi-arid regions of northwestern China. To further refine site suitability, we used the MODIS MOD09GA cloud-mask product from 2001–2020 to derive 20-year clear-sky statistics for all PICS candidates, using the annual mean clear-sky fraction averaged over the 20-year period. MOD09GA provides two sets of cloud-related flags (cloud state and internal cloud algorithm flag). Based on these, two clear-sky indicators were defined:

Clear-Sky Probability 1: fraction of pixels within the 10 km × 10 km window that satisfy cloud state = 0 or internal cloud algorithm flag = 0;
Clear-Sky Probability 2: fraction of pixels within the 10 km × 10 km window that satisfy cloud state = 0 and internal cloud algorithm flag = 0.

These metrics jointly characterize the long-term clear-sky availability under “cloud-free/low-cloud” (Clear-Sky Probability 1) and “strictly cloud-free” (Clear-Sky Probability 2) conditions.

To meet the spatial homogeneity requirements, we further calculated the mean slope and slope standard deviation within a 10 km × 10 km window around each candidate site using SRTM digital elevation model (DEM) data, providing quantitative constraints on terrain flatness and spatial uniformity.

As shown in Table 2, the three selected sites—Golmud, Dachaidan, and Dunhuang—perform well in terms of spatial homogeneity and clear-sky frequency. Although Dunhuang exhibits slightly higher cloudiness, its low slope and high spatial uniformity make it an excellent candidate. Golmud and Dachaidan, both located within typical gravel–gobi regions, show consistently high clear-sky probabilities and therefore provide highly favorable conditions for long-term, stable ground-based calibration observations.

4.2. Spatial Uniformity of Surface Thermal Infrared Emission

To evaluate the spatial uniformity of surface thermal infrared emission over the candidate calibration regions, thermal infrared band images from the GF-5A WTI instrument were analyzed within a 2 km × 2 km area centered on each candidate site. Spatial radiometric uniformity was quantified using the coefficient of variation (CV), a dimensionless statistic widely used in calibration-site characterization studies [25], defined as:

CV = \frac{D N_{std}}{D N_{mean}} \times 100 %,

(8)

where

D N_{std}

represents the standard deviation of digital numbers (DN) within the region of interest, and

D N_{mean}

denotes the corresponding mean DN value. A smaller CV indicates a more spatially homogeneous thermal radiance field.

As summarized in Table 3, all three candidate calibration sites exhibit excellent spatial radiometric uniformity, with CV values consistently below 2% across all thermal infrared bands. The Dunhuang site shows the lowest CV values among the three regions for both daytime (0.3%–0.9%) and nighttime (0.18%–0.63%) scenes, indicating superior radiance homogeneity. The Golmud site ranks second (daytime 0.6%–1.6%, nighttime 0.42%–0.98%), while the Dachaidan site displays slightly higher CV values but remains well within acceptable uniformity thresholds (daytime 0.6%–1.8%, nighttime 0.3%–1.1%).

Across all sites, nighttime CV values are systematically lower than their daytime counterparts, suggesting that surface thermal emission fields are more stable under weak or absent solar heating. Taken together, the three regions exhibit favorable characteristics—uniform surface composition, minimal temporal variability, flat terrain, and a high frequency of clear-sky conditions—supporting their suitability for establishing persistent vicarious calibration fields and meeting the radiometric accuracy requirements of high-resolution thermal infrared sensors.

4.3. Analysis of Calibration Coefficients

To accommodate calibration requirements across different gain modes within the same spectral band, calibration coefficients were derived independently for each mode. During the calibration process, it was observed that DN values differ across gain states even when corresponding to the same physical radiance

L_{TOA}

.

To compare calibration consistency across gain settings under equivalent radiance levels, this study defines the overlap region as the radiance interval between the minimum

L_{TOA}

observed in a lower-gain mode and the maximum

L_{TOA}

observed in the adjacent higher-gain mode. Within this interval, radiance residuals were computed using the calibration coefficients listed in Table 4, following:

L_{TOA_BIAS} = L_{TOA} - ({gain}_{i j} \cdot D N_{i j} + {offset}_{i j}),

(9)

where

{gain}_{i j}

and

{offset}_{i j}

indicate the calibration slope and intercept of the i spectral band under the j gain mode. For each overlap interval, three statistical metrics—systematic bias (Bias), standard deviation (Std), and root-mean-square error (RMSE)—were used to characterize the error distribution and evaluate whether different gain settings maintain consistent radiometric responses under identical radiance conditions.

Figure 10 illustrates the radiometric consistency across gain modes for each spectral band within their respective

L_{TOA}

overlap intervals. These results provide a systematic basis for evaluating whether different gain configurations maintain comparable statistical behavior and response stability under identical radiance conditions, thereby testing the robustness of the calibration model across the instrument’s operational dynamic range. A few extreme samples are present in the overlap region, which are retained because they reflect gain-state behavior near the overlap boundary.

Across-band comparisons reveal distinct cross-mode behaviors. For Band 2, gain mode Z2 shows a clear statistical advantage in the overlap region: its

R M S E

,

| Mean Bias |

and

S t d

are 0.1695, 0.0182 and 0.1685, respectively, all lower than the corresponding values for Z3 (0.2145, 0.0595 and 0.2060). Under the current calibration model and sample distribution, this indicates that the calibration samples for Band 2 provide denser coverage on the low-radiance side (e.g., nighttime or low-temperature conditions), so that the linear fit and noise characteristics in this radiance interval are better constrained. Since Z2 is typically used for lower-radiance observations, the present dataset suggests that low-brightness samples exert a stronger constraint on the Z2 calibration parameters, leading to smaller apparent errors and higher stability for this gain mode under the current calibration setup.

For Band 3, the overlap-region statistics show a different pattern: Z4 outperforms Z3 across all three metrics, with

R M S E

decreasing from 0.2788 to 0.1971,

| Mean Bias |

from 0.1232 to 0.0436, and

S t d

from 0.2501 to 0.1923. Under the current calibration model and dataset, this indicates that the available calibration samples in the overlap region are more concentrated in the higher-radiance range (e.g., daytime or stronger surface emission), so that the high-radiance responses of Z4 are more fully constrained by the data. This feature suggests that, for Band 3 under the present calibration model and sample conditions, the high-radiance samples provide a more pronounced constraint on the Z4 fit, with system noise and potential nonlinearity being more effectively suppressed for this gain mode in the overlap interval.

Band 4 exhibits a similar behavior to Band 3 within the overlap region. With only a small difference in dispersion (

S t d

differing by about 0.006), Z4 achieves smaller errors than Z3 (for Z3:

R M S E = 0.1943

,

| Mean Bias | = 0.1100

; for Z4:

R M S E = 0.1699

,

| Mean Bias | = 0.0350

). Under the present sample size and calibration procedure, this indicates that Z4 yields smaller fitting errors for the higher-radiance samples in the Band 4 overlap region.

Overall, the overlap-region statistics for the three bands jointly reflect the behavior of the current calibration parameters across different radiance levels. For Band 2, the better error statistics of Z2 indicate that, within the present dataset, nighttime or low-radiance samples provide more effective constraints for this gain mode. For Bands 3 and 4, the smaller errors of Z4 suggest that, under the current calibration model and sample distribution, high-radiance samples in the overlap regions exert stronger constraints on the corresponding fits. Taken together, these results indicate that, for GF-5A WTI under the present calibration framework and sample conditions, consistent responses and good error convergence are achieved across different radiance regimes.

4.4. Validation of Calibration Coefficients

To assess the applicability and radiometric accuracy of the derived calibration coefficients under real observational conditions, thermal infrared imagery acquired by the GF-5A WTI instrument from 1 August to 10 September 2024 was used for validation. For each scene, the WTI spectral response functions, MODTRAN-derived atmospheric radiative transfer components (including upward transmittance and path radiance), together with in situ surface temperature and emissivity measurements, were combined to compute the band-equivalent at-sensor radiance. The radiance was subsequently converted to the simulated brightness temperature

T_{sim}

using the Planck function. In parallel, the satellite-side brightness temperature

T_{sat}

was retrieved using the calibration coefficients derived in Section 3.3 of this study by applying them to the corresponding WTI observations to obtain the at-sensor radiance, followed by inversion of the Planck function. Comparison with the satellite-derived brightness temperature

T_{sat}

yields the brightness–temperature deviation:

Δ T_{B} = T_{sat} - T_{sim} .

(10)

After removing samples affected by cloud contamination or unstable surface temperature conditions, 201 out of 246 candidate samples were excluded, and the remaining 45 valid cases were retained for quantitatively assessing the stability and accuracy of the calibration coefficients under on-orbit conditions.

As shown in Figure 11, the brightness temperature deviations for the four thermal infrared bands exhibit strong consistency across the 45 valid samples. The deviation range is within

\pm 1.6 K

for all bands, and the maximum RMSE does not exceed 0.76 K, indicating that the calibration results achieved in this study are both accurate and reliable. The error distributions for each channel are relatively compact, further demonstrating that the calibration model remains stable and that the retrieval uncertainties are well controlled.

From a per-band perspective, although slight differences exist in deviation magnitude and distribution shape, the overall behavior is coherent across bands. Band 1 shows a nearly symmetric deviation distribution with

Mean Bias = + 0.091 K

and

RMSE = 0.758 K

, indicating stable radiometric response without obvious systematic shifts. Band 2 exhibits a small negative bias (

Mean Bias = - 0.166 K

) and achieves the smallest RMSE among all bands (

RMSE = 0.619 K

), suggesting that this channel currently attains the highest calibration accuracy. Band 3 presents a small positive bias (

Mean Bias = + 0.169 K

,

RMSE = 0.642 K

) with tightly clustered residuals and limited random dispersion. Band 4 behaves similarly to Band 3, showing

Mean Bias = + 0.153 K

and

RMSE = 0.648 K

, consistent with stable linear response and controlled random errors.

Overall, the mean absolute deviation across all four channels is approximately

0.145 K

, with an average RMSE of

0.667 K

. These results demonstrate that the calibration coefficients derived in this study are highly reliable and applicable under on-orbit observation conditions. Furthermore, after applying the calibration coefficients, the GF-5A WTI instrument maintains strong radiometric consistency and effective noise control across all channels.

4.5. Uncertainty Analysis

The uncertainty of brightness temperature estimation in this study primarily originates from four contributing factors: (1) contact temperature measurement, (2) surface emissivity measurement, (3) atmospheric profile acquisition, and (4) radiative transfer modeling using MODTRAN v5.2. These components collectively determine the overall accuracy of the surrogate radiometric calibration.

(1): Uncertainty of Contact Temperature Measurement

To quantify the measurement uncertainty of the contact temperature sensor, synchronous heating–cooling comparison experiments were conducted using a JM standard thermometer traceable to national metrology institutes. The results show that the RMSE between the contact thermometer and the standard reference thermometer lies within 0.173–0.177

° C

. This RMSE already includes practical factors such as thermal coupling between the probe and target surface and differences in response time. Therefore, the RMSE is treated as the standard uncertainty of the contact temperature sensor under the conditions of this study:

σ_{contact} = 0.18 K .

(11)

(2): Uncertainty of Surface Emissivity Measurement

Surface emissivity was measured in the field using a portable Fourier-transform infrared spectrometer. As defined by Equation (5), emissivity uncertainty arises primarily from radiance measurement errors and temperature measurement errors propagated through the emissivity formulation. In addition, it also includes the algorithmic uncertainty in the emissivity measurement. When these effects are further propagated through the surface–atmosphere–sensor radiative transfer chain, the resulting brightness temperature uncertainty is less than 0.5 K. Therefore, the emissivity-related uncertainty is taken as:

σ_{emis} = 0.5 K .

(12)

(3): Uncertainty of Atmospheric Profile Data

Atmospheric correction in this study relies on radiosonde profiles from the Integrated Global Radiosonde Archive (IGRA). These profiles undergo operational quality control and exhibit good temporal and spatial consistency [29,30]. Under cold and dry conditions, radiosonde relative humidity biases are typically within 5–10%. Previous studies show that a 10% relative increase in precipitable water vapor (PWV) leads to a brightness temperature deviation of approximately 0.13 K [32]. As the study area is located in an arid to semi-arid Gobi region with generally low and slowly varying PWV, the atmospheric-profile uncertainty is taken as:

σ_{atm} = 0.13 K .

(13)

(4): Uncertainty of the Radiative Transfer Model (MODTRAN)

The uncertainty of the radiative transfer model mainly arises from band-model approximations, absorption-line parameter uncertainties, and numerical integration discretization. Within the 8–14 μm atmospheric window, the inherent MODTRAN v5.2 model error is generally below 2%. Therefore, we apply a

\pm 2 %

relative perturbation to the MODTRAN-simulated band-equivalent TOA radiance for each channel and retrieve the corresponding brightness temperature using the inverse Planck relation. We then summarize the differences before and after perturbation in the brightness–temperature domain, and take the mean of all differences to quantify this uncertainty term.

Based on our dataset, the mean brightness–temperature uncertainties induced by MODTRAN v5.2 are 0.9 K (Band 1), 0.9 K (Band 2), 1.2 K (Band 3), and 1.3 K (Band 4). Accordingly, the MODTRAN v5.2-related brightness-temperature uncertainty is taken as:

σ_{\mod} = 1.30 K .

(14)

(5): Total Uncertainty

The combined standard uncertainty of the at-sensor brightness temperature derived via the radiative transfer chain is computed using the root-sum-of-squares (RSS):

σ_{T, total} = \sqrt{σ_{contact}^{2} + σ_{emis}^{2} + σ_{atm}^{2} + σ_{\mod}^{2}} .

(15)

Substituting the adopted values

σ_{contact} = 0.18 K

,

σ_{emis} = 0.50 K

,

σ_{atm} = 0.13 K

,

σ_{\mod} = 1.30 K

, yields:

σ_{T, total} \approx 1.41 K .

(16)

Overall, brightness temperature retrievals based on the radiative transfer model exhibit a conservative combined standard uncertainty of approximately 1.41 K under the conditions of this study. Among the individual contributors, the MODTRAN-related uncertainty dominates the total error budget, followed by surface emissivity uncertainty, while uncertainties from contact temperature measurements and atmospheric profiles are comparatively minor. This indicates that even in dry, low-water-vapor environments, radiative transfer modeling uncertainty can be a primary contributor to the absolute brightness–temperature uncertainty when propagated in the brightness–temperature domain.

5. Conclusions

In response to the need for long-term stability and high-frequency calibration of TIR satellite sensors, this study developed a continuous, automated ground-based observation system centered on Gobi pseudo-invariant calibration sites and dedicated in situ observations. The framework has been validated using the GF-5A WTI instrument through comprehensive analyses of calibration-coefficient stability, inter-gain consistency, and on-orbit brightness–temperature comparisons.

Three calibration sites located in arid to semi-arid Gobi regions were selected through comprehensive screening based on terrain flatness, cloud-occurrence probability, and TIR spatial uniformity. Using metrology-traceable reference thermometers together with dual validation from Turbo FT emissivity measurements and ISSTES-based temperature–emissivity separation, the deployed automated ground system was demonstrated to provide a reliable, traceable surface-temperature reference suitable for long-term TIR radiometric calibration.

By integrating high-accuracy emissivity measurements, IGRA atmospheric-sounding profiles, and MODTRAN v5.2-based radiative-transfer modeling, an end-to-end “surface–atmosphere–sensor” calibration chain was established, enabling routine updates of on-orbit gain and offset parameters at a cadence compatible with operational practice and supporting a physically consistent, traceable radiometric link between ground measurements and satellite observations.

Results derived from GF-5A WTI indicate that the linear calibration coefficients exhibit highly stable responses across all bands and gain states, with coefficients of determination consistently exceeding

R^{2} > 0.98

. Overlap-region analyses further show that inter-gain brightness–temperature differences (bias and RMSE) are generally constrained within about

0.3 K

. Band-dependent behaviors are also observed: Band 2 calibration benefits from low-radiance samples (nighttime or low-temperature conditions), whereas Bands 3 and 4 achieve improved model fitting under high-radiance conditions (daytime or warm-surface cases). Independent on-orbit image validation confirms that the RMSE for all bands remains below

0.76 K

, which is smaller than the conservative estimated total uncertainty of approximately

1.41 K

derived from the full radiative chain.

Overall, the proposed continuous ground-based surrogate calibration approach provides GF-5A WTI with a low-maintenance, traceable, and operationally practical on-orbit calibration solution. The methodology offers a reproducible technical pathway for quality assurance, long-term stability assessment, and cross-sensor or multi-mission consistency analysis for future high-resolution thermal infrared satellite missions.

Author Contributions

Conceptualization, J.B. and K.J.; methodology, J.B., Y.B. and K.J.; software, J.B.; validation, J.B., Y.B., H.G. and M.Z.; formal analysis, J.B.; investigation, J.B., S.Z. and H.G.; resources, J.B., Y.B., H.G. and K.J.; data curation, J.B. and S.Z.; visualization, J.B. and S.Z.; writing—original draft preparation, J.B.; writing—review and editing, J.B., Y.B., G.Z., S.Z., Y.Z. and K.J.; supervision, G.Z., Y.Z. and K.J.; project administration, G.Z., M.Z. and K.J. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Key Laboratory of Target Cognition and Application Technology, grant number E4M9080306.

Data Availability Statement

Data are contained within the article. Further inquiries and requests for additional information can be directed to baijingwei23@mails.ucas.ac.cn.

Conflicts of Interest

Author Hong Guan is the legal representative of UTAN Technology Co., Ltd. (Hangzhou, China), and Author Mingmin Zhang is employed by UTAN Technology Co., Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

Huang, S.; Qi, W.; Zhang, S.; Xia, T.; Wang, J.; Zeng, Y. Recent Progress of Earth Observation Satellites in China. Chin. J. Space Sci. 2024, 44, 731. [Google Scholar] [CrossRef]
Liu, Y.; Wang, Y.; Wang, G. Land Surface Temperature Retrieval From GF5-01A Based on Split- Window Algorithm. Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2024, XLVIII-1-2024, 431–436. [Google Scholar] [CrossRef]
Li, Z.L.; Wu, H.; Duan, S.B.; Zhao, W.; Ren, H.; Liu, X.; Leng, P.; Tang, R.; Ye, X.; Zhu, J.; et al. Satellite Remote Sensing of Global Land Surface Temperature: Definition, Methods, Products, and Applications. Rev. Geophys. 2023, 61, e2022RG000777. [Google Scholar] [CrossRef]
Hulley, G.C.; Hook, S.J. Generating Consistent Land Surface Temperature and Emissivity Products Between ASTER and MODIS Data for Earth Science Research. IEEE Trans. Geosci. Remote Sens. 2011, 49, 1304–1315. [Google Scholar] [CrossRef]
Markham, B.L.; Helder, D.L. Forty-Year Calibrated Record of Earth-Reflected Radiance from Landsat: A Review. Remote Sens. Environ. 2012, 122, 30–40. [Google Scholar] [CrossRef]
Xiong, X.; Chiang, K.-F.; Wu, A.; Barnes, W.L.; Guenther, B.; Salomonson, V.V. Multiyear On-Orbit Calibration and Performance of Terra MODIS Thermal Emissive Bands. IEEE Trans. Geosci. Remote Sens. 2008, 46, 1790–1803. [Google Scholar] [CrossRef]
Wan, Z. New Refinements and Validation of the Collection-6 MODIS Land-Surface Temperature/Emissivity Product. Remote Sens. Environ. 2014, 140, 36–45. [Google Scholar] [CrossRef]
Montanaro, M.; McCorkel, J.; Tveekrem, J.; Stauder, J.; Mentzell, E.; Lunsford, A.; Hair, J.; Reuter, D. Landsat 9 Thermal Infrared Sensor 2 (TIRS-2) Stray Light Mitigation and Assessment. IEEE Trans. Geosci. Remote Sens. 2022, 60, 5002408. [Google Scholar] [CrossRef]
Wang, W.; Cao, C.; Blonski, S. NOAA-21 VIIRS Thermal Emissive Bands Early On-Orbit Calibration Performance and Improvements. IEEE Trans. Geosci. Remote Sens. 2024, 62, 5006313. [Google Scholar] [CrossRef]
Thome, K.J. Absolute Radiometric Calibration of Landsat 7 ETM+ Using the Reflectance-Based Method. Remote Sens. Environ. 2001, 78, 27–38. [Google Scholar] [CrossRef]
Zhang, F.; Shao, X.; Cao, C.; Chen, Y.; Wang, W.; Liu, T.C.; Jing, X. Evaluation of VIIRS Thermal Emissive Bands Long-Term Calibration Stability and Inter-Sensor Consistency Using Radiative Transfer Modeling. Remote Sens. 2024, 16, 1271. [Google Scholar] [CrossRef]
Madhavan, S.; Brinkmann, J.; Wenny, B.N.; Wu, A.; Xiong, X. Evaluation of VIIRS and MODIS Thermal Emissive Band Calibration Stability Using Ground Target. Remote Sens. 2016, 8, 158. [Google Scholar] [CrossRef]
Barsi, J.A.; Schott, J.R.; Hook, S.J.; Raqueno, N.G.; Markham, B.L.; Radocinski, R.G. Landsat-8 Thermal Infrared Sensor (TIRS) Vicarious Radiometric Calibration. Remote Sens. 2014, 6, 11607–11626. [Google Scholar] [CrossRef]
Hu, Y.; Zhang, Y.; Yan, L.; Li, X.M.; Dou, C.; Jia, G.; Si, Y.; Zhang, L. Evaluation of the Radiometric Calibration of FY4A-AGRI Thermal Infrared Data Using Lake Qinghai. IEEE Trans. Geosci. Remote Sens. 2021, 59, 8040–8050. [Google Scholar] [CrossRef]
Bouvet, M.; Thome, K.; Berthelot, B.; Bialek, A.; Czapla-Myers, J.; Fox, N.P.; Goryl, P.; Henry, P.; Ma, L.; Marcq, S.; et al. RadCalNet: A Radiometric Calibration Network for Earth Observing Imagers Operating in the Visible to Shortwave Infrared Spectral Range. Remote Sens. 2019, 11, 2401. [Google Scholar] [CrossRef]
Voskanian, N.; Thome, K.; Wenny, B.N.; Tahersima, M.H.; Yarahmadi, M. Combining RadCalNet Sites for Radiometric Cross Calibration of Landsat 9 and Landsat 8 Operational Land Imagers (OLIs). Remote Sens. 2023, 15, 5752. [Google Scholar] [CrossRef]
Czapla-Myers, J.; McCorkel, J.; Anderson, N.; Thome, K.; Biggar, S.; Helder, D.; Aaron, D.; Leigh, L.; Mishra, N. The Ground-Based Absolute Radiometric Calibration of Landsat 8 OLI. Remote Sens. 2015, 7, 600–626. [Google Scholar] [CrossRef]
Liu, Y.; Ma, Z.; Ma, L.; Wang, N.; Qian, Y.; Li, C.; Tang, L. Vicarious Radiometric Calibration Using a Ground Radiance-Based Approach: A Case Study of Sentinel 2A MSI. In Proceedings of the IGARSS 2018–2018 IEEE International Geoscience and Remote Sensing Symposium, Valencia, Spain, 22–27 July 2018; pp. 3296–3299. [Google Scholar] [CrossRef]
Li, W.; Ma, L.; Zhao, Y.; Liu, Y.; Wang, N.; Qian, Y.; Li, K.; Li, C.; Tang, L. Temporal Vicarious Radiometric Calibration of ZY-3 Mux Sensor Using Automatic Ground Measurement of Baotou Sandy Site in China. In Proceedings of the 2021 IEEE International Geoscience and Remote Sensing Symposium IGARSS, Brussels, Belgium, 11–16 July 2021; pp. 7767–7770. [Google Scholar] [CrossRef]
Tang, H.; Xie, J.; Dou, X.; Zhang, H.; Chen, W. On-Orbit Vicarious Radiometric Calibration and Validation of ZY1-02E Thermal Infrared Sensor. Remote Sens. 2023, 15, 994. [Google Scholar] [CrossRef]
Hu, Y.; Li, X.M.; Dou, C.; Jia, G.; Hu, Z.; Xu, A.; Ren, Y.; Yan, L.; Wang, N.; Cui, Z.; et al. Absolute radiometric calibration evaluation of the thermal infrared spectrometer onboard SDGSAT-1. Int. J. Digit. Earth 2023, 16, 4492–4511. [Google Scholar] [CrossRef]
He, Z.; Gao, L.; Liang, M.; Zeng, Z.C. A Survey of Methane Point Source Emissions from Coal Mines in Shanxi Province of China Using AHSI on Board Gaofen-5B. Atmos. Meas. Tech. 2024, 17, 2937–2956. [Google Scholar] [CrossRef]
Sun, J.; Wei, D.; Zhang, D.; Sun, Z. An Improved Method for Estimating Sea Surface Temperature Based on GF-5A Satellite Data in Bohai Bay. Remote Sens. 2025, 17, 1879. [Google Scholar] [CrossRef]
Reuter, D.; Richardson, C.; Pellerano, F.; Irons, J.; Allen, R.; Anderson, M.; Jhabvala, M.; Lunsford, A.; Montanaro, M.; Smith, R.; et al. The Thermal Infrared Sensor (TIRS) on Landsat 8: Design Overview and Pre-Launch Characterization. Remote Sens. 2015, 7, 1135–1153. [Google Scholar] [CrossRef]
Hu, X.; Wang, L.; Wang, J.; He, L.; Chen, L.; Xu, N.; Tao, B.; Zhang, L.; Zhang, P.; Lu, N. Preliminary Selection and Characterization of Pseudo-Invariant Calibration Sites in Northwest China. Remote Sens. 2020, 12, 2517. [Google Scholar] [CrossRef]
Gao, H.; Gu, X.; Yu, T.; Liu, L.; Sun, Y.; Xie, Y.; Liu, Q. Validation of the Calibration Coefficient of the GaoFen-1 PMS Sensor Using the Landsat 8 OLI. Remote Sens. 2016, 8, 132. [Google Scholar] [CrossRef]
Zhu, S.; Li, Z.; Hu, X.; Qie, L.; Xu, H.; Qiao, R.; Zhang, Y.; Golry, P. Evaluation and Optimal Selection of Dunhuang Radiometric Calibration Site Based on OLI/Landsat 8 and MSI/Sentinel 2 Data. Int. J. Remote Sens. 2022, 43, 1684–1702. [Google Scholar] [CrossRef]
Tan, K.; Wang, X.; Niu, C.; Wang, F.; Du, P.; Sun, D.X.; Yuan, J.; Zhang, J. Vicarious Calibration for the AHSI Instrument of Gaofen-5 With Reference to the CRCS Dunhuang Test Site. IEEE Trans. Geosci. Remote Sens. 2021, 59, 3409–3419. [Google Scholar] [CrossRef]
Durre, I.; Yin, X.; Vose, R.S.; Applequist, S.; Arnfield, J. Enhancing the Data Coverage in the Integrated Global Radiosonde Archive. J. Atmos. Ocean. Technol. 2018, 35, 1753–1770. [Google Scholar] [CrossRef]
Durre, I.; Vose, R.S.; Wuertz, D.B. Overview of the Integrated Global Radiosonde Archive. J. Clim. 2006, 19, 53–68. [Google Scholar] [CrossRef]
Borel, C. Surface Emissivity and Temperature Retrieval for a Hyperspectral Sensor. In Proceedings of the IGARSS ’98. Sensing and Managing the Environment, 1998 IEEE International Geoscience and Remote Sensing, Symposium Proceedings, (Cat. No.98CH36174), Seattle, WA, USA, 6–10 July 1998; IEEE: Piscataway, NJ, USA, 1998; Volume 1, pp. 546–549. [Google Scholar] [CrossRef]
Hook, S.; Chander, G.; Barsi, J.; Alley, R.; Abtahi, A.; Palluconi, F.; Markham, B.; Richards, R.; Schladow, S.; Helder, D. In-Flight Validation and Recovery of Water Surface Temperature with Landsat-5 Thermal Infrared Data Using an Automated High-Altitude Lake Validation Site at Lake Tahoe. IEEE Trans. Geosci. Remote Sens. 2004, 42, 2767–2776. [Google Scholar] [CrossRef]

Figure 1. Spectral response functions of the four thermal infrared bands of the GF-5A WTI.

Figure 2. Vertical atmospheric profiles of (a) pressure, (b) temperature, and (c) relative humidity obtained on 3 April 2024 from radiosonde observations, supplemented with standard atmospheric layers above the balloon burst height.

Figure 3. Automated ground-based observation system.

Figure 4. Surface emissivity spectra at the three calibration sites (Golmud, Dachaidan and Dunhuang).

Figure 5. Workflow of the Operational Ground-Based Vicarious Radiometric Calibration Framework.

Figure 6. Field comparison between the ground-based contact thermometer and the JM standard thermometer and the corresponding temperature differences

T_{Bias 1} (t)

: (a,b) time series of measured temperature

T_{contact}

and JM reference temperature

T_{JM}

for the warming and cooling; (c,d) corresponding temperature differences

T_{Bias 1} (t) = T_{contact} (t) - T_{JM} (t)

.

Figure 6. Field comparison between the ground-based contact thermometer and the JM standard thermometer and the corresponding temperature differences

T_{Bias 1} (t)

: (a,b) time series of measured temperature

T_{contact}

and JM reference temperature

T_{JM}

for the warming and cooling; (c,d) corresponding temperature differences

T_{Bias 1} (t) = T_{contact} (t) - T_{JM} (t)

.

Figure 7. Consistency assessment between the contact thermometer and ISSTES-retrieved surface temperature: distribution of temperature differences

T_{Bias 2} (t) = T_{contact} (t) - T_{ISSTES} (t)

under high-temperature (

\sim 40 ° C

, 20 September 2024) and low-temperature (

\sim 7 ° C

, 26 September 2024) conditions.

Figure 7. Consistency assessment between the contact thermometer and ISSTES-retrieved surface temperature: distribution of temperature differences

T_{Bias 2} (t) = T_{contact} (t) - T_{ISSTES} (t)

under high-temperature (

\sim 40 ° C

, 20 September 2024) and low-temperature (

\sim 7 ° C

, 26 September 2024) conditions.

Figure 8. Time series of in situ contact surface temperature at the three land calibration sites (Golmud, Dachaidan and Dunhuang) during 2024.

Figure 9. On-orbit radiometric calibration results for the GF-5A WTI instrument: (a) Band 1 under gain mode Z2; (b) Band 2 under gain mode Z2; (c) Band 2 under gain mode Z3; (d) Band 3 under gain mode Z3; (e) Band 3 under gain mode Z4; (f) Band 4 under gain mode Z3; (g) Band 4 under gain mode Z4. Each subplot includes the least-squares regression line, sample size n, coefficient of determination

R^{2}

, and statistical significance (all

p < 0.001

).

Figure 9. On-orbit radiometric calibration results for the GF-5A WTI instrument: (a) Band 1 under gain mode Z2; (b) Band 2 under gain mode Z2; (c) Band 2 under gain mode Z3; (d) Band 3 under gain mode Z3; (e) Band 3 under gain mode Z4; (f) Band 4 under gain mode Z3; (g) Band 4 under gain mode Z4. Each subplot includes the least-squares regression line, sample size n, coefficient of determination

R^{2}

, and statistical significance (all

p < 0.001

).

Figure 10. Consistency assessment across gain modes within the

L_{TOA}

overlap regions for GF-5A WTI. (a,b) Band 2: Z2 vs. Z3; (c,d) Band 3: Z3 vs. Z4; (e,f) Band 4: Z3 vs. Z4. Left column: residual–radiance relationships; Right column: RMSE, Bias, and Std comparisons within the same overlap region.

Figure 10. Consistency assessment across gain modes within the

L_{TOA}

overlap regions for GF-5A WTI. (a,b) Band 2: Z2 vs. Z3; (c,d) Band 3: Z3 vs. Z4; (e,f) Band 4: Z3 vs. Z4. Left column: residual–radiance relationships; Right column: RMSE, Bias, and Std comparisons within the same overlap region.

Figure 11. Distribution of brightness temperature validation errors for each thermal infrared band of GF-5A WTI.

Table 1. Main characteristics of the GF-5A WTI thermal infrared bands.

GF-5A WTI	Description	Technical Parameter
Spectral Range	Band 1	8.01–8.39 μm
	Band 2	8.42–8.83 μm
	Band 3	10.3–11.3 μm
	Band 4	11.5–12.5 μm
Subsatellite Resolution	Band 1–4	≤100 m
Width	Band 1–4	≥1500 km

Table 2. Basic characteristics of the selected calibration sites, including terrain slope statistics and long-term clear-sky probabilities.

Site	Mean Slope (°)	Slope Std. Dev. (°)	Clear-Sky Prob. 1	Clear-Sky Prob. 2
Golmud	1.780	0.859	0.69	0.53
Dunhuang	1.542	0.793	0.54	0.34
Dachaidan	2.048	0.951	0.70	0.55
Overall mean of candidate sites (N = 32)	4.759	2.420	0.63	0.46

Table 3. Statistics of surface thermal infrared uniformity around the calibration sites, expressed as coefficient of variation (CV, %) within a 2 km × 2 km area centered at each site for GF-5A WTI thermal infrared bands.

Band	Dachaidan CV (%)		Dunhuang CV (%)		Golmud CV (%)
Band	Daytime	Nighttime	Daytime	Nighttime	Daytime	Nighttime
Band 1	1.7	1.1	0.8	0.63	1.6	0.81
Band 2	1.8	1.1	0.9	0.45	1.6	0.98
Band 3	1.0	0.5	0.5	0.26	0.85	0.75
Band 4	0.6	0.3	0.3	0.18	0.59	0.42

Table 4. Field-derived calibration coefficients for the GF-5A WTI instrument.

Band	Gain Mode	Gain	Offset
B1	Z2	$3.734 \times 10^{- 3}$	$9.271 \times 10^{- 1}$
B2	Z2	$3.263 \times 10^{- 3}$	$1.205$
B2	Z3	$3.944 \times 10^{- 3}$	$1.906$
B3	Z3	$5.214 \times 10^{- 3}$	$- 1.073$
B3	Z4	$6.014 \times 10^{- 3}$	$- 6.354 \times 10^{- 1}$
B4	Z3	$6.309 \times 10^{- 3}$	$- 4.019$
B4	Z4	$7.008 \times 10^{- 3}$	$- 3.641$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Bai, J.; Bao, Y.; Zhou, G.; Zhang, S.; Guan, H.; Zhang, M.; Zhao, Y.; Jiang, K. An Operational Ground-Based Vicarious Radiometric Calibration Method for Thermal Infrared Sensors: A Case Study of GF-5A WTI. Remote Sens. 2026, 18, 302. https://doi.org/10.3390/rs18020302

AMA Style

Bai J, Bao Y, Zhou G, Zhang S, Guan H, Zhang M, Zhao Y, Jiang K. An Operational Ground-Based Vicarious Radiometric Calibration Method for Thermal Infrared Sensors: A Case Study of GF-5A WTI. Remote Sensing. 2026; 18(2):302. https://doi.org/10.3390/rs18020302

Chicago/Turabian Style

Bai, Jingwei, Yunfei Bao, Guangyao Zhou, Shuyan Zhang, Hong Guan, Mingmin Zhang, Yongchao Zhao, and Kang Jiang. 2026. "An Operational Ground-Based Vicarious Radiometric Calibration Method for Thermal Infrared Sensors: A Case Study of GF-5A WTI" Remote Sensing 18, no. 2: 302. https://doi.org/10.3390/rs18020302

APA Style

Bai, J., Bao, Y., Zhou, G., Zhang, S., Guan, H., Zhang, M., Zhao, Y., & Jiang, K. (2026). An Operational Ground-Based Vicarious Radiometric Calibration Method for Thermal Infrared Sensors: A Case Study of GF-5A WTI. Remote Sensing, 18(2), 302. https://doi.org/10.3390/rs18020302

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

An Operational Ground-Based Vicarious Radiometric Calibration Method for Thermal Infrared Sensors: A Case Study of GF-5A WTI

Highlights

Abstract

1. Introduction

2. Materials and Methods

2.1. Materials

2.1.1. GF-5A WTI

2.1.2. Study Area

Golmud, Qinghai

Dachaidan, Qinghai

Dunhuang, Gansu

2.1.3. Atmospheric Data Acquisition

2.1.4. Ground-Based Synchronous Observations

2.2. Methods

2.2.1. Metrological and Radiometric-Closure Validation of Contact Temperature Sensors

2.2.2. Surface Emissivity Measurement

2.2.3. Radiometric Calibration Method

3. Results

3.1. Accuracy Verification of Ground-Based Temperature Measurements

3.2. Analysis of Surface Temperature Variations

3.3. Radiometric Calibration Results

4. Discussion

4.1. Selection of Calibration Sites

4.2. Spatial Uniformity of Surface Thermal Infrared Emission

4.3. Analysis of Calibration Coefficients

4.4. Validation of Calibration Coefficients

4.5. Uncertainty Analysis

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI