Next Article in Journal
Three-Dimensional Reconstruction and Geometric Morphology Analysis of Lunar Small Craters within the Patrol Range of the Yutu-2 Rover
Next Article in Special Issue
Effective Improvement of the Accuracy of Snow Cover Discrimination Using a Random Forests Algorithm Considering Multiple Factors: A Case Study of the Three-Rivers Headwater Region, Tibet Plateau
Previous Article in Journal
Interframe Saliency Transformer and Lightweight Multidimensional Attention Network for Real-Time Unmanned Aerial Vehicle Tracking
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Novel Physics-Statistical Coupled Paradigm for Retrieving Integrated Water Vapor Content Based on Artificial Intelligence

1
School of Surveying and Geo-Informatics, Shandong Jianzhu University, Jinan 250100, China
2
Institute of Agricultural Resources and Regional Planning, Chinese Academy of Agricultural Sciences, Beijing 100081, China
3
School of Physics and Electronic-Engineering, Ningxia University, Yinchuan 750021, China
4
National Space Science Center, Chinese Academy of Sciences, Beijing 100190, China
5
Department of Watershed Sciences, Utah State University, Logan, UT 84322, USA
6
Department of Civil and Environmental Engineering and Water Resources Research Center, University of Hawaii at Manoa, Honolulu, HI 96822, USA
7
School of Public Administration and Law, Northeast Agricultural University, Harbin 150006, China
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Remote Sens. 2023, 15(17), 4250; https://doi.org/10.3390/rs15174250
Submission received: 26 July 2023 / Revised: 24 August 2023 / Accepted: 25 August 2023 / Published: 30 August 2023

Abstract

:
Retrieval of integrated water vapor content (WVC) from remote sensing data is often ill-posed because of insufficient observational information. There are many factors that cause WVC changes, which yield instability in the accuracy of many traditional algorithms. To overcome this problem, we developed a novel fully-coupled paradigm for the robust retrieval of WVC from thermal infrared remote sensing data. Through the derivation of the physical radiative transfer equation, we determined two conditions that need to be satisfied for the deep learning retrieval paradigm of WVC. The first condition is that the input parameters and output parameters of the deep learning need to be able to build a complete set of solvable equations in theory. The second condition is that, if there is a strong relationship between input parameters and output parameters, it can be directly retrieved. If it is a weak relationship, we need to use prior knowledge to improve the portability and accuracy of the algorithm. The training and test data of deep learning is composed of representative solutions of physical methods and solutions of statistical methods. The representative solutions of the physical methods were obtained from the physical forward model, and the statistical solutions were obtained from multi-source data which can compensate for the defect that the physical model cannot simulate mixed pixels. MODIS L1B data was used for case analysis of paradigm retrieval, and the analysis indicated that four thermal infrared bands were usually needed as the input parameters of deep learning and the integrated water vapor content as the output parameter. When land surface temperature and emissivity were taken as prior knowledge, the root-mean-square error (RMSE) of the retrieved WVC was 0.07 g/cm2. The optimal accuracy RMSE was 0.27 g/cm2. When there was a strong correlation between input parameters and output parameters, i.e., if there were two bands that were very sensitive to WVC in the band combination, high-precision retrieval could also be achieved without prior knowledge. All the analyses show that the paradigm of deep learning coupling physics and statistics can accurately retrieve WVC, which is a significant improvement on the traditional method and solves the problem of lack of physical interpretation of deep learning.

Graphical Abstract

1. Introduction

Temporal and spatial variations of integrated water vapor content (WVC) influence the transfer of solar radiation, thus affecting many weather and climate change processes [1]. For example, variation in WVC affects global heat circulation, water circulation, and radiation energy balance [2]. WVC is the most influential variable in the atmosphere on solar radiation. In the process of radiative transfer, integrated water vapor content absorbs and releases surface long-wave radiation, and latent heat through phase transformation, which impacts air and ground temperatures. Retrieval of WVC plays an important role in drought monitoring [3,4], climate change [5], land surface temperature retrieval [6], and other studies.
Common methods of estimating WVC include radiosonde technology, the Global Navigation Satellite System (GNSS), and remote sensing techniques. Traditionally, the study of WVC has mainly relied on radiosonde technology [6], where WVC is determined through profiling measurements of temperature, relative humidity, and total pressure. However, radiosonde methods often have high operating costs, low sounding frequency, and large distances between launch points relative to the spatial variation of WVC [7], and therefore are only practical for small-scale use. In recent years, gradual improvements in the GNSS have increased its potential for WVC detection. A GNSS signal experiences tropospheric wet delay when it travels through the neutral atmosphere between satellite and receiver, and can achieve mm-scale accuracy [8,9,10]. This method is based on the upward detection of water vapor from the surface and is not affected by complex surface radiation and reflection information. Using GNSS, continuous WVC monitoring is possible from fixed stations on the earth’s surface [11,12]. GNSS monitoring of WVC is not limited by weather [13] and is widely used in the evaluation of water vapor products [14,15,16]. Unfortunately, the distribution of detectors is often uneven and cannot accurately resolve gradient changes of water vapor on the horizontal scale, which limits its utility. Satellite remote sensing can quickly and widely obtain large-scale spatial information, which can improve the spatio–temporal analysis of WVC at regional scale [17]. Based on the absorption characteristics of water vapor, remote sensing uses visible-near infrared (VNIR) [18,19], thermal infrared (TIR) [20] bands and microwave bands [15,21,22] for the estimation of WVC. The rapid development of remote sensing technology has yielded an abundance of data and methods for retrieving WVC.
Statistical, physical, and machine learning methods are used to obtain WVC through remote sensing. Many statistical methods rely on the regression of brightness temperature (BT) and WVC. This has the advantage of simplicity, requiring only a few parameters, but the physical meaning is not very clear and the models are often not highly portable. Through the remote sensing method, we cannot accurately obtain all the parameters affecting WVC retrieval. In order to reduce this ill-posed problem and improve the accuracy, principal component regression [23,24], empirical regression [23], Bayes [25,26], and other methods have been proposed. The surface types in different regions are different, so the retrieval method obtained by using statistical methods is mainly suitable for local regions. Although the physical method is based on energy balance and has high retrieval accuracy [27,28], it requires many parameters. Due to the insufficient information for remote sensing observation, the number of unknowns is more than the number of equations, which forms an “ill-conditioned” problem. In order to reduce the unknowns, thermal infrared remote sensing usually used the split window algorithm (two thermal infrared bands) to retrieve the WVC [29,30,31], which was also affected by the surface type. The principle of this method is that, under the premise that integrated water vapor content has spatial invariance in a specific region [32], using the linear approximation of the Planck function to the brightness temperature difference, linear approximation is achieved of the relationship between brightness temperature difference and absorption of WVC. However, this requires that the WVC is not too high and that the land surface specific emissivity does not vary much between bands. To improve the accuracy of integrated water vapor content retrieval, Liu et al. [20] and Hu et al. [33] added a water vapor band to the traditional split window algorithm and proposed a three-band algorithm, which improved the retrieval accuracy and reduced the uncertainty of land surface emissivity under dry atmospheric conditions (i.e., WVC < 2 g/cm2). Machine learning methods often use neural networks to obtain a relationship between WVC and various input parameters [34,35,36]. The accuracy of these methods depends on the accuracy of the training and testing datasets. The advantage of machine learning is that it can handle nonlinear calculations to obtain an approximation of the optimal solution. However, it is difficult for these methods to give a specific explanation of physical relations between input variables and output parameters, so they are often regarded as “black boxes”.
In order to overcome the shortcomings of traditional methods, this study constructs a retrieval paradigm of water vapor content based on deep learning (DL) coupled physical and statistical methods. Firstly, the basic conditions for determining the DL paradigm were derived based on physical methods, and then the representative solutions of the physical methods were obtained using forward physical models and the solutions of statistical methods were obtained using multi-source data. Finally, DL used the solutions of physical and statistical methods as training and test data to achieve the purpose of coupling, and analyzed whether the retrieval paradigm needs to use land surface temperature (LST) and land surface emissivity (LSE) as prior knowledge to improve the retrieval accuracy and algorithm portability.

2. Methodology

The DL method is essentially the same as other methods utilized in order to improve the accuracy of target parameters. If we want to obtain a general inversion paradigm, we must prove that the output parameters of DL can be uniquely determined by the input parameters. To construct a WVC retrieval paradigm of DL with physical interpretation, we used the thermal radiation energy balance equation to determine the relationship between the input and output parameters of DL, and ensured that a complete set of retrieval equations can be constructed theoretically between the input and output parameters. The retrieval paradigm is shown in Figure 1. Firstly, based on the derivation of the energy radiation balance equation, it is theoretically determined how many equation sets (physical method) need to be constructed to solve the integrated water vapor content, that is, how many input parameters need to be determined for DL, and then used the forward model MODTRAN simulation to obtain the representative solution of the physical method. Secondly, the forward model simulates pure pixels, and representative solutions are needed for large-scale mixed pixels. Based on the derivation of the physical model, we built a fuzzy statistical method, and used the multi-source data to obtain a reliable statistical method solution as a supplement to the physical method solution. Thirdly, the solutions of physical and statistical methods formed the training and testing database of DL, thus realizing the purpose of coupling physical and statistical methods of DL. Fourthly, the information for the TIR window band under clear sky conditions is mainly affected by LST and LSE, and the retrieval accuracy of LST and LSE is high. Therefore, in order to improve the accuracy of WVC retrieval in the TIR window band and the portability of the algorithm, we can take the LST and LSE as a priori knowledge. Here we used two sets of models for comparative analysis, one of which added LST and LSE as prior knowledge, and the other group had no prior knowledge. Fifthly, we used site observation data for precision verification and cross validation with MODIS products.

3. Materials and Methods

3.1. Data

The retrieval paradigm of deep learning coupling physics and statistical methods proposed in this study is realized by composing the representative solutions of physical methods and statistical methods into a training and testing database, so the acquisition of both physical methods and statistical methods solutions (WVC) is the key. We used the classical atmospheric radiation transfer model MODTRAN to simulate the thermal radiation transfer in order to obtain the physical method solution, and used the multi-source data to obtain the statistical method solution, so as to make up for the defect that it is difficult to simulate the mixed pixel with the former model.

3.1.1. Simulated Data

MODTRAN is a medium spectral resolution radiative transfer model, which is widely used and has high accuracy [37]. It can simulate the entire radiative transfer process, by setting a variety of land-surface, atmospheric, and instrumentation parameters. Through radiative transport simulations, every transport parameter in the forward process of a single thermal infrared band is known. For this study, atmospheric WVC is the target parameter of the physical method (the solution of the equation). The mid-latitude summer atmospheric reference model is selected as the input model of MODTRAN. For the required parameters, we used 17 surface types, including mainly soil, vegetation, water and rock. Other parameter settings are as follows: (1) surface temperature: the training data range is 280–325 K, with a step interval of 2 K; the test data range is 281–323 K, with a step interval of 3 K; (2) integrated water vapor content: the training data range is 0.2–4.5 g/cm2, with a step interval of 0.3 g/cm2; the test data range is 0.3–3.8 g/cm2, and the step interval is 0.4 g/cm2; (3) the observation angle range of the satellite ranges from 0 to 65°, with step intervals of 3°. Based on the different features of MODIS bands 27, 28, 29, 31, and 32, the model was set up and iteratively simulated to obtain the brightness temperature at satellite. The various parameters of the simulation process are known, so we can obtain solutions for physical methods under different conditions, which can form a part of the deep learning training and testing data. In addition, when the observation angle is relatively large, it is necessary to exclude some simulated values because the water vapor on the oblique path is already large (>5 g/cm2) and there is not much information obtained through thermal infrared remote sensing.

3.1.2. Remote Sensing Data

MODIS remote sensing data was selected as case analysis data, which is a new generation of “union of imagery and spectrum” optical remote sensing instruments. The program consists of two polar-orbiting satellites of the EOS (Earth Observation System) in the United States, called ‘Terra’ and ‘Aqua’, and can provide day and night images of any point on the earth’s surface every 1–2 days, with a spatial resolution of 1 km at nadir, and 5 km at the edge of the scan [38]. MODIS has lower instrument noise and narrower spectral response functions (SRF) [39], and can provide high quality data. The sensor has 36 bands (0.62–14.38 μm), with multiple TIR bands, so that multiple combination analysis can be performed. The MODIS sensor also has multiple TIR bands suitable for WVC retrieval, including band 27 (centered at 6.72 μm), which is located near the center of the water vapor absorption region, monitoring radiation from the upper atmosphere. Band 28 (centered at 7.33 μm) and band 29 (centered at 8.55 μm) are closer to the wing of the absorption zone, which enables the sensor to detect radiation from the lower layer [20]. Band 31 (centered at 11.03 μm) and band 32 (centered at 12.02 μm) are in the atmospheric window, which is mainly affected by water vapor content. Furthermore, MOD/MYD11 products include LST and LSE, and the error of LST is typically 1 K under stable atmospheric conditions, with LSE products under 0.02 [40]. Due to the different emissivity of different surface types, the TIR long-wave radiation is quite different [41]. Therefore, in order to obtain the surface type information, we took the band emissivity as prior knowledge. The MOD/MYD05 product contains WVC products retrieved by the VNIR ratio method and the TIR split window algorithm, and the accuracy of MOD/MYD05 has been widely validated [42,43,44].

3.1.3. WVC Assimilation Product

ERA5 data is fifth-generation atmospheric reanalysis data for the global climate, established by the European Center for Medium-Range Weather Forecasts (ECMWF) [45]. Developed from ERA-Interim data, ERA5 data is assimilated with worldwide observations to form a complete and consistent global dataset. It provides many meteorological elements, including hourly integrated water vapor content with a spatial resolution of 0.25°. Since the release of the ERA5 reanalysis data, many researchers have tested for accuracy [16,46]. The accuracy of the DL algorithm mainly depends on the training and test data. To improve the accuracy of the collected data, we used ERA5 data as the reference calibration data. The data can only be used when the accuracy of different data products is consistent.

3.1.4. Ground Observations Data

Ground-observed WVC data are also important complementary datasets. In situ WVC data from 2020 was used to build a training/testing database and evaluated existing datasets and new products. These data were obtained from Suomi Net, which is a real-time national Global Positioning System (GPS) network. A series of stations has been set up in North America. There are about 800 stations in total, and each station uses GPS signals to retrieve integrated water vapor content every 30 min, with an accuracy of 1–2 mm [47]. Table 1 lists the ground-based and satellite datasets used in this study.

3.1.5. Data Processing

The accuracy of WVC retrieval depends on the quality of training and testing datasets. The forward physical model is used mainly to obtain the representative solution of the physical method. The parameters of the model simulation are determined, and the data accuracy is very high, but it cannot represent all the real situations, so it is necessary to supplement the solution of the statistical method. The solution of the statistical method comes from multi-source data, so to ensure the consistency of the parameters of the statistical solution it is necessary to ensure that the collected data are synchronized in time and space. To ensure the quality of collected MODIS data products, we have eliminated invalid data due to cloud coverage and other reasons according to the quality control documents of MODIS (including MOD/MYD05 and MOD/MYD11). With ERA5 data as a reference, all data were collected at a resolution of 0.25°, and the time was consistent. The data will be selected only when the accuracy of the two integrated water vapor content products is consistent.

3.2. Methods

In order to make the algorithm for retrieving WVC based on DL have physical significance and form a general paradigm, here we derive the physical and statistical methods based on the radiative transfer energy balance equation, and elaborate the coupling mechanism of the formation of the paradigm.

3.2.1. The Physical Method

The long wave heat radiation transfer process is shown in Figure 2. The derivation of the physical method for WVC retrieval is based on the thermal radiance of the ground and its transfer from the ground through the atmosphere to the remote sensor. Generally speaking, the ground is not a blackbody, and ground emissivity has to be considered for computing the thermal radiance emitted by the ground. The atmosphere has important effects on the received radiance at remote sensor level.
The thermal infrared band is mainly affected by the WVC, which attenuates the thermal radiation energy emitted from the ground. Under clear sky conditions, surface energy radiation changes with different surface types. LST affects the change in near-surface temperature. The absorption of heat by integrated water vapor content will change the atmospheric profile temperature, which will affect the integrated water vapor content saturation. In addition, due to the continuous exchange of energy in the earth–atmosphere system, the vertical density distribution of the atmosphere is also uneven. The upward and downward transmissions are different. Therefore, atmospheric radiation is often expressed in two forms (upward and downward). The radiance value of the sensor at a certain angle can be expressed as Equation (1).
B i T i = ε i B i T s τ i W V C , θ + R i ( θ ) + ( 1 ε i ) τ i ( W V C , θ ) R i ( θ )
where T i is the BT at the satellite of the band i , B i ( T i ) is the radiation received by the sensor, B i T s represents the ground radiation, T s is the LST, τ i ( W V C , θ ) is the atmospheric transmittance in band i at the observation angle θ, ε i is the LSE in the band i , and R i ( θ ) and R i ( θ ) are the downward and upward atmospheric radiances, respectively.
In Franc and Cracknell [14], the upward atmospheric radiation R i ( θ ) and downward atmospheric radiation R i ( θ ) were expressed as:
R i ( θ ) = 0 Z B i T z τ i ( θ , z , Z ) z d z
R i ( θ ) = 2 0 π 2 h 0 B i T z τ i ( θ , z , 0 ) z cos θ sin θ d z d θ
where T z is the atmospheric temperature at height z , Z is the sensor height, θ is the upward direction of atmospheric radiation, and τ i ( θ , z , Z ) represents the upward atmospheric transmittance from height z to the sensor. θ is the downward direction of atmospheric radiation, h is the atmospheric height, and τ i ( θ , z , 0 ) represents the downward atmospheric transmittance from height z to the surface. Qin et al. [48] used the median value theorem to process the upward and downward atmospheric radiation. With this we obtain
R i ( θ ) = ( 1 τ i W V C , θ ) B i ( T a )
R i ( θ ) = ( 1 τ i W V C , θ ) B i ( T a )
where T a and T a are the effective average atmospheric temperature upward and downward respectively, B i T a and B i T a represents the effective average atmospheric radiation of T a and T a in band i . Next, Qin [48] found that substituting T a for T a had no significant impact on accuracy. Therefore, Equation (1) can be written as,
B i T i = ε i B i T s τ i ( W V C , θ ) + ( 1 + ( 1 ε i ) τ i ( W V C , θ ) ) 1 τ i ( W V C , θ ) B i T a
Equation (6) indicates that the radiative transfer formula of each band has four unknowns ( τ i , T s , T a , ε i ). For N bands, there will be 2N + 2 unknowns ( τ and ε of N bands, T s and T a ). We need to reduce variables according to the geophysical relationship between them. The atmospheric transmittance of the TIR band is mainly affected by the integrated water vapor content and other gases (O), and the transmittance can be expressed as Equation (7).
τ i ( W V C , θ ) = f ( W V C , O )
The change in thermal infrared band transmittance is mainly affected by the change in integrated water vapor content, because other gas components in the atmosphere are relatively stable. Therefore, the transmittance of the TIR band is mainly a function of integrated water vapor content change. The emissivity of different bands is different, but the emissivity curve of surface type is stable. Therefore, as long as the ground surface type (G) is determined, the emissivity change curve can be determined. Therefore, the emissivity of different bands is a function of the surface type, which can be expressed by Equation (8).
ε i = f ( G )
From the above derivation and analysis, Equation (6) can be simplified into four unknowns (WVC, LST, G, and T a ). To solve the equations, at least four TIR bands are needed to construct the equations. Mao, et al. [49] found that there was an approximate restrictive relationship between the average atmospheric action temperature ( T a ) and the brightness temperature ( T i ) at satellite, as shown in Equation (9).
T a A + B × T i
where A and B are constants that are limited by geographical conditions. However, the constraint relation cannot be determined exactly, which leaves uncertainty in the solution by traditional methods. If the calculation method is optimized by deep learning, the equation can be reduced to three unknowns (WVC, LST and G), which means that the equation set can be constructed with only three bands at least, but the accuracy may be slightly affected.
In summary, the number of unknowns in Equation (6) has been reduced to four (WVC, LST, G, and T a ). In mathematical equations, if the only solution is found, the number of equations is greater than or equal to n (here there are four unknowns). Therefore, when using physical methods to retrieve WVC, at least four TIR bands are needed to construct the equations. Since high-precision LST and LSE products can be retrieved, we used LST and LSE as prior knowledge to improve the stability and accuracy of the WVC inversion algorithm. In addition, in the previous derivation, we found that, if we use the relationship between the average effective temperature of the atmosphere and the BT at satellite which ensures that the BT information on the satellite mainly comes from the atmosphere, we can also use only three TIR bands to construct the retrieval equation. The above analysis was the process that we used, utilizing physical logic reasoning to determine the minimum number of input parameters required by DL.

3.2.2. The Statistical Method

Theoretically, if we can find all the representative solutions of physical methods in the real world, we can directly use the representative solutions of physical methods as the training and test database for DL to perform retrieval optimization calculations. In fact, the forward model MODTRAN cannot simulate all situations, especially the mixed pixel situation. Therefore, we need to collect other representative solutions, which we call statistical solutions. This statistical method also needs to meet the basic conditions of physical methods, that is, at least four or three bands of BT as input parameters and of WVC as output parameters, and also needs to obtain the corresponding LST and LLSE information. Therefore, the fuzzy function of the statistical method can be expressed as Equation (10).
W V C = i = 1 n Π f ( T i ) + b
where f ( T i ) is a polynomial function of BT, n   ( n 3 ) is the number of TIR bands used for retrieval, Π is an empirical coefficient, which can be determined by the contribution of the infrared band in WVC retrieval, and b is a constant. Statistical methods are essentially the same as physical methods, and the solutions with statistical methods are obtained from multi-source data. The statistical solution is a beneficial supplement to the physical method solution, especially in the case of mixed pixels.

3.2.3. Physics-Statistical Coupling Deep Learning Based on Prior Knowledge

The logical derivation of physical methods is to determine that there is a unique mathematical function relationship between the input parameters and the output parameters of DL. The statistical method is to supplement the lack of representativeness of the simulation solution of the physical forward model. We combined the solutions of physical methods and statistical methods into the training and testing database of DL, thus realizing the purpose of coupling physical and statistical methods. The DNN can model complex nonlinear relationships between input and output variables [50]. The DNN structure is shown in Figure 3; a complete DNN includes an input layer, an output layer, and a number of hidden layers. The activation function can not only introduce nonlinear expression ability, but also improve the robustness of the model and alleviate the problem of gradient disappearance. In this study, we chose the common function ‘Sigmoid’ as the activation function of DNN (shown in Equation (11), where x is the output value of the previous hidden layer node). The representation of the function in the neural network is carried out by a group of independent neurons, which learn from the input during the training phase. After all patterns had been presented, the interconnecting synaptic weight of each neuron was adjusted [51]. The DNN constructed in this study adopted Kalman filter technology, which greatly improves the convergence speed in the learning stage and the separability of nonlinear boundary problems. Moreover, the training time was greatly shortened and the precision of the neural network was improved [52].
f ( x ) = 1 1 + e x

3.2.4. Physics-Statistical Coupled Deep Learning Retrieval Scheme

A specific scheme for retrieving WVC using the PSP–DL model was presented. Firstly, we used MODTRAN simulated atmospheric parameters (training data: 126,720, test data: 59,400) to verify the availability of the physical method, and obtained the theoretical accuracy of WVC retrieval and the optimal scheme for MODIS retrieval of atmospheric water vapor. Secondly, the accuracy of the statistical method was verified by adding reliable WVC data (training data: 25,805, test data: 11,803). Radiometric calibration, geometric correction and atmospheric correction were used to preprocess MODIS images to obtain more accurate spectral information, and to obtain the brightness temperature of MODIS and the WVC of the corresponding image.
According to the band design purpose of MODIS and the position of the band center wavelength, we have determined a representative band combination. Two combination models were constructed (Table 2). The first was a combination with prior knowledge (BT in thermal infrared bands 27, 28, 29, 31 and 32, and LST and LSE in bands 29, 31 and 32). The second was the combination without prior knowledge (BT in thermal infrared bands 27, 28, 29, 31 and 32).

4. Results and Validation

4.1. Theoretical Accuracy Validation and Analysis

We demonstrated the solution (retrieval) of the physical method (Equation (6)) using simulation data and DL. The information obtained by satellite sensors includes surface radiation and reflection information, which is mainly determined by the type of surface. Therefore, we also used the prior knowledge of LST and LSE to solve Equation (6) of WVC, i.e., used accurate and reliable prior knowledge with high-precision BTs as the input from DL to retrieve WVC. We have carried out inversion and comparative analysis on the combination with and without prior knowledge and selected the mean absolute error (MAE), root mean square error (RMSE) and R2, which can represent the inversion accuracy, to find the best WVC inversion combination. The DL retrieval result with prior knowledge combination is shown in Table 3, Table 4 and Table 5 (Model 1) and the retrieval combination without prior knowledge is shown in Table 6, Table 7 and Table 8 (Model 2).
For model 1, WVC retrieval accuracy was good and stable using four and five bands (results shown in Table 3, Table 4 and Table 5). By comparing these results, it was found that the retrieval accuracy using band 27 was better than that using band 28. This is likely to be due to the strong absorption zone of integrated water vapor content in band 27, which can include more water vapor information for the whole layer. The highest accuracy was obtained with the 27, 28, 31, and 32 band combination, which yielded an MAE = 0.05 g/cm2, RMSE = 0.07 g/cm2, and R2 = 0.993, when the number of hidden layers was seven and the number of nodes per layer was 800. Through analysis, WVC retrieval accuracy of the band combination 27, 28, 29, 31, and 32 was similar to retrieval with band 27, 28, 31 and 32 with the use of prior knowledge. Therefore, data redundancy and retrieval band quality should also be taken into account.
Model 2 was designed to compare the effect of using prior knowledge on the retrieval accuracy, and used the same band combinations as model 1, but without prior knowledge input, and the retrieval process was the same as for model 1. In order to find the retrieval results with the highest accuracy, different combinations of hidden layers and number of neurons were tried (results shown in Table 6, Table 7 and Table 8). Compared with Table 3 and Table 6, the minimum MAE of WVC retrieval using LST and LSE and BTs of bands 27, 29, 31 and 32 as input parameters was 0.07 g/cm2, and the minimum MAE of retrieval error using only BTs of bands 27, 29, 31 and 32 as input parameters was 0.16 g/cm2, which shows that the increase of prior knowledge reduces MAE by 0.09 g/cm2. Compared with Table 4 and Table 7, when bands 28, 29, 31 and 32 were combined, the MAE with prior knowledge was 0.11 g/cm2 lower than that without prior knowledge. It can be seen from Table 8 that, when the number of hidden layers was 6 and the number of nodes in each layer was 500, the minimum retrieval error was the combination of bands 27, 28, 31 and 32, MAE = 0.06 g/cm2, RMSE = 0.08 g/cm2, R2 = 0.997. From the comparison between Table 5 and Table 8, it is indicated that, for the combination with 27, 28, 31 and 32, the error with prior knowledge was only 0.01 g/cm2 lower than that without prior knowledge. The main reason is that the BT of bands 27 and 28 is very sensitive to the integrated water vapor content and is little affected by the ground. Therefore, for the TIR band whose band combination of BT at satellite is greatly affected by the ground, the retrieval accuracy can be greatly improved by using prior knowledge. On the contrary, for the TIR band combination that is less affected by the ground and mainly affected by the integrated water vapor content, the ground prior knowledge may not be required.
The accuracy of WVC retrieved by DL for different combinations was generally higher than that of traditional methods. To better evaluate the error distribution of different model combinations, we have analyzed the error distribution, as shown in Figure 4 and Figure 5. The error of model 1 was relatively small, the error fluctuation range was small, and the retrieval result was stable. Due to the high accuracy of retrieval using prior knowledge, data points basically overlap. Taking 2 g/cm2 as the center, the retrieval error increases at both ends, mainly because the amount of training and test data at both ends was relatively small. From the analysis, we can see that the retrieval error using prior knowledge was more stable and smaller. When there are high-precision LST and LSE products, prior knowledge is a better choice. However, if there is no LST and LSE or their accuracy cannot be guaranteed, prior knowledge can also be omitted. In addition, although the accuracy of the retrieval combination of five thermal bands was slightly higher than that of four bands, considering the instrument band setting, we recommend using four bands. In practical application, we should consider the availability, accuracy and redundancy of data according to the thermal infrared band setting of the sensor. In the future, the instrument band design can be optimized according to our analysis.

4.2. Practical Validation and Analysis

The simulation data mainly represent pure pixels, so to make the retrieval results more representative, we supplemented with corresponding high-precision statistical data. The retrieval calculation and accuracy evaluation, after the simulation data were supplemented with statistical data, were similar to the analysis in Section 4.1 and will not be repeated here. So far, the method for retrieving integrated water vapor content using LST and LSE as prior knowledge and DL coupled physical and statistical methods has been completed. MODIS image of southern North America was selected as the case study, and the study scope is shown as Figure 6. The study area is located within 14°–49°N and 66°–124°W, and features a variety of land surface types, including deserts, basins, plains, hills, plateaus, and mountains. The terrain is relatively high in average elevation in the west and low in the east. The continental topography is characterized by north–south mountain ranges on both east and west coasts. Temperate maritime climate with abundant rainfall and Mediterranean climate are characteristic of the coastal mountains over much of the west coast. The windward slope is affected by the cold ocean current, keeping temperatures cooler, while the Mediterranean climate has less rain in the high-temperature period and more rain in the low-temperature period. The central plain of the United States has a temperate continental climate, and the precipitation changes dramatically with the seasons. The eastern and southern regions have a subtropical monsoon humid climate, and are the sites of many ground observations and the prior knowledge information used in this study.

4.2.1. Cross Validation

Two MODIS images were selected for the PSP–DL method, to demonstrate the retrieval of WVC. One was a MODIS VNIR integrated water vapor content product image dated 10 August 2020, and the other was a MODIS TIR integrated water vapor content image from the same day. The training database used MOD/YD 021KM image data and corresponding WVC products, and the image data ensured that there were valid data every month. The band combination with the best accuracy was selected for verification, namely the BTs of 27, 28, 31, and 32 thermal infrared bands, with LST and LSE used as input parameters of PSP–DL.
To comprehensively evaluate the accuracy of the PSP–DL method, we compared the WVC results retrieved by PSP–DL with MODIS water vapor products. Figure 7 and Figure 8 show that the WVC results retrieved by PSP–DL were consistent with the general trends of the WVC products, and that WVC increased gradually from northwest to southeast, consistent with the distribution of climate regions. However, the accuracy of retrieval was low in relatively dry and humid environments. The main reason here is that there is no continuous data at both ends of the function curve when using DL to solve the function curve of the solution of the equation system, so the accuracy at both ends is slightly lower.
Figure 9 and Figure 10 are the results of image cross-validation, which show a good overall spatial comparison between different products. Further analysis revealed that the combination of using four bands with prior knowledge was better than the combination using four bands alone. The error between the two WVC products and the WVC retrieved by the PSP–DL method was mostly between (−0.5 g/cm2, 0.5 g/cm2). Compared to the MODIS-derived WVC, the highest MAE and RMSE of WVC retrieved by DL method were 0.22 g/cm2 and 0.28 g/cm2. The values with poor retrieval accuracy were largely at the boundaries of cloud pixels, because some of these pixels contain cloud. The central valley of the northern coastal mountains was affected by the subtropical high-pressure belt, and the summer was characteristically hot and dry. The area was mainly irrigated agriculture, and irrigation promotes evapotranspiration, which increased atmospheric humidity during the growing season and summer. The change of LSE after irrigation can lead to a decrease in LST retrieval accuracy, which will reduce the retrieval accuracy of WVC. Comparative analysis showed that our algorithm has a high consistency with other products, and had certain advantages over those other products, such as situational adaptation through the supplementing of high-precision samples. To further improve the accuracy, we can build training and testing databases for different regions and add more high-precision data.

4.2.2. In Situ Validation

WVC estimates obtained by GPS are considered the most reliable [53], although there are temporal and spatial matching problems between remote sensing images and validation data. When the satellite transit time was closest to the GPS observation time, the relevant data was collected. In space, GPS is the point observation data of the surface, and there is a gap between the WVC within several kilometers corresponding to a pixel of a remote sensing image. To ensure data accuracy, GPS data were used only when the WVC values from the GPS observation points were close to ERA5 and MODIS data. We selected 255 observation points with a variety of surface types in southern North America, covering the period from January to December 2020, and extracted the pixels of the inversion corresponding to the latitude and longitude of the observation stations. As shown in Figure 11, the MAE of WVC retrieved by the combination of “LST and LSE + 4BTs” and ground synchronous data was 0.23 g/cm2, with RMSE = 0.27 g/cm2 and R2 = 0.921. The MAE of WVC retrieved by combination of “4BTs” and ground synchronous data was 0.25 g/cm2, with RMSE = 0.31 g/cm2, and R2 = 0.894. It can be seen from the analysis that adding prior knowledge did not significantly improve the retrieval accuracy, but the retrieval accuracy was more stable. Theoretically, the retrieval algorithm is more transplantable with ground prior knowledge. When there are bands that are very sensitive to the integrated water vapor content (such as bands 27 and 28), we can obtain relatively high retrieval accuracy without using LST and LSE as prior knowledge. If the satellite sensor has no TIR band that is very sensitive to the integrated water vapor content, it is more appropriate to use LST and LSE as prior knowledge to retrieve the integrated water vapor content.

5. Discussion

Sensitivity analysis is necessary in many applications where knowledge of probable WVC estimation due to possible errors in parameter determination is generally desired. In order to evaluate the impact of parameter errors on WVC retrieval, we need to perform a sensitivity analysis for our algorithm. We utilized Equation (12) to compute the probable WVC estimation error as follows.
Δ W V C = W V C x + Δ x W V C x
For our algorithm, the land surface radiation is the main influence factor for the accuracy of WVC. In this study, we utilized the MODTRAN simulation to analyze the sensitivity of LST and LSE. The retrieval results of WVC were obtained by changing the LST error (−1.5 K–1.5 K) and LSE error (−0.02–0.02). Δ W V C represents the difference between real WVC and retrieved WVC. In order to understand the sensitivity of prior knowledge to WVC, RMSE was selected as an evaluation index to analyze the changes of the two parameters. Figure 12a is the WVC error when we change the LST at the same step size (0.5 K). Figure 12b is the WVC error when we change the LSE at the same step size (0.005); here we made the sensitivity analysis for band 29, 31, 32 and change band 29/31/32 synchronously, with the same step size.
As shown from Figure 12 that the highest accuracy of the retrieval WVC is in the real LST or LSE, WVC error changes between 0.07–0.72 g/cm2 when the LST error change is between −1.5 K to 1.5 K. Compared with the real LST, the larger the deviation, the larger the error of WVC retrieval. The error of WVC retrieval was largest (RMSE is about 0.72 g/cm2) when the LST error was about −1.5 K, underestimating LST can lead to even greater errors, and the retrieval error of WVC increased suddenly when the absolute value of LST error was greater than 1 K. Figure 12b is WVC error when we change the band 29, 31, 32 emissivity (LSE) at the same error step size, and the error of WVC retrieval was the largest (RMSE is about 0.28 g/cm2) when the LSE error was about −0.02. WVC error changed between 0.07–0.28 g/cm2 when the LSE error changed between −0.02 to 0.02. The error of WVC retrieval was the largest (RMSE is about 0.28 g/cm2) when the LSE error was about −0.02. Similarly, the larger the LSE deviation, the greater the WVC retrieval error, and underestimating LSE can lead to larger errors. In order to evaluate the sensitivity of the algorithm to LSE, we also analyzed the change of emissivity error of each band. When the emissivity of band 29 changed between −0.02 and 0.02, the WVC error changed between 0.07 and 0.26 g/cm2. When the emissivity of band 31 changed between −0.02 and 0.02, the WVC error changed between 0.07 and 0.24 g/cm2. When the emissivity of band 32 changed between −0.02 and 0.02, the WVC error changed between 0.07 and 0.20 g/cm2. In general, it can be shown from Figure 12 that when the LST error was within ±1 K and the emissivity was within ±0.01, the retrieval error of WVC did not change much. The emissivity of the thermal infrared band has little change, so the algorithm is not very sensitive to an error of prior knowledge under the condition of large scale pixels. Deep learning coupled physics and statistical methods requires a certain amount of time in obtaining representative solutions from physical and statistical methods, as well as in deep learning training. However, once a deep learning model is trained, its speed is very fast and it has significant advantages over traditional methods. The biggest advantage of this method is that we can establish different training and testing databases for different regions and seasons to further improve accuracy.

6. Conclusions

In this study, a new fully coupled paradigm is developed for robust retrieval of WVC from thermal infrared remote sensing data. The retrieval paradigm proposed by us is that a complete set of closed equations can be constructed between the input parameters and output parameters of deep learning in theory, which can be determined by physical logical reasoning through the radiation energy balance equation. If there is a strong correlation between input parameters and output parameters, deep learning can be directly used for retrieval. If there is a weak correlation between input parameters and output parameters, it is necessary to add prior knowledge to improve the retrieval accuracy of the output parameters. If we know a large number of representative solutions of the physical method, we can use deep learning to obtain the curve function of the solution through training. Physical model simulations provide us with the opportunity to obtain solutions for physical methods, so deep learning can replicate physical methods. Physical methods cannot describe all situations, and we can supplement solutions involving statistical methods with multi-source data.
The effectiveness of the PSP–DL was evaluated through simulations and experiments, and proven by obtaining satisfactory results. For the simulated data, the model with highest accuracy of the retrieved WVC yielded an MAE = 0.05 g/cm2, RMSE = 0.07 g/cm2, and R2 = 0.993. Prior knowledge was found to improve the retrieval accuracy and reduce the number of required infrared bands. After comparing different band combinations, band 27, 28, 31, and 32 can obtain the optimal result, and it was also found that, if the band combination contains two bands that are very sensitive to the integrated water vapor content, high-precision retrieval can be achieved without prior knowledge. Reliable GPS site data were used for verification of the PSP–DL method, resulting in an MAE = 0.23 g/cm2, RMSE = 0.27 g/cm2, and R2 = 0.921. The retrieval accuracy was slightly lower in very dry or wet environments, but this trend can be alleviated through the use of prior knowledge. In future applications, data availability, accuracy, and redundancy should be considered. Results from the case study of a part of North America showed that the WVC retrieved by the PSP–DL were consistent with the overall trends of two commonly-used WVC products.

Author Contributions

Methodology, Software, Validation, Formal analysis, Investigation, Data Curation, Writing—Original Draft, Writing—Review and Editing, R.M.; Conceptualization, Methodology, Software, Validation, Formal analysis, Investigation, Data Curation, Writing—Original Draft, Writing—Review, Editing, Project administration and Funding acquisition, K.M.; Software, Validation, Formal analysis, Investigation and Data Curation, J.S.; Resource, Formal analysis & Investigation, J.N. and S.M.B.; Resource and Editing, F.M.; Resource and Editing, G.D. All authors have read and agreed to the published version of the manuscript.

Funding

This project was supported by the Second Tibetan Plateau Scientific Expedition and Research Program (STEP)—“Dynamic monitoring and simulation of water cycle in Asian water tower area” (No. 2019QZKK0206), the National Key R&D Program of China (no. 2021YFD1500101), the Open Fund of the State Key Laboratory of Remote Sensing Science (no. OFSLRSS202201), Ningxia Science and Technology Department Flexible Introduction talent project (no. 2021RXTDLX14) and Fengyun Application Pioneering Project (no. FY-APP-2022.0205).

Data Availability Statement

Data openly available in a public repository.

Acknowledgments

The authors would like to thank the University Corporation for Atmospheric Research (UCAR) for providing Suominet Global Positioning System (GPS) site data, the NASA Earth Observing System Data and Information System for providing the MODIS data, and the ECMWF for providing the fifth-generation climate reanalysis data.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Trenberth, K.E.; Fasullo, J.; Smith, L. Trends and variability in column-integrated atmospheric water vapor. Clim. Dyn. 2005, 24, 741–758. [Google Scholar] [CrossRef]
  2. Padmanabhan, S.; Reising, S.C.; Vivekanandan, J.; Iturbide-Sanchez, F. Retrieval of atmospheric water vapor density with fine spatial resolution using three-dimensional tomographic inversion of microwave brightness temperatures measured by a network of scanning compact radiometers. IEEE Trans. Geosci. Remote Sens. 2009, 47, 3708–3721. [Google Scholar] [CrossRef]
  3. Leng, P.; Song, X.N.; Li, Z.L.; Ma, J.W.; Zhou, F.C.; Li, S. Bare surface soil moisture retrieval from the synergistic use of optical and thermal infrared data. Int. J. Remote Sens. 2014, 35, 988–1003. [Google Scholar] [CrossRef]
  4. Martínez-Fernández, J.; González-Zamora, A.; Sánchez, N.; Gumuzzio, A.; Herrero-Jiménez, C.M. Satellite soil moisture for agricultural drought monitoring: Assessment of the SMOS derived soil water deficit index. Remote Sens. Environ. 2016, 177, 277–286. [Google Scholar] [CrossRef]
  5. Mikhaylov, A.; Moiseev, N.; Aleshin, K.; Burkhardt, T. Global climate change and greenhouse effect. Entrep. Sustain. Issues 2020, 7, 2897–2913. [Google Scholar] [CrossRef]
  6. Mao, K.B.; Zhang, C.Y.; Shi, J.C.; Wang, X.M.; Guo, Z.H.; Li, C.S.; Dong, L.X.; Wu, M.X.; Sun, R.J.; Wu, S.L.; et al. The paradigm theory and judgment conditions of geophysical parameter retrieval based on artificial intelligence. Smart Agric. 2023, 5, 161–171, (in Chinese with English abstract). [Google Scholar] [CrossRef]
  7. Mao, K.B.; Shen, X.Y.; Zuo, Z.Y. An advanced radiative transfer and neural network scheme and evaluation for estimating water vapor content from MODIS data. Atmosphere 2017, 8, 139. [Google Scholar] [CrossRef]
  8. Wang, J.H.; Zhang, L.Y.; Dai, A.; Van Hove, T.; Van Baelen, J. A near-global, 2-hourly data set of atmospheric precipitable water from ground-based GPS measurements. J. Geophys. Res. 2007, 112, D11107. [Google Scholar] [CrossRef]
  9. Ortiz de Galisteo, J.P.; Toledano, C.; Cachorro, V.; Torres, B. Improvement in PWV estimation from GPS due to the absolute calibration of antenna phase center variations. GPS Solut. 2010, 14, 389–395. [Google Scholar] [CrossRef]
  10. Liu, J.N.; Shi, C.; Huang, J.F.; Zheng, G.; Zhang, R.; Haase, J.S.; Lou, Y.D.; Zhang, W.X. The use of ground-based GPS precipitable water measurements over China to assess radiosonde and ERA-Interim moisture trends and errors from 1999 to 2015. J Clim. 2017, 30, 7643–7667. [Google Scholar] [CrossRef]
  11. Rocken, C.; Ware, R.; Van Hove, T.; Solheim, F.; Alber, C.; Johnson, J.; Bevis, M.; Businger, S. Sensing atmospheric water vapor with the global positioning system. Geophys. Res. Lett. 1993, 20, 2631–2634. [Google Scholar] [CrossRef]
  12. Sun, D.L.; Yu, Y.Y.; Fang, L.; Liu, Y.L. Toward an operational land surface temperature algorithm for GOES. J. Appl. Meteorol. Climatol. 2013, 52, 1974–1986. [Google Scholar] [CrossRef]
  13. Bevis, M.; Businger, S.; Chiswell, S.; Herring, T.A.; Anthes, R.A.; Rocken, C.; Ware, R.H. GPS meteorology: Mapping zenith wet delays onto precipitable water. J. Appl. Meteorol. 1994, 33, 379–386. [Google Scholar] [CrossRef]
  14. Franc, G.B.; Cracknell, A.P. Retrieval of land and sea surface temperature using NOAA-11 AVHRR data in north-eastern Brazil. Int. J. Remote Sens. 1994, 15, 1695–1712. [Google Scholar] [CrossRef]
  15. Du, J.; Kimball, J.S.; Jones, L.A. Satellite microwave retrieval of total precipitable water vapor and surface air temperature over land from AMSR2. IEEE Trans. Geosci. Remote Sens. 2015, 53, 2520–2531. [Google Scholar] [CrossRef]
  16. Zhang, Y.L.; Cai, C.S.; Chen, B.Y.; Dai, W.J. Consistency evaluation of precipitable water vapor derived from ERA5, ERA-Interim, GNSS, and radiosondes over China. Radio Sci. 2019, 54, 561–571. [Google Scholar] [CrossRef]
  17. Wang, Y.Q.; Shi, J.C.; Wang, H.; Feng, W.L.; Wang, Y.J. Physical statistical algorithm for precipitable water vapor inversion on land surface based on multi-source remotely sensed data. Sci. China Earth Sci. 2015, 58, 2340–2352. [Google Scholar] [CrossRef]
  18. Frouin, R.; Deschamps, P.; Lecomte, P. Determination from space of atmospheric total water vapor amounts by differential absorption near 940 nm: Theory and airborne verification. J. Appl. Meteorol. Climatol. 1990, 29, 448–460. [Google Scholar] [CrossRef]
  19. Kaufman, Y.J.; Gao, B.C. Remote sensing of water vapor in the near IR from EOS/MODIS. IEEE Trans. Geosci. Remote Sens. 1992, 30, 871–884. [Google Scholar] [CrossRef]
  20. Liu, H.L.; Tang, S.H.; Hu, J.Y.; Zhang, S.L.; Deng, X.B. An improved physical split-window algorithm for precipitable water vapor retrieval exploiting the water vapor channel observations. Remote Sens. Environ. 2017, 194, 366–378. [Google Scholar] [CrossRef]
  21. Mao, K.B.; Shi, J.C.; Li, Z.L.; Tang, H.J. An RM-NN algorithm for retrieving land surface temperature and emissivity from EOS/MODIS data. J. Geophys. Res. 2007, 112, 5455–5467. [Google Scholar] [CrossRef]
  22. Wang, J.R.; Wilheit, T.T.; Chang, L.A. Retrieval of total precipitable water using radiometric measurements near 92 and 183 GHz. J. Appl. Meteorol. 1989, 28, 146–154. [Google Scholar] [CrossRef]
  23. He, J.; Liu, Z.Z. Refining MODIS NIR atmospheric water vapor retrieval algorithm using GPS-derived water vapor data. IEEE Trans. Geosci. Remote Sens. 2021, 59, 3682–3694. [Google Scholar] [CrossRef]
  24. Tan, H.B.; Mao, J.T.; Chen, H.H.; Chan, P.W.; Wu, D.; Li, F.; Deng, T. A study of a retrieval method for temperature and humidity profiles from microwave radiometer observations based on principal component analysis and stepwise regression. J. Atmos. Ocean. Technol. 2011, 28, 378–389. [Google Scholar] [CrossRef]
  25. Rieder, M.J.; Kirchengast, G. Water vapor profiling with SSM/T-2 data employing an extended optimal estimation approach. Microw. Remote Sens. Atmos. Environ. 1998, 3503, 153–164. [Google Scholar] [CrossRef]
  26. Rieder, M.J.; Kirchengast, G. Physical-statistical retrieval of water vapor profiles using SSM/T-2 sounder data. Geophys. Res. Lett. 1999, 26, 1397–1400. [Google Scholar] [CrossRef]
  27. Varade, D.; Dikshit, O. Improved assessment of atmospheric water vapor content in the himalayan regions around the Kullu valley in India using Landsat-8 data. Water Resour. Res. 2019, 55, 462–475. [Google Scholar] [CrossRef]
  28. Zhang, X.Y.; Pang, J. A comparison between atmospheric water vapour content retrieval methods using MSG2-SEVIRI Thermal-IR data. Int. J. Remote Sens. 2015, 36, 5075–5086. [Google Scholar] [CrossRef]
  29. Dalu, G. Satellite remote sensing of atmospheric water vapour. Int. J. Remote Sens. 2007, 7, 1089–1097. [Google Scholar] [CrossRef]
  30. Li, Z.L.; Jia, L.; Su, Z.; Wan, Z.; Zhang, R. A new approach for retrieving precipitable water from ATSR2 split-window channel data over land area. Int. J. Remote Sens. 2003, 24, 5095–5117. [Google Scholar] [CrossRef]
  31. Ren, H.Z.; Du, C.; Qin, Q.M.; Liu, R.Y.; Meng, J.J.; Li, J. Atmospheric Water Vapor Retrieval from Landsat 8 and Its Validation. In Proceedings of the IEEE International Geoscience and Remote Sensing Symposium (Igarss), Quebec City, QC, Canada, 13–18 July 2014; pp. 3045–3048. [Google Scholar]
  32. Andersen, H.S. Estimation of precipitable water vapour from NOAA-AVHRR data during the Hapex Sahel experiment. Int. J. Remote Sens. 2007, 17, 2783–2801. [Google Scholar] [CrossRef]
  33. Hu, J.; Tang, S.; Liu, H.; Min, M. An Operational Precipitable Water Vapor Retrieval Algorithm for Fengyun-2F/VLSSR Using a Modified Three-Band Physical Split-Window Method. J. Meteorol. Res. 2019, 33, 276–288. [Google Scholar] [CrossRef]
  34. Gangwar, R.K.; Mathur, A.K.; Gohil, B.S.; Basu, S. Neural network based retrieval of atmospheric temperature profileusing AMSU-A observations. Int. J. Atmos. Sci. 2014, 2014, 763060. [Google Scholar] [CrossRef]
  35. Mao, K.B.; Li, H.T.; Hu, D.Y.; Wang, J.; Tang, H.J. Estimation of water vapor content in near-infrared bands around 1 μm from modis data by using RM-NN. Opt. Express 2010, 18, 9542–9554. [Google Scholar] [CrossRef]
  36. Ma, X.W.; Yao, Y.B.; Zhang, B.; Qin, Y.X.; Zhang, Q.; Zhu, H. An improved MODIS NIR PWV retrieval algorithm based on an artificial neural network considering the land-cover types. IEEE Trans. Geosci. Remote Sens. 2022, 60, 5622412. [Google Scholar] [CrossRef]
  37. Laurent, M.; Thome, K.; Cattrall, C. MODTRAN-based retrieval of column water vapor from solar transmittance. In Proceedings of the Optical Science and Technology, the SPIE 49th Annual Meeting, Denver, CO, USA, 2–6 August 2004; Volume 5546, pp. 115–125. [Google Scholar] [CrossRef]
  38. Wolfe, R.E.; Roy, D.P.; Vermote, E. MODIS land data storage, gridding, and compositing methodology: Level 2 grid. IEEE Trans. Geosci. Remote Sens. 1998, 36, 1324–1338. [Google Scholar] [CrossRef]
  39. Justice, C.O.; Townshend, J.R.G.; Vermote, E.F.; Masuoka, E.; Wolfe, R.E.; Saleous, N.; Roy, D.P.; Morisette, J.T. An overview of MODIS land data processing and product status. Remote Sens. Environ. 2002, 83, 3–15. [Google Scholar] [CrossRef]
  40. Hulley, G.C.; Hook, S.J. Generating consistent land surface temperature and emissivity products between ASTER and MODIS data for earth science research. IEEE Trans. Geosci. Remote Sens. 2011, 49, 1304–1315. [Google Scholar] [CrossRef]
  41. Wang, M.; He, G.; Zhang, Z.; Wang, G.; Long, T. NDVI-based split-window algorithm for precipitable water vapour retrieval from Landsat-8 TIRS data over land area. Remote Sens. Lett. 2015, 6, 904–913. [Google Scholar] [CrossRef]
  42. Bennouna, Y.S.; Torres, B.; Cachorro, V.E.; de Galisteo, J.P.O.; Toledano, C. The evaluation of the integrated water vapour annual cycle over the Iberian Peninsula from EOS-MODIS against different ground-based techniques. Q. J. R. Meteorol. Soc. 2013, 139, 1935–1956. [Google Scholar] [CrossRef]
  43. Liu, H.L.; Tang, S.H.; Zhang, S.L.; Hu, J.Y. Evaluation of MODIS water vapour products over China using radiosonde data. Int. J. Remote Sens. 2015, 36, 680–690. [Google Scholar] [CrossRef]
  44. Merrikhpour, M.H.; Rahimzadegan, M. Improving the algorithm of extracting regional total precipitable water vapor over land from MODIS images. IEEE Trans. Geosci. Remote Sens. 2017, 55, 5889–5898. [Google Scholar] [CrossRef]
  45. Hersbach, H.; Bell, B.; Berrisford, P.; Hirahara, S.; Horányi, A.; Muñoz-Sabater, J.; Nicolas, J.; Peubey, C.; Radu, R.; Schepers, D.; et al. The ERA5 global reanalysis. Q. J. R. Meteorol. Soc. 2020, 146, 1999–2049. [Google Scholar] [CrossRef]
  46. Bai, J.N.; Lou, Y.D.; Zhang, W.X.; Zhou, Y.Z.; Zhang, Z.Y.; Shi, C. Assessment and calibration of MODIS precipitable water vapor products based on GPS network over China. Atmos. Res. 2021, 254, 105504–105514. [Google Scholar] [CrossRef]
  47. Ware, R.H.; Fulker, D.W.; Stein, S.A.; Anderson, D.N.; Avery, S.K.; Clark, R.D.; Droegemeier, K.K.; Kuettner, J.P.; Minster, J.B.; Sorooshian, S. SuomiNet: A real–time national GPS network for atmospheric research and education. Bull. Am. Meteorol. Soc. 2000, 81, 677–694. [Google Scholar] [CrossRef]
  48. Qin, Z.H.; Dall’Olmo, G.; Karnieli, A.; Berliner, P. Derivation of split window algorithm and its sensitivity analysis for retrieving land surface temperature from NOAA-advanced very high resolution radiometer data. J. Geophys. Res. Atmos. 2001, 106, 22655–22670. [Google Scholar] [CrossRef]
  49. Mao, K.B.; Qin, Z.H.; Shi, J.C.; Gong, P. A practical split-window algorithm for retrieving land-surface temperature from MODIS data. Int. J. Remote Sens. 2007, 26, 3181–3204. [Google Scholar] [CrossRef]
  50. Abrahart, R.J.; See, L.M. Neural network modelling of non-linear hydrological relationships. Hydrol. Earth Syst. Sci. 2007, 11, 1563–1579. [Google Scholar] [CrossRef]
  51. Wang, H.; Mao, K.B.; Yuan, Z.J.; Shi, J.C.; Cao, M.M.; Qin, Z.H.; Duan, S.B.; Tang, B.H. A method for land surface temperature retrieval based on model-data-knowledge-driven and deep learning. Remote Sens. Environ. 2021, 265, 112665–112684. [Google Scholar] [CrossRef]
  52. Tzeng, Y.C.; Wen-Liang, K.; Fung, A.K. A dynamic learning neural network for remote sensing applications. IEEE Trans. Geosci. Remote Sens. 1994, 32, 1096–1102. [Google Scholar] [CrossRef]
  53. Liu, Z.Z.; Li, M.; Zhong, W.K.; Wong, M.S. An approach to evaluate the absolute accuracy of WVR water vapor measurements inferred from multiple water vapor techniques. J. Geodyn. 2013, 72, 86–94. [Google Scholar] [CrossRef]
Figure 1. A novel physics-statistical coupled deep learning paradigm for retrieving integrated water vapor content.
Figure 1. A novel physics-statistical coupled deep learning paradigm for retrieving integrated water vapor content.
Remotesensing 15 04250 g001
Figure 2. Simplified diagram of radiative transfer modeling relating LST, LSE and WVC.
Figure 2. Simplified diagram of radiative transfer modeling relating LST, LSE and WVC.
Remotesensing 15 04250 g002
Figure 3. DNN and schematic of neural network operation. ( n is the number of input parameters, x i , j are the input parameters of the neural network, and WVC is the output parameter. w i , j is the weight of each neuron and σ is the bias. a i , j is the parameter input to the next layer of neurons after the activation function of the previous layer of neurons, indicates multiplication).
Figure 3. DNN and schematic of neural network operation. ( n is the number of input parameters, x i , j are the input parameters of the neural network, and WVC is the output parameter. w i , j is the weight of each neuron and σ is the bias. a i , j is the parameter input to the next layer of neurons after the activation function of the previous layer of neurons, indicates multiplication).
Remotesensing 15 04250 g003
Figure 4. Boxplot of absolute error of retrieval result. The left side of the connecting line shows the errors from model 1, while the right side shows the errors from model 2.
Figure 4. Boxplot of absolute error of retrieval result. The left side of the connecting line shows the errors from model 1, while the right side shows the errors from model 2.
Remotesensing 15 04250 g004
Figure 5. Retrieval results of PSP–DL. WVCr shows retrieval results, WVCt is the MODTRAN simulation data. (ac) are scatter plots of model 1 retrieval results and simulation data, (a) is the combination of prior knowledge and bands 27, 29, 31 and 32, (b) is the combination of prior knowledge and bands 28, 29, 31 and 32, (c) is the combination of prior knowledge and bands 27, 28, 31 and 32. (a′c′) are scatter plots of model 1 retrieval results and simulation data, (a′) is the combination of bands 27, 29, 31 and 32, (b′) is the combination of bands 28, 29, 31 and 32, (c′) is the combination of bands 27, 28, 31 and 32, (d) is the error distribution of simulated data and retrieval data.
Figure 5. Retrieval results of PSP–DL. WVCr shows retrieval results, WVCt is the MODTRAN simulation data. (ac) are scatter plots of model 1 retrieval results and simulation data, (a) is the combination of prior knowledge and bands 27, 29, 31 and 32, (b) is the combination of prior knowledge and bands 28, 29, 31 and 32, (c) is the combination of prior knowledge and bands 27, 28, 31 and 32. (a′c′) are scatter plots of model 1 retrieval results and simulation data, (a′) is the combination of bands 27, 29, 31 and 32, (b′) is the combination of bands 28, 29, 31 and 32, (c′) is the combination of bands 27, 28, 31 and 32, (d) is the error distribution of simulated data and retrieval data.
Remotesensing 15 04250 g005
Figure 6. A case study of southern North America (The United States and Mexico).
Figure 6. A case study of southern North America (The United States and Mexico).
Remotesensing 15 04250 g006
Figure 7. WVC (g/cm2) retrieved by PSP–DL method. Integrated water vapor content products: (A) MODIS-VNIR, (B) WVC retrieved from MODIS bands 27, 28, 31 and 32, (C) WVC retrieved from MODIS bands 27, 28, 31 and 32 and MODIS LST and LSE products (MOD11). The white areas indicate invalid values.
Figure 7. WVC (g/cm2) retrieved by PSP–DL method. Integrated water vapor content products: (A) MODIS-VNIR, (B) WVC retrieved from MODIS bands 27, 28, 31 and 32, (C) WVC retrieved from MODIS bands 27, 28, 31 and 32 and MODIS LST and LSE products (MOD11). The white areas indicate invalid values.
Remotesensing 15 04250 g007
Figure 8. WVC (g/cm2) retrieved by PSP–DL method. Integrated water vapor content products: (A) MODIS-TIR, (B) WVC retrieved from MODIS bands 27, 28, 31 and 32, (C) WVC retrieved from MODIS bands 27, 28, 31 and 32 and MODIS LST and LSE products (MOD11). The white areas indicate invalid values.
Figure 8. WVC (g/cm2) retrieved by PSP–DL method. Integrated water vapor content products: (A) MODIS-TIR, (B) WVC retrieved from MODIS bands 27, 28, 31 and 32, (C) WVC retrieved from MODIS bands 27, 28, 31 and 32 and MODIS LST and LSE products (MOD11). The white areas indicate invalid values.
Remotesensing 15 04250 g008
Figure 9. Differences (g/cm2) between MODIS-VNIR products and WVC retrieved using PSP–DL method. (A) the difference between “MODIS-VNIR” and “4BTs”, (B) the difference between “MODIS-VNIR” and “LST and LSE + 4BTs”.
Figure 9. Differences (g/cm2) between MODIS-VNIR products and WVC retrieved using PSP–DL method. (A) the difference between “MODIS-VNIR” and “4BTs”, (B) the difference between “MODIS-VNIR” and “LST and LSE + 4BTs”.
Remotesensing 15 04250 g009
Figure 10. Differences (g/cm2) between MODIS-TIR products and WVC retrieved using PSP–DL method. (A) the difference between “MODIS-TIR” and “4BTs”, (B) the difference between “MODIS-TIR” and “LST and LSE + 4BTs”. WVCr is the retrieval result of PSP–DL.
Figure 10. Differences (g/cm2) between MODIS-TIR products and WVC retrieved using PSP–DL method. (A) the difference between “MODIS-TIR” and “4BTs”, (B) the difference between “MODIS-TIR” and “LST and LSE + 4BTs”. WVCr is the retrieval result of PSP–DL.
Remotesensing 15 04250 g010
Figure 11. Comparison between WVC retrieval by PSP–DL method and GPS data.
Figure 11. Comparison between WVC retrieval by PSP–DL method and GPS data.
Remotesensing 15 04250 g011
Figure 12. The sensitivity analysis of parameters. (a) change of WVC error with the change of LST error, (b) change of WVC error with the change of LSE error.
Figure 12. The sensitivity analysis of parameters. (a) change of WVC error with the change of LST error, (b) change of WVC error with the change of LSE error.
Remotesensing 15 04250 g012
Table 1. Datasets used in this study.
Table 1. Datasets used in this study.
VariableDataset(s)Data Source
Brightness temperature (BT)MOD/MYD021KMhttps://ladsweb.modaps.eosdis.nasa.gov/search/ (accessed: 20 November 2021)
Land surface temperature (LST)MOD/MYD11 L2
Land surface emissivity (LSE)MOD/MYD11C1
Integrated water vapor content (WVC)MOD/MYD05https://ladsweb.modaps.eosdis.nasa.gov/search/ (accessed: 10 December 2021)
ERA5https://cds.climate.copernicus.eu/cdsapp#!/dataset/reanalysis-era5-single-levels?tab=form (accessed: 5 January 2022)
Suomi Nethttps://www.cosmic.ucar.edu/what-we-do/suominet-weather-precipitation-data, (accessed: 5 January 2021)
Table 2. Band combinations used for retrieval of WVC.
Table 2. Band combinations used for retrieval of WVC.
ModelSituationVariables
1Use prior knowledgeBTs in thermal infrared band 27/28/29/31/32, LST and LSE of bands 29, 31, 32
2No prior knowledge usedBTs in thermal infrared band 27/28/29/31/32
Table 3. WVC retrieval results using LST and LSE and bands 27, 29, 31, and 32.
Table 3. WVC retrieval results using LST and LSE and bands 27, 29, 31, and 32.
LST and LSE + 4BTs (from 27, 29, 31, 32) → WVC
Nodes400500600700800
Layers MAERMSER2MAERMSER2MAERMSER2MAERMSER2MAERMSER2
30.110.150.9800.180.240.9490.110.150.9790.110.140.9810.100.130.985
40.100.120.9860.120.150.9780.090.120.9870.080.110.9890.090.110.988
50.110.150.9800.110.150.9790.110.140.9820.080.100.9900.110.140.981
60.120.160.9770.120.160.9770.100.120.9860.080.100.9900.080.100.991
70.140.20.9640.120.160.9770.120.160.9780.110.140.9810.090.120.988
80.090.120.9870.100.130.9850.110.140.9820.090.120.9880.100.130.985
90.120.160.9750.130.180.9700.090.120.9870.090.120.9860.090.130.985
100.170.260.9400.100.130.9850.100.140.9830.080.110.9890.070.100.991
Table 4. WVC retrieval results using LST and LSE and bands 28, 29, 31, and 32.
Table 4. WVC retrieval results using LST and LSE and bands 28, 29, 31, and 32.
LST and LSE + 4BTs (from 28, 29, 31, 32) → WVC
Nodes400500600700800
Layers MAERMSER2MAERMSER2MAERMSER2MAERMSER2MAERMSER2
30.120.150.9780.150.200.9640.110.150.9790.120.160.9770.120.150.978
40.130.170.9740.130.170.9740.140.190.9690.100.130.9850.120.150.979
50.130.170.9720.130.170.9720.100.130.9830.140.190.9630.110.150.978
60.140.180.9660.110.140.9800.090.120.9870.110.140.9810.110.140.981
70.240.320.9060.130.170.9740.130.170.9740.140.180.9670.110.140.981
80.130.180.9700.110.140.9780.130.180.9720.130.160.9730.110.150.978
90.170.230.9510.150.200.9620.110.140.9820.110.140.9800.090.120.984
100.110.140.9800.110.140.9800.120.160.9780.100.130.9830.100.130.983
Table 5. WVC retrieval results using LST and LSE and bands 27, 28, 31, and 32.
Table 5. WVC retrieval results using LST and LSE and bands 27, 28, 31, and 32.
LST and LSE + 4BTs (from 27, 28, 31, 32) → WVC
Nodes400500600700800
Layers MAERMSER2MAERMSER2MAERMSER2MAERMSER2MAERMSER2
30.170.250.9440.120.170.9720.090.120.9860.090.130.9850.070.100.990
40.110.160.9600.140.230.9690.080.120.9860.080.110.9820.060.090.989
50.070.100.9790.160.300.9380.090.140.9850.070.120.9840.080.120.986
60.080.110.9870.120.200.9520.080.110.9860.070.200.9750.060.080.988
70.100.220.9550.110.310.9520.070.100.9890.090.160.9700.050.070.993
80.130.320.9400.100.400.9080.090.120.9870.070.100.9810.060.110.989
90.110.160.9040.120.180.9390.060.090.9880.050.070.9900.050.080.990
100.120.180.9660.070.100.9750.050.110.9860.060.090.9920.050.070.994
Table 6. WVC retrieval results using bands 27, 29, 31, and 32.
Table 6. WVC retrieval results using bands 27, 29, 31, and 32.
4BTs (from 27, 29, 31, and 32) → WVC
Nodes400500600700800
Layers MAERMSER2MAERMSER2MAERMSER2MAERMSER2MAERMSER2
30.190.250.9430.180.240.9480.170.230.9520.170.230.9520.170.230.950
40.190.250.9410.170.220.9530.170.230.9500.170.230.950.170.230.952
50.170.230.9500.200.290.9210.200.280.9260.180.250.9420.170.220.953
60.170.230.9510.170.230.9500.160.220.9530.180.250.9410.170.230.950
70.170.240.9460.170.230.9480.190.270.9340.180.240.9470.180.240.945
80.190.250.9420.180.230.9490.180.250.9390.170.230.9500.170.230.952
90.190.250.9410.160.220.9540.170.220.9530.170.240.9470.170.240.944
100.190.260.9360.170.230.9490.180.230.9490.180.240.9450.180.250.944
Table 7. WVC retrieval results using bands 28, 29, 31, and 32.
Table 7. WVC retrieval results using bands 28, 29, 31, and 32.
4BTs (from 28, 29, 31, 32) → WVC
Nodes400500600700800
Layers MAERMSER2MAERMSER2MAERMSER2MAERMSER2MAERMSER2
30.220.300.9180.210.290.9240.200.280.9270.210.280.9250.210.280.927
40.210.280.9260.220.290.9210.210.280.9260.200.280.9290.200.280.927
50.200.280.9280.220.290.9210.200.270.9300.200.280.9290.210.280.925
60.210.280.9260.200.280.9280.200.280.9270.200.280.9290.210.300.917
70.200.270.9310.210.290.9240.200.280.9280.200.270.9310.210.280.926
80.250.330.8960.200.280.9270.210.290.9230.210.280.9260.210.280.926
90.220.290.9200.200.280.9280.230.320.9070.210.280.9250.210.280.925
100.210.280.9260.210.280.9280.210.280.9280.220.290.9210.200.270.930
Table 8. WVC retrieval results using bands 27, 28, 31, and 32.
Table 8. WVC retrieval results using bands 27, 28, 31, and 32.
4BTs (from 27, 28, 31, and 32) → WVC
Nodes400500600700800
Layers MAERMSER2MAERMSER2MAERMSER2MAERMSER2MAERMSER2
30.120.170.9740.080.140.9810.080.110.9890.060.080.9940.060.130.985
40.090.140.9790.080.210.9740.090.340.9660.070.180.9730.050.220.980
50.140.310.9520.050.070.9620.060.170.9400.070.140.9790.060.140.974
60.060.090.9180.060.080.9970.080.410.9150.050.100.9860.070.220.976
70.100.530.8210.070.210.9670.060.150.9530.050.100.9890.181.330.450
80.070.180.8880.181.170.4130.080.310.9460.060.160.9840.110.600.548
90.090.240.9740.080.220.4200.060.150.9510.181.070.4720.070.150.814
100.050.070.9470.100.280.9360.070.180.9860.050.070.5130.050.180.978
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Mei, R.; Mao, K.; Shi, J.; Nielson, J.; Bateni, S.M.; Meng, F.; Du, G. A Novel Physics-Statistical Coupled Paradigm for Retrieving Integrated Water Vapor Content Based on Artificial Intelligence. Remote Sens. 2023, 15, 4250. https://doi.org/10.3390/rs15174250

AMA Style

Mei R, Mao K, Shi J, Nielson J, Bateni SM, Meng F, Du G. A Novel Physics-Statistical Coupled Paradigm for Retrieving Integrated Water Vapor Content Based on Artificial Intelligence. Remote Sensing. 2023; 15(17):4250. https://doi.org/10.3390/rs15174250

Chicago/Turabian Style

Mei, Ruyu, Kebiao Mao, Jiancheng Shi, Jeffrey Nielson, Sayed M. Bateni, Fei Meng, and Guoming Du. 2023. "A Novel Physics-Statistical Coupled Paradigm for Retrieving Integrated Water Vapor Content Based on Artificial Intelligence" Remote Sensing 15, no. 17: 4250. https://doi.org/10.3390/rs15174250

APA Style

Mei, R., Mao, K., Shi, J., Nielson, J., Bateni, S. M., Meng, F., & Du, G. (2023). A Novel Physics-Statistical Coupled Paradigm for Retrieving Integrated Water Vapor Content Based on Artificial Intelligence. Remote Sensing, 15(17), 4250. https://doi.org/10.3390/rs15174250

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop