1. Introduction
By the end of the 20th century, direct assimilation of satellite radiance data into variational assimilation systems had begun to help with the problem of insufficient observational data, thus greatly improving the accuracy of numerical forecasts. In particular, the accuracy of forecasts in the southern hemisphere, where conventional observations are scarce, was rapidly improved to be consistent with that in the northern hemisphere [
1]. The contribution of satellite observations in operational data assimilation systems is increasing; for example, the proportion of satellite information in GRAPES (Global/Regional Assimilation and Prediction System)—the numerical weather prediction system of the China Meteorological Administration—is more than 70%, and that of the European Centre for Medium-Range Weather Forecasts (ECMWF) exceeds 90% [
2]. Amongst all the satellite instruments currently in operation, AMSU-A (Advanced Microwave Sounding Unit-A) is notable for its ability to retrieve information on the vertical distribution of atmospheric conditions and contributes significantly towards reducing global NWP forecast errors; additionally, it has also been demonstrated to be beneficial in many ways to regional forecasting systems [
3,
4].
Launched with the polar-orbiting satellites NOAA15-19 and Metop-A\B\C, AMSU-A is a passive microwave remote-sensing instrument that detects information on the state of the earth–atmosphere system by receiving upward radiance. Fifteen channels are available in AMSU-A, including three window channels (23.8, 31.4, and 89.0 GHz) for detecting clouds and rain and providing information on surface temperature and emissivity. There are also twelve oxygen absorption channels distributed at 50–60 GHz for detecting atmospheric temperature profiles from the troposphere to the stratosphere. Many studies have demonstrated that the assimilation of AMSU-A observations can significantly improve the accuracy of models in forecasting severe weather [
5,
6,
7]. The long-term and continuous atmospheric temperature profile observations of AMSU-A also make it important for studying atmospheric temperature under climate change [
8,
9,
10].
The brightness temperature (BT) observed by AMSU-A is not a model variable, so assimilation of AMSU-A observations requires a radiative transfer model (RTM) for variable conversion. According to the theory of radiative transfer, the surface state and hydrometeor particles have significant effects on the observed BT of AMSU-A. In the case of cloud-contaminated fields of view (FOVs), not only does the RTM need to input more complex information about the phase, size, and number of particles in the clouds, but, owing to the limitations of the model’s cloud and rain microphysical processes and nonlinear radiation processes, the simulated BT will deviate considerably from the observed BT [
11]. If cloud-contaminated observations are assimilated without any specific treatment, there will be a negative impact on the forecast [
12]. Over the past few decades, many researchers have developed various methods to detect cloud-contaminated observations over the ocean and subsequently eliminate them, thereby only assimilating clear-sky observations in what is referred to as ‘clear-sky assimilation’.
The most common cloud detection method is based on the deviation between the observed and simulated BT (O−B) of AMSU-A channels. For instance, the |O−B| value of channel 4 being greater than 0.7 K was used to detect and reject scenes that were contaminated by cloud and/or precipitation in the ECMWF model [
13]. In addition, the FOVs are considered to be cloud-contaminated when the cloud liquid water path (LWP), which is retrieved from the window channels 1 and 2 of AMSU-A, is greater than a specific threshold [
14]. Zou et al. [
15] developed a one-stream cloud detection method by merging the LWP and ice water path (IWP) retrieved from window channels 1 and 2 of the Microwave Humidity Sounder (MHS). When the LWP or IWP is larger than the threshold, the FOV is considered to be cloud-contaminated, and the IWP captures some of the ice clouds missed by the LWP [
16]. The scattering index is defined by a linear regression model of channel 15 and channels 1–3 of AMSU-A, used by the AAPP (AVHRR (Advanced Very High Resolution Radiometer) Pre-processing Package) to detect cloud [
17]. Aires et al. [
18] used the MSG-SEVIRI (Meteosat Second Generation–Spinning Enhanced Visible and Infrared Imager) cloud product as a reference to train AMSU-A/B observations with a neural network algorithm and proposed a land and ocean cloud classification method. Cloud detection methods based on channel |O−B| and retrieved cloud water variables require ancillary data such as model output data. In contrast, neural network algorithms rely on a vast number of accurately labeled datasets to train microwave information, which is prone to overfitting and increases the computational burden. In recent years, researchers have devoted themselves to the development of all-sky assimilation, which assimilates thin clouds and non-precipitating-cloud-contaminated observations over the ocean by introducing a cloud-related variable into the model and resetting the background error of cloudy observations. The method has yielded positive results and is now being used operationally [
19,
20,
21,
22,
23]. However, regardless of whether clear-sky or all-sky assimilation methods are employed, accurate identification of clear-sky or cloudy-sky observations is a prerequisite.
The spatial and temporal variability of surface parameters, such as surface temperature and surface emissivity, are much stronger over land than ocean, and soil composition, moisture, and roughness all affect surface emissivity [
24]. Surface parameters are difficult to measure accurately, which leads to more difficulties and challenges in assimilating AMSU-A observations over land than ocean, even with clear-sky assimilation [
25]. To improve the accuracy of model-simulated BT over land, various surface temperatures and surface emissivity estimation methods have been proposed, and these methods have yielded significant improvements in the clear-sky assimilation of AMSU-A terrestrial observations [
26,
27,
28]. However, in most of these methods, cloud product retrievals from other space-based instruments (e.g., MODIS (Moderate Resolution Imaging Spectroradiometer) cloud masks) are used to detect clouds, because most cloud detection methods over the ocean, such as LWP retrieval, are not applicable over land owing to the significant effect of surface emissivity. When using those cloud products, spatiotemporal interpolation is required between AMSU-A and other instruments, which is neither economical nor convenient for operational applications. While there are only a few empirical solutions available for terrestrial cloud detection based on the AMSU-A instrument itself; the accuracy of these methods is heavily dependent on the accuracy of the ancillary data. For example, in the GSI (Gridpoint Statistical Interpolation) assimilation system, terrestrial cloud detection is based on empirical scattering indices and precipitation indices [
22]; whereas, GRAPES directly excludes the observations of the low-peaking channels 1–4 and 15, and mid-peaking channels 5 and 6, over land. A large number of AMSU-A terrestrial observations are excluded or discarded, which results in wasted information. Therefore, in this work, we attempted to develop a new AMSU-A terrestrial cloud detection method and, based on it, we evaluated the bias characteristics of different channels affected by clouds and different surface types under clear-sky conditions, and prepared for assimilating the observations of AMSU-A mid- and low-peaking channels over land in GRAPES.
The paper is structured as follows: Following this introduction,
Section 2 introduces the datasets.
Section 3 describe the new cloud detection method.
Section 4 evaluates the effectiveness of the new cloud detection method and assesses the bias and standard deviation characteristics of different channels affected by clouds and different surface types under clear-sky conditions.
Section 5 provides a discussion and conclusion. Finally,
Section 6 gives a summary.
3. Methods
Figure 2 shows the spatial distribution of the O−B of AMSU-A channel 3 from NOAA-19 and MODIS cloud classification products in East Asia at 0600 UTC on 26 June 2019. The weighting function peak heights of channel 3 are located at the ground, which is the first temperature measurement channel of AMSU-A. To avoid the influence of topographic height, we chose to restrict the study to East Asia only. We used the Community Radiative Transfer Model (CRTM) developed by the Joint Center for Satellite Data Assimilation to simulate the AMSU-A BT. CRTM can provide fast, accurate satellite radiance simulations and Jacobian calculations at the top of the atmosphere. The model supports the simulation of sensor measurements covering wavelengths ranging from the visible through the microwave [
32]. We used the FNL (final analysis) data [
33] as the background field, and the NPOESS (National Polar-orbiting Operational Environmental Satellite System) dataset to determine the land-surface type of each FOV. As we did not input hydrometeor information into CRTM, we considered all-sky to be clear-sky. Comparing
Figure 2a,b, it can be seen that the O−B in the thick cloud areas showed significant negative values, such as in the convective cloud system from Lake Baikal to northeast China and the convective cloud system over the Korean Peninsula, as well as in the stratocumulus over the eastern coast of China (black dashed circle). In the clear-sky area, meanwhile, the absolute value of O−B was smaller. The observed radiance of channel 3 in the clear-sky area was primarily from the surface-emitted radiance; whereas, the radiance observed by satellites in deep cloud areas was basically the cloud-top radiance, which was significantly lower than the surface radiance. Even for clouds penetrated by ground radiation, the scattering and absorption of water and ice particles in the clouds leads to the radiance received by satellites being significantly lower than the simulated clear-sky radiance. Although there are many factors that lead to differences between observed and simulated BTs, most O−B values in cloud areas were negative, which proves that clouds have an important impact on O−B. If cloud and clear-sky data cannot be distinguished, it results in false assimilation effects.
Clouds have a significant effect on the BT observed by AMSU-A, but the response to clouds varies from channel to channel owing to frequency differences. AMSU-A window channels are sensitive to the presence of cloud and precipitation [
34]. A scatterplot of the BTs observed by AMSU-A channels 3 and 15 over East Asia is given in
Figure 3, where the circle colors indicate the matched simultaneous and the closest MODIS cloud classification results. It can be seen that the observed BT of channel 15 is higher than that of channel 3 in the clear-sky area. In the microwave region, Planck’s formula can be simplified to the Rayleigh–Jeans radiation law, given the frequency
ν and the thermodynamic temperature
T of a black body:
where
c is the speed of light and
k is the Boltzmann constant, such that the BT is proportional to the quadratic of the frequency. This approximate theory has an accuracy of better than 1% for an object at 300 K viewed at a frequency less than 125 GHz [
35]. In the clear-sky area, the AMSU-A observed radiance is mainly dependent on the radiance emitted from the surface; this can be simplified as the following Equation (2):
where
LClr(
ν,
θ) is the clear-sky upwelling radiance,
εsfc is the surface emission,
Tsfc is the surface temperature,
τs is the transmittance from the surface to the top of the atmosphere, and
B(
ν,
T) is the Planck function for a frequency
ν and temperature
T. Then, combined with Equation (1), the observed radiance ratio of the two channels in the same FOV is:
In the same FOV, the surface emissivity and atmospheric state are fixed, but the frequency of channel 15 is larger than that of channel 3, so the observed BT of channel 15 is warmer than that of channel 3, and the ratio of the BT of channel 15 to that of channel 3 is close to a constant value.
In cloudy sky, the relationship between the BTs of the two channels is more complicated. The cloud attenuates the BT of both channels, and the thicker and higher the clouds, the more pronounced their attenuation effect and the lower the observed BT of channels 3 and 15, with the lowest BT observed in deep convective clouds and cirrostratus. However, microwaves can penetrate some thin clouds, so the observed BT under cirrus and some cirrostrati is not distinguishable from the observed BT under clear sky. Many cloud-related factors will lead to a decrease in BT—for instance, the size and distribution characteristics of water and ice particles, as well as the shape of ice particles. Besides, there is a significant difference in the attenuation of cloud BT between the two channels, with channel 15 being more sensitive to clouds than channel 3, meaning the BT of channel 15 is more significantly reduced by clouds than that of channel 3. In
Figure 3, the BT of channel 15 is remarkably smaller than that of channel 3 in the deep convective cloud area.
Therefore, we can try to define a cloud index based on the different responses of these two channels to clouds. Qin and Zou [
36], based on MHS channel 2 being more sensitive to clouds than channel 1, used the standardized BT of channel 1 as the numerator and the BT of channel 2, which was adjusted to the same magnitude as the numerator, as the denominator to define a terrestrial cloud detection index. The index can detect mostly cloudy FOVs. Zhu et al. [
37] introduced this method to the Microwave Humidity Sounder II instrument onboard China’s FY-3C satellite, also achieving satisfactory cloud detection results. In this work, five low-peaking channels (channels 1–4 and 15) of AMSU-A were selected to define the cloud index:
where
Tb,i is the observed BT of the
ith channel of the five channels 1–4 and 15 of AMSU-A. The normalized brightness of channel 3 is used as the numerator, and the exponentiation-adjusted brightness of channel 15 is used as the denominator.
The spatial distribution of the numerator, denominator, and the AMSU-A cloud index at the same moment in time as in
Figure 2a is given in
Figure 4. Comparing with
Figure 2b, because channel 3 is less sensitive to clouds, the normalized BT of channel 3 therefore showed a larger positive value in the cloudy areas but a smaller value in clear sky. To further amplify the difference between cloudy and clear sky, we added the BT of the cloud-sensitive channel 15 and used the exponentiated BT of channel 15 as the denominator. The difference between the clear-sky and cloud-contaminated BT of channel 15 is amplified by the exponentiation, having been multiplied by a coefficient to adjust the magnitude to be comparable in size to the numerator. As the cloud attenuates the BT of channel 15 more significantly, the value of the denominator will thus be smaller in the cloud area, which ultimately gives the cloud index a large positive value in cloudy sky, while the value is smaller in clear sky, as shown in
Figure 4c.
Figure 5 presents scatterplots of the numerator (horizontal axis) and denominator (vertical axis) of the cloud index for two moments, 0600 UTC 26 June and 0600 UTC 28 June 2019, when NOAA-19 had more observations in East Asia, in which the colors represent the matched MODIS cloud classification products. It can be seen that the clear-sky observations are almost all clustered in the upper-left corner, while the cloudy-sky observations are scattered on the right-hand side. This corresponds to the previous analysis, where cloud-free observations have a smaller numerator and a larger denominator, and thus would be clustered in the upper-left corner, while cloudy observations are the opposite. There is a clear distinction between cloud and cloud-free observations in the scatterplots in
Figure 5a,b, and the slopes of the two parts of the data after fitting are significantly different. Based on this feature, the threshold for distinguishing between the cloud and cloud-free observations can be determined. Of course, some cloudy and clear-sky observations were incorrectly distinguished, which we improved upon below.
In order to ensure the stability of the results, one month of AMSU-A observations were used to determine the thresholds of
Aindex.
Figure 6 shows the fitted slope of the denominator and numerator of the
Aindex for different thresholds, in which the
Aindex was calculated from AMSU-A observations of NOAA-19 over land areas of East Asia from 0000 UTC 15 June to 1800 UTC 15 July 2019. As can be seen from the figure, the slope for data with the
Aindex bigger than the threshold (black curve) increased slowly with the threshold value increasing to 0.14, and then basically stayed the same. However, for data with the
Aindex less than the threshold (red curve), the slope increased rapidly before the threshold value reached 0.02, and then held steady from 0.02 to 0.1, after which it kept increasing and eventually got close to the black dotted curve. When the threshold value was less than 0.02, there were fewer clear-sky observations with the
Aindex less than the threshold, so the absolute value of the fitted slope was large and increased rapidly. When the threshold value was in the range from 0.02 to 0.1, the data included by respecting the
Aindex less than the threshold were mostly the same clear-sky observations, so the fitted slope stayed nearly the same. However, after the threshold value exceeded 0.1, the slope increased steadily and eventually got close to the slope of the observations in the cloudy area. This means that cloudy observations have been included. So,
Aindex = 0.1 can be used to distinguish between cloudy and clear-sky FOVs.
5. Discussion and Conclusions
The three-month analysis of cloud detection results reported in this paper validated the reliability of the new method, and the vast majority of cloud-contaminated FOVs could be detected. The new method only uses the observations, which helps to successfully avoid the influence of the model background field on the detection results, thus making this method promising for operational data assimilation. However, as with the old method, the FAR is slightly higher, and we will focus on solving this problem in the future.
The present work focused only on the summer season, and so we need to use more and different seasonal data to analyze the effect of atmospheric temperature on the indexes in subsequent studies. Of course, this method is still based on the principle that clouds have a significant impact on the BT of each channel of AMSU-A and MHS. In some cases, the impact of clouds on the BT is very weak. For example, the thin cirrus cloud at high altitude has little impact on microwave radiation, and the low surface temperature at high latitudes and on glacial surfaces also causes the surface radiance to be similar to that of clouds, which may influence the effectiveness of the new method. The effect of the method on detection in winter also needs more data to be fully evaluated. In future research, we need to quantitatively evaluate the characteristics of clouds that influence the BT in these cases through idealized experiments based on the RTM, so as to determine a more reasonable detection threshold and further improve the method. At present, we do not recommend applying the method in areas with low surface temperature—for example, at high latitudes north of 60° N in winter or areas covered by perennial glaciers.
In addition, due to data availability, we used MODIS cloud products to verify the method. The 3-h time difference between the cloud detection index and cloud products inevitably has an impact on the verification results. We believe that by using the better-matched AVHRR cloud product data carried by the same satellite as the verification data, evaluation of this new method will more reasonable, which is also a direction for our group’s work in the future. In addition, the effectiveness of the new cloud detection method still needs to be tested by assimilation experiments. Specifically, the effect of the method on assimilation needs to be judged in terms of actual assimilation results. Therefore, next, based on the new method, we intend to assimilate the clear-sky data of channels 5 and 6 over land areas in GRAPES to verify the improvements the new method can deliver in terms of achieving better forecast results.
6. Summary
Because of the strong surface emissivity and high spatial and temporal variability, the cloud detection of AMSU-A over land has been a challenge. In this work, based on the characteristics of AMSU-A and MHS channels, we developed a new terrestrial cloud detection method that relies only on the observations by merging the AMSU-A data and MHS data. Practical testing showed that the AMSU-A cloud index could detect most of the deep convective clouds, but missed the cirrus and some cirrostratus clouds. The addition of the matched MHS cloud index made up for the majority of clouds missed by the AMSU-A index. By comparing with the cloud classification product of MODIS, the cloud detection method after merging the information from both instruments could eliminate most of the cloudy observations.
The effectiveness and stability of the new cloud detection method were verified by collecting AMSU-A and MHS observations for three months. By referring to the MODIS cloud product, the POD, FAR, and HR of the three cloud detection methods were calculated, revealing that the new method performed the best. On average, the POD with the cloud FOVs of the new method could reach 83.85%; additionally, the new method was found to have a lower clear-sky (higher cloudy-sky) FAR than the MHS-index-only method, meaning fewer cloudy observations are missed by using data from both instruments.
After removing cloudy observations, the O−B of the low- and middle-peaking channels were found to be more in line with the normal distribution. Based on the accurate identification of the clear-sky observations, we also analyzed the O−B distribution characteristics of the AMSU-A low- and middle-peaking channel observations over land areas. Among the window channels, channels 1 and 15 had the largest bias and standard deviation in their simulated BT owing to the influence of clouds, which gradually decreased as the weighting function peak height of the channel increased. After removing the cloudy observations, the bias and standard deviation of O−B of the low- and middle-peaking channels of AMSU-A were found to reduce significantly; additionally, the bias of the O−B of channels 5 and 6 was within 1.0 K under clear-sky conditions, and standard deviation was around 1.0 K. The bias and standard deviation of the O−B for the middle- and lower-peaking channels also differ among vegetation types under clear sky. The bias of broadleaf forest was smaller than that of pine forest, but the observation error was slightly larger than that of pine forest; the bias of grassland is larger, but the error was the smallest; and the observation error on scrub is the largest. Overall, the bias and standard deviation of the O−B of channels 5 and 6 are smaller among all channels.