1. Introduction
The longest space-based observation series from a passive meteorological multispectral imager is that recorded by the Advanced Very High Resolution Radiometer (AVHRR, [
1]). The TIROS-N satellite, launched in a sun synchronous polar orbit in October 1978, was the first to carry this sensor. The sensor has subsequently been onboard many NOAA and EUMETSAT MetOp satellites, with the very last sensor onboard MetOp-C launched in November 2018. Consequently, the sensor has enabled the observation of clouds, aerosols, and the earth’s surface for more than 44 years. It is anticipated that the mission will continue for several more years, and that it may even complete a fifth decade of observations.
An observation series based on one practically unchanged sensor (with only minor exceptions) spanning almost five decades is a great asset for environmental and climate studies. This is true even though obtaining accurate climate-quality observations from the AVHRR is challenging because the sensor was not originally designed for climate studies and cannot accurately depict trends. The uniquely long period of observation and the global coverage at high spatial resolution could still be of interest for studies of climate variability and the monitoring of medium-to-large global changes as well as changes in the geographic distribution of certain environmental parameters on the earth’s surface.
This paper focuses on cloud observations from the AVHRR. The description and evolution of global cloudiness in climate models remains the largest uncertainty in climate scenarios and predictions [
2,
3,
4,
5,
6]. In particular, the description of low-level cloudiness in tropical and sub-tropical regions, which is responsible for a large part of the cooling effect from the reflection of solar radiation back into space, is critical here. It is critical because changes in the amount of low-level clouds may easily contribute significantly to either a positive or a negative feedback effect on the global radiation budget [
2].
Estimations of global cloud conditions and their changes over time derived from AVHRR data are available in several earlier studies [
7,
8,
9,
10]. These earlier studies indicate a negative trend in global cloud cover [
10]. However, it has been challenging to establish details concerning the contribution of different cloud levels to overall changes, mainly because the AVHRR mostly only observes the highest cloud layers. Additionally, the previously studied time series were too short to estimate trends adequately. In this paper, we will present results from the third edition of the Climate Monitoring Satellite Application Facility (CM SAF [
11]) cLoud, Albedo and surface RAdiation dataset from the AVHRR data, abbreviated CLARA-A3 [
12]. The data record incorporates several cloud products, surface and top-of-atmosphere radiation products, and surface albedo products (for access and a complete description, see the Data Availability Statement section). However, here we focus only on the global distribution of cloudiness, the observable contribution of vertical cloud levels, and the associated cloud-top height information. The improvement in the temporal coverage compared with the predecessor CLARA-A2 is quite significant, with an extension in time from 34 years to 42 years. CLARA-A3 now incorporates AVHRR observations from its very introduction (1979) to the year 2020, which means that both the start and the end of the time series have been extended compared with those of CLARA-A2. Additionally, the CLARA-A3 data series is extended continuously by the ongoing production of so-called Interim Climate Data Records (ICDRs), which means that the complete data record up to the present (2023) covers 44 years rather than 42 years.
This paper describes the main improvements in cloudiness and cloud-top information in CLARA-A3 compared with its predecessor, CLARA-A2. We will also repeat inter-comparisons with similar data records (as in [
10]) and take a closer look at possible regional differences and trends in absolute values. It is also of particular interest here to have a preliminary look at possible changes in low-level cloudiness at low and middle latitudes, bearing in mind its importance for the earth’s radiation balance.
In
Section 2, we describe the methods and reference datasets used to evaluate the achievements of the new CLARA edition. We also introduce a method for evaluating the distribution and the potential changes in the contribution of low-level clouds. After that, results are given in
Section 3, and this will be followed by a discussion in
Section 4 and our conclusions in
Section 5.
2. Data and Methods
2.1. Cloud Fraction and Cloud-Top Height Products in CLARA-A3
The global monthly mean cloudiness parameter belongs to the Cloud Fractional Cover (CFC) product of the CLARA-A3 data record. It is calculated from the binary cloud masks from all AVHRR overpasses in a month by averaging the results in a 0.25° regular latitude–longitude grid. The binary cloud masks are derived from thresholding cloud probabilities at 50%, provided by the cloud probability product, CMAPROB. CMAPROB is computed using naïve Bayesian theory and trained using the data from the cloud lidar instrument CALIOP (Cloud-Aerosol Lidar with Orthogonal Polarization) on the CALIPSO (Cloud-Aerosol Lidar and Infrared Pathfinder Satellite Observations) satellite recorded between 2006 and 2016 (for full details, see [
13]).
The corresponding cloud-top height product (denoted CTO) is based on monthly averaged CTO products from every AVHRR overpass. Cloud-top geometrical heights (CTH), as well as cloud-top pressures (CTP) and cloud-top temperatures (CTT), were estimated via infrared AVHRR channels by an artificial neural network (ANN), also trained using data from the CALIPSO–CALIOP sensor. The method is described in detail in [
14].
Combining the cloud mask and CTP information makes estimating the gross relative contributions of the various vertical cloud layers to the total cloud cover possible. Thus, the CFC product includes information on the observable (i.e., not obscured by higher clouds) contribution of low-level, medium-level, and high-level clouds using the classical International Satellite Cloud Climatology (ISCCP) subdivisions of these three cloud layers which refer to cloud-top pressure levels of 680 hPa and 440 hPa.
2.2. Detailed Evaluation of Global Cloudiness and Cloud Detection Sensitivity from CALIPSO
A detailed evaluation of the CLARA-A2 cloud mask was previously performed using cloud information from the CALIOP cloud lidar on the CALIPSO satellite [
15]. This was based on information derived from globally collocated AVHRR and CALIPSO–CALIOP measurements and CALIOP cloud masks and cloud optical thickness products. In addition to estimating traditional validation scores (i.e., mean error or bias, hit rate, and Hanssen–Kuipers skill (KSS) score, all of which are explained in [
16]), the study defined a cloud detection sensitivity parameter (CDS) for evaluating cloud detection efficiency as a function of the geographical position on the earth. In other words, CDS describes the smallest cloud optical thickness for which the probability of detection by the CLARA-A2 cloud mask is still equal to or better than 50%. This was estimated in an equal-area global grid, thus providing information on how cloud detection efficiency varied with the geographical position on the earth. We repeat this validation study here, but now for CLARA-A3, to see the extent to which the validation scores and CDS values have changed. The basic validation scores for CLARA-A3 were estimated using independent results from 2010, since collocation data from this year were not used to train the cloud-masking method. However, the entire matchup dataset from 2006 to 2016 was used to estimate the CDS parameter to enable results with global coverage on a coarse-resolution grid.
2.3. Detailed Evaluation of Cloud-Top Height Information from CALIPSO
Regarding the CLARA-A3 cloud mask, a detailed evaluation of the cloud top information (i.e., the CTH product) from 2010 was performed using the globally collocated AVHRR and CALIPSO–CALIOP information. These results were compared with the corresponding results from CLARA-A2. The validation scores used (which are defined in [
16]) were the mean error (bias), the mean absolute error (MAE), and the bias-corrected root mean square error (bc-RMSE). Results were also given separately for the three vertical cloud layers (low-level, medium-level, and high-level clouds).
We can expect the CTH accuracy results to depend upon cloud thickness and whether multi-layer clouds exist. To illustrate this, we complement the overall results with results obtained under certain special conditions (e.g., single-layer clouds, multi-layer clouds, opaque clouds, and semi-transparent clouds).
2.4. Comparison of CLARA-A3 Results and Other Cloud Climate Data Records
Several other satellite-based global cloud climatologies exist, and we have here compared CLARA-A3 CFC and CTP monthly mean values with results from the most up-to-date versions of the following data records:
ISCCP-HGM: The International Satellite Cloud Climatology Project (ISCCP) H-Series has provided a cloud CDR with monthly gridded results since 1983 [
17,
18]. Here, we used the ISCCP-HGM product as it provided the Level-3 monthly means. At the time of this evaluation, the ISCCP-HGM CDR (v01r00) data from 1983 through June 2017 were available, and a corresponding ICDR, including data through December 2018 (covering a total of 35 years), was also available.
PATMOS-x: This evaluation used the latest version (v06r00) of the PATMOS-x cloud CDR [
9]. The daily Level-2b data are available at 0.1 × 0.1-degree resolution for each AVHRR-carrying satellite (separate data are available for the ascending and descending nodes). Note that an official Level-3 PATMOS-x product based on this latest version is unavailable. Therefore, we computed monthly Level-3 aggregations using a 50% threshold on the Level-2b cloud probability data to facilitate a fair comparison. The PATMOS-x version 6 data used here cover the period 1982–2020 (39 years).
ESA-CCI: The AVHRR-based cloud CDR (Version 3) derived in the framework of the ESA Climate Change Initiative [
8] was also used in this evaluation. It covers the period from 1982 through 2018 (37 years) and includes only the data collected by the prime satellites carrying the AVHRR sensors. The Level-3 monthly CDR is available at 0.5 × 0.5-degree resolution.
MODIS: This study used the Level-3 monthly 1 × 1-degree cloud property products from the latest Collection 6.1 [
19] derived from MODIS, onboard the Aqua satellite (product MYD08_M3). They cover the period from 2003 through 2020 (18 years).
CALIPSO–CALIOP: This study used version 1.00 of the CALIOP lidar Level-3 cloud product (1 × 1 degrees) derived for the Global Energy and Water Cycle Experiment (GEWEX) Cloud Assessment [
20]. It reports different flavors of cloud property statistics. Two flavors, namely TopLayer and Passive, were used here. The TopLayer flavor provides cloud descriptions of the uppermost tropospheric cloud layer that the CALIOP sensor can see. These include sub-visual thin cirrus clouds that typically lie below the detection limit of passive sensors such as AVHRR. The Passive flavor, on the other hand, uses an optical depth threshold of 0.3 to provide cloud statistics to facilitate comparisons with the passive sensors. The data from this CALIPSO Level-3 GEWEX product covering 2007 through 2016 (10 years) were analyzed. Note that the dataset is defined using the same CALIOP Level 2 products used in [
15].
Consequently, we carried out inter-comparisons of different climate data records in the same manner as in [
10], but with more recent versions of the data records and several additional CDRs.
2.5. Evaluating Potential Changes in Low-Level Cloud Contributions at Low and Middle Latitudes
Deducing changes in low-level cloudiness from AVHRR data is not trivial; an accurate estimation of low-level cloudiness is impossible because of obscuring clouds at higher levels. Thus, the estimated amount of low-level clouds will always be an underestimate of the true amount. However, one has to consider that the cooling effect caused by low-level clouds is most significant when upper-layer clouds are absent. If higher clouds are present, especially if these are high and thin cirrus clouds, reflected sunlight and thermal radiation emitted from the low-level clouds might be absorbed by these higher clouds, and this can actually lead to a warming instead of a cooling of the system. Thus, to really see a significant cooling effect, we need to study cases in which low-level clouds are present and upper-level clouds are absent. During daytime, such cases lead to large amounts of solar radiation being reflected back into space, and most of this thermal radiation may radiate into space without significantly interacting with the atmosphere and clouds. This leads to a sizeable cooling effect.
Consequently, the contribution of low-level clouds to the total cloud cover is important, even if we cannot correctly depict all low-level clouds. Therefore, we will look at the geographical distribution and temporal evolution of the observable low-level cloud fraction over the CLARA-A3 period. Since the cooling effects should be most prominent at low- and mid-latitude areas (where the solar radiation input and the thermal radiation output are high), we have focused on these regions. In addition, we have separated land and ocean areas to see whether there are specific regions (e.g., oceanic marine stratocumulus areas) that have experienced more visible changes.
4. Discussion
Validation studies of cloud detection efficiency and cloud-top height information have revealed further improvements in the new CLARA-A3 CDR compared with its predecessor, CLARA-A2. This improvement is mainly due to CLARA-A3 capitalizing on the high-quality information on clouds obtained from CALIOP on the CALIPSO satellite, which has provided invaluable input for the training of the cloud detection and cloud top retrieval systems. It has facilitated a shift from the traditional binary cloud-masking method to a new probabilistic approach. Even if improvements in cloud detection efficiency are not dramatic (compared with the results presented in [
15]), other advantages are evident in this simple validation of an interpreted binary cloud mask. For example, access to cloud probabilities allows users to utilize cloud information in a stricter mode (i.e., in a clear conservative or cloud conservative mode) for applications that are very sensitive to cloud masking errors. This has enabled safer downstream processing of other CLARA-A3 products such as surface radiation budget components and surface albedo (as is explained in [
12]). This detailed evaluation of the improved cloud detection sensitivity (CDS) parameter also has positive implications for other applications. The underlying method relies on calculating the probability of detection (POD) of a cloud with a specific optical thickness. Since these statistics are geolocated, it is possible to express POD as a function of optical depth and location anywhere on earth. These POD lookup tables form the basis of the CFMIP Observation Simulator Package (COSP) simulators for the CLARA data record ([
22]), which can be used in climate model inter-comparisons. This simulator, initially developed for CLARA-A2, has now been updated to be valid for the new CLARA-A3 edition.
The motivation for defining the CDS parameter came mainly from the observation that the cloud detection limits of cloud screening methods have for a long time taken a very simple form, most often that of a fixed optical thickness threshold (e.g., cloud optical thickness 0.3, as used in the COSP ISCCP simulator [
23]). However, it is obvious that the ability to detect clouds in satellite imagery depends to a large extent on the properties of the underlying surfaces. One should therefore expect quite substantial variability in the effectiveness of cloud detection, and our ambition was to find a measure that would describe this variation. We found it useful to use the optical depth value above which a majority (50%) of clouds are detected. Consequently, we observed that the CDS parameter should not be interpreted as a definitive limit above (or below) which all clouds are detected (or not detected). This situation (i.e., when the POD values go quickly from 0% to 100% when passing the optical thickness threshold of the CDS parameter) only occurs occasionally for locations with very low CDS values. In contrast, for locations with very high CDS values, there is normally only a gradual increase (or decrease) in POD values for increasing (or decreasing) cloud optical thicknesses. Therefore, the best way of describing the overall cloud detection ability at a particular location is to use the full range (or full profile) of the POD values as a function of cloud optical thicknesses. This has been applied in the CLARA-A3 simulator [
22]. The long operation time of the CALIPSO satellite (much longer than the nominally expected lifetime) enabled the estimation of the CDS parameter globally with a fine horizontal resolution. This enabled the compilation of a large enough number of AVHRR/CALIOP collocations.
This evaluation of CFC and CTP parameters using plots of global averages and global difference distributions reveals many noteworthy features which require further investigation to be fully understood. However, for the globally averaged CFC parameter, we may conclude that the different CDRs have reached a better agreement on global cloudiness than in previous evaluations [
10,
24]. The range of the spread of CFC values has decreased, and there is now a stronger indication of relatively stable global CFC values with only a minimal negative trend over four decades. Questions remain concerning the first two satellites in the series, the Tiros-N and NOAA-6 satellites. Their CFC values are somewhat high (and reinforce previously found negative trends if included in the calculations), and their CTP values are much lower than those of the other satellites. The indication of a cold bias can be seen in the 11 µm channel of the Tiros-N AVHRR when compared with the corresponding HIRS channels ([
25] The cold bias could explain the overestimated CFC and the underestimated CTP results. The seasonal variation in the CFC values (i.e., high values in the late summer or early autumn in the northern hemisphere) can be explained by the asymmetric distribution of land cover in the two hemispheres. This leads to excessive CFC values in the northern hemisphere linked to summer monsoons over India, Indonesia, and northern Africa. However, it is still uncertain how much of this variation can also be attributed to problems in cloud detection, e.g., to the underestimation of CFC during the polar winter, which primarily affects the South Pole region. Unfortunately, the CALIPSO–CALIOP observations cannot be used to verify this in the central parts of polar areas. The seasonal variation in the CTP parameter is more marked for some CDRs, especially CLARA-A3 and MODIS. Surprisingly, the PATMOS-x CDR does not show the same seasonal variation in CTP values. We suspect that his is because the new PATMOS-x CRD merges the results from the AVHRR and HIRS. Adding HIRS data probably improves the cloud detection capability of PATMOS-x, but the coarser resolution may lead to somewhat smoother results (despite applying a method to enhance the spatial resolution of the HIRS data). More studies are needed to confirm this. Here, we can only conclude that including HIRS data does not seem to lead to significantly better results than those obtained with CLARA-A3. The new ANN-based retrieval method in CLARA-A3 appears to have found additional, previously unused information in the AVHRR data that can be used for CTP retrieval.
The improvements in CFC and CTP product quality evident in CLARA-A3 triggered an attempt to estimate potential trends in the observable CFC contribution of low-level clouds. Any changes here could lead to substantial changes in the radiation balance at the earth’s surface and at the top of the atmosphere. However, the results are not conclusive because ENSO cycles heavily influence cloudiness in large regions where low-level clouds often occur. Nevertheless, a noteworthy mean negative trend of about −0.5% per decade over ocean surfaces in sub-tropical and mid-latitude regions exists. Further studies must be undertaken to determine the significance of these results and remove additional uncertainties. Such uncertainties include whether the presence of optically thin or optically thick clouds affect the changes in the observable fraction of low-level clouds. Investigating this requires revisiting all of the original level-2 products, since these include the necessary information on cloud optical thickness. However, this also necessitates separating cloudiness data according to nighttime and daytime conditions. Finally, further studies must also involve the surface and top of atmosphere radiation products in the CLARA-A3 CDR to determine the effects and trends of these radiation products. The radiation contribution from low-level clouds is only one of many factors contributing to the radiation budget. However, one should investigate whether the possible negative trend in low-level cloudiness over ocean surfaces found here is also accompanied by increasing trends in incoming solar radiation at the surface and decreasing trends in outgoing reflected solar radiation at the top of the atmosphere.
5. Conclusions
This study investigated the quality of global cloudiness (cloud fraction) and cloud-top height information in the CLARA-A3 CDR. Compared with its predecessor, CLARA-A2, we observed clear improvements in the detection of very thin clouds over most oceanic areas, particularly those in the tropical region, and over the Arctic Ocean. For land areas, the improvements were marginal, but the switch to a probabilistic approach has facilitated the use of cloud information when processing additional radiation and surface albedo products in the CLARA-A3 CDR.
The most prominent improvement was that observed in the CLARA-A3 cloud-top height product, which has significantly reduced the previous systematic underestimation of cloud-top altitudes. For high-level clouds, the improvement is better than 2 km when compared with the CALIOP cloud lidar. In addition, the tendency to overestimate the cloud tops of low-level clouds has been reduced. These improvements in the basic cloud products have improved the overall description of the three-dimensional distribution and the temporal evolution of global cloudiness, even if descriptions of the vertical extension of clouds is limited exclusively to describing the upper cloud boundary.
For demonstration purposes, and to capitalize on these improvements, we finally looked at the changes in the observable fraction of low-level clouds over the 42-year CLARA-A3 period. These clouds are though to play an essential role in a net cooling effect on the earth’s climate. The results are inconclusive since there are still visible features linked to the influence of ENSO modes. However, outside of the core tropical region and the polar areas (where cloud detection is still a challenge), a negative trend of about −0.5% per decade in low-level cloudiness was found over oceanic areas. Further studies are required to verify these findings, and these studies should incorporate additional cloud optical thickness information and consider the results of the CLARA-A3 surface and top-of-atmosphere radiation budget products.