1. Introduction
The retrieval of the vertical profile of an atmospheric parameter requires the solution of an inverse problem [
1,
2,
3] that is often ill-posed [
4], and in order to obtain a stable solution, some a priori information has to be added in the retrieval process. A commonly used method to retrieve atmospheric parameters using remote sensing is the
optimal estimation method [
1], where the a priori information is represented by an a priori profile and by an a priori covariance matrix (CM) of the unknown parameter, and the solution is given by the profile corresponding to the maximum a posteriori probability calculated with the Bayes theorem (see e.g., [
5]).
Since, in the last few years, the number of satellite instruments that are sounding the atmosphere has increased at a high rate, it is very likely that more instruments will simultaneously measure either the same vertical profile or vertical profiles corresponding to nearby geo-locations. In this case, the different retrieved profiles can be combined into a single product that includes all of the available information, and we refer to this combination as
data fusion [
6]. Accordingly, the choice of the a priori information and of the vertical grid has to take into account the possibility that the result of the retrieval will be fused with other measurements [
7,
8,
9,
10,
11]. The data fusion approach is alternative to that of the synergistic retrieval [
12], in which all of the available observations are simultaneously used in a single retrieval; for a detailed description and comparison of the two methods see [
13] and references therein.
In light of the increased requirement of fused products, we consider the possibility of using new variables representing the retrieval products, with the purpose of simplifying the subsequent fusion processes. A change in the retrieval products is proposed in view of developing a shared formalism, which facilitates the interface between data providers and data users, while ensuring a full exploitation of the available information. The advantages of the new variables with respect to those currently used are analyzed on a theoretical basis.
When the result of the retrieval is used in subsequent data fusion operations, the vertical grid of the fusing products should be as fine as needed for the representation of the information content of the final fused product, rather than of the information content of the individual measurement, because, as shown in [
14], in the latter case, some information is lost. This can be easily done because the use of the a priori information allows for representing the profile on a vertical grid as fine as desired. Therefore, the retrieval products are no longer chosen with the objective of providing the user with a useful representation of the observed profile, but rather as the best input for the fusion process, possibly independent of a priori information. Therefore, the question arises of whether by removing the objective of the graphical representation of the profile a more functional data transfer of the retrieval products can be considered.
Generally, in order to make complete use of the products in further processing such as data fusion or data assimilation, the retrieval products are represented by means of the retrieved profile, the averaging kernel matrix (AKM), the retrieval CM and the a priori information used in the retrieval.
We propose new variables calculated starting from these standard retrieval products that are a new way to save the information provided by the measurements and have several advantages with respect to the standard quantities. In the linear approximation of the forward model, the new variables are independent of the a priori information used in the retrieval and decrease the data volume requirement. Furthermore, they can be used to represent the profile with any a priori information and are quite suitable for subsequent data fusion operations.
In
Section 2, we recall useful notations and equations, linearize the transfer function and introduce the new variables. In
Section 3, we describe the advantages of the new variables with respect to the standard retrieval products concerning representation of the profile, data fusion and reduction of the data volume. Finally, in
Section 4, we draw the conclusions.
2. The New Variables
2.1. Recall of Notations and Equations
We assume to have retrieved the vertical profile
of an atmospheric parameter from a set of observations (radiances)
with the optimal estimation method [
1], using a profile
and a CM
as a priori information. We indicate with
the forward model, which allows us to express the observations
as a function of the true profile
by the following equation:
where
is the vector including both the noise errors of the observations and the forward model errors, due to parameter errors and physical approximations of the forward model. Generally, the forward model calculates the radiative transfer through the Earth’s atmosphere and knowing the state of the atmosphere, the observation geometry (for example either limb or nadir) and the characteristics of the instrument allows us to simulate the radiances measured in the given conditions. In order to simplify the formulation, we assume that there are no forward model errors and, therefore,
includes only the noise errors of the observations and is characterized by
and
.
indicates the mean value, and
is the CM of the noise errors of the observations. A formulation that takes into account forward model errors can be obtained defining new observations corrected for the bias of the forward model errors and replacing
with the sum of
and the CM of the random part of the forward model errors.
The sensitivity of
to the true profile
is described by the AKM
, and the retrieval errors of
are described by the CM
, which is the sum of the CM of the noise errors
and the CM of the smoothing errors, which are due to the smoothing of the true profile caused by the averaging kernels,
. The AKM and the CMs are given by (see Equations (3.28)–(31) in [
1]):
where
with
being the Jacobian of the forward model
calculated at
:
. The matrix
is the Fisher information matrix [
1,
15], defined as
where
is the conditional probability distribution to obtain
given
, which, considered as a function of
, is referred to as the
likelihood function [
16]. In the case that the inverse problem can be solved without constrain (
), that is when we can find the solution of maximum likelihood, from Equations (3)–(5), we see that
is equal to the inverse matrix of the CM of the retrieval errors (
), which coincides with the CM of the noise errors (
). From this consideration, we can understand that the physical meaning of
is quantifying the information provided by the observations
about the retrieved vertical profile.
depends on the a priori information used in the retrieval through calculated at , which depends on and . Therefore, the dependence of on the a priori information is due to the second order terms in the expansion of the forward model as a function of the profile , and consequently, when the linear approximation of the forward model is valid, is independent of the a priori information.
2.2. Linearization of the Transfer Function and Variables
We can consider the whole measuring system, including both the observing system and the retrieval method, as an operation that transforms the true profile
into the retrieved profile
and, accordingly, defines the retrieved profile
as a function of the true profile
. This function is referred to as the
transfer function [
1], and besides being a function of
, it is also a function of the noise errors
of the observations
. This dependence can be seen recalling that really
depends on
through the observations
; therefore, using Equation (1) we can write
, which we indicate as
. We note that
because from Equation (1), it results that
is the identity matrix.
Expanding the transfer function at the first order around the a priori profile
and zero errors
, we obtain:
Concerning the first term of the expansion, we recall that the retrieved profile obtained with the optimal estimation method in the absence of errors is a weighted mean between the true profile and the a priori profile. Therefore, when the true profile coincides with the a priori profile, the retrieved profile in the absence of errors results in the a priori profile, that is
. This result is peculiar of the optimal estimation method, and if we wish to extend the results of this article to retrieval methods different from the optimal estimation, it is necessary to identify a linearization point for which we know the value assumed by the transfer function. This consideration also applies to the complete data fusion method [
6,
17] and to all the methods that are based on the expansion of the transfer function.
Under the approximation that the derivatives do not significantly depend on the point where they are calculated, we have
and
, where
is the gain matrix and is given by
On the basis of these considerations, Equation (8) becomes
Following the approach described in the complete data fusion method [
6,
17], we define the vector
:
which can be calculated knowing the retrieved profile, the a priori profile and the AKM. Substituting
from Equation (10) into Equation (11), we see that
is equal to
and provides a measurement of the true profile made using the rows of
as weighting functions. Equation (12), together with Equations (2) and (9), shows that
(differently from
), in the linear approximation of the forward model is independent of the a priori profile
; however, through the expressions of
and
, it maintains dependence on the a priori CM
.
2.3. The New Variables
We define the vector
as
and using Equations (2), (5), (9) and (10), we obtain
where the vector
is given by
Equation (14) provides the physical meaning of , that is the measurement of the true profile in which the weighting functions are the rows of , and is the vector that includes the errors of this measurement. Furthermore, from Equations (14) and (15) we see that , in the linear approximation of the forward model, is uniquely determined independently of both and .
Using Equation (14), we calculate the sensitivity of
to the true profile, which is the AKM of
and from Equations (6), (14) and (15), we calculate the CM of
Therefore, both the AKM and the CM of coincide with the Fisher information matrix .
From Equation (13), we see that the dimensions of are the inverse of the dimensions of : ; therefore, does not represent a profile of the parameter that we aim to retrieve. However, as we noticed in the introduction, this is not a problem, because the objective of the retrieval products is no longer the graphical representation of the profile, but to efficiently provide all of the information of the observations to subsequent data analyses.
4. Conclusions
With the increasing use of the atmospheric profiles retrieved from atmospheric satellite observations in data fusion operations, the requirement that these products provide a representation of the observed quantity is less important, and other features, such as completeness and compactness of the information, are becoming more relevant. In light of this, new retrieval variables have been proposed when the retrieval has been performed with the optimal estimation method and the first order approximation of the transfer function is appropriate. These variables, referred to as , are the measurement of the true profile obtained using the rows of the Fisher information matrix as weighting functions. This measurement does not provide a representation of the profile, but has several useful properties: in the linear approximation of the forward model, it is independent of the a priori information used in the retrieval, and both the AKM and the CM of coincide with the Fisher information matrix. Furthermore, the variables can be used to obtain the representation of the vertical profile with an a priori information selected by the user, and they can be directly used to perform the data fusion of a set of measurements performed with different instruments. For the exploitation of these products in the subsequent operations, it is sufficient to provide and the Fisher information matrix , which fully characterizes the measurement, being both its AKM and its CM. Accordingly, the use of the variables allows us to reduce the stored data to about one-third of its volume with respect to the use of the standard products. These properties of the variables make them a perfect retrieval product when further processing is performed by the users and encourage the possibility of considering finer retrieval grids, possibly concerted by the scientific community rather than determined by instrumental considerations. On the other hand, the standard products have the advantage of providing a graphical representation of the measured profiles. However, it is important to notice that the possibility of a graphical representation is obtained at the cost of a constraint on the adopted retrieval grid. The retrieval grid is usually limited in extension and density of points in order to avoid a too large bias of the a priori information, and different instruments freely use different retrieval grids that complicate comparisons. A storage procedure that does not depend on the a priori information can use a retrieval grid commonly used with the other instruments and avoid these difficulties.
The communities of data providers and data users are invited to test and validate the efficiency of this new interface.