3.1. Rough Sea Surface Emissivity Correction
Physically, the emissivity of rough sea surfaces
is related to sea surface roughness and scattering, which is mainly affected by sea surface wind field, wave, and precipitation. However, studies also find a relationship between rough sea surface radiation and SST and atmospheric parameters [
31]. Therefore, before developing the correction model, the correlation between C/X-band rough sea emissivity and several geophysical parameters (U10, SST, WV, and CLW) is studied. The correlation coefficients of sea surface emissivity
to geophysical parameters are shown in
Table 1.
Clearly, for both C- and X-band, the rough sea emissivity shows the highest correlation to 10 m wind speed than SST, WV, and CLW.
Table 1 presents some interesting results. It is not surprising that the emissivity of rough sea surfaces is related to sea surface wind speed according to the RTM. However, why the emissivity of rough sea surfaces is associated with SST and atmospheric parameters is a question worthy of discussion. The correlation between the emissivity of rough sea surfaces and SST, WV, and CLW may stem from the underlying relationship between the sea surface wind field and the above three parameters, rather than a direct correlation between the emissivity of rough sea surfaces and these three parameters. Therefore, we argue that establishing an empirical correction model for the emissivity of rough sea surfaces and wind speed can “explain” most of the correlation between the emissivity of rough sea surfaces and ocean–atmospheric parameters. If that is correct, the correlation between the emissivity residual and other parameters will be significantly reduced after wind speed correction. To verify our hypothesis, we analyze the correlations between the corrected residual emissivity
and the four air–sea parameters. The results are also presented in
Table 1. The results show that, after correction, the correlation between the emissivity residual and sea surface wind speed is significantly reduced. This verifies the effectiveness of the wind speed-related emissivity correction model established in this study. Meanwhile, the correlations between the corrected residual emissivity and SST, WV, and CLW also decrease significantly. This confirms our hypothesis: the simultaneous correlation of rough sea surface emissivity with the four air–sea parameters arises from two sources—the direct correlation between emissivity and wind speed, and the indirect correlation between wind speed and the other three parameters.
We use a polynomial fitting function to correct wind effect in TB data. To do this, the rough sea emissivity of C- and X-band
are averaged in the wind speed range of 0–16 m/s with a bin width of 1 m/s. An important issue for polynomial fitting is the selection of the maximum order. While high-order polynomials offer better data fitting, they tend to cause overfitting. Thus, we need to strike a reasonable balance between reducing fitting errors and avoiding overfitting. To verify the robustness of polynomials of different orders, we used polynomials from the 1st to the 5th order and conducted 100 numerical fitting experiments for each. In each numerical experiment, 70% of the data was randomly sampled from the entire data set of 2018 to construct the fitting polynomial, while the remaining 30% was used as the test data set to calculate the RMSE (
Figure 2a) and coefficient of determination R
2 (
Figure 2b) of the fitting polynomial. Meanwhile, we also calculated the probability that each order of polynomial becomes the optimal polynomial (with an RMSE smaller than that of the other four polynomials) during 100 numerical experiments (
Figure 2c). We also computed the mean value and standard deviation of fitting errors for each polynomial across these numerical experiments, so as to reflect the sensitivity of polynomials of different orders to the randomness of the modelling data (
Figure 2d).
In
Figure 2a,b, the 1st-order (blue line) and 2nd-order (red line) polynomials struggle to accurately characterize the relationship between the rough sea surface emissivity and the sea surface wind speed. For polynomials of the 3rd-order and above, their RMSE values and coefficients of determination are close to each other. It can be seen from
Figure 2c that in all 100 numerical experiments, the probability of the 3rd-order polynomial being the optimal fitting polynomial is only slightly lower than that of the 5th-order polynomial and higher than that of the polynomials of other orders. Meanwhile,
Figure 2d shows that the average fitting errors of the 3rd-order polynomial is slightly higher than that of the 4th-order and 5th-order polynomials. However, the stability of fitting errors (characterized by the standard deviation) is better than that of the 4th-order and 5th-order polynomials. Consequently, after comprehensively considering the accuracy and robustness of the fitting polynomials, we select the 3rd-order polynomial to fit the relationship between the rough sea surface emissivity and wind speed. The relationship between rough sea emissivity and wind speed is expressed using a 3rd-order polynomial in Equation (6). And the fitting coefficients are listed in
Appendix A,
Table A1.
The fitting functions are shown in
Figure 3 as red curves. Clearly, all data points match with the fitted curve very well. The deviation between data points and predicted values is better than 0.002. Meanwhile, data points and fitting curve also show high correlation, and the correlation coefficient is higher than 0.998 for both C- and X-bands.
As we have discussed above, after correcting for wind-induced roughness emissivity, the remaining emissivity residual does not show clear dependence on WV, SST, and CLW. The correlation coefficients of the residual with respect to SST, WV, and CLW are less than 0.1. This indicates that the emissivity residual after correcting for wind-induced effects has a weak correlation with SST, WV, and CLW. Moreover, from a physical perspective, the emissivity of a rough sea surface should be independent of SST and atmospheric parameters. Therefore, we use Equation (6) to correct for the sea surface roughness effect.
Applying Equation (6) to the test data set, the rough sea emissivity is calculated and subtracted from the total sea surface emissivity. Therefore, the flat sea surface emissivity extracted from WindSat data
is expressed as
To validate the accuracy of the rough sea emissivity correction model, the flat sea emissivity extracted from satellite data
using Equation (7) is compared with modeled
, the flat sea emissivity calculated by the dielectric constant model of sea water developed by Meissner and Wentz. The results are shown in
Figure 4. Clearly, the flat emissivity extracted from satellite data coincides with the theoretical values well, and most data points cluster around the 1:1 line. The RMS is about 0.001, and the correlation coefficient is considerably high.
3.2. λ-Based Emissivity Combination
As is discussed in previous sections, C- and X-band TB mainly focuses on SST retrieval. Therefore, if C- and X-band TB are directly used to retrieve SSS, the strong SST signal in these bands will mask the weak SSS information. In this section, the sensitivity of C/X-band emissivity to SST and SSS are discussed. We try to find a reasonable combination of C- and X-band emissivity to eliminate their sensitivity to SST.
Based on the MW dielectric constant model of sea water, the flat sea surface emissivity of C- and X-bands under different SST and SSS conditions are calculated and shown in
Figure 5. These figures reveal an obvious dependence of C/X-band to SST variations, which is expected, since these two bands are the primary channels for SST retrieval. Meanwhile, these bands also show sensitivity to SSS variations, especially under high SST conditions. The magnitude of the sensitivity of flat sea emissivity to SST is similar to that to SSS. Considering that the TB of a calm sea surface can be expressed as the product of the emissivity of the calm sea surface and the SST, we believe that the sensitivity of the C/X-band TB to SST arises more from the influence of the SST factor in the TB but not the emissivity. This also prompts us to use the emissivity of calm sea surfaces rather than TB for SSS retrieval. In a previous study, researchers use the difference between C- and X-band emissivity to retrieve SSS in the BOB [
8]. However, as
Figure 5 shows, the difference between these emissivities are still sensitive to SST changes. Simply subtracting C-band emissivity from X-band does not bring sufficient advantages to salinity retrieval. However, Equation (5) suggest that, by introducing a parameter λ, the dependence of λ-based emissivity combination on SST will be greatly weakened. Therefore, using the flat sea emissivities modelled by the MW dielectric constant model under various SST and SSS conditions, we calculate the partial derivatives of C/X-band emissivity with respect to SST, and the ratio of these two derivatives, i.e., the parameter λ. Note that we find the parameter λ is related to SSS, and it is necessary to determine a parameter λ that depends on SSS, rather than simply setting it as a fixed value. Consequently, we generate a lookup table for parameter λ and World Ocean Atlas (WOA) climatological monthly SSS field data [
32], where the SSS ranges from 25 to 40 psu with an interval of 0.5 psu. The λ-based combination of calm sea emissivities are shown in
Figure 5d. Evidently, the sensitivity of the λ-based combination of emissivities to SST has been basically eliminated, while the λ-based combination retains its sensitivity to SSS.
It is necessary to discuss the differences between the emissivity combination based on parameter λ and the linear combination method that directly subtracts the C-band emissivity from the X-band emissivity. Although the introduction of the parameter λ appears to constitute a “linear combination” of the C/X-band emissivities, we argue that this is not a true linear combination. By definition, λ equals the ratio of the partial derivatives of the C- and X-band emissivities with respect to SST. Thus, the nonlinear responses of the C/X-band emissivities to SST are inherently embedded in the definition of λ. This nonlinear relationship results in λ being not a constant but a variable related to the WOA salinity climatology data. Consequently, the emissivity combination we constructed is not a linear combination but one that inherently incorporates the nonlinear relationship between emissivity and SST.
Regarding the use of WOA monthly SSS field data that may prevent the salinity retrieval algorithm from capturing small-scale salinity signals, it should be noted that, in our method, the use of WOA SSS is relatively indirect. The WOA SSS is only used to choose the appropriate λ and is not directly involved in the calculation of SSS values. Meanwhile, the mean value, standard deviation, and dynamic range of Argo, WOA, and WindSat (our method) SSS in the BOB during 2017–2019 are displayed in
Table 2. Clearly, the mean values of all three SSS data sets are similar, which suggests our method has no systematic errors. The standard deviation and dynamic range of SSS can reflect the ability of salinity data to characterize salinity variations in the BOB. It is obvious that the standard deviation and dynamic range of WindSat SSS are larger than WOA SSS and similar to Argo SSS. Considering the Argo SSS is pointwise, these data naturally tend to reflect small-scale salinity variations. These results suggest that the introduction of the parameter λ related to WOA SSS does not exert a significant negative impact on WindSat SSS’s ability to characterize small-scale SSS variations.
3.3. Salinity Retrieval and Validation
Based on the λ-based combination of calm sea emissivity differences
, we developed an empirical model function relating SSS to SST and
through multivariate regression analysis:
where
is the fitting coefficient, and
i and
j are the orders of the combination of the V-pol calm sea emissivity
and sea surface temperature
TS. The fitting coefficients are listed in
Appendix A Table A2. Note that the parameter λ in
is a quantity related to salinity, and the appropriate selection of its value has a significant impact on the results of salinity retrieval. Clearly, using Argo or SMAP salinity data to select an appropriate λ may lead to the problem of data independence. Therefore, as a trade-off between maintaining data independence and selecting an appropriate value for λ, we use the monthly climatological salinity data from WOA to determine the value of λ, and then retrieve the SSS. Using the matched data set of SMAP and WindSat from 2017 to 2019 obtained in
Section 2.4, we randomly selected 70% of the 2018 data as the training data set, and the remaining 30% as the test data set. Meanwhile, to test the generalization ability of the SSS retrieval model, we used the data of 2017 and 2019 as the validation data set. The figures below show the salinity retrieval results of WindSat in the BOB region for the four seasons (Spring: Mar–May, Summer: Jun–Aug, Autumn: Sep–Nov, Winter: Dec–Feb). Observations reveal a nuanced seasonal and spatial pattern of SSS. Spring (
Figure 6a) is characterized by relatively homogeneous SSS across the bay, with weak horizontal gradients. This is attributed to negligible precipitation and river discharge, strong evaporation, and an anticyclonic circulation that advects high-salinity water northward, homogenizing the salinity field. In summer (
Figure 6b), a distinct low-salinity patch emerges in the northern bay (SSS < 31 psu) associated with peak Ganges–Brahmaputra discharge, while offshore regions experience elevated SSS due to enhanced evaporation. Autumn (
Figure 6c) exhibits a considerable low-salinity area, as freshwater accumulated during summer spreads southeastward, covering a vast area in the northeastern bay with moderately low SSS. In winter (
Figure 6d), a remnant freshwater plume persists in the northeast, maintaining a stable north–south salinity gradient with lower values in the north and higher values in the south. A common feature across all figures is a persistent year-round low-salinity region in the southeastern Bay of Bengal, likely sustained by local precipitation, runoff from Myanmar, or exchange with the Andaman Sea.
Regarding the validation of the salinity retrieval model, since the SMAP satellite salinity data itself has an error of 0.6–0.8 psu, a direct comparison between the WindSat salinity retrieval results and the SMAP salinity data cannot provide a good estimate of the WindSat salinity retrieval error. To address this issue, we adopt two methods to validate the WindSat salinity retrieval results. First, direct comparison is conducted between WindSat data and the in situ observation data from Argos and MRBs. Second, the Argo/WindSat/SMAP matched data set and the triple collocation method is used to simultaneously estimate the errors of both WindSat and SMAP salinity data.
Using the spatiotemporal matching window of 12 h and spatial interval of 0.125 degrees, we collocate the WindSat SSS and the in situ data from Argos and MRBs during 2017–2019. Meanwhile, we adopted the same spatiotemporal matching strategy to match the salinity values retrieved by directly subtracting WindSat C/X-band emissivities, as well as the SMAP salinity values, with the in situ observation data. For all matched data sets, we calculated the mean bias, standard deviation, RMSE, and correlation coefficient as evaluation metrics. The results are shown in
Table 3. The results show that compared with the salinity retrieval results without using the parameter λ (where the calm sea surface emissivities of the C-band and X-band are directly subtracted), the use of λ reduces the salinity retrieval RMS error by approximately 0.1–0.2 psu, and the correlation coefficient is also significantly improved. This indicates that the introduction of parameter λ effectively enhances the sensitivity of the C/X-band difference to salinity and improves the accuracy of salinity retrieval. The RMSE of SMAP salinity data is slightly higher than that of WindSat salinity data; however, SMAP salinity shows a better correlation with in situ observation data. Meanwhile, it is noteworthy that the number of matched data points between SMAP and the in situ observation data is smaller than that between WindSat and the latter, indicating that the number of valid observations from WindSat is also superior to that from SMAP. We consider it beneficial to discuss the sensitivity of salinity inversion results to the value of lambda. Therefore, we introduced a ±10% variation in the parameter λ to analyze the sensitivity of the SSS retrieval results to the value of λ. The results show that the RMSE of SSS retrieved using the modified λ ranges from 0.96 to 1.14 psu, which is an increase of 0.24–0.42 psu compared to the original SSS error. These findings indicate that the SSS retrieval results are sensitive to the choice of λ, which motivates the present study to develop a delicate and effective method for determining λ. Meanwhile, the above results demonstrate that the SSS retrieval errors are larger than those obtained with the original λ value after either increasing or decreasing λ by 10%, validating the effectiveness of the method proposed in this paper.
The scatter plots of WindSat SSS versus in situ salinity data are illustrated in
Figure 7. Note the WindSat salinity values show greater dispersion in the low salinity range below 31 psu. As shown in
Figure 6, salinity values below 31 psu mainly occur in the coastal areas of the northern BOB. These regions receive freshwater input from multiple rivers, including the Ganges and Brahmaputra, resulting in a significant vertical salinity gradient. Under such conditions, the difference between the observation depth of microwave radiometers (approximately 1 cm) and that of buoy data (1 m to several meters) leads to a vertical representativeness discrepancy between the two salinity data sources, thereby causing greater dispersion of WindSat observations relative to in situ measurements in low-salinity areas. Meanwhile, these low-salinity areas are close to the northern coast of the BOB, and the TB data observed by the radiometer are susceptible to land contamination. This leads to a decline in the quality of TB data, which in turn affects the accuracy of salinity retrieval.
We discussed the validation of WindSat SSS data using Argo and MRB in situ buoys above. In fact, the direct comparison between satellite remote sensing data and in situ measurements tends to overestimate the errors of satellite data because the direct comparison method attributes all differences between satellite and in situ data to the errors of satellite data, ignoring the inherent errors of in situ observations and the spatial representativeness discrepancy between in situ and satellite data. In contrast, the triple collocation method estimates the random errors of each of the three spatiotemporally matched data sources by utilizing the variances of each data source and the covariances between every pair of data sources. The triple collocation equations can be expressed as follows:
where
and
are the scale and offset factors of system i.
is the random error of system
i.
S is the true salinity signal that can be observed by all four systems.
is the
ith SSS observation system (Argo, WindSat, and SMAP). It is evident that for the triple collocation method to yield robust results, the salinity observations provided by the three spatiotemporally matched systems should be consistent. Based on the three systems simultaneously capturing a stable and dominant true salinity signal
S, their respective random errors
are relatively small, leading to comparable variances and covariances among the three salinity data sets. Conversely, if the data variance of one salinity observation system is significantly different from that of the other two, or if the covariance between the observations of one system and the others is notably small, it indicates a substantial discrepancy and low correlation between the observations of this system and the others. Introducing such a data source into the triple collocation method will compromise the robustness of the error assessment results. We reconstructed a triple collocation data set (Argo/WindSat/SMAP). Before applying the triple collocation method, we calculated the variances of the four salinity data sets (Argo SSS, WindSat SSS with λ, WindSat SSS without λ, SMAP SSS) and the covariances between them. The covariance matrix is presented in the
Table 4.
As is clearly shown in
Table 4, the salinity retrieval results obtained by directly subtracting C/X-band emissivities (WindSat without λ) exhibit significantly lower variance and covariances with all other data sets. Based on our previous discussion, this indicates that the statistical characteristics of the WindSat without λ salinity retrieval results are significantly different from those of the other data sets. In contrast, the variance and covariances of the salinity data incorporating the parameter λ (WindSat with λ) are comparable to those of the Argo and SMAP salinity data sets. The above results demonstrate that the introduction of the parameter λ can effectively improve the quality of salinity retrieval data. Therefore, in the triple collocation calculation, we excluded the WindSat without λ salinity data and used the remaining three data sets for computation; the results are presented in
Table 5.
It can be seen that compared with the results of the direct comparison between satellite data and Argo data in
Table 4, the random errors of WindSat and SMAP salinity in
Table 5 have both decreased by approximately 0.1 psu. Notably, the error of Argo salinity is significantly overestimated. This result is not unexpected, as it reflects the spatial representativeness discrepancy between buoy data and satellite data. Buoys provide pointwise observations that can capture small-scale spatial salinity variations, whereas satellite data represent the average value within their antenna footprints (on the order of tens of kilometers) and thus struggle to resolve sub-footprint variability (SFV). Consequently, small-scale salinity signals detected only by buoys but not by satellites are interpreted as errors of the buoy data in the triple collocation method. Clearly, this is not a true error but rather a representativeness error caused by differences in spatial representativeness across multiple data sources. However, based on a previous study by the authors [
33], under such conditions, although the error of the high-resolution data set (i.e., Argo) tends to be overestimated, the triple collocation method still yields robust error estimates for the other two data sets. Therefore, we consider the results in
Table 5 to be a reliable estimate of the errors of WindSat and SMAP salinity data.
Due to the limitation of its tens of kilometers antenna footprint size, the data quality of microwave radiometers in coastal regions is significantly lower than that in open oceans, which in turn affects the accuracy of retrieved ocean parameters. Enlightened by the reviewer’s suggestion, we used a 0.5-resolution offshore distance look-up table to calculate the offshore distance for all matched data between WindSat salinity and Argo salinity. Then, within the offshore distance range of 100 km to 500 km, we binned the data at 100 km intervals and analyzed the variation of salinity retrieval error with offshore distance. The results are shown in the figure below. As shown in the figure, the salinity retrieval error decreases sharply with increasing offshore distance. At an offshore distance of 100 km, the SSS error exceeds 1 psu, while at 500 km offshore, the error falls below 0.6 psu (
Figure 8).