A Semianalytical Algorithm for Estimating Water Transparency in Different Optical Water Types from MERIS Data

: Water transparency (or Secchi disk depth: Z SD ) is a key parameter of water quality; thus, it is very important to routinely monitor. In this study, we made four efforts to improve a state-of-the-art Z SD estimation algorithm that was developed in 2019 on the basis of a new underwater visibility theory proposed in 2015. The four efforts were: (1) classifying all water into clear (Type I), moderately turbid (Type II), highly turbid (Type III), or extremely turbid (Type IV) water types; (2) selecting different reference wavelengths and corresponding semianalytical models for each water type; (3) employing an estimation model to represent reasonable shapes for particulate backscattering coefﬁcients based on the water type classiﬁcation; and (4) constraining likely wavelength range at which the minimum diffuse attenuation coefﬁcient ( K d ( λ )) will occur for each water type. The performance of the proposed Z SD estimation algorithm was compared to that of the original state-of-the-art algorithm using a simulated dataset (N = 91,287, Z SD values 0.01 to 44.68 m) and an in situ measured dataset (N = 305, Z SD values 0.3 to 16.4 m). The results showed a signiﬁcant improvement with a reduced mean absolute percentage error (MAPE) from 116% to 65% for simulated data and from 32% to 27% for in situ data. Outliers in the previous algorithm were well addressed in the new algorithm. We further evaluated the developed Z SD estimation algorithm using medium resolution imaging spectrometer (MERIS) images acquired from Lake Kasumigaura, Japan. The results obtained from 19 matchups revealed that the estimated Z SD matched well with the in situ measured Z SD , with a MAPE of 15%. The developed Z SD estimation algorithm can probably be applied to different optical water types due to its semianalytical features. to Type III waters based on their MERIS-derived R rs ( λ ) spectra. From the ﬁgure, the MERIS-derived Z SD values were consistent with the in situ measured Z SD values, with a MAPE value of 15%, an RMSE value of 0.08 (in log 10 unit), and a bias value of − 9%. The R 2 value of 0.57 and the slope of 0.82 also indicate the good performance of the new Z SD estimation algorithm. Since the Jiang19 algorithm also selected the same reference band (band 12 of MERIS) and the corresponding QAA (QAA_T754) for all matchups, identical results were obtained when we used the Jiang19 algorithm (data not Systematic overestimation was not found in the application of MERIS data.


Introduction
Water transparency relates to the depth to which light will penetrate water and to photosynthesis changes in a specific waterbody over time; thus, it is key to thoroughly evaluating water quality [1][2][3][4][5][6][7][8][9]. Changes in water transparency can also be an indicator of a human threat to an ecosystem [10][11][12][13][14]. Water transparency is often represented using Secchi disk depth (Z SD ), which can be measured using a traditional Secchi disk or estimated using the remote sensing technique. Recently, the remote sensing technique has been widely used because it is capable of providing a synoptic view of Z SD distribution in waters and estimation algorithms have been fine tuned [15][16][17].
Generally, two types of Z SD estimation algorithms are used with the remote sensing technique: empirical and semianalytical algorithms. Empirical algorithms use a simple regression analysis between remote sensing data and in situ measured Z SD values, while semianalytical algorithms are based on an underwater visibility theory [4,[18][19][20][21]. Compared to empirical algorithms, they are more suitable for estimating Z SD values from remote sensing data because they can be applied to various waters without the requirements of in situ data for algorithm recalibration [21,22].
Two underwater visibility theories are currently in use: the classic theory proposed in 1952 [23] and the new theory proposed in 2015 [4]. Previous studies have found that Z SD estimation algorithms based on the new theory perform better than those based on the classic theory [4,[20][21][22]24]. Algorithms based on the new theory rely on the diffuse attenuation coefficient (K d (λ)) at a wavelength corresponding to the maximum light transparency [4]. In these algorithms, the accurate estimation of total absorption and backscattering coefficients (a(λ) and b b (λ), respectively) using an appropriate quasianalytical algorithm (QAA) is a key step for estimating Z SD values more accurately. Therefore, the selection of the most appropriate QAA for each water type is necessary [4,15,19,21,[25][26][27][28].
Recently, Jiang et al. [21] developed a semianalytical algorithm (hereafter referred to as the Jiang19 algorithm) to estimate Z SD values with the aim of improving a Z SD estimation algorithm developed previously by Lee et al. [4]. Both algorithms are based on the new underwater visibility theory. However, only two water types (i.e., clear and turbid waters) were considered in the Jiang19 algorithm, and thus only two reference wavelengths (560 nm for clear waters and 754 nm for turbid water) were used to estimate a(λ) and b b (λ). Although previous studies have confirmed the success of using 560 nm for clear waters [29] and 754 nm for highly turbid waters [30] as reference wavelengths in total absorption coefficient estimations, the assumption of a pure water absorption coefficient dominating the total absorption coefficient at these two wavelengths will probably be invalid in moderately turbid and extremely turbid waters. This is because the absorption coefficients of phytoplankton and nonalgal particles at 560 and 754 nm will also be high in moderately and extremely turbid waters [31]. Therefore, the applicability of the Jiang19 algorithm to different optical water types is still a challenge. It is thus necessary to revisit the Jiang19 algorithm and improve its water type classification, reference wavelength selection, and water-type-corresponding QAA application to estimate more accurate Z SD values in waters with different optical properties.
Consequently, the objectives of our study were to: (1) develop a new algorithm for estimating Z SD more accurately in waters with different optical properties by classifying more water types and by selecting a more appropriate reference wavelength and corresponding QAA for each water type; and (2) evaluate the performance of the developed Z SD estimation algorithm using synthetic, in situ, and medium resolution imaging spectrometer (MERIS) data.

In Situ Data Collection
We collected 305 in situ measured hyperspectral remote sensing reflectance (R rs (λ)) and Z SD data pairs from 20 lakes and Tokyo Bay in Japan (Figure 1, hereafter referred to as the in situ dataset). The Z SD values were measured with a 30 cm diameter white Secchi disk. The Z SD data ranged from 0.3 to 16.4 m and covered clear to highly turbid waters. The in situ measured R rs (λ) values were obtained through the above-water approach. This approach measures the radiance of the skylight (L s ), the total upwelling radiance from the water (L t ), and the radiance from a standard gray board (L g ) using a FieldSpec ® HandHeld spectroradiometer (ASD, Boulder, CO, USA) with a sensor zenith angle of 40 • and an azimuth angle of 135 • from the sun [32]. All measurements were carried out between 9:30 to 14:00 local time (three measurements were taken between 14:00 and 16:00). Then, the R rs was calculated as follows: where ρ is the water surface reflectance factor (0.028 when the wind speed is less than 5 m/s) [32], R g is the reflectance of the gray board, and ∆ is the residual reflected skylight calculated using a method proposed by Jiang et al. [33]. All R rs (λ) spectra were then converted to MERIS bands based on the MERIS spectral response functions. This dataset was used to evaluate the performance of the developed Z SD estimation algorithm.
where is the water surface reflectance factor (0.028 when the wind speed is less than 5 m/s) [32], Rg is the reflectance of the gray board, and ∆ is the residual reflected skylight calculated using a method proposed by Jiang et al. [33]. All (λ) spectra were then converted to MERIS bands based on the MERIS spectral response functions. This dataset was used to evaluate the performance of the developed estimation algorithm. To further evaluate the estimated values from MERIS images, we also acquired in situ measured values at 10 stations in Lake Kasumigaura (Figure 1b) between January 2003 and March 2012 from the Lake Kasumigaura database, which were collected by the National Institute for Environmental Studies (NIES), Japan [34].

Synthetic Data Collection and Generation
In this study, we collected one synthetic dataset from Jiang et al. [31], which is called Synthetic Dataset I (N = 91,287) in Table 1. Synthetic Dataset I contains pairs of (λ) spectra as well as total absorption coefficient (a( )) and total backscattering coefficient (bb(λ)) values. The (λ) spectra were generated from the simulated a( ) and bb(λ) values by using a bio-optical model proposed by Gordon et al. [35] and Lee et al. [29] with assumptions of a nadir viewing angle and optically deep waters. For Synthetic Dataset I, the chlorophyll-a concentration (Cchl), tripton concentration (Ctr), and CDOM absorption coefficient at 440 nm (aCDOM(440)) were varied with different intervals between 0.01-1000 mg/m 3 , 0.01-1000 g/m 3 , and 0.01-5 m −1 , respectively. More details on Synthetic Dataset I generation can be found in Jiang et al. [31].
We then generated two other synthetic datasets to help with the development and evaluation of the algorithms. Synthetic Dataset II contains the diffuse attenuation coefficient (Kd(λ)) values, which were generated from the simulated a( ) and bb(λ) values in Synthetic Dataset I by using the following semianalytical equation proposed by Lee et al. [36,37]: where θ is the solar zenith angle and is the ratio of (backscattering coefficient of pure water [38]) to . To further evaluate the estimated Z SD values from MERIS images, we also acquired in situ measured Z SD values at 10 stations in Lake Kasumigaura (Figure 1b) between January 2003 and March 2012 from the Lake Kasumigaura database, which were collected by the National Institute for Environmental Studies (NIES), Japan [34].

Synthetic Data Collection and Generation
In this study, we collected one synthetic dataset from Jiang et al. [31], which is called Synthetic Dataset I (N = 91,287) in Table 1. Synthetic Dataset I contains pairs of R rs (λ) spectra as well as total absorption coefficient (a(λ)) and total backscattering coefficient (b b (λ)) values. The R rs (λ) spectra were generated from the simulated a(λ) and b b (λ) values by using a bio-optical model proposed by Gordon et al. [35] and Lee et al. [29] with assumptions of a nadir viewing angle and optically deep waters. For Synthetic Dataset I, the chlorophylla concentration (C chl ), tripton concentration (C tr ), and CDOM absorption coefficient at 440 nm (a CDOM (440)) were varied with different intervals between 0.01-1000 mg/m 3 , 0.01-1000 g/m 3 , and 0.01-5 m −1 , respectively. More details on Synthetic Dataset I generation can be found in Jiang et al. [31]. We then generated two other synthetic datasets to help with the development and evaluation of the algorithms. Synthetic Dataset II contains the diffuse attenuation coefficient (K d (λ)) values, which were generated from the simulated a(λ) and b b (λ) values in Synthetic Dataset I by using the following semianalytical equation proposed by Lee et al. [36,37]: where θ is the solar zenith angle and η w is the ratio of b bw (backscattering coefficient of pure water [38]) to b b . Synthetic Dataset III consists of simulated Z SD values. These Z SD values were generated from the simulated K d (λ) values (Synthetic Dataset II) and the corresponding R rs (λ) values (Synthetic Dataset I) using the following semianalytical equation developed by Jiang et al. [21]: value among the visible wavelengths, R PC rs is the corresponding R rs at the wavelength with the minimum K d (λ), C r t is the contrast threshold for sighting a white disk (i.e., 0.013 sr −1 ), RI is the refractive index value of water (1.34, [39]), K T is the diffuse attenuation coefficient of upwelling radiance at the wavelength with the minimum K d (λ), θ is the solar zenith angle, and µ is defined as b b /(a+b b ). A summary of all synthetic datasets is listed in Table 1.

Satellite Data Collection and Processing
In our study, the MERIS Level-1B data between 2002 and 2012 for Lake Kasumigaura were collected from the European Space Agency (ESA, https://merisfrs-merci-ds.eo.esa. int, accessed on 4 February 2018). We selected MERIS data due to their good spatial (300 × 300 m) and spectral (15 bands in visible and near-infrared domains) characteristics. Atmospheric correction was performed using the Case-2 Regional Processor in BEAM 5.0, and clouds and cloud shadows were identified using the IdePix module in SNAP. We then masked out the pixels with clouds, cloud buffers, cloud shadows, and failed atmospheric correction. Finally, we compiled 19 matchups from Lake Kasumigaura by matching the acquisition times of in situ measured Z SD values and satellite images (acquired on the same day). These matchups were used to further evaluate the performance of the developed Z SD estimation algorithm. In addition, the averages of the Z SD values derived from 3-by-3 pixels were used as satellite-estimated values.

The Original Jiang19 Algorithm
The original Jiang19 algorithm contains three main steps. First, a QAA hybrid is developed to retrieve a(λ) and b b (λ) from R rs (λ) spectra. In the QAA hybrid, a maximum chlorophyll index (MCI) proposed by Gower et al. [40] is used to switch QAA_V5 [29] and QAA_T (i.e., QAA_Turbid in [30]). In detail, if MCI <= 0.0016 sr −1 , 560 nm is used as the reference wavelength and thus QAA_V5 is selected to estimate a(λ) and b b (λ) for clear waters; otherwise, 754 nm is used as the reference wavelength and thus QAA_T is selected to estimate a(λ) and b b (λ) for turbid waters.
Then, K d (λ) is estimated from a(λ) and b b (λ) using Equation (2) as the second step, and Z SD is estimated from the minimum K d (λ) in the visible domain (e.g., 443 nm, 490 nm, 510 nm, 560 nm, 620 nm, 665 nm) and the corresponding R rs (λ) using Equation (3) (the third step).

Development of a New Z SD Estimation Algorithm
Since previous studies have confirmed that semianalytical Equations (2) and (3) are robust in waters with different optical properties if accurate a(λ) and b b (λ) are provided (e.g., [4,21,22,36,41]), our efforts were focused on how to obtain more accurate a(λ) and b b (λ) values from R rs (λ) spectra. A flowchart of the developed Z SD estimation algorithm is shown in Figure 2. Comparing to the Jiang19 algorithm, four improvements were carried out in this study, which are detailed sequentially in the following paragraphs. robust in waters with different optical properties if accurate ( ) and ( ) are provided (e.g., [4,21,22,36,41]), our efforts were focused on how to obtain more accurate ( ) and ( ) values from (λ) spectra. A flowchart of the developed estimation algorithm is shown in Figure 2. Comparing to the Jiang19 algorithm, four improvements were carried out in this study, which are detailed sequentially in the following paragraphs. First, we adopted a recently proposed water type classification algorithm that can classify all waters into four types, clear (Type I), moderately turbid (Type II), highly turbid (Type III), and extremely turbid (Type IV), by comparing the values of at 490, 560, 620, and 754 nm [31]. This approach allowed us to more reasonably assume that the pure water absorption coefficient dominated the total absorption coefficients at the reference wavelengths (i.e., 560 nm for Types I & II, 754 nm for Type III, and 865 nm for Type IV) [29,31,42]. The water type classification algorithm is described as follows: Second, we selected a different reference wavelength and corresponding QAA for each water type to retrieve (λ 0 ) and (λ 0 ) values at the reference wavelength (λ 0 ). For Type I waters, we selected 560 nm as the reference wavelength and then used an empirical equation in QAA_V5 to estimate a(560) due to its good performance in clear water [29,[43][44][45]. The Equations are: First, we adopted a recently proposed water type classification algorithm that can classify all waters into four types, clear (Type I), moderately turbid (Type II), highly turbid (Type III), and extremely turbid (Type IV), by comparing the values of R rs at 490, 560, 620, and 754 nm [31]. This approach allowed us to more reasonably assume that the pure water absorption coefficient dominated the total absorption coefficients at the reference wavelengths (i.e., 560 nm for Types I & II, 754 nm for Type III, and 865 nm for Type IV) [29,31,42]. The water type classification algorithm is described as follows: If R rs (490) > R rs (560), then Type I waters, Else, if R rs (490) > R rs (620), then Type II waters, Else, if R rs (754) > R rs (490) and R rs (754) > 0.01 sr −1 , then Type IV waters, Else, Type III waters.
(4) Second, we selected a different reference wavelength and corresponding QAA for each water type to retrieve a(λ 0 ) and b b (λ 0 ) values at the reference wavelength (λ 0 ). For Type I waters, we selected 560 nm as the reference wavelength and then used an empirical equation in QAA_V5 to estimate a(560) due to its good performance in clear water [29,[43][44][45]. The Equations are: a(560) = a w (560) + 10 −1.146−1.366x−0.469x 2 (5) x = log 10 ( r rs (443) + r rs (490) r rs (560) + 5 r rs (665) r rs (490) r rs (665) ) (6) where r rs is the remote sensing reflectance just below the water surface, a w (560) is the absorption coefficient of pure water at 560 nm, and a(560) is the total absorption coefficient at 560 nm. For Type II waters, we selected the same wavelength (i.e., 560 nm) as the reference wavelength but used a different empirical equation from QAA_TM (developed by Curtarelli et al. [42]) to estimate a(560). We adopted QAA_TM because of its good perfor- where R rs (560), R rs (665), and R rs (709) are the remote sensing reflectance just above the water surface at 560, 665, and 709 nm, respectively. However, since Equation (7) used a longer wavelength (709 nm) than those used in Equation (6), and considering the effects of a low signal-to-noise ratio in a practical application, we followed Lee et al.'s [46] suggestion to use QAA_V5 for the waters with R rs (665) < 0.0015 sr −1 (i.e., we still used Equations (5) and (6)). For Type III waters, 754 nm was selected as the reference wavelength, and the equations from QAA_T were adopted to estimate a(754) (i.e., a(754) ≈ a w (754) [30]). Similar to the consideration for Type II waters, we used a threshold of R rs (754) < 0.0015 sr −1 and used QAA_TM for those waters with lower R rs (754). This approach allowed us to avoid the effects of a low signal-to-noise ratio in a practical application.
For Type IV waters, 865 nm was selected as the reference wavelength and the assumption of a(865) ≈ a w (865) was adopted, as suggested by Jiang et al. [31]. After estimation of a(λ 0 ) at the reference wavelength (λ 0 ), the particulate backscattering coefficient (b bp (λ 0 )) could be calculated by the following equation [21,29]: Here, b bw (λ 0 ) is the backscattering coefficient of pure water at the reference wavelength.
In the third effort, we employed two empirical models (for Types I and II, respectively) and one semianalytical model (for both Types III and IV waters), which can represent reasonable shapes for particulate backscattering coefficients (b bp (λ)), thereby allowing us to retrieve more accurate a(λ) and b b (λ) values at a given wavelength for each water type. For Type I waters, the spectral slope Y was calculated using an empirical Equation in Lee et al. [29]: For Type II waters, the spectral slope Y was calculated using a different empirical Equation in Curtarelli et al. [42]: Then a(λ) and b b (λ) at the given wavelength (λ) were calculated using the following Equations [29]: To summarize the second and third efforts, we selected QAA_V5 for Type I waters, QAA_TM for Type II waters, QAA_T with a reference wavelength of 754 nm (hereafter In the fourth effort, we carried out a statistical analysis using Synthetic Dataset II to investigate the possibility of a minimum K d (λ) for each water type. Figure 3 shows the number of synthetic water samples with the minimum K d (λ) occurring at each visible wavelength for each water type. From the figure, we can observe that the minimum K d (λ) almost always occurred at 490 and 560 nm for Type I waters, 560 nm for Type II waters, 560, 620, and 665 nm for Type III waters, and at 665 nm for Type IV waters. Accordingly, we considered the relating wavelengths to find the minimum K d (λ) for each water type. This effort avoided the selection of inappropriate wavelengths due to estimation errors in the previous steps (e.g., due to uncertainty in the estimations of Y values using Equations (9)-(11).
To summarize the second and third efforts, we selected QAA_V5 for Type I wat QAA_TM for Type II waters, QAA_T with a reference wavelength of 754 nm (herea renamed QAA_T754) for Type III waters, and QAA_T with a reference wavelength of nm (renamed QAA_T865) for Type IV waters.
In the fourth effort, we carried out a statistical analysis using Synthetic Dataset I investigate the possibility of a minimum Kd(λ) for each water type. Figure 3 shows number of synthetic water samples with the minimum Kd(λ) occurring at each vis wavelength for each water type. From the figure, we can observe that the minimum K almost always occurred at 490 and 560 nm for Type I waters, 560 nm for Type II wat 560, 620, and 665 nm for Type III waters, and at 665 nm for Type IV waters. Accordin we considered the relating wavelengths to find the minimum Kd(λ) for each water ty This effort avoided the selection of inappropriate wavelengths due to estimation error the previous steps (e.g., due to uncertainty in the estimations of Y values using Equati (9)-(11).

Accuracy Assessment
We used the root mean square error (RMSE) in a 10 unit, the mean absolute p centage error (MAPE), bias, and the Nash-Sutcliffe efficiency (NSE) to evaluate the p formance of the developed algorithm. The Equations are as follows: Figure 3. Statistical analysis using Synthetic Dataset II to identify the possibility of a minimum K d (λ) for each water type.

Accuracy Assessment
We used the root mean square error (RMSE) in a log 10 unit, the mean absolute percentage error (MAPE), bias, and the Nash-Sutcliffe efficiency (NSE) to evaluate the performance of the developed algorithm. The Equations are as follows: where X estimated is the estimated Z SD value, X measured is the corresponding in situ measured or known Z SD value, X measured is the mean of in situ measured or known Z SD values, and N is the number of data pairs. The regression results between the estimated and the known or in situ measured Z SD values (i.e., R 2 , slope, and intercept) were also used to help evaluate algorithm performance. Figure 4 compares known Z SD values (i.e., Synthetic Dataset III) and estimated Z SD values from Synthetic Dataset I using the Jiang19 algorithm and the algorithm developed in our study, respectively. From the figure, we observed many overestimations around the boundary between Type I and Type II waters as well as around the boundary between Type II and Type III waters when the Jiang19 algorithm was used (Figure 4a Figure 4a,b are also represented according to the QAA used (Figure 4c,d) and to the selected wavelength with the minimum K d (λ) (Figure 4e,f). These figures were used to discuss how the proposed estimation algorithm worked (see Section 4). Figure 5 compares the in situ measured and estimated Z SD values. The estimated Z SD values were obtained from in situ measured R rs (λ) spectra using the Jiang19 algorithm or the algorithm developed in this study. The results show that the new Z SD estimation algorithm outperformed the Jiang19 algorithm with a reduced RMSE (from 0.18 to 0.17 in log 10 unit) and MAPE (from 32% to 27%) as well as increased NSE (from 0.90 to 0.92) and R 2 values (from 0.90 to 0.93). In particular, we observed that three points with large errors due to the use of the Jiang19 algorithm (blue circles with red boxes in Figure 5a) were improved substantially by using the new algorithm ( Figure 5b). However, compared to the use of the Jiang19 algorithm, the bias due to the use of the new algorithm increased slightly, from a 13% underestimation to an 18% underestimation. In addition, unlike the results shown in Figure 4, systematic overestimation was not found in the application of in situ measured R rs (λ) spectra to estimate Z SD , regardless of the use of the Jiang 19 algorithm or the new algorithm. Type IV waters were not found from our in situ dataset. As in Figure 4, the results shown in Figure 5a,b are also represented according to the QAA used (Figure 5c,d) and to the selected wavelength with the minimum K d (λ) (Figure 5e,f). These figures were also used to discuss how the proposed Z SD estimation algorithm worked (see Section 4). units) and MAPE (90% to 68%) as well as increased values of R 2 (0.95 to 0.97) and NSE (0.85 to 0.90). However, a systematic overestimation was found in the new algorithm (bias of 60%). The results shown in Figure 4a,b are also represented according to the QAA used (Figure 4c,d) and to the selected wavelength with the minimum Kd(λ) (Figure 4e,f). These figures were used to discuss how the proposed estimation algorithm worked (see Section 4).   Figure 5 compares the in situ measured and estimated values. The estimated values were obtained from in situ measured (λ) spectra using the Jiang19 algorithm or the algorithm developed in this study. The results show that the new estimation algorithm outperformed the Jiang19 algorithm with a reduced RMSE (from 0.18 to 0.17 in 10 unit) and MAPE (from 32% to 27%) as well as increased NSE (from 0.90 to 0.92) and R 2 values (from 0.90 to 0.93). In particular, we observed that three points with large errors due to the use of the Jiang19 algorithm (blue circles with red boxes in Figure

Validation Using MERIS Data
We further compared the estimated values from MERIS images using the newly developed algorithm to the in situ measured values ( Figure 6). The in situ measured

Validation Using MERIS Data
We further compared the estimated Z SD values from MERIS images using the newly developed algorithm to the in situ measured Z SD values ( Figure 6). The in situ measured Z SD values were obtained from the Kasumigaura database, and all matchups were classified to Type III waters based on their MERIS-derived R rs (λ) spectra. From the figure, the MERISderived Z SD values were consistent with the in situ measured Z SD values, with a MAPE value of 15%, an RMSE value of 0.08 (in log 10 unit), and a bias value of −9%. The R 2 value of 0.57 and the slope of 0.82 also indicate the good performance of the new Z SD estimation algorithm. Since the Jiang19 algorithm also selected the same reference band (band 12 of MERIS) and the corresponding QAA (QAA_T754) for all matchups, identical results were obtained when we used the Jiang19 algorithm (data not shown). Systematic overestimation was not found in the application of MERIS data.
Remote Sens. 2022, 14, x FOR PEER REVIEW 11 values were obtained from the Kasumigaura database, and all matchups were cl fied to Type III waters based on their MERIS-derived (λ) spectra. From the figure MERIS-derived values were consistent with the in situ measured values, w a MAPE value of 15%, an RMSE value of 0.08 (in 10 unit), and a bias value of −9%. R 2 value of 0.57 and the slope of 0.82 also indicate the good performance of the new estimation algorithm. Since the Jiang19 algorithm also selected the same reference b (band 12 of MERIS) and the corresponding QAA (QAA_T754) for all matchups, iden results were obtained when we used the Jiang19 algorithm (data not shown). System overestimation was not found in the application of MERIS data.

Discussion
We developed a new algorithm to estimate from MERIS data. Since this a rithm is based on several existing semianalytical algorithms [21,[29][30][31]42], it can be plied to different optical water types, from clear to extremely turbid. The validation res from the synthetic dataset (with a wide range of inherent optical properties, va ranging from 0.01 to 44.68 m) and the in situ dataset (from 21 Japanese waters, va ranging from 0.3 to 16.4 m) provide strong evidence to confirm the applicability of developed estimation algorithm. However, a systematic overestimation of values was observed from the val tion results of the use of the synthetic dataset (Figure 4b). This is because the assump for a( ) at the reference wavelengths resulted in an underestimation of a( ) values thus an underestimation of bb(λ) values at the same wavelength. These underestimat propagated to the final estimated values. However, no overestimation of va was observed when the developed algorithm was applied to the in situ and satellite (Figures 5b and 6). This is probably because the measurement errors contained in th situ and satellite data hid (or offset) the systematic overestimation due to the mechan of the developed algorithm. Therefore, further correction to mitigate the systematic o estimation was not carried out in this study.
Optical water type classification is very important because it helps to both cla relationships between different properties inside a certain class and quantify the varia between classes [25,47]. Recently, interest has grown in the application of the optical w

Discussion
We developed a new algorithm to estimate Z SD from MERIS data. Since this algorithm is based on several existing semianalytical algorithms [21,[29][30][31]42], it can be applied to different optical water types, from clear to extremely turbid. The validation results from the synthetic dataset (with a wide range of inherent optical properties, Z SD values ranging from 0.01 to 44.68 m) and the in situ dataset (from 21 Japanese waters, Z SD values ranging from 0.3 to 16.4 m) provide strong evidence to confirm the applicability of the developed Z SD estimation algorithm.
However, a systematic overestimation of Z SD values was observed from the validation results of the use of the synthetic dataset (Figure 4b). This is because the assumption for a(λ) at the reference wavelengths resulted in an underestimation of a(λ) values and thus an underestimation of b b (λ) values at the same wavelength. These underestimations propagated to the final estimated Z SD values. However, no overestimation of Z SD values was observed when the developed algorithm was applied to the in situ and satellite data (Figures 5b and 6). This is probably because the measurement errors contained in the in situ and satellite data hid (or offset) the systematic overestimation due to the mechanism of the developed algorithm. Therefore, further correction to mitigate the systematic overestimation was not carried out in this study.
Optical water type classification is very important because it helps to both clarify relationships between different properties inside a certain class and quantify the variation between classes [25,47]. Recently, interest has grown in the application of the optical water type classification in the remote sensing of ocean color (e.g., [21,31,[48][49][50][51]). In this study, we found that water type classification is a guide to select the most appropriate reference wavelength and corresponding QAA for the more accurate estimation of a(λ) and b b (λ) in waters with different optical properties. Therefore, we adopted the latest classification algorithm to classify all waters into four types. In contrast, the Jiang19 algorithm classified waters into only two types (clear and turbid) and used two corresponding QAAs (QAA_V5 and QAA_T754), thus generating large errors around boundaries of different water types (Figure 4a). For example, in Figure 4c, the Jiang19 algorithm classified some data samples as the clear water type and used QAA_V5 to estimate a(λ) and b b (λ); thus, large errors occurred in the Z SD estimations (yellow points in Figure 4c). In contrast, we classified these data samples as Water Type II (moderately turbid waters), which enabled us to use QAA_TM for a(λ) and b b (λ) estimations and in turn improved the Z SD estimations (blue points in Figure 4d). Similar results can also be observed from Figure 5c. For example, the Jiang19 algorithm classified the yellow point with red box 3 as clear water and two green points with red boxes 1 and 2 as turbid water, and thus used QAA_V5 and QAA_T754 to estimate a(λ) and b b (λ), resulting finally in large errors in the Z SD estimations. These three Z SD estimations were improved by classifying them as Type II waters and thus using QAA_TM to replace QAA_V5 and QAA_T754 (Figure 5d).
In addition, Jiang et al. [31] suggested the use of QAA_V6 to estimate a(λ) and b b (λ) at the reference wavelength of 665 nm for Type II waters. However, several previous studies have reported that QAA_V6 often failed in turbid waters [6,21,30,42]. Therefore, we used QAA_TM instead of QAA_V6 in this study. Our results also show that QAA_TM performed better than QAA_V6 for Type II waters (data not shown).
We improved the accuracy of a(λ) and b b (λ) estimation by classifying waters into four types (effort 1), and then adopted different QAA to estimate a(λ) and b b (λ) for each water type (efforts 2 and 3). These three efforts addressed most of the outliers caused by using the Jiang19 algorithm. However, some outliers remained after the above three efforts. This is because 620 or 665 nm was sometimes identified as the wavelength with the minimum K d even for some Type I and II waters due to uncertainty in the Y estimation model [30]. The identification of 620 or 665 nm is obviously inappropriate as the wavelength with the minimum K d in Type I and II waters should be at 490 or 560 nm based on the statistical results in Figure 3. Therefore, in this study, we also limited the wavelengths used to estimate the minimum K d (λ) in each water type. This effort can make the wavelengths for estimating minimum K d (λ) close to the reference wavelength in each water type and thus can reduce errors due to uncertainty in the Y estimation model as seen in Figure 4e,f (not clearly observed in Figure 5e,f).

Conclusions
In this study, we developed a semianalytical algorithm for estimating Z SD values from remote sensing data. The developed Z SD estimation algorithm considers different optical properties in four water types, thus allowing the selection of the most optimal reference wavelength and QAA for more accurate estimation of total absorption and backscattering coefficients and, in turn, improved Z SD estimations compared to the previous study. In addition, constraining the wavelength range of minimum K d (λ) in each water type contributed to the improvement of Z SD estimation accuracy. The validation results using synthetic and in situ data show that the developed algorithm is applicable to different optical water types.