Evaluation of Total Ozone Column from Multiple Satellite Measurements in the Antarctic Using the Brewer Spectrophotometer

: The ground-based ozone observation instrument, Brewer spectrophotometer (Brewer), was used to evaluate the quality of the total ozone column (TOC) produced by multiple polar-orbit satellite measurements at three stations in Antarctica (King Sejong, Jang Bogo, and Zhongshan stations). While all satellite TOCs showed high correlations with Brewer TOCs (R = ~0.8 to 0.9), there are some TOC differences among satellite data in austral spring, which is mainly attributed to the bias of Atmospheric Infrared Sounder (AIRS) TOC. The quality of satellite TOCs is consistent between Level 2 and 3 data, implying that “which satellite TOC is used” can induce larger uncertainty than “which spatial resolution is used” for the investigation of the Antarctic TOC pattern. Additionally, the quality of satellite TOC is regionally different (e.g., OMI TOC is a little higher at the King Sejong station, but lower at the Zhongshan station than the Brewer TOC). Thus, it seems necessary to consider the difference of multiple satellite data for better assessing the spatiotemporal pattern of Antarctic TOC.


Introduction
Since the early 1980s, when ozone depletion in the Antarctic stratosphere was first detected [1], the Antarctic stratospheric ozone hole has been continuously monitored and has emerged as a global environmental problem. Chlorofluorocarbons released by human activities act as a catalyst for ozone destruction, forming stratospheric ozone holes [2], inducing the increase in harmful solar radiation at the surface. The formation of the ozone hole also relates to the pattern of atmospheric dynamics and regional meteorology [3]. Particularly, several studies revealed that the stratospheric ozone variation is a major factor influencing the Southern Annular Mode (SAM), a type of large-scale atmospheric circulation in the southern hemisphere [4,5]. Changes in the SAM cause the variation in the mid-latitude jet flow, which not only influences the tropospheric-stratospheric mass exchange [6,7] but also the Antarctic sea ice distribution [8] and even temperature and precipitation in Australia [9,10]. Thus, it is very important to continuously monitor the stratospheric ozone pattern as its depletion affects both the Antarctic stratosphere and the lower atmosphere in Antarctica, Australia, and the southern hemisphere [11].
The Brewer spectrophotometer (hereafter, Brewer) is a ground-based remote sensing instrument widely used for stratospheric ozone monitoring, based on the measurement of total ozone column (TOC) from direct sun and zenith sky techniques. Previous studies reported that Brewer is a high-performance stratospheric ozone observation device with an accuracy of 1% error in direct sun mode and less than 3%-5% error in zenith sky scattered mode [12]. The Brewer has also provided reliable data at various ozone monitoring stations in Antarctica [13], but the ground-based measurement still has spatial limitations. The utilization of satellite data can be a solution to overcome this limitation, which allows the coverage of large areas. In particular, the usage of satellite TOC values enables us to examine the relationship to the regional meteorology in the broad area [14].
Fortunately, it is possible to sufficiently track changes in the amount of stratospheric ozone using satellites; several satellite observations have been carried out to date. Satellite observation data must be verified through comparison with ground-based instrument data to ensure quality and reliability [15]. A number of ground-based instruments have been used for the validation of satellite TOC observations, e.g., Dobson spectrophotometer [16][17][18]; Brewer spectrophotometer [19][20][21]; SAOZ [22][23][24]; Pandora photometer [25][26][27]; and ozonesonde [28][29][30]. Inter-comparison between the data collected by ground-based instruments and satellite data to analyze the changes of ozone amount has also been performed [31][32][33][34]. As shown above, the Brewer is commonly used for inter-comparison studies [35,36]. However, the verification of satellite data has not been performed much in the polar region because of the operational difficulties in ground-based observation in the harsh cold condition. While a few related studies performed recently [37,38], further validations are much required. Thus, there is a need to secure new data for the comparative verification.
In this study, we verified the TOCs observed from multiple satellites using several Brewer TOCs. First, we used Brewer data at two Korean stations (King Sejong and Jang Bogo station), which have not been extensively considered in previous studies. Ozone monitoring record is especially rare in the Antarctic region nearby the Ross Sea where the Jang Bogo station is located. Hence, it is believed that ozone monitoring at the Jang Bogo station could provide valuable insights associated with the analysis of the Antarctic stratospheric ozone. Second, in order to cover the whole Antarctic region evenly, we also used the Brewer data at the Zhongshan station, which has been often used for the Antarctic TOC analysis [39][40][41][42]. Using these Brewer TOC data, we would evaluate the quality of TOC from the measurement of multiple satellite instruments: The Ozone Monitoring Instrument (OMI), TROPOspheric Monitoring Instrument (TROPOMI), Global Ozone Monitoring Experiment-2 (GOME-2), Ozone Mapping and Profiler Suite (OMPS), and Atmospheric Infrared Sounder (AIRS). We mainly examined the correlations of TOC values between the Brewer and satellite observations, then also assessed other statistic values. Through this research, we aimed to show the necessity of using various TOC measurements simultaneously for the better diagnosis of target situations.

Ground-Based TOC Measurement Using the Brewer
The Brewer observes the direct sun (DS mode) and zenith sky scattered (ZS mode) sunlight following a set operating schedule. This radiative observation is applied to retrieve the quantity of some trace gases in the atmosphere, e.g., ozone [43,44]. The Brewer consists of a single diffraction grating, five light emission slits, and a focal plane, and the absorption wavelengths of the slits used are 306.3, 310.1, 313.5, 316.0, and 320.0 nm, which are optimized to observe O 3 and SO 2 . Compared to the ZS mode observation, it is known that the DS mode observation yields more accurate values; numerically, the errors of DS and ZS observation are less than 1% and 3%, respectively [44,45].
In this study, we used the Brewer TOC at three Antarctic stations: King Sejong, Jang Bogo, and Zhongshan stations ( Figure 1). The Korea Polar Research Institute installed Brewers at the King Sejong station in 1996 (MK IV model) and the Jang Bogo station in 2014 (MK III model). In the Zhongshan station, built in 1989 as the second Antarctic science base of China, the MK IV Brewer was installed in 1993 and operated until 2010. After the end of MK IV operation, the TOC has been monitored using the MK III Brewer, which was newly installed in 2009 [46]. All available Brewer TOC data here can be obtained from the World Ozone and Ultraviolet Radiation Data Center (WOUDC), one of six world data centers operated by the Global Atmosphere Watch of the World Meteorological Organization. All these data are summarized in Table 1. the World Ozone and Ultraviolet Radiation Data Center (WOUDC), one of six world data centers operated by the Global Atmosphere Watch of the World Meteorological Organization. All these data are summarized in Table 1. The observation periods for TOC Brewer data at the King Sejong, Jang Bogo, and Zhongshan stations were from January 23, 1998, to February 29, 2020 (excluding 2015 and 2017 due to a mechanical failure), from August 05, 2016 to March 10, 2020, and from March 16, 2000 to November 30, 2019 (excluding 2008-2012 and 2015 due to mechanical failure), respectively. The two Korean Antarctic stations provide daily average TOC values for both DS and ZS mode observations, while the Zhongshan station provides either a TOC from the DS or ZS mode observation as the daily representative (i.e., DS and ZS mode TOC values are not separately provided. When the TOC from the DS mode was available, TOC from the ZS mode was not provided). Despite this difference, the correlation coefficients between satellite and Brewer data for this station were similar with those of previous studies [41] and were therefore considered acceptable.  the official process of the International Ozone Services, Inc. (IOS). Brewer at the Jang Bogo station performed an instrument calibration in 2014 following the official process of the Kipp and Zonen. The record of Brewer calibration at the Zhongshan station was provided in [46], addressing that main calibration was performed in Canada during 2000, and intercomparison between two different Brewer instruments (MK III and MK IV) at the same site from 2009 to 2010. Based on these records, we decided that observations of three Brewers used in this study were qualified and proper for the comparison with satellite TOCs.

Satellite TOC measurement 2.2.1. OMI/Aura
The OMI sensor onboard the Aura satellite is the next generation ozone observation sensor following the Total Ozone Mapping Spectrometer (TOMS) and was developed using the TOMS version 8 algorithm [47] to observe the TOC and the vertical ozone profile. The main components of observation are O 3 , NO 3 , SO 2 , OCIO, BrO and aerosols. As a solar synchronous orbital satellite and a polar orbital satellite, OMI passes the equator when observed with the sun's reflection angle at 13:45, and observes the entire Earth in one day by gradually shifting longitude according to the Earth's rotation. The OMI sensor uses two ultraviolet channels (UV1: 270-314 nm, UV2: 360-380 nm) and one visible light channel (350-500 nm). The average spectral resolutions for full width at half maximum in the ultraviolet channels are 0.42 nm and 0.45 nm, respectively, and that of the visible light channel is 0.63 nm [48]. The OMI Level 2 TOC product [49] used in this study has a cross-track field of view (FOV) angle of 114 • and an observation width of 2600 km, which is sufficient to observe the entire earth in one day. In addition, the OMI sensor has a spatial resolution of 13 km × 24 km by the nadir scanning method and performs an observation in a zoom mode of 13 km × 12 km. We also used the OMI Level 3 product data [50] in this study, which have a spatial resolution of 1.0 • × 1.0 • . This was produced from Level 2G data (0.25 • × 0.25 • ), obtained by collecting the daily Level 2 data and taking quality-weighted average using grid data adjacent to the observed location.
Compared to other observational spectroscopic sensors, the OMI sensor has higher accuracy in detecting the chemical composition of clouds and the troposphere with improved resolution. The quality flag value is provided with TOC values; The value of 0 corresponding to the "good sample", which was used to perform the comparison [50]. Previous studies have reported that the error of the OMI TOC is~2% or 6 Dobson units (DU) and is~25% of the observed value of tropospheric ozone [51]. In addition, as the solar zenith angle increases, the TOC in the upper troposphere may be overestimated due to several reasons: noise in the observation signal, temperature deviation between regions, decrease in sensitivity of the ozone signal, and the curvature effect of the Earth's atmosphere. For these reasons, the expected values of the root mean square error (RMSE) for the TOC profile are estimated to be~1.5%, 3%, and 5% for ≤70 • , ≤82 • , and ≤85 • , respectively [52].

TROPOMI/S-5P
The TROPOMI onboard the Sentinel-5 Precursor (S-5P) satellite was launched on October 13, 2017. The satellite maintains an altitude of 824 km and passes through the equator at 13:30 local solar time. It travels 7 km/s and uses a two-dimensional observation device to have across-track and along-track observation resolutions of 3.5-15 km and 7 km, respectively with an observation width (swath) of 2600 km. It divides the wavelength of the input light into four sections using a grating spectrophotometer (UV-VIS-NIR-SWIR, UVNS) and specifies the wavelength range for each absorption band (UV and visible: 270-500 nm, near-infrared (NIR): 675-775 nm, short-wave infrared (SWIR): 2305-2385 nm) to perform the observation [53]. Compared to the absorption wavelength range of OMI sensors, the range of the TROPOMI is wider in the NIR and SWIR regions [54].
The spatial resolution of SWIR is 7 km × 7 km and that of others is 7 km × 3.5 km. Similar to the OMI data, the observed atmospheric chemicals from the TROPOMI measurement are O 3 , NO 2 , SO 2 , CO, CH 4 , aerosols and also clouds [53]. S-5P Level 2 data were provided at a resolution of 7 km × 3.5 km until December 27, 2019, and 5 km × 3.5 km thereafter. The TROPOMI algorithm used to calculate the total tropospheric ozone in this study is based on the one-step non-time critical or offline (OFFL) dataset (S5P_TO3_GODFIT) [55,56]. The value of quality flags was also applied for the TROPOMI Level 2 data. Following the product user's guide [53], we only utilized the TOC having flag values > 50 among the flag values between 0 (no data) and 100 (full quality data). According to the results of previous studies on the observation accuracy and precision of TROPOMI for the global region, the error was 3% and 1% for the TOC, 10% and 5% for the ozone profile, and 25% and 10% for the total amount of tropospheric ozone, respectively [57].

OMPS/Suomi NPP
The OMPS sensor is onboard a polar orbit satellite, Suomi NPP (National Polar Orbiting Partnership) and performs one limb observation and two nadir observations. Among them, the nadir mapper system monitors the TOC while covering the entire sunlit orbit throughout the day; the nadir profiler and limb profiler observe the vertical ozone profiles for the outer regions of each sunlit orbit [58]. The OMPS has a cross-track FOV of 110 • and a slit width of 0.27 • along the orbit. According to cross-track observation, 35 pieces which were captured in one observation have a vertical width of 3.35 • (50 km), and the resolution of along-track observation is 50 km. Concurrently, the TOC is observed by covering the observation width of 2800 km with nadir mapping for 7.6 s. Therefore, the resolutions of the OMPS Level 2 [59] swath and Level 3 [60] grid are 50 km × 50 km and 1 • × 1 • , respectively [58]. The total daily average ozone value was calculated using the TOMS version 7 algorithm [61]. The quality flags of the OMPS Level 2 output are provided in 16-bit format. Among them, the "good sample" value corresponding to the 0 value of the 0-bit was used. For the retrieval of the ozone amount, the wavelength regions with high ozone sensitivity (308.

AIRS/Aqua
The AIRS sensor is onboard the Aqua satellite, a solar synchronous orbiting satellite that has been observing the polar region since March 4, 2002. The output of Level 2 data [62] used in this study has a pixel resolution of 50 km × 50 km and Level 3 [63] has a grid resolution of 1 • × 1 • . AIRS sensor also provides the TOC data as a daily average value. One big difference from previous satellite observations is the usage of longer wavelengths. The AIRS sensor has 2378 absorption channels in the infrared (IR) wavelength range of 3.75-15.4 µm and four absorption channels in the visible wavelength range of 0.4-1.0 µm. Additionally, the Advanced Microwave Sounding Unit-A (AMSU-A) device onboard the Aqua satellite has 15 microwave (MW) channels. The TOC value was calculated by combining the observed radiations in these IR/MW wavelength ranges based on day (ascending) and night (descending) monitoring. Level 2 output used in the study was prepared using only the values obtained during the day. The range of the quality flag values for each calculated TOC is 0-2 [64], and we only used the qualified TOC values (quality flag = 0).

GOME-2/MetOp
The GOME-2 sensor, as an optical spectrometer, is composed of four observational unit sections in the wavelength range of 240-790 nm. The optical spectrometers are equipped with scanning mirrors for omnidirectional observation in the nadir and across-track directions to cover the entire polar region [65].
The GOME-2 sensors have been on board a set of sun-synchronous and polar-orbital satellites passing through the equator at 09:30 UTC: EUMETSAT's MetOp-A (launched in 2006), MetOp-B (launched in 2012) and MetOp-C (launched in 2018), respectively. Here, we used TOC from the measurements of GOME-2 onboard MetOp-A, retrieved from the GOME Data Processor (GDP) 4.8 algorithm. The GDP 4.8 algorithm is based on the typical optimized differential optical absorption spectroscopy (DOAS) method to convert the inclined observation value into a vertical value by applying an air mass factor (AMF) [66] for the retrieval of TOC and other trace gases (e.g., NO 2 , BrO, HCHO, SO 2 , etc.). For the TOC retrieval, the wavelength absorption range of the ozone absorption band (325-335 nm) was used for the ozone absorption band [67] with the 1.5%-2.0% of biases. GOME-2 measurements are maintained at an altitude of~820 km. The front scan of the GOME-2 consists of 24 pixels with a size of 80 km (across-track) × 40 km (along-track), and the rear scan consists of eight pixels with the same size as the front. The Level 2 data used in this study are available online at ftp://atmos.caf.dlr.de (accessed on 29 May 2020) and correspond to the front scan data, and the Level 3 product is provided in a grid format as daily data with the spatial resolution of 0.25 • × 0.25 • [67], which are available online at https://wdc.dlr.de/sensors/gome2/ (accessed on 29 May 2020). In addition, the GOME-2 covers an observation width of 1920 km for a total of 6 s. With the launch of the MetOp-B satellite equipped with the GOME-2 device in 2012, the across-track spatial resolution of the MetOp-A satellite data doubled around July 2013, but the observation width and ground pixel resolution reduced to 960 km and 40 km × 40 km, respectively [68]. For the GOME-2 data in this paper, we used the data after the resolution increased. For the qualified analysis of GOME-2 data, we used Level 2 TOC values only having the quality flag of 0, which implies to the "good sample" in the whole range of quality flags from 0 to 15.

Methodology
While the Brewers observe only certain locations, satellites move periodically and observe the entire Earth, resulting in lower spatiotemporal limitation. Since there is no point source of air pollutants around the Antarctic research stations, however, high resolution may not be strongly required for the TOC monitoring. In terms of satellite measurements, Level 2 data have the advantage of being very dense (i.e., both spatially by higher resolution and temporally by multiple measurements in a day), making it easier to observe local areas. Compared to Level 3 data having a lower resolution, a Level 2 product is usually considered preferential for the analysis. Nevertheless, Level 3 product also has been used significantly because it has better data access than the Level 2 product in providing the daily mean value of the entire planet on a given grid basis.
In this study, we compared the TOC values from multiple satellites at the three Antarctic stations separately to evaluate the performance of each satellite's monitoring. For example, the difference of spatial resolution (Figure 2) influences the difference related to the ground-based Brewer measurements. We even also compared the Level 3 (grid) values and the Level 2 (pixel) TOC values of the satellite measurements. As the TROPOMI sensor only provides Level 2 data, Level 3 data were not compared to ground-based observations. All comparisons in this study were based on the daily representative values and conducted by sampling two TOCs which are measured in the closest location and time.
For the comparison between the Brewer and satellite TOCs, temporally we used daily mean values. Brewer usually repeated many TOC measurements in a day automatically based on its monitoring schedule, but the daily mean TOC of the low-orbit satellite was only obtained based on one or two measurement in a day, implying that the temporal co-location between the Brewer and the satellite is not perfectly consistent. Spatially, the satellite TOC can be selected when its overpassing position (longitude and latitude) is located within a 100 km radius from each station, meaning another inconsistence. This spatiotemporal difference can contribute to the disagreement of TOC between the Brewer and satellite data. But here, we basically assume that the polar region has a relatively small variation of TOC naturally in a day and in a range of 100 km radius from each site because there is no source to induce the ozone variation at a small scale different from the lower latitudinal region, but we will consider this difference for the detail analyses of results if necessary. Remote Sens. 2021, 13, x FOR PEER REVIEW 7 of 20 For the comparison between the Brewer and satellite TOCs, temporally we used daily mean values. Brewer usually repeated many TOC measurements in a day automatically based on its monitoring schedule, but the daily mean TOC of the low-orbit satellite was only obtained based on one or two measurement in a day, implying that the temporal colocation between the Brewer and the satellite is not perfectly consistent. Spatially, the satellite TOC can be selected when its overpassing position (longitude and latitude) is located within a 100 km radius from each station, meaning another inconsistence. This spatiotemporal difference can contribute to the disagreement of TOC between the Brewer and satellite data. But here, we basically assume that the polar region has a relatively small variation of TOC naturally in a day and in a range of 100 km radius from each site because there is no source to induce the ozone variation at a small scale different from the lower latitudinal region, but we will consider this difference for the detail analyses of results if necessary.
We examined the TOC quality by comparing satellite observations according to the methods suggested in [41] to evaluate the performance of satellite data. The relationship between ground-based measurements (GBMs) and satellite observations (SATs) was examined using several statistics. We simply calculated the Pearson correlation coefficient (R) between the different TOC values to see how TOCs measured by different instruments are related each other. We also estimated other statistical indicators showing the extent of difference, such as the mean bias error (MBE), mean absolute bias error (MABE), and RMSE, which are dependent on the absolute differences (ADs). Notably, the MBE is an indicator of systematic error in measurement, which let us know to find the positive and negative bias showing the overestimation or underestimation of the comparative measurement. MABE and RMSE represent the mean pattern of absolute bias indicating the We examined the TOC quality by comparing satellite observations according to the methods suggested in [41] to evaluate the performance of satellite data. The relationship between ground-based measurements (GBMs) and satellite observations (SATs) was examined using several statistics. We simply calculated the Pearson correlation coefficient (R) between the different TOC values to see how TOCs measured by different instruments are related each other. We also estimated other statistical indicators showing the extent of difference, such as the mean bias error (MBE), mean absolute bias error (MABE), and RMSE, which are dependent on the absolute differences (ADs). Notably, the MBE is an indicator of systematic error in measurement, which let us know to find the positive and negative bias showing the overestimation or underestimation of the comparative measurement. MABE and RMSE represent the mean pattern of absolute bias indicating the agreement between the comparative measurements. The formula to obtain these indicators are described as the following Equations (1)-(4). Here, i means each compared TOC that exists simultaneously and N means the total data number for the target period:

Results and Discussion
At first, we examined the pattern of scatterplots and the extent of linear correlations (R) among the Brewer and the satellite Level 2 and Level 3 TOC values at the King Sejong (Figure 3), Jang Bogo (Figure 4), and Zhongshan station ( Figure 5). In most cases, Brewer TOCs in both DS and ZS mode consistently revealed a higher correlation with satellite TOCs (just slightly higher correlations were found with DS mode TOC). Among all satellite observations, the TROPOMI TOC had the highest correlation coefficient at all three stations with Brewer TOCs in DS mode (R ≥ 0.98). OMI showed the highest correlation with Brewer TOCs in DS mode at the Zhongshan station, but all correlations at three stations are actually high enough (R =~0.95 to 0.98). In contrast, the AIRS showed lower correlation coefficients with other TOC values. Lower correlations were found at the Jang Bogo and Zhongshan stations compared to the King Sejong station, indicating that TOCs retrieved from the AIRS measurements may be less reliable in the eastern Antarctic region.
Correlation coefficients for Level 2 and Level 3 data were not much different (Figures 3-5), revealing that both levels had a reliable quality of TOC values. Exactly the higher spatial resolution can reflect the better regional property with a higher correlation with the ground-based measurement. Owing to the deficiency for the source of local air pollutant in the polar region, however, it seems that both level data have the analogous performance, supporting previous findings in [14]. In conclusion, the use of Level 2 TOC data is preferentially recommended, but the usage of Level 3 TOC can also be acceptable for the polar TOC monitoring.
As mentioned in Section 2.2, unlike other sensors, TOC retrieval from the AIRS measurement was based on the observation of IR radiation, which propagates when heat is released. Thus, the uncertainty for the TOC retrieval can be attributed to the bias of temperature measurements [69]. Among the three stations, the King Sejong station located at a lower latitude shows warmer conditions than the Jang Bogo and Zhongshan stations located at a higher latitude (Figure 1), due to the longer solar radiation and the increased intrusion of warm air masses from the mid-latitude [70]. Considering the moderate correlation of the AIRS TOC with the Brewer TOC at the King Sejong station but low correlation at the Jang Bogo and Zhongshan stations (Figures 3-5), it seems that the lower surface temperature at higher latitudes may be related to the larger noise of retrieved TOCs for using IR channels. We also examined the monthly difference of the relationship between TOCs from other measurements and the AIRS TOCs ( Figure 6). The quality of AIRS TOCs was comparable to other TOCs during the austral summer. However, the large bias of AIRS TOC was frequently found in austral spring (September, October, and November) at the Jang Bogo and Zhongshan stations. Since there is almost no solar radiation in austral winter, the Antarctic surface temperature in austral spring rapidly increases in association with the rapid increase of incoming solar radiation. Namely, the inaccuracy of AIRS TOCs in spring as shown in Figure 6 can also be attributed to the temperature effect. Further study will be necessary for better understanding this issue. Remote Sens. 2021, 13, x FOR PEER REVIEW 9 of 20    Correlation coefficients for Level 2 and Level 3 data were not much different ( Figure  3, 4, and 5), revealing that both levels had a reliable quality of TOC values. Exactly the higher spatial resolution can reflect the better regional property with a higher correlation with the ground-based measurement. Owing to the deficiency for the source of local air  In addition, we analyzed other statistics (MBE, MABE, and RMSE) to determine the reliability of our interpretation for multiple TOCs using the correlation coefficient. MBE is a measure of the degree of difference between two groups, showing the systematic error of overestimation or underestimation. MABE is an index that compares each difference in MBE by summing the absolute value, indicating the degree of correlation and agreement between two comparing groups. RMSE is similar to the MABE, and more frequently used to describe how consistent the results are between groups. Calculated MBE and MABE values for TOCs from each satellite observation are summarized in Table 2, and RMSE and  N values are also provided in Tables 3 and 4, respectively. These statistical values evaluate the TOC from satellite measurements.  2  466  159  375  785  629  ZS  2  1107  316  766  1690  1206  DS  3  696  373  865  486  ZS  3  1599  758  2002  909  Jang Bogo  DS  2  164  195  345  349  332  ZS  2  246  199  449  448  432  DS  3  327  327  343  276  ZS  3  428  428  442  362  Zhongshan  DS  2  1376  165  1075  1940  1159  ZS  2  150  8  71  213  70  DS  3  1678  1072  2064  893  ZS  3  184  71  259  39 For all three sites, the TROPOMI measurement showed the smallest MBE, MABE, and RMSE of TOCs, and the AIRS showed the largest values. Considering that the small number of the TROPOMI (Table 4) may contribute to the low bias, the OMI measurement generally maintained the best quality of TOCs for a long-term period among satellite measurements. Both OMPS and GOME-2 TOCs had a smaller MBE, MABE, and RMSE than AIRS TOCs, but larger than those of OMI TOCs.
For most cases at the King Sejong station, satellite TOCs showed the positive MBE values, implying that the satellite measurement tends to have higher TOC compared to the ground-based measurements. At the Jang Bogo and Zhongshan stations, the OMI and GOME-2 measurements tended to have a small MBE in general, while the OMPS and AIRS TOCs were often compared to the Brewer TOC. This difference was sometimes higher than 10 DU, which is comparable to the interannual variation of the TOC in the Antarctic region [71], implying that the diagnosis of temporal TOC change can be varied according to the kind of satellite data.
The comparison between the MABE and RMSE reached the analogous conclusion of MBE analyses just only some small differences. For example, OMPS and GOME-2 TOCs had similar MABE values in spite of a large MBE difference at the King Sejong station: small MBE for OMPS TOCs but large MBE for GOME-2 TOCs. This implies that the GOME-2 TOCs were consistently higher than the Brewer TOC, but OMPS TOCs have 1 to 1 corresponding to the Brewer TOCs in general in spite of large biases. RMSE patterns also resulted with identical conclusions for MABE patterns. As found in this comparison, interpretation focusing on the difference between MBE and MABE is useful to determine the regional performance of satellite TOCs in detail. Our analyses using R, MBE, MABE, and RMSE revealed that there is no consistent conclusion about the difference among three sites between Level 2 and 3 data. Additionally, the comparison with Brewer TOCs from DS mode observations did not reveal much difference from that of ZS mode observations.
In conclusion, "which satellite TOC is used" should be significantly considered for the investigation of polar TOC pattern. According to the kind of satellite data, TOCs have regional discrepancy in the Antarctica. Namely, the atmospheric condition of each station has own characteristics for the relationship of TOCs between the ground-based and satellite measurements. We finally examined this issue based on a simple case study for the austral spring in 2019. Figure 7 shows the monthly mean TOC from OMI and OMPS measurements, and their differences. Although the moderate quality of OMPS TOC was confirmed with the Brewer TOC (Tables 2 and 3), there was the partial overestimation or underestimation compared to the OMI TOC. While the TOC difference between OMI and OMPS is not obvious in October and November, the TOC difference in September is large enough when the Antarctic ozone hole starts to occur. The difference was~30 DU at maximum, which is larger than the typical range of TOC annual variation [71]. Since this difference was not spatially uniform, it means that a different diagnosis can be derived according to the selection of the satellite dataset. Considering that the magnitude of TOC difference was not that small, this discrepancy cannot be easily neglected. This simple case study well revealed the necessity to utilize multiple TOC data simultaneously, at least for the TOC monitoring at the specific site. enough when the Antarctic ozone hole starts to occur. The difference was ~ 30 DU at maximum, which is larger than the typical range of TOC annual variation [71]. Since this difference was not spatially uniform, it means that a different diagnosis can be derived according to the selection of the satellite dataset. Considering that the magnitude of TOC difference was not that small, this discrepancy cannot be easily neglected. This simple case study well revealed the necessity to utilize multiple TOC data simultaneously, at least for the TOC monitoring at the specific site.

Conclusion
In this study, we used TOCs from Brewer measurements at the King Sejong, Jang Bogo, and Zhongshan station to evaluate TOCs from five satellite measurements. TOCs from the TROPOMI and OMI measurements had high correlations with the Brewer TOCs, but TOCs from the AIRS measurement did not. The quality of satellite TOCs did not show any consistent difference between Level 2 and Level 3, and between the comparison to the Brewer TOCs from DS and ZS mode. In conclusion, all satellite measurements revealed the moderate performance of Antarctic TOC monitoring. Nonetheless, we need to carefully consider differences among multiple satellite TOCs for the more accurate monitoring of the Antarctic ozone hole pattern. At this present moment, most TOC studies in the Antarctic region have been conducted using OMI measurements [38]. This study confirmed the high quality of OMI TOCs, but we still recommend to use multiple satellite TOCs simultaneously when detail and quantified analyses are requested, such as in the diagnosis of the interannual variability of TOC.