Evaluation of Operational Monsoon Moisture Surveillance and Severe Weather Prediction Utilizing COSMIC-2/FORMOSAT-7 Radio Occultation Observations

: Operational monsoon moisture surveillance and severe weather prediction is essential for timely water resource management and disaster risk reduction. For these purposes, this study suggests a moisture indicator using the COSMIC-2/FORMOSAT-7 radio occultation (RO) observations and evaluates numerical model experiments with RO data assimilation. The RO data quality is validated by a comparison between sampled RO proﬁles and nearby radiosonde proﬁles around Taiwan prior to the experiments. The suggested moisture indicator accurately monitors daily moisture variations in the South China Sea and the Bay of Bengal throughout the 2020 monsoon rainy season. For the numerical model experiments, the statistics of 152 moisture and rainfall forecasts for the 2020 Meiyu season in Taiwan show a neutral to slightly positive impact brought by RO data assimilation. A forecast sample with the most signiﬁcant improvement reveals that both thermodynamic and dynamic ﬁelds are appropriately adjusted by model integration posterior to data assimilation. The statistics of 17 track forecasts for typhoon Hagupit (2020) also show the positive effect of RO data assimilation. A forecast sample reveals that the member with RO data assimilation simulates better typhoon structure and intensity than the member without, and the effect can be larger and faster via multi-cycle RO data assimilation.


Introduction
Taiwan-as an island located in the East Asian monsoon region adjacent to the northwestern Pacific Ocean-is subject to various severe weather systems, such as Meiyu fronts, typhoons, and local thunderstorms [1]. Rainfall brought by these severe weather systems is the main source of water on the island, but excessive rainfall may cause massive landslides or floods that threaten human life and property. For timely water resource management and disaster risk reduction, it is essential to enhance the surveillance of monsoon moisture and the prediction of severe weather systems. The enhancement relies greatly on utilizing more advanced and widespread remote sensing observations, especially meteorological satellite observations over the ocean where in-situ measurements hardly exist. Most active and passive meteorological satellites sense the radiation emitted and reflected by the Earth and its atmosphere in the nadir direction using multiple channels of different wavelengths, which provide data at high horizontal resolution. However, with nadir sounding, the vertical profiles of the atmospheric state are indirectly estimated by assuming weighting functions as a function of altitude for all the channels. The vertical resolution of the estimated profiles is limited, and the accuracy is vulnerable to the sensitivity of sensors and the uncertainty of algorithms. To retrieve a more accurate atmospheric state at higher vertical resolution, the global navigation satellite system (GNSS [2,3]) radio occultation (RO) observations aboard low Earth orbit (LEO [4,5]) satellites have been regarded as the best solution thus far. From these limb-sounding observations, the Doppler shift of each refracted ray is used to derive the bending angle of the ray, and then the bending angle profile is used to derive refractivity that contains implicit information about atmospheric pressure, temperature, and humidity. The vertical resolution of GNSS RO profiles can reach up to dozens of meters in the lower troposphere via the high sampling rate of LEO receivers, and the accuracy is immune to the sensitivity of sensors because the phase information of electromagnetic waves is used instead of the amplitude.
The first application of the GNSS RO technique to atmospheric sounding was made by the Microlab-1 microsatellite in the Global Positioning System/Meteorology (GPS/MET) experiment managed by the University Corporation for Atmospheric Research (UCAR) from 1995 to 1997, which confirmed the feasibility and potential of this technique [6]. Following the success of GPS/MET, more satellite missions with GNSS RO measurements have been launched, such as the Challenging Minisatellite Payload (CHAMP [7]) by Germany, the Satélite de Aplicaciones Científicas-C (SAC-C [8]) by Argentina, the Gravity Recovery and Climate Experiment (GRACE [9]) by the United States (US) and Germany, the Constellation Observing System for Meteorology, Ionosphere, and Climate/Formosa Satellite Mission 3 (COSMIC/FORMOSAT-3 [10]) and COSMIC-2/FORMOSAT-7 [11] by the US and Taiwan, the Meteorological Operational Satellite (MetOp [12]) by Europe, the Terra X-band Synthetic Aperture Radar (TerraSAR-X [13]) by Germany, the Lemur Cube-Sats [14] by the US, and the Sentinel-6 [15] by Europe and the US. Among these missions, COSMIC/FORMOSAT-3 was the first one dedicated to the needs of operational climate monitoring and weather forecasting by deploying a constellation of six LEO satellites in 2006, which was retired and succeeded by COSMIC-2/FORMOSAT-7 with another constellation of six in 2019. The major differences between the two constellations are the orbital inclination and the compatibility with GNSSs. The COSMIC/FORMOSAT-3 constellation had a 72 • inclination, which led to a more uniform global distribution of observations, and only received signals from GPS. Instead, the COSMIC-2/FORMOSAT-7 constellation has a 24 • inclination, which favors observations at low and middle latitudes, and receives signals from GPS and the Russian Globalnaya Navigatsionnaya Sputnikovaya Sistema (GLONASS [2,3]) with an average of 4000 profiles per day at present. This quantity is several times as large as that in COSMIC/FORMOSAT-3, and it can be larger when signals from the European Galileo system [2,3] are receivable as well. Therefore, evaluating the effects of COSMIC-2/FORMOSAT-7 RO observations on the essential monsoon moisture surveillance and severe weather prediction becomes the motivation of this study.
For the purpose of monsoon moisture surveillance, information about atmospheric pressure, temperature, and humidity contained by RO refractivity data can be retrieved via a one-dimensional variational (1DVar) procedure and utilized. The 1DVar analyses were verified with radiosonde measurements and global weather analyses soon after the success of GPS/MET and proved reliable [16,17]. As for severe weather prediction, assimilating RO observations into numerical models is considered a key factor to improve forecast accuracy [18,19]. Regarding the assimilated variable, the retrieved pressure, temperature, and humidity may not be the best choice because of the uncertainty embedded in the 1DVar procedure. Therefore, the upstream variables, such as refractivity and the bending angle, have been more favorable for data assimilation with the necessity of corresponding observation operators [20,21]. The refractivity operator was derived as a function of the total pressure, the partial pressure of water vapor, and the absolute temperature [20], and the bending angle operator was derived as a function of the impact parameter and the Abel transform of a refractivity profile under a local spherical symmetry assumption [21]. It is also worth noting that some studies explored more sophisticated operators to alleviate the errors resulting from the local spherical symmetry assumption, such as ray-tracing operators [22][23][24]. Another advance was the proposal of a nonlocal linear observation operator for the modeled and observed excess phase paths, which integrate the modeled and Abelderived refractivities, respectively, along the refracted ray trajectory below the top of the numerical model [25,26]. The excess phase path operator accurately accounts for horizontal along-track refractivity gradients, which are strong around severe weather systems, by canceling the linearization and discretization errors during the two integration processes, and it is faster than the ray-tracing operator and simple for practical implementation.
There have been numerous studies employing the aforementioned operators to investigate the effects of RO data assimilation on numerical model analyses and forecasts. The assimilation of refractivity from Microlab-1 was pioneered by a series of observing system simulation experiments using the fifth generation Penn State University/National Center for Atmospheric Research (NCAR) mesoscale model (MM5) three-dimensional variational (3DVar [27]) system for the scenario of an extratropical cyclone, which found that the analyses of temperature and humidity and the consequent forecast of the cyclone were improved [18]. The assimilation of the bending angle from Microlab-1 was pioneered using the Spectral Statistical Interpolation (SSI) 3DVar system of the National Centers for Environmental Prediction (NCEP), which found that the analysis increment of temperature was the largest in the middle and upper troposphere and stratosphere while that of humidity was the largest in the lower troposphere [19]. After these pioneering studies, multiple-run evaluations were performed by two studies using the NCEP SSI 3DVar system. One assimilated 837 bending angle profiles from Microlab-1 over 11 days and obtained more accurate analyses and forecasts of temperature and humidity between the 200-and 850-hPa levels, verified with 56 collocated radiosonde profiles [28]. The other assimilated 3030 bending angle profiles from CHAMP over 16 days and reduced the large cold bias of the NCEP analyses without CHAMP, verified with 264 collocated radiosonde and dropsonde profiles [29]. Both studies validated the positive effects of RO data assimilation, which were more pronounced in the Southern Hemisphere because more in-situ measurements were assimilated in the Northern Hemisphere. Hence, RO observations have been operationally assimilated worldwide at most numerical weather prediction centers, such as the European Centre for Medium-Range Weather Forecasts (ECMWF [30,31]), NCEP [32,33], Environment and Climate Change Canada (ECCC [34]), the Met Office [35], and the Deutscher Wetterdienst (DWD [36]). For further insight into severe weather prediction, several studies were devoted to optimizing RO data assimilation for tropical cyclone cases. It was found that the forecasts of tropical cyclone tracks, intensity, and induced rainfall could be improved by RO data assimilation [37][38][39][40][41][42][43], and more improvement could be made by increasing the number of assimilation cycles [38], using the nonlocal observation operator [39,41], using a flow-dependent assimilation scheme [38], or assimilating other kinds of observations in addition [37,38,40]. Similar conclusions were also drawn for rainfall forecasts in Meiyu cases [38,41,44].
Recent studies have started to assess the performance of the COSMIC-2/FORMOSAT-7 RO observations, which have the best coverage ever at low and middle latitudes as mentioned earlier. Preliminary results revealed that the observations agreed well with independent radiosondes [11,45], LEO satellites [45,46], and model analyses [11], and their high signal-to-noise ratios benefited the accuracy of humidity retrieval [45]. Utilizing such high-resolution, high-quality RO observations-this study develops and evaluates operational products for monsoon moisture surveillance and severe weather prediction [47] at the National Science and Technology Center for Disaster Reduction (NCDR), which serves as a think tank on disaster risk reduction in Taiwan. The purpose is to understand the accuracy of these operational products and scrutinize the role played by the RO observations. Section 2 describes the materials and methods, including the regional verification of the RO observations, a suggested RO moisture indicator, the design of numerical model experiments with RO data assimilation, and the statistical method of the paired t-test. Section 3 presents the corresponding results. Section 4 provides in-depth discussion on observational and modeling aspects. Section 5 is devoted to conclusions.

Source and Verification of the COSMIC-2/FORMOSAT-7 RO Observations
On the Taiwan side, the COSMIC-2/FORMOSAT-7 RO observations are operationally received, processed, archived, and released by Taiwan Analysis Center for COSMIC (TACC [48]). The first test data were released on 10 December 2019, and real-time data have been open to the public since 7 March 2020. TACC provides raw GNSS data as well as retrieved atmospheric and ionospheric data at various processing levels and formats, among which the files named atmPrf and wetPf2 are automatically downloaded by NCDR every hour. The latency is about 50 min from the observation time. The atmPrf files mainly contain the profiles of the refractivity, bending angle, dry pressure, dry temperature, azimuth, and impact parameter at the heights of the refracted rays. The wetPf2 files mainly contain the profiles of the 1DVar-retrieved pressure, temperature, and humidity interpolated to 50-and 100-m resolution at the heights of 0-20 and 20-60 km, respectively. Figure 1 shows the daily number of the downloaded RO profiles from the start date, 1 October 2019, of the first test data to a recent date, 26 April 2021. The average is 4114 per day.
the accuracy of these operational products and scrutinize the role played by the RO observations. Section 2 describes the materials and methods, including the regional verification of the RO observations, a suggested RO moisture indicator, the design of numerical model experiments with RO data assimilation, and the statistical method of the paired ttest. Section 3 presents the corresponding results. Section 4 provides in-depth discussion on observational and modeling aspects. Section 5 is devoted to conclusions.

Source and Verification of the COSMIC-2/FORMOSAT-7 RO Observations
On the Taiwan side, the COSMIC-2/FORMOSAT-7 RO observations are operationally received, processed, archived, and released by Taiwan Analysis Center for COSMIC (TACC [48]). The first test data were released on 10 December 2019, and real-time data have been open to the public since 7 March 2020. TACC provides raw GNSS data as well as retrieved atmospheric and ionospheric data at various processing levels and formats, among which the files named atmPrf and wetPf2 are automatically downloaded by NCDR every hour. The latency is about 50 min from the observation time. The atmPrf files mainly contain the profiles of the refractivity, bending angle, dry pressure, dry temperature, azimuth, and impact parameter at the heights of the refracted rays. The wetPf2 files mainly contain the profiles of the 1DVar-retrieved pressure, temperature, and humidity interpolated to 50-and 100-m resolution at the heights of 0-20 and 20-60 km, respectively. Figure  1 shows the daily number of the downloaded RO profiles from the start date, 1 October 2019, of the first test data to a recent date, 26 April 2021. The average is 4114 per day. Although the quality of the COSMIC-2/FORMOSAT-7 RO observations has been well recognized [11,45,46], this study still begins with a comparison between sampled RO profiles and nearby radiosonde profiles around Taiwan so as to understand their regional characteristics prior to further applications. The compared variables are temperature and specific humidity ( ) interpolated to 10-hPa resolution as well as precipitable water ( ) above certain pressure levels ( ), calculated as [49]: where is gravity; the values of include 500, 600, 700, 850, 925, and 1000 hPa. The water vapor above the 10-hPa level is omitted. From 1 October 2019 to 23 June 2020, there are 59 RO profiles sampled for being within 100 km and ±2 h from nearby radiosonde profiles provided by six radiosonde stations, including Banqiao with 9 profiles, Hualien with 13, Tungsha with 13, Pingtung with 9, Magong with 11, and Green Island with 4 as shown in Figure 2. The selection of the 100-km and ±2-h thresholds is a compromise between sufficient samples and sufficient representativeness in space and time to ensure statistical significance. Moreover, because the quality of RO observations in the tropical lower troposphere can be influenced by the effect of multipath propagation resulting from Although the quality of the COSMIC-2/FORMOSAT-7 RO observations has been well recognized [11,45,46], this study still begins with a comparison between sampled RO profiles and nearby radiosonde profiles around Taiwan so as to understand their regional characteristics prior to further applications. The compared variables are temperature and specific humidity (s) interpolated to 10-hPa resolution as well as precipitable water (W x ) above certain pressure levels (p x ), calculated as [49]: where g is gravity; the values of p x include 500, 600, 700, 850, 925, and 1000 hPa. The water vapor above the 10-hPa level is omitted. From 1 October 2019 to 23 June 2020, there are 59 RO profiles sampled for being within 100 km and ±2 h from nearby radiosonde profiles provided by six radiosonde stations, including Banqiao with 9 profiles, Hualien with 13, Tungsha with 13, Pingtung with 9, Magong with 11, and Green Island with 4 as shown in Figure 2. The selection of the 100-km and ±2-h thresholds is a compromise between sufficient samples and sufficient representativeness in space and time to ensure statistical significance. Moreover, because the quality of RO observations in the tropical lower troposphere can be influenced by the effect of multipath propagation resulting from the large vertical gradients of refractivity in precipitation systems [50,51], the 59 RO profiles are divided into the sunny group and the cloudy/rainy group for separate statistics. An RO profile is classified into the sunny group if the majority of its surrounding area within a 30-km distance is cloudless, according to satellite images from Himawari-8; otherwise, it is classified into the cloudy/rainy group.
the large vertical gradients of refractivity in precipitation systems [50,51], the 59 RO profiles are divided into the sunny group and the cloudy/rainy group for separate statistics. An RO profile is classified into the sunny group if the majority of its surrounding area within a 30-km distance is cloudless, according to satellite images from Himawari-8; otherwise, it is classified into the cloudy/rainy group.

Experimental Design for Monsoon Moisture Surveillance
The rainy season of the East Asian summer monsoon consists of two phases [52]. The first phase begins with the onset of the South China Sea summer monsoon, characterized by strong southwesterly flow that penetrates the South China Sea and transports abundant moisture to East Asia. This phase is also called the pre-flood period [53], during which a large-scale stationary front with heavy rain (called Meiyu) occurs over South China, Taiwan, and Okinawa. The second phase is called the grand-onset phase, characterized by the stationary front migrating to Central and East China, the main islands of Japan (called Baiu), and the Korean Peninsula (called Changma). The moisture in this phase is mainly transported from the Bay of Bengal [54]. Therefore, this study retrieves daily moisture variations in the South China Sea during 26 April-10 June 2020, and the Bay of Bengal during 1 May-30 July 2020, using the COSMIC-2/FORMOSAT-7 RO observations, an observational approach of monsoon moisture surveillance in addition to the use of reanalysis data [52][53][54]. The indicator is total precipitable water ( ), calculated as [49]:

Experimental Design for Monsoon Moisture Surveillance
The rainy season of the East Asian summer monsoon consists of two phases [52]. The first phase begins with the onset of the South China Sea summer monsoon, characterized by strong southwesterly flow that penetrates the South China Sea and transports abundant moisture to East Asia. This phase is also called the pre-flood period [53], during which a large-scale stationary front with heavy rain (called Meiyu) occurs over South China, Taiwan, and Okinawa. The second phase is called the grand-onset phase, characterized by the stationary front migrating to Central and East China, the main islands of Japan (called Baiu), and the Korean Peninsula (called Changma). The moisture in this phase is mainly transported from the Bay of Bengal [54]. Therefore, this study retrieves daily moisture variations in the South China Sea during 26 April-10 June 2020, and the Bay of Bengal during 1 May-30 July 2020, using the COSMIC-2/FORMOSAT-7 RO observations, an observational approach of monsoon moisture surveillance in addition to the use of reanalysis data [52][53][54]. The indicator is total precipitable water (W T ), calculated as [49]: where p 0 and p 20 are the pressure at the heights of 0 and 20 km, respectively. Figure 3 shows two separate surveillance domains in the South China Sea (110-120 • E, 5-15 • N) and the Bay of Bengal (80-97.5 • E, 0-22.5 • N). The RO observations in each domain are collected every 24 h from 12:00 to next 11:59 UTC (Coordinated Universal Time) and then averaged into a daily mean profile of pressure and specific humidity, from which the daily mean total precipitable water is derived via Equation (2). To evaluate this approach, the NCEP Climate Forecast System Reanalysis (CFSR [55]) data and the undermentioned regional model analyses with RO data assimilation are also used to calculate domain-averaged total precipitable water every 00:00 UTC during the surveillance periods for intercomparison.
where and are the pressure at the heights of 0 and 20 km, respectively. Figure  3 shows two separate surveillance domains in the South China Sea (110-120°E, 5-15°N) and the Bay of Bengal (80-97.5°E, 0-22.5°N). The RO observations in each domain are collected every 24 h from 12:00 to next 11:59 UTC (Coordinated Universal Time) and then averaged into a daily mean profile of pressure and specific humidity, from which the daily mean total precipitable water is derived via Equation (2). To evaluate this approach, the NCEP Climate Forecast System Reanalysis (CFSR [55]) data and the undermentioned regional model analyses with RO data assimilation are also used to calculate domain-averaged total precipitable water every 00:00 UTC during the surveillance periods for intercomparison.

Experimental Design for Severe Weather Prediction
To examine the effects of COSMIC-2/FORMOSAT-7 RO data assimilation on severe weather prediction around Taiwan, this study employs the version 3.8.1 of the Weather Research and Forecasting (WRF) model and WRF data assimilation (WRFDA) system [56] with one-way, triple-nested domains, as shown in  [58], revised MM5 surface layer [59], Noah land surface model [60], Yonsei University planetary boundary layer [61], and the Rapid Radiative Transfer Model for GCMs (RRTMG) longwave and shortwave radiation [62]. For data assimilation, the 3DVar component of WRFDA is used with the background error covariance option of five control variables (CV5), which contain the stream function, unbalanced velocity potential, unbalanced surface pressure, unbalanced temperature, and pseudo relative humidity. Three outer loops are applied to the minimization of the cost function. The background error variance and length scaling variables refer to empirical values in a previous study [43]. Implementing this configuration, two operational 84-h forecast members are initialized with the NCEP Global Forecast System (GFS) 0.5° forecast every 00:00, 06:00, 12:00, and 18:00 UTC. Member 1 assimilates the Global Telecommunication System (GTS) in-situ observations at the initial time for all the three domains. Member 2 additionally assimilates the RO refractivity profiles within ±3 h from the initial time, which are thinned to 200-m vertical resolution to approach the model resolution. For each WRF member, 152 moisture and rainfall forecasts initialized during 00:00 UTC 26 May-18:00 UTC 2 July 2020, are verified with the water vapor mixing ratio data of the NCEP GFS analyses and

Experimental Design for Severe Weather Prediction
To examine the effects of COSMIC-2/FORMOSAT-7 RO data assimilation on severe weather prediction around Taiwan, this study employs the version 3.8.1 of the Weather Research and Forecasting (WRF) model and WRF data assimilation (WRFDA) system [56] with one-way, triple-nested domains, as shown in  [58], revised MM5 surface layer [59], Noah land surface model [60], Yonsei University planetary boundary layer [61], and the Rapid Radiative Transfer Model for GCMs (RRTMG) longwave and shortwave radiation [62]. For data assimilation, the 3DVar component of WRFDA is used with the background error covariance option of five control variables (CV5), which contain the stream function, unbalanced velocity potential, unbalanced surface pressure, unbalanced temperature, and pseudo relative humidity. Three outer loops are applied to the minimization of the cost function. The background error variance and length scaling variables refer to empirical values in a previous study [43]. Implementing this configuration, two operational 84-h forecast members are initialized with the NCEP Global Forecast System (GFS) 0.5 • forecast every 00:00, 06:00, 12:00, and 18:00 UTC. Member 1 assimilates the Global Telecommunication System (GTS) in-situ observations at the initial time for all the three domains. Member 2 additionally assimilates the RO refractivity profiles within ±3 h from the initial time, which are thinned to 200-m vertical resolution to approach the model resolution. For each WRF member, 152 moisture and rainfall forecasts initialized during 00:00 UTC 26 May-18:00 UTC 2 July 2020, are verified with the water vapor mixing ratio data of the NCEP GFS analyses and the radarderived quantitative precipitation estimation (QPE [63]) data around Taiwan from the Central Weather Bureau (CWB); 17 track forecasts for Typhoon Hagupit (2020) initialized during 12:00 UTC 1 August-12:00 UTC 5 August 2020 are verified with the best track data from CWB. Moreover, the forecast samples initialized at 12:00 UTC 6 June and 12:00 UTC the radar-derived quantitative precipitation estimation (QPE [63]) data around Taiwan from the Central Weather Bureau (CWB); 17 track forecasts for Typhoon Hagupit (2020) initialized during 12:00 UTC 1 August-12:00 UTC 5 August 2020 are verified with the best track data from CWB. Moreover, the forecast samples initialized at 12:00 UTC 6 June and 12:00 UTC 1 August 2020, for the cases of Meiyu and typhoon Hagupit, respectively, are spatially scrutinized, and the latter is additionally compared with two sensitivity experiments that perform three 6-h assimilation cycles prior to the forecast.

Statistical Method of the Paired t-test
To properly assess the statistical significance for the comparisons between the sampled RO and radiosonde profiles and between the rainfall forecasts made by WRF Members 1 and 2, the paired t-test is carried out [64]. Suppose there are sampled pairs with an average pair difference of . The t value for the paired t-test is calculated as: where is the expected value of the population; is the sample standard deviation of the pair differences. In this study, is set to 0 and then the calculated t value is compared against the critical value with a 95% confidence interval. If the absolute t value is smaller than the critical value, it means there is no statistically significant difference between the two groups.

Comparison between the Sampled RO and Radiosonde Profiles
Figure 5a-f calculate the average temperature, specific humidity, and precipitable water differences of the sampled COSMIC-2/FORMOSAT-7 RO profiles minus nearby radiosonde profiles for the sunny and cloudy/rainy groups separately. The standard deviations of the temperature and specific humidity differences are also shown. All sample number curves reveal a decreasing trend below the 700-hPa level because the lowest observable altitudes of the RO profiles near Banqiao, Hualien, and Pingtung radiosonde stations, located on the mountainous main island of Taiwan, are mostly higher than the other samples. The temperature difference for the sunny group is small at most levels with a

Statistical Method of the Paired t-test
To properly assess the statistical significance for the comparisons between the sampled RO and radiosonde profiles and between the rainfall forecasts made by WRF Members 1 and 2, the paired t-test is carried out [64]. Suppose there are n sampled pairs with an average pair difference of d. The t value for the paired t-test is calculated as: where µ 0 is the expected value of the population; s d is the sample standard deviation of the pair differences. In this study, µ 0 is set to 0 and then the calculated t value is compared against the critical value with a 95% confidence interval. If the absolute t value is smaller than the critical value, it means there is no statistically significant difference between the two groups.

Comparison between the Sampled RO and Radiosonde Profiles
Figure 5a-f calculate the average temperature, specific humidity, and precipitable water differences of the sampled COSMIC-2/FORMOSAT-7 RO profiles minus nearby radiosonde profiles for the sunny and cloudy/rainy groups separately. The standard deviations of the temperature and specific humidity differences are also shown. All sample number curves reveal a decreasing trend below the 700-hPa level because the lowest observable altitudes of the RO profiles near Banqiao, Hualien, and Pingtung radiosonde stations, located on the mountainous main island of Taiwan, are mostly higher than the other samples. The temperature difference for the sunny group is small at most levels with a standard deviation of approximately 1 K (Figure 5a), whereas the one for the cloudy/rainy group is notably negative below the 850-hPa level, reaching −2.7 K (Figure 5b). The specific humidity difference for the sunny group is positive above the 450-hPa level but negative below, and the absolute value and standard deviation decrease with altitude owing to rare water vapor at high levels ( Figure 5c). The vertical distribution of the specific humidity difference for the cloudy/rainy group is analogous to the sunny group, except for a larger absolute value and a positive difference instead between the 550-and 700-hPa levels (Figure 5d). It is to be noticed that the point-to-point temperature and specific humidity comparison between the RO and radiosonde profiles within a certain spatial and temporal distance is very rigorous, especially for moisture that can have larger spatial and temporal gradients. Therefore, the precipitable water vertically integrating the specific humidity above the 500-, 600-, 700-, 850-, 925-, and 1000-hPa levels is also compared. The precipitable water difference for the sunny group is near zero above all the six levels (Figure 5e), whereas the one for the cloudy/rainy group is notably negative above the 925-and 1000-hPa levels, reaching −5.5 kg m 2 (Figure 5f). Paired t-tests are carried out with the whole 59 sampled pairs of RO and radiosonde profiles. For temperature, the absolute t values are smaller than the critical values at almost all levels (Figure 5g). For specific humidity, the absolute t values exceed the critical values below the 900-hPa level and above the 400-hPa level (Figure 5h). As a concluding remark, the sunny group exhibits greater consistency than the cloudy/rainy group, and the overall consistency is great except for specific humidity at very low levels (where sample numbers are small) and high levels (where moisture is rare). cific humidity difference for the sunny group is positive above the 450-hPa level but negative below, and the absolute value and standard deviation decrease with altitude owing to rare water vapor at high levels ( Figure 5c). The vertical distribution of the specific humidity difference for the cloudy/rainy group is analogous to the sunny group, except for a larger absolute value and a positive difference instead between the 550-and 700-hPa levels (Figure 5d). It is to be noticed that the point-to-point temperature and specific humidity comparison between the RO and radiosonde profiles within a certain spatial and temporal distance is very rigorous, especially for moisture that can have larger spatial and temporal gradients. Therefore, the precipitable water vertically integrating the specific humidity above the 500-, 600-, 700-, 850-, 925-, and 1000-hPa levels is also compared. The precipitable water difference for the sunny group is near zero above all the six levels (Figure 5e), whereas the one for the cloudy/rainy group is notably negative above the 925-and 1000-hPa levels, reaching −5.5 kg m 2 (Figure 5f). Paired t-tests are carried out with the whole 59 sampled pairs of RO and radiosonde profiles. For temperature, the absolute t values are smaller than the critical values at almost all levels (Figure 5g). For specific humidity, the absolute t values exceed the critical values below the 900-hPa level and above the 400-hPa level (Figure 5h). As a concluding remark, the sunny group exhibits greater consistency than the cloudy/rainy group, and the overall consistency is great except for specific humidity at very low levels (where sample numbers are small) and high levels (where moisture is rare).  Figure 6 calculates the daily total precipitable water in the South China Sea during 26 April-10 June 2020, retrieved with the COSMIC-2/FORMOSAT-7 RO observations, the CFSR data, and the analyses of the aforementioned WRF Member 2. The 41-year (1979-2019) climatological average with the CFSR data is also shown. It is obvious that the RO observation curve corresponds well with the CFSR and WRF analysis curves, which look smoother with larger data size though. This confirms that the direct use of the RO humidity retrieval is sufficient for an accurate moisture indicator in a monsoon region. In comparison with the climatological average, the total precipitable water in the South China Sea was significantly higher during 3-7 May and 20-24 May 2020, with southeasterly and southwesterly prevailing winds (not shown), respectively, according to the CFSR data. The value during 20-24 May 2020 that reached 60 kg m −2 led to pronounced northeastward moisture flux and, consequently, the heaviest daily rainfall (668.5 mm) of the year in Taiwan on 22 May 2020. Therefore, for a more comprehensive outlook on monsoon rainfall, it is necessary to supplement the moisture indicator with the information of wind, which is not contained in the RO observations.  Figure 6 calculates the daily total precipitable water in the South China Sea during 26 April-10 June 2020, retrieved with the COSMIC-2/FORMOSAT-7 RO observations, the CFSR data, and the analyses of the aforementioned WRF Member 2. The 41-year (1979-2019) climatological average with the CFSR data is also shown. It is obvious that the RO observation curve corresponds well with the CFSR and WRF analysis curves, which look smoother with larger data size though. This confirms that the direct use of the RO humidity retrieval is sufficient for an accurate moisture indicator in a monsoon region. In comparison with the climatological average, the total precipitable water in the South China Sea was significantly higher during 3-7 May and 20-24 May 2020, with southeasterly and southwesterly prevailing winds (not shown), respectively, according to the CFSR data. The value during 20-24 May 2020 that reached 60 kg m −2 led to pronounced northeastward moisture flux and, consequently, the heaviest daily rainfall (668.5 mm) of the year in Taiwan on 22 May 2020. Therefore, for a more comprehensive outlook on monsoon rainfall, it is necessary to supplement the moisture indicator with the information of wind, which is not contained in the RO observations. Likewise, Figure 7 calculates the daily total precipitable water in the Bay of Bengal during 1 May-30 July 2020, retrieved with the RO observations and CFSR data. The RO observation curve still corresponds well with the smoother CFSR analysis curve, reconfirming the usability of the RO humidity retrieval. The comparison with the climatological average reveals tremendous moisture in the Bay of Bengal throughout the 2020 monsoon rainy season. The first peak that exceeded 60 kg m −2 on 19 May 2020, was due to cyclone Amphan, which was the first and strongest tropical cyclone in the 2020 North Indian Ocean cyclone season. Since mid-June 2020, the total precipitable water had kept around 60 kg m −2 for a couple of months, with a southwesterly prevailing wind (not shown) according to the CFSR data. This led to extreme northeastward moisture flux toward southern China and, consequently, the most devastating floods in the Yangtze basin since 1998. Likewise, Figure 7 calculates the daily total precipitable water in the Bay of Bengal during 1 May-30 July 2020, retrieved with the RO observations and CFSR data. The RO observation curve still corresponds well with the smoother CFSR analysis curve, reconfirming the usability of the RO humidity retrieval. The comparison with the climatological average reveals tremendous moisture in the Bay of Bengal throughout the 2020 monsoon rainy season. The first peak that exceeded 60 kg m −2 on 19 May 2020, was due to cyclone Amphan, which was the first and strongest tropical cyclone in the 2020 North Indian Ocean cyclone season. Since mid-June 2020, the total precipitable water had kept around 60 kg m −2 for a couple of months, with a southwesterly prevailing wind (not shown) according to the CFSR data. This led to extreme northeastward moisture flux toward southern China and, consequently, the most devastating floods in the Yangtze basin since 1998.

Severe Weather Prediction
The 152 moisture and rainfall forecasts made by each WRF member are verified with the NCEP GFS analysis and radar-derived QPE data, respectively. For the moisture forecasts, Figure 8 calculates the root-mean-square error of the predicted 850-hPa water vapor  Likewise, Figure 7 calculates the daily total precipitable water in the Bay of Bengal during 1 May-30 July 2020, retrieved with the RO observations and CFSR data. The RO observation curve still corresponds well with the smoother CFSR analysis curve, reconfirming the usability of the RO humidity retrieval. The comparison with the climatological average reveals tremendous moisture in the Bay of Bengal throughout the 2020 monsoon rainy season. The first peak that exceeded 60 kg m −2 on 19 May 2020, was due to cyclone Amphan, which was the first and strongest tropical cyclone in the 2020 North Indian Ocean cyclone season. Since mid-June 2020, the total precipitable water had kept around 60 kg m −2 for a couple of months, with a southwesterly prevailing wind (not shown) according to the CFSR data. This led to extreme northeastward moisture flux toward southern China and, consequently, the most devastating floods in the Yangtze basin since 1998.

Severe Weather Prediction
The 152 moisture and rainfall forecasts made by each WRF member are verified with the NCEP GFS analysis and radar-derived QPE data, respectively. For the moisture forecasts, Figure 8 calculates the root-mean-square error of the predicted 850-hPa water vapor

Severe Weather Prediction
The 152 moisture and rainfall forecasts made by each WRF member are verified with the NCEP GFS analysis and radar-derived QPE data, respectively. For the moisture forecasts, Figure 8 calculates the root-mean-square error of the predicted 850-hPa water vapor mixing ratio at 24 h in Domain 2 for both members. Although the COSMIC-2/FORMOSAT-7 RO observations contain implicit information about humidity, the two similar curves indicate that the impact of RO data assimilation tends to become neutral on the moisture field after 24-h model integration posterior to data assimilation. Nevertheless, a slightly positive impact exists in the predicted 24-48-h rainfall. Figure 9 calculates the correlation coefficient, root-mean-square error, and threat score above a 40-mm threshold of the predicted 24-48-h rainfall for both members. It is found that Member 2 mostly has higher correlation coefficients, lower root-mean-square errors, and higher threat scores than Member 1 despite limited differences. The average correlation coefficient, root-mean-square error, and threat score values for Members 1 and 2 are 0.163 and 0.171, 14.5 and 14.3 mm, and 0.0562 and 0.0633, respectively. Paired t-tests are carried out with the whole 152 pairs of rainfall forecasts made by Members 1 and 2. The absolute t values are 1.67 and 1.14 for the correlation coefficient and root-mean-square error, respectively. Both values are smaller than 1.97, which is the critical value with a 95% confidence interval. This infers that the additional assimilation of the RO observations at the initial time has a neutral to slightly positive impact on the moisture and rainfall forecasts during the Meiyu season in Taiwan. The most significant improvement occurs in the forecast sample initialized at 12:00 UTC 6 June 2020, whose predicted 24-48-h rainfall maps for Members 1 and 2 are compared with the simultaneous radar-derived rainfall map in Figure 10. It is found that Member 2 has a more similar rainfall pattern and intensity to the radar-derived rainfall than Member 1, which has a serious overprediction in central Taiwan without RO data assimilation.
than Member 1 despite limited differences. The average correlation coefficient, root-meansquare error, and threat score values for Members 1 and 2 are 0.163 and 0.171, 14.5 and 14.3 mm, and 0.0562 and 0.0633, respectively. Paired t-tests are carried out with the whole 152 pairs of rainfall forecasts made by Members 1 and 2. The absolute t values are 1.67 and 1.14 for the correlation coefficient and root-mean-square error, respectively. Both values are smaller than 1.97, which is the critical value with a 95% confidence interval. This infers that the additional assimilation of the RO observations at the initial time has a neutral to slightly positive impact on the moisture and rainfall forecasts during the Meiyu season in Taiwan. The most significant improvement occurs in the forecast sample initialized at 12:00 UTC 6 June 2020, whose predicted 24-48-h rainfall maps for Members 1 and 2 are compared with the simultaneous radar-derived rainfall map in Figure 10. It is found that Member 2 has a more similar rainfall pattern and intensity to the radar-derived rainfall than Member 1, which has a serious overprediction in central Taiwan without RO data assimilation.   . The (from top to bottom) correlation coefficient, root-mean-square error, and threat score above a 40-mm threshold of the predicted 24-48-h rainfall for the 152 rainfall forecasts initialized during 00:00 UTC 26 May-18:00 UTC 2 July 2020, for WRF Members 1 (pink curves) and 2 (red curves). The green line denotes the forecast initialized at 12:00 UTC 6 June 2020. Figure 9. The (from top to bottom) correlation coefficient, root-mean-square error, and threat score above a 40-mm threshold of the predicted 24-48-h rainfall for the 152 rainfall forecasts initialized during 00:00 UTC 26 May-18:00 UTC 2 July 2020, for WRF Members 1 (pink curves) and 2 (red curves). The green line denotes the forecast initialized at 12:00 UTC 6 June 2020. Figure 9. The (from top to bottom) correlation coefficient, root-mean-square error, and threat score above a 40-mm threshold of the predicted 24-48-h rainfall for the 152 rainfall forecasts initialized during 00:00 UTC 26 May-18:00 UTC 2 July 2020, for WRF Members 1 (pink curves) and 2 (red curves). The green line denotes the forecast initialized at 12:00 UTC 6 June 2020. With respect to typhoon prediction, the track forecasts for Typhoon Hagupit, for which CWB issued a sea warning during 2-3 August 2020, are verified with the best track data at 6-h intervals. Figure 11a calculates the total track error within 72 h averaged over the 17 forecasts made by each WRF member. It is found that the positive effect of RO data assimilation is distinguishable since 48 h and more significant since 66 h. For a closer inspection of the beginning 48 h, Figure 11b calculates the average total track, along-track, and cross-track error differences of Member 1 minus Member 2, where positive values represent the superiority of Member 2. The comparison reveals that RO data assimilation has a neutral impact on the moving speed and slightly positive impact on the moving With respect to typhoon prediction, the track forecasts for Typhoon Hagupit, for which CWB issued a sea warning during 2-3 August 2020, are verified with the best track data at 6-h intervals. Figure 11a calculates the total track error within 72 h averaged over the 17 forecasts made by each WRF member. It is found that the positive effect of RO data assimilation is distinguishable since 48 h and more significant since 66 h. For a closer inspection of the beginning 48 h, Figure 11b calculates the average total track, along-track, and cross-track error differences of Member 1 minus Member 2, where positive values represent the superiority of Member 2. The comparison reveals that RO data assimilation has a neutral impact on the moving speed and slightly positive impact on the moving direction of the predicted typhoon within 48 h. Taking the forecast sample initialized at 12:00 UTC 1 August 2020, for example, the track predicted by Member 2 is closer to the best track than that by Member 1 during 06:00 UTC 3 August (42 h)-06:00 UTC 4 August (66 h) 2020 ( Figure 12). As the best track during this period approaches the coastal waters of northern Taiwan, the improvement of the track forecast can benefit the timing of the sea warning issuance. In order to enhance the positive effect, two sensitivity experiments that perform initialization at 00:00 UTC 1 August 2020 and three 6-h assimilation cycles at 00:00, 06:00, and 12:00 UTC (termed as partial cycling) to incorporate earlier observation information are carried out for comparison. One, named as Member 1PC, follows Member 1 to only assimilate the GTS in-situ observations. The other, named as Member 2PC, follows Member 2 to additionally assimilate the RO refractivity profiles. Both partial cycling members use the same configuration of WRF and WRFDA as that in Members 1 and 2. The tracks predicted by Members 1PC and 2PC are found to correct the excessive moving speed of those predicted by Members 1 and 2 ( Figure 12). Table 1 further lists the total track error differences of Member 2 minus Member 2PC and Member 1PC minus Member 2PC, where positive values represent the superiority of Member 2PC. The result reveals significant outperformance of Member 2PC over Member 2 and Member 1PC. This forecast sample infers that, based on the background provided by global model forecasts, regional model forecasts are likely to benefit from suitable partial cycling assimilation of regional observations, especially the high-resolution, high-quality COSMIC-2/FORMOSAT-7 RO observations. track error differences of Member 2 minus Member 2PC and Member 1PC minus Member 2PC, where positive values represent the superiority of Member 2PC. The result reveals significant outperformance of Member 2PC over Member 2 and Member 1PC. This forecast sample infers that, based on the background provided by global model forecasts, regional model forecasts are likely to benefit from suitable partial cycling assimilation of regional observations, especially the high-resolution, high-quality COSMIC-2/FOR-MOSAT-7 RO observations.

Discussion
The 59 sampled pairs of COSMIC-2/FORMOSAT-7 RO and radiosonde profiles around Taiwan exhibit great overall consistency, except for specific humidity at very low levels for the cloudy/rainy group ( Figure 5). The exception is similar to the results of a recent study [45] and attributable to the low-level atmospheric instability in precipitation systems. There are also statistically significant differences at high levels, but they are understandable because the true locations of the RO and radiosonde observations can horizontally deviate from the locations of the occultation point and radiosonde station, respectively, as the height increases. The horizontal deviation degrades the representativeness of the comparison, especially for moisture that has a higher spatial variability. Despite these observational uncertainties, the suggested approach of monsoon moisture surveillance using the RO observations still accurately indicates daily moisture variations in the South China Sea and the Bay of Bengal (Figures 6 and 7). An accurate moisture indicator is certainly helpful for anticipating measures against potential droughts or heavy rainfall hazards.
The statistics of the 152 moisture and rainfall forecasts for the 2020 Meiyu season in Taiwan (Figures 8 and 9) and the 17 track forecasts for typhoon Hagupit (Figure 11) have shown the neutral to slightly positive effects of COSMIC-2/FORMOSAT-7 RO data assimilation. This subsection is to spatially scrutinize the reasons via the forecast samples initialized at 12:00 UTC 6 June and 12:00 UTC 1 August 2020, respectively. Figure 13 shows the 700-hPa temperature and wind differences as well as 850-hPa specific humidity and wind differences of WRF Member 2 minus WRF Member 1 at 0, 24, and 48 h for the Meiyu forecast sample. There is one RO profile (118.31 • E, 26.58 • N) available in Domain 3 at the analysis time, the assimilation of which decreases the temperature and moisture only near the profile location. However, it varies the wind analysis throughout the domain, where a northeasterly difference can be seen over the South China Sea. This suppresses the intensity of the southwesterly flow impinging on Taiwan and, consequently, results in lower temperature and moisture over the Taiwan Strait for the 24-h forecast. For the 48-h forecast, Member 2 still has lower moisture around Taiwan than Member 1 despite higher temperature, and thus the serious overprediction of the 24-48-h rainfall in central Taiwan is successfully corrected by RO data assimilation ( Figure 10). the profile location. However, it varies the wind analysis throughout the domain, where a northeasterly difference can be seen over the South China Sea. This suppresses the intensity of the southwesterly flow impinging on Taiwan and, consequently, results in lower temperature and moisture over the Taiwan Strait for the 24-h forecast. For the 48-h forecast, Member 2 still has lower moisture around Taiwan than Member 1 despite higher temperature, and thus the serious overprediction of the 24-48-h rainfall in central Taiwan is successfully corrected by RO data assimilation ( Figure 10). With respect to the typhoon Hagupit forecast sample, Figure 14 shows the west-east cross-sections, which pass through the typhoon center denoted by the black lines, of the temperature and geopotential height differences of WRF Member 2 minus WRF Member 1 at 6 h. The temperature difference is positive at 550-850 hPa west of the typhoon center (Figure 14a), which implies that the simulated condensation heating of the western eyewall updraft is strengthened and favorable for the warm core after RO data assimilation. The geopotential height difference is negative at 850-1000 hPa within the rainband areas With respect to the typhoon Hagupit forecast sample, Figure 14 shows the west-east cross-sections, which pass through the typhoon center denoted by the black lines, of the temperature and geopotential height differences of WRF Member 2 minus WRF Member 1 at 6 h. The temperature difference is positive at 550-850 hPa west of the typhoon center (Figure 14a), which implies that the simulated condensation heating of the western eyewall updraft is strengthened and favorable for the warm core after RO data assimilation. The geopotential height difference is negative at 850-1000 hPa within the rainband areas and positive at 250-850 hPa west of the typhoon center (Figure 14b), which implies that the structure of radial inflow at lower levels and outflow at upper levels is further established. Figure 15 shows the 850-hPa geopotential height, wind speed, and wind differences of Member 2 minus Member 1 at 24 h. After 24-h simulation, a dipole pattern is visible around the typhoon center location in both maps because the tracks have become different between the two members ( Figure 11). There are extensive decrease of geopotential height and increase of wind speed in the northwest quadrant of the typhoon, which demonstrates that the intensity and symmetry of the typhoon circulation is improved at low levels by RO data assimilation from a weaker and asymmetric circulation simulated in Member 1 (not shown). The improvement of the typhoon structure explains why Member 2 makes more accurate track forecasts to a certain degree ( Figure 12). Lastly, to further explore the effect of RO data assimilation with a partial cycling assimilation strategy, Figure 16 shows the 850-hPa geopotential height, wind speed, and wind differences of Member 2PC minus Member 1PC for the analyses and 24-h forecasts. At the final analysis time, the geopotential height difference is negative around the typhoon area but positive in its periphery, which demonstrates the reinforcement of the inner low pressure and outer subsidence. The wind speed difference is positive west and east of the typhoon center, which represents higher intensity in Member 2PC. After 24-h simulation, the improvement of the typhoon structure and intensity is significantly expanded. As a concluding remark on Figure 12, Table 1, and Figure 16, the effect of one-cycle RO data assimilation is smaller and slower, whereas multi-cycle RO data assimilation can enhance the quality of typhoon prediction more efficiently.
demonstrates that the intensity and symmetry of the typhoon circulation is improved at low levels by RO data assimilation from a weaker and asymmetric circulation simulated in Member 1 (not shown). The improvement of the typhoon structure explains why Member 2 makes more accurate track forecasts to a certain degree ( Figure 12). Lastly, to further explore the effect of RO data assimilation with a partial cycling assimilation strategy, Figure 16 shows the 850-hPa geopotential height, wind speed, and wind differences of Member 2PC minus Member 1PC for the analyses and 24-h forecasts. At the final analysis time, the geopotential height difference is negative around the typhoon area but positive in its periphery, which demonstrates the reinforcement of the inner low pressure and outer subsidence. The wind speed difference is positive west and east of the typhoon center, which represents higher intensity in Member 2PC. After 24-h simulation, the improvement of the typhoon structure and intensity is significantly expanded. As a concluding remark on Figure 12, Table 1, and Figure 16, the effect of one-cycle RO data assimilation is smaller and slower, whereas multi-cycle RO data assimilation can enhance the quality of typhoon prediction more efficiently.   (a) (b) Figure 15. The 850-hPa (a) geopotential height difference and (b) wind speed difference of WRF Member 2 minus WRF Member 1 at 24 h for the forecast sample initialized at 12:00 UTC 1 August 2020. The arrows indicate the wind difference. The green dots denote the COSMIC-2/FORMOSAT-7 RO observations assimilated by WRF Member 2. The dark blue shading indicates terrain.

Conclusions
This study utilizes COSMIC-2/FORMOSAT-7 RO observations to suggest a moisture indicator and design numerical model experiments for the purposes of monsoon moisture surveillance and severe weather prediction, respectively. Prior to the experiments, a comparison between 59 sampled RO profiles and nearby radiosonde profiles around Taiwan, divided into the sunny and cloudy/rainy groups, is made to verify the RO data quality. The comparison exhibits great overall consistency, except for specific humidity at very low levels for the cloudy/rainy group. Despite the exception, the suggested moisture indicator, which calculates total precipitable water out of the 1DVar-retrieved humidity profile, accurately monitors daily moisture variations in the South China Sea and the Bay of Bengal throughout the 2020 monsoon rainy season. With respect to the numerical model experiments, the performances of two operational 84-h WRF forecast members with and without RO data assimilation are compared. The statistics of 152 moisture and rainfall forecasts for the 2020 Meiyu season in Taiwan show a neutral to slightly positive impact brought by RO data assimilation. A forecast sample with the most significant improvement reveals that not only the temperature and moisture fields, but also the wind field is appropriately adjusted by model integration posterior to data assimilation. The statistics of 17 track forecasts for typhoon Hagupit (2020) also show the positive effect of RO data assimilation. A forecast sample reveals that the member with RO data assimilation simulates better typhoon structure and intensity than the member without, and the effect can be larger and faster via multi-cycle RO data assimilation.
It is noteworthy that even though there are globally more than 4000 COSMIC-2/ FORMOSAT-7 RO profiles per day, there may sometimes be very few or no RO profiles available in a regional domain at the analysis time such as Figure 13. Therefore, the partial cycling (or even full cycling) assimilation strategy becomes a key factor in propagating more RO observation information into the regional domain and indeed improves the typhoon forecast sample in this study. For future prospects, the statistics of operational multi-cycle forecasts can be carried out as well for a more systematical evaluation. It is also to be emphasized that even though the relatively cheaper 3DVar assimilation scheme and local refractivity observation operator are employed for the consideration of operational costs, this study still explores the benefits of the cutting-edge COSMIC-2/FORMOSAT-7 RO observations in both observational and modeling aspects. The statistical results can provide a benchmark for more advanced assimilation schemes or observation operators. Funding: This research was funded by Taiwan's National Science and Technology Center for Disaster Reduction (NCDR).

Data Availability Statement:
The operational products for monsoon moisture surveillance and severe weather prediction are available online: https://watch.ncdr.nat.gov.tw/watch_fs7, accessed on 23 May 2021. The digitial data are not publicly available due to NCDR's right of privacy.