Estimation of Precipitation Area Using S-Band Dual-Polarization Radar Measurements

Estimating precipitation area is important for weather forecasting as well as real-time application. This paper aims to develop an analytical framework for efficient precipitation area estimation using S-band dual-polarization radar measurements. Several types of factors, such as types of sensors, thresholds, and models, are considered and compared to form a data set. After building the appropriate data set, this paper yields a rigorous comparison of classification methods in statistical (logistic regression and linear discriminant analysis) and machine learning (decision tree, support vector machine, and random forest). To achieve better performance, spatial classification is considered by incorporating latitude and longitude of observation location into classification, compared with non-spatial classification. The data used in this study were collected by rain detector and present weather sensor in a network of automated weather systems (AWS), and an S-band dual-polarimetric weather radar during ten different rainfall events of varying lengths. The mean squared prediction error (MSPE) from leave-one-out cross validation (LOOCV) is computed to assess the performance of the methods. Of the methods, the decision tree and random forest methods result in the lowest MSPE, and spatial classification outperforms non-spatial classification. Particularly, machine-learning-based spatial classification methods accurately estimate the precipitation area in the northern areas of the study region.


Introduction
Precipitation plays a major role in the water cycle, which has direct impacts on human life. Accurate rainfall estimation is therefore essential to reduce damage from natural disasters such as drought, flood, and typhoon. Specially, identification of precipitating regions is of interest to many fields, such as agriculture, hydrology, and atmospheric science. A variety of equipment and techniques have been used to accurately estimate precipitation variability over time and space.
Ground-based equipment gauge has been widely used as a reference due to its accurate measurement of precipitation [1]. The gauge directly measures the amount of precipitation in limited monitoring sites, and spatial interpolation is often employed to predict precipitation at unobserved locations [2][3][4][5][6][7]. It is also used to compare with other remote sensing equipment as the ground truth value [8][9][10][11][12]. However, the accuracy of precipitation amount and detection is limited by the quantitative resolution of gages such as the size of bucket. Precipitation detector and visibility weather sensor from automatic weather stations (AWS) are additionally used to overcome this limitation in detecting the Remote Sens. 2021, 13,2039 2 of 20 existence of precipitation [13,14]. These instruments can yield accurate point measurements of precipitation occurrence but are limited into spatial coverage such as point measurements at the AWS site, thus resulting no observation over oceanic and mountainous regions, and suffer from observational errors [15,16].
To overcome these limitations, a weather radar is commonly used as a remote sensing instrument. The radar is superior in space-time resolution and spatial coverage is used for understanding spatial distribution of precipitation. To accurately estimate precipitation, it is essential to remove the contamination due to ground clutter and beam blockage over complex terrain. Many techniques to classify non-meteorological echoes have been developed using statistical methods such as fuzzy logic classifier, Bayesian approach, and neural network [17][18][19][20][21][22]. The contamination from beam blockage has also been corrected using a vertical profile of radar parameter and a beam blockage fraction estimated by radar beam propagation of standard atmosphere and digital elevation model (DEM) simulation of the beam blocked fraction [23,24]. A hybrid scan method consisting of the lowest radar bins was developed using multiple elevation angles to avoid the non-metrological echoes and beam blockage [25][26][27]. Furthermore, the spatial variabilities of dual-polarization parameters are useful features to identify non-meteorological echoes [22,28]. Ryzhkov and Zrnic [29] also showed that dual-polarimetric variables such as the correlation coefficient and the different phase are good indicators of non-meteorological echoes. Kwon et al. [27] developed Hybrid Surface Rainfall (HSR), which can improve rainfall estimation by using dual-polarization parameters and the hybrid scan method by reflecting current atmospheric conditions. However, HSR still shows high uncertainty at a long distance due to beam broadening and subsequent contamination by bright band. In addition, the uncertainty of precipitation estimation is increased due to the effects of beam height and wind drift [30,31]. According to Yan and Bárdossy [32], the correction of displacement between beam height and surface precipitation by using combined data of radar and ground measurement improves the precipitation estimation. The surface precipitation area identified in this study can thus further improve the QPE by correctly eliminating no precipitation area while retaining precipitation area. The complete description of precipitation area can be widely applied in various fields, such as weather forecast [33], hydrological and land surface modeling [32], development of precipitation sensor, and plant water requirement [34].
The objective of this paper is to develop an optimal framework for mapping precipitation areas using dual-polarization radars. The main contribution of this paper is twofold. First, this paper presents a novel data framework for improving the precipitation area estimation. Two types of sensors, rain detector (AWS_RE) and present weather sensor (VIS_WW), are considered. To generate the binary variable by accumulating one-minute data at ten-minute average data, two thresholds are employed. Lastly, two models, nonspatial and spatial models, are considered in the classification. The spatial model includes the longitude and latitude of station location in the set of covariates. Second, this paper provides a rigorous comparison of the classification methods for precipitation area estimation. There are numerous statistical and machine learning methods for classification problems. Among them, we consider five classification methods to estimate precipitation areas based on the accumulated data and spatial characteristics. Two statistical methods (logistic regression and linear discriminant analysis) and three machine learning methods (support vector machine, decision tree, and random forest) are used in this paper. For each method, as addressed earlier, two classification models are employed, non-spatial classification and spatial classification. The former considers only radar measurements as predictors, while the latter incorporates spatial information into classification, called spatial classification. It is expected that we can improve the accuracy of precipitation area estimation using spatial information. Additionally, we study and compare the events classified into two types of storms (stratiform and convective).

Methodology
Let x = x 1 , x 2 , . . . , x p T be a set of p covariates for each binary response y i , i = 1, · · · , n.
In this paper, we consider two statistical and three machine learning classification methods for precipitation area estimation. To assess the performance of the classifiers, the meansquared prediction error (MSPE) for each method is calculated using leave-one-out cross validation (LOOCV).

Logistic Regression
In linear regression, the response Y can be modeled directly with covariates due to its continuity. However, when the response is binary, it is more appropriate to model the probability of a response being a particular outcome, p = P(Y = 1|x) . A common approach to modeling p is the logistic regression (LR) model, where β j is the regression coefficient of x j . The left-hand side of the equation above is known as the log-odds or the logit of probability (p). Note that p is not linear in terms of x, while the logit is. The model can be rewritten as which is an S-shaped curve with predictions constrained between 0 and 1. To estimate the model parameters, β 1 , · · · , β p , the maximum likelihood method is commonly used.
Once we obtain the estimates, making predictionsp is simply achieved by substituting the values of the estimates for the regression parameters.

Linear Discriminant Analysis
As its name implies, linear discriminant analysis (LDA) involves modeling a response variable as a linear combination of the covariates. Welch [35] and Fisher [36] were both credited for developing LDA, and the latter is used in this paper.
Fisher's approach to the problem of finding an optimal classification rule is to find the linear combination of the covariates, which maximizes the between-group variance while also minimizing the within-group variance. Let B be the between-group covariance matrix and W be the within-group covariance matrix. Then, we seek to find b such that is maximized. The solution, or linear discriminant, is the eigenvector corresponding to the largest eigenvalue of W −1 B. This ratio of the between-group variance to the within-group variance can be thought of as a signal-to-noise ratio. In other words, Fisher's approach involves maximizing the signal-to-noise ratio. When the number of covariates is large, the calculations can become complicated. Unlike Welch's approach, the method does not make any assumptions about the underlying distributions of the data. However, it is necessary to assume that the covariance matrices are identical for all groups.

Support Vector Machine
Support vector machine (SVM) is a powerful tool for solving classification and regression problems in machine learning. If data are completely separable into two classes, there are infinitely many linear boundaries accurately separating into the two classes, and it is challenging to choose the optimal boundary based solely on accuracy. Vapnik [37] defined an alternate metric, the margin, which is the smallest distance from the observations to a Remote Sens. 2021, 13, 2039 4 of 20 hyperplane separating them. The maximal margin hyperplane is farthest from the training observations. The decision function can be written as where y i is the sign of the class (y i = 1 for Group 1 and y i = −1 for Group 2), α i is the model parameter, and x i u is the dot product between the observed data and the data points that fall on the margin boundary. In the completely separable case mentioned previously, the set of nonzero α parameter corresponds to the data points that fall on the boundary of the margin, known as support vectors. The classifier is supported solely by these data points. If D(u) > 0, the new sample is classified into Group 1, whereas, if D(u) < 0, the new sample is classified into Group 2.

Decision Tree
Decision tree (DT) methods involve segmenting the covariate space into a number of simple and distinct regions. To split the covariate space into the regions, recursive binary splitting is commonly used, starting at the top of the tree and successively splitting the covariate space. Each split is indicated via two new branches further down the tree.
In order to perform this splitting, all covariates are considered and one is chosen with the lowest impurity measure. There are several impurity measures available for DT. For instance, the Gini index for the two-class case is defined by wherep mk is the probability of the k th class in the m th region. This index measures how often a randomly selected element would be incorrectly identified. To predict a new observation, the region to which it belongs is identified, and it is assigned to the class with the largest class probability,p mk , in that region.

Random Forest
A single decision tree can produce highly variable estimates. To decrease this variance, an ensemble approach is a reasonable choice, combining the outcomes of several decision trees to a single classification. Random forests are widely used ensemble learning methods for classification and regression, combining predictions from multiple models to improve predictive performance [38]. These classifications can be thought of as votes, and the new observation is classified into the class with the most votes. An alternative approach is to use predicted probabilities. For a given class, the number of trees which classified the new observation into that class is divided by the total number of trees, which is considered the predicted probability for that class. The new observation is classified into the class with the highest probability. The variance reduction achieved by this method can be seen visually in the form of a smoother Receiver Operating Characteristic (ROC) curve, and numerically in the form of a lower Area Under the Curve (AUC).
In this method, a number of decision trees are built on bootstrapped training samples. The trees must be built in such a way that they are uncorrelated. Thus, at each split in a tree, a random sample of m covariates out of the full set of p covariates is chosen as split candidates. Then, one of the m covariates is chosen to split on. This prevents a very strong covariate from always being used for the first split in a tree, which leads to a unique set of trees. This leads to a lower variance than would be achieved using a single tree or bagged trees.

Spatial Classification
This paper focuses on the prediction of a binary response at unsampled locations. This prediction problem can be thought of a spatial classification problem. Utilizing spatial information in classification problems can enhance classification accuracy. To Remote Sens. 2021, 13, 2039 5 of 20 this end, we include spatial information, longitude and latitude in this study, in the set of covariates, which is referred to as Model 2. Different spatial information could be incorporated if available. To evaluate the effectiveness of the spatial information, the non-spatial classification model (Model 1) with only non-spatial predictors is compared with the spatial classification model (Model 2). In this study, both models include the radar variables as non-spatial predictors, but longitude and latitude are used as spatial predictors in only Model 2 (spatial classification model).

Leave-One-Out Cross Validation
Leave-one-out cross validation (LOOCV) is conducted to assess the performance of the classification methods mentioned above. Cross validation methods involve splitting the set of observations into two parts, a training set and a test set. The model is trained on the observations in the training set, and predictions are made for the observations in the test set. Then, the mean squared prediction error (MSPE) is calculated by comparing the actual and predicted values for the observations in the test set. In LOOCV, the test set consists of only a single observation and the model is trained on the remaining n − 1 observations. This is repeated n times, where n is the number of observations, and each observation is left out one by one. The LOOCV estimate of the MSPE is given by where y i is the actual value of the i th observation in the test set, andŷ i is the predicted value of the i th observation in the test set. Note that the observations in the test set are classified into one of two groups. Thus, y i andŷ i can both only take on one of two values-either 0 or 1. This constrains the MSPE to values between 0 and 1. Moreover, the MSPE can be thought of as the percent of incorrectly classified observations in the test set. Therefore, lower MSPE values are preferable.

Data Description
Data from the Mountain Myeonbong S-band dual polarization radar and the Automatic Weather Station (AWS) are used in this study. This radar has a beam width of 0.92 • and scans plan position indicators (PPI) at nine elevation angles (0 • , 0.39 • , 0.83 • , 2 • , 2.88 • , 4.06 • , 5.67 • , 7.88 • , 10.94 • ) for every 10 min (Table 1). The observed radar variables are reflectivity (Z H ), differential reflectivity (Z DR ), specific differential phase shift (K DP ), and cross-correlation coefficient (ρ HV ). The biases in Z H and Z DR were calibrated by post-processing. The averaged calibration bias of Z H is −3.9 dBZ calculated by using the self-consistency constraint between Z H and K DP [11,39]. The averaged calibration bias of Z DR is 0.03 dB obtained by comparison of observed data with the average Z H -Z DR relationship derived from data from disdrometer [11]. The HSR radar data are generated by using the lowest radar elevation angles which are not affected by ground echoes, beam blockage, and non-meteorological echoes [27]. The radar parameters in polar coordinates are converted to Cartesian coordinates with a horizontal resolution of 1 km × 1 km over a 240 km range. The pixel values with no echoes were replaced as Z H = −32 dBZ, Z DR = −8 dB, The AWS data are collected from two devices, rain (precipitation) detector (AWS_RE) and present weather sensor, using optical sensors (VIS_WW). These data are quality controlled by the physical limit check, the climate range check, the step check, the persistence check, the internal consistency check, and the median filter [42]. The total of 596 AWSs are used over the area covered by radar, and time resolution is 1 min. These data are recorded as 1 if rain is detected, and 0 otherwise. The present weather sensor computed the visibility and present weather using measured optical extinction/scattering, temperature, humidity, and precipitation rate with observed precipitation particles. The outputs of present weather sensor are WMO (World Meteorological Organization) 4680 code (drizzle, rain, showery precipitation, etc.) from 0 to 99, which is converted to binary data. The binary variable is set to 1 if the present weather code is greater than 40, indicating precipitation, and 0 otherwise. These binary data (AWS_RE and VIS_WW) are used as response variables in this study. The rain detector is installed in every AWS, while the present weather sensor is available in some stations. The map of the stations is shown in Figure 1. The AWS data are collected from two devices, rain (precipitation) detector (AWS_RE) and present weather sensor, using optical sensors (VIS_WW). These data are quality controlled by the physical limit check, the climate range check, the step check, the persistence check, the internal consistency check, and the median filter [42]. The total of 596 AWSs are used over the area covered by radar, and time resolution is 1 min. These data are recorded as 1 if rain is detected, and 0 otherwise. The present weather sensor computed the visibility and present weather using measured optical extinction/scattering, temperature, humidity, and precipitation rate with observed precipitation particles. The outputs of present weather sensor are WMO (World Meteorological Organization) 4680 code (drizzle, rain, showery precipitation, etc.) from 0 to 99, which is converted to binary data. The binary variable is set to 1 if the present weather code is greater than 40, indicating precipitation, and 0 otherwise. These binary data (AWS_RE and VIS_WW) are used as response variables in this study. The rain detector is installed in every AWS, while the present weather sensor is available in some stations. The map of the stations is shown in Figure 1.  To match the time resolution between the radar data (every 10 min) and the AWS data (every minute), we consider two different ways to accumulate the AWS data using two thresholds. For the first threshold (Type 1 threshold), the station is classified as "wet" (that is, avg (AWS_RE) ≥ 0.5 or avg (VIS_WW) ≥ 0.5) if rain was observed for at least five out of ten minutes. The second threshold (Type 2 threshold) defines a station as "wet" if rain was observed for at least one out of ten minutes (that is, avg (AWS_RE) ≥ 0.1 or avg (VIS_WW) ≥ 0.1). The resulting pair of the radar data and the AWS data were produced by extracting the values of the nearest radar pixel to each AWS. The observations with missing values are eliminated. Table 2 presents the eight scenarios by considering two sensors (AWS_RE and VIS_WW), two thresholds (Type 1 and Type 2), and two models (Model 1: non-spatial and Model 2: spatial). Eleven rain events are selected from June to September in 2017 (Table 3). A total of 318 10-min data are used to estimate the precipitation area. The PPI images of Z H and ρ HV for two precipitation types (stratiform and convective) are shown in Figure 2. Z H at the low elevation angle (0.0 • ) in the stratiform rain shows widespread precipitation area, and the rainfall intensity is quite weak (Figure 2a). Another typical characteristic of stratiform rainfall is the distinct melting layer. Low ρ HV at the high elevation angle (4.06 • ) appears in a ring shape, indicating a melting layer (Figure 2b). In the convective case, Z H at the low elevation angle is stronger (>35 dBZ) than that of the stratiform case. The echoes are aligned northeast to southwest (Figure 2c) and the melting layer is not shown in ρ HV at the high elevation angle (Figure 2d).  The precipitation type of each event is classified as stratiform and convective cases using the spatial autocorrelation of precipitation field. The spatial autocorrelation of precipitation is known to depend on precipitation types [43][44][45][46]. Thus, the spatial autocorrelation of rainfall rate can be used to quantitatively determine between convective and stratiform precipitation systems. Rainfall rate (R) estimated by ( = 200R 1.6 ) from HSR data is analyzed to calculate spatial autocorrelation with the maximum horizontal length of 100 km. An ellipse fit of decorrelation length is given by e-folding distances which measure the spatial extent for the variations of precipitation field. In this study, the time steps in which with the length of the major axis of the ellipse is more than 130 km are considered the stratiform type, while others are classified as the convective type. An example of spatial autocorrelation for stratiform and convective type is shown in Figure 3. The length of the major axis of the ellipse is 143.50 km (107.96 km) in the stratiform (convective) case. The length of the minor axis (43.52 km) as well as the axis ratio (0.40) is much smaller in this convective case. The length of the major axis is almost 100 km. These are the characteristics of banded convective precipitation. The precipitation type of each event is classified as stratiform and convective cases using the spatial autocorrelation of precipitation field. The spatial autocorrelation of precipitation is known to depend on precipitation types [43][44][45][46]. Thus, the spatial autocorrelation of rainfall rate can be used to quantitatively determine between convective and stratiform precipitation systems. Rainfall rate (R) estimated by Z H (Z H = 200R 1.6 ) from HSR data is analyzed to calculate spatial autocorrelation with the maximum horizontal length of 100 km. An ellipse fit of decorrelation length is given by e-folding distances which measure the spatial extent for the variations of precipitation field. In this study, the time steps in which with the length of the major axis of the ellipse is more than 130 km are considered the stratiform type, while others are classified as the convective type. An example of spatial autocorrelation for stratiform and convective type is shown in Figure 3. The length of the major axis of the ellipse is 143.50 km (107.96 km) in the stratiform (convective) case. The length of the minor axis (43.52 km) as well as the axis ratio (0.40) is much smaller in this convective case. The length of the major axis is almost 100 km. These are the characteristics of banded convective precipitation.

Results
First of all, we compare the eight scenarios described in Table 2 with statistical and machine learning classification methods in order to find an optimal data framework for precipitation area estimation, followed by an extensive comparison of classifiers. In this comparison, three factors are considered, sensor types (AWS and VIS), threshold types (0.1 and 0.5), and model choice (spatial and non-spatial). The medians of the MSPE values obtained from each of the eight scenarios are reported in Table 4. For the sensor types, it is found that the MSPE values of AWS are mostly smaller than those of VIS, which indicates that the rain detectors are preferable. It is interesting to note that the Type 1 threshold (0.5) offers the smaller median for AWS and VIS. To investigate the effectiveness of spatial information in precipitation area estimation, non-spatial (Model 1) and spatial (Model 2) models are compared under the same sensor and threshold. Notice that spatial model significantly outperforms non-spatial model for AWS and VIS. In addition, tree-based methods (DT and RF) in S3 provide the most precise estimation of precipitation area in terms of the MSPE. Figure 4 plots the distribution of the MSPE values for each scenario over all methods and events, showing the smallest variability in S3. Thus, in the remainder of this section, S3 is used to compare classification methods over different types of rain events.

Results
First of all, we compare the eight scenarios described in Table 2 with statistical and machine learning classification methods in order to find an optimal data framework for precipitation area estimation, followed by an extensive comparison of classifiers. In this comparison, three factors are considered, sensor types (AWS and VIS), threshold types (0.1 and 0.5), and model choice (spatial and non-spatial). The medians of the MSPE values obtained from each of the eight scenarios are reported in Table 4. For the sensor types, it is found that the MSPE values of AWS are mostly smaller than those of VIS, which indicates that the rain detectors are preferable. It is interesting to note that the Type 1 threshold (0.5) offers the smaller median for AWS and VIS. To investigate the effectiveness of spatial information in precipitation area estimation, non-spatial (Model 1) and spatial (Model 2) models are compared under the same sensor and threshold. Notice that spatial model significantly outperforms non-spatial model for AWS and VIS. In addition, tree-based methods (DT and RF) in S3 provide the most precise estimation of precipitation area in terms of the MSPE. Figure 4 plots the distribution of the MSPE values for each scenario over all methods and events, showing the smallest variability in S3. Thus, in the remainder of this section, S3 is used to compare classification methods over different types of rain events.  To illustrate precipitation area estimation for the entire area of interest, the case at 09:40 KST on 31 July 2017 is chosen, and the observed radar measurements are shown in Figure 5. The statistical and machine learning methods addressed in Section 2 are used. The estimated precipitation areas (no precipitation: blue, precipitation: red) are given in Figure 6. The true values from AWS are shown by triangles (no precipitation) and circles (precipitation). The maps yielded by the statistical methods present similar spatial patterns (two upper figures for LR and LDA). Substantial misclassification, estimating wet areas when the stations are observed as dry, has been found in the northern areas. However, the machine learning methods correctly estimate the areas as dry. RF produces a smoother classification boundary than DT. To illustrate precipitation area estimation for the entire area of interest, the case at 09:40 KST on 31 July 2017 is chosen, and the observed radar measurements are shown in Figure 5. The statistical and machine learning methods addressed in Section 2 are used. The estimated precipitation areas (no precipitation: blue, precipitation: red) are given in Figure 6. The true values from AWS are shown by triangles (no precipitation) and circles (precipitation). The maps yielded by the statistical methods present similar spatial patterns (two upper figures for LR and LDA). Substantial misclassification, estimating wet areas when the stations are observed as dry, has been found in the northern areas. However, the machine learning methods correctly estimate the areas as dry. RF produces a smoother classification boundary than DT. Remote Sens. 2021, 13, 2039 11 of 21 Figure 5. Radar measurements, reflectivity (ZH), differential reflectivity (ZDR), specific differential phase (KDP), and crosscorrelation coefficient ( ), at 09:40 KST on 31 July 2017.  boxplots. Note that the machine learning methods apparently achieve lower MSPE, and DT and RF methods have the lowest median of the MSPE values. The spatial patterns of the MSPE values are visualized in Figure 8. The spatial patterns of the statistical methods, LR and LDA, are similar, indicating large values in central and western regions. The errors show systematic structure in space. The machine learning methods also show similar spatial patterns, and RF has lower MSPE values on some stations in southern and northern regions. The errors are nearly random in space.  Figure 9 presents the boxplots of the MSPE values for each of these two types o events and of all events. As expected, the stratiform category has smaller MSPE tha convective category. However, there are some outliers in stratiform. The convective gory showed the largest error.  Figure 9 presents the boxplots of the MSPE values for each of these two types of rain events and of all events. As expected, the stratiform category has smaller MSPE than the convective category. However, there are some outliers in stratiform. The convective category showed the largest error. Figure 10 displays the boxplots of the MSPE for the five classification methods by precipitation types. It indicates that the machine learning methods, SVM, DT, and RF, mostly outperform the statistical methods, LR and LDA, in terms of the MSPE. Note that there are outliers on the high end for all classification methods. For the convective category, the median of the MSPE for DT is considerably lower than that of the other methods. Similar to the stratiform category, the machine learning methods have smaller medians than the statistical methods. Interestingly, there are no outliers in the boxplots for the convective category. The combined category shows slightly larger medians than the stratiform categories. Likewise, the machine learning methods perform better than the statistical methods except SVM. A few outliers of MSPE are detected in DT and RF.  Figure 10 displays the boxplots of the MSPE for the five classification methods by precipitation types. It indicates that the machine learning methods, SVM, DT, and RF, mostly outperform the statistical methods, LR and LDA, in terms of the MSPE. Note that there are outliers on the high end for all classification methods. For the convective category, the median of the MSPE for DT is considerably lower than that of the other methods. Similar to the stratiform category, the machine learning methods have smaller medians than the statistical methods. Interestingly, there are no outliers in the boxplots for the convective category. The combined category shows slightly larger medians than the stratiform categories. Likewise, the machine learning methods perform better than the statistical methods except SVM. A few outliers of MSPE are detected in DT and RF. In Figure 11, the estimated precipitation areas for the two types, convective and stratiform, are compared with the two selected classification methods, LDA and RF, that yield overall the worst and best results, respectively. For the convective case, RF correctly estimates wet areas in the western region, while LDA misclassifies them as dry. Similarly, RF provides more precise estimates in the northern and south-western regions for the stratiform case. It is worth pointing out that the two dry stations in the south-western region are accurately estimated by RF even though they are surrounded by wet areas. In Figure 11, the estimated precipitation areas for the two types, convective and stratiform, are compared with the two selected classification methods, LDA and RF, that yield overall the worst and best results, respectively. For the convective case, RF correctly estimates wet areas in the western region, while LDA misclassifies them as dry. Similarly, RF provides more precise estimates in the northern and south-western regions for the stratiform case. It is worth pointing out that the two dry stations in the south-western region are accurately estimated by RF even though they are surrounded by wet areas.
For validation purposes, we perform the precipitation area estimations with data from different years, 2018 and 2019, and a different validation method. In the previous analysis, the models are trained by 10 min data and evaluated by LOOCV, which is applicable in real time. In this analysis, we consider temporally aggregated data sets acquired by combining all 10 min data sets in each event and hold-out validation in order to examine the sensitivity of model performance to temporal resolution and validation method. Table 5 presents the MSPE performance of the five models for the six events and shows similar results as compared to the previous analysis. The machine learning methods mostly outperform the statistical methods, and the tree-based methods are more appealing than the other methods. For validation purposes, we perform the precipitation area estimations with data from different years, 2018 and 2019, and a different validation method. In the previous analysis, the models are trained by 10 min data and evaluated by LOOCV, which is applicable in real time. In this analysis, we consider temporally aggregated data sets acquired by combining all 10 min data sets in each event and hold-out validation in order to examine the sensitivity of model performance to temporal resolution and validation method. Table 5 presents the MSPE performance of the five models for the six events and shows similar results as compared to the previous analysis. The machine learning methods mostly outperform the statistical methods, and the tree-based methods are more appealing than the other methods. Figure 11. Estimated precipitation areas of LDA and RF for two types (stratiform and a convective) of cases. Estimated dry (blue), estimated wet (red), observed dry (triangle), and observed wet (circle).

Discussion
Our results indicate that the best model is the tree-based methods (DT and RF), wh does not require retraining once it is trained. The tree-based model is computationa efficient and easy to deploy at any machine. Wolfensberger et al. [47] also mentioned t the RF-based polarimetric radar QPE algorithm can easily be used operationally. Shin al. [48] also showed significant improvement in QPE with the RF. Given that statem and the fact that any additional scaling of input variables is not needed in our study, i obvious that our model is suitable for operational application.
We have conducted the precipitation area estimation using MYN S-band radar d and AWS data. Microphysics is dependent on different climate and regions because of variability of the drop size distribution [49]. Therefore, in terms of QPE, when the locat Figure 11. Estimated precipitation areas of LDA and RF for two types (stratiform and a convective) of cases. Estimated dry (blue), estimated wet (red), observed dry (triangle), and observed wet (circle).

Discussion
Our results indicate that the best model is the tree-based methods (DT and RF), which does not require retraining once it is trained. The tree-based model is computationally efficient and easy to deploy at any machine. Wolfensberger et al. [47] also mentioned that the RF-based polarimetric radar QPE algorithm can easily be used operationally. Shin et al. [48] also showed significant improvement in QPE with the RF. Given that statement and the fact that any additional scaling of input variables is not needed in our study, it is obvious that our model is suitable for operational application.
We have conducted the precipitation area estimation using MYN S-band radar data and AWS data. Microphysics is dependent on different climate and regions because of the variability of the drop size distribution [49]. Therefore, in terms of QPE, when the location has changed, the values of dual-polarization variables related with microphysics information would be changed. However, estimation of precipitation area is less affected by locations due to the range of polarimetric variables, which indicates that precipitation signal is less dependent on the regions.

Conclusions
We have studied precipitation area estimation using ground-based measurements and remote sensing products. Several rain events classified as either stratiform, or convective are used for analysis. We considered three factors, rain sensors, thresholds, and models, to present a novel data framework for improving the estimation. As a result, Scenario 3 with AWS, Type 1 threshold, spatial model, yields better performance than other scenarios. With the data framework, a rigorous comparison of the classification methods for precipitation area estimation was conducted. It is found that the machine learning methods yield better predictive performance than the statistical methods. Specifically, the tree-based methods, such as decision trees and random forests, are attractive methods for both stratiform and convective storms.
This study classified rain events as one of two categories, stratiform or convective, according to their precipitation patterns. Typically, this classification is originated from spatial extension and temporal duration of precipitation. Due to various physical conditions and characteristics, it seems to be more effective to estimate precipitation areas separately for different categories. To this end, it is an imperative task to accurately classify an event in data preprocessing.
The results presented in this paper indicate that no single method exhibits clear superiority over others in different events. In those situations, multi-model inference is an alternative by combining multiple statistical and machine learning methods in order to improve overall predictive performance. In statistics, model averaging is often used to improve predictive ability and account for uncertainty due to model choice. The most common method is Bayesian model averaging (BMA) that places prior probability on each model and determines the posterior distribution as the weighted average of the individual models.
The complex physical processes governing the precipitation process have substantial variability over space and time. In this paper, we consider incorporating spatial information into the precipitation area estimation. The methods addressed in the paper can be extended to incorporate spatial-temporal information by adding a dynamic component to the estimation.
In this study, we have found that spatial classification outperforms non-spatial classification. This finding can be related to censoring radar data. Radar measurements are often subject to censoring due to detection limits. In this case, there are several methods to deal with this problem. Firstly, analyzing a complete data set by deleting censored data is a common approach, which is inefficient because the information by discarding the censored data is lost. The other common approach is to replace censored values with a fixed value such as detection limit, as we have done in this paper. It is well known that these naïve methods result in bias estimation. The censored radar measurements in the data sets have the same values replaced by the practical detection limits, which is quite noninformative in the classification. However, spatial classification can provide additional information for classifying precipitation area by borrowing information from near observations. The rain-detecting devices installed on the ground monitoring stations would be subject to measurement error. Since the variable of interest in this study is a binary variable, the measurement error problem is known as misclassification in statistics. We expect that future studies accounting for misclassification in the estimation will produce better predictive abilities than standard estimation.