End of the Day (EOD) Judgment for Daily Rain-Gauge Data

: Several versions of Asian Precipitation-Highly-Resolved Observational Data Integration Towards Evaluation of Extreme Events (APHRODITE-2) have been released for analyzing rain-gauge-based daily precipitation. APHRODITE-2 constitutes an improvement compared with the previous versions for evaluating extreme precipitation. One advantage of APHRODITE-2 products (versions 1801R1 and 1901) over APHRODITE-1 products is the ability to ensure uniformity in the daily accumulation period (end of the day, EOD) used in a speciﬁc domain. To create these EOD segregated or EOD adjusted products, we applied an EOD judgment scheme using multi-satellite merged precipitation products CMORPH V1.0 and ERA-Interim reanalysis. The novelty of the current methodology was tested against rain-gauge datasets with known EOD information. Despite the di ﬀ erence in horizontal resolution, ERA-Interim shows similar EOD detection performance to CMORPH. However, CMORPH showed better performance over India than ERA-Interim. The current method has potential to judge EOD of rain-gauge station data with an unknown observation time. Having prior EOD information as metadata is very important for gridded datasets as well as station data. Here, we also present deterministic / estimated EOD that we used for V1801R1 over Monsoon Asia.


Introduction
Long-term meteorological datasets have a significant role in advancing our scientific understanding of the nature of climate change as well as projecting future changes. Analyses of climate data for various meteorological parameters can be used to infer the possible connections between extreme meteorological events and climate change. However, despite significant improvements in global/regional models, precipitation is still a challenging element to quantify and parameterize accurately through models and observations [1][2][3]. Although several platforms are considered as sources for precipitation datasets, rain-gauge measurements are treated as accurate measurements over land. Hence, rain-gauge-based datasets are considered as a reference for evaluating other precipitation estimates obtained by satellites and modeling. Rain-gauges deployed over different spatial domains (regional/continental) with high/low density are often converted to uniform grids of specific spatial/temporal resolutions with robust statistical interpolation/extrapolation methods to overcome rain-gauge point specific issue, i.e., to obtain areal precipitation, etc. [4,5]. Efforts are also made to try to improve the interpolation scheme, for instance, the representation of local geographical/physical factors such as topography, the climatology of a specific variable, and the spatial variability of parameters in interpolation methods.
In addition, we used global surface summary of the day (GSOD), which is a widely used set of GTS-based data. It is available at the website of the National Centers for Environmental Information (NCEI) of the National Oceanic and Atmospheric Administration (NOAA) (https://data. nodc.noaa.gov/cgi-bin/iso?id=gov.noaa.ncdc:C00516). GSOD provides information about 18 essential meteorological elements on a daily scale. Nearly 9000 weather stations collect information on 18 surface meteorological variables around the globe. GSOD is assembled from sub-daily or daily reports to GTS, and each set of precipitation data has a flag showing the quality information including processes to get "daily" precipitation value of the day which was applied at NCEI/NOAA relevant persons. Each "daily" precipitation sometimes involves the summation of three 6-hourly precipitation reports or one 12-hourly precipitation report (Table 1). Therefore, we have only considered those reports of which the accumulation represents daily precipitation. The three kinds of report are thus four 6-hourly reports (D), two 12-hourly reports (F), and one daily report (G). These different types of data source reports are indicated with different flags (e.g., D, F, and G), as mentioned in GSOD readme file (ftp://ftp.ncdc.noaa.gov/pub/data/gsod/readme.txt). Table 1. Flags of Global Summary of the Day (GSOD) precipitation data. Station reported '0' as the amount for the day but also reported at least one occurrence of precipitation I Station did not report any precipitation data for the day After the EOD check described in this study, we selected a station precipitation dataset to use country by country and year by year. A list of the countries for both offline and GSOD over Monsoon Asia is available in the Appendix A, with information on EOD and on whether or not we used it for APHRO_V1801R1.

CMORPH
The Climate Prediction Center (CPC) MORPHing technique, CMORPH [18,19], is a multi-satellite-based precipitation product derived by a morphing technique that exclusively makes use of low-level orbit passive microwave (PMW) observations to estimate precipitation. Precipitation features obtained from microwave observations are propagated in the spatial/temporal domain by a forward/backward morphing technique. Cloud system advection vectors derived from infrared measurements are used to propagate precipitation features over grids, where PMW observations are not available. The dataset is available with multiple spatial and temporal resolutions from 1998 to the present over the latitude range of 60 • N to 60 • S.
A reprocessed and bias-corrected version (V1.0) of CMORPH daily precipitation data has been released into the public domain. The latest version of CMORPH data has several advantages, such as fixed interpolation algorithms and microwave observations being considered from identical versions of passive microwave sensors. The updated version of CMORPH precipitation retrievals showed potential improvement over the earlier version in reproducing the spatial and temporal variability of precipitation. Moreover, the latest CMORPH precipitation estimates have been proven to be better than widely used TRMM-3B42 datasets at sub-daily/daily scales [19]. Although CMORPH Atmosphere 2020, 11, 772 4 of 15 precipitation estimates are available with various spatial/temporal resolutions, we used a special version of hourly precipitation estimates with 0.25 • latitude/longitude spatial resolution provided by Dr. Xie for EOD estimates.

ERA-Interim
EOD detection with satellite-based estimates of precipitation was impracticable in the pre-satellite era given the limited number of observations. Moreover, satellite-based estimates of precipitation do not work well over high altitudes. Thus, we used 3-hourly/0.75 • precipitation forecasts of ERA-Interim to estimate the EOD over various sub-continental regions. We compared the performance of ERA-Interim and CMORPH datasets to evaluate their strengths and weaknesses. ERA-Interim is a long-term global atmospheric reanalysis dataset produced by the European Centre for Medium-range Weather Forecasts (ECMWF). It contains multiple meteorological variables available from 1979 to August 2019. The ERA-Interim dataset was developed with a four-dimensional data assimilation scheme by incorporating analysis/forecast cycles [20]. In each cycle, the available observations, which do not include precipitation, are combined with prior information from a forecast model to estimate the evolving state of the global atmosphere and the surface underlying it. We used 3-h, 6-h, 9-h, 12-h precipitation forecasts starting with 00:00 h UTC (hereafter 00 UTC) and 12:00 h UTC (12 UTC) ( Figure 1). Each successive 3-h precipitation value used in this study is obtained by subtracting accumulated precipitation at consecutive time intervals. For example, 3-h precipitation between 03-06 UTC is obtained by subtracting 3-h forecast initialized at 00 UTC from 6-h forecast initialized at 00 UTC. The computational steps to estimate the 3-hourly precipitation from ERA-Interim measurements are shown in Figure 1. EOD detection with satellite-based estimates of precipitation was impracticable in the presatellite era given the limited number of observations. Moreover, satellite-based estimates of precipitation do not work well over high altitudes. Thus, we used 3-hourly/0.75° precipitation forecasts of ERA-Interim to estimate the EOD over various sub-continental regions. We compared the performance of ERA-Interim and CMORPH datasets to evaluate their strengths and weaknesses. ERA-Interim is a long-term global atmospheric reanalysis dataset produced by the European Centre for Medium-range Weather Forecasts (ECMWF). It contains multiple meteorological variables available from 1979 to August 2019. The ERA-Interim dataset was developed with a four-dimensional data assimilation scheme by incorporating analysis/forecast cycles [20]. In each cycle, the available observations, which do not include precipitation, are combined with prior information from a forecast model to estimate the evolving state of the global atmosphere and the surface underlying it. We used 3-h, 6-h, 9-h, 12-h precipitation forecasts starting with 00:00 h UTC (hereafter 00 UTC) and 12:00 h UTC (12 UTC) ( Figure 1). Each successive 3-h precipitation value used in this study is obtained by subtracting accumulated precipitation at consecutive time intervals. For example, 3-h precipitation between 03-06 UTC is obtained by subtracting 3-h forecast initialized at 00 UTC from 6-h forecast initialized at 00 UTC. The computational steps to estimate the 3-hourly precipitation from ERA-Interim measurements are shown in Figure 1.

Definition of EOD
EOD is one of the significant aspects through which APHRODITE-2 products constitute improvements over earlier versions. The estimated or deterministic EOD information was used for the time-adjusted product (APHRO_V1901) as well as to choose the station data for APHRO_V1801R1.
Rain-gauge measurements are available with different temporal intervals. Daily data may often be collected from different data holders within the same country/region using various 24-h accumulation periods. Each dataset of an individual country is generated according to the local organization's convenience following WMO guidance. For example, in the case of India, the Indian Meteorological Department measures the daily rain-gauge data in the morning at 08:30 h (IST, which corresponds to 03 UTC) and stamps on that specific day. Precipitation (in mm/day) stamped on 11th July is the actual precipitation accumulated from the previous day (10 July) 08:30 h to 11 July 08:30 h IST. Therefore, EOD is 3 UTC for IMD rain-gauge measurements (see Figure 2a). EOD may differ for each dataset collected from various data sources.
GSOD daily precipitation data are available with different rain flags D, F, and G. Precipitation accumulated with D, F, and G flags represents the summation of four reports of 6-h precipitation (D), two reports of 12-h precipitation (F), and one report of 24-h precipitation (G); however, they are not

Definition of EOD
EOD is one of the significant aspects through which APHRODITE-2 products constitute improvements over earlier versions. The estimated or deterministic EOD information was used for the time-adjusted product (APHRO_V1901) as well as to choose the station data for APHRO_V1801R1.
Rain-gauge measurements are available with different temporal intervals. Daily data may often be collected from different data holders within the same country/region using various 24-h accumulation periods. Each dataset of an individual country is generated according to the local organization's convenience following WMO guidance. For example, in the case of India, the Indian Meteorological Department measures the daily rain-gauge data in the morning at 08:30 h (IST, which corresponds to 03 UTC) and stamps on that specific day. Precipitation (in mm/day) stamped on 11th July is the actual precipitation accumulated from the previous day (10 July) 08:30 h to 11 July 08:30 h IST. Therefore, EOD is 3 UTC for IMD rain-gauge measurements (see Figure 2a). EOD may differ for each dataset collected from various data sources.
Atmosphere 2020, 11, x FOR PEER REVIEW 5 of 15 associated with EOD information. Mixing such kinds of datasets, especially GSOD data with different flags with offline data, may reduce the extreme values. In a previous version of APHRODITE products, namely APHRO_V1101, we mixed all available data including GSOD with flags other than D, F, and G as well as various offline data. Mixing precipitation data with varying EOD may change the nature of the event. To avoid such data mixing ambiguity, the APHRODITE-2 team carefully selected data with unmixed EOD to be used for generating APHRO_V1801R1 gridded products (see Appendix A).
(a) (b)  GSOD daily precipitation data are available with different rain flags D, F, and G. Precipitation accumulated with D, F, and G flags represents the summation of four reports of 6-h precipitation (D), two reports of 12-h precipitation (F), and one report of 24-h precipitation (G); however, they are not associated with EOD information. Mixing such kinds of datasets, especially GSOD data with different flags with offline data, may reduce the extreme values. In a previous version of APHRODITE products, namely APHRO_V1101, we mixed all available data including GSOD with flags other than D, F, and G as well as various offline data. Mixing precipitation data with varying EOD may change the nature of the event. To avoid such data mixing ambiguity, the APHRODITE-2 team carefully selected data with unmixed EOD to be used for generating APHRO_V1801R1 gridded products (see Appendix A).

Methodology for EOD Estimation
EOD is estimated for each rain-gauge as a specific data source using statistical metrics such as correlation coefficient (r) and root mean square error (RMSE). We used the updated version (V1.0) of the CMORPH and ERA-interim dataset to estimate EOD information for each rain-gauge location.
The r and RMSE between referenced rain-gauge daily precipitation data and the nearest CMORPH 0.25 • grid pixel (ERA-interim 0.75 • grid pixel) daily accumulated precipitation are computed. Namely, we obtain 72 correlation coefficients (r), between a daily rain-gauge station precipitation and CMORPH "daily" precipitation with different SOD (−24 to 47). Figure 3 shows an example of r and RMSE between the daily station precipitation time series and 72 different CMORPH "daily" time series of the Philippines for 1998. After computing r and RMSE for each station at each CMORPH time series, we averaged r and RMSE of the 52 stations in the Philippines. The averaged highest r and lowest RMSE were obtained at SOD = 0 (EOD = 24) (Figure 3), which is the correct 24-h accumulation period for the Philippines.
Atmosphere 2020, 11, x FOR PEER REVIEW 6 of 15 EOD is estimated for each rain-gauge as a specific data source using statistical metrics such as correlation coefficient (r) and root mean square error (RMSE). We used the updated version (V1.0) of the CMORPH and ERA-interim dataset to estimate EOD information for each rain-gauge location.
The r and RMSE between referenced rain-gauge daily precipitation data and the nearest CMORPH 0.25° grid pixel (ERA-interim 0.75° grid pixel) daily accumulated precipitation are computed. Namely, we obtain 72 correlation coefficients (r), between a daily rain-gauge station precipitation and CMORPH "daily" precipitation with different SOD (−24 to 47). Figure 3 shows an example of r and RMSE between the daily station precipitation time series and 72 different CMORPH "daily" time series of the Philippines for 1998. After computing r and RMSE for each station at each CMORPH time series, we averaged r and RMSE of the 52 stations in the Philippines. The averaged highest r and lowest RMSE were obtained at SOD = 0 (EOD = 24) (Figure 3), which is the correct 24-h accumulation period for the Philippines. For satellite precipitation data and for reanalysis precipitation data, we often use SOD to show the time of observation/forecasts. However, rain-gauge observations are often recorded when the hydro-meteorological service measures precipitation, which is EOD. Hence, we mainly use the word EOD rather than SOD for our analysis. Several earlier studies reported that satellite-based remote sensing products underperform in winter and over cold regions [19,[21][22][23][24][25]. Thus, we estimated EOD for summer (JJAS) as well as the whole year.
Following the computation of EOD for each rain-gauge, we selected rain-gauges with a correlation coefficient equal to or greater than 0.4 (r ≥ 0.4) and the number of valid days (data availability) as 270 days to obtain stable EOD information (Figures 4 and 5). The thresholds for the summer are similar to the annual ones, except that the number of valid days is relaxed to 90 days ( Figure 4). The r and number of valid days are relaxed to 0.1 and 50 days for the GSOD data ( Figure 6). For satellite precipitation data and for reanalysis precipitation data, we often use SOD to show the time of observation/forecasts. However, rain-gauge observations are often recorded when the hydro-meteorological service measures precipitation, which is EOD. Hence, we mainly use the word EOD rather than SOD for our analysis. Several earlier studies reported that satellite-based remote sensing products underperform in winter and over cold regions [19,[21][22][23][24][25]. Thus, we estimated EOD for summer (JJAS) as well as the whole year. Following the computation of EOD for each rain-gauge, we selected rain-gauges with a correlation coefficient equal to or greater than 0.4 (r ≥ 0.4) and the number of valid days (data availability) as 270 days to obtain stable EOD information (Figures 4 and 5). The thresholds for the summer are similar to the annual ones, except that the number of valid days is relaxed to 90 days (Figure 4). The r and number of valid days are relaxed to 0.1 and 50 days for the GSOD data ( Figure 6).
The proposed methodology is also applied to estimate EOD information using ERA-Interim data, such that the suitability of ERA-Interim data could be evaluated and hence would be used in the pre-satellite era (before 1979) to determine EOD information. Moreover, it would also be very useful to compare the detection ability of EOD estimates obtained from two kinds of reference datasets, such as CMORPH and ERA-Interim.

Offline Data
We selected datasets with the same EOD information, at least at the national domain, for generating gridded product APHRO_V1801R1. Figure 5 shows the spatial distribution of rain-gauge station data showing EOD information at offline stations for the year 2008. The original station datasets are not always the same through the target period for APHRO_V1801R1 (1998-2015, see Appendix A). The color represents the estimated EOD at each rain-gauge location. EOD information shared by rain-gauges appears to be similar at least nationally. However, rain-gauge locations over China and Nepal show different EOD, which highlights that the EOD of datasets does not need to be similar despite being generated for the same country. For instance, countries such as China show the spatial distribution of rain-gauge station with different EOD, namely, most of them are in black (EOD = 24) but there are also many red (EOD = 12) stations. Mixing such data with EOD information differing by more than several hours may degrade the quality of the gridded product, especially for the three flagged data, and number of each flagged input data is different from year to year. It offers two sets of daily precipitation datasets with an EOD difference of 12 h. Segregation of such kinds of flagged data (D, F, and G) according to the nature of the daily accumulation period would help data users to select and combine GSOD data with local data for correct utilization. The EOD information of each dataset (both offline and GSOD) considered for V1801R1 for a specific country is tabulated in the Appendix. Figure 5. Spatial distribution of offline rain-gauge data collection for APHRODITE-2 in 2008. The color represents the EOD estimated by CMORPH at each rain-gauge location. The estimated EOD is indexed with different colors. Each color represents a temporal window of EOD estimates. For instance, black represents rain-gauge stations with EOD ≥21 and <27 UTC. Similarly, dark blue represents EOD ≥27 and <33 UTC. Figure 5. Spatial distribution of offline rain-gauge data collection for APHRODITE-2 in 2008. The color represents the EOD estimated by CMORPH at each rain-gauge location. The estimated EOD is indexed with different colors. Each color represents a temporal window of EOD estimates. For instance, black represents rain-gauge stations with EOD ≥ 21 and <27 UTC. Similarly, dark blue represents EOD ≥ 27 and <33 UTC. Comparing Figure 5 and Figure 6, GSOD ( Figure 6) has more early EOD data (EOD ≤ 15), which is attributed from the nature of GTS data. It is considered four 6-hourly reports (D) and two 12-hourly reports (F) are summed up according to the day (24-h interval) that NCEI/NOAA stamped at EST/EDT of the United States.

Evaluation of EOD Estimation Methodology
The EOD detection skills of CMORPH and ERA-Interim with deterministic EOD precipitation during the whole year and the summer, 2006, are shown in Figure 4 for three different countries. We considered offline rain-gauge datasets for three countries, India, China, and Japan, with deterministic (known) EOD 3 UTC, 24 UTC, and 15 UTC for evaluation, respectively. The number of counts showing exact EOD is larger for CMORPH than for ERA-Interim in all 6 panels. Both reference datasets estimate the exact EOD by the maximum number of stations, except for ERA-Interim estimates of India. It is noted that estimated EOD distribution is narrower in Japan than that of the other two countries. The widespread distribution of stations over India and China may have been contributed to by the quality of rain-gauge data or quality issues of satellite-based estimates of precipitation. We will see the distribution of estimated EOD in the next section. The dominant occurrence of convective precipitation in India may have contributed to higher counts by the higher resolution of CMORPH than that of ERA-Interim. Interestingly, CMORPH tends to show more counts in earlier time before the exact (correct) time than that of ERA-Interim, and vice versa. As both datasets perform well over China, the current method can be assumed to better estimate the EOD information over that country. Despite the lower resolution, ERA-Interim notably performs well over Japan. However, overall, CMORPH appears to perform better than ERA-Interim.
Overall, there was no notable difference in the EOD distribution and spatial distribution of stations with deterministic EOD when considering just the summer (JJAS). Over Japan, some stations located in the north of the country are affected by snow, however, EOD estimates of the whole year and that of only JJAS are not different from each other. Impact of snow condition on EOD estimate by CMORPH seems to be small in Japan. In the case of EOD estimates by ERA-Interim, stations with deterministic EOD information are reduced when restricting the time length to JJAS in India. Moreover, the number of stations with abnormal earlier EOD information is decreased in the summer in India. With the current evaluation results, the EOD methodology is applied for offline and GSOD data. The offline and GSOD results by using CMORPH are shown in Sections 3.2 and 3.3, respectively.

Offline Data
We selected datasets with the same EOD information, at least at the national domain, for generating gridded product APHRO_V1801R1. Figure 5 shows the spatial distribution of rain-gauge station data showing EOD information at offline stations for the year 2008. The original station datasets are not always the same through the target period for APHRO_V1801R1 (1998-2015, see Appendix A). The color represents the estimated EOD at each rain-gauge location. EOD information shared by rain-gauges appears to be similar at least nationally. However, rain-gauge locations over China and Nepal show different EOD, which highlights that the EOD of datasets does not need to be similar despite being generated for the same country. For instance, countries such as China show the spatial distribution of rain-gauge station with different EOD, namely, most of them are in black (EOD = 24) but there are also many red (EOD = 12) stations. Mixing such data with EOD information differing by more than several hours may degrade the quality of the gridded product, especially for representing extreme events. The daily accumulation timestamps, namely EOD, of most of the Asian countries' datasets appear to be ≥3 and <27 UTC, but some stations show later value (blue dots, showing EOD ≥ 33 and <39). The spatial consistency in EOD information shared by station data appears to be more consistent over East Asian countries than in West Asian countries.
The EOD estimates obtained with the ERA-Interim dataset were compared with the EOD output derived from CMORPH data (figure not shown). Overall, there was good agreement in terms of spatial homogeneity of EOD information estimated by the two kinds of reference datasets (CMORPH, ERA-Interim) with different temporal/spatial resolutions. Variability of sub-daily precipitation might also be attributed to variation in EOD information.

GSOD Data
The timestamp information of GSOD daily precipitation data (D, F, and G) generated from sub-daily/daily reports should be crucial for data users when using GSOD data for different research applications. Users may either consider or convert native rain-gauge station data to merge with GSOD data, such that conflicts of data accumulation time can be resolved. The EOD estimation method is applied to GSOD data to reveal the unknown timestamp of daily precipitation data with different flags (D, F, and G) over the Monsoon Asia (MA) region, since EOD information was necessary to make APHRO_MA_V1801R1 as well as APHRO_MA_V1901. Figure 6 shows the EOD distribution of GSOD daily precipitation data accumulated from a combination of different flags for the year 2015. The reason to choose a different year with that of offline data ( Figure 5) was 1) 2015 is the latest period that we created the above two products and 2) relatively large number of offline data were assembled for 2008.
For some of the Asian countries, two kinds of timestamp of GSOD data are depicted with EOD information differing by~12 to 24 h. The difference in daily timestamp for rain-gauge locations is accounted for by the original temporal resolutions (D, F, and G). For instance, GSOD China contains the three flagged data, and number of each flagged input data is different from year to year. It offers two sets of daily precipitation datasets with an EOD difference of 12 h. Segregation of such kinds of flagged data (D, F, and G) according to the nature of the daily accumulation period would help data users to select and combine GSOD data with local data for correct utilization. The EOD information of each dataset (both offline and GSOD) considered for V1801R1 for a specific country is tabulated in the Appendix A.
Comparing Figures 5 and 6, GSOD ( Figure 6) has more early EOD data (EOD ≤ 15), which is attributed from the nature of GTS data. It is considered four 6-hourly reports (D) and two 12-hourly reports (F) are summed up according to the day (24-h interval) that NCEI/NOAA stamped at EST/EDT of the United States.

Impact of EOD on the Evaluation of Extreme Events
Previous sections reveal that EODs are different over China within each dataset (offline and GSOD) and among the datasets as shown in Figures 5 and 6. Thanks to some international collaborations, APHRODITE-2 obtained 12-hourly precipitation data over China. Then we could sum up that precious offline data for both EOD = 12 and EOD = 24. Actually, APHRODITE-1 products (APHRO_V0902, APHRO_V1003, and APHRO_V1101) are generated with such mixed EOD datasets. However, we segregated such input data that differ from other EOD for APHRO_V1801R1. Hence, we evaluated the impact of EOD on the APHRO_V1801R1 gridded precipitation product over certain selected countries in Asia. Qualitative/quantitative assessment of the APHRO_V1801R1 product was performed to check the impact of EOD on extreme events. The precipitation gridded products were generated with unmixed (rain-gauge datasets with EOD = 12) and mixed EOD datasets (EOD with 12, 24, and GSOD data) over China for 2007. Figure 7a,b shows the difference in the spatial distribution of precipitation events and precipitation fractions computed for events more significant than 50 mm over China. The experimental design is clearly explained in a previous paper [16]. The east of China, which is generally a wet part of the country, shows a significant impact of EOD on the gridded product. The EOD-implemented version of APHRO_V1801R1 gridded product shows larger (red) and smaller (blue) values of precipitation events with intensity greater than 50 mm. The resultant precipitation fraction from events greater than 50 mm is unevenly distributed over China (Figure 7b). As discussed in an earlier section, mixing rain-gauge datasets with a different EOD stamp may underestimate (overestimate) the number of extreme precipitation events in the regions shown in red (blue). Quantitative assessment of EOD impact on APHRO_V1801R1 is evaluated with a total number of events greater than 50 mm recorded over a country. Figure 7c,d illustrates the total number of precipitation events greater than 50 mm over the Philippines, China, and Japan for 2007. Total number of precipitation events is computed with unmixed EOD (YES) datasets and mixed EOD (NO) datasets. Extreme precipitation events estimated from gridded products generated with mixed EOD data are less than the product generated with unmixed (segregated) EOD. Gridded product with mixed EOD datasets such as local data and GSOD data certainly alters the magnitude of actual precipitation events due to the mixing of datasets with differentiable EOD. The massive difference in the number of events greater than 50 mm may be attributed to contamination of the actual precipitation by the previous day's precipitation while averaging and/or interpolating across all precipitation stations available over a grid pixel.
in an earlier section, mixing rain-gauge datasets with a different EOD stamp may underestimate (overestimate) the number of extreme precipitation events in the regions shown in red (blue). Quantitative assessment of EOD impact on APHRO_V1801R1 is evaluated with a total number of events greater than 50 mm recorded over a country. Figure 7c,d illustrates the total number of precipitation events greater than 50 mm over the Philippines, China, and Japan for 2007. Total number of precipitation events is computed with unmixed EOD (YES) datasets and mixed EOD (NO) datasets. Extreme precipitation events estimated from gridded products generated with mixed EOD data are less than the product generated with unmixed (segregated) EOD. Gridded product with mixed EOD datasets such as local data and GSOD data certainly alters the magnitude of actual precipitation events due to the mixing of datasets with differentiable EOD. The massive difference in the number of events greater than 50 mm may be attributed to contamination of the actual precipitation by the previous day's precipitation while averaging and/or interpolating across all precipitation stations available over a grid pixel.

Summary
This paper proposes a novel approach to judge the end of the day (EOD) timestamp for rain-gauge datasets across the globe with the help of satellite-based precipitation estimates and reanalysis of datasets. Bias-corrected CMORPH V1.0 precipitation data were used for the current approach. The scope of ERA-Interim data was validated to judge EOD as estimated by CMORPH, such that it would be useful to determine the EOD of rain-gauge data obtained during the pre-satellite era. The current methodology was applied to rain-gauge station datasets (1998-2015) collected from various countries by APHRODITE projects. The present method was evaluated for certain selected countries with known EOD (deterministic). Based on the evaluation results, it was concluded that CMORPH works better to judge the approximate EOD of rain-gauge stations over India. Although ERA-Interim has coarser temporal/spatial resolution than that of CMORPH, it estimates almost similar EOD distribution with that of CMORPH. To retain the nature of GSOD flagged daily precipitation data, the proposed method was also implemented for GSOD data. The methodology adopted in this study should be very helpful to judge EOD information of rain-gauge data with unknown EOD information. Having EOD information would be beneficial for users of rain-gauge data to avoid any ambiguity regarding timestamps.
Concerning the EOD judgment results, the estimated EOD for offline datasets over most of Asia appeared to be around ≥3 and <27 (e.g., India (3 UTC), China (12,24 UTC), and Japan (15 UTC)). EOD information of GSOD data reflected the effect of the combination of rain-gauge data with different accumulation periods, but generally, EOD of GSOD is the same or earlier than that of offline. The impact of EOD on APHRODITE gridded product was evaluated qualitatively/quantitatively over three selected countries. The spatial distribution of extreme events and precipitation fraction revealed a significant impact of EOD on the gridded product. Based on a test for China, the number of extreme events obtained with unmixed EOD was shown to be much higher than the extreme events captured with mixed EOD.