Feasibility of Calculating Standardized Precipitation Index with Short-Term Precipitation Data in China

: At present, high-resolution drought indices are scarce, and this problem has restricted the development of reﬁned drought analysis to some extent. This study explored the possibility of calculating the standardized precipitation index (SPI) with short-term precipitation sequences in China, based on data from 2416 precipitation observation stations covering the time period from 1961 to 2019. The result shows that it is feasible for short-sequence stations to calculate SPI index, based on the spatial interpolation of the precipitation distribution parameters of the long-sequence station. Error analysis denoted that the SPI error was small in east China and large in west China, and the SPI was more accurate when the observation stations were denser. The SPI error of short-sequence sites was mostly less than 0.2 in most areas of eastern China and the consistency rate for the drought categories was larger than 80%, which was lower than the error using the 30-year precipitation samples. Further analysis showed that the estimation error of the distribution parameters β and q was the most important cause of SPI error. Two drought monitoring examples show that the SPI of more than 50,000 short-sequence sites can correctly express the spatial distribution of dry and wet and have reﬁned spatial structure characteristics.


Introduction
Drought is a natural disaster caused by a shortage of water, which is characterized by slow development, a long duration, and a wide range of impacts, with particularly severe effects on agriculture, society, the economy, and ecosystems [1][2][3]. In the context of global warming, some regions in the northern hemisphere have shown a significant drying trend [4,5]. Drought-related issues have been investigated widely [6][7][8][9]. Every year, China suffers huge economic losses due to drought because of its high frequency and the large area affected by drought disasters [10,11]. Therefore, it is important to improve drought monitoring and early warning systems and enhance drought disaster risk assessment methods. However, most of the tools used for drought monitoring only provide information over a very large spatial range, such as climate zoning [12][13][14]. In practical applications, it is more desirable to obtain high-resolution drought risk information. Dow et al. [15,16] stated that stakeholders are eager to obtain drought information in local areas corresponding to their jurisdictions. In addition, due to the impact of local precipitation, low spatial resolution data are not suitable for reflecting the changes in drought severity. Therefore, drought analysis at a higher spatial resolution would be better for capturing the dynamic details of drought occurrence, development, and the recovery process, as well as promoting the development of refined drought risk assessment procedures and helping to optimize drought prevention and mitigation programs.

Data
The data employed in this study comprised monthly precipitation data (the daily cumulant in a natural month) acquired from 2416 meteorological observatories covering the period from 1961 to 2019 (the length of the sequence without missing data was greater than 30 years) and 51,062 regional automatic precipitation observation stations from 2015 to 2019, provided by the National Climate Center of China Meteorological Administration. The spatial distribution of the stations is shown in Figure 1.
it is necessary to verify the feasibility of calculating SPI with short-term precipitation data in China.
The aim of the present study was to investigate the possibility of calculating the monthly scale SPI index for stations with short-term precipitation sequences in China and the spatial distribution characteristics of the SPI errors. The remainder of this paper is organized as follows. Section 2 introduces the data used in the study, the method employed for calculating the SPI index, and the three methods used to obtain the distribution parameters for stations with short-term precipitation sequences. Section 3 presents analyses of the main findings, including the characteristic analysis of SPI error, the reason analysis of the SPI error, and the drought monitoring examples based on the high-resolution SPI. The study is summarized and discussed in Section 4.

Data
The data employed in this study comprised monthly precipitation data (the daily cumulant in a natural month) acquired from 2416 meteorological observatories covering the period from 1961 to 2019 (the length of the sequence without missing data was greater than 30 years) and 51,062 regional automatic precipitation observation stations from 2015 to 2019, provided by the National Climate Center of China Meteorological Administration. The spatial distribution of the stations is shown in Figure 1.   Figure  1b). The data we use here are the data after quality control. Based on the methods of the climatological limit value check, regional limit value check, time consistency check and spatial consistency check method, Ren Zhihua et al. [30,31] studied the quality control plan of hourly precipitation data from regional automatic stations nationwide. The results show that the accuracy rate of hourly precipitation data after quality control can reach 98%. Table 1 shows the altitude statistics of short-sequence stations. From the data in Table 1, we can see that most of the sites are below 500 m above sea level, accounting for 62.2%.   Figure 1 shows that the density of sites in the east and west of China is quite different. Statistics show that 91% of the long-sequence stations are located east of 100 • E (Figure 1a), and 94% of the short-sequence precipitation observation stations are east of 100 • E ( Figure 1b). The data we use here are the data after quality control. Based on the methods of the climatological limit value check, regional limit value check, time consistency check and spatial consistency check method, Ren Zhihua et al. [30,31] studied the quality control plan of hourly precipitation data from regional automatic stations nationwide. The results show that the accuracy rate of hourly precipitation data after quality control can reach 98%. Table 1 shows the altitude statistics of short-sequence stations. From the data in Table 1, we can see that most of the sites are below 500 m above sea level, accounting for 62.2%.

SPI
The SPI is an indicator based on probability. The process employed to calculate this indicator first uses a certain distribution function to fit the precipitation over the same period, and based on the cumulative probability of a certain amount of precipitation, the quantile point of the normal distribution is the size of the SPI index. The gamma distribution is the preferred fitting function. The density function expression for this distribution is as follows: where α is the shape parameter, β is the scale parameter, x is the amount of precipitation, and Γ(α) = ∞ 0 y α−1 e −y dy is gamma function. The maximum likelihood estimates of the parameters α and β are:α where A = ln(x) − ∑ ln(x) n and n is the sample size. By integrating Formula (1), which is the distribution function of precipitation G(x), we have the following expression: where G(x) is the probability that the precipitation is less than or equal to x. A precipitation value of 0 may be present in the actual precipitation samples, so the distribution function for precipitation needs to be modified. The corrected distribution function is as follows: where q represents the probability of the precipitation being 0. According to the standard normal distribution, the probability distribution function is as follows: where Φ(t) is the probability that the random variable is less than or equal to t. The corresponding cumulative probability H(x) is calculated according to the actual amount of precipitation. The size of t calculated using the equation: H(x) = Φ(t), is the size of the SPI and it is obtained with the following expression.
Hence, H(x) is transformed to the standard normal random variable t with zero mean and variance one. That is, SPI follows the standard normal distribution. The expression above shows that calculating SPI requires a certain number of precipitation samples, and generally more than 30 years of data [22] to allow the calculation of a relatively stable q value and the two parameters α and β. The SPI is often divided into the categories shown in Table 2 in actual analyses [32].
A positive value in Table 2 indicates that the rainfall is higher than the average in the same period and a negative value indicates that the rainfall is lower than the average in the same period. In addition, the SPI indicator is a standard normalized index, so it has a corresponding relationship with the probability. The third column in Table 2 shows the probability for each category.

Method for Calculating the Precipitation Distribution Parameters for Stations with Short Sequences
In practice, not all stations have long enough observation data. Therefore, it is not possible to directly calculate the SPI based on historical observation data for sites with short sequences of observation records. According to research [16,29], the precipitation distribution parameters of short-sequence stations can be obtained based on the precipitation distribution parameters of the surrounding long-sequence stations. Three calculation methods are given below.

Nearest Neighbor Substitution Method
The spatial distance between the stations was calculated based on the latitude and longitude data for the stations. Based on the distance information, each short-sequence station can find the nearest long-sequence station. The precipitation distribution parameters for the stations with short sequences were regarded as equivalent to the parameters of the nearest neighbor station with a long sequence.

Regional Average Method
Based on the given value N and the spatial distance information for the stations, the N long-sequence stations could be found around the short-sequence station. The average values for the precipitation distribution parameters at the N long-sequence stations were used to calculate the SPI for the short-sequence station. Figure 2 shows a schematic diagram of solving the distribution parameters of the short-sequence station (marked by a fivepointed star) from the distribution parameters of the surrounding 4 long-sequence stations.
Atmosphere 2021, 12, x FOR PEER REVIEW 6 of 15 Figure 2. Schematic diagram illustrating the regional average method.

Kriging Interpolation Method
The kriging interpolation method, also known as the spatial autocovariance optimal interpolation method, is widely used in spatial interpolation problems in the field of geosciences [33]. The ordinary kriging interpolation method (it is assumed that the semivari-  The kriging interpolation method, also known as the spatial autocovariance optimal interpolation method, is widely used in spatial interpolation problems in the field of geo-sciences [33]. The ordinary kriging interpolation method (it is assumed that the semivariance function has a linear relationship with the distance) was applied in a similar manner to the regional averaging method described above for determining the precipitation distribution parameters with the same data.
Based on the above three methods, the precipitation distribution parameters of shortsequence stations can be obtained, and then the SPI of short-sequence stations can be realized. It is worth noting that this method does achieve the calculation of SPI, but the error size of SPI and which method can be used to obtain the most accurate SPI is unknown. In order to investigate the reliability of calculating the SPI with short-term precipitation sequences in China, data from 2416 observation stations with long precipitation sequences were used for verification. First, the precipitation distribution parameters and SPI values for each station were obtained based on the real historical data, which were treated as true values. Second, assuming that each long-sequence station is also a short precipitation sequence, the three indirect methods (Sections 2.3.1-2.3.3) were used to obtain the precipitation distribution parameters and SPI values for each station. Then, the SPI obtained by the two ways can be compared, so as to realize the reliability evaluation of the three indirect methods. Finally, based on the best of the three indirect calculation methods, the SPI of more than 50,000 regional automatic stations was obtained and applied to drought monitoring in China, which will be compared with the drought monitoring results obtained from long-sequence stations to verify the actual monitoring effect of high-resolution SPI. A conceptual model illustrating the verification process is shown in Figure 3.

The Characteristic Analysis of SPI Error
The monthly precipitation data were fitted using the gamma distribution. Table 3 shows the distribution fitting test based on Kolmogorov-Smirnov (KS) tests. In SPI calculation, the precipitation distribution function is obtained from the precipitation data of the same month, so it is necessary to check the precipitation distribution of

The Characteristic Analysis of SPI Error
The monthly precipitation data were fitted using the gamma distribution. Table 3 shows the distribution fitting test based on Kolmogorov-Smirnov (KS) tests. In SPI calculation, the precipitation distribution function is obtained from the precipitation data of the same month, so it is necessary to check the precipitation distribution of each month. The values given in Table 3 represent the percentage of 2416 sites that passed the KS test. Table 3 shows that except for a few stations, the monthly precipitation at almost all stations was subject to gamma distribution. Taking September as an example, Figure 4 shows the spatial distribution of the KS distribution test at each station.  The results of the KS distribution test for all months showed that most of the sites that failed the KS distribution test were located in North China. When calculating the precipitation distribution parameters of the short-sequence stations, the surrounding longsequence stations must have the same type of distribution function; otherwise, the spatial interpolation operation cannot be performed. Therefore, consistent precipitation distribution functions and excellent precipitation distribution fitting ability are necessary conditions for subsequent calculations. Table 3 shows that the gamma distribution meets this condition, so this paper selects the gamma distribution as the distribution function of precipitation.
The regional averaging method (Section 2.3.2) and kriging interpolation method (Section 2.3.3) require a predetermined value N to determine the number of surrounding longsequence stations. Affected by topography and other factors, stations with similar precipitation distribution functions are not necessarily distributed in a circle (for example, band structure). Therefore, the search for long-sequence stations cannot simply follow the circular pattern shown in Figure 2. Due to this, data from 2416 observation stations with long precipitation sequences were used to explore the best calculation strategy for short-sequence stations. The following method is used to search for the long-sequence stations around the short-sequence stations: (1) Use the KS distribution test to determine whether the monthly precipitation of the neighboring stations comes from the same distribution; that is, to determine whether the precipitation distributions of the two stations are similar. The top 50 stations that are closest to the precipitation distribution function of the central station are screened out. (2) The spatial distance between the stations is calculated from the longitude and latitude information of the stations, and the stations with too large a spatial distance in step 1 are eliminated based on the spatial distance information. The maximum distance is set at 400 KM in this paper. Figure 5 shows the stations with similar The results of the KS distribution test for all months showed that most of the sites that failed the KS distribution test were located in North China. When calculating the precipitation distribution parameters of the short-sequence stations, the surrounding long-sequence stations must have the same type of distribution function; otherwise, the spatial interpolation operation cannot be performed. Therefore, consistent precipitation distribution functions and excellent precipitation distribution fitting ability are necessary conditions for subsequent calculations. Table 3 shows that the gamma distribution meets this condition, so this paper selects the gamma distribution as the distribution function of precipitation.
The regional averaging method (Section 2.3.2) and kriging interpolation method (Section 2.3.3) require a predetermined value N to determine the number of surrounding long-sequence stations. Affected by topography and other factors, stations with similar precipitation distribution functions are not necessarily distributed in a circle (for example, band structure). Therefore, the search for long-sequence stations cannot simply follow the circular pattern shown in Figure 2. Due to this, data from 2416 observation stations with long precipitation sequences were used to explore the best calculation strategy for shortsequence stations. The following method is used to search for the long-sequence stations around the short-sequence stations: (1) Use the KS distribution test to determine whether the monthly precipitation of the neighboring stations comes from the same distribution; that is, to determine whether the precipitation distributions of the two stations are similar. The top 50 stations that are closest to the precipitation distribution function of the central station are screened out. (2) The spatial distance between the stations is calculated from the longitude and latitude information of the stations, and the stations with too large a Atmosphere 2021, 12, 603 8 of 14 spatial distance in step 1 are eliminated based on the spatial distance information. The maximum distance is set at 400 KM in this paper. Figure 5 shows the stations with similar precipitation distributions as the representative stations (in Figure 1a) in September. From the results in Figure 5, the stations with similar precipitation distributions are mostly located near the representative stations, and they are not evenly distributed in the circle. Based on these stations, the precipitation distribution parameters of each representative station can be obtained using the method introduced in Section 2.3, and then the SPI sequence can be obtained. We take station 54,906 in Figure 1a as an example, and Figure 6 shows the SPI sequence based on three methods. From the results in Figure 5, the stations with similar precipitation distributions are mostly located near the representative stations, and they are not evenly distributed in the circle. Based on these stations, the precipitation distribution parameters of each representative station can be obtained using the method introduced in Section 2.3, and then the SPI sequence can be obtained. We take station 54,906 in Figure 1a as an example, and Figure 6 shows the SPI sequence based on three methods.
According to the comparison of SPI sequence in Figure 6a, SPI calculated based on different indirect methods had very similar sizes. Large errors are likely to occur at the peak value of the sequence. To further verify this result, a comparative analysis of SPI is presented in Figure 6b. The Y-coordinate value in Figure 6b represents the true value of SPI, and the X-coordinate value represents the SPI calculated based on different indirect methods. Therefore, the more accurate the value of SPI calculated based on the indirect method is, the closer the position of scatter points is to the diagonal line in Figure 6b. According to the scatter distribution in Figure 6b, the SPI value of station 54,906 obtained based on the three indirect methods is almost equal to the true value of SPI. In the whole country, whether the SPI based on the indirect method can be approximately equal to the true value is of great significance. Similar to the calculation in Figures 5 and 6, we calculated the SPI sequence of all stations across the country based on three indirect methods. Compared with the true value of SPI, the error sequence of SPI is obtained, and the standard deviation (STD) of the SPI error sequence is obtained. The larger the standard deviation, the less accurate the SPI calculated by the indirect method, so this value can be used as an analysis of the SPI calculation error. Figure 7 shows the spatial distribution of the STD of SPI error across the country.
From the results in Figure 5, the stations with similar precipitation distributions are mostly located near the representative stations, and they are not evenly distributed in the circle. Based on these stations, the precipitation distribution parameters of each representative station can be obtained using the method introduced in Section 2.3, and then the SPI sequence can be obtained. We take station 54,906 in Figure 1a as an example, and Figure 6 shows the SPI sequence based on three methods. According to the comparison of SPI sequence in Figure 6a, SPI calculated based on different indirect methods had very similar sizes. Large errors are likely to occur at the peak value of the sequence. To further verify this result, a comparative analysis of SPI is presented in Figure 6b. The Y-coordinate value in Figure 6b represents the true value of SPI, and the X-coordinate value represents the SPI calculated based on different indirect methods. Therefore, the more accurate the value of SPI calculated based on the indirect method is, the closer the position of scatter points is to the diagonal line in Figure 6b. According to the scatter distribution in Figure 6b, the SPI value of station 54,906 obtained based on the three indirect methods is almost equal to the true value of SPI. In the whole country, whether the SPI based on the indirect method can be approximately equal to the true value is of great significance. Similar to the calculation in Figures 5 and 6, we calculated the SPI sequence of all stations across the country based on three indirect methods. Compared with the true value of SPI, the error sequence of SPI is obtained, and the standard deviation (STD) of the SPI error sequence is obtained. The larger the standard deviation, the less accurate the SPI calculated by the indirect method, so this value can be used as an analysis of the SPI calculation error. Figure 7 shows the spatial distribution of the STD of SPI error across the country. As shown in Figure 7, the spatial distribution characteristics of SPI errors obtained based on different indirect methods were very consistent, which all showed that the SPI error in eastern China was small and the SPI error in western China was large. In most areas of eastern China, the STD of the SPI error was less than 0.2. Among the three indirect calculation methods, the nearest neighbor substitution method performed better than the other two methods in some areas of western China, but its performance in most areas was inferior to those of the other two methods. The difference in the performance of the regional average method and kriging method was not obvious in eastern China. However, in the western region, the nearest neighbor substitution method performed better than the other two methods in some regions. Since the calculation of the distribution parameters of short-sequence stations completely depends on the surrounding long-sequence stations, the closer to the long-sequence stations, the more accurate the indirect calculation of SPI. Due to the dense distribution of long-sequence stations in the east (Figure 1a), the average distance from short-sequence stations to long-sequence stations in the east is much lower than the value in the west. It is for this reason that no matter which indirect calculation method is adopted in the eastern region, the error of SPI east of 100°E is significantly lower than the error of SPI west of 100°E. The actual applications of the SPI index include two ways: analysis based directly on the size of the SPI index and analysis based on drought categories (Table 2). Therefore, Figure 8 showed the spatial distribution of the accuracy of SPI classification. As shown in Figure 7, the spatial distribution characteristics of SPI errors obtained based on different indirect methods were very consistent, which all showed that the SPI error in eastern China was small and the SPI error in western China was large. In most areas of eastern China, the STD of the SPI error was less than 0.2. Among the three indirect calculation methods, the nearest neighbor substitution method performed better than the other two methods in some areas of western China, but its performance in most areas was inferior to those of the other two methods. The difference in the performance of the regional average method and kriging method was not obvious in eastern China. However, in the western region, the nearest neighbor substitution method performed better than the other two methods in some regions. Since the calculation of the distribution parameters of short-sequence stations completely depends on the surrounding long-sequence stations, the closer to the long-sequence stations, the more accurate the indirect calculation of SPI. Due to the dense distribution of long-sequence stations in the east (Figure 1a), the average distance from short-sequence stations to long-sequence stations in the east is much lower than the value in the west. It is for this reason that no matter which indirect calculation method is adopted in the eastern region, the error of SPI east of 100 • E is significantly lower than the error of SPI west of 100 • E. The actual applications of the SPI index include two ways: analysis based directly on the size of the SPI index and analysis based on drought categories (Table 2). Therefore, Figure 8 showed the spatial distribution of the accuracy of SPI classification. As shown in Figure 8, the spatial distribution structure was consistent with the spatial distribution structure for the SPI error (Figures 7). The accuracy of SPI classification in the eastern region was higher than the accuracy in the western region. The accuracy of SPI classification in most of eastern China was higher than 80%, but in parts of western China, it was lower than 60%. From the perspective of the size of the SPI error ( Figure 7) and the accuracy of the SPI classification (Figure 8), the calculation of high-resolution SPI in the eastern region would be more accurate.
Although the error of SPI in eastern China was mostly less than 0.2 and the accuracy of SPI classification could reach 80%, did this error fall within an acceptable range? In order to further analyze the credibility of the indirect methods, the resampling method was used to discuss how much error is acceptable. As we all know, it is recommended to use a longer precipitation sample period to calculate the SPI value [34,35] but sufficient observation data might not be available for a long period, so the calculation is generally based on a 30-year sample. However, due to randomness, the SPI index calculated by different 30-year precipitation samples is not consistent. Therefore, the SPI error due to the 30-year sample should be regarded as a tolerable error limit. Based on the precipitation data of each station, the resampled data of each station with 30 groups of precipitation data containing 30 years was obtained by using the resampling method. Based on the resampled data, the STD of the SPI error and the accuracy of SPI classification can be calculated, which is regarded as the error tolerance limit. The result showed that the average value of STD was almost 0.27 nationwide and the accuracy of SPI classification was almost 76%. Therefore, it is feasible and acceptable to calculate the SPI based on indirect methods in eastern China and the western region where the stations are concentrated.

The Reason Analysis of the SPI Error and the Drought Monitoring Examples
According to the SPI calculation principle introduced in Section 2.2, the error of SPI based on indirect methods was entirely due to the error of distribution parameters. Since the precipitation distribution function was calculated separately for each month, the errors of precipitation distribution parameters in different months were not consistent. Taking January as an example, Figure 9 shows the spatial distribution of parameter errors. As shown in Figure 8, the spatial distribution structure was consistent with the spatial distribution structure for the SPI error ( Figure 7). The accuracy of SPI classification in the eastern region was higher than the accuracy in the western region. The accuracy of SPI classification in most of eastern China was higher than 80%, but in parts of western China, it was lower than 60%. From the perspective of the size of the SPI error ( Figure 7) and the accuracy of the SPI classification (Figure 8), the calculation of high-resolution SPI in the eastern region would be more accurate.
Although the error of SPI in eastern China was mostly less than 0.2 and the accuracy of SPI classification could reach 80%, did this error fall within an acceptable range? In order to further analyze the credibility of the indirect methods, the resampling method was used to discuss how much error is acceptable. As we all know, it is recommended to use a longer precipitation sample period to calculate the SPI value [34,35] but sufficient observation data might not be available for a long period, so the calculation is generally based on a 30-year sample. However, due to randomness, the SPI index calculated by different 30-year precipitation samples is not consistent. Therefore, the SPI error due to the 30-year sample should be regarded as a tolerable error limit. Based on the precipitation data of each station, the resampled data of each station with 30 groups of precipitation data containing 30 years was obtained by using the resampling method. Based on the resampled data, the STD of the SPI error and the accuracy of SPI classification can be calculated, which is regarded as the error tolerance limit. The result showed that the average value of STD was almost 0.27 nationwide and the accuracy of SPI classification was almost 76%. Therefore, it is feasible and acceptable to calculate the SPI based on indirect methods in eastern China and the western region where the stations are concentrated.

The Reason Analysis of the SPI Error and the Drought Monitoring Examples
According to the SPI calculation principle introduced in Section 2.2, the error of SPI based on indirect methods was entirely due to the error of distribution parameters. Since the precipitation distribution function was calculated separately for each month, the errors of precipitation distribution parameters in different months were not consistent. Taking January as an example, Figure 9 shows the spatial distribution of parameter errors. Atmosphere 2021, 12, x FOR PEER REVIEW 12 of 15 From the analysis in Section 3.1, it could be seen that the kriging method was better than the other two indirect methods, so Figure 9 only showed the results based on the kriging method. The value in Figure 8 is the rate of change compared to the real value (take parameter as an example: the value in Figure 9 a is calculated based on 100%). As shown in Figure 9, the distribution parameters and had much greater changes than , with increases of 30% in western China. The parameter error analysis of other months shows that the error of the distribution parameters has obvious seasonal characteristics, and the results show that the error is the smallest in summer and the largest in winter. In addition, the error of the distribution parameter has the smallest change with the season, and parameter has the largest change. Therefore, the estimation error of the distribution parameters and is the most important cause of SPI error.
The previous SPI error analysis based on 2416 long-sequence stations shows that it is feasible to develop high-resolution SPI indicators in eastern China. For the SPI calculation of 51,062 short-sequence stations, firstly, the long-sequence station nearest to the shortsequence station is searched based on the longitude and latitude information. Secondly, based on the best indirect method of calculating the distribution parameters of the longsequence station (based on the results in Figure 7), the precipitation distribution parameters of the 51,062 short-sequence station are solved. Figures 10 and 11 shows the comparison of drought monitoring results in China.  From the analysis in Section 3.1, it could be seen that the kriging method was better than the other two indirect methods, so Figure 9 only showed the results based on the kriging method. The value in Figure 8 is the rate of change compared to the real value (take parameter α as an example: the value in Figure 9 a is calculated based on . As shown in Figure 9, the distribution parameters β and q had much greater changes than α, with increases of 30% in western China. The parameter error analysis of other months shows that the error of the distribution parameters has obvious seasonal characteristics, and the results show that the error is the smallest in summer and the largest in winter. In addition, the error of the distribution parameter α has the smallest change with the season, and parameter q has the largest change. Therefore, the estimation error of the distribution parameters β and q is the most important cause of SPI error. The previous SPI error analysis based on 2416 long-sequence stations shows that it is feasible to develop high-resolution SPI indicators in eastern China. For the SPI calculation of 51,062 short-sequence stations, firstly, the long-sequence station nearest to the shortsequence station is searched based on the longitude and latitude information. Secondly, based on the best indirect method of calculating the distribution parameters of the longsequence station (based on the results in Figure 7), the precipitation distribution parameters of the 51,062 short-sequence station are solved. Figures 10 and 11 shows the comparison of drought monitoring results in China. From the analysis in Section 3.1, it could be seen that the kriging method was better than the other two indirect methods, so Figure 9 only showed the results based on the kriging method. The value in Figure 8 is the rate of change compared to the real value (take parameter as an example: the value in Figure 9 a is calculated based on 100%). As shown in Figure 9, the distribution parameters and had much greater changes than , with increases of 30% in western China. The parameter error analysis of other months shows that the error of the distribution parameters has obvious seasonal characteristics, and the results show that the error is the smallest in summer and the largest in winter. In addition, the error of the distribution parameter has the smallest change with the season, and parameter has the largest change. Therefore, the estimation error of the distribution parameters and is the most important cause of SPI error.
The previous SPI error analysis based on 2416 long-sequence stations shows that it is feasible to develop high-resolution SPI indicators in eastern China. For the SPI calculation of 51,062 short-sequence stations, firstly, the long-sequence station nearest to the shortsequence station is searched based on the longitude and latitude information. Secondly, based on the best indirect method of calculating the distribution parameters of the longsequence station (based on the results in Figure 7), the precipitation distribution parameters of the 51,062 short-sequence station are solved. Figures 10 and 11 shows the comparison of drought monitoring results in China.   Figures 10 and 11 show that the monitoring results obtained based on different data are very consistent, and the results obtained based on 51,062 stations are more detailed than those obtained based on 2416 stations. The above results once again verify that it is feasible to carry out the calculation of the high-resolution SPI index in eastern China. It needs to be pointed out that it may be due to the error of indirect calculation of distribution parameters in winter and spring. The monitoring results during this period have certain errors, but the results in southern China and other seasons are consistent. The comparative analysis of drought monitoring over many months shows that it is reliable to carry out SPI calculations for short-sequence stations in various seasons in southern China and in the summer (autumn) season in most parts of China.

Summary and Discussion
In this study, in order to facilitate SPI calculations with short-term precipitation data, long-term precipitation data from 2416 stations in China were used to conduct a feasibility analysis. The results showed that it is feasible to calculate the SPI indirectly in eastern China. The SPI error was small in east China and large in west China, and the SPI was more accurate when the observation stations were denser. The standard deviation of the SPI error in eastern China was generally less than 0.2 and the category agreement lager than 80%, which was lower than the error using the 30-year precipitation samples. Error analysis showed that the kriging method and regional average method performed better than the nearest neighbor substitution method. The reason for the SPI error is the error generated by the indirect method to solve the distribution parameters, especially the distribution parameters and have the largest errors and seasonal differences. The analysis of two examples in China shows that the SPI distribution of 51,062 stations obtained by the indirect method is in good agreement with the results based on 2416 stations, which verifies that the calculation of SPI based on a short series of precipitation data in eastern China is reliable. However, it should be noted that during the winter and spring seasons, short-sequence stations in northern China have certain errors in the calculation of SPI.
As the precipitation distribution parameters are calculated from decades of precipitation data, they are susceptible to the impact of climate change, especially the stations on the boundary of the climate zone. Besides, the precipitation distribution parameters are obtained based on the precipitation data of the same month, so the spatial distribution of the precipitation distribution parameters in different months is not consistent. However, in the same season, the spatial distribution of the distribution parameters is relatively consistent. Therefore, the SPI is prone to large errors in the months of seasonal transition. On the other hand, due to the obvious seasonal rainfall in China, the parameter is almost  Figures 10 and 11 show that the monitoring results obtained based on different data are very consistent, and the results obtained based on 51,062 stations are more detailed than those obtained based on 2416 stations. The above results once again verify that it is feasible to carry out the calculation of the high-resolution SPI index in eastern China. It needs to be pointed out that it may be due to the error of indirect calculation of distribution parameters in winter and spring. The monitoring results during this period have certain errors, but the results in southern China and other seasons are consistent. The comparative analysis of drought monitoring over many months shows that it is reliable to carry out SPI calculations for short-sequence stations in various seasons in southern China and in the summer (autumn) season in most parts of China.

Summary and Discussion
In this study, in order to facilitate SPI calculations with short-term precipitation data, long-term precipitation data from 2416 stations in China were used to conduct a feasibility analysis. The results showed that it is feasible to calculate the SPI indirectly in eastern China. The SPI error was small in east China and large in west China, and the SPI was more accurate when the observation stations were denser. The standard deviation of the SPI error in eastern China was generally less than 0.2 and the category agreement lager than 80%, which was lower than the error using the 30-year precipitation samples. Error analysis showed that the kriging method and regional average method performed better than the nearest neighbor substitution method. The reason for the SPI error is the error generated by the indirect method to solve the distribution parameters, especially the distribution parameters β and q have the largest errors and seasonal differences. The analysis of two examples in China shows that the SPI distribution of 51,062 stations obtained by the indirect method is in good agreement with the results based on 2416 stations, which verifies that the calculation of SPI based on a short series of precipitation data in eastern China is reliable. However, it should be noted that during the winter and spring seasons, short-sequence stations in northern China have certain errors in the calculation of SPI.
As the precipitation distribution parameters are calculated from decades of precipitation data, they are susceptible to the impact of climate change, especially the stations on the boundary of the climate zone. Besides, the precipitation distribution parameters are obtained based on the precipitation data of the same month, so the spatial distribution of the precipitation distribution parameters in different months is not consistent. However, in the same season, the spatial distribution of the distribution parameters is relatively consistent. Therefore, the SPI is prone to large errors in the months of seasonal transition. On the other hand, due to the obvious seasonal rainfall in China, the parameter q is almost always 0 in the rainy season, so there is almost no interpolation error in this season. However, in the non-rainy season, local precipitation is often affected by topography or other factors, so the spatial distribution of parameter q is not smooth. For this reason, the interpolation error of q in the non-rainy season is relatively large, which further affects the calculation accuracy of SPI.
The SPI is a drought index with multiple time scales, but the present study only investigated the SPI on a monthly scale, and it did not verify the accuracy of the SPI with time scales of 3, 6, 12, and 24 months. However, it is considered that when the time scale is larger, the error will be smaller when using an indirect method to obtain the SPI because the spatial difference in rainfall will be lower over a longer time scale. All of the analyses conducted in this study were based on the gamma distribution function, but this function is not the best function for fitting the precipitation at all stations, and thus the SPI error could be affected by the distribution function selected. In addition, the same distribution function must be assumed when indirect methods are employed to determine the precipitation distribution parameters; otherwise, the SPI index cannot be calculated.