Evaluation of Methods for Estimating Long-Term Flow Fluctuations Using Frequency Characteristics from Wavelet Analysis

: This study was aimed at exploring di ﬀ erent indices to quantify ﬂ ow ﬂ uctuations and calculate long-term ﬂ ow indicators (L-FFI). Three approaches were considered to calculate the indicators: Method (1)—calculate the annual index and then average it; Method (2)—average the annual ﬂ ow characteristics and then calculate the index; and Method (3)—calculate the index considering all available data. Wavelet analysis was performed to evaluate the derived L-FFI. The evaluation index was based on the period corresponding to the highest spectral power from the wavelet transformation of seasonally di ﬀ erenced data. Strong and negative positive correlations were observed between the L-FFI and the high-and low-ﬂ ow variations, respectively. The correlation coe ﬃ cient ( 𝑅 ) between L-FFIs and the frequency with maximum global wavelet power showed that Method (2) consistently yielded the most reliable results across various facets, having a determination coef-ﬁ cient of 0.73 ( 𝑅 (cid:2870) ) on average. In the regionalization analysis using the Ward method, it was consistently observed that the two largest dams (the Chungju Dam and the Uiam Dam) were signi ﬁ - cantly di ﬀ erentiated from the other dams. Furthermore, Method (2) showed the most similar characteristics to the clustering of the wavelet features. The outcomes are expected to facilitate long-term water resource management.


Introduction
A notable characteristic of climate in Korea is that the rainfall is concentrated in the summer season, which leads to significant variations in the flow discharge in a short period. These flow fluctuations are closely related to the hydrological and ecological stability of rivers. In addition, flow fluctuations considerably influence the efficiency of water resource utilization and pose potential risks associated with flooding. Therefore, it is necessary to examine the long-term flow variability for a specific basin (or watershed), and considerable research has been focused on flow fluctuations [1][2][3][4][5].
Various indices are available for quantifying the variability in flow fluctuations. A representative flow fluctuation index (FFI) is the river regime coefficient (RRC), which is the ratio of the maximum flow discharge to the minimum flow discharge in a certain period. Several researchers have focused on the RRC in the Korean environment. Won [6] estimated the RRC based on flow discharge data of the five major river basins in Korea in the 1920s. Later, Park [7] estimated the RRC using flow discharge data of the Geum river in Korea. Lee and Woo [8] and Lee et al. [9] estimated the RRC using flow discharge data of the major five rivers in Korea, similar to [6].
In the 2000s, several researchers examined the indices for various flow fluctuation characteristics. Lee [10] analyzed the flow variability of the Han and Nakdong rivers from 1999 to 2001. Unlike the previous studies, Lee [10] considered not only the maximum and minimum flow but also indices based on ordinary discharge, low discharge, and drought discharge. Subsequently, Handayani [11] estimated the annual RRCs from 2006 to 2015 based on discharge data from the Rokan river in Sumatra, Indonesia, where flood damage frequently occurs, and analyzed the temporal variations in the RRCs. Unlike [10], who focused on a multi-year period, Handayani [11] set the analysis period as one year. Overall, different researchers have used different methods to calculate long-term indices, leading to a certain subjectivity in the findings. Although the indices vary depending on the target period, to the best of our knowledge, none of the existing studies have investigated the variation in the long-term FFI (L-FFI) indices according to the target period or estimation method.
The wavelet analysis technique is commonly used for examining periodic properties and has proven to be an effective tool for time series analyses across various academic fields [12]. Numerous instances of its application can be found in hydrology and meteorology [13][14][15][16][17]. Moreover, recent research has focused on integrating the frequency characteristics obtained through wavelet analysis with machine learning for prediction or forecasting, and promising results have been obtained [18][19][20][21]. Some studies have confirmed the validity of using wavelet analysis to analyze the patterns of dam inflow and river outflow [22][23][24]. Thus, the frequency characteristics derived from wavelet analysis, which are closely related to the flow fluctuations, could serve as an evaluation criterion for assessing L-FFI.
On the other hand, there have been studies related to regionalization, which clusters basins by identifying and interpreting patterns inherent in hydrological features such as these flow characteristics. Most studies have utilized this for regional flood frequency analysis. Cluster analysis methods are predominantly used for regionalization. These can be broadly divided into hierarchical algorithms and partitioning algorithms. A representative method for hierarchical algorithms is Ward's method [25], and for partitioning algorithms K-means [26] can be mentioned. Hybrid algorithms, which combine these algorithms, are also frequently employed. Various attempts continue to be made to advance hydrological regionalization up to recent times [27][28][29][30][31][32][33]. If such regionalization from L-FFI and wavelet analysis is further conducted, then its applicability in long-term water resource management could be significantly enhanced.
In this study, we evaluated several methods for estimating L-FFI based on the characteristics exhibited by long-term data in the time-frequency domain. While there have been studies that aimed to obtain flow characteristics through flow-related indices or wavelet analysis, research that seeks to integrate both is a novel endeavor. To this end, we first investigated the indices for quantifying the flow fluctuations, considering three methods for estimating the L-FFI. The frequency characteristics were defined using the frequency with the highest spectral power from the result of the wavelet transformation of seasonally differenced data. The methods for L-FFI were evaluated based on these frequency characteristics. Furthermore, based on the results of the L-FFI and frequency characteristics, we conducted regionalization of the dam basin using hierarchical clustering with the Ward algorithm and discuss the implications.

Study Area and Data
The study objects were eight domestic dam basins, for which long-term inflow data were extracted. We selected basins that are not significantly affected by upstream hydraulic structures. The analysis period encompassed 36 years, from 1986 to 2021. Daily discharge data for the dam basins were obtained through the daily inflow data provided by the Water Resources Management Information System (http://www.wamis.go.kr, accessed on 1 April 2023) in Korea. Figure 1 shows the considered basins, river networks, and dam locations. Table 1 summarizes the basic information of the dam and dam basin.

FFI
The RRC is a representative index for quantifying flow fluctuations. Additionally, specific flow characteristics based on discharge that occurs for more than a specific number of days in a year (i.e., daily discharge occurring in a specific order throughout the year) can be used for evaluation. This parameter, known as the flow regime, is commonly used in hydrological research [34]. In this study, periods of 10, 95, 185, 275, and 355 d were selected, corresponding to flood discharge, abundant discharge, ordinary discharge, low discharge, and drought discharge, respectively. Such discharges are typically expressed as exceedance probabilities. Thus, the discharges considered in this study corresponded to exceedance probabilities of 2.7%, 26.0%, 50.7%, 75.3%, and 97.3%, defined as . , . , . , . , and . , respectively. The RRC can be formulated as follows: where and denote the maximum and minimum discharge at a specific point in the stream in a specific period, respectively. In estimating the RRC, the target period has been set as the specific period by various researchers. For example, Lee and Woo [8] estimated the RRC for several decades before and after the construction of five major dams in Korea to evaluate the change in the RRC owing to the dam construction. The average value of the RRCs within the long-term period was considered for calculating the longterm RRC. Lee [10] compared the changes in the annual RRC in Korea using daily discharge data from 1999 to 2001, for which a substantial amount of discharge data were available compared with those in the other periods. In contrast, certain researchers considered a period spanning several decades, depending on the objective for estimating the RRC.
Because the RRC is sensitive to extreme discharge values observed during drought or flood periods, it cannot effectively reflect the variations in the average flow fluctuations. A promising alternative is the flow regime coefficient ( ) [9], expressed as Although the meaning of this coefficient is similar to that of the RRC, it can overcome the abovementioned limitation of the RRC.
In addition, Park [35] proposed the flood coefficient ( ) and abundance coefficient ( ) to quantify the flow fluctuations above the ordinary discharge ( 50.7 ): where and exceed 1. Smaller values of the coefficients indicate a higher similarity of and . to . , i.e., reduced flow fluctuations above the ordinary discharge. Considering that there may be a difference in the flow fluctuations above or below the ordinary discharge, Park [35] proposed a low coefficient ( ) and drought coefficient ( ), expressed as where and are smaller than 1, and as their value increases, the values of . and . approach that of . . In other words, the flow fluctuations of the river decrease. Furthermore, Park [36] investigated the flow fluctuations based on abundant discharge. The ratio of to and that of to were defined as the variance of the drought coefficient ( ) and the variance low coefficient ( ), respectively.
Additionally, we considered the coefficient of variance ( ), which is widely used in various fields.
, also known as the relative standard deviation, can be used for comparing variables with different absolute scales. In this study, in addition to the commonly used mean values, we considered the index based on median values. The two indices are distinguished by the subscripts: where and denote the average discharge (m 3 /s) and median discharge (m 3 /s), respectively.
is the standard deviation (m 3 /s). and are the coefficients of variation based on and , respectively. Table 2 summarizes the FFIs mentioned earlier and provides a brief description of each.

Wavelet Transform
There are various methods to analyze signals in the time domain; for example, ARIMA (auto regressive integrated moving average) [37], exponential smoothing [38], SSM (state space models) [39,40], VAR (vector autoregressions) [41], decomposition [42,43], approximation of differential equations [44,45], etc. Among them, conventionally, Fourier transformations have been widely used to analyze signals in the time domain by transforming them to the frequency domain [46,47]. As the Fourier transform uses sine and cosine functions as the basis functions, it is particularly useful in analyzing the overall periodicity rather than local changes. However, if the time series data are non-stationary, the periodic characteristics may not be consistent throughout the data period. This problem is commonly encountered with most time series data observed in nature. To overcome this limitation, the concept of time-frequency analysis has been introduced [48,49]. Time-frequency analysis based on Fourier transform shows the variation in the relative importance of the sine or cosine function at each time point.
Although time-frequency analyses based on Fourier transformations represent a valuable tool, they involve several limitations. The most fundamental limitation is that the basis functions repeat indefinitely, which means that their influence extends across the entire time series. In examining the frequency characteristics of long-term time series data, as in this study, one could potentially apply Fourier analysis. However, in such instances, it becomes challenging to clarify temporal changes in the frequency variations.
The wavelet analysis technique was established to overcome this drawback [50,51]. A wavelet refers to a small wave that has a finite length. Thus, wavelet transformations use basis functions of finite length to transform the existing time series data into the timefrequency domain. Compared with Fourier transformations, wavelet transformations can more clearly represent local changes, which is beneficial for identifying discontinuous points in signals.
The basis function of the wavelet transform is known as the mother wavelet. By the scaling and transition of the mother wavelet, signals in the time domain can be transformed to the frequency domain. For a time series ), the continuous wavelet transform (CWT) is calculated as shown in (11).
where ( , ) represents the CWT with parameters and , where is the scale parameter that adjusts the size of the mother wavelet, and is the transition parameter that shifts the position of the mother wavelet along the time axis. * denotes the mother wavelet function, and the superscript * represents the complex conjugate. The ( , ) computed from (10) typically takes complex values, and thus, to perform comparisons based on magnitudes, the wavelet power (WP) computed as | ( , )| is commonly used. The average WP over the entire period is referred to as the global WP (GWP). The mother wavelet must be carefully selected considering the research objective, as it considerably affects the results of wavelet analysis even when using the same data [52]. Furthermore, if an inappropriate mother wavelet is chosen, a problem with convergence could be encountered. In univariate mother wavelet analysis, the Morlet mother wavelet yields consistent results in both the time and frequency domains compared with other mother wavelets [53][54][55][56]. Therefore, the analysis in this study was based on the Morlet mother wavelet, which can be expressed as in (2).
where represents the non-dimensional time parameter transformed by the scale and transition parameters. is the wave number, which is also referred to as the non-dimensional frequency, set as 6 in this study. In these settings, the Morlet mother wavelet generates six oscillations. Furthermore, setting as 6 has been known to provide the optimal balance in terms of localization in the time-frequency domain during the wavelet transformation [53][54][55][56].
After determining the mother wavelet, the frequency to be analyzed must be set. Typically, this setting is defined using the scale, which is the reciprocal of the frequency. The scale is determined using (13), and J, which denotes the index of the largest scale, is obtained through (14) [57].
where represents the scale size at , and denotes the index of the scale, ranging from 0 to . Therefore, the total number of scales to be analyzed is + 1 .
is the smallest value among the scales to be analyzed, and determines the density of the scale. is a constant and setting it to a smaller value enables a more precise representation of the wavelet spectrum. Typically, is set as 0.4875 [51]. However, its value is set as 0.25 in this study to obtain finer details.
is the time unit for wavelet analysis. The small scale ( ) is generally set as 2 .
Compared with Fourier transformations, the wavelet transformation, which converts data from the time domain into the time-frequency domain, yields less straightforward interpretations. The frequency displayed in the time-frequency domain by the wavelet spectrum may not necessarily represent the frequency of the original data. The same problem is encountered with the scale, which is inversely proportional to the frequency. This discrepancy arises because unlike Fourier transformation, which uses sine or cosine functions of a certain frequency as the basis functions, the center frequency of the mother wavelet used as the basis function in wavelet analysis differs for each function. Nevertheless, when the Morlet mother wavelet ( = 6) is used, the frequency observed in the wavelet analysis closely approximates the frequency of the original data ( = 1.03 ), which enhances the interpretability of the results.
Additionally, the results of wavelet analysis typically include an area known as the cone of influence (COI) in which edge effects cannot be ignored. Caution must be exercised when interpreting results within this area due to its high uncertainty. When using the Morlet mother wavelet, the COI at scale s is calculated as √2 [53]. We used the Python package PyCWT, and the code was developed with reference to the work of [57].

Regionalization of Basins
Regionalization of basins is primarily a method utilized for regional flood frequency analysis (RFFA). When the flood records of a specific basin are very short, it involves grouping basins that have hydrological homogeneity in the vicinity, and then conducting FFA based on this. Cluster analysis methods are predominantly used for regionalization. These can be broadly divided into hierarchical algorithms and partitioning algorithms. A representative method of hierarchical algorithms is Ward's method [25], and for partitioning algorithms, K-means [26] can be cited.
In this study, the Ward algorithm, the most basic form of hierarchical algorithms, was employed for regionalization based on the final L-FFI and wavelet analysis results. The Ward algorithm has been successfully applied in previous related studies [58][59][60][61]. An explanation for this is provided as follows: Firstly, Ward's algorithm aims to minimize the total variance within the cluster. To implement this method, one must find the pair of clusters at each stage that, when merged, results in the smallest increase in the total intra-cluster variance. Essentially, this measures the similarity of clusters based on the increase in the error sum of squares when two clusters are combined. Thus, in data with variables, if there are clusters at the current stage, and each cluster contains data, the objective function is expressed as follows: where represents the value of the observation for the value in cluster , and ̅ denotes the average value of the value in cluster . Thus, the objective function of the Ward algorithm minimizes the sum of squared deviations of feature vectors from each cluster's center [62]. The Ward algorithm begins with singleton clusters. At this point, the cluster center is identical to the feature vector, and the objective function is computed as zero. At each step of the analysis, the union of all possible pairs of clusters is considered, and the two clusters resulting in the smallest increase in when merged are combined. Due to merging, the change in the objective function value, , relies only on the relationship between the two merged clusters and is not influenced by other clusters. Ward's algorithm is adept at restoring cluster structures and tends to form spherical clusters of equal sizes. These characteristics of the Ward algorithm prove beneficial for identifying similar regions for regionalization [63].

Pre-Analysis and Estimation of FFI and L-FFI
The existing studies used different periods to estimate the FFIs, such as a single year or multiple years. A standard method for estimating the L-FFIs remains to be established. Therefore, in this study, we evaluated three methods for estimating the L-FFIs for the Chungju Dam basin: Method (1)-average the annual FFIs; Method (2)-calculate the L-FFIs after averaging the annual flow characteristics; and Method (3)-calculate the L-FFIs using the flow characteristics for the entire period. Figure 2 shows the time series of daily inflow data and the mass curve of the Chungju Dam. Given the large difference in the scale of each year, the horizontal axis is set as a log scale. Although certain differences can be observed among the years, the inflow is generally concentrated in the summer, and the fluctuations vary across the years. In regions such as South Korea, where there is a concentrated rainfall and inflow during specific periods of the year, understanding the variability of dam inflow through metrics such as FFI is crucial from a water resource management perspective. For instance, in areas with high inflow fluctuations, there might be a need to release a significant amount of water for flood prevention, which could subsequently lead to an inability to adequately supply water during drought periods. Conversely, in regions with fewer inflow fluctuations, there is a higher likelihood of efficiently addressing water demands during droughts. Before diving into the in-depth analysis, we first examined the precipitation patterns. For easier comparison, it is displayed in a log-scale, consistent with previous analyses as shown in Figure 3. Upon examining the correlation coefficient between precipitation and inflow time series, as well as the cross-correlation coefficient with a given lag, it is observed that the correlation coefficient is merely about 0.473 when there is no lag. However, it peaks at 0.716 when a lag of 1 day is given, and then exponentially decreases thereafter. This indicates that the precipitation and inflow time series exhibit similar patterns, with the highest correlation observed at a 1-day lag for the Chungju Dam. Other dams also showed similar characteristics. Before examining the results for all the dam basins, the results of the Chungju Dam were analyzed. First, the annual flow characteristics were calculated from the daily inflow. Figure 4 shows the flow characteristics in terms of the minimum and maximum daily discharge and major discharge by year for a period of 36 years from 1986 to 2021. Additionally, Figure 5 displays the box-plot of the annual inflow for each year, with the y-axis set to a log scale.
The minimum discharge value significantly decreases over time (Figure 4a). From this, it can be inferred that, over time, the Chungju Dam shows a general trend of becoming drier. Similarly, . in Figure 4b shows a decreasing trend over time. However, upon examining the precipitation time series in Figure 3, it is highly probable that this trend has been influenced by factors other than precipitation, such as changes in land use or water consumption. Discharges exceeding . in Figure 4e-g exhibit small fluctuations. Figure 4i,j show that the distribution of the flow data is positively skewed. What this implies is that in the case of the Chungju Dam, a significant flow occurs in a short time span. In other words, due to the concentration of precipitation during the peak season, there is a characteristic of inflow rapidly increasing and decreasing in a short period. This characteristic can also be confirmed through the box-plot in Figure 5, and it has been observed to be similarly manifested in other dams.  Delving into the specifics of the index calculation, as previously explained, Method (1) involves computing the FFI for each year and then taking the average, while Method (2) first averages the flow characteristics of each year to derive a long-term flow characteristic, and then calculates a single L-FFI for the entire period at once. As for Method (3), it computes a single L-FFI based on the flow characteristics of the entire dataset.
For example, to calculate the RRC for each method, the process is as follows. Table 3 displays the yearly RRC calculated using the minimum and maximum flows. Method (1) determines the L-FFI by averaging these 36 RRCs. The resulting RRC computed in this manner is 8723.5. For Method (2), one first takes the average of flow characteristics, specifically the minimum and maximum flows. These averages are 4.9 m 3 /s for the minimum and 4922.2 m 3 /s for the maximum. Hence, using these values, an RRC of 1059.8 is derived. Lastly, for Method (3)  As demonstrated, when determining long-term flow indicators, the results can vary significantly depending on at which stage the averages or calculation are taken. As observed in the case of the Chungju Dam, a difference of up to approximately 15 times can be confirmed. Ultimately, Method (3) is sensitive to extreme values, potentially compromising its reliability in a long-term context. Figure 6 shows each index calculated for each year based on the flow characteristics. Ten indices were estimated for different years. The RRC exhibited large variations, and thus, the horizontal axis is expressed in the log scale. Additionally, Table 4 presents the L-FFIs calculated using Methods (1), (2), and (3).
As shown in Figure 6a, the RRC varies significantly across the years, with the minimum and maximum values being 188 (in 1991) and 75,632 (in 2006), respectively. The large difference is attributable to the minimum discharge rather than the maximum discharge, as shown in Figure 4. and (Figure 6b,g, respectively) exhibit similar trends, affected by . . in Figure 6c exhibits considerable variability due to the influence of the maximum fluctuations.
in Figure 6h demonstrates a similar pattern. and (Figure 6d,e, respectively) exhibit nearly constant values. However, in Figure 6f exhibits large variations, which are somewhat different from the patterns of RRC and . and (Figure 6i,j, respectively) reflect not only the large-scale discharge characteristics but also the small-scale discharge characteristics. In other words, the value increases as the low-scale flow reduces.  Figure 7 shows the calculated L-FFIs for all the dams. The RRC, , , and calculated by the different methods vary significantly. However, such significant disparities are not observed in the remaining indices.   Figure 8 shows the results of wavelet analyses on the raw data. The annual cycle appears prominently. Although this trend is noted to be slightly weakened during the 2010s, the dams exhibit a more distinct annual cycle within one year except for the Paldang Dam. As the dam size decreases, a higher power is observed at a smaller scale. Except for the Paldang Dam and the Uiam Dam, the other dams exhibit very similar spectrum patterns. This appears to be influenced by the presence of hydraulic structures upstream. In other words, if there are structures such as dams upstream for flood management, the spectrum of shorter periods tends to be somewhat mitigated.

Wavelet Analysis
Notably, although the data exhibit significant annual periodicity, our focus is the variation in flow excluding seasonal fluctuations. Therefore, we performed a wavelet analysis after removing seasonality. Figure 9 presents the results of the same analysis conducted using data subjected to seasonal differencing. Unlike Figure 8, dam inflow in each figure in Figure 9 presents the seasonally differenced data. Based on seasonal differencing, data excluding the first year were used in the analysis. Figure 9 shows that the annual periodic variations are eliminated, leaving only variations within the year. The variations within one year are not significant, and a smaller catchment corresponds to shorter period exhibiting the largest GWP. These trends are repeated almost every year. Therefore, similar results are likely to be obtained by the Fourier analysis. Considering these aspects, we extracted the period and corresponding frequency with the largest GWP.    Figure 10 shows the results of analyzing the correlation among the L-FFIs. The trend is slightly different for the two groups ( and vs. the others). In other words, and are negatively correlated with the other indicators and positively correlated with each other. This observation suggests that in cases of strong high flow, low flow tends to weaken. The RRC exhibits the most notable differences depending on the method used. Only the RRC obtained using Method (2) exhibits a strong absolute correlation with the other indices. The M-GWP frequency represents the frequency of non-seasonal flow repetitions. Thus, as this value increases, the likelihood of intensification of high and low flows compared with the average increases. Therefore, there exists a positive relationship with the high-flow index and negative relationship with the low-flow index, as shown in Figure 10. Figure 11 displays the scatter plot of the M-GWP frequency and L-FFIs, and Figure  12 presents the corresponding correlation coefficient ( ) and determination coefficient ( 2 ). The M-GWP frequency is negatively correlated with and and positively correlated with the remaining indicators. Except for , the highest correlation is observed when the indicators are calculated using Method (2). When compared quantitatively using the coefficient of determination, for Method (1)

Discussion
In this study, we identified the strongest periodic components from the differenced data and data excluding seasonal variations, and selected the method that showed the most pronounced correlation with L-FFI. Overall, Method (2), which calculates the index by averaging the flow indices, emerges as the most reasonable approach. Method (2) can have the effect of smoothing outliers that may occur in each year, preserving information about long-term flow characteristics. This method can be used to calculate the L-FFI for any point on a dam or along a river, which can facilitate the establishment of appropriate strategies for water management.
Major related prior studies [6][7][8][9][10][11] vary in the periods they use for indicator estimation, and there are even instances within the same study where results from different periods are compared with one another. In contrast, the approach proposed in this study has a clear estimation method for each technique, and notably for Method (2)-which was evaluated as the most optimal-indicators sensitive to maximum values such as RRC appear within a much more acceptable range. Overall, when considering the representativeness of the basin, it is deemed more reasonable compared to previous studies.
We further analyzed the regionalization of watersheds. Ward's method was utilized, one of the representative hierarchical clustering methods. In this study, we performed clustering using L-FFIs derived from Methods (1), (2), and (3) and also carried out clustering by utilizing GWP (global wavelet power) for comparison. Figure 13 displays the results. The standardization was conducted before the analysis.
Method (1) and Method (2) show almost similar analysis results, while Method (3) exhibits a somewhat different regionalization pattern. Observing the clustering results from the wavelet analysis, Method (3) closely aligns with the results of Method (2). This seems to support the findings of our earlier analysis. Moreover, in all analyses, we confirmed that the largest basins, Paldang and Uiam, are clustered differently. As mentioned in the wavelet analysis, this appears to be influenced by the presence of structures such as dams upstream for flood management. The wavelet analysis employed in this study allows for the simultaneous capture of high-frequency and low-frequency components of time series, offering the advantage of investigating both short-term and long-term fluctuations. Moreover, it can be applied to non-stationary time series data, making it adept at capturing changes in extreme conditions such as droughts or floods. However, the interpretation of non-recurring single events can be somewhat challenging, and due to the inherent characteristics of wavelet analysis, the beginning and end of the time series may yield slightly distorted results. Therefore, caution is needed when interpreting data with shorter lengths. While this study conducted analyses on a daily scale, it is essential to note that for actual flood prevention, an even shorter time-scale analysis could be more critical.
Through the methodology of this study, if further analyses for multiple basins are conducted in the future, it will be possible to establish a range for long-term indicators that demonstrate hydrologically stable patterns. In such cases, any new basin can be evaluated through these long-term indicators, enabling the determination of whether improvements in hydrological cycles, through human interventions such as the installation of additional dams or river maintenance, are necessary. These series of processes are expected to facilitate long-term water resource management.
On the other hand, such variations in flow are derived from changes in precipitation patterns. Of course, the degree to which rainfall patterns manifest as changes in flow patterns will vary by basin. However, this study solely focused on determining the most rational method to quantify changes in flow. In future research endeavors, it is deemed essential to investigate the factors influencing long-term flow indicators, such as land cover, soil characteristics, and river network morphology. Furthermore, it is also necessary to examine how these factors respond to rainfall patterns. Additionally, a thorough examination of the implications of these indicators from the perspective of water resource management, specifically pertaining to water supply and flood prevention, is also warranted.

Conclusions
This study explored indices for quantifying flow fluctuations and calculated longterm flow indicators using different methods. For calculating L-FFIs, three approaches were considered. Wavelet analysis was performed to evaluate the derived L-FFI. The evaluation index focused on the period with the highest spectral power from the wavelet transformation of seasonally differenced daily dam inflow data. The key conclusions can be summarized as follows.
(1) The results varied greatly depending on the methodology used to determine the L-FFIs. The RRC, calculated as the ratio of the maximum and minimum flow values, was identified as the most sensitive index to the method used.
(2) Wavelet analysis of the data with seasonality removed revealed periodic components within a year exhibiting strong peaks. The period and frequency having maximum global wavelet powerderived from these data decreased and increased, respectively, with reduced dam size.
(3) The M-GWP frequency was positively and negatively correlated with the highflow and low-flow indices, respectively. The strength of the correlations was the highest when using Method (2) with the highest average 2 of 0.728.
(4) In the regionalization analysis using the Ward method, it was consistently observed that the Chungju Dam and Uiam Dam were significantly differentiated from the other dams. Furthermore, Method (2) showed the most similar characteristics to the clustering of the GWPs from the wavelet analysis.
Overall, Method (2), which involves averaging the flow indices to determine the L-FFI, was noted to be the most reasonable approach. In general, to facilitate efficient water management at the basin level and formulate effective policies, the flow must be analyzed considering various aspects. Indices representing high flow, such as the RRC, can be typically used to formulate flood control strategies, and indices representing low flow may be used to ensure effective water use. The strong correlation between the maximum frequency period and long-term FFIs is a key finding that can provide guidelines for basin management. If further analyses for multiple basins are conducted in the future, it will be possible to establish a range for long-term indicators that demonstrate hydrologically stable patterns. In such cases, the ungauged basin can be evaluated through these long-term indicators, helping in decision-making about human interventions in hydrological cycles. These processes are expected to facilitate long-term water resource management.

Data Availability Statement:
The data that support the findings of this study are available from the corresponding author upon reasonable request.

Conflicts of Interest:
The authors declare no conflict of interest.