The TAM-xLSTM Model for Hourly River Flow Forecasting: A Case Study of Qiandongnan, Guizhou Province, China

Liu, Renfeng; Wang, Dingdong; Wang, Liangyi; Cheng, Chi; Xia, Xiaoling; Yang, Ziheng

doi:10.3390/w17172644

Open AccessArticle

The TAM-xLSTM Model for Hourly River Flow Forecasting: A Case Study of Qiandongnan, Guizhou Province, China

by

Renfeng Liu

^1,†

,

Dingdong Wang

^1,†

,

Liangyi Wang

¹

,

Chi Cheng

^2,*

,

Xiaoling Xia

³ and

Ziheng Yang

¹

School of Mathematics and Computer Science, Wuhan Polytechnic University, Wuhan 430023, China

²

Hubei Meteorological Service Center, Wuhan 430074, China

³

Guizhou Meteorological Service Center, Guiyang 550002, China

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Water 2025, 17(17), 2644; https://doi.org/10.3390/w17172644

Submission received: 13 August 2025 / Revised: 3 September 2025 / Accepted: 4 September 2025 / Published: 7 September 2025

(This article belongs to the Topic Advances in Earth Observation Technologies to Support Water-Related Sustainable Development Goals (SDGs))

Download

Browse Figures

Versions Notes

Abstract

Accurate river flow forecasting is vital for flood warning and water resource management, yet hourly-scale prediction in small catchments remains underexplored despite its importance for rapid response flood control. To address this gap, this study proposes an enhanced temporal attention module xLSTM (TAM-xLSTM) model that combines temporal feature extraction with timestep-level attention to better capture dynamic variations and dependencies. Case studies in the Qiandongnan region demonstrate that TAM-xLSTM substantially outperforms baseline models during wet season forecasting at Panghai Station, reducing RMSE by 9.6%, MAE by 24.1%, and Theil’s U by 6.6%, while increasing NSE by 4.8%. These results highlight the model’s ability to improve short-term river flow prediction in complex mountainous terrain and its potential to support effective flood warning and water resource management.

Keywords:

river flow forecasting; hourly-scale prediction; small catchments; TAM-xLSTM; Qiandongnan region

1. Introduction

Accurate river flow forecasting is particularly important in small catchments, where hydrological responses occur rapidly and with high variability. In such contexts, hourly-scale prediction is especially valuable, as it enables timely flood warnings, supports emergency decision-making, and enhances the efficiency of water resource allocation [1]. Compared with daily forecasts, hourly predictions provide finer temporal resolution, which is crucial for addressing the challenges of flash floods, dam regulation, and real-time watershed management [2].

Despite its importance, hourly-scale river flow forecasting remains underexplored. Traditional hydrological models, such as SWAT and HEC-HMS, provide physically interpretable frameworks but face limitations in parameter calibration, structural complexity, and computational cost [3,4]. With the rise of data-driven approaches, machine learning methods such as artificial neural networks (ANNs), support vector machines (SVMs), and random forests (RFs) have been applied to flow forecasting [5,6,7,8]. However, these models are often criticized as “black boxes”, prone to overfitting and lacking interpretability [9]. To overcome these shortcomings, deep learning models, particularly recurrent neural networks (RNNs) and their variants, have been widely introduced into hydrology. The long short-term memory (LSTM) network alleviates gradient vanishing problems and captures long-term dependencies, while the gated recurrent unit (GRU) offers a simplified structure with comparable performance [10,11,12,13]. Nevertheless, their accuracy often degrades when applied to highly nonlinear or long-duration flood events. More recently, Transformer-based models have demonstrated strong capabilities in long-sequence forecasting through self-attention mechanisms [14,15]. Variants such as Informer and Autoformer have improved efficiency and long-term prediction accuracy [16,17], yet they still struggle with capturing local details, exhibit high computational complexity, and often require large datasets—making them less suitable for small catchments [18]. To further enhance sequence modeling, the xLSTM framework was recently introduced as an innovative extension of LSTM, designed to improve memory capacity and temporal feature extraction [19]. While xLSTM has shown promising results, they remain limited in effectively capturing multi-scale temporal dependencies and in emphasizing key timestep features, which are critical in hourly-scale hydrological forecasting.

In this study, an enhanced temporal attention module xLSTM (TAM-xLSTM) model is proposed to address these challenges. To evaluate model performance, a multi-source hydrometeorological dataset was constructed by combining meteorological, hydrological, and surface energy variables. Case studies were carried out at three stations in Guizhou Province, China: Panghai, Xinghua, and Zhijin. Panghai Station is affected by dam operations during the dry season, Xinghua Station represents near-natural hydrological conditions, and Zhijin Station provides nearly 50 years of streamflow records. These diverse settings enable a comprehensive assessment of forecasting accuracy and generalization capability.

The objective of this study is to develop a robust deep learning framework capable of capturing multi-scale temporal dependencies and critical hydrological features for fine-scale flow prediction. Specifically, TAM-xLSTM integrates dilated temporal convolutions and channel–temporal attention mechanisms to enhance both accuracy and stability. By testing the model across different hydrological regimes and long-term historical records, this work aims to advance methodological development in hourly-scale hydrological forecasting and to provide practical insights for flood risk management and water resource planning.

2. Related Work

2.1. xLSTM Model

The xLSTM framework is an innovative sequence modeling architecture proposed by a leading research team in the field of deep learning. This framework extends the traditional LSTM network by introducing multiple optimized variants, including scalar LSTM (sLSTM) and matrix LSTM (mLSTM), each specifically designed to enhance particular performance metrics and meet diverse functional requirements. These variants enable the model to effectively address various complex sequence modeling tasks.

The sLSTM extends the conventional LSTM by incorporating a scalar update mechanism, which refines the gating process and improves stability in long-sequence modeling. Its core update can be written as:

c_{t} = f_{t} \cdot c_{t - 1} + i_{t} \cdot z_{t}, h_{t} = o_{t} \cdot {\tilde{h}}_{t}, {\tilde{h}}_{t} = \frac{c_{t}}{n_{t}}

(1)

where

c_{t}

is the cell state,

h_{t}

is the hidden state,

z_{t}

is the candidate cell input generated through a nonlinear transformation,

n_{t}

is the normalization factor used to stabilize the output of exponential gating,

{\tilde{h}}_{t}

is the normalized cell state, and

f_{t}

,

i_{t}

, and

o_{t}

denote the forget, input, and output gates respectively.

The mLSTM significantly enhances the memory capacity and parallel processing capability of the traditional LSTM by extending states from vectors to matrices. Its update mechanism can be expressed as:

C_{t} = f_{t} C_{t - 1} + i_{t} v_{t} k_{t}^{⊤}, h_{t} = o_{t} ⊙ {\tilde{h}}_{t}, {\tilde{h}}_{t} = \frac{C_{t} q_{t}}{max {| n_{t}^{⊤} q_{t} |, 1}}

(2)

where

C_{t}

is the memory matrix,

f_{t}

and

i_{t}

are the forget and input gates,

v_{t}

is the value vector,

k_{t}

is the key vector,

q_{t}

is the query vector,

n_{t}

is the accumulated key representation, and

o_{t}

is the output gate.

In summary, as an innovative extension of the LSTM framework, xLSTM enhances the modeling of complex sequential data by introducing two optimized variants: sLSTM and mLSTM. The sLSTM variant improves the gating mechanism, making it particularly effective for sequences with subtle temporal variations, while the mLSTM leverages a matrix-based design to expand memory capacity and enable efficient parallel computation. This modular architecture allows xLSTM to flexibly adapt to specific task requirements by selecting the most appropriate variant, thereby achieving a balance between model performance and computational efficiency.

2.2. WaveNet

WaveNet is a generative neural network originally proposed for high-quality speech synthesis [20]. By directly modeling raw time–domain data, it effectively captures long-term dependencies and produces high-fidelity signals, overcoming the limitations of traditional statistical models and recurrent neural networks. Owing to its strong sequential modeling capability, WaveNet has since been applied to speech synthesis, music generation, natural language processing, and time series forecasting in areas such as meteorology and hydrology.

The key to WaveNet lies in its use of causal and dilated convolutions. Causal convolution ensures predictions depend only on past information, while dilated convolution expands the receptive field exponentially without increasing computational cost. Together with residual and skip connections, these mechanisms alleviate vanishing gradients and facilitate the training of very deep networks.

In summary, WaveNet combines causal convolution, dilated convolution, and residual learning into an efficient and scalable architecture that advanced the state of the art in sequence generation and inspired many subsequent convolution-based models for temporal data.

3. Methods

3.1. Research Strategy

The methodology of this study follows a structured workflow designed to ensure both predictive performance and physical interpretability. The overall strategy consists of the following steps:

Data acquisition and preprocessing—Collection of hourly hydrological and meteorological observations from six stations in the Qiandongnan region, supplemented with satellite-derived radiation data. Records were temporally aligned, missing values interpolated, and variables integrated into a unified dataset.
Feature construction—Inclusion of precipitation, solar radiation, and basin morphometric indices to represent both external hydrometeorological drivers and intrinsic catchment response mechanisms.
Model development—Design of the TAM-xLSTM, integrating a T-WaveNet module and a CBAM-1D attention mechanism to enhance multi-scale temporal feature extraction and timestep-level sensitivity.
Model training and validation—Generation of training samples using a sliding window approach, with explicit train/validation/test splits and a three-day buffer period to prevent temporal leakage. Supervised learning with early stopping was employed for parameter optimization.
Evaluation and uncertainty analysis—Assessment of predictive performance using RMSE, MAE, Theil’s U, and NSE, complemented by bootstrap-based 95% confidence intervals to quantify uncertainty and robustness.

This workflow is summarized in the schematic diagram (Figure 1), which illustrates the overall research strategy from data preparation to model evaluation.

3.2. Study Area

This study focuses on the Qiandongnan region in Southeastern Guizhou Province, China, located between latitudes 24°19^′ N–26°49^′ N and longitudes 107°17^′ E–109°35^′ E. Situated in the transition zone between the Yungui Plateau and the Western Hunan hills, the region exhibits complex topography and hydroclimatic conditions, representative of mountainous plateau areas in Southwest China. Elevation ranges from below 300 m to over 2000 m, forming distinct vertical climate zones. Extensive karst landforms, including depressions, caves, and subterranean rivers, strongly influence local hydrological processes [21].

Qiandongnan belongs to the Pearl River Basin and features a dense river network. The Duliu River, a major tributary of the Qianjiang system, spans about 310 km with a drainage area of 15,600 km², while the Qingshui River, part of the Longjiang system, extends 280 km with a basin area of 13,000 km². These rivers provide water for domestic use, irrigation, ecological functions, and hydropower development. The region has a subtropical humid monsoon climate, with an annual mean temperature of about 15 °C and precipitation of 1100–1400 mm, most of which falls between May and September. Influenced by terrain, rainfall distribution is highly uneven, and localized extreme events often trigger floods and geological hazards.

Overall, the study area’s combination of rugged topography, diverse river systems, and pronounced seasonal variability makes it suitable for investigating hydrological responses in mountainous catchments and for developing river flow prediction models. The geographical distribution is shown in Figure 2.

3.3. Dataset

This study utilizes hourly river flow, precipitation, and solar radiation data to investigate the influence of these climatic variables on river discharge. The dataset comprises hydrological and meteorological observations collected from six monitoring stations in the Qiandongnan region of Guizhou Province, China, covering the period from 2022 to 2023, as summarized in Table 1. Historical data for these variables were obtained from the Guizhou Meteorological Bureau and the FY-4A satellite, ensuring high accuracy and consistency with the model design. To address missing meteorological values caused by occasional monitoring equipment failures, linear interpolation was applied to maintain data completeness. Although meteorological forecast variables such as precipitation forecasts, temperature, pressure, and wind speed are practically relevant, the available forecast data in the Qiandongnan region suffer from large amounts of missing values and low measurement accuracy. Using these data directly may introduce noise and reduce prediction reliability. Therefore, this study employs high-resolution gridded precipitation datasets and satellite-based remote sensing products, which provide more complete temporal coverage and higher spatial accuracy. Preliminary experiments further confirmed that these datasets yield better model performance than the available meteorological forecasts.

To ensure data consistency, the raw data in Table 1 were carefully extracted and preprocessed, with particular attention paid to temporal alignment across sources. For example, the surface solar radiation data from the FY-4A satellite were timestamped using Coordinated Universal Time (UTC), whereas the ground-based precipitation records were logged in Beijing Time (UTC+8). Therefore, to synchronize the two datasets, the timestamps of the satellite data were uniformly shifted forward by 8 hours to match the local time. After preprocessing, the hydrometeorological variables were integrated into a unified dataset. The features of the final dataset are summarized in Table 2.

Beyond their statistical role as model inputs, these variables also reflect underlying physical processes that govern river flow, particularly at the hourly scale. Precipitation is the most direct driver of discharge, with rainfall events rapidly generating surface runoff and sharp flow rises within hours. Solar radiation influences flow indirectly through evapotranspiration and snowmelt: strong radiation may enhance evapotranspiration and reduce discharge, while in snow-affected basins it can accelerate meltwater contribution. Basin area determines the spatial extent of water collection, with larger basins sustaining higher long-term discharge but exhibiting slower hydrological responses, whereas smaller basins respond more quickly to rainfall. Average slope reflects topographic steepness: steeper slopes promote faster overland flow and reduced infiltration, leading to more pronounced short-term peaks, while gentler slopes delay and attenuate hydrographs. Similarly, stream length and density shape runoff concentration: longer channels and lower densities delay and dampen flood peaks, whereas shorter channels and higher densities accelerate flow convergence.

Together, these meteorological and morphometric variables capture both external hydrometeorological drivers and intrinsic basin response mechanisms, thereby providing a physically meaningful foundation for improving the accuracy and interpretability of hourly flow forecasting.

To evaluate the interrelationships among the selected features, a correlation matrix analysis was conducted on the hydrometeorological variables, and a corresponding heatmap was generated. As shown in Figure 3, among the selected features, Water_Level exhibits a strong positive correlation with river flow, indicating that it directly reflects hydrodynamic conditions. In contrast, meteorological variables such as precipitation-related metrics show relatively low linear correlation coefficients. Although their direct linear relationship with river flow is weak, these variables may still contribute valuable information in a nonlinear modeling framework, particularly in capturing short-term fluctuations and lagged hydrological responses.

In addition to the main experiment using hourly data, this paper introduces a long-term, daily-scale hydrometeorological dataset from the Zhijin Hydrological Station in Bijie City, Guizhou Province, as supplementary validation to further evaluate the robustness and generalization of the proposed model over long time spans. This dataset covers the period from 1975 to 2023 and contains basic hydrometeorological characteristics such as daily rainfall, temperature, wind speed, air pressure, and water level. Although this dataset differs from the hourly data used in the main experiment in terms of temporal resolution and feature composition, its long-term nature helps evaluate the model’s performance under different hydrological conditions, particularly in extreme climate years, further validating the model’s temporal transfer capabilities and generalization.

3.4. T-WaveNet Module

Graph WaveNet, an extension of the original WaveNet architecture, incorporates both graph convolution layers and temporal convolution layers to model spatiotemporal data structures [22]. While effective in capturing spatial dependencies, traditional graph WaveNet models suffer from increased computational complexity and training difficulty due to the integration of graph convolution operations. To address these limitations, this study proposes temporal-WaveNet (T-WaveNet)—a streamlined variant of WaveNet that removes the graph convolution component while preserving the model’s powerful temporal modeling capabilities. Unlike graph WaveNet, T-WaveNet exclusively focuses on processing temporal sequences through gated temporal convolutional layers (Gated TCN), avoiding graph-based operations. This design results in a more lightweight architecture with reduced computational overhead, while retaining the strengths of WaveNet in time series forecasting. The structure of the T-WaveNet module is illustrated in Figure 4.

T-WaveNet is a lightweight variant of the WaveNet architecture, specifically designed for temporal data modeling. By removing the complexity of graph convolutional layers, T-WaveNet simplifies the overall model structure while preserving the powerful temporal modeling capabilities of the original WaveNet. This design significantly improves computational efficiency and offers enhanced flexibility and scalability, making it well suited for real-time prediction tasks involving large-scale time series data.

3.5. CBAM-1D Module

The convolutional block attention module (CBAM) is an attention mechanism designed to enhance the representational power of convolutional neural networks [23]. To improve the model’s ability to capture key feature channels and critical temporal steps, this study adopts the 1D convolutional block attention module (CBAM-1D) to reweight the input time series data. Unlike conventional spatial attention mechanisms, CBAM-1D incorporates a temporal attention mechanism to enhance the model’s focus on important moments in sequential data. The CBAM-1D module first applies channel attention followed by temporal attention, and its architecture is illustrated in Figure 5.

The input features are processed through the channel attention mechanism, which dynamically learns the importance weights of each feature channel. By computing these weights, the model effectively captures the most critical information from the input features and generates optimized intermediate representations. The structure of the channel attention module is shown in Figure 6.

The specific process is as follows: given the input features

X \in R^{B \times C \times T}

, global average pooling and global max pooling are first performed along the temporal dimension T to obtain two channel descriptors. These descriptors are then passed through a shared two-layer multilayer perceptron (MLP) for feature transformation. Finally, a sigmoid activation function is applied to produce the channel attention weights

M_{c}

. The computation of channel attention is formulated as follows:

M_{c} (X) = σ (M L P (A ν g P o o l (X)) + M L P (M a x P o o l (X)))

(3)

where

σ

is the sigmoid activation function.

Subsequently, the input features are element-wise multiplied with the channel attention weights to obtain the enhanced feature map

X^{'}

:

X^{'} = M_{c} (X) ⊙ X

(4)

The timestep attention module is designed to dynamically adjust the importance of different timesteps in the input sequence, allowing the model to automatically focus on critical moments based on task requirements. In time series data, some timesteps may contain more crucial information and have a greater impact on the prediction results, while others may include noise or less informative content. By assigning a weighting coefficient to each timestep, the module ensures that the model can adaptively modulate its attention across the sequence. The structure of the timestep attention module is illustrated in Figure 7.

The specific process is as follows: First, global max pooling and average pooling are performed along the channel dimension C to obtain two one-dimensional temporal feature sequences, which are then concatenated. The concatenated features are fed into a 1D convolutional network (Conv1D) to extract local temporal patterns. Finally, a sigmoid activation function is applied to generate the temporal attention weights

M_{t}

. The calculation process of the temporal attention is as follows:

M_{t} (X^{'}) = σ (f_{c o n v}^{1 D} ([A ν g P o o l_{C} (X^{'}); M a x P o o l_{C} (X^{'})]))

(5)

where

f_{c o n v}^{1 D}

is the 1D convolution operation and

[;]

represents feature concatenation.

Finally, the feature map is multiplied element-wise with the temporal attention weights to obtain the final output

X^{″}

:

X^{″} = M_{t} (X^{'}) ⊙ X^{'}

(6)

3.6. TAM-xLSTM Network Architecture

To further improve the accuracy of hydrological flow prediction and the capability of temporal feature modeling, this study proposes the temporal attention module xLSTM (TAM-xLSTM) model, which integrates the T-WaveNet module and the CBAM-1D module. TAM-xLSTM is designed to address the limitations of traditional LSTM models in capturing long-term dependencies in complex time series data. By introducing attention mechanisms, the model dynamically adjusts the importance of different timesteps, thereby enhancing overall performance. The architecture of the TAM-xLSTM network is shown in Figure 8.

The TAM-xLSTM model consists of three main modules: the T-WaveNet module, the CBAM-1D attention module, and the xLSTM encoding module. These modules work in close coordination to effectively enhance the model’s ability to perceive multi-scale temporal features and respond to key time points. The T-WaveNet module is composed of multiple layers of dilated causal convolutions, enabling the model to capture both short-term fluctuations and long-term trends in the input sequence across different receptive fields. The core idea of the CBAM-1D module is to dynamically assess the importance of feature dimensions using the channel attention mechanism, followed by a modified temporal attention mechanism to identify key time segments. This module outputs a weighted temporal feature sequence, providing the xLSTM model with more significant and focused temporal input. The xLSTM is built using the msm structure, incorporating both sLSTM and mLSTM units. It includes a multi-head mechanism and projection capability, enabling it to encode the dynamic changes of time series from multiple perspectives. Upon receiving the weighted temporal input, the xLSTM processes the data step-by-step and learns deep temporal dependencies.

The entire TAM-xLSTM model takes hourly river flow and meteorological data as input and sequentially extracts temporal features through the aforementioned modules. Finally, a linear layer is used to output the predicted values. This structure combines the efficiency of local convolutional modeling with the temporal modeling advantages of the xLSTM architecture, significantly enhancing the model’s adaptability and prediction accuracy for non-stationary time series data.

3.7. Validation and Evaluation Metrics

This study adopts a rigorous cross-validation method, training and testing different models using the same dataset. To comprehensively evaluate the prediction performance of the model, this study selected four evaluation metrics with clear hydrological significance: The root mean square error (RMSE), The Nash–Sutcliffe efficiency (NSE), the mean absolute error (MAE), and Theil’s U.

The RMSE is the square root of the average of the squared differences between the predicted values and the true values. RMSE is more sensitive to deviations in extreme river flow predictions. In applications such as flood forecasting, accurately predicting river flow is crucial. This metric has been widely used in hydrological model evaluation. The calculation formula is:

RMSE = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}}

(7)

In the formula,

y_{i}

is the i-th observed value,

{\hat{y}}_{i}

is the i-th predicted value, and n is the number of data points.

The NSE coefficient is a widely used performance evaluation metric in hydrological modeling, meteorological forecasting, and environmental simulations [24]. It measures the degree of fit between the model’s predicted values and the actual observed values and is particularly suitable for assessing the prediction accuracy of hydrological time series such as river flow and precipitation. The calculation formula is:

NSE = 1 - \frac{\sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}}{\sum_{i = 1}^{n} {(y_{i} - \bar{y})}^{2}}

(8)

In the formula,

y_{i}

is the i-th observed value,

{\hat{y}}_{i}

is the i-th predicted value,

\bar{y}

is the mean of all observed values, and n is the number of data points.

The MAE is the average of the absolute errors between the predicted values and the actual values. It intuitively reflects the average magnitude of prediction errors and is insensitive to outliers, making it suitable for assessing the stability of daily river flow predictions. This metric is often used as a benchmark for comparing hydrological models. The calculation formula is:

MAE = \frac{1}{n} \sum_{i = 1}^{n} | y_{i} - {\hat{y}}_{i} |

(9)

In the formula,

y_{i}

is the i-th observed value,

{\hat{y}}_{i}

is the i-th predicted value, and n is the number of data points.

Theil’s U statistic is a metric used to measure the deviation between regression model predictions and actual observed data. By normalizing the RMSE, this indicator allows for the comparison of prediction performance across different basins or time scales. A value of

U < 1

indicates that the model performs better than a naive forecast, making it particularly suitable for evaluating the systematic bias of river flow prediction models. The calculation formula is:

Theil ’ s U = \sqrt{\frac{\sum_{i = 1}^{n} {({\hat{y}}_{i} - y_{i})}^{2}}{\sum_{i = 1}^{n} y_{i}^{2}}}

(10)

In the formula,

y_{i}

is the i-th observed value,

{\hat{y}}_{i}

is the i-th predicted value, and n is the number of data points.

The combined use of these evaluation metrics enables a comprehensive assessment of the model’s performance across multiple dimensions of river flow prediction, offering valuable insights for targeted model optimization. In particular, under varying application scenarios, such as extreme event forecasting and routine daily flow simulation, each metric captures distinct aspects of model behavior, thereby providing a multifaceted understanding of its strengths and limitations.

4. Results

The processor used in this experiment is an i5-12400F produced by Intel (Santa Clara, CA, USA), with a main frequency of 2.5 GHz. The graphics card is an NVIDIA GeForce GTX 4060 (NVIDIA Corporation, Santa Clara, CA, USA), with 8 GB of video memory. The memory of the experimental equipment is 16 GB, and the operating system used is Windows 10. The network model was built using the PyTorch 2.0.1 framework for deep learning, with CUDA version 11.0. The experimental parameters adopted by each network model are as follows: the training rounds of epoch are 400, the sliding window of input is set to 72, the batch size of each input sample is 1024, and the Adam optimizer is used to update parameters during the training process, with an initial learning rate of 0.001. The experimental environment and parameter settings are shown in Table 3.

This study conducts a comprehensive training and evaluation of six models: LSTM, GRU, Transformer, Informer, xLSTM, and the improved TAM-xLSTM. The models are trained using hourly data on river flow and precipitation collected from six hydrological stations in the Qiandongnan region of Guizhou Province, together with solar radiation data obtained from the FY-4A geostationary meteorological satellite. To construct the training dataset, a sliding window technique with a window length of 72 hours is applied, enabling the models to capture short-term to medium-term hydrological variation patterns.

For data partitioning, the training set comprises all remaining records from the six stations covering January 2022 to December 2023, excluding the validation and test periods of the target station. At each target station, the two months immediately preceding the test month are reserved as the validation set, while the entire target month serves as the test set. To avoid information leakage from temporal overlap, a three-day buffer period is inserted between the validation and test intervals. This split ensures strict temporal separation among training, validation, and testing, allowing robust evaluation of model generalization. The specific validation and test periods for Panghai and Xinghua stations are shown in Table 4.

The training process follows a conventional supervised learning framework. In each iteration, the input sequence is fed forward through the network to generate predicted outputs. The difference between the predicted and observed values is computed, and the MSE is used as the loss function to evaluate prediction accuracy. The backpropagation algorithm is employed to propagate the error gradient from the output layer to the input layer. Through the chain rule, the partial derivatives of the loss function with respect to each model parameter are calculated. The Adam optimizer is then used to update the model weights based on the calculated gradients and a predefined learning rate. Training continues until the model’s performance on the validation set ceases to improve or the maximum number of training epochs is reached.

To evaluate the predictive performance of TAM-xLSTM in capturing fine-scale hydrological processes, such as short-term fluctuations and rapid changes in flow, this study selects two representative hydrological stations as key testing sites: Panghai Station in the Qingshui River Basin and Xinghua Station in the Duliujiang River Basin. Experiments are conducted under both wet season and dry season conditions, and the results are compared with those of several mainstream baseline models. The statistical properties of the hydrometeorological input variables of river flow, rainfall, and solar radiation at Panghai and Xinghua stations are summarized in Table 5. Detailed experimental results and performance comparisons are presented in Table 6.

Table 5 reports river flow (m³/s), hourly rainfall (mm), and solar radiation (W/m²) for Panghai on the Qingshui River and Xinghua on the Duliu River. The mean flow at Xinghua is 34.5 m³/s, about half of Panghai’s 76.1 m³/s. The maximum flow at Xinghua reaches 2192.6 m³/s, well above Panghai’s 1222.9 m³/s. The standard deviations are similar, at 90.9 and 97.9 m³/s for Xinghua and Panghai, respectively. The flow distribution at Xinghua exhibits markedly heavier tails, with skewness 11.6 and kurtosis 189.1, compared with 4.4 and 28.3 at Panghai, indicating a greater propensity for extreme floods. For rainfall, the mean hourly precipitation is 1.7 mm at Panghai and 1.4 mm at Xinghua. Panghai records a larger maximum rainfall of 101.2 mm, while Xinghua reaches 91.5 mm. Rainfall variability is high at both stations, but it is slightly more intermittent and bursty at Xinghua, with skewness 6.8 and kurtosis 60.1, compared with 5.9 and 45.7 at Panghai. Solar radiation is more stable than both flow and rainfall. Panghai shows a higher mean of 1334.8 W/m² and a higher maximum of 6548.4 W/m², while Xinghua records 903.4 and 4484.7 W/m². Distributions at both stations are near symmetric and light-tailed, with skewness close to 0.9 and kurtosis around −0.5 to −0.6.

The experimental results shown in Table 6 indicate that the Transformer and Informer models generally underperform compared with the LSTM and GRU models across most evaluation metrics. This difference is particularly evident in the RMSE and MAE values during the wet season, which may be attributed to the stronger local temporal dependencies in the data and the requirement for Transformers to have larger datasets to effectively capture long-term dependencies [25]. While LSTM and GRU models outperform Transformer and Informer in most indicators, they are surpassed by both the xLSTM and the proposed TAM-xLSTM models. This suggests that traditional recurrent architectures retain certain advantages in river flow forecasting. In the prediction tasks for both wet and dry seasons at Panghai and Xinghua stations, the TAM-xLSTM model achieves the best performance, demonstrating significantly lower RMSE and MAE values than other models. This reflects its superior generalization capability and higher prediction accuracy. Furthermore, TAM-xLSTM attains the lowest Theil’s U statistic, indicating smaller deviations between predicted and observed values as well as greater stability. It also achieves the highest NSE, signifying the most accurate fit to observed data among all models tested. In summary, TAM-xLSTM effectively captures the highly volatile river flow dynamics characteristic of the wet season, exhibiting the strongest robustness and precision while maintaining stable performance under complex hydrological conditions. Even during the dry season, characterized by weaker flow fluctuations, TAM-xLSTM continues to demonstrate strong generalization and fitting ability. These results confirm that the model possesses enhanced accuracy for hourly river flow prediction and excels in capturing dynamic hydrological variations compared with traditional and mainstream deep learning models.

During the wet season, river flow typically exhibits rapid changes events and large discharge volumes. A comparison of the model prediction results presented in Figure 9 and Figure 10 reveals notable differences in the performance of various models in river flow forecasting. The LSTM model effectively captures the long-term dependencies within the time series. However, it exhibits relatively weaker performance in capturing abrupt variations in flow and in filtering high-frequency noise [26]. The Informer model surpasses both LSTM and Transformer in long-sequence prediction tasks but remains inferior to TAM-xLSTM in terms of prediction accuracy and trend fitting. Among all models, TAM-xLSTM demonstrates the highest sensitivity to dynamic variations in river flow time series, particularly excelling in the accurate prediction of abrupt variations in flow. This superior performance highlights the model’s robustness and reliability in forecasting during flood seasons.

In Guizhou Province, the presence of large dams imparts a distinct seasonal pattern to water resource regulation. Beginning in November each year, marking the start of the dry season, dams typically initiate water storage for a period before releasing it in a concentrated manner to satisfy hydropower generation demands. Consequently, river flow during the dry season generally exhibits relative stability with a narrower fluctuation range. A comparison of the model prediction results illustrated in Figure 11 and Figure 12 reveals significant differences in model performance for dry season river flow forecasting. The Transformer model is more effective at capturing the stable flow trends but lacks sensitivity to low-frequency variations, leading to considerable prediction errors for minor fluctuations [27]. The Informer model improves upon LSTM in representing the stable characteristics of the dry season flow sequence but exhibits delayed responses to sudden rainfall events [28]. The TAM-xLSTM model not only accurately captures the long-term attenuation patterns of river flow but also effectively detects subtle fluctuations during the dry season. These results fully demonstrate its robustness and marked advantages in dry season forecasting.

In addition to the deterministic metrics RMSE, MAE, Theil’s U, and NSE, this study further carried out an uncertainty analysis using a bootstrap procedure to estimate 95% confidence intervals for each model and for each season–station combination [29], as presented in Table 7 and Figure 13. The inclusion of confidence intervals makes it possible to evaluate not only the average prediction error but also the robustness of the models under repeated sampling. The results indicate that the proposed TAM-xLSTM model consistently achieves the lowest RMSE and MAE values across both Panghai and Xinghua stations in wet and dry seasons. At the same time, its confidence intervals are narrower than those of the other models, which shows that the forecasts are not only more accurate but also more stable. For instance, at Panghai Station in the wet season, the RMSE of TAM-xLSTM is 16.95 m³/s, substantially lower than the values of 24.36 m³/s for LSTM and 25.13 m³/s for Transformer, and accompanied by a much tighter confidence interval. At Xinghua Station, TAM-xLSTM again delivers the smallest errors, with an RMSE of 2.91 m³/s in the wet season and 1.85 m³/s in the dry season, both with highly consistent confidence intervals. By contrast, conventional deep learning models such as Transformer and Informer tend to produce larger RMSE and MAE values together with wider confidence intervals, which reflects less stable performance. The error bar plots in Figure 13 further highlight these differences: TAM-xLSTM consistently appears in the lowest error region while also exhibiting the smallest variability across resamples. Overall, the addition of confidence intervals provides a more comprehensive and reliable comparison of model performance. This analysis clearly demonstrates the superiority and robustness of the proposed TAM-xLSTM framework over the baseline models.

To further assess the robustness and generalization ability of the proposed TAM-xLSTM model under a broader range of hydrological conditions, an additional experiment was conducted using long-term daily data from the Zhijin hydrological station, located in Bijie City, Guizhou Province, China. This dataset spans from 1975 to 2023 and includes daily precipitation, temperature, wind speed, air pressure, and water level measurements. Compared with the main hourly-scale dataset from the six stations in Qiandongnan region, the Zhijin dataset features a much longer temporal coverage but at a coarser temporal resolution. For this experiment, records from 1975 to 2010 were used for model training, 2011 to 2015 were allocated for validation, and 2016 to 2023 were reserved for testing. The same set of evaluation metrics—RMSE, MAE, Theil’s U, and NSE—was employed to ensure comparability. This supplementary analysis aims to verify whether the model can maintain its predictive performance when applied to datasets with different temporal resolutions and longer-term hydrological variability.

Table 8 presents the comparative results of different models for the Zhijin Station dataset. On daily test data from Zhijin Station from 2016 to 2023, the TAM-xLSTM model achieved the best performance across all evaluation metrics. In terms of RMSE, TAM-xLSTM achieved 2.3380 m³/s, a 15.6% improvement over the next best performing Informer, significantly improving prediction accuracy. In terms of MAE, the TAM-xLSTM model achieved the lowest error of 1.0128 m³/s, demonstrating its ability to capture daily flow variations with less bias. In terms of Theil’s U, the TAM-xLSTM model achieved a value of 0.2574, significantly lower than Transformer and GRU, indicating a further reduction in the relative error between predictions and observations. In terms of NSE, TAM-xLSTM achieved 0.6393, an improvement of over 56% over the other models, demonstrating its robustness and generalization capabilities under long-term and multi-year climate conditions. In summary, despite differences in temporal resolution and feature composition between the Zhijin Station data and the main experiment, TAM-xLSTM maintained optimal performance, validating the model’s adaptability across different time scales and hydrological conditions. This result provides strong evidence for the model’s application in hydrological forecasting tasks at multiple spatiotemporal resolutions.

As shown in Figure 14, all deep learning models are able to effectively capture the overall trend of flow forecasting at Zhijin Station, but their performance varies during flood peak forecasting. TAM-xLSTM’s predicted values are closer to the measured values during most flood peak periods, demonstrating stronger peak response capabilities. Traditional LSTM, GRU, and Transformer models exhibit a certain degree of underestimation or overestimation at some flood peaks. Informer models track fluctuations well during the dry season but lag slightly during flood season. A zoomed-in image (b) further reveals the detailed performance of the models during a typical flood peak. The amplitude and phase of TAM-xLSTM and LSTM at multiple peak points are highly consistent with the measured traffic. Although GRU and xLSTM can track the trend, there are varying degrees of amplitude deviation at the peak. Transformer and Informer models accurately depict flood peak morphology, but their peak height predictions are slightly lower than the measured values. Overall, TAM-xLSTM achieves an optimal balance between flood peak capture accuracy and dry season fitting stability, demonstrating strong generalization ability.

5. Discussion

This study demonstrates the effectiveness of the proposed TAM-xLSTM framework for hourly river flow forecasting in small- and medium-sized catchments. Across different stations and hydrological regimes, TAM-xLSTM consistently outperformed traditional LSTM, GRU, Transformer, and Informer baselines. The improvement was particularly evident during periods of rapid hydrological change, such as flood peaks and sharp recessions, underscoring the model’s ability to capture short-term fluctuations and complex nonlinear dependencies that conventional approaches often fail to represent. Several structural innovations contribute to the enhanced performance of TAM-xLSTM. The T-WaveNet module expands the temporal receptive field through dilated convolutions, enabling effective extraction of multi-scale dependencies without introducing spatial noise. In parallel, the CBAM-1D mechanism adaptively allocates attention across both channels and timesteps, strengthening the model’s sensitivity to critical temporal features such as abrupt rises or drops in flow. These enhancements allow TAM-xLSTM to achieve fine-scale hydrological responsiveness while maintaining stability in long sequences, thereby addressing the limitations of both recurrent and Transformer-based architectures in small catchments.

The findings of this study are consistent with, and extend, previous research on data-driven streamflow forecasting. Earlier works using LSTM and GRU demonstrated the potential of recurrent neural networks to capture temporal dependencies, but their performance often deteriorated during highly nonlinear flood events [30]. Transformer-based methods such as Informer and Autoformer have improved long-sequence forecasting efficiency, yet their reliance on large datasets and high computational demand limits their suitability for small catchments. By contrast, the modular xLSTM architecture was recently introduced to enhance memory capacity and temporal modeling. The present study advances this line of research by incorporating dilated convolution and attention modules, showing that such hybrid designs are particularly effective in hydrological contexts characterized by rapid and localized variability. Beyond accuracy gains, TAM-xLSTM also exhibits narrower confidence intervals than baseline models, as demonstrated by bootstrap resampling. This reflects not only reduced prediction errors but also greater robustness, a feature of critical importance in operational forecasting where uncertainty can be as consequential as the mean estimate. The combination of accuracy and robustness makes TAM-xLSTM especially valuable for real-time flood forecasting, where both false alarms and missed warnings carry substantial risks.

Although this study focuses on the Qiandongnan region, TAM-xLSTM is essentially a data-driven framework that can be applied to watersheds with different climatic and topographic conditions. When transferred to other basins, the model can be retrained or fine-tuned using local meteorological and hydrological data to capture region-specific flow dynamics. The modular design of TAM-xLSTM further supports its adaptability to diverse temporal patterns and hydrological responses, reinforcing its potential for broader transferability.

For practical deployment, TAM-xLSTM can be integrated into real-time flood warning systems by linking with automated meteorological and hydrological monitoring platforms. Its relatively low inference cost allows hourly forecasts to be generated on standard computational infrastructure or cloud servers. When predicted flows exceed predefined thresholds, the system can automatically issue flood alerts to local water resource authorities. Conversely, forecasts below critical levels may trigger reservoir filling operations for hydropower generation. Beyond technical deployment, the model has direct policy relevance: more reliable and timely forecasts can strengthen disaster risk reduction strategies, improve emergency preparedness, and guide reservoir operation policies on water allocation, hydropower scheduling, and ecological flow regulation.

Nevertheless, several limitations should be acknowledged. First, this study focused on short-term, hourly forecasting using high-frequency meteorological and hydrological data and did not include comparisons with physically based hydrological models such as SWAT or HEC-HMS. These models are designed for long-term simulations and require extensive calibration, which was beyond the scope of this work. Future research could explore hybrid approaches that integrate physical and data-driven models to leverage their respective strengths. Second, although TAM-xLSTM performed well at multiple basins in Guizhou Province, the validation is geographically limited and does not cover diverse climatic zones or extreme real-time flood events. Broader evaluation across different regions and climates is needed to confirm generalizability. Third, flows at some stations are strongly influenced by reservoir operations. Due to the lack of detailed dam operation records, such effects were not explicitly incorporated, which limits applicability in regulated basins. Future work should attempt to integrate dam operation data to enhance predictive accuracy and reliability in such contexts.

6. Conclusions

This study proposed and evaluated an enhanced temporal attention module xLSTM (TAM-xLSTM) model for hourly river flow forecasting. The main conclusions are as follows:

Model development: TAM-xLSTM integrates a T-WaveNet module with a CBAM-1D attention mechanism, enhancing the ability of xLSTM to capture multi-scale temporal dependencies and emphasize critical timestep features.
Performance: Across multiple hydrological stations and both wet and dry seasons, TAM-xLSTM consistently outperformed baseline models (LSTM, GRU, Transformer, Informer, and xLSTM) in terms of accuracy, robustness, and stability.
Generalization: An extended case study at Zhijin Station with nearly 50 years of data demonstrated that TAM-xLSTM maintains strong predictive skill for long-term hydrological variability and flood events, highlighting its transferability to different contexts.
Practical implications: With relatively low computational cost, TAM-xLSTM is suitable for integration into real-time flood forecasting and water resource management systems, offering direct relevance for disaster risk reduction, reservoir operation, and policy support.
Limitations and future work: Current experiments are geographically limited to Guizhou Province and do not explicitly incorporate dam operation data. Future research should expand testing to diverse climatic zones, include regulated basins, and explore hybrid approaches combining data-driven and physically based models.

In conclusion, TAM-xLSTM provides an effective and practical deep learning solution for fine-scale river flow forecasting, with strong potential for both scientific advancement and operational application.

Author Contributions

Conceptualization, R.L. and X.X.; methodology, D.W. and L.W.; software, Z.Y.; validation, R.L., L.W. and Z.Y.; formal analysis, C.C.; investigation, Z.Y.; resources, X.X.; data curation, L.W.; writing—original draft preparation, R.L. and D.W.; writing—review and editing, R.L. and D.W.; visualization, D.W.; supervision, C.C.; project administration, C.C.; funding acquisition, C.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded the Key Projects of the Joint Meteorological Fund of the Natural Science Foundation of Hubei Province: No.2022CFD017; the Guizhou Meteorological Service Center: No.24SWQXZ034.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

Author Chi Cheng was employed by the Hubei Meteorological Service Center. Author Xiaoling Xia was employed by the Guizhou Meteorological Service Center. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

Tsegaw, A.T.; Alfredsen, K.; Skaugen, T.; Muthanna, T.M. Predicting hourly flows at ungauged small rural catchments using a parsimonious hydrological model. J. Hydrol. 2019, 573, 855–871. [Google Scholar] [CrossRef]
Wang, Y.; Liu, R.; Guo, L.; Tian, J.; Zhang, X.; Ding, L.; Wang, C.; Shang, Y. Forecasting and providing warnings of flash floods for ungauged mountainous areas based on a distributed hydrological model. Water 2017, 9, 776. [Google Scholar] [CrossRef]
Chathuranika, I.M.; Gunathilake, M.B.; Baddewela, P.K.; Sachinthanie, E.; Babel, M.S.; Shrestha, S.; Jha, M.K.; Rathnayake, U.S. Comparison of two hydrological models, HEC-HMS and SWAT in runoff estimation: Application to Huai Bang Sai Tropical Watershed, Thailand. Fluids 2022, 7, 267. [Google Scholar] [CrossRef]
Kannan, N.; Santhi, C.; White, M.J.; Mehan, S.; Arnold, J.G.; Gassman, P.W. Some challenges in hydrologic model calibration for large-scale studies: A case study of SWAT model application to Mississippi-Atchafalaya River Basin. Hydrology 2019, 6, 17. [Google Scholar] [CrossRef]
Liu, C.; Xu, J.; Li, X.A.; Yu, Z.; Wu, J. Water resource forecasting with machine learning and deep learning: A scientometric analysis. Artif. Intell. Geosci. 2024, 5, 100084. [Google Scholar] [CrossRef]
Zanial, W.N.C.W.; Malek, M.B.A.; Reba, M.N.M.; Zaini, N.; Ahmed, A.N.; Sherif, M.; Elshafie, A. River flow prediction based on improved machine learning method: Cuckoo Search-Artificial Neural Network. Appl. Water Sci. 2023, 13, 28. [Google Scholar] [CrossRef]
Zhang, X.; Wang, R.; Wang, W.; Zheng, Q.; Ma, R.; Tang, R.; Wang, Y. Runoff prediction using combined machine learning models and signal decomposition. J. Water Clim. Chang. 2025, 16, 230–247. [Google Scholar] [CrossRef]
Kumar, A.; Kumar, P.; Singh, V.K. Evaluating different machine learning models for runoff and suspended sediment simulation. Water Resour. Manag. 2019, 33, 1217–1231. [Google Scholar] [CrossRef]
Räuker, T.; Ho, A.; Casper, S.; Hadfield-Menell, D. Toward transparent AI: A survey on interpreting the inner structures of deep neural networks. In Proceedings of the 2023 IEEE Conference on Secure and Trustworthy Machine Learning (SATML), Raleigh, NC, USA, 8–10 February 2023; pp. 464–483. [Google Scholar] [CrossRef]
Waqas, M.; Humphries, U.W. A critical review of RNN and LSTM variants in hydrological time series predictions. MethodsX 2024, 13, 102946. [Google Scholar] [CrossRef]
Lima, M.; Deck, K.; Dunbar, O.R.; Schneider, T. Toward Routing River Water in Land Surface Models with Recurrent Neural Networks. arXiv 2024, arXiv:2404.14212. [Google Scholar] [CrossRef]
Hu, C.; Wu, Q.; Li, H.; Jian, S.; Li, N.; Lou, Z. Deep learning with a long short-term memory networks approach for rainfall-runoff simulation. Water 2018, 10, 1543. [Google Scholar] [CrossRef]
Weißenborn, M.; Breuer, L.; Houska, T. Neural networks in catchment hydrology: A comparative study of different algorithms in an ensemble of ungauged basins in Germany. Hydrol. Earth Syst. Sci. Discuss. 2024, 2024, 1–50. [Google Scholar] [CrossRef]
Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, Ł.; Polosukhin, I. Attention is all you need. In Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, CA, USA, 4–9 December 2017. [Google Scholar]
Su, L.; Zuo, X.; Li, R.; Wang, X.; Zhao, H.; Huang, B. A systematic review for transformer-based long-term series forecasting. Artif. Intell. Rev. 2025, 58, 80. [Google Scholar] [CrossRef]
Zhou, H.; Zhang, S.; Peng, J.; Zhang, S.; Li, J.; Xiong, H.; Zhang, W. Informer: Beyond efficient transformer for long sequence time-series forecasting. In Proceedings of the AAAI Conference on Artificial Intelligence, Virtual, 2–9 February 2021; Volume 35, pp. 11106–11115. [Google Scholar]
Wu, H.; Xu, J.; Wang, J.; Long, M. Autoformer: Decomposition transformers with auto-correlation for long-term series forecasting. Adv. Neural Inf. Process. Syst. 2021, 34, 22419–22430. [Google Scholar] [CrossRef]
Tay, Y.; Dehghani, M.; Bahri, D.; Metzler, D. Efficient transformers: A survey. ACM Comput. Surv. 2022, 55, 109. [Google Scholar] [CrossRef]
Beck, M.; Pöppel, K.; Spanring, M.; Auer, A.; Prudnikova, O.; Kopp, M.; Klambauer, G.; Brandstetter, J.; Hochreiter, S. xlstm: Extended long short-term memory. arXiv 2024, arXiv:2405.04517. [Google Scholar] [CrossRef]
Van Den Oord, A.; Dieleman, S.; Zen, H.; Simonyan, K.; Vinyals, O.; Graves, A.; Kalchbrenner, N.; Senior, A.; Kavukcuoglu, K. Wavenet: A generative model for raw audio. arXiv 2016, arXiv:1609.03499. [Google Scholar] [CrossRef]
Yang, T.; Chen, X.; Xu, C.Y.; Zhang, Z.C. Spatio-temporal changes of hydrological processes and underlying driving forces in Guizhou region, Southwest China. Stoch. Environ. Res. Risk Assess. 2009, 23, 1071–1087. [Google Scholar] [CrossRef]
Wu, Z.; Pan, S.; Long, G.; Jiang, J.; Zhang, C. Graph WaveNet for deep spatial-temporal graph modeling. arXiv 2019, arXiv:1906.00121. [Google Scholar] [CrossRef]
Woo, S.; Park, J.; Lee, J.Y.; Kweon, I.S. CBAM: Convolutional Block Attention Module. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018; pp. 3–19. [Google Scholar] [CrossRef]
Gupta, H.V.; Kling, H.; Yilmaz, K.K.; Martinez, G.F. Decomposition of the mean squared error and NSE performance criteria: Implications for improving hydrological modelling. J. Hydrol. 2009, 377, 80–91. [Google Scholar] [CrossRef]
Zhou, H.; Li, J.; Zhang, S.; Zhang, S.; Yan, M.; Xiong, H. Expanding the prediction capacity in long sequence time-series forecasting. Artif. Intell. 2023, 318, 103886. [Google Scholar] [CrossRef]
Zou, Y.; Wang, J.; Lei, P.; Li, Y. A novel multi-step ahead forecasting model for flood based on time residual LSTM. J. Hydrol. 2023, 620, 129521. [Google Scholar] [CrossRef]
Piao, X.; Chen, Z.; Murayama, T.; Matsubara, Y.; Sakurai, Y. Fredformer: Frequency debiased transformer for time series forecasting. In Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Barcelona, Spain, 25–29 August 2024; pp. 2400–2410. [Google Scholar] [CrossRef]
Shang, J.; Zhao, B.; Hua, H.; Wei, J.; Qin, G.; Chen, G. Application of informer model based on SPEI for drought forecasting. Atmosphere 2023, 14, 951. [Google Scholar] [CrossRef]
Zhang, A.; Shi, H.; Li, T.; Fu, X. Analysis of the Influence of Rainfall Spatial Uncertainty on Hydrological Simulations Using the Bootstrap Method. Atmosphere 2018, 9, 71. [Google Scholar] [CrossRef]
Xiang, Z.; Yan, J.; Demir, I. A Rainfall–Runoff Model with LSTM-Based Sequence-to-Sequence Learning. Water Resour. Res. 2020, 56, e2019WR025326. [Google Scholar] [CrossRef]

Figure 1. Method roadmap.

Figure 2. Geographical distribution of Southeastern Guizhou Province and locations of hydrological stations.

Figure 3. Correlation matrix heatmap.

Figure 4. Architecture of the T-WaveNet module.

Figure 5. The channel attention mechanism and temporal attention mechanism in the CBAM-1D module.

Figure 6. Channel attention module.

Figure 7. Temporal attention module.

Figure 8. TAM-xLSTM network architecture.

Figure 9. Flow prediction results of different models for Panghai Station during the wet season. (a) Overall prediction results, with the grey area indicating a critical period corresponding to heavy rainfall; (b) Enlarged view of the grey area in (a), highlighting model performance during this key period.

Figure 10. Flow prediction results of different models for Xinghua Station during the wet season. (a) Overall prediction results, with the grey area indicating a critical period corresponding to heavy rainfall; (b) Enlarged view of the grey area in (a), highlighting model performance during this key period.

Figure 11. Flow prediction results of different models for Panghai Station during the dry season. (a) Overall prediction results, with the grey area indicating a critical period corresponding to heavy rainfall; (b) Enlarged view of the grey area in (a), highlighting model performance during this key period.

Figure 12. Flow prediction results of different models for Xinghua Station during the dry season. (a) Overall prediction results, with the grey area indicating a critical period corresponding to heavy rainfall; (b) Enlarged view of the grey area in (a), highlighting model performance during this key period.

Figure 13. Prediction performance of models at Panghai and Xinghua stations.

Figure 14. Flow prediction results of different models for Zhijin Station. (a) Overall prediction results, with the grey area indicating a critical period corresponding to heavy rainfall; (b) Enlarged view of the grey area in (a), highlighting model performance during this key period.

Table 1. Data sources.

Data Type	Source	Resolution
Flow Observation Data	Guizhou Meteorological Service Center	Hourly
Digital Elevation Model (DEM)	Geospatial Data Cloud	12 M
Precipitation Data	Guizhou Meteorological Service Center	3 km
Surface Solar Radiation Data	FY-4A Satellite	4 km

Table 2. Dataset feature description with unit and frequency.

Feature	Unit	Frequency
Max_PRE	mm	Hourly
SSI	W/m²	Hourly
Total_PRE	mm	Daily
Rainy_Area	km²	Hourly
Average_PRE	mm	Hourly
Average_Rainy_PRE	mm	Hourly
Flow	m³/s	Hourly
Water_Level	m	Hourly
Area	km²	Static
Avg_slope	degree	Static
Stream_length	km	Static
Stream_density	km/km²	Static

Table 3. Experimental environment and parameter settings.

Name	Parameter Value
Processor	i5-12400F
GPU	NVIDIA GeForce GTX 4060
CUDA Version	11.0
Deep Learning Framework	PyTorch 2.0.1
Epoch	400
Batch Size	1024
Window Size	72
Optimizer	Adam
Learning Rate	0.001

Table 4. Validation, and test set splits for Panghai and Xinghua Station.

Station	Time	Validation Set	Test Set
Panghai	Wet season	1 May 2023–27 June 2023	July 2023
Panghai	Dry season	1 September 2023–28 October 2023	November 2023
Xinghua	Wet season	1 May 2023–27 June 2023	July 2023
Xinghua	Dry season	1 September 2023–28 October 2023	November 2023

Table 5. Statistical indicators of hydrometeorological variables at test stations.

Statistic	Qingshui River (Panghai)			Duliu River (Xinghua)
Statistic	Flow (m³/s)	Hourly Rainfall (mm)	Radiation (W/m²)	Flow (m³/s)	Hourly Rainfall (mm)	Radiation (W/m²)
Mean	76.1	1.7	1334.8	34.5	1.4	903.4
Std	97.9	6.2	1850.7	90.9	5.3	1254.8
Minimum	5.0	0.0	0.0	1.3	0.0	0.0
Maximum	1222.9	101.2	6548.4	2192.6	91.5	4484.7
Skewness	4.4	5.9	0.9	11.6	6.8	0.9
Kurtosis	28.3	45.7	−0.5	189.1	60.1	−0.6

Table 6. Experimental results of Panghai Station and Xinghua Station.

Station	Time	Model	RMSE (m³/s)	MAE (m³/s)	Theil’s U	NSE
Panghai	Wet season	LSTM	20.4164	10.8265	0.1501	0.7522
		GRU	19.0571	10.7945	0.1386	0.7841
		Transformer	22.2692	14.9595	0.1691	0.7052
		Informer	22.7087	12.3377	0.1646	0.6935
		xLSTM	18.7458	10.1789	0.1357	0.7911
		TAM-xLSTM	16.9448	7.7212	0.1267	0.8293
Panghai	Dry season	LSTM	15.8507	11.0454	0.2923	0.3323
		GRU	13.4720	8.8611	0.2185	0.5198
		Transformer	16.6102	13.9560	0.3394	0.2647
		Informer	14.7152	10.1196	0.2746	0.4271
		xLSTM	11.8527	7.5940	0.2098	0.6283
		TAM-xLSTM	10.2665	5.5373	0.1907	0.7211
Xinghua	Wet season	LSTM	4.7250	3.3485	0.1231	0.7732
		GRU	4.6070	3.0474	0.1174	0.7844
		Transformer	6.5935	4.7392	0.1798	0.5583
		Informer	7.2346	5.1425	0.1967	0.4683
		xLSTM	5.2450	3.0300	0.1378	0.7205
		TAM-xLSTM	3.6826	2.8286	0.0958	0.8622
Xinghua	Dry season	LSTM	4.3653	1.9254	0.2669	0.3822
		GRU	4.3237	1.9188	0.2615	0.3965
		Transformer	4.9178	2.5217	0.3009	0.3584
		Informer	2.9705	1.6050	0.2157	0.4033
		xLSTM	3.3514	1.7825	0.2351	0.3912
		TAM-xLSTM	1.8516	1.3886	0.1371	0.4309

Table 7. Model prediction performance in different seasons at Panghai and Xinghua Station.

Station	Time	Model	RMSE (m³/s)	RMSE Lower (m³/s)	RMSE Upper (m³/s)	MAE (m³/s)	MAE Lower (m³/s)	MAE Upper (m³/s)
Panghai	Wet season	LSTM	24.361	20.43	29.184	14.458	13.004	15.986
		GRU	23.055	17.439	28.694	12.814	11.473	14.381
		Transformer	25.13	21.295	29.783	18.293	17.045	19.658
		Informer	22.703	18.334	28.343	12.331	10.942	13.889
		xLSTM	18.746	14.069	24.884	10.179	9.106	11.475
		TAM-xLSTM	16.945	11.735	23.567	7.721	6.718	8.94
Panghai	Dry season	LSTM	15.851	14.76	16.931	11.045	10.155	11.913
		GRU	15.266	13.937	16.629	10.628	9.808	11.48
		Transformer	16.61	15.921	17.333	13.956	13.276	14.655
		Informer	15.752	14.775	16.696	11.141	10.295	11.977
		xLSTM	11.21	10.059	12.278	6.971	6.274	7.676
		TAM-xLSTM	10.251	8.988	11.498	5.518	4.876	6.198
Xinghua	Wet season	LSTM	4.725	4.376	5.052	3.349	3.092	3.613
		GRU	4.607	4.201	5.014	3.047	2.776	3.301
		Transformer	6.593	6.165	7.006	4.739	4.39	5.079
		Informer	7.235	6.668	7.866	5.143	4.777	5.544
		xLSTM	5.245	4.079	6.61	2.829	2.523	3.183
		TAM-xLSTM	2.914	2.666	3.169	2.095	1.945	2.246
Xinghua	Dry season	LSTM	4.365	3.563	5.09	1.925	1.629	2.247
		GRU	3.467	2.896	4.045	1.609	1.39	1.866
		Transformer	2.541	2.315	2.772	1.506	1.358	1.66
		Informer	2.52	2.256	2.778	1.411	1.253	1.572
		xLSTM	3.465	3.184	3.768	2.881	2.743	3.03
		TAM-xLSTM	1.852	1.713	1.99	1.389	1.298	1.482

Table 8. Experimental results of Zhijin Station.

Model	Time	RMSE (m³/s)	MAE (m³/s)	Theil’s U	NSE
LSTM	2016–2023	2.8508	1.2334	0.2964	0.4637
GRU	2016–2023	2.7802	1.0307	0.3003	0.4899
Transformer	2016–2023	2.8065	1.1623	0.3238	0.4802
Informer	2016–2023	2.7705	1.0503	0.2983	0.4935
xLSTM	2016–2023	2.9953	1.8765	0.2973	0.4080
TAM-xLSTM	2016–2023	2.3380	1.0128	0.2574	0.6393

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Liu, R.; Wang, D.; Wang, L.; Cheng, C.; Xia, X.; Yang, Z. The TAM-xLSTM Model for Hourly River Flow Forecasting: A Case Study of Qiandongnan, Guizhou Province, China. Water 2025, 17, 2644. https://doi.org/10.3390/w17172644

AMA Style

Liu R, Wang D, Wang L, Cheng C, Xia X, Yang Z. The TAM-xLSTM Model for Hourly River Flow Forecasting: A Case Study of Qiandongnan, Guizhou Province, China. Water. 2025; 17(17):2644. https://doi.org/10.3390/w17172644

Chicago/Turabian Style

Liu, Renfeng, Dingdong Wang, Liangyi Wang, Chi Cheng, Xiaoling Xia, and Ziheng Yang. 2025. "The TAM-xLSTM Model for Hourly River Flow Forecasting: A Case Study of Qiandongnan, Guizhou Province, China" Water 17, no. 17: 2644. https://doi.org/10.3390/w17172644

APA Style

Liu, R., Wang, D., Wang, L., Cheng, C., Xia, X., & Yang, Z. (2025). The TAM-xLSTM Model for Hourly River Flow Forecasting: A Case Study of Qiandongnan, Guizhou Province, China. Water, 17(17), 2644. https://doi.org/10.3390/w17172644

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

The TAM-xLSTM Model for Hourly River Flow Forecasting: A Case Study of Qiandongnan, Guizhou Province, China

Abstract

1. Introduction

2. Related Work

2.1. xLSTM Model

2.2. WaveNet

3. Methods

3.1. Research Strategy

3.2. Study Area

3.3. Dataset

3.4. T-WaveNet Module

3.5. CBAM-1D Module

3.6. TAM-xLSTM Network Architecture

3.7. Validation and Evaluation Metrics

4. Results

5. Discussion

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI