Spatiotemporal Forecasting of Seismic Activity Trends Using Wiener Filtering and Artificial Neural Networks

Ren, Pengfei; Li, Peijia; Chen, Xiaoyang; Gu, Tingkai; Song, Xiaoyu; Wang, Cong; Yan, Kai

doi:10.3390/math14101756

Open AccessArticle

Spatiotemporal Forecasting of Seismic Activity Trends Using Wiener Filtering and Artificial Neural Networks

by

Pengfei Ren

,

Peijia Li

,

Xiaoyang Chen

,

Tingkai Gu

,

Xiaoyu Song

,

Cong Wang

^* and

Kai Yan

Key Lab of Building Structural Retrofitting and Underground Space Engineering of the Ministry of Education, Shandong Jianzhu University, Jinan 250101, China

^*

Author to whom correspondence should be addressed.

Mathematics 2026, 14(10), 1756; https://doi.org/10.3390/math14101756

Submission received: 15 April 2026 / Revised: 13 May 2026 / Accepted: 15 May 2026 / Published: 20 May 2026

(This article belongs to the Special Issue Advances of Mathematics and Artificial Intelligence in Engineering Applications)

Download

Browse Figures

Versions Notes

Abstract

Reliable forecasting of seismic activity trends is essential for regional seismic hazard analysis. Based on earthquake catalogs from 1500 to 2026, this study investigates the spatiotemporal evolution of seismic activity in the North-South Seismic Belt using a hybrid framework that integrates Wiener filtering and artificial neural networks. Seismic activity is modeled as a discrete-time stochastic process, and a time series of earthquakes with magnitudes ≥ 6.0 is constructed. Wiener filtering is applied to establish an optimal linear relationship between input and output under the minimum mean square error criterion, and multi-origin extrapolation is employed to predict earthquakes with magnitudes ≥ 7.0 over the next century. The results reveal several stable peaks or peak clusters that agree well with historical strong earthquakes, with prediction errors generally within approximately three years. Sensitivity analyses indicate that longer time series (∼500 years) and higher threshold magnitudes (≥6.0) enhance prediction stability, although the method shows limitations in spatial prediction. To address this issue, a 16–8–4 artificial neural network model is developed, and seismic sequence features are extracted using a sliding time window approach to perform both temporal and spatial forecasting. The artificial neural network achieves high accuracy in temporal prediction (maximum error ≈ 0.5) and outperforms Wiener filtering in spatial prediction, capturing the migration characteristics of seismic activity. The results further suggest that earthquakes with magnitudes ≥ 7.0 are more likely to occur within the latitude range of 30.5–33.0° N in the near future.

Keywords:

Wiener filtering; artificial neural networks; seismic activity trend forecasting; spatiotemporal analysis; North-South Seismic Belt

MSC:

86A15; 68T07

1. Introduction

Earthquakes, as one of the most destructive natural hazards, pose persistent threats to human life, critical infrastructure, and socio-economic development due to their sudden occurrence and high uncertainty. Despite significant advances in seismic monitoring and observational technologies, accurately predicting the timing, location, and magnitude of earthquakes remains a major unresolved challenge in Earth sciences [1]. In this context, earthquake forecasting has become an important research focus and plays a critical role in seismic ground motion zonation and the development of seismic design standards.

To mitigate earthquake risks, seismic zonation has been widely implemented at national, regional, and urban scales, providing a scientific basis for engineering design by classifying seismic hazard levels across different regions. In China, the seismic zoning map released in 1977 introduced the concept of basic seismic intensity, defined as the maximum intensity expected within a 100-year period, which has since guided the seismic design of industrial and civil structures. The development of seismic zonation maps relies fundamentally on long-term earthquake forecasting, in which seismic regions and belts are delineated to evaluate and predict future seismic activity trends.

Seismic hazard assessment constitutes the core methodology of seismic zonation, among which Probabilistic Seismic Hazard Analysis has become the dominant framework worldwide. However, persistent debates remain regarding its reliability [2]. In recent years, several destructive earthquakes have occurred in regions previously classified as low hazard, resulting in severe casualties and economic losses [3]. Representative examples include the 2011 M_w 9.0 Tōhoku earthquake in Japan and the 2008 M_w 8.0 Wenchuan earthquake in China [4,5]. These events reveal that traditional hazard assessment methods may underestimate extreme events, highlighting the need for complementary approaches capable of capturing the evolution of seismic activity trends.

Traditional earthquake forecasting methods are primarily founded on statistical seismology models and empirical relationships, such as Omori’s law and the Epidemic-Type Aftershock Sequence (ETAS) model. These approaches are capable of extracting spatiotemporal characteristics from earthquake catalogs and constructing probabilistic seismicity forecasting frameworks [6,7]. Ebrahimian et al. [8,9] proposed a probabilistic forecasting framework based on a Bayesian spatio-temporal ETAS model. By incorporating Bayesian inference and Markov Chain Monte Carlo simulation techniques, their framework enables adaptive updating of seismicity forecasts as new earthquake events become available. Compared with conventional maximum likelihood estimation approaches, the proposed method simultaneously accounts for both the uncertainty in ETAS model parameters and the stochasticity of future seismic sequences, thereby improving the robustness and reliability of short-term seismicity forecasting. In addition, smoothed seismicity models have been extensively employed in long-term earthquake forecasting and have demonstrated promising performance in regional forecasting experiments [10,11].

Despite their strong physical interpretability and rigorous theoretical foundations, statistical seismicity models generally rely on linear or weakly nonlinear assumptions. Consequently, their capability in characterizing the highly nonlinear evolutionary behavior and multi-scale spatiotemporal coupling characteristics of seismic activity remains limited. In contrast, data-driven approaches exhibit stronger capability in extracting nonlinear relationships and hidden spatiotemporal patterns from large-scale seismic datasets, thereby providing new opportunities for earthquake forecasting research.

With the rapid advancement of artificial intelligence techniques, recurrent neural networks (RNN) and Long Short-Term Memory (LSTM)-based frameworks have attracted increasing attention in seismic forecasting owing to their superior capability in modeling long-term temporal dependencies in time-series data. Berhich et al. [12] proposed an attention-based LSTM framework for predicting the timing, magnitude, and location of large earthquakes using the Japan earthquake catalog spanning 1900–2021. Their results demonstrated that the proposed framework achieved significantly improved prediction performance compared with conventional empirical scenarios, highlighting the considerable potential of deep learning approaches for large-earthquake forecasting.

Asim et al. [13] employed multiple machine learning approaches, including Pattern Recognition Neural Networks (PRNN), RNN, Random Forest (RF), and Linear Programming Boost (LPBoost), to predict earthquakes with magnitudes ≥ 5.5 in the Hindukush region using eight seismicity indicators derived from historical earthquake catalogs. Their results demonstrated that machine learning models can effectively capture nonlinear relationships between seismicity indicators and earthquake occurrence, thereby providing encouraging performance in earthquake forecasting applications. These findings further suggest that machine learning methods possess strong capability in identifying hidden patterns embedded in complex seismic sequences.

Nevertheless, the majority of previous studies have mainly concentrated on temporal forecasting while paying relatively limited attention to the spatial migration characteristics of seismic activity. To address this issue, Yousefzadeh et al. [14] proposed a spatiotemporally explicit earthquake forecasting framework based on Deep Neural Networks (DNNs), Support Vector Machines (SVMs), Decision Trees (DTs), and Shallow Neural Networks (SNNs). Their study introduced a spatial parameter termed “Fault Density,” derived from Kernel Density Estimation and Bivariate Moran’s I statistics, to characterize the spatial correlations between active faults and seismic activity. The results demonstrated that incorporating spatial parameters can significantly improve forecasting performance, particularly for high-magnitude earthquakes.

Although deep learning frameworks exhibit enhanced nonlinear fitting capability and strong feature extraction performance, their prediction accuracy remains highly dependent on data quality, catalog completeness, feature engineering strategies, and network architecture design. Wu et al. [10] applied an adaptively smoothed seismicity model to the eastern Tibetan Plateau for long-term probabilistic forecasting of earthquakes with M ≥ 5.0, and found that models based on longer input catalogs yield improved performance due to the increased number of earthquake samples. Similarly, Mousavi et al. [15] emphasized that more complete earthquake catalogs can significantly enhance the performance of both statistical and physics-based forecasting methods, as well as artificial intelligence-based approaches. Furthermore, Zhang et al. [16] proposed an end-to-end machine learning-based workflow for high-precision earthquake location, enabling the efficient and automated construction of high-quality earthquake catalogs from continuous seismic data. These findings collectively highlight the importance of high-quality and long-duration earthquake catalogs for improving forecasting reliability, and further motivate the adoption of advanced data-driven methods capable of capturing the nonlinear characteristics of seismic activity.

Recent studies have highlighted the critical roles of data completeness and methodological advancement in improving earthquake forecasting performance. With the rapid development of dense seismic networks and automated data processing techniques, earthquake catalogs have become increasingly comprehensive and accurate, providing a robust foundation for advanced analyses [15,17,18,19]. In this context, machine learning approaches have been widely introduced into earthquake forecasting due to their strong capabilities in nonlinear modeling and pattern recognition [15,18,19,20,21,22,23,24]. In particular, Zhang et al. [20] proposed a fully convolutional network model that uses spatial maps of the logarithm of past released seismic energy as input to forecast future earthquakes. Applied to California, the model achieved predictive performance comparable to an enhanced ETAS model, while offering improved computational efficiency. Similar conclusions were reported by Stockman et al. [21], further demonstrating the potential of neural networks in earthquake forecasting. Moreover, Liu et al. [22] attempted to incorporate more comprehensive spatiotemporal prior knowledge into DNNs, with the aim of improving the prediction accuracy of earthquake magnitude and epicenter location. Despite these advances, most existing studies focus on individual methods, with limited attention to the complementarity between different approaches. In particular, the joint characterization of temporal evolution and spatial migration of seismic activity remains insufficiently explored, highlighting the need for integrated frameworks that combine multiple modeling paradigms.

China is located in one of the most seismically active regions in the world, characterized by a complex tectonic setting and frequent earthquake hazards. After decades of development, a multi-level earthquake forecasting framework has been established, encompassing long-term, intermediate-term, and short-term predictions [25,26]. Among the major seismic zones in China, the North-South Seismic Belt is one of the most active regions for strong earthquakes, exhibiting pronounced spatiotemporal patterns and staged seismic activity. This makes it an ideal natural laboratory for investigating seismic activity trend forecasting.

Motivated by the above considerations, this study investigates seismic activity trends in the North-South Seismic Belt using historical earthquake catalog data. A hybrid framework integrating Wiener filtering and artificial neural networks is proposed to jointly analyze temporal evolution and spatial migration. Specifically, Wiener filtering is employed to characterize the linear temporal evolution of seismic activity, while artificial neural networks are used to capture nonlinear spatiotemporal patterns. Based on this framework, a coupled spatiotemporal forecasting approach is developed. Compared with existing studies, the main contributions of this work are threefold: (1) seismic activity trends are systematically characterized from both temporal and spatial perspectives, overcoming the limitations of single-dimensional analyses; (2) the complementary advantages of linear filtering and nonlinear learning are integrated to improve forecasting reliability; and (3) a sliding time window-based feature extraction strategy is proposed to enhance model stability and robustness. The proposed framework provides new insights for seismic hazard assessment and offers a useful reference for seismic zonation and disaster risk mitigation.

2. Fundamental Principles of Seismic Activity Trend Forecasting

Earthquake occurrence is influenced by multiple stochastic factors and exhibits pronounced randomness in both temporal and spatial distributions. Nevertheless, seismic activity within the same seismic belt often shows a certain degree of correlation. By analyzing the correlation characteristics of earthquake time series, the intrinsic relationships among seismic events can be revealed, providing a basis for seismic activity trend forecasting. In this study, the Wiener filtering method and artificial neural networks are employed to identify the evolutionary patterns of seismic activity and to conduct forecasting of seismic activity trends.

2.1. Principles of Wiener Filtering-Based Seismic Activity Trend Forecasting

Filtering techniques constitute a fundamental component of signal analysis and processing and play a key role in extracting reliable information from noisy observations. With the advancement of automated systems and control theory, time series analysis has evolved beyond traditional signal transmission applications toward broader domains, including forecasting and information extraction. Wiener [27] systematically investigated the correlation structure and spectral characteristics of stationary stochastic processes and established the theoretical foundation of Wiener filtering and prediction by proposing methods to extract useful signals from noisy sequences.

Earthquake occurrences exhibit distinct characteristics across different magnitude levels, and early earthquake catalogs often suffer from incomplete records of moderate and small events. Previous studies have shown that large earthquakes (M ≥ 7.0) are typically preceded by several events with magnitudes ≥ 5.6 [28]. Based on this observation, a time series of earthquakes with magnitudes ≥ 6.0 is selected in this study to characterize the correlation with large earthquakes (M ≥ 7.0). During the forecasting process, the time series of earthquakes with magnitudes ≥ 7.0 is extrapolated iteratively, enabling simultaneous estimation of the potential occurrence time and intensity of future events. Therefore, the problem can be formulated within a unified framework that combines filtering and prediction.

The earthquake sequence

f (t_{i})

can be represented using parameters related to either the earthquake magnitude

M

or the seismic energy release

E

. In this study, the cube root of seismic energy,

E^{1 / 3}

, is adopted. Compared with

\log E

, the quantity

E^{1 / 3}

increases more rapidly with increasing magnitude, thereby enhancing the contrast between large and small earthquakes. As a result, using

E^{1 / 3}

as the intensity measure improves the signal-to-noise ratio by emphasizing large events while suppressing smaller ones. Accordingly, the earthquake sequence is defined as:

f (t_{i}) = E^{1 / 3} i = 1, 2, 3, \dots, n

(1)

\log E = 1.5 M + 11.8

(2)

where

n

is the total length of the earthquake time series, and

M

denotes the maximum magnitude occurring within each year.

The sequence is transformed into a zero-mean stochastic process

φ (t_{i})

as follows:

φ (t_{i}) = f (t_{i}) - \bar{f} (t) i = 1, 2, 3, \dots, n

(3)

where

\bar{f} (t)

is the mean value of the sequence

f (t_{i})

.

The process

φ (t)

can be decomposed into two components:

φ (t) = s (t) + n (t)

(4)

where

s (t)

represents the sequence of large earthquakes (signal), and

n (t)

corresponds to the sequence of moderate and small earthquakes (noise).

The objective is to determine a filtering prediction operator

P

such that, when

φ (t)

is input into the system, the output

y (t)

approximates the future value of the signal

s (t + t_{0})

, where

t_{0}

denotes the prediction horizon. The Wiener filtering-based prediction framework is illustrated in Figure 1. For the earthquake sequence, this can be expressed as:

P [φ (t)] = s (t + t_{0})

(5)

Thus, seismic activity trend forecasting can be formulated as the problem of determining an optimal operator

P

that minimizes the mean square error between the predicted output and the future signal. The input–output relationship of the system is given by:

s (t + t_{0}) = \int_{0}^{\infty} φ (t - τ) k_{t_{0}} (τ) d τ

(6)

where

k_{t_{0}} (τ)

is the impulse response function of the Wiener filtering system, also referred to as the kernel of the operator

P

.

The problem therefore reduces to solving for the kernel function

k_{t_{0}} (τ)

. Under the minimum mean square error criterion,

k_{t_{0}} (τ)

satisfies the following integral equation:

R_{φ s} (τ + t_{0}) = \int_{0}^{\infty} R_{φ φ} (τ - α) k_{t_{0}} (α) d α

(7)

where

R_{φ s} (τ + t_{0})

is the cross-correlation function between

φ (t)

and

s (t + t_{0})

, and

R_{φ φ} (τ)

is the autocorrelation function of

φ (t)

.

These correlation functions are defined as:

R_{φ s} (τ + t_{0}) = \lim_{T \to \infty} \frac{1}{2 T} \int_{- T}^{T} s (t + t_{0} + τ) φ (t) d t

(8)

R_{φ φ} (τ) = \lim_{T \to \infty} \frac{1}{2 T} \int_{- T}^{T} φ (t + τ) φ (t) d t

(9)

Since

φ (t)

and

s (t)

are known, the correlation functions

R_{φ s} (τ + t_{0})

and

R_{φ φ} (τ)

can be computed using Equations (8) and (9). Substituting the solution of Equation (7) into Equation (6) yields the predicted values of future seismic activity.

In this study, a multi-origin extrapolation strategy is employed to evaluate the effectiveness of the Wiener filtering method for seismic activity trend forecasting. During the extrapolation process, a portion of the earthquake time series is withheld as a validation dataset, and predictions are generated from multiple starting points. The predicted results are then compared with the corresponding observed data for validation. In addition, cross-comparison among predictions from different starting points is conducted to assess consistency and enhance the reliability of the results.

2.2. Principles of Artificial Neural Network-Based Seismic Activity Trend Forecasting

Driven by the dynamic motion of the mantle asthenosphere, tectonic plates undergo continuous movement and interaction. Tectonic earthquakes resulting from these plate dynamics account for more than 90% of global seismic activity. The inherent complexity of this geodynamic process introduces pronounced nonlinearity and significant uncertainty into earthquake occurrence. Seismic activity trend forecasting can be viewed as a process of identifying underlying patterns in earthquake sequences and extrapolating them into the future, which is conceptually analogous to the pattern recognition capability of artificial neural networks. As highly adaptive nonlinear dynamical systems, artificial neural networks can learn from large volumes of data to extract latent relationships and evolutionary patterns, enabling effective modeling and prediction of complex nonlinear processes. This aligns well with the objective of analyzing earthquake sequences to uncover intrinsic correlations and forecast future seismic activity trends. Therefore, artificial neural network-based methods provide a promising approach for seismic activity trend forecasting.

A multilayer feedforward neural network consists of an input layer, one or more hidden layers, and an output layer. The input layer receives external data, while the hidden layers, composed of interconnected neurons, perform nonlinear transformations through activation functions. The connections between neurons are characterized by weights, which are iteratively adjusted during training. Although multiple hidden layers can be employed, a single hidden layer is commonly adopted in practical applications due to its balance between model complexity and computational efficiency. The output layer produces the final prediction results, which are determined by the network architecture, activation functions, and connection weights.

The backpropagation neural network, one of the most widely used feedforward architectures, is trained through forward and backward propagation. In the forward propagation stage, input signals pass through the network to generate predictions. In the backward propagation stage, the prediction error is propagated backward to update the network weights, progressively reducing the error and improving model accuracy. In this study, a two-layer backpropagation neural network is adopted, and its architecture is illustrated in Figure 2.

The output of each neuron can be expressed as:

Z_{p s}^{r} = \frac{1}{1 + \exp (- \sum_{q = 0}^{N_{r - 1}} U_{p q}^{r} Z_{q s}^{r - 1})}

(10)

where

Z_{p s}^{r}

denotes the output of the p-th neuron in the r-th layer for sample

s

;

Z_{q s}^{r - 1}

represents the output of the q-th neuron in the (r − 1)-th layer (for

r - 1 = 0

, it corresponds to the input of sample

s

);

U_{p q}^{r}

is the connection weight between the q-th neuron in the (r − 1)-th layer and the p-th neuron in the r-th layer; and

N_{r - 1}

is the number of neurons in the (r − 1)-th layer.

The error

E

between the network output and the target output is defined as:

E = \frac{1}{2} \sum_{p = 1}^{N_{t}} {(d_{p s}^{t} - Z_{p s}^{t})}^{2}

(11)

where

d_{p s}^{t}

is the desired output of the p-th neuron in the output layer for sample

s

,

Z_{p s}^{t}

is the corresponding network output, and

N_{t}

is the number of neurons in the output layer.

The weights

U_{p q}^{r}

are updated using the gradient descent method to minimize the error function. The update rule is given by:

Δ U_{p q}^{r} = - α \frac{\partial E}{\partial U_{p q}^{r}}

(12)

where

α

is the learning rate, typically chosen within the range (0, 1).

By substituting Equation (10) into Equation (11), the weight update can be derived as:

Δ U_{p q}^{r} = α e_{p s}^{r} Z_{q s}^{r - 1}

(13)

where the propagated error term

e_{p s}^{r}

is defined as:

e_{p s}^{r} = \{\begin{matrix} (d_{p s}^{r} - Z_{p s}^{r}) Z_{p s}^{r} (1 - Z_{p s}^{r}) & for output layer \\ Z_{q s}^{r} (1 - Z_{q s}^{r}) (\sum e_{p s}^{r + 1} U_{p q}^{r + 1}) & for hidden layer \end{matrix}

(14)

To accelerate convergence, a momentum term is introduced into the weight update at the k-th iteration:

Δ U_{p q}^{r} (k) = α e_{p s}^{r} Z_{q s}^{r - 1} + η Δ U_{p q}^{r} (k - 1)

(15)

where

η

is the momentum coefficient, typically selected within the range 0.2–0.4.

During training, the network weights are initialized with nonzero random values within the range of ±1.0. The training samples, consisting of input–output pairs, are sequentially fed into the network, and the weights are updated iteratively after each sample. A full pass through all samples constitutes one training epoch. The training process continues until the loss function converges to a predefined tolerance or the maximum number of epochs is reached.

From a methodological perspective, the Wiener filtering and artificial neural network approaches can be interpreted within a unified predictive framework for seismic activity trend forecasting. The Wiener filtering method formulates the problem as an optimal linear estimation of stochastic processes under the minimum mean square error criterion, yielding a filtering operator that maps observed sequences to future signals. This framework is effective in capturing the global temporal evolution of seismic activity. In contrast, the artificial neural network approach treats earthquake forecasting as a nonlinear function approximation problem. Through backpropagation-based learning, the network establishes a mapping between input and output sequences, enabling the representation of nonlinear relationships and spatiotemporal dependencies in seismic activity. Accordingly, Wiener filtering can be regarded as a linear operator derived analytically, whereas the artificial neural network represents a nonlinear operator learned from data. This formulation provides a unified basis for integrating linear and nonlinear modeling approaches in seismic activity trend forecasting.

3. Seismic Activity Trend Forecasting for the North-South Seismic Belt

The North-South Seismic Belt is a major seismically active zone extending across mainland China, generally oriented in a north-south direction. It spans approximately 24–37° N latitude and 102–107° E longitude, extending from Yunnan Province through western Sichuan and eastern Gansu to Ningxia. This belt has hosted the majority of historical earthquakes with magnitudes ≥ 8.0 in mainland China, including the 1920 Haiyuan earthquake (M_w 8.5) and the 2008 Wenchuan earthquake (M_w 8.0). Its seismic activity is closely associated with major fault systems, including the Longmenshan, Haiyuan, Liupanshan, and Anninghe faults, among others. Characterized by frequent seismic activity and large-magnitude events, this region represents one of the most seismically hazardous tectonic belts in China. It has also been designated as a target region for earthquake forecast testing under the Collaboratory for the Study of Earthquake Predictability (CSEP) project [29].

In the early years, earthquake catalogs were manually compiled and documented in specialized volumes. The first modern seismological monitoring station in China was established in 1930. Since then, China’s seismic observation system has undergone rapid and substantial development. At present, the China Earthquake Administration has established a dense nationwide seismic monitoring network comprising 18,296 stations. Five key earthquake early-warning regions have been developed, including North China, the North–South Seismic Belt, the southeastern coastal area, the central Tianshan region of Xinjiang, and Lhasa in Tibet. These efforts have enabled intensified deployment in critical areas (with an average station spacing of approximately 13 km) and effective coverage in general regions (with an average spacing of approximately 47 km), thereby significantly enhancing the national capacity for earthquake monitoring and early warning. The large volume of observational data is systematically archived and shared through the National Earthquake Data Center.

The earthquake catalog covering approximately 530 years (1500–2026) shows that earthquakes with magnitudes ≥ 7.0 occurred in 22 years within the North-South Seismic Belt, including 5 years with events ≥ 8.0. In several cases, multiple large earthquakes occurred within the same year, indicating clustered seismic activity. Over the past century, seismic activity has exhibited a pattern of staged intensification, with 8 years recording earthquakes ≥ 7.0, including 2 years with events ≥ 8.0. In this study, the spatiotemporal evolution of annual maximum earthquake magnitudes in the North-South Seismic Belt from 1500 to 2026 is statistically analyzed. The spatial distribution of earthquakes with magnitudes ≥ 5.0 is presented in Figure 3, the temporal characteristics of earthquakes ≥ 6.0 are shown in Figure 4, and the catalog of earthquakes ≥ 7.0 is summarized in Table 1.

3.1. Wiener Filtering-Based Seismic Activity Trend Forecasting for the North-South Seismic Belt

In this study, seismic activity is modeled as a discrete-time stochastic process based on historical earthquake records, with an annual time step. The cube root of seismic energy, derived from earthquake magnitude, is adopted as an amplitude indicator to characterize earthquake intensity, thereby constructing the seismic activity time series. The sequence of earthquakes with magnitudes ≥ 6.0 is defined as the input signal, while the sequence of earthquakes with magnitudes ≥ 7.0 is treated as the output response. Wiener filtering is then employed to establish the optimal linear relationship between input and output under the minimum mean square error criterion. In this study, earthquake catalogs comprising events with magnitudes ≥ 6.0 are employed to forecast the timing and spatial distribution of earthquakes with magnitudes ≥ 7.0. This approach inherently targets the prediction of events of magnitude ≥ 7.0, which pose significant threats to human life and property. Early identification of seismic hazards, coupled with appropriate mitigation measures, can effectively reduce earthquake-induced risks and associated losses.

To capture the time-varying characteristics of seismic activity and perform forecasting, a multi-origin extrapolation strategy is adopted. Multiple starting points are selected to construct prediction models based on historical data, enabling the analysis of seismic activity trends in the North-South Seismic Belt. During the extrapolation process, earthquake records following each starting point are truncated and used as independent validation samples. Model performance is evaluated by comparing predicted results with corresponding observations. In addition, consistency and discrepancies among predictions from different starting points are analyzed to assess model robustness across multiple temporal scales, thereby improving the reliability of seismic activity trend forecasting.

3.1.1. Results of Wiener Filtering-Based Seismic Activity Trend Forecasting

The years 1920, 1933, 1955, 1970, 1980, and 1990 are selected as extrapolation starting points. The Wiener filtering method is then applied to predict the time series of potential seismic activity in the North-South Seismic Belt over the next century. The results are presented in Figure 5, where the peaks represent the predicted occurrence times of earthquakes with magnitudes ≥ 7.0, and the dashed lines denote the corresponding observed events.

A comparative analysis of the results obtained from different starting points shows that the predicted time series consistently exhibit pronounced peaks or peak clusters within similar time intervals, indicating strong robustness of the model under varying historical data conditions. By grouping temporally overlapping or adjacent peaks, six potential centennial-scale active periods of large earthquakes are identified. Specifically:

A peak cluster is observed during 1961–1966, followed by a sequence of strong earthquakes, including the 1970 M_w 7.8 Tonghai earthquake, the 1974 M_w 7.1 Daguan earthquake, and the 1976 M_w 7.2 Songpan–Pingwu earthquake. This suggests that the model effectively captures subsequent phases of intensified seismic activity.
A distinct peak appears around 2007, corresponding closely to the 2008 M_w 8.0 Wenchuan earthquake, indicating a high degree of temporal agreement.
A peak cluster occurs during 2011–2013, consistent with the 2013 M_w 7.0 Ya’an earthquake.
A peak around 2020 corresponds to the 2017 M_w 7.0 Jiuzhaigou earthquake. In addition, the 2022 M_w 6.9 Menyuan earthquake (37.8° N, 101.3° E) suggests that this period remains seismically active.
Two future peak clusters are identified during 2054–2057 and 2066–2067, representing potential periods of intensified seismic activity that warrant further attention.

Recent destructive earthquakes, including the 2008 Wenchuan, 2013 Ya’an, and 2017 Jiuzhaigou events, are all located near the predicted peaks or peak clusters. The deviation between predicted and observed occurrence times of earthquakes with magnitudes ≥ 7.0 is generally within approximately three years, with a root mean square error (RMSE) of 2.3 years and a coefficient of determination (R²) of 0.987, demonstrating that the proposed method possesses reasonable capability in the temporal prediction of strong earthquakes. Furthermore, by incorporating newly available earthquake data and performing multi-origin cross-validation, the model exhibits good stability and consistency in identifying major active periods. Nevertheless, due to the inherently nonlinear and stochastic nature of seismic processes, the results are more suitable for medium- to long-term trend identification rather than precise time prediction.

For spatial analysis, the distribution of seismic activity in the North-South Seismic Belt is simplified into a latitude–time series, and Wiener filtering combined with multi-origin extrapolation is applied to predict potential earthquake locations. The results show that the predicted latitudes are generally inconsistent with observed earthquake locations, with significant deviations in certain cases and overall unstable performance. Even with the incorporation of additional data, no substantial improvement is achieved, indicating that the Wiener filtering method has inherent limitations in spatial prediction. Therefore, alternative approaches are required to effectively capture the spatial evolution of seismic activity in seismic zonation studies.

3.1.2. Sensitivity Analysis of Wiener Filtering-Based Predictions

This section investigates the sensitivity of seismic activity time prediction to the length of the time series and the selection of threshold magnitude, with the aim of systematically evaluating the applicability and reliability of the Wiener filtering method.

In general, longer earthquake catalogs provide more reliable and stable estimates of the autocorrelation characteristics of seismic activity. However, as the time span increases, historical records become increasingly incomplete due to limitations in documentation, which may introduce uncertainty into the analysis. Therefore, a robust forecasting method should exhibit limited sensitivity to the length of the input time series. To evaluate this effect, earthquake sequences with magnitudes ≥ 6.0 over three time spans (1500–2016, 1622–2016, and 1713–2016) are considered. Extrapolations are performed from the years 1920, 1933, 1955, 1970, 1980, and 1990 to predict the occurrence times of earthquakes with magnitudes ≥ 7.0 over the next century. Taking the results from the 1990 starting point as an example, a comparison of predictions based on different time series lengths is shown in Figure 6.

The results indicate that extrapolations based on time series of approximately 500, 400, and 300 years all exhibit several prominent peaks, primarily concentrated in the periods 1961–1966, 2007, 2011–2013, 2020, 2054–2057, and 2066–2067. However, the number and distribution of these peaks show a clear dependence on the length of the time series. Specifically, the number of peaks obtained from extrapolation based on the ~500-year time series is smaller than that derived from the ~400-year and ~300-year time series. This suggests that shorter time series tend to produce more dispersed and fluctuating peaks, whereas longer time series can effectively smooth random disturbances and better capture the overall evolutionary characteristics of seismic activity. As the length of the time series increases, the prediction results gradually converge and exhibit improved consistency, indicating reduced sensitivity to uncertainty. Therefore, longer time series (e.g., ~500 years) provide greater reliability for long-term seismic activity trend forecasting.

In addition to time series length, the choice of threshold magnitude also influences prediction results. Early earthquake catalogs often contain incomplete records of moderate and small events; therefore, different threshold magnitudes may introduce varying levels of uncertainty. To assess this effect, earthquake data since 1500 are analyzed using three magnitude thresholds (M ≥ 6.0, ≥5.5, and ≥5.0), and extrapolations are performed from the same set of starting years (i.e., 1920, 1933, 1955, 1970, 1980, and 1990). A comparison of results from the 1990 starting point is presented in Figure 7.

The results show that the predicted occurrence times of earthquakes with magnitudes ≥ 7.0 are generally consistent across different threshold magnitudes, particularly for the periods 1961–1966, 2007, 2011–2013, and 2020, which correspond well with the occurrence of observed strong earthquakes in 1970, 1974, 1976, 2008, 2013, and 2017. However, as the threshold magnitude increases, the predictions become more stable, characterized by fewer peaks and more concentrated temporal distributions. This can be attributed to the higher completeness and reliability of large-magnitude earthquake records, whereas smaller events are more prone to omissions. Consequently, higher threshold magnitudes (e.g., M ≥ 6.0) yield more robust and reliable predictions.

In summary, both the length of the time series and the selection of threshold magnitude have a significant impact on prediction stability. Longer time series and higher threshold magnitudes effectively reduce the influence of random disturbances and improve model robustness. Under these conditions, the Wiener filtering method demonstrates good applicability for seismic activity trend forecasting and provides a reliable basis for regional strong-earthquake hazard analysis.

3.2. Artificial Neural Network-Based Seismic Activity Trend Forecasting for the North-South Seismic Belt

Reliable forecasting of seismic activity trends requires effective management of the inherent complexity of earthquake prediction, for which artificial intelligence provides powerful tools for data-driven analysis and decision support. In recent years, rapid advances in artificial neural network techniques have helped overcome the limitations of traditional knowledge-based expert systems, particularly in terms of knowledge acquisition and representation. In this context, artificial neural networks offer a promising approach for learning latent patterns embedded in historical earthquake catalogs and capturing the nonlinear evolution of seismic activity. Accordingly, this study employs artificial neural network-based methods to investigate the applicability and reliability of seismic activity trend forecasting in the North-South Seismic Belt.

In artificial neural networks, multiple hidden layers can be employed to approximate complex nonlinear functions. Although increasing the network depth theoretically enhances its representation capability, excessively deep architectures may lead to overfitting, increase training difficulty, and hinder convergence, particularly when the available dataset is limited. Therefore, a two-layer artificial neural network is adopted to model and characterize the fluctuation patterns of seismic activity. The number of neurons in the input layer also plays a critical role in model performance. An insufficient number of input neurons may result in information loss, preventing the network from fully capturing the underlying features of the input data. Conversely, an excessive number of input neurons may introduce redundant information and increase computational complexity. Hence, an appropriate selection of input layer size is essential for improving training efficiency and prediction accuracy. In this study, multiple network architectures, including 16–8–4 and 12–8–4, were systematically evaluated for their performance in seismic activity trend forecasting. When the number of input neurons is set to 16, the network incorporates information from nearly 200 years of earthquake catalog data, thereby effectively capturing the underlying patterns of seismic activity. Accordingly, the network employs a 16–8–4 architecture, with 16, 8, and 4 neurons in the input, hidden, and output layers, respectively. A schematic illustration of the network architecture is presented in Figure 8.

3.2.1. Training and Validation of the Artificial Neural Network Model

To provide an initial assessment of the learning and extrapolation capabilities of the artificial neural network, a sinusoidal function y = sin(πt/6) with a period of T = 12 s is adopted as a benchmark signal. Since the inputs to the artificial neural network are required to lie within the range of 0–1, the function values are normalized using the transformation (y + 1)/2, and a corresponding time series is constructed with a time step of 1 s. It should be noted that this test is not intended to replicate the characteristics of real seismic time series, which are inherently non-stationary, non-periodic, and sparse. Instead, the sinusoidal function serves as a simplified and controlled example to verify that the network can effectively capture general temporal patterns and achieve stable convergence under idealized conditions. More importantly, the effectiveness of the proposed method for seismic activity trend forecasting is primarily evaluated using real earthquake catalog data from the North–South Seismic Belt. Through retrospective validation and multi-origin extrapolation, the model performance is assessed under realistic conditions, which better reflect the non-stationary and complex nature of seismic activity.

The training samples are generated using a sliding time window approach. Specifically, a window length of 4 s is adopted, with four consecutive windows (16 s) used as the input and the subsequent window (4 s) as the output, forming one training sample. The first sample is constructed starting from t = 0, and subsequent samples are generated by sliding the window forward with a step size of 1 s. Based on this dataset, a 16–8–4 artificial neural network is employed to learn the temporal characteristics of the sinusoidal sequence. Extrapolation is then performed under two scenarios: (i) preserving the original sample order and (ii) randomly shuffling the sample order. The validation procedure is illustrated in Figure 9, and a comparison between predicted and true values is shown in Figure 10.

The results show that, under both sample processing strategies, the extrapolation error is ≤0.03, demonstrating the high predictive accuracy of the artificial neural network model. With sufficient training, the network effectively captures the intrinsic temporal features of the time series, enabling stable extrapolation and accurate prediction. These findings provide a reliable methodological basis for subsequent seismic activity trend forecasting.

To further evaluate the general applicability of the proposed framework, earthquake data from other tectonically active regions were additionally employed for training and validation of the artificial neural network model. The Japan subduction zone, located at the convergent boundary of the Pacific Plate, the Philippine Sea Plate, and the Eurasian Plate, represents one of the most tectonically complex and seismically active regions worldwide. Owing to its intense plate interactions, frequent occurrence of large-magnitude earthquakes, and relatively complete historical earthquake catalog, the region provides an important natural laboratory for investigating seismic activity trend forecasting. In this study, the historical earthquake catalog covering the period from 1498 to 1987 was used to forecast the seismic activity trend of the Japan subduction zone. The prediction results indicate that the probability of earthquakes with magnitudes greater than 7.0 occurring during the period 1988–2017 reached approximately 0.975, implying a persistently elevated level of seismic hazard in the region during this interval. Subsequent seismic events exhibit a certain degree of consistency with the forecast results. The 2011 Tōhoku earthquake (M_w 9.0) triggered the Fukushima Daiichi nuclear disaster and caused catastrophic social and economic consequences. According to official statistics, the event resulted in 19,533 fatalities and direct economic losses estimated at 16–25 trillion Japanese yen. In addition, the 1995 Kobe earthquake (M_w 7.3) caused 6434 deaths, destroyed approximately 108,000 buildings, and generated direct economic losses approaching 100 billion USD. Furthermore, earthquakes with magnitudes exceeding 8.0 also occurred in 1994 and 2003, further reflecting the persistently high seismic activity of the region.

3.2.2. Results of Artificial Neural Network-Based Forecasting

In this study, a 16–8–4 artificial neural network is developed to learn from historical earthquake data in the North-South Seismic Belt since 1500 and to extract the underlying evolutionary patterns of seismic activity. Based on this model, spatiotemporal forecasting of seismic activity trends is conducted.

To construct input features for the artificial neural network, a sliding time window approach is adopted to transform the raw earthquake catalog into structured time-series data. Considering the sparsity and temporal variability of seismic events, a window length of 30 years is selected to ensure sufficient statistical representation within each interval. The window is advanced with a step size of 10 years, thereby generating overlapping segments that preserve temporal continuity while increasing the number of training samples. Within each time window, the maximum earthquake magnitude is extracted as the representative feature to characterize the intensity of seismic activity. This choice is motivated by the fact that large-magnitude events dominate seismic hazard and are more relevant for trend forecasting. In cases where no earthquake occurs within a given window, a constant value (e.g., M = 4.0) is assigned to maintain the continuity of the time series and avoid missing data.

To ensure numerical stability and facilitate network training, all input features are normalized into the range of [0, 1] using a Min–Max scaling scheme. For magnitude-based features, the normalization is defined as (M − 4)/10. The adoption of Min–Max normalization is motivated by its ability to preserve the relative distribution of the data while ensuring compatibility with the activation functions used in the neural network. Compared with Z-score standardization, this approach avoids amplifying the influence of extreme values, which is particularly important given the non-stationary and sparse nature of seismic data.

Based on the constructed time series, artificial neural network training samples are generated using a sliding window of length 4, where four consecutive windows (16 data points) are used as input and the subsequent window (4 data points) is used as output. Samples are generated sequentially with a step size of 1. This configuration allows the model to incorporate information spanning nearly 200 years of seismic activity, thereby capturing long-term temporal dependencies and underlying evolutionary patterns.

To evaluate the reliability and robustness of the proposed method, a retrospective validation strategy is adopted. The earthquake catalog is truncated at multiple years (1990, 1980, 1970, 1960, and 1950), and only data prior to each truncation point are used for training. Predictions are then generated for the truncated periods and compared with observed data. Model performance across different temporal scales is assessed through cross-comparison of these results.

The learning rate α controls the magnitude of parameter updates during the training process. An appropriate learning rate can improve training efficiency while ensuring model convergence. In contrast, excessively large or small learning rates may lead to degraded model performance or unstable training. Typically, α is selected within the range of 0 to 1. In this study, a sensitivity analysis of the learning rate was conducted, and, in combination with commonly adopted values, α = 0.2 and α = 0.3 were ultimately selected to comparatively analyze the seismic activity trend forecasting results for the North–South Seismic Belt. The comparison between predicted and observed time series is presented in Table 2, and the prediction results with 2000 as the starting point are shown in Table 3.

The results indicate that, for learning rates of 0.2 and 0.3, the maximum deviation between predicted and observed values is approximately 0.5, demonstrating a reasonable level of accuracy in temporal prediction. To reduce the influence of randomness arising from weight initialization and iterative optimization, multiple independent prediction runs are averaged to improve result stability. For a forecasting starting point of 2000, the predicted values are 7.7 and 7.9 for learning rates of 0.2 and 0.3, respectively. Although a M_w 7.0 earthquake occurred in Jiuzhaigou in 2017, combined analysis with Wiener filtering results suggests that earthquakes with magnitudes ≥ 7.0 may still occur in the region in the coming years. It should be noted, however, that these results are derived from statistical modeling and should be interpreted as indicative of long-term trends rather than precise predictions.

For spatial prediction, the latitudes of earthquakes with magnitudes ≥ 6.0 are organized into a time series, and artificial neural network-based forecasting is performed using the same sliding window strategy. A 30-year window is adopted, with the latitude corresponding to the maximum magnitude selected as the feature value. If multiple candidates exist, the value is determined based on the north-south migration pattern; if no event occurs, the value is set to 0. Latitude values are normalized by dividing by 100. The time series is constructed using a 5-year step size starting from 1500. Training samples are generated in the same manner as for temporal prediction, and retrospective validation is conducted using multiple truncation periods. The comparison between predicted and observed latitude sequences is presented in Table 4, and the prediction results for the 2000 starting point are shown in Table 5.

The results presented in Table 4 and Table 5 indicate that certain differences exist between the predicted and observed values under different forecasting starting points. Nevertheless, the prediction errors remain within an acceptable range. The mean error between the predicted and observed latitudes is 0.79°, with a coefficient of determination of 0.913. For the 1990 starting point, the predicted latitudes are 29.4° and 29.5° for learning rates of 0.2 and 0.3, respectively, which show reasonable agreement with the 2008 Wenchuan earthquake (31.0° N, 103.4° E) and the 2013 Ya’an earthquake (30.3° N, 103.0° E). Although the extrapolation in this study is based on the latitude sequence corresponding to the maximum magnitude within each sliding time window, the above events exhibit a certain degree of consistency with the predicted results in terms of spatial distribution. This suggests that the proposed method has a certain capability in capturing the spatial migration characteristics of seismic activity.

Overall, the artificial neural network model effectively characterizes seismic activity trends in the North-South Seismic Belt and improves the stability of spatial prediction, partially compensating for the limitations of the Wiener filtering method. The results further indicate that earthquakes with magnitudes ≥ 7.0 are more likely to occur within the latitude range of 30.5–33.0° N in the near future, with an estimated probability of 0.957. It should be emphasized that the available earthquake catalog spans only slightly more than 500 years, resulting in limited training samples. In addition, the validation period remains relatively short, constraining the assessment of model robustness. Therefore, this study constitutes a preliminary exploration of artificial neural network-based spatial prediction of seismic activity trends, and the results are still subject to inherent uncertainties.

4. Conclusions

Based on the earthquake catalog of the North–South Seismic Belt spanning 1500–2026, this study systematically investigated the temporal and spatial characteristics of seismic activity trends using Wiener filtering and artificial neural network approaches. The results indicate that seismic activity in the North–South Seismic Belt exhibits pronounced stage-wise evolution, clustering behavior, and multi-scale temporal variability, reflecting the complex tectonic background and active fault systems of the region. Historical earthquake records demonstrate that strong earthquakes are not randomly distributed in time, but instead tend to occur during concentrated active periods, highlighting the intrinsic temporal clustering characteristics of regional seismicity. Multi-origin extrapolation using Wiener filtering consistently identified several major active periods of strong earthquakes, including 1961–1966, 2007, 2011–2013, and 2020, as well as potential future active periods during 2054–2057 and 2066–2067. Retrospective validation showed that the deviation between predicted and observed occurrence times of earthquakes with magnitudes ≥ 7.0 is generally within approximately three years, with a root mean square error of 2.3 years and a coefficient of determination (R²) of 0.987, indicating reasonable capability in temporal trend prediction. Sensitivity analyses further demonstrated that longer time series (~500 years) and higher threshold magnitudes (≥6.0) contribute to improved model stability and consistency. These findings provide important insights into the long-term evolution patterns and clustering characteristics of seismicity in the North–South Seismic Belt. Nevertheless, owing to the inherently nonlinear, stochastic, and non-stationary nature of seismic processes, the proposed framework is more suitable for medium- to long-term trend identification rather than deterministic prediction of earthquake occurrence time and location. Moreover, the Wiener filtering approach exhibits limited capability in characterizing spatial variability, highlighting the necessity of incorporating complementary nonlinear approaches.

The artificial neural network model demonstrated good capability in capturing nonlinear temporal and spatial patterns of seismic activity. The prediction results remained within an acceptable range, with a mean latitude prediction error of 0.79° and an R² value of 0.913, indicating reasonable agreement between predicted and observed spatial distributions. Compared with Wiener filtering, the artificial neural network model exhibited superior performance in spatial prediction and showed enhanced capability in characterizing the migration behavior of seismic activity. The results further suggest that strong earthquakes (M ≥ 7.0) are more likely to occur within the latitude range of 30.5–33.0° N in the near future, with an estimated probability of 0.957. This finding is generally consistent with the historically active seismic segments of the North–South Seismic Belt and further reflects the non-uniform spatial distribution and migration characteristics of regional seismicity. Consequently, the proposed framework may provide useful insights into the seismotectonic characteristics and potential seismic hazard of the studied region. However, due to the limited length of the available earthquake catalog and the inherent sparsity of large-magnitude seismic events, uncertainties remain in the spatial prediction results. Future studies should therefore incorporate longer earthquake catalogs, additional geophysical information, and more advanced benchmark models to further improve forecasting robustness and reliability.

The comparative analyses indicate that Wiener filtering performs well in characterizing the global temporal evolution of seismic activity and achieves relatively high accuracy in predicting the occurrence time of strong earthquakes. However, its capability in spatial prediction remains limited because it cannot effectively characterize the nonlinear migration characteristics of seismic activity. In contrast, the artificial neural network model exhibits stronger capability in capturing nonlinear relationships and spatial distribution patterns, particularly with respect to the migration behavior of seismic activity, although its temporal prediction performance is relatively less stable than that of Wiener filtering. By integrating the complementary strengths of the two approaches, the proposed hybrid framework demonstrates improved performance in both temporal and spatial prediction tasks. These findings highlight the potential advantages of combining linear and nonlinear modeling strategies and suggest that hybrid forecasting frameworks may provide a more robust and reliable approach for seismic activity trend forecasting. Nevertheless, more systematic quantitative comparisons, particularly through formal ablation studies and benchmarking analyses against representative statistical and deep learning models, are still required and will be considered in future work.

Author Contributions

Conceptualization, P.R. and C.W.; methodology, P.R.; software, P.L.; formal analysis, P.L., X.C., T.G. and C.W.; investigation, P.R., X.C. and C.W.; writing—original draft preparation, P.R.; writing—review and editing, P.R., P.L., X.C., T.G., X.S., C.W. and K.Y.; supervision, C.W.; funding acquisition, P.R., C.W. and K.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (Grant Nos. 52408547 and 52308510), the Shandong Provincial Natural Science Foundation (Grant Nos. ZR2024QE361 and ZR2024QE157), and the Shandong Provincial Higher Education Institution Youth Innovation Team Program (Grant No. 2025KJH147).

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Acknowledgments

The authors gratefully acknowledge the financial support from the National Natural Science Foundation of China, the Shandong Provincial Natural Science Foundation, and the Shandong Provincial Higher Education Institution Youth Innovation Team Program. The authors also sincerely thank the reviewers for their constructive comments and insightful suggestions.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Hall, S. What Turkey’s earthquake tells us about the science of seismic forecasting. Nature 2023, 615, 388–389. [Google Scholar] [CrossRef] [PubMed]
Stirling, M.W. Earthquake hazard maps and objective testing: The hazard mapper’s point of view. Seismol. Res. Lett. 2012, 83, 231–232. [Google Scholar] [CrossRef]
Wyss, M.; Nekrasova, A.; Kossobokov, V. Errors in expected human losses due to incorrect seismic hazard estimates. Nat. Hazards 2012, 62, 927–935. [Google Scholar] [CrossRef]
Stein, S.; Geller, R.J.; Liu, M. Why earthquake hazard maps often fail and what to do about it. Tectonophysics 2012, 562–563, 1–25. [Google Scholar] [CrossRef]
Zheng, T.Y.; Li, Y.; Hou, J.S.; Mi, H.L. Review on earthquake disaster loss in Chinese mainland in 2008. J. Catastrophol. 2010, 25, 112–118. [Google Scholar]
Beroza, G.C.; Segou, M.; Mousavi, S.M. Machine learning and earthquake forecasting—Next steps. Nat. Commun. 2021, 12, 4761. [Google Scholar] [CrossRef] [PubMed]
Shcherbakov, R.; Zhuang, J.C.; Zöller, G.; Ogata, Y. Forecasting the magnitude of the largest expected earthquake. Nat. Commun. 2019, 10, 4051. [Google Scholar] [CrossRef] [PubMed]
Ebrahimian, H.; Jalayer, F.; Asayesh, B.M.; Hainzl, S.; Zafarani, H. Improvements to seismicity forecasting based on a Bayesian spatio-temporal ETAS model. Sci. Rep. 2022, 12, 20970. [Google Scholar] [CrossRef]
Ebrahimian, H.; Jalayer, F. Robust seismicity forecasting based on Bayesian parameter estimation for epidemiological spatiotemporal aftershock clustering models. Sci. Rep. 2017, 7, 9803. [Google Scholar] [CrossRef]
Wu, G.; Zhou, Q.; Ran, H.L.; Yuan, R.M. Long-term probabilistic forecast for M ≥ 5.0 earthquakes in the eastern Tibetan plateau from adaptively smoothed seismicity. Bull. Seismol. Soc. Am. 2019, 109, 1110–1124. [Google Scholar] [CrossRef]
Khoshnevis, N.; Taborda, R.; Azizzadeh-Roodpish, S.; Cramer, C.H. Seismic hazard estimation of northern Iran using smoothed seismicity. J. Seismol. 2017, 21, 941–964. [Google Scholar] [CrossRef]
Berhich, A.; Belouadha, F.Z.; Kabbaj, M.I. An attention-based LSTM network for large earthquake prediction. Soil Dyn. Earthq. Eng. 2023, 165, 107663. [Google Scholar] [CrossRef]
Asim, K.M.; Martınez-Alvarez, F.; Basit, A.; Iqbal, T. Earthquake magnitude prediction in Hindukush region using machine learning techniques. Nat. Hazards 2017, 85, 471–486. [Google Scholar] [CrossRef]
Yousefzadeh, M.; Hosseini, S.A.; Farnaghi, M. Spatiotemporally explicit earthquake prediction using deep neural network. Soil Dyn. Earthq. Eng. 2021, 144, 106663. [Google Scholar] [CrossRef]
Mousavi, S.M.; Beroza, G.C. Machine learning in earthquake seismology. Annu. Rev. Earth Planet. Sci. 2023, 51, 105–129. [Google Scholar] [CrossRef]
Zhang, M.; Liu, M.; Feng, T.; Wang, R.J.; Zhu, W.Q. LOC-FLOW: An end-to-end machine learning-based high-precision earthquake location workflow. Seismol. Res. Lett. 2022, 93, 2426–2438. [Google Scholar] [CrossRef]
Mousavi, S.M.; Ellsworth, W.L.; Zhu, W.Q.; Chuang, L.Y.; Beroza, G.C. Earthquake transformer—An attentive deep-learning model for simultaneous earthquake detection and phase picking. Nat. Commun. 2020, 11, 3952. [Google Scholar] [CrossRef]
Mousavi, S.M.; Beroza, G.C. Deep-learning seismology. Science 2022, 377, eabm4470. [Google Scholar] [CrossRef]
Dascher-Cousineau, K.; Shchur, O.; Brodsky, E.E.; Günnemann, S. Using deep learning for flexible and scalable earthquake forecasting. Geophys. Res. Lett. 2023, 50, e2023GL103909. [Google Scholar] [CrossRef]
Zhang, Y.; Zhan, C.X.; Huang, Q.H.; Sornette, D. Forecasting future earthquakes with deep neural networks: Application to California. Geophys. J. Int. 2025, 240, 81–95. [Google Scholar] [CrossRef]
Stockman, S.; Lawson, D.J.; Werner, M.J. Forecasting the 2016–2017 Central Apennines earthquake sequence with a neural point process. Earth Future 2023, 11, e2023EF003777. [Google Scholar] [CrossRef]
Liu, J.; Zhang, T.; Gao, C.L.; Wang, P.X. Forecasting earthquake magnitude and epicenter by incorporating spatiotemporal priors into deep neural networks. IEEE Trans. Geosci. Remote Sens. 2023, 61, 5911413. [Google Scholar] [CrossRef]
Asaly, S.; Gottlieb, L.A.; Inbar, N.; Reuveni, Y. Using support vector machine (SVM) with GPS ionospheric TEC estimations to potentially predict earthquake events. Remote Sens. 2022, 14, 2822. [Google Scholar] [CrossRef]
Essam, Y.; Kumar, P.; Ahmed, A.N.; Murti, M.A.; El-Shafie, A. Exploring the reliability of different artificial intelligence techniques in predicting earthquake for Malaysia. Soil Dyn. Earthq. Eng. 2021, 147, 106826. [Google Scholar] [CrossRef]
Shao, Z.G.; Yan, R.; Wang, W.X.; Liu, Q.; Meng, L.Y.; Pan, Z.Y.; Wang, Z.Y.; Yan, W.; Yue, C. Strategic plan for earthquake forecasting in China (2025−2035): A brief summary. Earthq. Sci. 2026, 39, 214–219. [Google Scholar] [CrossRef]
Yu, H.Z.; Yan, R.; Deng, S.G.; Liu, J.; Xue, Y.; Li, G.; Xie, M.Y.; Ma, Y.C.; Zhang, X.T.; Ma, Y.W.; et al. Improved medium-to-short-term earthquake predictions in China in 2022. Geomat. Nat. Hazards Risk 2024, 15, 2350482. [Google Scholar] [CrossRef]
Wiener, N. Extrapolation, Interpolation, and Smoothing of Stationary Time Series, with Engineering Applications; The M.I.T. Press: Cambridge, MA, USA, 1949. [Google Scholar] [CrossRef]
Fu, Y.F.; Zhang, Z.W.; Yang, X.Y.; Yuan, S. Prediction of large-scale earthquakes using precursor earthquakes: A regression study. Int. J. Adv. Appl. Sci. 2026, 13, 21–32. [Google Scholar] [CrossRef]
Mignan, A.; Jiang, C.; Zechar, J.D.; Wiemer, S.; Wu, Z.; Huang, Z. Completeness of the mainland China earthquake catalog and implications for the setup of the China earthquake forecast testing center. Bull. Seismol. Soc. Am. 2013, 103, 845–859. [Google Scholar] [CrossRef]

Figure 1. Schematic of the Wiener filtering-based forecasting framework.

Figure 2. Architecture of the two-layer backpropagation neural network.

Figure 3. Spatial distribution of earthquakes with magnitudes ≥ 5.0 in the North-South Seismic Belt (1500–2026).

Figure 4. Time series of earthquakes with magnitudes ≥ 6.0 in the North-South Seismic Belt (1500–2026).

Figure 5. Predicted occurrence times of earthquakes with magnitudes ≥ 7.0 in the North-South Seismic Belt.

Figure 6. Effect of time series length on prediction results: (a) 1500–2016; (b) 1622–2016; (c) 1713–2016.

Figure 7. Effect of threshold magnitude on prediction results: (a) M ≥ 6.0; (b) M ≥ 5.5; (c) M ≥ 5.0.

Figure 8. Schematic of the artificial neural network architecture used in this study.

Figure 9. Schematic of artificial neural network validation using a sinusoidal function.

Figure 10. Comparison of predicted and true values: (a) ordered samples; (b) shuffled samples.

Table 1. Catalog of earthquakes with magnitudes ≥ 7.0 in the North-South Seismic Belt (1500–2026).

No.	Event Date	Latitude/°	Longitude/°	Magnitude	Epicenter Location
1	13 January 1500	24.9	103.1	7.0	Yiliang, Yunnan
2	29 March 1536	28.1	102.2	7.5	North of Xichang, Sichuan
3	09 August 1588	24.0	102.8	7.0	Jianshui, Yunnan
4	25 October 1622	36.5	106.3	7.0	North of Guyuan, Ningxia
5	21 July 1654	34.3	105.5	8.0	South of Tianshui, Gansu
6	04 September 1713	32.0	103.7	7.0	Maoxian, Sichuan
7	19 June 1718	35.0	105.2	7.5	South of Tongwei, Gansu
8	02 August 1733	26.3	103.1	7.75	Dongchuan, Yunnan
9	01 June 1786	29.9	102.0	7.75	South of Kangding, Sichuan
10	07 June 1789	31.0	102.9	7.0	Huaning, Yunnan
11	06 September 1833	25.0	103.0	8.0	Songming, Yunnan
12	12 September 1850	27.7	102.4	7.5	Xichang, Sichuan
13	01 July 1879	33.2	104.7	8.0	South of Wudu, Gansu
14	21 December 1913	24.2	102.5	7.0	Eshan, Yunnan
15	16 December 1920	36.7	104.9	8.5	Haiyuan, Ningxia
16	25 August 1933	31.9	103.4	7.5	Maoxian, Sichuan
17	05 January 1970	24.2	102.7	7.8	Tonghai, Yunnan
18	11 May 1974	28.2	104.1	7.1	North of Daguan, Yunnan
19	23 August 1976	32.5	104.3	7.2	Between Songpan and Pingwu, Sichuan
20	12 May 2008	31.0	103.4	8.0	Wenchuan, Sichuan
21	20 April 2013	30.3	103.0	7.0	Ya’an, Sichuan
22	08 August 2017	33.2	103.8	7.0	Jiuzhaigou, Sichuan

Table 2. Comparison between predicted time series and observed earthquake occurrences.

Learning Rate	Forecast Starting Point	Predicted Value (Run 1)	Predicted Value (Run 2)	Predicted Value (Run 3)	Predicted Value (Run 4)	Predicted Value (Run 5)	Mean Prediction	Observed Value	Prediction Error
α = 0.2	1990	8.5	8.3	8.4	8.4	8.3	8.4	8.0	0.4
	1980	7.8	8.0	7.5	7.5	7.6	7.7	8.0	−0.3
	1970	7.4	7.1	7.3	7.9	7.7	7.5	7.8	−0.3
	1960	7.4	7.5	7.5	7.3	7.6	7.5	7.8	−0.3
	1950	8.0	8.6	8.1	8.5	8.1	8.3	7.8	0.5
α = 0.3	1990	7.8	8.6	8.4	8.7	8.1	8.3	8.0	0.3
	1980	8.2	8.4	8.0	8.3	7.9	8.2	8.0	0.2
	1970	8.1	8.4	8.9	8.1	8.1	8.3	7.8	0.5
	1960	8.6	8.1	8.3	7.8	8.3	8.2	7.8	0.4
	1950	8.0	8.2	8.2	8.7	8.0	8.2	7.8	0.4

Table 3. Time series prediction results.

Learning Rate	Forecast Starting Point	Predicted Value (Run 1)	Predicted Value (Run 2)	Predicted Value (Run 3)	Predicted Value (Run 4)	Predicted Value (Run 5)	Mean Prediction
α = 0.2	2000	7.7	7.7	7.7	7.7	7.7	7.7
α = 0.3	2000	7.9	8.0	7.7	8.0	8.0	7.9

Table 4. Comparison between predicted latitude series and observed earthquake locations.

Learning Rate	Forecast Starting Point	Predicted Value (Run 1)	Predicted Value (Run 2)	Predicted Value (Run 3)	Predicted Value (Run 4)	Predicted Value (Run 5)	Mean Prediction	Observed Value	Prediction Error
α = 0.2	1990	29.3	29.6	30.3	29.3	28.3	29.4	31.0	−1.6
	1980	29.9	30.3	30.0	29.4	29.5	29.8	31.0	−1.2
	1970	24.5	24.0	24.8	23.9	23.6	24.2	24.2	0.0
	1960	23.8	25.4	24.7	25.4	24.5	24.8	24.2	0.6
	1950	24.1	24.0	24.0	25.0	24.3	24.3	24.2	0.1
α = 0.3	1990	29.6	29.5	29.6	29.1	29.6	29.5	31.0	−1.5
	1980	31.8	31.4	31.0	31.1	31.6	31.4	31.0	0.4
	1970	24.8	24.2	24.4	24.6	24.5	24.5	24.2	0.3
	1960	25.3	25.6	25.8	25.8	26.0	25.7	24.2	1.5
	1950	24.6	24.3	25.4	25.5	24.8	24.9	24.2	0.7

Table 5. Latitude series prediction results.

Learning Rate	Forecast Starting Point	Predicted Value (Run 1)	Predicted Value (Run 2)	Predicted Value (Run 3)	Predicted Value (Run 4)	Predicted Value (Run 5)	Mean Prediction
α = 0.2	2000	31.1	31.1	31.0	30.8	30.9	31.0
α = 0.3	2000	32.1	31.5	32.0	32.6	31.9	32.0

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Ren, P.; Li, P.; Chen, X.; Gu, T.; Song, X.; Wang, C.; Yan, K. Spatiotemporal Forecasting of Seismic Activity Trends Using Wiener Filtering and Artificial Neural Networks. Mathematics 2026, 14, 1756. https://doi.org/10.3390/math14101756

AMA Style

Ren P, Li P, Chen X, Gu T, Song X, Wang C, Yan K. Spatiotemporal Forecasting of Seismic Activity Trends Using Wiener Filtering and Artificial Neural Networks. Mathematics. 2026; 14(10):1756. https://doi.org/10.3390/math14101756

Chicago/Turabian Style

Ren, Pengfei, Peijia Li, Xiaoyang Chen, Tingkai Gu, Xiaoyu Song, Cong Wang, and Kai Yan. 2026. "Spatiotemporal Forecasting of Seismic Activity Trends Using Wiener Filtering and Artificial Neural Networks" Mathematics 14, no. 10: 1756. https://doi.org/10.3390/math14101756

APA Style

Ren, P., Li, P., Chen, X., Gu, T., Song, X., Wang, C., & Yan, K. (2026). Spatiotemporal Forecasting of Seismic Activity Trends Using Wiener Filtering and Artificial Neural Networks. Mathematics, 14(10), 1756. https://doi.org/10.3390/math14101756

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Spatiotemporal Forecasting of Seismic Activity Trends Using Wiener Filtering and Artificial Neural Networks

Abstract

1. Introduction

2. Fundamental Principles of Seismic Activity Trend Forecasting

2.1. Principles of Wiener Filtering-Based Seismic Activity Trend Forecasting

2.2. Principles of Artificial Neural Network-Based Seismic Activity Trend Forecasting

3. Seismic Activity Trend Forecasting for the North-South Seismic Belt

3.1. Wiener Filtering-Based Seismic Activity Trend Forecasting for the North-South Seismic Belt

3.1.1. Results of Wiener Filtering-Based Seismic Activity Trend Forecasting

3.1.2. Sensitivity Analysis of Wiener Filtering-Based Predictions

3.2. Artificial Neural Network-Based Seismic Activity Trend Forecasting for the North-South Seismic Belt

3.2.1. Training and Validation of the Artificial Neural Network Model

3.2.2. Results of Artificial Neural Network-Based Forecasting

4. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI