1. Introduction
Solar activity is a range of phenomena deriving from the Sun, including solar flares, sunspot cycles, and coronal mass ejections, which significantly impact space weather and geomagnetic conditions on Earth. Researchers have studied the effects of solar activity on Earth’s atmosphere and magnetic field in depth, contributing to our understanding of the impact of space weather on technological and natural processes [
1,
2,
3,
4,
5].
The complex interaction between terrestrial and extraterrestrial forces has historically been of great interest to the scientific community. This area of research examines the potential relationship between solar activity and seismic activity. Seismologists traditionally associate earthquakes with tectonic forces, but recent research has shown that solar activity can also influence seismic events. Exploring the links between space weather and geophysical phenomena is not timely but critically important because of the growing need for sophisticated disaster forecasting and mitigation strategies.
The notion that solar activity may trigger seismicity is based on several studies. As an example, Huzaimy and Yumoto examined the relationship between the high-speed solar wind (HSSW) and massive earthquakes, and found a significant relationship, especially during sunspot cycles [
6]. It is consistent with the findings of Tavares et al. A long-term study of global earthquakes and solar activity cycles from 1600 to 2010 was conducted and found that earthquakes often occur after solar maxima, especially in tectonically active regions such as the Pacific and South American plates [
7]. The methodological basis for this study was to exclude data, obtained from the National Geophysical Data Centre (NGDC), prior to 1600 because of the scant and inconsistent records of both solar activity and seismic events prior to that date. Significant fluctuations in solar activity such as the Maunder minimum (1645–1720) and Dalton minimum (1790–1820) were also observed during this period, providing insight into the relationship between low solar activity and earthquakes. Nikouravan et al. provided additional supporting evidence by showing an apparent strong association between solar proton events (SPE) and local earthquakes in Iran as seismic activity appears to peak during solar maxima [
8].
Marchitelli et al. analyzed proton density and velocity data from the SOHO satellite over a 20-year period, as well as earthquake data from the ISC-GEM catalog. Since proton density showed the strongest correlation with seismic activity, it was chosen as the most important parameter. In the conclusion of the paper, the researchers found that there is a statistically significant correlation between proton density induced by solar activity and the occurrence of large earthquakes (magnitude greater than 5.6). A high correlation was found between proton density and earthquakes, with a time lag of one day. This indicates that an increase in proton density may precede earthquakes. The optimal proton density threshold used in the study was 15.5 protons/cm
3, chosen based on an analysis that demonstrated maximum correlation with earthquakes. However, the reasoning behind this threshold selection requires more rigorous backing, as mentioned in previous studies [
9].
However, the association between solar and seismic activity is not necessarily unambiguous. For example, while some studies associate high solar activity with increased seismicity, others indicate that high-magnitude earthquakes often occur during periods of low solar activity. This seeming contradiction may be due to different mechanisms affected by various solar cycle phases. For example, during periods of solar minimum, changes in solar wind and geomagnetic activity may cause seismic responses differently than during solar maximum periods, when increased solar radiation may directly affect ionospheric and terrestrial processes [
8,
9]. Thus, these complexities of the correlation may involve different mechanisms under different solar conditions, indicating a need for the further exploration of these dynamics.
In addition to statistical relations, some researchers have considered possible physical mechanisms that might relate solar activity and seismic activity [
10]. For example, Ruzhin, Sorokin, and Yaschenko suggested that geomagnetic winds caused by solar flares could create telluric currents in the Earth’s crust that would trigger earthquakes [
11]. Then, they found that, after the large (X9.3-class) solar flare, there was not only a global change in seismicity but that it is also quite possibly regional in nature. Both authors indicate that the geomagnetic pulses from solar flares might result in the dissipation of electric currents concentrated within the fault zone creating sufficient stress on the fault to cause an earthquake [
12].
In contrast, the purported influence of solar activity on seismicity is somewhat controversial. For example, one study in which scientists examined solar winds and earthquakes in Latin America showed a weak correlation, especially for earthquakes less than magnitude 6 [
13]. Cataldi, Bose, and Strather countered that there may be overlapping time signatures between increased solar activity and increased seismicity; however, they also argue that solar activity is not the primary driver of crustal collapse [
14]. The study of solar activity in earthquakes goes beyond simple correlations and causal relationships. Uzunov and Khachikian extended the study of geomagnetic storms by considering seismic activity associated with geomagnetic storms that excite new radiation belts around the Earth. Large earthquakes are likely to occur near geomagnetic traces of active radiation belts of relativistic electrons. This suggests that there is some behavior of moving solar particles associated with geomagnetic storms that are relevant to seismic events [
12]. Solar flares are linked to activity that increases the Earth’s angular velocity and also induces telluric currents that may contribute to earthquakes [
15].
Solar activity has the potential to predict earthquakes. Saldana and Hirata used extended advanced statistical techniques to show that there is a unidirectional causal relationship between solar activity and seismicity. The accuracy of earthquake prediction improved when sunspot data was included [
16]. Lee et al. used deep learning to evaluate the relationship between solar activity and global earthquakes and found a possible relationship, especially with earthquakes of magnitudes 4.0 to 4.9 [
17]. Overall, the results suggest that although solar activity cannot predict earthquakes on its own when included in an overall prediction model together with other correlations, it can work.
Deep learning techniques [
18] have also been used to study seismicity and solar activity. For example, different datasets were combined using machine learning techniques to study earthquake phenomena [
19,
20]. The results indicate that LSTM provides the best model for earthquake prediction, primarily because it captures the complex nonlinear aspects of the observed data and accounts for time dependence [
20]. A study by Saqib et al. noted that LSTM was significantly better than the ARIMA model at detecting anomalous indicators and seismic events in the ionospheric anomaly data they studied [
21].
As mentioned above, studies on earthquake prediction have considered situations where solar activity was associated with seismicity a few days after solar-related activity. Abri and Artuner discussed this topic in their paper, where the LSTM method outperformed all other models considered, including common ones such as SVM and Random Forest. They suggest that LSTM is the best option for tasks involving temporal processes, such as predicting earthquakes from time–series data [
22].
Perhaps most interestingly, Saqib et al. suggested that LSTM is better than ARIMA at predicting earthquake-associated ionospheric anomalies [
21,
23]. This lends credence to the use of LSTM over traditional types of models in complex problems such as proton density prediction and subsequent prediction of associated earthquakes, as LSTM does have clear advantages. Studies of seismic activity prediction, such as earthquake occurrence, along with observations of proton density, have confirmed that increasing proton density is a major factor in predicting seismic activity. It has been demonstrated that LSTM models that account for proton density and its changes over time can identify and discriminate patterns associated with increasing earthquake occurrence better than traditional models [
24].
A critical characteristic of the application of LSTMs in particular, and deep learning models with multiple layers of temporal dependencies in general, is the ability of the model to learn from complex temporal dependencies [
18,
25]; this is very important for understanding the variability of solar activity and its relationship to earthquakes. Time–series data, such as those on solar activity, show variable and complex patterns associated with seasonal effects; so, many traditional approaches, such as ARIMA or SARIMA models [
26,
27], do not capture multi-level dependencies. Since LSTM has methods of forgetting and establishing reference lines to the results, it is often effective and provides better forecasting ability on complex time–series variables. LSTM predicts solar activity’s effect on earthquakes significantly better than classical approaches.
To summarize, the best approach is to use a model such as LSTM or to apply the LSTM model to study the complexity of time–series of solar variables and the effect of solar activity on seismicity. This is the best way within the current academic interpretation and knowledge [
28]. Comparison with other models also confirms that the high accuracy and precision of predictions will be higher with the LSTM model. This is a key area for future work.
This study focuses on two important tasks related to solar activity and its connection with seismic processes on Earth. The first task, related to proton density prediction using LSTM, will improve the accuracy and reliability of space weather monitoring in the long run. Proton density is one of the key parameters of solar activity, so attempting to predict proton density will provide insight into understanding solar phenomena and their potential impact on geophysical processes on Earth.
The second objective of the research is to predict global-scale earthquakes based solely on proton density. This approach allows us to assess how well the model can perform this task, despite controversial opinions about the correlation between solar activity and earthquakes. If successful, this task will lead to the development of disaster prediction and management methods. Efforts to monitor solar activity can improve preparedness and reduce the impact of earthquakes on vulnerable communities.
  2. Materials and Methods
The correlation between earthquakes and solar activity—in particular, changes in proton density—is increasingly becoming the subject of scientific research due to the unique interactions protons have with Earth’s geomagnetic and ionospheric environments. Proton density, unlike transient solar phenomena such as solar flares or geomagnetic storms, provides a stable and reliable measure within the solar wind. Given these qualities, recent studies have shown that there may be a correlation between large-scale solar events and large-scale earthquakes. Therefore, accurate prediction of solar wind parameters, especially proton density, is a key challenge in understanding and predicting geophysical phenomena affecting natural and human systems. To solve this challenge, we designed a prediction model based on the LSTM network [
29], a recurrent neural network designed to process networks, a type of recurrent neural network made especially to handle sequential input.
Because of their inherent internal storage structure, long-range relationships can be uniquely modeled by LSTM networks, which makes them attractive for use in time–series forecasting. Unlike traditional RNNs, LSTM networks have a mechanism that controls information flow through input, forget, and output gates. For forecasting space weather events, this is especially significant for the solar wind and proton density, where temporal patterns can demonstrate both short-term fluctuations and long-term trends [
30,
31].
  2.1. Data Collection
The dataset used in this study was obtained from the Solar and Heliospheric Observatory (SOHO) mission, specifically from the Charge, Element, and Isotope Analysis System (CELIAS) instrument, which measures proton density in the solar wind. The original dataset consists of proton density measurements recorded every 5 min from 1996 to July 2023.
The data were resampled to an hourly time scale at 5 min intervals to aid in analysis and reduce data volume. The highest proton density value for each hour was chosen for the resampling process. This method makes sure that the dataset contains the most extreme variations in proton density, which is important for examining any possible links with seismic activity. Thus, the obtained dataset represents a complete hourly record of proton density for almost three decades, which is the basis for subsequent analysis and modeling.
The United States Geological Survey (USGS) earthquake catalog covering the period from 1996 to July 2023 was used in this study. This dataset contains detailed records of seismic events around the world, which is very important for studying potential correlations between solar activity and seismic events.
To ensure the integrity of the analysis, the earthquake dataset was carefully declustered to isolate primary seismic events by removing dependent events such as aftershocks and foreshocks, which can misinterpret the correlation analysis. Spatial and temporal filtering criteria were applied, as outlined in methodology by Marchitelli et al. [
9]. The dataset was filtered to remove earthquakes that occurred within 150 km of a significant event and during 6 months. The purpose of this declustering procedure was to reduce potential errors caused by dependent events and concentrate on primary seismic events.
Filtering earthquakes by depth and magnitude, taking into account only big seismic events (magnitude larger than 6 and depth less than 60 km), was a crucial step in the data production. This allowed us to focus on earthquakes that might be connected to variations in solar activity. The average value for each column was used to fill in the gaps in the proton density data to guarantee data completeness.
The next step was to split the data and create a new column based on the proton density data, indicating whether an earthquake occurred in the next 48 h. This step prepared the data for the binary classification task. Label analysis showed that the classes were highly unbalanced, so a method of calculating class weights had to be used to further train the model.
A 48 h data sequence was prepared to train the LSTM model. This time window enabled the model to incorporate temporal dependencies over two days, increasing prediction accuracy. To enable the model to learn patterns in the data effectively, the MinMaxScaler function was used to normalize the feature values to a range between 0 and 1. This reduces the impact of differences in feature values and allows the model to learn more efficiently by standardizing the input data. In contrast to a standard scaler (converting the data to zero mean and unit variance), MinMaxScaler preserves the relative differences between minimum and maximum values, which is useful for capturing anomalies and abrupt changes in proton density that may correlate with seismic events. LSTM models are sensitive to the scale of the input data because the training uses gradients to adjust the weights. MinMaxScaler helps to avoid explosive growth or disappearance of gradients, which can improve stability and convergence rate. After that, the data was divided into test and training sets at a ratio of 80% to 20% so that the model’s performance could be assessed on data that had not been used for training.
  2.2. LSTM for Earthquake Classification
In the case of earthquake prediction based on proton density data, most of the time intervals do not contain earthquakes, which leads to a strong class imbalance: most examples are in the “no earthquakes” class; a smaller fraction are in the “earthquake” class.
This is common for real data, particularly when forecasting rare events like earthquakes is required. Without considering the imbalance, a model trained on such data may be biased toward forecasting a more frequent class (in this case, the lack of an earthquake). Despite high overall accuracy, the model may perform poorly in predicting a rare but important class, i.e., earthquakes. For example, the model may predict that there will be no earthquakes and thus achieve high accuracy, completely ignoring the need to predict earthquakes correctly, and reducing its practical relevance.
To deal with this problem, the ‘class_weight’ parameter is used to weigh the contribution of each class to the loss function. In doing so, rarer classes such as earthquakes are assigned higher weights, making their correct classification more meaningful to the model. In this way, the model becomes more adept at identifying unusual occurrences and decreases the possibility of overlooking crucial forecasts.
For this study, the model prioritizes accurately forecasting the presence of earthquakes due to the greater weight assigned to the type of earthquakes. This in turn pushes the model to understand the patterns leading up to an earthquake and helps prevent the scenario where it produces simplistic forecasts in favor of a more frequent class.
Metrics like recall and F1-score—that pertain to rare class prediction—have been improved when “class_weight” is used. This is especially important for problems such as earthquakes where missing critical events can have serious consequences.
Next, the model was built using a three-layer LSTM (
Figure 1) (each layer contains 64 neurons), one dense layer, and a ‘tanh’ activation function as shown in 
Table 1. The three layers of the LSTM work together to capture increasingly subtle temporal patterns in the proton density data. The first layer identifies broad patterns, the second layer refines these patterns, and the third layer condenses the information into high-level representations. This layered approach allows the model to explore complex temporal relationships for accurate predictions while minimizing the risks of overshooting. The Adam optimizer [
32] was used in the study to optimize the model, with a gradient rate of 1.0 and an initial learning rate of 0.0001. These settings were selected through a trial-and-error process, where various configurations were tested to determine their impact on model performance.A binary cross-entropy function was used to calculate the model loss, and accuracy was computed as the ratio of correct classifications. Using the EarlyStopping callback, training was carried out under supervision and stopped after five epochs if the validation loss did not improve. The best model’s weights were saved after training so that they could be used to assess the test patterns.
  2.3. Proton Density Forecasting
Proton density is an important parameter for understanding solar activity, which is also relevant to space weather [
33,
34,
35] and can influence geophysical processes, including earthquakes, on Earth. Accurate proton density predictions are important for reliable space weather monitoring and forecasting to better respond to situations that affect technology and natural phenomena. Thus, another important goal of the project, in addition to earthquake prediction, is proton density forecasting using LSTM models.
In this case, proton density is doubly important because it improves the understanding of solar dynamics and also provides a broader basis for predicting related terrestrial phenomena such as earthquakes. During solar spikes, a reliable proton density prediction model can serve as an early warning tool and help establish periods of increased solar activity that may be associated with increased seismic activity on Earth. To this end, the use of advanced deep learning models, such as LSTM, for proton prediction is both an area of scientific research and of great applied importance.
To achieve this, a sophisticated LSTM model was designed, incorporating bidirectional layers to capture patterns in both past and future sequences of the data. The model’s architecture consists of multiple layers as shown in 
Table 2: a bidirectional LSTM layer with 128 neurons and a ‘tanh’ activation function, followed by batch normalization and dropout [
36] to prevent over-fitting. The architecture was designed to balance model complexity and prediction accuracy while avoiding over-fitting. The choice of two LSTM layers (one bidirectional and one standard) allows the model to train on both forward and backward data sequences in the first layer and refine these patterns in the second layer. This layered approach allows the model to capture complex temporal dependencies in the proton density data without excessive depth, which can lead to over-fitting. This is followed by a second LSTM layer with 64 neurons, another round of batch normalization and dropout, and a dense layer with 32 neurons using the ‘relu’ activation function. LSTM layer reduces dimensionality and tunes the learned patterns, creating a compact representation. The dense layer serves as a transition between the LSTM layers and the final output layer. The final output layer with a single neuron and a ‘linear’ activation function, is suitable for regression tasks like predicting proton density (
Figure 2). Each architectural choice was carefully considered to maximize accuracy while maintaining computational efficiency.
The model is trained using the Adam optimizer and Mean Squared Error loss is applied to the regression problem. To ensure training robustness, we implement several callbacks: ‘EarlyStopping’ stops training when the performance of the models on the validation set stops improving; ‘ReduceLROnPlateau’ reduces the plateau when the learning rate is reached; ‘ModelCheckpoint’ keeps the model with the best performance. ‘ModelCheckpoint’ keeps the model that performed the best.
The result of the model development and implementation is a reliable proton density prediction tool, which is essential for understanding and predicting the impact of solar activity on the Earth. The performance of the model is very important because it also contributes to the higher goal of linking solar activity to earthquakes on Earth and improving the prediction and mitigation of natural disasters.
  3. Results
In this study, an LSTM-based prediction model was developed and applied to study the relationship between solar activity (especially proton density variations) and earthquake occurrence. The performance of the model in predicting earthquakes and its accuracy in predicting proton density are described in detail below. The results show that the LSTM model has a remarkable ability to recognize patterns in the data despite the challenges of class imbalance and the complexity of the underlying phenomena.
  3.1. Earthquake Classification
The LSTM model was trained on 100 epochs, and the training time was 4–10 min per epoch. In the early stages of training, the accuracy of the model is unstable, starting from 0.5394 and loss of 0.6918 in the first epoch. The validation accuracy at this stage is 0.5009 with a loss of 0.6893. These initial results show that it is difficult to distinguish seismic from non-seismic events based on proton density data alone.
As the model was trained, it became possible to capture temporal correlations in the data. In the final epoch (the 93rd epoch), the model had a training accuracy of 0.8313 with a validation loss of 0.3600 and a validation accuracy of 0.8384 with a loss of 0.3572. This result indicates the learning process and generalization of the model. The model was not over-fitted and completed due to no improvement in validation loss for several consecutive epochs.
Table 3 summarizes the overall classification performance, including precision, recall, F1-score, and overall accuracy for both seismic and non-seismic classes. After training the model, the test accuracy reached 0.8447, with a corresponding test loss of 0.3425. For example, in a study using total electron content (TEC) data for earthquake detection, it was reported that LSTM-based prediction models achieved an accuracy of about 82% in classifying earthquake events based on TEC values over several days [
22]. This indicates that the LSTM model for earthquake prediction using proton density is not only comparable but also exhibits slightly higher accuracy, indicating its reliability in predicting seismic activity. Furthermore, a study of the effect of solar activity on seismicity showed that the integration of solar parameters led to a marked improvement in the accuracy of seismic event prediction, increasing the likelihood ratio from 2.6% to 17.9% for the largest magnitude predictions [
16]. Although the focus is on solar activity rather than proton density, these data highlight the potential of integrating various solar parameters to improve prediction models.
 For the seismic class, the precision was 0.6807, the recall was 0.8368, and the F1-score was 0.7507. These metrics indicate a relatively lower precision for earthquake prediction (0.68) compared to non-earthquake events (0.93), as shown in 
Table 3. This discrepancy raises concerns about potential model bias due to the inherent class imbalance, which is not fully mitigated despite efforts to adjust the class weights during training.
To address this, we varied class weights during the training process to incrementally reduce the impact of class imbalance while improving prediction accuracy for the underrepresented seismic class. Although this strategy improved recall for earthquakes (0.84), the precision remained comparatively lower, indicating that the model is prone to false positives in predicting earthquake events. These false positives may primarily occur for events that are close to the decision boundary or could be influenced by specific confounding factors, such as non-seismic indicators being misclassified as seismic events.
The F1-score for the seismic class reflects a reasonable balance between precision and recall, but further adjustments—such as advanced resampling techniques or introducing additional seismic predictors—may be necessary to achieve a more balanced performance, while the adjustment of class weights contributed to improved recall, the increased sensitivity in the model could lead to a higher number of false positives, thereby impacting overall precision.
Figure 3 illustrates the spatial distribution of earthquake predictions from 1996 to July 2023, highlighting the accuracy of the model across different locations. Correctly predicted events are marked in green, while incorrect predictions are in red. These results underscore the complexity of modeling seismic activity, where even minor shifts in solar activity indicators, such as proton density, can lead to misclassifications.
   3.2. Proton Density Forecasting
In the task of forecasting proton density, the LSTM model demonstrated remarkable accuracy and consistency. The model’s performance in forecasting from 2018 to 2023 is illustrated in 
Figure 4. The plot demonstrates the model’s ability to closely track the actual values over this period, highlighting the effectiveness in capturing the temporal variations in proton density.
The model was trained for up to 1000 epochs, and halted by the 401st epoch due to the EarlyStopping criterion, as the validation loss stabilized. During the initial epochs, the model showed rapid improvement. By Epoch 1, the model achieved a training loss of 0.0725, which quickly dropped to 0.0013 in the second epoch, with the validation loss improving from  to .
As the training progressed, the model refined its ability to capture temporal patterns in the proton density data. By Epoch 401, the training loss had decreased to 0.00043347, with a corresponding validation loss of 0.00046692. The final performance on the test set was evaluated using the Mean Squared Error (MSE) metric, which yielded a value of 1.7158. This low MSE indicates the model’s strong predictive capability and effectiveness in capturing the underlying dynamics of proton density variations.
However, while the MSE is low, it is essential to assess its practical significance against established industry and scientific standards for predictive accuracy. Typically, acceptable MSE values depend on the specific application context, such as the required precision for scientific studies or operational forecasting in industry. For instance, in some scientific domains, an MSE below 2.0 may be considered satisfactory, while in critical applications like space weather forecasting, stricter thresholds might apply. A high MSE could lead to substantial forecasting errors, resulting in misinterpretations of solar activity and its effects on seismic events, which can have serious consequences for disaster management.
Thus, while the current MSE suggests that the model performs well, it would benefit from further benchmarking against these standards to determine its adequacy for practical applications.
The consistent reduction in training and validation loss throughout the training process indicates the model’s robustness in forecasting proton density. The combination of bidirectional LSTM layers, batch normalization, and dropout layers prevents over-fitting and ensures the model’s generalizability to unseen data.
  4. Discussion
This study investigated the complex relationship between solar activity and seismic events, highlighting the role of proton density as a significant predictor of earthquake occurrence. The results contribute to the ongoing discourse on the predictive capabilities of solar activity and are consistent with the work of Saldana and Hirata, who found a unidirectional causal relationship between solar activity and seismicity; while the LSTM results support the notion that solar activity can improve the accuracy of earthquake prediction, we also show limitations that suggest that solar activity alone cannot serve as a reliable predictor without the integration of additional variables.
This LSTM approach is based on the existing literature, particularly the studies of Lee et al. and Saqib et al., who explain the success of deep learning techniques in capturing complex patterns associated with seismicity. The LSTM models excel in identifying nonlinear relationships in time-dependent data, confirming the advantages highlighted in previous studies. For topics such as solar activity, which are modeled in a broad framework, this study stands out in its uniqueness by focusing exclusively on proton density. This concentrated attention brings out the subtle patterns in seismic activity associated with variations in proton density that have not been extensively explored with respect to earthquake prediction.
In addition, this study addressed the issue of class imbalance, which is not always fully addressed. By implementing advanced techniques to mitigate bias towards the majority class, the reliability of the predictions was improved. However, it is important to recognize that proton density is only one of many factors that influence seismic activity. Future research should investigate additional variables such as solar wind speed and geological factors such as tectonic plate movements to further improve the predictive capabilities of the model and provide a more comprehensive understanding of the interplay between solar activity and seismic events.
By improving the accuracy of proton density prediction, we can make a valuable contribution to space weather monitoring and ultimately support the development of effective disaster prediction and response methods. This is particularly important for vulnerable communities exposed to the risks of seismic events.
  5. Conclusions
These findings provide valuable insights into the relationship between solar activity and seismic events, suggesting potential applications for disaster management and hazard assessments. However, it is crucial to approach the utility of the LSTM model for predicting earthquakes with caution, given the inherent complexity of earthquake prediction and the multitude of factors that contribute to seismic activity.
While the model demonstrates effectiveness in capturing meaningful patterns based solely on proton density variations, it is important to recognize that proton density is only one of many variables influencing seismic occurrences. Other potential variables that could be investigated in future research include solar wind speed, magnetic field fluctuations, and geological factors such as tectonic plate movements. Exploring these additional factors may enhance the model’s predictive capabilities and provide a more comprehensive understanding of the relationship between solar activity and seismic events.
The precision, recall, and F1-score for earthquake predictions—0.6807, 0.8368, and 0.7507, respectively—indicate that the model can label a significant proportion of earthquakes with reasonable accuracy. Furthermore, the overall accuracy of 84.47% on the test set reflects the model’s robustness in generating predictions for rare seismic events.
The high recall rates are particularly noteworthy, as they highlight the model’s ability to identify a substantial number of seismic occurrences. This capability is critical in contexts where the failure to predict an earthquake could have serious implications. Nonetheless, it is essential to address the challenges of class imbalance and model bias, as these issues can significantly affect the reliability of predictions. Class imbalance refers to the unequal distribution of seismic and non-seismic events in the dataset, which can lead to a model that is biased toward the majority class. This bias can result in a higher number of false negatives, where earthquakes are missed. Therefore, further research is warranted to improve the model’s predictive capabilities by incorporating additional variables and exploring more advanced techniques to address these challenges.