1. Introduction
The North Atlantic Oscillation (NAO) is the predominant way in which atmospheric variability occurs in the North Atlantic region [
1,
2,
3,
4,
5]. It is characterised as the primary mode of daily to decadal oscillations of the wave. It regulates the primary meteorological patterns across the North Atlantic, North America, Europe, and to some extent, Asia. A negative phase (
Figure 1a) of the NAO is linked to a substantial arching configuration of the jet stream in the North Atlantic and a southward-shifted storm track in Europe. Consequently, this results in colder and drier than average weather conditions in Northern Europe, while Southern Europe experiences wetter weather. An anomalous strength of the semi-stationary Icelandic low and Azores high-pressure regions is observed during a positive phase (
Figure 1b) of the NAO. This phenomenon leads to a more zonal jet stream over the North Atlantic and a northward shift of the storm track over Europe. Consequently, Northern Europe experiences wetter and warmer weather, while Southern Europe encounters dry weather. Studying the variability of the North Atlantic Ocean and atmosphere will be a crucial area of focus for the climate and research in the coming decade [
3,
4,
5,
6,
7].
The winter NAO has significant variability, exerting a considerable impact on weather patterns and climate dynamics throughout interannual to decadal time periods across North America, Europe, and Asia. The NAO variability is a result of internal atmospheric fluctuation, as well as intricate interactions between the atmosphere and the ocean. The variability of the NAO has significant implications for coastal populations, manifesting in various ways such as storm surges, high tide, atmospheric blocking resulting in persistent weather patterns, flooding and drought, sea ice formation, and coastal erosion [
2,
3,
4,
5,
6,
7,
8,
9].
Skilful forecasts are advantageous to the community. The relevance of winter windstorms to the food systems and insurance sector is significant. The influence of winter temperatures on energy pricing and the potential disruption of transport networks have been seen [
8,
9]. The enhancement of the NAO forecast also results in financial benefits by reducing electricity consumption during the winter season in Scandinavian nations through the implementation of hydropower generation and the energy market [
10,
11]. The NAO variability and prediction is an ideal test subject for developing innovative transdisciplinary research frameworks because of the strong connections between the atmosphere, the ocean, and society.
The NAO’s adept forecasts [
8,
9,
10,
11,
12,
13] yield significant advantages for society, encompassing river dynamics, transportation, sea ice extent, energy production, water resource administration, insurance, and food systems. Notable instances include the effects on hydropower production and energy markets in Scandinavia, as well as the management of wind power in the UK [
14,
15,
16]. There are a limited number of studies that rigorously quantify the impacts of the NAO. However, a significant challenge arises from the inherent difficulty in quantifying the socio-economic response to this phenomenon. This difficulty arises from the fact that a substantial portion of the pertinent information is disseminated through news portals and other communication media. Consequently, extracting and analysing this information for the scientific community is a complex task. The subsequent obstacle pertains to the formulation of the analysis–prediction framework [
17,
18,
19,
20]. Skilled NAO prediction is crucial for enhancing numerical models of atmospheric dynamics, which serve as the foundation for operational weather forecast services. However, the progress in these models is hindered by the intricate nature of the climate system and the models themselves. According to recent research [
13], it has been proposed that the accuracy of the NAO winter forecast can be enhanced by integrating information regarding the external factors influencing the NAO tropical variability, such as the El Nino Southern Oscillation (ENSO) and the Madden–Julian Oscillation (MJO), Arctic Sea ice, and solar variability. The intricate interactions between the ocean and atmosphere are difficult to unravel using numerical models or standard analysis methods [
13]. Therefore, the third task is to create a proficient model.
Novel data analysis approaches are emerging as viable alternatives to dynamical models. One illustration of Big Data analysis approaches encompasses the utilization of extensive text mining tools and the application of deep learning in time-series analysis. These techniques can be employed to examine the extensive textual resources (such as newspaper communications like the New York Times Annotated Corpus, social media, and public reports) with the aim of measuring the effects on the power generation sector and the decision-making process at the federal level concerning public services in the countries impacted by the NAO. But, it should be noted that much less useful information is available on the future NAO in the meteorological and oceanographic data used by previously published studies on predicting the wintertime NAO. Past weather information has been extracted based on whatever information was found. If information was found, then we coded this as 1, if not, then it was coded as 0. The techniques [
17,
20,
21,
22] have the potential to be utilised in the identification of optimal communication routes for prediction and forecast services, specifically in terms of facilitating the dissemination of information from the scientific community to society. In addition, there exist novel methodologies for the analysis and prediction of NAO variability. Various research has focused on analysing the portrayal of the NAO in climate models. An autoencoder (AE) is a neural network structure specifically created for the purpose of unsupervised learning. It operates in a comparable manner to Empirical Orthogonal Function (EOF) in terms of diminishing the dimensionality of data and extracting the most significant patterns. AE is distinct from EOF and able to extract intricate and nonlinear patterns. The study by Ibebuchi [
23] utilised AE to condense the SLP anomaly data by employing the encoder. During the training process, the AE acquires the ability to reconstruct the input single-layer perceptron data by utilizing the decoder and the encoded representations. The reconstructed patterns are compared to the original input in order to calculate the loss, such as the mean squared error. This loss is then minimised during the training process. This technique facilitates the model in acquiring the fundamental patterns of the data.
This method incorporates a combination of meteorological and oceanic predictors. By incorporating pertinent predictors such as regional sea surface temperature/heat flow indices for oceanic predictors, ENSO, and indices designed to capture stratospheric variability, it is plausible that comparable methodologies might potentially improve the prediction of the NAO. The suggested effort is based on the use of these innovative methods to examine the elements of the atmosphere–ocean–society system (see to
Figure 2 for further details).
This paper aims to achieve the following important objective:
The objective is to create a novel generalised additive forecast model, referred to as “DeepNAO,” which integrates deep learning through text mining sources from textual resources. This model is used for cross-validating the NAO index.
3. Results
In the current era, characterised by advanced computing capabilities, extensive data generation, and a vast influx of global information, the scientific community is confronted with the task of extracting and condensing crucial insights regarding the variability of the Earth’s climate system from the continuously flowing data derived from observations and models. Additionally, there is a challenge in gathering and interpreting the indicators of the socio-economic response, which is necessary for the advancement of analysis tools and prediction models. These efforts aim to bolster societal resilience and prosperity. DeepNAO tackles three challenges in its reinterpretation of the traditional ‘push–pull’ problem [
1].
A local weighted regression has been employed to evaluate time-series data and the correlation was 0.54, and then the GAM has been adjusted to the increments, utilising the velocities between consecutive places as its reaction. The velocities are derived by the process of data partitioning and subsequently computing the mean velocities (measured in degrees of latitude–longitude per day) across consecutive locations. Each component (x,y) of velocity is fitted with a distinct generalised additive model (GAM) in order to anticipate the velocity field. In this rudimentary model, we model each velocity as a continuous function of its corresponding position. After including the spline transformation, the correlation was 0.68. The estimated density is used as a basis for drawing samples. The estimate is treated as a Gaussian mixture, which represents the probability distributions of observations in the entire population. Initially, a component of the mixture is selected, and then a sample deviation is derived from that selection. The other data collected from reports are subjected to analysis using text mining techniques. Then, the correlation increased to 0.74.
The annual NAO value was compared using leave-one-out cross-validation with the GAM and Deep-NAO. It was noted that the GAM performed well in reproducing the NAO value, and the inclusion of the deep learning method with the GAM resulted in improved performance (
Figure 4 and
Figure 5). In the Deep-NAO model, the text information will act as a layer that can be a variable into the model.
4. Discussions
The authors [
23] highlighted the promise of using data-driven techniques as a reliable tool for predicting the seasonal NAO and modelling air–sea interactions. While this work has significant advances, it is crucial to recognise the existence of some constraints. The existing causal discovery algorithms, which depend on data, have a constraint that restricts them to performing mutual causal analysis exclusively on data from a particular time period, often on a monthly basis. As a result, these systems lack the ability to analyse NAO occurrences that happen at specific times, such as during a particular season. However, due to the cyclical nature of NAO events, it is necessary to employ separate seasonal analysis and forecasting. Furthermore, the research undertaken by the authors in [
23] suggests that causal discovery algorithms may encounter challenges in finding factors that have enduring impacts on NAO or those that exhibit teleconnections. Moreover, the use of data-driven methodologies can aid in predicting future events by utilising both past and current data. However, the accuracy of these predictions is greatly influenced by the quality and quantity of the accessible data. This study specifically examines monthly forecasts and utilises sea level pressure data dating back to 1899. The dataset consists of little more than 1300 monthly data points. Limited availability of data can lead to overfitting issues during model training. In addition, deep learning, which is an opaque method, lacks the ability to provide logical explanations for the gained attributes and their observable outcomes.
To overcome these limitations, future research can explore alternative areas of study. One possible topic to explore is the analysis of different algorithms for identifying causes. These algorithms should be able to assess occurrences of NAO (North Atlantic Oscillation) at certain time intervals and efficiently include seasonal fluctuations [
26]. Further research should examine the use of additional remotely connected but important factors, such as El Niño Southern Oscillation (ENSO) events in the equatorial Middle and East Pacific, anomalies in sea surface temperatures (SST) in tropical regions, and the amount of snowfall. This research aims to improve the accuracy of forecasts. To address the issue of inadequate data, efforts were made to incorporate data from a wider range of sources. For example, using data from carefully integrated model simulations to gain an understanding of the fundamental physics of the model. Furthermore, it is imperative to develop methodologies that can effectively handle missing data, as this has the potential to improve the robustness of the models. Further study could explore methods for interpreting and visualising the fundamental representations of deep learning models to acquire a more profound comprehension of their tangible implications. These efforts have the potential to generate substantial insights into the underlying physical mechanisms and contribute to the validation of the models’ predictions. Generally, data-driven approaches have limitations. However, there are many areas that could be explored in the future to overcome these limitations and improve our understanding of NAO dynamics and their effects [
27].
In a distinct investigation, the authors in [
28] analysed the influence of the El Niño Southern Oscillation (ENSO) on the atmospheric circulation during the initial winter period in the Euro-Atlantic region from 1979 to 2022. The analysis was performed utilising multiple reanalysis datasets. The study documented the El Niño Southern Oscillation (ENSO) atmospheric circulation trends from 1979 to 2022. The analysis found that the ENSO footprint had a better fit with the EAP than the commonly mentioned NAO pattern. The influence was associated with dipolar convection anomalies resulting from the El Niño Southern Oscillation (ENSO) in the Gulf of Mexico and Central America (GMCA). These irregularities can produce a sequence of Rossby waves that extend towards the north into the North Atlantic. This leads to the formation of a region of low atmospheric pressure south of Iceland and west of Ireland during El Niño occurrences. As a result, this causes an increase in El Niño Atlantic Precipitation (EAP).
Subsequently, the researchers [
28] performed an investigation on the possible long-term variation in the early-winter ENSO teleconnection. An increase in the connection between the El Niño Southern Oscillation (ENSO) and the East Asian summer monsoon (EAP) was noted throughout the late 1990s. An evident EAP reaction was noted in the initial winter of the El Niño Southern Oscillation (ENSO) in the late 1990s. Before the late 1990s, the ENSO regression pattern had resemblances to a North Atlantic Oscillation (NAO) pattern. This observation suggests a possible shift in the early-winter ENSO teleconnection towards the Euro-Atlantic region in the late 1990s. Since the late 1990s, there has been a notable improvement in the GMCA precipitation response to the El Niño Southern Oscillation (ENSO). Although the TWEIO precipitation forcing is still in effect, the GMCA anomaly has had a notable impact on the TWEIO by causing an NAO anomaly. However, the GMCA precipitation has become significantly more susceptible to ENSO since the late 1990s. Although TWEIO precipitation forcing is still being used, the dominance of GMCA precipitation becomes prominent, resulting in the formation of an EAP by the stimulation of a north-propagating Rossby wave train.
The results of this study are consistent with earlier research that has demonstrated a discernible but restricted ability to predict the seasonal behaviour of the El Niño Southern Oscillation (ENSO)-related El Niño Atlantic precipitation (EAP) and the surface climate in the Euro-Atlantic region during the beginning of winter. The reduced strength of this signal is believed to be caused by the less accurate model simulation of tropical–extratropical teleconnections, which seems to be linked to the underestimated convection response to the El Niño Southern Oscillation (ENSO) phenomenon in the Gulf of Mexico and Central America (GMCA) region [
28]. The results of our study highlight the importance of GMCA precipitation in the ENSO–EAP teleconnection, therefore confirming these claims. Moreover, the recent intensification of the ENSO–EAP teleconnection indicates a shift in the way the ENSO phenomenon affects the atmospheric circulation in the Euro-Atlantic region during the beginning of winter. The results of this study are important for understanding the ENSO teleconnection in early winter and improving the accuracy of seasonal climate forecasts in the Euro-Atlantic region.
The authors note that their findings are primarily based on observational data. They highlight the importance of incorporating information from modelling and CMIP6 simulations in future investigations. The study focused on analysing the effects of low-frequency Rossby waves resulting from the ENSO. The North Atlantic region is distinguished by a notable prevalence of atmospheric eddy–low-frequency flow feedback. Therefore, it may be essential to assess their potential consequences as well. In addition, the research mostly concentrated on the linear impacts of the ENSO. It is imperative to ascertain whether this link is influenced by the particular kind or variant of the ENSO. It is crucial to study how the ENSO teleconnections will be affected by changes in the future climate, particularly due to greenhouse warming. This is an intriguing topic for future research.