Next Article in Journal
Exploring the Role of Energy Consumption Structure and Digital Transformation in Urban Logistics Carbon Emission Efficiency
Previous Article in Journal
Quantifying Ozone-Driven Forest Losses in Southwestern China (2019–2023)
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Time Series Prediction and Modeling of Visibility Range with Artificial Neural Network and Hybrid Adaptive Neuro-Fuzzy Inference System

by
Okikiade Adewale Layioye
1,*,
Pius Adewale Owolawi
1,2 and
Joseph Sunday Ojo
3
1
Department of Computer Systems Engineering, Tshwane University of Technology, Pretoria 0152, South Africa
2
Faculty of Information and Communications Technology, Tshwane University of Technology, Pretoria 0152, South Africa
3
Department of Physics, The Federal University of Technology, Akure 340110, Nigeria
*
Author to whom correspondence should be addressed.
Atmosphere 2025, 16(8), 928; https://doi.org/10.3390/atmos16080928 (registering DOI)
Submission received: 22 May 2025 / Revised: 4 July 2025 / Accepted: 17 July 2025 / Published: 31 July 2025
(This article belongs to the Special Issue Atmospheric Modeling with Artificial Intelligence Technologies)

Abstract

The time series prediction of visibility in terms of various meteorological variables, such as relative humidity, temperature, atmospheric pressure, and wind speed, is presented in this paper using Single-Variable Regression Analysis (SVRA), Artificial Neural Network (ANN), and Hybrid Adaptive Neuro-fuzzy Inference System (ANFIS) techniques for several sub-tropical locations. The initial method used for the prediction of visibility in this study was the SVRA, and the results were enhanced using the ANN and ANFIS techniques. Throughout the study, neural networks with various algorithms and functions were trained with different atmospheric parameters to establish a relationship function between inputs and visibility for all locations. The trained neural models were tested and validated by comparing actual and predicted data to enhance visibility prediction accuracy. Results were compared to assess the efficiency of the proposed systems, measuring the root mean square error (RMSE), coefficient of determination (R2), and mean bias error (MBE) to validate the models. The standard statistical technique, particularly SVRA, revealed that the strongest functional relationship was between visibility and RH, followed by WS, T, and P, in that order. However, to improve accuracy, this study utilized back propagation and hybrid learning algorithms for visibility prediction. Error analysis from the ANN technique showed increased prediction accuracy when all the atmospheric variables were considered together. After testing various neural network models, it was found that the ANFIS model provided the most accurate predicted results, with improvements of 31.59%, 32.70%, 30.53%, 28.95%, 31.82%, and 22.34% over the ANN for Durban, Cape Town, Mthatha, Bloemfontein, Johannesburg, and Mahikeng, respectively. The neuro-fuzzy model demonstrated better accuracy and efficiency by yielding the finest results with the lowest RMSE and highest R2 for all cities involved compared to the ANN model and standard statistical techniques. However, the statistical performance analysis between measured and estimated visibility indicated that the ANN produced satisfactory results. The results will find applications in Optical Wireless Communication (OWC), flight operations, and climate change analysis.

1. Introduction

Weather prediction is an interesting focus due to its significant influence and applications in various fields, such as flight operations, wireless communications (such as Optical Wireless Communication (OWC) and Radio Frequency (RF) transmission systems), and climate change. Visibility is an atmospheric parameter that is related to some of the aforementioned fields. Numerous atmospheric variables (including relative humidity (RH), temperature (T), pressure (P), wind speed (WS), etc.) can greatly impact its state and seasonal pattern. One of the most important scientific goals of predicting visibility is to be able to provide continuous sufficient data to scholars or officers responsible for wireless transmission systems and airport services. According to the information provided in [1,2], connections between high RH and low visibility were observed, indicating that high RH directly affects the extinction coefficient of atmospheric particles, which impacts visibility. Usman et al. [3] observed a linear relationship between visibilities and both relative humidity and temperature. Boudala and Isaac [4] used data on liquid water equivalent snowfall rate (S), cloud ceiling, RH, T, and WS from various sources to study the relationships between visibility and these parameters during the winter seasons of 2005–2007. They found a strong link between S and visibility, while correlations between visibility and the other parameters were weaker.
Several scholars have suggested various techniques to predict time series, such as Simple Moving Average (SMA), Simple Regression Analysis (SRA), Artificial Neural Network (ANN), Multi-variant Regression Analysis (MRA), Auto-Regressive Integrated Moving Average (ARIMA), and Adaptive Neuro-fuzzy Inference System (ANFIS), in order to address the challenges of achieving accurate and efficient time series prediction [5,6,7]. Based on past studies on time series prediction of weather parameters using ANN, it has been determined that this AI technique is effective for handling complex weather predictions. Colabone et al. (2015) [7] also used ANN to predict fog occurrences in Pirassununga, Sao Paulo State, Brazil. Their work showed that the Multilayer Perceptron Neural Network (MLPNN) had about 95% reliability compared to the collected data after training the MLPNN with 19 years of weather variables. For predicting atmospheric variables like rainfall, ANN was found to be the most efficient and competent model [8,9]. However, in certain areas of data analysis, such as short-term stream flow forecasting, studies have noted limitations in the performance of ANN. Kumarasiri and Sonnadara conducted a study in 2006 that revealed that ANN had the capability to forecast short-term stream flow but faced challenges with processing certain input variables and the dimensions of hidden layers [10]. Comparing the ANFIS model to ANN and ARMA models for time series prediction of stream flow, ANFIS was found to forecast results more accurately for 2 days ahead of time [11,12]. Additionally, in other research, it was found that ANFIS performed competitively for monthly data analysis of stream flow, outperforming the ARMA model [13]. This indicates that both ANN and ANFIS models have advantages over other systems, as they are more reliable and competitive.
However, there have not been sufficient studies on the use of AI techniques to predict atmospheric visibility. The aim of this work is to forecast visibility and provide information to other scientific fields, such as wireless communication system or aerospace activities. This work also aims to demonstrate the relationships between various weather-related data (such as RH, T, WS, and P) and visibility using Artificial Intelligence (AI) techniques, specifically ANN and ANFIS, across six different locations in South Africa. The focus is not on determining the best ANN and ANFIS method or architecture but rather on revealing how all atmospheric variables are related to the measured visibility values. Therefore, among the various available Neural Network (NN) types, the Feedforward Backpropagation Neural Network (FF-BPNN) was selected for its simplicity and flexibility, with the Levenberg–Marquardt (LM) training algorithm. Additionally, a hybrid machine learning technique combining ANN and Fuzzy Logic System was utilized in this study. The dependent parameter in this study is visibility, while the independent factors are the meteorological variables mentioned earlier. Meteorological data from six distinct locations in South Africa were used for this investigation. The initial method used for determining the relationship between the dependent variable and the independent variables is Single-Variable Regression Analysis (SVRA). Performance metrics such as Root Mean Square Error (RMSE), Mean Bias Error (MBE), and coefficient of determination ( R 2 ) were evaluated for the ANN and ANFIS approaches in this study.

2. Prediction of Visibility Using ANN and ANFIS

Atmospheric particles such as water drops and fog cause light wave extinction leading to reduced visibility. These particles have states that can be described by values of related weather variables [1,4,14]. This suggests a complex association exists between these weather variables and visibility. The lack of proper analysis when using SVRA or Standard Statistical Techniques (SSTs) to address the complex link has prompted the need to apply ANN and ANFIS techniques in this work.
While the connection between atmospheric variables and airfield visibility has been extensively studied, research on predicting visibility and Free Space Optics (FSOs) losses for Optical Wireless Communication (OWC) applications in terms of atmospheric variables is limited. This study aims to fill this gap by improving FSO link designs through more accurate visibility predictions. Real-time visibility data are essential for maintaining optimal optical link performance, but collecting such data is often challenging due to technical and logistical constraints. Additionally, our modeling approach could help address data deficiencies in other sectors.
This paper introduces AI-driven models that utilize ANN and ANFIS trained on various meteorological data using a back-propagation-based Multilayer Perceptron Neural Network (MLPNN) and neuro-fuzzy architecture, respectively, to forecast visibility across South African provinces. In essence, this study involves predicting visibility using ANN and optimizing the result using the ANFIS model.

2.1. ANN Architecture Description and Its Applications

Obtaining reliable parametric connections among several meteorological factors poses significant challenges. Conventional statistical methods such as standard statistical techniques (SST) often lead to complications and inaccuracies when forecasting meteorological parameters like visibility, primarily due to the complex interplay between independent and dependent variables. For instance, modeling the relationship between visibility and critical elements such as RH, T, P, and WS is notoriously difficult when using standard techniques. These SST methods regularly do not account for the complex and non-linear characteristics in atmospheric data, leaving several issues unresolved. These challenges can be handled by a data analysis approach that relies on AI techniques such as ANN. This technique yields high accuracy and is capable of dealing with intricate data relations, thereby serving as an alternative to SST.
The ANN is a modeling method uniquely designed to predict several erratic and non-linear activities, like atmospheric events, or handle cumbersome data that simple statistical or numerical methods cannot easily handle [7,15]. This technique can carry out special tasks like recognizing, connecting, and generalizing trends based on its ability and enhancement processes, which involve its data learning functions to technically imitate the dimensions of the human brain [7,16]. A characteristic ANN network groups and controls the network inputs to attain a proper response for a predictable output within a reliability limit. The general model of the ANN technique is illustrated in Figure 1, which is in the form of a nodal network. This comprises certain nodes that are trained through several pieces of information arising from internal or external conditions of the system. Firstly, the inputs are allotted random weights, which are always altered until the output is attained at a suitable error level. Therefore, this training process of the network, which involves the conversion of the obtained weighted sum, ends when an expected error margin is achieved. The weighted sum is the addition of the multiplication of each of the entries and the allocated weights. In cases where a suitable error has not been attained, the training process continues with new estimations through an error correction of data entries referred to as the back propagation scheme [7,15,16].
Once the neural network training eventually yields the expected output, the derived design is tested and validated using another set of input data. This architecture tends to reduce the loss function with regard to weights and biases [7,16,17]. Typically, about 70% of the input is utilized for training the model, while the other 30% is split between network testing and validation.

2.2. Adaptive Neuro-Fuzzy Inference System

The ANFIS is a sophisticated and efficient machine learning network that is sometimes used in place of the ANN due to its focus on simplifying the process of obtaining the required result. This hybrid system combines neural network architecture and fuzzy logic networks to process a set of input data. The neurons, which operate as nodes, process the input data based on rules provided by the fuzzy logic model. The inputs and outputs suggest the rules to be used by the neuro-fuzzy system for each action that occurs.

2.2.1. ANFIS Architecture

A typical ANFIS network displaying two inputs with five layers is shown in Figure 2. The rules generated and implemented by this model are utilized to produce accurate results and are important for achieving optimal efficiency in future data [18].
The fuzzy section in Figure 2 comprises the first-order Sugeno inference system. The linear regression output variables f 1 and f 2 are obtained by executing several fuzzy rules to fuzzy sets of input variables [19,20].
R u l e   1 : I f   x   i s   A 1   a n d   y   i s   B 1 t h e n   f 1 = p 1 x + q 1 y + r 1 ,
R u l e   2 : I f   x   i s   A 2   a n d   y   i s   B 2 t h e n   f 2 = p 2 x + q 2 y + r 2 ,
where A 1 , A 2 , B 1 , and B 2 are the non-linear parameters, and p 1 , p 2 , q 1 , q 2 , r 1 , and r 2 are the linear parameters.
The layers of the ANFIS model are explained as follows:
1.
Layer 1: Membership Function or Fuzzy layer
The first layer consists of a certain number of nodes i . The outputs, which are reliant on the number of inputs for a set of linguistic labels A i and B i , are given as follows:
O 1 , i =   μ A i x ,                                   f o r   i = 1 ,   2 ,
O 1 , i 2 = μ B i 2 y ,                   f o r   i = 3 ,   4 ,
where x and y are the inputs and μ A i x and μ B i 2 y are the membership functions of the linguistic labels, given as A 1 , A 2 , B 1 , and B 2 . For instance, the membership function for A is given as follows:
μ A i x =   0 ;                           x a i   o r   x c i x a i b i a i ;                     a i x b i b i x c i b i ;                         b i x c i .
2.
Layer 2: Rules Layer
The second layer of the ANFIS architecture consists of fixed nodes, and its output is computed using:
O 2 , i = w i =   μ A i x   × μ B i y ,                                   w h e r e   i = 1 ,   2 .
3.
Layer 3: Normalized Firing Strength (NFS)
Each node in the third layer is also fixed. The normalized firing strength is the ratio of the output i from layer 2 to the total output from layer 2, and it is given as follows:
O 3 , i =   w i ¯ =   w i w 1 + w 2                               w h e r e   i = 1 ,   2 .
4.
Layer 4: Defuzzification
The weight obtained from the previous layer and the linear regression function of order one gives the output of this layer, which is given as follows:
O 4 , i =   w i ¯ f i =   w i ¯ p i x + q i y + r i .
5.
Layer 5: Addition Layer
The output of layer 5 is given as:
O 5 , i = i = 1 N w i ¯ f i =   i = 1 N w i f i i = 1 N w i .

2.2.2. Hybrid Learning Algorithm

In 1993, Jang introduced the hybrid learning algorithm used to train the ANFIS model by effectively calibrating its parameters. This is achieved by integrating both least square estimation and gradient descent, making it suitable for situations with both linear and non-linear properties. This allows for the management of non-linear properties in the first layer and the generation of a linear relationship in the fourth layer through the dual method [21]. As a result, each rule produces a linear function of the input variables, as shown in Equations (1) and (2). By applying gradient descent, the error estimate is given as:
E p = m = 1 L T m , p O m , p L 2 ,
where T m , p represents the m t h component of the p t h target and O m , p L is the m t h component of the actual output vector. The total error estimate obtained by the sum of squared errors is:
E =   p = 1 P E p .

3. Site, Data, and Methods

3.1. Site Study and Meteorological Data

The study sites cover six selected areas, across six different provinces in South Africa, which are Durban (DBN), Cape Town (CPT), Mthatha (MTA), Bloemfontein (BFT), Johannesburg (JHB), and Mahikeng (MHK). These areas comprise two coastline locations, one near-coastline location, and three inland locations. The map of South Africa showing the study sites is displayed in Figure 3. South Africa, which has many climatic classifications, is categorized as having a sub-tropical climate. In this study, one-year data were collected from both the National Centre for Environmental Information (NCEI) and the South African Weather Services (SAWS). These atmospheric data span from 1 January 2013 to 31 December 2013 and include hourly data of relative humidity (RH), temperature (T), wind speed (WS), pressure (P), and visibility (V) for the regions under consideration.

3.2. ANN Approach to Predict Visibility Range

3.2.1. Feed-Forward Neural Network

The Feed-Forward Neural Network (FFNN) can be described as a kind of multi-layer ANN that can be implemented to learn the association that occurs between the available inputs and the output. Just as its name implies, this type of network does not have any loop existing within its system [17,23,24,25,26]. Considering the input ( X ) and output ( y ) , this network estimates its output in terms of a function approximation, such that y = f ( X ; μ ) , and to obtain the perfect function approximation, it learns the values of μ [27].
In this type of multi-layer neural network (shown in Figure 4), there are three main layers (i.e., input, intermediate or hidden, and output layers), which are the nodes connected by a certain amount of network neurons. These neurons perform the processing and then calculate the output based on the established connection existing between the three layers and the weights w , as well as the biases ( b ) . As shown in Figure 4, the input variables are RH, T, P, and WS, while the output variable is visibility ( V ) .
The ANN architecture for visibility parameterization will be achieved using MATLAB version R2023a, whereby 70% of the entire data will be used for training, 15% for testing, and the remaining 15% for validation of the network. Considering the entire one-year dataset of 8760 samples for each of the atmospheric variables, this division corresponds, respectively, to 6132, 1314, and 1314 samples of each atmospheric variable.
The Back Propagation algorithm was utilized in this work to give the coordination between the input layer data and the intermediate layer using necessary weight adjustments, as well as to reduce the estimation error [17,25]. The output (i.e., visibility) of the FFNN is given as follows [1,16]:
V =   j = 1 N h w j o t a n h   i = 1 N i + 1 w i j h x i +   w N h + 1 o ,
where w i j h and w j o are the weights associating the input unit to the intermediate unit and intermediate unit to the output unit, respectively, i and j represent input and iteration parameters, respectively, and h , N h , and o symbolize the hidden layer, number of hidden neurons, and the output layer, respectively.
The ANN structure presented in Figure 4 provides the proposed FF-BPNN system used in this work to determine the visibility based on several independent variables with different weights attached to them.

3.2.2. Selection of ANN Model Parameters

The number of neurons in the input and output is dependent on the number of input and output variables. The choice of the number of hidden neurons ( N h ) is an important factor that is solely based on achieving acceptable performance during dataset validation. The desired N h produces the necessary iterations to determine the relationship that exists between the various input variables and visibility. One of the major aims of this work is to compute the dependence of V on the input variables rather than to differentiate between different ANN models or select the best ANN training function among many. Therefore, due to its fast convergence features, the Gauss–Newton-based Levenberg–Marquardt training method was used in this work. The other model parameters chosen for this study are presented in Table 1.

3.3. Accuracy Measurement Methods

To determine the accuracy level of the output of the SST, ANN, and ANFIS models, some metrics were utilized, including R 2 , RMSE, and MBE. These metrics can be estimated using the following expressions [28,29,30,31,32]:
R 2 = 1 i = 1 N V m , i V f , i 2 i = 1 N V m , i V m , i ¯ 2 ,
R M S E = 1 N i = 1 N V f , i V m , i 2 ,
M B E = 1 N i = 1 N V f , i V m , i ,
where V f , i and V m , i are, respectively, the i t h visibility prediction and observation, V m , i ¯ is the mean of the measured visibility data, and N represents the number of samples. The acceptable range for R 2 is between 0 and 1 , where unity indicates a perfect fit or strong functional relationship [28,29]. The RMSE and MBE determine the errors, such that the RMSE deals with the squared deviations and the MBE deals with over-estimation or under-estimation [32].

4. Results and Discussion

In this section, we first analyzed the functional dependence of visibility on several meteorological variables that were taken as independent observables using standard statistical techniques. This analysis was then followed by the ANN and ANFIS approaches. Under each subsection, the analysis for each of the six South African cities (Durban, Cape Town, Mthatha, Bloemfontein, Johannesburg, and Mahikeng) will be discussed in that order.

4.1. Standard Statistical Techniques

The SST—specifically, the SVRA—was used to determine any apparent relationship between V and all considered meteorological variables. This was accompanied by regressing the visibility phenomenon against the various meteorological variables. Figure 5, Figure 6, Figure 7, Figure 8, Figure 9 and Figure 10 show the non-linear relationships between various atmospheric parameters and visibility using the SST throughout the year 2013 for all cities considered in this study. The error results of these non-linear regression fits, in terms of R 2 , are presented in Table 2.
Figure 5 presents the functional relationship between visibility and the four meteorological parameters, RH, T, P, and WS, throughout the year 2013 for the city of Durban. A visual inspection of these plots for Durban from the SST results proposes that visibility is linearly independent from all meteorological variables, indicating a lack of linear correlation between them after comparing various regression analyses. Therefore, the various non-linear fits provided in each category under Figure 5 show the relationship between visibility and all the atmospheric variables considered. Based on the R 2 estimated from the various non-linear regression fits, the results showed that the visibility variability explained by RH, T, P, and WS are approximately 30.2 % ,   11.7 % ,   3.6 % , and 10.9 % respectively. Figure 5a shows the highest R 2 value, indicating a relatively stronger correlation between V and RH compared to the others, while Figure 5b,d shows weaker dependence of V on T and V on WS, respectively. However, these values are not significant enough to be considered. This implies a lack of reasonable and substantial correlation between V and all these parameters, especially between V and P, which has the lowest R 2 value. The SST results of the non-linear regression fits for Cape Town during the year 2013, presented in Figure 6, show that the R 2 values of the visibility variability due to the meteorological variables are approximately 36.2 % ,   15.2 % ,   2.3 % , and 15.1 % for RH, T, P, and WS respectively. Similarly, these percentages reveal that the correlation between V and RH (as shown in Figure 6a) has the highest R 2 value, while the dependence of V on both T and WS (Figure 6b,d), respectively, have weak correlations, with very close R 2 values. The correlation between V and P has the lowest R 2 value, and it is quite insignificant amongst all the error values. The non-linear regression fit results of the SST for Mthatha, presented in Figure 6, show that the R 2 values of the visibility variability due to the meteorological variables are approximately 34.8 % ,   12.1 % ,   5.7 % , and 9.4 % for RH, T, P, and WS, respectively. Likewise, it can be observed in terms of these percentages that the non-linear relationship between V and RH (Figure 7a) is the strongest, followed by the visibility variability due to T and WS (as shown in Figure 7b,d), respectively. The R 2 values obtained from the reliance of V on both T and WS individually showed weak interactions, while the lowest correlation was between V and P. The results of the non-linear regression fits for Bloemfontein for the same year, presented in Figure 8, show that the R 2 values of the visibility variability explained by the various meteorological variables are approximately 25.1 % ,   13.1 % ,   3.5 % , and 17.9 % for RH, T, P, and WS, respectively. According to these percentages, Figure 8a,d show relatively higher R 2 values, which indicates relatively strong correlations between V and RH as well as V and WS, respectively. However, these values are not significant enough to be considered. On the other hand, the relationship between V and T shows weaker correlations, and that of V and P exhibits a very weak or insignificant correlation when compared to the other percentage R 2 values. Figure 9a–d present the functional relationship between visibility and all the atmospheric variables for the city of Johannesburg during the year 2013. The R 2 estimated from the various non-linear regression fits in Figure 9 are approximately 30.7 % ,   12.2 % ,   4.9 % , and 11.2 % , which correspond to the visibility variability explained by RH, T, P, and WS, respectively. The R 2 estimates provided in Table 2 show that the strongest non-linear correlation is between V and RH, while T and WS showed relatively weaker and close relationships with V . The results of the non-linear regression fits for Mahikeng, given in Figure 10, show that the R 2 values of the visibility variability described by the various meteorological variables are approximately 25.7 % ,   13.1 % ,   6.6 % , and 13.9 % for RH, T, P, and WS respectively. Based on these percentages, Figure 10a shows the highest R 2 value, suggesting a relatively strong correlation between V and RH. While the dependence of V on WS is slightly stronger than that of V on T, both functions show weak correlations.
According to the statistical results of the performances of the SVRA presented in Table 2 for diverse meteorological parameters, it was observed across all the cities that there is a relatively strong parametric relationship between visibility and relative humidity and a fairly significant relationship between visibility and both temperature and wind speed individually. The variability in visibility in terms of R 2 explained by relative humidity ranges from 25 % 36 % , and that of wind speed, temperature, and pressure ranges from 9 % 18 % , 12 % 15 % , and 2 % 7 % , respectively, over all the sites considered. In summary, the average visibility variability explained by RH, WS, T, and P for all the cities is 0.3043, 0.1306, 0.1283, and 0.0443, respectively. These results suggest that among the four atmospheric variables considered in this study, the functional dependence of visibility in South Africa is strongest on relative humidity and weakest on atmospheric pressure. The mean visibility variability described by both temperature and wind speed suggests that they contribute partially and almost equally to changes in visibility in these locations. The strong relationship between relative humidity (RH) and visibility is physically grounded in atmospheric processes. As RH increases, the air approaches saturation, leading to condensation of water vapor into tiny droplets that form haze, mist, or fog, key contributors to reduced visibility. Additionally, high RH enhances the hygroscopic growth of aerosol particles, increasing their size and scattering efficiency, which further diminishes horizontal visibility. This explains why RH consistently emerges as the dominant variable influencing visibility across all locations in this study. Even though atmospheric pressure displays a weak direct relationship, this does not mean pressure has no important role in visibility variation. Pressure frequently regulates certain systems with low- and high-pressure trends that in turn regulate humidity, wind power, and wind direction. Small changes in pressure can indirectly impact the formation of fog by changing wind fields and stability profiles in areas where local topography intensifies pressure gradients. Thus, in this case, pressure may act as an unseen contributor to the other directly pertinent elements. Nevertheless, its local variability demands further high-resolution studies. Therefore, in general, this technique has established the functional dependence of visibility on atmospheric variables. However, the outcomes showed a lack of strong correlation between them using the SST approach.

4.2. Artificial Neural Network Simulation and Results

In this section, the ANN approach was implemented for visibility prediction using various meteorological parameters. Based on the universal performance of the ANN model, it can be concluded that the proposed model is capable of providing accurate forecasts for random variables. Therefore, it is likely to achieve reasonable accuracy in estimating visibility and other weather parameters using the ANN.
In this study, four meteorological variables (i.e., the input variables) were grouped into five input cases for data training and visibility prediction, as shown in Table 3. These input cases include one or more input parameters. Input Case-A (with RH as the sole input parameter) is considered the base case, while Input Case-E, a combination of all input variables, is considered as the saturated case. Relative humidity was selected as the base case for the ANN input configurations based on the findings of the SST analysis, which identified RH as the most significant meteorological parameter for predicting visibility. Wind speed (WS) and temperature (T) were considered the next significant parameters, with very close overall mean error values, while pressure (P) was deemed the least significant parameter in visibility prediction. Therefore, WS and T were labeled as secondary input parameters.
For each location, independent meteorological time series datasets were loaded as input time series according to the input cases, with visibility time series loaded as the target time series. The ANN model’s activation was based on a uniformly random distribution method, where a portion of the input dataset was assigned to training, validation, and testing processes. The data sharing method used for dividing input data from various locations was as follows: 70% for training, 15% for validation, and 15% for testing. This division was applied to each meteorological dataset within each input case.

4.2.1. ANN Simulation Procedure

The ANN model simulations were achieved through use of the Neural Network (NN) non-linear time series application tool provided in MATLAB by MathWorks, Natick, MA, USA. To perform a non-linear test on the visibility time series, the non-linear autoregressive network with external input, which uses dynamic neural networks, was selected, among other options. In other words, the Nonlinear Autoregressive Exogenous network (NARX), a non-linear neural network technique, was employed to accurately predict the visibility range.
The initial step during the simulation process and case evaluation was accomplished by analyzing the performance of the ANN model when N h is gradually increased from 2 to 40 and when the input case containing all four input variables are considered for each of the cities. These analyses revealed that at N h 12 for Cape Town and at N h 15 for the other stations, each of the ANN ensembles for the corresponding cities performed quite close to one another, with similar values of R 2 , RMSE, and MAE, respectively. For instance, the ANN ensembles at N h 15 for Durban showed that each ANN simulation for different values of N h had R 2 , RMSE, and MBE to be approximately 67 % , 32 % , and 0.22 , respectively, and for Cape Town at N h 12 , the values of R 2 , RMSE, and MBE are approximately 71 % , 29 % , and 0.15 , respectively. In general, as the value of N h is decreasing, a slight reduction was gradually noticed in the values of R 2 and an increase was observed in the RMSE values. Thus, according to the results obtained from these analyses, it is imperative to set the optimum value of N h to 15, which indicates the common converging point where the performances of all the ANN models for the respective cities become similar. The number of hidden neurons ( N h ) is crucial for balancing model complexity and generalization: too few neurons limit the network’s ability to learn non-linear patterns (underfitting), while too many can capture noise (overfitting). We optimized N h via cross-validation, selecting the value that maximized R2 on the validation set without inflating RMSE, thereby ensuring the ANN achieves its best predictive accuracy. Also, in order to accomplish the simulation and computation of each ANN architecture, the functions and parameters presented in Table 1 were used.
The last task in the ANN investigation is to choose the best input case based on accuracy and error estimations and analyze the importance of atmospheric variables to the predicted target values. Forward adding, which involves finding the functional dependence of visibility on the base case (RH) before adding other parameters, was used to build different input cases. These input cases were loaded into the Neural Network time series application one after another for each location considered, and the results of each ANN architecture were recorded.
Ten trainings were performed for each ANN architecture with different numbers of inputs i and a fixed number of hidden layers ( h = 2 ) and hidden neurons ( N h = 15 ) to avoid local minima problems resulting from random weight initializations. By averaging the statistical indicators before comparisons, the accuracy of predicted results is increased, and predicted values are smoothed out. The statistical indicators presented in this analysis are the mean of all 10 accuracy or error statistics observed by R 2 , RMSE, and MBE, along with the SD of the averaged RMSE and MBE values.
The analysis of the ANN architecture is considered complete when hidden units are fixed to an optimal value and input cases are simulated, analyzed, and compared for each location. The best input configuration for predicting V can be chosen after analyzing all input configurations to determine the relationship between V and atmospheric variables.
The Student’s t-interval for a mean was used to determine the t-based CI for a mean of several ANN trainings. In this case, there are 10 R2 values (one per training); therefore, it is important to compute the 95% CI on the mean R2  ( R 2 ¯ ) using [28,30]:
95 %   C I = R 2 ¯   ±   t 0.975 , d f = 9   × s n ,
where s is the sample standard deviation across 10 trainings, n is the number of trainings (such that n = 10 ), the standard error (SE) of the mean of n independent estimates is given as S E =   s n , and t 0.975 , d f = 9 denotes the t-critical statistic at a degree of freedom ( d f = n 1 = 9 ) .
The statistical significance in terms of p-value for each mean R2 value was obtained by testing each R 2 ¯ against zero through the t-test, given as [28,31]:
t = R 2 ¯   N 2 1 R 2 ¯ 2 ,
where N is the number of samples. Therefore, with N 2 degrees of freedom, the two-tailed p-value can be computed using [28,31]:
p = 2 1 F t N 2 t ,
where F t N 2 represents the Cumulative Distribution Function (CDF) of the t-distribution with N 2 degrees of freedom.

4.2.2. ANN Prediction Results

The minimum and maximum R2 values for each of the input configurations for each of the locations are presented in Table 3. The mean R2 values, Confidence Intervals (CI), and the statistical significance (p-value) of the 10 different trainings for each input case are also recorded in the table.
Table 3. Confidence intervals and statistical significance of the performances of the ANN models for different input configurations.
Table 3. Confidence intervals and statistical significance of the performances of the ANN models for different input configurations.
CityInput CasesInput Variables Minimum   R 2 Maximum   R 2 Mean   R 2 Approx. 95% CI for Mean R2p-Value
DBNCase—ARH0.47710.52910.5156[0.5063, 0.5249]<0.001
Case—BRH, WS0.57940.61220.6065[0.6006, 0.6124]<0.001
Case—CRH, T, P0.57970.63010.6219[0.6129, 0.6309]<0.001
Case—DRH, WS, P0.59880.66440.6450[0.6333, 0.6567]<0.001
Case—ERH, T, P, WS (All Variables)0.63880.67250.6701[0.6641, 0.6761]<0.001
CPTCase—ARH0.49110.54010.5368[0.5280, 0.5456]<0.001
Case—BRH, WS0.60420.64980.6335[0.6254, 0.6416]<0.001
Case—CRH, T, P0.61550.67340.6573[0.6469, 0.6677]<0.001
Case—DRH, WS, P0.62130.68110.6760[0.6653, 0.6867]<0.001
Case—ERH, T, P, WS (All Variables)0.67020.71850.7002[0.6916, 0.7089]<0.001
MTACase—ARH0.47720.52970.5191[0.5097, 0.5285]<0.001
Case—BRH, WS0.55990.60330.5964[0.5886, 0.6042]<0.001
Case—CRH, T, P0.57190.62880.6178[0.6076, 0.6280]<0.001
Case—DRH, WS, P0.60770.65120.6384[0.6306, 0.6462]<0.001
Case—ERH, T, P, WS (All Variables)0.62790.68530.6612[0.6509, 0.6715]<0.001
BFTCase—ARH0.45880.50960.4955[0.4864, 0.5046]<0.001
Case—BRH, WS0.55890.59740.5792[0.5723, 0.5861]<0.001
Case—CRH, T, P0.56120.60600.5912[0.5832, 0.5992]<0.001
Case—DRH, WS, P0.56770.61950.6001[0.5908, 0.6094]<0.001
Case—ERH, T, P, WS (All Variables)0.60230.65410.6322[0.6229, 0.6415]<0.001
JHBCase—ARH0.47110.51890.5073[0.4988, 0.5158]<0.001
Case—BRH, WS0.55870.60010.5899[0.5825, 0.5973]<0.001
Case—CRH, T, P0.57010.62160.6132[0.6040, 0.6224]<0.001
Case—DRH, WS, P0.61650.64120.6221[0.6177, 0.6265]<0.001
Case—ERH, T, P, WS (All Variables)0.62750.68130.6658[0.6562, 0.6754]<0.001
MHKCase—ARH0.44850.49110.4798[0.4722, 0.4874]<0.001
Case—BRH, WS0.55560.59450.5714[0.5645, 0.5783]<0.001
Case—CRH, T, P0.56830.61010.5955[0.5881, 0.6029]<0.001
Case—DRH, WS, P0.57030.62350.6011[0.5916, 0.6106]<0.001
Case—ERH, T, P, WS (All Variables)0.61020.65830.6235[0.6149, 0.6321]<0.001
For each site, the mean R2 across 10 independent ANN trainings for each input case were computed alongside the sample standard deviation. For every input case, the 95% confidence intervals on R 2 ¯   using the t-distribution ( d f = 9 , t 0.975 , d f = 9 = 2.262 ) were derived and the statistical significance was confirmed via two-tailed tests with N 2 degrees of freedom, yielding an all p < 0.001 . A two-tailed t-test against zero with N 2 degrees of freedom confirmed all correlations are highly significant ( p < 0.001 ) . The narrow intervals and uniformly low p-values reinforce the robustness of our correlation findings, from single-variable (RH-only) to full-variable (RH + T + P + WS) cases, across all six South African cities.
Figure 11, Figure 12, Figure 13, Figure 14, Figure 15 and Figure 16 display the regression plots for neural network training, validation, and testing, applying 70%, 15%, and 15% of the atmospheric datasets for the entire year of 2013 across all sites in the study. However, Figure 11d, Figure 12d, Figure 13d, Figure 14d, Figure 15d, Figure 16d show regression results for visibility prediction at each station using all atmospheric data. The R values and model equations are provided for each station in these plots, illustrating the relationship between predicted and measured visibility. Each regression plot includes scatter and linear plots of predicted visibility versus measured visibility, with data points distributed around the average line given by the linear fit and centered on the 45-degree line of perfect fit. Colored lines in the plots represent results from network training, validation, testing, and overall forecasting, with equations indicating the determined results, while dotted lines show expected visibility outcomes. To streamline the presentation, only regression plots for Case-E, where all input variables were used to determine visibility dependence, are shown. These plots highlight the periods where the highest R 2 values were achieved among the 10 different trainings conducted for each Case-E input.
After the network training and estimation of visibility, the outcomes of the proposed ANN models (illustrated in Figure 11, Figure 12, Figure 13, Figure 14, Figure 15 and Figure 16) attained satisfactory reliabilities of about 83.60 % , 86.83 % , 84.38 % , 80.81 % , 84.33 % , and 80.30 % with 70 % of the atmospheric data for DBN, CPT, MTA, BFT, JHB, and Mahikeng, respectively. When using 15 % of the atmospheric data for the neural network validation, the proposed neural network achieved a satisfactory reliability of about 81.03 % , 82.45 % , 80.13 % , 80.88 % , 79.83 % , and 81.65 % for DBN, CPT, MTA, BFT, JHB, and MHK, respectively. Although a perfect or better ANN model architecture should have reliability higher than the values recorded in this work, the reliability results obtained are above 50 % and better than that obtained using SST. Therefore, it is assumed that the outcomes are quite satisfactory.
The results presented in Table 4 are the mean of all the error estimations after 10 different trainings for each input case. It should be noted that these results were achieved when all the data for the entire year under study were used to determine visibility. The mean R 2 showed that the accuracy increases from the base case to when all the variables were integrated. The percentage Mean RMSE was provided along with standard deviation values. It can be observed that the Mean RMSE was quite lower for all the input Case-E for all the locations. However, their standard deviations were observed to be the highest among all the input cases. The Mean Bias Error (MBE) was calculated to compare measured and estimated visibility over a one-year period of meteorological data. The MBE, which assesses the accuracy of predicted visibility, determines if data from the ANN model is over- or under-estimated. While not typically used as a model error measure, it can help evaluate bias in the ANN model and identify necessary corrections. The focus is on how much the model over- or under-estimates visibility rather than the MBE magnitude. MBE values from the ANN model for all input cases are shown in Table 4. For Case-E, bias values (in %) were 0.221 ± 0.121 , 0.152 ± 0.103 , 0.254 ± 0.251 , 0.304 ± 0.351 , 0.350 ± 0.253 , and 0.323 ± 0.262 for DBN, CPT, MTA, BFT, JHB, and MHK, respectively. Results in Table 4 indicate that the generated ANN models did not over-estimate visibility, as MBE values were below zero. Note that the systematic negative MBEs (presented in Table 4) reflect a mild under-prediction bias in our ANN. This underestimation likely arises from the network’s regularization and loss-minimization strategy, which penalizes large errors and thus pulls extreme forecasts toward the mean. While conservative estimates can be preferable in safety-critical forecasting, future work could apply a simple bias-correction layer or adjust the loss function (e.g., using asymmetric penalties) to further reduce this residual under-estimation.
Comparing SST and ANN methods for predicting visibility based on RH data alone, the ANN model outperformed SST. Mean coefficients of determination when using only RH datasets to predict visibility improved from 0.3017 to 0.5156 , 0.3618 to 0.5368 , 0.3475 to 0.5191 , 0.2513 to 0.4955 , 0.3072 to 0.5073 , and 0.2565 to 0.4798 for DBN, CPT, MTA, BFT, JHB, and MHK, respectively. Despite using only RH for forecasting visibility, the developed ANN model showed improvement over the SST approach. This demonstrates that the ANN approach establishes a stronger relationship between meteorological variables and visibility for the cities analyzed. The ANN’s superior performance over single-variable regression analysis (SVRA/SST) stems from its capacity to model complex, non-linear interactions and to integrate multiple atmospheric inputs simultaneously. Unlike SST, which fits only a straight-line relationship between visibility and one predictor, the ANN’s multi-layer architecture captures synergistic effects among RH, WS, T, and P, transforming subtle joint dependencies into improved predictive skill.
Using the meteorological data that consist of RH, T, WS, and P for each of the locations considered, the generated ANN model equation for the visibility range in km is given as follows:
V p = α V m + β ,
where V p denotes predicted visibility, V m represents measured visibility, and α and β are parametric constants, which are a function of the four meteorological variables (i.e., RH, T, WS, and P) considered in this study. The model parameters for each of the locations, obtained from the regression plots in Figure 11, Figure 12, Figure 13, Figure 14, Figure 15 and Figure 16 are shown in Table 5.
Figure 17a–f show time series plots that illustrate daily evolution of both predicted and measured visibilities throughout the year 2013 for DBN, CPT, MTA, BFT, JHB, and MHK, respectively. The results indicate that the proposed Artificial Neural Network (ANN) models were able to accurately replicate the daily visibility fluctuations for each city. These time series profiles were generated using the ANN model equation and the specific α and β values (both in kilometers) for each city. It is important to note that the estimated visibility in these plots is based on all meteorological variables.

4.3. ANFIS Simulation and Results

4.3.1. ANFIS Simulation Procedure and Analysis

In this section, the input and output variables are trained using the ANFIS network for each of the sites independently. This network is designed to predict the visibility range by employing the datasets with inputs and output from previous records. As earlier mentioned in this study, visibility prediction is a non-linear system. Therefore, the ANFIS configuration is appropriate for making this prediction through training, testing, and checking the algorithm with various functions. The input data are manually inserted into the ANFIS algorithm, allotting about 70% and 30% of the input data for training and testing, respectively. Afterwards, the ANFIS model is monitored with various FIS algorithms, as well as with certain error tolerance and epochs in order to arrive at the best prediction. This ANFIS model is aimed at reducing the errors between the actual and the forecasted output. Figure 18 illustrates the block diagram of the ANFIS input–output configuration using the Sugeno machine, which serves as the type of FIS that incorporates the input parameters to produce the output. This network depicts a four-input and one-output ANFIS system where visibility is predicted from T, RH, P, and WS.
Figure 19 shows the proposed ANFIS model architecture used in this study. This ANFIS design incorporates four input criteria for each of the four input variables, which is referred to as the ‘4444’ FIS configuration. Consequently, there are 256 rules governing the operation of this ANFIS algorithm, as illustrated in Figure 19. Throughout this research, it was determined that an error tolerance of 0.001 and 100 epochs were suitable for training the preferred ANFIS model. The ANFIS architecture employs gaussmf as the input mf type and a linear function as the output mf type, detailed in Table 6.

4.3.2. ANFIS Prediction Results

The ANFIS network, as shown in Figure 18, was applied to each of the selected locations using the hybrid architecture in Figure 19. Daily average atmospheric data from the year 2013 were used in this section of the work to predict corresponding visibility data. Time series charts were created by plotting both the training data and FIS output for each city studied in this work against the time index, as shown in Figure 20a–f. This figure displays only 70% of the input data allocated for training, equivalent to the first 256 days within the whole period considered. It was observed that after training, there was a high relationship between the two datasets suggesting very low RMSE value. The estimated training RMSE value for the ANFIS network is shown in Table 6.
The training RMSE of the proposed ANFIS model for DBN, CPT, MTA, BFT, JHB, and MHK with four inputs and one output were obtained as 0.0262, 0.0132, 0.0139, 0.0097, 0.0182, and 0.0068, respectively. These values were obtained after completing the training, testing, and checking processes involved in designing the proposed ANFIS model. The low RMSE values suggest that the ANFIS model is ideal for predicting visibility.
Approximately 30% of the total study period was dedicated to testing the proposed ANFIS model to determine if it was suitable for predicting visibility in this sub-tropical region. Figure 21a–f displays the regression plots of ANFIS-predicted visibility compared to measured visibility data using the allocated testing data. The estimated R 2 values for the model testing from the figures are 0.8958, 0.9282, 0.9134, 0.9293, 0.8993, and 0.9301 for DBN, CPT, MTA, BFT, JHB, and MHK, respectively. Therefore, based on the results of the testing data, it was observed that there is a high correlation between the measured and predicted results.
Table 7 illustrates the R 2 values obtained for each location using ANN and ANFIS models after training, testing, and validation. The table shows that the ANFIS model demonstrates the most effective training results, with improvement values of 31.54%, 26.76%, 30.33%, 35.43%, 30.57%, and 36.67% for DBN, CPT, MTA, BFT, JHB, and MHK, respectively. Additionally, the testing results showed that the ANFIS model performed better in forecasting visibility. However, both models still exhibited some inadequacies in accurately predicting visibility in this region. This study suggests that there are additional atmospheric parameters that need to be considered in order to achieve a perfect relationship or estimation.
Figure 22a–f illustrates time series prediction results comparing measured and predicted data by both the ANN and ANFIS models for all locations. The actual and predicted data in this case are the testing data, which represent 30% of the entire study year. Thus, the time index on the x-axis spans from the 257th day to the 365th day of the study year. These results demonstrate differences between the measured and predicted data utilizing the two AI models. The outcomes show better estimation using the hybrid algorithm over the ANN algorithm for all the locations using the testing data. The ANFIS model showed more accuracy over ANN in predicting visibility. Table 8 shows the RMSE of the results predicted by both the ANN and ANFIS algorithms. The testing RMSE for the ANFIS algorithm was quite lower than that of the ANN algorithm for all cities considered. For example, the RMSE of Durban using the hybrid algorithm was as low as 0.1172 compared to a higher RMSE of about 1.4051 by the ANN model during testing. Therefore, a similar improvement was consistently observed for all other cities considered.
Using the SVRA, similar trends were observed across cities from coastal to inland, with relative humidity consistently showing the strongest correlation with visibility and assigning only a minor role to atmospheric pressure. Despite the geographic and climatic differences among our six South African sites, the two modeling approaches (ANN and hybrid ANFIS) yield remarkably consistent patterns. Finally, the application of ANFIS boosted the combined R2 further. Crucially, this progression from SVRA through ANN to ANFIS, and the gains seen when moving from single- to multi-variable inputs, were uniformly observed across all six South African sites (coastal Durban and Cape Town, near-coastal Mthatha, and inland Bloemfontein, Johannesburg, and Mahikeng), demonstrating a smooth, humidity-anchored mechanism that each method and location share. Coastal sites experience stronger sea-breeze cycles and aerosol–humidity interactions, whereas inland sites are more meteorologically homogeneous. This concordance across locations and techniques highlights the robustness of our findings and justifies combining insights for regional visibility forecasting.

5. Conclusions

Visibility is one of the most significant natural phenomena that is crucial for flight operations and wireless communications. Due to fluctuating climatic conditions, it is important to monitor visibility conditions in order to make the most of the outcomes. To monitor visibility, a standard and precise model needs to be generated for prediction. Predicting visibility is essential to provide information to personnel for safe flight operations and optimal wireless link performance. To address issues related to low visibility, a system to forecast visibility is important, especially utilizing AI such as neural and neural-fuzzy networks. This study employed standard statistical techniques, back propagation (ANN), and hybrid (ANFIS) algorithms to predict visibility from specific atmospheric variables.
The results from the SST were not accurate in predicting visibility, indicating the need for more advanced techniques like an AI model. Among the four inputs, only RH showed a fairly strong relationship with visibility according to the SST, which is why it was chosen as the base case (Case-A) for the ANN analysis. The statistical error analysis from the ANN model revealed increased prediction accuracy when WS, T, and P were added to the base case. A comparison of results in Table 2 and Table 3 showed that the ANN technique outperformed the SST method. The proposed ANN models achieved an average reliability of about 68.44%, 73.23%, 69.66%, 64.56%, 69.41%, and 63.32% for Durban, Cape Town, Mthatha, Bloemfontein, Johannesburg, and Mahikeng, respectively. The hybrid ANFIS model demonstrated even higher reliability, with improvements ranging from 22.34% to 32.70% over the ANN models.
The aim of this study was not to determine the best ANN or ANFIS architecture but to identify the best combination and level of correlation between meteorological variables and visibility. The analysis revealed that the relationship between visibility and RH was the strongest, followed by WS and T, with P showing the lowest correlation. Despite the improved performance of the ANFIS model, it showed a slight shortfall in the prediction, indicating further room for improvement by including other atmospheric components like solar radiation and gas-related variables.
In conclusion, visibility prediction using atmospheric variables was analyzed, considering time series analysis and visibility prediction for different cities. The ANFIS model was chosen for its effective prediction investigation capabilities, leading to accurate visibility predictions with minimal error compared to ANN and SST models. These results show that the neuro-fuzzy technique is an effective algorithm for prediction investigation. Looking ahead, integrating our ANN/ANFIS framework with real-time observational streams (e.g., automated weather stations or satellite-derived humidity products) could enable continuous visibility nowcasting. Deploying such a system in an operational setting would test its robustness under live data feeds and support timely decision-making for wireless communication systems, aviation, and road-traffic safety.

Author Contributions

Conceptualization, O.A.L. and P.A.O.; methodology, O.A.L.; software, O.A.L. and P.A.O.; validation, O.A.L. and P.A.O.; formal analysis, O.A.L. and J.S.O.; investigation, O.A.L.; resources, P.A.O.; data curation, O.A.L.; writing—original draft preparation, O.A.L.; writing—review and editing, O.A.L., P.A.O., and J.S.O.; visualization, O.A.L. and J.S.O.; supervision, P.A.O. and J.S.O.; project administration, P.A.O.; funding acquisition, O.A.L. and P.A.O. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The two different sets of data used for this study are openly available in National Centers for Environmental Information (NCEI) at https://www.ncei.noaa.gov/access/search/data-search/global-hourly?dataTypes=VIS&stations=68592099999&pageNum=1 (accessed on 11 March 2018) and South African Weather Service (SAWS) at https://www.weathersa.co.za/ (accessed on 15 March 2018).

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Lopez, G.; Bosch, J.L.; Pulido-Calco, I.; Gueymard, C.A. Visibility Estimates from Atmospheric and Radiometric Variables using Artificial Neural Networks. WIT Trans. Ecol. Environ. 2017, 211, 129–136. [Google Scholar]
  2. Lundgren, D.A.; Cooper, D.W. Effect of Humidity on light-scattering methods of measuring particle concentration. J. Air Pollut. Control Assoc. 1969, 19, 243–247. [Google Scholar] [CrossRef]
  3. Usman, A.; Olaore, K.O.; Ismaila, G.S. Estimating Visibility using some Meteorological Data at Sokoto, Nigeria. Int. J. Basic Appl. Sci. 2013, 1, 810–815. [Google Scholar]
  4. Boudala, F.S.; Isaac, G.A. Parametrization of Visibility in Snow: Application in Numerical Weather Prediction Models. J. Geophys. Res. 2009, 114, D19202. [Google Scholar]
  5. Zhao, J.; Mo, H.; Deng, Y. An Efficient Network Method for Time Series Forecasting Based on the DC Algorithm and Visibility Relation. IEEE Access 2020, 8, 7598–7608. [Google Scholar] [CrossRef]
  6. Salman, A.G.; Kanigoro, B. Visibility Forecasting Using Autoregressive Integrated Moving Average (ARIMA) Models. Procedia Comput. Sci. 2021, 179, 252–259. [Google Scholar] [CrossRef]
  7. Colabone, R.O.; Ferrari, A.L.; Vecchia, F.A.S.; Tech, A.R.B. Application of Artificial Neural Network for Fog Forecast. J. Aerosp. Technol. Manag. 2015, 7, 240–246. [Google Scholar] [CrossRef]
  8. Alhashimi, S.A.M. Prediction of Monthly Rainfall in Kirkuk using Artificial Neural Network and Time Series Model. J. Eng. Dev. 2014, 18, 129–143. [Google Scholar]
  9. Shaikh, L.; Sawlani, K. A Rainfall Prediction Model using Artificial Neural Network. Int. J. Tech. Res. Appl. 2017, 5, 45–48. [Google Scholar]
  10. Kumarasiri, A.; Sonnadara, D. Rainfall Forecasting: An Artificial Neural Network Approach. In Proceedings of the Technical Sessions, Institute of Physics, Colombo, Sri Lanka, 24 March 2006. [Google Scholar]
  11. Mittal, P.; Chowdhury, S.; Roy, S.; Bhatia, N.; Strivastav, R. Dual Artificial Neural Network for Rainfall-Runoff Forecasting. J. Water Resour. Prot. 2012, 4, 1024–1028. [Google Scholar] [CrossRef]
  12. Nayak, D.R.; Mahapatra, A.; Mishra, P. A Survey on Rainfall Prediction using Artificial Neural Network. Int. J. Comput. Appl. 2013, 72, 32–40. [Google Scholar] [CrossRef]
  13. Karimi, S.; Shiri, J.; Kisi, O.; Shiri, O.O. Short-term and Long-term Streamflow Prediction by using ‘Wavelet-gene Expression’ Programming Approach. ISH J. Hydraul. Eng. 2016, 22, 148–162. [Google Scholar] [CrossRef]
  14. Chen, J.; Zhao, C.S.; Ma, N.; Liu, P.F.; Gobel, T.; Hallbauer, E.; Deng, Z.Z.; Ran, L.; Xu, W.Y.; Liang, Z.; et al. A Parameterization of Low Visibilities for Hazy Days in the North China Plain. Atmos. Chem. Phys. 2012, 12, 4935–4950. [Google Scholar] [CrossRef]
  15. Tafner, M.A.; Xerez, M.; Rodrigues, F.W. Redes Neurais Artificiais: Introdução e Princípios de Neurocomputação [Artificial Neural Networks: Introduction and Principles of Neurocomputing]; FURB: Blumenau, Brazil, 1996; ISBN 8571140502. [Google Scholar]
  16. Russel, S.J.; Norvig, P. Artificial Intelligence: A Modern Approach, 3rd ed.; Prentice Hall: Saddle River, NJ, USA, 2009; pp. 727–737. [Google Scholar]
  17. Dixon, M.F.; Halperin, I.; Bilokon, P. Feedforward Neural Networks. In Machine Learning in Finance; Springer: Cham, Switzerland, 2020; pp. 111–166. [Google Scholar]
  18. Wahyuni, I.; Mahmudy, W.F.; Iriany, A. Rainfall Prediction using Hybrid Adaptive Nuero Fuzzy Inference System (ANFIS) and Genetic Algorithm. J. Telecommun. Electron. Comput. Eng. 2017, 9, 51–56. [Google Scholar]
  19. Atmaca, H.; Cetisli, B.; Yavuz, A. The Comparison of Fuzzy Inference Systems and Neural Network Approaches with ANFIS for Fuel Consumption Data. In Proceedings of the Second International Conference on Electrical and Electronics Engineering Papers ELECO 2001, Bursa, Turkey, 7–11 November 2001. [Google Scholar]
  20. Cakmakci, M.; Kinaci, C.; Bayramoglu, M.; Yildirim, Y. A Modeling Approach for Iron Concentration in Sand Filtration Effluent using Adaptive Neuro-Fuzzy model. Expert Syst. Appl. 2010, 37, 1369–1373. [Google Scholar] [CrossRef]
  21. Faulina, R.; Suhartono, P. Hybrid ARIMA-ANFIS for Rainfall Prediction in Indonesia. Int. J. Sci. Res. 2013, 2, 159–162. [Google Scholar]
  22. Conradie, D.C.U. Designing for South African Climate and Weather. In The Green Building Handbook of South Africa, the Essential Guide; Green Building Media (Pty) Ltd.: Johannesburg, South Africa, 2012; Volume 4, pp. 181–195. [Google Scholar]
  23. Urso, A.; Fiannaca, A.; La Rosa, M.; Ravi, V.; Rizzo, R. Data Mining: Classification and Prediction. In Encyclopedia of Bioinformatics and Computational Biology; Academic Press: Cambridge, MA, USA; Elsevier: Amsterdam, The Netherlands, 2019; Volume 1, pp. 384–401. ISBN 9780128114322. [Google Scholar]
  24. Zeil, A. Simulation Neuronaler Netze [Simulation of Neural Networks], 1st ed.; Addison-Wesley: Boston, MA, USA, 1994; p. 73. ISBN 3-89319-554-8. (In Germany) [Google Scholar]
  25. Lavine, B.K.; Blank, T.R. Feed-Forward Neural Network. In Comprehensive Chemometrics; Elsevier: Amsterdam, The Netherlands, 2009; Volume 3, pp. 571–586. ISBN 9780444527011. [Google Scholar]
  26. Schmidhuber, J. Deep learning in neural networks: An Overview. Neural Netw. 2005, 61, 85–117. [Google Scholar] [CrossRef] [PubMed]
  27. Goodfellow, I.; Bengio, Y.; Courville, A. Deep Learning; MIT Press: Cambridge, MA, USA, 2016; pp. 164–172. [Google Scholar]
  28. Moore, D.S.; Notz, W.I.; Flinger, M.A. The Basic Practice of Statistics, 6th ed.; W. H. Freeman and Company: New York, NY, USA, 2013; p. 138. [Google Scholar]
  29. Henseler, J.; Ringle, C.; Sinkovics, R. The use of partial least squares path modelling in international marketing. Adv. Int. Mark. (AIM) 2009, 20, 277–320. [Google Scholar]
  30. Moore, D.S.; McCabe, G.P.; Craig, B.A. Introduction to the Practice of Statistics, 9th ed.; W. H. Freeman and Company: New York, NY, USA, 2017. [Google Scholar]
  31. Anderson, T.W. An Introduction to Multivariate Statistical Analysis, 2nd ed.; Wiley: New York, NY, USA, 1958; pp. 156–190. [Google Scholar]
  32. Pal, R. Predictive Modeling of Drug Sensitivity; Academic Press: Cambridge, MA, USA; Elsevier: Amsterdam, The Netherlands, 2017; pp. 83–107. ISBN 9780128052747. [Google Scholar]
Figure 1. Schematic diagram of a typical ANN model.
Figure 1. Schematic diagram of a typical ANN model.
Atmosphere 16 00928 g001
Figure 2. The basic two-input five-layer ANFIS architecture.
Figure 2. The basic two-input five-layer ANFIS architecture.
Atmosphere 16 00928 g002
Figure 3. Map of the climatic zones of South Africa, showing the study sites ([22]).
Figure 3. Map of the climatic zones of South Africa, showing the study sites ([22]).
Atmosphere 16 00928 g003
Figure 4. A typical multi-layer perceptron based on FFNN.
Figure 4. A typical multi-layer perceptron based on FFNN.
Atmosphere 16 00928 g004
Figure 5. Relationship between visibility and different atmospheric parameters: (a) RH; (b) T; (c) P; and (d) WS, using SST throughout the year 2013 for Durban.
Figure 5. Relationship between visibility and different atmospheric parameters: (a) RH; (b) T; (c) P; and (d) WS, using SST throughout the year 2013 for Durban.
Atmosphere 16 00928 g005
Figure 6. Relationship between visibility and different atmospheric parameters: (a) RH; (b) T; (c) P; and (d) WS, using SST throughout the year 2013 for Cape Town.
Figure 6. Relationship between visibility and different atmospheric parameters: (a) RH; (b) T; (c) P; and (d) WS, using SST throughout the year 2013 for Cape Town.
Atmosphere 16 00928 g006
Figure 7. Relationship between visibility and different atmospheric parameters: (a) RH; (b) T; (c) P; and (d) WS, using SST throughout the year 2013 for Mthatha.
Figure 7. Relationship between visibility and different atmospheric parameters: (a) RH; (b) T; (c) P; and (d) WS, using SST throughout the year 2013 for Mthatha.
Atmosphere 16 00928 g007
Figure 8. Relationship between visibility and different atmospheric parameters: (a) RH; (b) T; (c) P; and (d) WS, using SST throughout the year 2013 for Bloemfontein.
Figure 8. Relationship between visibility and different atmospheric parameters: (a) RH; (b) T; (c) P; and (d) WS, using SST throughout the year 2013 for Bloemfontein.
Atmosphere 16 00928 g008
Figure 9. Relationship between visibility and different atmospheric parameters: (a) RH; (b) T; (c) P; and (d) WS, using SST throughout the year 2013 for Johannesburg.
Figure 9. Relationship between visibility and different atmospheric parameters: (a) RH; (b) T; (c) P; and (d) WS, using SST throughout the year 2013 for Johannesburg.
Atmosphere 16 00928 g009
Figure 10. Relationship between visibility and different atmospheric parameters: (a) RH; (b) T; (c) P; and (d) WS, using SST throughout the year 2013 for Mahikeng.
Figure 10. Relationship between visibility and different atmospheric parameters: (a) RH; (b) T; (c) P; and (d) WS, using SST throughout the year 2013 for Mahikeng.
Atmosphere 16 00928 g010
Figure 11. ANN model regression plots for (a) training; (b) validation; (c) testing; and (d) all data throughout the year 2013 for Durban.
Figure 11. ANN model regression plots for (a) training; (b) validation; (c) testing; and (d) all data throughout the year 2013 for Durban.
Atmosphere 16 00928 g011
Figure 12. ANN model regression plots for (a) training; (b) validation; (c) testing; and (d) all data throughout the year 2013 for Cape Town.
Figure 12. ANN model regression plots for (a) training; (b) validation; (c) testing; and (d) all data throughout the year 2013 for Cape Town.
Atmosphere 16 00928 g012
Figure 13. ANN model regression plots for (a) training; (b) validation; (c) testing; and (d) all data throughout the year 2013 for Mthatha.
Figure 13. ANN model regression plots for (a) training; (b) validation; (c) testing; and (d) all data throughout the year 2013 for Mthatha.
Atmosphere 16 00928 g013
Figure 14. ANN model regression plots for (a) training; (b) validation; (c) testing; and (d) all data throughout the year 2013 for Bloemfontein.
Figure 14. ANN model regression plots for (a) training; (b) validation; (c) testing; and (d) all data throughout the year 2013 for Bloemfontein.
Atmosphere 16 00928 g014
Figure 15. ANN model regression plots for (a) training; (b) validation; (c) testing; and (d) all data throughout the year 2013 for Johannesburg.
Figure 15. ANN model regression plots for (a) training; (b) validation; (c) testing; and (d) all data throughout the year 2013 for Johannesburg.
Atmosphere 16 00928 g015
Figure 16. ANN model regression plots for (a) training; (b) validation; (c) testing; and (d) all data throughout the year 2013 for Mahikeng.
Figure 16. ANN model regression plots for (a) training; (b) validation; (c) testing; and (d) all data throughout the year 2013 for Mahikeng.
Atmosphere 16 00928 g016
Figure 17. Time series analysis of the measured and estimated visibilities (in terms of daily mean) using the ANN model with all the atmospheric parameters throughout the year 2013 for (a) DBN; (b) CPT; (c) MTA; (d) BFT; (e) JHB; and (f) MHK.
Figure 17. Time series analysis of the measured and estimated visibilities (in terms of daily mean) using the ANN model with all the atmospheric parameters throughout the year 2013 for (a) DBN; (b) CPT; (c) MTA; (d) BFT; (e) JHB; and (f) MHK.
Atmosphere 16 00928 g017aAtmosphere 16 00928 g017b
Figure 18. Block diagram illustrating the ANFIS model.
Figure 18. Block diagram illustrating the ANFIS model.
Atmosphere 16 00928 g018
Figure 19. The proposed ANFIS model architecture with input membership function of a 4444 configuration.
Figure 19. The proposed ANFIS model architecture with input membership function of a 4444 configuration.
Atmosphere 16 00928 g019
Figure 20. Time series plot of the training data ( ° ) and ANFIS output (*) over the daily time index for visibility range for (a) Durban; (b) Cape Town; (c) Mthatha; (d) Bloemfontein; (e) Johannesburg; and (f) Mahikeng.
Figure 20. Time series plot of the training data ( ° ) and ANFIS output (*) over the daily time index for visibility range for (a) Durban; (b) Cape Town; (c) Mthatha; (d) Bloemfontein; (e) Johannesburg; and (f) Mahikeng.
Atmosphere 16 00928 g020
Figure 21. Regression plots of the proposed ANFIS model using the testing data throughout the year 2013 for (a) Durban; (b) Cape Town; (c) Mthatha; (d) Bloemfontein; (e) Johannesburg; and (f) Mahikeng.
Figure 21. Regression plots of the proposed ANFIS model using the testing data throughout the year 2013 for (a) Durban; (b) Cape Town; (c) Mthatha; (d) Bloemfontein; (e) Johannesburg; and (f) Mahikeng.
Atmosphere 16 00928 g021
Figure 22. Time series analysis profile of the measured, ANFIS-predicted, and ANN-predicted visibilities (in terms of daily mean) using the testing data throughout the year 2013 for (a) Durban; (b) Cape Town; (c) Mthatha; (d) Bloemfontein; (e) Johannesburg; and (f) Mahikeng.
Figure 22. Time series analysis profile of the measured, ANFIS-predicted, and ANN-predicted visibilities (in terms of daily mean) using the testing data throughout the year 2013 for (a) Durban; (b) Cape Town; (c) Mthatha; (d) Bloemfontein; (e) Johannesburg; and (f) Mahikeng.
Atmosphere 16 00928 g022aAtmosphere 16 00928 g022b
Table 1. Selected ANN model properties.
Table 1. Selected ANN model properties.
ANN Model PropertiesFunctions and Values
Neural Network (NN) TypeFeed-forward Back Propagation Neural Network
Network Training MethodGauss–Newton-based Levenberg–Marquardt algorithm
Adaptive Learning FunctionLEARNGDM
Network Training FunctionTRAINLM
Network Transfer FunctionTANSIG
Number of Neurons15
Number of hidden Layers2
Table 2. Error estimation results of the SST performances based on the appropriate curve fitting type.
Table 2. Error estimation results of the SST performances based on the appropriate curve fitting type.
Input VariableDurbanCape TownMthathaBloemfonteinJohannesburgMahikeng
R 2 R 2 R 2 R 2 R 2 R 2
RH0.30180.36190.34760.25140.30730.2566
T0.11690.15160.12140.13060.12190.1309
P0.03570.02350.05750.03490.04890.0659
WS0.10920.15090.09380.17870.11230.1392
Table 4. Statistical results of the performances of the ANN models using different input configurations.
Table 4. Statistical results of the performances of the ANN models using different input configurations.
DBN
Input CasesInput Variables Mean   R 2 Mean RMSE (%)Mean MBE (%)
Case—ARH0.515638.560 ± 0.251−0.320 ± 0.111
Case—BRH, WS0.606535.481 ± 0.243−0.352 ± 0.100
Case—CRH, T, P0.621933.822 ± 0.451−0.501 ± 0.352
Case—DRH, WS, P0.645033.011 ± 0.683−0.203 ± 0.300
Case—ERH, T, P, WS (All Variables)0.670131.972 ± 0.882−0.221 ± 0.121
CPT
Input CasesInput VariablesMean R 2 Mean RMSE (%)Mean MBE (%)
Case—ARH0.536836.623 ± 0.154−0.252 ± 0.120
Case—BRH, WS0.633533.592 ± 0.222−0.101 ± 0.252
Case—CRH, T, P0.657332.761 ± 0.550−0.342 ± 0.300
Case—DRH, WS, P0.676031.453 ± 0.761−0.200 ± 0.131
Case—ERH, T, P, WS (All Variables)0.700229.421 ± 0.923−0.152 ± 0.103
MTA
Input CasesInput VariablesMean R 2 Mean RMSE (%)Mean MBE (%)
Case—ARH0.519139.713 ± 0.172−0.344 ± 0.222
Case—BRH, WS0.596437.020 ± 0.281−0.202 ± 0.100
Case—CRH, T, P0.617835.771 ± 0.430−0.412 ± 0.324
Case—DRH, WS, P0.638434.324 ± 0.654−0.551 ± 0.252
Case—ERH, T, P, WS (All Variables)0.661232.553 ± 0.753−0.254 ± 0.251
BFT
Input CasesInput VariablesMean R 2 Mean RMSE (%)Mean MBE (%)
Case—ARH0.495540.050 ± 0.273−0.201 ± 0.330
Case—BRH, WS0.579237.884 ± 0.340−0.440 ± 0.294
Case—CRH, T, P0.591236.853 ± 0.392−0.561 ± 0.302
Case—DRH, WS, P0.600136.010 ± 0.583−0.252 ± 0.310
Case—ERH, T, P, WS (All Variables)0.632234.132 ± 0.761−0.304 ± 0.351
JHB
Input CasesInput VariablesMean R 2 Mean RMSE (%)Mean MBE (%)
Case—ARH0.507339.111 ± 0.142−0.321 ± 0.264
Case—BRH, WS0.589936.893 ± 0.330−0.484 ± 0.300
Case—CRH, T, P0.613235.662 ± 0.421−0.503 ± 0.323
Case—DRH, WS, P0.622135.132 ± 0.564−0.241 ± 0.100
Case—ERH, T, P, WS (All Variables)0.665832.924 ± 0.693−0.350 ± 0.253
MHK
Input CasesInput VariablesMean R 2 Mean RMSE (%)Mean MBE (%)
Case—ARH0.479841.091 ± 0.292−0.461 ± 0.200
Case—BRH, WS0.571439.682 ± 0.323−0.334 ± 0.234
Case—CRH, T, P0.595537.033 ± 0.481−0.402 ± 0.152
Case—DRH, WS, P0.601136.230 ± 0.640−0.340 ± 0.300
Case—ERH, T, P, WS (All Variables)0.623535.672 ± 0.834−0.323 ± 0.262
Table 5. Generated model equation and parameters for visibility using meteorological parameters based on input Case-E.
Table 5. Generated model equation and parameters for visibility using meteorological parameters based on input Case-E.
Generated   Model   Equation   for   Visibility :   V p   =   α V m + β
Location Mean   α   ( k m ) Mean   β   ( k m )
DBN 0.69 5.4
CPT 0.72 5.5
MTA 0.67 4.7
BFT 0.68 8.9
JHB 0.75 4.6
MHK 0.62 7.4
Table 6. Estimated training RMSE after ANFIS Training.
Table 6. Estimated training RMSE after ANFIS Training.
PlaceInput MF TypeEpochInputsOutputFunctionObtained Error
DurbanLinear MF30Temperature, Relative Humidity, Pressure and Wind SpeedVisibilitygaussmf0.0262
Cape TownLinear MF30gaussmf0.0132
MthathaLinear MF30gaussmf0.0139
BloemfonteinLinear MF30gaussmf0.0097
JohannesburgLinear MF30gaussmf0.0182
MahikengLinear MF30gaussmf0.0068
Table 7. Coefficient of determination for the training, testing and validation data by FF-BPNN and ANFIS models for the study locations.
Table 7. Coefficient of determination for the training, testing and validation data by FF-BPNN and ANFIS models for the study locations.
ANNANFISEnhancement (%)
PlaceInputsOutput Mean   R 2 R 2 R 2   ( % )
TrainingTestingValidationTrainingTestingTrainingTesting
DurbanTemperature, Relative Humidity, Pressure and Wind SpeedVisibility0.68440.57990.64110.99980.895831.5431.59
Cape Town0.73230.60120.66890.99990.928226.7632.70
Mthatha0.69660.60810.62870.99990.913430.3330.53
Bloemfontein0.64560.63980.64990.99990.929335.4328.95
Johannesburg0.69410.58110.61850.99980.899330.5731.82
Mahikeng0.63320.70670.66010.99990.930136.6722.34
Table 8. Estimated RMSE values after testing for ANN and ANFIS models.
Table 8. Estimated RMSE values after testing for ANN and ANFIS models.
PlaceANNANFIS
Durban1.40510.1172
Cape Town2.40120.2185
Mthatha1.28850.1133
Bloemfontein0.86590.0681
Johannesburg1.10750.1007
Mahikeng1.15470.0730
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Layioye, O.A.; Owolawi, P.A.; Ojo, J.S. Time Series Prediction and Modeling of Visibility Range with Artificial Neural Network and Hybrid Adaptive Neuro-Fuzzy Inference System. Atmosphere 2025, 16, 928. https://doi.org/10.3390/atmos16080928

AMA Style

Layioye OA, Owolawi PA, Ojo JS. Time Series Prediction and Modeling of Visibility Range with Artificial Neural Network and Hybrid Adaptive Neuro-Fuzzy Inference System. Atmosphere. 2025; 16(8):928. https://doi.org/10.3390/atmos16080928

Chicago/Turabian Style

Layioye, Okikiade Adewale, Pius Adewale Owolawi, and Joseph Sunday Ojo. 2025. "Time Series Prediction and Modeling of Visibility Range with Artificial Neural Network and Hybrid Adaptive Neuro-Fuzzy Inference System" Atmosphere 16, no. 8: 928. https://doi.org/10.3390/atmos16080928

APA Style

Layioye, O. A., Owolawi, P. A., & Ojo, J. S. (2025). Time Series Prediction and Modeling of Visibility Range with Artificial Neural Network and Hybrid Adaptive Neuro-Fuzzy Inference System. Atmosphere, 16(8), 928. https://doi.org/10.3390/atmos16080928

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop