# A New Multi-Objective Genetic Programming Model for Meteorological Drought Forecasting

## Abstract

**:**

## 1. Introduction

## 2. Study Area and Data Collection

## 3. Methods

#### 3.1. The Standardized Precipitation Index

#### 3.2. Overview of GP and MGGP

_{1}, x

_{2}, x

_{3}). Inner nodes are randomly filled by a user-defined function (here, Log, sin, cos, addition, subtraction, and addition). Terminal nodes are those that only can assign a variable or a constant value (here, a random value equal to 0.213). Overall, the gene expresses Equation (1) in the tree form.

_{0}) (Danandeh Mehr and Safari 2021).

_{1}, a

_{2}, …, a

_{i}are the regression coefficients commonly calculated using the least-square optimization technique. Figure 4 illustrates a multi-gene solution with three genes and the maximum depth of three evolved using three input variables (i.e., x

_{1}, x

_{2}, x

_{3}). Overall, the multi-gene model can be mathematically expressed by Equation (3). Eray et al. [31] demonstrated that the multi-gene GP algorithm has the potential ability to produce accurate and relatively low-depth solutions for hydrological simulations. In this study, GPTIPS [32], the MGGP toolbox, was used to develop multi-gene SPI estimation models. For more details on the monotonic GP and MGGP algorithms, and the review of their applications to solve engineering problems, the reader is referred to [30].

#### 3.3. State-of-the-Art MOMGGP Algorithm

#### 3.4. Performance Evaluation

## 4. Results and Discussion

#### The Best GP, MGGP, and MOMGGP Solutions

- RMSE was applied as the objective function in both tools. The smaller the RMSE, the better the forecasting accuracy;
- arithmetic operations (+, −, ×, and /), exponential function (Exp), three argument addition multiplication, square, and trigonometric functions (including sin and cos) with the same selection probability were used as arbitrary functions;
- SPI lags (from lag 1 to lag 12) together with a set of random numbers in the range of −10 to 10 were used in the terminal set;
- the maximum tree depth for GP and MGGP solution was set to nine and four, respectively;
- the maximum number of genes for MGGP solution was set to five,
- ramped half and half initialization of individuals with the population size of 300 at each run were used;
- the run is configured to proceed for 500 generations or to terminate when a fitness (RMSE) of 0.002 is achieved.

_{t−12}to SPI

_{t−1}, respectively.

## 5. Conclusions

## Author Contributions

## Funding

## Data Availability Statement

## Acknowledgments

## Conflicts of Interest

**Figure 2.**The SPI-6 time series was calculated using historical precipitation records at the observatory station in Burdur, Turkey.

**Figure 7.**(

**a**) Summary of the GP runs and (

**b**) the best GP model developed for month-ahead SPI-6 forecasting at the Burdur meteorological station.

**Figure 8.**Multi-gene expression and gene weights of the best MGGP model developed for month-ahead SPI forecasting at Burdur. The xi (i = 1, 2, …, and 12) is the normalized lagged SPI vectors used to forecast the normalized SPI.

**Figure 9.**The Pareto-front plot of the 300 generated MGGP models in terms of their forecasting error and expressional complexity.

**Figure 10.**Multi-gene expression and gene weights of the proposed MOMGGP model developed for month-ahead SPI forecasting in Burdur. The xi (i = 1, 2, 3, …,12) is the normalized lagged SPI vectors used to forecast normalized SPI.

**Figure 11.**Observed and predicted monthly SPI-6 time series and their scatter plots in the training (upper panels) and testing (lower panels) datasets.

**Table 1.**Statistical characteristics of the SPI-6 series attained at Burdur station during the 1971–2021 period.

Station | Dataset | Mean | Min | Max | SD * |
---|---|---|---|---|---|

Burdur | Entire | 0.00 | −2.91 | 2.65 | 1.005 |

Training | 0.02 | −2.91 | 2.65 | 0.987 | |

Testing | −0.04 | −2.58 | 2.20 | 1.046 |

**Table 2.**Classifications of drought states with the aid of the SPI [26].

State | Threshold |
---|---|

No drought | 0.0 ≤ SPI |

Mild drought | −1.0 ≤ SPI ≤ 0.0 |

Moderate drought | −1.5 ≤ SPI < −1.0 |

Severe drought | −2.0 ≤ SPI < −1.5 |

Extreme drought | SPI < −2.0 |

Training | Testing | ||||
---|---|---|---|---|---|

Models | Complexity | RMSE | NSE | RMSE | NSE |

GP | 190 | 0.550 | 0.689 | 0.548 | 0.726 |

MGGP | 195 | 0.504 | 0.740 | 0.555 | 0.717 |

MOGGP | 128 | 0.522 | 0.721 | 0.542 | 0.731 |

