1. Introduction
The global load demand has seen a rapid increase as a result of global population and domestic appliance increase during the last decades. The continuous utilization of fossil fuels to satisfy these needs causes climate change worldwide. The introduction of polluting emissions including carbon oxides (CO
x), nitrogen oxides (NO
x), sulfur oxides (SO
x), gaseous hydrocarbons (C
xH
y), soot, ash, droplets of tars, and other organic compounds, affects the ozone layer integrity, the greenhouse intensification, and human health [
1]. A layer with lower ozone concentration is unable to absorb most of the sun’s ultraviolet (UV) radiation, allowing a large portion to reach the Earth. Due to the innumerable building installations that the planet lists today, large amounts of solar energy are absorbed and stored in the form of heat, whereas the residual fraction that could be otherwise reflected back to the clear sky is now constantly restricted by the greenhouse effect. As a result, both global warming and energy crisis based on fuel-price uncertainty and reserve availability, constitute increasing concerns for modern societies.
In order to reduce the dependency on imported fuels, renewable energy sources (RES) offer a pathway to mitigate the environmental footprint towards sustainability. Between such resources, wind is more accessible and fairly cheaper [
2] while solar energy is distributed over a wide geographical region providing a long-term stable cost with minimum maintenance requirements [
3]. However, the increasing employment of RES to replace conventional generation brings a lot of stability challenges. The current transformation of passive networks to bi-directional smart grids promises novel mechanisms to guarantee self-healing from disturbances, active consumer participation in real-time operation, the accommodation of all generation and storage options, and assets optimization, without deteriorating the power quality provision. On the other hand, the intermittency and uncertainty of RES impose power quality violations translated in voltage and frequency deviations from nominal values, non-unity power factor, and sinusoidal waveforms with higher harmonics and transients. Hence, it is crucial for the independent transmission system operators (TSOs) to plan ahead the optimal generation scheduling of conventional plants considering the residual, net-load demand.
To account for the mismatches between actual and forecasted values, each TSO must also plan ahead adequate operating reserves, which are generally classified into four categories. In normal conditions, regulating and following reserves can take place to automatically or manually adjust the production to consumption according to the optimal economic dispatch. In case of an event, the ramping reserves offer secondary and tertiary control to return the frequency back to its nominal levels or replace primary and secondary reserves, respectively. Similar control is provided by the contingency reserves, which can also serve in primary services to stabilize the frequency [
4]. According to the required reliability and accepted risk, these reserves add considerable costs to the total generation and/or transmission and distribution of power. It should be noted that in increasing the share of RES in the generation mix, the ability to provide the needed flexibility is reduced, since RES cannot contribute in reserve provision but inherently impose its increase. It is therefore important for efficient tools to be employed and to accurately forecast the residual load in upcoming smart grids.
1.1. Literature Survey
The vast majority of the literature on load forecasting concentrates on residential consumption and small-area RES generation [
5,
6,
7]. Relating to wind power, most studies focus on wind speed and direction forecast, stating that they are the only parameters that affect the output power. For the prognostication of wind speed and produced power, the study in [
8] proposed a radial basis function neural network using the data from a wind farm near the central Taiwan area. In [
9], a generalized regression neural network is proposed to predict the wind speed for the western region of India. The accuracy in wind-speed estimation of the outlier correction algorithm, wavelet, and extreme learning machine was compared in [
10], whereas another study [
11] presented a novel method based on Weibull and Gaussian probability distribution. Utilizing the past values of measured wind speed and various input parameters including longitude, latitude, daily horizontal solar irradiance, air temperature, relative humidity, elevation, and atmospheric pressure, a few assessments have been made on time-series predictions [
2]. Among them, a support vector machine was proposed by [
12] to predict the mean 10-min time series data of wind speed. Targeting smaller regions, several studies assess the performance of different models for short-term PV generation forecasting. A neuro-fuzzy method was proposed by [
13] to investigate solar radiation taking into account the mean sea level, dry-bulb temperature, wet-bulb temperature, and relative humidity. The dry-bulb temperature and relative humidity were found to be the most dominant factors for radiation prediction. The study in [
3] concluded that to effectively predict the hourly solar radiation using machine learning algorithms, the minimum parameter inputs needed are the daily sunshine duration, precipitation, relative humidity, air pressure, and daily minimum/mean/maximum temperatures. Considering the solar energy availability and the absence of measuring stations in some regions, a mechanism based on a group method of data handling was developed by [
14]. The proposed formulation made use of particle swarm optimization, genetic algorithm, and ant colony to optimize the artificial intelligence models towards accurate PV forecasts. Finally, the authors in [
15] demonstrated a hybrid model named Ensemble Long Short-Term Memory-Feed Forward Neural Network, indicating that it can increase the PV forecasting accuracy in both 15 min- and 1 h-ahead horizons.
Apart from a few studies dealing with load forecasting in small-scale buildings and micro-grids in the short term, most studies investigating the load demand utilize temperature as the only input that affects the overall profile [
16]. In this way, long short-term memory networks have to be combined with convolutional neural networks in order to estimate the correlation and minimize the forecast error against the actual load [
17]. This could be extremely computational-intensive in the case of residual load forecasting where several inputs with vast historical data have to take place. In addition, aiming at the minimization of the expected error over probability distributions, these frameworks appear versatile and difficult to retrieve temporal and spatial correlations within the entangled functions and dependent variables.
1.2. Objective and Contribution
In this study, a multi-input/multi-output model is utilised to effectively forecast the day-ahead residual load, taking into account a large set of different weather, activity, and cost factors. Utilizing these factors as predictors, the mapping of their relationship with the targets, namely load demand, PV, and wind power production, is performed by making use of a deep feed-forward neural network. In order to identify both the spatial and seasonal correlations between different consumers, several predictors are used to estimate the load repetition, seasonal, weather, energy price, and overall activity impacts. Moreover, the forecast model accounts for a wide variety of predictors for the simultaneous determination of the power output generated by PV parks and wind farms. Based on actual data, the proposed model is assessed over three case studies varying the input parameters, and its performance is compared under a wide variety of fitness algorithms used to update the weights and bias towards nonlinear regression. The aim is to develop a comprehensive forecast model able to observe day-ahead residual load with high accuracy based on realistic and publicly available input parameters. The model could form a benchmark framework for comparisons under different conditions, regions, scales, and training algorithms. Although deep reinforcement learning [
18] and hierarchical reinforcement learning [
19] are both capable of solving complex decision-making tasks, the burden of increased dimensionality and time-intensive training becomes superior in forecasting tasks. The main reasons rely on deteriorating uncertainty due to increasing renewable sources and weather-dependent electric devices. Consequently, the stochastic behavior related to people’s daily habits and fluctuating generation is proposed to be conducted with the aid of Gaussian process regression.
In modern power systems, each independent producer provides the day-ahead generation to the associate system operator. Consequently, the aggregators of PV and small wind parks are accountable to distribution system operators, whereas bigger wind farms must inform the transmission system operators about their actual hourly power output. Between the transmission and distribution system operators, a bi-directional communication exists in order to identify the residual load for the optimum unit commitment and economic power dispatch of the conventional, thermal generating units, which may be connected to either distribution or transmission networks. In addition, all information with respect to the load demand derives from individual load forecasts by the distribution system operator. In this regard, it is highly important to predict the load with high accuracy, especially in the case of prosumer (producer and consumer), demand response, and demand-side management customers. As a result, the day-ahead residual load forecasting constitutes a critical tool for transmission system operators, since the overall production and consumption is predicted consolidating the transmission and distribution losses in relation to exogenous factors, such as temperature, wind speed, and time. Furthermore, the privacy and personal information pertaining different stakeholders (e.g., consumers, prosumers, RES producers, conventional producers, distribution system owners, transmission system owners, aggregators, distribution system operators, transmission system operators, etc.) is retained. To this end, historical data possess a vital role and thus, their right choice and overall management becomes a key requirement for the forecasting accuracy.
The rest of the paper is organised as follows. In
Section 2, the mathematical framework relating to the load demand and wind and PV power generation is presented. The multi-input/multi-output (MIMO) model for residual load forecasting is explained in
Section 3, while the extensive experimental evaluation and overall comparisons are included in
Section 4. The conclusions are drawn in
Section 5.
2. Mathematical Framework
To define the least required predictor inputs, the formulation of load, wind power, and PV power is taken into account. Both targets depend on uncertain weather conditions, while load relates also some other parameters which are discussed in the following sections.
2.1. Load Profile
Load demand varies based on a periodic hourly, daily, weekly, monthly, seasonal, and annual basis. This expresses the load fraction according to the human activity and energy requirements to satisfy its overall needs. Consequently, the load curve demonstrates a decreasing trend when most people sleep and constantly increases when they wake up and start working. Peak demand coincides with weather (especially temperature) extremities and when people return from their work and residential buildings are occupied again [
20]. Apart from the electricity needed for heating, cooling, and cooking, a further share, which is important, refers to the lighting load. The last also depends on weather conditions (mainly on sunlight) and the time. As a result, cloud-coverage may form a different input to the load-forecasting task.
With the fast expansion of rooftop PV systems engaged with net-metering or net-billing schemes, residential load forecasting became a serious challenge. The relationship between domestic load and PV generation for self-producers acquired a complex relation that differs considerably by hour and season. To capture their correlation, humidity and wind speed are needed to appropriately define how the actual temperature feels and, together with the previous day, previous week and previous year load, to estimate the extent of load increases or decreases. Historical load serves in adding the respective knowledge for grid extensions, load additions and upgrades, increase in tourism, and so on. Human habits can be distinguished by date into weekdays, weekends, and holidays.
All the aforementioned parameters regard the satisfaction of humans’ electricity needs without their participation. Considering the demand response, fuel and electricity price can contribute to the reshaping of the load profile [
21]. Indeed, during periods with increased cost of oil per barrel, demand has seen drastic reductions. Similarly, the electricity price can affect the time-of-use in some households, which either participate in demand response programs or not. For a random facility
k, the electricity requirements (
Pk) can be calculated via Equation (1), where
ηi and
Pi represent the overall efficiency and input needs of the
ith domestic appliance to convert electricity into useful work at a specific time
t, such as sound, light, heat, rotation, etc.
The electricity-to-work efficiency varies from device to device. Apart from the time of use, weather-dependent devices offer a varying efficiency, difficult to estimate in most cases. Towards this realization, the total electricity demand (
Pload) is planned based on Equation (2) for each time-interval (usually hourly). The nominal frequency (50 or 60 Hz) is retained considering the following (Equation (3)) power balance condition:
Uj indicates the status of conventional generator
j, while
Pj represents its actual power output. Hence, the residual load (
PRL) must always be equal to the total generation from all (
N) thermal units and accounts for the total load demand excluding the contribution of RES, namely wind power
Pwind and solar PV power
Ppv, at each time interval. Typical examples are depicted in
Figure 1, where the contribution of RES transforms the conventional load demand in the summer of consecutive years to the “duck curve”.
As can be seen, the midday solar saturation and evening ramp, also known as the “belly” and “neck” of the duck, constitute challenging tasks that require extremely fast ramp rates from the conventional generating units. In addition, increasing the contribution of RES, the spinning reserve capability is decreased, deteriorating the system stability and reliability [
22]. To ameliorate for the security-constraint problem of optimal unit commitment and allow for a smoother transition towards carbon-free and environmentally friendly energy resources, electricity storage solutions are continuously assessed in a wide variety of applications. In this way, the current thermal resources can be replaced with future low-carbon alternatives which will be quite ‘dispatchable’. However, this solution adds considerable uncertainty and cost, while the residual load for the conventional generators may fluctuate even more stochastically.
Figure 2 illustrates three typical paradigms without storage, in the presence of storage, and with prosumer participation.
2.2. Wind Generation
Wind turbines can convert the kinetic energy from wind to electrical energy with the aid of rotor blades based on the following equation.
Pw constitutes the wind power as a function of the air density (
ρair), the spanned area by the rotor blades (
A), and the wind velocity (
vw). However, a turbine cannot extract 100% of the kinetic energy from wind according to the Betz limit (
cp ~ 59.26%). In this way, the maximum power output from a wind turbine can be obtained via Equation (5).
According to the type and scale of the wind turbine, most manufacturers define the kinetic energy converted into electricity as a function of the rated power
Pr and wind speed
vr with respect to the cut-in
vi and cut-out speed
vo speed. In this regard, the following formulation can be used:
The wind velocity
vw,ref is measured at weather stations with a pre-defined height
hw,ref (e.g., 10 m), which may be significantly lower than the actual height
hw of a wind turbine (~100 m). At the desired height, these measurements can be corrected to represent the actual velocity
vw based on Equation (7), combining the roughness of the terrain
ho.
Together with wind direction, these parameters are the most important to estimate the actual power output from wind turbines, while pitch regulation, gear selection, and yaw angle control are performed in order to keep
cp near its maximum value considering the tip speed ratio (
TSR or
λ) as follows:
where
r is the radius of the circular disc formed by the blades and
ω defines the rotational speed.
Figure 3 demonstrates a simulation of a wind turbine and a simple diagram for the determination of wind direction in degrees. Identifying the parameters
A,
hw,
r,
cp and the variables
ρair,
λ,
vw, required to estimate the wind power generation, the necessary data collection can take place as will be explained in the model description section.
Wind power forecasting plays a pivotal role in the efficient integration of renewable energy into the grid. However, several uncertain parameters pose significant challenges to accurate predictions. One of the foremost factors contributing to uncertainty is the variability in wind speed, which is inherently unpredictable and subject to rapid changes. Additionally, the performance and characteristics of wind turbines introduce uncertainties, encompassing factors such as degradation over time and variations in equipment specifications. Meteorological conditions, including temperature, air density, and atmospheric pressure, further compound the intricacies of wind power forecasting. Accurate prediction of wind direction is paramount, as changes in this parameter can markedly impact the performance of wind turbines. Furthermore, the local topography, including hills, valleys, and other geographical features, plays a crucial role in influencing wind patterns, adding another layer of complexity to forecasting models.
Addressing the uncertainties associated with seasonal and diurnal variations is equally imperative for reliable wind power forecasting. Wind patterns exhibit cyclic variations over the course of a day and are subject to seasonal shifts, necessitating robust modeling approaches. Moreover, the quality and availability of data, both historical and real-time, introduce uncertainties into forecasting models. Data gaps, inaccuracies, and limitations in measurement technologies must be carefully considered. Climate change effects on wind resources represent an emerging concern, with potential long-term impacts on forecasting accuracy. The integration of wind power into the grid is also contingent on understanding and mitigating uncertainties related to transmission and grid constraints. Finally, the parameters within the forecasting models themselves, such as algorithm selection, time horizons, and updating frequencies, contribute to the overall uncertainty in wind power forecasting.
2.3. PV Generation
Electric power can be provided directly from solar PV panels considering the photovoltaic conversion efficiency (
ηpv), the global solar irradiation (
GA), and the ambient temperature (
Ta). According to the type of the PV arrays employed, some technical parameters are also needed to estimate the actual power output. These refer to the solar radiation under standard conditions (
GSC), the temperature under standard conditions (
TSC), and the PV temperature coefficient (
CT). Hence, a nominal output (
PSC) under 25 °C and 1 kW/m
2, which are usually taken into account by manufacturers during testing and validation, is given and the actual PV generation (
Ppv) is estimated as:
Tilted PV modules show an angular loss due to the fact that the irradiation is not always perpendicular to the PV surface during the day and the year. Therefore, two tilt- angle corrections can be made via two-axis tracking systems, namely the module tilt angle from vertical axis
a and the tilt angle from horizontal axis
β. During daylight (from sunrise to sunset),
β can vary from 0° to 180°, tracking the sun’s direction from east to west. During the year, and according to the season,
a ranges between 0° and 90° to increase the solar absorption towards maximum conversion efficiency [
23].
Figure 4 illustrates a paradigm of a PV module with a two-axes tracking system.
GA can vary depending on the cloud cover prevailing at a given time. Thus, a clearness index indicating the fraction of solar radiation transmitted through the atmosphere to strike the earth’s surface can be used. Utilizing a dimensionless number between 0 and 1, it can range between a minimum of 0.25 and the maximum of 0.75, to define a mostly cloudy and a mostly sunny day, respectively. A similar effect can be observed with dust and other atmospheric particulate matter (PM) which is able to reduce the solar energy provision by ~17–25% with roughly equal contributions from ambient PM and PM deposited on photovoltaic surfaces [
24].
Finally, the impact of temperature on a PV panel must be considered. The controversial effect of decreased generation during higher solar irradiance can be explained by engaging the ambient temperature information. According to Equation (9), every 1 °C above the standard temperature of 25 °C can decrease the photovoltaic efficiency by 0.3–0.5% (assuming that the temperature coefficient sits between −0.3 and −0.5%/°C).
Accurate forecasting of solar photovoltaic power output is essential for the effective integration of renewable energy into the grid. However, numerous uncertain parameters contribute to the complexity of predicting solar PV generation. Central among these is the variability in solar irradiance, influenced by dynamic atmospheric conditions, cloud cover, and diurnal cycles [
25,
26]. Addressing this uncertainty is paramount for reliable predictions, as fluctuations in irradiance directly impact the energy yield of solar panels. Additionally, the effects of temperature on PV panel performance introduce a layer of unpredictability, requiring sophisticated modelling to capture temperature-related efficiency changes accurately. Beyond these meteorological factors, considerations of panel soiling and degradation over time become integral, affecting the long-term reliability of PV systems. The challenge extends further to encompass shadowing and obstructions, as well as the optimal orientation and tilt angles of solar panels, which collectively influence the spatial and temporal distribution of sunlight exposure.
Inverter efficiency represents another critical parameter introducing uncertainty into solar PV power predictions. Accurate conversion of direct current (DC) to alternating current (AC) is pivotal for maximizing energy output, and fluctuations in inverter efficiency must be meticulously considered. Moreover, the impact of weather-related factors, such as atmospheric pressure, humidity, and wind speed, on PV performance necessitates a comprehensive approach to forecasting. The quality and availability of data, spanning historical solar irradiance records to real-time measurements, constitute additional sources of uncertainty that researchers must contend with in the development of robust forecasting models. As we strive for greater reliance on solar energy, understanding and mitigating these uncertain parameters are fundamental to advancing the precision and reliability of solar PV power predictions.
3. Model Description
In this section, the proposed model for day-ahead residual load forecasting is described. The input parameters used as predictors are explained in detail along with the mapping process for the relationship between the input predictors and output targets. The generic input factors are classified in five major categories. Constant values such as the Boltzmann constant, Betz limit, water density, and various test conditions are excluded from the studied predictor list. The input parameters including wind turbine heights, fixed tilt angles of PV surface, device efficiency, and so on are also excluded. Although they possess diverse values for each independent system, in total evaluation they occur as constants or target (output) independent variables. The third class regards the controllable variables like tilted PV arrays with tracking systems, pitch and yaw angle, gear ratio, etc. These variables vary towards optimum power output, and thus they are also excluded from the predictor matrix. Following are the non-controllable variables, which are strictly correlated with output target values. These relate the ambient temperature, humidity, cloud coverage, wind speed, energy prices, and past knowledge with respect to the considered targets. Together with the last category, which accommodates the time indices (hour, day, month, season, and year), all uncontrollable variables are used as predictors for the proposed forecasting model.
Aiming at the effective forecast of residual load, the impact of weather is taken into account for the determination of power generation from RES (wind and PV). Wind output is strictly correlated with wind speed, wind direction, and air density. To guarantee that the wind blows perpendicular to the turbine blades, the gear selection process takes place and is not required to be included as a predictor input. On the other hand, the air density formed by controllable and uncontrollable weather variables is indirectly included together with wind speed. Analysing the origin of density, one can simply conclude that it depends on altitude, which forms a parameter, and temperature and humidity which are uncontrollable variables and must be taken into account. On the contrary, PV offers periodically equivalent outputs due to the sun’s position during identical daily, monthly, and seasonal intervals.
With respect to the load demand, the human activity in terms of day type in combination with past load observations, the weather impact, seasonality, and energy price are considered as important predictors for a proper day-ahead prognostication. Consequently, the required predictors, for which historical data must be obtained, are day type, temperature, humidity, wind speed, cloud coverage index, fuel price, electricity price, previous-day load, previous-week load, and previous-year load. These are related to the output targets f load demand, wind, and solar PV output and are grouped by hour of day, day of the year, the month, the season, and the year. It is worth noting that, apart from the knowledge of the previous load, the rest of the inputs constitute forecasted values for the day-ahead representation.
Table 1 lists the selected predictors and targets by category.
Hours and days form important predictors in predicting the PV output. In normal cases, the cloud coverage index may help the model to increase the accuracy; more enhanced configurations must involve the temperature, humidity, and years. Apart from the weather impact, the input indicating the year under assessment provides the information of PV installations added from year to year (translated as capacity increments). To forecast the fluctuating load in enhanced systems, all time indices are needed (hours, days, day types, months, seasons, and years) along with the weather impact predictors of temperature, humidity, and cloud coverage. In addition, the load trend can be explained via the previous load knowledge, and overall customer behavior is affected by the varying energy prices. The last target, wind power, is time-independent and is affected only by wind speed and air density, which can be defined equivalently by the ambient temperature and relative humidity.
Neural networks are often preferred for load forecasting due to their ability to model complex, nonlinear relationships between input variables and the target output, which is particularly useful in capturing the intricate dynamics of energy demand. They can effectively learn from historical data and identify patterns that traditional statistical methods might miss. Additionally, neural networks are highly flexible and can be adapted to different forecasting horizons (short-, medium-, or long-term) and can handle large amounts of data, improving the accuracy and robustness of predictions. Their capability to generalize well across varying conditions makes them ideal for dealing with the uncertainties and fluctuations in load forecasting. The developed model exploits a feed-forward neural network with a varying number of hidden layers. The layers vary from one to five according to the number of input predictors under assessment. In more detail, a number (
K) of predictors
x are imported in the feed-forward network along with the target
y to form the dataset
. The model is trained using the largest share (70%) of the historical data for training, while the rest is equally distributed for validation (15%) and testing (15%). The multi-input/multi-output model is employed as follows [
27]. Considering six neurons for the first hidden layer (
h11–
h16), Equations (10) and (11) can be employed to extract the first output
h21.
This is repeated through the neural configuration [6, 8, 9, 8, 6] for the respective five hidden layers, estimating the outputs of
h21–
h28,
h31–
h39,
h41–
h48,
h51–
h56 and the final target of
y (Equations (12)–(16)).
The sigmoid activation function is expressed via σ(.), hi is the output of each hidden layer, wk constitutes the weights, and bi represents the bias operator. A temporal optimal observation y at each iteration k is assumed as target data, and the scope is to effectively reduce the error between this target outcome and the progressively obtained pattern solution. To this end, different algorithms can be used to update the weight matrix and bias between consecutive iterations (e.g., k and k + 1). For example, performing simulations in MATLAB R2022-MathWorks, the available training algorithms (and their respective function) for nonlinear regression are: Levenberg Marquardt (trainlm), BFGS Quasi-Newton (trainbfg), resilient backpropagation (trainrp), scaled conjugate Gradient (trainscg), conjugate gradient with Powell/Beale restarts (traincgb), Fletcher-Power conjugate gradient (traincgf), Polak-Ribiere conjugate gradient (traincgp), one-step secant (trainoss), Bayesian regularization (trainbr), gradient descent (traingd), gradient descent with momentum backpropagation (traingdm), gradient descent adaptive learning rate backpropagation (traingda), gradient descent with momentum, and adaptive learning backpropagation (traingdx).
In this way, the neural is separately used to make use of an algorithm at a time and to observe the targets of load, PV, and wind power, while the residual load is estimated by the model based on Equation (3).
Figure 5 depicts the proposed multi-input/multi-output model. Time series data with hourly resolution are used as input predictors. To distinguish the targets, all data are separated by hours, days, day types, months, seasons, and years, each one forming an individual input. Hence, the inputs of temperature, humidity, wind speed, cloud coverage, energy price, and previous load constitute
Mx1 vectors, where
M equals to 26,280 (assuming 3 years, 365 days/year, and 24 h/day).
Utilizing one of the aforementioned training algorithms each time, a neural network with different inputs is employed for every target. Based on this model, the prediction is performed using 24 h forecasted values recording the output. Once the day-ahead forecast of load demand, PV, and wind output is completed, the residual load is estimated and the performance metrics are recorded. The procedure is repeated for the 13 training algorithms. To account for the imposed uncertainty between the actual and predicted residual load, nonparametric models can take place. Gaussian process regression (GPR) is expected to be favored for load forecasting because of its ability to provide not only accurate predictions but also a measure of uncertainty in those predictions, which is crucial for managing energy systems. It also performs well with small- to medium-sized datasets, as it does not require large amounts of data to achieve high accuracy. GPR models are nonparametric, probabilistic models that rely on kernel functions. Considering a training set
where
and
are drawn from an unknown distribution, the goal of a GPR model is to predict the response variable
ynew, given the new input vector
xnew, and the information from the training data. A linear regression model is expressed as:
where
. The error variance
and the coefficients
β are typically estimated from the data. A GPR model explains the response by introducing latent variables,
, derived from a Gaussian process, along with explicit basis functions,
h. The covariance function of these latent variables reflects the smoothness of the response, while the basis functions map the inputs
x to a
p-dimensional feature space. Gaussian process is a collection of random variables, such that any finite subset has a joint Gaussian distribution. Therefore, if we define
as a Gaussian process, then for given observations
, the random variables
follow a joint Gaussian distribution. To comprehensively specify it, the mean value is defined via the function
m(x), whereas the covariance is demonstrated by
. Specifically, if
forms a GP, then
and
.
Consider the model
where
. This represents a GPR framework, where the response variable can be modelled as:
For each observation
xi, the joint distribution of latent variables
is presented in vector form as
, where
is written as follows:
The covariance function is typically parameterized by a set of kernel parameters or hyperparameters, denoted by θ. Sometimes, is used to explicitly show this dependence on θ. Exploiting a fitness function available in MATLAB R2022, known as ‘fitrgp’, the kernel values of β, σ2 and θ can be estimated during the training of the GPR model. Hence, the covariance (or kernel) function expresses how the response (or target) yi at one point xi is affected by responses (yj) at other points xj which possess similar predictor values. The characteristic length, σl, is utilized to define a magnitude apart from which the input values are treated as uncorrelated to the response output.
During the evaluation of the demonstrated MIMO using GPR, two categories of kernel functions have been tested. The first category pertains to the exponential, the squared exponential, the Matern 3/2, the Matern 5/2, and the rational quadratic kernel functions which concern same length-scale for each predictor. The other category accommodates the built-in functions which consider a separate length scale for each predictor. These kernels are referred to as automatic relevance determination (ARD) alternatives. The mathematical expressions of the kernel functions used are listed in
Table 2. The Euclidian distance between
xi and
xj is noted by
r, while
θ expresses the vector of
and
, which forces both
σl and
σ to be always a positive number.