This section exhibits numerous highly cited ANN research applications related to BEMS control and optimization in order to discriminate them into the aforementioned ANN types: FNNs; RNNs; hybrid control applications—which concern the integration of ANNs with each other or with other methodologies; and other ANN applications that do not concern any of the aforementioned types. To this end, the current section explores the integrated research applications, in which their underlying motivations and the conceptual methodologies of ANNs used are thoroughly detailed. It elaborates on the structure of each potential ANN, encompassing a comprehensive characterization of inputs, outputs, hidden layers, and the overall architecture of the models. Additionally, the outcomes of these applications are thoroughly analyzed in terms of statistical measures.
The abbreviation “N/A” or “-” represents the “not identified” elements in the Tables and Figures.
4.1. Review of the FeedForward Neural Network Applications for BEMS Control
In a 2015 study, Afram et al. [
59] focused on developing and comparing models for various HVAC subsystems, such as the energy recovery ventilator (ERV model structure—4:10:2), air handling unit (AHU model structure—1:10:1), buffer tank (BT model structure 8:10:1), radiant floor heating (RFH model structure—1:10:2), and ground-source heat pump (GSHP model structure 2:10:1). The hidden layers for each ANN model structure was defined at 10 nodes, while sigmoid was elected as the activation function for each node of the models. Except from the FNNs, the different model types were thoroughly examined—including the transfer function (TF), process, state-space (SS), and autoregressive exogenous (ARX) types—and this was achieved using system identification techniques in MATLAB. The study also contrasted these newly created black box models with previously established gray box models. After evaluating the models in the visual and analytical mode, FNNs emerged as the top performer, followed by the ARX, TF, SS, process, and gray box models in descending order of performance.
The same year, Zhang et al. [
60] examined the efficiency of the data-driven models in predicting HVAC hot-water energy consumption in office buildings. Four models—change-point regression, Gaussian process regression, Gaussian mixture regression, and ANN—were evaluated using pre-retrofit building data as the baseline for retrofit projects. Each model’s performance was gauged using metrics such as
, RMSE, and
. The model structure accounted for the dry bulb temperature, solar radiation, humidity, as well as other variables as inputs to a FNN architecture with two hidden layers, where each layer is composed of 20 neurons and features a single-output neuron (which was aligned with the target value). According to the evaluation, the Gaussian mixture regression model slightly outperformed the others, while the FNN model required more training data. Despite their differences, all models, barring FNN, aligned with the ASHRAE Guideline 14 criteria for hourly predictions.
Also in 2015, Ardabili et al. [
61], aimed to enhance the control accuracy of an HVAC system by employing both a fuzzy control system and a radial basis function (RBF) model—a specific type of FNN—for predictive management. The model was developed to utilize temperature and humidity as inputs to predict the following four output variables: the coil valve, circulation air damper, fresh air damper, and moisture pump valve. The single hidden layer nodes varied from 4 to 24, where 20 was determined as the optimal value. According to the evaluation, the RBF network consistently outperformed the fuzzy system across all metrics. More specifically, the RBF network exhibited lower values for MAE (0.045908 for temperature and 0.054455 for relative humidity), MAPE (0.002181 for temperature and 0.000605 for relative humidity), and RMSE (0.0699 for temperature and 0.0903 for relative humidity). In addition, the RBF network showcased a high correlation coefficient (0.9243 for temperature and 0.8522 for relative humidity), thus indicating a strong linear relationship between its predicted and actual values, as well as highlighting its superior learning capability.
In 2016, a novel research conducted by Idowu et al. [
62] presented a comprehensive examination and forecast of heat load in buildings, which included aspects of both the building space and DHW. This was achieved through the following various machine learning (ML) methodologies: support vector machine (SVM), FNN, multiple linear regression (MLR), and regression tree. The information for constructing these models was derived from ten buildings, which were split evenly between residential and commercial types, located in Skellefteå, Sweden. The prediction models utilized inputs such as external temperature, historical heat load data, time-based variables, and the details of the district heating substations. The FNN algorithm was employed with N hidden layers, with the ideal N being chosen for each building’s dataset. The models’ performances were evaluated over forecast intervals that spanned from 1 to 48 h. The results revealed that SVM, FNN, and MLR were more effective than the regression tree method, and that they demonstrated comparable prediction accuracy while incorporating fewer errors in their forecasts.
The same year, Renno et al. [
63] developed and evaluated two FNN models to accurately predict solar radiation metrics: (a) daily global radiation (GR) and (b) hourly direct normal irradiance (DNI). By exploiting a mix of climatic, astronomic, and radiometric data, the models’ performances were evaluated under different neural network configurations, and the best ones were further assessed on new datasets. In both FNN models, the hidden neuron number started at 8 and increased until performance declined at 12 neurons, thus indicating 10 as the optimal number. Moreover, different combinations of activation functions were evaluated, indicating the tanh–tanh–lin configuration as the most appropriate one for both models. The results indicated correlations with the MLP models, which were then used to estimate the electrical energy output of the two different photovoltaic systems for a residential building. According to the evaluation, the GR model achieved a MAPE of 4.57%, an RMSE of 160.3 Wh/m
2, and an
of 0.9918, whereas the DNI model obtained a MAPE of
, an RMSE of 17.7 W/m
2, and an
of 0.994.
In 2017, Ahmad et al. [
64] conducted a study that assessed the effectiveness of a standard FNN trained with backpropagation in predicting the hourly HVAC energy use of a hotel in Madrid in comparison to a random forest (RF) model—another method that is gaining popularity in forecasting. Incorporating factors like the guest count slightly improved the predictions for both methods. When evaluating based on criteria such as the root–mean–square error (RMSE), the mean absolute percentage error (MAPE), the mean absolute deviation (MAD), the coefficient of variation (CV), and the
metrics, the FNN surpassed the RF model in all measurements. Though it should be underlined that both methods showed nearly identical accuracy, thereby indicating that they were both suitable for building energy predictions. The structure of the MPL involved a single hidden layer with 10–15 neurons featuring a single output neuron, which was aligned with the target value. The quantity of the input nodes varied, including a range chosen from ten factors like the outside air temperature, dew point temperature, relative humidity, wind velocity, hour of the day, day of the week, month of the year, daily guest count, and the total number of rooms reserved.
Also in 2017, Park et al. [
65] investigated the performance of a ground-source heat pump system (GSHP) that supplied heating and cooling to a hospital (
Figure 7). The GSHP system’s seasonal heating efficiency and operational characteristics were analyzed using real-time data. The researchers then developed two prediction models for the system’s performance: one based on multiple linear regression (MLR) and the other on an FNN. After an exploratory data analysis (EDA) on the raw data, the final FNN model featured 13 input variables and 3 hidden layers with 10, 5, and 2 neurons each. The MLR model was further refined to study the impact of specific variables, such as the temperatures of the source and load water inputs. When comparing the accuracy of the two models, the FNN model proved to be more precise than the MLR model. According to the evaluation (which was based on the coefficient of variation of the RMSE with no significant bias), when comparing the prediction accuracy, the MLR method exhibited a deviation of 3.56%, while FNN showed a tighter accuracy with a deviation of 1.75%.
In an interesting study in 2018, Kandasamy et al. [
66] introduced an innovative lighting control solution for net-zero energy buildings (NZEBs). The system was modeled using an FNN architecture integrated with the internal model control (IMC) principle for controller creation. Using FNN for modeling the lighting system simplified the task, thereby removing the need to handle vast and intricate systems and extensive data analysis. The suggested ANN-IMC controller relied on sensor feedback to maintain the desired light levels, and it is both easy to adjust and robust against variability. The training data for both modes, derived from testbed experiments (
Figure 8), comprised illuminance levels (lux) in tables and light power settings ranging from 0 to 100% in 5% increments. The single hidden layer for both models consisted of 10 neurons, thus reducing the overall data required for modeling. The outcome indicated energy savings of 54% and 40% regarding the desired light intensities of 300 lux and 500 lux, respectively, when compared to the baseline control approach.
The same year, Markovic et al. [
67] focused on the importance of accounting for occupant behavior, specifically window openings, in building performance simulations, which were used to estimate indoor climate and energy usage for HVAC systems more accurately. Traditional models often integrate biases and inefficiencies to handle large numbers of occupants. To address this, a Deep FNN model has been introduced in the current research for the purpose of predicting window openings in commercial buildings. The model was trained using data from a German office, and it was then tested on three distinct buildings. The network integrated an input layer of 25 neurons: 22 for the current time step features, and 3 for the indoor temperature, humidity, and CO
2 from 10 min earlier (while the output layer contained one neuron for window state prediction). The model’s practicality was evaluated by integrating it into a Modelica-based building thermal simulation, which presented accuracy rates between 86–89%, with the
(F-score statistical measure) ranging from 0.53 to 0.65 across the office buildings. Notably, while the model’s performance saw a decline of around 15% with limited input data, the
score remained relatively high.
Also in 2018, Gonzales et al. [
68] presented a new multi-agent system (MAS) in a cloud setting. This system, combined with a wireless sensor network (WSN), aimed to improve HVAC energy efficiency. The agents in the MAS learned from data and used a neural network (ANN) to understand the patterns. The system used sensor data to adjust to building conditions and the number of people present. It also considered the weather predictions and times when the building (
Figure 9) was not in use to fine tune the HVAC system’s operation. The FNN employed a sigmoidal activation function to prevent extreme values during training with the hidden layer containing
neurons, where n equals the number of input neurons. This method allowed for steadier temperature changes, thus avoiding sudden jumps that increased energy use. Their tests showed that this approach saved an average of 41% of energy in office environments. Interestingly, the energy saved was not always directly tied to the difference in the indoor and outdoor temperatures.
Deb et al. [
69] in 2018, established two predictive tools to save energy in HVAC systems for commercial buildings located in Singapore. By exploring both multiple linear regression (MLR) and ANN models such as MLPs, the primary aim of the work was to efficiently compare the aforementioned approaches and identify the best potential prediction model. To this end, 1 to 14 input variables from pre-retrofit energy audit reports were tested to identify the optimal MLP structure for predicting changes in the energy use intensity (EUI). The number of neurons in the hidden layer was fixed at 4, and this was determined through a sensitivity analysis involving 2–8 neurons. The outcome showed that the MLP illustrated an improved prediction performance of about 14.8% in the EUI in comparison with the MLR methodology.
In a 2019 study, Peng et al. [
70] introduced a learning-based control strategy aimed at allowing HVAC systems to adapt to individual thermal preferences as conditions change. The preference models were built using four key factors: time, indoor and outdoor weather conditions, and user behavior. A FNN, holding the best potential hyperparameters, was trained to predict the room temperature setpoint (Tsp) as adjusted by occupants. The structure of the model involved a two-layered MLP, in which the number of neurons in the hidden layer varied between 3 and 10. Over five months, this learning-based thermal preference control (LTPC) was tested on an HVAC system in both single-user and multi-user office settings. The results showcased energy savings between 4% and 25% compared to fixed temperature settings. Additionally, the need for manual temperature adjustment was significantly reduced from 4–9 days/month to just 1 day/month.
Also in 2019, Al-Waeli et al. [
71] aimed to compare various photovoltaic thermal (PVT) integrated energy systems—including conventional PVT, water-based PVT, water-nanofluid PVT, and nanofluid/nano-PCM—under identical conditions. A single MLP was used for this evaluation. The study sought to understand the efficiency variations in these systems, both thermally and electrically, using a singular MLP simulation system. The input parameters of the model concerned two input nodes in addition to solar irradiation data, ambient temperature data, a single hidden layer, and a single output, where the voltage, current, electrical, and thermal efficiencies for each respective energy system were varied. The model exhibited an MSE of 0.0229 during training and 0.0282 during cross-validation. The results from the FNN model indicated that the nanofluid/nano-PCM system increased electrical efficiency from 8.07% to 13.32%, while its thermal efficiency reached 72%.
In the same year, Ren et al. [
72] refined a sophisticated control model for HVAC systems to manage both indoor air quality (IAQ) and indoor thermal comfort (ITC) more effectively. The model utilizes low-dimensional linear models (LLVM for ventilation and LLTM for temperature) in conjunction with MLP neural networks and a contribution ratio index (CRI). The control system was underpinned by a database informed by computational fluid dynamics (CFD) and experimental data. The single-output ANN was specifically employed for IAQ prediction, where the air change per hour (ACH) and indoor pollutant sources were the inputs, and the indoor CO
2 level was the output, thus aiding in expanding the CFD database for a more accurate control of the HVAC system. This integrated approach enabled rapid and accurate predictions of environmental conditions like the CO
2 level and temperature. According to the evaluation, the application of this model in HVAC control can lead to significant energy savings, thereby reducing ventilation and air conditioning energy consumption by up to 50% and 32%, respectively.
In a 2020 study by Deng et al. [
73], a new approach was introduced to improve HVAC systems in offices with multiple users by exploiting data from wearable wristbands to gather physiological information. Using an MLP, the model was adequate for predicting thermal feelings based on the following indoor conditions: air temperature, relative humidity (RH), clothing level, thermal sensation, wrist skin temperature, and wrist skin. The optimal structure of the model consisted of six neurons on the input layer feeding a single hidden layer, as well as six neurons featuring a single output neuron, which was aligned with the thermal sensation vote (TSV). The MLP model was trained for a year using data from seven different offices, and it demonstrated high prediction accuracy. Using this data, Deng et al. developed a control system for HVAC systems to adjust the thermostat in real-time, thus improving comfort. Testing through experiments and simulations showcased that over 50% of the users felt neutral in terms of the temperature, while only a minor percentage felt discomfort. The energy use was similar to standard systems, but when combined with controls based on occupancy—achieved using light sensors or Bluetooth from the wristbands—the heating and cooling needs dropped significantly by 90% and 30%, respectively, in certain office areas.
In another important study in 2020 [
74], Chen et al., focused on improving the predictive control of HVAC systems in smart buildings by utilizing a high-fidelity deep neural network (DNN) model. This model was designed to accurately forecast the building’s thermal responses, incorporating the dynamics of natural ventilation. The study verified numerous deep-learning architectures that exploited environmental data concerning outdoor air temperature, dew point temperature, indoor air temperature, and relative humidity, as well as the operational status of space heating, cooling, and natural ventilation. The elected model integrated six hidden layers with varying node configurations, while its purpose concerned the prediction of indoor air temperature and relative humidity at future time steps. The key innovation of the research was the application of transfer learning, where the pre-trained DNN with extensive data from one building was adapted for use in a different building by retraining only a small subset of its parameters. This method allowed accurate predictions of the indoor temperature and humidity with significantly less data from the new building, thus demonstrating that transfer learning can expedite the deployment of smart building technologies by reducing the time and cost associated with model training. According to the evaluation, the study’s transfer learning model achieved the lowest mean squared error (MSE) of 0.16 for temperature and 2.52 for humidity predictions, thus outperforming other models and demonstrating superior prediction accuracy in HVAC system control.
In 2021, Luo et al. [
75] explored the integration of lighting control and a building-integrated photovoltaic (BIPV) system to optimize energy consumption in buildings. By introducing three machine learning frameworks—FNNs, support vector regression (SVM), and long-short-term-memory neural networks (LSTM)—the study aimed to simultaneously predict multiple building energy loads and BIPV power production. The primary goal was to manage energy demands efficiently given the shared influencing factors. The structure of the particular FNN concerned multiple input nodes receiving data from the weather station, the building operation schedules, and the recorder energy data in order to determine the heating, cooling, and lighting load, along with the BIPV power production in the output layer. The single hidden layer varied between 2 and 50 to balance the model effectiveness and computational time. According to the final evaluation of the tested models, the FNN offered the highest accuracy, while the SVM boasted the quickest computation time.
Also in the same year, Kabilan et al. [
76] introduced an energy prediction for a building-integrated photovoltaic system by considering different building orientations via the utilization of ML techniques. The prediction approach included stages for data quality, ML algorithms, weather pattern grouping, and accuracy evaluation. The FNN approach utilized therein consisted of a DNN using four input neurons forwarding solar radiation, wind speed, relative humidity, and temperature data to a couple of hidden layers that consisted of 10 neurons each. The PV generation was determined at the output layer and consisted of one node. The findings indicate that, by applying linear regression coefficients to the neural network predictions of PV energy generation, the forecast’s precision was enhanced. The concluding model displayed accurate predictions with a root mean square error of 4.42% using the FNN, 16.86% with quadratic support vector machine (QSVM), and 8.76% with decision tree (TREE).
In a 2022 study by Elnour et al. [
77], a control strategy using FNNs was introduced to optimize the HVAC system in Qatar University’s sports hall. This method considered predictions of future system behavior, blending both forecasting and optimization components. The FNN model, responsible for predicting the HVAC system’s dynamic behavior, was tested against other machine learning (ML) techniques, including support vector regression (SVR), k-nearest neighbor (k-NN), and decision tree (DT). According to the evaluation, the FNN model surpassed these ML techniques, achieving an average root mean squared error (RMSE) of approximately 0.06 and a correlation coefficient of 0.99, thus indicating its reliability and precision. Two variations of the FNN strategy were tested for the sports hall’s HVAC system. The results showed significant energy savings of up to 46%, while also ensuring optimal thermal comfort and air quality indoors.
In a 2022 research, the study of [
78] proposed a novel approach to occupancy prediction in various building spaces, which was undertaken using sensorial data and advanced deep learning techniques. The study harnessed a comprehensive set of sensor data, including indoor and outdoor environmental parameters, Wi-Fi device connections, energy usage, HVAC operations, and time-related information. A new feature selection algorithm was developed to sift through this data, in which key factors critical for accurate occupancy predictions were identified. The study implemented several deep learning models, such as deep FNNs, LSTM networks, and gated recurrent units (GRUs), in different settings for commercial buildings. The structure of the FNN specifically considered 3 hidden layer units consisting of 32 neurons each, while the single-note output layer predicted the occupant count. The findings revealed that different models excelled in different environments, and they found that the indoor CO
2 concentration and the number of Wi-Fi-connected equipment were the highest influential attributes for accurate occupancy forecasting.
4.2. Review of Recurrent Neural Network Applications for BEMS Control
In the same year, Ferlito et al. [
79] introduced a comprehensive procedure for creating effective ANN frameworks in terms of estimating a building’s energy needs with respect to HVAC and lighting operation. The efficacy of this procedure was validated through a case study where a straightforward nonlinear autoregressive (NAR) model was constructed and its precision was assessed for prediction spans of 3, 6, and 12 months. The NAR network received historical data from the monthly electric consumption of a public building, while the single hidden layer integrated six neurons that delivered the forecasted energy demand (NAR output). The simulated results exhibited strong regression values across all forecast periods, where the deviations quantified as the RMSPE (root mean square percentage error) equated to 15.7%, 17.97%, and 14.59% at the 3-, 6-, and 12-month prediction intervals, respectively. The results suggested that the NAR acted sufficiently toward the building energy requirements only when energy consumption time series data was accessible.
In a 2017 study, Chen et al. [
80] introduced a data-driven strategy that forms a cycle for precise predictive modeling and the instantaneous management of building thermal dynamics. This method relies on a deep RNN that utilizes large volumes of sensor data. The refined RNN was then integrated into a finite horizon-constrained optimization problem. To convert this constrained optimization into an unconstrained one, the researchers implemented an iterative momentum-based gradient descent method with momentum to determine the best control inputs. The simulation results demonstrated that this approach surpassed the model-based strategy in terms of both building system modeling and management. According to the simulations, this method enabled a set of control decisions that reduced energy consumption by 30.74%. In contrast, the solution derived from the RC model led to only a 4.07% decrease in energy usage.
Also in 2017, Sun et al. [
81] proposed an advanced control method for residential solar photovoltaic (PV) systems using RNN to optimize the power output and ensure efficient grid integration. The RNN was trained to manage a single-phase inverter with an LCL filter, aiming to improve system performance, safety, and reliability. The structure of the RNN considered four nodes in the input layer to take in the error and integral of the error terms related to the grid-connected current, two hidden layers with six nodes each, and a single output representing the control voltage for the inverter in the d-q frame. This control voltage was then used to adjust the operation of the solar inverter, thus ensuring efficient power extraction and grid integration. Through simulations and experimental setups, the RNN strategy was benchmarked against conventional control methods. The results showed that the RNN provided superior performance, and it maintained stability and maximizing power extraction under various conditions, including during disturbances and non-ideal scenarios. The research highlighted the potential of ANN-based controls in enhancing the effectiveness of residential solar PV systems.
In 2019, Kazem et al. [
82] assessed the performance of a building in an integrated photovoltaic (PV) system located in Sohar University, Oman. The PV system’s power, energy outputs, yield, capacity factor, energy cost, and payback period were monitored and analyzed over a year. To efficiently predict the PV system’s energy output more accurately, specified models using a deep learning approach, including time-lag recurrent networks (TLRN) and various configurations of fuzzy RNNs (FRNNs), were developed (based on temperature and solar irradiance) to forecast the PV system’s current (I) output. Both of the types of models utilized one or two hidden layers with varying numbers of neurons and utilized a momentum learning method. The article revealed that the highest energy production and yield ratios were achieved with a capacity factor of 21.7%, a cost of energy at 0.045 USD/kWh, and a payback period of over 11 years. Among the predictive models, the FRNN-2 and FRNN-3 cases outperformed the others and displayed lower mean square errors, thereby indicating a more accurate fit to the experimental data.
The research by Sendra et al. in 2020 [
83] proposed an LSTM model for predicting a building’s HVAC energy consumption for the next day. The system was based in Madrid’s MagicBox, a house powered entirely by solar energy and fitted with monitoring equipment (
Figure 10). The particular study explored various LSTM neural network configurations and employed techniques to enhance the initial data set. The LSTM model received input data related to both indoor and outdoor conditions. These inputs included the outdoor temperature, relative humidity, irradiance, indoor CO
2 level, indoor temperature, and the reference temperature set by the user. The hidden layers consisted of two LSTM layers that preserved an equal number of nodes, and this was determined through hyper-parameter optimization. The output of the LSTM layers feeds into a fully connected (dense) layer, which is responsible for generating the predicted power consumption for the HVAC system. According to the final evaluation, the LSTM configurations showcased a commendable performance with a test error rate (NRMSE) of 0.13 and a 0.797 correlation between the predicted and actual data points. When contrasted with a straightforward one-hour-ahead forecasting model, the results were nearly on par, thus highlighting the viability of real-time energy estimations for building structures.
In the same year, Correa et al. [
84] employed deep learning techniques to predict the performance of a solar hot water (SHW) system under varying weather conditions in Chile. Using TRNSYS, a physical simulation model was created to generate a vast amount of simulated data. The different NN architectures received the ambient temperature, the solar field’s inlet temperature, the control signal of pumps (which is indicative of the operational status of the system’s pumps), the inlet temperature at the heat exchangers, as well as the previous values of the solar collector’s outlet temperature data. The primary aim was to predict the future values of the solar collector’s outlet temperature, and it portrayed a key performance indicator of the SHW system, thereby reflecting its efficiency and effectiveness in utilizing solar energy for heating water. Among the models tested, which included FNN, RNN, and LSTM architectures, the LSTM showcased superior prediction accuracy. When compared to traditional regression models, all three architectures, especially the LSTM models, delivered more reliable results, thus indicating their potential for predicting SHW system performance. More specifically, the LSTM model excelled in its predictions, achieving a low mean absolute error of 0.55 °C, the smallest root mean square error of 1.27 °C, as well as minimal variance and relative prediction errors.
In an interesting study in 2020, Heidari et al. [
85] proposed advanced machine learning techniques for predicting the energy use in solar-assisted water heating systems by comparing multiple model architectures. The ANNs received input data as historical energy data; temporal variables like the hour and day; as well as environmental and operational Variables such as indoor and outdoor temperatures, solar radiation, relative humidity, and wind speed. The number of nodes in the input layer corresponded to the number of features used while the output layer of the models predicted the next time step of the energy use, thus making it a regression problem. To this end, multiple model architectures (LSTM, ALSTM, ALSTM-D, and FNN) were experimented on with different topologies, where the best performance was observed in a configuration with two LSTM layers containing 150 neurons in each hidden LSTM layer. The study compared the performance of the enhanced LSTM models—both with and without the attention mechanism and data decomposition—against a traditional FNN. The enhanced LSTM models demonstrated significantly lower mean absolute errors, and they outperformed the baseline FNN model by 25% to 41%, thus indicating a more accurate prediction of the energy use in solar heating systems.
In 2021, Tagliabue et al. [
86] presented a technique that combined indoor air quality data, which were gathered from Internet of Things (IoT) sensors, to inform indoor environment changes based on how many people were present in a building at the University of Brescia’s Smart Campus (
Figure 11). The method involved using a RNN that incorporates LSTM units trained on real-time data, which then guided the ventilation changes through an IoT communication system. The structure of the networks that utilized the CO
2 in the air, the indoor temperature readings, and relative humidity (RH) data as inputs involved four hidden layers (recurrent LSTM, additional LSTM, sequence layer, and a fully connected layer). As its purpose was to predict the CO
2 concentration as the output, the main goal of this research effort was to adjust the HVAC system and determine the optimal behavior for window operation in order to improve the indoor air quality. This, in turn, aimed to boost the cognitive performance of the building’s occupants, even as conditions changed. The research paper utilized Pearson’s correlation coefficient (
) with values of 0.93, 0.88, and 0.92 for the training, test, and whole datasets, respectively, as well as the mean square error (MSE) for the test period (which was approximately 10.6% of the average CO
2 concentration).
Also in 2021, Fang et al. [
87] proposed a novel approach using a sequence-to-sequence model grounded in LSTM neural networks, which was employed for the advanced prediction of indoor temperatures and aimed to optimize the energy efficiency of HVAC systems. The model intricately processes a blend of historical indoor temperatures, as well as external factors such as forecasted outdoor temperatures and time-related data serving as the inputs. These inputs are skillfully encoded through LSTM layers for predicting the forecast horizon, which are adept at retaining important temporal information across sequences, thus ensuring a robust feature extraction. The model architecture consisted of a dynamic duo: an LSTM encoder that digests the input data, and an LSTM decoder that is fine tuned for generating accurate multi-step future temperature predictions. This architecture was fine tuned with hyperparameters such as the learning rate and number of hidden nodes, which were further reinforced with dropout techniques to curb overfitting. The models optimal prediction was determined for a single hidden layer with 128 nodes. Moreover, the efficiency of the approach prowess was benchmarked against established methods like Prophet and seasonal naive models, whereby it delivered superior performance in very short-term forecasting scenarios. What is noticeable is that their effort integrated a real-life application of this model, and it was tested in a real-world building environment (
Figure 12) in order to showcase its potential in enhancing energy savings without compromising occupant comfort, thus marking a leap forward in intelligent building management systems.
4.3. Review of Hybrid Neural Networks Applications for BEMS Control
In an interesting hybrid study in 2015, Huang et al. [
88] introduced a novel control approach for HVAC systems in commercial buildings, where the aim was to reduce energy use and costs. The approach combined a traditional MPC with an MLP and RNN (NARX) feedback method. The control model was based on a simplified representation, while the complexities in the HVAC process were addressed using an MLP structure that included three hidden layers and a single output node. More specifically, the MPL modeled the nonlinear input–output relationship of the building’s HVAC system, in which the system states (e.g., temperature) or control actions (e.g., valve positions) were predicted based on inputs like environmental conditions and desired output states. The MLPs received data that included the past and present values of various environmental and system parameters like the chilled water temperature, return air temperature, outdoor temperature, air mass flow rate, and the desired output while also predicting system states like temperature. The secondary employed RNN was a network able to capture the temporal dynamics of the building’s HVAC system, thus making it essential for accurately predicting system responses over time and aiding in the control process. The effectiveness of the methodology was evaluated at Adelaide Airport’s building using simulations and real-world settings, and this was achieved by incorporating such advanced air-conditioning strategies to optimize the energy efficiency. The results highlighted significant energy and cost savings of 13% without compromising comfort compared to the baseline control methods.
In a novel research effort in 2015, Papantoniou et al. [
89] focused on enhancing energy efficiency in a hospital building in Chania, Greece by integrating a building optimization and control (BOC) algorithm into its existing BEMSs (
Figure 13). The hybrid control approach integrates Elman RNN models for predictive modeling/temperature predictions, GAs for multi-step optimization, and fuzzy techniques for real-time control, all of which are aimed at enhancing energy efficiency in the building. With respect to the deployed RNN, it was aimed to predict indoor air temperatures, and it was trained with five real data inputs such as the indoor air temperature, time, convective transfer of windows, HVAC coil operation, and the HVAC fan consumption. The three hidden layers of the ANN consisted of 354 nodes each. Implemented within a web-based energy management and control system (Web-EMCS), the BOC algorithm effectively determines optimal temperature set points and monitors real-time data, such as energy savings. According to the real-life evaluation, the system demonstrated a potential energy saving of 36%.
In the same year, Garnier et al. [
90] proposed a novel predictive control approach targeting the enhancement of HVAC system operations in non-residential buildings. A building in Perpignan, France was modeled using EnergyPlus to gauge comfort using the predicted mean vote index. MLP served as the core of the predictive models that emulate HVAC system behaviors. The study used six self-growing ANNs, which were trained by the cascade-corellation algorithm to model variables like air and radiant temperatures, as well as the electrical power consumed by HVAC subsystems. As inputs, the models receive various parameters like the outdoor temperature, solar radiation, room occupancy, current air temperatures, radiant temperatures, and HVAC temperature set-points in adjacent rooms to predict the values of the air temperature, radiant temperature, and electrical power consumed by the building’s HVAC subsystems for each time step. The MLPs were developed for different operational modes (heating and cooling) and specific building areas. Such models were employed within an optimization framework powered by genetic algorithms (GAs), which efficiently dictate the optimal activation and deactivation times for HVAC components in both heating and cooling modes.
In 2015, Wei et al. [
91] implemented an ensemble of multi-layer perceptron FNNs to develop an extensive energy model for a building. The model features a variable hidden layer, where the neuron nodes in the layer are randomly set between 10 to 40 during the training phase. It includes three indoor air quality models: the temperature model, the relative humidity model, and the CO
2 concentration model. To balance the power consumption with the indoor air quality, a four-objective optimization problem was formulated. This challenge was tackled using an enhanced particle swarm optimization (PSO) algorithm, which determines control parameters for the supply air temperature and static pressure in the air management unit. By assigning different weights to the objectives within the model, the optimized control parameters effectively balanced the HVAC system’s power usage with the establishment’s thermal comfort. Simulation tests showed that the MLP ensemble method outperformed seven other techniques. It was thus selected for creating both the comprehensive energy model and the trio of indoor air quality (IAQ) models. The overall energy savings for the dataset in this study amounted to 17.4% without IAQ constraints and 12.4% under IAQ limitations for one of the eight user preference scenarios.
In 2016, Attaran et al. [
92] proposed an innovative method for optimizing the energy efficiency of HVAC systems by combining a radial basis function neural network (RBFNN) with an epsilon constraint (EC) PID approach. This hybrid method employs RBFNN within the HVAC system to predict residual variances, which enhance the control signal and minimize errors. The primary objective was to design and evaluate the EC-RBFNN for a self-adjusting PID controller, and it is specifically tailored for a particular bilinear HVAC system with a focus on temperature and humidity control. Through comparative simulation case studies, the EC-RBFNN approach was found to be more accurate than both standard PID optimization and the combined PID-RBFNN method. The results showed that the hybrid EC-RBFNN methodology reduced the integral absolute error (IAE) by 18% for temperature and 20% for humidity measurements.
In the same year, Kim et al. [
93] focused on improving the energy efficiency of integrated daylighting, heating, ventilating, and air conditioning (IDHVAC) systems in buildings. The researchers developed a meta-model to predict the building’s energy performance, which integrates artificial lighting regression and ANN models. This model was trained on a database generated by the EnergyPlus simulation tool, where the design of experiment (DOE) method is used to ensure robust training without overfitting in order to predict the room temperature, total energy consumption, and indoor daylight illuminance. The number of input variables for the ANN models varied depending on the specific model; meanwhile, the number of hidden layers was fixed at three for the ANN models and the number of neurons in each hidden layer was optimized using a genetic algorithm to minimize the total energy consumption while maintaining thermal and visual comfort for occupants. The GA optimization was applied to controllable variables within the IDHVAC system. According to the findings, which were based on three winter months of data, the GA-optimized IDHVAC system achieved an average energy savings of 13.7% compared to a conventional setup. When the optimization was applied separately to the HVAC system and Venetian blinds, the GA-optimized HVAC alone achieved 11.7% energy savings, while the combined IDHVAC optimization provided a higher savings rate.
In 2017, Yuce et al [
94] proposed a novel framework for optimizing household energy management by scheduling appliances to operate at times that reduce peak energy demand while also maximizing the use of renewable energy. By integrating an ANN model—which was used to predict the energy demand and supply from RES along with a genetic algorithm (GA)—it was found to be adequate in receiving the predictions from the ANN, and it utilized them to create an optimized schedule for operating home appliances. With respect to the structure of the FNN, the inputs included environmental variables (outdoor temperature, wind speed, diffuse solar radiation, etc.), occupancy, appliance states, and time information. The remaining duration time for each appliance was an additional input. The outputs, on the other hand, included the total energy consumption, PV energy generation, and wind power generation—where the individual energy consumption for each appliance was also used as an output. The FNN used a single hidden layer, where the number of neurons in the hidden layer was determined experimentally. The best performance was found with 25 neurons. This smart scheduling led to a significant reduction in the grid energy consumption and shifted energy usage with respect to the periods with available RES energy, thus achieving more sustainable and cost-effective home energy use. The methodology was tested in a home environment, where it demonstrated reductions in grid energy dependence of 10%, 25%, and 40% on different occasions.
In 2019, Reynolds et al. [
95] presented two different strategies for improving district energy management. The first focuses on enhancing the production of district heat from a multifaceted energy center, while the second combines this with direct control over building heating demand. The FNN models utilized the input data, such as the predicted outdoor temperature, solar irradiance, hour, day type, occupancy, the temperature set point, and the indoor temperature, of the previous hour. Each FNN integrated 2 hidden layers with 15 neurons per layer. One FNN predicted the hourly energy consumption, while the other predicted the hourly average indoor temperature. Such predictions were utilized with a genetic algorithm to determine the most efficient operation schedules for the heating equipment, thermal storage, and heating set points. The results showed a significant profit increase when optimizing heat production, as well as an even greater profit, as the system was adequate enough to adjust the building energy demand directly. According to the evaluation measurements, by focusing on optimizing the district heat production, there was a notable 44.88% profit boost compared to a standard rule-based strategy. When the system was also given the capability to directly influence the building energy demand, an extra 8.04% profit enhancement was observed.
Also in 2019, Satrio et al. [
96] proposed a novel hybrid framework for enhancing the energy efficiency and thermal comfort of an educational building’s HVAC system (
Figure 14). The researchers established a typical MLP to accurately predict the building’s energy consumption and the thermal comfort level of its occupants, as measured by the percentage of people dissatisfied (PPD) index. The structure of the FNN involved 10 input nodes—which received data such as the cooling set point, RH set point, starting delay, stopping delay, supply air flow rate (VAV system), window area, wall thickness, supply air temperature, supply radiant temperature, and supply radiant flow rate. The single hidden layer integrated three neurons, while the output layer included two output nodes for annual energy consumption and PPD predictions. By utilizing a multi-objective genetic algorithm (MOGA) framework, it was found to be adequate enough to generate the optimal operation settings for the building’s two-chiller HVAC system. The optimized system showed significant improvements in maintaining thermal comfort while also reducing annual energy consumption when compared to the baseline control operational settings.
In 2020, Bourhnane et al. [
97] studied the prediction and management of energy consumption in smart buildings using machine learning techniques by employing FNNs in conjunction with GAs to model energy usage. The system was applied to a real-world setting, and it was used to process the data from a PV solar panel installation and various electrical appliances within the building. The networks were structured with 2 input neurons, a single hidden layer of 10 neurons, and a single output neuron. To this end, the model received two types of input: a timestamp of the energy consumption measurement and a unique identifier for each AC, fridge, furnace, and microwave appliance. The sole output of this model is the prediction of the energy consumption for each appliance. The evaluation illustrated that the hybrid FNN-GA approach showed the highest accuracy among all of the evaluated prediction methods that utilized the regression and the support vector machine models.
In a 2021 study, Elmaz et al. [
98] introduced an advanced hybrid (CNN-LSTM) model. This model combines effective feature extraction with sequential learning to predict room temperature. The data from a room at Antwerp University was used to develop this control system. The hybrid CNN-LSTM network architecture comprised an input layer that received sensorial data from the motion detector (binary), as well as targeted set-point temperature, variable air volume (VAV) flow rate, window (binary), outside temperature, and room temperature data. The input layer is followed by two CNN layers for feature extraction. It then transitions to two LSTM layers to capture temporal dynamics, as well as uses a flatten layer to consolidate features. It then concludes with an output layer consisting of a single node dedicated to predicting temperature. The new approach was then benchmarked against the traditional MLP and a basic LSTM for short-term predictions spanning from 1 to 120 min. While all of the neural network models performed well for 1 min predictions, the CNN-LSTM model stood out in longer forecasts by consistently achieving an
value greater than 0.9 for predictions up to 120 min.
In the same year, Somu et al. [
99] proposed a novel transfer learning-based approach for the purpose of enhancing thermal comfort modeling in buildings, which is crucial for occupant well-being and productivity. To this end, a hybrid CNN-LSTM model was developed to capture the spatio-temporal relationships in thermal comfort data. The sophisticated model utilizes the data from thermal comfort parameters (TCPs) as the input, which includes the following: the indoor air temperature, indoor air temperature, indoor relative humidity, air velocity, mean radiant temperature, outdoor air temperature, as well as occupant age, gender, clothing insulation, and metabolic rate. The first hidden layer of the model is a 1D convolutional layer equipped with 128 filters and a kernel size of 5, which is followed by two LSTM layers, each with 256 nodes. Following the LSTM layers, the two fully connected dense layers respectively include 64 and 16 nodes. To this end, the five-node output layer provides model predictions of the different thermal comfort levels, which are categorized into a five-point thermal sensation scale (target classes: very cold/cold, slightly cool, neutral, slightly warm, and hot/very hot). The key modeling parameters for the TL CNN-LSTM model were determined using the Chi-squared test. At the same time, the issue of insufficient data samples across all of the thermal conditions was tackled using the synthetic minority oversampling technique (SMOTE). The model’s efficacy was tested on two source datasets (ASHRAE RP-884 and Scales Project) and a target dataset (Medium US office), and it demonstrated an accuracy of over 55% in the target buildings with limited data.
In their research work in 2021, Huang et al. [
100] proposed an integrated ant colony optimization (ACO)-enhanced wavelet neural network (I-ACO-WNN) for the precise prediction of building heating and cooling loads. The model was assessed using key performance metrics, and it showcased its superior accuracy over traditional methods. Its WNN portrayed a special form of FNN using a wavelet basis function (WBF)—specifically the Mexican Hat wavelet—as its transfer function. The input layer received data that included eight building parameters: the relative compactness, surface area, wall area, roof area, overall height, orientation, glazing area, and glazing area distribution. The prediction of the network involved two output nodes, which represented the heating load (HL) and cooling load (CL) of the buildings. According to the evaluation, the regression coefficients for the heating load HL and CL predictions were 0.9714 and 0.9783, respectively. Compared to the standard wavelet neural network, the novel I-ACO-WNN model significantly reduced the prediction errors: the RMSE for HL and CL decreased by 66.01% and 73.28%, respectively; the MAE decreased by 82.44% and 84.82%, respectively; the MAPE decreased by 81.21% and 85.31%, respectively; and the MSE decreased by 88.44% and 92.86%, respectively.
In 2022, Afroz et al. [
101] introduced a novel technique to curtail the energy use in HVAC systems, yet they also upheld the ambient internal standards as this necessitates extra power expenditure. The approach involved a complicated problem with several variables, which was resolved through a particle swarm optimization (PSO) algorithm. For regulating the internal indoor comfort and HVAC energy utilization, contemporary predictive formulations were established utilizing a nonlinear auto-regressive exogenous (NARX) neural network. The models utilized sophisticated information regarding multiple HVAC system and environmental parameters for the purpose of predicting the energy consumption, as well as indoor temperature, humidity, CO
2, and VOC. NARX formulations underwent refinement to achieve the ideal forecast precision, and an ease of integration was conducted in the actual systems. To this end, the formulations were enhanced to determine the most efficient HVAC adjustments, which was achieved by considering seasonal alterations. The best performance was found with 12 hidden neurons and 2 time delays for the energy consumption prediction NARX, while the optimal NARX size for the indoor temperature, humidity, CO
2, and VOC predictions were determined to be 10 hidden neurons and varying numbers of time delays (e.g., two or three). The outcomes revealed the feasibility of decreasing the overall power consumption by 7.8% without sacrificing internal conditions like the air warmth (19.60–28.20 °C) and moisture levels (30–65%).