Predicting Heating and Cooling Loads in Energy-E ﬃ cient Buildings Using Two Hybrid Intelligent Models

: Today, energy conservation is more and more stressed as great amounts of energy are being consumed for varying applications. This study aimed to evaluate the application of two robust evolutionary algorithms, namely genetic algorithm (GA) and imperialist competition algorithm (ICA) for optimizing the weights and biases of the artiﬁcial neural network (ANN) in the estimation of heating load (HL) and cooling load (CL) of the energy-e ﬃ cient residential buildings. To this end, a proper dataset was provided composed of relative compactness, surface area, wall area, roof area, overall height, orientation, glazing area, glazing area distribution, as the HL and CL inﬂuential factors. The optimal structure of each model was achieved through a trial and error process and to evaluate the accuracy of the designed networks, we used three well-known accuracy criterions. As the result of applying GA and ICA, the performance error of ANN decreased respectively by 17.92% and 23.22% for the HL, and 21.13% and 24.53% for CL in the training phase, and 20.84% and 23.74% for HL, and 27.57% and 29.10% for CL in the testing phase. The mentioned results demonstrate the superiority of the ICA-ANN model compared to GA-ANN and ANN.


Introduction
A large portion of energy (more than 30%) is used in the building sector globally [1]. In a building, thermal energy involves two measures of cooling load (CL), and heating load (HL) [2] and these measures are regulated by heating ventilation and air conditioning (HVAC) system. The HVAC system is designed to compute the HL and CL of the space and thereby, provide a desirable indoor air condition. In this sense, some studies have focused on evaluating comfortable, yet energy-saving spaces [3]. Required cooling and heating capacities are estimated mainly according to the basic factors, including building properties, its utilization, and climate conditions. Sant'Anna et al. [4] conduct research for conventional and green buildings in terms of indoor environmental quality perception. In another research, Caniato et al. [5] studied the expectations of the thermal and acoustic performance of wooden buildings against a heavyweight structure.
Therefore, more sustainable consumption of energy can be ensured by a proper examination of the energy performance of buildings (EPB) and optimal designing of the HVAC system. Although many countries take such measures, there is still a high level of energy consumption, and it is projected to increase globally. A great number of scientists believe that such an increase is because of rising living standards. For example, building construction in Europe must meet special conditions related to energy efficiency. It should be noted that 40% of energy in Europe is consumed in buildings [6]. According to reports, this value approximately comprises 39% of the total primary energy requirement (PER) and 27.5% of total national energy use in the United States [7] and China [8], respectively.
According to the items mentioned above, many engineers have tried to develop different predictive and evaluative tools the primary aim in order to produce an optimal approximation of building energy consumption. So far, building energy has been modelled in a number of attempts [9,10]. Generally, there are two common methods for EPB evaluation: Forward modelling and inverse modelling [11]. In former, the tools are used to simulate the energy performance by taking into account the actual attributes of the building (e.g., geometry). Since the application of such simulators involves control over certain parameters that are difficult to be set in practice, there are some related disadvantages. However, the chief problem in forwarding modelling is that it is not suited to appraising the EPB of the occupied buildings [12]. In addition, this method is time-consuming and requires a lot of precision as it involves a large number of parameters. Moreover, employing various simulation programs can yield estimations with varying accuracies [13]. Nonetheless, these programs could be reliable in evaluating the effects of a single parameter on EPB when other parameters remain unchanged. Designer's Simulation Toolkit (DeST) [14], Energy Plus [15], DOE-2 [16] are among the well-known computer software tools used in this subject.
To assess EPB, many researchers have turned to the inverse (data-driven) modelling [17] in order to avoid the drawbacks of the simulation packages. In this sense, if a sufficient number of instances (consisting of some inputs and targets) are available, it is viable to evaluate the impact of influential factors (e.g., relative compactness, roof area, and orientation) on EPB [18,19]. The chief advantage in using artificial intelligence (AI) is its high-performance speed, as well as the ease of implementation. Moreover, the optimal structure of machine learning methods can easily handle the changes in variables (changing in the building design parameters, in this case) that is a formidable challenge for designers and architectures alike. In what follows, an extensive report is presented in the previous attempts that investigated the feasibility of various AI techniques in EPB. Notably, although AI techniques have been effectively used in estimating and optimization of the energy performance of buildings, few researchers have focused on enhancing the effectiveness of the typical predictive models. Hence, the focal effort of the current research is to design two metaheuristic algorithms to improve the quality of the estimation of the artificial neural networks in the field of HL and CL of the residential buildings. To do this, after providing the suitable dataset, an extensive trial and error process was executed to achieve the optimal structure of the proposed models namely, multi-layer perceptron (MLP) neural network, the combination of artificial neural network (ANN) and genetic algorithm (GA-ANN), and the combination of ANN and imperialists competition algorithm (ICA-ANN).

Literature Review
Similar to many other fields of research, AI techniques have attracted great attention in building energy consumption evaluation. Among various types of AI techniques, artificial neural network (ANN) is a well-known technique that is widely employed for many simulation problems [20,21]. Moreover, ANN has emerged as an effective tool in building energy management [22,23]. Mejías et al. [24] used linear regression models and artificial neural networks to estimate three concepts of heating and cooling energy demands, energy consumptions and CO 2 emissions in office buildings. They discovered that the ANN system could perform better than other models as it yields results with greater accuracy. Deb et al. [25] investigated ANN capability in forecasting the diurnal cooling energy load and found that ANN can successfully predict daily energy use by taking into account the records of five previous days. In addition, Li et al. [26] conducted a comparative study that focused on the estimation of an hourly cooling load of the building. Their study compared support vector machine (SVM) against three common types of neural networks: Traditional backpropagation neural network (BPNN), general regression neural network (GRNN), and radial basis function neural network (RBFNN). They found that SVM and GRNN could offer better prediction than the RBNN and RBFNN predictive tools, based on the root mean square error (RMSE) and mean relative error (MRE). In addition to ANN, many other methods have been used in dealing with energy-efficient residential buildings problem. For instance, Yun et al. [27] effectively used an indexed ARX regression model in order to forecast the thermal load of building on an hourly basis. In a case study, by Yao et al. [28] examined the ability of a combined forecasting model based on the analytic hierarchy process (AHP) in appraising the hourly cooling load. They showed that combined models could outperform individual ones. In addition, some studies have investigated the efficiency of SVMs regarding the same objectives. Dong et al. [29] evaluated the SVM regression method in approximating building energy consumption in case of a tropical region. Their results show that this model is reliable enough based on the calculated coefficients of variance (CV) (less than 3%) and a small percentage error (4%). Jing and Xiaojuan [30] modelled the dynamic cooling and heating load of an office building by developing a hybrid method based on wavelet transform, SVM, and partial least squares regression (PLS) to. Accordingly, they chose a building located in Tianjin, China. A multiple-horizon prediction was performed after they carried out a sensitivity analysis for selecting the best inputs combination. According to their results, the developed model can be used to identify an accurate perception of the problem. Li et al. [31] synthesized an adaptive neuro-fuzzy inference system (ANFIS) with a genetic algorithm in order to improve its accuracy in estimating building energy consumption. They found that this synthesis not only does provide a reliable model, but also gives better performance than does the ANN model. In addition, Random forest (RF) model has been applied for the quantitative prediction of energy performance [32]. Similarly, multiple linear regression can yield a correlation coefficient of 0.987 for evaluation of the heating energy demand [33].
Hereafter, four general sections form the body of this research. Explanations about the considered models are available in Section 3. Section 4 is devoted to data preparation, and statistically analysis of them. Following this, the outcomes are presented and discussed in Section 5. Eventually, Section 6 is the conclusion giving a brief report of the current research.

Multi-Layer Perceptron (MLP)
The idea of an artificial neural network (ANN) was first suggested in 1943 by McCulloch and Pitts [34]. Originally, ANNs are inspired by the human neural network and has been satisfactorily used for several fields of research [35,36]. The main advantage of this model lies in the ability of mapping the non-linear relationship between sets of input-output pairs. Among various kinds of ANNs, multi-layer perceptron (MLP) is a reliable tool which is designed for the case of this study to predict the HL and CL of the residential buildings. As is shown in Figure 1, an MLP is composed of three layers that each layer contains a number of computational units (i.e., neurons). During the MLP performance, it has been aimed to find the optimal values of weights (W) and biases (b) to obtain the best accommodation between the targets and outputs of the system. More specifically, let T be the input vectors; then, the output of the k-th neuron is calculated as follows: where T denotes the input value, W and b are the assigned weight and bias, respectively. Also, the term F symbolizes the activation function.

Genetic Algorithm (GA)
Inspired by the theory of natural selection, Genetic algorithm (GA) was first presented by Holland [37]. GA is one of the most effective evolutionary computing which one of its aims is to improve the applicability of various networks, ANNs for example. Dealing with highly non-linear and intricate problems can be mentioned as the excellence of GA. Accordingly, for most modelling aims, there are various conditions that can be well handled by GA. Geotechnical simulations are a good instance of such cases, associated with changes in loading condition (dynamic and static loading patterns), continuous or discontinuous soil media, and linear or nonlinear soil behaviors. Due to different parameters that must be optimized for achieving a proper GA, it can be categorized as a problematic optimization algorithm (OA).
As stated supra, GA is a heuristic search technique that follows the natural selection rules to provide an effective solution. GA is a well-known component of a larger group of evolutionary algorithms (EA) that perform based on natural evolution techniques suchlike crossover, mutation, selection, and inheritance [38]. In the commencing of GA performance, some so-called solutions "population" are involved. In this procedure, new populations (offspring) are generated by existing relations. The fitness of each chromosome is evaluated to select two parent chromosomes from the population. It is notable that this determines the chance of relations to reproduce. In other words, the more fitness they have, the more likely to reproduce. This process continues until one of the stopping criteria (improvement of the best solution or number of populations) is met. Figure 2 illustrates a graphical view of the mentioned process.

Genetic Algorithm (GA)
Inspired by the theory of natural selection, Genetic algorithm (GA) was first presented by Holland [37]. GA is one of the most effective evolutionary computing which one of its aims is to improve the applicability of various networks, ANNs for example. Dealing with highly non-linear and intricate problems can be mentioned as the excellence of GA. Accordingly, for most modelling aims, there are various conditions that can be well handled by GA. Geotechnical simulations are a good instance of such cases, associated with changes in loading condition (dynamic and static loading patterns), continuous or discontinuous soil media, and linear or nonlinear soil behaviors. Due to different parameters that must be optimized for achieving a proper GA, it can be categorized as a problematic optimization algorithm (OA).
As stated supra, GA is a heuristic search technique that follows the natural selection rules to provide an effective solution. GA is a well-known component of a larger group of evolutionary algorithms (EA) that perform based on natural evolution techniques suchlike crossover, mutation, selection, and inheritance [38]. In the commencing of GA performance, some so-called solutions "population" are involved. In this procedure, new populations (offspring) are generated by existing relations. The fitness of each chromosome is evaluated to select two parent chromosomes from the population. It is notable that this determines the chance of relations to reproduce. In other words, the more fitness they have, the more likely to reproduce. This process continues until one of the stopping criteria (improvement of the best solution or number of populations) is met. Figure 2 illustrates a graphical view of the mentioned process.

Imperialist Competitive Algorithm (ICA)
Imperialist Competitive Algorithm (ICA) is a newly-developed metaheuristic algorithm. Inspired by the socio-political process of imperialism and imperialistic competition, this model was first suggested by Atashpaz-Gargari and Lucas in 2007 [40]. ICA capability in dealing with various optimization issues has been evaluated thoroughly in the literature [41]. Similar to other evolutionary techniques (e.g., particle swarm optimization and genetic algorithm) ICA is a population-based system which gets started with a user-defined population (i.e., the number of countries). As Figure 3 displays, two major groups of countries form the so-called units "empire." The first group is known as Imperialists comprising a number of the elite countries, and the remaining countries are defined as the Colonies. During the ICA implementation, imperialists aim to extend their territories to the governing colonies. Since this procedure is an uncontrolled action, we may witness a revolution in each country. Note that, every country can define its self in a new empire initial other than the initial one. The tendency of the countries for the higher chance promotion can be mentioned as the main reason for this. The result of the mentioned steps is that the possession of the weaker colonies is taken by, the stronger one. This process continues until a predetermined stopping criterion is met. More details about the ICA mechanism is well-explained in other researches [42].

Imperialist Competitive Algorithm (ICA)
Imperialist Competitive Algorithm (ICA) is a newly-developed metaheuristic algorithm. Inspired by the socio-political process of imperialism and imperialistic competition, this model was first suggested by Atashpaz-Gargari and Lucas in 2007 [40]. ICA capability in dealing with various optimization issues has been evaluated thoroughly in the literature [41]. Similar to other evolutionary techniques (e.g., particle swarm optimization and genetic algorithm) ICA is a population-based system which gets started with a user-defined population (i.e., the number of countries). As Figure 3 displays, two major groups of countries form the so-called units "empire." The first group is known as Imperialists comprising a number of the elite countries, and the remaining countries are defined as the Colonies. During the ICA implementation, imperialists aim to extend their territories to the governing colonies. Since this procedure is an uncontrolled action, we may witness a revolution in each country. Note that, every country can define its self in a new empire initial other than the initial one. The tendency of the countries for the higher chance promotion can be mentioned as the main reason for this. The result of the mentioned steps is that the possession of the weaker colonies is taken by, the stronger one. This process continues until a predetermined stopping criterion is met. More details about the ICA mechanism is well-explained in other researches [42].

Data Preparation and Statistics
In this study, we used the dataset taken from https://cml.ics.uci.edu/, based on research by Tsanas and Xifara [32]. Eight input parameters, including relative compactness, surface area, wall area, roof area, overall height, orientation, glazing area, and glazing area distribution of a residential building, (named as X1, X2, …, X8, respectively) and two response parameters, including heating load (HL) and cooling load (CL) (named as Y1 and Y2, respectively) of the proposed building construct the mentioned dataset. Figure 4 shows a graphical description from data preparation procedure. Considering twelve different buildings, four glazing areas, five distribution scenarios, and four orientations, 768 buildings were modelled and analyzed by using Ecotect computer software [43]. The basic assumptions and the characteristics of the considered building are well described in Reference [32]. Additionally, Table 1 denotes the results of the statistical analysis of both independent and dependent variables.
To better understand the issue, the relationship between the inputs and output variables is demonstrated in this section. To do so, the 3D distribution of each independent parameter (i.e., input variables) against both response variables is depicted in Figure 5. In this regard, each diagram shows the X1, X2, …, X8 on the x-axis, versus Y1 and Y2 on the y and z axes.

Data Preparation and Statistics
In this study, we used the dataset taken from https://cml.ics.uci.edu/, based on research by Tsanas and Xifara [32]. Eight input parameters, including relative compactness, surface area, wall area, roof area, overall height, orientation, glazing area, and glazing area distribution of a residential building, (named as X1, X2, . . . , X8, respectively) and two response parameters, including heating load (HL) and cooling load (CL) (named as Y1 and Y2, respectively) of the proposed building construct the mentioned dataset. Figure 4 shows a graphical description from data preparation procedure. Considering twelve different buildings, four glazing areas, five distribution scenarios, and four orientations, 768 buildings were modelled and analyzed by using Ecotect computer software [43]. The basic assumptions and the characteristics of the considered building are well described in Reference [32]. Additionally, Table 1 denotes the results of the statistical analysis of both independent and dependent variables.
To better understand the issue, the relationship between the inputs and output variables is demonstrated in this section. To do so, the 3D distribution of each independent parameter (i.e., input variables) against both response variables is depicted in Figure 5. In this regard, each diagram shows the X1, X2, . . . , X8 on the x-axis, versus Y1 and Y2 on the y and z axes.     Moreover, the frequency of output data has been calculated and depicted in the form of histogram charts by using IBM SPSS software. These diagrams are available in Figure 6a,b. Furthermore, according to the Figure 6c, the high correlation between the values of Y1 and Y2 (R 2 = 0.9523) portends a meaningful accommodation between these two variables. It shows that HL is positively correlated with CL. In other words, the input samples that produce low/high values of HL, give low/high values of CL, too.  Moreover, the frequency of output data has been calculated and depicted in the form of histogram charts by using IBM SPSS software. These diagrams are available in Figure 6a,b. Furthermore, according to the Figure 6c, the high correlation between the values of Y1 and Y2 (R 2 = 0.9523) portends a meaningful accommodation between these two variables. It shows that HL is positively correlated with CL. In other words, the input samples that produce low/high values of HL, give low/high values of CL, too. Moreover, the frequency of output data has been calculated and depicted in the form of histogram charts by using IBM SPSS software. These diagrams are available in Figure 6a,b. Furthermore, according to the Figure 6c, the high correlation between the values of Y1 and Y2 (R 2 = 0.9523) portends a meaningful accommodation between these two variables. It shows that HL is positively correlated with CL. In other words, the input samples that produce low/high values of HL, give low/high values of CL, too.

Results and Discussion
This study addresses the application of two optimization algorithms of genetic algorithm (GA) and imperialist competition algorithm (ICA) for improving the performance of the artificial neural network (ANN) for modeling the heating load (HL) and cooling load (CL) of a residential building. Note that, the mentioned algorithms are employed to find the best computational parameters (i.e., the weights and biases) of the ANN. Eight influential factors, including relative compactness, surface area, wall area, roof area, overall height, orientation, glazing area, and glazing area distribution of a residential building were selected to form the input dataset. Among 768 samples, 80% (614 records) were randomly selected and devoted to the training phase, and the remaining 20% (154 records) were used to evaluate the accuracy of ANN, GA-ANN, and ICA-ANN models. In this sense, to evaluate and compare the efficiency of the applied models, three accuracy criterions namely, R 2 , MAE, RMSE were defined, which have been extensively used in earlier studies [44,45]. Equations (2)-(4) formulate the mentioned statistical indices.
where Yi observed, and Yi predicted denote the actual and predicted values of HL (or CL) of the efficient building, respectively. The term N represents the number of instances, and observed stands for the average of the actual values of HL (or CL).

Results and Discussion
This study addresses the application of two optimization algorithms of genetic algorithm (GA) and imperialist competition algorithm (ICA) for improving the performance of the artificial neural network (ANN) for modeling the heating load (HL) and cooling load (CL) of a residential building. Note that, the mentioned algorithms are employed to find the best computational parameters (i.e., the weights and biases) of the ANN. Eight influential factors, including relative compactness, surface area, wall area, roof area, overall height, orientation, glazing area, and glazing area distribution of a residential building were selected to form the input dataset. Among 768 samples, 80% (614 records) were randomly selected and devoted to the training phase, and the remaining 20% (154 records) were used to evaluate the accuracy of ANN, GA-ANN, and ICA-ANN models. In this sense, to evaluate and compare the efficiency of the applied models, three accuracy criterions namely, R 2 , MAE, RMSE were defined, which have been extensively used in earlier studies [44,45]. Equations (2)-(4) formulate the mentioned statistical indices.
where Y i observed , and Y i predicted denote the actual and predicted values of HL (or CL) of the efficient building, respectively. The term N represents the number of instances, and Y observed stands for the average of the actual values of HL (or CL).

Finding the Optimal Structure of the Intelligent Models
As is known, determining an optimal structure of the networks is a substantial step in the field of artificial intelligence utilization. In this study, we executed an extensive trial and error process to achieve the appropriate architecture of ANN, GA-ANN, and ICA-ANN models. To do this, mean square error (MSE) was defined as the objective function to measure the accuracy of various structures of the networks. MSE is expressed as follows: where similar to the aforementioned criteria, N, Y i observed , and Y i predicted respectively symbolize the number of samples, actual and predicted values of HL (or CL). For ANN, a feed-forward back-propagation MLP containing one hidden layer was tasted based on different numbers of hidden neurons. Also, the Levenberg-Marquardt (LM) training algorithm is applied, due to the better performance compared to conventional gradient descent (GD) technique [46,47]. Besides, the tangent-sigmoid activation function (simply called Tansig) was used in this study, which is formulated as follows: Figure 7 depicts the results of the sensitivity analysis carried out to select the best ANN structure, as is shown. However, the MLP with 3, 4, and 5 nodes in its hidden layer yields a satisfying approximation, and the least MSE is obtained by the network with four hidden neurons. So, the used ANN model forms as 8 × 4 × 2 representing eight input neurons, four hidden neurons, and two output neurons, respectively. This is noteworthy that this structure was used in the following and for the GA-ANN and ICA-ANN predictive models.

Finding the Optimal Structure of the Intelligent Models
As is known, determining an optimal structure of the networks is a substantial step in the field of artificial intelligence utilization. In this study, we executed an extensive trial and error process to achieve the appropriate architecture of ANN, GA-ANN, and ICA-ANN models. To do this, mean square error (MSE) was defined as the objective function to measure the accuracy of various structures of the networks. MSE is expressed as follows: where similar to the aforementioned criteria, N, Yi observed, and Yi predicted respectively symbolize the number of samples, actual and predicted values of HL (or CL). For ANN, a feed-forward back-propagation MLP containing one hidden layer was tasted based on different numbers of hidden neurons. Also, the Levenberg-Marquardt (LM) training algorithm is applied, due to the better performance compared to conventional gradient descent (GD) technique [46,47]. Besides, the tangent-sigmoid activation function (simply called Tansig) was used in this study, which is formulated as follows: Figure 7 depicts the results of the sensitivity analysis carried out to select the best ANN structure, as is shown. However, the MLP with 3, 4, and 5 nodes in its hidden layer yields a satisfying approximation, and the least MSE is obtained by the network with four hidden neurons. So, the used ANN model forms as 8 × 4 × 2 representing eight input neurons, four hidden neurons, and two output neurons, respectively. This is noteworthy that this structure was used in the following and for the GA-ANN and ICA-ANN predictive models. As explained supra, to enhance the capability of the typical ANN, we combined it with genetic algorithm and imperialist competition algorithm (Figure 8). Similar to ANN, the best parameters were selected via a trial and error process. There are several parameters that affect the learning process of evolutionary algorithms. In the case of this study, different population sizes for GA, and a different number of decades for ICA were tested within 1000 iterations. Moreover, similar to As explained supra, to enhance the capability of the typical ANN, we combined it with genetic algorithm and imperialist competition algorithm (Figure 8). Similar to ANN, the best parameters were selected via a trial and error process. There are several parameters that affect the learning process of evolutionary algorithms. In the case of this study, different population sizes for GA, and a different number of decades for ICA were tested within 1000 iterations. Moreover, similar to Koopialipoor et al. [39], the most appropriate values for other parameters have been determined by a trial and error process. Table 2 denotes the results of the mentioned process.
Appl. Sci. 2019, 9, x FOR PEER REVIEW 13 of 26 Koopialipoor et al. [39], the most appropriate values for other parameters have been determined by a trial and error process. Table 2 denotes the results of the mentioned process.   Next, both parameters of "number of countries" and "population size" were considered to change between 50 to 500 with 50 intervals. The objective function was calculated for each iteration, and the results of this process are shown in Figures 9 and 10. Based on these figures, no significant change was observed after the 600th iteration for either GA and ICA algorithms. Accordingly, the lowest MSE obtained for the GA and ICA models with the number of countries and population size equal to 450 and 400, respectively. Next, both parameters of "number of countries" and "population size" were considered to change between 50 to 500 with 50 intervals. The objective function was calculated for each iteration, and the results of this process are shown in Figures 9 and 10. Based on these figures, no significant change was observed after the 600th iteration for either GA and ICA algorithms. Accordingly, the lowest MSE obtained for the GA and ICA models with the number of countries and population size equal to 450 and 400, respectively.

Model Assessment
Using the programming software of MATLAB version 14.0 all three models were coded and implemented with their optimal architecture and the results are presented through various ways. A graphical comparison between the actual and predicted values of HL and CL is presented in Figures  11 and 12, as well as a chart showing the error of each sample (i.e., the difference between the actual and predicted values) and a histogram of the errors depicting the frequency of each error value.

Model Assessment
Using the programming software of MATLAB version 14.0 all three models were coded and implemented with their optimal architecture and the results are presented through various ways. A graphical comparison between the actual and predicted values of HL and CL is presented in Figures  11 and 12, as well as a chart showing the error of each sample (i.e., the difference between the actual and predicted values) and a histogram of the errors depicting the frequency of each error value.

Model Assessment
Using the programming software of MATLAB version 14.0 all three models were coded and implemented with their optimal architecture and the results are presented through various ways. A graphical comparison between the actual and predicted values of HL and CL is presented in Figures 11 and 12, as well as a chart showing the error of each sample (i.e., the difference between the actual and predicted values) and a histogram of the errors depicting the frequency of each error value.   Moreover, the accommodation between the targets and the outputs of ANN, GA-ANN, and ICA-ANN is depicted in the form of regression charts in Figures 13 and 14, respectively, for HL and CL modelling. The coefficient of determination (R 2 ) is also calculated, which indicates the correlation between the mentioned parameters. This is noteworthy that R 2 is a statistical index which varies between 0 and 1 so that 1 represents an ideal prediction and vice versa. Moreover, the accommodation between the targets and the outputs of ANN, GA-ANN, and ICA-ANN is depicted in the form of regression charts in Figures 13 and 14, respectively, for HL and CL modelling. The coefficient of determination (R 2 ) is also calculated, which indicates the correlation between the mentioned parameters. This is noteworthy that R 2 is a statistical index which varies between 0 and 1 so that 1 represents an ideal prediction and vice versa.   In the following, Table 3 summarizes the values of R2, RMSE, and MAE acquired for the training and testing phases of HL and CL prediction. It shall be noted that, to have a coherent presentation of the results, calculated values of R 2 , RMSE, and MAE are reported in the respective order for ANN, GA-ANN, and ICA-ANN in all explanations below. According to this table, by comparing the obtained R 2 for the training HL (0.8710, 0.8711, and 0.8816) and testing HL (0.9017, 0.9076, and 0.9115), it can be concluded that all three models have shown better performance for the testing dataset. In this regard, the outputs produced by ICA-ANN has the highest correlation with the actual HLs for  In the following, Table 3 summarizes the values of R2, RMSE, and MAE acquired for the training and testing phases of HL and CL prediction. It shall be noted that, to have a coherent presentation of the results, calculated values of R 2 , RMSE, and MAE are reported in the respective order for ANN, GA-ANN, and ICA-ANN in all explanations below. According to this table, by comparing the obtained R 2 for the training HL (0.8710, 0.8711, and 0.8816) and testing HL (0.9017, 0.9076, and 0.9115), it can be concluded that all three models have shown better performance for the testing dataset. In this regard, the outputs produced by ICA-ANN has the highest correlation with the actual HLs for both training and testing data. Also, the training RMSE (3.6535, 2.9986, and 2.8050) and MAE (3.0014, 2.1797, and 2.0068) show that applying the GA and ICA evolutionary algorithms have a significant effect in decreasing the performance error. Similarly, the testing RMSE (3.6481, 2.8878, and 2.7819) and MAE (2.9382, 2.0622, and 2.0089) demonstrate that ICA-ANN has shown a higher accuracy compared to the GA-ANN and typical ANN in forecasting the HL of residential buildings. As for CL prediction, the calculated R 2 for the training (0.9119, 0.9102, and 0.9216) and testing samples (0.9179, 0.9232, and 0.9287) prove that ICA has performed slightly better than GA in improving the ANN performance. This is while ANN has achieved a higher value of R 2 compared to GA-ANN model (0.9119 > 0.9102). The computed RMSE for the training (4.3588, 3.4376, and 3.2892) and testing phases (3.9489, 2.8598, and 2.7995) revealed that combining ANN with GA and ICA has led to decreasing the root mean square error as 0.9212 and 1.0696, and 1.0891 and 1.1494, respectively for the training and testing instances. Referring to Table 3, however, GA-ANN outperformed ICA-ANN in the case of testing MAE (3.2835, 2.0982, and 2.1054), but the results of the training MAE (3.4320, 2.4330, and 2.3565) support the previous claims (i.e., the superiority of ICA-ANN method).

Conclusions and Remarks
Recent years have witnessed the development of new models to give an accurate approximation from the energy consumption of the buildings. In this paper, we employed two metaheuristic algorithms to overcome the drawbacks of the backpropagation algorithms for this aim. In this sense, a typical multilayer-perceptron neural network (ANN) was synthesized with genetic algorithm (GA-ANN) and imperialist competition algorithm (ICA-ANN) to estimate the heating load (HL) and cooling load (CL) of a building with residential use. To this end, eight influential energy parameters, including relative compactness, surface area, wall area, roof area, overall height, orientation, glazing area, glazing area distribution were considered as the inputs of the designed networks. Considering twelve different buildings, four glazing areas, five distribution scenarios, and four orientations, 768 buildings were modelled and analyzed in Ecotect environment. Among those, 80% (614 records) were devoted to training the ANN, GA-ANN, and ICA-ANN models, and the efficiency of them was evaluated by the remaining 20% (154 records). The proposed models were implemented in their optimal conditions and three famous statistical indices of the coefficient of determination (R 2 ), mean absolute error (MAE), root mean square error (RMSE) was used to evaluate the accuracy of each method. We achieved the following results in this study:

•
Artificial intelligence techniques can act as an efficient approach for analyzing the heating and cooling load of the buildings.

•
Optimizing the computational parameters (i.e., the weights and biases) of the ANN using the GA and ICA has a significant effect on its performance in terms of both pattern learning and prediction.