A Neural Network for Monitoring and Characterization of Buildings with Environmental Quality Management, Part 1: Verification under Steady State Conditions

Introducing integrated, automatic control to buildings operating with the environmental quality management (EQM) system, we found that existing energy models are not suitable for use in integrated control systems as they poorly represent the real time, interacting, and transient effects that occur under field conditions. We needed another high-precision estimator for energy efficiency and indoor environment and to this end we examined artificial neural networks (ANNs). This paper presents a road map for design and evaluation of ANN-based estimators of the given performance aspect in a complex interacting environment. It demonstrates that in creating a precise representation of a mathematical relationship one must evaluate the stability and fitness under randomly changing initial conditions. It also shows that ANN systems designed in this manner may have a high precision in characterizing the response of the building exposed to the variable outdoor climatic conditions. The absolute value of the relative errors ( M a x A R E ) being less than 1.4% for each stage of the ANN development proves that our objective of monitoring and EQM characterization can be reached.


Introduction
A published review [1] and papers [2][3][4] introduced the concept of environmental quality management (EQM) including a feasibility of application of the selected statistical methods [5] or artificial neural networks [6] to control heating systems. This paper is a first attempt to address the full system of monitoring and management of environmental quality in buildings from the control point of view.
During the review of current technology [7][8][9] the shortcomings of the current energy models became clear. Amongst many reasons one can highlight the uncertainty in measurement of the global temperature that is key to operating temperature [10], the effect of interactions that is not reflected in the energy models and costs [11,12], difficulties with applying multi-criterial analysis [13], and interaction of building with a city [14].
Nevertheless, the problem is two-fold: (1) most of the currently used energy models solve a system of simultaneous heat, air, and water transfer equations that are partial and hysteretic, second-order In this paper we analyze a simplified case with constant room temperature as such a case was previously considered (see [9]). This paper seeks to create a highly-efficient, smart building control system that can be applied to any type of facility regardless of its functionality. This approach is one of the novelties in the subject matter described, but it also has certain consequences. One of them, as already mentioned, relates to including the weather predictive capabilities in the model, another is to calculate the transient operational room temperature that must vary within a prescribed time function and stay within the adaptable climate range. Another novelty is integrating energy with the indoor environment in the management system.
It is worth highlighting that using artificial intelligence to solve the control issue with a new energy evaluation paradigm leads to extraordinary convergence of the model results with the measured results. However, to achieve this convergence, it is necessary to perform the required analytical work. The demonstrated procedure is another novelty in the area of environmental quality management.
The solution presented in the article is a continuation of many years of research, which was the basis for the creation of many articles. In previous studies [25][26][27][28][29][30], the experimental base was discussed in detail along with the possibilities it creates for conducting "in situ" research in the field of automatic process control. The issues concerning the work of individual technical systems (HVAC, lighting, blinds, etc.) in terms of reducing energy consumption through the implementation of specially developed control algorithms are presented elsewhere (see [6,26,[31][32][33]), as are issues related to providing comfort in an energy-efficient building (see [10,29,30,34,35]). Following the development of technology, conducting an analysis of scientific research results, we designed a new solution that is the subject of this article. To develop this solution, we used measurements published in different sources [10,[26][27][28][29][30][31][32][33][34].

Requirements for Building Automatics Control System (BACS)
With time, as the number of energy sources and storage capabilities grew (e.g., solar thermal, solar photovoltaic, water-based heat pump, water buffer or hot water tank, air-earth heat exchanger, air-water preheat coils), the role of heating ventilation and air conditioning (HVAC) interactions became apparent. Furthermore, post-construction requirements of a good indoor climate that were parallel to energy efficiency, imposed the optimization of HVAC under occupancy stage and thereby created a new set of problems and opportunities [36][37][38]. For instance, some inaccessible valves that previously were set during construction needed accessibility to permit adjustments, thus, new In this paper we analyze a simplified case with constant room temperature as such a case was previously considered (see [9]). This paper seeks to create a highly-efficient, smart building control system that can be applied to any type of facility regardless of its functionality. This approach is one of the novelties in the subject matter described, but it also has certain consequences. One of them, as already mentioned, relates to including the weather predictive capabilities in the model, another is to calculate the transient operational room temperature that must vary within a prescribed time function and stay within the adaptable climate range. Another novelty is integrating energy with the indoor environment in the management system.
It is worth highlighting that using artificial intelligence to solve the control issue with a new energy evaluation paradigm leads to extraordinary convergence of the model results with the measured results. However, to achieve this convergence, it is necessary to perform the required analytical work. The demonstrated procedure is another novelty in the area of environmental quality management.
The solution presented in the article is a continuation of many years of research, which was the basis for the creation of many articles. In previous studies [25][26][27][28][29][30], the experimental base was discussed in detail along with the possibilities it creates for conducting "in situ" research in the field of automatic process control. The issues concerning the work of individual technical systems (HVAC, lighting, blinds, etc.) in terms of reducing energy consumption through the implementation of specially developed control algorithms are presented elsewhere (see [6,26,[31][32][33]), as are issues related to providing comfort in an energy-efficient building (see [10,29,30,34,35]). Following the development of technology, conducting an analysis of scientific research results, we designed a new solution that is the subject of this article. To develop this solution, we used measurements published in different sources [10,[26][27][28][29][30][31][32][33][34].

Requirements for Building Automatics Control System (BACS)
With time, as the number of energy sources and storage capabilities grew (e.g., solar thermal, solar photovoltaic, water-based heat pump, water buffer or hot water tank, air-earth heat exchanger, air-water preheat coils), the role of heating ventilation and air conditioning (HVAC) interactions became apparent. Furthermore, post-construction requirements of a good indoor climate that were parallel to energy efficiency, imposed the optimization of HVAC under occupancy stage and thereby created a new set of problems and opportunities [36][37][38]. For instance, some inaccessible valves that previously were set during construction needed accessibility to permit adjustments, thus, new controls Energies 2020, 13, 3469 4 of 24 to expand the adjustment ranges had to be added. Yet the biggest impact was the requirement for additional monitoring information of temperatures or intensity of the flowing media as the control system needed to develop predicting capabilities [29].
Field monitoring is necessary for two different reasons: (a) safety and security, and (b) information to predict settings of these operational controls that are affected by thermal impedance of the dynamically operated building. Incidentally, we need both the history of these settings and the weather as well as the forecast of weather. Moreover, we need a model of the time response of the building. As we discussed elsewhere [1][2][3][4][5][6][7][8][9], the EQM technology does not rely on the currently existing energy models but develops a neural network-based model (see later text). Since having a well-designed BACS is the key to optimizing the energy performance of the building, one may ask the question-what are the requirements for building an automatics control system?
In the European Union (EU) the main driver for the upgrading energy efficiency is the Energy Performance in Buildings Directive (EPBD) from 2019 which highlights the importance of building automation and monitoring for non-residential buildings. Furthermore, the EU proposed a systematic auditing methodology and established energy performance certificates (EPCs). Such a certificate is issued by an energy expert, and provides information about details of the energy systems, energy consumption, and potential retrofitting measures. The mandatory certificate is implemented throughout all of the EU and helps to increase the energy efficiency of buildings. A study of voluntary European certification conducted in 2014 found that emerging, comprehensive energy certification was the eu.bac audit. In principle, the features and functionalities of building automation and control systems rely on guidelines set by the European standard, EN15232 "Energy performance of buildings-Impact of Building Automation, Controls and Building Management".
While the eu.bac tool is useful for acquiring insight as part of the building automation system's commissioning process, the tool is very detailed and needs a large amount of data to define the system. In most cases such data are missing. Time and resources to acquire the data limit the potential users of the tool, especially for retrofit projects. Thus, one may consider this standard as an excellent checklist to be supported with climate consideration (not addressed in the EN 15232 standard). Amongst many papers dealing with auditing and evaluation of BAC systems [39][40][41] one finds that large project savings on improvements to BAC systems are comparable to those of retrofitting building enclosures [39]. Yet, this not an issue of choice between different approaches; both are complementary elements of the same process and both are subjects to the economic analysis of cost-benefit relations.
In the previous series of papers, we addressed the issues related to the next generation of construction technology from a building science point of view [1][2][3][4][6][7][8]. This paper, as well as two more papers to follow, will address the issues related to the building automation, controls, and building management systems that as we know [25,39,40] may have an impact on both energy efficiency and building science. This paper employs two types of verification: (1) traditional verification of ANNs that includes a validation stage after training and before testing, and (2) comparison of the measured and ANN calculated results.

Finding the Best Number of Neurons in the Hidden Layer
We used a process discussed previously [42], as "y" represents the room air operational temperature, and X is a vector that includes 20 parameters described in the later text of the paper that affect the value y in one or another manner ( Figure 2). Two two-layered ANNs, P1 and P2 after training and validation have neurons in the second (hidden) layer, s {1} , varying from 1 to 50. Weight and bias were randomly assigned, and calculation was repeated five times (called approach and indexed as one to five). The results are shown below using so-called boxplots [43]. The criterion for the choice of the ANN was selected as the minimum value of the maximum absolute value of the relative error [44,45] calculated for a given approach. An additional condition about resilience when changing initial weight and bias for neurons was needed to ensure repeatability of results in the tested problem [46][47][48][49].
where is main criterion for choosing the best neural network structure; { } is the number of neurons in the hidden layer; is maximum absolute relative error obtained for the testing stage: The criterion for the choice of the ANN was selected as the minimum value of the maximum absolute value of the relative error [44,45] calculated for a given approach. An additional condition about resilience when changing initial weight and bias for neurons was needed to ensure repeatability of results in the tested problem [46][47][48][49]. where MainCrit is main criterion for choosing the best neural network structure; s {1} is the number of neurons in the hidden layer; MaxARE TEST is maximum absolute relative error obtained for the testing stage: where y iTest is a target for the network in testing stage; y ANNiTest is an output for the network in the testing stage. The selection of minimum value of the maximum absolute value ( y ANNi ) ensures that each relative error will be between zero and the maximum absolute relative error, as long as we do not use the ANN outside of the training range [50].
While the information about the maximum absolute value of the error is in the mathematical model [51] under testing, the expectation is that the calculated ANN results for conditions not used in training are not farther apart from the target values.
For the sake of clarity, we distinguish between input preparation and other elements of ANN's architecture.

Data Preparation
Measurements were performed for 20 min each starting on 17 July and ending on 6 August for a total of 21 days in the summer (the operational temperature, see definitions, is season dependent but in this project, we are not concerned with seasonal differences). The 20 physical parameters that may affect the value of operative temperature in the tested room are as follows: 1. Measured time in decimal notation every 20 min 2.
Degree of opening valve for interior earth-air heat exchanger (%) 3.
Degree of opening valve for exterior earth-air heat exchanger (%) 4.
Exterior air temperature (mean of both earth-air heat exchangers) 5.
Relative humidity of exterior air measured on the inlet to earth-air heat exchangers (EAHXs) 6.
Exterior air temperature (measured on the roof of the building) 10. Temperature of the cooling water in the tank 11. Temperature of the cooling water on return from the tested room 12. Efficiency of the cooling exchanger (%) 13. Steering of the floor cooling valve (%) 14. Temperature in the adjacent room on side 1 (it is used instead of wall surface temperature and we are dealing with a steady state evaluation) 15. Temperature in the adjacent room on side 2 (it is used instead of wall surface temperature) 16. Temperature in the room below (it is used instead of floor surface temperature) 17. Temperature in the room above (it is used instead of ceiling surface temperature) 18. Steering of the cooling valve for the floor system 19. Efficiency of ventilator in climate-convector (%) 20. Angle of setting in the solar shutters Target : y i = temperature in the tested room, • C The above measures constituted the vector X i and together with the room temperature (targets) y i were used for teaching, validation, and testing of the ANN. There were 1409 sets and 60% were used for Energies 2020, 13, 3469 7 of 24 training (X iTr , y iTr ) and 20% each for validation (X iVal , y iVal ) and testing (X iTest , y iTest ). An algorithm for ascribing data sets was written as a loop so that they moved to the next iteration after coming to the starting point. The first three sets were ascribed to the training, number four was ascribed to validation, and number 5 to testing. In this manner, for each approach (see Figure 2) we reduced random effects of the comparison and network validation. The data for validation were taken from measurements from which the estimator y = f (X) was determined [40]. Since the data included changes in temperature, humidity, weather, and other factors, and the validation was performed on independent data, uncertainty of the estimator in the validation stage was compared with training and testing stages. The MSE (6) was chosen as the uncertainty indicator. The process follows the description from previous studies [52,53].

Pre-and Post-Processing, Learning Parameters, and General Equation of the ANN's Architecture for a Given Approach
We decided that all input data would be normalized during pre-processing and denormalized during post-processing using the "mapminmax" function (i.e., a linear transformation into the interval of given boundaries) [52,53].
where Val org is the original value; Val is the transformed value; Val max and Val min are the original interval boundaries; Val max , Val min are the desired range boundaries, from −1 to 1. The research uses the Levenberg-Marquardt teaching algorithm [53,54]. This algorithm has shown satisfactory performance in preliminary studies [9]. As a performance function the mean squared error (MSE) was chosen: where n is the number of experimental sets for each learning stage (training (X iTr , y iTr ), validation (X iVal , y iVal ), tests (X iTest , y iTest )); y i is a target for the network; y ANNi is an output of the network respective to the i-th target. Implicitly, the error was also defined (5): Other parameters of the learning process are shown in Table 1. To identify the best possible relationship y = f (X) we selected the general network structure as feedforward [53,54] with one hidden layer (denoted with {1}) and one output layer (denoted with {2}) (Figure 3).

Hidden layer {1}
Pre-processing function mapminmax To restrict the scope of analysis the same activation function was selected. The chosen function was tansig, a hyperbolic tangent sigmoid [53,55], mathematically equivalent to tanh (8). Similarly, for the output layer a linear function of activation was used-purelin (12)-as recommended [50] for cases of non-linear functions.
The choices of the general network architecture and activation functions [9] were confirmed in the preliminary analysis.
where { } -argument of the transfer function was: where is the input column vector; { } is the column vector of biases for the hidden layer; and { } is the matrix of weights of input arguments for the hidden layer: Looking at Equation (8), it should be noted that it is simultaneously the column vector of the hidden layer {1} outputs { } . This vector takes the following form: To restrict the scope of analysis the same activation function was selected. The chosen function was tansig, a hyperbolic tangent sigmoid [53,55], mathematically equivalent to tanh (8). Similarly, for the output layer a linear function of activation was used-purelin (12)-as recommended [50] for cases of non-linear functions.
The choices of the general network architecture and activation functions [9] were confirmed in the preliminary analysis.
where arg {1} -argument of the transfer function was: where X is the input column vector; B {1} is the column vector of biases for the hidden layer; and W {1} is the matrix of weights of input arguments for the hidden layer: Energies 2020, 13, 3469 9 of 24 Looking at Equation (8), it should be noted that it is simultaneously the column vector of the hidden layer {1} outputs Y {1} . This vector takes the following form: . . .
In Figure 3 we notice that the output vector Y {1} of the hidden layer {1} enters the output layer {2} as the input. Taking this into consideration as well as the fact that in the output layer the transfer function was purelin (12) [53,55] and that the number of neurons was constant and equal to s {2} = 1 {2} (in accordance with the single variable output [49]), the following equations for the layer {2} can be written: where a = 1 is the directional coefficient; arg {2} is the purelin transfer function argument: where W {2} (14) isthe row vector of weights of input arguments Y {1} for the output layer; b 1 {2} is the bias (scalar) for the output layer.
The above analysis of components of the general structure of the ANN (Figure 3) permits formulating the equation y = f(X) as agglomeration of Equations (3) though (14) in the form of Equation (15): where norm mapminmax is the data preprocessing operation; denorm mapminmax is the data postprocessing operation.

Robustness Study of the Examined Neural Network Structures
As mentioned before, we checked 50 cases of ANNs with five created approaches, where weight and bias were randomly ascribed to permit assessment of the ANN's robustness. If the ANN structure, despite changes in the initial weight and bias, maintains consistency in the basic qualifiers such as MARE (mean absolute relative error) and R (Pearson's coefficient), one can assume that the structure is not sensitive to the initial conditions [55]. Additionally, one verifies if the initial data affect the stability of the system [55]. Therefore, if the ANN's qualifiers such as MARE, R, MSE or others are satisfactory, the repeatability of the ANN is confirmed [24].  (16)) shown against the number of neutrons in the hidden layer: Energies 2020, 13, x FOR PEER REVIEW 10 of 24      All three figures show that the ANNs with more than 32 neurons in the hidden layer are sensitive to the initial conditions. They also prove that structures with less than five neurons in the hidden layer show insufficient accuracy fit to the functional relationship. Figure 4

Overfitting and Underfitting Study of the Examined Neural Network Structures
The next item in the analysis in the stage of testing was the degree of ANN fitting (i.e., the relation between ANN model prediction and the target values) [49]. In this case, as all the y values had a magnitude of 10 2 , a summed square of residuals (SSE) was selected to measure the discrepancy between the data and an estimation model (see Equation (17)): This qualifier is used for > 1 as the power increases the difference between target and model prediction. Figure 7 shows boxplot versus the number of neurons in the hidden layer.
The results presented in Figure 7 show that for most structures, underfitting or overfitting phenomenon occurs. It can also be noticed that the overfitting has significant impact for almost every structure starting from { } = 15.

Overfitting and Underfitting Study of the Examined Neural Network Structures
The next item in the analysis in the stage of testing was the degree of ANN fitting (i.e., the relation between ANN model prediction and the target values) [49]. In this case, as all the y values had a magnitude of 10 2 , a summed square of residuals (SSE) was selected to measure the discrepancy between the data and an estimation model (see Equation (17)): This qualifier is used for e i > 1 as the power increases the difference between target and model prediction. Figure 7 shows boxplot versus the number of neurons in the hidden layer.
The results presented in Figure 7 show that for most structures, underfitting or overfitting phenomenon occurs. It can also be noticed that the overfitting has significant impact for almost every structure starting from s {1} = 15.
Finally, after analyzing the results from Figure 7, it is possible to indicate only a few structures for which the influence of the phenomenon of underfitting or overfitting is acceptable or has negligible significance. Such structures correspond to s {1} = 8, 13, 14, 20. Energies 2020, 13, x FOR PEER REVIEW 12 of 24

Identification of the Best Possible Mathematical Relationship of y = f (X)
So far, results in Section 3.1 (robustness) and Section 3.2 (over and underfitting) indicate that structures with s {1} = 8, 13, 14 and their approaches are suitable to represent the phenomenon described by the equation y = f (X). Figure 8 presents a boxplot illustrating the results of the maximum absolute relative error calculated for the network testing stage (2), constituting the basis for the main criterion for choosing the best possible relationship y = f (X) (1).
This figure presents the results obtained for all analyzed structures to prove that the structure indicated as the best in the following part of the publication (Section 3.       Results shown in Figure 8 and Table 2 for the main criterion for the selection of the best identification of the equation y = f (X), as was defined in Equation (1), were fulfilled in approach 1, by the structure with s {1} = 13. It is marked with bold numbers. Furthermore, looking at Figure 7, the smallest change in the SSE value was also seen for the structure s {1} = 13. This means that overfitting or underfitting had the smallest significance for this structure.
Effectively, we can state that the structure s {1} = 13, using approach 1 is the best approximation of the relation y = f (X) for input described in Section 2.2.

The Best y = f (X) Relation Obtained
The learning process for the selected, best neural network is shown in Figures 9 and 10. The figures present the calculated value of the network performance (Equation (6)) and other learning parameters (gradient, momentum, validation checks). Both of these graphs were depicted according to learning epochs.
In Figure 9, it can be seen that the ANN obtained the best results for the 8th epoch, therefore this epoch represents the final results of the learning process of the network. This figure also shows that the ANN learning process for the training stage had an uninterrupted downward trend until its completion. In addition, we notice that for the validation stage from the beginning of the ANN learning process up to the 8th epoch, there was a continuous improvement in the value of the performance (Equation (6)). This fact also confirms the course of validation checks shown in Figure 10.  Results shown in Figure 8 and Table 2 for the main criterion for the selection of the best identification of the equation = ( ), as was defined in Equation (1), were fulfilled in approach 1, by the structure with { } = 13. It is marked with bold numbers. Furthermore, looking at Figure 7, the smallest change in the SSE value was also seen for the structure { } = 13. This means that overfitting or underfitting had the smallest significance for this structure.
Effectively, we can state that the structure { } = 13 , using approach 1 is the best approximation of the relation = ( ) for input described in Section 2.2.

The Best = ( ) Relation Obtained
The learning process for the selected, best neural network is shown in Figures 9 and 10. The figures present the calculated value of the network performance (Equation (6)) and other learning parameters (gradient, momentum, validation checks). Both of these graphs were depicted according to learning epochs.
In Figure 9, it can be seen that the ANN obtained the best results for the 8th epoch, therefore this epoch represents the final results of the learning process of the network. This figure also shows that the ANN learning process for the training stage had an uninterrupted downward trend until its completion. In addition, we notice that for the validation stage from the beginning of the ANN learning process up to the 8th epoch, there was a continuous improvement in the value of the performance (Equation (6)). This fact also confirms the course of validation checks shown in Figure  10.   Figure 11 shows a histogram of residuals of the network, , (Equation (7)), in each of the ANN learning stages (training, validation, test). The x-axis presents the values of the residuals ascribed for the given bin. The y-axis presents the number of occurrences of , covering the range of a given bin. In the figure discussed, it can be seen that the histogram is characterized by a Gaussian distribution and that the data assigned to a specific learning stage have been distributed in an even manner. The discussed figure from the point of view of the quality of the modeled relation = ( ) is purely illustrative, because the values of (errors) on it are not related to the reference output values of which this relationship should seek. Therefore, Figure 12 shows a histogram of relative errors calculated with Equation (18), in the form of 50 bins. In turn, Figure 13 presents a graph showing for which of the assigned measurement samples the relative error occurred. In addition to the analysis of residuals and relative errors in the input data, representation of Pearson's correlation coefficient, R, was calculated [48] to establish the degree of the ANN model fit. To demonstrate this we used, R 2 , as the qualifier [48]. Figure 14 shows four graphs: training, validation, testing, and all data in relation between (output) and (target). The first three relate to specific stages of the ANN learning process, the last shows them all. Figure 10. Gradient, momentum, validation checks values obtained during the learning process for the best analyzed neural network. Figure 11 shows a histogram of residuals of the network, e i , (Equation (7)), in each of the ANN learning stages (training, validation, test). The x-axis presents the values of the residuals ascribed for the given bin. The y-axis presents the number of occurrences of e i , covering the range of a given bin. In the figure discussed, it can be seen that the histogram is characterized by a Gaussian distribution and that the data assigned to a specific learning stage have been distributed in an even manner. The discussed figure from the point of view of the quality of the modeled relation y = f (X) is purely illustrative, because the values of e i (errors) on it are not related to the reference output values of y i which this relationship should seek. Therefore, Figure 12 shows a histogram of relative errors calculated with Equation (18), in the form of 50 bins. In turn, Figure 13 presents a graph showing for which of the assigned measurement samples the relative error occurred. In addition to the analysis of residuals and relative errors in the input data, representation of Pearson's correlation coefficient, R, was calculated [48] to establish the degree of the ANN model fit. To demonstrate this we used, R 2 , as the qualifier [48]. Figure 14 shows four graphs: training, validation, testing, and all data in relation between y ANNi (output) and y i (target). The first three relate to specific stages of the ANN learning process, the last shows them all.

of 24
Energies 2020, 13, x FOR PEER REVIEW 15 of 24 Figure 11. Error histograms obtained during the learning process for the best analyzed neural network.

Figure 12.
Relative error histograms with 50 bins obtained during the learning process for the best analyzed neural network. Figure 11. Error histograms obtained during the learning process for the best analyzed neural network.
Energies 2020, 13, x FOR PEER REVIEW 15 of 24 Figure 11. Error histograms obtained during the learning process for the best analyzed neural network.     Results shown in Figures 12 and 13 indicate that the relative error for the representation = ( ) is less than 1.4% (i.e., a very good result). Table 3 shows the coefficient of determination, R 2 , indicating a very strong correlation [54] between the results of ANN calculation and the values to which the relationship = ( ) should converge. As the difference between the ideal fit and the studied case is less than 0.3%, one may conclude that the input data, architecture of the ANN, and methodology of its teaching were correct.  Figures 12 and 13 indicate that the relative error for the representation y = f (X) is less than 1.4% (i.e., a very good result). Table 3 shows the coefficient of determination, R 2 , indicating a very strong correlation [54] between the results of ANN calculation y ANNi and the values y i to which the relationship y = f (X) should converge. As the difference between the ideal fit and the studied case is less than 0.3%, one may conclude that the input data, architecture of the ANN, and methodology of its teaching were correct. To present the results of the described mapping quality, Figure 15 presents an example of the value of the function y = f (X) drawn for arguments x 1 and x 2 (3). This function was drawn on the background of y i (target), absolute errors (e i ), and relative errors (RE) made by the functional mapping y = f (X). In this figure, it can be seen that the function maps all reference output values y i with proper accuracy (RE < 1.4%). It can also be seen that the abrupt change in the value of x 2 does not cause discrepancies in the values of e i or RE, which indicates the compliance of the results described in this chapter. the background of (target), absolute errors ( ), and relative errors (RE) made by the functional mapping = ( ). In this figure, it can be seen that the function maps all reference output values with proper accuracy (RE < 1.4%). It can also be seen that the abrupt change in the value of does not cause discrepancies in the values of or RE, which indicates the compliance of the results described in this chapter. The equation = ( ) for the best analyzed neural network is presented here as Equation (1) with two decimal places (low precision) to show the significance of a given parameter for the resulting value. Yet, Appendix A (available online) shows the actual values in the neural network as obtained from the training process. Equation (19) and those in Appendix A are a specific form of the ANN described by Equation (15) and elements of matrix are defined and identified by numbers. The number of input arguments (x u gdzie u = 1, 2, . . . , 20) and the length of the results, implied that instead of presenting the functional mapping relative to the other arguments x u and x p (where u p = 1, 2, . . . , 20), the equation y = f (X) will be presented.

Results shown in
The equation y = f (X) for the best analyzed neural network is presented here as Equation (1) with two decimal places (low precision) to show the significance of a given parameter for the resulting value. Yet, Appendix A (available online) shows the actual values in the neural network as obtained from the training process. Equation (19) and those in Appendix A are a specific form of the ANN described by Equation (15) and elements of matrix are defined and identified by numbers.
More details of the content of the elements of Equation (19) are given in Appendix A in Equation (A1).

Discussion
This paper is a part of tools developed for environmental quality management (EQM) system [1][2][3][4][5][6][7][8][9] by two groups: (a) ANN group with US team (MAE Dept., Clarkson University, Potsdam, NY, USA, RD services Cookeville, TN, USA and DFI Enterprise in Morrisville, NY, USA) and Poland team (Lbooking Inc and Cracow University of Technology, Cracow, Poland) and (b) energy modeling group with Canada (ETS, Montreal, QC, Canada) and Saudi Arabia (JUC, Jubal, Saudi Arabia). A cornerstone of this technology is integration of mechanical and structural elements of buildings and use of the integrated control systems for steering of these mechanical devices to satisfy requirements of energy efficiency and indoor environment. The EQM building operates in transient conditions and while all current energy and hygrothermal models are parametric in that they allow comparisons of results related to a given change in the input data, they are not suitable to address transient and cyclic short-term events as they do not account for energy related to air and moisture movements and moisture hysteresis in porous materials.
Therefore, we intend to change the paradigm for steering and control of the indoor environment and energy in the EQM-based residential buildings either new or retrofitted to monitoring to develop an EQM solution for a given building. In the feasibility study [9] we used an ANN to a selected case study. This paper follows the case study and examines the same data set as a component of a potential system of monitoring and EQM solution. This paper presents a full range of design and testing of a ANN that aims at a high precision of estimation of the air temperature in the room located in the experimental building equipped with an earth-air heat exchanger (EAHX) for preconditioning of air during the summer period [2,31]. The case study deals with a model of temperature control and uses 20 input parameters that describe all possible effects, namely temperatures of all adjacent rooms, outdoor climatic conditions including solar radiation, performance of EAHX and water chiller.
The anticipated estimator was termed as y = f (X) as it was supposed to represent a mathematical model. The input data to the vector X were selected in a three-week summer period with variable outdoor conditions (more than 1400 data sets), but similar enough to expect a pattern characteristic for the given building. Indeed, the results exceeded the expectations. The absolute value of the relative errors (MaxARE) was for this estimator less than 1.4% for each stage of the ANN development.
The estimator (Equation (19)) was established after evaluation of 250 variants for the basic equation y = f (X). This evaluation process included a search for the best structure of ANNs (see Figure 2) and satisfying diverse performance criteria that also included ruggedness (Section 3.1) as well as overfitting and underfitting (Section 3.2). These two elements of analysis showed large effect of initial conditions leaving only three cases out of 50 initially considered cases (or 15 out of 250). This comparison implies that 94% of cases did not have the required stability (Section 3.1) or fitness to represent the equation of y = f (X).
It must also be highlighted that the above presented estimator was verified in this paper as a self-standing item. Yet the whole project involves an integrated control system and the energy estimator will be expanded to include exchange of air and there also will be another estimator addressing the indoor environment characterization for a given climatic season.
It is worth mentioning that the above presented model was successfully validated for the case of its individual performance. Yet the whole project involves an integrated control system and the energy estimator will be expanded in a few different aspects, particularly with respect to missing data. One should remember that this estimator will be validated twice: first, during the work in the "Measurement data reconstruction and temperature estimation module" (Figure 1), and secondly, for the whole control system (Figure 1). To achieve this goal, measurements will be performed that will correlate with the period of time (climatic season), for which the estimator was designed. Afterwards, the appropriate indicators will be selected. All these validations will be conducted in accordance with previously described procedures [49,[56][57][58].
Nevertheless, while recording measurements for X i samples often have part of input data missing or distorted. Such cases are commonly encountered when studying buildings with working control systems. Therefore, in the future, the estimator for the real objects will use a control system element called "Data analysis/reconstruction" (Figure 1). A project of such an element will be presented in the course of further work. This element will function on the basis of an autoencoder or Hopfield's net [59].
The above presented estimator allows for an initial handling of data so that they will not disturb measured results or negatively affect the process of analysis. Using Hopfield's net reduces the negative effects of such, as loss of measurements leads to empty fields in the data or failure in control of some parameters or even non-optimal settings used for some devices or sub-systems. Thanks to these measures the estimator will provide stability and efficiency to the operation of the BAC system. Moreover, the estimator ( Figure 2) is built as a modular and elastic tool that employs several different statistical methods. One can continue to modify and expand the scope of the operation of this tool without any collision with the control processes. The statistical analysis complements the limitations in the inherent scope of ANNs because it gives us the capability to determine which factors have a critical influence on the process, to identify statistical outliers or even find a malfunctional device. In turn, one can make a decision on what will be considered as input to the ANN that is being created.
Perhaps, one should explain that our team participates in three interdisciplinary, partly sponsored industrial projects in which buildings with very different thermal mass (a historic building functioning as a hotel, a modern office building, and a houseboat) were retrofitted. With such different thermal impedance of each building, we were looking for a modular and universal solution to control heating/cooling systems. The control system must work with a high precision under adaptable indoor climate conditions (linear transition of air temperature) for 12 cm thick walls on the boat and for more than a meter of masonry walls in the historic building.
The model described in the paper has the following features: (a) statistical analysis of data; (b) increased performance as higher precision means larger savings of energy; (c) the best model of the analyzed room.
The search for a high precision system is justified by the volume of buildings being retrofitted. The difference between 5% and 15% errors in the energy balance means the difference between a country being within international agreements on carbon gas emission or failing it on energy, carbon emissions, and socio-economic progress.
This paper is presented as a road map for the design and evaluation of ANN-based estimators of the given performance aspect in a complex interacting environment. It demonstrates that to create a model for a precise representation of a mathematical relationship one must evaluate stability and fitness under randomly changing initial conditions. It also shows that a properly designed ANN system may have a very high precision in characterizing the underlying dependence as in the analyzed case where the response factor of the building was exposed to the variable outdoor climatic conditions. The absolute value of the relative errors (MaxARE) was for this estimator less than 1.4% for each stage of the ANN development which proves that our objective of monitoring and EQM characterization can be reached. A similar procedure was used in another case study [57].

Future Research Program
In this paper, we used a term "operational temperature" to describe the indoor climate effect because we needed to add convective and conductive effects to the radiative heat exchange (particularly as we will primarily use the radiative heating systems) and this is done under the term of operational air temperature. To this end we need to complete procedures of handling data prior to application to the ANN and development of indexes to be used in the ANN characterization of indoor environment.
As this paper dealt with verification under steady state conditions, we could use air temperatures for adjacent rooms, but for the transient conditions one needs to use the wall surface temperatures as well. Thus, in addition to characterizing the effect of air exchange, the next estimator must deal with an operational temperature calculated from the balance of energy in the room surroundings. Furthermore, as our focus is on the interaction between different factors affecting the transient response of the building, we will use a modular statistical software [5] to evaluate the consistency of the input data set and remove statistical outliers.