Data-Based RC Dynamic Modelling to Assessing the In-Situ Thermal Performance of Buildings. Analysis of Several Key Aspects in a Simpliﬁed Reference Case toward the Application at On-Board Monitoring Level

: This paper reports the application of RC dynamic models for assessing thermal performance of buildings from in-situ tests (obtaining the U value for the walls, and the UA value and gA value for the whole buildings). The following aspects which are relevant to this approach have been systematically analyzed: The e ﬀ ect of the solar radiation on the heat ﬂux through the opaque walls versus the performance of the models including this e ﬀ ect, the optimum number of nodes required to represent the thermal systems, the assignment of inputs and outputs and the length of the test period. Additionally, several options modelling relevant e ﬀ ects using unmeasured variables were studied to evaluate the feasibility to reduce the cost and intrusiveness of the measurement devices required to obtain accurate results. Data series recorded under di ﬀ erent experimental conditions were considered to analyze the robustness and validity of the results. The performance of the models for each of these di ﬀ erent test conditions is discussed. The uncertainties estimated using the described method for the U values of the opaque walls, and the UA and gA values of the whole building, are, respectively, 2.8%, 4.2% and 2.3%. The feasibility to model relevant e ﬀ ects using unmeasured variables has been demonstrated. A simpliﬁed and well-known building has been used as a case study, reinforcing and complementing the validation criteria.


Introduction
The increasing consciousness on energy and environmental issues is leading to an intensification of research that can contribute to guide the elaboration of regulations, aiming to save building energy use and emissions of pollutants, frequently based on theoretical calculations. However, the theoretical and actual building energy performance can show significant differences. This problem has been evidenced and discussed by some researchers [1,2], showing that more scientific work is necessary to enhance the reliability of the building energy simulation design tools and the procedures for assessing the as-built energy performance. This research is also required to produce scientific background to support the elaboration of regulations and standards for energy savings in buildings [3]. The Strategic Plan [4] of the Energy in Buildings and Communities (EBC) Technology Collaboration Programme (TCP) of the International Energy Agency (IEA), considers among its research themes the need for The analysis reported in this paper added to other published works [31] constitutes a collection of methods systematically applied and validated by means of their application to the same test set up and high quality benchmark data. This material contributes as a strong basis and background to the development of procedures for the in-situ performance assessment of the building fabric, using data recorded by on-board monitoring systems, from test campaigns conducted under a large variety of weather and test conditions. The next sections are organized as follows: Section 2 briefly describes the simplified building, the boundary conditions and test set up used as a case study in this research work; Section 2 also introduces the methodology that has been applied as well as the steps followed for the analysis, first of the opaque walls and afterwards of the whole building; Section 3 reports and discusses the results; and, finally, Section 4 synthesizes the main findings of the work.

Benchmark Set up and Data
A test box performing as a simplified building was used in this research work. It was built within the framework of IEA EBC Annex 58. The same building was used as benchmark and support for previous research works in this context. This building and the experiment set up are briefly presented hereafter. Further information is reported in reference [15].

The Building (Test Box)
The simplified building is a test box that consists of a cubic structure ( Figure 1). Its exterior dimensions are 120 × 120 × 120 cm 3 , and the ceiling, floor and walls are all 12 cm thick. The south wall has a wooden-framed window 71 × 71 cm 2 and a glazed area of 52 × 52 cm 2 . It is suspended over a structure that avoids contact with the ground. Due to its airtight construction, air leakage can be assumed as negligible. The thermal transmittance (U in Wm −2 K −1 ) and the solar-energy transmittance (g, dimensionless) have been considered to characterize the opaque elements (ceiling, floor and walls) from one-dimensional analysis. The overall thermal transmission coefficient (UA in WK −1 ) and the total solar heat gain factor (gA in m 2 ), have been considered to characterize the whole building envelope from three-dimensional analysis. The values of the characteristic parameters of the building theoretically calculated, together with the results obtained applying other method for the analysis were used as complement to the validation criteria ( Table 1). The maximum deviations for the theoretical values were calculated as the standard deviation among all the average values for winter and summer data series due to variations of surface temperature and wind speed, which are minor deviations, as shown in Table 1. The g value is very low. According to [15], it is 0.0060 in average. Taking into account this low value it could lead to an undetectable effect of the solar radiation on the interior surfaces of the opaque walls. Table 1. Theoretical values of the characteristic parameters and estimated ranges of deviations from their constant values, and values obtained applying other analysis method [31].

Parameters
Theoretical

Boundary Conditions
The test campaign was carried out outdoors at the Plataforma Solar de Almeria (PSA), in the south east of Spain (37.1 • N, 2.4 • W). The area is rural, and the climate is typically hot and dry in summer and cold in winter; the air temperature presents large oscillations from day to night, with high levels solar radiation and clear sky.

Conducted Experiments and Data Series
Several experiments have been conducted under different test conditions to obtain different data series which are suitable to analyze different aspects of the modelling methods. The heating power was supplied by a 100 W incandescence lamp and the indoor air temperatures have been controlled in order to set the different test conditions. The experiment set up includes a comprehensive set of measurement devices [15]. The following subset of variables has been used for the analysis described in the next sections: The experiments have been designed aiming to produce several data series containing the information required to apply system identification techniques. The phenomena to be characterized must be evident and strong enough for the application of these techniques. In this context, a phenomenon is strong enough when the amplitude of its driving variable is significantly higher than the uncertainty in its measurement. If a driving variable does not fit this requisite, signal to noise is poor, leading to inaccurate parameter estimates. Accordingly:

•
To identify the heat transfer coefficient, the test set-up must guarantee strong enough heat loss through the building envelope maximizing the indoor to outdoor air temperature difference, which is the driving variable for this effect.

•
To identify the overall gA-value, the solar gains must be strong enough. This is achieved when the test period includes enough sunny days with high levels of solar radiation, which is the corresponding driving variable.
• To identify dynamic models, the system must be excited by dynamic input signals containing a wide range of frequencies and including the characteristic time constants of the system. The variables involved in the energy balance equations of outdoor tests have a dynamic character, but identification is facilitated by the application of certain power sequences such as the ROLBS [33] etc.
The following issues regarding the experiment set up are also relevant to the identifiability and accuracy of the parameter estimates:

•
Heating Power: heating the indoor air is necessary to maximize the indoor to outdoor air temperature difference. Free running tests usually lead to poor signal to noise ratios in the measurements of temperature difference and cause problems with identifiability. Additionally, the heating power must be strong enough because it is a relevant variable to the energy balance equation that must be used for the analysis.

•
Homogeneity of the indoor air temperature: the solar radiation and the heating systems are heating sources that can produce some inhomogeneity to the indoor air temperature which contribute to the uncertainty of the parameter estimates. A mini fan has been used to improve the air temperature homogeneity. A device with very small ventilation power has been chosen to avoid perturbations in the interior convection coefficients.

•
Sampling frequency: the sampling theorem must be fulfilled in the raw data as well as in any resampling applied in the pre-processing phase. The sampling frequency must be at least twice the frequency of the variable which is being measured.
According to all these issues three data series corresponding to different periods have been considered. One-minute sampling interval has been set. This sampling interval guarantees that the raw data fulfil the sampling theorem. The main differences among the three periods concerns to the heating power and indoor temperature settings: •  (9 days). Aiming to reproduce as possible a coheating test, but setting the indoor air temperature set point to 35 • C. This test sequence was designed in order to have a reference analysis according as much as possible to the traditional coheating test, and also to explore the application of steady-state approaches and also to analyze the capability to apply the system identification techniques to this type of tests. This series is also useful to analyze causality issues and different variables as the input signal. It must be observed that this test is not identical to the coheating test described in the literature [17]. It is a bit modified taking into account the given boundary conditions: in winter the indoor air temperature cannot be maintained at a constant 25 • C because the low position of the sun produces a strong global solar radiation incident on the window (Figure 2e). Consequently, the indoor air temperature shows some dynamic behavior with overheating intervals (Figure 2d) even raising the set point to 35 • C. • Series 18: 27 December 2013 to 7 January 2014 (12 days): also aiming to reproduce as possible a Coheating test, but in this case setting the indoor air temperature set point to 21 • C, intending to emulate the comfort zone in an in-use building. These more realistic conditions are set in order to explore the possibilities to apply these identification techniques to in-use buildings using data recorded by its on-board monitoring systems. These test conditions are also interesting to analyze causality issues and different variables as output signals. In this case, the low level of the indoor to outdoor air temperature difference increases the difficulty to identify the UA value. Additionally, the contribution of the solar radiation to the space heating makes it impossible to maintain a constant indoor air temperature. This effect, clearly seen from the observation of Figure 2e,f, is stressing the need to apply dynamic techniques for the analysis. Finally, the consecutive records available in these data series have been merged in one data series and then this full data series has been split in three series with identical length. The analysis has been done using the resulting four data series, named as: •

Data Analysis and Modelling Methodology
The subsequent subsections describe the following issues of the methodology that has been applied: First, the dynamic RC modelling approach is briefly presented (in Section 2.2.1). The criteria to assess performance of the models and the validity of the results are presented (in Section 2.2.2). Then the application of this methodology to the ceiling, floor, opaque walls (in Section 2.2.3) and the whole building (in Section 2.2.4) is described. Afterwards, some models including modelling of some effects driven by unmeasured variables are considered (in Section 2.2.5).
Several candidate models have been considered. All these candidate models have been constructed translating the physical features of the considered thermal systems into RC model structures. Simplifications, in which validity was demonstrated in previous work [31], were taken into account to construct these RC models. Accordingly, the starting points of this work are the results and conclusions reported in [31]. This work explores the capabilities of RC models to improve the results of that work analyzing the following aspects: • Further analysis of the influence of the solar radiation that is transmitted through the opaque walls versus the performance of the models including this effect. The work reported in [31] concluded that including the solar radiation in the models of the opaque walls does not improve the accuracy of the U value estimates. However, a model including a y-intercept was identified as the best model among the considered candidate models. The y-intercept was interpreted as a representation of the sum of the effects that were not included explicitly in the model. This issue induced us to carry out further analysis reconsidering the effect of the solar radiation applying the RC approach described in this paper.

•
Other effects such as dependencies on wind speed, surface and fabric temperatures have been disregarded taking into account the conclusions of the work reported in [31]. • Several options modelling relevant effects using unmeasured variables in order to evaluate the feasibility to reduce the cost and intrusiveness of the measurement devices required us to obtain accurate results. These options were not considered in [31]. However other works have demonstrated the usefulness of modelling several effects which are driven by unmeasured variables [34,35].
Additionally, the following issues of the RC modelling approach have been investigated along the application of this method to the particular case study considered in this work:

•
Reduction of the test period regarding the test period required us to obtain accurate results applying the dynamic integrated method applied in [31].

•
The optimum number of nodes required to represent the thermal systems.

•
The assignment of inputs and outputs.

•
The performance of the models using data recorded under different test conditions.

RC Models
RC models are used for obtaining the U value of the opaque walls and the UA value and gA value of the whole building. These models use an electrical analogy representing the thermal system to obtain the required parameters. The software tool LORD [30] has been used to identify the parameters of these models. Physical criteria have been combined with this modelling approach to obtain these coefficients and also to validate the results. This applied approach is based on energy balances, constructed from the main heat transfer contributions that are identified and modelled.
First, the U values of the opaque walls, ceiling and floor were obtained by a one-dimensional analysis based on direct measurement of heat flux through the interior surfaces. The main focus of this analysis was determining the optimum number of nodes to represent the opaque walls. Afterwards, a three-dimensional analysis based on the energy balance on the air volume confined by the building envelope, was applied to estimate the overall UA and gA values of the whole building envelope. The data series a, b, c and d recorded under different test conditions as well as the results obtained in previous work, applying a different method [31] and also the target design values, were used to analyze the robustness of the results.

Assessment of the Validity of the Results
The performance of the candidate modes to give accurate estimates of the required characteristic parameters was evaluated considering these aspects: • The physical coherence of the results obtained using different data series recorded under very different test conditions. The uncertainty related to the spread in each parameter was quantified through the following parameter [36] and its minimization was used as a criterion for consistency: where X represents the parameter estimate that can be U, g, UA or gA. X max and X min are, respectively, the maximum and minimum among the X estimates obtained from all the available data series; and σ x is the error obtained as the squared root of the variance estimate assuming a rectangular distribution [36] for the parameter X.

•
The average of the residuals: Considering the minimization of the difference between measured output variable and model output, as indicator of good model performance. The value of this indicator is considered good enough when it is in the range of the uncertainty of the measurement of the model output.

•
The agreement with the theoretical target values that are reported in Section 2.1.1. These values are considered reliable references and are relevant complements to the validation criteria, taking into account the simplicity and the detailed and accurate knowledge of the construction.

•
The agreement with the results obtained by the application of other methods [31].
The systematic analysis and discussion of the quality of the results obtained from the different candidate models has been assisted by plots of the proposed indicators versus the different modelling options.

Modelling of the Ceiling, Opaque Walls and Floor
The analysis to obtain the U value of the opaque walls, ceiling and floor was based on the one-dimensional energy balance equation on the corresponding interior surface. This energy balance was referred to the net heat flux density measured on this surface. Several candidate models were constructed based on different approximations for these effects and considering these contributions: • Heat losses through the wall driven by the indoor to outdoor air temperature difference. A constant U value is assumed to characterize this effect. As explained before, variations of this parameter due to other effects have been disregarded. • Solar gains driven by the solar radiation. Different candidate models that exclude and include this effect were constructed, considering that this contribution could be a bit relevant or negligible to the net heat flux through the opaque walls, ceiling and floor. The global solar radiation measured on the corresponding exterior surface, multiplied by an unknown constant g value, was considered in those candidate models including this contribution.
Taking into account these contributions, introducing c as the effective heat capacity of the wall and T w as its temperature, the one-dimensional energy balance equation on the interior surface of each opaque wall, with all the terms in W/m 2 , is written as: The RC candidate models representing this energy balance in each opaque wall were constructed linking indoor and outdoor air temperatures by one branch with several capacitors and resistances in series ( Figure 3a) as explained hereafter: • Models with n nodes and being n from 3 to 7 were considered, in order to analyze different distributions of thermal mass along the thickness of the walls. To avoid over-parameterization, symmetry was assumed.
The heat flux multiplied by a parameter (A n ) which is fixed to 1, is linked to node n.

•
The outdoor air temperature (T e ) is linked to node 1.

•
The indoor air temperature (T i ) is linked to node n. • Models including and non-including the solar radiation were evaluated as discussed in previous paragraphs. For models including the solar radiation, it is multiplied by an unknown parameter (A 2 ) and linked to node 2.

•
The indoor air temperature (T i ) and the heat flux through the internal surface of the wall (ϕ) were considered as alternative outputs to the models. In principle the selection of the model output must be based on causality, but both options were analyzed when these variables do not evidence clearly which one should be chosen as output according to this criterion. First, a systematic analysis of the ceiling considering all these options and their combinations was carried out. Once the best model for the ceiling was identified among all the candidate models considered, the U value of floor and left, right, back walls was obtained using the best model found for the ceiling taking into account that all these opaque components were almost identical. The following measurements of solar radiation were used: the horizontal global solar radiation for the ceiling, the north vertical global solar radiation for the back wall, and the south vertical global solar radiation for the floor, right and left walls.
Once we identified the best model representing the opaque walls, we analyzed the effect of the length of the test period on the accuracy of the results. This model has been used to obtain the U value of the ceiling, floor and opaque walls for different test lengths from 1 to 10 days.

Modelling of the Whole Building
The analysis to obtain the UA and gA of the whole box envelope was based on the three-dimensional energy balance equation on the air volume confined by the building envelope. This energy balance refers to the energy supplied and removed from this volume. Several candidate models were constructed based on different approximations for these effects and considering these contributions: • Heat supplied to the room by the heating system, P heating . • Heat losses through the building envelope driven by the indoor to outdoor air temperature difference. A constant UA value is assumed. Variations of this parameter with wind speed and outdoor surface temperature have been disregarded taking into account the conclusions from a previous work [31]. • Solar gains driven by the solar radiation. Modelled as an unknown constant gA value multiplied by the measured global vertical solar radiation. This effect is modelled assuming that the main contribution of the solar radiation to the energy balance in the room is caused by the global vertical solar radiation incident on the window. • Air leakage. All the candidate models have been constructed under the hypothesis that this effect is negligible, taking into account that the construction is known to be very tight. The results reported in [31] corroborate this assumption.
Taking into account these contributions and introducing C as the effective heat capacity of the room, the three-dimensional energy balance equation on the air volume confined by the building envelope, with all the terms in W, is written as: The RC candidate models representing this energy balance in the air volume confined by the building envelope were constructed linking the indoor and outdoor air temperatures by the element constituted by several capacitors and resistances (Figure 3b) as explained hereafter: • A branch representing the opaque walls. Candidate models with 3 to 7 nodes in this branch were considered, in order to analyze the suitability of modelling different distributions of thermal mass along the thickness of these walls. To avoid over-parameterization, symmetry has been assumed.
A parallel branch, consisting in just a thermal resistance, representing a faster heat transmission through the building envelope, mainly due to the window.

•
The outdoor air temperature (T e ) is linked to node 1.

•
The indoor air temperature (T i ) is linked to node n.

•
All the candidate models include the south vertical solar radiation (G v ) taking into account that the facing south windows provoke a relevant contribution to the energy supplied to the room. It is multiplied by an unknown parameter (A n ) and linked to node n. • Node n+1 is linked to node n by a conductance that is fixed to a large value (such that (1/H n+1−n → 0)). The heating power (P heating ) multiplied by a parameter (A n+1 ) fixed to 1 is linked to node n+1. This makes node n+1 equivalent to node n, but it is necessary because the tool used to identify the considered models (LORD, [30,37]) allows just one link of heat flux per node.

•
The indoor air temperature (T i ) and the heating power (P heating ) were considered as alternative outputs to the models. The issues that justify the consideration of these two options for the output are the analogous to those mentioned in Section 2.2.3 for the opaque walls.
A systematic analysis of the whole box considering all these options and their combinations was carried out.
Once identified the best model representing the whole building, the effect of the length of the test period on the accuracy of the results has been analyzed. This model has been applied to obtain the UA and gA value of the whole building for different test lengths from 1 to 10 days.

Models Including Effects Driven by Unmeasured Variables
Once the best model was identified among all the candidate models considered, several options modelling the relevant effects using unmeasured variables were proposed and evaluated in order to study the feasibility to reduce the cost and intrusiveness of the measurement devices required to obtain accurate results. The following candidate models were considered:

•
Model 1: using a synthetic variable, instead of the measured G v , calculated according the following expression trying to emulate the shape of the actual vertical global solar radiation: This strategy improves cost effectiveness as far as it can avoid the use of a pyranometer, specifically installed for this analysis, measuring the global solar radiation on the orientation where the building has relevant glassing surfaces. It can be applied in any the following cases: 1.
When the horizontal global solar radiation is available, that is more standardized than having it in any vertical surface.

2.
When no measurement of solar radiation is available, but the time of sunrise and sunset are known, which is relatively easy for a given location.
• Model 2: using a synthetic outdoor air temperature, instead of the measured T e , calculated according to the following expression: where 8.48 • C is the average outdoor air temperature according to the Typical Meteorological Year (TMY) that was available for the location as reported in [6,38]. ∆t is a two-hour delay observed between the solar radiation and the outdoor air temperature. This strategy offers the possibility for a cost reduction when the T e corresponding to the TMY of the place is available. • Model 3: using a binary variable, instead of the measured P heating . This binary variable is one when the heating is on and zero when the heating is off. The coefficient A 4 was fixed to the average of the heating power when it is on. In practice this would be equivalent to having a record of the status of the heating system (on or off) along the experiment and the measurement of the heating power supplied without being necessary any devices for its continuous measurement. This option represents a moderate cost reduction as far as continuous measurement of heating power is avoided, but it requires the continuous measurement of the status of heating system which is simpler and cheaper than a continuous accurate measurement of the heating power. • Model 4: the only difference between this model and Model 3 is that, in Model 4, the parameter A 4 is an identifiable parameter instead of the fixed value used in Model 3.
• Model 5: using the set point indoors air temperature, instead of the measured T i . This option allows us to avoid the measurement of the indoor air temperature. This strategy is useful to reduce the cost of the experiment as well as its intrusiveness.

Results and Discussion
This section presents the results obtained using all the candidate models and discusses their validity.

Analysis of the Opaque Walls
The results given by the different candidate models considered for the opaque walls, and the validity of these results are presented and discussed in Sections 3.1.1 and 3.1.2.

The Ceiling
The results obtained using the different candidate models are presented in Figure 4 and Tables 2-5. These tables and figure indicate that the three nodes models present a very bad performance taking into account the large spread observed between the results obtained from different data series, the large average value of the residual and the U value estimates which are far from the target theoretical values and also far from the values obtained applying other methods. Attending to the same criteria, a significant improvement is observed in the four nodes models. However, scarce variations are observed when the number of nodes is increased. In some cases, the tendencies of these minor variations regarding the four nodes models are towards worse performance.
Focusing on the four nodes models and attending to the spread of the results using different data series, the model with T i as output shows slightly smaller spread in the results than using ϕ ceiling as output for the same model. Models including solar radiation do not show improvement regarding those not including it. The others validity criteria do not show significant differences. Figure 5c and Tables 4 and 5 show that the g values identified from the models including the solar radiation are very low with relatively large uncertainties which indicates a poor identification of this parameter due to its low value according to Section 2.1.1, corroborating its negligibility. This issue corroborates why the models including the solar radiation are not improved.
According to these issues, the U value estimate obtained from the four nodes model, not including solar radiation and using the indoor air temperature as output has been chosen ( Figure 4b and Table 2).

Floor, Back, Right and Left Walls
The results obtained using the four nodes model for the floor, back, right and left walls are summarized in Figure 5 and Tables 6-8. The U value estimates present better performance when the solar radiation is not included (Figure 5a,b, and Tables 6 and 7). The g value estimates when models include the solar radiation, are very low with large uncertainties (Figure 5c and Table 8). This behavior was also observed for the ceiling, which makes sense taking into account that all these opaque elements are identical, concluding that including the solar radiation also has a negative effect on the performance of the models for the floor and opaque walls. The results obtained with the model that do not include the solar radiation present very good agreement with the theoretical values and also with the results obtained using other different method reported in [31] (Table 1).

Model Performance for Different Length of Test Period
Considering the results for the ceiling and back wall, very little variation in the accuracy of the U value estimates for the different lengths of test period is observed. Attending to the spread in the U value estimates for different data series, a slight improvement is observed for a six-day test period which is maintained similar for larger test periods (Figure 6a and Table 9). The average of the residuals does not show relevant differences for the different lengths of test period. These residuals are low and in the range of the uncertainty of the measurement of the output variable.  Ceiling. U value. Summary of the results using models that do not include solar radiation. Output: T i . Where e U is the error and R is the average of the residuals given by LORD for each data series.   Table 3. Ceiling. U value. Summary of the results using models that include solar radiation. Output: T i . Where e U is the error and R is the average of the residuals given by LORD for each data series.   Table 4. Ceiling. g value. Summary of the results using models that include solar radiation. Output: T i . Where e g is the error and R is the average of the residuals given by LORD for each data series.   Table 5. Ceiling. g value. Summary of the results using models that include solar radiation. Output: ϕ. Where e g is the error and R is the average of the residuals given by LORD for each data series.   The average of the residuals for these opaque elements shows the best behavior for the series b (Figure 6b,d,f), which is the series with the largest indoor to outdoor air temperature difference. This issue corroborates the usefulness of maximizing this driving variable introduced in Section 2.1.3. Table 6. Opaque walls. U value. Summary of the results using models that do not include solar radiation. Output: T i . Where e g is the error and R is the average of the residuals given by LORD for each data series.   Table 7. Opaque walls. U value. Summary of the results using models that include solar radiation. Output: T i . Where e g is the error and R is the average of the residuals given by LORD for each data series.   Considering the floor, the spread of the results for the different data series is relatively large for test periods shorter than seven days and increases slightly for larger test periods (Figure 6e). The average of the residuals shows different behavior for the different data series, being relatively high for series a and lower for series b and c (Figure 6f). The results for series a and c are far from the target theoretical values. However, the results obtained using series b present better agreement with these theoretical values. The observed behavior could be explained by the effect of the perturbation caused by the solar radiation through the windows incident on the floor. The relative weight of this perturbation is smaller for series b that presents the larger amplitude of indoor to outdoor air temperature difference which is the main driven variable used to obtain the U value. This issue leads us to obtain better results from series b even in the presence of the perturbation caused by the solar radiation incident on the interior surface of the floor. Table 9. Ceiling. Comparison of the results for different test lengths. U using the 4 nodes model without solar radiation and T i as output.

The Whole Building
The results obtained using the different candidate models considered to obtain UA and gA values and also the validity of these results is presented and discussed in Sections 3.2.1-3.2.3. The number of nodes in the opaque branch (3 to 7) is used to identify the models hereafter.

Models Considering Different Number of Nodes for the Branch Representing the Opaque Walls
The results obtained using different number of nodes for the branch representing the opaque walls and setting the outdoor air temperature and the heating power as alternative outputs are summarized in Figure 7, and Tables 10 and 11. These tables and figure show that: • Attending to the spread in the results obtained using the different data series, the three nodes model with T i as output shows the best performance. • Attending to the residual obtained using the different data series, scarce differences are observed for the different candidate models.     Table 11. Whole building. Summary of results. Output: T i . gA values, and average of the residuals. Where e gA is the error and R is the average of the residuals given by LORD for each data series.  Taking into account the observed performance of the considered models, the three nodes model, with T i as output, including solar radiation has been chosen to give the final results for the UA and gA values. This model is named as Model 0 hereafter.

Series a Series b Series c Series d
Focusing on the results obtained from the different data series, the results obtained from series a is the one that shows the best agreement with the theoretical values and with the results obtained applying a different analysis approach (see Tables 1,10 and 11). This is logical taking into account that the test conditions from this data series was optimized for the application of system identifications techniques, as explained in the Section 2.1.3.

Model Performance for Different Length of the Test Period
The results of the analysis carried out are summarized in Figure 8, and Tables 12 and 13. These tables and figure show that the spreads in the UA and gA obtained for the different data series is relatively high for the shortest lengths of test periods. These spreads oscillate for the different lengths of test period with a decreasing tendency for increasing lengths of the test period and is significantly small for a test period of four days in the case of UA. It is also relatively low for lengths of the test periods of five and eight days (Figure 8a). However, the gA value shows large spread for test periods shorter than six days and is relatively low for tests periods of six and eight days (Figure 8b). The average of the residuals does not show relevant differences for the different lengths of test period. These residuals are low and in a range that is similar to the range of the uncertainty of the measurement of the output variable.

Models Including Non-Measured Variables
The results obtained for the considered models including non-measured variables are summarized in Tables 14 and 15, and discussed in the following: • Model 1 using a synthetic variable, instead of the measured G v , has given good UA estimates only if the data series is long enough (series d). This result indicates that this strategy is useful for a cost reduction of the test set up when the length of the test campaign is not critical. • Model 2 using a synthetic outdoor air temperature, instead of the measured T e , has shown a better behavior when the indoor air temperature is higher (series a and b). In this case the errors due to the difference between the true T e and T e_synthetic have a relative lower influence than having a lower indoor to outdoor temperature difference. This result indicates that when the test conditions allow to achieve a high enough indoor to outdoor temperature difference and the TMY of the place is available, a cost reduction of the test set up by eliminating the measurement of the outdoor air temperature is possible. • Model 3 using a binary variable, instead of the measured P heating , has given good results for all the data series. These results support the validity of a model implementing a moderate cost reduction regarding the reference model 0 as discussed in Section 3.2.

•
Model 4 which is model 3 modified by the parameter A 4 set as an identifiable parameter instead of a fixed value, has given good UA estimates only if the data series is long enough (series d).
This result indicates that this strategy is useful for a cost reduction of the test set up when the length of the test campaign is not critical. • Model 5 using the set point indoors air temperature, instead of the measured T i , has shown good results only when this set point is remarkably high (series b) which is useful for testing empty buildings allowing setting up a high enough T i .
The results obtained from these alternative models which are in the range of the expected values are highlighted in green in Tables 14 and 15.

Conclusions
The reported systematic analysis has demonstrated the capability of the dynamic data-based RC models to assess the U value of opaque walls as well as the UA and gA values of whole buildings. The results and conclusions are supported by the richness of the data series recorded under very different conditions of the tests carried out on a benchmark set-up. Additionally, the well-known theoretical target values and the results obtained using other method give strong support to the validity of the obtained results.
The following conclusions were extracted from the one-dimensional analysis carried out to obtain the U value of the opaque walls, ceiling and floor: • The best performance among all the candidate models has been observed in the four nodes model, with T i as output, non-including solar radiation. This model has been chosen to give the final results for the U value of the opaque walls, ceiling and floor.

•
The results obtained using models assigning the heat flux as output instead of the indoor air temperature did not show significant differences regarding the results obtained using the chosen model.

•
The results obtained using models including the solar radiation do not show significant differences regarding the chosen model that do not include this variable. This result can be explained in two different ways, both leading to conclude that including the solar radiation on the opaque wall models is not convenient, (1) either because its effect is in practice negligible on the heat flux measured through the internal surfaces of these walls, or (2) because its effect is in the range of the uncertainty of the measurement of heat flux. In this case, modelling the effect of the solar radiation from the recorded data is not possible.
The following conclusions were extracted from the three-dimensional energy balance and analysis carried out to determine the UA and gA values of the whole building:

•
Taking into account the observed performance of the considered models, the best performance among all the candidate models has been observed in the three nodes model, with T i as output, including the solar radiation. This model has been chosen to give the final results for the UA and gA values because it shows the best performance attending to the spread in the results obtained using different data series.

•
The data series that was optimized for the application of system identifications techniques (series a including a ROLBS power sequence), gave the most accurate results, which makes sense taking into account that a system identification tool has been applied to obtain these parameters.
The analysis of the influence of the length of the test periods on the performance of the chosen models has shown that this length is relevant to the accuracy of the results. The worse performance was observed for the lowest length with improving tendencies and certain oscillation when the length is increased. This tendency is slightly seen for the U value of opaque walls but is more evident for the whole building that shows very bad performance for the lowest test periods. In both cases (opaque walls and whole building) six-day and eight-day test lengths have shown the best performances. These issues evidence the usefulness of analyzing the optimum length for each case study.
The following conclusions were extracted from the three-dimensional energy balance and analysis carried out to determine the UA and gA values of the whole building using non-measured variables reducing the cost of experiment set up and simplifying it: • If the heating power has an on/off pattern, models can give accurate results including this contribution to the energy balance as a binary variable that is one when the heating is on and zero when the heating is off, together with a constant coefficient fixed to the average of the heating power when it is on. The same model avoiding the average of the heating power when it is on can give also acceptable results for long enough testing periods.
• Using a synthetic variable, instead of the measured G v , can give good enough results when the tests campaign is long enough. • Using a synthetic outdoor air temperature, instead of the measured T e , can give acceptable results when the test conditions allow to achieve a high enough indoor to outdoor temperature difference and the TMY of the place is available.

•
Using the set point indoors air temperature, instead of the measured T i , can give good results only when this set point is remarkably high, which is useful for testing empty buildings allowing setting up a high enough indoor air temperature.
It must be highlighted that the models chosen to obtain the final result are highly dependent on the characteristics of the building envelope. The best models identified for this case study cannot be used arbitrarily to other buildings. The application of this modelling approach to any other building typology requires one to follow the same process presented in this article. First, constructing several candidate models based on physical criteria and feasible approximations. Afterwards, selecting the best model following the steps and the validation criteria presented in this paper. The identification of the most suitable model output, the optimum number of nodes to represent the opaque elements, determining if the transmission of the solar radiation through the opaque walls is or is not relevant, and the assessment of the robustness of the results, are key aspects that can show different behavior in different buildings and consequently must be analyzed in each case study.
The analysis reported in this paper added to other published works constitutes a collection of methods systematically applied and validated by means of their application to high quality benchmark data. This material provides strong background and support to the development of procedures for in-situ performance assessment of the building fabric, from data recorded by the on-board monitoring systems, from experimental campaigns conducted under a wide variety of dynamic conditions. The presented methodology for building energy performance assessment would be very useful to carry out diagnosis, resulting in a guidance to assist energy rehabilitations, and also to assess the effectiveness of such refurbishments once implemented. A large proportion of the buildings that need an energy rehabilitation are in-use. The capability to carry out such analysis using on-board monitoring system on in-use buildings, would contribute to reduce the intrusiveness of these assessment techniques, enhancing the readiness for deployment of this methodology.
Further research work to be undertaken in this area should expand the generality of the presented approach and its validity. In order to achieve these objectives, case studies increasing gradually the level of reality should be considered. In this context, the simplified building considered in this paper, and an extended test campaign that is available, are useful to study the following relevant aspects: extend the analysis to other months (including summer, spring and autumn), and also to study the feasibility to characterize the building in the presence of air infiltrations, different building orientations and incorporating phase change materials. Additionally, other buildings should be considered as case studies, to continue expanding the generality of these methods and extending the study to different building typologies under different weather conditions. Funding: This research was funded by the Spanish Ministry of Science and Technology through the In-Situ-BEPA-MAS project, reference PID2019-105046RB-I00. The authors wish to express their gratitude for these funds.