Development of Building Design Optimization Methodology: Residential Building Applications

: Building design optimization is a highly complex problem, requiring long computational running processes because of the many options that exist when a building is being designed. This paper introduces an integrated approach through which to perform this optimization within an acceptable time frame. The approach includes the methods of variable selection, model simplification, and a sequential optimization process. Using singular value decomposition, a large number of design variables is reduced to a smaller subset that can be solved more quickly through the optimization algorithm. To expedite the variable selection process, a modeling approach that quickly simulates annual energy consumption was developed to replace full annual energy simulations. The developed methodology was applied to two residential buildings in the US, and the results are discussed herein. To assess the accuracy of the integrated optimization methodology, the optimized life cycle costs are compaa variables demonstrating the strongest contributions in the optimization study were identified. The proposed methodology significantly shortened the time requirements for the optimization processes of the two case studies by 74% and 84%; the optimized life cycle costs were within 0.05% and 0.06%, respectively, of the optimum point.


Introduction
According to energy usage data from the US Energy Information Administration, residential and commercial buildings contribute approximately 40% of energy consumption and 16% of energy-related carbon emissions in the US [1,2].Over the past 25 years, residential energy use has increased by approximately 7.0%, and that of commercial buildings has increased by about 10.4% [3].As energy consumption in the US continues to increase and as energy-related carbon emissions need to be reduced, energy simulation tools have been incorporated more regularly into the building design process for high-performance buildings.Energy-efficient design optimization techniques for buildings are undoubtedly important to reducing building energy consumption and its associated costs, which have been active research topics [4].
Ideally, to find an optimal building design point that minimizes an objective function, such as energy consumption or building life cycle cost (LCC), building energy simulation software, such as EnergyPlus [5] and TRNSYS [6], can be coupled with an optimization algorithm for accurate energy consumption calculations.However, such optimization studies are usually prohibitive because of their very long calculation times, owing to large numbers of design variables, and their relatively long simulation times related to the software used to predict annual energy consumption at each iteration of the optimization process.
A lot of research has been carried out on reducing the long computational times associated with the optimization process.The primary focus of previous research has been to develop simplified energy consumption models that can replace detailed energy simulation software using, for example, neural networks and regression models.Magnier and Haghighat [7], as well as Gossard et al. [8], used a genetic algorithm with an artificial neural network model to optimize thermal comfort and energy consumption in a residential house.Ghiaus [9] applied robust regression (an alternative of the least squares regression method that is effective when outliers exist) to predict the heating load as a function of outdoor temperature.Catalina et al. [10] used a quadratic polynomial regression model to predict heating demand with multiple independent variables, such as shape factor, building time constant, and more.To generate the database for creating the model, 18,144 simulations were performed.Hygh et al. [11] used EnergyPlus to build a multivariate regression model with 27 parameters; 20,000 full annual simulation datasets were generated to predict heating and cooling loads.Assuming each simulation takes 30 s to 1 min, the process of generating the dataset alone requires 166 to 332 h.The cooling regression model has a high value for the coefficient of determination (R 2 ) that indicates how well the data fit the model; however, the heating regression model has a low value for R 2 from 0.498 to 0.816.Ghiaus [9] used the concept of balance point temperature to construct a regression model.Hygh et al. [11] used a fixed 18 • C balance temperature to predict energy consumption, and Krarti [12] estimated the exact balance temperature.Eisenhower [13] developed an analytical metamodel that fit the building simulation data and then performed the optimization.Although a simplified modeling approach is capable of carrying out the building design optimization process with little computational effort, it essentially requires a tremendous number of pre-energy simulation results to obtain a reliable model, which is the major disadvantage of this approach.Consequently, regenerating the database is necessary when changing the design variables to create a simplified model.
Another approach through which to accelerate the optimization process involves employing variable selection methods to choose a subset of variables from the entire set of variables.Bettonvil [14] applied a group screening method to detect important factors by applying sequential bifurcation to a building energy model.Rahni [15] partitioned building parameters into multiple groups and tested which groups make more significant contributions to building energy performance.Corrado and Mechri [16] used the Morris method to reduce the parameter dimensions from 129 to 10. Brohus et al. [17] used an analysis of variance-based analysis to identify significant variables of residential building energy consumption, and identified the 10 most important parameters out of a total of 57.
An alternative approach is using sensitivity analysis.In previous studies, sensitivity analyses were employed in either the pre-or postoptimization phases or to efficiently select a subset of variables significantly affecting the objective function, thus streamlining the optimal design process with reduced effort compared with full optimization.Gunay et al. [18] used sensitivity analysis to identify the key operational parameters of office buildings, and Cheng et al. [19] used the Morris one-at-a-time method to design a sustainable housing community.Østergård et al. [20] used sensitivity analysis to inform the decision-making process in the early building design process.Sensitivity analysis is easy to use and is a valuable tool for assessing model behavior and identifying critical input parameters; therefore, it is frequently used in building simulation research.However, sensitivity analysis becomes more challenging as the dimensionality of the input space increases, and the interactions between variables can become complex, making it difficult to isolate the effect of an individual variable [21].
To speed up building design optimization, the two approaches of simplified modeling and variable reduction can naturally be combined.However, despite many previous studies, it is hard to find studies testing the integrated optimization approach.Additionally, the computational time needed to generate a reduced-order model or a variable selection process is often neglected.Therefore, it is not clear how much the integrated approach is able to accelerate building design optimization.This paper presents an integrated optimization methodology to tackle the building design optimization problem.The developed methodology can be used for any residential building design optimization process, including for the early design and engineering design stages.Also, the developed methodology is not limited to any specific design variables for the optimization process.A novel variable selection method is proposed and adopted for the optimization problem to reduce the high dimensionality of the design variables.To overcome the long computational time required to generate sufficient data that can be used during the variable selection process, a new, simplified modeling approach is developed to replace the full annual energy simulation for residential buildings.A strategy through which to approach the true optimal point is also introduced.The integrated methodology is applied to two case studies of typical residential buildings in the US, and comparisons of the results with those of the full optimization processes for the entire design spaces are presented.

Overall Methodology
To derive an accurate but computationally efficient methodology for building design optimizations, two main problems need to be solved: (1) reduce the number of optimization variables and (2) reduce the amount of computational time consumed by the energy simulation software.To overcome these problems, the following process shown in Figure 1 is proposed.
selection process is often neglected.Therefore, it is not clear how much the integrated approach is able to accelerate building design optimization.
This paper presents an integrated optimization methodology to tackle the building design optimization problem.The developed methodology can be used for any residential building design optimization process, including for the early design and engineering design stages.Also, the developed methodology is not limited to any specific design variables for the optimization process.A novel variable selection method is proposed and adopted for the optimization problem to reduce the high dimensionality of the design variables.To overcome the long computational time required to generate sufficient data that can be used during the variable selection process, a new, simplified modeling approach is developed to replace the full annual energy simulation for residential buildings.A strategy through which to approach the true optimal point is also introduced.The integrated methodology is applied to two case studies of typical residential buildings in the US, and comparisons of the results with those of the full optimization processes for the entire design spaces are presented.

Overall Methodology
To derive an accurate but computationally efficient methodology for building design optimizations, two main problems need to be solved: (1) reduce the number of optimization variables and (2) reduce the amount of computational time consumed by the energy simulation software.To overcome these problems, the following process shown in Figure 1 is proposed.The first step, shown in the green box, is to define a building design optimization problem by specifying appropriate design variables and their corresponding constraints, along with the objective function (e.g., the minimization of LCC for residential buildings in this paper).To evaluate energy costs accurately during the optimization process, detailed building energy simulation software is incorporated rather than the simplified models that are typically used in other research approaches.The second step is to reduce the number of design variables using a variable selection method, which identifies subvariables that have a significant influence on the objective function.To accelerate this process, which requires a large number of annual building energy simulations, a simplified modeling strategy is adopted.Then, an optimization algorithm is coupled to the building energy software, which searches for the optimum point only in a reduced dimension of the design space as defined by the variable selection method.In this approach, the simplified energy consumption models are only used during the variable selection process (the second block of Figure 1), and high-fidelity models are used in the actual optimization The first step, shown in the green box, is to define a building design optimization problem by specifying appropriate design variables and their corresponding constraints, along with the objective function (e.g., the minimization of LCC for residential buildings in this paper).To evaluate energy costs accurately during the optimization process, detailed building energy simulation software is incorporated rather than the simplified models that are typically used in other research approaches.The second step is to reduce the number of design variables using a variable selection method, which identifies subvariables that have a significant influence on the objective function.To accelerate this process, which requires a large number of annual building energy simulations, a simplified modeling strategy is adopted.Then, an optimization algorithm is coupled to the building energy software, which searches for the optimum point only in a reduced dimension of the design space as defined by the variable selection method.In this approach, the simplified energy consumption models are only used during the variable selection process (the second block of Figure 1), and high-fidelity models are used in the actual optimization process (the third block of Figure 1).The last step is to reduce the optimization error that is inherited from the approximation approach (i.e., the variable selection algorithm) by sequentially adjusting the insignificant variables.

Variable Selection Algorithm and Optimization Process
In this section, each step shown in Section 2 is described in detail.Section 3.1 provides the details of the variable selection algorithm, and Section 3.2 shows how this variable selection algorithm is applied to an optimization problem.Section 3.3 shows the simplified energy consumption model used for residential building applications that will be used only in the variable selection process, and Section 3.4 shows the optimization process, with the variables selected from the variable selection process and postoptimization process.

Variable Selection Algorithm Using Singular Value Decomposition
Consider a differentiable function, denoted as f : The task here is to select the important subvariables out of (x 1 , x 2 , • • • , x n ), which influence y most significantly.Selecting a metric that quantifies the magnitude of importance or significance of a variable is the key step.We define the following operator L: where ), e i is the natural basis corresponding to x i , < •, • > is the inner product, and v(∈ R n ) is a unit vector.Operator L measures the spatial average of the directional derivate of f over D in the direction of v.When L(v) = 0, y does not change in the direction of v.When L(v 1 ) >> L(v 2 ), y varies more significantly in the direction of v 1 than that of v 2 in the spatial average sense.Therefore, employing L as a measure to gauge the significance or importance of a variable is a rational choice.
The following example is provided for further clarity.Let y = x 1 2 + 0.01x 2 2 , and assume that x 1 and x 2 are well-scaled such that −1 ≤ x 1 , x 2 ≤ 1. Variable x 1 clearly influences the output variable y more significantly than it does x 2 .The L values for e 1 = 1 0 and e 2 = 0 1 , which correspond to x 1 and x 2 , respectively, are For these L values, L(e 1 ) >> L(e 2 ).This example shows that the operator L provides a measure of the contribution of a variable to an output variable.
For practical applications, Equation ( 1) can be approximated as where N is a total number of samples of x k over a feasible domain With this metric, the most important variable is represented by The maximizer can be obtained by performing singular value decomposition (SVD) of the matrix J J T , where We denote the resulting form of the singular value decomposition as where r) are submatrices of V and Σ, and The diagonal matrix Σ contains magnitudes of L in descending order corresponding to each column vector of V.The first column vector of V maximizes the metric L (i.e., the maximizer of Equation ( 3)), and the second column vector of V maximizes the metric L over a space perpendicular to the first column vector of V. Consequently, the last column vector has a minimum value of L. Therefore, primary coordinates can be selected out of V in which y varies significantly by looking at the values of Σ; these coordinates can denote V 1 as the primary component.The criteria can be set to select V 1 so that the cumulative sum of the percentage of Σ 1 is at least 99.5%, which means that 99.5% of the magnitudes of L can be explained by primary component V 1 , and Σ 2 is negligible [15].

Application to an Optimization Problem
Consider an optimization problem, where f is a differentiable function for ∀x ∈ R n .We define new variables of z such that z = V T x.The ith element of z is the projection of x onto the ith column vector of V. Note that x can be expressed as , the optimization problem can be searched over a subspace spanned by the column vectors of V 1 rather than over the entire R n space.In other words, neglecting the influence of z 2 in the optimization problem becomes possible because the magnitudes of the directional derivatives along the column vector of V 2 are small.The new optimization problem in a lower dimension then becomes For some cases, it is worthwhile to retrieve the important variables in the original design space from z 1 because z 1 is a linear combination of design variables; thus, z 1 may not have a direct physical meaning.The significant design variables in the original space can be retrieved by examining the column vectors of V 1 .For example, when the jth component of a column vector of V 1 is more predominant than the others, it means that the jth component of x contributes to the output significantly.The retrieving process may increase the dimension of the design space from r to m(≥ r).
The final optimization problem with the selected m variables in the original design space becomes min where E ⊂ R m .The proposed method is very similar to that of principle component analysis (PCA).PCA is also a dimensionality reduction technique commonly used in statistic and machine learning.The method aims to reduce the data's dimensionality while retaining as much relevant information as possible [22].However, it differs significantly because (1) the matrix to be composed by SVD contains information of the output variable, and PCA does not, and (2) the proposed method is developed from deterministic points of view, and PCA is based on random variables.It also differs from the partial least squares method (PLS).PLS is a multivariate statistical technique used for modeling the relationship between a set of independent variables and a set of dependent variables [23].PLS is commonly used in regression analysis and classification tasks to establish predictive models, but it assumes a linear relationship between input and output variables.In contrast, SVD does not inherently assume linearity.Additionally and more importantly, our proposed method can be applied to an optimization process as described here, and PCA and PLS cannot [24,25].

Simplified Energy Consumption Modeling Approach
The introduced variable selection algorithm requires calculating the Jacobian matrix of the objective function at multiple points in a design space (see Equation ( 4)).This method would take a tremendous time with approximately (n variables) × (N design points) × (T annual simulation times per a design point).To reduce the time required for generating the series of Jacobian matrices, the modified degree day method is proposed.The degree day method is easy to implement and requires only basic weather data, but it has limitations because it assumes a linear relationship between temperature and energy use, and other factors such as insulation level, building design, and occupant behavior can also significantly affect energy use.Given the robust correlation between degree day data and energy consumption in residential buildings, and coupled with the relatively limited influence of occupant behavior, the degree day method continues to be a valuable and straightforward tool for energy modeling in the residential sector.The proposed method is based on the following assumptions and modifications to the existing degree day method, and the detailed process is shown in Figure 2: • The actual degree day varies with the chosen design variables.Consequently, the balance temperature must be calculated for each design combination; • The modified ANAGRAM method is used to shift the diagonal linear regression models to the origin based on daily average degree days to separately predict heating, cooling, and fan energy consumption; • To find an appropriate number of data samples to yield an accurate regression model, the recursive least square (RLS) algorithm is used; • A simplified model is constructed specifically to replace energy simulation software during the variable selection phase.This model is employed for the sole purpose of identifying which variables significantly influence the output.Therefore, the model must be sufficiently accurate to discern these influential trends.Suppose we want to evaluate annual energy consumption  for a residential building at a design point , which consists of building design parameters.The role of building an energy simulation model, denoted as  , that maps  to  , is  = (, ) , where  represents any other inputs that influence , such as weather data.The goal is to find a function that approximates  .We adopt the degree day method for its simplicity and Suppose we want to evaluate annual energy consumption y for a residential building at a design point x, which consists of building design parameters.The role of building an energy simulation model, denoted as f , that maps x to y, is y = f (x, w), where w represents any other inputs that influence y, such as weather data.The goal is to find a function that approximates f .We adopt the degree day method for its simplicity and widespread use in predicting energy consumption for a long time period for residential buildings.More precisely, it can be represented as the following relationship over a heating season, for example: where y P is the heating energy consumption for the period of [t 0 , t 0 + P], and α is a buildingspecific coefficient.The variable DD P is the number of heating degree days for the period, and (T a , T b ) are the ambient and balance temperatures of the building, respectively.Note that once α and T b are identified, the annual energy consumption for heating can be easily predicted by only considering the weather data.In other words, when α and T b are known, annual heating energy consumption can be predicted without simulating a detailed building energy model.The time-invariant parameters of α and T b depend on x.
To estimate them, a regression approach is used with samples of y P obtained from building energy simulations for selected days.The final form of our simplified model is where α and Tb are the estimated values of α and T b at a design point of x.After the simulation studies for residential buildings, a day is selected as P (i.e., P = 1 day).A natural question regards the selection of an appropriate sample size to accurately estimate the parameters; that is, the question of how many daily simulations are needed to accurately estimate α and T b .This problem can be handled by employing the RLS method because it calculates confidence intervals for estimated parameters as each sample is updated.Because the original model, Equation (8), is a nonlinear function with respect to the parameters, it was reformulated as follows: where θ 1 = α, and θ 2 = αP(T b − 18).The equation DD P,18 = t 0 +P t 0 (18 − T a (τ))dτ repre- sents the number of degree days with the assumption of an 18 • C balance temperature [9].
The model structure, shown in Equation (10), can be readily used with the RLS method because of its linearity.Once the parameters of θ 1 and θ 2 are estimated, the balance temperature can be retrieved by . As an initial dataset, 6 days from the weather file are selected and simulated; these include the heating design day as well as 5 other days, which are uniformly distributed and are found by preprocessing a typical meteorological year file at the location.Next, θ 1 and θ 2 are calculated from the six samples.Then, they are inserted into the RLS method as the initial guess.Another day simulation is implemented, and those parameters are updated.This process is repeated until the parameter covariance goes below a threshold.For a description of the RLS method, refer to Appendix A. Because heating energy, cooling energy, and fan energy consumption need to be considered to predict total annual energy consumption, separate models for heating, cooling, and fan energies are developed using the same methodology as used here.For brevity, in this paper, the equations for cooling and fan energy are omitted.
To calculate ∂ f /∂x i (x), two simplified models at two design points of (x, x + ∆x i ) are constructed.The process is repeated for each variable and each data point to construct a series of Jacobian matrices (shown in Equation ( 4)).

Optimization with the Significant Variables and Sequential Search Method
After identifying significant variables using the variable selection method with the simplified modeling approach, an optimization routine can be implemented with those variables while fixing the remaining variables, which are insignificant, to reference values.In the current case study, a discrete binary version of the particle swarm optimization (PSO) methodology [26] is adopted.This algorithm is selected based on its ability to efficiently explore the design space and arrive at an optimal solution to overcome the problems of the continuous version that cannot handle discrete variables.To prevent fast convergence to a local optimum, the inertia PSO version [27] is used.The results that are discussed in Section 4 demonstrate that it is possible to get close to the optimum point when using only those significant variables that were identified using the variable selection method.Once an optimal point is found with the significant variables, there may still be a chance to get closer to the true optimum point by perturbing the insignificant variables, although their contributions are expected to be small.One point of clarification is that the variables that are identified as insignificant are not necessarily unimportant; they just do not demonstrate a significant contribution within the scope of the optimization study, which is why they are excluded in that phase.However, to further improve the final result, a sequential onedimensional optimization approach is used.This approach optimizes the first variable in the group of insignificant variables while fixing the others.Once the process is terminated, the second variable in that group is optimized while fixing those that remain.This process is repeated until the last variable is optimized.The order of the sequential search approach may be important because variables are sorted according to their contributions to the cost function.Therefore, the order of variables in the sequential search approach follows that determined in the variable selection process.

Case Studies
To validate the developed methodology, two cases studies for residential buildings were selected.Two representative residential building types in the US were chosen, and design variables were selected from typical variables that are often considered in the engineering design stage.

Descriptions of Case Study Buildings
To develop detailed building models, the energy simulation software EnergyPlus was used.The buildings are a typical slab-on-grade residential house and a house with a heated basement.They were referenced from the Pacific Northwest National Laboratory residential prototype [28] and were assumed to be located in Indianapolis, Indiana. Figure 3 shows the model dimensions of the houses.

Descriptions of Case Study Buildings
To develop detailed building models, the energy simulation software EnergyPlus was used.The buildings are a typical slab-on-grade residential house and a house with a heated basement.They were referenced from the Pacific Northwest National Laboratory residential prototype [28] and were assumed to be located in Indianapolis, Indiana. Figure 3 shows the model dimensions of the houses.The first house has a gross floor area of 334.6 m 2 with two different spaces, which include a living space and attic.The living space is the only conditioned zone, and the net conditioned area is 223.1 m 2 .The entry doors are located on the south and north sides of the building, and two windows are placed on each side of the building.The house is assumed to have three bedrooms and three bathrooms.The second house has a gross floor area of 446.1 m 2 .The house has a living space, attic, and heated basement.The prototype complies with the 2012 International Energy Conservation Code with modified construction layers for optimization purposes.The living space is the only conditioned zone, and the net conditioned area is 334.6 m 2 .The house has four bedrooms and four bathrooms, and the basement is placed completely below the ground level.

Defining Design Variables for Optimization
To compare the results of the proposed method with those of the full optimization, we chose a relatively small number of design variables in this paper.The following considerations were made in defining the design variables for the case studies: select elements that may have (1) a high impact on building energy consumption, (2) a strong influence in the construction cost, and (3) energy-saving or cost-saving benefits, but whose influence is not strictly known beforehand.Based on these considerations, 12 different construction design variables were selected to investigate their effect on the LCC of the residential building.The first eight variables were related to the building envelope, and the remaining variables were associated with the heating, ventilation, and air-conditioning (HVAC) system.
Commercially available products associated with those design variables are listed in Table 1.For example, for roofing material, one can take either asphalt shingles, a metal surface, or a concrete tile roof.Design variables of the wall core can be a given stud dimension and filled with any type of insulation shown in the table.Structural insulated panels or insulated concrete forms with various thicknesses were also considered to be different wall core materials for this study.The under-floor insulation was used only for case study 1, and the foundation wall insulation was used only for case study 2. The total number of possible combinations, ∏ n i=1 N i , is around 60 million for each case, where N i is the number of options for the ith variable.Notably, it can be reduced significantly to ∏ r i=1 N i through variable selection, because the reduced dimension r is generally much smaller than the original dimension n.For the case studies, n = 12, r = 5 (see the later discussion), and, thus, the total number of combinations was reduced to around 700. Natural Gas Furnace Efficiency (4) 80%, 85%, 90%, 95% 1 IP stands for imperial units; 2 SEER represents Seasonal Energy Efficiency Ratio.

Optimization Objective Function
We considered an LCC over a 20-year time horizon as the objective function, as shown in Equation (11).To estimate realistic construction costs, the material cost, labor, overhead, and profit were all considered.The primary tool used for estimating construction costs is RSMeans [29].RSMeans is a construction cost database that is widely used and respected in the United States.It provides comprehensive, up-to-date, and reliable information related to construction costs, materials, labor, and equipment cost.To account for regional variations in construction costs, a location influence factor (I f ) is used, and the I f of 92.5% (applicable for Indianapolis) is multiplied by the national average cost.HVAC equipment costs were modeled using multiple linear regression to fit cost data taken from online equipment suppliers relative to both system capacity and efficiency.The discount rate was assumed to be 3%, and the modified uniform present value (UPV * ) factors were taken from the annual supplement to the NIST Handbook [30].The UPV * factors are energy price projections for different fuel types based on the US Department of Energy/US Energy Information Administration regional projections.
Here, the following pertains: LCC-incremental LCC of given building system; I f -influence factor of the location; C Con -construction cost; C HVAC -HVAC equipment cost; C Elec -electricity cost; C NG -natural gas cost; UPV * Elec -UPV* factor for electricity cost; UPV * NG -UPV* factor for natural gas cost.

Validation Methodology
To validate and compare the accuracy and efficiency of the proposed methodology, a full optimization with all design variables was first performed.The full optimization was intended to minimize the LCC using all 12 variables.Following the full optimization study, the proposed methodology was implemented.The required data size, N in Equation ( 4), may be automatically selected by tracking the convergence behavior of singular values as N increases.However, for the feasibility study of the proposed method, a fixed size of data (i.e., 100 data points) was generated to identify significant variables.
After finding the significant variables using the variable selection method, the remaining variables were fixed to the cheapest material, and the optimization process was carried out on the significant variables.For more detailed information about the initial setting of variables, refer to Appendix B. Finally, the best values of the remaining insignificant variables were determined using a sequential search method.
The inertia weight version of the binary PSO algorithm was used for both full and proposed optimizations.Based on the suggestions of Parsopoulos and Vrahatis [31], the swarm size for the PSO algorithm and the neighborhood size of a von Neumann topology were set as 64 and 8, respectively, for the full optimization, and were 36 and 6, respectively, for the proposed method.

Variable Selection Results
Figure 4 shows the singular values of the Gramian matrix (see Equation ( 4)) in descending order.The singular values are scaled with respect to the sum of all singular values.For both case studies, the first four coordinates are selected based on the fact that their cumulative sum of singular values is at least 99.49% and 99.46% with respect to the sum of all singular values.In other words, using only the first four coordinates, 99.49% and 99.46% of data can be captured for case studies 1 and 2. Because the variables are discrete, it is convenient to find the significant parameters in the original design space using Equation (7).Table 2 displays the singular vectors of the first four coordinates with the higher values shown in bold, indicating the variables that significantly contribute to the corresponding coordinates.For example, external wall siding material and wall core type make the strongest contributions to the first and second singular vectors, and roofing materials, air-conditioning (AC) speed, and heat recovery type are the primary contributors to the third singular vector.Through this process, 5 out of the 12 design variables, including roofing material, external wall siding material, wall core, AC speed, and heat recovery, were identified as significant variables in both case studies.Although the same five design variables are selected for both case studies, this choice does not necessarily imply that these five design variables are significant variables for all cases.The results will vary based on the specified design parameter values and regional factors, including construction cost and equipment cost, as well as weather conditions.

Simplified Energy Consumption Model
To ensure that the simplified modeling approach is sufficiently accurate for the purpose of the variable selection, the variable selection algorithm is also implemented using the full annual energy model for comparison.Then, the first four singular vectors obtained from the simplified model, denoted as V s = [v s,1 , v s,2 , v s,3 , v s,4 ], and from the full model, denoted as ], are compared.Table 3 shows the matrix of V T s V f for the case 1 and 2 buildings.If the simplified modeling approach is accurate, v s,i T v f,j ≈ ±δ i,j where δ i,j is the Kronecker delta.That is, the diagonal elements of V T s V f must be close to either 1 or −1, and the off-diagonal elements must be close to 0. Notably, the 0 property arises because of the orthonormal characteristic of singular vectors.The results satisfy the criteria, which demonstrates that the resulting significant variables from the simplified modeling approach are very similar to those of the full model for both case studies.

Optimization Result Comparison
To compare the accuracy of the integrated optimization methodology, the optimized LCCs were compared with the full optimizations, and they are shown in Table 4.Because it is of interest to see the improvement resulting from incorporating the sequential search method, the optimization results with only the selected variables (i.e., without the sequential method) were also included.The results show that the minimal LCC errors arising from only using the subset of design variables were around 4% for both case studies.By further employing the sequential search method, the errors were brought to within 0.05% (USD 27) and 0.06% (USD 35) for a 20-year horizon.
Table 5 shows a comparison of the optimized designs using full optimization and the proposed methodology.In case study 1, the difference was caused only by the roof eave overhang depth and attic insulation material.For case study 2, roof eave overhang depth and air conditioner seasonal energy efficiency ratio (SEER) rating caused the difference.As expected from the variable selection algorithm, the variables are insignificant in both cases.
All numerical experiments were run on a desktop computer with a 6 core 3.10 GHz CPU, 6 GB RAM, and Windows (64-bit) operating systems.For both cases, the variable selection process took a majority of the total optimization time.However, the variable selection process without introducing the simplified modeling approach took 13.5 for case study building 1 and 9.98 h for case study building 2. This time demonstrates the efficiency of the proposed modeling methodology.The full optimization process required approximately 33 and 29 h, and the proposed methodology required 8.7 and 5.7 h for each case.In other words, the proposed optimization approach achieved 74% and 84% computational time savings.These significant reductions were mainly because of the reduced design space and the number of evaluations requested by the optimization algorithm.

Conclusions
To make building design optimization feasible, a novel integrated approach is presented in this paper.This work includes a method to reduce the dimensionality of a design space, which aims to identify only a few significant variables that need to be considered as optimization variables.The method formally defines a measure of significance and uses the SVD to order variables based on the metric using sample data.A simplified modeling methodology to reduce the time needed to generate samples for the variable selection process is introduced.The developed methodology is applied to case studies for residential buildings, and the results are compared with those of the full optimization process over the entire design space.
In summary: • Using the variable selection process, significant variables (5 out of 12) that demonstrate the strongest contribution to the optimization study are identified; • The proposed methodology significantly shortens the time requirement for the optimization process in the two case studies of 74% and 84%, and the optimized LCC is within 0.05% and 0.06%, respectively, of the optimum point.
These findings not only validate the effectiveness of the integrated approach, but also highlight its potential to streamline and expedite the design optimization process while maintaining a high degree of accuracy in the results.

Buildings 2023 , 18 Figure 2 .
Figure 2. Process of the simplified energy consumption model's development.

Figure 2 .
Figure 2. Process of the simplified energy consumption model's development.

Figure 3 .
Figure 3. Dimensions of the case study houses.The first house has a gross floor area of 334.6 m 2 with two different spaces, which include a living space and attic.The living space is the only conditioned zone, and the net conditioned area is 223.1 m 2 .The entry doors are located on the south and north sides of the building, and two windows are placed on each side of the building.The house is assumed to have three bedrooms and three bathrooms.The second house has a gross floor area of 446.1 m 2 .The house has a living space, attic, and heated basement.The prototype complies with the 2012 International Energy Conservation Code with modified construction layers for optimization purposes.The living space is the only conditioned zone, and

Figure 3 .
Figure 3. Dimensions of the case study houses.

Figure 4 .
(a) Result of singular values in case study 1.(b) Result of singular values in case study 2.

Figure 4 .
Figure 4. (a) Result of singular values in case study 1.(b) Result of singular values in case study 2.

Author
Contributions: Y.B.: Model development and simulation, data analysis, writing-original draft, writing-review and editing.D.K.: model development, writing-original draft.W.T.H.:

Table 1 .
Design variables and values.

Table 2 .
Singular vectors.3 3Significant variables in both case studies are highlighted in blue in the first column.

Table 3 .
Correlation matrices between singular vectors obtained from detailed and simplified models 4 .

Study Building 1 Optimal Design with Full Optimization Optimal Design with the Proposed Method
140 mm (2 × 6 in.) studs at 600 mm (24 in.) on center filled with loose fill cellulose insulation 38 × 140 mm (2 × 6 in.) studs at 600 mm (24 in.) on center filled with loose fill cellulose insulation Under-Floor Insulation Expanded polystyrene 25.4 mm (1 in.)Expanded polystyrene 25.4 mm (1 in.) 6The differences are shown in blue rows in the table.