Next Article in Journal
Business Model as a Concept of Sustainability in the Banking Sector
Next Article in Special Issue
AI-Based Campus Energy Use Prediction for Assessing the Effects of Climate Change
Previous Article in Journal
Exploring the Potential of Climate-Adaptive Container Building Design under Future Climates Scenarios in Three Different Climate Zones
Previous Article in Special Issue
Effect of the Built Environment on Natural Ventilation in a Historical Environment: Case of the Walled City of Famagusta
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Artificial Neural Network-Based Residential Energy Consumption Prediction Models Considering Residential Building Information and User Features in South Korea

1
Department of Architecture, Sejong University, Seoul 05006, Korea
2
Department of Architecture, Yeungnam University, Gyeongsan 38541, Korea
*
Author to whom correspondence should be addressed.
Sustainability 2020, 12(1), 109; https://doi.org/10.3390/su12010109
Submission received: 10 December 2019 / Revised: 16 December 2019 / Accepted: 17 December 2019 / Published: 22 December 2019

Abstract

:
When researching the energy consumption of residential buildings, it is becoming increasingly important to consider how residents use energy. With the advancement of computing power and data analysis techniques, it is now possible to analyze user information using big data techniques. Here, we endeavored to integrate user information with the physical characteristics of residential buildings to analyze how these elements impact energy consumption. Regression analysis was conducted to accurately identify the impact of each element on energy consumption. It was found that six elements were influential in all seasons: the number of exterior walls, housing direction, housing area, number of years occupied, number of household members, and the occupation of the household head. The elements that had an impact in each period were then derived. Based on the results of the regression analysis, input variables for the training of an artificial neural network (ANN) model were selected for each period, and residential energy consumption prediction models were implemented based on actual consumption. The elements identified as those affecting energy consumption, through regression analysis, can be used for implementing prediction models with advanced forms. This study is significant in that we derived influential elements from an integrative perspective.

1. Introduction

In 2019, the World Green Building Council reported that the energy used by buildings accounted for 30% of the world’s total energy consumption, with residential buildings representing the highest proportion (22%) [1]. The figures are similar for South Korea; the residential sector represents 17.1% of total energy consumption, with electric energy being responsible for 38.8%. This demonstrates the necessity of preparing energy-saving measures for residential buildings [2].
To reduce energy consumption in buildings, various systems for managing energy have been introduced (e.g., the building energy management system (BEMS)), and energy-saving measures have been prepared (e.g., improving physical performances related to building energy). In South Korea, such energy-saving measures and their application scope have been gradually expanded to residential buildings. In residential buildings, energy consumption is significantly affected by the attributes or sociology of the inhabitants, as well as the performance of the building. Therefore, the importance of researching energy saving using user information is increasing [3]. Moreover, it is necessary to effectively conduct sustainable energy utilization by preparing an energy reduction plan that considers user features.
Thus far, research on how the physical environment affects energy consumption has been intensively conducted; however, it is difficult to collect a large amount of user information owing to its personal nature. It is also difficult to quantitatively reduce and analyze the information on individuals. Nevertheless, the analysis of qualitative data has become possible through national surveys and new methodologies that are capable of nonlinear analysis, such as artificial neural networks (ANNs). In this study, the energy consumption in residential buildings in South Korea was investigated, by integrating the physical characteristics of residential buildings with information on occupant behavior and characteristics, to identify elements that affect residential energy consumption. The elements that were found to be influential were subsequently used to construct energy consumption prediction models.

2. Literature Review

Most studies on the energy consumption of residential buildings have focused on physical characteristics, and those that considered user behavior only considered one or two user elements. However, user information cannot be defined with such few elements; therefore, studies that reflect user information from various perspectives are required. In this study, elements that affect energy consumption were derived considering both the physical characteristics of buildings and user information.

2.1. Effects of Physical Building Characteristics on Energy Consumption

Kim [4] reported that annual power consumption varied depending on the housing type, construction year, number of floors, building structure, and building location. It was found that detached houses with relatively less energy-related facilities consumed more energy than apartments, as did old buildings and those with fewer floors. Kim et al. [5] discovered that the residential area, heating method, number of floors, and building direction all affected heating energy consumption for 181 apartment complexes in Seoul. Similarly, Eum et al. [6] reported that the construction year, household area, building direction, and heating method affected energy consumption in 21 apartment complexes in Daegu, which is located in central South Korea. In another region in central South Korea—Gyeongbuk—Lee and Chae [7] proposed that the main elements affecting energy consumption were the heating method and exterior walls and windows.

2.2. Effects of User Sociology on Energy Consumption

Van den Brom et al. [8] analyzed the actual energy consumption data of 14,000 households to examine how user features affect energy consumption. Schipper et al. [9] and Noh [10] mentioned that the relationship between energy consumption and users varies because the method and pattern of using energy are different depending on a user’s sociology [9,11].
Kim and An [12] considered the types of Korean users and reported that energy consumption was higher as income increased, because users with higher income were more sensitive to changes in their surroundings. Conversely, the energy consumption of users with relatively lower income was more affected by the physical attributes of their buildings, such as building age, than by changes in their surroundings [9].
In residential spaces, energy consumption varies depending on the residence time and lifestyle of household members. Kim et al. [5] proposed that the occupation, type of household members, and number of household members were influential elements [8]. In the case of occupation, office workers consumed more energy than the self-employed. In addition, members with longer residence times consumed more power [13].
Seo et al. [14] reported that power consumption tendency differed depending on the occupants’ income level and the type of residents. Single-person households and families without children consumed less energy, because they were socially and economically more active and thus spent less time in their houses. However, energy consumption increased upon childbirth, when their residence time and economic activity patterns changed [15].

3. Materials and Methods

Figure 1 shows the research process of this study. First, data from the Household Energy Standing Survey were integrated with the annual regional temperature data of the Korea Meteorological Administration. Second, multiple regression analysis was conducted using the integrated data to obtain the elements affecting energy consumption in each season. Elements found to be influential were then used as input data to construct an ANN model and implement energy consumption prediction models. To accurately derive the elements and construct the prediction model, regression analysis and prediction model construction were conducted for five periods: spring, summer, fall, and winter, as well as annually.

3.1. Household Energy Standing Survey Data

The user data used in this study were collected from the 2016 and 2017 Household Energy Standing Survey, which is conducted annually by the Korea Energy Economics Institute. The survey respondents were from 2520 households in 16 cities and provinces and 3 metropolitan cities, with data on 19 physical housing elements, 14 heating, cooling, and cooking elements, and 14 household and household member elements. The survey also provided the monthly consumption data from 18 energy sources, including general electricity, midnight electricity, and total electricity. It was found that the earlier data were not suitable for integrating and analyzing multi-year data because of limitations in way the composition of items and code disclosure scope were recorded. Therefore, 5040 items of data from 2016 and 2017 were used in this study.

3.2. Seasonal Characteristics in South Korea and their Effects on Energy Consumption

South Korea has four distinct seasons; consequently, different home appliances and energy sources are used in each season. The seasons also affect the length of time residents spend indoors. In spring and fall, residential energy consumption is low, because there are many clear and dry days under the influence of migratory anticyclones. In summer, cooling-related energy consumption increases because the weather is hot and humid under the influence of the North Pacific anticyclone. In winter, heating-related energy consumption increases because the weather is generally cold and dry under the influence of the continental anticyclone [16].
Figure 2 shows the monthly average temperature data of 16 regions in South Korea in 2016 and 2017. In Seoul and Kyunggi-do, the cities with the highest population densities in South Korea, the average annual temperatures were approximately 30.8 and 29.5 °C in 2016, and 31.2 and 30.5 °C in 2017, respectively. The lowest temperature was observed in Kyunggi-do in January 2016, and the highest in Gwangju in July 2017.
Figure 3 shows the seasonal energy consumption data of 4943 households, excluding outliers. It was found that energy consumption was highest in winter. Moreover, energy consumption was high in spring and winter when relatively lower temperatures were observed compared to other seasons. This indicates that heating-related energy consumption significantly affects the total energy consumption of buildings.
Analysis was conducted for each of the four seasons to reflect the influence of energy sources and energy-using devices in each season. Annual energy consumption was examined to identify overall influential elements.

3.3. Derivation of Significant Elements through Multiple Regression Analysis

Machine learning techniques, such as ANNs, can estimate prediction results when trained to predict nonlinear elements. However, it is difficult for them to determine the influence of these elements [17]. In this study, regression analysis was conducted to examine the influence of individual elements and to derive influential elements. The elements found to affect energy consumption were then used as input data to implement the prediction model.
The multiple regression analysis was conducted using SPSS180.0 statistical software. The coefficient ( b n ) of each variable and the constant term ( a n ) of the model were estimated by applying the seasonal energy consumption to the dependent variable ( Y k ) and substituting the physical characteristics of the building, household characteristics, and seasonal local temperature into independent variables ( x n ), as shown in Equation (1):
Y k   =   a n + b 1 x 1 + b 2 x 2 + + b n x n .

3.4. Prediction of Energy Consumption Using ANNs

According to Neto and Fiorelli [13], simulations and ANNs are both efficient for predicting energy consumption; however, each has its own benefits and drawbacks. In the case of simulation, it is possible to input the operating hours of home appliances by the user, however there are limitations in inputting other detailed information. In the case of ANNs, energy consumption can be predicted by quantifying nonlinear user information [13]. In this study, the ANN method was considered more appropriate to use because a large amount of user information could be integrated and considered.
ANNs are machine-learning algorithms proposed by McCulloch and Pitts [18]. When they were first proposed, implementing the learning models was complicated and there was lack of clear connection between the input and output data. Nevertheless, these issues were solved with the development of deep neural networks (DNNs) that combined the backpropagation algorithm with multi-layers with multiple nodes [19]. Owing to the proposal of new functions and algorithms, as well as improved computer hardware, it is now possible to implement DNN models with one or more hidden layers. DNN models can analyze vast amounts of elements that were previously overlooked for analysis, and it is now possible to implement prediction models to derive outputs through the weight of each node [20].
Figure 4 shows the basic structure of the ANN model described by Equation (2). R is the number of input variables and S is the number of hidden neurons. p represents each input variable, b each hidden layer, and w its weight. The weight of each calculated element is used as the input of the activation function. The output is derived through the sum of the weighted values [15]. The activation function utilized the most commonly used sigmoid function:
n k h   =   j = 1 R w k j h p j + b b k k   =   1   t o   S .
When an ANN prediction model is created, hidden nodes and layers must be constructed, as shown in Figure 5. As there is no clear standard or methodology for this, it is necessary to find the optimal model with the lowest mean squared error (MSE) value after making as many attempts as possible. For successful learning, the minimum numbers of hidden nodes and layers must be larger than the number of input variables n, and the maximum number must not exceed 2n + 1 [21]. In this study, the ANN model was examined while the number of layers was increased from 1 to 9. In addition, for each number of layers, five cases were created with the minimum number of hidden nodes, the maximum number of hidden nodes, and three intermediate numbers. The model with the lowest MSE, i.e., the highest model performance, was examined.
To verify the performance of the model, three methods were used to construct the household member information input data, and the performance of the three cases were compared. As previously mentioned, influential elements were found through regression analysis. The performance of models using the influential elements as input data were compared to a model that used all the original variables as input data, as well as to a model that used the four household member data derived by a number of previous studies as input data.
The ANN prediction model for residential buildings in South Korea by Whaid and Kim [22] was implemented using 14,260-hour data of 20 apartment complexes in Seoul. Their model used the physical elements of the buildings that had optimal MSE values as input data. Research on a prediction model that integrates user information with the model based on such physical attributes is required.
Lee et al. [15] derived the energy consumption from the same buildings according to the behavioral patterns of the users, and implemented a DNN model for predicting energy consumption through six elements, i.e., gender, age, occupation, income, education level, and length of residency. Their prediction model exhibited 64% accuracy, indicating that the six elements had an impact on energy consumption. As the energy consumption was derived through simulation rather than records of actual energy consumption, the influence of the six elements must be verified through a comparison with the actual energy consumption. In addition, as the research was conducted based on annual energy consumption, it is difficult to determine detailed seasonal consumption and influential elements. It is also necessary to conduct research on different types of residential buildings, such as apartments and detached houses, as the research was based only on multi-family houses.
It is difficult to accurately measure data such as energy consumption in residential buildings owing to its spatiotemporal variability [23]. Data-based models are widely used because they can calculate results through repeated learning, even for cases with limited input variables. ANN techniques are actively used in various areas, as it is easy to identify relationships between different variables, and nonlinear correlations can be analyzed without analyzing the physical phenomena [24].

4. Results and Discussion

4.1. Derivation of Influential Elements

4.1.1. Analysis Process (Multi-Collinearity, Outliers, and Dependent and Independent Variables)

Prior to the analysis, among the 2520 households in 2016 and 2520 households in 2017, the following were removed as outliers: 92 households with unidentifiable data in each item, 2 households located on the 100th or higher floors, and 3 households where there were 4 or more household members aged 65 or older.
The data from all 16 regions comprised 12 physical building elements and 12 user information elements, as shown in Table 1. Regional variables were applied to the analysis to use them as proxy variables that represent the geographic characteristics, social atmosphere, economic characteristics, and annual weather of each region. Among the user information data, the gender, age, education level, and occupation were analyzed using the information about the household head due to the nature of the data.
Nominal variables were determined by regression analysis through dummy coding. The reference variables of each dummy variable are as follows. The regional variables were based on Seoul, which exhibited the largest temperature changes in 2016 and 2017. In South Korea, as energy efficiency is lowest when a building faces north, coding was performed based on northward facing buildings. Other variables were analyzed based on items for which it was difficult to derive meaningful conclusions when analysis and interpretation were conducted. Variables that were relatively few in number were not applied.
Prior to the analysis, multi-collinearity verification was performed, which could potentially reveal independent variables with high correlations. If multi-collinearity occurs, correlations among independent variables may affect the analysis and lead to wrong results. Consequently, independent variables that have a significant impact on the dependent variable may appear meaningless, or the sign of the regression coefficient may change [25]. The multi-collinearity verification showed that there was no collinearity between the variables, because the variance inflation factor (VIF) value was less than 10 for all items.
Table 2 shows that the regression analysis models for residential energy consumption in the annual period, spring, summer, fall, and winter were appropriate. The Durbin–Watson results were between 1.661 and 1.749; Durbin–Watson numbers close to 2 indicate that there is no autocorrelation. The explanatory power for residential energy consumption, which was the dependent variable of each independent variable, was found to be 12% (annual), 12.5% (spring), 14.2% (summer), 11% (fall), and 12.3% (winter). These values are low compared to previous studies on energy consumption in buildings. This appears to be because user attributes, which are sociological and humanistic elements, were included in the analysis in large quantities.

4.1.2. Elements Affecting Energy Consumption

Table 3 shows the physical elements and user attributes identified through the regression analysis as affecting residential energy consumption annually, as well as in the spring, summer, fall, and winter periods. The detailed results are given in the Appendix A Table A1, Table A2 and Table A3 the dummy-coded variables were analyzed through a comparison with the reference group. It was determined that energy consumption increased by the value of the non-standardized coefficient (B) when each of the continuous variables increased by 1.
In addition to the regional variables, it was found that among the physical elements of the building, the number of exterior walls, housing direction, and housing area were influential in all seasons. Among the user attributes, the number of years occupied, number of household members, and occupation of household head were found to be influential in all seasons. Figure 6, Figure 7, Figure 8, Figure 9, Figure 10 and Figure 11 show the annual and seasonal energy consumption distribution charts for these six elements.
Figure 6 shows the seasonal energy consumption distribution charts for the number of exterior walls. Exterior walls refer to walls that directly face the outside air. The number of exterior walls is a continuous variable, from 0 (for the basement) to 6. As shown in the figure, energy consumption showed a tendency to increase as the number of exterior walls increased, because exterior walls are vulnerable to insulation and release heat to the surroundings [19].
The housing direction is the direction that the front of the building faces. In South Korea, it is usually determined based on the position of the living room [26]. This is because Korean users spend a considerable amount of time in the living room. In South Korea, north-facing walls have the lowest daily average solar radiation, resulting in low energy efficiency. Hence, houses are usually designed to avoid the north. The reference variable therefore considered buildings in a north facing direction. The analysis results in Figure 7 show that energy consumption was highest in the northwest direction and lowest in the south direction. This indicates that the northwest direction is more vulnerable to residential energy consumption than the north direction, which is the reference variable, in South Korea.
In South Korea, the “pyeong” unit is usually used in calculating the area of a building. 1 pyeong corresponds to 3.3 m2. Five groups were prepared with intervals of 33 m2 (10 pyeong) to analyze the effect of housing area. As the area relates to the volume of the building and therefore the volume of internal air, buildings with larger areas are more vulnerable to energy consumption [27]. Figure 8 shows that energy consumption generally increased as the building area increased.
Time-related variables, such as construction year and number of years occupied, may form a curved distribution. In such cases, general regression analysis for analyzing linear samples may not produce significant results. Nevertheless, regression analysis can be used if the data is converted to a linear distribution by squaring the value of each point. In the case of the construction year, significant results were not obtained. However, for the number of years occupied, a point of inflection appeared between 4 and 6 years, as shown in Figure 9, indicating that the variable had a curve-type distribution. This suggests that energy consumption increases with an increase in number of years occupied up to 4–6 years, but decreases afterward.
Energy consumption increases as the number of household members increases, because there are more users that directly consume energy. As shown in Figure 10, energy consumption showed a tendency to increase as the number of household members increased in the annual period and all seasons. Moreover, the number of household members appear to be more influential in winter and spring, when energy related to heating and hot water are consumed more frequently, than in summer, when cooling-related energy is consumed.
For ordinary households in South Korea, the household head is responsible for all household members. The occupation of the household head was applied as occupational data in this study; the occupation was classified as regular employees, temporary employees, business owners, and others. The analysis results showed that business owners had an influence on energy consumption in most periods as shown in Figure 11. This is because many business owners in South Korea use homes for business and residence, and thus their residence time is longer than that of other occupations.
In addition to the six common influential elements, the elements that had an influence in each season are as follows. The temperature was found to be more influential in spring and fall when considered in a comprehensive manner. Detached houses were found to be more vulnerable to higher energy consumption in all periods other than spring. Energy consumption showed a tendency to increase for all seasons as buildings became older, but the influence of building age was lower than that of other elements on an annual basis. The number of exterior walls was only influential in winter and annual periods, while the main heating method was influential in the spring, summer, and fall. The cooling method was influential in summer, fall, and the annual period, and energy consumption was affected by the air conditioner set temperature in fall and winter. Occupant gender had an influence on energy consumption in fall, winter, and the annual period, and the unusual features of household were influential in summer and winter. The annual gross income was influential in all periods except fall. Several elements exhibited an influence in specific seasons. In summer, energy consumption varied depending on the number of bedrooms and the number of household members aged 65 or older. In fall, it varied depending on the composition of household members. In winter, the number of floors, the age of household head, and the education level of the household head were found to be influential.

4.2. Energy Consumption Prediction Model

4.2.1. Input/Output Data

The ANN model was constructed for five periods (annual, spring, summer, fall, and winter). Elements that were found to be influential in each season were selected as input variables for the analysis. In the case of nominal variables, the ANN model was constructed by including the reference group as input variables after performing dummy coding. When the input variables of each model were examined based on the significant elements, it was found that the number of input variables were as follows: 42 for the annual model, 38 for the spring model, 46 for the summer model, 47 for the fall model, and 48 for the winter model.
As shown in Table 4, 4943 data were used for the analysis, excluding outliers. Among them, 3461 data were used for training, 741 for validation, and 741 for testing.

4.2.2. Hidden Layer and Node

Table 5 shows six cases with the highest model performances for each seasonal prediction model according to the number of layers. Gradient vanishing, in which prediction models cannot be used when the number of layers exceeds the number presented in the table, occurred.

4.2.3. ANN Simulation Result

Figure 12 shows the R-values of the training, validation, and test data of the highest performing annual energy consumption prediction model. The R-value was found to be 0.25745 for the training data, 0.30359 for the validation data, and 0.27886 for the test data. The MSE value was 10 7 × 2.5250. This model had one layer and 70 nodes. It was found that the neural network (NN) model exhibited the highest performance.
Figure 13 shows the R-values of the training, validation, and test data of the highest performing spring energy consumption prediction model. The R-value was found to be 0.39154 for the training data, 0.28441 for the validation data, and 0.27913 for the test data. The MSE value was 10 6 × 2.5981, and the model had two layers and 24 nodes. It was found that the DNN model with two layers exhibited the highest performance.
Figure 14 shows the R-values of the training, validation, and test data of the highest performing summer energy consumption prediction model. The R-value was 0.38204 for the training data, 0.31832 for the validation data, and 0.27092 for the test data. The MSE value was 10 5 × 5.1060, and the model had three layers and 16 nodes. It was found that the DNN model with three layers exhibited the highest performance.
Figure 15 shows the R-values of the training, validation, and test data of the highest performing fall energy consumption prediction model. The R-value was 0.33652 for the training data, 0.2783 for the validation data, and 0.21787 for the test data. The MSE value was 10 6 × 1.4318, and the model had one layer and 47 nodes. It was found that the NN model with one layer exhibited the highest performance.
Figure 16 shows the R-values of the training, validation, and test data of the highest performing winter energy consumption prediction model. The R-value was 0.28741 for the training data, 0.23453 for the validation data, and 0.21289 for the test data. The MSE value was 10 6 × 5.7602. This model had two layers and 24 nodes. It was found that the DNN model with two layers exhibited the highest performance.
For the annual and fall periods, the performances of the NN models with one layer were found to be excellent. However, the DNN models with two or more layers did not have significantly different MSE values. This indicates that if a sufficient amount of data from the Household Energy Standing Survey were used, the model performance would be improved and the application of DNN models will be possible.
Table 6 compares models with three input data types. The type A prediction models were constructed by conducting regression analysis on the household information elements of the original data, with the influential elements from each season set as input data. The type B prediction models were constructed using all 12 sets of household data as input data for all seasons. The type C prediction models were constructed by applying elements commonly derived as influential elements in previous studies as input data, i.e., the income, number of household members, occupation, composition of household members, etc.
To compare standards for household information, influential elements for each season were applied to models A, B, and C in the same manner as the physical elements of buildings. The same numbers of layers and neurons were also applied to types A, B, and C to control the influence of other factors except the input data of the model. The analysis results showed that type A, which used significant household information as input data in all seasons through regression analysis, exhibited the highest model accuracy. For type B, which used the original data, gradient vanishing occurred in the spring and summer models. Type C had higher prediction accuracy than type A, except in the annual period; however, there was a significant difference in prediction accuracy between the training and test models, resulting in an overfitting problem.
Based on the comparison and analysis results of each model, it was found that identifying the most significant variables through regression analysis could improve model performance, especially when variables that are difficult to quantify, such as household information, are included as input data.

5. Conclusions

An increasing number of studies have been conducted on the energy consumption of residential buildings. This study is different from previous studies in three aspects. First, this study was conducted based on the actual energy consumption of residential buildings in South Korea. Furthermore, the elements were derived by integrating the physical information of buildings with user information and reflecting mutual influence. Finally, energy consumption prediction models were implemented by dividing energy consumption into four seasons and deriving influential elements for each season.
This study found that user information has as much influence on energy consumption as the physical elements of buildings. In each season, the influence of the physical characteristics of buildings and household characteristics as well as the importance of each element was identified. The three representative elements that exhibited the highest influence in each season are as follows.
  • In spring, the building direction was found to be the most influential element, followed by the occupation and cooling method. Buildings facing northwest—the direction with the lowest annual average solar radiation—exhibited the highest energy consumption. Buildings inhabited by to business owners, who typically have longer residence times than other occupations, also consumed more energy. Households that used air conditioners for cooling consumed more energy.
  • In summer, the heating method was the most influential, followed by the cooling method and housing area. For residential buildings in South Korea, individual heating and central heating are the two representative heating methods. Households with individual heating were found to consume more energy. Heating energy was mostly concentrated on the use of hot water in summer, indicating that individual heating was more vulnerable to the use of energy related to hot water. As in spring, households that used air conditioners consumed more energy, as did households with larger areas.
  • In fall, the housing area was the most influential factor, followed by the housing type and occupation. Households with larger areas consumed more energy, as did detached houses that managed energy individually. As in spring, buildings inhabited by to business owners exhibited the highest energy consumption.
  • In winter, the building direction was the most influential, followed by the housing area and occupation. As in spring, the northwest direction with the lowest solar radiation exhibited the highest energy consumption. As the housing area increased, energy consumption increased. As for the occupation, to business owners with the longest residence time exhibited the highest energy consumption, as in spring and fall.
As seen above, different elements affected energy consumption in each season. The various elements had different influences depending on the season. This must be reflected when preparing systematic energy saving and management measures in the future. Providing data on seasonal energy usage will be possible when people with households displaying specific features will live in the target residential area. If enough information is obtained via matching these users with residential buildings, more sophisticated policies can be implemented, and greater awareness of energy use can be aroused individually. Such measures could be further developed by adding impact-reflecting factors and expanding the scope of the analysis in order to continuously reduce energy.
The suitability of the energy consumption prediction models implemented in this study was compared through a comparison of three types. For type A, a prediction model was constructed using the influential variables of each season derived through regression analysis as input data. For type B, a prediction model was constructed using the four representative user information elements derived in previous studies, i.e., the income, occupation, composition of household members, and number of household members. For type C, a prediction model was constructed using all the household information included in the original data. When the models were compared, it was found that type A exhibited the highest suitability. This indicates that prediction models with a higher performance can be implemented by verifying the influence of individual elements through regression analysis. It can then be applied to future prediction models that measure how atypical data affects energy consumption, such as the household member information. Predictive models of type B can identify influential factors and provide information that can be utilized when drafting a plan for continuous energy reduction from simple usage forecasting. Based on the influence of these individual factors, it is possible to formulate countermeasures in related fields when developing sustainable energy saving measure.
Currently, in South Korea, when evaluating the energy impact of users and buildings, the post-occupancy evaluation (POE) method is used to evaluate the energy impact of users and buildings. If the energy impact of the building and the user can be predicted using the model such as the one proposed in this study, a novel form of energy impact assessment can be conducted. Such assessments can reduce the unreasonable energy consumption of post-occupancy assessments and, furthermore, provide a way to create customized energy-saving residential spaces provided by both, the state or by individuals to create their own living environments.
In the model of this study, although the prediction rate increased by using only influential factors through regression analysis, the performance of the predictive model was limited because only two years’ worth of data were used. However, the dataset used in this study, the Household Energy Standing Survey, is conducted annually; hence, the current limitations owing to this lack of data will be eventually overcome.
The seasonal influential elements derived in this study are expected to be utilized as basic elements that can be used for further research on more accurate energy prediction if they are integrated with the information on detailed climate and building information, such as microclimate information, and information on the ownership and usage of home appliances. Such attempts will be useful as basic research to derive and predict common elements that have an influence on energy consumption on a national level, beyond residential energy research in the scope of single buildings and survey-based investigations in small areas.

Author Contributions

Conceptualization, M.K., and S.J.; Data curation M.K. and J.-w.K.; Formal analysis, M.K.; Methodology, M.K. and S.J.; Project administration, M.K.; Software, M.K.; Supervision, S.J. and J.-w.K.; Visualization, M.K.; Writing—Original draft preparation, M.K.; Writing−Review and editing, M.J., S.J. and J.-w.K. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korean government (MSIT) (No. NRF–2018R1A2B2005528).

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Table A1. Results of deriving elements that affect energy consumption in the annual period and spring.
Table A1. Results of deriving elements that affect energy consumption in the annual period and spring.
AnnualSpring
Unstandardized CoefficientStandardized CoefficienttpUnstandardized CoefficientStandardized Coefficienttp
BStandard ErrorBStandard Error
item0.35720.72800.00000.0170.986430.03914.87900.0512.893*** 00.004
cityD2−1814.663364.398−00.080−4.980*** 00.000−907.482113.356−0.128−80.006*** 00.000
cityD3−1016.515400.446−00.039−2.538** 00.011−391.367126.351−00.048−30.097*** 00.002
cityD4814.586400.82000.03120.032** 00.042308.361127.42700.0382.420** 00.016
cityD5−11310.0714070.010−00.043−2.779*** 00.005−741.729126.291−00.090−5.873*** 00.000
cityD6−7080.094398.812−00.027−1.776* 00.076−300.561124.220−00.036−2.420** 00.016
cityD7433.798469.60300.0140.9240.356−83.532145.999−00.008−0.5720.567
cityD8291.769302.46100.0160.9650.335−830.08094.586−00.015−0.8780.380
cityD92138.199412.61600.0815.182*** 00.000219.506129.44800.0271.696* 0.090
cityD10−1339.813419.101−00.050−3.197*** 00.001−508.758130.348−00.061−3.903*** 00.000
cityD11−40.925376.899−00.002−0.1090.914−509.335124.624−00.071−40.087*** 00.000
cityD12271.9294190.08900.0100.6490.516−376.520138.285−00.046−2.723*** 00.006
cityD13−9540.0953770.056−00.041−2.53000.011−805.814119.814−0.112−6.726*** 00.000
cityD14−260.189346.897−00.013−0.7500.453−297.113109.483−00.046−2.714*** 00.007
cityD15−1647.383348.892−00.080−4.722*** 00.000−751.381109.655−0.117−6.852*** 00.000
cityD16−30600.076655.856−00.067−4.666*** 00.000−1345.139202.957−00.094−6.628*** 00.000
B_a1D1732.460244.13300.06430.000*** 00.00376.72175.92300.02110.0110.312
B_a1D2−734.310317.726−00.065−2.311** 00.021117.96398.80200.0341.1940.233
B_a233.17322.56100.0461.4700.1422.49470.01600.011.3560.722
B_a3−12.18622.855−00.011−0.5330.594−3.6107.105−00.010−0.5080.611
B_a4269.49077.12800.0613.494*** 00.00081.979240.00000.0603.416*** 00.001
B_a5D1−308.573421.521−00.018−0.7320.464−156.1181310.077−00.029−1.1910.234
B_a5D2173.7114700.05100.0080.3700.71251.469146.16400.0070.3520.725
B_a5D3−825.806378.932−00.073−2.179** 00.029−243.738117.820−00.069−20.069** 0.039
B_a5D4−577.1033980.033−00.041−1.4500.147−1270.065123.751−00.029−10.0270.305
B_a5D5−353.640431.583−00.019−0.8190.413−87.611134.185−00.015−0.6530.514
B_a5D635.501564.87400.00100.0630.950−42.340175.641−00.004−0.2410.810
B_a5D727400.077660.55900.0674.148*** 00.000798.281205.39700.0623.887*** 00.000
B_a6−64.80274.977−00.015−0.8640.38753.71123.30900.0402.304** 00.021
B_a7617.644135.28900.0964.565*** 00.0001800.037420.04700.0894.282*** 00.000
B_a8124.599147.85500.0160.8430.39948.55345.97500.02010.0560.291
B_a950.63922.10800.0362.291** 00.02210.9296.85400.0251.5950.111
H_a2146.845193.20900.0110.7600.447−0.575600.07700.000−0.0100.992
H_a181.18321.9990.1263.690*** 00.00031.4756.8390.1574.602*** 00.000
H_a1′−0.747.495−00.050−1.5090.131−0.4350.154−00.094−2.823*** 00.005
B_a10D1171.356295.26800.0090.5800.562393.85491.81400.0654.290*** 00.000
B_a11D1−6680.002385.133−00.055−1.734* 00.083−127.971110.536−00.034−1.1580.247
B_a126.38891.65200.00200.0700.9442.70825.29400.0030.1070.915
H_a3462.61186.7380.1035.333*** 00.000132.85426.96600.0944.927*** 00.000
H_a4−44.999124.266−00.006−0.3620.717−31.37438.633−00.014−0.8120.417
H_a5−28.372135.552−00.004−0.2090.834−39.40542.151−00.017−0.9350.350
H_a6D1−29.574396.759−00.001−00.0750.94170.830123.36700.0090.5740.566
H_a7D1341.354201.58200.0261.693* 00.09015.72662.69800.0040.2510.802
H_a81570.023102.47700.0291.5320.126220.07831.82800.0130.6940.488
H−a9D1−255.470193.122−00.022−1.3230.186−78.738600.050−00.022−1.3110.190
H_a10D1211.767250.47500.0190.8450.39852.68677.88300.0150.6760.499
H_a10D24670.0593730.08700.0191.2520.211126.9671160.00600.01710.0940.274
H_a10D3762.147262.91100.0572.899*** 00.004156.11081.75400.0381.910* 00.056
H_a11D1−430.564317.554−00.023−1.3560.175−143.32598.641−00.024−1.4530.146
H_a12192.45161.87100.0643.111*** 00.00253.37919.23700.0572.775*** 00.006
Note. * 90% Confidence interval (p-value < 0.10). ** 95% Confidence interval (p-value < 00.05). *** 99% Confidence interval (p-value < 00.01).
Table A2. Results of deriving elements that affect energy consumption in summer and fall.
Table A2. Results of deriving elements that affect energy consumption in summer and fall.
SummerFall
Unstandardized CoefficientStandardized CoefficienttpUnstandardized CoefficientStandardized Coefficienttp
BStandard ErrorBStandard Error
item1.7181.7510.0160.9810.327−6.69530.084−0.036−2.171** 0.030
cityD2−30.66453.397−0.009−0.5740.566−71.58087.106−0.013−0.8220.411
cityD3−9.14058.628−0.002−0.1560.876−204.944960.022−0.033−2.134** 0.033
cityD42360.06558.8260.06140.013*** 0.000181.35995.9660.0291.890* 0.059
cityD5340.31259.3590.0875.733*** 0.000400.27396.8960.0644.131*** 0.000
cityD647.88958.4280.0120.8200.412−75.23695.503−0.012−0.7880.431
cityD7391.71168.8830.0835.687*** 0.000255.787112.2130.0342.279** 0.023
cityD8103.61744.5920.0392.324** 0.020120.39272.3780.0281.663* 0.096
cityD9185.19560.7930.04730.046*** 0.002938.35098.907.1509.487*** 0.000
cityD10−1100.01261.396−0.028−1.792* 0.073−126.288100.152−0.020−1.2610.207
cityD11−64.59255.404−0.019−1.166.244388.833900.0430.0714.318*** 0.000
cityD121640.00761.5510.0422.665*** 0.008492.574100.2790.0794.912*** 0.000
cityD13162.85855.2920.0472.945*** 0.003292.50790.6360.0533.227*** 0.001
cityD14−46.58450.822−0.015−0.9170.359102.19382.8150.0211.2340.217
cityD1514.558510.0900.0050.2850.776−108.53083.428−0.022−1.3010.193
cityD16−201.90295.623−0.030−2.111** 0.035−94.801156.221−0.009−0.6070.544
B_a1D1−34.26035.755−0.020−0.9580.338260.59658.3460.0954.466*** 0.000
B_a1D2125.81146.5300.0752.704*** 0.007−220.31375.932−0.082−2.901*** 0.004
B_a2−0.1823.304−0.002−0.0550.9568.5675.3920.0501.5890.112
B_a31.8883.3470.0110.5640.5731.1675.4620.004.2140.831
B_a420.31811.2920.0311.799* 0.07259.34718.4290.0573.220*** 0.001
B_a5D1−1050.09261.738−0.041−1.702* 0.089−28.303100.742−0.007−0.2810.779
B_a5D2−89.25368.842−0.026−1.2960.19529.479112.3380.0050.2620.793
B_a5D3−144.40655.502−0.086−2.602*** 0.009−184.83090.565−0.068−20.041** 0.041
B_a5D4−101.95458.298−0.049−1.749* 0.080−1480.09195.132−0.044−1.5570.120
B_a5D5−20.27663.213−0.007−0.3210.748−106.717103.150−0.024−10.0350.301
B_a5D6−7.87082.730−0.002−0.0950.92493.1141350.0020.0120.6900.490
B_a5D7239.71996.7520.0392.478** 0.013550.689157.8740.0563.488*** 0.000
B_a628.78810.9790.0452.622*** 0.009−60.36817.921−0.059−3.369*** 0.001
B_a760.87919.8190.06330.072*** 0.002117.79132.3360.0773.643*** 0.000
B_a837.74221.6550.0331.743* 0.08120.89235.3370.0110.5910.554
B_a92.5483.2410.0120.7860.4323.2745.2840.0100.6200.535
H_a2D1−14.78428.296−0.008−0.5220.60134.19046.1760.0110.7400.459
H_a1120.0763.2220.1273.748*** 0.000120.0685.2580.0792.295** 0.022
H_a1′−0.2230.073−0.101−30.070*** 0.002−0.0760.118−0.021−0.6400.522
B_a10D1253.45743.2430.0885.861*** 0.000−242.13970.567−0.052−3.431*** 0.001
B_a11D1−207.91756.601−0.115−3.673*** 0.000−308.71292.132−0.107−3.351*** 0.001
B_a12−20.67513.500−0.051−1.5310.126−40.96521.942−0.063−1.867** 0.062
H_a369.21712.7030.1035.449*** 0.00076.51120.7320.0713.690*** 0.000
H_a47.81618.1990.0070.4290.66831.95729.7000.01810.0760.282
H_a5−33.42819.852−0.030−1.684* 0.092−10.47832.396−0.006−0.3230.746
H_a6D1−54.11558.107−0.014−0.9310.352−54.27494.823−0.009−0.5720.567
H_a7D112.50929.5220.0070.4240.672133.11748.1730.0432.763*** 0.006
H_a89.950150.0090.0130.6630.50723.40024.4930.0180.9550.339
H_a9D118.36928.2850.0110.6490.516−23.15946.154−0.009−0.5020.616
H_a10D18.14936.6830.0050.2220.82488.44259.8610.0331.4770.140
H_a10D20.10954.6410.0000.0020.998187.92689.1640.0322.108** 0.035
H_a10D370.70138.5060.0361.836* 0.066166.15262.8330.0532.644*** 0.008
H_a11D1−82.65446.515−0.030−1.777* 0.076−109.99575.904−0.025−1.4490.147
H_a1244.18690.0600.0994.877*** 0.000180.02614.7880.0251.2190.223
Note. * 90% Confidence interval (p-value < 0.10). ** 95% Confidence interval (p-value < 00.05). *** 99% Confidence interval (p-value < 00.01).
Table A3. Results of deriving elements that affect energy consumption in winter.
Table A3. Results of deriving elements that affect energy consumption in winter.
Winter
Unstandardized CoefficientStandardized Coefficienttp
BStandard Error
item−0.72316.868−0.001−0.0430.966
cityD2−7930.038197.362−0.072−40.018*** 0.000
cityD3−363.774199.398−0.028−1.824* 0.068
cityD4158.162202.8790.0120.7800.436
cityD5−1142.696223.802−0.089−5.106*** 0.000
cityD6−371.881197.825−0.029−1.880* 0.060
cityD7−113.413243.285−0.007−0.4660.641
cityD8179.2691510.0270.0211.1870.235
cityD9738.232214.6390.0583.439*** 0.001
cityD10−581.945213.632−0.045−2.724*** 0.006
cityD11263.793192.8440.0241.3680.171
cityD12118.786218.9150.0090.5430.587
cityD13−546.911195.663−0.049−2.795*** 0.005
cityD14−72.352188.881−0.007−0.3830.702
cityD15−758.888184.372−0.076−4.116*** 0.000
cityD16−1359.103341.562−0.061−3.979*** 0.000
B_a1D1432.979118.1220.0783.666*** 0.000
B_a1D2−761.329153.708−0.140−4.953*** 0.000
B_a222.48610.9150.06520.060** 0.039
B_a3−11.662110.057−0.021−10.0550.292
B_a4105.27337.3450.0502.819*** 0.005
B_a5D1−24.133203.936−0.003−0.1180.906
B_a5D2179.642227.4200.0160.7900.430
B_a5D3−255.944183.316−0.046−1.3960.163
B_a5D4−203.349192.537−0.030−10.0560.291
B_a5D5−143.873208.770−0.016−0.6890.491
B_a5D6−7.363273.2680.000−0.0270.979
B_a5D71141.306319.5470.0573.572*** 0.000
B_a6−86.52336.265−0.041−2.386** 0.017
B_a7256.88265.4250.0823.926*** 0.000
B_a818.17971.5360.0050.2540.799
B_a9340.05210.6770.0503.189*** 0.001
H_a2D1129.34593.4600.0201.3840.166
H_a125.48410.6420.0822.395** 0.017
H_a1’−0.010.240−0.001−0.0420.966
B_a10D1−232.919142.860−0.025−1.6300.103
B_a11D1−13.605176.178−0.002−0.0770.938
B_a1268.36740.6440.0521.682* 0.093
H_a3185.21941.9610.0854.414*** 0.000
H_a4−54.96360.132−0.016−0.9140.361
H_a5560.09865.5790.0150.8550.392
H_a6D16.379191.9590.0010.0330.973
H_a7D1175.73597.5600.0281.801* 0.072
H_a8103.51949.4910.04020.092** 0.037
H_a9D1−172.52993.428−0.031−1.847* 0.065
H_a10D164.481121.1690.0120.5320.595
H_a10D2151.951180.5180.0130.8420.400
H_a10D3371.304127.2000.0582.919*** 0.004
H_a11D1−94.230153.444−0.010−0.6140.539
H_a1277.45229.9310.0532.588** 0.010
Note. * 90% Confidence interval (p-value < 0.10). ** 95% Confidence interval (p-value < 00.05). *** 99% Confidence interval (p-value < 00.01).

References

  1. U.S. Energy Information Administration. Annual Energy Outlook 2019; Government Printing Office: Washington, DC, USA, 2019. [Google Scholar]
  2. Korea Energy Economics Institute. 2018 Energy Info. Korea; Yong-Sung Cho: Seoul, Korea, 2018. [Google Scholar]
  3. Santamouris, M. Cooling the buildings–past, present and future. Energy Build. 2016, 128, 617–638. [Google Scholar] [CrossRef]
  4. Kim, M.-K. An Estimation Model of Residential Building Electricity Consumption in Seoul. Seoul Stud. 2013, 14, 179–192. [Google Scholar]
  5. Kim, Y.-L.; Hong, W.-H.; Seo, Y.-K.; Jeon, G.-Y. A Study on the Electricity Consumption Propensity by Household Members in Apartment Houses. J. Korean Hous. Assoc. 2011, 22, 43–50. [Google Scholar] [CrossRef] [Green Version]
  6. Eum, M.-R.; Hong, W.-H.; Lee, J.-A. Deriving Factors Affecting Energy Usage for Improving Apartment Energy Consumption Evaluation. J. Archit. Inst. Korea Struct. Constr. 2018, 34, 27–34. [Google Scholar]
  7. Lee, K.-H.; Chae, C.-U. Estimation Model of the Energy Consumption under the Building Exterior Conditions in the Apartment Housing—Focused on the Maintenance Stage. J. Archit. Inst. Korea Plan. Des. 2008, 24, 85–92. [Google Scholar]
  8. Van den Brom, P.; Meijer, A.; Visscher, H. Performance gaps in energy consumption: Household groups and building characteristics. Build. Res. Inf. 2018, 46, 54–70. [Google Scholar] [CrossRef] [Green Version]
  9. Schipper, L.; Bartlett, S.; Hawk, D.; Vine, E. Linking life-styles and energy use: A matter of time? Annu. Rev. Energy 1989, 14, 273–320. [Google Scholar] [CrossRef]
  10. Noh, S.C.; Lee, H.Y. An analysis of the factors affecting the energy consumption of the household in Korea. J. Korea Plan. Assoc. 2013, 48, 295–312. [Google Scholar]
  11. Jung, J.; Yi, C.; Lee, S. An integrative analysis of the factors affecting the household energy consumption in Seoul. J. Korea Plan. Assoc. 2015, 50, 75. [Google Scholar] [CrossRef]
  12. Kim, K.; An, Y.; Seungil, L. Analysis of influencing factors of building and urban planning on building energy consumption considering income gap-focused on electricity consumption on August in Seoul. J. Korea Plan. Assoc. 2017, 52, 253–267. [Google Scholar]
  13. Neto, A.H.; Fiorelli, F.A.S. Comparison between detailed model simulation and artificial neural network for forecasting building energy consumption. Energy Build. 2008, 40, 2169–2176. [Google Scholar] [CrossRef]
  14. Seo, H.-C.; Hong, W.-H.; Nam, G.-M. Characteristics of Electric-Power Use in Residential Building by Family Composition and Their Income Level. J. Korean Hous. Assoc. 2012, 23, 31–38. [Google Scholar] [CrossRef]
  15. Lee, S.; Jung, S.; Lee, J. Prediction model based on an artificial neural network for user-based building energy consumption in South Korea. Energies 2019, 12, 608. [Google Scholar] [CrossRef] [Green Version]
  16. Domestic Climate Data. Available online: http://www.weather.go.kr/weather/climate/average_south.jsp (accessed on 5 October 2019).
  17. Benítez, J.M.; Castro, J.L.; Requena, I. Are artificial neural networks black boxes? IEEE Trans. Neural Netw. 1997, 8, 1156–1164. [Google Scholar] [CrossRef]
  18. McCulloch, W.S.; Pitts, W. A logical calculus of the ideas immanent in nervous activity. Bull. Math. Biol. 1990, 52, 99–115. [Google Scholar] [CrossRef]
  19. Rumelhart, D.E.; Hinton, G.E.; Williams, R.J. Learning representations by back-propagating errors. Nature 1986, 323, 533–536. [Google Scholar] [CrossRef]
  20. Salakhutdinov, R.; Mnih, A.; Hinton, G. Restricted Boltzmann machines for collaborative filtering. In Proceedings of the 24th International Conference on Machine Learning, Corvalis, OR, USA, 20–24 June 2007; ACM: New York, NY, USA, 2007; pp. 791–798. [Google Scholar]
  21. Huang, W.; Foo, S. Neural network modeling of salinity variation in Apalachicola River. Water Res. 2002, 36, 356–362. [Google Scholar] [CrossRef]
  22. Wahid, F.; Kim, D.H. Short-term energy consumption prediction in korean residential buildings using optimized multi-layer perceptron. Kuwait J. Sci. 2017, 44, 67–77. [Google Scholar]
  23. Akhtar, M.; Corzo, G.; Van Andel, S.; Jonoski, A.; Sciences, E.S. River flow forecasting with artificial neural networks using satellite observed precipitation pre-processed with flow length and travel time information: Case study of the Ganges river basin. Hydrology 2009, 13, 1607–1618. [Google Scholar]
  24. Park, J.-A.; Kim, G.-S. Estimation of spatial distribution of soil moisture at Yongdam dam watershed using artificial neural networks. J. Korean Geogr. Soc. 2011, 46, 319–330. [Google Scholar]
  25. Pan, D.; Chan, M.; Deng, S.; Lin, Z.; Energy, J.A. The effects of external wall insulation thickness on annual cooling and heating energy uses under different climates. Appl. Energy 2012, 97, 313–318. [Google Scholar] [CrossRef]
  26. Suh, D.; Chang, S. A heuristic rule-based passive design decision model for reducing heating energy consumption of Korean apartment buildings. Energies 2014, 7, 6897–6929. [Google Scholar] [CrossRef] [Green Version]
  27. Yeo, M.-S.; Yang, I.-H.; Kim, K.-W. Historical changes and recent energy saving potential of residential heating in Korea. Energy Build. 2003, 35, 715–727. [Google Scholar] [CrossRef]
Figure 1. Research process.
Figure 1. Research process.
Sustainability 12 00109 g001
Figure 2. Monthly average temperature distribution in 16 regions of South Korea; (a) 2016; (b) 2017.
Figure 2. Monthly average temperature distribution in 16 regions of South Korea; (a) 2016; (b) 2017.
Sustainability 12 00109 g002
Figure 3. Household energy consumption distribution by season; (a) spring; (b) summer; (c) fall; (d) winter.
Figure 3. Household energy consumption distribution by season; (a) spring; (b) summer; (c) fall; (d) winter.
Sustainability 12 00109 g003aSustainability 12 00109 g003b
Figure 4. Artificial neural network (ANN) model structure.
Figure 4. Artificial neural network (ANN) model structure.
Sustainability 12 00109 g004
Figure 5. Structure of a seasonal energy consumption prediction model.
Figure 5. Structure of a seasonal energy consumption prediction model.
Sustainability 12 00109 g005
Figure 6. Seasonal energy consumption according to the number of exterior walls: (a) annual; (b) spring; (c) summer; (d) fall; (e) winter.
Figure 6. Seasonal energy consumption according to the number of exterior walls: (a) annual; (b) spring; (c) summer; (d) fall; (e) winter.
Sustainability 12 00109 g006
Figure 7. Seasonal energy consumption according to building direction: (a) annual; (b) spring; (c) summer; (d) fall; (e) winter.
Figure 7. Seasonal energy consumption according to building direction: (a) annual; (b) spring; (c) summer; (d) fall; (e) winter.
Sustainability 12 00109 g007aSustainability 12 00109 g007b
Figure 8. Seasonal energy consumption according to building area: (a) annual; (b) spring; (c) summer; (d) fall; (e) winter.
Figure 8. Seasonal energy consumption according to building area: (a) annual; (b) spring; (c) summer; (d) fall; (e) winter.
Sustainability 12 00109 g008
Figure 9. Seasonal energy consumption according to the number of years occupied: (a) annual; (b) spring; (c) summer; (d) fall; (e) winter.
Figure 9. Seasonal energy consumption according to the number of years occupied: (a) annual; (b) spring; (c) summer; (d) fall; (e) winter.
Sustainability 12 00109 g009
Figure 10. Seasonal energy consumption according to the number of household members: (a) annual; (b) spring; (c) summer; (d) fall; (e) winter.
Figure 10. Seasonal energy consumption according to the number of household members: (a) annual; (b) spring; (c) summer; (d) fall; (e) winter.
Sustainability 12 00109 g010aSustainability 12 00109 g010b
Figure 11. Seasonal energy consumption according to the occupation of household head: (a) annual; (b) spring; (c) summer; (d) fall; (e) winter.
Figure 11. Seasonal energy consumption according to the occupation of household head: (a) annual; (b) spring; (c) summer; (d) fall; (e) winter.
Sustainability 12 00109 g011aSustainability 12 00109 g011b
Figure 12. Annual energy consumption prediction model (layer: 1, node: 70); (a) Training data regression result; (b) Validation data regression result; (c) Test data regression result.
Figure 12. Annual energy consumption prediction model (layer: 1, node: 70); (a) Training data regression result; (b) Validation data regression result; (c) Test data regression result.
Sustainability 12 00109 g012
Figure 13. Spring energy consumption prediction model (layer: 2, node: 24); (a) Training data regression result; (b) Validation data regression result; (c) Test data regression result.
Figure 13. Spring energy consumption prediction model (layer: 2, node: 24); (a) Training data regression result; (b) Validation data regression result; (c) Test data regression result.
Sustainability 12 00109 g013
Figure 14. Summer energy consumption prediction model (layer: 3, node: 16); (a) Training data regression result; (b) Validation data regression result; (c) Test data regression result.
Figure 14. Summer energy consumption prediction model (layer: 3, node: 16); (a) Training data regression result; (b) Validation data regression result; (c) Test data regression result.
Sustainability 12 00109 g014
Figure 15. Fall energy consumption prediction model (layer: 1, node: 47); (a) Training data regression result; (b) Validation data regression result; (c) Test data regression result.
Figure 15. Fall energy consumption prediction model (layer: 1, node: 47); (a) Training data regression result; (b) Validation data regression result; (c) Test data regression result.
Sustainability 12 00109 g015
Figure 16. Winter energy consumption prediction model (layer: 2, node: 24); (a) Training data regression result; (b) Validation data regression result; (c) Test data regression result.
Figure 16. Winter energy consumption prediction model (layer: 2, node: 24); (a) Training data regression result; (b) Validation data regression result; (c) Test data regression result.
Sustainability 12 00109 g016
Table 1. Data construction, descriptive statistics, and variance inflation factor (VIF) analysis results.
Table 1. Data construction, descriptive statistics, and variance inflation factor (VIF) analysis results.
CodeVariableCategoryMeanStandard DeviationNCommon DifferenceVIF
cityCity11Seoul *
21Pusan00.06470.2460949430.7031.423
22Daegu00.04750.2128249430.7781.285
23Incheon00.04770.2132549430.7741.292
24Gwangju00.04730.2123949430.7561.322
25Daejun00.04710.2119549430.7911.264
26Ulsan00.03240.1770049430.8181.222
31Kyunggi0.11070.3137449430.6281.593
32Kangwon00.04750.2128249430.7331.364
33Chungbuk00.04530.2080249430.7441.345
34Chungnam00.06370.2442949430.6671.500
35Jeonbuk00.04730.2123949430.7141.402
36Jeonnam00.06270.2424749430.6761.479
37Kyungbuk00.08030.2718149430.6361.573
38Kyungnam00.08070.2724349430.6261.598
39Jeju00.01520.1222549430.8791.137
B_a1Housing type1Detached0.38620.4869349430.4002.500
2Apartment0.45740.4982349430.2264.433
3Others *
B_a2Number of floorsNumeric8.227.80549430.1825.485
B_a3Floor numberNumeric4.344.94349430.4432.258
B_a4Number of exterior wallsNumeric4.331.27849430.5821.718
B_a5Housing direction1East0.11940.3242549430.3033.305
2West00.06410.2450149430.4262.346
3South0.42000.4936149430.1626.189
4North *
5Southeast0.19930.3994949430.2244.473
6Southwest0.10340.3044849430.32730.055
7Northeast00.02990.1704449430.6101.640
8Northwest00.01900.1366049430.6941.440
B_a6Construction yearNumeric3.821.29849430.5971.674
B_a7Housing areaNumeric3.170.86749430.4112.432
B_a8Number of bedrooms (rooms)Numeric2.720.72849430.48820.050
B_a9Number of exterior wall windowsNumeric8.113.96149430.7371.357
B_a10Main heating method1Individual0.90920.2874049430.7851.274
2Central heating *
B_a11Cooling method1Air conditioner0.30450.4602349430.1805.558
2No air conditioner *
B_a12Air conditioner set temperatureNumeric3.1320.04949430.1606.239
H_a1Number of years occupiedNumeric10.918.72249430.1546.513
H_a2Housing ownership1Owned0.75890.4278249430.8271.209
2Not owned *
H_a3Number of household membersNumeric2.971.24249430.48720.054
H_a4Number of economically active household membersNumeric1.450.76749430.6221.608
H_a5Number of household members aged 65 or olderNumeric0.470.74349430.5571.795
H_a6Composition of household members1Children00.05000.2179049430.7561.322
2No children *
H_a7Gender of household head1Male0.74990.4330949430.7421.348
2Female *
H_a8Age of household headNumeric3.7410.05249430.48620.056
H_a9Education level of household head1High school graduate or below0.58550.4926949430.6241.602
2University graduate or above *
H_a10Occupation of household head1Regular employee0.49650.5000449430.3602.775
2Temporary employee00.05580.2296349430.7701.298
3Owner operator0.23200.4221849430.4592.179
4Etc.
H_a11Unusual features of household1Unusual features00.09810.2975049430.6331.579
2General *
H_a12Annual gross incomeNumeric3.671.86249430.4262.347
Table 2. Model explanatory power and suitability analysis results.
Table 2. Model explanatory power and suitability analysis results.
RR2Revised R2Estimated Standard ErrorDurbin–WatsonFSignificant Probability
Annual0.3460.1200.1115284.3551.664130.08400.000
Spring0.3540.1250.1161643.5101.674140.03100.000
Summer0.3770.1420.1347740.0961.74916.25100.000
Fall0.3320.1100.1011263.1901.661120.08100.000
Winter0.3510.1230.11425,570.0531.67413.77800.000
Table 3. List of significant variables for seasonal residential building energy consumption.
Table 3. List of significant variables for seasonal residential building energy consumption.
SectionCodeVariableSpringSummerFallWinterAnnual
TempTemperatureO O
cityCityOOOOO
Building factors (12)B_a1Housing type OOOO
B_a2Number of floors O
B_a3Floor number
B_a4Number of exterior wallsOOOOO
B_a5Housing directionOOOOO
B_a6Construction yearOOOO
B_a7Housing areaOOOOO
B_a8Number of bedrooms (rooms) O
B_a9Number of exterior wall windows OO
B_a10Main heating methodOOO
B_a11Cooling method OO O
B_a12Air conditioner set temperature OO
User features (12)H_a1Number of years occupiedOOOOO
H_a2Housing ownership
H_a3Number of household membersOOOOO
H_a4Number of economically active household members
H_a5Number of household members aged 65 or older O
H_a6Composition of household members O
H_a7Gender OOO
H_a8Age of household head O
H_a9Education level of household head O
H_a10Occupation of household headOOOOO
H_a11Unusual features of household O O
H_a12Annual gross incomeOO OO
Total1115151712
Table 4. Information on the use of data.
Table 4. Information on the use of data.
No. of SamplesTrainingValidationTesting
494370%15%15%
3461741741
Table 5. Performances of seasonal energy consumption prediction models according to the hidden layers.
Table 5. Performances of seasonal energy consumption prediction models according to the hidden layers.
PeriodMinMaxLayerNeuronR Value of TrainingR Value of ValidationR Value of TestMSETerminated Epoch
Annual42851700.2570.3030.278 10 7 × 2.525012th
2350.3510.2790.306 10 7 × 2.75455th
3210.4190.2990.312 10 7 × 3.46745th
4110.3610.2880.262 10 7 × 2.66494th
5130.4270.2700.272 10 7 × 2.80777th
6140.5350.3480.256 10 7 × 3.143612th
Spring38771470.3630.2960.235 10 6 × 2.633914th
2240.3910.2840.279 10 6 × 2.59815th
580.4770.3120.246 10 6 × 2.73007th
6120.4570.2560.331 10 6 × 2.73044th
760.3760.2830.324 10 6 × 2.70688th
870.4040.2780.317 10 6 × 2.61939th
Summer46931580.3220.3150.283 10 5 × 5.315712th
2230.5430.2640.243 10 5 × 7.93805th
3160.3820.3180.270 10 5 × 5.10607th
4170.3820.2470.213 10 5 × 6.12922nd
5100.4500.3890.353 10 5 × 5.43418th
680.4420.3420.372 10 5 × 5.98186th
Fall47951470.3360.2780.217 10 6 × 1.431827th
2300.1520.1580.131 10 6 × 1.605419th
3160.3990.2730.201 10 6 × 1.65713rd
4170.2880.2080.167 10 6 × 1.80193rd
5100.4230.2480.293 10 6 × 1.61334th
6150.5650.2440.316 10 6 × 1.80637th
Winter48971840.3360.2870.261 10 6 × 6.173115th
2240.2870.2340.212 10 6 × 5.76028th
3240.7320.3590.336 10 6 × 5.95312nd
4180.4250.2930.256 10 6 × 5.906011th
6160.4020.1820.241 10 6 × 7.22933rd
7130.4850.2520.293 10 6 × 6.31344th
Table 6. Performances of seasonal energy consumption prediction models according to the input data type.
Table 6. Performances of seasonal energy consumption prediction models according to the input data type.
PeriodInput TypeLayerNeuronR Value of TrainingR Value of ValidationR Value of TestMSETerminated Epoch
AnnualA1700.2570.3030.278 10 7 × 2.525012th
B1700.3270.2380.262 10 7 × 3.16017th
C1700.3200.2150.270 10 7 × 402424th
SpringA2240.3910.2840.279 10 6 × 2.59815th
B224
C2240.7240.3630.297 10 6 × 3.24169th
SummerA3160.3820.3180.270 10 5 x 5.10607th
B316
C3160.6290.2790.339 10 5 × 7.69976th
FallA1470.3360.2780.217 10 6 × 1.431827th
B1470.2600.1680.297 10 6 × 1.506115th
C1470.3740.3260.245 10 6 × 1.647615th
WinterA2240.2870.2340.212 10 6 × 5.76028th
B2240.5880.3750.220 10 6 × 6.40054th
C2240.4770.2870.318 10 6 × 6.38536th

Share and Cite

MDPI and ACS Style

Kim, M.; Jung, S.; Kang, J.-w. Artificial Neural Network-Based Residential Energy Consumption Prediction Models Considering Residential Building Information and User Features in South Korea. Sustainability 2020, 12, 109. https://doi.org/10.3390/su12010109

AMA Style

Kim M, Jung S, Kang J-w. Artificial Neural Network-Based Residential Energy Consumption Prediction Models Considering Residential Building Information and User Features in South Korea. Sustainability. 2020; 12(1):109. https://doi.org/10.3390/su12010109

Chicago/Turabian Style

Kim, Mansu, Sungwon Jung, and Joo-won Kang. 2020. "Artificial Neural Network-Based Residential Energy Consumption Prediction Models Considering Residential Building Information and User Features in South Korea" Sustainability 12, no. 1: 109. https://doi.org/10.3390/su12010109

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop