Next Article in Journal
A Review on the Aging Behavior of BADGE-Based Epoxy Resin
Previous Article in Journal
Study on the Typological Chronology of Cantonese Palatial-Type (Diantang Shi) Timber Structures in Guangdong Province, China: An Interdisciplinary Approach of Building Archaeology
Previous Article in Special Issue
The Application and Optimization of a New Tunnel Ventilation Method for the Control Room of Electric Submersible Pump Systems on Jack-Up Offshore Platforms
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Comprehensive Assessment of Indoor Thermal in Vernacular Building Using Machine Learning Model with GAN-Based Data Imputation: A Case of Aceh Region, Indonesia

by
Muslimsyah Muslimsyah
1,*,
Safwan Safwan
1 and
Andri Novandri
2
1
Department of Architecture and Planning, Universitas Syiah Kuala, Banda Aceh 23111, Indonesia
2
Department of Computer Engineering, Universitas Serambi Mekkah, Banda Aceh 23249, Indonesia
*
Author to whom correspondence should be addressed.
Buildings 2025, 15(14), 2448; https://doi.org/10.3390/buildings15142448
Submission received: 17 May 2025 / Revised: 27 June 2025 / Accepted: 28 June 2025 / Published: 11 July 2025
(This article belongs to the Special Issue Thermal Environment in Buildings: Innovations and Safety Perspectives)

Abstract

This study introduces a predictive model for estimating indoor room temperatures in vernacular building using external environmental factors such as air temperature, humidity, sunshine duration, and wind speed. The dataset was sourced from the Meteorology, Climatology, and Geophysics Agency and supplemented with direct measurements collected from four rooms within a vernacular building in Aceh Province, Indonesia. A Generative Adversarial Network (GAN)-based imputation technique was implemented to address missing data during preprocessing. The prediction model adopts a hybrid framework that integrates Multiple Linear Regression (MLR) and Artificial Neural Networks (ANNs), with both models optimized using Support Vector Regression (SVR) to better capture the nonlinear dynamics between inputs and outputs. The evaluation results show that the ANN-SVR model achieved the lowest average M A E ¯ and R M S E ¯ values, at 0.164 and 0.218, respectively, and the highest average R ¯ and R 2 ¯ values, at 0.785 and 0.618. Evaluation results indicate that the ANN-SVR model consistently achieved the lowest error rates and the highest correlation coefficients across all four rooms, identifying it as the most effective model for forecasting indoor thermal conditions. These results validate the combined use of ANN-SVR for prediction and GAN for preprocessing as a powerful strategy to enhance data quality and model performance. The findings offer a scientific basis for architectural planning to improve thermal comfort in vernacular buildings such as the Rumoh Aceh.

1. Introduction

A vernacular building is a traditional house designed with an architectural style that evolves naturally, adapting to a particular region’s climatic characteristics, culture, and available resources, and is typically constructed by local craftsmen [1,2]. One of the prominent features of vernacular architecture is the use of natural materials readily available from the surrounding environment, such as wood for walls and floors and rumbia leaves for roofing [3,4]. These materials reflect local wisdom and contribute to passive thermal comfort due to their good heat insulation properties [5]. The structure of a vernacular building is adapted to the local climate and geographical conditions, commonly taking the form of an elevated stilt house with ventilation holes in the walls and roof to support air circulation [6]. This design reflects local wisdom and serves the community’s social, cultural, and spiritual functions while enhancing effective thermal quality [7,8]. The shape and orientation of the building are typically designed to maximize the natural benefits of the surrounding environment [9]. In addition, the elevation of the building allows for air circulation beneath the floor, facilitating airflow within the interior spaces [10,11]. This condition naturally helps reduce heat buildup and keeps indoor temperatures cool [12,13]. This cooling effect is supported by ventilation that allows continuous airflow in and out of the building [14]. Moreover, the wooden materials used also play an essential role in maintaining thermal conditions due to their ability to absorb heat during the day and release it gradually at night, contributing to a more stable indoor temperature [15,16]. The insulating thermal properties of wood make it effective in dampening temperature fluctuations from the environment [17]. In addition, wood is a natural, renewable material with a lower carbon footprint than modern materials [18,19]. The orientation of the building, adjusted according to wind direction and sunlight exposure, is also part of the strategy to improve thermal quality [20,21]. By orienting the ventilation toward the prevailing wind direction, the building can help reduce indoor temperatures [22]. One example of a vernacular building is the Rumoh Aceh in Aceh Province, Indonesia [23,24].
Thermal performance in vernacular building plays a vital role in assessing how effectively these buildings can ensure a comfortable indoor environment. Air temperature, humidity, wind speed, and sunlight exposure significantly influence a building’s thermal behavior. Numerous studies have explored these aspects in vernacular structures across tropical regions. For instance, the study in [25] examined traditional Malay houses in Penang, Malaysia, and found that their well-designed natural ventilation systems resulted in lower indoor air temperature and humidity than outdoor conditions despite reduced indoor wind speed. Similarly, research in [26] evaluated the thermal characteristics of vernacular buildings in a hot and humid climate in Bushehr, Iran. It concluded that materials with low thermal conductivity like wood improved indoor thermal quality. Wood, known for its insulating properties, contributed to maintaining stable indoor temperatures. In another case, ref. [27] investigated traditional buildings in Yazd, Iran, and discovered that their thermal quality was relatively poor, requiring mechanical ventilation to enhance comfort. These findings underscore the importance of architectural design, choice of materials, and ventilation systems in shaping the thermal performance of vernacular buildings. Predictive thermal models are essential to better assess and improve thermal quality.
Thermal prediction models are designed to estimate indoor temperature conditions based on environmental factors and building attributes [28,29]. This model makes predictions to enhance occupant comfort, reduce energy consumption, and design more optimal ventilation and cooling systems, particularly in buildings [30,31]. Their primary function is to simulate and forecast indoor thermal behavior by analyzing various external variables and structural features. Generally, these models fall into two categories: physical models and data-driven models. Physical models rely on thermodynamic and heat transfer equations to replicate thermal conditions but require detailed architectural and thermal input data [32,33]. Although physical models are instrumental in the technical design of modern buildings, their application in the study of vernacular buildings is limited due to the requirement for detailed input variables [34]. Conversely, data-driven models utilize historical data to identify patterns between environmental parameters and indoor temperatures, offering greater flexibility and efficiency [35,36]. This model does not require detailed technical information about the building structure, making it easier to apply to traditional buildings with minimal architectural documentation [37]. Many studies have explored data-driven approaches to thermal prediction. For example, [38] used an Adaptive Neuro-Fuzzy Inference System (ANFIS) to predict air temperature in a livestock barn based on six days of environmental data, enabling optimized ventilation control for thermal stability and animal well-being. The model performance results show that the R M S E , M A P E , and R 2 values are 0.182, 0.548%, and 0.997, respectively. In paper [39], a Nonlinear Autoregressive with External Input (NARX) model was developed to forecast indoor temperatures in a commercial building, aiming to enhance energy efficiency without compromising thermal comfort. The statistical results show that the M S E and R M S E values are 0.083 and 0.837%, respectively. Additionally, [40] evaluated the effectiveness of various machine learning methods, such as Autoregressive Exogenous (ARX), Robust Multiple Linear Regression (RMLR), Extreme Learning Machine (ELM), Multilayer Perceptron (MLP), and NARX, combined with traditional approaches to predicting room temperatures in different parts of buildings. The study demonstrated that these predictive models support energy management and contribute to designing more thermally efficient and comfortable indoor environments. The numerical results indicate that the MLP-NARX method is the most accurate indoor temperature prediction, with an average annual M A E of 0.111. This method outperforms the others due to its ability to capture nonlinear patterns and daily periodicity. The ARX method ranks second with an M A E of 0.171, followed by RMLR at 0.207, MLP at 0.242, and ELM at 0.286. Consequently, developing data-driven thermal prediction models is a valuable strategy for optimizing building design and enhancing indoor thermal comfort.
Multiple Linear Regression (MLR) is a traditional statistical technique that models the linear relationship between a single dependent variable and multiple independent variables. MLR has been applied in thermal condition prediction, as demonstrated in [41], where an MLR model was developed to estimate indoor temperature in a mobile container using inputs such as global solar irradiance, ambient temperature, and wind speed. The model achieved relatively high accuracy, reporting an error rate of 7.1%. A key advantage of MLR is its strong interpretability; however, it struggles to capture nonlinear relationships. Artificial Neural Networks (ANNs) have been introduced to address this limitation, as they can learn complex, nonlinear patterns. A related study in [42] explored using ANN models to predict indoor thermal conditions, including air temperature, humidity, and wind speed, based on building orientation and geographical location. The results of this study indicate that the ANN-based predictive model produced error levels ranging from 0.03 to 1.05, demonstrating the effectiveness of this method in supporting building design oriented toward thermal comfort. Support Vector Regression (SVR), an extension of the Support Vector Machine (SVM) framework tailored for regression problems, has also been investigated in this context. SVR works by finding a predictive function that tolerates minor deviations and can handle nonlinear relationships using kernel functions. In paper [43], SVR was applied to predict air temperature inside a greenhouse. This approach aimed to address the complexity of the greenhouse temperature system, which is both nonlinear and time-varying. The study results show that the SVR model achieved the best performance, with a maximum relative error of 1.73%, an M S E of 0.00019, and an R 2 of 0.99945. In comparison, the Backpropagation Neural Network (BPNN) model exhibited a higher error of up to 16.23%, an M S E of 0.019564, and an R 2 of 0.96425. These findings confirm that the proposed SVR model is more accurate, stable, and effective in representing the actual temperature conditions in the greenhouse. Furthermore, the Generative Adversarial Network (GAN) is a data preprocessing technique capable of generating realistic data that closely resembles the original data, primarily imputing missing data. In paper [44], GAN was used to enhance the quality of mammographic images through data augmentation. By applying GAN, the dataset used for modeling became more representative. The results of this study demonstrate that applying GAN in the data preprocessing stage significantly enriches the dataset, thereby increasing the prediction model’s accuracy to 100%, compared to only 57.5% without augmentation. Paper [45] also highlights the critical role of GAN in data augmentation. The study used GAN to generate synthetic data to enrich the existing dataset. The study results indicate that applying GAN data augmentation improved the model’s accuracy from 76.1% to 88.5%. This approach not only successfully addressed data imbalance issues but also enhanced the generalization of the model. In addition, GAN offers advantages over simpler methods such as K-Nearest Neighbor (KNN) or mean substitution. In the mean substitution method, missing data is replaced with a fixed value, which can eliminate natural variability and alter the data distribution. Meanwhile, the KNN method tends to be less effective when dealing with high-dimensional data and requires considerable computational resources [46,47]. In contrast to these approaches, GAN can learn and understand the data distribution patterns more deeply through the adversarial mechanism between two networks: the generator and the discriminator. Through this competitive training process, GAN can generate synthetic data that closely resembles the original data, making it more effective in imputing missing data by the actual data distribution characteristics [48].
This study presents a hybrid method for developing an indoor temperature prediction model tailored to the traditional Rumoh Aceh building, using external environmental variables such as temperature, humidity, sunshine duration, and wind speed. The dataset combines ecological data sourced from the Meteorology, Climatology, and Geophysics Agency with sample measurements taken from four rooms within Rumoh Aceh structures located in Aceh Province, Indonesia. To ensure the dataset’s quality, a data preprocessing stage was conducted using the GAN method as an imputation technique to address the issue of missing data. This process generated synthetic data that closely resembled the original. Next, two primary predictive models, MLR and ANN, were applied, with each model further optimized using the SVR approach. The MLR algorithm was chosen due to its ease of interpreting regression coefficients. It allows the contribution of each variable to room temperature to be directly understood, although only for linear relationships. The ANN algorithm was then adopted to capture complex nonlinear patterns that MLR cannot address. Finally, the SVR approach was applied to reduce the risk of overfitting in models built from limited datasets. The MLR-SVR model combines the interpretability of MLR with the generalization capability of SVR. In contrast, the ANN-SVR model integrates the nonlinear modeling flexibility of ANN with SVR’s predictive precision. The evaluation used statistical metrics to determine the best-performing model with high accuracy, ensuring applicability in real-world conditions. The findings of this study present a practical approach for evaluating thermal performance in vernacular buildings and support architectural planning in designing more comfortable living environments. The main contributions of this paper are as follows:
  • We propose a study to support passive thermal comfort in the Rumoh Aceh vernacular building by developing a predictive model to estimate indoor thermal conditions based on external environmental variables such as air temperature, humidity, sunshine duration, and wind speed.
  • We introduce the GAN method in the data preprocessing stage to impute missing data, generating synthetic data that closely resembles the original. This technique enriches the dataset and improves model generalization.
  • We propose two hybrid approaches: MLR optimized with SVR and ANN optimized with SVR. The goal is to compare model performance and determine the most accurate model for predicting indoor temperature in Rumoh Aceh based on external environmental parameters.

2. Materials and Methods

The simulation model developed in this study was designed using a computer with AMD Ryzen 5 4600H, 16 GB DDR4 RAM, and NVIDIA GeForce GTX 1650 Ti with 4 GB of memory. The software used included Python 3.10 with TensorFlow version 2.12.0, which was utilized for computation, analysis, and data visualization.

2.1. Data Acquisition

Rumoh Aceh is a vernacular building in the Aceh Province of Indonesia, characterized by its distinctive architecture designed to suit the region’s natural conditions and climate. These houses are built on raised platforms supported by wooden pillars, a structural approach that helps protect against flooding and enhances stability in earthquake-prone areas. The main framework is constructed using strong wood for durability. Key architectural features such as steeply sloped roofs and strategically placed ventilation openings support adequate airflow and help minimize indoor heat accumulation. The location of the Rumoh Aceh studied in this paper is in Banda Aceh, Indonesia, precisely at the coordinates 5°31′00.5″ N 95°16′18.1″ E, as shown in Figure 1. Meanwhile, Figure 2 presents a graph illustrating weather condition variations over the past three years in the study area.
This research incorporates meteorological data, including temperature, humidity, sunshine duration, and wind speed. The data were sourced from the Meteorology, Climatology, and Geophysics Agency of Aceh Province. Table 1 presents the specifications of the measuring instruments used. Data were recorded hourly throughout the day [49,50]. The probability distribution for each input parameter is assumed to follow a normal distribution. This assumption is based on the general characteristics of empirical data obtained from the Meteorology, Climatology, and Geophysics Agency over a long period. It is further supported by measurement data collected directly from four rooms in the Rumoh Aceh building.
Meanwhile, indoor air temperature measurements in the Rumoh Aceh building were carried out by the ASTM C-1046 standard [51]. Measurement device placement was strategically determined by analyzing heat flow directions to ensure that the data accurately reflected the building’s thermal environment. Each measurement location was required to exhibit one-dimensional heat flow to minimize errors from multidimensional heat transfer. Four rooms were selected for analysis: the lobby, a bedroom, and the left and right backrooms. Data collected from these areas served as representative samples for assessing the overall thermal distribution throughout the building. The layout for sensor placement is illustrated in Figure 3. The sensors were positioned in each corner of the rooms to achieve balanced data collection. They were placed away from ventilation openings to avoid interference from hot airflow that might compromise measurement accuracy [52,53]. Each sensor was mounted approximately 1.5 m above the floor, aligning with the typical human comfort zone. This positioning was chosen to ensure high-quality data by capturing evenly distributed heat, accurate thermal conditions, and dependable airflow measurements [54,55].
This paper’s modeling used six input variables and one output variable. This modeling aims to predict the indoor temperature in four different rooms. The input variables include minimum temperature ( T m i n ), maximum temperature ( T m a x ), average temperature ( T a v g ), average humidity ( H a v g ) from the surrounding environment, sunshine duration ( S t i m e ), and wind speed ( W s p e e d ). Meanwhile, the output variable is the indoor temperature ( T r o o m ) within each of the four rooms within the building. Further details regarding the input and output parameters are presented in Table 2.

2.2. Data Preprocessing

Data preprocessing is the initial stage in the data analysis and modeling process. One of the main challenges in this stage is missing data, which refers to incomplete entries in a dataset. Missing data can reduce model accuracy and performance if not properly handled [56,57]. One approach that can impute missing data is the Generative Adversarial Network (GAN) method, a machine learning architecture capable of generating missing estimates that closely resemble the original data distribution [58]. In the context of missing data reconstruction, the GAN modeling process begins by forming two neural network models: the generator ( G ) and the discriminator ( D ). The generator is responsible for producing estimates of the missing data based on the observed data, while the discriminator functions to distinguish between real data and imputed data. GAN is trained so that the generator produces data that is realistic enough to be indistinguishable from the original data. This process is carried out iteratively, where the generator and discriminator are trained simultaneously using the min-max optimization principle. Mathematically, the objective function of GAN is expressed as follows [59,60]:
min G max D V D , G = E x ~ p d a t a x log D x + E z ~ p z z log 1 D G z
where V D , G represents the value function that defines the training objective between the generator and discriminator, E x ~ p d a t a x is the expectation over variable x sampled from the original data distribution p d a t a of the dataset, E z ~ p z z is the expectation over variable z drawn from a random data distribution p z used as input to the generator, G z denotes the synthetic data generated by the generator, and D x represents the probability assigned by the discriminator. The generator’s input is not solely random data but rather a combination of observed data from the missing data entries and random noise z , resulting in an output x r e c o n representing the fully reconstructed data. The discriminator then evaluates whether the obtained data originates from the original distribution or is generated through imputation. A reconstruction loss function is added to the objective function to ensure the training process effectively reconstructs missing data. This component measures how accurately the imputed data matches the original, expected values. The reconstruction loss function is formulated as follows [61]:
L r e c o n = E x t r u e , x r e c o n M x t r u e x r e c o n 2
where E x t r u e , x r e c o n denotes the expectation over all pairs of original data x t r u e and imputed results x r e c o n , and M represents the masking matrix that indicates the positions of the missing data elements. After the GAN training process and the model’s convergence, the generator can fill in the missing data entries with estimated values that follow the original data distribution.

2.3. Algorithm

This study introduces predictive models utilizing MLR and ANN optimized through SVR. These models are designed to forecast indoor temperature using available environmental parameters. The SVR-based optimization process aims to improve model performance by effectively capturing the relationship between input and output variables.

2.3.1. Multiple Regression Linear

MLR is a statistical technique that models the linear relationship between a single dependent variable and two or more independent variables. It expands upon simple linear regression, involving only one predictor variable [62]. The primary goal of MLR is to estimate the dependent variable using a linear combination of the independent variables while also assessing the individual impact of each predictor on the outcome [63,64]. MLR is widely applied across disciplines such as economics, engineering, social sciences, and environmental studies because it effectively accommodates multiple influencing factors in analyzing complex systems. The standard mathematical representation of the MLR model is given as follows [65]:
y = a 0 + a 1 x 1 + a 2 x 2 + + a n x n + ε
where y denotes the dependent variable, while x 1 ,   x 2 ,   , x n represent the independent variables. The term a 0 is the intercept constant, and a 1 ,   a 2 ,   , a n are the regression coefficients that quantify the impact of each independent variable on the dependent variable. The error term ε accounts for the discrepancy between the actual value and the model’s predicted value. MLR aims to estimate the parameters a 1 ,   a 2 ,   , a n such that the difference between observed and predicted values is minimized. The most widely used technique for estimating these parameters is Ordinary Least Squares (OLS), which minimizes the Residual Sum of Squares (RSS), as defined below [65]:
min a i = 0 n y i a 0 j = 0 n a j x i j 2
After determining the parameter values, the MLR model can be applied to estimate the dependent variable using new input values for the independent variables.

2.3.2. Artificial Neural Network

ANN is an approach in machine learning that mimics the way biological neural networks process information and make decisions. ANN is adaptive, as it can adjust weight values during training to learn complex input–output relationships, including nonlinear ones [66,67,68]. Each node receives inputs from the previous layer’s nodes in its computational process. These inputs are multiplied by their respective weights, summed up, and added to a bias term. The result of this operation, known as the weighted sum, is then passed through an activation function to produce the node’s output. This activation function is crucial in introducing nonlinearity into the network, allowing it to learn complex patterns. Mathematically, the output of a node can be formulated as follows [69,70]:
z = i = 1 n w i . x i + b
y = f z
where x i denotes the input value at node i , w i is the weight associated with that input, b is the bias term, and f ( z ) is the activation function. In this paper, the activation function used is the ReLU (Rectified Linear Unit) function, which is defined as follows [71]:
f z = m a x 0 , x
The training of ANN focuses on determining the optimal weights and biases that minimize prediction errors. This begins with forward propagation, passing input data through the entire network to produce the predicted output. In the initial phase, the values of the hidden layer nodes are computed using the following equation [72]:
a ( n ) = f w ( n ) .   x + b ( n )
where x denotes the input node value and a ( n ) , w ( n ) , and b ( n ) represent the node values, weight matrix, and bias vector in the n hidden layer, respectively. The node values in the output layer are then computed using the following equation [73]:
y = f w ( n + 1 ) .   a ( n ) + b ( n + 1 )
where y is the node value in the output layer. The generated output is then compared with the target or actual value using a loss function, which measures the network’s error. One of the most commonly used loss functions is the Mean Squared Error ( M S E ), which is formulated as follows [74]:
L = 1 n i = 1 n y i y ^ i 2
where n denotes the total number of data samples while y i and y ^ i represent the network’s predicted output and the actual target value for the i data point, respectively. This loss value is utilized in backpropagation, a calculus-based technique used to compute the partial derivatives of the loss function concerning each network weight [75]. The ANN can determine the optimal direction to adjust its parameters and progressively minimize prediction errors through this method. The network’s weights are subsequently updated using the gradient descent algorithm or its variants, which follow the negative gradient of the loss function. The gradient concerning the weight w is calculated using the following equation [76]:
L w = L y ^   .   y ^ z   .   z w
where L y ^ represents the derivative of the loss function concerning the predicted output y ^ , which simplifies to the expression 2 y i y ^ i . Next, y ^ z refers to the derivative of the activation function, while z w denotes the derivative of the linear combination z concerning the weight w . The updated weights are then computed using the following equation [76]:
w n e w = w o l d μ L w
where μ denotes the learning rate, which determines the magnitude of each weight update step. The cycle of forward propagation, loss computation, backpropagation, and weight adjustment is iteratively performed across multiple epochs until the ANN reaches the target performance level.

2.3.3. Support Vector Regression

SVR is a variant of the SVM algorithm developed for regression tasks [77]. Unlike traditional linear regression, which minimizes the overall error between predicted and actual values, SVR seeks to identify a regression function that stays within a predefined margin of tolerance ( ε ) from the target values while maintaining minimal model complexity. In other words, SVR attempts to construct a line that can predict the target values with a certain tolerance for error, ignoring minor errors as long as they are within the ε margin. This approach makes SVR a highly reliable method when dealing with nonlinear data and cases involving noise. In general, the SVR model seeks to find a regression function in the following form [78]:
f x = w T x + b
where w is the weight vector and b is the bias. SVR aims to minimize the norm of the weight vector w 2 to prevent overfitting while ensuring that most of the training data lies within the deviation margin ε . To accommodate data that cannot be predicted precisely within the ε margin, two relaxation variables, ξ i and ξ i , are introduced, representing the positive and negative deviations from the margin boundary. The objective function to be minimized in SVR is expressed as follows [79]:
min w , b , ξ , ξ 1 2 w 2 + C i = 1 n ξ i + ξ i
with subject
y i f x i ε + ξ i f x i y i ε + ξ i ξ i , ξ i 0
where y i denotes the actual target value, and C represents the regularization parameter that balances the trade-off between the error tolerance outside the ε -margin and the margin itself. A higher value of C results in a stricter penalty for errors beyond the margin. To effectively model nonlinear relationships, SVR employs a kernel function that transforms the input data into a higher-dimensional space, enabling more precise linear regression. The regression function used in the SVR model is defined as follows [80]:
f x = i = 1 n α i α i K x i , x + b
where α i and α i represent the Lagrange multipliers derived from solving the dual optimization problem, while K x i , x denotes the kernel function, which manages nonlinear input–output relationships efficiently without explicitly transforming the data. In this study, the SVR model employs the Radial Basis Function (RBF) kernel, defined as follows [81]:
K x i , x = e γ x i x 2
where γ > 0 serves as a scaling parameter that determines how far the influence of each data point extends within the prediction function. The SVR model requires careful tuning of the parameters C , ε , and kernel parameter γ during training. After training, the SVR model predicts outputs for new inputs by leveraging the influence of the support vectors [82].

3. Modeling Process

The modeling process in this study is divided into three stages: data preprocessing, prediction modeling, and performance analysis. For a clearer understanding of the predictive modeling methodology, the process diagram is presented in Figure 4. During the data preprocessing stage, a GAN-based method handles missing values within the dataset. The procedure involves inputting real and synthetic data produced by the generator into the discriminator. The GAN framework comprises two core components: the generator, which produces synthetic data resembling actual data, and the discriminator, which evaluates and differentiates between real and generated samples. By fine-tuning between the generator and the discriminator, the algorithm produces synthetic data deemed authentic by the discriminator. These data are then used to fill in the missing parts, resulting in a complete dataset. Next, the dataset is used in the prediction modeling stage, which comprises two models: MLR and ANN. The prediction results from each model are then optimized using an SVR approach. This technique allows the base models to provide an initial understanding of the data, which SVR further refines to improve prediction accuracy. The final stage is performance analysis, where the prediction results are evaluated using various statistical metrics. The evaluation involves an in-depth analysis of each model’s performance in predicting data and assessing the contribution of the GAN-based data imputation process to enhancing predictive model accuracy.
Figure 5 presents the architecture of the GAN, which is composed of two primary components: the generator and the discriminator. The generator takes a 20-dimensional random vector sampled from a uniform distribution and processes it through four layers. Each layer includes a combination of dropout, a defined number of nodes, batch normalization, and an activation function. The generator produces synthetic data to mimic real data fed into the discriminator. The discriminator’s role is to assess whether the input data is genuine or generated. It also features four sequential layers for processing. Training occurs in an adversarial manner: while the generator strives to create increasingly realistic data to fool the discriminator, the discriminator simultaneously enhances its ability to distinguish between real and synthetic inputs.
The structure of the MLR model with SVR optimization is shown in Figure 6. In this model, the initial approach uses MLR, which utilizes six input variables connected to one output room temperature variable through a series of coefficients from a 0 to a 6 . Each input is multiplied by a specific coefficient and summed together with the common intercept ( a 0 ) to generate the initial prediction data. The coefficient values for each room are presented in Table 3. Subsequently, the output from the MLR model is further optimized using the SVR approach, as illustrated on the right side of Figure 6. In this process, the output from the MLR model is used as input for SVR, which maps the input into a higher-dimensional space using a kernel function to minimize prediction error. Key parameters in SVR include the regularization constant ( C ), kernel parameter ( γ ), error margin ( ε ), number of support vectors ( N S V ), and bias value ( b ). These parameters are adjusted to obtain optimal values, producing more accurate prediction results. The parameter values for each room are shown in Table 4.
Meanwhile, Figure 7 displays the architecture of the ANN model refined through the SVR method. The model uses an artificial neural network to learn the nonlinear relationship between six input features and a single output. Its structure comprises four hidden layers, each containing a different number of neurons, and applies the ReLU activation function to enhance training efficiency. Table 5 describes the network structure, including the number of layers, learning rate, and epochs adjusted for each room. After training, the ANN’s output is further refined using SVR. This step involves projecting the ANN predictions into a higher-dimensional space via a kernel function and determining the optimal hyperplane to boost prediction accuracy. The SVR parameters, shown in Table 6, are specifically tuned to suit the unique characteristics of each room.
Several strategies were implemented in the model to prevent overfitting. First, the GAN architecture was equipped with dropout and batch normalization layers, preventing the model from relying too heavily on specific neurons. Second, an SVR approach was added to the model’s output. Third, setting the number of epochs and applying early stopping during ANN training helped halt the training process before the model began fitting into the datasets’ noise.

4. Performance Analysis Method

This study seeks to evaluate the performance of the proposed model through metric-based assessment and statistical analysis. The accuracy of the prediction model is measured using several standard metrics, including Mean Absolute Error ( M A E ), Mean Squared Error ( M S E ), Root Mean Squared Error ( R M S E ), and Mean Absolute Percentage Error ( M A P E ). M A E calculates the average absolute difference between predicted and actual values, indicating the overall error magnitude. Both M S E and R M S E assess the expected, predicted, and actual variance outcomes. However, R M S E , the square root of M S E , is more responsive to larger errors, offering a deeper understanding of model accuracy. On the other hand, M A P E represents prediction errors as a percentage of actual values, allowing for relative performance evaluation. The formulas for these metrics are presented below [83,84]:
M A E = 1 n i = 1 n y i y ^ i
M S E = 1 n i = 1 n y i y ^ i 2
R M S E = 1 n i = 1 n y i y ^ i 2
M A P E = 1 n i = 1 n y i y ^ i y i . 100 %
where n denotes the total number of test data points, while y i and y ^ i represent the actual and predicted values for the i data point, respectively. In addition to the error metrics, statistical indicators such as the correlation coefficient ( R ) and the coefficient of determination ( R 2 ) are employed. The correlation coefficient R measures the strength and direction of the linear relationship between two variables. An R value close to 1 suggests a strong positive correlation, meaning both variables tend to increase or decrease together. Conversely, an R value near −1 signifies a strong negative correlation, where an increase in one variable corresponds with a decrease in the other. An R value close to 0 implies a weak or negligible linear relationship. The coefficient of determination R 2 evaluates the regression model’s performance by quantifying the proportion of variance in the dependent variable that the model can explain. The R 2 ranges from 0 to 1, where 0 indicates no explanatory power, and 1 indicates that the model accounts for all observed variance. The corresponding equations for these statistical indicators are provided below [85]:
R = n i = 1 n y i y ^ i i = 1 n y i i = 1 n y ^ i n i = 1 n y i 2 i = 1 n y i 2 n i = 1 n y ^ i 2 i = 1 n y ^ i 2
R 2 = 1 i = 1 n y i y ^ i 2 i = 1 n y i y ¯ 2
Uncertainty analysis using the Monte Carlo method is a statistical technique used to estimate the uncertainty of a system or model’s output by randomly simulating input variations based on known probability distributions. This analysis is conducted by defining a distribution for each input, followed by Monte Carlo sampling, which is expressed as follows [86]:
T ^ i ¯ = 1 M k = 1 M T ^ i ( k )
where T ^ i ¯ is the average prediction obtained from all samples, M is the number of Monte Carlo samples, and T ^ i ( k ) is the predicted value using the k sample at data point i . From all the generated predictions, the standard deviation of the output ( σ i n p u t , i ) is calculated using the following equation [87]:
σ i n p u t , i = 1 M 1 k = 1 M T ^ i ( k ) T ^ i ¯ 2
Then, the mean value is calculated using the following equation [88]:
σ i n p u t ¯ = 1 N i = 1 N σ i n p u t , i
There are two components of uncertainty: the standard deviation of the model prediction error ( σ m o d e l ), which is measured using R M S E , and the standard deviation of the prediction resulting from input variability ( σ i n p u t ). These two components are assumed to be independent, as R M S E originates from the inherent error of the model, while σ i n p u t arises from measurement noise in the input. Therefore, the total prediction uncertainty can be calculated using the following equation [89]:
σ t o t a l = σ i n p u t ¯ 2 + σ m o d e l 2
If the output distribution approximates a normal distribution, the 95% confidence interval can be expressed as ± 1.96 σ t o t a l .

5. Results and Discussion

This section presents a performance evaluation for each room and an assessment of the model’s overall effectiveness. The review involves several stages, including data preprocessing, model training, testing, and comparative analysis. The model was trained on a dataset comprising 385 records in this study. The model was validated by splitting the dataset into 80% training and 20% testing data, then evaluating its performance using statistical metrics such as M A E , M S E , R M S E , M A P E , R , and R 2 for each room. In addition, the quality of GAN imputation was assessed through the loss curve during training to ensure convergence and the ability to replicate the original data distribution. Further error analysis was conducted by comparing predicted and actual values on the test data to identify how well the model captures thermal patterns and the extent of deviation. Uncertainty analysis was also performed to assess the sensitivity level of the proposed model.

5.1. Data Preprocessing Results

Figure 8 depicts the loss trends observed during the GAN model’s training across four rooms. The graphs show that the generator initially exhibits a high loss, which gradually declines with increasing epochs. This trend indicates that the generator’s ability to produce data that resembles real data is improving. In contrast, the discriminator’s loss typically increases during the early training stages before stabilizing, suggesting it becomes more effective at distinguishing real data from generated data as the generator’s output quality improves. Minor fluctuations in both losses reflect the adversarial nature of the GAN training process. The consistent patterns across all four rooms demonstrate the model’s stable and reliable training behavior under varying environmental conditions.
Figure 9 presents the results of the missing data imputation. After the imputation process, a more complete dataset that closely resembles the original data pattern was obtained. The imputed data successfully follows the trend of the original data. This alignment demonstrates that the GAN model can learn the temporal characteristics of temperature data and generate accurate estimated values.

5.2. Performance Analysis of Prediction Models

Figure 10 presents a comparison of prediction errors across different rooms. Each plot displays the error values for 30 data indices, comparing four models. In general, the errors for each model fall within a relatively narrow range, between −2 and 2, indicating that all four models demonstrate fairly good prediction performance. The error variation remains relatively high in Room 1 and Room 2, especially in the MLR and MLR-SVR models, which exhibit a wider spread of errors than the ANN model. In contrast, Room 3 and Room 4 show errors more concentrated around zero, indicating more accurate and stable predictions, particularly in the ANN and ANN-SVR models. This pattern suggests that ANN-based models produce more precise results, especially when combined with nonlinear regression techniques such as SVR. Overall, the ANN-SVR model consistently delivers the best performance across all four rooms based on the distribution of error values.
Figure 11 compares actual and predicted temperatures from four different models. The closer the data points are to the reference line, the better the model’s performance in predicting the temperature. In Rooms 1 and 2, the data points from the ANN and ANN-SVR models appear more concentrated around the reference line than in the MLR and MLR-SVR models. The same pattern is observed in Room 3 and Room 4, where the ANN-SVR model consistently produces results closer to the actual temperature. On the other hand, the MLR model tends to show a wider spread from the reference line, indicating that the linear regression method is less capable of capturing the data’s complexity. Therefore, it can be concluded that the ANN-SVR model performs the best in predicting room temperatures with high precision across all four rooms tested.
Based on the performance evaluation results summarized in Table 7, the ANN-SVR model demonstrates the highest overall accuracy. In Room 1, it achieved the lowest error values, M A E of 0.117, R M S E of 0.087, and M A P E of 0.258, along with the highest correlation and determination coefficients, with R and R 2 values of 0.986 and 0.936, respectively. A similar pattern is evident in Room 2, where the ANN-SVR model again outperforms the others, achieving an M A E of 0.052 and R 2 of 0.971. These results surpass those of the other models, which generally produce higher error values and lower R 2 scores. In Room 3, although the SVR model records a slightly higher R value than the ANN-SVR model, the latter significantly reduces error metrics, confirming its superior predictive performance. In Room 4, the standalone ANN model shows marginally better R and R 2 values. However, the ANN-SVR model yields the lowest M A E , R M S E , and M A P E , indicating better error minimization. Overall, the ANN-SVR model consistently delivers the most accurate temperature predictions across all four rooms in the Rumoh Aceh building, making it the most effective model for this application.
The analysis also shows that the R 2 values cannot fully explain the temperature variations in some rooms, particularly where R 2 is below 0.8. This condition indicates that important variables are still not yet included in the model. Several steps can be taken to improve the R 2 values, including incorporating internal factors such as the type and thermal properties of building materials, the building’s orientation to the sun, and ventilation conditions, which significantly influence the thermal dynamics inside the building. In addition, increasing the size of the dataset is also necessary to enhance the model’s generalization capability.
Table 8 presents the uncertainty analysis results using the Monte Carlo method for the ANN-SVR model in predicting room temperature. The Monte Carlo iterations were conducted over 1000 epochs. The study shows that Room 1 has a relatively low uncertainty, with a confidence interval of ±0.33 °C. Meanwhile, Room 2 and Room 3 exhibit the same confidence interval of ±1.02 °C, although with different standard deviation compositions. This indicates that the contribution of each source of uncertainty may vary yet still result in a similar level of total uncertainty. For these two rooms, the confidence interval reaches ±1.02 °C. As for Room 4, the confidence interval is ±0.8 °C. Although the confidence interval values in Room 2 and Room 3 are relatively high, at ±1.02 °C, the ANN-SVR model still demonstrates statistically accurate predictive performance, as shown in Table 7. Furthermore, according to ASHRAE 55 Standard, temperature fluctuations should not exceed ±1.1 °C within 15 min or ±2.2 °C within 1 h [90]. Therefore, the uncertainty of ±1.02 °C remains within the permissible range of temperature variation dynamics, although it does not directly represent the level of thermal comfort. In addition, thermal comfort is not solely determined by the physical parameters of the environment but is also significantly influenced by individual psychological factors [91]. These psychological factors refer to a person’s perception of adjusting to thermal conditions based on experience, expectations, personal preferences, and social and cultural conditions [92].
As a form of validation for the proposed ANN-SVR model, a performance comparison was conducted with several models from previous highly relevant studies. The comparison results presented in Table 9 show that the ANN-SVR model developed in this study outperforms all the benchmark models. This reinforces the conclusion that the proposed combination of ANN and SVR provides high accuracy and is suitable for being considered a reliable predictive model.

5.3. Thermal Prediction Results

Based on the ANN-SVR model, which has proven to be the most optimal predictive model, the process of predicting room temperature based on variations in environmental parameters was carried out. Figure 12a shows that the expected temperature increases as the outdoor temperature rises, with Room 3 showing the highest predicted temperature compared to the other rooms. This indicates that the outdoor temperature significantly impacts the indoor temperature, especially in Room 3. Figure 12b shows that an increase in outdoor humidity tends to lower the predicted temperature in all four rooms, indicating a negative influence of humidity on the room temperature. However, Room 3 still maintains the highest predicted temperature. Next, in Figure 12c, it is shown that the longer the duration of sunlight, the higher the room temperature, especially in Room 3 and Room 2. This suggests that solar radiation plays a significant role in increasing the room temperature. Finally, Figure 12d shows that wind speed does not have as significant an effect as the other variables. However, there is a slight trend of temperature increase with the increasing wind speed, particularly in Room 3 and Room 1. Overall, the prediction results indicate that the ANN-SVR model is quite sensitive to changes in environmental condition parameters, with Room 3 consistently showing higher predicted temperatures in nearly all conditions. This is due to the absence of direct ventilation to the outside environment in that room, causing hot air to be trapped and accumulated inside. Meanwhile, variations in solar intensity were not tested because solar intensity is highly sensitive to microclimatic conditions and environmental obstructions, such as surrounding trees or buildings. As a result, it does not always accurately represent the overall level of solar exposure the building receives.
The developed model is general and can be applied to other vernacular buildings in tropical regions. However, to apply it to buildings with different architectural characteristics, the model needs to be retrained using appropriate local data to capture the specific influences of the local climate.

6. Conclusions

Based on the evaluation results, applying the GAN method has proven effective in addressing missing data issues and can enhance the overall quality of the dataset. On the other hand, the ANN-SVR predictive model demonstrated the most optimal performance in terms of accuracy and stability. Evaluation results show that this model achieved the lowest average M A E ¯ and R M S E ¯ values, at 0.164 and 0.218, respectively, and the highest average R ¯ and R 2 ¯ values, at 0.785 and 0.618, respectively. This performance surpasses the MLR, MLR-SVR, and conventional ANN models. The predictive analysis results also show that environmental temperature and solar radiation duration are the dominant factors contributing to the increase in room temperature. At the same time, humidity has a negative influence, and wind speed has a relatively minor impact. These findings emphasize the effectiveness of integrating the GAN method in the preprocessing stage with the ANN-SVR predictive model while also serving as a reference for planners and architects to improve thermal comfort in vernacular buildings such as the Rumoh Aceh. However, the model developed in this study still does not consider internal factors such as occupant behaviour, building materials, and the available ventilation system. This may result in prediction outcomes that do not fully reflect the actual conditions. In practice, these factors significantly impact the thermal dynamics within the building, so the obtained predictions still do not fully represent the thermal conditions in an absolute sense. However, this model still demonstrates high accuracy based on variations in external environmental conditions and can serve as a foundation for further development by incorporating internal factors. Therefore, additional research can be conducted to develop a model that dynamically adapts based on changes in environmental conditions, occupant activity, building materials, and available ventilation. This makes the results more applicable in supporting the design planning of vernacular buildings with high thermal quality. In addition, this study has not explicitly addressed the psychological aspects of occupants in evaluating thermal comfort. By incorporating these psychological factors, further research can better understand thermal conditions in vernacular buildings.

Author Contributions

Conceptualization, M.M., S.S. and A.N.; methodology, M.M., S.S. and A.N.; software, A.N.; validation, M.M., S.S. and A.N.; formal analysis, A.N.; investigation, A.N.; resources, M.M., S.S. and A.N.; data curation, M.M., S.S. and A.N.; writing—original draft preparation, A.N.; writing—review and editing, M.M., S.S. and A.N.; visualization, A.N.; supervision, M.M.; project administration, M.M.; funding acquisition, M.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding authors.

Acknowledgments

The authors would like to thank the Sentro Research Group, Universitas Syiah Kuala, which has become a place for researchers to develop this journal’s research.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
ANNArtificial Neural Network
GANGenerative Adversarial Network
MAEMean Absolute Error
MAPEMean Absolute Percentage Error
MLRMultiple Linear Regression
MSEMean Squared Error
OLSOrdinary Least Squares
RBFRadial Basis Function
RMSERoot Mean Squared Error
RSSResidual Sum of Squares
SVMSupport Vector Machine
SVRSupport Vector Regression

References

  1. Muslimsyah, M.; Husin, H.; Akhyar, A. Numerical Analysis of Thermal Comfort Behavior in Acehnese Traditional Houses in Indonesia. Ecol. Eng. Environ. Technol. 2025, 26, 27–42. [Google Scholar] [CrossRef] [PubMed]
  2. Sahputra, Z.; Arie Edytia, M.H. A Comparison Study on Ornament of Rumoh Aceh in Aceh Besar and Umah Pitu Ruang in Aceh Tengah. Malays. J. Sustain. Environ. 2021, 8, 1. [Google Scholar] [CrossRef]
  3. Majid, N.H.A.; Denan, Z.; Rahim, Z.A.; Nawawi, N.M.; Hazman, S.N. Sustainability Concepts in Malay and Aceh Traditional Houses. Plan. Malaysia 2017, 15, 1–12. [Google Scholar] [CrossRef]
  4. Muslimsyah; Munir, A.; Away, Y.; Abdullah; Huda, K.; Salsabilah, M. Assessment of Indoor Thermal Environment of Aceh House Based on WBGT Index. In Proceedings of the 5th International Conference on Rebuilding Place, Banda Aceh, Indonesia, 15–16 September 2021; IOP Conference Series: Earth and Environmental Science. Volume 881, pp. 1–9. [Google Scholar]
  5. Munir, A.; Sofyan, M. Thermal Comfort In Naturally Ventilated and Air Conditioned Room: A Comparison between PMV and Actual Vote. In Proceedings of the Seminar on Environment and Architecture (SENVAR), Malang, East Java, Indonesia, 10–11 November 2011. [Google Scholar]
  6. Guo, Y.; Wang, J.; Chen, H.; Li, G.; Liu, J.; Xu, C.; Huang, R.; Huang, Y. Machine Learning-Based Thermal Response Time Ahead Energy Demand Prediction for Building Heating Systems. Appl. Energy 2018, 221, 16–27. [Google Scholar] [CrossRef]
  7. Sopa, M. Local Wisdom in the Cultural Symbol of Indonesian Traditional House. In Proceedings of the 1st Annual International Conference on Language and Literature, Medan, Indonesia, 18–19 April 2018; Knowledge E: Dubai, United Arab Emirates; Volume 3, p. 524. [Google Scholar]
  8. Yun, K.; Luck, R.; Mago, P.J.; Cho, H. Building Hourly Thermal Load Prediction Using an Indexed ARX Model. Energy Build. 2012, 54, 225–233. [Google Scholar] [CrossRef]
  9. Gesti Maria Ulfahmi, C.; Rohendi Rohidi, T. Triyanto Local Wisdom Values of Rumoh Aceh. Catharsis 2018, 3, 257–264. [Google Scholar]
  10. Okafor, M.U.; Awuzie, B.O.; Otasowie, K.; Aigbavboa, C. Traditional and Contemporary Building Thermal Sensations Prediction Models in a Warm-Humid Climate. Archit. Eng. Des. Manag. 2024, 1–16. [Google Scholar] [CrossRef]
  11. Li, N.; Yu, W.; Li, B. Assessing Adaptive Thermal Comfort Using Artificial Neural Networks in Naturally-Ventilated Buildings. Int. J. Vent. 2012, 11, 205–217. [Google Scholar] [CrossRef]
  12. Xuan, Z.; Qing-dian, L.; Guo-qiang, L.; Jun-wei, Y.; Jian-cheng, Y.; Lie-quan, L.; Wei, H. Multi-Variable Time Series Forecasting for Thermal Load of Air-Conditioning System on SVR. In Proceedings of the 34th Chinese Control Conference (CCC); Hangzhou, China, 28–30 July 2015, IEEE: Piscataway, NJ, USA, 2015; pp. 8276–8280. [Google Scholar]
  13. Kim, D.-J.; Kim, S.-I.; Kim, H.-S. Thermal Simulation Trained Deep Neural Networks for Fast and Accurate Prediction of Thermal Distribution and Heat Losses of Building Structures. Appl. Therm. Eng. 2022, 202, 117908. [Google Scholar] [CrossRef]
  14. Cui, B.; Fan, C.; Munk, J.; Mao, N.; Xiao, F.; Dong, J.; Kuruganti, T. A Hybrid Building Thermal Modeling Approach for Predicting Temperatures in Typical, Detached, Two-Story Houses. Appl. Energy 2019, 236, 101–116. [Google Scholar] [CrossRef]
  15. Shrestha, A.; Shimizu, T. Thermal Performance Assessment of Traditional Japanese Wooden Houses with Short-Term Measurements: Machine Learning Approaches. J. Build. Eng. 2024, 97, 110954. [Google Scholar] [CrossRef]
  16. Hermawan; Prianto, E.; Setyowati, E. Thermal Comfort of Wood-Wall House in Coastal and Mountainous Region in Tropical Area. Procedia Eng. 2015, 125, 725–731. [Google Scholar] [CrossRef]
  17. Febryano, I.G.; Harum, O.M.A.; Wulandari, C.; Hidayat, W.; Banuwa, I.S.; Prasetia, H.; Iswandaru, D.; Novriyanti, N.; Duadji, N.; Tresiana, N.; et al. Raw Material of Besemah Traditional House Construction in Indonesia. Folia For. Pol. Ser. A For. 2021, 63, 74–80. [Google Scholar] [CrossRef]
  18. Wibowo, A.P. Changes in the Use of Building Materials in Traditional Houses in Indonesia. In Lecture Notes in Civil Engineering; Springer: Singapore, 2020; Volume 59, pp. 71–78. [Google Scholar] [CrossRef]
  19. Leo Samuel, D.G.; Dharmasastha, K.; Shiva Nagendra, S.M.; Maiya, M.P. Thermal Comfort in Traditional Buildings Composed of Local and Modern Construction Materials. Int. J. Sustain. Built Environ. 2017, 6, 463–475. [Google Scholar] [CrossRef]
  20. Lu, S.; Cui, M.; Gao, B.; Liu, J.; Ni, J.; Liu, J.; Zhou, S. A Comparative Analysis of Machine Learning Algorithms in Predicting the Performance of a Combined Radiant Floor and Fan Coil Cooling System. Buildings 2024, 14, 1659. [Google Scholar] [CrossRef]
  21. Hu, J.; Zheng, W.; Zhang, S.; Li, H.; Liu, Z.; Zhang, G.; Yang, X. Thermal Load Prediction and Operation Optimization of Office Building with a Zone-Level Artificial Neural Network and Rule-Based Control. Appl. Energy 2021, 300, 117429. [Google Scholar] [CrossRef]
  22. Yu, J.; Lu, Y.; Hu, J.; Zhong, K.; Jia, T.; Yang, X. Prediction Models for Building Thermal Mass of Intermittently Heated Rooms for Balancing Energy Consumption and Indoor Thermal Comfort. Energy Build. 2024, 317, 114376. [Google Scholar] [CrossRef]
  23. Putra, R.A.; Ekomadyo, A.S. Transformation of Architecture of Rumoh Aceh: An Encoding Process Through Semiotic. Local Wisdom Sci. Online J. 2023, 15, 1–11. [Google Scholar] [CrossRef]
  24. Rizky, S.F.; Loebis, M.N.; Pane, I.F. The Continuity of Rumoh Aceh Spaces on the Development of Houses in Gampong Lambheu, Aceh Besar. IOP Conf. Ser. Earth Environ. Sci. 2018, 126, 012002. [Google Scholar] [CrossRef]
  25. Hassan, A.S.; Ramli, M. Natural Ventilation of Indoor Air Temperature: A Case Study of the Traditional Malay House in Penang. Am. J. Eng. Appl. Sci. 2010, 3, 521–528. [Google Scholar] [CrossRef]
  26. Shaeri, J.; Yaghoubi, M.; Aflaki, A.; Habibi, A. Evaluation of Thermal Comfort in Traditional Houses in a Tropical Climate. Buildings 2018, 8, 126. [Google Scholar] [CrossRef]
  27. Foruzanmehr, A. Thermal Comfort and Practicality: Separate Winter and Summer Rooms in Iranian Traditional Houses. Archit. Sci. Rev. 2014, 59, 1–11. [Google Scholar] [CrossRef]
  28. Xi, T.; Wang, M.; Cao, E.; Li, J.; Wang, Y.; Sa’ad, S.U. Preliminary Research on Outdoor Thermal Comfort Evaluation in Severe Cold Regions by Machine Learning. Buildings 2024, 14, 284. [Google Scholar] [CrossRef]
  29. Kapetanakis, D.-S.; Mangina, E.; Finn, D.P. Input Variable Selection for Thermal Load Predictive Models of Commercial Buildings. Energy Build. 2017, 137, 13–26. [Google Scholar] [CrossRef]
  30. Albatayneh, A.; Alterman, D.; Page, A.; Moghtaderi, B. The Impact of the Thermal Comfort Models on the Prediction of Building Energy Consumption. Sustainability 2018, 10, 3609. [Google Scholar] [CrossRef]
  31. Pérez-Sánchez, J.F.; Chavez-Vega, F.R.; Calvillo-Villicaña, M.E.; Suárez-Domínguez, K.; Estrada Castro, K.E.; Luna-Domínguez, J.H.; Gallegos-Villela, R. Thermal Conductivity Prediction and Comfort in Adobe Housing in Tamaulipas. Cogent Eng. 2022, 9, 2109321. [Google Scholar] [CrossRef]
  32. Roberts, B.M.; Allinson, D.; Diamond, S.; Abel, B.; Das Bhaumik, C.; Khatami, N.; Lomas, K.J. Predictions of Summertime Overheating: Comparison of Dynamic Thermal Models and Measurements in Synthetically Occupied Test Houses. Build. Serv. Eng. Res. Technol. 2019, 40, 512–552. [Google Scholar] [CrossRef]
  33. Chen, S.; Zhou, X.; Zhou, G.; Fan, C.; Ding, P.; Chen, Q. An Online Physical-Based Multiple Linear Regression Model for Building’s Hourly Cooling Load Prediction. Energy Build. 2022, 254, 111574. [Google Scholar] [CrossRef]
  34. Pompei, L.; Nardecchia, F.; Gugliermetti, L.; Cinquepalmi, F. Design of Three Outdoor Combined Thermal Comfort Prediction Models Based on Urban and Environmental Parameters. Energy Build. 2024, 306, 113946. [Google Scholar] [CrossRef]
  35. Abdel-Jaber, F.; Dirks, K.N. A Review of Cooling and Heating Loads Predictions of Residential Buildings Using Data-Driven Techniques. Buildings 2024, 14, 752. [Google Scholar] [CrossRef]
  36. Mirarabi, A.; Nassery, H.R.; Nakhaei, M.; Adamowski, J.; Akbarzadeh, A.H.; Alijani, F. Evaluation of Data-Driven Models (SVR and ANN) for Groundwater-Level Prediction in Confined and Unconfined Systems. Environ. Earth Sci. 2019, 78, 489. [Google Scholar] [CrossRef]
  37. Ferracuti, F.; Fonti, A.; Ciabattoni, L.; Pizzuti, S.; Arteconi, A.; Helsen, L.; Comodi, G. Data-Driven Models for Short-Term Thermal Behaviour Prediction in Real Buildings. Appl. Energy 2017, 204, 1375–1387. [Google Scholar] [CrossRef]
  38. Xie, Q.; Ni, J.-Q.; Bao, J.; Su, Z. A Thermal Environmental Model for Indoor Air Temperature Prediction and Energy Consumption in Pig Building. Build. Environ. 2019, 161, 106238. [Google Scholar] [CrossRef]
  39. Afroz, Z.; Urmee, T.; Shafiullah, G.M.; Higgins, G. Real-Time Prediction Model for Indoor Temperature in a Commercial Building. Appl. Energy 2018, 231, 29–53. [Google Scholar] [CrossRef]
  40. Mateo, F.; Carrasco, J.J.; Sellami, A.; Millán-Giraldo, M.; Domínguez, M.; Soria-Olivas, E. Machine Learning Methods to Forecast Temperature in Buildings. Expert Syst. Appl. 2013, 40, 1061–1068. [Google Scholar] [CrossRef]
  41. Patonai, Z.; Kicsiny, R.; Géczi, G. Multiple Linear Regression Based Model for the Indoor Temperature of Mobile Containers. Heliyon 2022, 8, e12098. [Google Scholar] [CrossRef]
  42. Muslimsyah; Munir, A.; Away, Y.; Abdullah; Arif, T.Y.; Novandri, A. A Model of Indoor Thermal Condition Based on Traditional Acehnese Houses Using Artificial Neural Network. Heliyon 2024, 10, e40644. [Google Scholar] [CrossRef]
  43. Fan, L.; Ji, Y.; Wu, G. Research on Temperature Prediction Model in Greenhouse Based on Improved SVR. J. Phys. Conf. Ser. 2021, 1802, 042001. [Google Scholar] [CrossRef]
  44. Alawee, W.H.; Al-Haddad, L.A.; Basem, A.; Al-Haddad, A.A. A Data Augmentation Approach to Enhance Breast Cancer Detection Using Generative Adversarial and Artificial Neural Networks. Open Eng. 2024, 14, 1–9. [Google Scholar] [CrossRef]
  45. Eltehewy, R.; Abouelfarag, A.; Saleh, S.N. Efficient Classification of Imbalanced Natural Disasters Data Using Generative Adversarial Networks for Data Augmentation. ISPRS Int. J. Geo-Inf. 2023, 12, 245. [Google Scholar] [CrossRef]
  46. Jäger, S.; Allhorn, A.; Bießmann, F. A Benchmark for Data Imputation Methods. Front. Big Data 2021, 4, 1–16. [Google Scholar] [CrossRef]
  47. Khan, M.A. A Comparative Study on Imputation Techniques: Introducing a Transformer Model for Robust and Efficient Handling of Missing EEG Amplitude Data. Bioengineering 2024, 11, 740. [Google Scholar] [CrossRef]
  48. Qin, R.; Wang, Y. ImputeGAN: Generative Adversarial Network for Multivariate Time Series Imputation. Entropy 2023, 25, 137. [Google Scholar] [CrossRef] [PubMed]
  49. Permana, D.S.; Hutapea, T.D.; Praja, A.S.; Paski, J.A.I.; Makmur, E.E.S.; Haryoko, U.; Umam, I.H.; Saepudin, M.; Adriyanto, R. The Indonesia In-House Radar Integration System (InaRAISE) of Indonesian Agency for Meteorology Climatology and Geophysics (BMKG): Development, Constraint, and Progress. IOP Conf. Ser. Earth Environ. Sci. 2019, 303, 012051. [Google Scholar] [CrossRef]
  50. Zakaria, A. Correlation of BMKG with TRMM for Daily and Monthly Rainfall Series in Banten Region. J. Eng. Sci. Res. 2022, 4, 1–7. [Google Scholar] [CrossRef]
  51. ASTM C1046-95; Standard Practice for In-Situ Measurement of Heat Flux and Temperature on Building Envelope Components. ASTM International: West Conshohocken, PA, USA, 2007; pp. 1–10.
  52. López Gómez, J.; Troncoso Pastoriza, F.; Fariña, E.A.; Eguía Oller, P.; Granada Álvarez, E. Use of a Numerical Weather Prediction Model as a Meteorological Source for the Estimation of Heating Demand in Building Thermal Simulations. Sustain. Cities Soc. 2020, 62, 102403. [Google Scholar] [CrossRef]
  53. Duan, X.; Yu, S.; Chu, J.; Chen, D.; Chen, Y. Evaluation of Indoor Thermal Environments Using a Novel Predicted Mean Vote Model Based on Artificial Neural Networks. Buildings 2022, 12, 1880. [Google Scholar] [CrossRef]
  54. Fan, C.; Ding, Y. Cooling Load Prediction and Optimal Operation of HVAC Systems Using a Multiple Nonlinear Regression Model. Energy Build. 2019, 197, 7–17. [Google Scholar] [CrossRef]
  55. Wang, Y.; Zheng, W.; Li, B. A Modified Discrete Grey Model with Improved Prediction Performance for Indoor Air Temperatures in Laying Hen Houses. Biosyst. Eng. 2022, 223, 138–148. [Google Scholar] [CrossRef]
  56. Zhao, T.; Liu, Y.; Neves, L.; Woodford, O.; Jiang, M.; Shah, N. Data Augmentation for Graph Neural Networks. In Proceedings of the AAAI Conference on Artificial Intelligence, Online, 2–9 February 2021; Volume 35, pp. 11015–11023. [Google Scholar]
  57. Goel, T.; Murugan, R.; Mirjalili, S.; Chakrabartty, D.K. Automatic Screening of COVID-19 Using an Optimized Generative Adversarial Network. Cogn. Comput. 2024, 16, 1666–1681. [Google Scholar] [CrossRef]
  58. Marano, G.C.; Rosso, M.M.; Aloisio, A.; Cirrincione, G. Generative Adversarial Networks Review in Earthquake-Related Engineering Fields. Bull. Earthq. Eng. 2024, 22, 3511–3562. [Google Scholar] [CrossRef]
  59. Goodfellow, I.J.; Pouget-Abadie, J.; Mirza, M.; Xu, B.; Warde-Farley, D.; Ozair, S.; Courville, A.; Bengio, Y. Generative Adversarial Nets. Adv. Neural Inf. Process. Syst. 2014, 27, 1–9. [Google Scholar]
  60. Goodfellow, I.; Pouget-Abadie, J.; Mirza, M.; Xu, B.; Warde-Farley, D.; Ozair, S.; Courville, A.; Bengio, Y. Generative Adversarial Networks. Commun. ACM 2020, 63, 139–144. [Google Scholar] [CrossRef]
  61. Yilmaz, B. Generative Adversarial Network for Load Data Generation: Türkiye Energy Market Case. Math. Model. Numer. Simul. Appl. 2023, 3, 141–158. [Google Scholar] [CrossRef]
  62. King, M.; Kim, B.-J.; Yune, C.-Y. Prediction Model of Undisturbed Ground Temperature Using Artificial Neural Network (ANN) and Multiple Regressions Approach. Geothermics 2024, 119, 102945. [Google Scholar] [CrossRef]
  63. Katić, D.; Krstić, H.; Ištoka Otković, I.; Begić Juričić, H. Comparing Multiple Linear Regression and Neural Network Models for Predicting Heating Energy Consumption in School Buildings in the Federation of Bosnia and Herzegovina. J. Build. Eng. 2024, 97, 110728. [Google Scholar] [CrossRef]
  64. Rao, V.V.; Garg, T.; Datta, S.P. Predictive Assessment from ANN and MLR Models to Optimize the Ideal Evaporative/Hybrid Cooler Based on Experimental Observations. J. Build. Eng. 2021, 44, 103256. [Google Scholar] [CrossRef]
  65. Qiang, G.; Zhe, T.; Yan, D.; Neng, Z. An Improved Office Building Cooling Load Prediction Model Based on Multivariable Linear Regression. Energy Build. 2015, 107, 445–455. [Google Scholar] [CrossRef]
  66. Moon, J.W. Performance of ANN-Based Predictive and Adaptive Thermal-Control Methods for Disturbances in and Around Residential Buildings. Build. Environ. 2012, 48, 15–26. [Google Scholar] [CrossRef]
  67. D’Amico, A.; Ciulla, G. An Intelligent Way to Predict the Building Thermal Needs: ANNs and Optimization. Expert Syst. Appl. 2022, 191, 116293. [Google Scholar] [CrossRef]
  68. Away, Y.; Novandri, A.; Raziah, I.; Melinda, M. A New Technique for Improving the Precision of Dual-Axis Tetrahedron-Based Sensor with Switching Between PID and ANN. IEEE Access 2023, 11, 89138–89151. [Google Scholar] [CrossRef]
  69. Ashtiani, A.; Mirzaei, P.A.; Haghighat, F. Indoor Thermal Condition in Urban Heat Island: Comparison of the Artificial Neural Network and Regression Methods Prediction. Energy Build. 2014, 76, 597–604. [Google Scholar] [CrossRef]
  70. Ruano, A.E.; Crispim, E.M.; Conceição, E.Z.E.; Lúcio, M.M.J.R. Prediction of Building’s Temperature Using Neural Networks Models. Energy Build. 2006, 38, 682–694. [Google Scholar] [CrossRef]
  71. Rohana, R.; Hardi, S.; Nasaruddin, N.; Away, Y.; Novandri, A. Multi-Stage ANN Model for Optimizing the Configuration of External Lightning Protection and Grounding Systems. Energies 2024, 17, 4673. [Google Scholar] [CrossRef]
  72. Attoue, N.; Shahrour, I.; Younes, R. Smart Building: Use of the Artificial Neural Network Approach for Indoor Temperature Forecasting. Energies 2018, 11, 395. [Google Scholar] [CrossRef]
  73. Fouladfar, M.H.; Soppelsa, A.; Nagpal, H.; Fedrizzi, R.; Franchini, G. Adaptive Thermal Load Prediction in Residential Buildings Using Artificial Neural Networks. J. Build. Eng. 2023, 77, 107464. [Google Scholar] [CrossRef]
  74. Dobbin, K.K.; Simon, R.M. Optimally Splitting Cases for Training and Testing High Dimensional Classifiers. BMC Med. Genom. 2011, 4, 31. [Google Scholar] [CrossRef]
  75. Wang, J.; Chen, B.; Yang, W.; Xu, D.; Yan, B.; Zou, E. BP Neural Network Multi-Module Green Roof Thermal Performance Prediction Model Optimized Based on Sparrow Search Algorithm. J. Build. Eng. 2024, 96, 110615. [Google Scholar] [CrossRef]
  76. Cho, J.H.; Moon, J.W. Integrated Artificial Neural Network Prediction Model of Indoor Environmental Quality in a School Building. J. Clean. Prod. 2022, 344, 131083. [Google Scholar] [CrossRef]
  77. Tajuddeen, I.; Sajjadian, S.M.; Jafari, M. Regression Models for Predicting the Global Warming Potential of Thermal Insulation Materials. Buildings 2023, 13, 171. [Google Scholar] [CrossRef]
  78. Wang, G.; Cai, C.; Pei, J.; Zhu, X. Prediction of Thermal Conductivity of Polymer-Based Composites by Using Support Vector Regression. Sci. China Phys. Mech. Astron. 2011, 54, 878–883. [Google Scholar] [CrossRef]
  79. Kavitha, R.; Mukesh Kumar, P.C. A Comparison between MLP and SVR Models in Prediction of Thermal Properties of Nano Fluids. J. Appl. Fluid Mech. 2018, 11, 7–14. [Google Scholar] [CrossRef]
  80. Suriya Prakash, J.; Annamalai Vignesh, K.; Ashok, C.; Adithyan, R. Multi Class Support Vector Machines Classifier for Machine Vision Application. In Proceedings of the 2012 International Conference on Machine Vision and Image Processing (MVIP), Coimbatore, India, 14–15 December 2012; IEEE: Piscataway, NJ, USA, 2012; pp. 197–199. [Google Scholar]
  81. Ma, Y.; Guo, G. Support Vector Machines Applications. In Support Vector Machines Applications; Ma, Y., Guo, G., Eds.; Springer International Publishing: Cham, Switzerland, 2014; pp. 23–48. ISBN 978-3-319-02299-4. [Google Scholar]
  82. Nasution, E.S.; Away, Y.; Syahrial; Sara, I.D.; Novandri, A. SVR-Optimized ANN Model for Predicting Earthquake Risk in Electrical Substations Based on Disaster Datasets in the Aceh Region, Indonesia. IEEE Access 2025, 13, 105027–105043. [Google Scholar] [CrossRef]
  83. Wang, W.; Lu, Y. Analysis of the Mean Absolute Error (MAE) and the Root Mean Square Error (RMSE) in Assessing Rounding Model. IOP Conf. Ser. Mater. Sci. Eng. 2018, 324, 012049. [Google Scholar] [CrossRef]
  84. Chai, T.; Draxler, R.R. Root Mean Square Error (RMSE) or Mean Absolute Error (MAE)? Arguments Against Avoiding RMSE in the Literature. Geosci. Model Dev. 2014, 7, 1247–1250. [Google Scholar] [CrossRef]
  85. Chicco, D.; Warrens, M.J.; Jurman, G. The Coefficient of Determination R-Squared Is More Informative Than SMAPE, MAE, MAPE, MSE and RMSE in Regression Analysis Evaluation. PeerJ Comput. Sci. 2021, 7, e623. [Google Scholar] [CrossRef]
  86. Hu, X.; Chen, X.; Parks, G.T.; Yao, W. Review of Improved Monte Carlo Methods in Uncertainty-Based Design Optimization for Aerospace Vehicles. Prog. Aerosp. Sci. 2016, 86, 20–27. [Google Scholar] [CrossRef]
  87. Zhang, J. Modern Monte Carlo Methods for Efficient Uncertainty Quantification and Propagation: A Survey. WIREs Comput. Stat. 2021, 13, e1539. [Google Scholar] [CrossRef]
  88. Cox, M.G.; Siebert, B.R.L. The Use of A Monte Carlo Method for Evaluating Uncertainty and Expanded Uncertainty. Metrologia 2006, 43, S178–S188. [Google Scholar] [CrossRef]
  89. Harris, P.M.; Cox, M.G. On a Monte Carlo Method for Measurement Uncertainty Evaluation and Its Implementation. Metrologia 2014, 51, S176–S182. [Google Scholar] [CrossRef]
  90. Ansi/Ashrae ANSI/ASHRAE 55:2004; Thermal Environmental Conditions for Human Occupancy. ASHRAE: Atlanta, GA, USA, 2004; Volume 2004, p. 30.
  91. Özbey, M.F.; Turhan, C. A Novel Comfort Temperature Determination Model Based on Psychology of the Participants for Educational Buildings in a Temperate Climate Zone. J. Build. Eng. 2023, 76, 107415. [Google Scholar] [CrossRef]
  92. Özbey, M.F.; Turhan, C. Investigating the Relationship Between Thermal Comfort and Human Psychology: A Review. J. Build. Des. Environ. 2024, 3. [Google Scholar] [CrossRef]
  93. Feng, D.; Zhou, B.; Hassan, S.G.; Xu, L.; Liu, T.; Cao, L.; Liu, S.; Guo, J. A Hybrid Model for Temperature Prediction in a Sheep House. Animals 2022, 12, 2806. [Google Scholar] [CrossRef] [PubMed]
Figure 1. The geographical location of Rumoh Aceh as the study area.
Figure 1. The geographical location of Rumoh Aceh as the study area.
Buildings 15 02448 g001
Figure 2. Variation of weather parameters in the study area: (a) temperature, (b) humidity, (c) sunshine duration, (d) wind speed.
Figure 2. Variation of weather parameters in the study area: (a) temperature, (b) humidity, (c) sunshine duration, (d) wind speed.
Buildings 15 02448 g002
Figure 3. Location of the study object at the Rumoh Aceh.
Figure 3. Location of the study object at the Rumoh Aceh.
Buildings 15 02448 g003
Figure 4. Modeling method diagram.
Figure 4. Modeling method diagram.
Buildings 15 02448 g004
Figure 5. The architecture of GAN with generator and discriminator models.
Figure 5. The architecture of GAN with generator and discriminator models.
Buildings 15 02448 g005
Figure 6. The structure of the MLR model with SVR optimization.
Figure 6. The structure of the MLR model with SVR optimization.
Buildings 15 02448 g006
Figure 7. The structure of the ANN model with SVR optimization.
Figure 7. The structure of the ANN model with SVR optimization.
Buildings 15 02448 g007
Figure 8. Generator and discriminator loss during training on the preprocessed data: (a) room 1, (b) room 2, (c) room 3, (d) room 4.
Figure 8. Generator and discriminator loss during training on the preprocessed data: (a) room 1, (b) room 2, (c) room 3, (d) room 4.
Buildings 15 02448 g008
Figure 9. The results of missing data imputation using GAN: (a) room 1, (b) room 2, (c) room 3, (d) room 4.
Figure 9. The results of missing data imputation using GAN: (a) room 1, (b) room 2, (c) room 3, (d) room 4.
Buildings 15 02448 g009aBuildings 15 02448 g009b
Figure 10. The results of the error analysis comparison for each model: (a) room 1, (b) room 2, (c) room 3, (d) room 4.
Figure 10. The results of the error analysis comparison for each model: (a) room 1, (b) room 2, (c) room 3, (d) room 4.
Buildings 15 02448 g010
Figure 11. The comparison result of model performance: (a) room 1, (b) room 2, (c) room 3, (d) room 4.
Figure 11. The comparison result of model performance: (a) room 1, (b) room 2, (c) room 3, (d) room 4.
Buildings 15 02448 g011
Figure 12. The prediction results using the ANN-SVR model based on various variables: (a) outdoor temperature, (b) outdoor humidity, (c) sunshine duration, (d) wind speed.
Figure 12. The prediction results using the ANN-SVR model based on various variables: (a) outdoor temperature, (b) outdoor humidity, (c) sunshine duration, (d) wind speed.
Buildings 15 02448 g012
Table 1. Measuring instrument specifications.
Table 1. Measuring instrument specifications.
InstrumentSensorTypeRangeResolutionAccuracy
EnvironmentTemperatureHMP155−80–60 °C~0.01 °C±0.5 °C
HumidityHMP1550–100%~0.1%±2%
SunshineCSD-3400–1100 nm-±5%
Wind SpeedMi90002–60 m/s~1.5 m/s±0.5 m/s
IndoorTemperatureDHT22−40–80 °C~0.1 °C±0.5 °C
Table 2. Dataset parameters.
Table 2. Dataset parameters.
ParameterVariableUnitMinMax
Input T m i n 21.624
T m a x 30.635
T a v g 25.929.2
H a v g %6288
S t i m e hour 2.510.4
W s p e e d m s 413
Output T r o o m 1 24.933.1
T r o o m 2 24.633.8
T r o o m 3 25.333.1
T r o o m 4 24.132.9
Table 3. The constant coefficient values used in the MLR model.
Table 3. The constant coefficient values used in the MLR model.
Room a 0 a 1 a 2 a 3 a 4 a 5 a 6
122.8782−0.2223−0.01330.4488−0.00650.0868−0.0495
227.5445−0.09920.01770.2019−0.03430.1244−0.0429
326.7997−0.2302−0.00350.3626−0.03660.1773−0.0165
414.91440.20160.13620.2366−0.02520.0612−0.0482
Table 4. The parameter values used in the SVR model to optimize the MLR model.
Table 4. The parameter values used in the SVR model to optimize the MLR model.
Room C γ ε N S V b
11000.10.052229.7149
210000.050.052331.0867
310000.050.052030.7571
410000.010.052130.6171
Table 5. The structure of the ANN model.
Table 5. The structure of the ANN model.
RoomFeatureValue
1Number of hidden layers4
Network structure6–512–256–128–64–1
Activation functionReLU
Learning rate0.002
Epoch817
2Number of hidden layers4
Network structure6–512–256–128–64–1
Activation functionReLU
Learning rate0.001
Epoch1207
3Number of hidden layers4
Network structure6–512–256–128–64–1
Activation functionReLU
Learning rate0.001
Epoch1017
4Number of hidden layers4
Network structure6–512–256–128–64–1
Activation functionReLU
Learning rate0.001
Epoch1590
Table 6. The parameter values used in the SVR model to optimize the ANN model.
Table 6. The parameter values used in the SVR model to optimize the ANN model.
Room C γ ε N S V b
11000.10.051528.1055
210000.10.051828.8924
3100.030.11628.2237
4100000.0010.1200.3554
Table 7. The performance evaluation of each model.
Table 7. The performance evaluation of each model.
RoomModel M A E M S E R M S E M A P E R R 2
1MLR0.2350.0820.2860.793%0.7890.622
MLR-SVR0.1960.0560.2370.661%0.7230.523
ANN0.1640.0570.2380.562%0.6670.445
ANN-SVR0.0750.0160.1250.256%0.6990.489
2MLR0.2780.1180.3440.948%0.630.397
MLR-SVR0.1050.0250.1570.356%0.5930.351
ANN0.1760.0680.260.56%0.8790.773
ANN-SVR0.1520.0460.2140.516%0.8440.712
3MLR0.20.0630.2510.67%0.90.81
MLR-SVR0.2050.0670.2590.686%0.870.757
ANN0.3240.1610.41.094%0.8150.664
ANN-SVR0.2790.1140.3380.938%0.8140.662
4MLR0.2190.0790.280.762%0.8190.671
MLR-SVR0.1760.0530.2290.609%0.8110.657
ANN0.250.0960.310.871%0.8010.641
ANN-SVR0.1480.0370.1930.515%0.7810.61
Table 8. The uncertainty analysis results.
Table 8. The uncertainty analysis results.
Room σ i n p u t ¯ (°C) σ m o d e l (°C) σ t o t a l (°C) 95 % C I ( ± 1.96 σ t o t a l )
10.110.1250.17±0.33 °C
20.470.2140.52±1.02 °C
30.390.3380.52±1.02 °C
40.360.1930.41±0.8 °C
Table 9. The comparison of the proposed model with other published models.
Table 9. The comparison of the proposed model with other published models.
Model M A E ¯ M S E ¯ R M S E ¯ M A P E ¯ R ¯ R 2 ¯
NARX [39]-0.083-0.837%--
ANFIS [38]--0.6772.529%-0.743
BPNN [43]-0.01956---0.99945
PB-MLR [33]34,113-52,2322.64%-0.9784
XGBoost [93]1.20881.57821.2562--0.8312
PSO-XGBoost [93]0.14950.02810.1676--0.9969
ANN-SVR
(Proposed)
0.1640.0530.2180.556%0.7850.618
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Muslimsyah, M.; Safwan, S.; Novandri, A. Comprehensive Assessment of Indoor Thermal in Vernacular Building Using Machine Learning Model with GAN-Based Data Imputation: A Case of Aceh Region, Indonesia. Buildings 2025, 15, 2448. https://doi.org/10.3390/buildings15142448

AMA Style

Muslimsyah M, Safwan S, Novandri A. Comprehensive Assessment of Indoor Thermal in Vernacular Building Using Machine Learning Model with GAN-Based Data Imputation: A Case of Aceh Region, Indonesia. Buildings. 2025; 15(14):2448. https://doi.org/10.3390/buildings15142448

Chicago/Turabian Style

Muslimsyah, Muslimsyah, Safwan Safwan, and Andri Novandri. 2025. "Comprehensive Assessment of Indoor Thermal in Vernacular Building Using Machine Learning Model with GAN-Based Data Imputation: A Case of Aceh Region, Indonesia" Buildings 15, no. 14: 2448. https://doi.org/10.3390/buildings15142448

APA Style

Muslimsyah, M., Safwan, S., & Novandri, A. (2025). Comprehensive Assessment of Indoor Thermal in Vernacular Building Using Machine Learning Model with GAN-Based Data Imputation: A Case of Aceh Region, Indonesia. Buildings, 15(14), 2448. https://doi.org/10.3390/buildings15142448

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop