Enhancing Office Comfort with Personal Comfort Systems: A Data-Driven Machine Learning Approach

Wegertseder-Martinez, Paulina; Restrepo-Medina, Silvia E.; Aedo-García, Roberto; Soto-Concha, Raul

doi:10.3390/buildings15101676

Open AccessArticle

Enhancing Office Comfort with Personal Comfort Systems: A Data-Driven Machine Learning Approach

by

Paulina Wegertseder-Martinez

^1,*

,

Silvia E. Restrepo-Medina

^2,3

,

Roberto Aedo-García

^4,*

and

Raul Soto-Concha

⁵

¹

Design and Theory of Architecture Department, Universidad del Bío-Bío, Av. Collao 1202, Concepción 4051381, Chile

²

Departamento de Ingeniería Eléctrica, Universidad Católica de la Santísima Concepción, Alonso de Ribera 2850, Concepción 4090541, Chile

³

Centro de Energía, Universidad Católica de la Santísima Concepción, Alonso de Ribera 2850, Concepción 4090541, Chile

⁴

Department of Physics, Faculty of Science, Universidad del Bío-Bío, Av. Collao 1202, Concepción 4051381, Chile

⁵

Department of Engineering Sciences, Universidad de Los Lagos, Puerto Montt 5480000, Chile

^*

Authors to whom correspondence should be addressed.

Buildings 2025, 15(10), 1676; https://doi.org/10.3390/buildings15101676

Submission received: 25 April 2025 / Revised: 11 May 2025 / Accepted: 13 May 2025 / Published: 15 May 2025

(This article belongs to the Section Building Energy, Physics, Environment, and Systems)

Download

Browse Figures

Versions Notes

Abstract

Personal Comfort Systems (PCS) have emerged as a flexible alternative to address the diversity of environmental perceptions in office environments. Unlike conventional HVAC systems, PCSs allow users to improve their satisfaction and comfort by exercising individualized control over their immediate environment without interfering with others around them. This study evaluated the use of machine learning models generated by H2O AutoML to predict the use of three PCSs in four office buildings with effective occupancy. These were a thermal wristband, a desk fan, and an adjustable lamp. Data collected through environmental sensors, perception surveys, and spatial and personal attributes were used. Synthetic data augmentation and automated variable selection were also used to optimize the models’ performance. The predictive models had a robust performance, with R² values in the test set of 0.86 for the wristband, 0.84 for the fan, and 0.52 for the lamp. The most influential variables included the BMI, CO₂ level, and thermal satisfaction, highlighting the importance of physiological and subjective factors. The results confirm that the models allow anticipating the use of PCS with high precision in most cases, laying the foundations for the future implementation of user-oriented adaptive systems. This preliminary approach contributes to the design of healthier, more personalized, and more energy-efficient work environments.

Keywords:

personal comfort; machine learning; AutoML; comfort models; offices

1. Introduction

Environmental comfort in indoor spaces has been regulated from the four traditional physical domains of Indoor Environmental Quality (IEQ): thermal, air quality, lighting, and acoustics. The thermal parameter, in particular, is one of the core parameters for building design and is the most studied within the IEQ parameters in the built environment [1,2,3]. This is partly because, according to surveys, occupants usually indicate that the thermal sensation is the most influential factor for their performance and behaviors [4]. These considerations have promoted regulations and standards that provide environments considered “neutral" or statistically acceptable for “average” people [5]. However, this approach tends to ignore the diversity of individual needs between users and within the same individual over time, which generates important limitations in people’s comfort and well-being [6,7].

Conventional thermal comfort models, such as the PMV/PPD and the Adaptive Comfort models, are based on population averages, but have a low power of prediction at an individual level [8,9,10], which is why new options for flexibility and customization of the environment have emerged with the use of Personal Comfort Systems (PCS). These systems are equipment that heats and/or cools occupants without affecting the surrounding environment, from commonly used devices to innovative technologies, usually controlled by people [11]. They have an individual approach that significantly improves people’s environmental comfort [8,12], with benefits related to health and productivity and possible energy savings for the building [13].

As these systems are integrated into real environments, the need arises to develop models capable of learning and predicting their use based on multiple environmental and behavioral variables. In this respect, machine learning, particularly the Automatic Machine Learning (AutoML) approach, has proven to be a powerful tool for automating the development of complex predictive models. Here, platforms such as H2O AutoML allow multiple algorithms, such as Gradient Boosting Machines (GBM), Deep Neural Networks (DNNs), and Random Forests, to be trained, compared, and assembled automatically without requiring expert intervention at each stage of the process [14,15]. In addition, integrating interpretative techniques improves the model’s transparency and facilitates understanding each variable’s contribution to the final result [16].

However, using AutoML is not without its challenges. Its high computational cost can represent a barrier in scenarios with limited resources, and its final performance is still highly dependent on the quality of the input data [17,18]. Similarly, its generalization ability can be affected in contexts with unstructured or noisy data, as is usually true in real-life environments [19]. Various studies have explored the use of regression algorithms, neural networks, and clustering techniques to model the relationship between environmental conditions, individual preferences, and comfort devices [8,20,21,22]. In particular, the combination of the Internet of Things (IoT) and machine learning has made it possible to implement adaptive systems capable of responding in real time to changes in the environment or to the needs of occupants, with benefits in terms of both comfort and energy efficiency [23]. In this context, this study aims to evaluate the use of models generated by H2O AutoML to predict the use of PCS in office buildings, considering a set of environmental, spatial, subjective perception, and personal characteristics variables. Through an empirical approach, the goal is to contribute to developing new personalized comfort models, evaluating their performance in real occupancy conditions, and using low-cost devices integrated into everyday office contexts [24,25]. This research aims to provide evidence on the feasibility of implementing personal comfort systems that respond dynamically to individual requirements, promoting healthier, adaptive, and energy-efficient work environments [26] and becoming an alternative to conventional comfort models. It is hypothesized that it is possible to design predictive models using AutoML techniques trained with data collected in real occupancy conditions to accurately anticipate portable personal comfort systems use, even when facing dynamic variations in environmental conditions and individual preferences. If confirmed, this hypothesis would support the potential of machine learning as an effective tool in developing a new occupant-centered environmental control paradigm, promoting the design of more adaptive, comfortable, and energy-efficient work environments.

2. Materials and Methods

This study is grounded in the hypothesis that the predominant design of office spaces characterized by shared environments with centralized HVAC systems lacks a holistic approach to IEQ. As a result, such configurations are often insufficient to meet the diverse comfort needs of occupants. To address this limitation, the integration of PCS is proposed as a means to enhance individual comfort perceptions without negatively affecting others in the same environment. In addition, the study seeks to explore the synergistic and/or antagonistic interactions among physical-environmental stimuli, personal characteristics, and subjective perceptions that influence individual behaviors and actions. Figure 1 illustrates the overall research framework, encompassing all phases from hypothesis formulation to data collection and analysis. However, the present article focuses specifically on the development and evaluation of predictive models for PCS usage. Additional findings related to environmental monitoring, user perception, and behavioral analysis are discussed in previous publications [12,27].

The field study was conducted in real office environments with active occupancy and involved the synchronous evaluation of three PCS: a desk fan, adjustable lighting lamps, and a wearable thermal regulation device, the Embr Wave wristband [28]. Environmental monitoring was carried out using Netatmo smart weather stations installed at selected workstations. These were configured to record temperature, relative humidity (RH), and CO₂ concentration every five minutes, with a measurement uncertainty of ±0.3 °C and ±0.3%RH. In parallel, custom-developed smart plugs were used to automatically log PCS energy consumption and usage patterns (Figure 2).

The research adopted a quantitative and correlational methodology, integrating continuous monitoring of environmental conditions and PCS operation with the administration of surveys and structured interviews. A validated questionnaire was used to collect data on occupants’ subjective perceptions of thermal comfort, indoor air quality, lighting, perceived control, and health, along with other relevant parameters.

Data collection was conducted in four office buildings located in different cities in Chile, during both summer and winter seasons, allowing the inclusion of varied climatic conditions in the analysis. Based on results obtained which indicated that PCS significantly improved immediate user satisfaction even under suboptimal environmental conditions, machine learning models were developed to predict PCS usage. These models, trained with field-collected data using AutoML techniques [29,30], enabled the identification of the key variables that influence the effective use of PCS devices.

2.1. Generation of Synthetic Data

A controlled data augmentation strategy was implemented to expand the available dataset for modeling, combining instance replication with the incorporation of random values generated from validated statistical distributions. This procedure began with a detailed statistical analysis of the original dataset, estimating key metrics, such as the mean, median, standard deviation, and the empirical probability distribution of each variable [31]. Based on these distributions, new values were generated using random sampling methods [32], aiming to preserve the statistical structure of the original database. This approach has been widely supported in the literature as an effective strategy in contexts with limited data availability [33,34]. Given that our original dataset included only 72 records, below the minimum of 200 observations recommended for stable AutoML performance according to LeDell and Poirier [14], this synthetic data generation process was essential to ensure robust model training using H2O AutoML.

To make the augmented data more representative of real-world conditions, randomly missing values were introduced into selected variables. These were imputed using several strategies, including linear regression, random imputation, and the k-nearest neighbors (KNN) algorithm, which have shown good performance in similar data environments [35,36].

To verify that no significant differences existed between the real and simulated data, statistical hypothesis tests were conducted. Specifically, Student’s t-tests were applied to compare the means of the original and synthetic datasets for the three target variables (wristband, fan, and lamp), using a significance level of

α = 0.05

. The results (all p-values > 0.88) indicated no statistically significant differences, thereby confirming the representativeness and integrity of the expanded dataset [37].

2.2. H2O Automated Machine Learning

Predictive modeling measures the percentage of use of PCS devices (wristband, fan, and lamp). This is performed through supervised regression techniques using the AutoML module of the H2O.ai (version 3.46.0.6) platform [38]. This tool fully automates the training and evaluation of multiple machine-learning algorithms, optimizing their performance according to predefined statistical metrics. In the first stage, models were trained using all available variables as predictive attributes. Subsequently, a variable selection strategy was applied based on two complementary criteria: (i) the relative importance of the variables, extracted directly from the generated models [39], and (ii) the Pearson correlation coefficient r between each predictor variable and the target variable, considering only those with absolute coefficients above 0.2 (|r| > 0.2), following the recommendations for exploratory analysis [40]. This process is illustrated in Figure 3.

Operation of the H2O AutoML Module

The AutoML module automatically executes a sequence of key tasks on the dataset:

Internal pre-processing, including cross-validation, detection of categorical variables, and standardization if necessary.
Parallel training of multiple base algorithms, such as Random Forest, Gradient Boosting Machine (GBM), XGBoost, Deep Learning, and Generalized Linear Models (GLM) [41].
Automatic search for hyperparameters using optimization methods such as grid search or random search.
Generation of Stacked Ensembles, which combines the models with the best individual performance to generate a more robust prediction [42].

2.3. General Methodological Flow

Figure 4 visually summarizes the methodology used to develop predictive models for the use of personal comfort systems in offices. The process began by expanding the database and generating new synthetic records that retained the statistical properties of the original data, which were validated by Student’s t-tests and distribution analysis. Regression models were built for each target variable in the initial modeling stage using all the available variables. Subsequently, relevant variables were identified, considering their importance in the generated models and their correlation with the target variables. Optimized modeling was conducted for each PCS with this selection, using only the most influential attributes to build more efficient and accurate models.

For each target variable, the one that obtained the best performance in the validation set was selected as the final model, evaluated using metrics such as the mean absolute error (MAE), the root mean square error (RMSE), and the coefficient of determination (R²). This strategy allowed the identification of the most effective algorithm in each case and capturing complex non-linear relationships between variables without the need to develop specific manual code for each model.

3. Results

3.1. Evaluation of Database Expansion

The comparison between the original (O) and the augmented database (A) for the target variables wristband, fan, and lamp shows that the data expansion maintains the general statistical characteristics of the original distribution. The sample size increased from 72 to 360 records for each variable, representing a five-fold expansion in the available data. The augmented database’s mean values and standard deviations are similar to the original ones, suggesting that the data distribution has not changed significantly after the expansion. In addition, the minimum and maximum values remain unchanged, indicating that no extreme values outside the original range were introduced. The percentiles (25%, 50% and 75%) show slight variations, confirming that the data structure has been preserved. To evaluate whether the differences observed between the original and augmented data are statistically significant, a Student’s t-test was applied for independent samples. The values of p, which represent the probability that the observed differences are random, were 0.998 for the wristband, 0.986 for the fan, and 0.88 for the lamp, indicating no significant differences between datasets (

p > 0.05

). This supports the validity of the data augmentation procedure, ensuring that no substantial biases were introduced into the distribution of the target variables. Table 1 presents a statistical summary of the target variables, differentiating between the original and augmented data.

Figure 5 compares the distributions of the original and the augmented database to visualize the similarity between the two databases. The data expansion respected the structure of the original distributions, reinforcing some frequencies without introducing significant biases.

The histograms show that the data expansion has respected the original distribution of each target variable, ensuring that the artificially generated data do not introduce significant biases. The structure of the distributions is maintained, although with a reinforcement in specific frequencies to improve the representativeness of the data.

3.2. Performance Assessment of the Initial Models

The initial predictive models were trained using all the attributes available in the augmented database (N = 360). The results obtained after this initial modeling process are summarized in Table 2, where the leading performance indicators for each target variable are presented. In all cases, it is observed that the models achieved a satisfactory fit in the training phase, with coefficients of determination (R²) above 0.95, indicating a high explanatory capacity with the complete set of attributes. In particular, the best performance was achieved through assembly strategies for the wristband and fan variables. For the lamp, the most efficient model was the Gradient Boosting Machine. Although a slight decrease in performance is observed in the test data, especially for the lamp variable, the mean absolute error (MAE) and root mean square error (RMSE) metrics remain within acceptable ranges. These results not only evidence the validity of the augmented data used in the training but also support the relevance of the approach adopted for the selection of variables using automated analysis.

3.3. Selected Variables and Optimization of the Model

A variables selection process was conducted to improve the models’ efficiency and accuracy based on their importance in the model and their correlation with the target variable. As a result, key attributes were identified into five categories: environmental, spatial, personal characteristics, perception of control, and satisfaction with comfort (Table 3). It was observed that the satisfaction and comfort stated by occupants in the surveys have a significant value in the design of the PCS use predictive model. Table A1 in Appendix A shows a description of the selected variables.

3.4. Metrics of Optimized Models

Table 4 presents the performance of the predictive models optimized for each target variable, using only the variables selected after the attribute reduction process.

In all cases, the model that achieved the best performance was an assembly strategy, which confirms this technique’s effectiveness in capturing complex relationships between variables. For the wristband use prediction, the optimized model obtained an MAE of 4.10 and an R² of 0.86 in the test set, showing a robust fit comparable to the complete model (MAE de 4.29 y R² de 0.87). In the case of the fan, the simplified model outperformed the model with all variables, reaching an R² of 0.84 vs. 0.73 and reducing the MAE from 9.69 to 8.11. For the lamp, although the performance of the optimized model was slightly lower (R² of 0.52 vs. 0.51), the overall behavior remained stable. These results show that simplified models maintain a sound predictive capacity, although the metrics do not improve significantly or even decrease slightly in some cases. In addition, by reducing the complexity of the set of attributes, the model’s interpretability is increased, favoring its applicability in real contexts. This simplification brings greater significance to the model by focusing on relevant variables, contributing to designing more efficient, understandable, and adaptable solutions in work environments. Based on the complete set of available attributes, a variables selection strategy was applied that combined the analysis of relative importance in the models generated by AutoML and the statistical correlation with the target variables. This approach made it possible to identify the most influential factors in predicting the use of each device. Table 5 presents the hierarchy of importance of the predictor variables for each target variable (wristband, fan, and lamp) according to the ranking automatically generated by the H2O AutoML module. This analysis provides empirical evidence on which attributes have the most significant weight in the prediction, which is key for developing optimized and explainable models.

In the wristband case, the most influential attributes that stand out are the BMI, the average CO₂ level, and the cold temperature satisfaction variable (ColdTempSat), suggesting that both personal characteristics and environmental parameters significantly influence the use of this device. For the fan, BMI again appears as the most relevant predictor, followed by variables associated with thermal perception and environmental quality, such as WarmTempSat and the average outdoor temperature. In the case of the lamp, the most important attributes are mainly related to environmental and personal conditions, highlighting the average CO₂, age, and BMI. Transversally, variables such as FurnComSat, PrivSat, and Ventil also appear in the ranking of importance, evidencing the subjective perception of comfort in predicting the use of personal comfort systems. These results allow the identification of differential patterns in the factors that condition the use of each type of device and reinforce the need to consider both objective and subjective variables in modeling user behavior.

3.5. Predictive Model Results for the Wristband

The residual graph of Figure 6, shows evidence that the residuals are moderately randomly distributed around zero throughout the range of fitted values. However, some outliers are observed, especially for higher predictions. This indicates a slight tendency towards underestimation at the upper extremes, but without obvious structural patterns that compromise the model’s validity. The slight inclination of the trend line suggests that, although the model is generally accurate, it could benefit from additional adjustment in extreme cases.

Figure 7 compares real and predicted values of the use of the wristband in a logarithmic scale. A remarkable match is observed between both axes, especially between the real values between 10 and 60, where most predictions align closely to the diagonal, evidencing that the model accurately represents the most frequent usage patterns. At the extremes, the dispersion increases slightly: at low values (less than 10), there is a slight tendency towards overestimation, while above 60, some underestimated cases are identified. Even with these variations, the distribution of points maintains a coherent trajectory, reflecting a clear proportional relationship between predictions and observed data. This confirms that the model adequately captures the dynamics of device usage, particularly in the moderate usage range, which concentrates most of the analyzed records.

Based on the graphical analysis and the results obtained for the wristband target variable, it can be concluded that the predictive model, based on an assembly strategy, effectively captured the relationship between the selected attributes and the device usage behavior. As shown in Table 4, the model achieved an MAE of 4.10, an RMSE of 7.88, and a coefficient of determination of R² = 0.86 in the test set, reflecting a good predictive ability with an acceptable margin of error. Ultimately, the importance analysis of variables (Table 5) supports the personal characteristics, such as the BMI, along with environmental conditions, such as the average CO₂, and perceptions of comfort, such as ColdTempSat (satisfaction with cold temperature), as the most influential factors in the prediction. The relevance of these variables suggests that physiological attributes, environmental conditions, and subjective thermal perception are key determinants to explain the use of this personal comfort system. Together, these results validate the usefulness of the combined variables selection approach, data augmentation, and automated modeling with H2O AutoML.

3.6. Predictive Model Results for Fan

The residual graph of Figure 8 shows a moderately dispersed distribution of the residuals concerning the fit values, with more significant variability as the predicted values increase. Some outliers indicate notable deviations in the predictions, especially in the middle and high ranges. However, the general trend of the residuals remains close to the zero axis, which suggests that the model does not have marked systematic biases, although it could benefit from improvements at the extremes of the prediction range.

Figure 9 shows the comparison between the actual values and those estimated by the model for the case of the fan, using a logarithmic scale in both axes. The analysis reveals that the model offers higher accuracy in the medium–high range, approximately between 10 and 60 percentage of use, where most points are grouped near the regression line, reflecting a stable match between the observed and the predicted. On the other hand, for values below 10, cases with higher predictions than the actual observations are identified, indicating a tendency towards overestimation in that range. Conversely, at the upper end, above 60, some points are located below the fit line, which suggests certain underestimations. Although limited, these deviations could be related to lower data representativeness at the extremes or external variables not included in the model, such as natural air circulation, specific hours of use, or seasonal particularities. Despite these particular discrepancies, the overall relationship between the predicted and actual values remains consistent, reflecting an adequate ability of the model to capture the main dynamics of device use in real conditions.

Based on the graphical analysis and the results obtained for the fan target variable, it can be concluded that the predictive model, also based on an assembly strategy, achieved a robust performance. According to Table 4, the model obtained an MAE of 8.11, an RMSE of 14.02, and a coefficient of determination of R² = 0.84 in the test set. These metrics indicate a good predictive ability, considering the complexity of the modeled phenomenon. Regarding the importance of the variables (Table 5), the BMI is highlighted again as the most relevant predictor, along with variables linked to thermal perception, WarmTempSat, and environmental conditions, such as the AvOutTemp. The presence of perception variables and personal characteristics reinforces the idea that the use of the fan is closely related to the perceived individual thermal comfort and not only to objective environmental conditions. These findings confirm the usefulness of the combined approach of data augmentation, automated attribute selection, and ensemble models within AutoML for understanding the use of personal comfort systems.

3.7. Predictive Model Results for the Lamp

The residual graph of Figure 10 shows a greater dispersion of the residuals compared to the other target variables, with several outliers, particularly in the intermediate ranges. Although most of the residuals are clustered near the zero axis, the inclination of the trend line and the amplitude of some errors suggest some instability of the model to predict certain cases accurately. This dispersion may be related to the highly asymmetric distribution of the data and the presence of many values close to zero, which makes it difficult to generalize the model in more diverse scenarios.

Figure 11 represents the relationship between the actual values and those estimated by the model for the use of the lamp, using a logarithmic scale in both axes. It is seen that a large part of the data is concentrated at low real values, below 1, which reflects the low frequency of use of the device in the analyzed context. The model exhibits a remarkable dispersion in this range, with predictions ranging from values close to zero to overestimations greater than one unit. On the contrary, in the range between 1 and 20, where the values are grouped with greater representativeness, the alignment with the regression line is clearer, indicating a more coherent proportional relationship between what was observed and what was predicted. However, towards the upper end of the horizontal axis, where the highest actual values are presented, a tendency of the model to slightly underestimate the use is observed, with points below the fitted line. This behavior suggests that the model’s accuracy decreases at the extremes of the distribution, which could be attributed to the small number of records in those ranges and the absence of relevant contextual variables in the training process. Even so, the model manages to capture the general structure of the usage pattern, especially in the middle ranges where the density of observations is higher.

As for the performance metrics, Table 4 shows that the model for the lamp obtained an MAE of 2.93, an RMSE of 7.18, and an R² = 0.52 in the test set, representing the lowest performance among the three target variables. This lower predictive capacity can be attributed, in part, to the low frequency of actual use of the device and the low informational variability between the records, which limits the model’s ability to learn robust patterns. Regarding the importance of the variables (Table 5), it is observed that the most influential attributes were the average CO₂, age, and BMI, followed by thermal perceptions, such as ColdTempSat and the perceived temperature level (Temp). The lower presence of direct or active control environmental variables could partly explain the model’s greater difficulty in capturing accurate usage behaviors. These results highlight the need to include a more balanced set of attributes or explore alternative modeling approaches that explicitly address the skewed nature of the distribution using personal lighting systems. Overall, the results show the effectiveness of the methodological approach adopted to predict the use of personal comfort systems in office environments. The models generated through AutoML, using selected variables in an automated and validated way, achieved a robust performance in the three target variables, with remarkable precision in the cases of the wristband and fan, where coefficients of determination (R²) of 0.86 and 0.84 were reached, respectively. Although the model associated with the lamp showed a more limited performance (R² = 0.52), this is related to the high concentration of low values in the distribution and a reduced wealth of information in the associated variables. Graphical analysis of residuals and comparison between actual and predicted values support these findings, confirming the overall validity of the approach and the areas where there are opportunities for improvement. Likewise, the importance analysis of variables consistently highlights the role of personal (such as BMI and age), environmental (such as CO₂ and outdoor temperature), and subjective perceptions of comfort factors, reaffirming the need to consider multiple objective and subjective dimensions when modeling adaptive behaviors in indoor spaces. These findings lay a solid foundation for developing more intelligent, efficient, and user-centered space climate control management strategies complemented by new environmental comfort models.

4. Discussion

4.1. Contributions of the Proposed Model and Validation of the Hypothesis

The results obtained validate the study’s central hypothesis, demonstrating that it is possible to predict the use of personal comfort systems (PCS) from data collected in real office environments using models generated by AutoML. In particular, assembly strategies allowed a high predictive performance for the wristband and fan devices (R² = 0.86 and 0.84, respectively), confirming this approach’s robustness in contexts characterized by environmental variability and heterogeneity in the subjective perception of comfort. This work appears as a preliminary proposal for the development of user-centered models, which not only seek to improve the accuracy in predicting the use of PCS but also to move towards more flexible environmental conditions in office contexts. Unlike widespread standard climate control, which aims to create a uniform and “neutral” thermal environment, the use of PCS emphasizes personal control to meet diversified individual demands. From the results and other research [9,43,44], a need for predictive control based on user preferences is observed, underlining the importance of personal experience in the design of comfort systems.

4.2. Relevance of Personal and Perceptual Attributes

One of this study’s most relevant findings is the weight that personal and perceptual variables acquire in predicting the use of PCS. For example, the body mass index (BMI), the average CO₂ level, and the thermal satisfaction variables emerged as key attributes in the three analyzed devices. This result aligns with previous research that indicates comfort cannot be explained exclusively by physical parameters but strongly depends on individual factors and the degree of control perceived by users [8,45]. This perspective reinforces the need to incorporate integrative modeling frameworks that combine objective data (environmental, spatial) with subjective and personal information, particularly in applications aimed at work environments, where the perception of comfort directly impacts productivity and well-being. The role of subjective perception concerning satisfaction and comfort in predicting personal comfort systems is key. However, these same answers can change at different times of the day, between seasons, in a working day, after it, on out-of-office days, etc. That is why implementing models that learn from a permanent data collection would make the prediction level even more accurate.

4.3. Differential Performance According to Device Type

Implementing PCS in offices significantly improves thermal comfort, lighting, and IAQ perception [12], with 85.5% of users reporting increased satisfaction when using the three PCS devices tested. This supports the development of comfort models that address individual variability in IEQ. Model performance varied by device type. The wristband and fan showed high predictive accuracy, while the lamp model had a notably lower R² (0.52). This can be attributed to the low frequency and variability of lamp usage, as confirmed by sensor data and interviews (see Table 2 in [12]). Additionally, lighting conditions varied across workstations, and in shared spaces, users often refrained from using lamps to avoid disturbing coworkers or because ambient light was sufficient. From a technical perspective, we tested several advanced models, including Gradient Boosting, Random Forest, and neural networks, in an effort to better capture these complex relationships. While these approaches slightly improved performance, the challenge of modeling lighting behavior remains, particularly due to the influence of subjective perception and contextual social factors that are not easily quantified.

Despite these limitations, the lamp model remains useful as a first approximation, especially in systems that require initial estimates of PCS usage. Future studies could benefit from incorporating desk level light sensors and richer contextual data to further refine predictions related to lighting comfort.

4.4. Practical Implications and Projection Towards Intelligent Environments

Advances in the Internet of Things (IoT) and ubiquitous computing have enabled efficient and cost-effective data collection on microenvironments and individual user preferences [46]. This ability drives the development of new comfort models that predict individual responses and not averages of a large population, thus providing estimates more adapted to the specific needs of each user. If the implementation is technically scalable and minimizes disruption for occupants, using humans as sensors in buildings could transform post-occupancy assessments and the design and management of automated systems and controls. This would open up new opportunities for users to provide feedback in point-in-time applications and the continuous monitoring and management of systems, facilitating automated decision-making based on real preferences, not population standards. From this perspective, this study lays a solid foundation for developing smart platforms that integrate machine learning with low-cost sensor technologies, targeting more adaptive offices with lower energy consumption and greater user satisfaction. This vision coincides with the emerging trends in personalized environmental comfort and automation in smart buildings [26,47].

4.5. Future Lines of Research

This research had some limitations regarding the people who would initially participate, as it was carried out during the pandemic caused by the SARS-CoV-2 coronavirus. Therefore, the size of the initial dataset was limited, and the need for models to work effectively with scarce data is a significant barrier in this and other studies that collect data from real contexts and in use. However, it was possible to increase the database of the results obtained from the fieldwork. Although the results are promising, this work should be considered a first step towards the integral modeling of personalized comfort. Among the future research lines, the following stand out:

Incorporating time series and more granular data to capture daily usage dynamics.
Evaluating the model’s performance in different seasons and different types of buildings (e.g., co-working offices, university residences).
Extending the perceptual dimension, incorporating assessments of visual fatigue, subjective productivity, or cognitive load associated with different comfort levels.
Including new relevant parameters to compare survey responses, such as “chair sensors” showing effective workplace use, or light sensors for each desk.

These projections seek to consolidate a vision of flexible and user-centered comfort, contributing to the creation of more sustainable, healthy, and personalized work environments.

5. Conclusions

This study explored the feasibility of predicting the use of Personal Comfort Systems (PCS) in office environments using models developed with H2O AutoML, using environmental, spatial, physiological, and perceptual data collected in real occupancy conditions. The results obtained validate the potential of automated machine learning strategies to capture complex interactions between user characteristics and the physical and environmental conditions of the indoor environment, allowing accurate and interpretable predictions about the use of PCS devices. This translates into the improvement of people’s comfort and well-being, promoting, at the same time, an efficient and optimized energy use in terms of air conditioning. Among the devices evaluated, the predictive models associated with the thermal wristband and the desk fan achieved the best performance, with R² values of 0.86 and 0.84, respectively, which confirms their high explanatory capacity. In contrast, the model for the lamp showed a lower generalization capacity (R² = 0.52), possibly due to the concentration of use values in low ranges and a lower wealth of explanatory attributes. Despite these differences, the assembly-based modeling strategy proved effective in all three cases, especially when combined with previous variable selection processes. The constant presence of personal variables (such as BMI and age), environmental conditions (for example, the level of CO₂ and the outdoor temperature), and subjective satisfaction indicators (such as perceived comfort with furniture or thermal satisfaction) highlights the multifactorial character of individual comfort. These findings support the hypothesis that comfort cannot be explained solely through physical measurements and must incorporate perceptual and behavioral dimensions. Ultimately, this work provides a preliminary but solid basis for developing flexible user-centered control systems. By allowing the use of PCS to be anticipated, the generated models can inform more responsive and energy-efficient building management strategies, particularly in contexts with high climatic variability or limited infrastructure. Future efforts should focus on incorporating temporal dynamics with real-time control systems, expanding data sources, and validating the transferability of models to promote scalable and equitable solutions to improve indoor environmental quality.

Author Contributions

P.W.-M. conceptualized and designed this study; P.W.-M. conducted the data collection; R.A.-G. and S.E.R.-M. were in charge of the methodology for developing and validating the models; R.S.-C. statistically validated the results; P.W.-M., R.A.-G. and S.E.R.-M. were responsible for writing, reviewing, and editing; R.S.-C. worked on visualization. All authors have read and agreed to the published version of the manuscript.

Funding

The National Research and Development Agency of Chile (ANID) funded this research and P.W.M. through the Fondecyt Initiation Project, grant number 11200667. S.E.R was supported by the National Research and Development Agency of Chile (ANID) through the Fondecyt Initiation Project, grant number 11221231. R.A.G acknowledges the support from the Chilean National Research and Development Agency, grant number EQM220137, and the Universidad del Bío-Bío.

Institutional Review Board Statement

The study was conducted according to the principles of the Declaration of Helsinki and approved by the Institutional Review Board (or Ethics Committee) of Universidad del Bío-Bío.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

Data are contained within the article.

Acknowledgments

We acknowledge support from FONDECYT Iniciación 11200667 and 11221231.

Conflicts of Interest

The authors declares no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

PMV	Predicted mean vote
PPD	Predicted percentage of dissatisfied
IEQ	Indoor environmental quality
PCS	Personal comfort system
AutoML	Automatic machine learning
ML	Machine learning
IAQ	Indoor air quality

Appendix A

Table A1 presents the set of variables considered in the study, organized into five main categories: environmental, spatial, personal, perception of control, and satisfaction with comfort. This classification allowed a more structured understanding of the factors that influence the use of personal comfort systems, incorporating both objective data and subjective perceptions of users in their workplaces. Each variable was selected for its relevance in previous studies and its possible contribution to the predictive capacity of the developed models.

Table A1. Description of the variables used in the study.

Categoría	Tag	Variable	Descripción
Environmental Characteristics	AvCO₂	Average CO₂	Average CO₂ measured in the workplace
	AvIndTemp	Average Indoor Temperature	Average temperature measured in the workplace
	AvOutTem	Average Outdoor Temperature	Average temperature measured outside the building
Spatial Characteristics	Floor	Floor	Floor workstation is on
	OffTyp	Office Type	Private or Shared
	WindDist	Distance to closest window	Next to window, <4 m direct, >4 direct, indirect light
	WindPos	Position to the window	Facing, behind, or beside occupant
	WindOrient	Window orientation	North, South, East, West, Northeast, Southeast
	GroupType	Desk grouping type	Isolated or attached to another
	DeskSpace	Workstation space	Minimum, average, large
	LightRec	Light received	Almost none, little, enough to not turn on artificial lighting
Personal Characteristics	Age	Occupant Age	Age range (18–65+)
	Gen	Occupant Gender	Female, Male, Other, No answer
	BMI	Body Mass Index	Occupant information
	WorkTyp	Type of Work	Individual, customer service, supervision, other
	Perman	Permanence	Percentage of time at workstation
Percepción de Control	Temp	Temperature control	Perceived by the user (Likert 1–7 scale)
	Ventil	Ventilation control	Perceived by the user
	SolarProt	Solar/Sunlight Protection	Perceived by the user
	Light	Lighting control	Perceived by the user
	Noise	Noise control	Perceived by the user
Satisfaction and Comfort	WarmTempSat	Satisfaction with temperature (Warm)	Perceived on warm days (Likert 1–7 scale)
	ColdTempSat	Satisfaction with temperature (Cold)	Perceived on cold days (Likert 1–7 scale)
	NoiseSat	Satisfaction with noise	Perceived by the user
	LightSat	Satisfaction with lighting	Perceived by the user
	AirQualSat	Satisfaction with air quality	Perceived by the user
	PrivSat	Satisfaction with privacy	Perceived by the user
	WindProxSat	Satisfaction with proximity to window	Perceived by the user
	WindViewSat	Satisfaction with view from window	Perceived by the user
	FurnComfSat	Satisfaction with furniture	Perceived by the user

Grouping by variable category.

References

Song, C.; Liu, Y.; Zhou, X.; Wang, D.; Wang, Y.; Liu, J. Identification of local thermal conditions for sleeping comfort improvement in neutral to cold indoor thermal environments. J. Therm. Biol. 2020, 87, 102480. [Google Scholar] [CrossRef]
Hu, J.; He, Y.; Hao, X.; Li, N.; Su, Y.; Qu, H. Optimal temperature ranges considering gender differences in thermal comfort, work performance, and sick building syndrome: A winter field study in university classrooms. Energy Build. 2022, 254, 111554. [Google Scholar] [CrossRef]
Wu, Z.; Wagner, A. Effect of short-term thermal history on thermal comfort and physiological responses: A pilot study. Energy Build. 2023, 298, 113510. [Google Scholar] [CrossRef]
Frontczak, M.; Wargocki, P. Literature survey on how different factors influence human comfort in indoor environments. Build. Environ. 2011, 46, 922–937. [Google Scholar] [CrossRef]
Park, J.Y.; Ouf, M.M.; Gunay, B.; Peng, Y.; O’Brien, W.; Kjærgaard, M.B.; Nagy, Z. A critical review of field implementations of occupant-centric building controls. Build. Environ. 2019, 165, 106351. [Google Scholar] [CrossRef]
Fanger, P.O. Thermal Comfort: Analysis and Applications in Environmental Engineering; McGraw-Hill Inc.: New York, NY, USA, 1970. [Google Scholar]
Lan, L.; Lian, Z. Use of neurobehavioral tests to evaluate the effects of indoor environment quality on productivity. Build. Environ. 2009, 44, 2208–2217. [Google Scholar] [CrossRef]
Kim, J.; Schiavon, S.; Brager, G. Personal comfort models—A new paradigm in thermal comfort for occupant-centric environmental control. Build. Environ. 2018, 132, 114–124. [Google Scholar] [CrossRef]
Zhang, H.; Arens, E.; Zhai, Y. A review of the corrective power of personal comfort systems in non-neutral ambient environments. Build. Environ. 2015, 91, 15–41. [Google Scholar] [CrossRef]
Parkinson, T.; De Dear, R. Thermal pleasure in built environments: Physiology of alliesthesia. Build. Res. Inf. 2015, 43, 288–301. [Google Scholar] [CrossRef]
Exss, K.; Wegertseder-Martínez, P.; Trebilcock, M. A systematic review of Personal Comfort Systems from a post-phenomenological view. Ergonomics 2025, 68, 163–186. [Google Scholar] [CrossRef]
Wegertseder-Martinez, P.; Berges-Alvarez, I.; Piderit-Moreno, B. Evaluation of Synchronous Use of Portable Personal Comfort and Environment Conditioning Systems in Real Office Occupancy Conditions. Buildings 2024, 14, 1820. [Google Scholar] [CrossRef]
Rawal, R.; Schweiker, M.; Kazanci, O.B.; Vardhan, V.; Jin, Q.; Duanmu, L. Personal comfort systems: A review on comfort, energy, and economics. Energy Build. 2020, 214, 109858. [Google Scholar] [CrossRef]
LeDell, E.; Poirier, S. H2O AutoML: Scalable Automatic Machine Learning. In Proceedings of the 7th ICML Workshop on Automated Machine Learning, Online, 18 July 2020. [Google Scholar]
Gijsbers, P.; LeDell, E.; Thomas, J.; Poirier, S.; Bischl, B.; Vanschoren, J. An Open Source AutoML Benchmark. In Proceedings of the 6th ICML Workshop on Automated Machine Learning, Long Beach, CA, USA, 14–15 June 2019. [Google Scholar]
Madni, H.A.; Umer, M.; Ishaq, A.; Abuzinadah, N.; Saidani, O.; Alsubai, S.; Hamdi, M.; Ashraf, I. Water-Quality Prediction Based on H2O AutoML and Explainable AI Techniques. Water 2023, 15, 475. [Google Scholar] [CrossRef]
Salehin, I.; Islam, M.S.; Saha, P.; Noman, S.M.; Tuni, A.; Hasan, M.M.; Baten, M.A. AutoML: A systematic review on automated machine learning with neural architecture search. J. Inf. Intell. 2024, 2, 52–81. [Google Scholar] [CrossRef]
Xue, C.; Hu, M.; Huang, X.; Li, C.G. Automated search space and search strategy selection for AutoML. Pattern Recognit. 2022, 124, 108474. [Google Scholar] [CrossRef]
Omar, I.; Khan, M.; Starr, A.; Abou Rok Ba, K. Automated Prediction of Crack Propagation Using H2O AutoML. Sensors 2023, 23, 8419. [Google Scholar] [CrossRef]
Chennapragada, A.; Periyakoil, D.; Das, H.P.; Spanos, C.J. Time series-based deep learning model for personal thermal comfort prediction. In Proceedings of the Thirteenth ACM International Conference on Future Energy Systems, Virtual Event, 28 June–1 July 2022; pp. 552–555. [Google Scholar]
Markarian, E.; Azar, E. Predicting energy consumption and thermal comfort in buildings using a hybrid machine learning and building performance simulation approach. In Proceedings of the 18th IBPSA Conference, Shanghai, China, 4–6 September 2022. [Google Scholar]
Zhang, W.; Wu, Y.; Calautit, J.K. A review on occupancy prediction through machine learning for enhancing energy efficiency, air quality and thermal comfort in the built environment. Renew. Sustain. Energy Rev. 2022, 167, 112704. [Google Scholar] [CrossRef]
Marmolejo Duarte, C.R. La relevancia de la eficiencia energética entre los atributos arquitectónicos residenciales. Arquitetura Rev. 2021, 17, 90–110. [Google Scholar] [CrossRef]
Goecks, J.; Jalili, V.; Heiser, L.M.; Gray, J.W. How machine learning will transform biomedicine. Cell 2020, 181, 92–101. [Google Scholar] [CrossRef]
Baldominos, A.; Cervantes, A.; Saez, Y.; Isasi, P. A comparison of machine learning and deep learning techniques for activity recognition using mobile devices. Sensors 2019, 19, 521. [Google Scholar] [CrossRef]
Laing, S.; Kühl, N. Comfort-as-a-service: Designing a user-oriented thermal comfort artifact for office buildings. arXiv 2020, arXiv:2004.03323. [Google Scholar]
Wegertseder-Martínez, P. The Need for a Paradigm Shift toward an Occupant-Centered Environmental Control Model. Sustainability 2023, 15, 5980. [Google Scholar] [CrossRef]
Wang, Z.; Warren, K.; Luo, M.; He, X.; Zhang, H.; Arens, E.; Chen, W.; He, Y.; Hu, Y.; Jin, L.; et al. Evaluating the comfort of thermally dynamic wearable devices. Build. Environ. 2020, 167, 106443. [Google Scholar] [CrossRef]
Li, C.; Zhu, H.; Lian, X.; Liu, Y.; Li, X.; Feng, Y. Study of “time-lag” of occupant behavior occurrences for establishing an occupant-centric building control system. Build. Environ. 2022, 216, 109005. [Google Scholar] [CrossRef]
Soleimanijavid, A.; Konstantzos, I.; Liu, X. Challenges and opportunities of occupant-centric building controls in real-world implementation: A critical review. Energy Build. 2024, 308, 113958. [Google Scholar] [CrossRef]
Jain, A.K.; Murty, M.N.; Flynn, P.J. Data clustering: A review. ACM Comput. Surv. (CSUR) 1999, 31, 264–323. [Google Scholar] [CrossRef]
Goodfellow, I. Deep Learning; MIT Press: Cambridge, MA, USA, 2016. [Google Scholar]
Prokhorenkova, L.; Gusev, G.; Vorobev, A.; Dorogush, A.V.; Gulin, A. CatBoost: Unbiased boosting with categorical features. Adv. Neural Inf. Process. Syst. 2018, 31, 6639–6649. [Google Scholar]
Chawla, N.V.; Bowyer, K.W.; Hall, L.O.; Kegelmeyer, W.P. SMOTE: Synthetic minority over-sampling technique. J. Artif. Intell. Res. 2002, 16, 321–357. [Google Scholar] [CrossRef]
Hao, X.; Liu, L.; Yang, R.; Yin, L.; Zhang, L.; Li, X. A review of data augmentation methods of remote sensing image target recognition. Remote Sens. 2023, 15, 827. [Google Scholar] [CrossRef]
Farhangfar, A.; Kurgan, L.; Dy, J. Impact of imputation of missing values on classification error for discrete data. Pattern Recognit. 2008, 41, 3692–3705. [Google Scholar] [CrossRef]
Sheskin, D.J. Handbook of Parametric and Nonparametric Statistical Procedures; Chapman and Hall/CRC: Boca Raton, FL, USA, 2003. [Google Scholar]
H2O.ai. H2O.ai AutoML: Scalable Automatic Machine Learning; H2O.ai: Mountain View, CA, USA, 2020. [Google Scholar]
Molnar, C. Interpretable Machine Learning: A Guide for Making Black Box Models Explainable; Leanpub: Victoria, BC, Canada, 2020. [Google Scholar]
Shmueli, G.; Bruce, P.C.; Gedeck, P.; Patel, N.R. Data Mining for Business Analytics: Concepts, Techniques, and Applications in R, 3rd ed.; Wiley: Hoboken, NJ, USA, 2017. [Google Scholar]
H2O.ai. H2O AutoML User Guide; H2O.ai: Mountain View, CA, USA, 2021. [Google Scholar]
Caruana, R.; Niculescu-Mizil, A.; Crew, G.; Ksikes, A. Ensemble selection from libraries of models. In Proceedings of the Twenty-First International Conference on Machine Learning, Banff, AB, Canada, 4–8 July 2004; p. 18. [Google Scholar] [CrossRef]
Zhou, Y.; Su, Y.; Xu, Z.; Wang, X.; Wu, J.; Guan, X. A hybrid physics-based/data-driven model for personalized dynamic thermal comfort in ordinary office environment. Energy Build. 2021, 238, 110790. [Google Scholar] [CrossRef]
Tomat, V.; Ramallo-González, A.P.; Skarmeta Gómez, A.F. A comprehensive survey about thermal comfort under the IoT paradigm: Is crowdsensing the new horizon? Sensors 2020, 20, 4647. [Google Scholar] [CrossRef] [PubMed]
Salamone, F.; Belussi, L.; Currò, C.; Danza, L.; Ghellere, M.; Guazzi, G.; Lenzi, B.; Megale, V.; Meroni, I. Integrated method for personal thermal comfort assessment and optimization through users’ feedback, IoT and machine learning: A case study. Sensors 2018, 18, 1602. [Google Scholar] [CrossRef] [PubMed]
Shetty, S.S.; Hoang, D.C.; Gupta, M.; Panda, S. Learning desk fan usage preferences for personalised thermal comfort in shared offices using tree-based methods. Build. Environ. 2019, 149, 546–560. [Google Scholar] [CrossRef]
Kim, J.; Zhou, Y.; Schiavon, S.; Raftery, P.; Brager, G. Personal comfort models: Predicting individuals’ thermal preference using occupant heating and cooling behavior and machine learning. Build. Environ. 2018, 129, 96–106. [Google Scholar] [CrossRef]

Figure 1. Flow chart of the research design. Source: Developed by the authors.

Figure 2. Devices implemented in the field study. Source: Developed by the authors.

Figure 3. Variable selection criteria.

Figure 4. Predictive modeling methodology for office comfort systems.

Figure 5. Comparison of the original and augmented database distributions for (a) Wristband, (b) Fan, and (c) Lamp.

Figure 6. Residual analysis of the best model for the wristband. The blue line represents linear fit.

Figure 7. Scatter plot between real values and predictions for the wristband.

Figure 8. Residual analysis of the best model for the fan. The blue line represents linear fit.

Figure 9. Scatter plot between real values and predictions for the fan.

Figure 10. Residual analysis of the best model for the lamp. The blue line represents linear fit.

Figure 11. Scatter plot between real values and predictions for the lamp.

Table 1. Comparison of the distributions between the original (O) and the augmented database (A) for the target variables: (a) Wristband, (b) Fan, and (c) Lamp.

Statistics	Wristband (O)	Wristband (A)	Fan (O)	Fan (A)	Lamp (O)	Lamp (A)
N (count)	72	360	72	360	72	360
Mean	24.89	24.27	36.00	36.21	6.42	6.31
Standard deviation (SD)	21.41	21.15	32.40	32.33	10.81	10.65
Minimum (min)	0.00	0.00	0.00	0.00	0.00	0.00
25% percentile	7.75	6.75	9.00	9.00	0.02	0.02
Median (50%)	19.50	19.00	27.00	27.00	1.95	1.59
75% percentile	35.25	35.00	53.50	55.00	7.53	7.06
Maximum (max)	84.00	84.00	100.00	100.00	47.67	47.67
p-value (t-test)	0.998		0.986		0.880

Table 2. Summary of the models for each target variable with all the attribute variables.

Target Variable	Model	MAE		RMSE		R²
		Train	Test	Train	Test	Train	Test
Wristband	Ensemble models	1.53	4.29	2.97	7.45	0.98	0.87
Fan	Ensemble models	3.81	9.69	1.34	3.75	0.95	0.73
Lamp	Gradient Boosting Machine	0.84	2.64	1.34	4.10	0.97	0.51

Table 3. Summary of selected variables for each target variable.

Category	Wristband	Fan	Lamp
Environmental	Av.CO₂	Av.CO₂	Av.CO₂
Characteristics	AvOutTemp	AvOutTemp
Spatial	WindOrient	GroupType
Characteristics	WindDist
Personal	BMI	BMI	BMI
Characteristics		Age	Age
Perception of	Light	Light
Control		Ventil	Ventil
		Temp	Temp
	FurnComfSat	FurnComfSat	FurnComfSat
	ColdTempSat	ColdTempSat	ColdTempSat
Comfort	PrivSat	PrivSat	PrivSat
Satisfaction	WindViewSat	WarmTempSat	WarmTempSat
		LightSat	LightSat
		WindProxSat
		AirQualSat

Table 4. Summary of the models for each target variable with the selected attribute variables.

Target Variable	Optimized Model	MAE		RMSE		R²
		Train	Test	Train	Test	Train	Test
Wristband	Ensemble models	1.60	4.10	3.72	7.88	0.97	0.86
Fan	Ensemble models	3.83	8.11	7.44	14.02	0.94	0.84
Lamp	Ensemble models	0.83	2.93	1.87	7.18	0.97	0.52

Table 5. Importance of attribute variables from highest to lowest.

Wristband	Fan	Lamp
BMI	BMI	Av.CO₂
Av.CO₂	WarmTempSat	Age
ColdTempSat	AvOutTemp	BMI
WindOrient	LightSat	ColdTempSat
WindDist	Av.CO₂	Temp
AvOutTemp	FurnComfSat	FurnComfSat
Light	AirQualSat	PrivSat
FurnComfSat	GroupType	LightSat
WindViewSat	Age	WarmTempSat
PrivSat	Ventil	Ventil

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wegertseder-Martinez, P.; Restrepo-Medina, S.E.; Aedo-García, R.; Soto-Concha, R. Enhancing Office Comfort with Personal Comfort Systems: A Data-Driven Machine Learning Approach. Buildings 2025, 15, 1676. https://doi.org/10.3390/buildings15101676

AMA Style

Wegertseder-Martinez P, Restrepo-Medina SE, Aedo-García R, Soto-Concha R. Enhancing Office Comfort with Personal Comfort Systems: A Data-Driven Machine Learning Approach. Buildings. 2025; 15(10):1676. https://doi.org/10.3390/buildings15101676

Chicago/Turabian Style

Wegertseder-Martinez, Paulina, Silvia E. Restrepo-Medina, Roberto Aedo-García, and Raul Soto-Concha. 2025. "Enhancing Office Comfort with Personal Comfort Systems: A Data-Driven Machine Learning Approach" Buildings 15, no. 10: 1676. https://doi.org/10.3390/buildings15101676

APA Style

Wegertseder-Martinez, P., Restrepo-Medina, S. E., Aedo-García, R., & Soto-Concha, R. (2025). Enhancing Office Comfort with Personal Comfort Systems: A Data-Driven Machine Learning Approach. Buildings, 15(10), 1676. https://doi.org/10.3390/buildings15101676

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Enhancing Office Comfort with Personal Comfort Systems: A Data-Driven Machine Learning Approach

Abstract

1. Introduction

2. Materials and Methods

2.1. Generation of Synthetic Data

2.2. H2O Automated Machine Learning

Operation of the H2O AutoML Module

2.3. General Methodological Flow

3. Results

3.1. Evaluation of Database Expansion

3.2. Performance Assessment of the Initial Models

3.3. Selected Variables and Optimization of the Model

3.4. Metrics of Optimized Models

3.5. Predictive Model Results for the Wristband

3.6. Predictive Model Results for Fan

3.7. Predictive Model Results for the Lamp

4. Discussion

4.1. Contributions of the Proposed Model and Validation of the Hypothesis

4.2. Relevance of Personal and Perceptual Attributes

4.3. Differential Performance According to Device Type

4.4. Practical Implications and Projection Towards Intelligent Environments

4.5. Future Lines of Research

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI