Thermal Comfort-Based Personalized Models with Non-Intrusive Sensing Technique in O ffi ce Buildings

Heating, ventilation and air-conditioning (HVAC) systems play a key role in shaping the built environment. However, centralized HVAC systems cannot guarantee the provision of a comfortable thermal environment for everyone. Therefore, a personalized HVAC system that aims to adapt thermal preferences has drawn much more attention. Meanwhile, occupant-related factors like skin temperature have not had standardized measurement methods. Therefore, this paper proposes to use infrared thermography to develop individual thermal models to predict thermal sensations using three different feature sets with the random forest (RF) and support vector machine (SVM). The results have shown the correlation coefficients between clothing surface temperature and thermal sensation are 11% and 3% higher than those between skin temperature and thermal sensation of two subjects, respectively. With cross-validation, SVM with a linear kernel and penalty number of 1, as well as RF with 50 trees and the maximum tree depth of 3 were selected as the model configurations. As a result, the model trained with the feature set, consisting of indoor air temperature, relative humidity, skin temperature and clothing surface temperature, and with linear kernel SVM has achieved 100% recall score on test data of female subjects and 95% recall score on that of male subjects.


Introduction
Most people spend 90% of their time indoors [1] since buildings can provide satisfactory environments for human beings.Indoor environment quality (IEQ) is always used to evaluate the quality of the built environments created with different building systems such as acoustic quality, air quality, spatial quality, visual quality and thermal quality.As one of the key building systems, Heating, ventilation and air-conditioning (HVAC) systems affect all of these qualities, particularly indoor air quality (IAQ) and indoor thermal quality in commercial buildings such as open-plan office buildings.Moreover, open-plan offices have become a trend nowadays since they increase communication between employees and reduce construction cost.However, open-plan office buildings are also faced with problems like unsatisfactory shared indoor temperature and humidity due to different occupants' thermal preferences.Therefore, many researchers have developed the advanced technologies for the open-plan workplace so as to improve individual thermal comfort.

Background of Thermal Models for HVAC System of Office Buildings
As defined by ANSI/ASHRAE 55: Thermal Environmental Conditions for Human Occupancy [2] and ISO7730: Ergonomics of the thermal environment-Analytical determination and interpretation of thermal comfort using calculation of the PMV and PPD indices and local thermal comfort criteria [3], thermal comfort is the condition of mind that expresses satisfaction with the thermal environment and is assessed by subjective evaluation.Therefore, thermal comfort is the combined result of physical environment and psychological activities.In the course of thermal comfort theory, static thermal comfort and adaptive thermal comfort have become two main categories.In terms of static thermal comfort, predicted mean vote (PMV) developed by Fanger [4] is the most widely accepted thermal model where seven variables are accepted to be the indicators of thermal comfort, which are indoor air temperature, indoor relative humidity, indoor air velocity, mean radiant temperature, clothing insulation, metabolic rate and the external work (normally 0).However, PMV was derived from a strictly controlled climate chamber where the thermal environment is static and is different from the real office environment.Moreover, since PMV predicts the average vote of a large group of people based on the seven-point thermal sensation scale, instead of individual thermal comfort, it only describes the overall thermal sensation of multiple occupants in a shared thermal environment.
In order to overcome the disadvantages of static thermal comfort, adaptive thermal comfort aims to provide insights in increasing opportunities for personal and responsive control, thermal comfort enhancement, energy consumption reduction and climatically responsive and environmentally responsible building design [5].One of the milestone projects of adaptive thermal comfort is the ASHRAE RP-884, which collected a total of 22,000 sets of data from the real office environments across the world.This project has been widely used to investigate various adaptive thermal models, which have been integrated into personalized HVAC controls.With RP884 dataset, Lee et al. [6] proposed a method for learning personalized thermal preference profiles by formulating a combined classification and inference problem with 5-cluster models.Moreover, the thermal preference of a new user is inferred by mixture of sub-models for each cluster where clusters are used to group occupants with similar thermal preferences.However, instead of predicting 7-point thermal sensation described in ASHRAE 55, the paper predicts thermal preferences with three classes, namely "want cooler", "want warmer" and "no change".Auffenberg et al. [7] proposed a personalized thermal model using a Bayesian network to learn individual thermal comfort by predicting thermal sensation of occupants within a specific area, such as San Francisco with ASHRAE RP-884 dataset.However, the results were not promising that the highest accuracy of the model was only 30%.In addition to ASHRAE RP884 dataset, other data-driven studies have also been conducted for the development of adaptive thermal model comfort model.For instance, a unified framework was presented by Kim et al. [8] for personal comfort models and challenges were discussed for the application of personal comfort models in the real world.In the framework, instead of survey, the feedback regarding thermal preferences was collected with the interactions between personal a comfort system (PCS) and occupants.Moreover, machine learning algorithms were proposed for the development of personal comfort models.
Thermoregulation is a dynamic equilibrium with the environment where body regulates its internal core temperature in terms of heat generation and heat exchange with the environment [9].In detail, if the ambient temperature is below skin temperature, heat is dissipated from the body by radiation and convection.On the other hand, if the skin temperature is lower than ambient temperature, heat will be absorbed by the body so that the skin temperature increases [10].Even if heat exchange also occurs due to the metabolic process of the human body, since occupants usually sit or walk without too much metabolic heat production in office buildings, whether the individual is thermally comfortable or not is more related to skin temperature.
Motivated by thermoregulation theory, many researchers have proposed thermal models with the mean skin temperature or the most representative local skin temperature.Liu et al. [11] used the mean skin temperature of 10 measurement locations, including forehead, chest, upper arm, back, abdomen, elbow, hand, thigh, calf and foot to evaluate thermal comfort for a person in a sleeping posture under steady thermal environment.The results turned out that the mean skin temperature at the three different thermal comfort levels was statistically significant.In terms of local skin temperature, Choi [12] tested skin temperatures of 10 body locations to select the best location for thermal comfort inference, including forehead, upper arm, belly, wrist, hand, thigh, calf and foot.The results turned out that wrist was the most responsive area to infer thermal comfort with the lowest p-value.Moreover, Sim et al. [13] established multiple linear regression models to estimate thermal sensations with variables based on skin temperature.As a result, the models using temperatures of the fingertips and wrist showed the highest accuracy.Li et al. [14] also proposed multiple linear regression models to estimate thermal sensation under various active states using wrist skin temperature-related variables.The study indicated skin temperature, its time differential and heart rate could be used for estimating individual thermal sensation.Besides regression, thermal sensation prediction can also be seen as a classification problem where various classification algorithms can be implemented.
In order to investigate other statistical methods to establish thermal models besides regression, a recent research on thermal comfort inference utilized the wearable device of a pair of eyeglass with infrared (IR) arrays, and demonstrated that 82.8% of prediction accuracy for detection uncomfortable conditions was obtained with hidden Markov model (HMM) [15].Moreover, Huang et al. [16] used wearable devices like wearable fitness trackers to measure physical movements, sweat level and skin temperature so as to infer individual thermal comfort with machine learning algorithms including random forest (RF) and support vector machine (SVM).In addition, Dai et al. [17] implemented SVM to predict thermal demands using skin temperature collected from various locations with wearable sensors.As a result, SVM classifiers with linear kernel were preferred to Gaussian kernel, which achieved over 90% accuracy.Based on the literature review, classification models have performed so well as regression models.However, instead of a 7-point thermal sensation scale, most of those models only predict thermal sensations with three classes, which are uncomfortably warm, comfortable and uncomfortably cool.

Background of Sensing Technique for Developing Thermal Models in Office Buildings
Besides different methods of thermal models, the development of an advanced sensing technique has also promoted the development of adaptive thermal comfort.The current sensing technique used for thermal comfort inference can be mainly divided into two categories-wearable sensing devices and contactless sensing devices.For wearable devices, wrist band has been the most popular one [11][12][13][14][15][16][17][18][19][20].Moreover, Ghahramani et al. proposed a wearable infrared eyeglass frame to measure skin temperature at different points of the skin, including nose, front face, back of ear and cheek bone so as to infer thermal preferences [18].
Even if wearable devices can directly measure variables like skin temperature to indicate thermal comfort, the major disadvantage is intrusiveness.Therefore, contactless measurement methods have drawn much more attention nowadays, especially infrared (IR) thermography.IR thermography utilizes the emitted infrared radiation to measure surface temperature of an object.In order to measure surface temperature with IR thermography correctly, emissivity of the object has to be known since most of actual objects are grey bodies.Nowadays, IR thermography has been widespread and applied into various functions [21], such as building diagnostics [22], fault detection as well as thermal comfort analysis [23][24][25][26].Ranjan and Scott [24] have used IR cameras to dynamically detect and predict thermal comfort.They classified thermal preferences based on skin temperature of forehead, cheek, jaws, upper neck, lower neck, palm core, palm and back of hand, and found that the face outperformed other body regions.Han et al. [25] utilized infrared imaging to measure skin temperature and control the indoor environment with self-learning algorithms.As a result, 98% of the occupants' feedback demonstrated the control system was able to achieve satisfactory thermal environments.Additionally, an empirical study utilized an infrared (IR) sensor called Lepton to estimate occupant thermal comfort level by measuring skin temperature measured from different face regions.The results have shown that ears, nose and cheeks are most indicative to thermal comfort [26].
To summarize, contactless sensing technique can solve the issue of intrusiveness while maintain high performance of thermal comfort inference.Moreover, the adaptive thermal comfort is more realistic to the actual indoor environment than static thermal comfort and the performance can be enhanced with machine learning algorithms.Therefore, developing adaptive thermal models with contactless sensing techniques have become an interesting research area recently.

Research Contributions
Based on a literature review of thermal comfort research, it can be concluded that the recent studies related to thermal comfort have been focused on the development of different sensing techniques to build individual thermal models and apply such models into occupant-responsive HVAC system [12][13][14][15][16][17][18][19][20][24][25][26].Compared to sensing approaches like wearable devices, contactless vision-based occupancy sensors such as IR camera are much less intrusive and have drawn much more attention than before [24][25][26].However, most recent studies using IR camera for thermal comfort use regression analysis with only skin temperature, which may not have high performance when predicting occupant thermal comfort.Therefore, taking both environment-related and occupant-related factors into account, this research aims to develop adaptive thermal models with three different feature sets trained with RF and SVM.In order to collect occupant data, a six day experimental study was conducted in an open-plan office in Shanghai with the non-intrusive sensing system consisting of indoor air temperature and relative humidity sensor and an IR camera.The study contributes to the field of the personalized thermal comfort by proposing new feature sets and prediction algorithms to develop adaptive thermal models, which could further be integrated into personalized system control as well as task ambient conditioning system in open-plan offices.The rest of the paper consists of methodology, including the experimental setup and the development of thermal models, result analysis and discussions about the prediction performances as well as the conclusion and future work.

Methodology
In this study, the development of the thermal models can be divided into two parts.One is the field study to collect data for the development of thermal models with the proposed algorithms and the other is the real-time thermal model predictions, as shown in Figure 1.The data collection was implemented in C++ with ESP8266 while the models were developed in Scikit learn with Python [27].

Experimental Setup
Since this study aims to develop individual thermal models which capture the fine-tuned differences between different people, instead of developing a generic thermal model based on large group of people, it makes more sense to develop individual thermal models.Moreover, for machine learning algorithms, the performance will increase with a greater amount of data.However, even if

Experimental Setup
Since this study aims to develop individual thermal models which capture the fine-tuned differences between different people, instead of developing a generic thermal model based on large group of people, it makes more sense to develop individual thermal models.Moreover, for machine learning algorithms, the performance will increase with a greater amount of data.However, even if the common datasets like RP884 collected data from large group of participants, only less than 10 instances were collected for each individual.Therefore, unlike RP884, in this study, individual thermal models were developed based on a six-day experimental study to collect certain amount of data with two healthy subjects in their mid-twenties (one male and one female) between 3/11/2018 and 3/16/2018 in an office building in Tongji University, Shanghai.
Based on the design of non-intrusive sensing systems and a literature review of principle factors related to thermal comfort, the measured variables are shown in the following Table 1.The interval of data collection is five minutes, which corresponds with the survey responses.The experiment used the IR camera to measure the side face skin temperature and clothing surface temperature by manually selecting the face area and clothing area in the high-resolution thermal images, and then calculating the average surface temperature within those areas in FLIR tools, respectively.Figure 2 shows the participants' status in the experimental study and the thermographic photos, respectively.Before each session, the participants' clothes were reported, and heart rates were measured by a free mobile app called Instant Heart Rate so as to ensure the participants have similar thermal status in the beginning.Lastly, clothing insulation was manually estimated based on ASHRAE 55 and was used for calculating PMV values.In order to get the accurate skin temperature, the emissivity was set to be 0.98.Even if neutral sensation doesn't necessarily mean thermal comfort, it is a precondition of comfort [28].Therefore, the statement that an occupant feels thermally comfortable when they feel the thermal environment is neutral was utilized in this study, and a 7-point thermal sensation scale as a classification label was used, as shown in Table 2.In order to create different thermal environment conditions, an overhead air conditioner and four convective heaters were controlled in transient conditions where the air temperature increases no more than 0.2 • C/min on average, as shown Figure 3.Moreover, Figure 4 shows the floorplan of the testbed where two subjects were seated in two separate desks and the camera has two locations to take photos of each subject, respectively.To avoid discomfort due to high air velocity and asymmetric radiation, the participants were asked to adjust the locations of their working areas if they could feel a draft or if they could feel the sunlight before measurement.In addition, there were a total of 14 different sessions and each session lasted for at least two hours.Table 3 shows the temperature changes during each session.Since the participants attended the study for all sessions, all levels of thermal sensations were reported at least once by participants even if the majority votes belonged to neutral sensation.To avoid discomfort due to high air velocity and asymmetric radiation, the participants were asked to adjust the locations of their working areas if they could feel a draft or if they could feel the sunlight before measurement.In addition, there were a total of 14 different sessions and each session lasted for at least two hours.Table 3 shows the temperature changes during each session.Since the participants attended the study for all sessions, all levels of thermal sensations were reported at least once by participants even if the majority votes belonged to neutral sensation.To avoid discomfort due to high air velocity and asymmetric radiation, the participants were asked to adjust the locations of their working areas if they could feel a draft or if they could feel the sunlight before measurement.In addition, there were a total of 14 different sessions and each session lasted for at least two hours.Table 3 shows the temperature changes during each session.Since the participants attended the study for all sessions, all levels of thermal sensations were reported at least once by participants even if the majority votes belonged to neutral sensation.

Development of Thermal Models
The development of thermal models follows the typical machine learning pipeline, as shown in Figure 5.In this study, three different feature sets were used for developing the thermal models.The base feature set consists of indoor air temperature, indoor relative humidity and side face skin temperature.Since clothing plays an important role in thermal comfort, clothing surface temperature was selected as additional feature to construct feature set A. Moreover, since thermal comfort is also affected by temperature gradient, particularly in the transient conditions, side face skin temperature difference between two consecutive measurements was also selected as an additional feature to construct feature set B.

Development of Thermal Models
The development of thermal models follows the typical machine learning pipeline, as shown in Figure 5.In this study, three different feature sets were used for developing the thermal models.The base feature set consists of indoor air temperature, indoor relative humidity and side face skin temperature.Since clothing plays an important role in thermal comfort, clothing surface temperature was selected as additional feature to construct feature set A. Moreover, since thermal comfort is also affected by temperature gradient, particularly in the transient conditions, side face skin temperature difference between two consecutive measurements was also selected as an additional feature to construct feature set B.
As for the baseline, PMV was used to compare with the two classification models.However, several assumptions were set when calculating PMV.Firstly, since the mean radiant temperature (MRT) and indoor air velocity were not measured and the experimental environment avoided direct radiation, MRT was approximated with indoor air temperature when calculating PMV and the indoor air velocity was calculated as 0.1 m/s since there was no draft risk.In addition, since the subjects were seated during the whole study, the metabolic rate was calculated as 1 met and there was no external work.The whole dataset was split into two parts where 80% of data is the training set and 20% is the testing data.In addition, before training the best model, the hyper-parameters used in the best models were selected with 5-fold cross-validation, as shown in Table 4. Cross-validation is one of the common methods of model selection, which aims to find the model that gives the best prediction.The models fit on the training set is used to predict the test set.Usually, many such splits are used, and the results are averaged over splits [29].In terms of classification algorithms, rather than neural network, SVM and RF were implemented for classification since they can perform well with a relatively small-scale dataset.As for the baseline, PMV was used to compare with the two classification models.However, several assumptions were set when calculating PMV.Firstly, since the mean radiant temperature (MRT) and indoor air velocity were not measured and the experimental environment avoided direct radiation, Appl.Sci.2019, 9, 1768 8 of 16 MRT was approximated with indoor air temperature when calculating PMV and the indoor air velocity was calculated as 0.1 m/s since there was no draft risk.In addition, since the subjects were seated during the whole study, the metabolic rate was calculated as 1 met and there was no external work.
The whole dataset was split into two parts where 80% of data is the training set and 20% is the testing data.In addition, before training the best model, the hyper-parameters used in the best models were selected with 5-fold cross-validation, as shown in Table 4. Cross-validation is one of the common methods of model selection, which aims to find the model that gives the best prediction.The models fit on the training set is used to predict the test set.Usually, many such splits are used, and the results are averaged over splits [29].In terms of classification algorithms, rather than neural network, SVM and RF were implemented for classification since they can perform well with a relatively small-scale dataset.SVM can be used to develop supervised classification models with high-dimensional and non-linear data.Since the data is unlikely to be separated linearly, with SVM classification, soft margin usually performs better than hard margin [30].Generally, penalty parameter of C is used to control how soft of the margin can be selected with cross-validation.The Equations ( 1)-( 3) show the optimization framework in SVM.Moreover, compared to quadratic programming to solve the optimization problem, kernelized SVM can be computed much more efficiently.The common kernel functions include linear kernel, polynomial kernel and Gaussian kernel.
Inputs : S = (x 1 , y 1 ), (x 2 , y 2 ) . . ., (x n , y n )}, where x i ∈ X train and y i ∈ Y train Objective: Subject to: where w is a weight vector, C is a penalty parameter controlling how much you want to avoid misclassifying each training example and ξ i is a slack variable indicating if the sample is misclassified.With larger C, the optimization will select smaller margin of the hyper plane while with smaller C, the optimization will select larger margin of the hyperplane.Unlike SVM, RF is an ensemble learning method that averages the accuracy of a number of decision trees constructed with bagging algorithm at training time.However, random forest adds additional randomness to the model in that it looks for the most important feature among a random subset of features, a random subset of the whole dataset instead of all features, and the whole dataset while splitting the node.Compared to building deep decision trees to train high-dimensional data, owing to randomness, RF builds several smaller trees, thus suffering less from overfitting [31].As a result, it generally can develop better model than decision trees.

Assessment of Thermal Models
The prediction results of SVM and RF were compared with PMV model and all the thermal models were evaluated based on recall, precision and F1 scores [32], which were calculated with the confusion relationships based on the data collected with the two subjects.As a result, for female subject, the correlation coefficient between skin temperature and thermal sensation is 0.747 while that between clothing surface temperature and thermal sensation is 0.768.In addition, for male subject, the correlation coefficient between skin temperature and thermal sensation is 0.688 while that between clothing surface temperature and thermal sensation is 0.763.Therefore, clothing surface temperature has a higher correlation with thermal sensation than skin temperature for both subjects.

Results of Thermal Models with Classification Algorithms
Based on a 5-fold cross-validation with both female and male data, Table 7 shows the recall scores with cross-validation based on different parameter values of SVM and RF, respectively.As shown in the table, in terms of SVM model configuration, SVM with linear kernel has much better performance than Gaussian kernel with feature set A while there are few differences among all SVM configurations with the base feature set and feature set B for both subjects.Meanwhile, in terms of RF model configuration, there are few differences among all combinations of RF configurations and features sets.Since a larger penalty number will result in a slower SVM model and larger tree depth, and the number of trees will also result in a slower RF model, SVM with a linear kernel and penalty number of 1 were selected as the best model configuration SVM, and RF with maximum depth of 3 and 50 trees were selected as RF best model configuration.Tables 8 and 9 show the performance benchmark among the thermal models of the two subjects with the three feature sets trained by SVM, RF and PMV, respectively.As a result, for the female subject, precision, recall and F1 scores of PMV on test data are 48.6%,48.6% and 48.3%, respectively.Compared to the baseline of PMV, SVM and RF have better performances with all three different feature sets.Among the three feature sets, feature set A comprised of indoor air temperature, relative humidity, side face skin temperature and clothing surface temperature has 100% recall, precision and F1 score with SVM, which significantly outperforms the baseline.This may be because the effect of clothing insulation on thermal comfort has been taken into account since the dynamic clothing insulation can be estimated with clothing surface temperature and skin temperature [30], and the causal effect analysis in Section 3.1 also illustrates the clothing surface temperature has a higher correlation with thermal sensations than the skin temperature.In addition, SVM performs better than RF with all three feature sets.For the male subject, precision, recall and F1 score of PMV on test data is only 35%, 33% and 31.8%,respectively.Moreover, even if outperforming the baseline with the recall score, both SVM and RF have much worse performances in predicting thermal sensation of the male subject with the base feature set and the feature set B than those of female subject.This indicates individual differences with respect to thermal comfort such as gender, thus the personalized heating and cooling system being important to develop.However, for the feature set A, SVM still achieves high precision, recall and f1 scores which indicates that SVM can be the best classifier with the feature set consisting of indoor air temperature, relative humidity, skin temperature and clothing surface temperature for both the female and male subject.Therefore, such a combination is recommended to be the thermal model for real-time prediction for automatic personalized heating and cooling system in open-plan offices.
Moreover, Figures 7 and 8 show the normalized confusion matrices on test data with the three feature sets for the two subjects by using SVM and RF, respectively.The normalization equation is shown below: where x norm_ij is the normalized value of cell x ij , which is the element in the ith row and jth column.In addition, m is the number of the column of the confusion matrix.
subject, same as female subject, the test data with neutral label has the highest accuracy for each feature set.Moreover, as shown in Table 5, the number of uncomfortable sensations is much smaller than that of neutral sensation.Therefore, it is more difficult to predict uncomfortable thermal sensations, including cold/hot (−2/2) and very cold/very hot (−3/3) than neutral sensation.However, as shown in the figures, SVM performs better in predicting uncomfortable thermal sensations than RF for both subjects.

Discussions
With profound studies on the relationship between building energy performance and human dimensions, i.e., that occupant behaviors actually play a significant role in energy consumption in the buildings [33], the occupant-responsive HVAC system has been seen as future trend for developing intelligent building systems due to the outstanding performance of energy savings and occupant comfort improvements.Moreover, compared to theoretical physical models, the datadriven approaches to studying building system and thermal comfort have emerged to mitigate the As to the female subject, the test data with neutral label has the highest accuracy for each feature set.With both SVM and RF, for base feature set, the test data with cold (−2) label has the lowest accuracy while for feature set B, the test data with cool (−1) label has the lowest accuracy.As to male subject, same as female subject, the test data with neutral label has the highest accuracy for each feature set.Moreover, as shown in Table 5, the number of uncomfortable sensations is much smaller than that of neutral sensation.Therefore, it is more difficult to predict uncomfortable thermal sensations, including cold/hot (−2/2) and very cold/very hot (−3/3) than neutral sensation.However, as shown in the figures, SVM performs better in predicting uncomfortable thermal sensations than RF for both subjects.

Discussions
With profound studies on the relationship between building energy performance and human dimensions, i.e., that occupant behaviors actually play a significant role in energy consumption in the buildings [33], the occupant-responsive HVAC system has been seen as future trend for developing intelligent building systems due to the outstanding performance of energy savings and occupant comfort improvements.Moreover, compared to theoretical physical models, the data-driven approaches to studying building system and thermal comfort have emerged to mitigate the difficulty in developing complex models and still maintain high accuracy [8,[11][12][13][14][15][16][17][18][19][20][23][24][25][26]34].Moreover, compared to wearable sensing techniques, a non-intrusive sensing technique has drawn more attention for occupant-related factors related to thermal comfort.Therefore, this paper proposes to use the non-intrusive infrared camera in combination with temperature and humidity sensors to infer individual thermal comfort in real-time with two classification algorithms of SVM and RF.
Compared to existing methods, besides skin temperature, the proposed method also takes clothing surface temperature and skin temperature difference into consideration.Moreover, unlike ASHRAE RP884 where only a few instances were collected for each individual, the proposed experimental study collected over 300 instances for each individual so that the amount of data is enough for those classification algorithms.Additionally, besides using machine learning algorithms for thermal comfort inference like existing literature, this study also conducted a causal analysis with correlation coefficient to interpret the machine learning classification results.The results have shown the model with the feature set consisting of indoor air temperature, indoor relative humidity, side face skin temperature and clothing surface temperature perform best for both subjects.This can be explained by the fact that the correlation coefficient between clothing surface temperature and thermal sensation is higher than that between skin temperature and thermal sensation.Additionally, the model performance with respect to the female subject is much better than that of the male subject with the base feature set and the feature set B, which reveals individual differences and indicates it is necessary to update the individual thermal models with more data in real-time when adapting new users.
Moreover, besides office buildings, the proposed non-intrusive sensing system could also be applied to the air-conditioning system control during the night.During the period when occupants are asleep, whether the temperature set-point of the air-conditioning system is comfortable or not will affect sleep quality [35,36].However, since it is unlikely for occupants to manually adjust set-points of the system during sleep, the non-intrusive sensing system could measure thermal environment and skin temperature without interruptions.In addition, the sensing system could also be applied to vehicle cabins [37].Due to the much smaller size of the inner space, the non-intrusive sensing system could be embedded into the HVAC system where it does not affect the driver's views.However, since the current infrared camera is expensive to use, it may not be ideal for the real application in terms of thermal comfort.Therefore, more cost-effective sensing technique shall be investigated.AMG8833 is an 8 × 8 infrared thermal array used for occupancy detection where each pixel measures a surface temperature.Under normal operating temperature and the measured surface temperature, a calibrated AMG8833 has the resolution of 0.1 • C and the accuracy of +/−0.5 • C [38].Since AMG8833 is not only much more cost-effective than the infrared camera but also more easily integrated with temperature and humidity sensor such as DHT22, it has great potential to be used as a non-intrusive sensing system for the personalized HVAC system in different scenarios such as open-plan office buildings, residential buildings and vehicles.

Conclusions
This paper proposes a non-intrusive sensing technique, which consists of temperature, humidity sensor called DHT22, and an infrared camera named FLIR B8400.In order to improve prediction performance of thermal comfort, individual thermal models for female subject and male subject were developed with SVM and RF based on a six day experimental study conducted in an open-plan office in Shanghai.
A total of three different feature sets were selected to develop individual thermal models with the labels based on 7-point thermal sensation scale.As a result, all proposed feature sets have achieved much better performances than the baseline of the PMV model.In addition, the model trained with the feature set consisting of indoor air temperature, indoor relative humidity, side face skin temperature and clothing surface temperature, and with linear kernel SVM has achieved the best performance.The precision, recall and f1 scores of the best model was 100% on test data of female subjects and 97.5%, 96.1% and 95% on that of male subjects, respectively.Besides classification, the causal analysis has also shown that clothing surface temperature has a higher correlation coefficient with thermal sensations than side face skin temperature for both subjects, which explains why the individual thermal models with feature sets containing clothing surface temperature have the best performance.Last but not least, the individual difference in thermal comfort indicates the importance of personalized heating and cooling systems.
Moreover, the proposed sensing system could be improved further by using smaller-sized infrared thermal array such as AMG8833 to realize large-scale real deployment in open-plan office buildings.In future, the personalized cooling system could also be applied to various scenarios besides open-plan offices, such as sleep mode and driving mode to improve thermally comfortable indoor environments and energy performance.

16 Figure 1 .
Figure 1.The over diagram of the proposed methodology.

Figure 1 .
Figure 1.The over diagram of the proposed methodology.

Figure 2 .
Figure 2. The RGB and IR images of the two subjects.

Figure 2 .
Figure 2. The RGB and IR images of the two subjects.

Figure 4 .
Figure 4. Floorplan of the test bed.

Figure 4 .
Figure 4. Floorplan of the test bed.

Figure 4 .
Figure 4. Floorplan of the test bed.

Figure 5 .
Figure 5.The development pipeline of thermal models.

Figure 5 .
Figure 5.The development pipeline of thermal models.

Figure 6 .
Figure 6.Relationships between thermal sensations and (a) skin temperature of female subject (b) clothing surface temperature of female subject (c) skin temperature of male subject (d) clothing surface temperature of male subject.

Figure 6 .
Figure 6.Relationships between thermal sensations and (a) skin temperature of female subject (b) clothing surface temperature of female subject (c) skin temperature of male subject (d) clothing surface temperature of male subject.

Figure 7 .
Figure 7. Normalized confusion matrix of the female subject.(a) Base Feature set with SVM, (b) Feature set A with SVM, (c) Feature set B with SVM, (d) Base Feature set with RF, (e) Feature set A with RF, (f) Feature set B with RF.

Figure 7 .
Figure 7. Normalized confusion matrix of the female subject.(a) Base Feature set with SVM, (b) Feature set A with SVM, (c) Feature set B with SVM, (d) Base Feature set with RF, (e) Feature set A with RF, (f) Feature set B with RF.Appl.Sci.2019, 9, x 13 of 16

Figure 8 .
Figure 8. Normalized confusion matrix of the male subject.(a) Base Feature set with SVM, (b) Feature set A with SVM, (c) Feature set B with SVM, (d) Base Feature set with RF, (e) Feature set A with RF, (f) Feature set B with RF.

Figure 8 .
Figure 8. Normalized confusion matrix of the male subject.(a) Base Feature set with SVM, (b) Feature set A with SVM, (c) Feature set B with SVM, (d) Base Feature set with RF, (e) Feature set A with RF, (f) Feature set B with RF.

Table 1 .
Measured variables and sensor tools.

Table 1 .
Measured variables and sensor tools.

Table 3 .
Temperature changes during each session.

Table 3 .
Temperature changes during each session.

Table 3 .
Temperature changes during each session.

Table 4 .
The parameters of SVM and RF to be tuned with cross-validation.

Table 7 .
Best parameters selected with cross-validation.

Table 8 .
Performance benchmark among the thermal models of the female subject.

Table 9 .
Performance benchmark among the thermal models of the male subject