Model-Free HVAC Control in Buildings: A Review
Abstract
:1. Introduction
1.1. Literature Analysis Approach
- Criteria for articles: The related articles were selected based on the following themes: reinforcement learning HVAC control in buildings; neural network HVAC control in buildings; fuzzy logic HVAC control in buildings; and hybrid HVAC control in buildings.
- Keyword selection: Relevant terms related to our subject were examined in the recent literature. Search phrases included building HVAC reinforcement learning control; building HVAC deep reinforcement learning control; building HVAC artificial neural network control; building HVAC fuzzy logic control; building HVAC hybrid control; and building HVAC model-free control. These terms were chosen considering the unique challenges and facets of HVAC systems in buildings.
- Article selection: Our literature search, primarily in Google Scholar and Scopus, led us to numerous articles. After a quick scan of their abstracts, we selected the most relevant ones for our detailed review.
- Data collection: We categorized the information in each article, focusing on the method used for HVAC control and the application’s context by considering various factors like its benefits, limitations, and practical implications, especially regarding ideal HVAC control scenarios.
- Quality assessment: Each article selected was assessed for quality based on numerous criteria. These criteria included the number of citations of the paper, the scientific contributions of the authors, and the methodologies employed in the research. This helped gauge each article’s relevance and impact.
- Data analysis: Our findings were organized into clear categories, allowing for easy comparison and understanding.
1.2. Previous Literature Works
1.3. Novelty and Contributions
1.4. Paper Structure
2. General Description of HVAC Systems
HVAC Operations and Types
- Cooling Operation: The cooling operation of an HVAC system starts with the compressor, where the refrigerant is pressurized and heated, converting it into a high-pressure, high-temperature gas. This gas then flows through the condenser coils, typically located outside the building. As outdoor air is blown over these coils by a fan, the heat from the refrigerant dissipates into the environment, causing the refrigerant to condense into a high-pressure liquid. This liquid then passes through the expansion valve, where its pressure drops suddenly, leading to a significant decrease in temperature.
- Heating Operation: The cold refrigerant flows into the evaporator coil situated inside the building. As indoor air is circulated over these coils by another fan, the refrigerant absorbs the heat from the air, thereby cooling it. The refrigerant, now warmed, returns to the compressor, and the cycle repeats. On the other hand, the heating operation essentially reverses this process. The system extracts heat from the outdoor air even when it is cold, amplifies it using the compressor, and then transfers this heat indoors through the evaporator coil, thereby warming the interior space.
- Air-Conditioners (A/C): These are designed to cool the air in a space and include central air-conditioners, window units, or split systems. The control challenge involves precise temperature regulation while optimizing energy consumption, especially for central systems that need to account for the entire building’s thermal dynamics.
- Heat Pumps: These pumps provide both heating and cooling by transferring heat energy from one place to another and include types like air-source, ground-source, and water-source pumps. The control challenge usually concerns the optimization of heat transfer, especially during transitional seasons when temperature differences are minimal.
- Air-Handling Units (AHUs): These units condition and circulate air as part of an HVAC system and consist of components like blowers, heating or cooling elements, and filters. The control challenge lies in the coordination of these components to ensure optimal air circulation and conditioning while minimizing energy use.
- Variable Air-Volume (VAV) Systems: These systems supply variable airflow rates to save energy and better control comfort. In order to potentially optimize their operation, the adjustment of airflow rates in real time based on occupancy and thermal demand is necessary.
- Radiant Heating Devices: These devices transfer thermal energy for space heating through connections to boilers or operate using electricity. The control challenge is to maintain consistent heat output and ensure efficient heat transfer.
- Boilers: These produce hot water or steam for heating, which is then circulated through pipes. The potential control challenge in this type of equipment usually concerns the preservation of the desired temperature and pressure, ensuring efficient fuel combustion.
- Coolers: Evaporative coolers work by evaporating water to cool the air, which is effective in dry regions. For their efficient operation, it is necessary to optimize the evaporation process and manage water consumption.
- Furnaces: These are high-temperature heating devices used for central heating. The control challenge is to achieve high-temperature heating without wasting fuel and ensure even distribution of heated air.
- Multi-HVAC Systems: These systems integrate multiple types of HVAC equipment into a single framework, enabling zoning. Here, the control challenge is significantly more demanding than single-HVAC units. The coordination of various components to work harmoniously while considering the distinct thermal demands of different zones presents a significantly more complicated task.
3. Conceptual Background of Model-Free Methodologies for HVAC Control
3.1. Reinforcement Learning
3.1.1. Value-Based RL Approach
3.1.2. Policy-Based RL Approach
3.1.3. Actor–Critic RL Approach
3.2. Artificial Neural Networks
3.2.1. Feedforward Neural Networks (FNNs)
3.2.2. Recurrent Neural Networks (RNNs)
3.3. Fuzzy Logic Control
3.3.1. Mamdani FLC Approach
3.3.2. Sugeno FLC Approach
4. Literature Review of Model-Free Applications in HVAC Control
- Reference: Denoted as Ref. in the first column of each table.
- Year: The publication year of each research application.
- Methodology: The specific RL/ANN/FLC/hybrid/other type of control methodology applied in the related work.
- Agent: Indicates whether the applied control strategy utilizes a single- or multi-agent control philosophy.
- HVAC: The specific HVAC equipment type of each application, as described in the published work. Air-conditioning is denoted as AC; heat pumps are denoted as heat pumps; radiant heating is denoted as radiators; cooling devices are denoted as coolers; variable air-volume equipment is denoted as VAV; air-handling units are denoted as AHUs; and multi-HVAC equipment frameworks integrating more than a single device for control are denoted as multi.
- Single-zone: An “x” in this column indicates that the testbed application concerns a single-zone building control application.
- Multi-zone: An “x” in this column indicates that the testbed application concerns a multi-zone building control application.
- Simulation: An “x” in this column indicates that the testbed application concerns a simulation building control application.
- Real-life: An “x” in this column indicates that the testbed application concerns a real-world or real-life building control application.
- Residential: An “x” in this column indicates that the testbed application concerns a residential building control application.
- Commercial: An “x” in this column indicates that the testbed application concerns a commercial building control application.
- Citations: Indicates the number of citations of the related work according to Scopus.
4.1. Literature Review of Reinforcement Learning Control Applications
Ref. | Year | Methodology | Agent | HVAC | Single-Zone | Multi-Zone | Simulation | Real-Life | Residential | Commercial | Citations |
---|---|---|---|---|---|---|---|---|---|---|---|
[46] | 2015 | Q-learning | Single | NaN | x | x | 74 | ||||
[47] | 2017 | Q-learning | Single | VAV | x | x | 246 | ||||
[48] | 2017 | OPMCAC | Multi | VAV | x | x | x | 79 | |||
[49] | 2018 | Q-learning | Single | Multi | x | x | x | 189 | |||
[50] | 2019 | Gnu-RL | Single | VAV | x | x | x | x | 73 | ||
[51] | 2019 | DQN | Single | Cooler | x | x | x | 92 | |||
[52] | 2019 | DDPG | Single | Heat Pump | x | x | 54 | ||||
[53] | 2019 | PPO | Single | Multi | x | x | x | 64 | |||
[54] | 2019 | DDPG | Single | NaN | x | x | 95 | ||||
[55] | 2020 | PPO/TRPO | Single | Multi | x | x | x | 88 | |||
[56] | 2020 | DDPG | Single | AHU | x | x | x | x | 73 | ||
[57] | 2020 | Q-learning | Multi | AC | x | x | x | x | 46 | ||
[58] | 2020 | MAAC | Multi | AHU | x | x | x | 114 | |||
[59] | 2021 | SAC/TD3 | Single | Multi | x | x | x | 47 | |||
[60] | 2021 | DDPG | Multi | Heat Pump | x | x | x | 114 | |||
[61] | 2021 | DQN | Single | Radiator | x | x | x | x | 52 |
4.2. Literature Review of Artificial Neural Network Control Applications
Ref. | Year | Methodology | Agent | HVAC | Single-Zone | Multi-Zone | Simulation | Real-Life | Residential | Commercial | Citations |
---|---|---|---|---|---|---|---|---|---|---|---|
[66] | 2015 | MLP/FNN | Single | AHU | x | x | x | 99 | |||
[67] | 2016 | TDNN/FNN | Multi | Multi | x | x | x | 104 | |||
[68] | 2016 | RandNN/FNN | Single | NaN | x | x | x | 67 | |||
[69] | 2016 | MLP/FNN | Single | NaN | x | x | x | 277 | |||
[70] | 2017 | RNN | Single | Heat Pump | x | x | x | 67 | |||
[71] | 2017 | FNN | Single | NaN | x | x | x | 544 | |||
[72] | 2018 | MLP/FNN | Single | Multi | x | x | x | x | 100 | ||
[73] | 2018 | MLP/FNN | Single | AHU | x | x | x | 51 | |||
[74] | 2018 | MLP/FNN | Single | Heat Pump | x | x | x | 70 | |||
[75] | 2019 | LSTM/RNN | Single | NaN | x | x | x | 60 | |||
[76] | 2019 | MLP/FNN | Single | NaN | x | x | x | 60 | |||
[77] | 2020 | LSTM/RNN | Single | Heat Pump | x | x | x | 67 | |||
[78] | 2021 | LSTM/RNN | Single | VAV | x | x | x | 56 |
4.3. Literature Review of Fuzzy Logic Control Applications
4.4. Literature Review of Hybrid Model-Free Control Applications
Ref. | Year | Methodology | Agent | HVAC | Single-Zone | Multi-Zone | Simulation | Real-Life | Residential | Commercial | Citations |
---|---|---|---|---|---|---|---|---|---|---|---|
[85] | 2015 | GA-Fuzzy | Single | Multi | x | x | x | 61 | |||
[86] | 2015 | PSO-ANN | Single | AHU | x | x | x | 111 | |||
[87] | 2016 | EC-ANN | Single | AHU | x | x | 54 | ||||
[88] | 2017 | k-NN DTBF | Single | VAV | x | x | x | 61 | |||
[89] | 2018 | Neuro-Fuzzy | Single | AHU | x | x | x | 31 | |||
[91] | 2019 | GA-RL | Single | Radiator | x | x | x | 138 | |||
[90] | 2019 | GA-Fuzzy | NaN | VAV | x | x | x | 136 |
4.5. Literature Review of Other Model-Free Control Applications
5. Evaluation
5.1. Evaluation of Model-Free Control Strategies
5.1.1. Evaluation of Reinforcement Learning Control Strategies
5.1.2. Evaluation of Artificial Neural Network Control Strategies
5.1.3. Evaluation of Fuzzy Logic Control Strategies
5.1.4. Evaluation of Hybrid Model-Free Control Strategies
5.1.5. Evaluation of Other Model-free Control Strategies
5.2. Evaluation of Agent-Based Optimization Strategies
5.3. Evaluation of HVAC Equipment Types
5.4. Evaluation of Building Zones
5.5. Evaluation of Building Testbeds
5.6. Evaluation of Building Use
6. Future Directions in HVAC Control Using Model-Free Strategies
- Reinforcement Learning (RL): According to our evaluation, the primary utilized strategy considering the model-free HVAC control framework is RL, specifically DRL. The potential of RL in HVAC control is vast, but there are several areas ripe for exploration. One of the primary challenges is the sample inefficiency of many RL algorithms such as Q-learning, deep Q-networks (DQNs), and vanilla policy gradient methods. These algorithms often require a large number of samples (interactions with the environment) to learn a satisfactory policy, which can be costly and time-consuming in real-world HVAC scenarios. On the other hand, more advanced algorithms and techniques have been developed to address this inefficiency. Algorithms like proximal policy optimization (PPO) and trust region policy optimization (TRPO) have been shown to be more sample-efficient and stable in various applications. Their approaches are limited in the literature and thus more efforts are needed in order to further advance the concept of efficient HVAC control in buildings. Additionally, model-based RL approaches, where a model of the environment is learned and then used to simulate and optimize the policy, may also prove particularly beneficial for HVAC systems. This is because they can leverage the model to generate “synthetic” samples, reducing the need for real-world interactions. To this end, future research should focus on developing algorithms that can learn more effectively from limited interactions with the environment. Transfer learning, where knowledge gained in one environment is applied to another, could be a key technique for addressing this.Additionally, multi-agent RL systems, where multiple agents collaboratively learn and operate, could be employed to manage large-scale HVAC systems with numerous components. According to our evaluation, multi-agent RL approaches are limited compared to single-agent techniques, hindering the efficient application of RL control in large-scale buildings, where the potential for energy savings is vast. There is also a need for the development of RL algorithms that are robust to uncertainties such as unpredictable weather changes or equipment malfunction. Incorporating Bayesian approaches into RL might offer a solution by allowing the system to reason about its uncertainty and make more informed decisions.
- Artificial Neural Networks (ANNs): ANNs have shown great promise in predicting and optimizing HVAC operations. However, the black-box nature of ANNs can be a hindrance, especially in critical applications where interpretability is crucial. Future research should explore the development of interpretable neural network architectures for HVAC applications. Techniques like attention mechanisms, which highlight the importance of different inputs, could be integrated into HVAC-specific ANNs. Furthermore, the integration of temporal convolutional networks or recurrent architectures can better capture the time-dependent dynamics of HVAC systems. It should be noted that the vast majority of ANN applications consider FNN architectures, highlighting MLP as the most prominent, whereas RNN applications are more sparse. To this end, the examination of RNNs may provide a useful approach to providing upgraded and more sophisticated HVAC control in buildings. Moreover, comparisons between FNN and RNN architectures may provide a valuable research perspective. In addition, other architectures should be examined more intensively in future works, such as attention mechanisms and Transformer architectures for forecasting tasks where certain time points might be more relevant than others; hybrid models that combine the strengths of different architectures, like CNNs for spatial data and LSTMs for temporal data, or even hybrid models that combine ANNs with other techniques such as RL to leverage the strengths of both approaches; and neural ordinary differential equations (neural ODEs) for efficiently modeling HVAC systems using continuous dynamics, offering a differentiable way to solve differential equations.
- Fuzzy Logic Control (FLC): FLC offers a more understandable approach to HVAC control, as it makes use of linguistic rules. However, the manual design of membership functions and rule bases can be tedious and may not always capture the complexities of real-world scenarios. In comparison to reinforcement learning (RL) and artificial neural network (ANN) model-free frameworks, FLC applications are limited in the literature between 2015 and 2023. One primary reason is the manual design requirement of FLC systems. Designing membership functions and rule bases can be intricate, especially for multifaceted systems, whereas ANNs and RL inherently learn from data, diminishing the need for manual intervention. Additionally, the scalability of FLC becomes a concern with complex HVAC systems. As these systems grow in complexity, the number of rules in an FLC system can surge exponentially, making it challenging to design and maintain. In contrast, ANN and RL algorithms offer better scalability, thereby adeptly handling larger systems. Furthermore, although FLC can be adaptive, it often necessitates extra mechanisms to modify its rules based on real-time data.Future research work should focus on adaptive fuzzy systems that can evolve their rules and membership functions based on real-time data. The key is to integrate FLC with data-driven approaches like ANNs for the creation of hybrid models that combine the interpretability of FLC with the predictive power of ANNs. This is currently taking place in numerous research efforts and is known as neuro-fuzzy control approaches, which offer greater adaptability to changing environments compared to simple FLC. Moreover, neuro-fuzzy systems can handle complex, nonlinear relationships more efficiently, making them more robust and versatile. Last but not least, future research should also focus on the development of standardized frameworks for designing, testing, and validating FLC systems in HVAC applications.
7. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
AC | Air-Conditioner |
ADMM | Alternating Direction Method of Multipliers |
AFLM | Adaptive Fuzzy Logic Model |
AHU | Air-Handling Unit |
ANN | Artificial Neural Network |
BCNN | Bayesian Convolutional Neural Network |
BCVTB | Building Control Virtual Testbed |
BDQ | Branching Dueling Q-Network |
BEMS | Building Energy Management System |
CAO | Cognitive Adaptive Optimization |
CV | Coefficient of Variation |
DDPG | Deep Deterministic Policy Gradient |
DQN | Deep Q-Network |
DR | Demand Response |
DRL | Deep Reinforcement Learning |
DT | Decision Tree |
FIS | Fuzzy Inference System |
FLC | Fuzzy Logic Control |
FNN | Feedforward Neural Network |
GA | Genetic Algorithm |
HVAC | Heating Ventilation and Air-Conditioning |
IAQ | Indoor Air Quality |
IoT | Internet of Things |
k-NN | k-Nearest Neighbor |
LSTM | Long Short-Term Memory |
LTPC | Learning-Based Thermal Preference Control |
MAAC | Multi-Agent Actor–Critic |
MAD | Mean Absolute Deviation |
MADRL | Multi-Agent Deep Reinforcement Learning |
MAE | Mean Absolute Error |
MAPE | Mean Absolute Percentage Error |
MDP | Markov Decision Process |
MLP | Multi-Layer Perceptron |
MLR | Multiple Linear Regression |
MPC | Model Predictive Control |
NRMSE | Normalized Root-Mean-Square Error |
PCM | Personal Comfort System |
PIR | Passive Infrared Sensor |
PMV | Predicted Mean Vote |
PPD | Predicted Percentage Dissatisfied |
PPO | Proximal Policy Optimization |
PSO | Particle Swarm Optimization |
RBC | Rule-Based Control |
RBFNN | Radial Basis Function Neural Network |
RL | Reinforcement Learning |
RMSE | Root-Mean-Square Error |
RNN | Recurrent Neural Network |
RTP | Real-Time Pricing Program |
RandNN | Random Neural Network |
Rsquare or R2 | Coefficient of Determination |
SAC | Soft Actor–Critic |
SARSA | State–Action–Reward–State–Action |
SFLL | Supervised Fuzzy Logic Learning |
SGD | Stochastic Gradient Descent |
SVM | Support Vector Machine |
TD | Temporal Difference |
TD3 | Twin Delayed Deep Deterministic Policy Gradient |
TDNN | Time-Delay Neural Network |
TRNSYS | Transient System Simulation Program |
TRPO | Trust Region Policy Optimization |
VAV | Variable Air-Volume |
WSN | Wireless Sensor Array |
References
- Global Alliance for Buildings and Construction. Global Status Report for Buildings and Construction; Global Alliance for Buildings and Construction: Paris, France, 2020. [Google Scholar]
- Li, Y.; Wang, W.; Wang, Y.; Xin, Y.; He, T.; Zhao, G. A review of studies involving the effects of climate change on the energy consumption for building heating and cooling. Int. J. Environ. Res. Public Health 2021, 18, 40. [Google Scholar] [CrossRef] [PubMed]
- Behrooz, F.; Mariun, N.; Marhaban, M.H.; Mohd Radzi, M.A.; Ramli, A.R. Review of control techniques for HVAC systems—Nonlinearity approaches based on Fuzzy cognitive maps. Energies 2018, 11, 495. [Google Scholar] [CrossRef]
- Serale, G.; Fiorentini, M.; Capozzoli, A.; Bernardini, D.; Bemporad, A. Model predictive control (MPC) for enhancing building and HVAC system energy efficiency: Problem formulation, applications and opportunities. Energies 2018, 11, 631. [Google Scholar] [CrossRef]
- Seyam, S. Types of HVAC Systems. In HVAC System; InTech Open: London, UK, 2018; pp. 49–66. Available online: https://www.intechopen.com/chapters/62059 (accessed on 12 October 2023).
- Rafati, A.; Shaker, H.R.; Ghahghahzadeh, S. Fault detection and efficiency assessment for hvac systems using non-intrusive load monitoring: A review. Energies 2022, 15, 341. [Google Scholar] [CrossRef]
- Michailidis, P.; Pelitaris, P.; Korkas, C.; Michailidis, I.; Baldi, S.; Kosmatopoulos, E. Enabling optimal energy management with minimal IoT requirements: A legacy A/C case study. Energies 2021, 14, 7910. [Google Scholar] [CrossRef]
- Belic, F.; Hocenski, Z.; Sliskovic, D. HVAC control methods-a review. In Proceedings of the 2015 19th International Conference on System Theory, Control and Computing (ICSTCC), Cheile Gradistei, Romania, 14–16 October 2015; IEEE: Piscataway, NJ, USA, 2015; pp. 679–686. [Google Scholar]
- Yao, Y.; Shekhar, D.K. State of the art review on model predictive control (MPC) in Heating Ventilation and Air-conditioning (HVAC) field. Build. Environ. 2021, 200, 107952. [Google Scholar] [CrossRef]
- Akram, M.W.; Mohd Zublie, M.F.; Hasanuzzaman, M.; Rahim, N.A. Global prospects, advance technologies and policies of energy-saving and sustainable building systems: A review. Sustainability 2022, 14, 1316. [Google Scholar] [CrossRef]
- Kim, D.; Lee, J.; Do, S.; Mago, P.J.; Lee, K.H.; Cho, H. Energy modeling and model predictive control for HVAC in buildings: A review of current research trends. Energies 2022, 15, 7231. [Google Scholar] [CrossRef]
- Michailidis, I.T.; Sangi, R.; Michailidis, P.; Schild, T.; Fuetterer, J.; Mueller, D.; Kosmatopoulos, E.B. Balancing energy efficiency with indoor comfort using smart control agents: A simulative case study. Energies 2020, 13, 6228. [Google Scholar] [CrossRef]
- Ali, S.; Zheng, Z.; Aillerie, M.; Sawicki, J.P.; Pera, M.C.; Hissel, D. A review of DC Microgrid energy management systems dedicated to residential applications. Energies 2021, 14, 4308. [Google Scholar] [CrossRef]
- Paone, A.; Bacher, J.P. The impact of building occupant behavior on energy efficiency and methods to influence it: A review of the state of the art. Energies 2018, 11, 953. [Google Scholar] [CrossRef]
- Korkas, C.D.; Baldi, S.; Michailidis, P.; Kosmatopoulos, E.B. A cognitive stochastic approximation approach to optimal charging schedule in electric vehicle stations. In Proceedings of the 2017 25th Mediterranean Conference on Control and Automation (MED), Valletta, Malta, 3–6 July 2017; IEEE: Piscataway, NJ, USA, 2017; pp. 484–489. [Google Scholar]
- Michailidis, I.T.; Michailidis, P.; Rizos, A.; Korkas, C.; Kosmatopoulos, E.B. Automatically fine-tuned speed control system for fuel and travel-time efficiency: A microscopic simulation case study. In Proceedings of the 2017 25th Mediterranean Conference on Control and Automation (MED), Valletta, Malta, 3–6 July 2017; IEEE: Piscataway, NJ, USA, 2017; pp. 915–920. [Google Scholar]
- Michailidis, I.T.; Michailidis, P.; Alexandridou, K.; Brewick, P.T.; Masri, S.F.; Kosmatopoulos, E.B.; Chassiakos, A. Seismic Active Control under Uncertain Ground Excitation: An Efficient Cognitive Adaptive Optimization Approach. In Proceedings of the 2018 5th International Conference on Control, Decision and Information Technologies (CoDIT), Thessaloniki, Greece, 10–13 April 2018; IEEE: Piscataway, NJ, USA, 2018; pp. 847–852. [Google Scholar]
- Michailidis, I.T.; Manolis, D.; Michailidis, P.; Diakaki, C.; Kosmatopoulos, E.B. Autonomous self-regulating intersections in large-scale urban traffic networks: A Chania City case study. In Proceedings of the 2018 5th International Conference on Control, Decision and Information Technologies (CoDIT), Thessaloniki, Greece, 10–13 April 2018; IEEE: Piscataway, NJ, USA, 2018; pp. 853–858. [Google Scholar]
- Michailidis, I.T.; Manolis, D.; Michailidis, P.; Diakaki, C.; Kosmatopoulos, E.B. A decentralized optimization approach employing cooperative cycle-regulation in an intersection-centric manner: A complex urban simulative case study. Transp. Res. Interdiscip. Perspect. 2020, 8, 100232. [Google Scholar] [CrossRef]
- Michailidis, I.T.; Kapoutsis, A.C.; Korkas, C.D.; Michailidis, P.T.; Alexandridou, K.A.; Ravanis, C.; Kosmatopoulos, E.B. Embedding autonomy in large-scale IoT ecosystems using CAO and L4G-CAO. Discov. Internet Things 2021, 1, 1–22. [Google Scholar] [CrossRef]
- Keroglou, C.; Kansizoglou, I.; Michailidis, P.; Oikonomou, K.M.; Papapetros, I.T.; Dragkola, P.; Michailidis, I.T.; Gasteratos, A.; Kosmatopoulos, E.B.; Sirakoulis, G.C. A Survey on Technical Challenges of Assistive Robotics for Elder People in Domestic Environments: The ASPiDA Concept. IEEE Trans. Med. Robot. Bionics 2023, 5, 196–205. [Google Scholar] [CrossRef]
- Karatzinis, G.D.; Michailidis, P.; Michailidis, I.T.; Kapoutsis, A.C.; Kosmatopoulos, E.B.; Boutalis, Y.S. Coordinating heterogeneous mobile sensing platforms for effectively monitoring a dispersed gas plume. Integr. Comput.-Aided Eng. 2022, 29, 1–19. [Google Scholar] [CrossRef]
- Salavasidis, G.; Kapoutsis, A.C.; Chatzichristofis, S.A.; Michailidis, P.; Kosmatopoulos, E.B. Autonomous trajectory design system for mapping of unknown sea-floors using a team of AUVs. In Proceedings of the 2018 European Control Conference (ECC), Limassol, Cyprus, 12–15 June 2018; IEEE: Piscataway, NJ, USA, 2018; pp. 1080–1087. [Google Scholar]
- Gao, C.; Wang, D. Comparative study of model-based and model-free reinforcement learning control performance in HVAC systems. J. Build. Eng. 2023, 74, 106852. [Google Scholar] [CrossRef]
- Vamvakas, D.; Michailidis, P.; Korkas, C.; Kosmatopoulos, E. Review and Evaluation of Reinforcement Learning Frameworks on Smart Grid Applications. Energies 2023, 16, 5326. [Google Scholar] [CrossRef]
- Macieira, P.; Gomes, L.; Vale, Z. Energy Management Model for HVAC Control Supported by Reinforcement Learning. Energies 2021, 14, 8210. [Google Scholar] [CrossRef]
- Boodi, A.; Beddiar, K.; Benamour, M.; Amirat, Y.; Benbouzid, M. Intelligent systems for building energy and occupant comfort optimization: A state of the art review and recommendations. Energies 2018, 11, 2604. [Google Scholar] [CrossRef]
- Gholamzadehmir, M.; Del Pero, C.; Buffa, S.; Fedrizzi, R. Adaptive-predictive control strategy for HVAC systems in smart buildings—A review. Sustain. Cities Soc. 2020, 63, 102480. [Google Scholar] [CrossRef]
- Ahmad, M.W.; Mourshed, M.; Yuce, B.; Rezgui, Y. Computational intelligence techniques for HVAC systems: A review. In Building Simulation; Tsinghua University Press: Beijing, China, 2016; Volume 9, pp. 359–398. [Google Scholar]
- Aqilah, N.; Rijal, H.B.; Zaki, S.A. A review of thermal comfort in residential buildings: Comfort threads and energy saving potential. Energies 2022, 15, 9012. [Google Scholar] [CrossRef]
- Lamsal, P.; Bajracharya, S.B.; Rijal, H.B. A Review on Adaptive Thermal Comfort of Office Building for Energy-Saving Building Design. Energies 2023, 16, 1524. [Google Scholar] [CrossRef]
- American Society of Heating, Refrigerating and Air-Conditioning Engineers, Inc. HVAC Systems and Equipment; American Society of Heating, Refrigerating, and Air Conditioning Engineers: Atlanta, GA, USA, 1996; Volume 39. [Google Scholar]
- Sutton, R.S.; Barto, A.G. Reinforcement Learning: An Introduction, 2nd ed.; MIT Press: Cambridge, MA, USA, 2018. [Google Scholar]
- Bertsekas, D.P.; Tsitsiklis, J.N. Neuro-Dynamic Programming; Athena Scientific: Nashua, NH, USA, 1996; Volume 27. [Google Scholar] [CrossRef]
- Williams, R.J. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Mach. Learn. 1992, 8, 229–256. [Google Scholar] [CrossRef]
- Watkins, C.; Dayan, P. Q-learning. Mach. Learn. 1992, 8, 279–292. [Google Scholar] [CrossRef]
- Rummery, G.; Niranjan, M. On-Line Q-Learning Using Connectionist Systems; Technical Report CUED/F-INFENG/TR 166; University of Cambridge, Department of Engineering: Cambridge, UK, 1994. [Google Scholar]
- Boutalis, I.S.; Syrakoulis, G.C. Computational Intelligence & Applications, 1st ed.; Krikos: Xanthi, Greece, 2008. [Google Scholar]
- Haykin, S.S. Neural Networks and Learning Machines; Pearson Education India: Tamil Nadu, India, 2009; Available online: https://lps.ufrj.br/~caloba/Livros/Haykin2009.pdf (accessed on 12 October 2023).
- Krose, B.; van der Smagt, P. An Introduction to Neural Networks; MIT Press: Cambridge, MA, USA, 1996. [Google Scholar]
- Russell, S.; Norvig, P.; Chang, M.W.; Devlin, J.; Dragan, A.; Forsyth, D.; Goodfellow, I.; Malik, J.M.; Mansinghka, V.; Pearl, J.; et al. Artificial Intelligence a Modern Approach, 4th ed.; Pearson: New York, NY, USA, 2022. [Google Scholar]
- Michailidis, P.; Michailidis, I.T.; Gkelios, S.; Karatzinis, G.; Kosmatopoulos, E.B. Neuro-distributed cognitive adaptive optimization for training neural networks in a parallel and asynchronous manner. Integr. Comput.-Aided Eng. 2023, 1–23. [Google Scholar] [CrossRef]
- Zadeh, L. Fuzzy sets. Inf. Control 1965, 8, 338–353. [Google Scholar] [CrossRef]
- Mamdani, E.; Assilian, S. An experiment in linguistic synthesis with a fuzzy logic controller. Int. J. Man-Mach. Stud. 1975, 7, 1–13. [Google Scholar] [CrossRef]
- Takagi, T.; Sugeno, M. Fuzzy identification of systems and its applications to modeling and control. IEEE Trans. Syst. Man Cybern. 1985, 15, 116–132. [Google Scholar] [CrossRef]
- Barrett, E.; Linder, S. Autonomous hvac control, a reinforcement learning approach. In Proceedings of the Machine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2015, Porto, Portugal, 7–11 September 2015; Proceedings, Part III 15. Springer: Berlin/Heidelberg, Germany, 2015; pp. 3–19. [Google Scholar]
- Wei, T.; Wang, Y.; Zhu, Q. Deep reinforcement learning for building HVAC control. In Proceedings of the 54th Annual Design Automation Conference, Austin, TX, USA, 18 June 2017; pp. 1–6. [Google Scholar]
- Wang, Y.; Velswamy, K.; Huang, B. A long-short term memory recurrent neural network based reinforcement learning controller for office heating ventilation and air conditioning systems. Processes 2017, 5, 46. [Google Scholar] [CrossRef]
- Chen, Y.; Norford, L.K.; Samuelson, H.W.; Malkawi, A. Optimal control of HVAC and window systems for natural ventilation through reinforcement learning. Energy Build. 2018, 169, 195–205. [Google Scholar] [CrossRef]
- Chen, B.; Cai, Z.; Bergés, M. Gnu-rl: A precocial reinforcement learning solution for building hvac control using a differentiable mpc policy. In Proceedings of the 6th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation, New York, NY, USA, 13–14 November 2019; pp. 316–325. [Google Scholar]
- Valladares, W.; Galindo, M.; Gutiérrez, J.; Wu, W.C.; Liao, K.K.; Liao, J.C.; Lu, K.C.; Wang, C.C. Energy optimization associated with thermal comfort and indoor air control via a deep reinforcement learning algorithm. Build. Environ. 2019, 155, 105–117. [Google Scholar] [CrossRef]
- Liu, T.; Xu, C.; Guo, Y.; Chen, H. A novel deep reinforcement learning based methodology for short-term HVAC system energy consumption prediction. Int. J. Refrig. 2019, 107, 39–51. [Google Scholar] [CrossRef]
- Zhang, C.; Kuppannagari, S.R.; Kannan, R.; Prasanna, V.K. Building HVAC scheduling using reinforcement learning via neural network based model approximation. In Proceedings of the 6th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation, New York, NY, USA, 13–14 November 2019; pp. 287–296. [Google Scholar]
- Gao, G.; Li, J.; Wen, Y. Energy-efficient thermal comfort control in smart buildings via deep reinforcement learning. arXiv 2019, arXiv:1901.04693. [Google Scholar]
- Azuatalam, D.; Lee, W.L.; de Nijs, F.; Liebman, A. Reinforcement learning for whole-building HVAC control and demand response. Energy AI 2020, 2, 100020. [Google Scholar] [CrossRef]
- Zou, Z.; Yu, X.; Ergan, S. Towards optimal control of air handling units using deep reinforcement learning and recurrent neural network. Build. Environ. 2020, 168, 106535. [Google Scholar] [CrossRef]
- Lork, C.; Li, W.T.; Qin, Y.; Zhou, Y.; Yuen, C.; Tushar, W.; Saha, T.K. An uncertainty-aware deep reinforcement learning framework for residential air conditioning energy management. Appl. Energy 2020, 276, 115426. [Google Scholar] [CrossRef]
- Yu, L.; Sun, Y.; Xu, Z.; Shen, C.; Yue, D.; Jiang, T.; Guan, X. Multi-agent deep reinforcement learning for HVAC control in commercial buildings. IEEE Trans. Smart Grid 2020, 12, 407–419. [Google Scholar] [CrossRef]
- Biemann, M.; Scheller, F.; Liu, X.; Huang, L. Experimental evaluation of model-free reinforcement learning algorithms for continuous HVAC control. Appl. Energy 2021, 298, 117164. [Google Scholar] [CrossRef]
- Du, Y.; Zandi, H.; Kotevska, O.; Kurte, K.; Munk, J.; Amasyali, K.; Mckee, E.; Li, F. Intelligent multi-zone residential HVAC control strategy based on deep reinforcement learning. Appl. Energy 2021, 281, 116117. [Google Scholar] [CrossRef]
- Gupta, A.; Badr, Y.; Negahban, A.; Qiu, R.G. Energy-efficient heating control for smart buildings with deep reinforcement learning. J. Build. Eng. 2021, 34, 101739. [Google Scholar] [CrossRef]
- Li, Z.; Sun, Z.; Meng, Q.; Wang, Y.; Li, Y. Reinforcement learning of room temperature set-point of thermal storage air-conditioning system with demand response. Energy Build. 2022, 259, 111903. [Google Scholar] [CrossRef]
- Lei, Y.; Zhan, S.; Ono, E.; Peng, Y.; Zhang, Z.; Hasama, T.; Chong, A. A practical deep reinforcement learning framework for multivariate occupant-centric control in buildings. Appl. Energy 2022, 324, 119742. [Google Scholar] [CrossRef]
- Deng, X.; Zhang, Y.; Qi, H. Towards optimal HVAC control in non-stationary building environments combining active change detection and deep reinforcement learning. Build. Environ. 2022, 211, 108680. [Google Scholar] [CrossRef]
- Yu, L.; Xu, Z.; Zhang, T.; Guan, X.; Yue, D. Energy-efficient personalized thermal comfort control in office buildings based on multi-agent deep reinforcement learning. Build. Environ. 2022, 223, 109458. [Google Scholar] [CrossRef]
- Huang, H.; Chen, L.; Hu, E. A neural network-based multi-zone modelling approach for predictive control system design in commercial buildings. Energy Build. 2015, 97, 86–97. [Google Scholar] [CrossRef]
- Sholahudin, S.; Han, H. Simplified dynamic neural network model to predict heating load of a building using Taguchi method. Energy 2016, 115, 1672–1678. [Google Scholar] [CrossRef]
- Javed, A.; Larijani, H.; Ahmadinia, A.; Emmanuel, R.; Mannion, M.; Gibson, D. Design and implementation of a cloud enabled random neural network-based decentralized smart controller with intelligent sensor nodes for HVAC. IEEE Internet Things J. 2016, 4, 393–403. [Google Scholar] [CrossRef]
- Chae, Y.T.; Horesh, R.; Hwang, Y.; Lee, Y.M. Artificial neural network model for forecasting sub-hourly electricity usage in commercial buildings. Energy Build. 2016, 111, 184–194. [Google Scholar] [CrossRef]
- Chen, Y.; Shi, Y.; Zhang, B. Modeling and optimization of complex building energy systems with deep neural networks. In Proceedings of the 2017 51st Asilomar Conference on Signals, Systems, and Computers, Pacific Grove, CA, USA, 29 October–1 November 2017; IEEE: Piscataway, NJ, USA, 2017; pp. 1368–1373. [Google Scholar]
- Ahmad, M.W.; Mourshed, M.; Rezgui, Y. Trees vs Neurons: Comparison between random forest and ANN for high-resolution prediction of building energy consumption. Energy Build. 2017, 147, 77–89. [Google Scholar] [CrossRef]
- González-Briones, A.; Prieto, J.; De La Prieta, F.; Herrera-Viedma, E.; Corchado, J.M. Energy optimization using a case-based reasoning strategy. Sensors 2018, 18, 865. [Google Scholar] [CrossRef]
- Deb, C.; Lee, S.E.; Santamouris, M. Using artificial neural networks to assess HVAC related energy saving in retrofitted office buildings. Sol. Energy 2018, 163, 32–44. [Google Scholar] [CrossRef]
- Kim, Y.J. Optimal price based demand response of HVAC systems in multizone office buildings considering thermal preferences of individual occupants buildings. IEEE Trans. Ind. Inform. 2018, 14, 5060–5073. [Google Scholar] [CrossRef]
- Wang, Z.; Hong, T.; Piette, M.A. Data fusion in predicting internal heat gains for office buildings through a deep learning approach. Appl. Energy 2019, 240, 386–398. [Google Scholar] [CrossRef]
- Peng, Y.; Nagy, Z.; Schlüter, A. Temperature-preference learning with neural networks for occupant-centric building indoor climate controls. Build. Environ. 2019, 154, 296–308. [Google Scholar] [CrossRef]
- Sendra-Arranz, R.; Gutiérrez, A. A long short-term memory artificial neural network to predict daily HVAC consumption in buildings. Energy Build. 2020, 216, 109952. [Google Scholar] [CrossRef]
- Elmaz, F.; Eyckerman, R.; Casteels, W.; Latré, S.; Hellinckx, P. CNN-LSTM architecture for predictive indoor temperature modeling. Build. Environ. 2021, 206, 108327. [Google Scholar] [CrossRef]
- Saepullah, A.; Wahono, R.S. Comparative analysis of mamdani, sugeno and tsukamoto method of fuzzy inference system for air conditioner energy saving. J. Intell. Syst. 2015, 1, 143–147. [Google Scholar]
- Keshtkar, A.; Arzanpour, S.; Keshtkar, F.; Ahmadi, P. Smart residential load reduction via fuzzy logic, wireless sensors, and smart grid incentives. Energy Build. 2015, 104, 165–180. [Google Scholar] [CrossRef]
- Ulpiani, G.; Borgognoni, M.; Romagnoli, A.; Di Perna, C. Comparing the performance of on/off, PID and fuzzy controllers applied to the heating system of an energy-efficient building. Energy Build. 2016, 116, 1–17. [Google Scholar] [CrossRef]
- Keshtkar, A.; Arzanpour, S. An adaptive fuzzy logic system for residential energy management in smart grid environments. Appl. Energy 2017, 186, 68–81. [Google Scholar] [CrossRef]
- Ain, Q.u.; Iqbal, S.; Khan, S.A.; Malik, A.W.; Ahmad, I.; Javaid, N. IoT operating system based fuzzy inference system for home energy management system in smart buildings. Sensors 2018, 18, 2802. [Google Scholar] [CrossRef] [PubMed]
- Li, W.; Zhang, J.; Zhao, T. Indoor thermal environment optimal control for thermal comfort and energy saving based on online monitoring of thermal sensation. Energy Build. 2019, 197, 57–67. [Google Scholar] [CrossRef]
- Hussain, S.; Gabbar, H.A.; Bondarenko, D.; Musharavati, F.; Pokharel, S. Comfort-based fuzzy control optimization for energy conservation in HVAC systems. Control Eng. Pract. 2014, 32, 172–182. [Google Scholar] [CrossRef]
- Wei, X.; Kusiak, A.; Li, M.; Tang, F.; Zeng, Y. Multi-objective optimization of the HVAC (heating, ventilation, and air conditioning) system performance. Energy 2015, 83, 294–306. [Google Scholar] [CrossRef]
- Attaran, S.M.; Yusof, R.; Selamat, H. A novel optimization algorithm based on epsilon constraint-RBF neural network for tuning PID controller in decoupled HVAC system. Appl. Therm. Eng. 2016, 99, 613–624. [Google Scholar] [CrossRef]
- Ghahramani, A.; Karvigh, S.A.; Becerik-Gerber, B. HVAC system energy optimization using an adaptive hybrid metaheuristic. Energy Build. 2017, 152, 149–161. [Google Scholar] [CrossRef]
- Sala-Cardoso, E.; Delgado-Prieto, M.; Kampouropoulos, K.; Romeral, L. Activity-aware HVAC power demand forecasting. Energy Build. 2018, 170, 15–24. [Google Scholar] [CrossRef]
- Satrio, P.; Mahlia, T.M.I.; Giannetti, N.; Saito, K. Optimization of HVAC system energy consumption in a building using artificial neural network and multi-objective genetic algorithm. Sustain. Energy Technol. Assess. 2019, 35, 48–57. [Google Scholar]
- Zhang, Z.; Chong, A.; Pan, Y.; Zhang, C.; Lam, K.P. Whole building energy model for HVAC optimal control: A practical framework based on deep reinforcement learning. Energy Build. 2019, 199, 472–490. [Google Scholar] [CrossRef]
- Ren, M.; Liu, X.; Yang, Z.; Zhang, J.; Guo, Y.; Jia, Y. A novel forecasting based scheduling method for household energy management system based on deep reinforcement learning. Sustain. Cities Soc. 2022, 76, 103207. [Google Scholar] [CrossRef]
- Cai, J.; Kim, D.; Jaramillo, R.; Braun, J.E.; Hu, J. A general multi-agent control approach for building energy system optimization. Energy Build. 2016, 127, 337–351. [Google Scholar] [CrossRef]
- Wang, W.; Chen, J.; Huang, G.; Lu, Y. Energy efficient HVAC control for an IPS-enabled large space in commercial buildings through dynamic spatial occupancy distribution. Appl. Energy 2017, 207, 305–323. [Google Scholar] [CrossRef]
- Peng, Y.; Rysanek, A.; Nagy, Z.; Schlüter, A. Occupancy learning-based demand-driven cooling control for office spaces. Build. Environ. 2017, 122, 145–160. [Google Scholar] [CrossRef]
- Peng, Y.; Rysanek, A.; Nagy, Z.; Schlüter, A. Using machine learning techniques for occupancy-prediction-based cooling control in office buildings. Appl. Energy 2018, 211, 1343–1358. [Google Scholar] [CrossRef]
- Li, W.; Wang, S. A multi-agent based distributed approach for optimal control of multi-zone ventilation systems considering indoor air quality and energy use. Appl. Energy 2020, 275, 115371. [Google Scholar] [CrossRef]
- Michailidis, I.T.; Schild, T.; Sangi, R.; Michailidis, P.; Korkas, C.; Fütterer, J.; Müller, D.; Kosmatopoulos, E.B. Energy-efficient HVAC management using cooperative, self-trained, control agents: A real-life German building case study. Appl. Energy 2018, 211, 113–125. [Google Scholar] [CrossRef]
- Lymperopoulos, G.; Ioannou, P. Building temperature regulation in a multi-zone HVAC system using distributed adaptive control. Energy Build. 2020, 215, 109825. [Google Scholar] [CrossRef]
- Apostolidis, S.; Koutras, D.; Orfanidis, G.; Michailidis, P.; Ioannidis, K.; Kapoutsis, A.; Vrochidis, S.; Kompatsiaris, I.; Kosmatopoulos, E. D3.5 Dynamic and Adaptive Swarm Optimization V1. 2020. Available online: https://aresibo.eu/sites/default/files/documents/d3.5.pdf (accessed on 29 September 2020).
- Shahnazari, H.; Mhaskar, P.; House, J.M.; Salsbury, T.I. Modeling and fault diagnosis design for HVAC systems using recurrent neural networks. Comput. Chem. Eng. 2019, 126, 189–203. [Google Scholar] [CrossRef]
- Elnour, M.; Meskin, N.; Al-Naemi, M. Sensor data validation and fault diagnosis using Auto-Associative Neural Network for HVAC systems. J. Build. Eng. 2020, 27, 100935. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Michailidis, P.; Michailidis, I.; Vamvakas, D.; Kosmatopoulos, E. Model-Free HVAC Control in Buildings: A Review. Energies 2023, 16, 7124. https://doi.org/10.3390/en16207124
Michailidis P, Michailidis I, Vamvakas D, Kosmatopoulos E. Model-Free HVAC Control in Buildings: A Review. Energies. 2023; 16(20):7124. https://doi.org/10.3390/en16207124
Chicago/Turabian StyleMichailidis, Panagiotis, Iakovos Michailidis, Dimitrios Vamvakas, and Elias Kosmatopoulos. 2023. "Model-Free HVAC Control in Buildings: A Review" Energies 16, no. 20: 7124. https://doi.org/10.3390/en16207124
APA StyleMichailidis, P., Michailidis, I., Vamvakas, D., & Kosmatopoulos, E. (2023). Model-Free HVAC Control in Buildings: A Review. Energies, 16(20), 7124. https://doi.org/10.3390/en16207124