Next Article in Journal
Revolutionizing Green Electricity Certificates: A Real-Time Traceability Framework for Credible Renewable Energy Attribution in China
Previous Article in Journal
Wind–Storage Coordinated Control Strategy for Suppressing Repeated Voltage Ride-Through of Units Under Extreme Weather Conditions
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Review

Digitalization for Sustainable Heat Pump Operation: Review on Smart Control and Optimization Strategies

Process Equipment Design Laboratory, Department of Mechanical Engineering, Aristotle University of Thessaloniki, 54124 Thessaloniki, Greece
*
Author to whom correspondence should be addressed.
Energies 2026, 19(1), 66; https://doi.org/10.3390/en19010066
Submission received: 12 November 2025 / Revised: 17 December 2025 / Accepted: 19 December 2025 / Published: 22 December 2025

Abstract

This review provides a comprehensive analysis of advanced control strategies and operational optimization of energy systems, focusing on heat pumps, with an emphasis on their role in enhancing energy efficiency and operational flexibility. The study concentrates on methods supported by artificial intelligence algorithms, highlighting Model Predictive Control (MPC), Reinforcement Learning (RL), and hybrid approaches that combine the advantages of both. These methods aim to optimize both the operation of heat pumps and their interaction with thermal energy storage (TES) systems, renewable energy sources, and power grids, thereby enhancing the flexibility and adaptability of the systems under real operating conditions. Through a systematic analysis of the existing literature, 95 studies published after 2019 were examined to identify research trends, key challenges such as computational requirements and algorithm interpretability, and future opportunities. Furthermore, significant benefits of applying advanced control compared to conventional practices were highlighted, such as reduced operational costs and lower CO2 emissions, emphasizing the importance of heat pumps in the energy transition. Thus, the analysis highlights the need for digital solutions, robust and adaptive control frameworks, and holistic techno-economic evaluation methods in order to fully exploit the potential of heat pumps and accelerate the transition to sustainable and flexible energy systems.

1. Introduction

The impacts of climate change are already affecting humanity through more frequent extreme weather events, rising global temperatures, and disruptions to ecosystems, agriculture, and public health. These pressing challenges necessitate the transformation of energy systems, specifically by reducing reliance on fossil fuels and promoting sustainable energy solutions to secure a resilient global future. In response to these urgent needs, the European Commission announced the European Green Deal in December 2019, aiming for climate neutrality by 2050 in line with the commitments of the Paris Agreement [1].

1.1. Heat Pumps as a Key Technology for Decarbonization

The demand for thermal energy is a critical area of this transition, as heating and cooling account for half of global energy consumption [2] and are expected to increase further in the coming years due to climate change. In this framework, heat pumps powered by low-emission electricity emerge as one of the most critical technologies for achieving climate targets and facilitating the transition to a sustainable energy future [3]. By harnessing low-temperature renewable sources, heat pumps offer high energy efficiency and can significantly reduce dependence on fossil fuels. Their ability to be integrated with renewable energy systems, thermal storage, and smart grids makes them a strategic element for the decarbonization of both the building and industrial sectors. A main challenge in the energy transition concerns the electrification of systems and their smooth integration into the grids, resulting from the variability of generation and the imbalance between supply and demand. In this context, digitalization emerges as a key enabler for transforming energy systems, as it can address these issues by facilitating energy management, improving energy efficiency, and reducing CO2 emissions [3]. In this framework of digitalization and artificial intelligence development, a growing research focus is emerging on the control and optimization of heat pump systems, aiming to enhance energy efficiency and enable a more sustainable energy future.

1.2. Literature Analysis Approach

This review offers a comprehensive and critical examination of the current state of optimal control for heat pump systems within the context of advanced digitalization of energy systems. The objective is to identify the capabilities, challenges, and future directions that can enhance energy efficiency, flexibility, and the contribution of heat pumps to the transition towards a sustainable energy system. To enhance the organization of the article collection and the overall research, three research questions were identified, serving as a roadmap and aiding the entire process:
  • RQ1: What are the main advanced control strategies for heat pump systems?
  • RQ2: What are the advantages and limitations of each strategy?
  • RQ3: What are the reported outcomes of implementing advanced control strategies in heat pump systems, and what are the main challenges for their widespread adoption?
Guided by these three research questions, the methodology followed for the literature review is illustrated in Figure 1.
Initially, a search was conducted to identify various scientific studies in the literature database Scopus and the academic search engine Google Scholar, which were selected for their broad representation of topics in science and engineering. A search was conducted for various papers using a range of carefully selected keywords. These phrases included “model-free control heat pump systems,” “model predictive control heat pump systems,” “digitalization heat pump systems,” as well as terms related to flexibility, smart control, and other aspects of heat pump system optimization and digitalization. This approach ensured a comprehensive coverage of the scientific literature on the topic. After a rapid screening of the abstracts, the most relevant studies were selected for detailed analysis. Subsequently, the selected articles were further refined using specific filters: review papers were excluded, only peer-reviewed articles written in English were retained, and the selection was limited to articles published from 2019 and later. Furthermore, each chosen article was assessed for quality, considering the authors’ scientific contributions and the research methodologies applied, with priority given to papers published in reputable journals and conferences (e.g., Elsevier, MDPI, IEEE). Finally, 95 papers were selected for further analysis. The results were structured into categories based on the control strategy applied in each system and the application domain. This structure enabled the identification of the advantages of each method and technology, as well as the highlighting of future directions and the main obstacles to their implementation.
Additionally, a bibliometric analysis was conducted using VOSviewer (v.1.6.20) on the 95 selected articles, as shown in Figure 2. VOSviewer was used to extract and visualize the keywords of the articles, displaying their frequency and co-occurrence over time. The temporal mapping highlighted emerging trends, shifts in research areas, and the evolution of key topics related to heat pump control, optimization, digitalization, and related fields.
This visualization enabled a clearer understanding of the concepts that have gained prominence in recent years, thus facilitating the interpretation and categorization of control strategies and application domains. As observed, model-free control, such as deep reinforcement learning, has attracted significant research interest in recent years, indicating that it still holds considerable potential for further investigation. Moreover, additional studies are needed to verify and evaluate the performance of new control strategies for heat pump systems. For this reason, the hardware-in-the-loop approach is gaining increasing importance in the current literature, while district energy systems have also emerged as a recent area of interest within the framework of decarbonizing energy systems. All the findings from the VOSviewer analysis are described, analyzed, and discussed in the following sections.

1.3. Paper Structure

The structure of the article is illustrated in Figure 3. Section 1 served as the introduction, presenting the motivation behind this review and the literature analysis approach that was adopted. Subsequently, Section 2 provides a concise overview of the fundamental principles of heat pump systems, as well as the system characteristics and parameters that influence their performance and enable adaptable operation, which are topics increasingly gaining importance in contemporary research. In Section 3, the results of the extensive literature review on the control and optimization of various heat pump systems are presented and discussed, alongside a structured categorization and analysis of the selected articles. Section 4 presents a critical analysis of the reviewed control strategies, highlighting the relationships between control methods, application fields, and performance indicators, and identifying key insights, challenges, and research gaps. Finally, Section 5 presents the conclusions of this review, summarizing the main findings, highlighting the key contributions, and outlining future research directions for heat pump systems.

2. Technologies and Application of Heat Pumps

This chapter provides a comprehensive overview of heat pump systems, focusing on their classification, main technologies, and practical applications. It begins with an examination of the classification of heat pumps, differentiating them according to operating principles, energy sources, and system configurations. Next, some of the key applications of heat pumps with significant market presence are highlighted. Finally, the chapter discusses the potential of heat pumps to participate in demand response strategies, illustrating how these systems can support grid balancing, facilitate the integration of variable renewable energy sources, and optimize overall energy consumption. This overview lays the essential technical foundation for the understanding of subsequent chapters on control strategies, system optimization, and digitalization of heat pump technologies.

2.1. Classification of Heat Pump Systems

A heat pump is a device that transfers heat from a lower temperature to a higher temperature, contrary to the natural flow of heat, using energy. Heat pumps can be classified into electrically driven heat pumps and thermally driven heat pumps [4]. Electrically driven heat pumps use electricity to operate the compression cycle, while thermally driven heat pumps use heat or an engine to drive the sorption or compression cycle. Gas sorption heat pump, thermal compression heat pump, and gas engine heat pump are the three main types of thermally driven heat pumps. Both electrically driven and thermally driven heat pumps can operate with various cold sources and heat sinks, such as air, water, and ground [5]. Furthermore, heat pumps can also be classified according to their supply temperature. Low-temperature supply heat pumps typically provide around 35 °C, while intermediate-temperature systems operate at approximately 45 °C. Medium-temperature heat pumps deliver heat at about 55 °C, and high-temperature units can reach up to 200 °C [4], mainly used in industrial applications.

2.2. Key Developments and Applications of Heat Pumps

This section presents certain heat pump technologies and applications that play a significant role in both the market and current research activity, without being considered the only or dominant ones. The most notable among these include:
  • Heat pumps combined with energy storage
  • Solar-assisted heat pumps
  • Heat pumps in district heating and cooling networks

2.2.1. Heat Pumps Combined with Energy Storage

Regarding heat pump systems combined with energy storage, this is divided into thermal energy storage (TES) or electrical energy storage (EES). Integrating a heat pump system with TES can provide multiple benefits, such as increased energy efficiency, reduced energy costs, enhanced energy security, and improved flexibility. The incorporation of TES with heat pump systems decouples heat production from its use, allowing high-power heat generation to be shifted to off-peak hours [6]. TES systems can be classified according to various criteria, such as the temperature level of the stored energy (hot or cold storage), the storage duration (short-term or long-term), and the mechanism applied for energy storage (sensible heat storage, latent heat storage, chemical reaction heat storage) [7]. Beyond thermal energy storage, electrical energy storage also provides numerous benefits in reducing energy costs and enhancing the flexibility of a heat pump system. A common form of electrical energy storage is the use of batteries to store electricity generated from photovoltaic (PV) systems. In this way, a Battery Energy Storage System (BESS) can supply power not only to the heat pump but also to other loads, such as lighting, in a residential building. Since PV systems play a significant role in the operation and efficiency of heat pumps, this study briefly refers to solar-assisted heat pump systems.

2.2.2. Solar Assisted Heat Pump Systems

Solar-assisted heat pump systems lead to a significant reduction in their energy consumption and achieve a satisfactory level of energy independence from the national electricity grid, thereby contributing to the overall reduction in primary energy consumption and CO2 emissions [8]. In solar-assisted heat pump systems, the solar assistance usually comes from PV panels, solar thermal collectors, or PV/T collectors. In recent years, the installation of photovoltaic systems integrated with heat pumps for heating and cooling has gained significant popularity, not only in residential buildings but also in commercial and other applications. The efficiency of these systems increases when combined with a BESS; however, the initial installation cost also rises sharply due to the high cost of purchasing batteries. Regarding heat pump systems combined with solar thermal collectors or PV/T, they can be classified into two main types: Direct Expansion (DX) and Indirect Expansion (IDX) systems [9]. In DX systems, the solar collector functions as the evaporator of the heat pump, with the working fluid in the collector acting as the refrigerant. Finally, the combination of different solar collectors with thermally driven cooling technologies also attracts significant research interest, as these systems can efficiently utilize low-temperature sources such as solar energy, while exhibiting lower power consumption compared to conventional vapor compression cooling systems, since they do not require a mechanical compressor [10].

2.2.3. Heat Pumps in District Heating and Cooling Networks

District energy can significantly contribute to lowering carbon emissions in heating and cooling by maximizing energy efficiency, integrating renewable energy sources, and facilitating sector coupling, according to the Heat Roadmap Europe [11]. In recent years, sustained research efforts have been directed toward developing new technologies for district network systems. The evolution of district heating and cooling systems is commonly characterized by a series of “generations,” where each successive generation attains improved efficiency through the incorporation of advanced technologies, as shown in Figure 4.
In the past, heat in district heating networks was produced from industrial processes or from combined heat and power (CHP) plants, where waste heat from the combustion of fossil fuels was utilized by the district heating system. Nowadays, as the supply temperatures to end users are lower, the use of heat pumps plays a crucial role. Overall, heat pumps can contribute to the more sustainable development of district heating systems while increasing their operational flexibility, enabling them to better adapt to fluctuating demand and the integration of renewable energy sources. The level of flexibility in district networks is largely determined by the control strategies applied within the system [13]. The exploration of various potential strategies is a major focus of both the research and academic community, as will be illustrated in the following chapters of this paper.

2.3. Flexibility and Demand Response Capabilities of Heat Pumps

In order to achieve the decarbonization targets of the building stock and beyond, the penetration of heat pumps is considered essential. However, electrifying all buildings could significantly increase electricity demand and introduce mismatches between supply and demand, complicating grid management. Therefore, investigating how such systems can provide flexibility to the grid, facilitating the integration of renewable energy sources, represents an important research issue. An effective measure to address this issue is the demand response (DR) mechanism [14]. The DR mechanism falls under demand-side management and allows the adjustment of energy consumption according to the requirements of the grid, utilizing the variability of power and capacity of energy systems. The integration of DR technologies can help stabilize the grid by providing services such as load shedding and demand redistribution. Heat pump systems are capable of providing demand response (DR) services to the power system since their electricity consumption is inherently flexible [15]. For residential heat pumps in particular, the deployment of suitable control schemes and communication links between the heat pump, the building energy management system, and the power grid is essential. The main factors affecting the flexibility of heat pumps are the thermal demand, the size of the heat pump, the storage capacity, and the dynamic properties of the system [16].
An important role in reducing real-time imbalances in the electricity grid is expected to be played by advanced control strategies for heat pump systems [17]. The development of artificial intelligence algorithms for the control and optimization of these systems has become a key area of current research. The following chapters focus on modern control and optimization methods for heat pump systems within the context of digitalization and AI.

3. Control Approaches for Heat Pump Systems

For the optimization of heat pump systems, optimal control is essential in combination with reliable system modeling and characterization of the parameters affecting it, utilizing appropriate optimization techniques. System control strategies, whether conventional or advanced, aim to ensure system stability and improve energy efficiency, increasingly leveraging the benefits of artificial intelligence. This chapter presents control methods for various energy systems with a focus on heat pumps and provides an extensive literature review of these approaches. Emphasis is placed on modern and highly promising methods that integrate artificial intelligence to achieve greater energy efficiency and flexibility in heat pump systems. Initially, the essential concepts for each control system are introduced.
The control of a system consists of two subsystems: a controlling system and a controlled system. They are also referred to as the controller and the plant, respectively. Furthermore, control is categorized as closed-loop control (CLC) when the controlling system receives feedback from the controlled system. Conversely, it is termed open-loop control (OLC) if no such feedback is received. This feedback is generated by the input control signal sent to the controlled system and typically reflects the result of the optimization performed [18]. Various control strategies for heat pump systems can be grouped into three main categories [19], as illustrated in Figure 5.

3.1. Classical Control

The main strategy applied in the classical control approach is Rule-Based Control (RBC). The RBC strategy is a heuristic method that has been extensively used for the control of heat pump systems. This strategy is based on a set of if–then rules, such as, for example, turning on the heat pump when the temperature of a heated space falls below a predetermined setpoint. The RBC strategy is widely used in building applications, often in combination with PID controllers. RBC techniques typically operate at the supervisory level to manage overall operational strategies, while PID controllers handle local HVAC components and internal building dynamics [20]. In such systems, significant time response delays are observed in PID control compared to strategies with adaptive and predictive control capabilities. RBC techniques also lack continuous adaptation to changing conditions [21]. Consequently, these control systems are unable to provide optimal real-time control for HVAC systems, as their operation is influenced by numerous factors, such as external weather conditions, building occupancy, and equipment variations, which are often uncertain or difficult to predict based on predefined rules [22]. These limitations highlight the need for extensive research into the development of advanced control strategies, such as model-based and model-free approaches, particularly with the assistance of artificial intelligence. However, under the RBC framework, focus should be given to the development of predictive RBC (PRBC), which can potentially strike a balance between the higher computational demands of advanced control strategies and the advantages offered in terms of energy savings for heat pump systems. In this context, studies have been conducted highlighting the usefulness of PRBC in enhancing the flexibility of residential heat pump systems and demonstrating the significant economic benefits achieved by reducing electricity consumption by heat pumps [23,24]. Notably, the benefits of using PRBC were also observed in an office building, both from an environmental perspective, with a consequent reduction in CO2 emissions, and from an economic standpoint, with a recorded decrease in heating costs [25]. However, studies that combine the classical RBC strategy with artificial intelligence algorithms are of particular interest, such as the study by Hu et al. [26], where an ANN is employed to predict the building’s load and energy usage, and is combined with a rule-based control optimization model to improve the performance of the HVAC system. Similarly, the study by Meng et al. employed the RBC strategy combined with LSTM (Long Short-Term Memory) for energy load forecasting and the implementation of a demand response peak regulation strategy for the winter [27]. Finally, Betzold et al. [28] conducted a comparative study of a PRBC and an MPC for space heating and DHW provision in a residential building, with results showing that MPC offers a comparative advantage in terms of cost savings. Thus, Table 1 summarizes the aforementioned studies.
However, the limitations described earlier and the rapid development of artificial intelligence have led to the emergence of advanced control strategies, which have received increasing attention from the scientific community in recent years. The following section presents the key elements that constitute a model-based control approach, as well as a number of relevant studies on its application in heat pump systems.

3.2. Model-Based Control

In a model-based approach, controller design relies on the dynamic model of the system being controlled. The most widely used model-based control technique is Model Predictive Control (MPC), which incorporates system constraints and carries out optimization using predictive system behavior. As shown in Figure 6, a typical structure of a Model Predictive Controller. An MPC consists of a prediction model and an optimizer. At each time step k, the prediction model uses disturbance forecasts (e.g., internal loads, dynamic prices, etc.) to estimate the future system states at every step of the prediction horizon. The optimizer, taking into account the objective function, constraints, and the model predictions, determines the sequence of optimal control actions, of which only the first is applied to the real system. These control actions are implemented in the real system, which is also subject to external disturbances. The system returns the measured outputs to the MPC, allowing the control process to be continuously updated.
The optimization problem that the MPC optimizer is required to solve at each time step depends on the nature of the controlled system and the defined cost function. A common approach for classifying optimization methods is to distinguish between mathematical programming methods and metaheuristic methods [29], as illustrated in Figure 7.
Mathematical programming employs various numerical techniques to solve optimization problems, and the choice of the appropriate method depends on the functions and dynamics of the system being modeled. The processes and components of a system may be linear in theory; however, in practice, complex systems such as heat pumps are typically nonlinear. Linear algorithms are generally simpler and faster to implement, but their formulation requires careful consideration, as approximating nonlinear behavior can affect the accuracy. Consequently, nonlinear systems are often represented using linear algorithms to obtain practical and effective solutions. In general, the main techniques used for solving optimization problems in energy control systems include Linear Programming (LP) [30], Mixed-Integer Linear Programming (MILP) [31], Mixed-Integer Quadratic Programming (MIQP) [32], Mixed-Integer Nonlinear Programming (MINLP) [33], and Dynamic Programming (DP) [34]. Moreover, nonlinear optimization methods are also required when the objective function includes nonlinear terms, such as the nonlinear formulation of the heat pump COP. For this purpose, metaheuristic methods, such as Particle Swarm Optimization (PSO) [35] and Genetic Algorithms (GA) [36], are often employed. In general, the metaheuristic approach is a higher-level process that seeks the optimal solution through trial and error. These methods provide relatively fast solutions with low computational effort. However, they are often approximate solutions [29].
Generally, MPC has been an especially popular control strategy in recent years, as it offers the capabilities to model and counteract disturbances, control varying operating points under time-dependent system dynamics, and consider multiple optimization variables with different priorities. As evidenced by the conducted literature review, MPC has been applied in a wide range of heat pump system applications, from building applications to industrial systems, as well as district heating and cooling systems.
The results from the application of MPC in building-related applications, such as office buildings, are particularly encouraging [37,38,39], as they demonstrated that significant energy cost savings can be achieved in offices [37], a notable reduction in CO2 emissions while maintaining thermal comfort conditions within the spaces [38], and also substantial benefits in heat pump flexibility with the grid, as the approach by Tang et al. [39] showed that the proposed MPC not only provides demand response services by exploiting the energy flexibility of the heat pump and thermal energy storage system but can also operate safely and efficiently to deliver adequate domestic hot water.
Thereafter, the implementation of MPC has shown particularly promising results in enhancing the energy flexibility of residential buildings [40,41,42], thereby enabling effective peak load shaving during peak periods. Building upon this concept, studies [43,44,45] applied MPC algorithms to increase the self-consumption of PV-generated energy, consequently reducing the cost associated with heating, cooling, and domestic hot water (DHW) provision. However, with the increasing penetration of heat pumps and other temperature-controlled loads, the frequency regulation capability from the generation side is reduced, making it necessary to develop frequency regulation strategies from the load side. Model Predictive Control (MPC) offers a highly promising approach to addressing these challenges, as demonstrated by studies [46,47].
Especially noteworthy are studies on the integration of MPC in the control of ground source heat pump (GSHP) systems. Since GSHP systems require a high initial investment, the effective optimization of their design and control is fundamental for enhancing their energy performance and reducing the payback period. Various studies have focused on the use of model predictive control [48,49,50,51]. Notably, study [49] utilizes aquifer thermal energy storage (ATES), which is distinguished by its high storage capacity, exploiting groundwater-saturated aquifers. However, long-term operation may reduce the geothermal potential of the soil and cause thermal imbalance. In this context, study [50] focuses on the development of MPC, where the model emphasizes the spatial distribution of temperature variations in the subsurface, while study [51] applies MPC by defining the objective function in a way that also accounts for the long-term effects on the subsurface caused by the boreholes used for the system installation.
Another significant field with strong research interest is the study of district heating and cooling networks. To achieve the targets set for climate neutrality in buildings, heat pumps play a central role in such systems. Consequently, in the optimal control of such systems, MPC plays a critical role, as it can provide significant benefits, such as minimizing energy use by exploiting system flexibility and optimizing district heating and cooling (DHC) network operation while reducing electricity consumption through the flexibility of thermal energy storage (TES), as demonstrated by studies [52,53]. Moreover, the integration of heat pumps with thermal energy storage systems can serve as a key strategy for decarbonizing industrial energy supply, particularly when processes do not operate continuously. In this context, the study by Agner et al. [54] employed MPC to ensure the reliable operation of non-continuous industrial processes under real-world disturbances. Fuhrmann et al. [55] also employed predictive control to prevent potential bottlenecks in energy supply and to ensure the smooth operation of batch processes, thereby demonstrating in turn the value of MPC in maintaining production reliability in industrial batch-process applications.
Furthermore, regarding large-scale heat pumps, it is important to note that in many industrial activities, 30–70% of the input energy is lost as heat. Therefore, utilizing this waste heat can significantly reduce energy consumption and serve as a powerful decarbonization tool. Heat recovery directly reduces fuel consumption, CO2 emissions, and operating costs, but at the same time further increases the complexity of the process. Also, residual heat usually refers to lower-temperature heat (<100–150 °C) that is harder to reuse but still valuable. Exploiting the benefits of MPC, studies [56,57] have shown not only an improvement in the operational performance of district heating systems utilizing industrial waste heat, but also the ability of MPC to satisfactorily ensure the operational constraints of the systems. However, with the integration of heat pumps into industrial processes, it is also necessary to ensure system stability and correct power distribution [58]. In this context, the study by Rasmussen et al. [59] proposes an MPC-coordinated primary frequency control for both small- and large-scale heat pumps (HPs). Finally, regarding large-scale heat pump systems, notable are the results of a study on a high-power air-to-water transcritical CO2 air-source heat pump, which demonstrated that MPC can regulate the heat rejection pressure close to its optimal value, with a very small relative error (0.3–0.8%), thereby ensuring optimal heat pump performance [60].
Depending on the specific problem, the targeted objective function, the imposed constraints, and the modeling complexity, MPC can be classified into different types, such as Adaptive MPC, Robust MPC, and Stochastic MPC. Adaptive MPC offers the capability to dynamically adjust model parameters, such as the COP function, ensuring more efficient and flexible real-time control [61,62]. Robust MPC is used when stability and performance criteria must be satisfied despite all possible model variations and noise signals, provided that the uncertainty remains within set limits [63]. This approach often achieves reductions in operating costs and improvements in system reliability, particularly when combined with machine learning techniques for demand or load prediction [64,65]. Finally, Stochastic MPC, which employs a stochastic dynamic model of the process to predict its potential future evolution, also deals with uncertainties—but through probabilistic distributions—and can be applied both to residential systems and to larger-scale energy networks [66,67].
Beyond the various studies highlighting the interest in using MPC in heat pump systems, significant results have also emerged from field studies and different experimental setups. Field studies have managed to achieve reductions in operational costs [68,69] and significant improvements in efficiency [70], thereby generally supporting the further application of predictive control and advancing the technology toward greater emission reductions [71,72]. Similarly, laboratory and experimental studies, including hardware-in-the-loop tests [73,74], achieved significant energy savings, demonstrated the ability of the MPC controller to perform load shifting and exploit system flexibility [75], and were also applied to pioneering systems, such as thermal energy storage integrated into a ceiling with PCM, embedded in an air-source integrated heat pump (ASIHP) system [76].
The challenges faced by all modern energy systems concern both the impacts of climate change and operational parameters that require optimization, such as heat pump noise. For this reason, the present review also presents studies that are expected to play a central role in research over the coming years. In the context of climate change, the design and operation of heat pump systems must take into account that we are moving towards conditions where cooling demand clearly dominates [77]. Within this framework, both simulation studies [36,78,79] and experimental studies [80] examine the application of an MPC controller exclusively for addressing cooling loads. Finally, an area of growing research interest concerns the reduction in noise generated by heat pumps. Both [81] and [82] demonstrate that Model Predictive Control (MPC) can effectively reduce noise emissions from heat pumps while maintaining or improving energy efficiency. These studies highlight the potential of MPC to manage compressor and fan speeds, leveraging system flexibility to minimize acoustic discomfort in building applications. Table 2 summarizes all the aforementioned studies.
Despite the extensive research conducted on the use of MPC in heat pump systems, its implementation remains limited. First, the development of such models is a laborious process, as these approaches rely heavily on the mathematical models of the system under consideration. The development of accurate models is demanding due to their complexity and nonlinear characteristics [19]. Data-driven MPC attempts to address this issue; nevertheless, it remains a challenging task for MPC-based operation. Moreover, MPC-based control is a computationally demanding process, as it requires solving the optimization problem at each control step. This often prevents its real-time implementation, especially in more complex systems. Thus, given the continuously growing research activity in the field of artificial intelligence, model-free control approaches for heat pump systems have gained significant attention because of their flexibility and their ability to adapt to complex and dynamic systems without relying on explicit mathematical models.

3.3. Model-Free Control

The model-free control approach can implement control strategies in a system without the need to develop its mathematical model, as it interacts with the environment and adapts to changes in the system’s dynamics. The main categories observed in the literature for controlling heat pump systems are Neural Networks, Reinforcement Learning, Fuzzy Logic, and Extremum Seeking Control.

3.3.1. Neural Network Control

Neural networks represent an attractive option for controlling complex and nonlinear systems, such as heat pump systems. Control via neural networks is considered a model-free approach, as they learn directly from data and dynamically adapt to capture the underlying system dynamics without requiring explicit knowledge of its internal workings [19]. The general structure of a neural network consists of interconnected neurons, organized into three layers: the input layer, the hidden layer, and the output layer. The outputs (control commands) are calculated based on the system inputs, the network weights, and the network transfer function [83]. The training of neural networks involves adjusting the internal parameters in order to minimize a function that quantifies the difference between the predicted actions and the truly optimal control actions. A typical structure of a neural network controller is shown in Figure 8, which illustrates the control of the optimal mass flow rate in a solar system for domestic hot water supply in a residential building.
The accuracy of the actions performed by a neural network depends on its structure, the number of neurons, and its training algorithm. Thus, there are various neural network structures, with the most used architectures for controlling heat pump systems being recurrent neural networks (RNNs) such as LSTMs and feedforward neural networks (FNNs). One of the main challenges faced by neural networks and other model-free approaches is that the learning process requires a significant volume of reliable data, which is frequently difficult to collect in real-world settings.
Nevertheless, neural network controllers can significantly contribute to reducing the operational energy costs of heat pump systems and enhance their flexibility in managing electricity consumption. Particularly in residential buildings, they allow the adaptation of heat pump operation to changing external conditions, ensuring that energy consumption dynamically responds to actual demand, resulting in significant improvements in the performance of a designed heat pump system [85,86,87].

3.3.2. Reinforcement Learning Control

Research on the application of Reinforcement Learning for heat pump system control has been increasingly active in recent years. Reinforcement Learning (RL) is a subfield of Machine Learning (ML) and represents a learning method in which an agent learns to make decisions based on the current state of the environment by trying different actions within it. The agent receives feedback in the form of rewards for each action it performs. Its goal is to learn which actions, in each state, lead to the best outcomes in order to maximize the overall reward in the long term. An illustrative example of applying RL to a heat pump system is shown in Figure 9, where a residential heat pump system with a hot water storage tank (environment) is controlled by a central controller (agent). The agent’s actions relate to the decisions taken by the agent to control the environment, such as the supply temperature to the heating radiators. The states feed the agent and are retrieved from the environment. Typical examples include the outdoor air temperature and the COP of the heat pump. Finally, the reward represents the evaluation of the environment by the agent based on the action taken and may include a penalty for failing to achieve thermal comfort conditions.
Other key components of RL include the value function, the policy, and the model of the environment [88]. The value function of a state estimates the total expected reward that can be accumulated in the future starting from that state, indicating how “beneficial” or desirable the state is in the long term. The policy determines the agent’s decision-making process, defining the chances of selecting each action in a given state. Finally, the environment model is designed to replicate the behavior of the environment. However, the existence of a model is optional and is used only in Model-based RL approaches. In the Model-free RL approach, the agent learns through trial and error by interacting with the environment, without requiring a model of the environment. It operates by having the agent learn the best actions for each state, without establishing probabilities for transitions between states [88]. The Reinforcement Learning (RL) problem is typically formulated based on Markov Decision Processes (MDPs), which are mathematical tools used to describe the interaction between the agent and the environment, as well as how the agent’s actions are rewarded [61]. Furthermore, there are three primary approaches for executing RL: value-based, policy-based, and actor-critic. In value-based reinforcement learning, the key aim is to determine the value function. In the policy-based approach, the main objective is to optimize the policy that the agent uses to determine its actions, while the actor–critic approach combines both methods: the actor learns to select actions that maximize rewards, and the critic evaluates the chosen actions using the value function. Finally, it should be noted that many applications in studies concerning the control and optimization of heat pump systems implement Deep Reinforcement Learning (DRL). DRL is a modern Machine Learning method that combines the ability of Deep Learning to handle nonlinear states with the reliable decision-making capability of Reinforcement Learning [88].
In the literature, Reinforcement Learning is the most widely used model-free control approach. Numerous studies, including simulations as well as experimental and laboratory setups, have been carried out to highlight the effectiveness of this approach. The majority of studies focus on small-scale heat pump systems, primarily in residential buildings. RL-based control plays a key role in the intelligent management of such systems, offering energy savings and enhancing their flexibility, as it can shift heating loads to periods where the heat pump can operate most efficiently, as demonstrated in the study by Rohrer et al. [89]. Additionally, it contributes to the prevention and protection against decreased efficiency or even faults, as demonstrated by the study of Klingebiel et al. [90], who developed a self-optimizing defrost start controller using DRL. Significant energy savings have been reported in previous studies [91,92] on residential heating. Furthermore, the application of reinforcement learning (RL) control has demonstrated notable advantages, including enhanced self-consumption, increased energy flexibility, and improved energy efficiency while maintaining thermal comfort, in the context of residential heat pump systems integrated with photovoltaic (PV) [93,94,95,96,97] or even photovoltaic-thermal (PVT) systems [98]. Significant research has also been conducted in non-residential buildings, such as university campuses [99,100,101], as well as in larger-scale energy systems, including district heating networks [102,103] and the industrial sector [104]. Heat pump technologies that utilize waste heat [105,106,107] are of particular interest for improving system performance, promoting their faster adoption, and replacing energy generation units based on fossil fuels. The control of these systems using DRL significantly contributes to accelerating their deployment. Also, a notable study in the international literature is the study by Diaz et al. [108], which applied an RL-based control approach to optimize the operation of an absorption solar cooling system.
Finally, experimental studies also have an important role in demonstrating the efficiency and operational optimization of systems through Reinforcement Learning (RL)-based control. In the present review, several experimental studies published in recent years are highlighted, such as the study by Klingebiel et al. [109], which focused on the development of a self-optimizing defrost controller for air-to-water heat pumps (ASHPs). The study was conducted on a hardware-in-the-loop laboratory setup, where the RL agent demonstrated high control performance in this experimental environment. In the same context, studies [110,111,112] also confirmed the applicability and effectiveness of RL or DRL controllers in heat pump systems. However, the study by Boutahri et al. [110] is of particular interest, as the results obtained from the real-world case study showed smaller performance gains compared to those achieved in the simulations.

3.3.3. Fuzzy Logic Controller

Another control approach that has attracted considerable research interest is Fuzzy Logic Control. Fuzzy logic control is a model-free approach, as it does not require a mathematical model of the controlled system but relies on human reasoning and a linguistic model, applying simple mathematics in cases of nonlinear and complex systems [83]. Fuzzy Logic Controllers (FLCs) use fuzzy sets to handle imprecise and uncertain data. Fuzzy sets are defined through membership functions, which map input values to a degree of membership within the range [0, 1]. An FLC is based on a predefined rule base, usually constructed from empirical knowledge or historical data. Such rule bases may include, for example, rules like “If the room is cold, then activate heating”. In general, the typical structure of an FLC is illustrated in Figure 10 and consists of fuzzification, rule base, inference engine, and defuzzification [19].
Specifically, during fuzzification, crisp input data are transformed into fuzzy sets, while the rule base contains a set of “if–then” rules that define the controller’s behavior. Next, the inference engine, taking the crisp inputs and the rule base as input, calculates the fuzzy output set. Fuzzy inference systems are generally divided into two types: Mamdani and Sugeno. The Mamdani type is typically used for all kinds of systems, whereas the Sugeno approach is applied mainly to dynamic, nonlinear systems. Finally, the last step is defuzzification, where the fuzzy output sets are converted into crisp control commands. As conditions change, the system continually evaluates the rules, ensuring efficient operation. However, the frequent need for expert knowledge to define rules and membership functions in Fuzzy Logic Control (FLC) highlights the importance of integrating fuzzy logic with other methods. A relatively common and effective solution is the Adaptive Neuro-Fuzzy Inference System (ANFIS), which merges the learning abilities of artificial neural networks with the interpretability and flexibility of fuzzy logic, enabling enhanced system performance and smoother adaptation to optimal operating conditions.
Encouraging findings regarding the use of FLC for heat pump systems emerge from the present literature review. An increase in the energy efficiency of such systems is observed both in heat pump systems providing heating and cooling to buildings [114,115] and in other applications, such as the study by Rodríguez-Valderrama et al. [116] on a domestic refrigerator, the study by Chuensiri et al. [117] on a Hybrid Ground Source Heat Pump (HGSHP) system, and the study by Kim et al. [118] on a solar geothermal hybrid heat pump system. Finally, experimental studies on the application of FLC control in heat pump systems appear to be limited. However, the results from the experimental work of Langner et al. [119] indicate that cost reduction and the maintenance of thermal comfort can be effectively achieved through the use of such a model-free control approach.

3.3.4. Extremum Seeking Control

Finally, another commonly used control approach is Extremum Seeking Control. Extremum Seeking Control (ESC) is a real-time, adaptive, model-free control strategy capable of adapting to a system’s unknown dynamics [2]. ESC is a dynamic gradient-search method. Using pairs of dither-demodulation signals along with proper filtering, the gradient estimation is performed in real time. As shown in Figure 11, which illustrates a simple form of an Extremum Seeking Control, the basic idea is to introduce a signal to the system. Then, by observing and acting on the output of the performance function, it becomes possible to estimate the gradient, which can be used by a state regulator (e.g., an integrator) to iteratively search for an extremum point.
An important advantage of using ESC is that the optimum search process is linked to the dither frequencies, making it nearly independent of process variations, external disturbances, and measurement noise at other frequencies. However, the convergence to the optimal operating point using the classical perturbation-based ESC scheme may occur at a rate where the time scales are slower than the process dynamics. A common approach to address this slow convergence is the development of a proportional–integral ESC (PI-ESC). The final category of model-free control discussed in the paper is Extremum Seeking Control (ESC).
From the literature review, it was found that most studies investigating ESC in heat pump systems primarily focus on high-temperature heat pumps [121,122,123]. Gong et al. [121] use ESC to achieve a reduction in power consumption of a high-temperature vapor-injection heat pump system. Liu et al. [122] implemented a multivariable ESC for a high-temperature heat pump system with double refrigerant injection technology, while Wang et al. [123] applied ESC to a Cascade Heat Pump system (CASHP) aiming to minimize power consumption and maximize COP, achieving satisfactory convergence under both steady-state and transient conditions, thereby confirming the feasibility of the proposed control strategy. Finally, beyond the application of ESC in high-temperature heat pump systems, there is significant interest in its use for residential heat pumps, as shown by Vering et al. [124], aiming to optimize both energy performance and acoustic behavior. Simultaneous energy and acoustic optimization is expected to attract considerable research attention, as, according to a study [124], the acoustically optimal configuration does not coincide with the energetically optimal one. Table 3 summarizes all the aforementioned studies based on model-free control approaches. It should be noted that in some studies, the application field of the investigated system is not specified.
In summary, model-free control approaches rely solely on data and do not require an accurate mathematical model of the system. However, they require a large volume of data to train their algorithms. With the increasing research activity around sensors and IoT devices, the amount of available data for training model-free control approaches is expected to grow significantly. However, two major challenges that remain in the operation of these approaches are: the difficulty of enforcing constraints during system operation and the limited interpretability of the decisions they sometimes produce. The next section presents recent trends and studies that aim to combine the advantages of model-based and model-free approaches in order to improve the performance, safety, and flexibility of control systems.

3.4. Recent Advances in Predictive and Learning-Based Control

As shown in the literature review of model-free control approaches, the dominant technique, both in simulations and experimental studies, was Reinforcement Learning (RL). Among the main advantages of model-free RL were its ability to handle complex systems as well as its high adaptability. However, a challenge lies in the extensive data training required by RL and its dependence on sample efficiency. An alternative approach for accelerating learning and promoting the application of RL in heat pump systems is model-based RL. With the incorporation of a model into RL, the dependence on real-world interactions is reduced, and the convergence speed is improved [125]. In this context, an important comparative study of model-free and model-based RL control was presented by Gao et al. [126], where they compared the performance of model-free and model-based RL for heat supply from a heat pump in a residential building. The comparison was also made against a baseline control strategy, and it was shown that both RL control approaches outperformed it. Notably, model-based RL achieved almost the same performance as model-free RL but with a shorter training time due to its higher sample efficiency. Additionally, field tests for the application of model-based RL in a residential heat pump system were presented by Soares et al. [127], where the model-based RL control approach contributes to the maximization of instantaneous self-consumption of local photovoltaic generation. Nevertheless, model-based RL also has limitations due to the need for accurate system modeling.
Model Predictive Control (MPC) is also a well-established approach, with numerous studies conducted both at the simulation level and through experimental investigations. Despite several experimental studies on heat pump systems, MPC has not been widely adopted in practice, mainly due to the high requirements for data infrastructures as well as the lack of know-how among practitioners. A solution that addresses these issues is approximate MPC. The goal of approximate MPC is to find explicit control laws by approximating the input–output relations of the optimal controller (teacher). The logic of approximate MPC is based on the process of imitating the optimal controller, which is usually carried out by applying machine learning algorithms. These models substitute the MPC optimization and replicate its outputs for the same inputs. In this way, a new controller is obtained, consisting of the trained model, known as the approximator, and consequently, no optimization during closed-loop operation is required [3]. As shown in Figure 12, the development and evaluation of an approximate MPC starts with the creation of the MPC controller to control the system. Next, the controller is applied to the system, and the system’s operational data are collected as input/output data. These input/output data are then used to train the approximator. The most popular approximators in the literature are decision tree-based algorithms and artificial neural networks (ANN). Finally, the approximate MPC controller is implemented in the system, and its performance is evaluated [128].
In recent years, there has been growing research interest in the development of approximate MPC (approximator MPC) for heat pump systems. For example, Maier et al. [129] investigated the use of random forest (RF) algorithms to extract control rules from optimal MPC strategies in heat pump systems with storage, successfully mimicking the MPC performance while also improving the comprehensibility and practical applicability of the rules. Löhr et al. [130] applied an approximate MPC using neural networks for a heat pump system with thermal energy storage to provide heating and domestic hot water (DHW) in a building, while Maier et al. [131] implemented an approximate MPC for a hydronic heat pump system based on the BOPTEST framework, performing a comparison of various advanced approximators, specifically between artificial neural networks (ANNs), random forest (RF), linear, and logistic regression. The study demonstrates that the feature selection of the approximator significantly affects performance, while also showing the superiority of ANNs and RF.
Although approximate MPC requires less sophisticated hardware and software since the control laws are explicitly computed through model inferences, it nevertheless presents a significant drawback, which is its reliance on datasets collected from closed-loop experiments. A new control method combines machine learning techniques with MPC, aiming to reveal system behaviors that cannot be extracted from collected data, while also eliminating the need for a complete redesign of the system, which MPCs often face. A particularly promising combination is that of RL with MPC, where MPC can function as a function approximator, in which the actions are explicitly imposed on the model during the prediction horizons. This new control method is called differentiable MPC [20]. The differentiable MPC has the capability to optimize the model parameters end-to-end. This specific method was also adopted by Chen et al. [132] for the planning and control of HVAC systems. Another possible combination of RL with MPC focuses mainly on the explicit imposition of the action in the first control step, using the controller model, and employs the RL value function for the remaining prediction horizon. Such a study was presented by Arroyo et al. [133], where they investigated an RL-MPC control combination for providing heating in a residential building. Furthermore, a notable study on the combination of MPC and RL for advanced control of residential heat pumps was presented by Lee et al. [134]. In particular, they employed a Reinforcement Learning-based method, Behavioral Cloning, to model an MPC policy using a neural network. Unlike approximate MPC, this approach allows the controller to generalize across different buildings and electricity tariffs, whereas approximate MPC requires separate training for each building. Finally, also interesting is the approach of Cai et al. [135], who proposed an MPC-based RL approach for home energy management systems, specifically the parameterization of the MPC model, the cost function, and the constraints, and the training of these parameters through RL, with the aim of obtaining an optimal policy that satisfies both economy and comfort. All the aforementioned studies are summarized in Table 4.
The potential for applying these methods to different heat pump systems appears particularly high, especially within the ongoing transition toward the digitalization of energy systems. In this context, research in energy systems is increasingly focusing on the development of Digital Twins (DTs). A Digital Twin represents a modern approach for system validation and experimental assessment, bridging the gap between conventional simulations and real-world implementation. Specifically, a Digital Twin consists of three components. The first component refers to the physical system (Physical Twin), the second to the digital representation of the physical system and its continuous operations (Digital Twin), and the last to a bidirectional exchange of data and information, which allows the Digital Twin to be updated and provides feedback to the Physical System [136]. The integration of these advanced control strategies with the capabilities of Digital Twins enables real-time monitoring and simulation of systems, making predictive optimization possible, supporting performance evaluation under different operational and environmental conditions, and facilitating informed decision-making for energy-efficient operation [137].

4. Discussion

This article examines advanced control approaches for heat pump systems, taking into account the demands of the current energy transition. This section summarizes the main general trends identified in the study, aiming to highlight future research directions on the advanced control of heat pump systems.
Specifically, a critical analysis of the findings from this review is conducted, offering a better-grounded overview of prevailing trends, recurring design patterns, and underexplored areas. This critical analysis is carried out through the visualization of the relationships between control strategies, application domains, and corresponding benefits, as illustrated in Figure 13 and Figure 14. In Figure 13, a diagram is presented that directly maps each control type to the dominant performance indicators optimized in each study, allowing for a clear comparison of their research focus.
It should be noted, however, that the results of a study may simultaneously involve multiple performance indicators in the case of multi-objective optimization. From the diagram in Figure 13, it is evident that the primary objective of all problems is the minimization of energy consumption and, consequently, operational costs. Furthermore, there is a greater preference for using MPC to enhance system flexibility, primarily due to its predictive nature, while hybrid MPC-RL approaches have not yet been investigated in a sufficient number of studies to allow for conclusive findings. However, it should be noted that, in contrast to indicators related to cost, energy, and comfort, environmental metrics such as CO2 emissions do not demonstrate a clear research preference, as energy and cost savings are often directly equated with environmental benefits, without accounting for variations in grid emissions over time, which conflicts with carbon reduction goals. Thus, a pathway opens for studies that will take into account both this variation and the parameter of heat pump noise, which remains relatively unexplored, mainly of course for residential applications, as is also evident in Figure 14.
The second diagram (Figure 14) provides a more detailed representation, incorporating the intermediate level of application domains and revealing how different control approaches are distributed across various building and system environments. As illustrated, the advanced control methods explored to date have been mainly applied in residential settings and have yet to be extended to larger-scale systems, including district heating and cooling networks and industrial applications. It is noted that the N/A category mainly refers to experimental studies, in which the reason for meeting the required load is not sufficiently clarified. Furthermore, from this diagram, it is interesting to observe that applications with model-based control are more widespread in residential and non-residential contexts. The figure also confirms that industrial and district systems are more suited to model-free approaches due to the computational complexity required to model these systems. Additionally, the literature analysis highlights the research opportunities emerging in the field of intelligent control for large-scale and industrial applications, such as solar cooling systems, as well as in district heating systems, particularly with an emphasis on meeting the required thermal comfort conditions. Notably, according to the International Energy Agency (IEA) [138], heat pumps are increasingly emerging as a key technology for utilizing waste heat from data centers in district heating networks, opening new opportunities for energy-efficient and sustainable applications, as also demonstrated by the relevant studies in the review [56,107].
Also, ensuring system frequency (frequency regulation) is a key challenge for a resilient and efficient energy network, particularly as the electrification of energy systems expands. The increased pressure on existing infrastructure and the growing need for flexibility to balance fluctuations in electricity generation are expected to intensify. In this context, energy storage technology (TES) plays a central role in managing the mismatch between supply and demand. According to the review, many studies have examined the use of TES in their research, as illustrated in Figure 15. It is noteworthy that TES usage is particularly prevalent in industrial and DHC network applications, mainly due to the flexibility it provides, especially under batch process conditions. It is important to note, however, that underground thermal energy storage (UTES) differs from conventional thermal energy storage in terms of its dynamics, a factor that must be taken into account when designing an advanced control strategy for each system under study. In UTES, the thermal response of the ground is slow and depends on long-term heat accumulation or extraction, necessitating models with extended horizons and the prediction of thermal imbalance. In contrast, conventional thermal energy storage exhibits more predictable thermal behavior, enabling immediate corrective actions and more accurate short-term optimization. In this context, study [50] is particularly relevant, as the controller is designed so that the model focuses on the spatial distribution of temperature changes in the ground.
However, although the aforementioned control strategies for heat pump systems show highly promising results, they still lack experimental validation, as the collected studies indicate that only 14% are based on experimental data, while the remaining 86% rely on simulation-based studies. The implementation of advanced control methods through IoT devices is accompanied by several technical challenges. The diversity in the characteristics of IoT devices and the effort to integrate them with traditional building management systems lead to significant technical obstacles [139]. Moreover, the effective management of large amounts of data collected from various sensors and ensuring the reliability of the data against loss or tampering are critical requirements [140]. The decisions of AI algorithms for controlling heat pump systems rely on data from IoT networks, which may contain errors due to sensor battery depletion, device malfunctions, or communication loss. Failure to ensure the quality of this data can lead to incorrect decisions and reduced performance of the controlled system [141]. Therefore, for the successful integration of advanced control methods in heat pump systems, the accuracy of the sensors used must be particularly high, which, besides being rarely achieved, also increases implementation costs [69,71]. Specifically, regarding MPC, regular and computationally intensive optimization is required, particularly in real-time building applications where fast response times are crucial [139], while they also face unexpected transient phenomena that are usually not predicted by the model, negatively affecting their efficiency [75]. On the other hand, model-free control strategies require large amounts of data, making extensive and complete datasets necessary to achieve effective learning. Furthermore, the validation of a model in a real environment is critical for evaluating its performance, as it accounts for the real dynamics and complexity of the system during the learning process. However, implementing this validation faces significant challenges, such as limitations in computational infrastructure, security issues, and the need for access to real data [142]. Research efforts are directed towards addressing all the aforementioned issues to enable the successful implementation of advanced control in heat pump systems.
Finally, in this context, as also highlighted by the present review, the metrics related to computation and implementation, such as execution time performance and control loop frequency [143], remain significantly limited, serving as a factor that continues to hinder the widespread application of these control strategies in real-world systems, while it is also worth noting that most research efforts focus on optimizing heat pump systems during their operational phase, overlooking the impact that the initial system sizing and primary design have on their overall performance.

5. Conclusions

This review article provides a comprehensive analysis of the current state of optimal control of heat pump systems within the context of the ongoing energy transition. The findings emphasize the crucial role of heat pumps as a key technology for decarbonizing the heating and cooling sectors, supported by recent technological advancements. Compared to existing reviews, the present study provides a holistic and comparative presentation of advanced control strategies across all energy systems that focus on heat pumps, and it focuses not only on energy performance but also on operational flexibility, renewable energy integration, and demand management challenges. Its goal is to identify significant imbalances, limitations, and research gaps, thereby providing clear directions for the future development and optimization of these systems.
Thus, the study revealed that in the field of control strategies and operational optimization of energy systems with heat pumps, recent research increasingly focuses on methods supported by artificial intelligence algorithms. Model Predictive Control (MPC) is extensively examined; however, its application is often limited by high computational complexity. At the same time, Reinforcement Learning (RL) has emerged in the literature as the dominant model-free approach for advanced heat pump control, despite challenges related to processing large data volumes and enforcing constraints. Recent trends aim to combine the advantages of MPC and RL (e.g., Differentiable MPC, Approximate MPC) with the goal of developing more flexible, effective, and less computationally demanding controllers. A key direction for future research is the integration of advanced control strategies into the simultaneous optimization of the design of each energy system. This approach emphasizes that control and operational strategies should not be considered independently, but should be taken into account from the early stages of system design. In this way, the design can incorporate variable operating conditions as well as the interactions between components from the initial design phase.
The review indicates that significant barriers, such as high computational complexity and the requirements for data and model validation, greatly limit the practical implementation of experimental testing of advanced control methods in heat pump systems. Despite the existing challenges in the practical implementation of advanced control methods, developments in digitalization and artificial intelligence are expected to further enhance the efficiency and sustainability of heat pumps, accelerating their role as a key technology for the decarbonization of energy systems.
Summarizing all the above, and taking into account the points discussed in Section 4 of the discussion, future research directions could focus on the following:
  • Integrated Design and Operation Optimization using multi-objective optimization, taking into account the entire lifecycle of the system. This approach considers both energetic and economic aspects, aiming to develop sustainable and efficient energy systems.
  • Experimental evaluation of control algorithms in real-world engineering applications, combined with an analysis of their economic viability.
  • Further research is needed on advanced control methods for thermally driven heat pump systems and large-scale applications, such as solar cooling systems and district heating/cooling networks.
  • In 5th generation district heating and cooling (5GDHC) networks, heat pumps are installed at each consumer to adjust the supply temperature according to specific requirements, whether for heating or cooling purposes. This setup creates significant opportunities for further research, as the energy flexibility and resilience provided through advanced control represent additional benefits that become increasingly important in scenarios with high penetration of distributed renewable energy sources.
  • An emerging research direction is the investigation of heat pumps from an acoustic perspective. Evaluating the acoustic impact of heat pumps is crucial, especially in residential settings, where noise emissions can greatly influence the quality of life. Therefore, future research on the control of heat pump systems to optimize their acoustic performance is recommended.
These future research directions can be further supported and validated through experimental studies and practical demonstrations, which provide insights into real-world system behavior and performance. In conclusion, the integration of predictive, adaptive, and learning-based strategies, combined with digitalization and real-time data collection, can significantly enhance the performance, flexibility, and sustainability of heat pump systems.

Author Contributions

Conceptualization, K.S. and E.G.; methodology, K.S. and E.G.; validation, K.S.; data curation, K.S. and E.G.; writing—original draft preparation, K.S. and E.G.; writing—review and editing, K.S. and G.P.; supervision, E.G.; project administration, E.G. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

No new data were created or analyzed in this study.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

ANFISAdaptive-Network-based Fuzzy Inference System
ANNArtificial Neural Network
ASHPAir-Source Heat Pump
BESSBattery Energy Storage System
BOPTESTBuilding Optimization Testing Framework
CASHPCascade Heat Pump
CHPCombined Heat and Power
CLCClosed-Loop Control
DHWDomestic Hot Water
DHCDistrict Heating and Cooling
DPDynamic Programming
DRDemand Response
DRLDeep Reinforcement Learning
DXDirect Expansion
ESCExtremum Seeking Control
FLCFuzzy Logic Controller
FNNFeedforward Neural Networks
GAGenetic Algorithms
HGSHPHybrid Ground Source Heat Pump
HILHardware-In-the-Loop
HVACHeating, Ventilation, and Air Conditioning
IDXIndirect Expansion
IEAInternational Energy Agency
IoTInternet of Things
LPLinear Programming
LSTMLong Short-Term Memory networks
MBRLModel-Based Reinforcement Learning
MDPsMarkov Decision Processes
MIQPMixed Integer Quadratic Programming
MILPMixed-Integer Linear Programming
MINLPMixed-Integer Nonlinear Programming
MLMachine Learning
MPCModel Predictive Control
NNNeural Network
OLCOpen-Loop Control
PIDProportional-Integral-Derivative
PI-ESCProportional-Integral Extremum Seeking Control
PSOParticle Swarm Optimization
PVPhotovoltaic
PVTPhotovoltaic-Thermal
RBCRule-Based Control
RFRandom Forest
RLReinforcement Learning
RNNRecurrent Neural Networks
PRBCPredictive Rule-Based Control
STCSSolar Thermal Cooling Systems
TESThermal Energy Storage
T-STakagi-Sugeno

References

  1. European Council. European Green Deal, Consilium. 2019. Available online: https://www.consilium.europa.eu/en/policies/european-green-deal/ (accessed on 1 October 2025).
  2. IEA. Perspectives for the Clean Energy Transition–Analysis. 2019. Available online: https://www.iea.org/reports/the-critical-role-of-buildings (accessed on 1 October 2025).
  3. IEA. The Future of Heat Pumps–Analysis. 2022. Available online: https://www.iea.org/reports/the-future-of-heat-pumps (accessed on 1 October 2025).
  4. European Heat Pump Association. Thermally Driven Heat Pumps How They Work and Why They Matter. 2022. Available online: https://www.ehpa.org/wp-content/uploads/2022/10/220928_Thermally-Driven-Heat-Pumps_technology-report_online-1.pdf (accessed on 1 October 2025).
  5. Olympios, A.V.; Alonso, M.J.; Pantaleo, A.M. Heat pumps in Europe: Perspectives and innovation challenges. Prog. Energy 2025, 7, 032004. [Google Scholar] [CrossRef]
  6. Zhou, J.; Wu, W.; Bellamy, L.; Bishop, D. Thermal energy storage–coupled heat pump systems: Review of configurations and modelling approaches. Renew. Sustain. Energy Rev. 2025, 226, 116226. [Google Scholar] [CrossRef]
  7. Dheep, G.R.; Sreekumar, A. Influence of nanomaterials on properties of latent heat solar thermal energy storage materials—A review. Energy Convers. Manag. 2014, 83, 133–148. [Google Scholar] [CrossRef]
  8. Giama, E.; Sittas, K.; Chantzis, G.; Papadopoulos, A. Carbon neutrality in the building sector: Optimization of heat pumps operation in combination to PV/T as auxiliary renewable energy system. Energy 2025, 330, 136726. [Google Scholar] [CrossRef]
  9. Jonas, D.; Theis, D.; Meiers, J.; Frey, G. Model-Based Analysis of Solar Thermal and Heat Pump Systems Using TRNSYS. In Proceedings of the IEA SHC International Conference on Solar Heating and Cooling for Buildings and Industry, Abu Dhabi, United Arab Emirates (UAE), 29 October–2 November 2017. [Google Scholar] [CrossRef]
  10. Papingiotis, T.; Korres, D.N.; Koulocheris, V.; Koronaki, I.P.; Papaefthimiou, V.D. A comparative performance study of concentrating PV/T and ETC for solar adsorption cooling system in Greece. Renew. Energy 2025, 251, 123395. [Google Scholar] [CrossRef]
  11. Mathiesen, B.V.; Bertelsen, N.; Schneider, N.C.A.; García, L.S.; Paardekooper, S.; Thellufsen, J.Z.; Djørup, S.R. Towards a Decarbonised Heating and Cooling Sector in Europe: Unlocking the Potential of Energy Efficiency and District Energy; Aalborg Universitet: Aalborg, Denmark, 2019; p. 97. [Google Scholar]
  12. Simpson, J.G.; Long, N.; Zhu, G. Decarbonized district energy systems: Past review and future projections. Energy Convers. Manag. X 2024, 24, 100726. [Google Scholar] [CrossRef]
  13. Zouloumis, L.; Ploskas, N.; Panaras, G. Quantifying flexibility potential on district heating local thermal substations. Sustain. Energy Grids Netw. 2023, 35, 101135. [Google Scholar] [CrossRef]
  14. Chantzis, G.; Giama, E.; Nižetić, S.; Papadopoulos, A.M. The potential of demand response as a tool for decarbonization in the energy transition. Energy Build. 2023, 296, 113255. [Google Scholar] [CrossRef]
  15. Gaur, A.S.; Fitiwi, D.Z.; Curtis, J. Heat Pumps and Our Low-Carbon Future: A Comprehensive Review. Energy Res. Soc. Sci. 2021, 71, 101764. [Google Scholar] [CrossRef]
  16. Fischer, D.; Madani, H. On heat pumps in smart grids: A review. Renew. Sustain. Energy Rev. 2017, 70, 342–357. [Google Scholar] [CrossRef]
  17. Schibuola, L.; Scarpa, M.; Tambani, C. Demand response management by means of heat pumps controlled via real time pricing. Energy Build. 2015, 90, 15–28. [Google Scholar] [CrossRef]
  18. Schulz, M.; Kemmler, T.; Kumm, J.; Hufendiek, K.; Thomas, B. A More Realistic Heat Pump Control Approach by Application of an Integrated Two-Part Control. Energies 2020, 13, 2752. [Google Scholar] [CrossRef]
  19. Michailidis, P.; Michailidis, I.; Vamvakas, D.; Kosmatopoulos, E. Model-Free HVAC Control in Buildings: A Review. Energies 2023, 16, 7124. [Google Scholar] [CrossRef]
  20. Balali, Y.; Chong, A.; Busch, A.; O’kEefe, S. Energy modelling and control of building heating and cooling systems with data-driven and hybrid models—A review. Renew. Sustain. Energy Rev. 2023, 183, 113496. [Google Scholar] [CrossRef]
  21. Maddalena, E.T.; Lian, Y.; Jones, C.N. Data-driven methods for building control—A review and promising future directions. Control Eng. Pract. 2020, 95, 104211. [Google Scholar] [CrossRef]
  22. Michailidis, P.; Pelitaris, P.; Korkas, C.; Michailidis, I.; Baldi, S.; Kosmatopoulos, E. Enabling Optimal Energy Management with Minimal IoT Requirements: A Legacy A/C Case Study. Energies 2021, 14, 7910. [Google Scholar] [CrossRef]
  23. Naumann, G.; Schropp, E.; Gaderer, M. Predictive rule-based control for an air-source heat pump system: Environmental and economic impacts in future energy scenarios. Appl. Energy 2025, 396, 126298. [Google Scholar] [CrossRef]
  24. Avik, G.; Lu, X.; Adetola, V. Advanced Predictive Rule-based Control for HVAC Cost Reduction Under Dynamic Electricity Pricing in Residential Buildings. In Proceedings of the International High Performance Buildings Conference, West Lafayette, IN, USA, 15–18 July 2024; p. 438. Available online: https://docs.lib.purdue.edu/ihpbc/438 (accessed on 1 October 2025).
  25. Chantzis, G.; Giama, E.; Papadopoulos, A.M. Building Energy Flexibility Assessment in Mediterranean Climatic Conditions: The Case of a Greek Office Building. Appl. Sci. 2023, 13, 7246. [Google Scholar] [CrossRef]
  26. Hu, J.; Zheng, W.; Zhang, S.; Li, H.; Liu, Z.; Zhang, G.; Yang, X. Thermal load prediction and operation optimization of office building with a zone-level artificial neural network and rule-based control. Appl. Energy 2021, 300, 117429. [Google Scholar] [CrossRef]
  27. Meng, Q.; Wei, Y.; Fan, J.; Li, Y.; Zhao, F.; Lei, Y.; Sun, H.; Jiang, L.; Yu, L. Peak regulation strategies for ground source heat pump demand response of based on load forecasting: A case study of rural building in China. Renew. Energy 2024, 224, 120059. [Google Scholar] [CrossRef]
  28. Betzold, C.; Dentel, A. Comparison and implementation of MPC and simple predictive control into a heat pump system. In Proceedings of the CLIMA 2022 Conference, Rotterdam, The Netherlands, 22–25 May 2022. [Google Scholar] [CrossRef]
  29. Behzadi, A.; Holmberg, S.; Duwig, C.; Haghighat, F.; Ooka, R.; Sadrizadeh, S. Smart design and control of thermal energy storage in low-temperature heating and high-temperature cooling systems: A comprehensive review. Renew. Sustain. Energy Rev. 2022, 166, 112625. [Google Scholar] [CrossRef]
  30. Wirtz, M.; Kivilip, L.; Remmen, P.; Müller, D. Bidirectional low temperature networks in urban districts: A novel design methodology based on mathematical optimization. J. Phys. Conf. Ser. 2019, 1343, 012111. [Google Scholar] [CrossRef]
  31. Roure, R.; Chèze, D.; Vallée, M. Model predictive control of solar-coupled innovative heat pump: A comparison of economic and environmental optimizations in Latvia. Open Res. Eur. 2023, 3, 17. [Google Scholar] [CrossRef]
  32. Maier, L.; Schönegge, M.; Henn, S.; Hering, D.; Müller, D. Assessing mixed-integer-based heat pump modeling approaches for model predictive control applications in buildings. Appl. Energy 2022, 326, 119894. [Google Scholar] [CrossRef]
  33. Vonžudaitė, O.; Martišauskas, L.; Bakas, R.; Urbonienė, S.; Urbonas, R. Optimization of Heat Pump Systems in Buildings by Minimizing Costs and CO2 Emissions. Appl. Sci. 2023, 13, 4864. [Google Scholar] [CrossRef]
  34. Siddiqui, S.; Macadam, J.; Barrett, M. The operation of district heating with heat pumps and thermal energy storage in a zero-emission scenario. Energy Rep. 2021, 7, 176–183. [Google Scholar] [CrossRef]
  35. Wei, Z.; Ren, F.; Yue, B.; Ding, Y.; Zheng, C.; Li, B.; Zhai, X.; Wang, R. Data-driven application on the optimization of a heat pump system for district heating load supply: A validation based on onsite test. Energy Convers. Manag. 2022, 266, 115851. [Google Scholar] [CrossRef]
  36. Pintaldi, S.; Li, J.; Sethuvenkatraman, S.; White, S.; Rosengarten, G. Model predictive control of a high efficiency solar thermal cooling system with thermal storage. Energy Build. 2019, 196, 214–226. [Google Scholar] [CrossRef]
  37. Tomás, L.; Lämmle, M.; Pfafferott, J. Demonstration and Evaluation of Model Predictive Control (MPC) for a Real-World Heat Pump System in a Commercial Low-Energy Building for Cost Reduction and Enhanced Grid Support. Energies 2025, 18, 1434. [Google Scholar] [CrossRef]
  38. Yang, Y.; Jradi, M. Multi-objective optimization for balancing CO2 emissions and thermal comfort in a central heat pump system: A case study of a Danish office building. Energy Build. 2025, 342, 115870. [Google Scholar] [CrossRef]
  39. Tang, W.; Li, Y.; Walker, S.; Keviczky, T. Model Predictive Control Design for Unlocking the Energy Flexibility of Heat Pump and Thermal Energy Storage Systems. In Proceedings of the 2024 IEEE Conference on Control Technology and Applications (CCTA), Newcastle upon Tyne, UK, 21–23 August 2024; pp. 433–439. [Google Scholar] [CrossRef]
  40. Strauch, P.; Wang, W.; Langner, F. Model predictive control for demand flexibility of a residential building with multiple distributed energy resources. Energy Build. 2024, 305, 113884. [Google Scholar] [CrossRef]
  41. Klepic, V.; Wolf, M.; Pröll, T. Extension of a low-tech Model Predictive Control (MPC) algorithm for grid-supportive heat pump operation. Energy Build. 2024, 323, 114733. [Google Scholar] [CrossRef]
  42. Péan, T.; Costa-Castelló, R.; Salom, J. Price and carbon-based energy flexibility of residential heating and cooling loads using model predictive control. Sustain. Cities Soc. 2019, 50, 101579. [Google Scholar] [CrossRef]
  43. Baumann, C.; Wohlgenannt, P.; Streicher, W.; Kepplinger, P. Optimizing Heat Pump Control in an NZEB via Model Predictive Control and Building Simulation. Energies 2024, 18, 100. [Google Scholar] [CrossRef]
  44. Cid, J.M.; Mylonas, A.; Péan, T.Q.; Pascual, J.; Salom, J. Energy optimization algorithms for multi-residential buildings: A model predictive control application. Energy Build. 2024, 319, 114562. [Google Scholar] [CrossRef]
  45. Kuboth, S.; Heberle, F.; König-Haagen, A.; Brüggemann, D. Economic model predictive control of combined thermal and electric residential building energy systems. Appl. Energy 2019, 240, 372–385. [Google Scholar] [CrossRef]
  46. Lee, Z.; Gupta, K.; Kircher, K.J.; Zhang, K.M. Mixed-integer model predictive control of variable-speed heat pumps. Energy Build. 2019, 198, 75–83. [Google Scholar] [CrossRef]
  47. Yang, W.; Song, L.; Hong, Z. Research on Low-Frequency Load Shedding Control Strategy Based on Model Predictive Control. In Proceedings of the 2025 IEEE 3rd International Conference on Power Science and Technology (ICPST), Kunming, China, 16–18 May 2025; pp. 800–804. [Google Scholar] [CrossRef]
  48. Chen, Y.; Liang, E.; Berardi, U.; Yang, J.; Jiang, X.; Cui, G. A Model-Based Predictive Control Method of the Ground-Source Heat Pump System for Maintaining Thermal Comfort in Low-Energy Buildings. In Proceedings of the 2022 7th International Conference on Power and Renewable Energy (ICPRE), Shanghai, China, 23–26 September 2022; pp. 1080–1085. [Google Scholar] [CrossRef]
  49. Van Randenborgh, J.; Schulze Darup, M. MPC for Aquifer Thermal Energy Storage Systems Using ARX Models. arXiv 2025, arXiv:2509.10154. [Google Scholar] [CrossRef]
  50. Van Randenborgh, J.; Darup, M.S. MPC using mixed-integer programming for aquifer thermal energy storages. IFAC-PapersOnLine 2024, 58, 21–28. [Google Scholar] [CrossRef]
  51. Figueroa, I.C.; Cimmino, M.; Helsen, L. A Methodology for Long-Term Model Predictive Control of Hybrid Geothermal Systems: The Shadow-Cost Formulation. Energies 2020, 13, 6203. [Google Scholar] [CrossRef]
  52. Hermans, L.; Boydens, W.; Helsen, L. Development of a Mixed-Integer Nonlinear Model Predictive Controller for 5th Gen-eration District Heating and Cooling Networks. In Proceedings of the SimBuild Conference 2024, IBPSA-USA Building Simulation Conference, IBPSA-USA, Denver, CO, USA, 21–23 May 2024; Volume 11, pp. 705–714. Available online: https://publications.ibpsa.org/conference/paper/?id=simbuild2024_2198 (accessed on 1 October 2025).
  53. Hering, D.; Cansev, M.E.; Tamassia, E.; Xhonneux, A.; Müller, D. Temperature control of a low-temperature district heating network with Model Predictive Control and Mixed-Integer Quadratically Constrained Programming. Energy 2021, 224, 120140. [Google Scholar] [CrossRef]
  54. Agner, R.; Gruber, P.; Wellig, B. Model Predictive Control of Heat Pumps with Thermal Energy Storages in Industrial Processes. Energies 2024, 17, 4823. [Google Scholar] [CrossRef]
  55. Fuhrmann, F.; Schirrer, A.; Kozek, M. Model-predictive energy management system for thermal batch production processes using online load prediction. Comput. Chem. Eng. 2022, 163, 107830. [Google Scholar] [CrossRef]
  56. Hou, J.; Li, H.; Nord, N.; Huang, G. Model predictive control for a university heat prosumer with data centre waste heat and thermal energy storage. Energy 2023, 267, 126579. [Google Scholar] [CrossRef]
  57. Reimann, A.; Rose, M.; Küpper, J.; Schiffer, J. Nonlinear System identification and Predictive Control for Waste Heat Recovery with Heat Pumps. IFAC-PapersOnLine 2024, 58, 130–135. [Google Scholar] [CrossRef]
  58. Qin, X.; Lestas, I. Frequency Control and Optimal Power Sharing in Combined Power and Heating Networks with Heat Pumps. arXiv 2025, arXiv:2509.24051. [Google Scholar] [CrossRef]
  59. Rasmussen, T.B.H.; Wu, Q.; Moller, J.G.; Zhang, M. MPC Coordinated Primary Frequency Support of Small- and Large-Scale Heat Pumps. IEEE Trans. Smart Grid 2022, 13, 2000–2010. [Google Scholar] [CrossRef]
  60. Wang, W.; Zhao, Z.; Zhou, Q.; Qiao, Y.; Cao, F. Model predictive control for the operation of a transcritical CO2 air source heat pump water heater. Appl. Energy 2021, 30, 117339. [Google Scholar] [CrossRef]
  61. Aschidamini, G.L.; Pavlovic, M.; Reinholz, B.A.; Metcalfe, M.S.; Niet, T.; Resener, M. Comprehensive Review on the Control of Heat Pumps for Energy Flexibility in Distribution Networks. IEEE Access 2025, 13, 45678–45699. [Google Scholar] [CrossRef]
  62. Zhao, Z.; Wang, B.; Li, X.; Shi, W. Adaptive model predictive control of a residential solar-air hybrid heat pump system. Energy Convers. Manag. 2024, 321, 119026. [Google Scholar] [CrossRef]
  63. Serale, G.; Fiorentini, M.; Capozzoli, A.; Bernardini, D.; Bemporad, A. Model Predictive Control (MPC) for Enhancing Building and HVAC System Energy Efficiency: Problem Formulation, Applications and Opportunities. Energies 2018, 11, 631. [Google Scholar] [CrossRef]
  64. Bünning, F.; Warrington, J.; Heer, P.; Smith, R.S.; Lygeros, J. Robust MPC with data-driven demand forecasting for frequency regulation with heat pumps. Control Eng. Pract. 2022, 122, 105101. [Google Scholar] [CrossRef]
  65. Chen, W.-H.; You, F. Sustainable energy management and control for Decarbonization of complex multi-zone buildings with renewable solar and geothermal energies using machine learning, robust optimization, and predictive control. Appl. Energy 2024, 372, 123802. [Google Scholar] [CrossRef]
  66. Golmohamadi, H. Stochastic energy optimization of residential heat pumps in uncertain electricity markets. Appl. Energy 2021, 303, 117629. [Google Scholar] [CrossRef]
  67. Rasku, T.; Hasan, A. Stochastic model-predictive control of district-scale building energy systems using SpineOpt: 18th International IBPSA Building Simulation Conference, BS2023. In Proceedings of the Building Simulation 2023, Shanghai, China, 4–6 September 2023; pp. 893–900. [Google Scholar] [CrossRef]
  68. Mbuwir, B.V.; Geysen, D.; Kosmadakis, G.; Pilou, M.; Meramveliotakis, G.; Toersche, H. Optimal control of a heat pump-based energy system for space heating and hot water provision in buildings: Results from a field test. Energy Build. 2024, 310, 114116. [Google Scholar] [CrossRef]
  69. Drgoňa, J.; Picard, D.; Helsen, L. Cloud-based implementation of white-box model predictive control for a GEOTABS office building: A field test demonstration. J. Process. Control. 2020, 88, 63–77. [Google Scholar] [CrossRef]
  70. Fallmann, M.; Stanger, L.; Fischer, M.; Kureck, M.; Schirrer, A.; Hofmann, R.; Jakubek, S.; Kozek, M. Experimental validation of mixed-integer Model Predictive Control for energy management in an industrial food processing plant. Case Stud. Therm. Eng. 2025, 75, 106988. [Google Scholar] [CrossRef]
  71. Pergantis, E.N.; Priyadarshan; Al Theeb, N.; Dhillon, P.; Ore, J.P.; Ziviani, D.; Groll, E.A.; Kircher, K.J. Field demonstration of predictive heating control for an all-electric house in a cold climate. Appl. Energy 2024, 360, 122820. [Google Scholar] [CrossRef]
  72. Blum, D.; Wang, Z.; Weyandt, C.; Kim, D.; Wetter, M.; Hong, T.; Piette, M.A. Field demonstration and implementation analysis of model predictive control in an office HVAC system. Appl. Energy 2022, 318, 119104. [Google Scholar] [CrossRef]
  73. Göbel, S.; Stoffel, P.; Will, F.; Vering, C.; Müller, D. Maximizing operational efficiency of heat pumps with model predictive control: An experimental case study for residential application. In Proceedings of the 14th IEA Heat Pump Conference 2023 (HPC2023) Conference Proceedings—Full Papers, Chicago, IL, USA, 15–18 May 2023; p. 12. Available online: https://heatpumpingtechnologies.org/publications/paper-no-805-maximizing-operational-efficiency-of-heat-pumps-withmodel-predictive-control-an-experimental-case-study-for-residential-application-14th-iea-heat-pump-conference-chicago-usa/ (accessed on 1 October 2025).
  74. Sawant, P.; Mier, O.V.; Schmidt, M.; Pfafferott, J. Demonstration of Optimal Scheduling for a Building Heat Pump System Using Economic-MPC. Energies 2021, 14, 7953. [Google Scholar] [CrossRef]
  75. Pean, T.; Costa-Castello, R.; Fuentes, E.; Salom, J. Experimental Testing of Variable Speed Heat Pump Control Strategies for Enhancing Energy Flexibility in Buildings. IEEE Access 2019, 7, 37071–37087. [Google Scholar] [CrossRef]
  76. Hlanze, P.; Jiang, Z.; Cai, J.; Shen, B. Model-based predictive control of multi-stage air-source heat pumps integrated with phase change material-embedded ceilings. Appl. Energy 2023, 336, 120796. [Google Scholar] [CrossRef]
  77. Giama, E.; Chantzis, G.; Kontos, S.; Keppas, S.; Poupkou, A.; Liora, N.; Melas, D. Building Energy Simulations Based on Weather Forecast Meteorological Model: The Case of an Institutional Building in Greece. Energies 2022, 16, 191. [Google Scholar] [CrossRef]
  78. Ascione, F.; De Masi, R.F.; Festa, V.; Mauro, G.M.; Vanoli, G.P. Optimizing space cooling of a nearly zero energy building via model predictive control: Energy cost vs comfort. Energy Build. 2022, 278, 112664. [Google Scholar] [CrossRef]
  79. Coccia, G.; Mugnini, A.; Polonara, F.; Arteconi, A. Artificial-neural-network-based model predictive control to exploit energy flexibility in multi-energy systems comprising district cooling. Energy 2021, 222, 119958. [Google Scholar] [CrossRef]
  80. Cox, S.J.; Kim, D.; Cho, H.; Mago, P. Real time optimal control of district cooling system with thermal energy storage using neural networks. Appl. Energy 2019, 238, 466–480. [Google Scholar] [CrossRef]
  81. Li, Y.; Shi, J.; Jones, C.N.; Yorke-Smith, N.; Keviczky, T. Model predictive building climate control for mitigating heat pump noise pollution. Eur. J. Control. 2025, 86, 101323. [Google Scholar] [CrossRef]
  82. Klingebiel, J.; Beckschulte, M.; Will, F.; Vering, C.; Müller, D. Multi-objective model predictive control for air-source heat pumps: Leveraging system flexibility to simultaneously reduce noise and increase efficiency. Appl. Therm. Eng. 2025, 278, 126926. [Google Scholar] [CrossRef]
  83. Behrooz, F.; Mariun, N.; Marhaban, M.H.; Radzi, M.A.M.; Ramli, A.R. Review of Control Techniques for HVAC Systems—Nonlinearity Approaches Based on Fuzzy Cognitive Maps. Energies 2018, 11, 495. [Google Scholar] [CrossRef]
  84. Friese, J.; Brandt, N.; Schulte, A.; Kirches, C.; Tegethoff, W.; Köhler, J. Quasi-optimal control of a solar thermal system via neural networks. Energy AI 2023, 12, 100232. [Google Scholar] [CrossRef]
  85. Chae, S.; Bae, S.; Nam, Y. Performance improvement of air-source heat pump via optimum control based on artificial neural network. Energy Rep. 2023, 10, 460–472. [Google Scholar] [CrossRef]
  86. Dengiz, T.; Kleinebrahm, M. Imitation learning with artificial neural networks for demand response with a heuristic control approach for heat pumps. arXiv 2024, arXiv:2407.11561. [Google Scholar] [CrossRef]
  87. Maiorino, A.; Del Duca, M.G.; Aprea, C. ART.I.CO. (ARTificial Intelligence for COoling): An innovative method for optimizing the control of refrigeration systems based on Artificial Neural Networks. Appl. Energy 2022, 306, 118072. [Google Scholar] [CrossRef]
  88. Bousnina, D. Deep Reinforcement Learning for Optimal Energy Management in Smart Multi-Energy Systems. Ph.D. Thesis, Université Paris Sciences et Lettres, Paris, France, 2023. [Google Scholar]
  89. Rohrer, T.; Frison, L.; Kaupenjohann, L.; Scharf, K.; Hergenröther, E. Deep Reinforcement Learning for Heat Pump Control. In Lecture Notes in Networks and Systems; Springer: Cham, Switzerland, 2023; pp. 459–471. [Google Scholar] [CrossRef]
  90. Klingebiel, J.; Salamon, M.; Bogdanov, P.; Venzik, V.; Vering, C.; Müller, D. Towards maximum efficiency in heat pump operation: Self-optimizing defrost initiation control using deep reinforcement learning. Energy Build. 2023, 297, 113397. [Google Scholar] [CrossRef]
  91. Qin, H.; Yu, Z.; Li, T.; Liu, X.; Li, L. Energy-efficient heating control for nearly zero energy residential buildings with deep reinforcement learning. Energy 2022, 264, 126209. [Google Scholar] [CrossRef]
  92. Schmitz, S.; Brucke, K.; Kasturi, P.; Ansari, E.; Klement, P. Forecast-based and data-driven reinforcement learning for residential heat pump operation. Appl. Energy 2024, 371, 123688. [Google Scholar] [CrossRef]
  93. Heidari, A.; Maréchal, F.; Khovalyg, D. Reinforcement Learning for proactive operation of residential energy systems by learning stochastic occupant behavior and fluctuating solar energy: Balancing comfort, hygiene and energy use. Appl. Energy 2022, 318, 119206. [Google Scholar] [CrossRef]
  94. Bachseitz, M.; Sheryar, M.; Schmitt, D.; Summ, T.; Trinkl, C.; Zörner, W. PV-Optimized Heat Pump Control in Multi-Family Buildings Using a Reinforcement Learning Approach. Energies 2024, 17, 1908. [Google Scholar] [CrossRef]
  95. Xu, Y.; Li, Y.; Gao, W. Comparative Analysis of Reinforcement Learning Approaches for Multi-Objective Optimization in Residential Hybrid Energy Systems. Buildings 2024, 14, 2645. [Google Scholar] [CrossRef]
  96. Lissa, P.; Deane, C.; Schukat, M.; Seri, F.; Keane, M.; Barrett, E. Deep reinforcement learning for home energy management system control. Energy AI 2021, 3, 100043. [Google Scholar] [CrossRef]
  97. Langer, L.; Volling, T. A reinforcement learning approach to home energy management for modulating heat pumps and photovoltaic systems. Appl. Energy 2022, 327, 120020. [Google Scholar] [CrossRef]
  98. John, D.; Kaltschmitt, M. Control of a PVT-Heat-Pump-System Based on Reinforcement Learning–Operating Cost Reduction through Flow Rate Variation. Energies 2022, 15, 2607. [Google Scholar] [CrossRef]
  99. Friansa, K.; Pradipta, J.; Nanda, R.M.; Haq, I.N.; Mangkuto, R.A.; Iskandar, R.F.; Wasesa, M.; Leksono, E. Enhancing University Building Energy Flexibility Performance Using Reinforcement Learning Control. IEEE Access 2024, 12, 192377. [Google Scholar] [CrossRef]
  100. Buscemi, G.; Cuomo, F.P.; Razzano, G.; Cappiello, F.L.; Brandi, S. Deep reinforcement learning-based control of thermal energy storage for university classrooms: Co-Simulation with TRNSYS-Python and transfer learning across operational scenarios. Energy Rep. 2025, 14, 1349–1367. [Google Scholar] [CrossRef]
  101. Wang, Z.; Qiu, Y.; Zhou, S.; Tian, Y.; Zhu, X.; Liu, J.; Lu, S. Enhancing Air Conditioning System Efficiency Through Load Prediction and Deep Reinforcement Learning: A Case Study of Ground Source Heat Pumps. Energies 2025, 18, 199. [Google Scholar] [CrossRef]
  102. Elomari, Y.; Aspetakis, G.; Mateu, C.; Shobo, A.; Boer, D.; Marín-Genescà, M.; Wang, Q. A hybrid data-driven Co-simulation approach for enhanced integrations of renewables and thermal storage in building district energy systems. J. Build. Eng. 2025, 104, 112405. [Google Scholar] [CrossRef]
  103. Ghane, S.; Jacobs, S.; Casteels, W.; Brembilla, C.; Mercelis, S.; Latre, S.; Verhaert, I.; Hellinckx, P. Supply temperature control of a heating network with reinforcement learning. In Proceedings of the 2021 IEEE International Smart Cities Conference (ISC2), Manchester, UK, 7–10 September 2021; pp. 1–7. [Google Scholar] [CrossRef]
  104. Han, G.; Joo, H.-J.; Lim, H.-W.; An, Y.-S.; Lee, W.-J.; Lee, K.-H. Data-driven heat pump operation strategy using rainbow deep reinforcement learning for significant reduction of electricity cost. Energy 2023, 270, 126913. [Google Scholar] [CrossRef]
  105. Chen, Z.; Xing, T.; Wang, Y.; Zhuang, Y.; Zheng, M.; Zhao, Q.; Jia, Q.-S. Coupling time-scale reinforcement learning methods for building operational optimization with waste heat. Appl. Energy 2025, 391, 125851. [Google Scholar] [CrossRef]
  106. Riebel, A.; Cardemil, J.M.; López, E. Multi-objective deep reinforcement learning for a water heating system with solar energy and heat recovery. Energy 2024, 291, 130296. [Google Scholar] [CrossRef]
  107. Zhang, Y.; Wang, Z.; Filonenko, K.; Dominković, D.F.; Wang, S. Physics-guided deep reinforcement learning for optimized data center cooling and waste heat recovery utilizing aquifer thermal energy storage. Appl. Energy 2025, 402, 126984. [Google Scholar] [CrossRef]
  108. Diaz, J.J.; Fernández, J.A. The Potential of Control Models Based on Reinforcement Learning in the Operating of Solar Thermal Cooling Systems. Processes 2022, 10, 1649. [Google Scholar] [CrossRef]
  109. Klingebiel, J.; Höges, C.; Horst, J.; Nießen, O.; Venzik, V.; Vering, C.; Müller, D. A Self-Optimizing Defrost Initiation Controller for Air-Source Heat Pumps: Experimental Validation of Deep Reinforcement Learning. Appl. Energy 2025, 398, 126400. [Google Scholar] [CrossRef]
  110. Boutahri, Y.; Tilioua, A. Reinforcement learning for HVAC control and energy efficiency in residential buildings with BOPTEST simulations and real-case validation. Discov. Comput. 2025, 28, 45. [Google Scholar] [CrossRef]
  111. Zhang, W.; Yu, Y.; Yuan, Z.; Tang, P.; Gao, B. Data-driven pre-training framework for reinforcement learning of air-source heat pump (ASHP) systems based on historical data in office buildings: Field validation. Energy Build. 2025, 332, 115436. [Google Scholar] [CrossRef]
  112. Heidari, A.; Girardin, L.; Dorsaz, C.; Maréchal, F. A trustworthy reinforcement learning framework for autonomous control of a large-scale complex heating system: Simulation and field implementation. Appl. Energy 2024, 378, 124815. [Google Scholar] [CrossRef]
  113. Belman-Flores, J.M.; Rodríguez-Valderrama, D.A.; Ledesma, S.; García-Pabón, J.J.; Hernández, D.; Pardo-Cely, D.M. A Review on Applications of Fuzzy Logic Control for Refrigeration Systems. Appl. Sci. 2022, 12, 1302. [Google Scholar] [CrossRef]
  114. Sánta, R.; Simon, J.; Garbai, L. The Advantages of Fuzzy Control for Heat Pumps Systems. Period. Polytech. Mech. Eng. 2023, 67, 214–226. [Google Scholar] [CrossRef]
  115. Kovačević, J.; Süß, A.-C.; Çakmak, H.K.; Hagenmeyer, V. A fuzzy logic approach for economic energyefficient heat pump operation in thermal building control. In Proceedings of the 2024 IEEE PES Innovative Smart Grid Technologies Europe (ISGT EUROPE), Dubrovnik, Croatia, 14–17 October 2024; pp. 1–6. [Google Scholar] [CrossRef]
  116. Rodríguez-Valderrama, D.; Belman-Flores, J.; Hernández-Fusilier, D.; Pardo-Cely, D.; Gómez-Martínez, M.; Méndez-Díaz, S. Implementation of fuzzy control in a domestic refrigerator considering the influence of the thermal load. Int. J. Refrig. 2022, 149, 23–34. [Google Scholar] [CrossRef]
  117. Chuensiri, S.; Katchasuwanmanee, K.; Wisessint, A.; Jotisankasa, A.; Soralump, C.; Siriyakorn, V.; Kerdphol, T.; Sanposh, P. Implementation of Adaptive Network-Based Fuzzy Inference for Hybrid Ground Source Heat Pump. IEEE Access 2024, 12, 21052–21069. [Google Scholar] [CrossRef]
  118. Kim, Y.J.; Entchev, E.; Na, S.I.; Kang, E.C.; Baik, Y.-J.; Lee, E.J. Investigation of system optimization and control logic on a solar geothermal hybrid heat pump system based on integral effect test data. Energy 2023, 284, 129308. [Google Scholar] [CrossRef]
  119. Langner, F.; Kovačević, J.; Spatafora, L.; Dietze, S.; Waczowicz, S.; Çakmak, H.K.; Matthes, J.; Hagenmeyer, V. Experimental evaluation of model predictive control and fuzzy logic control for demand response in buildings. Appl. Energy 2025, 401, 126666. [Google Scholar] [CrossRef]
  120. Alexander, S. An Introduction to Extremum-Seeking Control. Sweden, 2017. Available online: https://alexanderskafte.com/esc.pdf (accessed on 1 October 2025).
  121. Gong, Y.; Liu, G.; Lu, Z. Extremum seeking control for real-time optimization of high temperature heat pump systems incorporating vapor injection. Therm. Sci. Eng. Prog. 2023, 42, 101867. [Google Scholar] [CrossRef]
  122. Liu, G.; Lu, Z.; Yao, Y.; Gong, Y. Multi-variable extremum seeking control for high temperature heat pump system incorporating double refrigerant injection. Int. J. Refrig. 2023, 152, 1–15. [Google Scholar] [CrossRef]
  123. Wang, W.; Li, Y.; Hu, B. Real-time efficiency optimization of a cascade heat pump system via multivariable extremum seeking. Appl. Therm. Eng. 2020, 176, 115399. [Google Scholar] [CrossRef]
  124. Vering, C.; Klingebiel, J.; Reichl, C.; Emhofer, J.; Nürenberg, M.; Müller, D. Simultaneous energy efficiency and acoustic evaluation of heat pump systems using dynamic simulation models. In Proceedings of the 13th IEA Heat Pump Conference, Seoul, Republic of Korea, 26–29 April 2021; pp. 393–403. [Google Scholar]
  125. Ding, X.; An, Z.; Rathee, A.; Du, W. A Safe and Data-Efficient Model-Based Reinforcement Learning System for HVAC Control. IEEE Internet Things J. 2025, 12, 8014–8032. [Google Scholar] [CrossRef]
  126. Gao, C.; Wang, D. Comparative study of model-based and model-free reinforcement learning control performance in HVAC systems. J. Build. Eng. 2023, 74, 106852. [Google Scholar] [CrossRef]
  127. Soares, A.; Geysen, D.; Spiessens, F.; Ectors, D.; De Somer, O.; Vanthournout, K. Using reinforcement learning for maximizing residential self-consumption—Results from a field test. Energy Build. 2020, 207, 109608. [Google Scholar] [CrossRef]
  128. Yang, S.; Wan, M.P.; Chen, W.; Ng, B.F.; Dubey, S. Experiment study of machine-learning-based approximate model predictive control for energy-efficient building control. Appl. Energy 2021, 288, 116648. [Google Scholar] [CrossRef]
  129. Maier, L.M.; Henn, S.; Mehrfeld, P.; Müller, D. Approximate Optimal Control for Heat Pumps in Building Energy Systems; Curran Associates, Inc.: Red Hook, NY, USA, 2021. [Google Scholar] [CrossRef]
  130. Lohr, Y.; Monnigmann, M.; Klauco, M.; Kaluz, M. Mimicking Predictive Control with Neural Networks in Domestic Heating Systems. In Proceedings of the 2019 22nd International Conference on Process Control (PC19), Strbske Pleso, Slovakia, 11–14 June 2019; pp. 19–24. [Google Scholar] [CrossRef]
  131. Maier, L.; Brillert, J.; Zanetti, E.; Müller, D. Approximating model predictive control strategies for heat pump systems applied to the building optimization testing framework (BOPTEST). J. Build. Perform. Simul. 2023, 17, 338–360. [Google Scholar] [CrossRef]
  132. Chen, B.; Cai, Z.; Bergés, M. Gnu-RL: A Practical and Scalable Reinforcement Learning Solution for Building HVAC Control Using a Differentiable MPC Policy. Front. Built Environ. 2020, 6, 562239. [Google Scholar] [CrossRef]
  133. Arroyo, J.; Manna, C.; Spiessens, F.; Helsen, L. Reinforced model predictive control (RL-MPC) for building energy management. Appl. Energy 2022, 309, 118346. [Google Scholar] [CrossRef]
  134. Lee, Z.E.; Zhang, K.M. Generalized reinforcement learning for building control using Behavioral Cloning. Appl. Energy 2021, 304, 117602. [Google Scholar] [CrossRef]
  135. Cai, W.; Sawant, S.; Reinhardt, D.; Rastegarpour, S.; Gros, S. A Learning-Based Model Predictive Control Strategy for Home Energy Management Systems. IEEE Access 2023, 11, 145264–145280. [Google Scholar] [CrossRef]
  136. Walther, K.; Molinari, M.; Voss, K. Controls of HVAC systems in digital twins—Comparative framework and case study on the performance gap. J. Build. Perform. Simul. 2025, 1–18. [Google Scholar] [CrossRef]
  137. Kyriaki, E.; Giama, E.; Theodoridou, I. Machine learning, artificial intelligence and digital twins: An up-to-date review analysis of the latest- era technologies in the urban building sector. Int. J. Sustain. Energy 2025, 44, 2544238. [Google Scholar] [CrossRef]
  138. IEA. Energy and AI; Licence: CC BY 4.0; IEA: Paris, France, 2025; Available online: https://www.iea.org/reports/energy-and-ai (accessed on 30 October 2025).
  139. Wei, Z.; Calautit, J.K. Field experiment testing of a low-cost model predictive controller (MPC) for building heating systems and analysis of phase change material (PCM) integration. Appl. Energy 2024, 360, 122750. [Google Scholar] [CrossRef]
  140. Martín-Garín, A.; Millán-García, J.; Baïri, A.; Millán-Medel, J.; Sala-Lizarraga, J. Environmental monitoring system based on an Open Source Platform and the Internet of Things for a building energy retrofit. Autom. Constr. 2018, 87, 201–214. [Google Scholar] [CrossRef]
  141. Noye, S.; Martinez, R.M.; Carnieletto, L.; De Carli, M.; Aguirre, A.C. A review of advanced ground source heat pump control: Artificial intelligence for autonomous and adaptive control. Renew. Sustain. Energy Rev. 2022, 153, 111685. [Google Scholar] [CrossRef]
  142. Ajifowowe, I.; Chang, H.; Lee, C.S.; Chang, S. Prospects and challenges of reinforcement learning- based HVAC control. J. Build. Eng. 2024, 98, 111080. [Google Scholar] [CrossRef]
  143. Evens, M.; Arteconi, A. Heat pump digital twin: An accurate neural network model for heat pump behaviour prediction. Appl. Energy 2024, 378, 124816. [Google Scholar] [CrossRef]
Figure 1. Literature Analysis Approach.
Figure 1. Literature Analysis Approach.
Energies 19 00066 g001
Figure 2. Overlay visualization map of keyword occurrences.
Figure 2. Overlay visualization map of keyword occurrences.
Energies 19 00066 g002
Figure 3. Paper structure.
Figure 3. Paper structure.
Energies 19 00066 g003
Figure 4. Temperature and efficiency of various generations of district energy systems [12].
Figure 4. Temperature and efficiency of various generations of district energy systems [12].
Energies 19 00066 g004
Figure 5. Control strategies for heat pump systems.
Figure 5. Control strategies for heat pump systems.
Energies 19 00066 g005
Figure 6. Schematic representation of the general MPC.
Figure 6. Schematic representation of the general MPC.
Energies 19 00066 g006
Figure 7. General classification of optimization methods.
Figure 7. General classification of optimization methods.
Energies 19 00066 g007
Figure 8. Neural network controller [84].
Figure 8. Neural network controller [84].
Energies 19 00066 g008
Figure 9. Reinforcement Learning control framework for a heat pump system.
Figure 9. Reinforcement Learning control framework for a heat pump system.
Energies 19 00066 g009
Figure 10. General scheme of a Fuzzy controller [113].
Figure 10. General scheme of a Fuzzy controller [113].
Energies 19 00066 g010
Figure 11. A general extremum-seeking scheme [120].
Figure 11. A general extremum-seeking scheme [120].
Energies 19 00066 g011
Figure 12. Schematic view of approximate MPC methodology.
Figure 12. Schematic view of approximate MPC methodology.
Energies 19 00066 g012
Figure 13. Sankey diagram between control approaches and optimization objectives in heat pump systems.
Figure 13. Sankey diagram between control approaches and optimization objectives in heat pump systems.
Energies 19 00066 g013
Figure 14. Sankey diagram between control approaches, application domains, and optimization objectives in heat pump systems.
Figure 14. Sankey diagram between control approaches, application domains, and optimization objectives in heat pump systems.
Energies 19 00066 g014
Figure 15. Percentage of studies examining TES across application categories.
Figure 15. Percentage of studies examining TES across application categories.
Energies 19 00066 g015
Table 1. Summary of reviewed studies on Predictive Rule-Based Control (PRBC) in heat pump systems.
Table 1. Summary of reviewed studies on Predictive Rule-Based Control (PRBC) in heat pump systems.
Ref.Control ApproachHP UseApplication FieldObjective
RNRDHCI
[23]PRBCSH and DHW Reduce electricity costs and greenhouse gas emissions from the heat pump’s electricity consumption
[24]PRBCSH, SC Reduce electricity costs
[25]PRBCSH Reduce heat energy costs and CO2eq emissions
[26]PRBCSH, SC Minimize operating costs
[27]PRBCSH, SC Peak shaving
[28]PRBC-MPCSH and DHW Optimize HP-TES operation for PV self-consumption and cost savings
Abbreviation—R: Residential, NR: Non-Residential, DHC: District Heating/Cooling Network, I: Industrial.
Table 2. Summary of reviewed studies on Model Predictive Control (MPC) in heat pump systems.
Table 2. Summary of reviewed studies on Model Predictive Control (MPC) in heat pump systems.
Ref.Control ApproachHP UseApplication FieldObjective
RNRDHCI
[37]MPCSH, SC Cost reduction, grid support enhancement via load shifting
[38]MPCSH Minimize the operational costs and minimize the marginal CO2 emissions
[39]MPCDHW Flexibility assessment and flexibility exploitation
[40]MPCSH, SC, and DHW Minimize energy cost with demand flexibility and thermal comfort
[41]MPCSH Heating load shifting and heating costs reduction
[42]MPCSH, SC, and DHW Minimization of thermal energy, electricity costs, and marginal CO2 emissions
[43]MPCSH, SC, and DHW Minimize electricity costs, increase PV self-consumption, and enhance indoor comfort
[44]MPCSH and DHW PV self-consumption maximization, CO2 emissions minimization, economic cost minimization, and final energy use minimization
[45]MPCSH and DHW Reduction in operational costs and increase in photovoltaic self-consumption through the heat pump
[46]MPCSH Minimization of energy costs and CO2 emissions
[47]MPCSH Low-frequency load shedding control
[48]MPCSH Εnsuring thermal comfort while minimizing energy consumption
[49]MPCSH Optimization of the operation of a geothermal heat pump with an ATES system.
[50]MPCSH Sustainable and cost-efficient operation of ATES systems
[51]MPCSH Optimization of a GSHP system taking into account the long-term effects occurring in the borefield.
[52]MPCSH, SC Minimization of energy use by exploiting system flexibility
[53]MPCSH, SC Optimize DHC network operation and reduce electricity consumption via TES flexibility
[54]MPCProvision of heating and cooling to process streams Ensure reliable operation under real-world disturbances
[55]MPCProvision of heating Minimization of energy cost in thermal batch processes, while simultaneously ensuring production reliability
[56]MPCHeating, Data Center Cooling Minimization of the total energy cost
[57]MPCSH Optimization of the heat pump’s operational efficiency
[59]MPCSH Minimize the frequency deviation from the nominal frequency
[60]MPCProvision of hot water Optimize the energy performance and operation of a transcritical CO2 air-source heat pump
[62]Adaptive MPCDHW Ensure effective and adaptive system control
[64] Robust MPCSH Frequency regulation and flexible heating operation
[65]Robust MPCSH, SC Reducing energy cost while ensuring thermal comfort
[66]Stochastic MPCSH and DHW Enhance power-to-heat flexibility under uncertainty in electricity prices, weather, and occupancy patterns
[67]Stochastic MPCSH, SC, and DHW Reduce energy costs
[68]ΜPCSH, SC, and DHW Minimize electricity costs through thermal storage flexibility
[69]MPCSH, SC Energy savings and improvement of thermal comfort conditions
[70]MPCHeating, Cooling Optimal scheduling of heat pumps and thermal storage units
[71]MPCSH Reduce the house’s total annual heating costs
[72]MPCSH, SC Reduce HVAC energy consumption
[73]MPCSH Minimization of heat pump power consumption within comfort constraints
[74]MPCSH, SC Utilize the technical flexibility of a building polygeneration system
[75]MPCSH, SC, and DHW Minimize operational costs and marginal CO2 emissions
[76]MPCSH, SC Load shifting and minimizing energy costs
[78] MPCSC Trade-off between thermal comfort and energy costs
[79]MPCSC Minimize electricity consumption from the grid by exploiting the building’s energy flexibility
[36]MPCSC Minimize backup energy consumption while satisfying the cooling demand
[80]MPCSC Reduce the operating costs of the district cooling network
[81]MPCSH Minimizing the noise nuisance generated by heat pumps
[82]MPCSH Improve energy efficiency and reduce noise emissions
Abbreviation—R: Residential, NR: Non-Residential, DHC: District Heating/Cooling Network, I: Industrial.
Table 3. Summary of reviewed studies on model-free control in heat pump systems.
Table 3. Summary of reviewed studies on model-free control in heat pump systems.
Ref.Control ApproachHP UseApplication FieldObjective
RNRDHCI
[85]ANNSH, SC Increase the system performance
[86]ANNSH Minimize the electricity cost with a time-variable electricity tariff
[87]ANNCooling----Performance optimization
[89]DRLSH Shift the heating to low price periods
[90]DRLSH Maximize heat pump performance through autonomous defrost optimization.
[91]DRLSH Minimize energy cost while considering thermal comfort
[92]RLSH Minimize electricity costs
[93]RLSH and DHW Save energy while maintaining the health and comfort of occupants.
[94]RLSH and DHW Μaintain the required temperatures in the thermal storage and enhance PV self-consumption
[95]DRLSH, SC, and DHW Minimize energy costs and maximize PV self-consumption in a home energy management system
[96]DRLSH and DHW Minimize energy consumption and shift loads while maintaining thermal comfort
[97]DRLSH and DHW Optimize the trade-off between energy consumption cost and comfort of living
[98]RLSH and DHW Minimize the operating costs
[99]RLSC Increase self-consumption, energy flexibility, and energy savings while maintaining thermal comfort
[100]DRLSH, SC Reduce heat pump energy consumption
[101]DRLSH, SC Control optimization
[102]DRLSH and DHW Enhance energy efficiency and minimize unnecessary heating
[103]DRLSH Reduce the electrical energy consumption while maintaining thermal comfort
[104]Rainbow DRLGreenhouse Heating Minimize electricity costs
[105]RLSH, SC Minimize energy consumption and power fluctuations while maintaining thermal comfort in multi-zone building HVAC systems
[106]DRLHeating Optimization of energy use
[107]DRLHeating, Cooling Reduction in energy consumption and maintenance of the annual thermal balance of the Aquifer TES
[108]RLCooling Optimize the operation of solar thermal cooling systems (STCS)
[109]DRLSH----Heat pump defrost control
[110]RLSH Improves indoor comfort while optimizing energy use
[111]RLSH, SC Optimize heat pump RL control
[112]DRLSH, SC Minimizing energy costs while maintaining occupant comfort
[114]FLCHeating/Cooling----Enhance energy efficiency and temperature control
[115]FLCSH Βalance between thermal comfort and electricity load reduction
[116] FLCDomestic refrigerator----Reduce energy consumption
[117]ANFISHeating/Cooling----Control the water flow rate with a Variable Speed Drive (VSD) to improve overall system performance
[118]FLCSH, SC Minimize energy costs
[119]FLCSH Minimize energy costs while maintaining thermal comfort
[121]ESCHeating Reduce power consumption
[122]ESCHeating Reduce power consumption and optimize system operating point
[123]ESCHeating Minimize power consumption
[124]ESCHeating----Optimize heat pump operation for energy efficiency and noise reduction
Abbreviation—R: Residential, NR: Non-Residential, DHC: District Heating/Cooling Network, I: Industrial.
Table 4. Summary of reviewed studies on recent advances in predictive and learning-based control in heat pump systems.
Table 4. Summary of reviewed studies on recent advances in predictive and learning-based control in heat pump systems.
Ref.Control ApproachHP UseApplication FieldObjective
RNRDHCI
[126]MBRLSH Enhance energy efficiency and maintain indoor comfort in the HVAC system
[127]MBRLDHW Maximize the instantaneous self-consumption of the local photovoltaic production
[129]Approximate MPCSH, SC Optimize the operation schedule for minimizing annual energy consumption and operational costs
[130]Approximate MPCSH, DHW Satisfy thermal comfort
[131]Approximate MPCSH Evaluate the performance of Approximate MPC for a hydronic heat pump system
[132]Differentiable MPCSH, SC Minimize energy consumption while maintaining occupant comfort
[133]Differentiable MPCSH Minimize operational costs while ensuring thermal comfort
[134]Behavioral cloning RLSH, SC Minimize the time-varying electricity cost while maintaining thermal comfort
[135]MPC-RLSH Minimize operational costs while ensuring thermal comfort
Abbreviation—R: Residential, NR: Non-Residential, DHC: District Heating/Cooling Network, I: Industrial.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Sittas, K.; Giama, E.; Panaras, G. Digitalization for Sustainable Heat Pump Operation: Review on Smart Control and Optimization Strategies. Energies 2026, 19, 66. https://doi.org/10.3390/en19010066

AMA Style

Sittas K, Giama E, Panaras G. Digitalization for Sustainable Heat Pump Operation: Review on Smart Control and Optimization Strategies. Energies. 2026; 19(1):66. https://doi.org/10.3390/en19010066

Chicago/Turabian Style

Sittas, Konstantinos, Effrosyni Giama, and Giorgos Panaras. 2026. "Digitalization for Sustainable Heat Pump Operation: Review on Smart Control and Optimization Strategies" Energies 19, no. 1: 66. https://doi.org/10.3390/en19010066

APA Style

Sittas, K., Giama, E., & Panaras, G. (2026). Digitalization for Sustainable Heat Pump Operation: Review on Smart Control and Optimization Strategies. Energies, 19(1), 66. https://doi.org/10.3390/en19010066

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop