Intelligent Virtual Machine Scheduling Based on CPU Temperature-Involved Server Load Model

Zhou, Huan; Zhu, Jiebei; Chen, Binbin; Yu, Lujie; Luo, Heyu

doi:10.3390/en18143611

Open AccessArticle

Intelligent Virtual Machine Scheduling Based on CPU Temperature-Involved Server Load Model

by

Huan Zhou

¹,

Jiebei Zhu

^1,*,

Binbin Chen

²,

Lujie Yu

¹ and

Heyu Luo

¹

School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China

²

Energy Research Institute of China Southern Power Grid Co., Ltd., Guangzhou 510530, China

^*

Author to whom correspondence should be addressed.

Energies 2025, 18(14), 3611; https://doi.org/10.3390/en18143611

Submission received: 18 April 2025 / Revised: 23 May 2025 / Accepted: 29 May 2025 / Published: 8 July 2025

Download

Browse Figures

Versions Notes

Abstract

To reduce the significant energy consumption in data centers, virtual machine scheduling optimization and server consolidation are deployed. However, existing server power load (SPL) models typically adopt linear approximations for model developments, which results in inaccuracy with actual SPL characteristics, hindering the optimal solution of virtual machine scheduling. Therefore, intelligent virtual machine scheduling (IVMS) is proposed based on a CPU temperature-involved server load model for data center energy conservation. The IVMS establishes a novel server power load model considering the influence of CPU temperature to capture the actual server load characteristics. Based on the model, the Q-learning method is utilized to solve the problem with the advantage of global optimization to obtain the scheduling solution that further improves calculation accuracy. The performance of the proposed IVMS is evaluated and compared to existing methods by both simulation and experiments in data centers, proving that the IVMS can better predict SPL characteristics and further reduce server energy consumption.

Keywords:

virtual machine scheduling; server power load model; data center; energy conservation; Q-learning

1. Introduction

With the rapid development of cloud computing and big data technologies, the scales of data centers (DCs) are growing worldwide. Global DC energy consumption reached 2.4% of the total global energy consumption in 2022 [1]. The increase in DC high energy consumption adds up to operational costs and does not meet energy efficiency requirements such as power usage efficiency and carbon emission requirements. DC energy consumption is primarily attributed to computing systems and cooling systems, where cooling energy consumption largely depends on the computing system consumption. Therefore, reducing computing resource energy consumption is of a high priority for DC energy conservation.

A survey in [2] indicated that up to 30% of physical machines (PMs) in the investigated DCs in the USA are idle and do not perform any work. Moreover, with various hardware components consuming electrical energy, idle PMs may take up approximately 70% of their energy consumption at full CPU speeds [3,4]. Therefore, to improve DC energy efficiency, DC computing resource systems can be optimally scheduled by using the schemes of virtual machine scheduling (VMS) [5,6] and server consolidation [7,8].

In these schemes, virtual machines (VMs) created in lightly loaded PMs are migrated to more appropriate PMs, and idle PMs are set to sleep mode or off mode to save more energy. In terms of VMS algorithms, migrating VMs and consolidating PMs have been evolving from early classical packaging algorithms (such as Best Fit Decreasing, BFD [7]) to heuristic algorithms (such as greedy algorithms [9], genetic algorithms [10], firefly swarm algorithms [11], etc.). Recently, reinforcement learning as an intelligent optimization tool has been adopted in VMS (such as Q-learning [12] and SARSA [13]). Amongst these reinforcement learning methods, to solve the optimization scheduling problem of VMS, which involves discrete states and action spaces in essence, the Q-learning method is considered fit for this purpose and obtains more accurate VMS actions [12,14,15,16,17]. However, these Q-learning-based VMS schemes employ oversimplified ‘linear’ server load models which lead to other errors.

As a basic tool for DC energy consumption analysis and optimization, server power load (SPL) characteristics typically adopt linear approximated models [18,19] that estimate SPL against various CPU utilization rates, considering that CPU accounts for the largest proportion in total server energy consumption and its power load is highly coupled with server total load [20]. In [21], a linear SPL model using power load data at empty and full CPU utilizations, respectively, was established. In [22], a piecewise-linearized approach was proposed to model SPL. However, the simple linear approximations in these models may fail to capture the non-linear properties of SPL characteristics, particularly under CPU fluctuation and dynamic frequency scaling, resulting in significant inaccuracies [23].

The aforementioned linear-approximated models can be revised to provide a more reliable solution for VMS [24,25,26,27,28,29,30,31]. References [24,25] propose SPL models using power exponent fitting and multivariate polynomial fitting, respectively. As server power density increases, the impact of server internal temperature on its power cannot be ignored [26,27]. References [26,27] introduce temperature data measured in real time to derive a data-driven SPL model for better accuracy. But this black-box modeling approach does not explore the mechanism of temperature influence on server power, leading to potential errors and low credibility in server power estimation results. In fact, among server components, CPUs and cooling fans are strongly coupled with temperature [28]. In terms of CPU–temperature coupling, references [29,30], through experimental analysis, conclude that CPU dynamic power consumption (the other one is leakage power consumption) is closely related to server internal temperature. In terms of fan–temperature coupling, reference [31], studying the commonly used intelligent temperature control and speed regulation for computer fans, obtains the mathematical relationship between their speeds and measured temperatures. However, compared to accessible CPU utilizations, real-time server internal temperatures are hard to obtain, and additional measurement devices, whose performances are subject to errors, are required to carry out SPL estimations using the models in [29,30,31]. Therefore, there is an increasing need to accurately ‘estimate’ CPU temperatures instead of measuring CPU temperatures.

To save DC server electrical energy, this paper proposes intelligent virtual machine scheduling (IVMS) based on a CPU temperature-involved server load model, which has the following features:

(1): It addresses the neglect of the impact of temperature on model accuracy in [21,22,24,25], IVMS develops a novel two-stage SPL model with better accuracy, which estimates CPU temperature variations based on CPU utilization change in the first stage and further computes server load power by virtue of estimated temperature in the second stage;
(2): To select an appropriate VMS algorithm over traditional methods such as BFD [6,7] and to promote algorithm accuracy and efficiency, IVMS proposes a Q-learning-based VMS strategy, with the developed model in (1) embedded, which designs specific state spaces, action spaces, and reward functions. This also addresses the model deficit of Q-learning-based VMS in [12,14,15,16,17], which neglects the aforementioned temperature impact.
(3): Experiments were conducted on a physical platform and validated based on the CloudSim simulation environment, demonstrating that the optimization framework composed of the model and method proposed in this paper exhibits higher energy efficiency.

The rest of this paper is organized as follows: Section 2 presents the fundamentals of VMS models and optimization problems. Section 3 proposes IVMS, which establishes the novel SPL model considering temperature influence and its application in Q-learning algorithms. In Section 4, experiments in a physical DC and simulations in CloudSim are carried out to evaluate IVMS performance. Finally, Section 5 concludes the paper.

2. Problem Formulation and Relevant Fundamentals

This section firstly introduces the problem and research objectives studied in this article, which is the virtual machine scheduling problem aimed at energy conservation, and constructs a VMS model. Then, the research method of this article is introduced using a server power consumption model that considers the influence of CPU temperature as the basis for solving the problem. The Q-learning method used to solve this optimization problem is also introduced.

2.1. VMS Framework

The VM set in DC, which takes shape from the end user’s workload requests, was admitted and placed to appropriate DC PMs based on VM utilizations on PM hardware resources as well as PM prevailing throughputs. The actual CPU utilizations in VMs may differ from their originally requested CPU resources, providing opportunities to migrate VMs and shut down idle PMs for energy saving, as illustrated in Figure 1.

To carry out VMS, the first constraint is that the total virtual CPU capacity requested by the total VMs should be less than or equal to the available capacity of one PM [5]. The second constraint is VM cycle and timing, which are defined by adopted operation patterns that can be periodic, event-driven, hybrid, or threshold-based [7,15]. This paper selects a periodical VMS operation pattern, which schedules the VM at preset fixed 6 h time intervals, similarly to [28,32].

The PM workload statuses, which are generally characterized by CPU utilization, accounting for the largest proportion (e.g., 70% or above [20]) in total PM energy consumption [33], are typically monitored for VMS execution. Therefore, similarly to [5,7], this paper only considers the CPU. This paper classifies PM workload statuses into four degrees defined by three CPU utilization thresholds [34], namely T_L, T_M, and T_H (0 ≤ T_L < T_M < T_H ≤ 1), corresponding to PM workload statuses of ‘lightly loaded’, ‘normally loaded’, ‘medium-loaded’ and ‘heavily loaded’, respectively.

Obtaining a specific migration plan usually involves three steps, as follows: (1) source PM selection; (2) source VM selection; and (3) target PM selection. Typically, a PM may migrate a certain amount of its VMs to other target PM(s) with sufficient capacities to ‘heavily loaded’ and may migrate all its VMs to ‘lightly loaded’ [5,6,7]. The source VM usually refers to all VMs on lightly loaded PMs, as well as some VMs on heavily loaded PMs. The target PM is normally loaded and medium-loaded as usual, and the selection is obtained through optimization algorithms, including sequential optimization [5,7] and global heuristic optimization [10,11], as well as reinforcement learning-based methods [14,15,16].

2.2. System Modeling and Problem Formulation

In this subsection, a VMS model for a DC, which consists of m PMs that host n VMs, is established to optimize VM configurations and PM statuses. The VMS-requested VMs are deployed in specific PM(s), provided that these VMS actions meet the PM resource constraints (such as the CPU, memory, and disk bandwidths). In this article, PM = {1, 2, …, m} and VM = {1, 2, …, n} are used to represent m PM and n VM sequences, respectively. A PM CPU capacity PM_i is represented by PC_i, whereas CPU capacity required by VM_j is represented by VC_j. VMS configurations are described by elements in a binary matrix X, in which x_ij indicates the allocations of VM_j to PM_i (x_ij = 0 indicates ‘allocated’ and x_ij = 0 indicates ‘not allocated’). The set Y describes PM statuses, in which y_i = 1 indicates open and y_i = 0 indicates closed.

The VMS problem for minimizing DC energy consumption E_DC can be expressed [5,8,12] as follows:

\begin{array}{l} \min E_{DC} (X) = \sum_{i = 1}^{m} E_{i} & (1.1) \\ s . t . \\ \begin{array}{l} x_{i j} = \{\begin{cases} 1, {if VM}_{j} {is assigned to PM}_{i} \\ 0, otherwise \end{cases} \forall i \in P M and \forall j \in V M \end{array} & (1.2) \\ y_{i} = \{\begin{cases} 1, if \sum_{j = 1}^{n} x_{i j} \geq 1 \\ 0, otherwise \end{cases} \forall i \in P M & (1.3) \\ \sum_{i = 1}^{m} x_{i j} = 1 \forall j \in V M & (1.4) \\ \sum_{j = 1}^{n} VC j \cdot x_{i j} \leq α \cdot PC i \cdot y_{i} \forall i \in P M & (1.5) \end{array}

(1)

where (1.1) is the problem objective function for DC energy consumption minimization, E_DC is the total energy consumption of the data center computing system, E_i is PM_i energy consumption, (1.2) describes VM_j mappings to PM_i, (1.3) describes PM open/closed states, (1.4) ensures that each VM is allocated to a ‘single’ PM, and (1.5) ensures that the resource capacities of each PM are equal or greater than the resource requirements of allocated VMs (a certain margin is typically reserved to ensure performance, e.g., α ∈ (0.8,1) [5]).

In (1.1), E_i during period t₁ to t₂ can be calculated as

E i = \int_{t 1}^{t 2} P i (U_{CPU}^{t}) d t

(2)

where

U_{CPU}^{t}

refers to the CPU utilization of a PM at time t. Conventionally, the power consumption P_i of PM_i is regarded to change ‘linearly’ with CPU utilization change as in [5,6,7,8,9,10,14,15]:

P i (U_{CPU}^{t}) = k \cdot P i \max + (1 - k) \cdot P i \max \cdot U_{CPU}^{t} = P i \max + (P i \max - P i \min) \cdot U_{CPU}^{t}

(3)

where

P i \min

is the power consumption under the minimum utilization of PM_i;

P i \max

is the power consumption under the maximum utilization; and k is the ratio of

P i \min

to

P i \max

. However, as a key finding of this paper, using such a linear model and ignoring the CPU temperature impact as discussed in Section 1 may not capture the precise characteristics of the SPL, leading to non-optimal VMS outcomes. This will be effectively addressed in the proposed IVMS in Section 3.

2.3. CPU Temperature Impact on SPL

An SPL model with CPU temperature influence, including CPU power consumption and fan power consumption, is established in this subsection.

The total power consumption of

P^{t_{n}}

mainly consists of CPU power consumption

P_{CPU}^{t_{n}}

, cooling fan power consumption

P_{fan}^{t_{n}}

, and other forms of power consumption (such as memory and disk)

P_{other}

, as follows [12]:

P^{t_{n}} = P_{CPU}^{t_{n}} + P_{fan}^{t_{n}} + P_{other}

(4)

where

P_{other}

is usually considered as a constant value due to its relatively small power consumption.

P_{CPU}^{t_{n}}

can be further expressed as

P_{CPU}^{t_{n}} = P_{idle} + a U_{CPU}^{t_{n}} + b_{0} \exp (- \frac{b_{1}}{T_{CPU}^{t_{n}}})

(5)

where the idle power consumption refers to

P_{idle}

as a fixed constant [24];

U_{CPU}^{t_{n}}

is the CPU utilization rate; and

T_{CPU}^{t_{n}}

is the CPU temperature. Coefficients a,

b_{0}

, and

b_{1}

are fitting coefficients derived from processing experimental data. In (5), the second term represents CPU dynamic power consumption, which is generated by CPU transistor switching and is related to CPU workload executions. The third term represents the CPU leakage power consumption, which is generated by the leakage current of transistors, related to

T_{CPU}^{t_{n}}

.

P_{fan}^{t_{n}}

can be further expressed as

P_{fan}^{t_{n}} = c_{0} + c_{1} f_{fan}^{t_{n}} + c_{2} {(f_{fan}^{t_{n}})}^{2} + c_{3} {(f_{fan}^{t_{n}})}^{3}

(6)

where

f_{fan}^{t_{n}}

is the cooling fan speed. Coefficients

c_{0} - c_{3}

are fitting coefficients derived from processing experimental data. In (6), the cubic polynomial relationship corresponding to the fan power consumption and fan speed can be obtained by fitting, in which the fan speed can be expressed as (7) according to CPU temperature [22]:

f_{fan}^{t_{n}} = \{\begin{matrix} f_{\min} & T_{CPU}^{t_{n}} \leq T_{\min} \\ f_{\min} + k_{fan} (T_{CPU}^{t_{n}} - T_{\min}) & T_{\min} < T_{CPU}^{t_{n}} \leq T_{\max} \\ f_{\max} & T_{CPU}^{t_{n}} > T_{\max} \end{matrix}

(7)

where f_min and f_max are the fan speeds under the lowest load and highest load of the server, respectively; T_min and T_max are the temperature thresholds when the fan speed is switched between the constant speed state and the variable speed state; and k_fan is the rate of change in speed with temperature in a variable speed area.

Since the server can be regarded as a heat transfer system in which certain temperature differences and thermal resistances exist, the temperature change while the server works can be described using the equivalent thermal parameter model commonly used in thermodynamic analysis [22]. Assuming that the CPU utilization

U_{CPU}^{t_{n - 1}}

remains unchanged during the sampling interval [t_n−1,t_n), the CPU temperature at the sampling time can be expressed by [22]

T_{CPU}^{t_{n}} = T_{CPU, s}^{t_{n - 1}} + (T_{CPU}^{t_{n - 1}} - T_{CPU, s}^{t_{n - 1}}) \exp (\frac{t_{n - 1} - t_{n}}{τ_{CPU}^{t_{n - 1}}})

(8)

where

T_{CPU}^{t_{n - 1}}

is the CPU temperature at moment t_n−1;

τ_{CPU}^{t_{n - 1}}

is the time constant of temperature change when the CPU utilization rate is equal to

U_{CPU, s}^{t_{n - 1}}

; and

T_{CPU, s}^{t_{n - 1}}

is the steady-state solution value of the CPU temperature under

U_{CPU, s}^{t_{n - 1}}

, which can be solved using an equivalent thermal parameter model [22].

2.4. Q-Learning Algorithm for VMS

According to the analysis above, VMS is a NP-hard problem which is difficult to solve in polynomial time. To solve this kind of question, the reinforcement learning (RL)-based algorithm is a novel method that demonstrates efficient and precise performance [14,15]. In this paper, VMS is developed based on Q-learning, as shown in Figure 2. The computing cluster is recognized as the environment, and the PM is considered as the individual in the Q-learning algorithm.

During the whole iteration of VMS based on Q-learning, after each state–action–reward–state cycle, the estimate called the Q-value is calculated [13]:

Q (s t, a t) \leftarrow Q (s t, a t) + α [r t + γ Max Q ((s t + 1, a t + 1) - Q (s t, a t))]

(9)

where α is the learning rate, which determines the speed of learning and is set between 0 and 1. The α close to one ensures that the latest information obtained is utilized, γ reflects the extent to which the future rewards obtained influence the actions, which is also set between 0 and 1. When γ is closer to one, the weight of future rewards is greater while a γ close to zero means only the latest rewards are considered.

Max Q ((s t + 1, a t + 1) - Q (s t, a t))

returns the maximum estimate of the future state operation. Once the Q-value is calculated, it is stored in the Q-table, and each action is selected according to the strategy and Q-table.

3. IVMS Scheme

Based on the research methods and tools mentioned in the previous section, a server SPL model based on CPU temperature estimation is constructed. A new IVMS scheme is proposed, which successfully embeds the Q-learning algorithm for a solution.

3.1. CPU Temperature Estimation-Based SPL Model

A novel CPU temperature estimation-based SPL model is established in this subsection. Firstly, it is necessary to divide two typical working states of the CPU, because different working states will affect the changes in CPU temperature, and predictions refer to (8).

Reference [35] divided the CPU into two typical working states through experiments, namely ‘stepping’ and ‘fluctuating’. The ‘stepping’ state is generally due to significant changes in CPU utilization caused by load migration or new load allocation. In the ‘fluctuating’ state, the CPU executes fixed tasks and maintains relatively stable utilization near specific values, accompanied by unpredictable short-term fluctuations.

Distinguishing and judging the working status of the CPU can accurately predict its temperature changes and obtain an accurate SPL model based on CPU temperature estimation. Therefore, a scientific method is needed to determine the working state of the CPU. To judge the working status of the CPU and estimate the CPU temperature, a CPU utilization time series U={

U_{CPU}^{t_{n} - Δ t}

,

U_{CPU}^{t_{n} - Δ t + 1}

, …,

U_{CPU}^{t_{n}}

} in the time period Δt (take 8s in this paper) before time t_n is first extracted. Second, the SPL model calculates the mean value dispersion (MVD) of the CPU utilization time series U, explaining the difference between the average values of the first half and the second half. The MVD degree

R_{MVD}^{t_{n}}

is given in (10).

R_{MVD}^{t_{n}} = |\frac{2}{Δ t} \sum_{t = t_{n} - Δ t / 2}^{t_{n}} U_{CPU}^{t} - \frac{2}{Δ t} \sum_{t = t_{n} - Δ t}^{t_{n} - Δ t / 2} U_{CPU}^{t}|

(10)

Next, the obtained

R_{MVD}^{t_{n}}

determines CPU working states, which comprises the ‘stepping’ state and ‘fluctuating’ state. Then, the proposed SPL model sets an MVD degree threshold

R_{MVD, th}

, below which a CPU fluctuating working state is deemed without T_CPU change and above which a CPU stepping working state with T_CPU change is identified. The estimated CPU core temperature

T_{pre}^{t_{n}}

at the current time can be determined.

To avoid the measurement of T_CPU, which is subjected to the addition of measurement devices and measurement errors,

T_{pre}^{t_{n}}

can be estimated instead in the proposed SPL model as

T_{pre}^{t_{n}} = \{\begin{array}{l} T_{pre}^{t_{n - 1}} & R_{MVD}^{t_{n}} \leq R_{MVD, th} \\ \begin{array}{l} \frac{μ_{0}}{Δ t} \sum_{t = t_{n} - Δ t}^{t_{n}} U_{CPU}^{t} + μ_{1} + \\ \sqrt{{(\frac{μ_{0}}{Δ t} \sum_{t = t_{n} - Δ t}^{t_{n}} U_{CPU}^{t} + μ_{1})}^{2} - μ_{2}} \end{array} & R_{MVD}^{t_{n}} > R_{MVD, th} \end{array}

(11)

where

μ_{0}

−

μ_{2}

refer to parameters obtained by fitting obtained CPU utilizations, and the correction sequence length

Δ t

is set to eliminate the transient error.

After obtaining the CPU temperature that can be accurately predicted, the next step is to continue to improve the SPL model. By substituting the dynamic temperature model (8) into (5), the dynamic characteristics of CPU power consumption can be established by incorporating temperature variables in (12).

P_{CPU}^{t_{n}} = P_{idle} + a U_{CPU}^{t_{n}} + b_{0} \exp (\frac{- b_{1}}{T_{CPU, s}^{t_{n - 1}} + (T_{CPU}^{t_{n - 1}} - T_{CPU, s}^{t_{n - 1}}) \exp (\frac{t_{n - 1} - t_{n}}{τ_{CPU}^{t_{n - 1}}})})

(12)

Then, by substituting (8) into (6) and (7), we can derive the dynamic characteristics of fan power consumption:

P_{fan}^{t_{n}} = λ_{0} + λ_{1} \exp (\frac{t_{n - 1} - t_{n}}{τ_{CPU}^{t_{n - 1}}}) + λ_{2} \exp (2 \frac{t_{n - 1} - t_{n}}{τ_{CPU}^{t_{n - 1}}}) + λ_{3} \exp (3 \frac{t_{n - 1} - t_{n}}{τ_{CPU}^{t_{n - 1}}})

(13)

where λ₀−λ₃ are constant coefficients, defined and calculated in (A1).

Via combining (12) and (13) into (4), the dynamic characteristics of SPL, taking CPU temperature variables into account, is given in (14).

\begin{array}{l} P^{t_{n}} = P_{idle} + a U_{CPU}^{t_{n}} + b_{0} \exp (\frac{- b_{1}}{T_{CPU, s}^{t_{n - 1}} + (T_{CPU}^{t_{n - 1}} - T_{CPU, s}^{t_{n - 1}}) \exp (\frac{t_{n - 1} - t_{n}}{τ_{CPU}^{t_{n - 1}}})}) + \\ λ_{0} + λ_{1} \exp (\frac{t_{n - 1} - t_{n}}{τ_{CPU}^{t_{n - 1}}}) + λ_{2} \exp (2 \frac{t_{n - 1} - t_{n}}{τ_{CPU}^{t_{n - 1}}}) + λ_{3} \exp (3 \frac{t_{n - 1} - t_{n}}{τ_{CPU}^{t_{n - 1}}}) + P_{other} \end{array}

(14)

To ascertain the power consumption of the server at time t_n, the CPU utilization time series U is first used as input. The CPU temperature

T_{pre}^{t_{n}}

is estimated using (10) and (11). Subsequently,

T_{pre}^{t_{n}}

(as

T_{CPU}^{t_{n - 1}}

) and the current

U_{CPU}^{t_{n}}

are used in (14) to derive the final output of server power consumption. By continuously acquiring the time series U in real time, the model enables real-time calculations and monitoring of power consumption during server operations.

3.2. Q-Learning-Based Solution

As mentioned earlier, the Q-learning algorithm is used to solve the VMS problem in this paper; meanwhile, the state space and action space are defined. The key issue at present is to define the reward function. A reasonable selection of the reward function can effectively guide the algorithm to converge and obtain optimal calculation results.

State space: The state space is a set of resource utilization of all PMs in each time step, expressed as S = {S₀, S₁, …, S_n}, where n is the number of VMs. The state S_t∈S represents each PM resource utilization at time step t and the state S_n corresponds to the final state after all VMs are placed. Each state S_t is represented by S_t = {s_t₁, s_t₂, …, s_tm}, where m is the number of PMs.

Action space: The action space is a set of all placement schemes for VM, represented as A = {A₁, A₂, …, A_m}, where A_j means to place the VM on PM_j, for instance, A₁ = (1, 0, …, 0), which means to place the VM on PM₁.

Reward function: The reward function R(S_t, A_j) is determined as an A_j action for the current state S_t of PMs. In the next step, the environment returns new feedback of the current state s_t+1 and a reward r_t based on the previously performed action. The best action to be performed when the agent is at a certain s_t must be determined, which can be completed through the reward function. The proposed IVMS scheme considers energy consumption and performance scoring to guide algorithm implementation. Energy consumption is to be directly calculated according to (2) and (12) after taking an action decision. Thus, the reward function is given in (15).

R = f (E_{DC}) = \frac{\sum_{1}^{m} \int_{t} P d t}{\sum_{1}^{m} \int_{t} P \max d t}

(15)

Equation (15) calculates the overall energy consumption of the system under different actions. If the energy consumption is smaller, the reward R is larger, which can guide the agent to follow this idea and try to make optimal choices.

After defining the state space, action space, and reward function of Q-learning in VMS, a VMS algorithm flow based on Q-learning can be designed.

3.3. Proposed IVMS

In this subsection, the implementation of the Q-learning algorithm for VMS, which is the proposed IVMS, is shown as pseudocode below (Algorithm 1).

Algorithm 1: IVMS method

First, the Q-table is initialized to 0 or 1 depending on whether the PMs can meet the VM resource requirements. Then, each VM in the VM(t) is selected in turn (lines 1–4). For the current PM set PMs(t), the PM pm_s is selected by using the ε-greedy strategy (lines 5–7). After that, the next state will be observed to calculate its fitness value after assigning vm_q to PM pm_s to obtain the reward, and the Q-value of the current state is calculated by (15) (lines 8–11). Finally, after updating the Q-table value, the next VM will be selected at the next iteration and the algorithm flow will be executed again until the termination condition is satisfied (lines 12–15).

4. Experiment and Simulation

In this section, we describe the experiments we conducted on virtual machine optimization consolidation using a real private cloud platform experimental environment and a larger scale simulation by using the cloud computing simulation tool CloudSim 3.0.3.

4.1. Experiment Based on Cloud Platform

The experiment was conducted on the experiment cloud platform of the School of Electrical Automation and Information Engineering of Tianjin University. This cloud platform was built according to the cloud data center, which consists of six servers (NF5280F5, Inspur, Jinan, China) and one network switch (S6730-S24X6Q, Huawei, Beijing, China). There was a cloud management platform, which was used to manage PM clusters and loads. The process of designing the entire cloud platform and implementing virtual machine scheduling is shown in Figure 3. Based on this cloud platform PM cluster, this study conducted experiments of larger scales with the data from the PlanetLab dataset.

The experiment selected the running data of a cluster containing 60 PMs and 210 virtual machines in the PlanetLab dataset as the raw data for the experiment, which includes the CPU usage of each PM within 24 h. Subsequently, 60 sets of raw data were randomly averaged into six sets of CPU usage datasets, which were used as inputs for six PMs in the experimental platform. The experiment used load running tools to occupy the CPU usage of the PMs and simulate their actual resource usage when running cloud services. The experiment conducted four virtual machine scheduling sessions throughout the entire experimental cycle, with a scheduling policy execution cycle of 6 h. The virtual machine scheduling was performed at the 2nd, 8th, 14th, and 20th hours of the experiment.

At the same time, in order to characterize cloud business environments with different levels of busyness, we selected four sets of typical load situations as the input for the experiment. The difference between them lies in the difference in the average load level of the cluster, which can represent four typical working conditions of the data center cluster, namely lightly loaded, normally loaded, medium-loaded, and heavily loaded. The specific settings are shown in Table 1.

Based on the above experimental setup, three different methods were used to generate virtual machine integration strategies. The first method is to combine the traditional linear server power load model with the classic BFD method (L-B), the second method is to combine the SPL model constructed in this paper with the BFD method (SPL-B), and the third method is the IVMS presented in this paper, which combines the SPL model with the Q-learning method (SPL-Q). Based on these three methods of computational optimization, different virtual machine layout strategies will be provided. Compared to the original situation without migration, the strategies obtained by the three methods have significant energy-saving effects. Figure 4 shows the power consumption of a PM cluster optimized for a virtual machine layout using the three methods within 24 h at the four load rate levels mentioned above.

Figure 4, as well as Figure A1, show the changes in system power consumption after virtual machine integration using L-B, SPL-B, and IVMS under four different loads during a 24 h experimental period. Figure 4a shows that under the lightly loaded situation, at 2:00 and 8:00, our virtual machine scheduling framework monitors and determines that PM2 and PM4 are in a low-load state based on the current system resource usage, and the other four PMs have sufficient resources to accommodate all virtual machines on PM2 and PM4. Therefore, we have calculated a virtual machine migration plan, guided the virtual machine migration, and put the empty PM to sleep to achieve resource reallocation. It can be seen that at the above two moments, the virtual machine scheduling scheme is generated and executed. After a short period of time, PM2 and PM4 are shut down one by one. In the following time, the power consumption of the entire cluster system is significantly reduced. It is worth noting that the significant drop in power did not occur at the two sharp points of 2:00 and 8:00. This is because at sharp points, the scheduling program only completed the judgment of whether the system needs to perform virtual machine migration and integration based on system operation information. The judgment result is that optimization calculations and virtual machine migration after migration require a certain amount of time. In our laboratory, this process usually takes about 10 min.

From Figure 4b, it can be seen that under constant load conditions, the VMS framework proposed in this paper produces significantly different results compared to other VMS frameworks. Specifically, although the virtual machine migration integration schemes provided by the three VMS frameworks will perform some VM migration and shut down PM4 at the scheduled time of 2:00, at the scheduled time of 8:00, IVMS will further migrate the virtual machine and make PM2 idle and shut down. However, other VMS frameworks did not make such judgments and migration instructions but instead provided a certain VM migration scheduling scheme and operation at 14:00 in the next scheduling time, ultimately resulting in significantly higher overall system power consumption than the scheme provided by IVMS.

The experiment compared three different combinations of models and algorithms in terms of energy efficiency improvement. It can be seen that when the system is in a light-load state, using our scheduling framework to integrate computing resources has the most significant energy-saving effect, and as the load level increases, the energy-saving effect weakens. The reason is that when the load is low, the scheduling framework can integrate and concentrate more virtual machines, effectively reducing the use of real PMs. For example, in a light-load scenario with an average load rate of 38%, resource integration was achieved through scheduling virtual machines during the experiment. Two PMs are shut down sequentially without redundancy, greatly reducing system energy consumption. When the load is high, only effective virtual machine migration can achieve resource rearrangement, which may not reduce the actual number of running PMs. Therefore, the results of integrated optimization are only reflected in the resource layout, and it is difficult to significantly reduce energy consumption by reducing the number of PMs. For example, in an overload situation with an average load rate of 88%, even optimizing the resource layout through scheduling frameworks can only improve energy efficiency by about 2%.

In addition, as shown by the line in Figure 4, the virtual machine integration strategy that uses IVMS has significant energy-saving advantages compared to using only the more accurate SPL model or Q-learning algorithm, with an energy-saving effect of about 2.6–10.9%.

4.2. Simulation Based on CloudSim

To verify the effectiveness of the proposed framework and methodology in large-scale data center environments, most studies choose to conduct experiments on large-scale virtualized data center infrastructure simulation tools. A Java-based simulator developed by the CLOUDS Laboratory at the University of Melbourne called CloudSim provides an excellent platform. We conducted extensive experiments using CloudSim 3.0.3 to simulate real cloud data centers. It included a total of three physical machine models and four types of virtual machines. The specific configuration of PMs is shown in Table 2. Similarly to Amazon EC2, we adopted several types of VMs according to [36].

Considering the complex environment in cloud computing, this experiment set up three types of workloads with different characteristics according to [37], assuming that the jobs are independent.

Figure 5 shows the comparison of system energy consumption after optimizing computing resources using L-B, SPL-B, and IVMS for large-scale simulation using CloudSim. It can be intuitively seen that the method proposed in this paper has the best effect in reducing energy consumption, significantly better than the other two methods. The green part in Figure 5 represents the system energy consumption optimized by IVMS, which is generally the lowest within 30 days. The red part indicates that SPL-B consumes more electricity than IVMS, the blue part indicates that L-B consumes more electricity than SPL-B, and the black part indicates the difference in energy consumption between using the traditional L-B method and not migrating.

Specifically, compared to not migrating virtual machines, the IVMS proposed in this paper saves about 17.8% of energy consumption, about 10.3% compared to the SPL-B method, and 7.5% compared to the traditional L-B method. The fundamental reason is that the server power consumption model considering temperature used in this paper is closer to the power consumption patterns in actual server workload fluctuations, which can provide assistance for more accurate decision-making in optimization. In addition, in the decision-making process of optimizing the layout of migrating virtual machines, the Q-learning method is used for global optimization, which greatly reduces the degree of resource fragmentation, makes the integrated PM cluster as close to the maximum limit as possible, ensures that computing resources are fully utilized, reduces redundancy, and thus improves the overall energy efficiency of the system.

5. Conclusions

In response to the energy-saving optimization problem of data centers, this paper improves the existing research that rarely uses accurate server power load models and constructs an SPL model. Based on this precise model, a virtual machine layout optimization problem model for cloud data centers was established. The new model can be solved using various algorithms. This paper uses the Q-learning method for solving this problem and proposes IVMS. The effectiveness of the new method is verified through experiments and simulations. At the same time, it is compared with traditional SPL models and traditional algorithms to prove the superiority of the new method in energy efficiency improvement. The generality of the method is verified through simulations of different scales.

The main research objective of this article is the virtual machine scheduling problem aimed at energy conservation. Through the scheduling of virtual machines and the integration of servers, the energy consumption of the entire system is minimized to the greatest extent possible. Energy conservation and consumption reduction are the focus and optimization goals of the system, so to some extent, the impact of system performance changes is ignored. The method proposed in this article has a good effect on the energy-saving operation of data centers, but there are certain limitations in optimizing data center operation. When considering the service quality and performance provided by the data center, the method proposed in this paper needs further optimization and adjustment, such as in the design of the penalty function in the Q-learning algorithm. In the future, we will adopt other more advanced solving algorithms or introduce resource utilization prediction as an input to the virtual machine scheduling framework, breaking through the limitations of the method proposed in this paper and achieving multi-objective optimization of energy efficiency and performance for computing clusters and data center systems.

Author Contributions

Conceptualization, H.Z.; methodology, H.Z. and L.Y.; software, H.Z. and B.C.; validation, H.Z. and H.L.; formal analysis, H.Z. and J.Z.; investigation, H.Z. and B.C.; resources, H.Z. and B.C.; data curation, H.Z.; writing—original draft preparation, H.Z. and J.Z.; writing—review and editing, H.Z. and B.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Key Research and Development Program of China, grant number 2018YFA0702200.

Data Availability Statement

The data are available within the article, and any additional inquiries regarding the findings should be addressed to the corresponding author.

Conflicts of Interest

Binbin Chen was employed by the Energy Research Institute of China Southern Power Grid Co., Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Appendix A

In Section 3.1, when studying the power consumption characteristics of server fans, substituting (8) into (6) and (7), the dynamic characteristics of fan power consumption over time can be obtained. For the variable speed working area of the fan (T_min <

T_{CPU}^{t_{n}}

≤ T_max), the dynamic characteristics of power consumption can be organized into the form shown in (13), where the specific expression of parameters λ₀−λ₃ are shown in (A1):

\{\begin{cases} λ_{0} = c_{0} + c_{1} (f_{\min} - k_{fan} T_{\min} + T_{CPU, s}^{t_{n - 1}}) + c_{2} {(f_{\min} - k_{fan} T_{\min} + T_{CPU, s}^{t_{n - 1}})}^{2} + c_{3} {(f_{\min} - k_{fan} T_{\min} + T_{CPU, s}^{t_{n - 1}})}^{3} \\ λ_{1} = c_{1} (T_{CPU}^{t_{n - 1}} - T_{CPU, s}^{t_{n - 1}}) + 2 c_{2} (f_{\min} - k_{fan} T_{\min} + T_{CPU, s}^{t_{n - 1}}) (T_{CPU}^{t_{n - 1}} - T_{CPU, s}^{t_{n - 1}}) + 3 c_{3} {(f_{\min} - k_{fan} T_{\min} + T_{CPU, s}^{t_{n - 1}})}^{2} (T_{CPU}^{t_{n - 1}} - T_{CPU, s}^{t_{n - 1}}) \\ λ_{2} = c_{2} {(T_{CPU}^{t_{n - 1}} - T_{CPU, s}^{t_{n - 1}})}^{2} + 3 c_{3} (f_{\min} - k_{fan} T_{\min} + T_{CPU, s}^{t_{n - 1}}) {(T_{CPU}^{t_{n - 1}} - T_{CPU, s}^{t_{n - 1}})}^{2} \\ λ_{3} = c_{3} {(T_{CPU}^{t_{n - 1}} - T_{CPU, s}^{t_{n - 1}})}^{3} \end{cases}

(A1)

Within a specific sampling interval, all parameters in (A1) are constant values. Therefore, λ₀−λ₃ are constant parameters.

Appendix B

Figure A1. Comparison of system power consumption within 24 h under different cluster load levels. (a) Medium-loaded. (b) Heavily loaded.

References

International Energy Agency. Supermicro Computer Inc Data Center and Environment 2022 Green Data Center Status Analysis Report; International Energy Agency: Paris, France, 2022. [Google Scholar]
Bahati, R.M.; Bauer, M.A. Towards adaptive policy-based management. In Proceedings of the 2010 IEEE Network Operations and Management Symposium-NOMS, Osaka, Japan, 19–23 April 2010; pp. 511–518. [Google Scholar] [CrossRef]
Barrett, E.; Howley, E.; Duggan, J. Applying reinforcement learning towards automating resource allocation and application scalability in the cloud. Concurr. Comput. Pract. Exp. 2013, 25, 1656–1674. [Google Scholar] [CrossRef]
Barroso, L.A.; Hölzle, U. The case for energy-proportional computing. Computer 2007, 40, 33–37. [Google Scholar] [CrossRef]
Beloglazov, A.; Buyya, R. Adaptive threshold-based approach for energy-efficient consolidation of virtual machines in cloud data centers. MGC@ Middlew. 2010, 4, 1890799–1890803. [Google Scholar] [CrossRef]
Yu, Q.; Wan, H.; Zhao, X.; Gao, Y.; Gu, M. Online scheduling for dynamic VM migration in multicast time-sensitive networks. IEEE Trans. Ind. Inform. 2019, 16, 3778–3788. [Google Scholar] [CrossRef]
Beloglazov, A.; Buyya, R. Optimal online deterministic algorithms and adaptive heuristics for energy and performance efficient dynamic consolidation of virtual machines in cloud data centers. Concurr. Comput. Pract. Exp. 2012, 24, 1397–1420. [Google Scholar] [CrossRef]
Brown, R. Report to Congress on Server and Data Center Energy Efficiency: Public Law 109–431; Lawrence Berkeley National Laboratory: Berkeley, CA, USA, 2008. [Google Scholar]
Liang, Y.; Hu, Z.; Li, K. Power consumption model based on feature selection and deep learning in cloud computing scenarios. IET Commun. 2020, 14, 1610–1618. [Google Scholar] [CrossRef]
Satpathy, A.; Sahoo, M.N.; Swain, C.; Bilal, M.; Bakshi, S.; Song, H. GAMap: A genetic algorithm-based effective virtual data center re-embedding strategy. IEEE Trans. Green Commun. Netw. 2024, 8, 791–801. [Google Scholar] [CrossRef]
Ranganna, P.K.K.; Matt, S.G.; Chen, C.-L.; Jayachandra, A.B.; Deng, Y.-Y. Fitness sharing chaotic particle swarm optimization (FSCPSO): A metaheuristic approach for allocating dynamic virtual machine (VM) in fog computing architecture. Comput. Mater. Contin. 2024, 80, 2557–2578. [Google Scholar] [CrossRef]
Shaw, R.; Howley, E.; Barrett, E. Applying reinforcement learning towards automating energy efficient virtual machine consolidation in cloud data centers. Inf. Syst. 2022, 107, 101722. [Google Scholar] [CrossRef]
Hummaida, A.R.; Paton, N.W.; Sakellariou, R. Scalable virtual machine migration using reinforcement learning. J. Grid Comput. 2022, 20, 15. [Google Scholar] [CrossRef]
Zhu, X.; Xia, R.; Zhou, H.; Zhou, S.; Liu, H. An intelligent decision system for virtual machine migration based on specific Q-learning. J. Cloud Comput. 2024, 13, 122. [Google Scholar] [CrossRef]
Tong, Z.; Wang, J.; Wang, Y.; Liu, B.; Li, Q. Energy and performance-efficient dynamic consolidate vms using deep-q neural network. IEEE Trans. Ind. Inform. 2023, 19, 11030–11040. [Google Scholar] [CrossRef]
Ma, X.; Xu, H.; Gao, H.; Bian, M.; Hussain, W. Real-time virtual machine scheduling in industry IoT network: A reinforcement learning method. IEEE Trans. Ind. Inform. 2022, 19, 2129–2139. [Google Scholar] [CrossRef]
Wang, D.; Zhang, W.; Han, X.; Lin, J.; Tian, Y.-C. A Multi-objective virtual network migration algorithm based on reinforcement learning. IEEE Trans. Cloud Comput. (TCC) 2023, 11, 18. [Google Scholar] [CrossRef]
Cleveland, W. Robust locally weighted regression and smoothing scatterplots. J. Am. Stat. Assoc. 2012, 74, 829–836. [Google Scholar] [CrossRef]
Dutreilh, X.; Kirgizov, S.; Melekhova, O.; Malenfant, J.; Rivierre, N.; Truck, I. Using reinforcement learning for autonomic resource allocation in clouds: Towards a fully automated workflow. In Proceedings of the ICAS 2011, The Seventh International Conference on Autonomic and Autonomous Systems, Venice, Italy, 22–27 May 2011; pp. 67–74. [Google Scholar]
Verma, A.; Ahuja, P.; Neogi, A. pMapper: Power and migration cost aware application placement in virtualized systems. In Proceedings of the ACM/IFIP/USENIX International Conference on Distributed Systems Platforms and Open Distributed Processing, Leuven, Belgium, 1–4 December 2008; pp. 243–264. [Google Scholar] [CrossRef]
Farahnakian, F.; Liljeberg, P.; Plosila, J. Energy-efficient virtual machines consolidation in cloud data centers using reinforcement learning. In Proceedings of the 2014 22nd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing, Torino, Italy, 12–14 February 2014; pp. 500–507. [Google Scholar] [CrossRef]
Hieu, N.T.; Di Francesco, M.; Ylä-Jääski, A. Virtual machine consolidation with multiple usage prediction for energy-efficient cloud data centers. IEEE Trans. Serv. Comput. 2017, 13, 186–199. [Google Scholar] [CrossRef]
Reda, S.; Nowroz, A.N. Power modeling and characterization of computing devices. Found. Trends Electron. Des. Autom. 2012, 6, 121–216. [Google Scholar] [CrossRef]
Kumar, N.; Zeadally, S.; Chilamkurti, N.; Vinel, A. Performance analysis of Bayesian coalition game-based energy-aware virtual machine migration in vehicular mobile cloud. IEEE Netw. 2015, 29, 62–69. [Google Scholar] [CrossRef]
Baron, B.; Campista, M.; Spathis, P.; Costa, L.H.M.; de Amorim, M.D.; Duarte, O.C.M.; Viniotis, Y. Virtualizing vehicular node resources: Feasibility study of virtual machine migration. Veh. Commun. 2016, 4, 39–46. [Google Scholar] [CrossRef]
Qiu, M.; Chen, Z.; Ming, Z.; Qin, X.; Niu, J. Energy-aware data allocation with hybrid memory for mobile cloud systems. IEEE Syst. J. 2014, 11, 813–822. [Google Scholar] [CrossRef]
Gai, K.; Qiu, L.; Zhao, H.; Qiu, M. Cost-aware multimedia data allocation for heterogeneous memory using genetic algorithm in cloud computing. IEEE Trans. Cloud Comput. 2016, 8, 1212–1222. [Google Scholar] [CrossRef]
Zhang, L.; Han, T.; Ansari, N. Energy-aware virtual machine management in inter-datacenter networks over elastic optical infrastructure. IEEE Trans. Green Commun. Netw. 2017, 2, 305–315. [Google Scholar] [CrossRef]
Fu, X.; Zhou, C. Virtual machine selection and placement for dynamic consolidation in Cloud computing environment. Front. Comput. Sci. 2015, 9, 322–330. [Google Scholar] [CrossRef]
Hussain, M.; Wei, L.F.; Lakhan, A.; Wali, S.; Ali, S.; Hussain, A. Energy and performance-efficient task scheduling in heterogeneous virtualized cloud computing. Sustain. Comput. Inform. Syst. 2021, 30, 100517. [Google Scholar] [CrossRef]
Zhou, Z.; Abawajy, J.H.; Li, F.; Hu, Z.; Chowdhury, M.U.; Alelaiwi, A.; Li, K. Fine-grained energy consumption model of servers based on task characteristics in cloud data center. IEEE Access 2017, 6, 27080–27090. [Google Scholar] [CrossRef]
Lv, P.; Zhang, Z.; Deng, Y.; Cui, L.; Lin, L. HVMM: A holistic virtual machine management strategy for cloud data centers. IEEE Trans. Netw. Serv. Manag. 2023, 21, 574–589. [Google Scholar] [CrossRef]
Beloglazov, A.; Buyya, R. Managing overloaded hosts for dynamic consolidation of virtual machines in cloud data centers under quality of service constraints. IEEE Trans. Parallel Distrib. Syst. 2012, 24, 1366–1379. [Google Scholar] [CrossRef]
Omoniwa, B.; Hussain, R.; Javed, M.A.; Bouk, S.H.; Malik, S.A. Fog/edge computing-based IoT (FECIoT): Architecture, applications, and research issues. IEEE Internet Things J. 2018, 6, 4118–4149. [Google Scholar] [CrossRef]
Zhu, J.; Liu, D.; Li, B. Power consumption model of server in data center based on temperature estimating by CPU operating statuses. Autom. Electr. Power Syst. 2023, 47, 140–148. (In Chinese) [Google Scholar]
Zhou, Z.; Shojafar, M.; Alazab, M.; Abawajy, J.; Li, F. AFED-EF: An energy-efficient vm allocation algorithm for iot applications in a cloud data center. IEEE Trans. Green Commun. Netw. 2021, 5, 658–669. [Google Scholar] [CrossRef]
Lin, W.; Wu, W.; He, L. An on-line virtual machine consolidation strategy for dual improvement in performance and energy conservation of server clusters in cloud data centers. IEEE Trans. Serv. Comput. 2022, 15, 766–777. [Google Scholar] [CrossRef]

Figure 1. Procedure of VM placement and migration.

Figure 2. Structure of VMS based on Q-learning [14].

Figure 3. Private cloud experimental platform and virtual machine scheduling algorithm implementation.

Figure 4. The 24 h power consumption of PM clusters under different loads. (a) Lightly loaded. (b) Normally loaded.

Figure 5. Energy consumption comparison under different loads (within 30 days).

Table 1. Average load levels of clusters selected for the experiment.

Cluster Average Load Level	Cluster Average Load Rate (CPU)
① lightly loaded	28%
② normally loaded	51%
③ medium-loaded	74%
④ heavily loaded	88%

Table 2. Configuration of PMs.

PMs	Cores	CPU	RAM	Operating System
A	8	Intel Xeon 3040 1.86 (GHz)	4GB DDR4 Dual Channel	Linux
B	8	Intel Xeon 3075 2.66 (GHz)	4GB DDR4 Dual Channel	Linux
C	16	Intel Xeon 3040 1.86 (GHz)	64GB DDR4 Dual Channel	Linux

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhou, H.; Zhu, J.; Chen, B.; Yu, L.; Luo, H. Intelligent Virtual Machine Scheduling Based on CPU Temperature-Involved Server Load Model. Energies 2025, 18, 3611. https://doi.org/10.3390/en18143611

AMA Style

Zhou H, Zhu J, Chen B, Yu L, Luo H. Intelligent Virtual Machine Scheduling Based on CPU Temperature-Involved Server Load Model. Energies. 2025; 18(14):3611. https://doi.org/10.3390/en18143611

Chicago/Turabian Style

Zhou, Huan, Jiebei Zhu, Binbin Chen, Lujie Yu, and Heyu Luo. 2025. "Intelligent Virtual Machine Scheduling Based on CPU Temperature-Involved Server Load Model" Energies 18, no. 14: 3611. https://doi.org/10.3390/en18143611

APA Style

Zhou, H., Zhu, J., Chen, B., Yu, L., & Luo, H. (2025). Intelligent Virtual Machine Scheduling Based on CPU Temperature-Involved Server Load Model. Energies, 18(14), 3611. https://doi.org/10.3390/en18143611

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Intelligent Virtual Machine Scheduling Based on CPU Temperature-Involved Server Load Model

Abstract

1. Introduction

2. Problem Formulation and Relevant Fundamentals

2.1. VMS Framework

2.2. System Modeling and Problem Formulation

2.3. CPU Temperature Impact on SPL

2.4. Q-Learning Algorithm for VMS

3. IVMS Scheme

3.1. CPU Temperature Estimation-Based SPL Model

3.2. Q-Learning-Based Solution

3.3. Proposed IVMS

4. Experiment and Simulation

4.1. Experiment Based on Cloud Platform

4.2. Simulation Based on CloudSim

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A

Appendix B

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI