Computational Intelligence Technologies for Occupancy Estimation and Comfort Control in Buildings

Korkidis, Panagiotis; Dounis, Anastasios; Kofinas, Panagiotis

doi:10.3390/en14164971

Open AccessArticle

Computational Intelligence Technologies for Occupancy Estimation and Comfort Control in Buildings

by

Panagiotis Korkidis

^*,†,

Anastasios Dounis

^*,† and

Panagiotis Kofinas

^†

Department of Biomedical Engineering, Egaleo Park Campus, University of West Attica, 12243 Athina, Greece

^*

Authors to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Energies 2021, 14(16), 4971; https://doi.org/10.3390/en14164971

Submission received: 30 June 2021 / Revised: 8 August 2021 / Accepted: 10 August 2021 / Published: 13 August 2021

(This article belongs to the Section G: Energy and Buildings)

Download

Browse Figures

Versions Notes

Abstract

:

This paper focuses on the development of a multi agent control system (MACS), combined with a stochastic based approach for occupancy estimation. The control framework aims to maintain the comfort levels of a building in high levels and reduce the overall energy consumption. Three independent agents, each dedicated to the thermal comfort, the visual comfort, and the indoor air quality, are deployed. A stochastic model describing the CO₂ concentration has been studied, focused on the occupancy estimation problem. A probabilistic approach, as well as an evolutionary algorithm, are used to provide insights on the stochastic model. Moreover, in order to induce uncertainty, parameters are treated in a fuzzy modelling framework and the results on the occupancy estimation are investigated. In the control framework, to cope with the continuous state-action space, the three agents utilise Fuzzy

Q

-learning. Simulation results highlight the precision of parameter and occupancy estimation, as well as the high capabilities of the control framework, when taking into account the occupancy state, as energy consumption is reduced by

55.9 %

, while the overall comfort index is kept in high levels, with values close to one.

Keywords:

stochastic processes; fuzzy reinforcement learning; multi agent control systems; occupancy estimation; evolutionary algorithms; buildings; fuzzy modelling; comfort control

1. Introduction

The problem of occupancy estimation and prediction, in the context of comfort control in buildings, is of crucial importance and has been studied by many authors [1]. Our motivation for this work is to combine an algorithm that estimates the number of occupants inside a room, given the CO₂ state concentration, with a multi agent control system that utilises the available information and controls the comfort indexes: thermal comfort, visual comfort, and indoor air quality. Most heating, ventilation, and air conditioning systems (HVAC) function on the assumption of maximum occupancy, not taking into account the variations of the occupancy level. There are various factors to be studied, in order for an estimation algorithm to be developed, that computes the actual number of occupants that entered a room during a time period.

In [2], Dong et al. study the control of HVAC systems using three approaches for the occupancy prediction: the expectation maximisation algorithm, a finite state automata method and introduces a method based on uncertain basis functions. A method based on a hybrid neural network is proposed in [3], where an extreme learning machine tuned with differential evolution is proposed. Models of occupancy in a single person office are studied in [4] using a non-homogeneous Poisson process to simulate the occupancy sequence. In [5] Feng et al. study the simulation of occupancy in buildings in four levels: the number of occupants in a building, the occupancy status in a space, the number of occupants in a space and the space location of an occupant. The correlation between occupancy and energy performance of a building is presented in [6], together with the impact on the quality of the indoor environment. A study that takes into account the stochastic nature of occupants behaviour can be found in [7], where a Markov chain is proposed. The Markov process is also studied in the context of occupancy models in [8]. Occupancy prediction based on the carbon dioxide concentration is studied in [9], where Jiang et al. study a Bayesian filtering approach. In [10], authors study models for CO₂ concentration prediction: A deterministic mass balance differential equation is firstly used and, moreover, a linear model is constructed using system identification techniques. The importance of occupancy for building control is highlighted in [11], where a model predictive controller utilises the occupancy information.

Occupancy estimation models can be classified into three categories. (a) statistical (black-box) models, (b) physical/deterministic (white-box) models, and (c) grey-box models. The first category includes hidden Markov models, machine learning methods, and artificial neural networks [2,3,4]. The second category, the physical models, are most often based on a CO₂ mass balance equation [9,12,13]. The benefits of the first and second category, i.e., black and white box models, are incorporated in the category of grey-box models. The latter models synthesise physical knowledge with statistical and data-driven parameter estimation. A grey-box model can describe variations in a room’s carbon dioxide level and estimate room occupancy [3,14,15,16,17].

Multi agent control systems are a group of agents which, in order to solve a problem, interact with the environment and with each other. MACS have been proposed for achieving high levels of thermal comfort inside a building and reducing the consumption of the heating/cooling system by efficiently controlling the indoor temperature. In [18], a MACS for a building with two rooms is proposed in order to control the thermal comfort of the two rooms through a central HVAC system. Intelligent control of HVAC systems [19] is also studied in [20] using a fuzzy based approach for personal comfort satisfaction. In [21] a MACS with multiple layers and a central agent for coordination is proposed. The system aims to reduce the overall efficiency of the building and simultaneously to satisfy the thermal needs of the occupants. Additionally, MACS have been proposed for achieving air quality, and thermal and visual comfort in the interior of the buildings with minimisation of energy consumption [22]. The MACS controls the actuators regarding the CO₂ concentration, the illumination, and the temperature in conjunction with reducing the energy consumption. These approaches are based on multi-layer MACS with local agents and supervisors which can lead to failure if the central agents fail. Moreover, they are based on the expert knowledge or on offline optimisation algorithms for adjusting parameters and they do not use any learning mechanism [23]. Only recently, few methods using reinforcement learning have been proposed but these methods are only limited to achieve thermal comfort [24]. The main drawback of this method is that it cannot be applied for continuous state-action space. Fuzzy logic systems can be used in order to overcome this drawback. Moreover, none of the above mentioned papers, combine occupancy estimation models with a multi agent control system, in order to reduce energy waste.

In the proposed system, a grey-box model consists of stochastic differential equation (SDE) and fuzzy sets is able to quantify uncertainties, which improves the occupancy estimation. The contribution of this study is to:

Provide a fully decentralised control architecture;
Provide a control architecture based on reinforcement learning;
Incorporate a fuzzy logic system with reinforcement learning;
Combine a multi agent control system with a grey-box modelling;
Combine a multi agent control system with occupancy estimation;
Combine fuzzy modelling with occupancy estimation.

The paper is organised as follows: The problem under study is stated in Section 2, in addition to a discussion on general notions of stochastic processes and the analysis of the CO₂ adopted stochastic model. Stochastic differential equations’ parameter estimation and differential evolution are also analysed. The use of the occupancy estimation algorithm, proposed in [14] is being studied for the estimated and the fuzzy parameters of the stochastic model, using two different occupancy profiles. Furthermore, a thorough analysis of Fuzzy

Q

-learning and the MACS is presented, together with the mathematical modelling of the building under study. Simulation results are demonstrated in Section 3 for each of the methods used and conclusions are discussed.

2. Materials and Methods

2.1. Problem Statement

The problem studied in the present work focuses on the development of a multi agent control system for the control of the artificial lighting, ventilation, and heating/cooling subsystems, in order for the comfort indexes to be maintained in the desired levels. The MACS should utilise the occupancy information, estimated using carbon dioxide concentration data. The carbon dioxide generation is considered to be given by the solution of a stochastic differential equation. By numerically solving the stochastic differential equation the training data are generated, representing the CO₂ levels in the room under study. Following that, an occupancy estimation algorithm is implemented to give information on the number of occupants. Two main assumptions have been made:

The training set consists of 1440 state measurements, corresponding to one working day, 24 h, times 60 min per hour;
The state, i.e., the CO₂ concentration level, is fully observable.

The work flow of the proposed method, graphically illustrated in Figure 1, is the following: Extending the CO₂ mass balance equation to a stochastic differential equation. The stochastic mass balance equation is numerically solved, to generate data of CO₂ concentration. Assuming that the solution time-series represents a stochastic process with unknown parameters, the parameters are estimated by maximising the likelihood function of the data with a differential evolution algorithm. Given the data and the estimated parameters, the occupancy estimation algorithm is implemented to compute the unknown number of occupants, that generated the data. Since the method of maximum likelihood provides only point estimates of the unknown parameters, fuzzy modelling is employed to study the impact of the parameters’ uncertainty on the occupancy estimation. The estimated profile of the building occupancy is going to be used as input in the MACS system. The MACS system consists of three decentralised local agents and controls the actuators, each responsible for the indoor temperature, the indoor horizontal illuminance, and the carbon dioxide concentration. Occupancy estimation defines whether the building is unoccupied and whether the actuators need not to operate, in order to reduce energy waste in time periods where there are no occupants.

2.2. Stochastic Processes and CO₂

The generation and decay of CO₂ is governed by the mass balance ordinary differential equation [25,26] given by:

V \frac{d x^{C O_{2}} (t)}{d t} = q (x_{o u t}^{C O_{2}} - x^{C O_{2}} (t)) + G_{C O_{2}}

(1)

where V denotes the volume of the room,

x^{C O_{2}} (t)

the carbon dioxide concentration,

x_{o u t}^{C O_{2}}

is the outdoor carbon dioxide concentration, q is a term associated with the ventilation and infiltration and window air exchange.

G_{C O_{2}}

is a term associated with the generation of carbon dioxide in the room. The latter contains two terms: the carbon dioxide generation per occupant

c_{o c c}^{C O_{2}}

multiplied by a positive integer valued number

n_{o c c} (t)

, denoting the number of persons, coming from the fact that the more the persons the higher the concentration in the room. If a closed window and mechanical ventilation free room are considered, the q term in Equation (1) is characterised only by the infiltration rate

q_{i n f}

.

In order to adopt a more realistic model for the carbon dioxide concentration in the room we must treat the CO₂ time series as a stochastic process. By adopting a stochastic model, e.g., considering the CO₂ concentration data as a stochastic process, uncertain factors and other random unmodelled phenomena are taken into account [26]. The stochastic nature emerges not only from known factors, e.g., varying exhalation rates of the occupants in the room, the air exchange through doors, the non-homogeneous nature of the room’s air, as well as unknown factors that cannot be described in mathematical terms. As referred to in [26], the CO₂ differential equation contains difficult to exact modelling factors, even in the deterministic case, such as surface area on which indoor pollutants are deposited and the deposition velocity of the pollutant, among others. Such terms can be considered as unmodelled, since their exact mathematical expression in the equation may lack accuracy. The Brownian motion, which will be introduced to the mass balance differential equation, can model both known but uncertain, and unknown or unmodelled phenomena, that govern the indoor carbon dioxide concentration.

A formal definition of a stochastic process is given below:

Definition 1

([27]). Let the probability space (Ω,

F

,

P

), the ordered set T and the measurable space (E,

G

). A stochastic process is a collection of random variables

X = {X_{t}; t \in T}

, where for each fixed

t \in T

,

X_{t}

is a random variable in (Ω,

F

,

P

) to (E,

G

). Ω denotes the sample space, and E denotes the state space of the stochastic process

X_{t}

.

Stochastic processes can be viewed as a function of both

t \in T

and

ω \in Ω

, e.g., for fixed t we have a random variable,

X_{t}

, and for fixed

ω

in the sample space

Ω

we find the function

X_{t} (ω) : T \to E

, which is the realisation of the process. One of the most important stochastic processes is the Brownian motion [28].

Therefore, considering the CO₂ generation as a stochastic process the concentration is associated to a random variable, due to the unknown influencing random effects. Since a deterministic differential equation model has been used, Equation (1), the randomness must be encapsulated in the model. By adding a noise term, the ordinary differential equation can be extended to a stochastic differential equation. Stochastic differential equations are used to model dynamics which exhibit random behaviour. It is now important to state the connection of stochastic differential equations and stochastic processes, in order to justify why adopting such an approach makes perfect sense.

Our purpose is to solve the following stochastic differential equation of the general form

d X (t) = \underset{drift}{\underset{⏟}{μ (t, X (t); θ) d t}} + \underset{diffusion}{\underset{⏟}{g (t, X (t); θ) d B_{t}}}

(2)

where

X (t) \in R

,

μ : [0, \infty) \times R \times Θ \to R

is the drift term and

g : [0, \infty) \times R \times Θ \to R

is the diffusion term.

B_{t}

is a 1-dimensional standard Brownian motion. An initial condition is given

X (t) = x

, where

t = 0

.

Let the filtration

F_{t}

generated by the Brownian motion

B_{t}

, i.e.,

F_{t} : = σ ({B_{s} : s \in [0, t]})

, for every

t \geq 0

. Under mild conditions (Similarly to Lipschitz continuity and linear growth) on the functions

μ (t, X (t); θ)

and

g (t, X (t); θ)

the solution of the stochastic differential equation exists and it is a unique process,

X (t), t \geq 0

satisfying:

It has continuous paths;
It is $F_{t}$ -adapted;
Almost surely, it holds that

$X (t) = x_{0} + \int_{0}^{t} μ (s, X (s); θ) d s + \int_{0}^{t} g (s, X (s); θ) d B_{s}$

(3)

The solutions of stochastic differential equations are stochastic processes called diffusion processes. The mathematical treatment of stochastic differential equations is quite technical, however in the case of linear one-dimensional equations, closed form solutions could be derived. Moreover, as discussed in the next section, the solutions of stochastic differential equations are Markov processes [29], a property that turns out to be crucial in practical implementations.

2.3. The Hull–White Model as Mass Balance Equation

Since, we wish to treat the unknown process of carbon dioxide indoor generation and decay as a stochastic process, it is useful to model the mass balance equation with a stochastic differential equation. A model which originates from the financial mathematics literature, called the Hull–White Model (HWM) [29,30], has the suitable mathematical form in order for Equation (1) to be extended into a stochastic framework. This model has been introduced by John Hull and Alan White in 1994, as an extension to the Vasicek model for interest rates modelling. The Hull–White model is given by the following equation:

d R (t) = [Φ (t) - α R (t)] d t + σ d B_{t}

(4)

where

R (t)

denotes the interest rate,

Φ (t)

denotes a time varying function, and

α

is the rate at which the rate is pushed towards the long term level.

Borrowing the mathematical structure of the HWM, a stochastic differential equation that is a generalisation of the Ornstein–Uhlenbeck model, can be written:

d X^{C O_{2}} (t) = [\underset{Φ (t)}{\underset{⏟}{q_{i n f} X_{o u t}^{C O_{2}} + c_{o c c}^{C O_{2}} n_{o c c} (t)}} - q_{i n f} X^{C O_{2}} (t)] d t + σ d B_{t}

(5)

where

X^{C O_{2}} (t)

is the carbon dioxide concentration in ppm (Appendix A),

q_{i n f}

denotes the infiltration rate in the room, i.e., the air change per hour,

σ

denotes the constant volatility of changes in CO₂, due to the Gaussian white noise (Brownian motion’s increments are

N

distributed.)

d B_{t}

,

X_{o u t}^{C O_{2}}

is the outdoor CO₂ concentration in ppm and

c_{o c c}^{C O_{2}}

is the CO₂ generation per person in ppm/h. Equation (5) is the stochastic mass balance differential equation.

We are dealing with a first order linear non autonomous stochastic differential equation, thus an analytical solution can be derived. The solution of Equation (5) can be found by applying Itô’s lemma on the function

e^{- q_{i n f}} X^{C O_{2}} (t)

and then integrating to get:

X^{C O_{2}} (t) = X^{C O_{2}} (s) e^{- q_{i n f} (t - s)} + \int_{s}^{t} e^{- q_{i n f} (t - τ)} Φ (τ) d τ + σ \int_{s}^{t} e^{- q_{i n f} (t - τ)} d B_{τ}

(6)

therefore, the expectation, mean solution, conditioned on the

σ

-algebra

F_{s}

is given by:

E [X^{C O_{2}} (t) | F_{s}] = X^{C O_{2}} (s) e^{- q_{i n f} (t - s)} + \int_{s}^{t} e^{- q_{i n f} (t - τ)} Φ (τ) d τ

(7)

since,

E [σ \int_{s}^{t} e^{- q_{i n f} (t - τ)} d B_{τ} | F_{s}] = 0

.

F_{s}

is the filtration generated by the process up to time s. Moreover, the variance of the process is

\begin{matrix} Var [X^{C O_{2}} (t) | F_{s}] & = E [{(X^{C O_{2}} (t) - E [X^{C O_{2}} (t) | F_{s}])}^{2} | F_{s}] \\ = E [{(σ \int_{s}^{t} e^{- q_{i n f} (t - τ)} d B_{τ})}^{2}] \\ = E [σ^{2} \int_{s}^{t} e^{- 2 q_{i n f} (t - τ)} d_{τ}], due to Itô ’ s Isometry \\ = \frac{σ^{2}}{2 q_{i n f}} (1 - e^{- 2 q_{i n f} (t - s)}) \end{matrix}

(8)

The mean solution of the stochastic differential equation, describing

X^{C O_{2}} (t)

, is composed by an exponential term which forces the state to decay and an integral term which is due to the non autonomous nature of the differential equation: this term reflects the presence of occupants in the room at each time t, i.e., a piecewise constant function, such that

n_{o c c} : [0, \infty) \to Z^{+}

. The integral is fully tractable, thus a closed form solution is derived. The presence of occupants inside the room should, from a physics perspective, increase the levels of the carbon dioxide concentration, a fact that is reflected naturally by the above mathematical modelling.

The computation of the mean and variance describing the behaviour of the stochastic process

X^{C O_{2}} = {X^{C O_{2}} (t); t \geq 0}

allows to write analytical expressions of the Likelihood function as the product of the transition densities from a state at time s to the state at time t, with

t \geq s

.

The original linear stochastic differential equation is equivalent [31] to the following discrete time system:

X_{t_{k + 1}}^{C O_{2}} = f_{k} X_{t_{k}}^{C O_{2}} + u_{k} + q_{k}, where q_{k} \sim N (0, Σ_{k})

(9)

where the coefficients

f_{k}

,

u_{k}

and

Σ_{k}

reads as:

\begin{array}{l} (10) & f_{k} & = ψ (t_{k + 1}, t_{k}) \\ (11) & u_{k} & = \int_{t_{k}}^{t_{k} + 1} ψ (t_{k + 1}, τ) Φ (τ) d τ \\ (12) & Σ_{k} & = σ^{2} \int_{t_{k}}^{t_{k} + 1} ψ^{2} (t_{k + 1}, τ) d τ \end{array}

where

ψ_{k} (t_{k + 1}, t_{k}) = e^{- q_{i n f} Δ t}

, and

Δ t = t_{k + 1} - t_{k}

These expressions can be analytically evaluated, to give the mean and variance of the original stochastic differential equation.

As discussed earlier, our purpose is to model the carbon dioxide concentration by a stochastic process. This is performed through the assumption that the generation and decay of carbon dioxide is governed by a linear non autonomous stochastic differential equation. The solution of the latter is a stochastic process and can be obtained either by using the discrete representation Equation (9), or by solving the original stochastic differential Equation (5) using a numerical integration scheme. To generate the training data, on which the parameters and occupancy estimation algorithm is based, the differential equation is solved using the method of Euler–Maruyama (EM), see Algorithm 1 [31].

Algorithm 1 Euler Maruyama

1:: Given a stochastic differential equation of the form $d X (t) = μ (t, X (t); θ) d t + g (t, X (t); θ) d B_{t}$ , with known parameters $θ$ , an initial condition $X (t_{0})$ and a time interval $[t_{0}, t]$ .
2:: Divide the time interval $[t_{0}, t]$ into N steps of $Δ t$ length.
3:: for Every time step k do
4:: Draw a random variable such as $Δ B_{k} \sim (0, Δ t)$
5:: Compute the approximation: $\hat{X} (t_{k + 1}) = \hat{X} (t_{k}) + μ (t_{k}, X (t_{k}); θ) Δ t + g (t_{k}, X (t_{k}); θ) Δ B_{k}$
6:: end for
7:: return The approximation $\hat{X} (t_{k})$ of the solution.

In this paper, CO₂ data are generated by simulating Equation (5), using the EM method. In order to be able to perform this, the parameters

θ

, are considered to be known. In Figure 2, the evolution of the stochastic differential equation’s solution, with a single occupant, is depicted: Ten solution paths, each associated with a realisation of the Brownian motion, the mean solution and the

95 %

quantiles.

In the following section, the method for the estimation of true parameters will be analysed, if an assumption of unknown parameters lies.

2.4. Estimation of the Stochastic Process Parameters

Since in real world applications the data of the carbon dioxide concentration may come in a time series, i.e., a set of available data, one might ask what the parameters of the process would be. Thus, an algorithm must be implemented in order to be able to estimate the true parameters of the problem. Henceforth, it is assumed that the solution time series of the carbon dioxide concentration is given, and the unknown parameters

θ

of the stochastic process are to be estimated.

We, therefore, have a problem of parameter estimation on stochastic differential equations: Given the stochastic differential Equation (5), the task is to estimate the parameters vector

θ

from a finite sample of available observations of the stochastic process

X^{C O_{2}} (t) = {X^{C O_{2}} (t), where t = t_{0}, \dots, t_{N}}

. The model is parametrised by the following quantities: the infiltration rate

q_{i n f}

, the outdoor carbon dioxide concentration

X_{o u t}^{C O_{2}}

, the generation rate of carbon dioxide

c_{o c c}^{C O_{2}}

by a single occupant and the volatility of the noise

σ

.

Parameter estimation problem in probabilistic models has been investigated by many authors and several methods have been proposed. For our task, the well established method of maximum likelihood in the context of stochastic differential equations, is adopted. The likelihood function must be maximised, with respect to the unknown parameter vector

θ

.

In light of the Markov property of the stochastic differential equation’s solution, an analytical expression of the likelihood of the observed data given the parameters is written by:

p (X^{C O_{2}} (t_{0}) \dots X^{C O_{2}} (t_{N}) | θ) = \prod_{k = 0}^{N - 1} p (X^{C O_{2}} (t_{k + 1}) | X^{C O_{2}} (t_{k}), θ)

(13)

where, in general,

p (X (t) | X (s), θ)

is the transition density, describing the probability of the stochastic process to be from

X (s)

, to the state

X (t)

in time

t - s

. Since the solution of the stochastic differential equation is a linear transformation of the Brownian motion, which is a Gaussian Process, it follows that the solution will also be Gaussian, thus, the transition densities are given by:

p (X^{C O_{2}} (t) | X^{C O_{2}} (s), θ) = N (E (X^{C O_{2}} (t) | X^{C O_{2}} (s)), Var (X^{C O_{2}} (t) | X^{C O_{2}} (s)))

(14)

where the Markov property is explicitly used, and the expressions for the distribution’s mean and variance are given by Equations (7) and (8), respectively.

The parameter vector

\hat{θ} = ({\hat{θ}}_{1}, {\hat{θ}}_{2}, {\hat{θ}}_{3}, {\hat{θ}}_{4}) = ({\hat{q}}_{i n f}, {\hat{X}}_{o u t}^{C O_{2}}, {\hat{c}}_{o c c}^{C O_{2}}, \hat{σ})

must be computed, such that the likelihood to be maximised. In simple terms, the parameter space

Θ

, is searched, for those parameters that generate the actual data, which in our case are the data from the numerical solution of the stochastic mass balance differential equation. For practical reasons, the parameters are determined in order for the negative log-likelihood function to be minimised, i.e.,

\hat{θ} = \underset{\begin{matrix} θ_{i} \in [l_{θ_{i}}, u_{θ_{i}}] \\ i = 1 \dots 4 \end{matrix}}{arg min} L (X^{C O_{2}} (t_{0}) \dots X^{C O_{2}} (t_{N}), θ)

(15)

where

l_{θ_{i}}, u_{θ_{i}}

denote the lower and upper bounds of the feasible domain for each of the four parameters.

L (.)

denotes the negative log-likelihood function given by:

L (X^{C O_{2}} (t_{0}) \dots X^{C O_{2}} (t_{N}), θ) = - \sum_{k = 0}^{N - 1} log p (X^{C O_{2}} (t_{k + 1}) | X^{C O_{2}} (t_{k}), θ)

(16)

In this paper a differential evolution algorithm is proposed, in order to numerically minimise the negative log-likelihood, without the need of derivative information.

Differential Evolution (DE) [32], belongs to the class of evolutionary algorithms (EAs), that stochastically explore the space of the optimisation problem’s decision variables. DE uses a structure of a population of candidate solutions and guides the search procedure through a number of generations, until termination criteria have been met. The algorithm is parametrised by a minimum number of hyper-parameters: the mutation scaling F, the crossover probability

p_{c r}

and the population size

n_{p}

.

The mutation of each individual comes in terms of perturbation of a target vector from the population with two weighted differential, described by:

θ_{i, Mutant}^{g + 1} = θ_{r_{1}}^{g} + F (θ_{r_{2}}^{g} - θ_{r_{3}}^{g}) + F (θ_{r_{4}}^{g} - θ_{r_{5}}^{g})

(17)

for

i = 1, \dots, n_{p}

, where

r_{1}

to

r_{5}

\in {1, 2, \dots, n_{p}}

and

i \neq r_{1} \neq r_{2} \neq r_{3} \neq r_{4} \neq r_{5}

. The mutation operator produces a population of perturbed model parameter vectors, called mutant vectors. Crossover is described by:

\begin{matrix} θ_{j i, Trial}^{g + 1} = \{\begin{matrix} θ_{j i, Mutant}^{g + 1} if U (0, 1) \leq p_{c r} or j = U_{i} \\ θ_{j i}^{g} otherwise \end{matrix} \end{matrix}

(18)

where

U_{i}

denotes the uniform distribution in integers

{1, \dots, 4}

and

U

the uniform distribution in

(0, 1)

. The candidate solutions will be represented by a population of individuals. For every generation g the population of the parameters is described:

\begin{matrix} P^{g} = {P_{i}}_{i = 1 \dots M} = \end{matrix} \begin{matrix} \begin{matrix} P_{1}^{g} P_{2}^{g} \dots P_{n_{p}}^{g} \end{matrix} \\ ( & \begin{matrix} θ_{1}^{1} & θ_{1}^{2} & \dots & θ_{1}^{n_{p}} \\ ⋮ & ⋮ & ⋮ & ⋮ \\ θ_{4}^{1} & θ_{4}^{2} & \dots & θ_{4}^{n_{p}} \end{matrix} & ) \end{matrix}

(19)

The proposed method is summarised in Algorithm 2.

Algorithm 2 Parameter estimation using likelihood function and differential evolution

1:: Generate Data by numerically integrating the stochastic mass balance differential Equation (6) using Euler–Maruyama method.
2:: Compute The mean and variance solutions of the SDE analytically
3:: Compute The transition densities and the negative log-likelihood function.
4:: Given Data from EM solution of the Stochastic Differential Equation:
$D = {(t_{k}, X^{C O_{2}} (t_{k}))}_{k = 1, \dots, N}$ .
5:: Initialise Differential evolution variant DE/Rand/2/Bin, where Rand describes the mutation operator, 2 indicates the number of differentials, and Bin the crossover scheme.
6:: Set Mutation parameter F, crossover probability $p_{c r}$ , population size $n_{p}$ and a maximum number of generations: MaxGen.
7:: Generate Initial population of candidate solutions
8:: $P^{0} \leftarrow U n i f o r m (l_{θ_{i}}, u_{θ_{i}}, n_{p})$
9:: forEvery Generation gdo
10:: for all individuals in $P^{g}$ do
11:: Mutate
12:: $P_{m u t a t i o n}^{g + 1} \leftarrow$ Mutation( $P^{g}, F$ )
13:: if Bound constraints $[l_{θ_{i}}, u_{θ_{i}}]$ are violated then
14:: $P_{m u t a t i o n}^{g + 1} \leftarrow$ Resampling( $P_{m u t a t i o n}^{g + 1}, l_{θ_{i}}, u_{θ_{i}})$
15:: end if
16:: Crossover
17:: $P_{t r i a l}^{g + 1} \leftarrow$ Crossover( $P^{g}, P_{m u t a t i o n}^{g + 1}, p_{c r}$ )
18:: Selection
19:: if $L (P^{g}) < L (P_{trial}^{g + 1})$ then $P^{g + 1} \leftarrow P^{g}$
20:: else $P^{g + 1} \leftarrow P_{trial}^{g + 1}$
21:: end if
22:: end for
23:: $g \leftarrow g + 1$
24:: end for
25:: Return The Estimated Parameter vector $\hat{θ} = ({\hat{q}}_{i n f}, {\hat{X}}_{o u t}^{C O_{2}}, {\hat{C}}_{o c c}^{C O_{2}}, \hat{σ})$

In the latter algorithm, it is assumed that the data are available to us. To achieve this, we synthetically generate them, by solving the stochastic differential equation using the Euler–Maruyama method. The function Resampling implements the resampling of the DE solutions into the feasible domain, since the mutation operation may produce solutions that are outside of the parameter’s bound constraints. The maximum number of generations, MaxGen is set to 100, the population size

n_{p} = 80

, the crossover probability

p_{c r} = 0.2

and a mutation scaling F sampled uniformly in

[0.2, 2]

, in every generation.

The justification for the choice of the hyper parameters is based on [32]: Empirical studies on the simulations performed, show that the convergence of the algorithm occurs at around 70th generation, hence a maximum number of 100 generations would suffice. The probability of crossover,

p_{c r}

is chosen to be small in order to avoid premature convergence and moreover, to increase the search robustness. The mutation scaling is chosen to be given by a uniform distribution in the range

[0.2, 2]

, in order to achieve a trade-off between exploring tight valleys and search diversity. As far as the population size is concerned, the choice of 80 achieves a reasonably fast convergence, and it is chosen on a trial and error basis.

2.5. Fuzzy Occupancy Estimation

It should be clearly stated, that up to this point the unknown (to be estimated) parameters were considered to be: the infiltration rate

q_{i n f}

, the outdoor carbon dioxide concentration

X_{o u t}^{C O_{2}}

, the carbon dioxide generation per occupant

c_{o c c}^{C O_{2}}

and the volatility of the noise term

σ

. Since the number of occupants in Equation (5), can be considered as a parameter too, the problem is addressed by likewise asking the same question as in the parameter estimation context: which occupancy vector generates the observed data.

Hence, if the model is re-parametrised with a single unknown parameter

n_{o c c} (t)

and the estimated parameters

\hat{θ}

are considered to generate the true data in high accuracy, then a likelihood approach can be used to determine the occupancy vector. We followed and implemented the occupancy estimation algorithm proposed in [14] and estimated the occupancy vector at every time instance. The occupancy vector

n_{o c c}

is very high dimensional, thus an optimisation based approach even in terms of evolutionary algorithms has, to our knowledge, not been studied. Wolf et al. propose a greedy algorithm, based on maximum likelihood’s principle, which initially considers the occupancy vector as a zero vector, representing an empty room, and iteratively adds integers values to the vector points, i.e., adds an occupant in each time step, where the likelihood function is maximised. For completeness, an implementation of the algorithm is given in Algorithm 3.

The reason that the estimated parameters

\hat{θ}

serve as inputs in the algorithm, is that in a real world setting the data observation will be given as a time-series, where the model’s parameters will be unknown. In the case when the estimated parameters are close to the ground truth, then the estimated model will generate very similar data and the only parameter that will be to be determined is the occupancy vector. The performance of the algorithm has been tested under two different occupancy profiles, with very accurate results. It is crucial to note that the method of parameter estimation, in our case the differential evolution algorithm, must result in close values of the true parameters

θ

otherwise, as explored next, the occupancy estimation algorithm will not output the correct estimation vector.

Algorithm 3 [14] Occupancy estimation algorithm

1:: Inputs:
Observed Data $X^{C O_{2}} (t)$ , Estimated Parameters $\hat{θ}$ , Likelihood Function $L (n_{o c c} (t))$
2:: Initialise
${\hat{n}}_{o c c} \leftarrow 0_{1 \times Number of Observations}$
$A \leftarrow I_{Number of Observations \times Number of Observations}$
Compute the value of the negative log-likelihood function:
$L_{{\hat{n}}_{o c c}} \leftarrow L (X^{C O_{2}} (t_{0}) \dots X^{C O_{2}} (t_{N}), \hat{θ}, {\hat{n}}_{o c c})$
3:: for a number of iterations do
4:: for every time step, $i = 1 \dots N$ do
5:: $n_{o c c}^{i} \leftarrow {\hat{n}}_{o c c} + A (i, :)$
6:: $L_{n_{o c c}^{i}} \leftarrow L (X^{C O_{2}} (t_{0}) \dots X^{C O_{2}} (t), \hat{θ}, n_{o c c}^{i})$
7:: end for
8:: $n_{o c c}^{m i n} \leftarrow {arg min}_{\begin{matrix} n_{o c c}^{i} \end{matrix}} L_{n_{o c c}^{i}}$
9:: Greedy Selection
10:: if $L_{n_{o c c}^{m i n}} < L_{{\hat{n}}_{o c c}}$ then ${\hat{n}}_{o c c} \leftarrow n_{o c c}^{m i n}$
11:: else ${\hat{n}}_{o c c} \leftarrow {\hat{n}}_{o c c}$
12:: end if
13:: end for
14:: Output: Estimation of the occupancy vector ${\hat{n}}_{o c c}$

The methodology of parameter

θ

estimation, via the maximum likelihood approach, results in point estimates. Since the true parameters that generated the data of carbon dioxide concentration are unknown, a question of how certain about these values we could be, emerges. As earlier discussed there are two kinds of uncertainty: the uncertainty that stems from random effects and unmodelled dynamics and the structural uncertainty, which corresponds to possible fluctuations of the model parameters. Stochastic differential equations can be considered to model the first case. Pertaining to the second type of uncertainty, a solution can be the use of fuzzy modelling [33]. The subject of stochastic differential equations with fuzzy parameters, leading to Fuzzy stochastic differential equations, have been studied, from a solution point of view, in [34,35].

In this paper, the effect of this structural parametric uncertainty onto the occupancy estimation algorithm, is investigated. The estimated parameters are mapped onto the fuzzy space using triangular fuzzy numbers, thus an uncertainty is induced on the model’s parametric space. Fuzzy numbers of the following form have been used:

μ (x; x_{L}, x_{C}, x_{R}) = max (0, min (\frac{x - x_{L}}{x_{C} - x_{L}}, \frac{x_{R} - x}{x_{R} - x_{C}}))

(20)

where x denotes a point in the domain

X

of the parameter to be fuzzified,

[x_{L}, x_{R}]

denotes the support of the fuzzy number and

x_{C}

denotes the core of the fuzzy number. Note that

μ

here symbolises the membership function of the fuzzy set, not to be confused with the drift term in Equation (2).

The assumption of structural uncertainty has been made for two of the estimated parameters: the infiltration rate

q_{i n f}

and the carbon dioxide generation per occupants

c_{o c c}^{C O_{2}}

. This reflects the fact that in a real scenario the infiltration may have small deviations from the true value [36]. Similarly, the CO₂ generation per occupant in the room,

c_{o c c}^{C O_{2}}

, in any case cannot be a crisp value, since it strongly depends on factors, such as the type of work that an occupant performs during its presence in the room, the age of the occupant, the gender of the occupant, and various physical characteristics [37,38]: a young child’s carbon dioxide generation differs from an adult heavy worker’s.

Smaller uncertainty bounds are induced on the infiltration parameter, and larger on the CO₂ generation per occupant. The fuzzification of the latter parameters is implemented by a symmetric triangular fuzzy number, given by Equation (18), where the supports have been calculated by

{\hat{θ}}_{1} \pm 10 %

and

{\hat{θ}}_{3} \pm 30 %

for the infiltration parameter and the carbon dioxide generation, respectively. The estimated parameters

{\hat{θ}}_{1}

and

{\hat{θ}}_{3}

are the cores of the fuzzy sets that describe the uncertainty of the infiltration and CO₂ generation. Figure 3 depicts the fuzzy sets, their support and the crisp uncertainty set, corresponding to the

0.5

-level.

To study the result of the fuzzy parameters in the occupancy estimation algorithm, the a-cuts

A_{a}

have been computed using the definition:

A_{a} = {x \in X, m (x) \geq a}

. A

0.5

-level is considered, meaning that a trust region is given, in the values of the fuzzified parameters with membership

μ \geq 0.5

.

The limit points of the

a - c u t s

are given by:

\begin{array}{l} (21) & {\underset{̲}{\hat{θ}}}_{1} & = a ({\hat{θ}}_{1} - {\tilde{l}}_{{\hat{θ}}_{1}}) + {\tilde{l}}_{{\hat{θ}}_{1}} \\ (22) & {\bar{\hat{θ}}}_{1} & = {\tilde{r}}_{{\hat{θ}}_{1}} - a ({\tilde{r}}_{{\hat{θ}}_{1}} - {\hat{θ}}_{1}) \\ (23) & {\underset{̲}{\hat{θ}}}_{3} & = a ({\hat{θ}}_{3} - {\tilde{l}}_{{\hat{θ}}_{3}}) + {\tilde{l}}_{{\hat{θ}}_{3}} \\ (24) & {\bar{\hat{θ}}}_{3} & = {\tilde{r}}_{{\hat{θ}}_{3}} - a ({\tilde{r}}_{{\hat{θ}}_{3}} - {\hat{θ}}_{3}) \end{array}

where

{\tilde{l}}_{{\hat{θ}}_{3}}, {\tilde{r}}_{{\hat{θ}}_{3}}

are the left and right limits of the support for the parameter

{\hat{c}}_{o c c}^{C O_{2}}

and

{\tilde{l}}_{{\hat{θ}}_{1}}, {\tilde{r}}_{{\hat{θ}}_{1}}

are the left and right limits of the support for the parameter

{\hat{q}}_{i n f}

. It should be clear that the sets

A_{a}

are uncountable and we are focusing on investigating the boundary case, i.e., the model with parameters chosen firstly as

{\underset{̲}{\hat{θ}}}_{1}, {\underset{̲}{\hat{θ}}}_{3}

and secondly

{\bar{\hat{θ}}}_{1}, {\bar{\hat{θ}}}_{3}

.

The implementation of the occupancy estimation algorithm, results in different occupancy vectors

n_{o c c} (t)

, reflecting the fact that when fuzzy parameters are considered, the estimation gives an upper and lower estimation of the number of occupants in the room at each time.

2.6. Fuzzy $Q$ -Learning and Multi Agent Control System

Reinforcement learning refers to a group of algorithms that have been inspired from the learning mechanism of humans or other leaving beings. Reinforcement learning algorithms aim to extract a policy via exploring the possible state-action pairs. The term policy refers to the mapping between states and actions. All reinforcement learning algorithms need to utilise a signal called reward, which defines a measure of how well an agent performs at a given state.

A tabular form of reinforcement learning, is

Q

-learning [39]. In the

Q

-learning framework, a

Q

-function, that tries to estimate the future discounted rewards for actions in given states, is gradually built by the agent. The

Q

-function output for a given state x and an action a is defined as

Q (x, a)

and the values of the

Q

-function are stored in a

Q

-table.

Q

-learning’s main drawback is that it cannot be applied for continuous state-action space. Fuzzy

Q

-learning [40] can be used to overcome this difficulty and it is summarised in the following steps:

Statexobservation;
Output selection, for each fired fuzzy rule, based on the exploration/exploitation algorithm;
Global output’s $α (x)$ and corresponding value’s of $Q (x, α)$ computation by:

$α (x) = \frac{\sum_{i = 1}^{M} α_{i} (x) α_{i}}{\sum_{i = 1}^{M} α_{i} (x)}$

(25)

$Q (x, α) = \frac{\sum_{i = 1}^{M} α_{i} (x) α_{i} q [i, i^{†}]}{\sum_{i = 1}^{M} α_{i} (x)}$

(26)

where M is the number of the fired fuzzy rules, $α_{i} (x)$ is the fired degree for each fuzzy rule, $α_{i}$ is the action that is selected for each fuzzy rule and $q [i, i^{†}]$ is the corresponding $Q$ value for the pair of the fuzzy rule i and the action $i^{†}$ . The action corresponds to the action that the exploration/exploitation algorithm selects.
The global action $α (x)$ is applied and the next state $x^{'}$ is observed;
Reward R computation;
$Q$ -values are updated according to the formula:

$Δ q [i, i^{*}] = η Δ Q \frac{α_{i} (x)}{\sum_{i = 1}^{M} α_{i} (x)}$

(27)

where $Δ Q = R + γ V (x^{'}) - Q (x, α)$ and

$V (x^{'}) = \frac{\sum_{i = 1}^{M} α_{i} (x^{'}) α_{i} q [i, i^{*}]}{\sum_{i = 1}^{M} α_{i} (x^{'})}$

(28)

and $q [i, i^{*}]$ is the corresponding $Q$ value for the pair of the fuzzy rule i and the action $i^{*}$ . γ denotes the discount factor and η the learning rate. The action $i^{*}$ corresponds to the action that has the maximum corresponding $Q$ value for the fuzzy rule i.

For solving complex or physically distributed problems, a multi agent control system can be used.

Q

-learning and consequently fuzzy

Q

-learning can be applied to MACS with the most common approach being independent learners [41], where each agent ignores the existence of other agents and interacts independently with the environment.

2.7. System Modelling and Description

2.7.1. Heating and Cooling System

The space is a building office with a surface of 50 m² and volume 150 m³ (5 m × 10 m × 3 m) and a 3 m² (3 m × 1 m) window which is located in the north side of the office. The heating/cooling system of the building introduce either hot or cold air into the space [42,43]. The heat flow into the space can be computed by the next equation:

\frac{d Q (t)}{d t} = \pm (T_{\frac{h e a t}{c o o l}} - T_{r o o m}) \dot{M} c + H G_{o} n_{o c c} (t) + H G_{E} n_{o c c} (t) + H G_{L} + H G_{S}

(29)

where the left hand side denotes the heat flow rate into the space.

T_{\frac{h e a t}{c o o l}}

denotes the temperature of the hot and the cold air 40 °C or 16 °C, respectively.

T_{r o o m}

is the temperature of the room’s air,

\dot{M}

is the mass air flow of the air from the heating/cooling system (2600 kg/h) and c is the air heat capacity at constant pressure 1005.4 J/kg °C.

H G_{o}

denotes the heating gain from the occupants (115 W/occupant) [44],

n_{o c c} (t)

is the number of occupants and

H G_{E}

is the heating gain from the electrical loads (5.38 W/m

^{2} \times 50

m

^{2} = 269

W) [36],

H G_{L}

is the heating gain from the artificial lighting

(0.5 \times 800 = 400 W)

and

H G_{S}

is the heating gain from the solar irradiance through the window [45] which is computed as:

H G_{S} = A_{w} \times r_{w} \times D \times I_{w} - g_{w} \times A_{w} \times (T_{r o o m} - T_{o u t})

(30)

where

A_{w}

denotes the area of window,

r_{w}

denotes the transparency of the window,

D = 1

denotes the window shadowing factor,

I_{w}

is the diffused solar irradiance that reaches the window,

g_{w}

is the glass conductance (2 W/m² °C) and

T_{o u t}

is the temperature of the outdoor air. The thermal model of the building does not take into consideration the effect of the solar irradiance through the building opaque surfaces. The derivative of the space temperature over time, is expressed as:

\frac{d T_{r o o m}}{d t} = \pm \frac{1}{M_{a} \cdot c} (\frac{d Q (t)}{d t} - \frac{d Q_{l} (t)}{d t})

(31)

where

M_{a}

is the air mass inside the building (183.75 kg) and:

\frac{d Q_{l} (t)}{d t} = \frac{T_{r o o m} - T_{o u t}}{R_{e q}}

(32)

where

R_{e q}

is the equivalent thermal resistance of the building envelope (

1.0306 \times 10^{- 6}

°C/W). The equivalent thermal resistance is computed by the following set of equations [42].

\begin{matrix} R_{e q} = \frac{R_{w α} R_{w i}}{R_{w α} + R_{w i}}, R_{w α} = \frac{T h_{w α}}{k_{w α} A_{w α}}, R_{w i} = \frac{T h_{w i}}{k_{w i} A_{w}} \end{matrix}

(33)

where

A_{w}

is the area of the windows as given in Equation (25),

T h_{w i}

is the thickness of the windows (0.01 m),

k_{w i}

is the thermal conductivity of the windows (0.78 W/m °C),

A_{w α}

is the area of the walls (187 m₂),

T h_{w α}

is the thickness of the walls (0.2 m),

k_{w α}

is the thermal conductivity of the walls (0.038 W/m °C).

The heat loss/gain through the building envelope (walls area, windows area, and type of insulation) is embedded inside the equivalent thermal resistance [42]. The positive or negative sign on Equations (24) and (26), refers to the process of heating or cooling, respectively.

2.7.2. Lighting System

In order to compute the diffused illuminance provided by the sky inside the building the following equation is used. The horizontal illuminance in a point p [46] which is located in the interior of the building is computed as:

\begin{matrix} \begin{matrix} E_{p, d} = \frac{r_{w} L_{s}}{2} \frac{z}{\sqrt{h_{p}^{2} + z^{2}}} (t a n^{- 1} \frac{x_{w} + w_{w} - x_{p}}{\sqrt{h_{p}^{2} + z^{2}}} + t a n^{- 1} \frac{x_{w} + x_{p}}{\sqrt{h_{p}^{2} + z^{2}}}) \\ - \frac{z}{\sqrt{{(h_{p} + h_{w})}^{2} + z^{2}}} (t a n^{- 1} \frac{x_{w} + w_{w} - x_{p}}{\sqrt{{(h_{p} + h_{w})}^{2} + z^{2}}} \\ + t a n^{- 1} \frac{x_{p} + x_{w}}{\sqrt{{(h_{p} + h_{w})}^{2} + z^{2}}}) \end{matrix} \end{matrix}

(34)

where

E_{p, d}

is the horizontal diffused illuminance of the point p,

L_{s}

is the illuminance from the sky, z is the horizontal distance between the point p and the window (10 m close to the south wall of the building which is amongst the most shading points inside the building).

h_{p}

denotes the height between the lower edge of the window and the point p (0.5 m),

h_{w}

is the height of the window (1 m),

w_{w}

is the width of the window (3 m),

x_{w}

is the distance between the left wall and the left edge of the window (3.5 m) and

x_{p}

is the distance between the point p and the left wall (2.5 m).

It is assumed that the horizontal diffused illuminance, provided by the sky, is distributed uniformly and equals to the value of the point p, and consequently the control decisions are taken according to this value. Fluorescence lamps are used for the artificial lighting and the ratio between the horizontal luminance and the consumed power is 100 lm/W [47]. The electro-chromic window [48], can change its transparency by providing a small amount of voltage. The nominal power of the artificial lighting is 400 W which can provide a maximum illuminance of 800

l u x

in a surface of 50 m

^{2}

. The relation between the

l u x

and the consumption is assumed to be linear. Moreover, it is assumed that the horizontal illuminance provided by the lamps is distributed uniformly all over the surface of the office.

2.7.3. Ventilation System

The CO₂ mass balance with no noise consideration and with a given ventilation rate is given by:

\frac{d x^{C O_{2}} (t)}{d t} = \frac{(q_{v e n t} + q_{i n f}) (x_{o u t}^{C O_{2}} - x^{C O_{2}} (t)) + c_{o c c}^{C O_{2}} n_{o c c} (t)}{V}

(35)

where V is the volume of the space (150 × 10

^{3}

L), and

q_{v e n t}

is the ventilation rate, which is now introduced by the MACS. According to [36,44] for a building office, the air change rate must be between 2 and 8. For a space of

150 \times 10^{3}

L the ventilation rate of the ventilation system can be

600 \times 10^{3}

L/h. The installed power for such a system is approximately 165 W by assuming a relationship of 0.5 W per flow rate (L/s). As we will discuss in the simulation section, the generation rate per person has been estimated to

26.9427

L/h, the outdoor carbon dioxide concentration has been estimated to 359.4813 ppm and the infiltration rate has been estimated to 0.1167 air change rate which equals to

17.535

L/h. At this point, it should be noted that mechanical ventilation affects the indoor temperature, however nowadays mechanical ventilation systems incorporate heat recovery that can reach efficiency of almost

90 %

. Based on that fact, the effect of the ventilation flow rate to the indoor temperature is neglected from the building’s thermal model.

2.8. Multi Agent Control System

The MACS consists of a group of three agents:

A = {A G_{T}, A G_{L}, A G_{C D}}

, where

A G_{T}

is the temperature agent,

A G_{L}

is the illuminance agent, and

A G_{G D}

the CO₂ agent. The states of the agents are defined by a total number of six fuzzy state variables

X_{i}

, i.e., two variables for each agent. For each positive values input, five triangular membership functions (MFs) have been used. In addition to that, for each both positive and negative values input, seven triangular MFs have been used (Figure 4). Membership functions are denoted as linguistic variables: PVB, PB, PM, PS, Z, NS, NM, and NB, e.g., positive very big, positive big, positive medium, positive small, zero, negative small, and negative medium, respectively.

A G_{T}

has two variable inputs: the outdoor temperature which is in the range

[0, 1]

, and the error

e_{T}

between the set point and the indoor temperature, normalised in the range

[- 1, 1]

. The total states are 35, which are represented by an equal number of fuzzy rules. The output vector of the temperature agent has five fuzzy singleton sets

A_{T} = {\frac{1}{- 0.05} + \frac{1}{- 0.1} + \frac{1}{0} + \frac{1}{0.01} + \frac{1}{0.05}}

.

The global action is the control signal, that defines the percentage of the air flow rate by the heating/cooling system, according to its maximum capacity. Positive signal actuate the heating system while negative signal actuate the cooling system. The reward

R_{A_{T}}

of this agent is defined as follows:

R_{A_{T}} (x, α, x^{'}) = - | e_{T} | - 0.1 \cdot P_{H C}

(36)

where

P_{H C}

is the power consumption of the heating/cooling system.

A G_{L}

has two variable inputs, the indoor horizontal illuminance, normalised in the range

[0, 1]

and the error

e_{L}

between the set point and the indoor illuminance normalised in the range

[- 1, 1]

. The total states are 35, which are represented by an equal number of rules. The output vector of the

A G_{L}

has five fuzzy singleton sets,

A_{L} = {\frac{1}{- 0.3} + \frac{1}{- 0.02} + \frac{1}{0} + \frac{1}{0.02} + \frac{1}{0.3}}

. The global action defines the percentage of the power change to be consumed by the artificial lighting system according to its nominal operating power for positive signal, while negative signal change the transparency of the electro-chromic window. The reward

R_{A_{L}}

of this agent is defined as follows:

R_{A_{L}} (x, α, x^{'}) = - | e_{L} | - 0.1 \cdot P_{A L}

(37)

where

P_{A L}

is the power consumption of the artificial lighting system.

A G_{C D}

has two variable inputs, the number of the occupants normalised in the range

[0, 1]

and the error

e_{C O}

between the set point and the indoor CO₂ concentration, normalised in the range

[0, 1]

. The total states (Five membership functions for each input. See plot (a) on Figure 4) are 25, which are represented by an equal number of rules. The output vector of the agent has five fuzzy singleton sets,

A_{C D} = {\frac{1}{- 0.4} + \frac{1}{- 0.02} + \frac{1}{0} + \frac{1}{0.02} + \frac{1}{0.04}}

. The global action is the control signal which represents the percentage of the power, change to be consumed by the ventilation system according to its maximum power. The reward

R_{A_{C D}}

of this agent is defined as follows:

R_{A_{C D}} (x, α, x^{'}) = - | e_{c o} | - 0.1 \cdot P_{V}

(38)

where

P_{V}

is the power consumption of the ventilation system.

All three agents use the same exploration/exploitation algorithm. When a new state is visited, the agent performs exploration for 500 rounds. Following the 500 rounds, the agent checks and performs the actions that has not been performed at all. By this way it is assured that all possible actions are explored. Consequently, the agent performs exploitation for

99 %

and exploration for

1 %

for a given state.

Additionally,

A G_{T}

and

A G_{L}

operate only when occupancy is detected inside the building space. For periods, where the occupancy vector is zero, i.e.,

{\hat{n}}_{o c c} (t) = 0

, the agent’s control signal is set to zero and the learning mechanism stops functioning. Thus, when the building is empty a waste of energy is avoided.

3. Results

3.1. Parameters and Occupancy Estimation

The proposed methodology has been implemented to estimate the parameters

θ

, as well as the occupancy vector for a set of given data, that stem from the EM solution of the stochastic differential Equation (5). All mentioned algorithms have been programmed in MATLAB. Figure 5 illustrates ten solution paths of the stochastic differential equation, where the parameters are chosen as:

q_{i n f} = 0.12 h^{- 1}

[36],

X_{o u t}^{C O_{2}} = 400

(ppm) [10],

c_{o c c}^{C O_{2}} = 180

(ppm/h), [14], and

σ = 2

[14]. A profile

n_{o c c} (t)

materialises a simple scenario of presence in a room.

In order to replicate a realistic problem, a single realisation of the solution is considered, and the time series is used to estimate the parameters using the method of evolutionary maximisation of the likelihood function. In the last figure, it is clear that the solution of the stochastic mass balance equation reflects the fact that when occupants are present in the room, e.g., approximately at 13:40 to almost 21:00, the CO₂ increases as expected. When

n_{o c c} (t)

vanishes, then the solution has a mean reverting behaviour (Ornstein–Uhlenbeck Process.), analogous to the infiltration rate.

The estimation algorithm has been implemented using a maximum number of generations

M a x G e n = 100

, a population of

n_{p} = 80

, a crossover probability

p_{c r} = 0.2

and a mutation scaling F sampled uniformly from

[0.2, 2]

. The estimated parameters of the proposed method are given in Table 1.

Figure 6, illustrates the solution of the stochastic model, i.e., the training data, and the model’s solution corresponding to the estimated parameters. The solution of the differential equation using the estimated parameters

\hat{θ}

is in very close vicinity to the true solution.

The occupancy estimation algorithm, has been implemented in the latter case and the estimation

{\hat{n}}_{o c c} (t)

is identical to the ground truth,

n_{o c c} (t)

. The result is illustrated in Figure 7.

The performance of both the parameter estimation method, as well as the occupancy estimation algorithm is excellent. For the implementation of the occupancy algorithm, 1440 points of CO₂ concentration are used, after having generated them from the numerical solution of the mass balance equation; using the method of Euler–Maruyama. Furthermore, the estimated parameter

\hat{θ}

has been used, as described in the section of occupancy estimation algorithm. The number of iterations for the algorithm has been set to 1000.

To study the result of parametric uncertainty on the occupancy estimation algorithm, the estimated infiltration rate and the estimated CO₂ generation per occupant, are modelled using fuzzy numbers. The parameters of the fuzzy case are computed using the a-cut method, as previously described, and are illustrated in Figure 3. The stochastic differential equation solutions using the estimated parameters, as well as the fuzzy estimated parameters (Table 2) are illustrated in Figure 8. It is clear that a parametric uncertainty bound is introduced in the model, since fuzzy parameters are considered.

The occupancy estimation algorithm is implemented in the case of both

\underset{̲}{\hat{θ}}

and

\bar{\hat{θ}}

parameters. The results are depicted in Figure 9. The estimation result produced by the algorithm differs from the real number of occupants, that generate the CO₂ data. By considering the infiltration and the carbon dioxide generation by the occupants as fuzzy, a further uncertainty in the model other than the noise, is induced. The occupancy vector in the case of

\underset{̲}{\hat{θ}}

is, in some time points, overestimated. This reflects the fact that since CO₂ generation is much lower than the estimated value, then the algorithm, based on the maximum likelihood principle, tries to determine what

n_{o c c} (t)

should be in order for the training data to be generated. Therefore, it computes an estimation vector

{\hat{n}}_{o c c} (t)

, so that the solution, if considered parameters

\underset{̲}{\hat{θ}}

, would match the true data. In the case of

\bar{\hat{θ}}

the estimated occupancy vector is, in some time points, underestimated reflecting the fact that in some time points the number of occupants should be smaller than 4, in order for the true data to be produced.

For the MACS development, the proposed methodology is used, for the estimation of an occupancy profile that resembles a working office: an increasing number of occupants are present in the room’s interior until late afternoon. Following to that, a single occupant is present for two hours, until midnight. The occupancy profile and the associated CO₂ concentration, are illustrated in Figure 10.

The true parameters used in the simulation, as well as their estimation, using differential evolution, are given in Table 3.

The algorithm of occupancy estimation, performs very accurately and the true occupant vector

n_{o c c} (t)

is identical to the estimated,

{\hat{n}}_{o c c} (t)

. The results are given in Figure 11.

Similarly to the previous case, the parametric uncertainty on the parameters is addressed, by assuming that the infiltration and the CO₂ generation per occupant are modelled by triangular fuzzy numbers, with core given by

{\hat{θ}}_{1}

for

q_{i n f}

and

{\hat{θ}}_{3}

for

c_{o c c}^{C O_{2}}

. Applying the a-cut methodology the left and right parameters were computed to be (Table 4).

The application of the occupancy algorithm in the case of fuzzy parameters is shown in Figure 12. The analysis follows the same philosophy as previously described. It is quite interesting, that fuzzy parameters increase the uncertainty in the model, and the occupancy estimation algorithm results in overestimated or underestimated vectors. However, it should be clearly stated that this is not a weak point of the algorithm, but a direct result from the maximum likelihood nature of it.

3.2. Multi Agent Control Framework

As far as the MACS is concerned, the simulation time has been set to one year with a simulation step time of 0.001 h. The data, regarding the ambient temperature, the horizontal diffused illuminance, and the solar irradiance are acquired from the database of the Institute for Environmental Research and Sustainable Development (IERSD), National Observatory of Athens, and they are associated with the year 2019, for Athens, Greece with a sample time of 1 h. For the outdoor CO₂ concentration, the constant value of 359.4813 ppm that has been estimated, is used. The set point for the heating/cooling system has been set to 23 °C, when the outdoor temperature falls under 21 °C, and 26 °C when the outdoor temperature exceeds 29 °C. The set point for the indoor illuminance has been set to 700

l u x

and the set point for the CO₂ concentration has been set to 380 ppm.

All the sub-models lighting, thermal, CO₂ concentration models and the algorithms of the MACS have been implemented in MATLAB/Simulink, each sub-model and each agent is an embedded MATLAB function with multiple inputs and outputs. The relation between inputs and outputs are described by the corresponding algorithms and equations. For example, the CO₂ concentration sub-model is a two input (number of occupants and ventilation rate) and one output system (CO₂ concentration).

The following figures illustrate the outdoor temperature, the indoor temperature, the control signal of the heating/cooling system and the occupancy of the building. After the extensive exploration, the indoor temperature remains almost constant at 23 °C for the winter and at 26 °C for the summer, during the hours when occupants are present inside the building. Figure 13 and Figure 14 are associated with one random day of winter and one random day of summer, respectively. When the occupancy level of the building is zero, the control signal of the agent is zero and consequently the indoor temperature is lower from the set point for the winter and above the set point for the summer. When occupants are entering the building the agent starts to operate again and the indoor temperature is reaching the set point with small oscillations.

A similar philosophy follows

A G_{L}

. At first, the corresponding agent visits new states and mainly explores the state-action space. Following the exploration, the indoor illuminance remains almost constant at 700

l u x

, at the hours of existing occupancy. Figure 15 depicts the outdoor horizontal diffused illuminance, the total indoor illuminance, the control signal, the transparency of the electro-chromic window, the provided illuminance by the artificial lighting system and the number of people inside the building, for a random day. When the occupancy of the building is zero, the control signal of the agent is zero, and consequently the indoor illuminance is proportional to the outdoor horizontal illuminance. When occupants enter the building, the agent starts to operate again and the indoor horizontal illuminance is reaching the set point with small oscillations, by either changing the transparency of the window or by changing the illuminance of the artificial lighting.

Figure 16 illustrate the occupancy of the building, the control signal of the agent and the indoor CO₂ concentration, for one random day. The agent explores in the beginning and after the exploration, the indoor concentration of the CO₂ remains lower than the value of 800 ppm. The ventilation system operates when there are people inside the building, and either reduces or stop its operation, when the occupancy level vanishes, and consequently the concentration of the CO₂ is reduced.

The overall energy consumption of the three systems: heating/cooling (H/C) system, artificial lighting (A/L) system and ventilation (V) system for one year period is computed as 1663 kWh in relation to the much larger value 3770 kWh, which is the overall consumption of these systems for the same year without taking into account the occupancy of the building: the heating/cooling system, as well as the lighting system, operate only for hours where

{\hat{n}}_{o c c} \neq 0

. This concludes an overall reduction of

55.9 %

.

The consumptions of each system, with and without regarding the occupancy of the building, are presented in Table 5 and graphically illustrated in Figure 17.

The overall comfort index

C_{i}

arises from three indexes; the thermal comfort index

T_{i}

, the visual comfort index

L_{i}

, and the indoor air quality index

A_{i}

. The trapezoidal membership functions, that calculate the respecting indexes according to the indoor temperature, the indoor horizontal illuminance and the CO₂ concentration, are depicted in Figure 18.

The overall comfort index is computed by the following equation:

\begin{matrix} C_{i} (x) = \{\begin{matrix} 0.4 T_{i} + 0.4 L_{i} + 0.2 A_{i}, & {\hat{n}}_{o c c} (t) > 0 \\ 0, & otherwise \end{matrix} \end{matrix}

(39)

Figure 19 illustrates

C_{i}

and the number of occupants for a random day.

C_{i}

is zero when no occupants are inside the building and equals, for most of the time, to one when there are occupants inside the building.

The MACS performance, pertaining to the case of fuzzy parameters, is presented in the following figures. Figure 20 and Figure 21, illustrate the outdoor temperature, the indoor temperature, the control signal, and the occupancy level estimated when using

\underset{̲}{\hat{θ}}

and

\bar{\hat{θ}}

, respectively.

Figure 22 depicts the MACS results, associated with the occupancy estimation, pertaining to

\underset{̲}{\hat{θ}}

. The outdoor illuminance, the total indoor illuminance, the control signal, the transparency of the window, the artificial lighting and the occupancy level, are illustrated. The results when using the

\bar{\hat{θ}}

-based estimated occupancy vector, are given in Figure 23.

Figure 24 and Figure 25, illustrate the occupancy of the building, the control signal of the agent and the indoor CO₂ concentration, for one random day for the different occupancy vectors.

The total consumption, as well as the consumptions of the heating/cooling system, the artificial lighting system and the ventilation system for two different occupancy profiles, are presented in Table 6. In the first profile, the overall consumption is 1533 kWh, meaning a reduction of 130 kWh (Comparison with the first column of Table 5). This reduction is a result of the ventilation system. The agent of the ventilation system learns a policy that reduces the consumption of the actuator but, as it is illustrated in Figure 24, the indoor air of the building, is of lower quality, as the CO₂ concentration reaches values up to 900 ppm. Small reduction is also observed to the heating/cooling system (Figure 20), but it is justified by the increased number of the occupants: More occupants are heating the thermal space, due to the occupant’s heating gains.

In the second profile, the overall consumptions is 1528 kWh, meaning a reduction of 135 kWh (Comparison with the first column of Table 5). In this case, the reduction is a result from the heating/cooling system (Figure 21). The agent of this system learns a better policy that reduces the consumption of the system, without particularly reducing the thermal comfort. A small reduction of 2 kWh is observed to the ventilation system (Figure 25), which is justified by the lower number of the occupants. The consumption of the lighting system, as it is illustrated in Figure 22 and Figure 23, remains the same, since the lighting system is not affected by the occupant number.

4. Conclusions

A MACS with a parameter and occupancy estimation algorithm has been studied, with respect to the control performance and maintenance of the building’s comfort levels. A stochastic process models the CO₂ generation and decay, by including the unmodelled dynamics. Given CO₂ concentration data, the parameters of the model, as well as the occupancy level, are estimated. Maximum likelihood methodology follows the estimation. In order to induce further uncertainty in the assumed model, the infiltration and carbon dioxide concentration are treated using fuzzy numbers, and the effect of the occupancy estimation algorithm in the new model is studied. By using two boundary cases of the

α

-cut method, for the fuzzy parameters, two occupancy estimations are produced. These occupancy estimated profiles can be interpreted to describe an uncertainty bound on the occupancy level which stems from the parametric uncertainty on the infiltration rate and the carbon dioxide generation rate per occupant.

The simulation results highlight the successful application of the independent learners approach of the MACS. This control framework succeeds stability over the desired set points for all the control parameters, in a fully decentralised framework. Each agent acts independently, thus a possible failure in one agent will not lead to a failure of the total control system. The enhanced approach, which takes into account the occupancy of the building, leads to even better performance because not only the set points of the parameters are achieved, but also an energy reduction of about

55.9 %

is observed. This reduction is a result of the integration of the estimated occupancy vector, which dictates which time periods the thermal and lighting agents will function. Therefore, a waste of energy is avoided for the time periods where there are no present occupants in the building. This approach is also tested for different occupancy profiles arising by the fuzzy parameters with similar successful results.

In conventional building control techniques the control algorithm receives immediate feedback and generates a control action. In the proposed MACS for building control, the control algorithm receives a delayed feedback, which results each agent to incorporate previous knowledge. Thus, an agent applies the optimum policy depending on the long term reward.

The over/under estimated occupancy level, when considering fuzzy parameters, indicate that further research should focus on the field of parametric uncertainty and on fuzzy stochastic differential equations. The incorporation of uncertainty bounds on occupancy estimation, could be included in the MACS control design to further reduce the energy consumption. Future research directions will also focus on the occupancy prediction problem, as well as the development of evolutionary based algorithms for occupancy estimation based on CO₂ concentration data.

A quiet interesting topic that we surely plan to investigate involves more complex approaches of cooperative MAS, which combine the indoor environmental quality parameters which, in combination with artificial general intelligence (Deep AI), may lead to further improvement of the proposed system. Furthermore, the impact of the ventilation system to the indoor temperature will be introduced into the thermal model of the building for providing more accurate results. Finally, the effect of ventilation systems, with and without heat recovery, on thermal comfort and energy consumption, will be studied and compared.

Author Contributions

Conceptualisation, P.K. (Panagiotis Korkidis), P.K. (Panagiotis Kofinas) and A.D.; methodology, P.K. (Panagiotis Korkidis), P.K. (Panagiotis Kofinas) and A.D.; software, P.K. (Panagiotis Korkidis) and P.K. (Panagiotis Kofinas); validation, P.K. (Panagiotis Korkidis), P.K. (Panagiotis Kofinas) and A.D.; formal analysis, P.K. (Panagiotis Korkidis), P.K. (Panagiotis Kofinas) and A.D.; investigation, P.K. (Panagiotis Korkidis), P.K. (Panagiotis Kofinas) and A.D.; resources, A.D.; writing—original draft preparation, P.K. (Panagiotis Korkidis) and P.K. (Panagiotis Kofinas); writing—review and editing, P.K. (Panagiotis Korkidis), P.K. (Panagiotis Kofinas) and A.D.; visualisation, P.K. (Panagiotis Korkidis) and P.K. (Panagiotis Kofinas); supervision, A.D.; project administration, A.D.; funding acquisition, A.D. All authors have read and agreed to the published version of the manuscript.

Funding

This research is co-financed by Greece and the European Union (European Social Fund—ESF) through the Operational Programme Human Resources Development, Education and Lifelong Learning 2014–2020 in the context of the project “Intelligent Control Techniques for Comfort and Prediction of Occupancy in Buildings—Impacts on Energy Efficiency” (MIS 5050544).

Acknowledgments

The authors wish to express their sincere gratitude, for the availability of the temperature and irradiance data, to the Institute for Environmental Research and Sustainable Development (IERSD), National Observatory of Athens.

Conflicts of Interest

The authors declare no conflict of interest.

Nomenclature

$α (x)$	Global action
$α_{i} (x)$	Fired degree of fuzzy rules
$\bar{\hat{θ}}$	Fuzzy right estimated parameters
$\hat{θ}$	Estimated parameters
$θ$	True parameters
$\underset{̲}{\hat{θ}}$	Fuzzy left estimated parameters
$\dot{M}$	Mass air flow of the heating/cooling system
$η$	Learning rate
$γ$	Discount factor
${\hat{n}}_{o c c} (t)$	Estimated number of occupants as a function of time
$P^{g}$	Population of individuals at generation g
$Q (x, α)$	$Q$ values of state x and action $α$
$σ$	Volatility of the noise
$a_{i}$	Consequent of fuzzy rule
$A_{w}$	Area of window
$B_{t}$	Brownian Motion at time t
c	Air heat capacity
$c_{o c c}^{C O_{2}}$	CO₂ generation per occupant
D	Window shadowing factor
$e_{c o}$	CO₂ Concentration error
$e_{L}$	Illuminance error
$E_{p, d}$	Horizontal diffused illuminance of the point p
$e_{T}$	Temperature error
F	Mutation scaling
$G_{C O_{2}}$	Term associated with CO₂ generation
$g_{w}$	Glass conductance
$h_{p}$	Height between p and the lower edge of the window
$h_{w}$	Height of the window
$H G_{E}$	Electrical loads heating gain
$H G_{L}$	Artificial lighting heating gain
$H G_{o}$	Occupants heating gain
$H G_{S}$	Solar irradiance heating gain
$I_{w}$	Diffused solar irradiance
$L (θ)$	Negative Log Likelihood function
$L_{s}$	Illuminance from the sky
$M_{α}$	Air mass inside the building
$n_{o c c} (t)$	Number of occupants as a function of time
$P_{A L}$	Artificial lighting system power consumption
$p_{c r}$	Crossover probability
$P_{H C}$	Heating/Cooling system power consumption
$P_{V}$	Ventilation system power consumption
q	Term associated with air exchange
$q_{i n f}$	Infiltration rate
$q_{v e n t}$	Ventilation rate
R	Reward
$R_{e q}$	Equivalent thermal resistance of the building envelope
$r_{w}$	Transparency of window
$T_{o u t}$	Temperature of the outdoor air
$T_{r o o m}$	Temperature of the room
V	Volume of the room
$w_{w}$	Width of the window
$X^{C O_{2}} (t)$	Random variable associated with CO₂ concentration at time t
$x^{C O_{2}} (t)$	Solution of the deterministic mass balance equation at time t
$x_{p}$	Distance between p and left wall
$x_{w}$	Distance between left wall and left edge of the window
z	Horizontal distance between p and window

Appendix A

The conversion of pollutant concentration (in m³) in

p p m

, is provided for the readers.

x_{p p m} = \frac{10^{6} \times C}{S}

(A1)

where

x_{p p m}

denotes a value in

p p m

, C denotes the pollutant component in m³ and S denotes the solvent in m³.

References

Jung, W.; Jazizadeh, F. Human-in-the-loop HVAC operations: A quantitative review on occupancy, comfort, and energy-efficiency dimensions. Appl. Energy 2019, 239, 1471–1508. [Google Scholar] [CrossRef]
Dong, J.; Winstead, C.; Nutaro, J.; Kuruganti, T. Occupancy-Based HVAC Control with Short-Term Occupancy Prediction Algorithms for Energy-Efficient Buildings. Energies 2018, 11, 2427. [Google Scholar] [CrossRef] [Green Version]
Bielskus, J.; Motuziene, V.; Vilutiene, T.; Indriulionis, A. Occupancy Prediction Using Differential Equation Online Sequential Extreme Learning Machine Model. Energies 2020, 13, 4033. [Google Scholar] [CrossRef]
Wang, D.; Federspiel, C.V.; Rubinstein, F. Modelling occupancy in single persons offices. Energy Build. 2005, 37, 121–126. [Google Scholar] [CrossRef]
Feng, X.; Yan, D.; Hong, T. Simulation of occupancy in buildings. Energy Build. 2015, 87, 348–359. [Google Scholar] [CrossRef] [Green Version]
Assimakopoulos, M.N.; Barmbaresos, N.; Pantazaras, A.; Karlessi, T.; Lee, S.E. On the comparison of occupancy in relation to energy consumption and indoor environmental quality: A case study. In Proceedings of the 9th International Conference on Sustainability in Energy and Buildings, Chania, Greece, 5–7 July 2017. [Google Scholar]
Wang, C.; Yan, D.; Jiang, Y. A novel approach for building occupancy simulation. Build. Simul. 2011, 4, 149–167. [Google Scholar] [CrossRef]
Wilke, U.; Haldi, F.; Scartezzini, J.L.; Robinson, D. A bottom-up stochastic model to predict building occupants’ time-dependent activities. Build. Environ. 2013, 60, 254–264. [Google Scholar] [CrossRef]
Jiang, C.; Chen, Z.; Su, R.; Masood, M.K.; Soh, Y.S. Bayesian filtering for occupancy estimation from carbon dioxide concentration. Energy Build. 2020, 206, 2–10. [Google Scholar] [CrossRef]
Pantazaras, A.; Lee, S.E.; Santamouris, M.; Yang, J. Predicting the CO₂ levels in buildings using deterministic and identified models. Energy Build. 2016, 127, 774–785. [Google Scholar] [CrossRef]
Oldewurtel, F.; Sturzenegger, D.; Morari, M. Importance of occupancy information for building climate control. Appl. Energy 2013, 101, 521–532. [Google Scholar] [CrossRef]
Ansanay, A.G. Estimating occupancy using indoor carbon dioxide concentrations only in an office building: A method and qualitative assessment. In Proceedings of the REHVA World Congress on Energy Efficient, Smart and Healthy Buildings (CLIMA), Prague, Czech Republic, 16–19 June 2013; pp. 1–8. [Google Scholar]
Cali, D.; Matthes, P.; Huchtemann, K.; Streblow, R.; Muller, D. CO₂ based occupancy detection algorithm: Experimental analysis and validation for office and residential buildings. Build. Environ. 2015, 86, 39–49. [Google Scholar] [CrossRef]
Wolf, S.; Cali, D.; Krogstie, J.; Madsen, H. Carbon dioxide-based occupancy estimation using stochastic differential equations. Appl. Energy 2019, 236, 32–41. [Google Scholar] [CrossRef]
Chen, Z.; Jiang, C.; Xie, L. Building occupancy estimation and detection: A review. Energy Build. 2018, 169, 260–270. [Google Scholar] [CrossRef]
Krisensen, N.R.; Madsen, H.; Jorgensen, S.B. Identification of continuous time models using discrete time data. In Proceedings of the 13th IFAC Symposium on System Identification (SYSID-2003), Rotterdam, The Netherlands, 27–29 August 2003. [Google Scholar]
Ebadat, A.; Bottegal, G.; Molinari, M.; Varagnolo, D. Multi-room occupancy estimation through adaptive gray-box models. In Proceedings of the IEEE 54th Annual Conference on Decision and Control (CDC), Osaka, Japan, 15–18 December 2015. [Google Scholar]
Pennings, L.W.H.A.; Boxem, G.; Van Houten, M.A.; Zelier, W. Multi Agent System to Optimize Comfort and Energy flows in the Built Environment. In Proceedings of the Tenth International Conference for Enhanced Building Operations, Eindhoven, The Netherlands, 26–28 October 2010. [Google Scholar]
Jazizadeh, F.; Ghahramani, A.; Becerik-Gerber, B.; Kichkaylo, T.; Orosz, M. User-led decentralized thermal comfort driven HVAC operations for improved efficiency in office buildings. Energy Build. 2014, 70, 398–410. [Google Scholar] [CrossRef]
Pazhoohesh, M.; Zhang, C. A satisfaction-range approach for achieving thermal comfort level in a shared office. Build. Environ. 2018, 142, 312–326. [Google Scholar] [CrossRef]
Zupancic, D.; Lusterk, M.; Gams, M. Multi-Agent Architecture for Control of Heating and Cooling in a Residential Space. Comput. J. 2014, 58, 1314–1329. [Google Scholar]
Wang, Z.; Wang, L.; Dounis, A.I.; Yang, R. Multi-agent control system with information fusion based comfort model for smart buildings. Appl. Energy 2012, 99, 247–254. [Google Scholar] [CrossRef]
Wang, Z.; Yang, R.; Wang, L. Multi-agent control system with intelligent optimization for smart and energy-efficient buildings. In Proceedings of the 36th Annual Conference on IEEE Industrial Electronics Society, Glendale, AZ, USA, 7–10 November 2010. [Google Scholar]
Gao, G.; Li, J.; Wen, Y. Energy-Efficient Thermal Comfort Control in Smart Buildings via Deep Reinforcement Learning. arXiv 2019, arXiv:1901.04693v1. [Google Scholar]
Wang, Z.; Wang, L. Intelligent Control of Ventilation System for Energy-Efficient Buildings With CO₂ Predictive Model. IEEE Trans. Smart Grid 2013, 4, 686–693. [Google Scholar] [CrossRef]
Kusiak, A.; Li, M. Optimal decision making in ventilation control. Energy 2009, 34, 1835–1845. [Google Scholar] [CrossRef]
Pavliotis, G.A. Stochastic Processes and Applications; Springer: New York, NY, USA, 2014. [Google Scholar]
Karatzas, I.; Shreve, S. Brownian Motion and Stochastic Calculus; Springer: New York, NY, USA, 1998. [Google Scholar]
Shreve, S. Stochastic Calculus for Finance II: Continuous-Time Models; Springer: New York, NY, USA, 2004. [Google Scholar]
Brigo, D.; Mercurio, F. Interest Rate Models—Theory and Practice; Springer: Berlin/Heidelberg, Germany, 2006. [Google Scholar]
Sarkka, S.; Solin, A. Applied Stochastic Differential Equations; Cambridge University Press: Cambridge, UK, 2019. [Google Scholar]
Storn, R.; Price, K. Differential Evolution—A Simple and Efficient Heuristic for Global Optimization over Continuous Spaces. J. Glob. Optim. 1997, 11, 341–359. [Google Scholar] [CrossRef]
Dubois, D.; Prade, H. Fuzzy Sets and Systems: Theory and Applications; Academic Press: Cambridge, UK, 1980. [Google Scholar]
Sprungk, B.; van den Boogart, K.G. Stochastic differential equations with fuzzy drift and diffusion. Fuzzy Sets Syst. 2013, 230, 53–64. [Google Scholar] [CrossRef]
Schmelzer, B. On solutions of stochastic differential equations with parameters modeled by random sets. Int. J. Approx. Reason. 2010, 51, 1159–1171. [Google Scholar] [CrossRef] [Green Version]
Ashrae, ANSI/ASHRAE Standard 62.1-2019, Ventilation for Acceptable Indoor Air Quality. Available online: https://www.ashrae.org/technical-resources/bookstore/standards-62-1-62-2 (accessed on 20 March 2021).
Rodero, A.; Krawczyk, D.A. Carbon Dioxide Human Gains—A New Approach of the Estimation. Sustainability 2019, 11, 7128. [Google Scholar] [CrossRef] [Green Version]
Yalc, N.; Balta, D.; Ozmen, A. A modeling and simulation study about CO₂ amount with web-based indoor air quality monitoring. Turk. J. Elec. Eng. Comp. Sci. 2018, 26, 1390–1402. [Google Scholar]
Watkins, C. Learning From Delayed Rewards. Ph.D. Thesis, University of Cambridge, Cambridge, UK, 1989. [Google Scholar]
Glorennec, Y.; Jouffe, L. Fuzzy Q-Learning. In Proceedings of the 6th International Fuzzy Systems Conference, Barcelona, Spain, 1–5 July 1997; pp. 659–662. [Google Scholar]
Claus, C.; Boutilier, C. The dynamics of reinforcement learning in cooperative multi-agent systems. In Proceedings of the National Conference on Artificial Intelligence (AAAI), Monoma Terrace, WI, USA, 26–30 July 1998. [Google Scholar]
MathWorks, Thermal Model of a House, MATLAB Documentation. Available online: https://www.mathworks.com/help/simulink/slref/thermal-model-of-a-house.html (accessed on 21 August 2020).
Dejvisesa, J.; Tanthanuchb, N. A Simplified Air-conditioning Systems Model with Energy Management. In Proceedings of the International Electrical Engineering Congress, iEECON2016, Chiang Mai, Thailand, 2–4 March 2016. [Google Scholar]
Schell, M.B.; Turner, S.C.; Shim, R.O. Application of -based demand-controlled ventilation using ASHRAE standard 62: Optimizing energy use and ventilation. ASHRAE Trans 1998, 104, 1213–1225. [Google Scholar]
Dounis, A.I.; Manolakis, D.E. Design of a fuzzy system for living space thermal-comfort regulation. Appl. Energy 2001, 69, 119–144. [Google Scholar] [CrossRef]
Kim, C.H.; Kim, K.-S. Development of Sky Luminance and Daylight Illuminance Prediction Methods for Lighting Energy Saving in Office Buildings. Energies 2019, 12, 592. [Google Scholar] [CrossRef] [Green Version]
Avella, J.M.; Souza, T.; Silveira, J.L. A Comparative Analysis between Fluorescent and LED Illumination for Improve Energy Efficiency at IPBEN Building. In Proceedings of the The XI Latin-American Congress Electricity Generation and Transmission CLAGTEE, Sao Paolo, Brazil, 8–11 November 2015. [Google Scholar]
Dounis, A.; Leftheriotis, G.; Stavrinidis, S.; Syrrokostas, G. Electrochromic device modeling using an adaptive neuro-fuzzy inference system: A model-free approach. Energy Build. 2016, 110, 182–194. [Google Scholar] [CrossRef]

Figure 1. Proposed methodology in a graphical representation.

Figure 2. (a) The stochastic nature of the CO₂ model. Mean solution is illustrated in red. Grey band depicts the

95 %

quantiles. (b) The occupancy vector.

Figure 2. (a) The stochastic nature of the CO₂ model. Mean solution is illustrated in red. Grey band depicts the

95 %

quantiles. (b) The occupancy vector.

Figure 3. Fuzzy numbers representation of the parameters (a)

{\hat{θ}}_{1} = {\hat{q}}_{i n f}

and (b)

{\hat{θ}}_{3} = {\hat{c}}_{o c c}^{C O_{2}}

and their corresponding

0.5

-cuts.

Figure 3. Fuzzy numbers representation of the parameters (a)

{\hat{θ}}_{1} = {\hat{q}}_{i n f}

and (b)

{\hat{θ}}_{3} = {\hat{c}}_{o c c}^{C O_{2}}

and their corresponding

0.5

-cuts.

Figure 4. Membership functions of the agents’ inputs. (a) MFs for positive valued inputs. (b) MFs for positive and negative valued inputs.

Figure 5. Ten realisations of the stochastic mass balance differential equation solution. The CO₂ concentration is depicted in grey and the profile

n_{o c c} (t)

in blue.

Figure 5. Ten realisations of the stochastic mass balance differential equation solution. The CO₂ concentration is depicted in grey and the profile

n_{o c c} (t)

in blue.

Figure 6. Solution of the True model and the Estimated model, i.e., the solution of the differential equation using the estimated parameters.

Figure 7. Estimation of the number of occupants in the room as a function of time t:

{\hat{n}}_{o c c} (t)

.

Figure 7. Estimation of the number of occupants in the room as a function of time t:

{\hat{n}}_{o c c} (t)

.

Figure 8. Solutions of the stochastic differential equation when using

\underset{̲}{\hat{θ}}

and

\bar{\hat{θ}}

. The uncertainty bound induced by the fuzzification of the parameters is the band between the red and blue solution.

Figure 8. Solutions of the stochastic differential equation when using

\underset{̲}{\hat{θ}}

and

\bar{\hat{θ}}

. The uncertainty bound induced by the fuzzification of the parameters is the band between the red and blue solution.

Figure 9. (a) Occupancy estimation when fuzzy parameters

\underset{̲}{\hat{θ}}

are considered. (b) Occupancy estimation when fuzzy parameters

\bar{\hat{θ}}

are considered.

Figure 9. (a) Occupancy estimation when fuzzy parameters

\underset{̲}{\hat{θ}}

are considered. (b) Occupancy estimation when fuzzy parameters

\bar{\hat{θ}}

are considered.

Figure 10. Solution of the true model and the estimated model, i.e., the solution of the differential equation using the estimated parameters.

Figure 11. Estimation of the number of occupants in the room as a function of time t:

{\hat{n}}_{o c c} (t)

.

Figure 11. Estimation of the number of occupants in the room as a function of time t:

{\hat{n}}_{o c c} (t)

.

Figure 12. (a) Occupancy estimation when fuzzy parameters

\underset{̲}{\hat{θ}}

are considered. (b) Occupancy estimation when fuzzy parameters

\bar{\hat{θ}}

are considered.

Figure 12. (a) Occupancy estimation when fuzzy parameters

\underset{̲}{\hat{θ}}

are considered. (b) Occupancy estimation when fuzzy parameters

\bar{\hat{θ}}

are considered.

Figure 13. (a) Outdoor temperature. (b) Indoor temperature. (c) Control signal and (d) Occupancy level. The plots are associated with a random day of winter.

Figure 14. (a) Outdoor temperature. (b) Indoor temperature. (c) Control signal and (d) Occupancy level. The plots are associated with a random day of summer.

Figure 15. (a) Outdoor horizontal diffused illuminance. (b) Total indoor illuminance. (c) Control signal. (d) Transparency of the electro-chromic window. (e) Provided illuminance by the artificial lighting system and (f) Occupancy level. The plots are associated with a random day.

Figure 16. (a) Number of occupants. (b) Control signal. (c) CO₂ concentration. The plots are associated with a period of one day.

Figure 17. Energy consumption for each subsystems and their sum.

Figure 18. (Top right) Membership functions of the thermal comfort index

T_{i}

. (Bottom left) Membership function of the visual comfort index

L_{i}

. (Bottom right) Membership function of the indoor air quality index

A_{i}

.

Figure 18. (Top right) Membership functions of the thermal comfort index

T_{i}

. (Bottom left) Membership function of the visual comfort index

L_{i}

. (Bottom right) Membership function of the indoor air quality index

A_{i}

.

Figure 19. (a) Comfort index and (b) Number of occupant for a random day.

Figure 20. (a) Outdoor temperature. (b) Indoor temperature. (c) Control signal and (d) Occupancy level.

Figure 21. (a) Outdoor temperature. (b) Indoor temperature. (c) Control signal and (d) Occupancy level.

Figure 22. (a) Outdoor horizontal diffused illuminance. (b) Total indoor illuminance. (c) Control signal. (d) Transparency of the electro-chromic window. (e) Provided illuminance by the artificial lighting system and (f) Occupancy level.

Figure 23. (a) Outdoor horizontal diffused illuminance. (b) Total indoor illuminance. (c) Control signal. (d) Transparency of the electro-chromic window. (e) Provided illuminance by the artificial lighting system and (f) Occupancy level.

Figure 24. (a) Number of occupants. (b) Control signal. (c) CO₂ concentration.

Figure 25. (a) Number of occupants. (b) Control signal. (c) CO₂ concentration.

Table 1. True and Estimated model parameters.

Parameters	True Values $θ$	Estimated Values $\hat{θ}$
Infiltration rate	0.12 h $^{- 1}$	0.1166 h $^{- 1}$
Outdoor CO₂ concentration	400 ppm	391.4339 ppm
CO₂ generation per occupant	180 ppm/h	179.1345 ppm/h
$σ$	2	3.8800

Table 2. Estimated and Fuzzy parameters.

Estimated Parameters $\hat{θ}$	Fuzzy Parameters $\underset{̲}{\hat{θ}}$	Fuzzy Parameters $\bar{\hat{θ}}$
${\hat{θ}}_{1} = 0.1166$ h $^{- 1}$	$0.1107$ h $^{- 1}$	$0.1224$ h $^{- 1}$
${\hat{θ}}_{2} = 391.4339$ ppm	$391.4339$ ppm	$391.4339$ ppm
${\hat{θ}}_{3} = 179.1345$ ppm/h	$152.2644$ ppm/h	$206.0047$ ppm/h
${\hat{θ}}_{4} = 3.8800$	$3.8800$	$3.8800$

Table 3. True and Estimated Model parameters.

Parameters	True Values $θ$	Estimated Values $\hat{θ}$
Infiltration rate	0.12 h $^{- 1}$	0.1167 h $^{- 1}$
Outdoor CO₂ concentration	420 ppm	359.4813 ppm
CO₂ generation per occupant	180 ppm/h	179.6181 ppm/h
$σ$	2	2.8917

Table 4. Estimated and Fuzzy parameters.

Estimated Parameters $\hat{θ}$	Fuzzy Parameters $\underset{̲}{\hat{θ}}$	Fuzzy Parameters $\bar{\hat{θ}}$
${\hat{θ}}_{1} = 0.1167$ h $^{- 1}$	$0.1108$ h $^{- 1}$	$0.1225$ h $^{- 1}$
${\hat{θ}}_{2} = 359.4813$ ppm	$359.4813$ ppm	$359.4813$ ppm
${\hat{θ}}_{3} = 179.6181$ ppm/h	$152.6754$ ppm/h	$206.5608$ ppm/h
${\hat{θ}}_{4} = 2.8917$	$2.8917$	$2.8917$

Table 5. Energy consumption of the three sub systems in numbers.

	MACS Taking into Account Occupancy	MACS without Taking into Account Occupancy
Heating / Cooling System	747 kWh	1478 kWh
Artificial Lighting System	414 kWh	1780 kWh
Ventilation System	502 kWh	502 kWh
Total	1663 kWh	3770 kWh

Table 6. Energy consumption of the three sub systems in numbers, for the MACS using the occupancy profiles estimated using the fuzzy parameter.

	MACS Taking into Account Occupancy, Estimated Using $\underset{̲}{\hat{θ}}$	MACS Taking into Account Occupancy, Estimated Using $\bar{\hat{θ}}$
Heating / Cooling System	740 kWh	614 kWh
Artificial Lighting System	414 kWh	414 kWh
Ventilation System	379 kWh	500 kWh
Total	1533 kWh	1528 kWh

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Korkidis, P.; Dounis, A.; Kofinas, P. Computational Intelligence Technologies for Occupancy Estimation and Comfort Control in Buildings. Energies 2021, 14, 4971. https://doi.org/10.3390/en14164971

AMA Style

Korkidis P, Dounis A, Kofinas P. Computational Intelligence Technologies for Occupancy Estimation and Comfort Control in Buildings. Energies. 2021; 14(16):4971. https://doi.org/10.3390/en14164971

Chicago/Turabian Style

Korkidis, Panagiotis, Anastasios Dounis, and Panagiotis Kofinas. 2021. "Computational Intelligence Technologies for Occupancy Estimation and Comfort Control in Buildings" Energies 14, no. 16: 4971. https://doi.org/10.3390/en14164971

APA Style

Korkidis, P., Dounis, A., & Kofinas, P. (2021). Computational Intelligence Technologies for Occupancy Estimation and Comfort Control in Buildings. Energies, 14(16), 4971. https://doi.org/10.3390/en14164971

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Computational Intelligence Technologies for Occupancy Estimation and Comfort Control in Buildings

Abstract

1. Introduction

2. Materials and Methods

2.1. Problem Statement

2.2. Stochastic Processes and CO₂

2.3. The Hull–White Model as Mass Balance Equation

2.4. Estimation of the Stochastic Process Parameters

2.5. Fuzzy Occupancy Estimation

2.6. Fuzzy $Q$ -Learning and Multi Agent Control System

2.7. System Modelling and Description

2.7.1. Heating and Cooling System

2.7.2. Lighting System

2.7.3. Ventilation System

2.8. Multi Agent Control System

3. Results

3.1. Parameters and Occupancy Estimation

3.2. Multi Agent Control Framework

4. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

Nomenclature

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Computational Intelligence Technologies for Occupancy Estimation and Comfort Control in Buildings

Abstract

1. Introduction

2. Materials and Methods

2.1. Problem Statement

2.2. Stochastic Processes and CO2

2.3. The Hull–White Model as Mass Balance Equation

2.4. Estimation of the Stochastic Process Parameters

2.5. Fuzzy Occupancy Estimation

2.6. Fuzzy Q -Learning and Multi Agent Control System

2.7. System Modelling and Description

2.7.1. Heating and Cooling System

2.7.2. Lighting System

2.7.3. Ventilation System

2.8. Multi Agent Control System

3. Results

3.1. Parameters and Occupancy Estimation

3.2. Multi Agent Control Framework

4. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

Nomenclature

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

2.2. Stochastic Processes and CO₂

2.6. Fuzzy $Q$ -Learning and Multi Agent Control System