The Application of Cyber Physical System for Thermal Power Plants : Data-Driven Modeling

Optimal operation of energy systems plays an important role to enhance their lifetime security and efficiency. The determination of optimal operating strategies requires intelligent utilization of massive data accumulated during operation or prediction. The investigation of these data solely without combining physical models may run the risk that the established relationships between inputs and outputs, the models which reproduce the behavior of the considered system/component in a wide range of boundary conditions, are invalid for certain boundary conditions, which never occur in the database employed. Therefore, combining big data with physical models via cyber physical systems (CPS) is of great importance to derive highly-reliable and -accurate models and becomes more and more popular in practical applications. In this paper, we focus on the description of a systematic method to apply CPS to the performance analysis and decision making of thermal power plants. We proposed a general procedure of CPS with both offline and online phases for its application to thermal power plants and discussed the corresponding methods employed to support each sub-procedure. As an example, a data-driven model of turbine island of an existing air-cooling based thermal power plant is established with the proposed procedure and demonstrates its practicality, validity and flexibility. To establish such model, the historical operating data are employed in the cyber layer for modeling and linking each physical component. The decision-making procedure of optimal frequency of air-cooling condenser is also illustrated to show its applicability of online use. It is concluded that the cyber physical system with the data mining technique is effective and promising to facilitate the real-time analysis and control of thermal power plants.


Introduction
Thermal power plants convert chemical energies of fossil fuels, e.g., coal, natural gas, petroleum or oil, into electricity.Fossil fuels still play the most important role in energy generation sector and contributes 80% of world's power generation [1].Nowadays, more and more aspects [2] are considered for the operation strategy of thermal power plants to improve key performance indicators, e.g., stability, safety, efficiency [3,4], flexibility [5,6] and emission level.The controlling and optimization of thermal power plants are facing new challenges under various boundary conditions and operation modes, which may be potentially handled by the application of cyber physical systems (CPS).
The concept of CPS has been proven to be an effective method for design and development of future systems [7].CPS refers to the conjoining and coordination between the cyber layers Energies 2018, 11, 690 2 of 16 (e.g.information, models, simulation) and the physical layers (e.g., components, sensors and actuators).The cyber and physical layers can be operated at different spatial and temporal scales and can be tightly interacted with each other.There are three issues to be addressed for the application of CPS (as illustrated in Figure 1): (1) The cyber layers can receive real-time information of the state and performance of the physical layers; (2) CPS should include well-established models of each component, so that the physical layers can be accurately represented in the cyber layers; (3) Decision-making of the physical layers should be based on the analysis and results of the cyber layers.The handling of these issues relies on data mining technique for data analysis as well as the communication network for bidirectional data transmission.The whole resolution procedure involves the analysis of big data, the method of machine learning, and the concept of intellectual control and analysis.With such features, CPS has been widely applied in the field of energy systems [8], automotive systems [9], medical systems [10] and transportation systems [11].Specific to the applications in energy systems, CPS has been intensively applied to, e.g., the smart grid [12,13], distribute power systems [14], smart buildings and cities [15][16][17], and peak shaving of renewable energy [18,19].
Energies 2017, 10, x FOR PEER REVIEW 2 of 16 The concept of CPS has been proven to be an effective method for design and development of future systems [7].CPS refers to the conjoining and coordination between the cyber layers (e.g.information, models, simulation) and the physical layers (e.g., components, sensors and actuators).The cyber and physical layers can be operated at different spatial and temporal scales and can be tightly interacted with each other.There are three issues to be addressed for the application of CPS (as illustrated in Figure 1): (1) The cyber layers can receive real-time information of the state and performance of the physical layers; (2) CPS should include well-established models of each component, so that the physical layers can be accurately represented in the cyber layers; (3) Decision-making of the physical layers should be based on the analysis and results of the cyber layers.The handling of these issues relies on data mining technique for data analysis as well as the communication network for bidirectional data transmission.The whole resolution procedure involves the analysis of big data, the method of machine learning, and the concept of intellectual control and analysis.With such features, CPS has been widely applied in the field of energy systems [8], automotive systems [9], medical systems [10] and transportation systems [11].Specific to the applications in energy systems, CPS has been intensively applied to, e.g., the smart grid [12,13], distribute power systems [14], smart buildings and cities [15][16][17], and peak shaving of renewable energy [18,19].

Physical layer
Database Server Information analysis: controlling, efficiency, stability, deterioration...The generic structure of cyber physical system for the application of thermal power plants.

• • •
However, there are few papers on the application of CPS in power generation sector and there is a lack of general framework and standard application procedure.In this paper, the CPS for the application of thermal power plants is intensively discussed.Typical CPS structure is illustrated in Figure 1: The components, such as boiler, turbines, heat exchangers and pumps, are linked in the physical layer.For each component, corresponding submodules are established and linked to the cyber layer.The interaction between both layers relies on the communication network.The motivation of CPS application for thermal power plants is, based on accurate simulation models in the cyber layer, to deeply investigate the information generated in the physical layer for the online analysis and decision-making under any operation mode or any boundary conditions.The iterative procedure between the physical and cyber layers continuously gathers information from the physical layer, processes and analyzes data in the cyber layer, and sends analysis results to the physical layer to support the decision-making.
This paper is organized as follows: In Section 2, we propose a general framework for applying the CPS to thermal power plants after a review of various types of modeling methodologies and a follow-up discussion on how to choose the most proper method.Section 3 introduces a case study to describe the physical and cyber layers specifically for a traditional supercritical coal-fired power plant.A data-driven model of air-cooling condenser is established to further support the application of the proposed idea.Section 4 discusses the relationships between the theoretical modeling and However, there are few papers on the application of CPS in power generation sector and there is a lack of general framework and standard application procedure.In this paper, the CPS for the application of thermal power plants is intensively discussed.Typical CPS structure is illustrated in Figure 1: The components, such as boiler, turbines, heat exchangers and pumps, are linked in the physical layer.For each component, corresponding submodules are established and linked to the cyber layer.The interaction between both layers relies on the communication network.The motivation of CPS application for thermal power plants is, based on accurate simulation models in the cyber layer, to deeply investigate the information generated in the physical layer for the online analysis and decision-making under any operation mode or any boundary conditions.The iterative procedure between the physical and cyber layers continuously gathers information from the physical layer, processes and analyzes data in the cyber layer, and sends analysis results to the physical layer to support the decision-making.
This paper is organized as follows: In Section 2, we propose a general framework for applying the CPS to thermal power plants after a review of various types of modeling methodologies and a follow-up discussion on how to choose the most proper method.Section 3 introduces a case study to describe the physical and cyber layers specifically for a traditional supercritical coal-fired power plant.A data-driven model of air-cooling condenser is established to further support the application of the proposed idea.Section 4 discusses the relationships between the theoretical modeling and machine learning based modeling.Finally, in Section 5, the conclusions and key innovations are further emphasized.

General Methodology for Applying CPS to Thermal Power Plants
In this section, the general approach for applying cyber physical system to thermal power plants is discussed.The concept of cyber physical system emphasizes the integration between the knowledge derived by machine learning algorithms from field data and theoretical analysis.The cyber layer first uses the historical data collected from the physical layer (e.g., the actual system layout and measurements) to establish the accurate regression model to represent the behavior of the considered components and systems under the system boundary and component states occurred in the past and to predict closely the system/component performance under different conditions.Then, the established models are employed as a parallel simulation platform, which receives real-time measured boundary conditions to obtain the optimal controlling strategy and/or the analysis of the efficiency, stability or safety of the physical layer.The general procedure of such CPS for thermal power plants is shown in Figure 2, where the steps A1-A3 represent the offline phase, while the steps B1-B3 stand for the online phase.The offline and online phases can be combined to provide an iterative online application.
Energies 2017, 10, x FOR PEER REVIEW 3 of 16 machine learning based modeling.Finally, in Section 5, the conclusions and key innovations are further emphasized.

General Methodology for Applying CPS to Thermal Power Plants
In this section, the general approach for applying cyber physical system to thermal power plants is discussed.The concept of cyber physical system emphasizes the integration between the knowledge derived by machine learning algorithms from field data and theoretical analysis.The cyber layer first uses the historical data collected from the physical layer (e.g., the actual system layout and measurements) to establish the accurate regression model to represent the behavior of the considered components and systems under the system boundary and component states occurred in the past and to predict closely the system/component performance under different conditions.Then, the established models are employed as a parallel simulation platform, which receives real-time measured boundary conditions to obtain the optimal controlling strategy and/or the analysis of the efficiency, stability or safety of the physical layer.The general procedure of such CPS for thermal power plants is shown in Figure 2, where the steps A1-A3 represent the offline phase, while the steps B1-B3 stand for the online phase.The offline and online phases can be combined to provide an iterative online application.

Preprocessing Historical Data (A1)
The CPS relies highly on the accuracy and quality of data sources.The preprocessing of historical data mainly includes the following three aspects:

Attribute Selection Based on the Business Target
A good understanding of the business target is the cornerstone for the proper selection of necessary input attributes to build the cyber-representation model for the considered components or subsystems.The physical layer also needs to be considered for the selection of the attributes.The sensors and actuators need to be well placed and, particularly, additional sensors or actuators for offering enough measurement redundancy might be added for monitoring and controlling of the components.With the redundancy measurements, some difficult-to-measure but necessary

Preprocessing Historical Data (A1)
The CPS relies highly on the accuracy and quality of data sources.The preprocessing of historical data mainly includes the following three aspects:

Attribute Selection Based on the Business Target
A good understanding of the business target is the cornerstone for the proper selection of necessary input attributes to build the cyber-representation model for the considered components or subsystems.The physical layer also needs to be considered for the selection of the attributes.The sensors and actuators need to be well placed and, particularly, additional sensors or actuators for offering enough measurement redundancy might be added for monitoring and controlling of the components.With the redundancy measurements, some difficult-to-measure but necessary attributes can be obtained indirectly based on the basic mass and energy balance established with physical knowledge.For instance, most Rankine-cycle based thermal power plants do not equip sensors for direct measurement of mass flowrate of the main steam, which is usually calculated based on the mass balance (e.g., the outlet mass flowrate of the condensate pump) and the energy balance of the feedwater preheaters.Therefore, the measurement system needs to be well defined and optimized to ensure that all the information required for the cyber layer is obtained with certain redundancy.

The Selection of the Historical Samples Based on Timeliness
For establishing a data-driven cyber physical model, it is important to select a proper period of historical samples.The latest samples should be considered with a high priority, since they represent the current component/system states more reasonably.There are mainly two issues for the selection of the historical samples: (1) How large the training dataset should be and (2) when the model should be updated to accurately reflect the current system/component states.Generally, a larger training dataset would cover more different boundary conditions so that the model established based on this data is expected to be valid for various situations and does not need to be updated frequently.However, a larger training dataset not only means a large computation time but also involves the representativeness of the data outdated.For a small dataset, it might not contain enough information for calibrating a model for those conditions not covered in the dataset.

The Enhancement of Data Quality
The data-driven model established at the cyber phase is based on the measurements, therefore the data quality may not be good enough due to measurement noises and sensors faults.However, the accuracy and validity of the model highly relays on the quality of the employed training data.Therefore, it is highly recommended to perform rigorous and conservative preprocessing of measurement data to ensure high data quality.Several data processing techniques are available with their advantages and disadvantages highlighted below. (

1) Data reconciliation
The well-known data reconciliation method is an excellent choice to enhance data quality.Data reconciliation technique utilizes redundant measurements from the employed sensor network to correct the measured value of each sensor by considering basic mass and energy balance of the flowsheet represented by the sensor network.Data reconciliation enhances the reliability of the original measured values by reconciled values obtained by systematically minimizing the system error of the sensor network, which is formulated as Equations ( 1) and ( 2 where the terms xi and x i are the reconciled and measured values, respectively.The term σ i is the standard deviation of the corresponding measured variable, while the objective function γ is the sum of all errors between the reconciled and measured values.The constraint functions f l mainly refer to the mass balance equations, energy balance equations, and other physical equations or even the empirical equations.In the constraint functions f l , the term ûk is the calculated (also named as reconciled) value of the unmeasured variables.The terms m, p and q represent the number of measured and unmeasured variables, and the number of constraint equations, respectively.As measurement redundancy is required for data reconciliation, the number of constraint equations should be larger than that of unmeasured variables (p > q) to solve the optimization problem of Equation ( 1), while the degrees of freedom can be described as r = p − q.In addition, data reconciliation is not compulsory but is highly recommended for building a highly-reliable model of the considered thermal power plant.
Energies 2018, 11, 690 5 of 16 (2) Outlier detection The outliers, wrong measurements, in the recorded operating data may significantly reduce the reliability of the reconciled values.The outliers occur mainly because of the faults of the sensors themselves, the errors during data transmission, and the changes of component states due to, e.g., unit maintenance.Many useful approaches based on, e.g., statistical hypothesis [20], distance, density or frequency characteristics of the dataset [21][22][23], or machine learning [24][25][26], have been developed to detect outliers.As the outliers may occur at different locations and may not share the common natures, the choice of a suitable detection method based on the data characteristics is of great significance.
(3) Normalization Most attributes for modeling thermal power plants are continuous variables.Each of these attributes has its own unit and scaling.The normalization is a way of ensuring that the structure of the dataset is appropriate for the employed mathematical algorithms (discussed in detail in Section 2.2).For example, for the normalization in support vector regression method, the original data is usually scaled into [0,1] or [−1,1], while in Auto-Associative Kernel Regression (AAKR) method [27], the dataset is usually normalized to have zero mean and standard deviation equal to one.The lack of normalization may lead to longer computation time or even computational divergence.

Regression Model Establishment (A2)
This step essentially regresses the component/system states, which can predict the corrected measured variables of components of subsystems for given boundary conditions.The real component and the cyber layer can be directly connected with the regression model and the feedback from the cyber layer can assist the CPS to perform more reasonable and beneficial controlling of physical components.With advanced artificial intelligence to determine the optimal operating conditions for current boundary conditions and component states, the physical controlling procedure can be optimized to enhance the competitiveness.

Physical Modeling or Thermodynamic Modeling
If the behavior of the considered component has already been studied intensively in the literature, a physical model can usually be established in the cyber layer with physical equations or empirical equations for simulating the components [28][29][30].Component characteristics, such as efficiency curve, heat transfer coefficient or pressure drop, can be acquired by analyzing the historical data recorded by the sensor network with parameter estimation technique.This is of great importance since the component performance usually decays after a certain time of operation and the original design performance cannot be valid for long-term operation.This ensures that the component states are always up-to-date.

Machine Learning
Machine learning is also a technique to identify and correct the parameters representing the component behaviors.Generally, machine learning is classified as supervised and unsupervised learning algorithms.The monitoring and controlling of thermal power plants always depend on various attributes and the relationships between the input variables and the targets (outputs).Thus, data mining in cyber physical system mainly refers to supervised learning algorithms, among which the support vector regression, neural network, fuzzy rough sets and decision tree are the most recommended methods for the application of thermal power plants.Here we focus on the support vector regression for machine learning.Support vector regression (SVR) investigates mathematically the best-possible relationship between inputs attributes x and target variable y.The regression function of SVR algorithm, which is more concise compared with neural network, is formulated as Equation (3): where the terms w = [w 1 , w 2 , . . . ,w l ] T and b are the weight factor of each support vector and the function bias, respectively.The function ϕ(x) = [ϕ 1 (x 1 ), . . . ,ϕ l (x l )] T maps the input attributes x into a high-dimension space ϕ(x).The training process of SVR model contains a trade of between the structural risk and confidence interval, which means SVR method usually has a good generalization performance.Besides, the concept of support vectors refers to the historical samples on the boundary of the decision surface.These samples are similar as the baseline, which shows the generalization of the established model and gives a further understanding of the component performance.These features make SVR method highly suitable for modelling thermal power plants.However, the shortage of SVR method is that it becomes computational expensive with an increasing data size.

Offline Model Validation (A3)
Before real-time optimization to offer the optimal controlling strategies, the established regression model should be validated to ensure high accuracy for various conditions.A commonly used technique is to employ 90% of the chosen dataset (after step A1) for the modeling calibration (step A2) and to use the remaining of the dataset for model validation.If the established model is with high accuracy for different operating conditions, it is then possible to be applied for the online phase.

Preprocessing of the Real-Time Data (B1)
This step is in fact similar to the step A1: preprocessing of historical data.However, only one sample or several samples are needed for each iteration, depending on whether the temporal information is needed in the model, thus less computation time is possible to achieve for real-time control.

Online Performance Analysis and Decision Making (B2)
With the established regression models, CPS can mainly use for performance analysis and decision making [31].Specific to thermal power plants, CPS is available for monitoring the component conditions or system performances (e.g., thermal efficiency [4,5] or heat transfer efficiency), fault diagnosis (e.g., online advanced exergy analysis [32,33]) and life cycle analyze (e.g., [34]).Moreover, CPS can also support decision making and parameter optimization [35] by automatically determining the optimal controlling strategy and the corresponding operating parameters according to historical data and present state, which is further explained by the case study in Section 3.2.2.

Model Updates (B3)
The offline model was built by the recent, most representative historical data, while various situations, e.g., significant changes of boundary conditions, the failure of key components and continuous performance degradation of components, may cause a significant mismatch between the prediction of the regressed model and the actual component/system performance.Regular inspections of the target variables are needed to check when the regressed model should be updated to reflect the current component/system status: (1) Model rebuilding Component overhaul or the change of operation modes leads to the requirement of rebuilding the model.The old datasets are discarded, and the current information is needed to rebuild the model.For example, the flowsheets of turbine subsystem are different in the heating period and non-heating period, thus the input attributes of extracted steam are different for modeling the turbine subsystem.
(2) Incremental learning Incremental learning can employ newly-added information and discard the outdated to prevent the burden of the huge dataset.This method is mainly used for the situation of performance degradation, under which the efficiency or the value of character parameter decays gradually and slowly.The difference between the predicted and the actual values increases if without a regular update.Incremental learning and support vector regression is the most popular combined techniques.Incremental support vector machine uses partial support vectors as the "candidate vectors", while the final support vectors are selected depending on the newly-added samples [36].Incremental learning is useful for support vector regression method, especially giving the consideration that the computational memory of the SVR increases significantly for a large dataset.Similarly, the incremental supervised learning method with neural network is also an effective method for a large dataset (e.g., [37]).
(3) Model unchanged The incremental learning and the rebuild of the model are optional.Usually, new information just continuously passes through the steps B1 and B2, and then the online analysis and decision making are realized in real time.

Case Study: The Application to Turbine Subsystem and Air Cooling Condenser
Section 2 discussed the general steps and the methodologies for establishing a cyber physical system of thermal power plants.However, for different components or subsystems, the design of data-driven analysis systems can be complicated.Take the optimal controlling strategy of air-cooling condenser (ACC) as an example: The increase in fan frequency enhances the cycle efficiency and accordingly boosts the gross power output .E gross ; however, the air fan power consumption .E fan is also increased.Thus, there exists a maximum net power gain with a corresponding optimal fan frequency.Below the optimal fan frequency, the net power gain increases with the increase in the fan frequency.With the fan frequency increases continuously, the net power gain decreases as the fan consumption .E fan increases dramatically.
The optimal controlling strategy of air fans under various load rates, ambient temperatures and wind conditions is one of the most important issues of such power plants.In this section, the procedure of how to solve this decision-making problem based on the proposed CPS idea in Figure 2 is discussed.The turbine subsystem of a traditional 660 MW single-reheat supercritical coal-fired power plant is described first in Section 3.1.The maximization of the net power gain is carried out in Section 3.2 based on the real-time data.Only the modeling of air cooling condenser, the key component of such a system, is given in detail below rather than all the turbine subsystem components.

The Physical Layer
The physical layer of the thermal power plant is described in terms of components, flowsheet, sensors and actuators.The major physical components involved are the boiler, generator (G), turbine (divided into several stages), feedwater preheaters, air cooling condenser (ACC), feedwater pump (FWP), condensate pump (CP) and deaerator (DA).The flowsheet of the turbine subsystem is illustrated in Figure 3.The main steam generated in the boiler is expanded in the high-pressure turbine (HPT1-2) and is reheated afterward.The reheated steam then expands through the intermediate-pressure (IPT1-2) and low-pressure turbines (LPT1-4) and is finally condensed in the air-cooling condenser (ACC).For each turbine stage, certain amount of steam is extracted for heating up feedwater in the ith feedwater preheater (FWPHi) before entering the boiler.In addition, a de-aerator (DA) is also equipped to remove dissolved gas in the feedwater.
The sensor network placed in the turbine subsystem is also drawn in Figure 3, where the red, green and white marks represent the sensors for pressure, temperature and mass flowrate, respectively.A temperature sensor and a pressure sensor are applied for each extracted steam.The inlet and outlet positions of the feedwater pump are applied with temperature sensors.The ambient temperature and ambient pressure are also recorded, which is mainly used for analyzing the performance of air cooling condenser.Mass flowrate sensors are placed at three different locations: the outlet of condensate pump, the inlet of the deaerator and the outlet of feedwater pump.The electric output of the generator and the power consumption of air fans are also measured.Besides, there are multiple sensors of each measurement point for a mutual check.The main actuators in turbine subsystem include the control valve of turbines and pumps, and the frequency of air fans, etc.
Energies 2017, 10, x FOR PEER REVIEW 8 of 16 inlet and outlet positions of the feedwater pump are applied with temperature sensors.The ambient temperature and ambient pressure are also recorded, which is mainly used for analyzing the performance of air cooling condenser.Mass flowrate sensors are placed at three different locations: the outlet of condensate pump, the inlet of the deaerator and the outlet of the feedwater pump.The electric output of the generator and the power consumption of air fans are also measured.Besides, there are multiple sensors of each measurement point for a mutual check.The main actuators in turbine subsystem include the control valve of turbines and pumps, and the frequency of air fans, etc.

The Cyber Layer
The cyber system is responsible for performance analysis and decision making of each component.The modeling of turbine subsystem, especially the ACC, in the offline phase is given in Section 3.2.1.Based on the offline model, the decision making of optimal air fan frequency in online phase is solved and discussed in Section 3.2.2.
With the analysis of two-week real operation data (Figure 4), we have found that the present controlling strategy of fan frequency can only deliver qualitative control: The higher the load factor and the higher the ambient temperature, the larger the fan frequency will be.When the ambient temperature is higher than 27 C ° or the load factor is higher than 90%, the fan frequency is adjusted to the maximum (55 Hz).For low load factors and low ambient temperatures, the fan frequency is varied within a wide range.Such control strategy follows the normal logics but is not necessarily the optimal in terms of overall-plant performance.A quantitative guide is needed to make sure the optimal plant operation under any combinations of ambient temperature and load factor.Therefore, following the procedure described in Figure 2, the quantitative relationship between the optimal fan frequency and the boundary conditions is established and discussed.

The Cyber Layer
The cyber system is responsible for performance analysis and decision making of each component.The modeling of turbine subsystem, especially the ACC, in the offline phase is given in Section 3.2.1.Based on the offline model, the decision making of optimal air fan frequency in online phase is solved and discussed in Section 3.2.2.
With the analysis of two-week real operation data (Figure 4), we have found that the present controlling strategy of fan frequency can only deliver qualitative control: The higher the load factor and the higher the ambient temperature, the larger the fan frequency will be.When the ambient temperature is higher than 27 • C or the load factor is higher than 90%, the fan frequency is adjusted to the maximum (55 Hz).For low load factors and low ambient temperatures, the fan frequency is varied within a wide range.Such control strategy follows the normal logics but is not necessarily the optimal in terms of overall-plant performance.A quantitative guide is needed to make sure the optimal plant operation under any combinations of ambient temperature and load factor.Therefore, following the procedure described in Figure 2, the quantitative relationship between the optimal fan frequency and the boundary conditions is established and discussed.

The Offline Phase for Performance Prediction of the Turbine Subsystem and ACC
The cyber layer of the turbine subsystem is illustrated in Figure 5.Following the actual flowsheet in the physical layer, the models of each turbine stage (ℱ ), feedwater preheaters (ℱ ), pumps (ℱ ), ACC (ℱ ), boiler, and DA are linked in the cyber layer.For given boundary conditions, the gross power output of the turbine and the power consumption of air fans can be calculated by ℱ .The difficulty of accurate calculation of both terms is the determination of backpressure , a crucial parameter for performance prediction of the ACC.Here, only the modeling of ℱ by machine learning (support vector regression) is discussed in detail.Air cooling condenser aims at removing heat from the exhaust steam.For ACC controlling, the main actuator is the AC asynchronous motors for adapting the air-fan frequency.There are sensors recording ambient temperature and pressure, ambient wind speed and direction, inlet and outlet thermodynamic state of the ambient air and exhaust steam, etc.The key issue is to accurately calculate the heat transfer between exhausted steam and cooling air under various conditions, e.g., air flowrate and temperature.
Step A1: Preprocessing of the historical data.One-month operation data, which was recorded by Supervisory Information System (SIS) every 60 s, is employed and only quasi-steady state points are selected.The attributes for modeling are selected according to the physical knowledge, including , , , , and ℎ .Data reconciliation is also applied to enhance the data quality.
Step A2: Regression model establishment.The traditional physical equations for modeling (e.g., [38,39]) lack the consideration of ambient wind and fouling.In this case, the ε-SVR method [40] is applied for training the back pressure : The present operation strategy of fan frequency.

The Offline Phase for Performance Prediction of the Turbine Subsystem and ACC
The cyber layer of the turbine subsystem is illustrated in Figure 5.Following the actual flowsheet in the physical layer, the models of each turbine stage (F turbine ), feedwater preheaters (F FWPH ), pumps (F pump ), ACC (F ACC ), boiler, and DA are linked in the cyber layer.For given boundary conditions, the gross power output of the turbine .E gross and the power consumption of air fans .E fan can be calculated by F ACC .The difficulty of accurate calculation of both terms is the determination of backpressure p back , a crucial parameter for performance prediction of the ACC.

The Offline Phase for Performance Prediction of the Turbine Subsystem and ACC
The cyber layer of the turbine subsystem is illustrated in Figure 5.Following the actual flowsheet in the physical layer, the models of each turbine stage (ℱ ), feedwater preheaters (ℱ ), pumps (ℱ ), ACC (ℱ ), boiler, and DA are linked in the cyber layer.For given boundary conditions, the gross power output of the turbine and the power consumption of air fans can be calculated by ℱ .The difficulty of accurate calculation of both terms is the determination of backpressure , a crucial parameter for performance prediction of the ACC.Here, only the modeling of ℱ by machine learning (support vector regression) is discussed in detail.Air cooling condenser aims at removing heat from the exhaust steam.For ACC controlling, the main actuator is the AC asynchronous motors for adapting the air-fan frequency.There are sensors recording ambient temperature and pressure, ambient wind speed and direction, inlet and outlet thermodynamic state of the ambient air and exhaust steam, etc.The key issue is to accurately calculate the heat transfer between exhausted steam and cooling air under various conditions, e.g., air flowrate and temperature.
Step A1: Preprocessing of the historical data.One-month operation data, which was recorded by Supervisory Information System (SIS) every 60 s, is employed and only quasi-steady state points are selected.The attributes for modeling are selected according to the physical knowledge, including , , , , and ℎ .Data reconciliation is also applied to enhance the data quality.
Step A2: Regression model establishment.The traditional physical equations for modeling (e.g., [38,39]) lack the consideration of ambient wind and fouling.In this case, the ε-SVR method [40] is applied for training the back pressure : Here, only the modeling of F ACC by machine learning (support vector regression) is discussed in detail.Air cooling condenser aims at removing heat from the exhaust steam.For ACC controlling, the main actuator is the AC asynchronous motors for adapting the air-fan frequency.There are sensors recording ambient temperature and pressure, ambient wind speed and direction, inlet and outlet thermodynamic state of the ambient air and exhaust steam, etc.The key issue is to accurately calculate the heat transfer between exhausted steam and cooling air under various conditions, e.g., air flowrate and temperature.
Step A1: Preprocessing of the historical data.One-month operation data, which was recorded by Supervisory Information System (SIS) every 60 s, is employed and only quasi-steady state points are selected.The attributes for modeling p back are selected according to the physical knowledge, including f fan , t air,in , V wind , .m ex and h ex .Data reconciliation is also applied to enhance the data quality.
Step A2: Regression model establishment.The traditional physical equations for modeling p back (e.g., [38,39]) lack the consideration of ambient wind and fouling.In this case, the ε-SVR method [40] is applied for training the back pressure p back : Energies 2018, 11, 690 10 of 16 where the input attributes, f fan , t air,in , V wind , .m ex and h ex are fan frequency, ambient temperature, wind speed, the mass flowrate and enthalpy of the exhaust steam.
Step A3: Offline validation.90% of the selected samples are employed for training, while the remaining 10% is used to validate the accuracy of the trained SVR model.The comparison between the model prediction and measured values for these 10% samples is illustrated in Figure 6.Around full load operation, the fan frequency reaches its maximum and the ambient temperature is the major driving force for such a control, since around that load the back pressure increases with the increase in the ambient temperature.For instance, under full-load conditions with an ambient temperature higher than 27 • C, the back pressure is higher than 20 kPa even if the air fans run at full speed.Under partial-load condition or with a low ambient temperature, however, it is much easier to maintain a lower back pressure by using a relatively low fan frequency, which potentially reduces the total fan power and enhances the overall plant performance.The validation in Figure 6 illustrates that the root means square error (RMSE) between the actual value and the prediction of back pressure is around 0.8 kPa under different operating conditions.
Energies 2017, 10, x FOR PEER REVIEW 10 of 16 = ℱ , , , where the input attributes, , , , , and ℎ are fan frequency, ambient temperature, wind speed, the mass flowrate and enthalpy of the exhaust steam.
Step A3: Offline validation.90% of the selected samples are employed for training, while the remaining 10% is used to validate the accuracy of the trained SVR model.The comparison between the model prediction and measured values for these 10% samples is illustrated in Figure 6.Around full load operation, the fan frequency reaches its maximum and the ambient temperature is the major driving force for such a control, since around that load the back pressure increases with the increase in the ambient temperature.For instance, under full-load conditions with an ambient temperature higher than 27 C °, the back pressure is higher than 20 kPa even if the air fans run at full speed.Under partial-load condition or with a low ambient temperature, however, it is much easier to maintain a lower back pressure by using a relatively low fan frequency, which potentially reduces the total fan power and enhances the overall plant performance.The validation in Figure 6 illustrates that the root means square error (RMSE) between the actual value and the prediction of back pressure is around 0.8 kPa under different operating conditions.Ambient temperature (°C) Figure 6.The validation between the calculated and the measured values (also discussed in [41]).

The Online Phase and Decision Making of Optimal Air-Fan Frequency
The online phase continuously executes the steps B1-B3 for real-time performance analysis and decision making.Preprocessing of the real-time data (Step B1) is similar to preprocessing of the historical data (step A1) and the corresponding input attributes are shown in Figure 5.With these inputs and the regression function of Equation ( 4), and can be predicted under any boundary condition.For online analysis and decisions (step B2), the optimal controlling strategy of air fan frequency is an optimization problem and can be formulated as follows: Figure 6.The validation between the calculated and the measured values (also discussed in [41]).

The Online Phase and Decision Making of Optimal Air-Fan Frequency
The online phase continuously executes the steps B1-B3 for real-time performance analysis and decision making.Preprocessing of the real-time data (Step B1) is similar to preprocessing of the historical data (step A1) and the corresponding input attributes are shown in Figure 5.With these inputs and the regression function of Equation (4), .E fan and .E gross can be predicted under any boundary condition.For online analysis and decisions (step B2), the optimal controlling strategy of air fan frequency is an optimization problem and can be formulated as follows:  where the term ∆ .
E net is the net power gain, which should be maximized.
. E gross,0 and .E fan,0 are the reference power with a fan frequency of 35 Hz.The terms p stream , t stream and .m stream represent the thermodynamic state of the streams for calculating turbine performance, feedwater preheaters, pumps, etc.
Figure 7 shows the optimal fan frequency and the corresponding net power gain ∆ .
E net under different load factors.Note that the input values of t air,in and V wind are set as constant, and the p stream , t stream and .m stream are set as the design values in each load factor.Figure 7 illustrates that the optimal fan frequency increases with the increasing in the load factor.When the load factor reaches around 90%, the fan frequency has reached the maximum allowed frequency (55 Hz).Note that the optimal controlling strategy can also be calculated for any given t air,in , V wind , p stream , t stream and .m stream .
Energies 2017, 10, x FOR PEER REVIEW 11 of 16 where the term ∆ is the net power gain, which should be maximized.
, and , are the reference power with a fan frequency of 35 Hz.The terms , and represent the thermodynamic state of the streams for calculating turbine performance, feedwater preheaters, pumps, etc.
Figure 7 shows the optimal fan frequency and the corresponding net power gain ∆ under different load factors.Note that the input values of , and are set as constant, and the , and are set as the design values in each load factor.Figure 7 illustrates that the optimal fan frequency increases with the increasing in the load factor.When the load factor reaches around 90%, the fan frequency has reached the maximum allowed frequency (55 Hz).Note that the optimal controlling strategy can also be calculated for any given , , , , and .A further comparison of actual and the optimal controlling strategy at the conditions (load factor 59.6%, air inlet temperature 23.8 °C) is given in Table 1.It is found that a higher of 49.0 Hz is preferred compared to the actual applied frequency of 37.1 Hz to enhance the heat transfer effect, which is mostly due to the performance degradation of ACC after years of running.With the optimal fan frequency, although the increases from 3.2 MW (37.1 Hz) to 7.3 MW (49.0 Hz) when reaching the optimal condition, the gross power output increases more than the increment of power consumption.The trained model of ℱ can be kept as long as the boundary conditions of newly recorded data have already been considered in the training data.However, the ℱ model should be rebuilt with the corresponding record data, when the boundary conditions have been changed dramatically, e.g., in winter, one or more columns of air-fan clusters will be stopped for anti-freezing and the ambient temperature is decreases significantly (even out of the temperature range covered by the currently-employed data).

Discussion: The Combination of Mathematical Algorithms and Physical Knowledge
With data mining technique and big data in the cloud, the CPS enables performance analysis and the feedback of the controlling strategy in a quantitative manner from the cyber layer.However, there are some drawbacks associated with the mathematical algorithms, where physical knowledge is needed as the complementary.A further comparison of actual and the optimal controlling strategy at the conditions (load factor 59.6%, air inlet temperature 23.8 • C) is given in Table 1.It is found that a higher f fan of 49.0 Hz is preferred compared to the actual applied frequency of 37.1 Hz to enhance the heat transfer effect, which is mostly due to the performance degradation of ACC after years of running.With the optimal fan frequency, although the .E fan increases from 3.2 MW (37.1 Hz) to 7.3 MW (49.0 Hz) when reaching the optimal condition, the gross power output increases more than the increment of power consumption.The trained model of F ACC can be kept as long as the boundary conditions of newly recorded data have already been considered in the training data.However, the F ACC model should be rebuilt with the corresponding record data, when the boundary conditions have been changed dramatically, e.g., in winter, one or more columns of air-fan clusters will be stopped for anti-freezing and the ambient temperature is decreases significantly (even out of the temperature range covered by the currently-employed data).

Discussion: The Combination of Mathematical Algorithms and Physical Knowledge
With data mining technique and big data in the cloud, the CPS enables performance analysis and the feedback of the controlling strategy in a quantitative manner from the cyber layer.However, there are some drawbacks associated with the mathematical algorithms, where physical knowledge is needed as the complementary.
When establishing a complex model, mathematical algorithms with, e.g., gray relation analysis, correlation coefficients and fuzzy rough sets, are often used for selecting the input attributes.However, it is worth noting that once the target is clear, the related attributes are determined, regardless which algorithms are used.In this case, the most proper algorithm for calculating the related input attributes can be chosen according to the prior physical knowledge.For example, approach with correlation coefficient does not consider the combination of the attributes, which makes it not proper when there is high interaction between the input attributes.The method with fuzzy rough sets considers the combination of the attributes, which indicates its applicability for the dataset with classification characteristics.For the models with well-established physical knowledge or empirical equations, necessary attributes can be directly determined.A collection of literature with different methods for selecting the input attributes is listed in the 3rd column of Table 2.
The data-driven models are mostly employed as a black-box, which hardly illustrates the physical relationships behind the inputs and outputs, since machine learning algorithms rely highly on the data set rather than the mass and energy balance or constraint equations.A review of recent modeling methods for energy systems is listed in the 4th column of Table 2. Based on these experience, it is advised that machine learning algorithm should be used for a complex thermodynamic process, of which the internal characteristics cannot be straightforwardly revealed by a set of equations.A hybrid model combining machine learning and thermodynamics is suggested for modeling the components in thermal power plants.A comparison between Support vector machine, neural network and decision tree

Conclusions
In this paper, we introduced a general framework and the standard procedure for applying CPS to thermal power plants.According to the characteristics of the data recorded for the considered Energies 2018, 11, 690 13 of 16 thermal power plants, the proposed CPS is designed with offline modeling phase and online analysis phase.This work concerns the basic modeling and connection of each component in the cyber layer, and performs a case study to further explain the details of each involved step.The idea of hybrid modeling combining physical knowledge with data mining techniques is highlighted.
The case study focuses on the application of CPS to investigate the optimal controlling strategy of ACC in an air-cooled supercritical thermal power plant with the proposed data-driven approach.Prior knowledge of the ACC system is needed for the selection of the input attributes and for the enhancement of the data quality.For the case study, fan frequency, ambient temperature, wind speed, the mass flowrate and enthalpy of the exhaust steam are selected as the input attributes.The data preprocessing is performed with data reconciliation method by combining mass and energy balance equations.Then, the off-design model to predict the back pressure under different conditions is established with SVR.The RMSE of the back-pressure prediction model is only 0.81 kPa, which is accurate enough power plant application.Finally, the optimal fan frequency under various boundary condition is acquired for online application.The CPS model gives a quantitative relationship between the optimal fan frequency and the boundary conditions rather than a qualitative controlling strategy according to the operators' experience.The proposed framework is also capable of the performance analysis or decision making of any other component in thermal power plants.

Figure 1 .
Figure 1.The generic structure of cyber physical system for the application of thermal power plants.

Figure 1 .
Figure 1.The generic structure of cyber physical system for the application of thermal power plants.

Figure 2 .
Figure 2. The procedure to establish a data-driven model based on the concept of cyber physical systems (CPS).

Figure 2 .
Figure 2. The procedure to establish a data-driven model based on the concept of cyber physical systems (CPS).

Figure 3 .
Figure 3.The flowsheet of the turbine subsystem investigated.

Figure 3 .
Figure 3.The flowsheet of the turbine subsystem investigated.

Figure 4 .
Figure 4.The present operation strategy of fan frequency.

Figure 5 .
Figure 5.The cyber layer of turbine subsystem and the inputs for calculating and .

Figure 4 .
Figure 4.The present operation strategy of fan frequency.

Figure 5 .
Figure 5.The cyber layer of turbine subsystem and the inputs for calculating and .

Figure 5 .
Figure 5.The cyber layer of turbine subsystem and the inputs for calculating .E gross and .E fan .

Figure 7 .
Figure 7. Schematic of the optimal control of air fan frequency based on the cyber layer.

Figure 7 .
Figure 7. Schematic of the optimal control of air fan frequency based on the cyber layer.

Table 1 .
A comparison between the real operation value and optimal value.

Table 1 .
A comparison between the real operation value and optimal value.

Table 2 .
A review of recent data-driven modeling for components or subsystems in energy systems.