Performance Assessment of Large Photovoltaic (PV) Plants Using an Integrated State-Space Average Modeling Approach

: Distributed power converters represent a technical solution to improve the performance of large or utility-scale photovoltaic (PV) plants. Unfortunately, evaluation of the yield obtained in large PV ﬁelds by using distributed converters is a di ﬃ cult task because of recurring partial unavailability, inaccuracy of power analyzers, operating constraints imposed by the Power Plant Controller and so on. To overcome such issues in real operating scenarios, a new modeling strategy has been introduced and validated in terms of computational complexity and accuracy. This approach is based on the state-space averaging technique which is applied to large PV plants with multiple conversion stages by performing some elaborations in order to get a ﬁnal integrated model. The new modeling strategy has been tested in MatLab Simulink environment using data coming from a 300 MW PV plant located in Brazil representing the case study of this work. In this plant, one subﬁeld is equipped with central inverters while another is with string inverters. The proposed model, whose accuracy is in the range from 2.2 to 2.7% with respect to the measured energy, e ﬀ ectively supports data analysis leading to a consistent performance assessment for the distributed conversion system. Final results highlight that string inverters ensure a gain of about 2% in terms of produced energy. the monitoring system of the real PV plant in case of data issues, unavailability or external constraints.


Introduction
Distributed power converters in PV fields play a key role in the solar industry for new constructions and for retrofitting activities as well. Many tests are carried out in laboratory as well as in small size and medium size PV plants with the purpose to compare the performance of string and central inverters. Some examples are reported in papers [1][2][3][4]. On the contrary, in literature a comprehensive analysis about test results for large PV plants is missing.
A first reason for this lack of information deals with the low number of large PV fields using distributed converters. In fact, installation of central inverters is actually the most common option for large plants. At the same time, the number of plants with distributed converters is growing fast. In most cases, installation of distributed converters is aimed to get energy recovery related to mismatch losses occurring in case of non-uniform aging of PV modules, tracker faults and so on [5].
Focusing on performance evaluation for long time periods in large PV fields, some issues often occur in big data analysis. In fact, monitoring system can be affected by faults in dataloggers and sensors leading to missing or wrong data. More generally, the number of unavailabilities is expected to be quite high because of the high number of components in which a fault can occur. Moreover, in some cases there are additional operating constraints to take into account, for example thresholds set by the power plant controller (PPC) or by the inverters fixing the maximum power that a PV subfield can deliver to grid.
These problems can be solved by exploiting suitable models able to track the expected behavior of the PV plant in any operating conditions while filling the gaps due to faults and missing data.
Generally, modeling approach and simulation platform for large or utility-scale PV plants are chosen on the basis of specific analysis targets taking into account requirements in terms of acceptable computational effort. If target is the yield estimation for a long time period, modeling of power converters is usually neglected in order to limit the computational effort. In this case, converters are replaced by functional blocks considering only the efficiency value [14,15]. On the contrary, if some specific operating conditions need to be investigated including details on converters operation and losses, accurate models need to be implemented [16]. In the latter case, computational effort could be very high. Under this perspective, in case of energy assessment analysis for months or years, detailed models cannot be exploited to simulate the behaviour of a large PV system having thousands of modules and hundreds of distributed converters.
Some authors tried to overcome these limitations by introducing simplified modeling approaches usually known as behavioural models. For example, model presented in [14] simulates the electrical behaviour of commercial grid-connected PV inverters in accordance with regulations on power quality. Simulation results show the waveform of injected AC current in case of power dynamics or grid voltage disturbances. In [17] a non-parametric approach is used to evaluate the energy delivered to grid by six PV fields creating a forecast method by means of meteorological variables.
Although behavioural approaches fulfill the requirement of low computational burden, the actual physical configuration of the PV system could be completely neglected. In some cases, this is a relevant drawback. For example, if the performance analysis aims to evaluate losses over time in DC and AC cables, some technical details about converters (e.g., their topology and control strategy) are strictly necessary to calculate voltage and current values in both DC side and AC side. In many real PV plants, electrical quantities in some sections of the conversion system are not acquired by dataloggers, consequently an accurate estimation of losses is impossible.
The behavioural model described in this paper consists of an integrated state-space average model. The latter computes all the electrical quantities in each section of a PV plant. In such a way, it is possible to calculate losses, voltage drops, etc. Satisfactory accuracy is obtained while large computational effort is avoided. The advantages of the proposed approach are listed here:

•
Significant reduction of simulation time, see Section 4.2 • Basic model structure can be easily adapted to different system configurations (e.g., central inverters, string inverters, string optimizers combined with central inverters, etc.) performing minimal modifications • Several common identification methods can be used to identify parameters being part of the state-space representation of the PV system • Technical details about converters (e.g., their topology and control strategy) become no longer necessary. Furthermore, in many cases such information are covered by policies on industrial secrets • Behavioural model can be easily integrated in monitoring systems and exploited for forecasting purposes and for fault detection • The proposed state-space model can be implemented in any simulation platform The introduced model has been validated in terms of computational complexity and accuracy. It has been applied to a specific case study represented by the performance comparison between central and string converters installed in two different PV subfields which form a 5 MW experimental cluster of a 300 MW PV plant located in Brazil. Comparison criteria deal with energy production. Data analysis is supported by the proposed model developed in the MatLab/Simulink environment. The case study is briefly described in Section 2. Our modeling approach is introduced in Section 3 providing information on integrated state-space average method, parameters identification and control system. Section 4 reports model validation in terms of accuracy and computational effort. Section 5 describes the implementation of the introduced model for the case study. The same Section shows the aggregate results regarding the performance assessment of distributed converters. Figure 1 shows two pictures of the 300 MW PV plant representing the case study of this paper. PV field is connected to the 34.5 kV 60 Hz grid using power transformers, one for each 2.5 MW subfield. PV modules are mounted on horizontal single-axis tracking systems.

Case Study
Energies 2020, 13, x FOR PEER REVIEW 3 of 27

•
The proposed state-space model can be implemented in any simulation platform The introduced model has been validated in terms of computational complexity and accuracy. It has been applied to a specific case study represented by the performance comparison between central and string converters installed in two different PV subfields which form a 5 MW experimental cluster of a 300 MW PV plant located in Brazil. Comparison criteria deal with energy production. Data analysis is supported by the proposed model developed in the MatLab/Simulink environment. The case study is briefly described in Section 2. Our modeling approach is introduced in Section 3 providing information on integrated state-space average method, parameters identification and control system. Section 4 reports model validation in terms of accuracy and computational effort. Section 5 describes the implementation of the introduced model for the case study. The same Section shows the aggregate results regarding the performance assessment of distributed converters. Figure 1 shows two pictures of the 300 MW PV plant representing the case study of this paper. PV field is connected to the 34.5 kV 60 Hz grid using power transformers, one for each 2.5 MW subfield. PV modules are mounted on horizontal single-axis tracking systems. A 5 MW cluster was realized for testing purpose with the objective to compare the performance of central inverters and string inverters manufactured by international companies.

Case Study
In the 2.5 MW subfield with central converters there are two inverters whose rated power is 1025 kVA, fan cooled, mounted into an electric cabin. The actual AC power reaches about 1045 kVA in case of unity power factor.
In the 2.5 MW subfield with string inverters each converter has a rated power of 60 kVA. Thanks to a particular design, maximum power rises up to 66 kVA in case of unity power factor (66 kW) and of ambient temperature below 30 °C. String converters, mounted into the field without using cabinets and without fan cooling, are grouped in a cabin close to transformers using AC parallel switchboards named QPCA.
The number of PV modules connected to each conversion system is exactly the same (total DC rated power is 2570.4 kW for both subfields) so that comparison is performed in the same conditions. PV strings are composed of 30 PV modules in series. Rated power of modules is 315 W. There are 272 strings in each subfield, 136 for each central inverter and 8 for each string inverter.
The configuration of subfields under test is represented in Figures 2 and 3. Technical data for the main power components are given in Table 1.
System monitoring is realized using global irradiance sensors mounted on trackers, ambient temperature sensors, module temperature sensors, inverter temperature sensors, power analyzers and power meters. Energy flowing in central inverters is measured by a meter connected to voltage and current sensors placed at the AC side of inverters. In subfields with string inverters a meter provides the value of AC energy. At the DC side, string monitoring is embedded in each converter. Accuracy of data provided by the monitoring system can be assumed in the range 2.0-2.5%. The criterion selected for the performance assessment of distributed converters is basically the average gain in terms of energy produced by using string inverters with respect to central inverters. A 5 MW cluster was realized for testing purpose with the objective to compare the performance of central inverters and string inverters manufactured by international companies.
In the 2.5 MW subfield with central converters there are two inverters whose rated power is 1025 kVA, fan cooled, mounted into an electric cabin. The actual AC power reaches about 1045 kVA in case of unity power factor.
In the 2.5 MW subfield with string inverters each converter has a rated power of 60 kVA. Thanks to a particular design, maximum power rises up to 66 kVA in case of unity power factor (66 kW) and of ambient temperature below 30 • C. String converters, mounted into the field without using cabinets and without fan cooling, are grouped in a cabin close to transformers using AC parallel switchboards named QPCA.
The number of PV modules connected to each conversion system is exactly the same (total DC rated power is 2570.4 kW for both subfields) so that comparison is performed in the same conditions. PV strings are composed of 30 PV modules in series. Rated power of modules is 315 W. There are 272 strings in each subfield, 136 for each central inverter and 8 for each string inverter.
The configuration of subfields under test is represented in Figures 2 and 3. Technical data for the main power components are given in Table 1.
System monitoring is realized using global irradiance sensors mounted on trackers, ambient temperature sensors, module temperature sensors, inverter temperature sensors, power analyzers and power meters. Energy flowing in central inverters is measured by a meter connected to voltage and current sensors placed at the AC side of inverters. In subfields with string inverters a meter provides the value of AC energy. At the DC side, string monitoring is embedded in each converter. Accuracy of data provided by the monitoring system can be assumed in the range 2.0-2.5%. The criterion selected for the performance assessment of distributed converters is basically the average gain in terms of energy produced by using string inverters with respect to central inverters.

Modeling Approach
A generic representation of a large PV plant is shown in Figure 4. This basic configuration can be adapted to simulate PV fields with central inverters as well as the ones with string inverters or with power optimizers at module or string level. Moreover, it is suitable for the whole PV plant under

Modeling Approach
A generic representation of a large PV plant is shown in Figure 4. This basic configuration can be adapted to simulate PV fields with central inverters as well as the ones with string inverters or with power optimizers at module or string level. Moreover, it is suitable for the whole PV plant under

Modeling Approach
A generic representation of a large PV plant is shown in Figure 4. This basic configuration can be adapted to simulate PV fields with central inverters as well as the ones with string inverters or with power optimizers at module or string level. Moreover, it is suitable for the whole PV plant under investigation or for a specific subfield (strings, string boxes, etc.) by performing minimal modifications.  About the implementation of this kind of model in a simulation platform, it is possible to distinguish three main approaches: • Detailed models, in this case each component (array, converter, grid, etc.) is inserted in simulation considering its physical description, circuit topology and operation mode. For example, converters are modeled reproducing their detailed topology and their switching modulation technique [16] • Models in which physical description of components is neglected [17] • Intermediate models, which are a trade-off between the previous categories [14] The behavioural model introduced in this work belongs to the third category. It is based on a modified state-space averaging model which has been selected following the primary target to get a simplified representation for the multi-stage conversion system of large PV fields with distributed converters. In fact, in this circumstance there are usually both DC-DC converters (e.g., first stage of string inverters or power optimizers) and DC-AC converters (grid connected inverters, usually multilevel inverters).
Models of power converters for PV applications are designed in different ways. In the case of stringent requirements on computational effort for energy assessment in a long-term time horizon, the common approach to model DC-DC converters and their control system is the state-space averaging method, some examples are in [18][19][20]. Grid connected inverters are usually modeled using relationships coming from energy balances [14,19,21] or from equivalent circuits [10]. In the latter case, electrical quantities are sometimes expressed as phasors [22,23].
In the literature there is a lack of information about integrated modeling approaches able to represent multiple conversion stages in large PV fields. This work contributes to fill this gap. In fact, the main novelty is related to the creation of a complete model in which each component of the PV field is included using its state-space representation. Integration of different conversion stages is obtained thanks to the development of a direct way for the analytical calculation of current flowing in the DC-link.
All the components of the generation and conversion system are mixed into a single state-space average model obtaining an integrated representation for the entire PV system. In other words, the PV plant becomes a single state-space system. Inputs of such system are irradiance and cell temperature while outputs are the energy production and all the electric quantities in every part of the system. About the implementation of this kind of model in a simulation platform, it is possible to distinguish three main approaches: • Detailed models, in this case each component (array, converter, grid, etc.) is inserted in simulation considering its physical description, circuit topology and operation mode. For example, converters are modeled reproducing their detailed topology and their switching modulation technique [16] • Models in which physical description of components is neglected [17] • Intermediate models, which are a trade-off between the previous categories [14] The behavioural model introduced in this work belongs to the third category. It is based on a modified state-space averaging model which has been selected following the primary target to get a simplified representation for the multi-stage conversion system of large PV fields with distributed converters. In fact, in this circumstance there are usually both DC-DC converters (e.g., first stage of string inverters or power optimizers) and DC-AC converters (grid connected inverters, usually multilevel inverters).
Models of power converters for PV applications are designed in different ways. In the case of stringent requirements on computational effort for energy assessment in a long-term time horizon, the common approach to model DC-DC converters and their control system is the state-space averaging method, some examples are in [18][19][20]. Grid connected inverters are usually modeled using relationships coming from energy balances [14,19,21] or from equivalent circuits [10]. In the latter case, electrical quantities are sometimes expressed as phasors [22,23].
In the literature there is a lack of information about integrated modeling approaches able to represent multiple conversion stages in large PV fields. This work contributes to fill this gap. In fact, the main novelty is related to the creation of a complete model in which each component of the PV field is included using its state-space representation. Integration of different conversion stages is obtained thanks to the development of a direct way for the analytical calculation of current flowing in the DC-link.
All the components of the generation and conversion system are mixed into a single state-space average model obtaining an integrated representation for the entire PV system. In other words, the PV plant becomes a single state-space system. Inputs of such system are irradiance and cell temperature while outputs are the energy production and all the electric quantities in every part of the system.

Conversion System: Basic Converter Models
This section refers to a specific multi-stage topology, represented in Figure 5. Such topology has been chosen to better explain the modeling approach.

Conversion System: Basic Converter Models
This section refers to a specific multi-stage topology, represented in Figure 5. Such topology has been chosen to better explain the modeling approach. Focusing on the DC-DC converter stage and representing the same as a single block with a fictitious voltage source vin and a fictitious load R as in Figure 6, its state-space average form is obtained following the well-known procedure for which ON and OFF states have to be analyzed separately by building Kirchhoff equations for state variables and then mixed together to obtain the final average state-space system. An example is discussed in [18].  Focusing on the DC-DC converter stage and representing the same as a single block with a fictitious voltage source v in and a fictitious load R as in Figure 6, its state-space average form is obtained following the well-known procedure for which ON and OFF states have to be analyzed separately by building Kirchhoff equations for state variables and then mixed together to obtain the final average state-space system. An example is discussed in [18].

Conversion System: Basic Converter Models
This section refers to a specific multi-stage topology, represented in Figure 5. Such topology has been chosen to better explain the modeling approach. Focusing on the DC-DC converter stage and representing the same as a single block with a fictitious voltage source vin and a fictitious load R as in Figure 6, its state-space average form is obtained following the well-known procedure for which ON and OFF states have to be analyzed separately by building Kirchhoff equations for state variables and then mixed together to obtain the final average state-space system. An example is discussed in [18].  Variation of duty cycle d controlled by the MPPT algorithm can be modeled as a perturbation d superimposed to the steady-state duty cycle D [20]. The basic form of a state-space average model is in Equation (1): where: The extension of this state-space system, in presence of d, is: In this way, a simple state-space average system models the behavior of any DC-DC converter topology in presence of variations of duty cycle forced by the MPPT control system.
About the inverter and filtering stage in Figure 7, for sake of clarity in this section load is assumed to be three-phase inductive-resistive in delta connection without grid sources. Basic modeling approach is described in [23] exploiting a generalized state-space averaging method based on Fortescue symmetrical components and on Fourier transform. Variation of duty cycle d controlled by the MPPT algorithm can be modeled as a perturbation d  superimposed to the steady-state duty cycle D [20]. The basic form of a state-space average model is in Equation (1): where: The extension of this state-space system, in presence of d  , is: In this way, a simple state-space average system models the behavior of any DC-DC converter topology in presence of variations of duty cycle forced by the MPPT control system.
About the inverter and filtering stage in Figure 7, for sake of clarity in this section load is assumed to be three-phase inductive-resistive in delta connection without grid sources. Basic modeling approach is described in [23] exploiting a generalized state-space averaging method based on Fortescue symmetrical components and on Fourier transform.  The state-space form derives from Kirchhoff laws, similarly to what shown in [23]: where i ab , i bc and i ca are virtual line currents, m is the modulation index. The other parameters are shown in Figure 7.
The AC currents and voltages are represented as the sum of their Fortescue symmetrical components as follows: For each term, neglecting the presence of inverse and homopolar components as a first approximation, state variables are the real and imaginary parts of the direct component: Energies 2020, 13, 4777 The other components can be easily included as described in [23], if necessary. AC currents and voltages become: Starting from these relationships, state-space equations of the DC-AC converter can be integrated in the PV plant model described in Section 3.6. Quantities (x 1 , . . . , x 18 ) are state variables.

Analytical Calculation of DC-Link Current
From the description of conversion stages, merging of different modeling approaches becomes necessary to obtain a comprehensive representation for the entire PV system. The key parameter for obtaining such merging is the DC-link current i DCinv .
Average or RMS value of this current is usually calculated from power balances [24] or from integral calculation [24,25]. In some cases the latter method is applied using a reduced-order Fourier transform. Computational effort related to data storage in integrals is the main drawback of these methods.
In this work, calculation of the average value of DC-link current has been developed in a direct way using the AC current components. To explain this achievement, starting point is this equation: where s is a function representing PWM modulation signals [23]: Using prosthaphaeresis formulas to calculate the differences (s a −s b ), (s b −s c ), (s c −s a ), and focusing on zero-sequence components of DC-link current, initial relationship in Equation (11) becomes: The first term into square parenthesis is rewritten in this form: thanks to the following Fourier transform property: In Equation (14), the second term on the right is null. On the contrary, the other terms are calculated using the Euler formulas: Energies 2020, 13, 4777 10 of 27 that can be rewritten as follows: so that: Repeating the same elaboration for the other zero-sequence components in Equation (13), final algebraic equation linking the DC-link current to AC current components is: This is a straightforward way to calculate the average value of DC-link current from the direct-sequence components of AC currents which are state variables in this study. No any data storage is required to calculate integral terms. In other words, a significant simplification is obtained without decreasing the accuracy level.

PV Array Model
Accuracy of the PV array model is crucial for the accuracy of the entire system. Basic formulation selected to model PV modules is in [26]. Since large temperature variations take place in PV plants, a preliminary sensitivity analysis has been conducted in order to detect those parameters causing large modifications in I-V and P-V characteristics. In case of stringent requirements on accuracy level, these temperature-dependent parameters are tuned on-line by means of look-up tables. In this work, satisfactory accuracy has been obtained taking into account the influence of temperature on short-circuit current and on no-load voltage through the coefficients I sc /T module and V oc /T module listed in the datasheet of PV modules.
Equations in (20) are used to extend the model of a PV module to a PV array considering N s modules in series per string and N p strings in parallel at the stringboxes [27]: where R s is the series resistance and R p is the parallel resistance of the single-diode model [26]. For the sake of simplicity, in the following sections the terms in Equation (20) refer to PV arrays without using the subscript "array". As for the other components of the PV system, also PV arrays are modeled in state-space form: The basic equivalent circuit is shown in Figure 8. R LOAD is a fictitious load connected to the PV array.
Energies 2020, 13, 4777 The basic equivalent circuit is shown in Figure 8. RLOAD is a fictitious load connected to the PV array. Performing simple calculations, Equation (21) becomes: by assuming that: so that: and: Current iph is a function of irradiance G and of module temperature Tmodule as follows: where Kt is the temperature coefficient: The normal operating cell temperature (NOCT) is usually reported in a PV module datasheet. Also the current id is a function of G and of Tmodule [26]. Performing simple calculations, Equation (21) becomes: by assuming that: so that: and: Current i ph is a function of irradiance G and of module temperature T module as follows: where K t is the temperature coefficient: The normal operating cell temperature (NOCT) is usually reported in a PV module datasheet. Also the current i d is a function of G and of T module [26].

Transformers and Grid Model
The implementation of a detailed equivalent circuit for the LV-MV transformer as well as of a distributed parameters model for the AC grid is not feasible for computational effort reasons. Anyway, a satisfactory accuracy is obtained using a minimum order model as the one shown in Figure 5. Values of grid voltage sources, load resistors and load inductances are calculated referring to the primary side of the transformer.

PV Plant Model
Combination of models built for each component of the PV system in Figure 5 leads to a comprehensive state-space representation for the entire plant or for a specific subfield in the form of Equation (28). In this way, power and energy values at the power meter can be calculated from irradiance and cell temperature data by running such integrated state-space average model. In this representation, an input is the RMS value of phase-to-phase grid voltage V grid together with irradiance and module temperature. Alternatively, it can be assigned as a constant term into matrices. Control variables d and m are set by the control system described in Section 3.7.
Considering the whole PV system in Figure 5, its state-space representation is: where: Matrix A is: with: Non-zero elements in matrix A ON are listed here: sin ϕ a − π 6 (33) and: The elements in A OFF whose expression is different from the corresponding A ON terms are: Matrix B is: with: whose non-zero elements are: Each state variable can be extracted as output of the system thanks to a proper assignment of matrix C. Alternatively, the latter matrix is set as an identity matrix. Matrix D is null.
Evaluation of the energy produced by the plant is performed by building simple equations using variables from x 1 to x 12 i.e., by AC voltage and current components.
Values of DC and AC cables resistance, useful to calculate the distribution losses and to evaluate the same in case of different plant configurations, are added to resistive elements included in state-space matrices or inserted in the form of new resistive elements.

Control System
DC-DC converter stage is controlled in P&O MPPT mode. Control system of the grid connected inverter is implemented in qd reference frame using a common Phase Locked Loop (PLL) algorithm to regulate active and reactive power. Control subsystem for the d-axis current sets the DC-link voltage [28].
Time sampling of quantities in control system can be set larger than time step used for the simulation of PV plant e.g., 10x or more in order to allow a fast computational time. With reference to the specific case study discussed in this paper, design of control system needs to take into account additional constraints: maximum power internal threshold of each converter, power limitation strategy related to IGBT stack temperature and PPC limitation. The latter depends on thresholds fixed by the local utility company for the maximum power that the PV plant can deliver to the grid.

Parameters Identification
PV array parameters are identified exploiting data listed in the datasheet of modules and considering the power configuration of strings and stringboxes. On the contrary, some parameters e.g., R s and R p need to be identified. A suitable identification method, used for this work, is in [26]. Temperature coefficients of V oc and I sc reported in PV module datasheet and in Table 1 are also exploited.
About converters, detailed information on topology, hardware components and control system are usually not available due to know-how protection policies. It is worth noting that the modeling approach described in this paper allows to overcome this issue giving the opportunity to identify an equivalent behavioral model. To perform such identification, one of the many methods in literature for state-space functions can be used. In this work, identification process is performed using data collected by the monitoring system and applying a constrained minimums formulation focusing on the deviation between model outputs and real measurements [29]. Generally, let y measured (k) be a given electrical variable measured by plant datalogger in the form of a timeseries having N time samples: Corresponding quantity calculated by model is named y model (k). Each y model (k) sample can be expressed as the linear composition of parameters p n and of terms h ki . The latters fix the relation between y model (k) and each p i parameter for a given system input: y model (k) = h k1 p 1 + h k2 p 2 + . . . . +h kn p n (55) which is, for N time samples: Implementation of the constrained minimums formulation gives the optimal set of parameters reducing the deviation between measured values and the ones calculated by model:

Model Validation
The MatLab/Simulink environment is the software platform for testing the introduced model. Proposed approach is compared to the detailed model in which high-frequency switching and related phenomena are included. Validation process involves both accuracy and complexity performances of the proposed model for different operating scenarios. Figures 9 and 10 show time plots of main electric quantities in a 18.9 kW PV subfield whose topology is the one shown in Figure 5. Referring to case study, this PV array is the basic generation unit to which a single MPPT in string inverters is applied. It is composed by two strings in parallel. Each string is the series connection of 30 PV modules, see datasheet in Table 1.

Model Running
State-space average model is implemented in a straightforward way exploiting equations in Section 3.6 while in the detailed model all the components (PV modules, converters, filters, etc.) are placed into the simulation platform using their physical description. In Figures 9 and 10 time plots obtained by the proposed model running in two operating scenarios are superimposed to the ones provided by detailed model in the same conditions. Figure 9 depicts the waveforms of the main electric quantities obtained in case of 1000 W/m 2 as irradiance and of 25 • C as module temperature (standard test conditions, STC). Values of parameters for this simulation are in Table 2. The integrated state-space average model well matches the detailed model at both DC and AC side. Control system is able to force a null reactive power while active power at the power meter reaches about 18.5 kW i.e., close to the PV array STC rated power with a slight difference caused by power losses in DC and AC side.
In Figure 10 irradiance is 800 W/m 2 and module temperature is 45 • C. Main parameters used in this simulation are listed in Table 3. At time 0.1 s control system forces a non null reactive power requested by grid. Also in this case the average model is compliant to the detailed one.
Repeating similar comparisons for several scenarios, the average error caused by the proposed model with respect to the detailed one is always zero or very close to zero.
Finally, the integrated state-space average approach, built as described in previous sections, provides the average values of all the electric quantities in every operating condition. It can replace the detailed one in most analysis addressed to establish the performance of PV plants. Table 2. Parameters list for the simulation shown in Figure 9.

PV Module
See 110 f s,inv (kHz) *** 20 L 1 (mH) 10 f n (Hz) **** 50 r L1 (Ω) 0.5 ϕ a (t 0 ) (rad) 2π/15 . Simulation of a 18.9 kW PV subfield (2 strings in parallel, each string is composed by the series connection of 30 315 W PV modules). PV system topology is represented in Figure 5, technical data of components are listed in Table 2. Irradiance is 1000 W/m 2 , module temperature is 25 °C. Comparison between detailed model and integrated state-space average model. Figure 10. Simulation of a 18.9 kW PV subfield (2 strings in parallel, each string is composed by the series connection of 30 315 W PV modules). PV system configuration is represented in Figure 5, technical data of components are listed in Table 3. Irradiance is 800 W/m 2 , module temperature is 45 °C. At time 0.1 s control system forces non null reactive power. Comparison between detailed model and integrated state-space average model. Figure 9. Simulation of a 18.9 kW PV subfield (2 strings in parallel, each string is composed by the series connection of 30 315 W PV modules). PV system topology is represented in Figure 5, technical data of components are listed in Table 2. Irradiance is 1000 W/m 2 , module temperature is 25 • C. Comparison between detailed model and integrated state-space average model. Table 3. Parameters list for the simulation shown in Figure 10. 110 f s,inv (kHz) *** 20 L 1 (mH) 10 f n (Hz) **** 50 r L1 (Ω) 0.5 ϕ a (t 0 ) (rad) 2π/23

PV Module
223 m(t 0 ) 0.75 Figure 9. Simulation of a 18.9 kW PV subfield (2 strings in parallel, each string is composed by the series connection of 30 315 W PV modules). PV system topology is represented in Figure 5, technical data of components are listed in Table 2. Irradiance is 1000 W/m 2 , module temperature is 25 °C. Comparison between detailed model and integrated state-space average model. Figure 10. Simulation of a 18.9 kW PV subfield (2 strings in parallel, each string is composed by the series connection of 30 315 W PV modules). PV system configuration is represented in Figure 5, technical data of components are listed in Table 3. Irradiance is 800 W/m 2 , module temperature is 45 °C. At time 0.1 s control system forces non null reactive power. Comparison between detailed model and integrated state-space average model. Figure 10. Simulation of a 18.9 kW PV subfield (2 strings in parallel, each string is composed by the series connection of 30 315 W PV modules). PV system configuration is represented in Figure 5, technical data of components are listed in Table 3. Irradiance is 800 W/m 2 , module temperature is 45 • C. At time 0.1 s control system forces non null reactive power. Comparison between detailed model and integrated state-space average model.

Execution Time Performance
Referring to the same PV system analyzed in the previous Section and shown in Figure 5, the advantages of the proposed approach have been evaluated in terms of execution time performance by implementing two different cases: For both A and B cases, four working scenarios have been simulated. To get a significant statistical database, each scenario has been executed 1000 times using two different processors named computer 1 and computer 2.
The evaluation of the execution time has been performed using some proper stopwatch functions in MatLab/Simulink environment. Relative difference in execution time is calculated by applying this equation: where, considering a given operating scenario, t issa is the execution time of the integrated state-space average model while t det is the execution time of the detailed model. Tables 4-7 summarize the results in terms of execution time performance using a statistical approach based on mean value of ∆t % and on its standard deviation.
As expected, the novel behavioural model ensures a significant reduction in execution time for both the processors. In case A, the proposed model is about 3 times faster in comparison to the detailed model. In case B this difference rises to 25x. where, considering a given operating scenario, tissa is the execution time of the integrated state-space average model while tdet is the execution time of the detailed model. Tables 4-7 summarize the results in terms of execution time performance using a statistical approach based on mean value of Δt% and on its standard deviation.  where, considering a given operating scenario, tissa is the execution time of the integrated state-space average model while tdet is the execution time of the detailed model. Tables 4-7 summarize the results in terms of execution time performance using a statistical approach based on mean value of Δt% and on its standard deviation.  where, considering a given operating scenario, tissa is the execution time of the integrated state-space average model while tdet is the execution time of the detailed model. Tables 4-7 summarize the results in terms of execution time performance using a statistical approach based on mean value of Δt% and on its standard deviation.

Accuracy Evaluation
Evaluation of the accuracy of the proposed model is carried out by comparing power curves and daily energy with the measurements registered by power meters in the real plant.
Irradiance and module temperature timeseries, recorded by the monitoring system in the PV plant become inputs of the model together with the RMS value of the grid voltage. Figures 11 and 12 show the PV plant operation during some days. Power curve of the model is superimposed to the measured one. During these days no unavailability, missing data or external constraints occur, so that comparison can be consistent.
For each day, the calculation of the percentage relative error for the daily energy is pointed out using this equation:     In Figure 11, the maximum value of ε % is registered for the bottom chart which refers to QPCA 2. In this case, the difference in daily energy between model and real data is 77.2 kWh corresponding to +2.32%. The minimum value of ε % , −0.37%, is for QPCA 1 chart.
In Figure 12, the maximum value of ε % is registered for central inverter 1. In this case, the difference in daily energy between model and real data is −101.5 kWh corresponding to −2.16%. The minimum value of ε % , −0.81%, is for QPCA 2 chart.
Repeating similar analysis for all the days in the period from December 2017 to October 2019, it can be stated that maximum error caused by the model is in the range of 2.2-2.7% on a daily basis. This is an accuracy index for the model also for those cases in which the model is exploited to simulate plant behaviour in presence of missing data, PPC limitation, etc.
In case of partial shading as seen in the left picture of Figure 1, the consequent mismatch effects could worsen the ability of model to simulate the plant operation. Anyway, from a practical point of view, such effects can be usually neglected on the basis of the following points: • Sampling time of quantities acquired by dataloggers in large PV plants is typically 1 min or more. This is a limit for the detection of fast variations of irradiance or temperature • Very fast variations of irradiance or temperature (e.g., in the order of seconds) could perturb the acquired values but their effect is low. They also have limited effects on production because of the electrical and thermal capacity of the PV system. On the contrary, medium fast variations (in the order of minutes) create a fluctuation in the energy production while causing a variation in the model output which keeps very close to reality • Since the proposed model has a full scalability, an alternative could be running the model for every string exploiting local irradiance data. Unfortunately, this requires the presence of thousands of irradiance sensors. Because of the high costs, in existing plants there are no more than three sensors per MW • Evaluating the accuracy by applying Equation (60) for many days during which partial shading phenomena occur, the maximum error keeps around 2.5-2.7%

Sensitivity Analysis
With the purpose to evaluate the sensitivity of the model to parameters variation or uncertainty, the relative error ε % calculated by using Equation (60) when identification is performed as shown in Section 3.8, is compared to ε p% calculated with the same equation but considering two different cases: • ε p1.1% , in this case the value of a certain parameter is 1.1 times the optimal value identified by using Equation (58), optimal values are assigned to the other parameters • ε p0.9% , in this case the value of a certain parameter is 0.9 times the optimal value identified by using Equation (58), optimal values are assigned to the other parameters Value of sensitivity for both cases is: Taking into account main parameters for every section of the PV plant and perturbing their basic values one by one as described above, Tables 8 and 9 report the average results obtained by running the model for several days in the period from December 2017 to October 2019.
As expected, it emerges that parameters belonging to PV array model have a large sensitivity causing the worsening of accuracy in case of deviation from optimal values. The same is for r L1 . The other parameters feature a low sensitivity.

Relevance of the Introduced Model in Supporting Data Analysis
Referring to the case study, the usefulness of the proposed modeling approach is highlighted in this Section by showing some examples. In Figure 13 the power measured at the meter of the string inverters group QPCA 1 is compared to the power curve obtained in simulation for the same subfield (whose rated DC power is 1209.6 kW) in a day during which a power limitation occurs because of the inverter internal maximum power threshold. Power plots indicate that, thanks to the model simulation, theoretical extra-energy is estimated with a good accuracy.
In Figure 14 model is applied to estimate the energy lost in central inverter 1 subfield caused by a fault in solar trackers power supply. Also in this case, the model is able to overcome wrong data due to the abnormal operation.
0.08 C 2 0.16 R LOAD 0.01 L LOAD 0.01 Table 9. Sensitivity in case of perturbation equal to 0.9 times the optimal parameter value.

Relevance of the Introduced Model in Supporting Data Analysis
Referring to the case study, the usefulness of the proposed modeling approach is highlighted in this Section by showing some examples. In Figure 13 the power measured at the meter of the string inverters group QPCA 1 is compared to the power curve obtained in simulation for the same subfield (whose rated DC power is 1209.6 kW) in a day during which a power limitation occurs because of the inverter internal maximum power threshold. Power plots indicate that, thanks to the model simulation, theoretical extra-energy is estimated with a good accuracy.
In Figure 14 model is applied to estimate the energy lost in central inverter 1 subfield caused by a fault in solar trackers power supply. Also in this case, the model is able to overcome wrong data due to the abnormal operation.

Relevance of the Introduced Model in Supporting Data Analysis
Referring to the case study, the usefulness of the proposed modeling approach is highlighted in this Section by showing some examples. In Figure 13 the power measured at the meter of the string inverters group QPCA 1 is compared to the power curve obtained in simulation for the same subfield (whose rated DC power is 1209.6 kW) in a day during which a power limitation occurs because of the inverter internal maximum power threshold. Power plots indicate that, thanks to the model simulation, theoretical extra-energy is estimated with a good accuracy.
In Figure 14 model is applied to estimate the energy lost in central inverter 1 subfield caused by a fault in solar trackers power supply. Also in this case, the model is able to overcome wrong data due to the abnormal operation.
These examples highlight how the introduced model is a very useful tool to track the behaviour of PV plant in normal operation as well as in case of missing data, PPC limitation due to grid capability and so on.  These examples highlight how the introduced model is a very useful tool to track the behaviour of PV plant in normal operation as well as in case of missing data, PPC limitation due to grid capability and so on.

Central vs String Inverters: Comparison Results
Performances of central and string inverters in case study have been compared looking at the post-commissioning period (5 months long) from December 2017 to April 2018 i.e., during summer and autumn in Brazil. Basic equation used to evaluate the gain obtained by string converters Gstring_inv is: Figure 13. Model output vs. measured data of string inverters subsection QPCA 1 during a specific day. The integrated state-space average model is able to estimate the theoretical extra energy that the subfield could produce in absence of inverter internal maximum power threshold.  These examples highlight how the introduced model is a very useful tool to track the behaviour of PV plant in normal operation as well as in case of missing data, PPC limitation due to grid capability and so on.

Central vs String Inverters: Comparison Results
Performances of central and string inverters in case study have been compared looking at the post-commissioning period (5 months long) from December 2017 to April 2018 i.e., during summer and autumn in Brazil. Basic equation used to evaluate the gain obtained by string converters Gstring_inv is: Figure 14. Model output vs. measured data of central inverter 1 subsection during a day in which a fault occurs in trackers power supply. The integrated state-space average model is able to estimate the theoretical extra energy that the subfield could produce in absence of such fault.

Central vs. String Inverters: Comparison Results
Performances of central and string inverters in case study have been compared looking at the post-commissioning period (5 months long) from December 2017 to April 2018 i.e., during summer and autumn in Brazil. Basic equation used to evaluate the gain obtained by string converters G string_inv is:

100
(62) where, on a daily basis, E string_inv is the energy produced by string inverters (or by a QPCA), E central_inv is the energy produced by central inverters (or by a single central inverter), P is the DC rated power of each subfield. Such comparison criterion corresponds to the comparison between performance ratio values. Figure 15 shows the energy produced in subfields with central and string inverters measured during a cloudy day, in absence of power derating, unavailability, PPC limitation or other constraints. In this specific case, the gain of string inverters subfield in comparison to central inverters is +1.5%, calculated by Equation (62). the energy produced by central inverters (or by a single central inverter), P is the DC rated power of each subfield. Such comparison criterion corresponds to the comparison between performance ratio values. Figure 15 shows the energy produced in subfields with central and string inverters measured during a cloudy day, in absence of power derating, unavailability, PPC limitation or other constraints. In this specific case, the gain of string inverters subfield in comparison to central inverters is +1.5%, calculated by equation (62). This calculation has been repeated for all the days in the considered period. In case of issues caused by wrong data, PPC limitation, partial unavailability, etc. occurring for one or more converters, model presented in this paper has been used to replace data coming from the real PV plant as shown in the previous Section.
Referring to the fixed time period, aggregated results are reported in Figure 16 with distribution charts including a normal density function fitting curve. Finally, the average gain obtained by the string converters is around +2.1% in terms of daily production, with a standard deviation of about 1.3%.  This calculation has been repeated for all the days in the considered period. In case of issues caused by wrong data, PPC limitation, partial unavailability, etc. occurring for one or more converters, model presented in this paper has been used to replace data coming from the real PV plant as shown in the previous Section.
Referring to the fixed time period, aggregated results are reported in Figure 16 with distribution charts including a normal density function fitting curve. Finally, the average gain obtained by the string converters is around +2.1% in terms of daily production, with a standard deviation of about 1.3%.   . The integrated state-space average model presented in this paper is used to replace data recorded by the monitoring system of the real PV plant in case of data issues, unavailability or external constraints.

Conclusions
This work deals with behavioral modeling of large PV plants. To evaluate the performance of distributed multi-stage converters in presence of issues caused by erroneous data, unavailability or external constraints, a new integrated state-space average model addressed to large or utility-scale PV plants has been introduced getting a satisfactory tradeoff between computational effort and simulation accuracy. A straightforward way for the calculation of the average DC-link current from the direct-sequence components of AC currents has been developed and exploited to reduce the computational complexity without decreasing the accuracy level.
Based on the calculation of the daily energy production in the period from December 2017 to October 2019, the maximum error caused by the model is in the range of 2.2-2.7% with respect to the energy measured by power analyzers. A case study has been investigated focusing on performance comparison between central and string converters installed for testing purpose in two subfields of a 300 MW PV plant in Brazil. Elaboration of data acquired by the monitoring system for 5 months in the PV plant is effectively supported by the developed model. Aggregate results highlight that string inverters ensure a gain of about 2% in terms of produced energy.
Future works will be addressed to the identification of phenomena leading to mismatch effects compensated by string inverters. Moreover, the same modeling approach will be applied for other real PV plants with distributed converters having different size and power configuration.  . The integrated state-space average model presented in this paper is used to replace data recorded by the monitoring system of the real PV plant in case of data issues, unavailability or external constraints.

Conclusions
This work deals with behavioral modeling of large PV plants. To evaluate the performance of distributed multi-stage converters in presence of issues caused by erroneous data, unavailability or external constraints, a new integrated state-space average model addressed to large or utility-scale PV plants has been introduced getting a satisfactory tradeoff between computational effort and simulation accuracy. A straightforward way for the calculation of the average DC-link current from the direct-sequence components of AC currents has been developed and exploited to reduce the computational complexity without decreasing the accuracy level.
Based on the calculation of the daily energy production in the period from December 2017 to October 2019, the maximum error caused by the model is in the range of 2.2-2.7% with respect to the energy measured by power analyzers. A case study has been investigated focusing on performance comparison between central and string converters installed for testing purpose in two subfields of a 300 MW PV plant in Brazil. Elaboration of data acquired by the monitoring system for 5 months in the PV plant is effectively supported by the developed model. Aggregate results highlight that string inverters ensure a gain of about 2% in terms of produced energy.
Future works will be addressed to the identification of phenomena leading to mismatch effects compensated by string inverters. Moreover, the same modeling approach will be applied for other real PV plants with distributed converters having different size and power configuration.