Development of a Hybrid Modeling Framework for the Optimal Operation of Microgrids

Lee, Jaekyu; Park, Eunseop; Lee, Sangyub

doi:10.3390/en18082102

Open AccessArticle

Development of a Hybrid Modeling Framework for the Optimal Operation of Microgrids

by

Jaekyu Lee

^†

,

Eunseop Park

^†

and

Sangyub Lee

^*

Energy IT Convergence Research Center, Korea Electronics Technology Institute, 25 Saenari-ro, Bundang-gu, Seongnam-si 13509, Gyeonggi-do, Republic of Korea

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Energies 2025, 18(8), 2102; https://doi.org/10.3390/en18082102

Submission received: 24 February 2025 / Revised: 4 April 2025 / Accepted: 16 April 2025 / Published: 18 April 2025

(This article belongs to the Special Issue Advances in Power Distribution Systems)

Download

Browse Figures

Versions Notes

Abstract

This paper presents a study on the development of a hybrid modeling framework for the optimal operation of microgrids based on renewable energy resources. Accurate prediction of both renewable energy generation and consumer demand is crucial for the efficient management of renewable energy-based microgrids. The proposed hybrid modeling framework integrates a high-resolution physical model for forecasting renewable energy sources (solar and wind), a data-driven model for renewable energy prediction, and a hybrid forecasting model that combines both physical and data-driven approaches. Additionally, the framework incorporates a consumer demand model to further optimize grid operations. In this research, a hybrid prediction model was developed to enhance the accuracy of solar and wind power generation forecasts. The hybrid model leverages the complementary strengths of both physical and data-driven models. When historical data are insufficient, the physical model generates synthetic training data to improve the learning process of the data-driven model. Moreover, in cases where the data-driven model exhibits limited predictive accuracy due to insufficient training data, the physical model provides reliable forecasts, ensuring robust performance under various conditions. When sufficient real-world data are available, the Weighted Inverse Error Weighting (WIEW) strategy is applied to dynamically integrate the outputs of both models, significantly enhancing forecasting accuracy. Furthermore, a digital twin platform was implemented to operate and simulate each model, and a validation system for the digital twin platform and models was established using Software-in-the-Loop Simulation (SILS) and Power Hardware-in-the-Loop Simulation (PHILS) techniques. This study focuses on the development and validation of a hybrid model designed to improve the accuracy of solar and wind power generation forecasts for renewable energy microgrids.

Keywords:

hybrid model; digital twin framework; digital twin; optimal operation of microgrid; PHILS (power hardware-in-the-loop simulation); SILS (software-in-the-loop simulation)

1. Introduction

The global climate crisis is accelerating, with increasingly severe impacts such as rising temperatures, extreme weather events, and shifting ecosystems. As a response, many countries have committed to achieving carbon neutrality by mid-century, driven by the urgent need to reduce greenhouse gas emissions and mitigate the effects of climate change [1,2]. The transition to renewable energy resources, particularly solar and wind power, is essential for reducing reliance on fossil fuels and curbing carbon emissions [3]. However, the integration of these renewable energy sources into power grids presents significant challenges due to their intermittent and variable nature [4]. Microgrids, which are localized energy systems capable of operating independently or alongside the main grid, have emerged as a key solution to improve the reliability and efficiency of renewable energy integration [5]. By incorporating distributed renewable energy resources, such as solar and wind, microgrids enhance grid resilience and contribute to the broader goals of carbon neutrality and climate stabilization [6]. However, the optimal operation of renewable energy-based microgrids relies heavily on accurate forecasting of both energy generation and consumer demand. In particular, solar and wind power generation can be highly variable due to changing weather conditions, making reliable prediction models essential for ensuring the stability of the microgrid [7,8].

The efficient operation of microgrids depends not only on the accurate forecasting of renewable energy generation but also on the precision of electricity demand predictions. Inaccurate forecasting of both generation and demand can lead to supply–demand imbalances, which, in turn, reduce the stability of energy supply, increase unnecessary power waste, and incur additional operational costs [9]. In particular, when forecasting errors are significant, microgrid operators must secure additional energy storage systems (ESSs) and increase the capacity of auxiliary power generation, leading to higher initial investment and maintenance costs [10]. Additionally, mismatches between power generation and demand necessitate frequent system adjustments to maintain stability, further escalating overall operational costs [11]. Conversely, improving forecasting accuracy can maximize annual energy savings in microgrids while minimizing the infrastructure costs required for system deployment and operation. Enhanced prediction accuracy allows for optimal sizing of ESSs and power generation units, thereby preventing excessive capital investment in equipment [12]. Moreover, from a maintenance perspective, higher forecasting precision reduces unnecessary system operations and maintenance activities, leading to improved operational efficiency and long-term cost savings. Therefore, to ensure the stable and cost-effective operation of microgrids, it is essential to enhance the accuracy of forecasting models for solar and wind power generation while reducing errors in electricity demand prediction.

This paper presents a hybrid modeling framework designed to optimize the operation of microgrids by improving the accuracy of renewable energy forecasts. The framework integrates high-resolution physical models, data-driven models, and a combination of both, allowing for enhanced prediction of solar and wind power generation. Specifically, hybrid models for solar and wind energy generation forecasting are crucial for ensuring the stability and efficiency of microgrids. Furthermore, integrating these models into digital twin platforms allows for real-time simulation and validation, ensuring that the models are both accurate and practical for real-world applications.

Building on this foundation, this paper focuses on the development of a hybrid modeling framework designed to optimize the operation of microgrids by enhancing the accuracy of renewable energy forecasts. The framework integrates physical and data-driven models for solar and wind energy forecasting, as well as consumer demand modeling, within a digital twin platform. Additionally, we established a validation system for the digital twin platform and each model based on SILS and Hardware-in-the-Loop Simulation (HILS). By improving forecasting accuracy, this research contributes to the broader goals of carbon neutrality and provides valuable insights into the future of sustainable energy systems.

This paper is structured as follows. In Section 2, we review existing studies on hybrid models and the enhancement of renewable energy generation forecasting using hybrid models. Section 3 provides a detailed description of the proposed system’s architecture, including the development of physical models and data models for predicting renewable energy (solar and wind) generation. Additionally, it discusses the development of the hybrid model, which combines the physical and data models to improve renewable energy forecasting accuracy. Section 4 addresses the implementation of the renewable energy digital twin model, along with the simulation environment and performance validation. Finally, Section 5 concludes with a summary of the developed work and suggestions for future research directions.

2. Related Work

Forecasting the variability of renewable energy sources has become increasingly critical for achieving reliable and efficient power system operation. Various artificial intelligence techniques have been actively studied to improve the accuracy of renewable energy generation forecasting, including solar and wind power prediction. Conventional forecasting models have primarily relied on data-driven approaches using collected historical and meteorological data. When data availability is limited, physics-based models have been adopted to simulate the physical behavior of energy systems. Recently, an increasing number of studies have focused on hybrid modeling frameworks that integrate both data-driven and physics-based methods to enhance predictive performance and generalizability. Ref. [13] proposed a hybrid modeling structure that integrates machine learning with simulation-based models. Their study outlines several hybridization strategies and emphasizes the potential of combining physically interpretable and data-adaptive components for improved forecasting accuracy. Unlike this general-purpose framework, the current study constructs a task-specific hybrid model tailored for renewable energy forecasting by embedding a computational fluid dynamics (CFD) model with transfer learning-based data-driven modules. Similarly, Ref. [14] presented a comprehensive review of hybrid modeling methodologies, emphasizing their strengths in combining data-driven learning with first-principles knowledge to enhance model accuracy, generalizability, and interpretability. Their study highlights the applicability of hybrid approaches across various domains, including energy systems, and provides methodological guidance for their effective implementation. This perspective further reinforces the value of hybrid models in renewable energy forecasting, where capturing both physical system dynamics and data-driven patterns is critical for accurate and reliable predictions. Ref. [15] proposed a deep learning-based hybrid model for short-term load forecasting and smart grid information management. The model combines GRU, TCN, and attention mechanisms to effectively learn long-term dependencies and complex temporal patterns in load data. It demonstrated high prediction accuracy and computational efficiency across four public datasets, including GEFCom2014. This framework, which integrates multiple deep learning architectures, represents a strong example of balancing predictive performance, efficiency, and interpretability. This methodological approach, which is similar to our research, highlights the growing importance of hybrid modeling in energy systems, particularly in the context of smart grid operation and intelligent energy management. Ref. [16] introduced a hybrid framework combining machine learning with finite element analysis to enhance the accuracy of a blade-bearing test bench model. Using random forest algorithms to estimate non-measurable parameters, their work demonstrated the effectiveness of integrating data-driven and physics-based methods.

Ref. [17] developed a hybrid CNN-LSTM architecture in which CNNs extract spatial features from weather inputs, and LSTMs capture temporal patterns. Their model achieved a 21% improvement in forecasting accuracy over standalone deep learning models. However, their approach is limited to data-driven deep learning methods and does not incorporate physical system knowledge, which may affect generalization under unseen environmental conditions. Similarly, Ref. [18] proposed an adaptive learning-based hybrid model for solar irradiance forecasting. This model integrates a Time-Series Linear Model (TMLM), a Genetic Algorithm-Based Neural Network (GABP), and an adaptive online learning algorithm. The TMLM analyzes short-term fluctuations in solar irradiance, while the GABP optimizes the neural network’s weights using a genetic algorithm. Additionally, the adaptive online learning algorithm continuously updates the model in real time to enhance performance in response to environmental changes. This approach demonstrated superior performance compared to conventional models in both short-term and long-term solar irradiance forecasting, maintaining high reliability, even under highly variable environmental conditions. Compared to this work, the present study emphasizes hybridization across modeling paradigms, combining physics-informed predictions with data-driven learning. Ref. [19] introduced a pipeline integrating clustering, classification, and regression to improve PV forecasting under weather variability. This study aimed to address the issue of PV power fluctuations caused by varying weather conditions and emphasized the importance of accurate forecasting for effective grid operation management. The proposed method focused on improving PV forecasting accuracy by analyzing multiple machine learning models and weather parameters. Clustering and classification algorithms were utilized to group data points with similar weather conditions, followed by the application of segmented regression to these groups. The forecasting model was designed to utilize next-day weather predictions to determine the most suitable model for forecasting. Their approach proved effective in handling data-driven variability. However, it does not incorporate physical modeling. In contrast, our framework addresses this gap by combining synthetic data generation through CFD simulations with machine learning in a unified forecasting structure. Furthermore, whereas their model relies on post-clustering regression for prediction, our method integrates real-time physical simulation outputs directly into the learning process, enabling improved generalization under diverse and unseen environmental conditions. Ref. [20] investigated short-term forecasting techniques for PV power generation, analyzing the impact of various weather conditions, such as solar irradiance and cloud cover, on power output prediction. To enhance forecasting accuracy, five different prediction models were developed and tested. These models included an Autoregressive Integrated Moving Average (ARIMA), a Support Vector Machine (SVM), an Artificial Neural Network (ANN), an Adaptive Neuro-Fuzzy Inference System (ANFIS), and a hybrid model utilizing a genetic algorithm (GA). The study results demonstrated that the GA-based hybrid model outperformed individual forecasting models in terms of accuracy and efficiency. However, all the methods evaluated were purely data-driven, whereas our framework incorporates physics-based simulations for better reliability under sparse data conditions. Ref. [21] proposed an advanced hybrid ensemble learning framework for solar power forecasting, providing a reliable predictive model for grid stability and renewable energy integration. This study applied the Ensemble Averaging technique, which leverages the strengths of multiple machine learning models and integrates them into a single predictive model. The Ensemble Averaging model combined five individual forecasting models using a weighted-averaging approach to predict solar power generation. The models used in the forecasting process included the Nonlinear Autoregressive Neural Network (NAR-NN), Nonlinear Autoregressive Neural Network with Exogenous Signal (NARX-NN), Least Squares Boosted Decision Tree Model, Support Vector Regressor (SVR) with RBF Kernel, and Extreme Learning Machine (ELM). Ref. [22] proposed a hybrid deep learning model combining Wavelet Packet Decomposition (WPD) and LSTM networks to forecast PV power generation one hour ahead at five-minute intervals. The proposed approach first utilizes WPD to decompose the original PV power time series into multiple sub-series. Then, four independent LSTM networks are trained separately on these sub-series. Finally, the predicted results from each LSTM model are reconstructed, and a linear weighting method is applied to obtain the final forecasting output. Although effective in handling temporal variation, the absence of physical insights limits the robustness of their approach in extreme weather scenarios. Moreover, in newly deployed or data-scarce sites where sufficient historical data are not available, the lack of a simulation-based physical model hinders the ability to ensure accurate forecasting and system reliability. Ref. [23] proposed a novel forecasting method for solar irradiance, which is directly correlated with PV power generation, considering both short-term and medium-term forecasting horizons. The study introduced a hybrid framework based on the truncated-regularized kernel ridge regression model, applying a fast-trainable statistical learning technique to enhance prediction performance. By incorporating multiple weather parameters as input variables, the model demonstrated improved scalability and exhibited high forecasting accuracy, particularly in highly variable weather conditions. The proposed model was validated using real-world datasets collected from Seattle and Medford, USA, including both clear and cloudy weather conditions. In contrast, our model leverages simulation-driven data to supplement real-world measurements, addressing scenarios where field data are insufficient. Ref. [24] conducted a study to enhance the accuracy of three-hour accumulated solar irradiance forecasts provided by Numerical Weather Prediction (NWP) systems using SVR, Gradient Boosted Regression (GBR), Random Forest Regression (RFR), and a hybrid approach combining these techniques. This study explores the potential application of machine learning (ML) and hybrid artificial intelligence systems in solar irradiance forecasting, demonstrating that ML-based hybrid models effectively complement NWP forecasts and improve the accuracy of solar irradiance predictions. Unlike post hoc correction methods, our model tightly integrates forecasting and simulation components within a unified predictive loop. Ref. [25] developed a hybrid PV forecasting model that combines irradiance-based physical features with a two-stage data-driven framework, improving accuracy and efficiency in microgrid applications. This approach closely aligns with our method of integrating physical and machine learning components for renewable energy forecasting.

Ref. [26] developed a machine learning-based hybrid model to improve the accuracy of wind power forecasting. This study proposed a hybrid forecasting approach integrating Gray Relational Analysis (GRA) and wind speed distribution characteristics. The model dynamically adjusted the weights of individual forecasting models based on wind speed intervals and similar wind speed frequencies, enabling more precise wind power predictions. The results demonstrated that the proposed forecasting model exhibited high applicability in very-short-term (15-minute-ahead) wind power forecasting. Their method has shown promise in improving predictive accuracy for wind power in ultra-short-term horizons; however, it does not incorporate model-based simulation or dynamic weighting—two central contributions of our framework. Without these components, the robustness and adaptability of the model may be limited, particularly in data-scarce environments or under dynamic system conditions. Ref. [27] proposed the ESMD-PSO-ELM hybrid forecasting model, which integrates Extreme-Point Symmetric Mode Decomposition (ESMD), ELM, and Particle Swarm Optimization (PSO). The proposed approach first applies ESMD to decompose wind power data into multiple Intrinsic Mode Functions (IMFs) and a residual component (R). Then, the PSO-ELM method is employed to predict each IMF and R individually. Finally, the predicted values of these components are combined to generate the final wind power forecast. This study validated the model using real-world wind power data collected in Yunnan, China, from 1 April 2016 to 30 April 2016, comprising a total of 2880 observations. The empirical results confirmed that the ESMD-PSO-ELM model outperforms traditional forecasting methods, demonstrating greater robustness and higher prediction accuracy. Their modular design parallels our use of component-wise integration but differs in lacking a real-time digital twin validation environment. While their approach effectively decomposes complex time series and optimizes prediction through evolutionary algorithms, it does not incorporate real-time feedback from simulated system behavior. In contrast, our framework embeds the hybrid prediction model within a digital twin infrastructure, enabling continuous validation, monitoring, and adaptation of model predictions based on evolving system dynamics. Ref. [28] proposed a hybrid ultra-short-term wind power forecasting model by integrating the strengths of data-driven and physics-based models. In this study, a wind power conversion model was developed to address periods of significant wind speed fluctuations, incorporating the inertial operating characteristics of wind turbines as a physics-based forecasting model. Conversely, during stable power output phases, a data-driven ultra-short-term forecasting framework utilizing a temporal attention mechanism was introduced. The proposed model was validated using real-world wind farm data from Inner Mongolia, China, and the experimental results demonstrated that the root mean square error (RMSE) remained below 13% for four-hour-ahead predictions. This approach shares conceptual similarities with our work; however, our model extends the principle further by applying it across both solar and wind energy forecasting domains. In addition, we employ the Weighted Inverse Error Weighting (WIEW) strategy for the real-time integration of physical and data-driven model outputs, allowing for enhanced adaptability and robustness under varying environmental conditions. Ref. [29] proposed a high-precision hybrid approach that integrates historical wind farm data and NWP data for wind power forecasting. The forecasting process consists of three sequential stages: wind direction prediction, wind speed prediction, and wind power prediction, where the same hybrid methodology is applied in each stage, differing only in the input dataset. The proposed approach comprises outlier detection, time-series decomposition using wavelet transform, effective feature selection, and prediction using a Multilayer Perceptron (MLP) neural network. The model was validated using real-world data from the Sotavento wind farm in Spain, and the experimental results demonstrated exceptionally high forecasting accuracy. However, while the multi-stage design enhances the precision of wind power forecasts, it lacks a mechanism for integrating physical modeling or enabling real-time system feedback. In contrast, our framework embeds hybrid forecasting models within a digital twin environment, allowing for continuous performance validation and adaptation in response to dynamic environmental and operational changes. Unlike their static structure, our model adapts dynamically to data quality and availability, ensuring robust performance in both high-data and low-data regimes. Ref. [30] proposed a novel hybrid forecasting approach by integrating the ARIMA model with a backpropagation neural network (BPNN). Their hybrid model was designed to enhance prediction performance by incorporating wind speed, wind direction, and the physical constraints of wind farms. The effectiveness of the proposed method was validated using real-world wind farm data, and the results indicated improved forecasting accuracy. However, the model lacks an adaptive mechanism that incorporates real-time physical simulation or continuous validation, limiting its scalability and flexibility under dynamic grid or weather conditions. In contrast, our framework leverages a digital twin structure to iteratively refine predictions based on ongoing system states, thereby enhancing robustness and adaptability in operational environments. Ref. [31] proposed a hybrid approach that integrates elements of physical and statistical models to improve forecasting accuracy. This study developed three forecasting systems using Artificial Neural Networks (ANNs) to predict wind power generation at 1 h, 3 h, 6 h, 12 h, and 24 h horizons. Additionally, the study analyzed forecasting errors across different time horizons and evaluated the statistical distribution of the prediction errors. The results demonstrated that the hybrid model, which combines NWP data and real-time measurements, outperforms purely statistical models, particularly in long-term wind power forecasting. However, their framework lacks systematic integration with real-time simulation feedback and does not implement adaptive learning strategies that respond to changing system dynamics. In contrast, our proposed model operates within a digital twin environment, enabling continuous refinement of forecasting outputs based on real-time measurements and simulation feedback using the SILS and PHILS methodologies. Ref. [32] proposed a hybrid forecasting model that integrates chaotic analysis and granular computing (GrC) for wind power prediction. The study conducted a physical analysis of the chaotic characteristics of wind power time series and applied data reconstruction in the chaotic phase space, generating low-dimensional input data with richer information to improve modeling accuracy. The proposed approach was validated using real-world wind farm data, demonstrating its effectiveness in enhancing short-term wind power forecasting accuracy. Our model differentiates itself by integrating these perspectives within a digital twin framework using SILS and PHILS for system-level validation. Ref. [33] developed a hybrid wind power forecasting model combining EMD and ESN, which improved accuracy and efficiency across multiple datasets. Their approach aligns with our goal of integrating physical features and machine learning to enhance wind forecasting in microgrid and regional settings.

These related studies collectively demonstrate the importance of hybrid approaches in renewable energy forecasting. However, most prior works focus on model-level hybridization without leveraging simulation for training or real-time integration. Building on these studies, the proposed research advances the field by introducing a digital twin-based hybrid framework that not only combines data-driven and physical models but also utilizes simulation outputs as synthetic data and adaptively fuses predictions using the WIEW strategy.

3. Development of a Hybrid Modeling Framework

This chapter presents the development of solar and wind power forecasting models, which are the core components of the hybrid modeling framework. It details the construction of data-driven prediction models and simulation-based physical models for solar and wind power generation. Furthermore, it describes the process of integrating these models to develop a hybrid forecasting model for improved prediction accuracy. Additionally, this chapter discusses the establishment of a high-availability server infrastructure to efficiently collect and store data from the hybrid forecasting model and demonstration site.

3.1. Development Structure and Data Flow of the Hybrid Modeling Framework

The development structure and data flow of the hybrid modeling framework are illustrated in Figure 1. This architecture consists of two main components: the hybrid modeling framework and the energy digital twin service that utilizes this framework. The hybrid modeling framework incorporates a hybrid forecasting model for predicting renewable energy generation. Additionally, it provides key components such as:

A simulation library for integration with the digital twin (SILS);
Hybrid forecasting model for solar and wind power generation prediction;
A PHILS module for validating simulation data;
An Energy Management System (EMS) algorithm for managing electrical energy within microgrids;
A data acquisition module for collecting electrical energy data from renewable energy generation sites;
A visualization toolkit for real-time monitoring and analysis.

Figure 1. Development structure and data flow of the hybrid modeling framework.

The energy digital twin service leverages the developed hybrid modeling framework to integrate both actual and virtual power generation assets, enabling energy balancing simulations. This allows the framework to perform pre-simulations of power generation and demand balancing across various microgrids or to receive real-time data from actual microgrids for continuous monitoring of grid conditions. The proposed hybrid modeling framework integrates predictive simulation and real-time system monitoring capabilities, maximizing microgrid operational efficiency and contributing to the optimization of power generation and consumption balance.

3.2. Development of a Physical Model

CFD-based physical models can numerically analyze complex fluid flows. It can simulate heat transfer, radiative heat effects, and wake effects, enabling the assessment of their impact on solar panels and wind turbines. Based on these analyses, the physical model can provide explanations for the predicted results and generate high-quality data when direct data collection is challenging. Additionally, CFD models allow for the generation of results under various conditions by adjusting parameters, offering significant advantages in predictive modeling.

3.2.1. Solar Power Physical Model

Figure 2 is a satellite image of an actual solar power plant. The solar physics model utilizes the design drawings of the solar panels from the real-world plant to create a 3D model for numerical analysis. By analyzing solar irradiance, the model estimates power generation. Consequently, the solar physics model can predict power generation values based on forecasted irradiance data.

The prediction procedure based on the physical model begins by utilizing NWP to forecast solar irradiance. The predicted solar irradiance values are then used as input to the solar physical model to estimate solar power generation. The solar irradiance prediction model was developed using one year of NWP data from 2022 and was validated using the 2023 NWP dataset. Figure 3a presents the results of the solar irradiance prediction, where the x-axis represents the timestamp and the y-axis represents irradiance. The blue line indicates the measured solar irradiance, while the orange line represents the predicted values. Figure 3b shows the predicted solar power generation results. The x-axis denotes the timestamp, and the y-axis corresponds to the solar power generation. The blue line indicates the actual measured power output, whereas the orange line represents the predicted values calculated by the physical model.

3.2.2. Wind Turbine Physical Model

The wind turbine physical model interprets the blade rotation speed and wind speed to calculate the rotational torque and uses the resulting power coefficient for modeling wind power generation. Figure 4 illustrates the geometry used for the wind turbine analysis. This model utilizes the wind speed measured at the nacelle of the wind turbine as input to estimate the wind power output. Similar to the solar physical model, NWP data are used to forecast the nacelle wind speed, and the predicted wind speed is then used to estimate the wind power generation. The nacelle wind speed prediction model was developed using one year of NWP data from 2022, and its performance was validated using the 2023 NWP dataset. Figure 5a presents the prediction results for the nacelle wind speed, where the x-axis represents the timestamp, and the y-axis indicates the wind speed at the nacelle. The blue line shows the measured wind speed, while the orange line represents the predicted wind speed. Figure 5b illustrates the wind power prediction results. The x-axis represents the timestamp, and the y-axis indicates the wind power generation. The blue line corresponds to the actual measured power, while the orange line indicates the predicted values calculated using the physical model.

3.3. Development of a Data Model

The data model uses datasets collected from solar inverters, wind turbines, power generation data, and NWP data and employs machine learning to analyze the pattern and predict power generation. The performance of the data model improves with the availability of high-quality data, and it has the advantage of faster execution compared to the physical model. The data model has been developed as either an integrated prediction model or individual models, depending on the size of the power generation plant.

3.3.1. Solar Power Data Model

To develop an effective solar power data model, we compared and analyzed various machine learning models and constructed a dataset that included NWP data and additional solar information to enhance prediction accuracy. Several algorithms were evaluated based on key performance metrics such as the normalized MAE (nMAE), the normalized RMSE (nRMSE), and the normalized Mean Absolute Percentage Error (nMAPE) to identify the model with the highest performance. Through iterative optimization and data preprocessing, we improved the dataset to maximize the predictive capability of the selected model. As a result, we successfully developed a high-performance solar power data model optimized for accurate and reliable predictions.

In this study, various machine learning models were selected and applied for solar power generation forecasting, including eXtreme Gradient Boosting (XGB) [34], Support Vector Regression (SVR) [35], Gradient Boosting Machine (GBM) [36], Light Gradient Boosting Machine (LGBM) [37], and a hybrid deep learning model combining Convolutional Neural Networks and Long Short-Term Memory (CNN-LSTM) [17]. These models represent different algorithmic families. XGB, GBM, and LGBM are based on boosting ensemble learning techniques. SVR is a kernel-based regression model, while CNN-LSTM is a deep learning architecture specifically designed to capture spatiotemporal dependencies in time-series data. All of these models are suitable for learning complex nonlinear relationships between meteorological variables and solar power output.

XGB is recognized for its high prediction accuracy, regularization mechanisms that help prevent overfitting, and its ability to interpret feature importance. However, it requires complex hyperparameter tuning and can be computationally intensive when working with large datasets. GBM provides competitive performance but has a relatively slow training speed and can be prone to overfitting if not properly tuned. LGBM is appropriate for large-scale datasets due to its fast training speed and low memory consumption. However, it is sensitive to noise, which makes thorough data preprocessing essential. SVR performs well on small datasets and handles high-dimensional features effectively, but it has high computational costs for large datasets and is sensitive to kernel type and parameter settings. CNN-LSTM integrates the spatial feature extraction capability of a CNN with the sequential learning strength of LSTM. For example, a CNN can identify spatial patterns in input variables such as temperature, humidity, and solar irradiance. LSTM then models the temporal dependencies present in the data. This hybrid approach is particularly effective at capturing the complex spatiotemporal interactions involved in solar power generation and can significantly improve forecasting accuracy.

In particular, XGB has demonstrated strong predictive performance in previous studies. Rodriguez-Leguizamon et al. (2023) compared a statistical model (SARIMA), a deep learning model (LSTM), and a machine learning model (XGB) for photovoltaic forecasting and reported that XGB achieved the best [38]. Building on these findings, we conducted comparative experiments using the above machine learning models. The results confirmed that XGB outperformed the other models, achieving the lowest prediction errors across all evaluation metrics: nMAE, nRMSE, and nMAPE.

The dataset used for model training consisted of one year of NWP data from 2022, while testing was conducted using the NWP dataset from 2023. Figure 6 visualizes the prediction results of the five models during the test period from June to August 2023. The x-axis represents the timestamp, and the y-axis indicates solar power generation. The blue line shows the actual measured power output, while the orange lines represent the predicted outputs of each model. Table 1 presents a comparison of the three metrics (nMAE, nRMSE, nMAPE) across the entire test period. The lower values in these metrics indicate higher prediction accuracy. As shown, the XGB model (a) achieved the lowest error values across all metrics, confirming its superior predictive performance. Furthermore, the visual results in Figure 6 show that the XGB model (a) more accurately captures the peak generation periods compared to the other models, demonstrating its strength in tracking rapid fluctuations in solar power output.

3.3.2. Wind Turbine Data Model

Unlike the solar demonstration site, where installations are concentrated within a 400 m radius, the wind power demonstration site consists of 14 wind turbines spaced approximately 1.5 km apart. Figure 7 is a satellite image of the demonstration site, showing that the spacing between turbines is 1.5 km. Each wind turbine has unique characteristics that are influenced by factors such as terrain, wear, and location-specific conditions. Developing a unified data model for all turbines would make it challenging to account for these individual variations. Therefore, we developed separate data models for each wind turbine, enabling the models to learn and capture the distinct characteristics of each turbine more effectively. To develop the wind turbine data model, we employed CNN [39] and XGB, both of which demonstrated strong predictive performance. Additionally, to assess the impact of time information on model performance in irregular time-series wind data, we conducted experiments by comparing the results with and without Time Embedding preprocessing. The findings revealed that the best performance was achieved when Time Embedding was not applied and the CNN model was used.

For model development, NWP data from 2022 were used for training, and data from 2023 were used for testing. We compared the total predicted power from individual wind turbine models with that from a unified model using three evaluation metrics. Table 2 presents the nMAPE results across four scenarios, showing that the CNN model achieved the highest accuracy in 9 out of 14 turbines. Table 3 compares the overall performance using nMAE, nRMSE, and nMAPE, confirming that individual models consistently outperformed the unified model. Figure 8 visualizes the December test results, where individual models (orange) more closely follow actual power (blue) than the unified model (green), demonstrating higher accuracy.

3.4. Development of a Hybrid Model

A hybrid model combines physical and data-driven models to complement the limitations of each model and effectively utilize their respective strengths. Based on the advantages of these two approaches, we developed a hybrid model that improves the inference speed and enhances reliability through explainable results. This study is based on the Simulation-Assisted Machine Learning method, one of the techniques proposed in Combining Machine Learning and Simulation to a Hybrid Modeling Approach by the Fraunhofer Center for Machine Learning. Referring to this method, we trained the hybrid model using both the predictions from the data-driven model and the outputs from the physical model [13].

3.4.1. Solar Power Hybrid Model

The solar power hybrid model is constructed by integrating the power generation predictions from both a physical model and a data-driven model. The overall structure of the solar hybrid forecasting model is illustrated in Figure 9.

First, NWP data are used to forecast solar irradiance, which serves as input to the simulation model. The predicted irradiance is then used in the physics-based model to estimate power generation. In parallel, both the NWP data and real-time solar inverter measurements are used as input features to train the data-driven model based on the XGB algorithm. The outputs from both the physics-based and data-driven models are subsequently fed into the hybrid model to compute the final PV power forecast. Accordingly, the proposed solar power forecasting model adopts a hybrid approach that combines the physical consistency of simulation-based modeling with the learning capability of XGB. The hybrid model is formulated as follows:

{\hat{P}}^{P V} (t) = α_{P V} (t) \cdot {\hat{P}}_{p h y s}^{P V} + (1 - α_{P V} (t)) \cdot {\hat{P}}_{X G B}^{P V} (t)

(1)

Additionally, the physics-based model for solar power forecasting is calculated as follows, considering the solar irradiance, temperature, and power generation efficiency characteristics.

{\hat{P}}_{p h y s}^{P V} (t) = η_{P V} \cdot A \cdot G (t) \cdot (1 - β \cdot (T (t) - T_{r e f}))

(2)

The XGB model learns from various meteorological and historical power generation data to predict future power output. The model is formulated as follows:

{\hat{P}}_{X G B}^{P V} (t) = f_{X G B} (x^{P V} (t); θ)

(3)

The weighting coefficient α of the hybrid model for solar power forecasting is dynamically adjusted based on the reliability of the physics-based and machine learning models. Instead of assigning a fixed weight, a weighted inverse error strategy is employed, where α is dynamically adjusted using the prediction errors of each model. To dynamically integrate the outputs, the weighting coefficient α(

t

) is computed using a weighted inverse error strategy, as described below. This approach enables automatic weight adjustments when the reliability of a specific model decreases due to factors such as weather changes, ensuring stable forecast values.

α_{P V} (t) = \frac{1 / σ_{M L} (t)}{1 / σ_{p h y s} (t) + 1 / σ_{M L} (t)}

(4)

The parameters used in Equations (1) through (4), including the model inputs, coefficients, and prediction-related variables, are detailed in Table 4 to facilitate a clear understanding and interpretation of the PV hybrid forecasting model formulation.

3.4.2. Wind Turbine Power Hybrid Model

The wind power hybrid prediction model, similar to the solar power hybrid model, is constructed by integrating the power output forecasts from both a physics-based model and a data-driven model. The overall structure of the wind power hybrid prediction model is illustrated in Figure 10. First, NWP data are utilized to forecast the nacelle wind speed. The predicted nacelle wind speed is then used as an input to the physical model to estimate power generation.

Subsequently, to generate the output of the data-driven model, both the NWP data and wind turbine data are used as input features to predict the power output. Unlike the solar power data-driven model, the wind power data-driven model aggregates the predicted results from the individual models trained for each turbine in the wind farm, reflecting the heterogeneous characteristics of each turbine.

Accordingly, the wind power forecasting model adopts a hybrid approach that integrates a physics-based model with a Convolutional Neural Network (CNN) machine learning model. This method is designed to compensate for the high variability of wind power generation and effectively capture the nonlinear characteristics of wind patterns. The hybrid model for wind turbine power generation forecasting can be formulated as follows:

{\hat{P}}^{W i n d} (t) = α_{W i n d} (t) \cdot {\hat{P}}_{p h y s}^{W i n d} + (1 - α_{W i n d} (t)) \cdot {\hat{P}}_{C N N}^{W i n d} (t)

(5)

Additionally, the previously developed physics-based model can be formulated as follows, and wind power generation is calculated based on turbine operating conditions as follows:

{\hat{P}}_{p h y s}^{W i n d} (t) = \{\begin{matrix} 0, v (t) < v_{c u t - i n} \\ P_{r a t e d} \cdot (\frac{{v (t)}^{3} - v_{c u t - i n}^{3}}{v_{r a t e d}^{3} - v_{c u t - i n}^{3}}), v_{c u t - i n} \leq v (t) < v_{r a t e d} \\ P_{r a t e d}, v_{r a t e d} \leq v (t) < v_{c u t - o u t} \\ 0, v (t) > v_{c u t - o u t} \end{matrix}

(6)

The CNN-based model learns from time-series wind speed data to predict the power output by identifying wind patterns effectively. The model is defined as follows:

{\hat{P}}_{C N N}^{W i n d} (t) = f_{C N N} (x^{W i n d} (t); θ)

(7)

The primary computational operation in CNNs is the convolution operation. The output of the l-th convolutional layer is expressed as follows:

h^{(l)} = σ (W^{(l)} * h^{(l)} + b^{(l)})

(8)

Finally, the weighting coefficient α of the hybrid model for wind turbine power forecasting is also dynamically adjusted based on the reliability of the physics-based and machine learning models. Instead of assigning a fixed weight, a weighted inverse error strategy is adopted, where α is dynamically adjusted using the prediction errors of each model.

α_{w i n d} (t) = \frac{1 / σ_{M L} (t)}{1 / σ_{p h y s} (t) + 1 / σ_{M L} (t)}

(9)

The parameters used in Equations (5) through (9), including the turbine operation thresholds, model inputs, and prediction-related variables, are detailed in Table 5 to facilitate a clear understanding and interpretation of the wind power hybrid forecasting model formulation.

3.5. Development of a High Available Data Server

The development of the hybrid model requires the real-time collection and processing of big data, including NWP data, solar inverter data, and wind turbine data, at both minute and hourly intervals. Since these data must not be lost—even in the event of collection errors—high availability and failover must be ensured. For this reason, a high-availability data server was implemented using a Kafka cluster and a Galera cluster using Max Scale to establish the hybrid framework. First, we deployed a Kafka cluster to enable the real-time, lossless, and distributed processing of big data. Additionally, we built a Relational Data Base Management System (RDBMS) cluster using MariaDB’s Max Scale and Galera Cluster, ensuring high availability by replicating the collected data across multiple nodes.

Structure

Figure 11 illustrates the architecture of the high-availability hybrid modeling framework. Data collected from various sources, including the NWP, wind turbines, and solar inverters, are ingested in real-time via Kafka producers and transmitted to the Kafka cluster. The transmitted data are replicated and sequentially stored in topics within Kafka brokers. The stored data are then consumed by Kafka consumers, which process the data and store it in an RDBMS cluster.

The Kafka-based messaging system ensures real-time, lossless, and distributed data processing, while the RDBMS cluster, built using MariaDB’s Max Scale and Galera Cluster, guarantees high availability and data integrity by synchronizing the data across multiple nodes. This framework ensures reliable data collection, efficient processing, and fault tolerance, which are crucial for hybrid modeling applications. Additionally, during the development of the hybrid model, a rigorous data preprocessing process was implemented to ensure the quality of the collected data and to enhance the reliability of predictions. Exploratory Data Analysis (EDA) was performed to identify the outliers that exceeded the expected range of the values generated by power generators and sensors. When such anomalies were detected, data refinement and correction modules were applied to improve the reliability of the model. However, since physical models generate high-fidelity data through simulations, and the demonstration site operates a moderate-scale renewable energy facility, the collected data exhibit high accuracy with minimal noise. As a result, the dataset used in this study contains negligible data gaps or noise, allowing for reliable forecasting and model validation. Nevertheless, in future applications involving new demonstration sites, high-fidelity data may not always be readily available. To address this potential issue, a data imputation module was developed to reconstruct missing or incomplete data based on the existing high-quality dataset. For this purpose, a Generative Adversarial Network (GAN)-based data imputation model was introduced. This model was designed to learn the inherent patterns of the collected data and reconstruct missing values to improve data quality. By applying the GAN-based imputation technique, the hybrid modeling framework can maintain its predictive performance and reliability, even in environments where data gaps occur. This approach enhances the framework’s robustness and adaptability, ensuring its practical applicability in various environments with fluctuating data quality.

4. Performance Evaluation and Implementation of the Hybrid Modeling Framework

4.1. Performance Evaluation

4.1.1. Metrics

In this study, the predicted power generation values are large in scale, which makes conventional evaluation metrics such as the RMSE, Mean Absolute Error (MAE), and Mean Absolute Percentage Error (MAPE) potentially misleading. Specifically, the RMSE and MAE produce large error magnitudes simply due to the scale of the data, making model performance comparisons less interpretable. Furthermore, the MAPE becomes unstable when actual values approach zero, and it cannot be calculated when actual values are zero. This situation commonly occurs in renewable energy generation, particularly during nighttime or periods of low wind. To overcome these limitations and ensure fair, scale-independent evaluation, we adopted the following normalized error metrics: nRMSE, nMAE, and nMAPE. These normalized metrics provide intuitive, relative error assessments that allow meaningful comparisons across different time periods, power scales, and energy sources. Additionally, these metrics have been widely used in previous studies on wind [40] and solar power forecasting [41] for these same reasons, ensuring that our evaluation methodology aligns with established best practices in renewable energy forecasting research.

The nMAE normalizes the MAE by dividing the average absolute difference between the predicted and actual values by the range of actual values. The formula is as follows:

n M A E = \frac{1}{n \cdot (y_{m a x} - y_{m i n})} \sum_{i = 1}^{n} |y_{i} - {\hat{y}}_{i}|

(10)

The nRMSE normalizes the RMSE by first computing the mean squared error, taking its square root, and then dividing it by the range of actual values. The formula is as follows:

n R M S E = \frac{1}{n \cdot (y_{m a x} - y_{m i n})} \sqrt{\sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}}

(11)

The nMAPE normalizes the MAPE by dividing the absolute difference between the predicted and actual values by the range of actual values and then averaging the results. The formula is as follows:

n M A P E = \frac{1}{n} \sum_{i = 1}^{n} |\frac{y_{i} - {\hat{y}}_{i}}{y_{m a x} - y_{m i n}}| \times 100

(12)

4.1.2. Evaluation

In this study, a hybrid modeling approach was applied to accurately predict solar and wind power generation for the stable operation of renewable energy-based microgrids. The proposed hybrid model was designed to integrate the learning capability of data-driven models with the interpretability of physics-based models. For training and evaluation, one year of real-world measurement data from operational solar and wind power plants in 2023 was used, along with prediction results from the data-driven and physics-based models for the same year. To account for the seasonal variability in renewable energy generation and the influence of meteorological conditions, the dataset was structured to ensure robust prediction performance across all seasons. Specifically, data from the first to the third week of each month were used for training, while data from the last week of each month were reserved for testing [42].

The prediction results were compared not only with the actual power generation values but also with the forecasts produced by the individual data-driven and physics-based models. This comparison quantitatively demonstrated the superior performance of the hybrid model. Figure 12 and Figure 13 present the seasonal prediction results for solar and wind power generation, respectively, using representative months for each season: March for spring, June for summer, September for autumn, and December for winter. By selecting these representative months, the analysis effectively reflects the seasonal variations in meteorological conditions. The x-axis in both figures represents the timestamp, while the y-axis corresponds to the predicted values of solar and wind power generation. For instance, spring is characterized by unstable solar irradiance, summer by high temperature and humidity, autumn by decreasing temperatures and changing wind patterns, and winter by low irradiance and irregular wind speeds. Across all seasons, the hybrid model consistently predicted values that were closer to the actual power generation than those produced by either the data-driven or the physics-based models. These results indicate that the hybrid model successfully captures complex seasonal variability in renewable energy generation.

Table 6 presents a comparison of three evaluation metrics for the hybrid model, data-driven model, and physical model. These three metrics indicate higher prediction accuracy when their values are lower. According to the results, the hybrid model consistently achieves the lowest values across all metrics for both solar and wind power generation forecasting, indicating the best overall performance. For solar power forecasting, the physical model demonstrates the second-best performance. In contrast, for wind power forecasting, the data-driven model ranks second in accuracy.

4.2. Implementation and Simulation of the Hybrid Modeling Framework

In this chapter, we simulate and analyze a microgrid environment within a digital twin framework using the developed hybrid modeling framework. The hybrid forecasting model is utilized to predict wind and solar power generation, while a demand model is developed to simulate the balancing of electricity production and consumption. Extra electricity is stored in the energy storage system (ESS) and used when renewable energy sources, such as solar and wind, are insufficient. The primary objective of this simulation is to minimize the required ESS capacity within the microgrid. This is crucial for ensuring stable microgrid operation while reducing the reliance on costly ESS infrastructure. Figure 14 illustrates the simulation environment utilizing the hybrid modeling framework. As shown in the figure, the system consists of a solar power plant, three wind turbines, a demand model representing a university, and an ESS. The solar power plant was modeled using actual data collected from an operational solar facility, while the three wind turbines were similarly modeled based on real-world wind power generation data.

The forecasting of renewable energy generation is performed using the hybrid solar/wind power forecasting model. The demand model was developed based on the electricity consumption data from a university, ensuring an accurate representation of real-world energy usage patterns. Additionally, the ESS capacity is dynamically adjusted in the simulation to determine the optimal storage size for efficient energy management. Figure 15 illustrates the digital twin model of the university as a demand-side resource. The model is designed so that the electricity generated from the solar and wind power plants is consumed in a manner that mirrors the actual electricity consumption of the university.

Figure 16 illustrates the hourly wind power generation in the digital twin environment developed based on the hybrid modeling framework. The simulation allows users to select a specific year, month, and day for analysis. For time periods where measured data are available, both the actual measured data and the forecasted data are displayed simultaneously. In Figure 16, the purple streamlines visualize the influence of wind. By incorporating the effects of the wind speed and direction, the hybrid model predicts wind power generation.

Figure 17 illustrates the simulation interface of the microgrid modeled within the digital twin environment. The system enables the real-time monitoring of electricity generation from the solar power plant and wind power plant, as well as electricity consumption in the demand model (university).

The surplus electricity, generated when power production exceeds consumption, is stored in the energy storage system (ESS). The key parameters for the ESS simulation are presented in Table 7, and the detailed explanation of the algorithm and parameters is as follows.

The ESS algorithm is designed to explore the optimal ESS capacity by dynamically adjusting the capacity of the energy storage system (ESS). This enables the effective balancing of power demand and renewable energy generation. The algorithm evaluates the balance of energy supply at regular intervals based on power demand (objective), renewable energy generation (generationPower), and the current state of charge of the ESS (ESSCurAmount). In this process, the generation power is calculated by incorporating the hourly mean (mean) and standard deviation (std) of solar and wind power generation. Additionally, the ESS capacity (ESSCap) is set as a variable, allowing the simulation to be iteratively performed under various capacity conditions. For each scenario, key performance metrics, such as the imbalance between supply and demand, the frequency of ESS charging and discharging, and the number of generator start-up and shut-down events, are defined. These metrics are then used to analyze the impact of the ESS capacity variations on microgrid operational performance. In the ESS simulation process, the algorithm compares the hourly power demand and generation to determine whether there is a surplus or shortage of supply, thereby formulating an operational strategy for the ESS and renewable energy generators. When the power supply from the existing renewable energy generators in the microgrid is insufficient, the ESS discharges the stored electrical energy to the microgrid, and the generator operates at full capacity. Conversely, if the renewable energy generation alone results in an excess power supply to the microgrid, the ESS charges and the generator is curtailed (stopped). During this process, ESS charging and discharging are conducted while considering the maximum charging rate (ESSMaxCharge) and maximum discharging rate (ESSMaxDischarge), ensuring that ESS operations comply with the physical constraints. By iteratively performing these simulations, the optimal ESS capacity range is derived to ensure a stable balance between supply and demand with a minimal ESS installation capacity. Moreover, if the ESS charge level exceeds 80% of its capacity, a curtailment command is issued to the renewable energy generation units to limit their output. Additionally, considering that solar power generation ceases during nighttime, the system maximizes ESS charging before sunset while ensuring that the ESS charge level does not exceed 80%. The primary objective of this simulation is to determine the optimal ESS capacity that minimizes the storage requirements while preventing blackouts. Additionally, it aims to optimize energy management within the microgrid by achieving efficient energy balancing.

5. Conclusions

In this study, a hybrid forecasting model was developed by integrating physics-based and data-driven models to improve the accuracy of renewable energy generation forecasting. Additionally, a hybrid modeling framework was designed to enable the operation of a digital twin system, facilitating real-time simulations and validation. The proposed framework was validated using real-world data from a renewable energy demonstration site, confirming its applicability in a digital twin environment. The experimental results demonstrated that the hybrid forecasting model outperformed conventional data-driven model approaches, achieving an accuracy improvement of 8.56% for PV power generation forecasting and 11.39% for wind power generation forecasting. These findings confirm the effectiveness of the proposed methodology in enhancing the precision and reliability of renewable energy predictions. Furthermore, the high-resolution hybrid model (combining physics-based and data-driven models) was applied to optimize the balance between power generation and consumption within a microgrid system. The study verified that the proposed model is effective in maintaining grid stability and holds significant potential for improving the reliability of microgrid operations. Although this study simulated a single microgrid based on the hybrid modeling framework, the developed digital twin model and virtual microgrid environment possess significant scalability potential. Future research will focus on constructing a complex microgrid that integrates various renewable energy generation and demand-side resources for simulation. Additionally, the hybrid modeling framework will be further refined through comparative validation using data from a demonstration site. Moreover, since the digital twin platform must integrate real-time data streams for continuous simulation and validation, it is essential to efficiently process large volumes of real-time data while addressing potential issues such as data latency, packet loss, and real-time processing constraints. To mitigate these challenges, edge computing and distributed processing techniques will be implemented to minimize latency and enhance system resilience against communication failures. Furthermore, research will be conducted to optimize the utilization of surplus power, such as hydrogen production via water electrolysis, and to develop strategies for the optimal operation of multiple interconnected microgrids. Through these advancements, this research is expected to contribute to the development of a more robust and scalable framework for real-world microgrid operations.

Author Contributions

J.L., E.P. and S.L. took part in the discussion and development of the work described in this paper. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by Institute of Korea Institute of Energy Technology Evaluation and Planning (KETEP) grant funded by the Korea government (MOTIE) (No. RS-2023-00232017, Advancement of e-Demonstration Complex based on Renewable Energy Digital Twin). This work was also supported by the Institute of Information & Communications Technology Planning & Evaluation (IITP) grant funded by the Korean government (MSIT) (No. RS-2022-II220074, the Development of High-Resolution Hybrid Modeling Framework for Energy Digital Twin).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author(s).

Conflicts of Interest

The authors declare that they have no conflicts of interest to report regarding the present study.

References

Chen, J.M. Carbon neutrality: Toward a sustainable future. Innovation 2021, 2, 100127. [Google Scholar] [CrossRef] [PubMed]
Lifshits, I.M.; Smbatyan, A.S.; Saliya, M.R. Implementation of the Paris Climate Agreement in the Legal Systems of the EAEU Member States. LEX 2024, 77, 104. [Google Scholar] [CrossRef]
Yuan, X.; Su, C.W.; Umar, M.; Shao, X.; Lobonţ, O.R. The race to zero emissions: Can renewable energy be the path to carbon neutrality? J. Environ. Manag. 2022, 308, 114648. [Google Scholar] [CrossRef] [PubMed]
Almehizia, A.A.; Al-Masri, H.M.K.; Ehsani, M. Integration of renewable energy sources by load shifting and utilizing value storage. IEEE Trans. Smart Grid 2018, 10, 4974–4984. [Google Scholar] [CrossRef]
Reed, E. Improvement of Reliability Indices in a Micro-grid System involving Renewable Generation and Energy Storage. Bachelor’s Thesis, The Ohio State University, Columbus, OH, USA, 2017. [Google Scholar]
Agupugo, C.P.; Kehinde, H.M.; Manuel, H.N.N. Optimization of microgrid operations using renewable energy sources. Eng. Sci. Technol. J. 2024, 5, 2379–2401. [Google Scholar] [CrossRef]
Aslam, S.; Herodotou, H.; Mohsin, S.M.; Javaid, N.; Ashraf, N.; Aslam, S. A survey on deep learning methods for power load and renewable energy forecasting in smart microgrids. Renew. Sustain. Energy Rev. 2021, 144, 110992. [Google Scholar] [CrossRef]
Alamo, D.H.; Medina, R.N.; Ruano, S.D.; García, S.S.; Moustris, K.P.; Kavadias, K.K.; Zafirakis, D.; Tzanes, G.; Zafeiraki, E.; Spyropoulos, G.; et al. An advanced forecasting system for the optimum energy management of island microgrids. Energy Procedia 2019, 159, 111–116. [Google Scholar] [CrossRef]
Husein, M.; Chung, I.Y. Impact of solar power and load demand forecast uncertainty on the optimal operation of microgrid. In Proceedings of the 2019 IEEE PES/IAS PowerAfrica, Abuja, Nigeria, 20–23 August 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 199–203. [Google Scholar]
Sone, A.; Kato, T.; Shimakage, T.; Suzuoki, Y. Influence of forecast accuracy of photovoltaic power output on capacity optimization of microgrid composition under 30-minute power balancing control. Electr. Eng. Jpn. 2013, 182, 20–29. [Google Scholar] [CrossRef]
Tomic, S.D. A study of the impact of load forecasting errors on trading and balancing in a microgrid. In Proceedings of the 2013 IEEE Green Technologies Conference (GreenTech), Denver, CO, USA, 4–5 April 2013; pp. 443–450. [Google Scholar]
Zhao, B.; Xue, M.; Chen, R.; Lin, S.; Liu, H. An economic dispatch model for microgrid with high renewable energy resource penetration considering forecast errors. Autom. Electr. Power Syst. 2014, 14, 1–8. [Google Scholar]
von Rueden, L.; Mayer, S.; Sifa, R.; Bauckhage, C.; Garcke, J. Combining machine learning and simulation to a hybrid modelling approach: Current and future directions. In Proceedings of the Advances in Intelligent Data Analysis XVIII, Proceedings of the 18th International Symposium on Intelligent Data Analysis, IDA 2020, Konstanz, Germany, 27–29 April 2020; Proceedings 18. Springer International Publishing: Berlin/Heidelberg, Germany, 2020. [Google Scholar]
Schweidtmann, A.M.; Zhang, D.; von Stosch, M. A review and perspective on hybrid modeling methodologies. Digit. Chem. Eng. 2024, 10, 100136. [Google Scholar] [CrossRef]
Wen, X.; Liao, J.; Niu, Q.; Shen, N.; Bao, Y. Deep learning-driven hybrid model for short-term load forecasting and smart grid information management. Sci. Rep. 2024, 14, 13720. [Google Scholar] [CrossRef]
Faller, L.; Graßmann, M.; Lichtenstein, T. Machine learning based parameter estimation for an adapted finite element model of a blade bearing test bench. Energy AI 2024, 18, 100436. [Google Scholar] [CrossRef]
Lim, S.C.; Huh, J.H.; Hong, S.H.; Park, C.Y.; Kim, J.C. Solar power forecasting using CNN-LSTM hybrid model. Energies 2022, 15, 8233. [Google Scholar] [CrossRef]
Wang, Y.; Shen, Y.; Mao, S.; Cao, G.; Nelms, R.M. Adaptive learning hybrid model for solar intensity forecasting. IEEE Trans. Ind. Inform. 2018, 14, 1635–1645. [Google Scholar] [CrossRef]
Bajpai, A.; Duchon, M. A hybrid approach of solar power forecasting using machine learning. In Proceedings of the 2019 3rd International Conference on Smart Grid and Smart Cities (ICSGSC), Berkley, CA, USA, 25–28 June 2019; pp. 108–113. [Google Scholar]
Wu, Y.-K.; Chen, C.-R.; Rahman, H.A. A novel hybrid model for short-term forecasting in PV power generation. Int. J. Photoenergy 2014, 2014, 569249. [Google Scholar] [CrossRef]
Nayak, A.; Heistrene, L. Hybrid machine learning model for forecasting solar power generation. In Proceedings of the 2020 International Conference on Smart Grids and Energy Systems (SGES), Perth, Australia, 23–26 November 2020. [Google Scholar]
Li, P.; Zhou, K.; Lu, X.; Yang, S. A hybrid deep learning model for short-term PV power forecasting. Appl. Energy 2020, 259, 114216. [Google Scholar] [CrossRef]
Almarzooqi, A.M.; Maalouf, M.; El-Fouly, T.H.M.; E Katzourakis, V.; El Moursi, M.S.; Chrysikopoulos, C.V. A hybrid machine-learning model for solar irradiance forecasting. Clean Energy 2024, 8, 100–110. [Google Scholar] [CrossRef]
Gala, Y.; Fernández, Á.; Díaz, J.; Dorronsoro, J.R. Hybrid machine learning forecasting of solar radiation values. Neurocomputing 2016, 176, 48–59. [Google Scholar] [CrossRef]
Habib, A.; Hossain, M. Advanced feature engineering in microgrid PV forecasting: A fast computing and data-driven hybrid modeling framework. Renew. Energy 2024, 235, 121258. [Google Scholar] [CrossRef]
Shi, J.; Ding, Z.; Lee, W.-J.; Yang, Y.; Liu, Y.; Zhang, M. Hybrid forecasting model for very-short term wind power forecasting based on grey relational analysis and wind speed distribution features. IEEE Trans. Smart Grid 2013, 5, 521–526. [Google Scholar] [CrossRef]
Zhou, J.; Yu, X.; Jin, B. Short-term wind power forecasting: A new hybrid model combined extreme-point symmetric mode decomposition, extreme learning machine and particle swarm optimization. Sustainability 2018, 10, 3202. [Google Scholar] [CrossRef]
Wang, D.; Shi, Y.; Deng, W.; Guan, X.; Yang, M.; Yu, X. An Ultra-Short-Term Wind Power Forecasting Method Based on Data-Physical Hybrid-Driven Model. In Proceedings of the 2023 IEEE/IAS Industrial and Commercial Power System Asia (I&CPS Asia), Chongqing, China, 7–9 July 2023. [Google Scholar]
Khazaei, S.; Ehsan, M.; Soleymani, S.; Mohammadnezhad-Shourkaei, H. A high-accuracy hybrid method for short-term wind power forecasting. Energy 2022, 238, 122020. [Google Scholar] [CrossRef]
Xue, X.; Zhao, D.; Dong, H.; Wang, J.; Zhao, W. A novel hybrid approach for wind power forecasting. In Proceedings of the Unifying Electrical Engineering and Electronics Engineering, Proceedings of the 2012 International Conference on Electrical and Electronics Engineering, Phetchaburi, Thailand, 16–18 May 2012; Springer: New York, NY, USA, 2014. [Google Scholar]
De Giorgi, M.G.; Tarantino, M.; Ficarella, A. Comparisons of different wind power forecasting systems. Eng. Syst. Des. Anal. 2010, 49156, 105–113. [Google Scholar]
Wang, Y.; Xiong, W.; E., S.; Liu, Q.; Yang, N.; Fu, P.; Gong, K.; Huang, Y. Wind power prediction based on a hybrid granular chaotic time series model. Front. Energy Res. 2022, 9, 823786. [Google Scholar] [CrossRef]
Yuzgec, U.; Dokur, E.; Balci, M. A novel hybrid model based on Empirical Mode Decomposition and Echo State Network for wind power forecasting. Energy 2024, 300, 131546. [Google Scholar] [CrossRef]
Chen, T.; Guestrin, C. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016. [Google Scholar]
Cortes, C.; Vapnik, V. Support-Vector Networks. Mach. Learn. 1995, 20, 273–297. [Google Scholar] [CrossRef]
Friedman, J.H. Greedy function approximation: A gradient boosting machine. Ann. Stat. 2001, 29, 1189–1232. [Google Scholar] [CrossRef]
Ke, G.; Meng, Q.; Finley, T.; Wang, T.; Chen, W.; Ma, W.; Ye, Q.; Liu, T.Y. LightGBM: A highly efficient gradient boosting decision tree. In Proceedings of the Advances in Neural Information Processing Systems 30, Proceedings of the 31st Conference on Neural Information Processing Systems (NIPS 2017), Long Beach, CA, USA, 4–9 December 2017; NIPS Foundation: San Diego, CA, USA, 2017. [Google Scholar]
Rodriguez-Leguizamon, C.K.; López-Sotelo, J.A.; Cantillo-Luna, S.; López-Castrillón, Y.U. PV Power Generation Forecasting Based on XGBoost and LSTM Models. In Proceedings of the 2023 IEEE Workshop on Power Electronics and Power Quality Applications (PEPQA), Cali, Colombia, 5–6 October 2023. [Google Scholar]
Simard, P.Y.; Steinkraus, D.; Platt, J.C. Best practices for convolutional neural networks applied to visual document analysis. In Proceedings of the Seventh International Conference on Document Analysis and Recognition, Edinburgh, UK, 6 August 2003; Volume 3. [Google Scholar]
Hanifi, S.; Liu, X.; Lin, Z.; Lotfian, S. A critical review of wind power forecasting methods—Past, present and future. Energies 2020, 13, 3764. [Google Scholar] [CrossRef]
Phan, Q.-T.; Wu, Y.-K.; Phan, Q.-D. An approach using transformer-based model for short-term PV generation forecasting. In Proceedings of the 2022 8th International Conference on Applied System Innovation (ICASI), Nantou, Taiwan, 22–23 April 2022. [Google Scholar]
Kreuwel, F.P.; Knap, W.; Schmeits, M.; de Arellano, J.V.-G.; van Heerwaarden, C.C. Forecasting day-ahead 1-minute irradiance variability from numerical weather predictions. Sol. Energy 2023, 258, 57–71. [Google Scholar] [CrossRef]

Figure 2. Satellite image of an actual solar power plant.

Figure 3. This figure shows the predicted values of irradiance and the result of the solar physical model. (a) Shows the result of the irradiance predicted using NWP data, and (b) shows the result of the solar physical model using the predicted irradiance as the input.

Figure 4. The wind turbine analysis geometry.

Figure 5. This figure shows the predicted values of nacelle wind speed and the result of the wind physical model. (a) Shows the result of the nacelle wind speed predicted using NWP data, and (b) shows the result of the wind physical model using the predicted nacelle wind speed as the input.

Figure 6. Comparison of the prediction results of the four models: (a) shows the results of XGB; (b) depicts the results of SVR; (c) illustrates the results of GBM; (d) displays the results of LGBM; and (e) illustrates the results of CNN-LSTM.

Figure 7. Satellite image of an actual wind power plant.

Figure 8. Comparison of the prediction results of the individual model and unified model.

Figure 9. Solar hybrid model prediction process.

Figure 10. Wind hybrid model prediction process.

Figure 11. High available data server structure.

Figure 12. Comparison of hybrid model, data-driven, and physical models for solar power generation forecasting. The blue line represents the measured values, and the red line indicates the predictions from the hybrid model: (a) spring, (b) summer, (c) autumn, and (d) winter.

Figure 13. Comparison of hybrid model, data-driven, and physical models for wind power generation forecasting. The blue line represents the measured values, and the red line indicates the predictions from the hybrid model: (a) spring, (b) summer, (c) autumn, and (d) winter.

Figure 14. Microgrid simulation environment based on the hybrid modeling framework.

Figure 15. Demand resource digital twin model (university).

Figure 16. Wind power site digital twin model.

Figure 17. Energy Balancing Monitoring Based on a Digital Twin.

Table 1. Comparison of the prediction results of the five models.

Models	nMAE	nRMSE	nMAPE
XGB	0.14231	0.18374	14.23
SVR	0.18467	0.34831	18.24
GBM	0.16395	0.20892	16.40
LGBM	0.15836	0.19170	15.84
CNN-LSTM	0.14486	0.18433	14.49

Table 2. Comparison of the prediction results of the four cases.

Turbines	CNN	XGB	CNN (TimeEmbedding)	XGB (TimeEmbedding)
WT01	18.13	18.79	18.12	18.85
WT02	18.63	18.84	18.28	18.82
WT03	17.08	18.08	17.54	17.98
WT04	17.34	18.75	17.37	18.71
WT05	19.33	19.87	19.41	19.75
WT06	18.96	19.7	19.08	19.7
WT07	17.85	19.39	18.24	19.32
WT08	17.45	19.3	18.04	19.04
WT09	17.56	18.67	17.38	18.61
WT10	18.44	18.47	18.56	18.71
WT11	16.89	18.13	17.63	18.29
WT12	18.21	18.37	17.79	18.39
WT13	17.5	18.62	18.03	18.6
WT14	23.77	20.37	22.65	21.08
1st	9	1	4	0

Table 3. Comparison of the prediction results of the individual model and unified model.

Metric	Individual Model	Unified Model
nMAE	0.18486	0.27988
nRMSE	0.24531	0.33017
nMAPE	18.48751	27.98757

Table 4. Definition of symbols and parameters for the PV hybrid forecasting model.

Parameter	Description
${\hat{P}}^{P V} (t)$	Final forecast of PV power generation at time $t$
${\hat{P}}_{p h y s}^{P V} (t)$	Power output predicted by the physics-based model
${\hat{P}}_{X G B}^{P V} (t)$	Power output predicted by the XGB model
$α_{P V} (t)$	Time-varying hybrid weighting coefficient
$η_{P V}$	Efficiency of the PV module
$A$	Total PV module installation area
$G (t)$	Global Horizontal Irradiance (GHI) at time $t$
$T (t)$	Ambient temperature at time $t$
$β$	Temperature coefficient of the PV module
$T_{r e f}$	Reference temperature (typically 25 °C)
$f_{X G B} (x)$	Trained XGB prediction function
$x^{P V} (t)$	Input feature vector for XGB at time $t$
$θ$	Model parameters for XGB
$σ_{p h y s} (t)$	Prediction error of the physics-based model (nMAPE)
$σ_{M L} (t)$	Prediction error of the machine learning model (XGB)

Table 5. Definition of symbols and parameters for the wind power hybrid forecasting model.

Parameter	Description
${\hat{P}}^{W i n d} (t)$	Final forecast of wind power generation at time $t$
${\hat{P}}_{p h y s}^{W i n d} (t)$	Power output predicted by the physics-based model
${\hat{P}}_{C N N}^{W i n d} (t)$	Power output predicted by the CNN model
$α_{W i n d} (t)$	Dynamically optimized hybrid weighting coefficient
$v (t)$	Wind speed at time $t$
$v_{c u t - i n}$	Cut-in wind speed (minimum speed for turbine operation)
$v_{r a t e d}$	Rated wind speed (speed at which the turbine reaches maximum power)
$v_{c u t - o u t}$	Cut-out wind speed (speed beyond which the turbine shuts down for safety)
$P_{r a t e d}$	Rated power output of the wind turbine
$f_{C N N}$	Trained CNN model function
$x^{W i n d} (t)$	Input feature vector for CNN-based wind power forecasting
$θ$	CNN model parameters
$h^{(l)}$	Output of the l-th convolutional layer
$W^{(l)}$	Convolutional kernel weights in the l-th layer
$b^{(l)}$	Bias term of the l-th layer
$σ$	Activation function (ReLU)
$*$	Convolution operation
$σ_{p h y s} (t)$	Prediction error of the physics-based model at time $t$
$σ_{M L} (t)$	Prediction error of the machine learning model (CNN) at time $t$

Table 6. Comparison of forecasting performance for solar and wind power using normalized error metrics.

	Model		Metrics
	Model	nMAE	nRMSE	nMAPE
Solar	Data model	0.14231	0.18374	14.23
	Physical model	0.13181	0.15447	13.18
	Hybrid model	0.05575	0.08618	5.58
Wind	Data model	0.18486	0.24531	18.49
	Physical model	0.21138	0.27752	21.14
	Hybrid model	0.07098	0.09748	7.10

Table 7. The key parameters for the ESS simulation.

	Parameters	Description
ESS	ESSCap (kWh)	Capacity of the ESS
	ESSCurAmount (kWh)	Current state of charge (SoC) of ESS
	ESSInit (kWh)	Initial state of charge (SoC) of ESS
	ESSMaxCharge (kW)	Maximum charging rate of ESS
	ESSMaxDischarge (kW)	Maximum discharging rate of ESS
Demand Resources	objective (kW)	Power demand of the microgrid
Renewable Energy (PV, Wind Turbine)	generationPower (kW)	Power generation of renewable energy generators
	mean (kW)	Average power generation of renewable energy generators
	std (kW)	Standard deviation of power generation of renewable energy generators

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Lee, J.; Park, E.; Lee, S. Development of a Hybrid Modeling Framework for the Optimal Operation of Microgrids. Energies 2025, 18, 2102. https://doi.org/10.3390/en18082102

AMA Style

Lee J, Park E, Lee S. Development of a Hybrid Modeling Framework for the Optimal Operation of Microgrids. Energies. 2025; 18(8):2102. https://doi.org/10.3390/en18082102

Chicago/Turabian Style

Lee, Jaekyu, Eunseop Park, and Sangyub Lee. 2025. "Development of a Hybrid Modeling Framework for the Optimal Operation of Microgrids" Energies 18, no. 8: 2102. https://doi.org/10.3390/en18082102

APA Style

Lee, J., Park, E., & Lee, S. (2025). Development of a Hybrid Modeling Framework for the Optimal Operation of Microgrids. Energies, 18(8), 2102. https://doi.org/10.3390/en18082102

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Development of a Hybrid Modeling Framework for the Optimal Operation of Microgrids

Abstract

1. Introduction

2. Related Work

3. Development of a Hybrid Modeling Framework

3.1. Development Structure and Data Flow of the Hybrid Modeling Framework

3.2. Development of a Physical Model

3.2.1. Solar Power Physical Model

3.2.2. Wind Turbine Physical Model

3.3. Development of a Data Model

3.3.1. Solar Power Data Model

3.3.2. Wind Turbine Data Model

3.4. Development of a Hybrid Model

3.4.1. Solar Power Hybrid Model

3.4.2. Wind Turbine Power Hybrid Model

3.5. Development of a High Available Data Server

Structure

4. Performance Evaluation and Implementation of the Hybrid Modeling Framework

4.1. Performance Evaluation

4.1.1. Metrics

4.1.2. Evaluation

4.2. Implementation and Simulation of the Hybrid Modeling Framework

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI