Photovoltaic Decomposition Method Based on Multi-Scale Modeling and Multi-Feature Fusion

Xu, Zhiheng; Chen, Peidong; Cheng, Ran; Duan, Yao; Luo, Qiang; Zhang, Huahui; Pan, Zhenning; Xiao, Wencong

doi:10.3390/en18195271

Open AccessArticle

Photovoltaic Decomposition Method Based on Multi-Scale Modeling and Multi-Feature Fusion

by

Zhiheng Xu

¹,

Peidong Chen

¹,

Ran Cheng

¹,

Yao Duan

¹,

Qiang Luo

¹,

Huahui Zhang

²,

Zhenning Pan

²

and

Wencong Xiao

^2,*

¹

Power System Planning Research Center of Guangdong Power Grid Co., Ltd., Guangzhou 510623, China

²

School of Electric Power Engineering, South China University of Technology, Guangzhou 510641, China

^*

Author to whom correspondence should be addressed.

Energies 2025, 18(19), 5271; https://doi.org/10.3390/en18195271

Submission received: 1 September 2025 / Revised: 26 September 2025 / Accepted: 29 September 2025 / Published: 4 October 2025

Download

Browse Figures

Versions Notes

Abstract

Deep learning-based Non-Intrusive Load Monitoring (NILM) methods have been widely applied to residential load identification. However, photovoltaic (PV) loads exhibit strong non-stationarity, high dependence on weather conditions, and strong coupling with multi-source data, which limit the accuracy and generalization of existing models. To address these challenges, this paper proposes a multi-scale and multi-feature fusion framework for PV disaggregation, consisting of three modules: Multi-Scale Time Series Decomposition (MTD), Multi-Feature Fusion (MFF), and Temporal Attention Decomposition (TAD). These modules jointly capture short-term fluctuations, long-term trends, and deep dependencies across multi-source features. Experiments were conducted on real residential datasets from southern China. Results show that, compared with representative baselines such as SGN-Conv and MAT-Conv, the proposed method reduces MAE by over 60% and SAE by nearly 70% for some users, and it achieves more than 45% error reduction in cross-user tests. These findings demonstrate that the proposed approach significantly enhances both accuracy and generalization in PV load disaggregation.

Keywords:

non-intrusive load monitoring; photovoltaic disaggregation; multi-scale modeling; multimodal fusion; time series representation

1. Introduction

With the continuous rise in demand for energy efficiency [1] and intelligent management [2,3], distributed photovoltaic (PV) systems have gradually become an integral part of residential energy systems. This trend imposes higher demands on distribution-side energy management, particularly requiring high-precision monitoring and optimization of distributed energy resources without additional hardware investments. Non-Intrusive Load Monitoring (NILM), a technology that relies solely on smart meter data, offers a viable solution to this challenge [4,5]. This study hypothesizes that multi-scale modeling and multi-feature fusion can effectively capture the non-stationary characteristics and multi-source feature dependencies of PV loads, thereby enhancing decomposition accuracy and generalization. Based on this, the research objective is to design and validate a PV decomposition framework suitable for residential scenarios, while also exploring its scalability across different users and contexts. As shown in Table 1, for clarity and consistency, a comprehensive table of symbols is provided to assist the reader in following the technical presentation.

Traditional load decomposition methods have made progress in identifying household appliances (e.g., refrigerators, washing machines) [6,7], but decomposing PV output remains more challenging. Driven by solar irradiation and diurnal variations, PV signals exhibit high volatility and nonlinear characteristics while often overlapping with other loads, significantly increasing task complexity [8,9]. Existing research has attempted improvements through deep Auto-Transformer models [10], enhanced clustering methods [11], or ensemble learning approaches [12]. Nevertheless, three major limitations remain: First, single-timescale approaches struggle to balance short-term fluctuations with long-term trends; Second, insufficient utilization of meteorological data prevents full exploitation of cross-dimensional feature interactions; Third, reliance on complex models or prior knowledge restricts scalability across diverse scenarios. Therefore, there is an urgent need for an innovative framework that simultaneously incorporates multi-scale modeling and multi-source feature fusion to address these limitations.

To tackle these challenges, this paper proposes a multi-scale and multi-feature fusion photovoltaic decomposition framework specifically tailored for residential scenarios. The framework consists of three modules: First, Multi-Scale Time Series Decomposition (MTD), which captures both local fluctuations and long-term trends through hierarchical down-sampling and fusion; Second, Multi-Feature Fusion (MFF), which models semantic interactions between load and meteorological variables across temporal and channel dimensions; Third, Temporal Attention Decomposition (TAD), which employs attention mechanisms to identify key time segments strongly correlated with PV output, thereby enhancing robustness under low-sampling conditions. Compared with existing approaches, the proposed framework achieves joint modeling across temporal scales and feature dimensions, while avoiding reliance on high-frequency sampling and complex prior knowledge, thus substantially improving accuracy and generalization. It should be noted that the dataset used in this study originates from residential load data, and further validation is required for commercial or industrial applications.

The contributions of this paper are threefold:

Proposing and validating a multi-scale and multi-feature fusion PV decomposition framework based on the hypothesis that joint modeling across temporal scales and feature dimensions enhances accuracy and generalization;
Organically integrating multi-scale decomposition, feature interaction modeling, and attention mechanisms in the framework design, which differs from traditional NILM methods that rely on single-scale or shallow fusion approaches;
Demonstrating significant improvements over SGN-Conv and MAT-Conv on real residential user data from southern China, while maintaining robust performance across different user experiments. It is important to note that the design and validation of this method are based on residential scenarios, and additional testing is necessary for commercial or industrial PV systems.

2. Related Work

In recent years, deep learning has demonstrated remarkable capabilities in feature extraction and sequence modeling, significantly advancing the performance of NILM. Particularly under low-frequency sampling conditions, traditional statistical methods have been progressively replaced by end-to-end neural network architectures such as Long Short-Term Memory Networks (LSTM) [13], Convolutional Neural Networks (CNN) [14], and Transformer-based models [15,16]. These approaches automatically capture the nonlinear dynamics and long-range dependencies inherent in aggregate load data, achieving relatively good results on various public NILM datasets. Figure 1 shows that the number of searches for the keyword “NILM” has continued to grow over the past six years, reflecting the research enthusiasm and attention in this field. With the increasing penetration of distributed PV systems in residential settings, researchers have begun integrating PV generation into the NILM framework. This integration enables the estimation of PV output directly from the total load curve recorded by smart meters, eliminating the need for additional sensors. To provide a clearer overview, recent studies are systematically compared in terms of their methodological characteristics and input features, with the results summarized in Table 2.

Beyond directly decomposing PV data from the total load curve, some studies have attempted modeling and control parameter estimation at the inverter level. For instance, ref. [17] employed Gaussian mixture models and curve fitting to reconstruct reactive power parameters, thereby avoiding reliance on distribution network topology. However, this method is highly sensitive to sunrise and sunset times, resulting in significantly diminished performance in distributed scenarios with multiple parallel inverters. In contrast, the proposed method requires no additional inverter-level modeling; it achieves stable decomposition solely based on aggregate load and meteorological data, thereby overcoming the applicability limitations of such approaches in distributed scenarios.

Similarly, ref. [18] combined multiscale modeling with a meteorological attention mechanism to enhance dynamic PV modeling capabilities. However, this approach heavily relies on high-quality meteorological inputs and experiences significant performance degradation under data-deficient or noisy conditions. It also lacks robustness in non-stationary load scenarios involving energy storage and electric vehicles. In contrast, our method employs a Multi-Feature Fusion (MFF) module for deep interactive modeling of electrical and meteorological features, maintaining robust performance even with low-quality meteorological data and thereby offering greater practicality. Ref. [19] proposed a Transformer–CNN hybrid architecture incorporating a “time–application attention” mechanism to capture complex device dependencies, primarily targeting industrial scenarios. Its multi-scale attention offers some reference value for non-stationary signals, but its transferability is limited in residential PV decomposition due to diverse user loads and energy storage interventions. In contrast, the proposed method’s TAD module is specifically designed for residential PV scenarios. It focuses on PV-related time segments under low-sampling conditions, better aligning with residential user needs.

With advances in deep learning, some studies have attempted to reduce dependence on meteorological data. For instance, ref. [20] proposed a real-time framework based on Universal Adaptive Stabilization (UAS), estimating PV and residential loads solely from net load data with rapid convergence. However, this method requires manual parameter tuning and performs poorly under extreme weather conditions. In contrast, the proposed method automatically models the relationship between meteorological data and load through Multi-Scale Time Series Decomposition (MTD) and the MFF module, eliminating the need for complex parameter tuning and demonstrating stable performance across diverse weather conditions.

Ref. [21] introduced a “month-to-hour mapping” strategy, employing Gaussian mixture models and reference PV curves, which is suitable for low-resolution data. However, it relies heavily on reference curves and is prone to errors in cross-climate or energy-storage scenarios. Ref. [22] proposed the Consumer Mixture Model (CMM), which estimates PV output using neighboring users’ nighttime curves and supports multi-user aggregation. However, it relies on region-specific characteristics, leading to significant errors in cases of energy storage discharge or cross-regional deployment. In contrast, the proposed method does not depend on specific reference curves or regional features. Instead, it automatically adapts to different scenarios through multiscale decomposition and cross-feature interaction modeling, thereby enhancing its generalization capability.

In summary, existing NILM approaches for PV disaggregation face several critical limitations. First, most models are confined to a single temporal resolution, making it difficult to simultaneously capture sudden variations and long-term trends in PV output. Second, the integration of meteorological and electrical features is often superficial, lacking structured mechanisms for multi-dimensional interaction. Third, many methods involve complex architectures or require prior domain knowledge, which compromises generalization and limits real-world applicability.

To address these challenges, this paper proposes a deep learning-based PV disaggregation framework that integrates multi-scale temporal modeling with semantic feature fusion. Specifically, the method employs a down-sampling-based strategy to construct multi-resolution temporal sequences and introduces a dual-channel architecture that models interactions between electrical loads and meteorological variables across temporal and channel dimensions. This design enhances the model’s capability to represent complex PV dynamics while improving its adaptability to heterogeneous environmental conditions.

3. Method

As illustrated in Figure 2, this paper proposes a non-intrusive modeling framework for PV load disaggregation, consisting of three core modules: Multi-scale Time-series Decomposition (MTD), Multi-feature Fusion (MFF), and Temporal Attention-based Decomposition (TAD). The MTD module takes the aggregated load and associated environmental variables

X \in R^{T \times C}

as input, where T denotes the sequence length and C denotes the number of features, extracts multi-scale features via down-sampling, and integrates them through an attention mechanism to produce a unified representation

U_{f u s e d}

. The MFF module then models temporal and channel-level dependencies to enhance the representation, yielding feature sequence Z. Finally, the TAD module applies a Transformer-based attention mechanism to capture contextual information and estimate the PV load sequence

{\hat{y}}_{t}^{P V}

. These three modules work in a coordinated manner, enabling accurate modeling and separation of PV components from the total load. Implementation details of each module are elaborated in the following sections.

3.1. Multi-Scale Time-Series Decomposition Module

To capture dynamic patterns across different time scales in photovoltaic load curves, this study proposes an MTD module. Through down-sampling operations, X is divided into multiple sub-sequences

{x_{1}, x_{2}, \dots, x_{k}}

with varying temporal resolutions, where k denotes the number of down-sampling layers. This approach simultaneously preserves rapid fluctuations and long-term trends, avoiding the omission of critical patterns common in traditional single-scale methods.

To effectively fuse features across scales, MTD employs an attention-guided feature fusion mechanism. First, each sub-sequence

x_{i}

undergoes average pooling to yield a compressed context vector

y_{i}

representing the core information at that scale. Next, all

{y_{1}, y_{2}, \dots, y_{k}}

are concatenated and fed into a fully connected layer. A Softmax function generates attention weights

α_{i}

to measure the importance of each scale. Finally, MTD obtains the fused temporal feature representation through weighted summation:

U_{fused} = \sum_{i = 1}^{k} α_{i} x_{i}

(1)

Here,

a_{i}

represents the weight learned by the attention mechanism, determining each scale’s contribution to the final representation. The resulting

U_{f u s e d}

simultaneously captures local fluctuations and global trends, providing more robust inputs for subsequent modeling.

Compared to existing NILM methods, this study innovates by “first performing explicit multi-scale decomposition, then applying attention fusion.” Traditional multi-scale approaches typically rely on fixed rules, lacking adaptability; attention mechanisms often operate directly on raw sequences, struggling to fully leverage multi-scale features. MTD combines both, enabling the model to dynamically select scales based on varying household and meteorological conditions, thereby achieving greater robustness in photovoltaic decomposition tasks.

3.2. Multi-Feature Fusion Module

To model the dynamic dependencies between load and meteorological variables, this study proposes a MFF module. Taking

U_{f u s e d}

as input, this module employs a Gated Recurrent Unit (GRU) network to capture the temporal evolution of each variable while utilizing a channel attention mechanism to characterize interactions between variables. This achieves deep fusion of multi-source features, enhancing feature representation capabilities.

In the temporal modeling component, each input variable sequence

U [: c]

is fed into a GRU with shared parameters to extract its temporal dependency feature

h_{c}

. Representations of all variables are then stacked into a unified tensor

U_{t i m e}

, preserving the dynamic patterns of each variable over time. This enables the model to recognize synergistic effects on PV output, such as high temperature and strong irradiation.

To further characterize variable interactions, MFF introduces a channel attention mechanism at each time step. This mechanism assigns weights

a_{t}

to each variable, quantifying its importance in the global representation. Based on the attention distribution, a channel-weighted feature representation

U_{c h a n n e l}

is obtained. Additionally, a learnable gating coefficient β ∈ [0,1] is introduced to regulate the contribution of interaction features, preventing certain redundant variables from amplifying noise.

During the fusion stage, temporal features

U_{t i m e}

and channel features

U_{c h a n n e l}

are integrated according to their weights, while residual connections preserve low-level features from the original input U. The resulting fused representation Z incorporates both single-variable dynamic information and explicitly models contextual dependencies among multiple variables.

Z = U + U_{t i m e} + β \cdot U_{c h a n n e l}

(2)

Compared to existing NILM methods, MFF’s novelty lies in its “dual-layer modeling of time dependency and channel interaction.” Traditional approaches often model only the load curve itself, neglecting the influence of meteorological factors, or treat meteorological variables merely as external inputs without deeply modeling their semantic relationship with load. MFF, however, combines temporal modeling with channel modeling, enabling load and meteorological features to interact at a deep semantic level. This significantly enhances the model’s interpretability and robustness in photovoltaic decomposition tasks.

3.3. Temporal Attention-Based Decomposition Module

After obtaining the fused feature sequence

Z \in R^{T \times C}

, the TAD module further leverages a Transformer-based structure to model temporal dependencies. By incorporating learnable positional encodings and a multi-head self-attention mechanism, this module extracts contextual features related to PV generation, enabling accurate PV load disaggregation.

The module first applies a linear projection to the input sequence Z, followed by the addition of learnable positional embeddings to provide temporal order awareness. This process is formulated as:

X_{t} = Z W + E_{t}

(3)

where

W \in R^{D \times d}

is the embedding matrix that maps the fused features into the Transformer modeling space, and

E_{t}

is the positional encoding at time step t. Here, D denotes the feature dimension, while d represents the model embedding dimension within the transformer. The resulting embedded sequence

X_{t} \in R^{T \times d}

is then fed into the temporal modeling module.

To capture dependencies among time steps, a multi-head self-attention mechanism is employed. This mechanism enables each time step to interact with all others through global attention. The attention computation is given by:

α_{t} = softmax (\frac{Q K^{Τ}}{\sqrt{d}})

(4)

C_{t} = α_{t} V_{t}

(5)

where

Q = X_{t} W_{Q}, K = X_{t} W_{K}, V = X_{t} W_{V}

represent the query, key, and value matrices, respectively, with learnable parameters

W_{Q}, W_{K}, W_{V} \in R^{d \times d}

.

α_{t}

denotes the attention weight, and

C_{t}

is its context-enhanced representation.

Subsequently, a time-distributed feedforward neural network maps each context vector

C_{t}

to its corresponding PV load estimate:

{\hat{y}}_{t}^{P V} = R e l u (w_{0} C_{t} + b_{0})

(6)

where

W_{0} \in R^{d \times 1}

,

b_{0} \in R

, and ReLu(·) is the activation function. The final output sequence

{\hat{y}}^{P V} \in R^{T}

represents the disaggregated PV load component.

To guide the model to focus more on the structural changes of the PV load, especially during periods of significant magnitude fluctuation, a gradient-sensitive time-weighted loss function is introduced:

L_{P V} = \frac{1}{T} \sum_{t}^{T} w_{t} {({\hat{y}}_{t}^{P V} - y_{t}^{P V})}^{2}

(7)

where the weight

w_{t}

is determined based on the local gradient of the ground-truth PV load. When

|y_{t} - y_{t - 1}| > θ

, a larger weight is assigned to enhance the model’s focus on abrupt changes.

By introducing attention-based temporal modeling, this module enables the model to utilize global temporal context in the disaggregation process. It effectively overcomes the equal-weight limitations of traditional methods and enhances the model’s ability to capture time-sensitive structures, making it a core component of the NILM disaggregation pipeline.

4. Experiment

4.1. Dataset Description

The experiments in this paper are conducted using a real-world residential energy consumption dataset collected from a city in southern China. This dataset includes net load and distributed PV generation data from four residential users, along with corresponding meteorological information such as temperature and solar irradiance, all aligned with timestamp records. To further evaluate cross-user generalization, we included two additional households, not used in training, as independent test subjects in our experiments. The data is sampled at an hourly resolution, spanning nearly two years, resulting in approximately 17,000 samples per user. These sequences are then split into training, validation, and test sets using a 7:2:1 ratio. It should be explicitly noted that the dataset contains a limited number of users and is sourced exclusively from residential scenarios within a single city. Consequently, the conclusions drawn in this paper are subject to limitations in terms of geographical scope and scale.

4.2. Parameter and Hardware Configuration

For model configuration, the input sequence length is set to 720, with predictions made at the same temporal resolution. The overall architecture comprises three layers of multi-scale fusion, each with a scale width of 2; the patch size is 16, and the dropout rate is 0.1. The Adam optimizer is employed with an initial learning rate of 0.001. Training is conducted for up to 100 epochs with early stopping enabled, using a batch size of 128. To enhance robustness under low-output conditions, the loss function incorporates a zero-output penalty term in addition to the standard MSE, penalizing non-zero predictions when the actual PV output approaches zero. All experiments are conducted on a CUDA-accelerated NVIDIA RTX 3090 GPU. It should be noted that the RTX 3090 is used solely to accelerate training and is not a computational requirement for the model, which can also be deployed on an RTX 3060 GPU or a GTX 1080 GPU.

For comparative methods, SGN-Conv [23] and MAT-Conv [24] are implemented according to their original papers and publicly available code. SGN-Conv maintains the same number of convolutional layers and channels as the original paper, using the Adam optimizer with an initial learning rate of 0.001. MAT-Conv’s multi-head attention layers and embedding dimensions also follow the original settings. Across all experiments, the learning rate is uniformly set to 0.001 and the batch size to 128. Both methods employ early stopping to prevent overfitting. Under these unified hardware and training conditions, the configurations ensure a fair comparison between different approaches.

4.3. Evaluation Metrics

To comprehensively evaluate model performance in PV disaggregation, three commonly used error metrics are employed: mean absolute error (MAE), mean squared error (MSE), and symmetric absolute error (SAE).

M A E = \frac{1}{T} \sum_{t = 1}^{T} | {\hat{y}}_{t} - y_{t} |

(8)

M S E = \frac{1}{T} \sum_{t = 1}^{T} {({\hat{y}}_{t} - y_{t})}^{2}

(9)

SAE = \frac{1}{M} \sum_{m = 1}^{M} \frac{1}{T} | \sum_{t = 1}^{T} y_{t} - \sum_{t = 1}^{T} y_{t} |

(10)

where

{\hat{y}}_{t}

denotes the disaggregated PV output at time step t,

y_{t}

denotes the ground-truth PV generation, M represents the batch size, and T represents the number of samples contained in each batch.

4.4. Main Experimental Results

Table 3 presents quantitative performance comparisons between the proposed method and two representative existing models SGN-Conv [23] and MAT-Conv [24] for four residential user scenarios. Overall, the proposed method demonstrates significant advantages across most metrics, effectively validating its accuracy and robustness in photovoltaic load decomposition tasks. On User 1 and User 2 datasets, our model achieves MAE values of 0.1819 and 0.0883, respectively—approximately 60% lower than SGN-Conv and 50% lower than MAT-Conv. Visualization results in Figure 3 further demonstrate high consistency between predicted and actual curves, particularly in peak-valley intervals.

In User 3’s experiment, the MAE and MSE of our method were slightly higher than those of MAT-Conv. This discrepancy did not stem from an overall performance degradation but rather from the alignment between signal characteristics and model mechanisms. Specifically, User 3’s photovoltaic output curve exhibited pronounced periodicity and highly stable diurnal variation patterns, with minimal short-term fluctuations (manifested as a low signal-to-noise ratio and small rolling variance). Under such conditions, the MTD module tends to generate redundant components unrelated to the dominant daily cycle during multi-scale decomposition. These components are subsequently amplified in feature fusion and attention mechanisms, leading to overfitting to local noise or minor disturbances and consequently increasing MAE and MSE. Figure 4 illustrates this phenomenon: The upper panel shows that while our model’s predictions (red dashed line) generally align with actual PV output (blue solid line), a sharp overprediction peak emerged around March 19. The residual plot below corroborates this, revealing isolated large errors that primarily drove the increased MAE and MSE values, supporting our hypothesis of localized overfitting anomalies. In contrast, MAT-Conv directly captures subtle differences in temporal neighborhoods through its local attention mechanism, giving it superior performance in “low-fluctuation, strong-periodicity” scenarios. Nevertheless, our method demonstrates stronger preservation of the primary cycle in spectral energy concentration analysis and maintains superiority in SAE metrics, indicating greater robustness in capturing long-term energy trends and suppressing cumulative error propagation. This suggests that even in stable, highly periodic scenarios, our approach retains clear advantages in modeling long-term structures and preserving overall energy patterns.

In user 4’s experiments, our method achieved lower MAE and MSE but slightly underperformed SGN-Conv in SAE. This is mainly because User 4’s PV curves contain a few sharp anomalous peaks. Our framework, emphasizing global trends through multi-scale decomposition, smooths such local spikes, leading to weaker recovery of extreme values and thus higher SAE. By contrast, SGN-Conv’s convolutional structure is more sensitive to short-term fluctuations, enabling better peak capture but at the expense of stability in overall trend modeling. In summary, our method is stronger in balancing global accuracy and long-term error control, while SGN-Conv excels at capturing isolated extreme peaks.

4.5. Ablation Experiment

As shown in Table 4, we further validated the contribution of each module to overall performance through ablation experiments. Case 1 represents removing meteorological features, while Case 2 denotes removing the MTD module. Overall, both ablation approaches resulted in a significant increase in error, indicating that the proposed modules play a crucial role in enhancing both model accuracy and robustness. Particularly in the results for User 1 and User 2, the complete model significantly outperformed the control group across all three metrics: MAE, MSE, and SAE. Moreover, Case 1 generally exhibited higher errors than Case 2, indicating that meteorological information holds significant value in capturing the non-stationary characteristics of PV output, while the MTD module is indispensable for modeling multi-timescale dynamic variations.

It is worth noting that in User 3’s results, Case 2 outperformed the full model in both MAE and MSE, consistent with the anomalous findings in previous experiments. Further analysis revealed that this user’s PV curve exhibited high stability and low volatility. In such scenarios, multi-scale decomposition may introduce redundant features or noise interference, thereby weakening the model’s ability to capture fine-grained variations. Nevertheless, the full model still delivers the best performance on the SAE metric, indicating its continued advantage in modeling long-term energy trends and controlling cumulative error. Thus, the anomalous results do not negate the MTD module’s utility but rather reveal its potential limitations in specific stable scenarios.

To verify the robustness of the ablation experiment results, we further calculated the 95% confidence intervals of three metrics (MAE, MSE, SAE) for User 1. As shown in the table, both Case 1 and Case 2 exhibited significantly higher errors compared to the complete model, with their confidence interval bounds showing little overlap with those of the full model. For example, the complete model achieved an MAE of 0.1819 [0.1709, 0.1976], which is substantially lower than 0.4736 [0.3980, 0.5358] in Case 1 and 0.2589 [0.2238, 0.3056] in Case 2. Similar trends are observed for the MSE and SAE metrics. These findings demonstrate that the complete model not only outperforms the ablation configurations in terms of average performance but also maintains a stable advantage within the uncertainty range, further confirming the critical role of the MTD module and meteorological features in enhancing both the accuracy and robustness of the model.

4.6. Generalization Experiment

To further validate the model’s generalization capability on unseen users, we evaluated it using newly added User 5 and User 6 as independent test subjects, while retaining the original four users as the training set. Table 5 presents the comparison results against two baseline models. The results demonstrate that the proposed method exhibits significant advantages for both new users. For User 5, our approach achieves MAE and MSE values of 0.1778 and 0.1007, respectively, representing reductions of approximately 45% and 60% compared to SGN-Conv, and 52% and 53% compared to MAT-Conv, significantly improving point prediction accuracy. Simultaneously, it maintains a leading position in the SAE metric, indicating greater robustness in cumulative energy prediction. In the User 6 test, our method achieves MAE and MSE values of 0.1530 and 0.0834, respectively, representing substantial reductions compared to both SGN-Conv and MAT-Conv. The advantage is particularly pronounced in the SAE metric (0.0724, approximately 71% lower than SGN-Conv and 63% lower than MAT-Conv), further validating the model’s reliability in modeling long-term energy trends.

Overall, the proposed method demonstrates consistent performance with previous experimental results on new users, not only improving single-point prediction accuracy but also exhibiting enhanced robustness in long-term energy estimation. This indicates that the multi-scale modeling and feature fusion strategy effectively captures key dynamic characteristics of PV output and possesses a degree of cross-user transferability. Although validation remains confined to the same region, the results provide positive support for the model’s generalizability.

4.7. Sensitivity Analysis and Time Cost Comparison

In this study, we primarily examined two hyperparameters—learning rate and dropout—that most significantly impact model stability and generalization capability. Sensitivity analysis results, as shown in Table 6, indicate that learning rate exerts the greatest influence on model performance: when set excessively high (0.01), model performance declines sharply, while remaining stable within the range of 0.001 to 0.0001. In contrast, dropout enhances robustness; moderate settings effectively suppress overfitting while further optimizing metrics. Overall, learning rate is the key factor determining model convergence stability, while appropriately applied dropout plays a supplementary role in boosting generalization.

Regarding computational efficiency, as shown in Table 7, experiments compared SGN-Conv, MAT-Conv, and the proposed method under identical hardware (NVIDIA RTX 3090 GPU), dataset (User 1), and hyperparameter conditions. Using the average training time over the first 100 epochs as the metric, results show that our method achieves significantly higher accuracy while averaging only 7.75 s per epoch. This outperforms SGN-Conv and is substantially lower than MAT-Conv’s 15.64 s, demonstrating a strong efficiency advantage.

5. Conclusions

This paper proposes a specialized NILM method for distributed PV systems. To address the non-stationary characteristics, weather dependency, and strong coupling of PV loads with total residential demand, we design a framework that integrates multiscale decomposition, multi-feature fusion, and temporal attention mechanisms. The proposed framework effectively decomposes PV loads by balancing short-term fluctuations with long-term trends, without requiring additional hardware or domain-specific prior knowledge.

Experimental validation on real residential datasets confirms the effectiveness of the proposed method. The results demonstrate superior performance over representative baseline models across multiple user scenarios, with strong generalization potential observed in cross-user tests. These findings highlight the framework’s significant practical value for residential PV decomposition tasks.

Despite these positive outcomes, several limitations remain: First, the limited number of users in the generalization validation is insufficient to demonstrate robustness at larger scales or across diverse regions; second, the MTD module may introduce redundant representations under stable load patterns, indicating the need for adaptive structural optimization; and third, robustness under extreme weather conditions has not yet been systematically evaluated, while PV volatility during abnormal meteorological events continues to pose a major challenge.

Future research will advance along three directions: (1) expanding to larger-scale and cross-regional datasets to improve generalization across diverse user groups and climatic conditions; (2) integrating with forecasting models and adaptive deployment strategies within smart grid management systems to support more complex energy optimization scenarios; and (3) conducting systematic studies on resilience under extreme weather and heterogeneous loads (e.g., energy storage, electric vehicles), while evaluating scalability for commercial and industrial PV systems. These efforts aim to further enhance both the scalability and the practical value of the proposed NILM framework in real-world distributed energy environments.

Author Contributions

Conceptualization, Z.P. and W.X.; methodology, Z.X., Y.D. and Z.P.; software, P.C., R.C. and H.Z.; validation, P.C.; formal analysis, Z.X., R.C. and Q.L.; investigation, P.C., Y.D. and H.Z.; resources, Q.L.; data curation, P.C. and R.C.; writing—original draft preparation, Z.X., P.C. and R.C.; visualization, R.C., Y.D. and Q.L.; supervision, Z.X. and W.X.; funding acquisition, Z.X. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the China Southern Power Grid Corporation Technology Project (037700KC23120014, GDKJXM20231346).

Data Availability Statement

Due to considerations of user privacy, the datasets described herein cannot be made available at any time. Should you require access to the datasets, please contact the corresponding author.

Conflicts of Interest

Authors Zhiheng Xu, Peidong Chen, Ran Cheng, Yao Duan and Qiang Luo were employed by the company Power System Planning Research Center of Guangdong Power Grid Co., Ltd., Guangzhou, China. Authors Huahui Zhang, Zhenning Pan and Wencong Xiao from South China University of Technology, Guangzhou, China. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

Tsoumplekas, G.; Athanasiadis, C.; Doukas, D.I.; Chrysopoulos, A.; Mitkas, P. Few-Shot Load Forecasting Under Data Scarcity in Smart Grids: A Meta-Learning Approach. Energies 2025, 18, 742. [Google Scholar] [CrossRef]
Gkalinikis, N.V.; Nalmpantis, C.; Vrakas, D.; Chatzigeorgiou, S.; Athanasiadis, C.; Doukas, D. RHEA: Residential Home Energy Advisor. In Proceedings of the 2025 10th International Conference on Smart and Sustainable Technologies (SpliTech), Bol and Split, Croatia, 16–20 June 2025; pp. 1–6. [Google Scholar] [CrossRef]
Athanasiadis, C.L.; Papadopoulos, T.A.; Kryonidis, G.C.; Doukas, D.I. A review of distribution network applications based on smart meter data analytics. Renew. Sustain. Energy Rev. 2024, 191, 114151. [Google Scholar] [CrossRef]
Hart, G.W. Nonintrusive Appliance Load Monitoring. Proc. IEEE 1992, 80, 1870–1891. [Google Scholar] [CrossRef]
Dai, S.; Meng, F.; Wang, Q.; Chen, X. DP²-NILM: A Distributed and Privacy-Preserving Framework for Non-Intrusive Load Monitoring. Renew. Sustain. Energy Rev. 2024, 191, 114091. [Google Scholar] [CrossRef]
Athanasiadis, C.L.; Doukas, D.I.; Papadopoulos, T.A.; Barzegkar-Ntovom, G.A. Real-time non-intrusive load monitoring: A machine-learning approach for home appliance identification. In Proceedings of the 2021 IEEE Madrid PowerTech, Madrid, Spain, 28 June–2 July 2021; pp. 1–6. [Google Scholar] [CrossRef]
Li, X.; Wang, Y.; Gao, Y.; Jie, B. Federated Sequence-to-Sequence Learning for NILM with Heterogeneous Data. In Proceedings of the 2025 IEEE International Conference on Power Systems and Smart Grid Technologies (PSSGT), Chongqing, China, 11–13 April 2025; pp. 344–349. [Google Scholar] [CrossRef]
Zai, Z.; Zhao, S.; Zhang, Z.; Li, H.; Sun, N. Non-Intrusive Load Monitoring Based on the Combination of Gate-Transformer and CNN. Electronics 2023, 12, 2824. [Google Scholar] [CrossRef]
Chen, X.; Ardakanian, O. Data efficient energy disaggregation with behind-the-meter energy resources. Sustain. Energy Grids Netw. 2022, 32, 100813. [Google Scholar] [CrossRef]
Chen, J.; Peng, T.; Qian, S.; Ge, Y.; Wang, Z.; Nazir, M.S.; Zhang, C. An error-corrected deep Autoformer model via Bayesian optimization algorithm and secondary decomposition for photovoltaic power prediction. Appl. Energy 2025, 377, 124738. [Google Scholar] [CrossRef]
Fu, X.; Lu, N.; Sun, H.; Zhang, Y. Extraction of representative scenarios for photovoltaic power with shared weight graph clustering. IEEE Trans. Smart Grid 2024, 15, 6158–6170. [Google Scholar] [CrossRef]
Moreno, S.; Teran, H.; Villarreal, R.; Vega-Sampayo, Y.; Paez, J.; Ochoa, C.; Espejo, C.A.; Chamorro-Solano, S.; Montoya, C. An Ensemble Method for Non-Intrusive Load Monitoring (NILM) Applied to Deep Learning Approaches. Energies 2024, 17, 4548. [Google Scholar] [CrossRef]
Zhou, X.; Feng, J.; Li, Y. Non-Intrusive Load Decomposition Based on CNN–LSTM Hybrid Deep Learning Model. Energy Rep. 2021, 7, 5762–5771. [Google Scholar] [CrossRef]
He, J.; Liu, J.; Zhang, Z.; Chen, Y.; Liu, Y.; Khoussainov, B.; Zhu, L. MSDC: Exploiting Multi-State Power Consumption in Non-Intrusive Load Monitoring Based on a Dual-CNN Model. Proc. AAAI Conf. Artif. Intell. 2023, 37, 5078–5086. [Google Scholar] [CrossRef]
Çavdar, İ.H.; Feryad, V. Efficient Design of Energy Disaggregation Model with BERT-NILM Trained by AdaX Optimization Method for Smart Grid. Energies 2021, 14, 4649. [Google Scholar] [CrossRef]
Sykiotis, S.; Kaselimi, M.; Doulamis, A.; Doulamis, N. ELECTRIcity: An Efficient Transformer for Non-Intrusive Load Monitoring. Sensors 2022, 22, 2926. [Google Scholar] [CrossRef] [PubMed]
Talkington, S.; Grijalva, S.; Reno, M.J.; Azzolini, J.A. Solar PV Inverter Reactive Power Disaggregation and Control Setting Estimation. IEEE Trans. Power Syst. 2022, 37, 4773–4784. [Google Scholar] [CrossRef]
Chen, X.; Huang, C.; Zhang, Y.; Wang, H. Season-Independent PV Disaggregation Using Multi-Scale Net Load Temporal Feature Extraction and Weather Factor Fusion. In Proceedings of the 2024 IEEE 8th Conference on Energy Internet and Energy System Integration (EI2), Shenyang, China, 29 November–2 December 2024; pp. 850–855. [Google Scholar] [CrossRef]
He, G.; Huang, Y.; Zhang, Y.; Zhu, Y.; Leng, Y.; Shang, N.; Zeng, J.; Pu, Z. Hybrid Transformer–Convolutional Neural Network Approach for Non-Intrusive Load Analysis in Industrial Processes. Energies 2025, 18, 2464. [Google Scholar] [CrossRef]
Usman, H.M.; ElShatshat, R.; El-Hag, A.H. A Novel Non-Intrusive Framework for Real-Time Disaggregation of behind-the-Meter Solar Generation from Smart Meter Data. Electr. Power Syst. Res. 2023, 225, 109831. [Google Scholar] [CrossRef]
Bu, F.; Dehghanpour, K.; Yuan, Y.; Wang, Z.; Guo, Y. Disaggregating Customer-Level Behind-the-Meter PV Generation Using Smart Meter Data and Solar Exemplars. IEEE Trans. Power Syst. 2021, 36, 5417–5427. [Google Scholar] [CrossRef]
Cheung, C.M.; Kuppannagari, S.R.; Srivastava, A.; Kannan, R.; Prasanna, V.K. Behind-the-Meter Solar Generation Disaggregation at Varying Aggregation Levels Using Consumer Mixture Models. IEEE Trans. Sustain. Comput. 2023, 8, 43–55. [Google Scholar] [CrossRef]
Shin, C.; Joo, S.; Yim, J.; Lee, H.; Moon, T.; Rhee, W. Subtask Gated Networks for Non-Intrusive Load Monitoring. Proc. AAAI Conf. Artif. Intell. 2019, 33, 1150–1157. [Google Scholar] [CrossRef]
Xiong, J.; Hong, T.; Zhao, D.; Zhang, Y. MATNilm: Multi-appliance-task non-intrusive load monitoring with limited labeled data. IEEE Trans. Ind. Inform. 2023, 20, 3177–3187. [Google Scholar] [CrossRef]

Figure 1. The chart displays the search volume for the keyword “NILM” over the past six years, with data sourced from Google Scholar.

Figure 2. The main framework of the model.

Figure 3. PV decomposition effect of four users.

Figure 4. Error Analysis of User 3’s Photovoltaic Output.

Table 1. Symbol Name and Meaning.

Symbol	Meaning	Symbol	Meaning
NILM	Non-Intrusive Load Monitoring	$y_{i}$	The compressed representation of the $x_{i}$
PV	Photovoltaic	$α_{i}$	weights
MTD	Multi-scale Time-series Decomposition	$⊙$	elemental multiplication
MFF	Multi-feature Fusion	$U$	Multi-scale fusion feature representation
TAD	Temporal Attention Decomposition	$U_{t i m e}$	Convergence characteristics of the time dimension
MAE	Mean Absolute Error	$U_{c h a n n e l}$	Convergence characteristics of channel dimensions
MSE	Mean Squared Error	Z	Dual-channel fusion features
SAE	Symmetric Absolute Error	$h_{c}$	Time representation for each channel
CNN	Convolutional Neural Network	W	mapping matrix
LSTM	Long Short-Term Memory	D	Feature Dimension
GRU	Gated Recurrent Unit	d	Model Embedding Dimension
UAS	Universal Adaptive Stabilization	$W_{Q}, W_{K}, W_{V}$	Learning parameters for Q, K, V
CMM	Consumer Mixture Model	$C_{t}$	context-enhanced representation
FNN	Feedforward Neural Network	$E_{t}$	position code
ReLu	Rectified Linear Unit	$b_{0}$	bias value
X	Inputs to the model	${\hat{y}}^{P V}$	Projected value of PV
T	Length of time series	$w_{0}$	projection matrix
C	Number of channels	$L_{P V}$	loss function
k	Number of down-sample	Softmax (·)	activation function
$x_{i}$	Subsequent sequences of multiscale decompositions	MeanPool (·)	Average pooling

Table 2. Comparative Analysis of Existing Methods, “✕” indicates that the method is not used, and “✓” indicates that the method is used.

Methods	Weather Data Fusion	Multi-Scale Modeling	Dual-Channel Fusion
[13,15]	✕	✕	✕
[6]	✓	✕	✕
[14]	✓	✓	✕
Ours	✓	✓	✓

Table 3. The experimental results are shown below, bold text indicates the best results.

User	Model	MAE	MSE	SAE
User 1	SGN-Conv	0.4754	0.4646	0.2350
	MAT-Conv	0.4420	0.4711	0.1968
	Ours	0.1819	0.1007	0.0781
User 2	SGN-Conv	0.2112	0.0913	0.1078
	MAT-Conv	0.2087	0.0981	0.0622
	Ours	0.0883	0.0239	0.0191
User 3	SGN-Conv	0.3644	0.2643	0.1888
	MAT-Conv	0.2090	0.1233	0.0879
	Ours	0.2318	0.2380	0.0436
User 4	SGN-Conv	0.2332	0.0744	0.0257
	MAT-Conv	0.2719	0.2220	0.2111
	Ours	0.1467	0.0695	0.0664

Table 4. Four users’ ablation test results, bold text indicates the best results.

User	Model	MAE	MSE	SAE
User 1	Case 1	0.4736 [0.3980, 0.5358]	0.4669 [0.3679, 0.5758]	0.2257 [0.1806, 0.3376]
	Case 2	0.2589 [0.2238, 0.3056]	0.1775 [0.1358, 0.2091]	0.1037 [0.0883, 0.1311]
	Ours	0.1819 [0.1709, 0.1976]	0.1007 [0.0845, 0.1179]	0.0781 [0.0635, 0.0895]
User 2	Case 1	0.2108	0.0918	0.1074
	Case 2	0.1258	0.0386	0.0468
	Ours	0.0883	0.0239	0.0191
User 3	Case 1	0.2584	0.1534	0.1153
	Case 2	0.1912	0.1083	0.0875
	Ours	0.2318	0.2380	0.0436
User 4	Case 1	0.2047	0.0799	0.0776
	Case 2	0.1693	0.0883	0.1610
	Ours	0.1467	0.0754	0.0664

Table 5. Generalization experiment result, bold text indicates the best results.

User	Model	MAE	MSE	SAE
User 5	SGN-Conv	0.3238	0.2522	0.2584
	MAT-Conv	0.3696	0.2147	0.1636
	Ours	0.1778	0.1007	0.1369
User 6	SGN-Conv	0.3817	0.3098	0.2534
	MAT-Conv	0.2316	0.1526	0.1977
	Ours	0.1530	0.0834	0.0724

Table 6. Sensitivity Analysis.

Lr	Dropout	MAE	MSE	SAE
0.001	0	0.1896	0.1048	0.0990
0.001	0.1	0.1907	0.1013	0.0828
0.001	0.3	0.1819	0.1007	0.0781
0.0001	0.1	0.1876	0.1018	0.1659
0.01	0.1	0.5805	0.4347	0.4347

Table 7. Time Cost Comparison, bold text indicates the best results.

Model	Time/s
SGN-Conv	8.19
MAT-Conv	15.64
Ours	7.75

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Xu, Z.; Chen, P.; Cheng, R.; Duan, Y.; Luo, Q.; Zhang, H.; Pan, Z.; Xiao, W. Photovoltaic Decomposition Method Based on Multi-Scale Modeling and Multi-Feature Fusion. Energies 2025, 18, 5271. https://doi.org/10.3390/en18195271

AMA Style

Xu Z, Chen P, Cheng R, Duan Y, Luo Q, Zhang H, Pan Z, Xiao W. Photovoltaic Decomposition Method Based on Multi-Scale Modeling and Multi-Feature Fusion. Energies. 2025; 18(19):5271. https://doi.org/10.3390/en18195271

Chicago/Turabian Style

Xu, Zhiheng, Peidong Chen, Ran Cheng, Yao Duan, Qiang Luo, Huahui Zhang, Zhenning Pan, and Wencong Xiao. 2025. "Photovoltaic Decomposition Method Based on Multi-Scale Modeling and Multi-Feature Fusion" Energies 18, no. 19: 5271. https://doi.org/10.3390/en18195271

APA Style

Xu, Z., Chen, P., Cheng, R., Duan, Y., Luo, Q., Zhang, H., Pan, Z., & Xiao, W. (2025). Photovoltaic Decomposition Method Based on Multi-Scale Modeling and Multi-Feature Fusion. Energies, 18(19), 5271. https://doi.org/10.3390/en18195271

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Photovoltaic Decomposition Method Based on Multi-Scale Modeling and Multi-Feature Fusion

Abstract

1. Introduction

2. Related Work

3. Method

3.1. Multi-Scale Time-Series Decomposition Module

3.2. Multi-Feature Fusion Module

3.3. Temporal Attention-Based Decomposition Module

4. Experiment

4.1. Dataset Description

4.2. Parameter and Hardware Configuration

4.3. Evaluation Metrics

4.4. Main Experimental Results

4.5. Ablation Experiment

4.6. Generalization Experiment

4.7. Sensitivity Analysis and Time Cost Comparison

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI