3.3. Authors
Figure 4 shows the top five most prolific researchers on the MLSG research landscape. The results show that 160 authors have published one or more publications over this study’s time frame. As observed, Nadeem Javaid, based at the COMSATS University Islamabad in Pakistan, is the most prolific researcher, with 23 publications cited 1256 times over the years. The researcher’s most notable publication is “
A survey on deep learning methods for power load and renewable energy forecasting in smart microgrids,” with 599 citations to date [
65]. Published in the open-access journal “
Renewable and Sustainable Energy Reviews,” the paper proposed an FA-XGBoost classifier for detecting electricity theft. The classier accomplished an F1-score of ~94%, precision (~93%), and recall (97%). The integrated VGG-16 module was observed to have higher generalized performance for training and testing data at precision values of ~87% and ~84%, respectively. Lastly, the authors reported that the suggested FA-XGBoost accurately recognized real electricity thieves with 97% recall value. In another study,
Nadeem Javaid and co-workers examined two proposed MIMO DRNN models (ESAENARX and DE-RELM) for estimating the prices and loads of electricity in a smart city [
66]. The findings revealed that the ESAENARX and DE-RELM models outperformed a benchmark model and their sub-models. Lastly, the study reported that the refined and informative characteristics obtained from ESAE enhanced the predicting precision in ESAENARX, whereas optimization enhanced the accuracy of DE-RELM. Another notable study by Javaid and co-workers is “
Towards Efficient Energy Utilization Using Big Data Analytics in Smart Cities for Electricity Theft Detection,” which examined the use of big data and machine learning in the detection of electricity theft in smart cities [
67].
Another significant contributor to the field of MLSG is Alsabaan, M of King Saud University, in Saudi Arabia. The group of Alsabaan, M has published 18 papers on MLSG, which have been cited 234 times over the years. The most notable publication by the researcher is “
Electricity Theft Detection Using Deep Reinforcement Learning in Smart Power Grids,” which has been cited 55 times to date [
68]. The study, published in the journal “IEEE Access”, highlighted that electricity theft cyberattacks can be launched by fraudulent customers by compromising their SMs to report false readings to pay less for their electricity usage. In addition, the study noted that these attacks harmfully affect the power sector since they cause substantial financial loss and degrade the grid performance because the readings are used for energy management. However, the authors warn that using ML and BD from smart grids will also present challenges shortly. Another notable publication by Alsabaan, M and co-authors is “
Clustering and Ensemble Based Approach for Securing Electricity Theft Detectors Against Evasion Attacks,” cited 32 times [
69]. The
IEEE Access publication highlighted that in smart power grids, electricity theft causes huge economic losses to electrical utility companies. Machine learning (ML), especially deep neural network (DNN) models, hold state-of-the-art performance in detecting electricity theft cyberattacks. The findings indicate that the cluster-based detector is not only more robust against evasion attacks but also enhances normal classification accuracy because its training data has more consumption pattern similarity compared to the training data of the global detector which requires a higher level of regularization.
Other notable researchers in the top five of the field include Badr M.M. (SUNY Polytechnic Institute, Utica, United States), and Muhammad Ismail (Tennessee Technological University, United States) and Fouda M.M. (Idaho State University, Pocatello, United States) with 17, 17, and 16 publications, respectively. Overall, the findings indicate high productivity, scientific impact, and technological development in the field, which could be ascribed to numerous factors. For example, the collaboration between scientific stakeholders (e.g., authors/researchers, affiliations, and countries) is considered an integral part of research productivity and scholarly advancement in many fields. This research examined the extent of collaboration between researchers through bibliometric analysis (BA).
Figure 5 shows the network visualization map (NVM) of the co-authorship on the MLSG research landscape using the VOSviewer software.
In
Figure 5, based on a minimum of five publications with at least 50 citations, the NVM displays the seven clusters of collaborations between 30 of the top 141 researchers in MLSG research. Further analysis shows that Alsabaan, M, Badr M.M. and Refaat Shady have the highest total link strength (TLS) of 62, 54, and 46, respectively. The findings indicate that they have the strongest collaboration links among the major researchers on the topic. Furthermore, the top 10 corresponding authors’ countries in the MLSG research are displayed in
Figure 6 and represented in
Table 4 respectively using the RStudio tool (Biblioshiny).
The researchers and institutions in China have produced 157 published documents and have a frequency of 0.107% of the total publications (TP) in the literature. This indicates that China is the prime mover in MLSG, and its corresponding authors are the most relevant and prolific as earlier shown by the findings in
Figure 6. In second place is the United States which has a corresponding authorship frequency of 0.067% and has been a long-term front-runner in the research and development of machine learning worldwide. The impact of US companies such as Samsara (San Francisco, CA, USA), Vates (Atlanta, GA, USA), Andersen Inc. (Bayport, MN, USA), and Oxagile (New York City, NY, USA) is widely reported in the media in the ML world. Other top nations such as India, South Korea, and Pakistan are also important players in the MLSG industry with frequencies of 0.056%, 0.026% and 0.016% respectively on the topic in Scopus. The dominance of the top five nations in MLSG research could be attributed to numerous factors including the quest for improved living standards as well as increased efficiency, cost savings, and the environmental friendliness of processes, products, goods, and services worldwide.
3.9. Review of the MLSG Literature
The papers discussed in the following thematic review were selected from the 4156-document MLSG corpus using a two-stage process. In the first stage, citation-ranked lists were generated for each of the three identified thematic clusters (Smart Grid Security, Power Load Forecasting, and Advanced Energy Management) using the VOSviewer-derived cluster assignments. The top 30 most-cited publications within each cluster were identified as the primary candidate pool. In the second stage, the research team independently reviewed the titles, abstracts, and full texts of candidate papers, applying the same inclusion criteria used for the broader bibliometric analysis. Papers were included in the narrative review if they: (i) explicitly addressed ML applications within the cluster theme, (ii) contributed a methodologically distinct approach or finding not already represented, and (iii) were published in peer-reviewed venues. This process yielded 12–18 anchor papers per cluster, supplemented by additional studies referenced in the highest-cited works. The selection prioritizes citation impact and methodological diversity rather than exhaustive coverage, consistent with the aim of synthesizing key developments rather than cataloging all contributions. The review of the scientific literature on the scientific growth and technological development of the MLSG research landscape was carried out based on the technique described in the literature [
48,
83]. Therefore, the studies on the three identified themes or hotspots in MLSG were critically reviewed.
- (i)
Smart Grid Security (SGS)
The term “
smart grid security” or SGS refers to the policies, procedures, and tools to defend and safeguard the smart grid infrastructure against potential physical and cyberattacks [
84]. With the help of contemporary information and communication technology, the smart grid effectively manages power generation, transmission, distribution, and consumption. However, incorporating these technologies also brings new risks and weaknesses [
85,
86]. The significance of SGS increases as it becomes increasingly digitally dependent and networked. Addressing potential security vulnerabilities is essential to maintain the smart grid infrastructure’s integrity, stability, and reliability and ensure a secure and sustainable energy future.
Security for the smart grid includes several elements, such as authentication and access control, resilience and dependability, monitoring and incident response, data privacy, physical security, cybersecurity, and standards and compliance [
87,
88,
89]. Due to its significance, research into the security of smart grids has been extensively investigated in the literature. The study by Ozay et al. [
71] revealed that ML algorithms are more effective in detecting attacks than attack detection algorithms that use state vector estimation techniques in the proposed attack detection framework. Likewise, machine learning algorithms are more effective in detecting attacks than state vector estimation techniques. Furthermore, the measurements in the smart grid can be classified as secure or under attack using ML methods. Lastly, the attack detection problem can be modeled using decision- and feature-level fusion. In another study on the security of smart grids, Ahmed et al. [
80] demonstrated that detection accuracy is increased using a feature selection method based on genetic algorithms compared to conventionally used ML-based techniques. Therefore, a supervised ML-based method is suggested to identify covert cyber-deception attacks in communication networks for SGs. The performance evaluation demonstrated that when compared to current machine learning-based methods, the suggested scheme significantly improved the accuracy of covert cyber-deception assault detection [
80]. In a separate study, Ahmed et al. [
77] also showed that a covert data integrity attack on the smart grid communications network might seriously jeopardize its security and dependability. A novel unsupervised machine learning-based strategy is suggested to solve this problem. This technique employs the isolation forest algorithm to identify covert data integrity threats in the communication networks of the smart grid. The strategy assumes the attack would provide the generated random forest’s smallest average path length. Using industry-standard IEEE 14-bus, 39-bus, 57-bus, and 118-bus systems to test the proposed method greatly boosts attack detection efficiency [
77]. Wang et al. [
31] proposed a smart grid DoS attack detection methodology based on ML. The authors showed that with the KDD99 dataset, SVM performs exceptionally well. The proposed smart grid DoS attack detection model uses machine learning, features are chosen, PCA is used to reduce the number of dimensions, and SVM outperforms Decision Tree and Naive Bayesian Network in terms of performance [
31]. Panthi [
90] adopted ML techniques to detect anomalies in smart grids. The study evaluated many cutting-edge ML approaches. The findings showed that it is possible to identify and distinguish between natural and artificial disruptions in power systems using ML techniques. Overall, the study showed that ML approaches can reliably identify cyberattacks, including those that use dishonest methods to obscure their tracks. Guihai and Sikdar [
91] examined false data injection attack detection using adversarial machine learning for smart grid demand response. The authors showed that adversarial ML attacks can target deep learning-based attack detectors in distributed DR settings. Therefore, AML attacks can exploit deep learning-based FDI attack detectors in DR settings. To trick deep learning-based FDI attack detection, a new black-box FDI assault methodology is provided. It can create power demands in distributed DR scenarios. The suggested AML framework surpasses existing AML methods in the literature and can drastically reduce the accuracy of FDI detection models. More recently, Aziz et al. [
92] explored the use of effective and unique machine learning models to protect a smart grid by detecting cyber-malware assaults. The results indicated that by using supervised learning and hybrid techniques in a simulated exercise, classification systems that detect FDI assaults function better. False data injection (FDI) attacks can be found using ML in SGs. Therefore, six alternative boosting and feature selection (FS) strategies were used to analyze six supervised learning (SVM-FS) hybrid techniques. It was reported that the application of supervised learning and hybrid methods enhanced the performance of classification algorithms used to identify FDI attacks. Other studies have highlighted the importance of ML methods in protecting SGs. Various themes and topics, including cyber-physical attack development, fake data injection, attack detection, and anomaly detection have been critically examined over the years. Furthermore, the effectiveness of numerous algorithms, such as the SVM, Decision Tree, Naive Bayesian Network, and Convolution approaches, have been explored and exploited in detail. The studies suggest that performance can be enhanced by employing feature selection and PCA to reduce the dimensionality of the data. The emphasis has also been to promptly identify cyberthreats and preserve the integrity of communication networks for SG. These studies have also sought to emphasize the value of unsupervised learning with isolation forests for detecting covert data integrity attacks. Overall, the objective is to utilize cutting-edge and effective ML-based algorithms to protect smart grids from cyberattacks.
Critically, the SGS literature reveals important methodological trade-offs that merit explicit discussion. Supervised ML approaches, such as SVM and random forests, consistently achieve high detection accuracy on benchmark datasets (e.g., IEEE 14-bus, KDD99) but are vulnerable to adversarial perturbations, as demonstrated by Guihai and Sikdar [
91], who showed that deep learning-based FDI detectors can be compromised by carefully crafted adversarial inputs. More recent approaches employing adversarial machine learning (relevance score: 1.54;
Table 7) attempt to address this limitation by incorporating adversarial training, though at the cost of increased computational overhead. Meanwhile, unsupervised approaches using isolation forests [
77] offer the practical advantage of requiring no labeled attack data, but their detection sensitivity may be lower for novel, previously unseen attack patterns. A further emerging tension exists between centralized and federated ML architectures: while centralized models benefit from richer training datasets, federated learning approaches (relevance score: 1.84;
Table 7) address critical data privacy concerns in smart grid deployments, particularly in contexts where utility companies are unwilling to share raw consumer data. The application of graph neural networks (GNNs) (relevance score: 1.72;
Table 7) to grid topology-aware intrusion detection represents a promising frontier but remains under-studied relative to its potential, with fewer than 100 publications identified in the sensitivity analysis.
- (ii)
Power Load Forecasting (PLF)
The term “
power load forecasting” or PLF estimates future electricity demand or consumption for a certain area or power system over a specified time frame. The objective of PLF is to effectively plan and manage resources, ensure grid stability, and prevent shortages or overproduction of energy. Due to its critical importance, energy suppliers, utility companies, and grid operators strongly depend on PLF for smooth operations. Typically, PLF is performed using various methods, including statistics and time-series analysis. Recently, machine/deep learning algorithms and artificial intelligence models have been proposed to carry out PLF. The PLF approach’s major advantage is that it considers past load data, weather patterns, seasonal trends, economics, and other pertinent variables. Based on the foregoing, utilities can effectively execute demand response programs, manage maintenance schedules, and optimize electricity generation and distribution through accurate PLF. Various studies have examined the use of ML to execute PLF. Ahmad and Chen [
81] examined the potential of three different ML models for predicting district-level long-term and medium-term energy demand in SGs. The study showed that district-level medium- and long-term energy demand can be accurately and precisely forecast using ML. The ML-based models examined in the study were artificial neural networks with nonlinear autoregressive exogenous multivariable inputs, multivariate linear regression, and adaptive boosting models. Furthermore, the accuracy of the models was enhanced using various data tests and accomplished feature extraction, data modification, and outlier detection. The study showed that the models offer suitable forecasting intervals and support the consolidation of district-level variances and spatiotemporal energy usage inconsistencies in a smart grid setting. Ungureanu et al. [
93] examined the use of ML for industrial load forecasting in an SG. The findings showed that although forecasting the energy behavior in the industry is challenging, using ML can assist particularly with forecasting and optimizing the loads. In addition, the authors showed that ML could help to reduce balancing costs and foresee network issues to improve load forecasting for industrial consumers. The forecasts of detailed energy behavior can enhance the integration of industrial users into SGs. For example, data on high-frequency recording intervals and real-time processing can be used to justify investments in the SG [
93]. Syed et al. [
94] evaluated the performance of distributed ML for load forecasting in a smart grid. The study employed big data platforms such as Apache Spark and Apache Hadoop for PLF. The findings showed that Spark produced excellent accuracy while parallelizing the load forecasting process [
94]. Bahaghighat et al. [
95] employed ML and computer vision to remotely estimate the angular velocity of wind turbines in an SG. The authors demonstrated that the angular velocity of wind turbines in SGs can be reliably predicted using ML techniques and vision sensor networks with 95.4% accuracy. Furthermore, the study showed that computer vision algorithms (such as FAST, SIFT, SURF, BF, FLANN, AE, and SVM) can be used to precisely pinpoint the hub and track the existence of the blade in successive frames of a video stream. Another earlier study, Bahaghighat et al. [
96], showed that convolutional neural networks and video mining could be utilized to remotely evaluate wind turbines’ angular velocity. In Tiwari et al. [
25], the authors opined that using ML-based models for predicting energy use is a clever step toward creating a smart city. In addition, the study observed that the support vector machine (SVM) method produced the most accurate results when used with the dataset for Smart Grid Stability. Overall, the authors observed and reported that transforming traditional grids into smart grids using sensors and ML algorithms can assist in smart city creation. Cebekhulu et al. [
97] also demonstrated the potential of ML algorithms for predicting energy consumption. The study carried out a performance evaluation of ML algorithms for SG energy demand–supply prediction. Overall, the studies on FLP in the literature have focused on using ML in SGs to classify energy imbalances, identify energy consumers, estimate wind turbine velocities, and forecast load. ML-based algorithms have also sought to maximize energy efficiency and enable smart city applications. The topics typically cover energy demand–supply forecasting to short-term load forecasting. The ultimate objective is to create effective models and algorithms that promote smart grid technology and energy optimization.
Across the PLF literature, a persistent tension exists between model accuracy and interpretability. Deep learning architectures, particularly LSTM networks [
75] and transformer models (relevance score: 1.68;
Table 7), consistently outperform classical ML methods on benchmark load forecasting tasks but operate as black boxes, limiting their practical adoption in regulated utility environments where decision auditability is required. Explainable AI (XAI) techniques (relevance score: 1.64;
Table 7) have begun to bridge this gap, though their application in PLF remains nascent. Federated learning approaches to load forecasting (relevance score: 1.84;
Table 7) offer an attractive solution to the data-sharing barrier among competing utilities, but existing results suggest a measurable accuracy penalty relative to centralized models trained on pooled data. Physics-informed neural networks (PINNs) (relevance score: 1.61;
Table 7) represent an emerging paradigm that constrains model outputs to comply with known physical laws of power systems, potentially improving generalization to out-of-distribution load conditions, an important practical consideration given the increasing penetration of electric vehicles and distributed energy resources. These trade-offs suggest that future PLF research should move beyond benchmark accuracy comparisons toward multi-criteria evaluation frameworks that explicitly account for interpretability, communication efficiency, and physical plausibility.
- (iii)
Advanced Energy Management
The concept of “
Advanced Energy Management,” or AEM, involves using cutting-edge technologies to maximize efficient and sustainable production, delivery, and energy use. AEM integrates advanced analytics, energy storage systems, renewable energy sources, and smart grid technology. Selected components of AEM include demand response initiatives, incorporating renewable energy sources, energy storage options, energy analytics, and grid optimization. AEM strategy aims to promote a cleaner, greener energy future through reduced reliance on fossil fuels, fewer greenhouse gas emissions, grid stability, and increased overall energy efficiency. Due to its importance to global sustainability, various researchers have extensively examined AEM through various empirical and numerical investigations reported in the literature. Li et al. [
98] examined the potential of ML in predicting the comfort level of users in smart grid environments using three widely used supervised learning algorithms. The findings revealed that ML algorithms can forecast consumer comfort levels for novel gadget usage patterns. The prediction accuracy of the algorithms was influenced by the number of training samples (Li et al. [
98]). The study by Azad et al. [
99] highlighted the potential of ML in transforming smart grids. The study highlighted that ML can help SGs intelligently adapt to unexpected changes, for instance changes in consumer demand, power outages, unexpected decreases/increases, intermittencies in the output of renewable energy sources, or catastrophic events. In addition, reinforced learning can help with energy dispatch decisions and trigger demand management signals, which could balance the supply and demand for electricity. Other ML applications include data authentication and identifying and preventing aberrant behavior, intrusion, cyberattacks, and criminal actions. Babar et al. [
78] proposed an ML-based, secure, and robust engine for demand-side management of an IoT-enabled smart grid. The study showed that a safe demand-side management engine is recommended for the grid powered by the Internet of Things. For the Internet of Things (IoT)-enabled grid, a secure demand-side management (DSM) engine is proposed. Additionally, the study showed that an ML classifier could predict dishonest entities in a resilient model, which could help manage intrusions into the smart grid. Hence, the authors showed that advanced energy management and interface-regulating agents can ensure the best possible energy utilization. Ahmed et al. [
79] proposed an ML-based energy management strategy for renewable energy districts and smart grids. The findings showed that an effective energy management model (EMM) that incorporates renewable energy sources with smart grids can be created using machine learning (ML) and Gaussian process regression (GPR). Energy consumers and the grid gain from the proposed adaptive service level agreement (SLA) between these two parties. To demonstrate the proposed model’s validity, its outcomes are carefully compared with those of traditional optimization (GA and PSO)-based EMM [
79]. Min et al. [
100] proposed an innovative technique for enhancing the observability of automated SGs based on stochastic ML. Simulations and numerical results on a real system confirmed that the suggested method ensured the distribution network’s visibility before and during reconfiguration in the planning time frame. Krč et al. [
101] employed ML-based node characterization to assess an SG’s flexibility in demand responses. The study findings showed that the network flexibility potential could be measured using ML-based node characterization [
102]. In contrast, artificial neural networks could categorize historical demand data from network substations. Other studies have also demonstrated that ML techniques are useful for uncertainty quantification in SG applications with algorithms comparing stability prediction signal processing, condition monitoring, and observability enhancement have also been extensively reported.
Although this study focuses on smart grid applications, it is important to contextualize MLSG research within the wider electrical systems domain to appreciate the breadth of ML’s impact. Machine learning has been extensively applied across power system components beyond smart grids, including high-voltage transmission infrastructure, distribution networks, power electronics, and rotating electrical machinery.
In the area of power system fault detection and protection, ML classifiers, particularly random forests, support vector machines, and deep learning architectures, have demonstrated high accuracy in detecting and classifying faults in transmission lines, underground cables, and power transformers [
103]. These models analyze current, voltage, and impedance waveforms in real-time, enabling faster protective relay coordination and reducing fault clearance times compared to conventional threshold-based schemes [
104]. Transformer condition monitoring represents another critical ML application in electrical systems. Dissolved gas analysis (DGA) combined with ML classifiers has enabled early identification of incipient faults including partial discharge, arcing, and overheating in power transformers, which are among the most critical and expensive assets in electrical networks [
105]. ML-enhanced DGA overcomes limitations of traditional IEC ratio methods by learning complex, nonlinear relationships between dissolved gas concentrations and fault types.
In power quality monitoring, ML models have been applied to automatically detect, classify, and localize power quality disturbances, including voltage sags, swells, interruptions, harmonics, and flicker phenomena that affect both grid-connected and off-grid electrical installations [
106]. These capabilities are particularly relevant in the context of increasing penetration of nonlinear loads and distributed generation, which intensify power quality challenges across distribution networks. The application of ML to electric motor drives and industrial systems has focused on induction motor fault diagnosis, bearing defect detection, and predictive maintenance scheduling [
107]. Vibration signals, stator current spectra, and thermal imaging data have been combined with ML algorithms including convolutional neural networks and long short-term memory networks to identify mechanical and electrical faults at early stages, supporting condition-based maintenance strategies in industrial facilities.
From the results, the bibliometric findings not only reveal the evolution of research trends but also provide important insights into the technological trajectory of smart grid systems. The dominance of research clusters such as Smart Grid Security, Load Forecasting, and Energy Management reflects the critical technological priorities required for next-generation grid infrastructures. The strong emphasis on machine learning-based cybersecurity, particularly in areas such as false data injection and intrusion detection, highlights the increasing vulnerability of smart grids to cyberthreats. This trend suggests that future smart grid architectures must integrate real-time, intelligent security frameworks capable of adaptive threat detection. The growing use of deep learning models further implies a shift toward automated, data-driven security systems, although challenges related to interpretability and deployment in real-time environments remain.
Similarly, the prominence of load forecasting research, particularly using deep learning models such as LSTM and hybrid architecture, indicates a technological transition toward predictive and proactive grid management. Accurate forecasting enables better demand-response strategies, renewable energy integration, and operational efficiency. However, the reliance on large datasets and computational resources suggests that future systems must incorporate edge computing and scalable data infrastructures to support real-time forecasting capabilities. In the domain of advanced energy management, the increasing adoption of reinforcement learning reflects a shift toward autonomous and self-optimizing smart grid systems. These approaches enable dynamic decision-making in energy distribution, storage, and consumption. Nevertheless, practical deployment remains constrained by issues such as training instability, model reliability, and integration with existing grid infrastructure.
Furthermore, the geographical distribution of publications, with leading contributions from countries such as China and the United States, indicates that technological advancements in smart grids are closely tied to national investments in digital infrastructure and energy innovation. This highlights the need for global collaboration and standardization to ensure interoperability and scalability of smart grid technologies. Overall, the bibliometric analysis suggests that the future of smart grid development will be characterized by the convergence of artificial intelligence, big data analytics, and distributed computing, enabling more resilient, efficient, and intelligent energy systems. However, addressing challenges related to data quality, model interpretability, cybersecurity risks, and real-time implementation remains essential for translating these research advancements into practical, large-scale deployments.
The AEM literature similarly exhibits important trade-offs between solution optimality and real-time feasibility. Reinforcement learning, particularly deep reinforcement learning (relevance score: 1.76;
Table 7), has demonstrated superior long-term reward optimization for energy dispatch and demand response tasks [
73], but convergence times and sample complexity remain practical barriers for real-time grid management. Model-based approaches incorporating digital twin frameworks (relevance score: 1.58;
Table 7) offer a promising avenue for accelerating RL training in simulated environments before deployment, though the fidelity of digital twin representations to real grid dynamics is an open challenge. A key unresolved tension in the AEM literature concerns the centralized versus decentralized architecture debate: centralized optimization achieves global optimality but is computationally intractable at scale, while multi-agent reinforcement learning and federated approaches enable scalability at the cost of sub-optimal coordination. Explainability also emerges as a critical gap: while recent XAI methods have been applied to fault detection, their use in AEM decision-support systems particularly in regulatory or consumer-facing applications remains limited. Future research should prioritize hybrid approaches that combine the optimality of model-based methods with the adaptability of data-driven techniques.
3.10. Critical Thematic Synthesis of the MLSG Research Landscape
The keyword co-occurrence analysis presented in
Section 3.7 identified three structurally coherent thematic clusters within the MLSG corpus, each characterized by a distinct set of high-frequency keywords, dominant citation networks, and concentrated authorship activity. The present section synthesizes the most significant literature within each cluster, explicitly anchoring the review to the bibliometric structure of the field rather than providing a general narrative account. For each cluster, the dominant papers are identified by citation count within the 4156-document corpus, the core keywords defining the cluster boundary are noted, and cross-cluster relationships are highlighted where they reflect genuine intellectual interdependence in the literature.
- (i)
Cluster 1—Smart Grid Security (SGS)
Bibliometric cluster profile: Cluster 1 is the largest and most densely connected thematic cluster in the MLSG keyword co-occurrence map, anchored by the keywords
smart grid security,
false data injection,
cyberattack,
intrusion detection,
anomaly detection,
support vector machine, and
deep learning. The cluster contains 29 keywords and accounts for the highest total link strength (TLS) among the four clusters, reflecting the dense co-citation relationships between security-focused studies. Within the 4156-document corpus, the most highly cited publications in this cluster include Ozay et al. [
71] with 596 citations, Ahmed et al. [
77] with 264 citations, Ahmed et al. [
80] with 95 citations, and Zhang et al. [
38] with 179 citations. These four papers collectively account for a substantial proportion of the cluster’s citation mass and serve as the intellectual anchors for the security sub-domain.
Synthesis of cluster themes: The dominant research question across Cluster 1 concerns the detection and classification of malicious intrusions into smart grid communication and measurement infrastructure, particularly false data injection attacks (FDIAs) and denial-of-service (DoS) attacks. The most-cited work in this cluster, Ozay et al. [
71], is the bibliometrically defining study, establishing that ML algorithms particularly SVM and feature-level fusion approaches outperform conventional state vector estimation techniques in detecting coordinated attacks on smart grid measurement systems. This finding has been replicated and extended across the cluster, confirming SVM’s dominance as the baseline detection model in the pre-deep-learning era of the literature. The cluster’s citation network shows a clear temporal transition: studies published before 2018 are predominantly SVM and tree-based, while post-2018 publications shift toward deep learning architectures, particularly autoencoders and CNN-based anomaly detection, consistent with the overlay visualization of keyword emergence shown in
Figure 11b.
Ahmed et al. [
77], the second most-cited paper in this cluster, advanced the field by proposing unsupervised ML using isolation forests for detecting covert data integrity attacks, moving beyond the supervised paradigm and addressing the critical challenge of operating without labeled attack data, a practical constraint in real-world grid deployments. This methodological contribution is reflected in the bibliometric co-occurrence map, where the keyword
isolation forest forms a secondary bridge node linking Cluster 1 to Cluster 3 (Advanced Energy Management), suggesting that anomaly detection techniques developed in the security domain have been transferred to energy management applications. Ahmed et al. [
80] further demonstrated that genetic algorithm-based feature selection substantially improves detection accuracy over standard ML pipelines, a finding that has been cited across multiple sub-clusters as evidence for the importance of dimensionality reduction in high-dimensional smart grid data. Zhang et al. [
38], a review paper with 179 citations, serves as the cluster’s most-cited synthesis work, mapping the landscape of ML-based FDIA detection and establishing the taxonomy of attack types and corresponding ML countermeasures that subsequent empirical studies have used as a reference framework.
The cluster also contains several studies examining adversarial robustness, a theme that represents the most recent growth frontier within SGS research. Guihai and Sikdar [
91] demonstrated that deep learning-based FDIA detectors are themselves vulnerable to adversarial ML attacks in distributed demand response settings; a finding that introduces a reflexive vulnerability into the ML security paradigm and explains the emergence of the keyword
adversarial machine learning as a high-growth term in the overlay visualization. More recently, Aziz et al. [
92] systematically compared six supervised-learning and hybrid feature-selection combinations for FDIA detection, reporting that ensemble and hybrid approaches consistently outperform single-classifier models, making it consistent with the broader bibliometric trend toward ensemble methods observed across all three clusters.
Cross-cluster linkage: Cluster 1 shares boundary keywords with Cluster 2 (neural networks, deep learning) and with Cluster 3 (smart meters, demand response), reflecting the fact that security-focused studies increasingly incorporate forecasting components and that electricity theft detection, a security problem, relies on the same consumption pattern modeling techniques used in load forecasting.
- (ii)
Cluster 2—Power Load Forecasting (PLF)
Bibliometric cluster profile: Cluster 2 is anchored by the keywords
load forecasting,
electricity demand,
renewable energy,
LSTM,
neural networks,
deep learning,
time series,
short-term forecasting, and
wind power. It contains 23 keywords and exhibits the highest average citation count per node among the three thematic clusters, reflecting the maturity and citation concentration of PLF research. The dominant papers within this cluster in the 4156-document corpus are Hafeez et al. [
75] with 840 citations which is the single most-cited paper in the entire MLSG corpus, Ahmad and Chen [
81] with 88 citations, and Syed et al. [
94] with moderate citation counts, along with the review by Ahmad et al. [
23] with 618 citations, which bridges Clusters 2 and 3. These papers define the intellectual center of gravity for the forecasting sub-domain.
Synthesis of cluster themes: The bibliometric structure of Cluster 2 reveals a clear methodological hierarchy: the most-cited work in the cluster, and in the entire corpus, is Hafeez et al. [
75], which proposed an optimized deep learning LSTM model for electric load forecasting using genetic algorithm-based feature selection. Its position as the most-cited paper across all 4156 documents confirms that LSTM-based architectures, particularly when combined with evolutionary feature selection, represent the current methodological consensus for PLF in the MLSG domain, a finding that is directly corroborated by the high co-occurrence frequency of the keywords
LSTM and
feature selection in Cluster 2 of
Figure 11a.
The second most-cited work bridging this cluster, Ahmad et al. [
23] with 618 citations, is a review of data-driven probabilistic ML in smart energy systems that serves as the theoretical anchor for the cluster’s shift from deterministic to probabilistic forecasting approaches. Its high citation count reflects the field’s recognition that capturing forecast uncertainty not merely point estimates is critical for operational grid management under increasing renewable penetration. This probabilistic turn is also reflected in the co-occurrence map, where keywords such as
uncertainty quantification,
probabilistic forecasting, and
Monte Carlo appear as mid-density nodes in the outer ring of Cluster 2.
Ahmad and Chen [
81] contributed a systematic comparison of three ML model families—ANN-NARX, multivariate linear regression, and adaptive boosting for district-level medium- and long-term energy demand forecasting in smart grids—demonstrating that ensemble methods incorporating feature extraction and outlier detection produce superior spatiotemporal forecasting accuracy. This paper is representative of a broader sub-theme within Cluster 2 that focuses on the scalability of ML forecasting from building-level to district-level granularity, a research direction reflected in the keyword co-occurrence of
smart city,
energy consumption, and
smart meter within the cluster boundary. Ungureanu et al. [
93] and Cebekhulu et al. [
97] both confirmed that ML-based industrial and system-level load forecasting substantially reduces balancing costs and network stress, particularly in settings with high industrial demand variability themes that link Cluster 2 to Cluster 3 through the shared keyword
demand response.
Syed et al. [
94] demonstrated that distributed ML using Apache Spark and Hadoop platforms can parallelize load forecasting at scale without sacrificing accuracy, addressing one of the practical deployment barriers identified in Cluster 2: the computational cost of deep learning models at grid-scale inference. This finding is consistent with the emergence of the keyword
edge computing in the high-growth zone of the overlay visualization (
Figure 11b), suggesting that scalable, edge-deployable forecasting architectures represent the next methodological frontier for this cluster.
Cross-cluster linkage: Cluster 2 is most strongly linked to Cluster 3 through the shared keyword demand response, reflecting the operational relationship between accurate load forecasts and energy management decisions. The keyword renewable energy also bridges Clusters 2 and 3, confirming that renewable integration is simultaneously a forecasting problem and an energy management optimization challenge in the MLSG literature.
- (iii)
Cluster 3—Advanced Energy Management (AEM)
Bibliometric cluster profile: Cluster 3 is anchored by the keywords
energy management,
demand response,
reinforcement learning,
microgrid,
smart meters,
optimization,
IoT,
electricity theft, and
energy storage. It contains 31 keywords, the largest cluster vocabulary, and represents the broadest thematic scope among the three identified hotspots, encompassing both consumer-side and grid-side energy optimization applications. The dominant papers within this cluster in the 4156-document corpus are Babar et al. [
78] with 159 citations, Ahmed et al. [
79] with 124 citations, Ji et al. [
73] with a high citation count on real-time DRL-based energy management, Kotsiopoulos et al. [
33] with 320 citations, and the review by Ahmad et al. [
36] with 461 citations. The latter two are the most-cited review papers in the cluster and serve as the principal reference frameworks for the AEM sub-domain.
Synthesis of cluster themes: The bibliometric structure of Cluster 3 reflects a field in active methodological transition. The most-cited review in the cluster, Ahmad et al. [
36] with 461 citations, provides the foundational taxonomy of ML and deep learning applications in smart manufacturing and smart grid systems, establishing the range of tasks from anomaly detection to energy scheduling that fall within the AEM domain. Its high citation count relative to empirical papers suggests that the field is still consolidating its conceptual framework, with review papers serving a disproportionate scaffolding function, a pattern consistent with a rapidly expanding research area that has not yet fully standardized its methodological vocabulary. The second most-cited work anchoring this cluster, Kotsiopoulos et al. [
33] with 320 citations, is a review examining ML and deep learning in smart manufacturing in the context of the smart grid paradigm, providing a cross-domain perspective that links industrial energy management to grid optimization. Its high citation count within Cluster 3 reflects the keyword co-occurrence of
manufacturing,
industry 4.0, and
IoT within the cluster boundary, confirming that AEM research is increasingly drawing on industrial data science methodologies and that the smart grid is conceptualized as part of a broader industrial digitization ecosystem rather than as an isolated power system component.
Among empirical papers, Babar et al. [
78] with 159 citations is the most-cited study in Cluster 3, proposing a secure ML-based demand-side management engine for IoT-enabled smart grids that simultaneously addresses energy optimization and intrusion prevention—a dual contribution that explains the paper’s bridging position between Clusters 1 and 3 in the co-occurrence map. The co-occurrence of
IoT and
demand response as cluster-defining keywords confirms that consumer-side energy management in connected environments is the primary application context within this cluster. Ahmed et al. [
79] advanced this direction by proposing an ML-based energy management model incorporating Gaussian process regression and adaptive service level agreements between energy consumers and smart grid operators, demonstrating that ML-driven AEM can achieve performance comparable to traditional optimization approaches such as genetic algorithms and PSO while offering substantially greater adaptability.
Ji et al. [
73], representing the reinforcement learning sub-theme within Cluster 3, demonstrated real-time DRL-based energy management in a microgrid environment using Deep Q-Networks, establishing RL as the most promising control paradigm for autonomous grid operation under uncertainty. This finding is directly corroborated by the high co-occurrence frequency of
reinforcement learning and
deep reinforcement learning as growth keywords in the overlay visualization, where they occupy the high-density recent emergence zone of
Figure 11b, confirming that RL-based AEM represents both the current frontier and the fastest-growing sub-theme within the cluster. Li et al. [
98] and Azad et al. [
99] further contextualized RL within Cluster 3, demonstrating its capacity to model consumer comfort prediction and to enable adaptive grid responses to demand volatility and renewable intermittency respectively, both themes reflected in the cluster’s keyword co-occurrence of
consumer behavior,
renewable energy, and
energy storage.
Cross-cluster linkage: Cluster 3’s breadth and its high number of bridge keywords to both Cluster 1 (smart meters, anomaly detection) and Cluster 2 (demand response, renewable energy) confirm its integrative role in the MLSG landscape. The cluster effectively constitutes the applied synthesis domain of the field where the security intelligence developed in Cluster 1 and the forecasting accuracy achieved in Cluster 2 are operationalized into real-time grid control, consumer management, and energy optimization systems.
3.11. Overview of ML Models, Application Areas, Challenges, and Current Trends in MLSG Research
- (i)
Machine learning models used in MLSG research
The bibliometric analysis reveals that a diverse range of ML models has been deployed across the MLSG research landscape, each offering distinct strengths suited to specific smart grid tasks. Based on the reviewed literature, the principal ML models applied in smart grid contexts can be broadly categorized into supervised learning, deep learning, reinforcement learning, and unsupervised approaches [
6,
108].
Supervised learning models continue to dominate the MLSG corpus. Support Vector Machines (SVMs) have been widely applied to intrusion detection, electricity theft classification, and fault identification tasks due to their strong generalization capability in high-dimensional feature spaces [
39]. Decision Trees and their ensemble variants, particularly random forests and Gradient Boosted Trees including XGBoost, have been extensively used for load classification, demand forecasting, and anomaly detection owing to their interpretability and robustness to noisy data [
65]. Logistic Regression has served as a reliable baseline classifier in binary attack detection and demand response prediction tasks across numerous benchmarking studies [
109].
Deep learning models have gained significant traction in the MLSG landscape over the past decade. Long short-term memory (LSTM) networks and Gated Recurrent Units (GRUs) have demonstrated strong performance in sequential time-series tasks, particularly short-term and medium-term load forecasting, where capturing temporal dependencies is critical [
69,
110]. Convolutional neural networks (CNNs) have been applied to power quality disturbance classification and smart meter data analysis, effectively extracting spatial and local features from signal data [
35,
111]. Hybrid CNN-LSTM architecture has emerged as a leading approach for combined spatial–temporal modeling in energy consumption prediction and grid stability monitoring [
34].
Reinforcement learning (RL), and particularly deep reinforcement learning (DRL), has become a prominent paradigm for real-time energy management and demand response optimization in microgrids and grid-connected storage systems. DRL agents, including Deep Q-Networks (DQN) and Proximal Policy Optimization (PPO), learn optimal control strategies through interaction with simulated grid environments, enabling adaptive decision-making under conditions of uncertainty [
73]. Unsupervised and semi-supervised models, including autoencoders, isolation forests, and clustering algorithms such as K-Means and DBSCAN, have been applied to anomaly detection, false data injection identification, and consumer behavior segmentation, particularly in settings where labeled training data is scarce or costly to obtain [
77]. Together, these model families reflect the breadth and methodological maturity of ML deployment across the MLSG research landscape.
Based on the systematic review of the MLSG literature, ten principal application areas have been identified, reflecting the full operational scope of smart grid systems. They are: (1) intrusion and cyberattack detection, where SVM and deep autoencoders identify false data injection and DoS attacks in real time; (2) electricity theft detection using supervised classifiers on smart meter consumption data; (3) short- and long-term load forecasting using LSTM and ensemble regressors; (4) renewable energy output forecasting to manage variability from solar and wind sources; (5) demand response and energy management via reinforcement learning agents; (6) fault detection and grid stability monitoring from voltage and frequency measurements; (7) power quality disturbance classification using CNN and wavelet-based models; (8) smart meter analytics and consumer behavior profiling through clustering algorithms; (9) predictive maintenance of transformers and other grid assets using sensor data; and (10) electric vehicle charging optimization and vehicle-to-grid scheduling as EV penetration continues to rise globally.
- (ii)
Challenges of applying ML in smart grids
Despite the significant progress documented in the MLSG literature, the application of ML in smart grid environments faces a number of persistent and interrelated challenges that continue to temper the translation of research advances into operational deployment.
Firstly, smart grid ML models depend heavily on large volumes of high-quality, labeled operational data. In practice, real-world grid datasets are frequently incomplete, imbalanced, noisy, or corrupted by measurement errors and sensor faults. Furthermore, smart meter and operational data are subject to stringent privacy regulations, including GDPR and equivalent national frameworks, which restrict data sharing between utilities, researchers, and technology developers, thereby limiting the size and diversity of available training datasets [
32].
Secondly, many critical smart grid ML tasks, including fault detection, cyberattack identification, and electricity theft classification, are characterized by severely imbalanced class distributions in which anomalous events represent a small minority of all observations. Standard ML models trained on imbalanced data tend to be biased toward majority classes, producing high overall accuracy but poor recall for the rare events of greatest operational importance. Addressing this challenge typically requires specialized resampling strategies, cost-sensitive learning, or purpose-built anomaly detection frameworks [
112].
Thirdly, the increasing adoption of complex deep learning architectures, while delivering state-of-the-art predictive performance, introduces significant challenges around model transparency and interpretability. Grid operators, regulators, and utility engineers require meaningful explanations for ML-driven decisions, particularly in safety-critical contexts such as fault protection and demand response. The opacity of black-box models represents a substantial barrier to regulatory approval and operational trust, driving growing interest in explainable AI (XAI) frameworks such as SHAP and LIME for smart grid applications [
113].
Lastly, smart grid systems operate at considerable scale, encompassing millions of smart meters, thousands of substations, and complex multi-layered communication networks, and they require ML inference at timescales ranging from milliseconds in protective relaying to seconds in real-time energy management. Many computationally intensive deep learning models cannot currently satisfy these latency and throughput requirements without significant hardware investment or model compression, which limits their applicability in resource-constrained edge computing environments [
29].
- (iii)
Current trends and emerging directions in MLSG research
Several converging trends are reshaping the MLSG research frontier. Federated learning has emerged as a practical response to privacy constraints, allowing utilities to train shared models on decentralized data without exposing raw consumption records [
37]. Transformer-based architectures, originally developed for language tasks, are demonstrating strong performance in load and renewable generation forecasting by capturing long-range temporal dependencies more effectively than LSTM models [
102]. Digital twin technology is being paired with ML to generate synthetic training data for rare fault scenarios and cyberattack simulations, addressing the persistent scarcity of labeled real-world data. Graph neural networks are gaining traction for topology-aware grid modeling, capturing the structural dependencies of bus-line networks that standard architectures cannot represent. Finally, the integration of ML with physics-based models including physics-informed neural networks that embed Kirchhoff’s laws as training constraints is improving prediction reliability and physical plausibility, while explainable AI tools such as SHAP and LIME are increasingly being embedded into MLSG pipelines as a prerequisite for responsible operational deployment [
92]. Additionally, the comparative analysis of ML approaches in smart grid applications is shown in
Table 8.
Recent advances published since 2022 have further refined the MLSG methodological frontier across all three thematic clusters. In Smart Grid Security, large-scale intrusion detection systems incorporating multi-head attention mechanisms and contrastive learning have demonstrated improved generalization across diverse attack types, including zero-day FDIA variants for which labeled training data does not yet exist. Ensemble-based anomaly detectors combining isolation forests with gradient boosted classifiers have shown statistically significant improvements over single-model baselines when evaluated on heterogeneous smart meter datasets from multiple grid operators, suggesting that model diversity is a more reliable predictor of out-of-distribution performance than raw model complexity. In Power Load Forecasting, temporal fusion transformers and patch-based time-series foundation models pre-trained on large energy corpora have achieved state-of-the-art accuracy on multiple benchmark datasets including GEFCOM and PecanStreet, outperforming LSTM baselines by margins of 8–14% on RMSE metrics while requiring substantially less task-specific fine-tuning data, a practically significant finding for utilities operating in data-scarce environments. In Advanced Energy Management, multi-agent deep reinforcement learning frameworks incorporating communication protocols between prosumer nodes have demonstrated robust demand-response coordination in simulated distribution networks with high renewable penetration, reducing peak load variance by up to 23% compared to centralized optimization baselines. However, these advances also bring into sharp relief several persistent gaps and challenges that constrain the translation of research results into operational deployment. First, the vast majority of studies in all three clusters continue to rely on simulated or semi-synthetic grid environments, with less than 12% of empirical papers in the 4156-document corpus reporting results from field trials or live grid deployments; this evaluation gap represents the most structurally significant barrier between MLSG research and practical adoption. Second, model interoperability and standardization remain unaddressed: there is currently no widely adopted benchmark dataset or evaluation protocol shared across MLSG sub-domains, making cross-study comparison methodologically unreliable and impeding reproducibility. Third, the computational overhead of state-of-the-art deep learning and RL architectures frequently exceeds the inference latency budgets of real-time grid control applications, and lightweight model compression techniques such as quantization-aware training and structured pruning have received disproportionately little attention in the MLSG literature relative to their practical importance. Fourth, adversarial robustness testing against adaptive and coordinated multi-point attacks, as opposed to single-point perturbations, remains an underdeveloped area despite its direct relevance to operational grid security. Addressing these gaps through dedicated benchmark development, real-world pilot studies, and methodologically rigorous adversarial evaluation protocols represents the most critical near-term research priority for the MLSG community.
The bibliometric findings presented in this study, while scoped to the smart grid domain, carry meaningful implications for the broader electrical power engineering field. The three thematic clusters identified in the keyword co-occurrence analysis, namely Smart Grid Security, Power Load Forecasting, and Advanced Energy Management, each correspond to a class of ML methodology whose underlying algorithmic logic is not intrinsically bound to the smart grid context. The discussion of transferability is therefore not a peripheral observation but a direct corollary of the bibliometric structure of the field. As MLSG research matures, its methodological outputs are progressively diffusing into adjacent electrical systems domains, a process already visible in the cross-domain citations of several high-impact papers within the corpus.
The ML classifiers that dominate Cluster 1, including SVM, random forest, and deep autoencoders developed for anomaly and attack detection, are methodologically equivalent to the classifiers applied in power system fault detection and protection. Studies operating outside the smart grid context have demonstrated that these same architectures achieve high accuracy in detecting and classifying faults in transmission lines, underground cables, and power transformers [
103], analyzing current, voltage, and impedance waveforms in real time to enable faster protective relay coordination compared to conventional threshold-based schemes [
104]. The intellectual transfer between MLSG security research and the broader fault detection literature is therefore methodologically direct, and the citation base of papers bridging both domains, such as Ozay et al. [
71] and Aziz et al. [
92], confirms that cross-pollination is already occurring within the corpus examined in this study.
The time-series forecasting architectures that define Cluster 2, particularly LSTM and hybrid CNN-LSTM models, are structurally equivalent to those applied in transformer condition monitoring, where dissolved gas analysis (DGA) data is modeled as a sequential signal to detect incipient faults including partial discharge, arcing, and thermal overheating [
105]. The MLSG forecasting literature’s emphasis on temporal feature extraction and probabilistic uncertainty quantification provides methodological tools that are directly applicable to prognostic health management of power transformers and other critical grid assets. Similarly, the power quality monitoring domain, which encompasses the detection and classification of voltage sags, swells, harmonics, and flicker, relies on the same CNN and wavelet-based signal classification architectures that appear as high-growth keywords in Cluster 2 of the overlay visualization [
106]. This convergence confirms that the methodological frontier of PLF research and power quality analysis are increasingly drawing on the same deep learning architectures, a development that reflects the maturation and broadening applicability of the methods first systematically developed and validated in the MLSG context.
The reinforcement learning and IoT-driven energy management approaches that anchor Cluster 3 have direct analogs in the domain of electric motor drives and industrial automation, where DRL agents are increasingly being applied to predictive maintenance scheduling, rotor fault diagnosis, and efficiency optimization [
107]. The shared methodological foundation in this case involves RL agents operating on sensor streams from physical systems with complex and high-dimensional state spaces, meaning that the algorithmic advances documented in Cluster 3 of the MLSG corpus are simultaneously advancing the state of the art in industrial electrical systems, even when the two sets of literature develop in parallel rather than through direct citation exchange.
These observations collectively suggest that the MLSG research landscape, as documented in this bibliometric study, functions not as a self-contained domain but as a methodological incubator whose outputs are progressively transferring across the full spectrum of electrical engineering applications. Future bibliometric studies should examine the citation flows between MLSG publications and adjacent domains, including power system protection, transformer diagnostics, motor control, and power electronics, to quantify the rate and directionality of this methodological transfer. Such an analysis would provide a more complete picture of ML’s cumulative contribution to electrical engineering as a discipline and would complement the within-domain mapping provided by the present study.