Next Article in Journal
SO-YOLO11-CDP: An Instance Segmentation-Based Approach for Cross-Depth-of-Field Positioning Micro Image Sensor Modules in Precision Assembly
Next Article in Special Issue
Artificial Intelligence-Enhanced Droop Control for Renewable Energy-Based Microgrids: A Comprehensive Review
Previous Article in Journal
Tracking Control of Quadrotor UAVs with Prescribed Performance and Prescribed-Time Convergence Under Arbitrary Initial Conditions
Previous Article in Special Issue
Resilience-Oriented Energy Management of Networked Microgrids: A Case Study from Lombok, Indonesia
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

PhysGraphIR: Adaptive Physics-Informed Graph Learning for Infrared Thermal Field Prediction in Meter Boxes with Residual Sampling and Knowledge Distillation

1
USC Viterbi School of Engineering, University of Southern California, Los Angeles, CA 90089, USA
2
Holley Technology Co., Ltd., Hangzhou 310012, China
3
School of Intelligent Engineering and Automation, Shahe Campus, Beijing University of Posts and Telecommunications, Beijing 100876, China
*
Author to whom correspondence should be addressed.
Electronics 2026, 15(2), 410; https://doi.org/10.3390/electronics15020410
Submission received: 11 December 2025 / Revised: 10 January 2026 / Accepted: 12 January 2026 / Published: 16 January 2026

Abstract

Infrared thermal field (ITF) prediction for meter boxes is crucial for the early warning of power system faults, yet this method faces three major challenges: data sparsity, complex geometry, and resource constraints in edge computing. Existing physics-informed neural network-graph neural network (PINN-GNN) approaches suffer from redundant physics residual calculations (over 70% of flat regions contain little information) and poor model generalization (requiring retraining for new box types), making them inefficient for deployment on edge devices. This paper proposes the PhysGraphIR framework, which employs an Adaptive Residual Sampling (ARS) mechanism to dynamically identify hotspot region nodes through a physics-aware gating network, calculating physics residuals only at critical nodes to reduce computational overhead by over 80%. In this study, a ‘hotspot region’ is explicitly defined as a localized area exhibiting significant temperature elevation relative to the background—typically concentrated around electrical connection terminals or wire entrances—which is critical for identifying potential thermal faults under sparse data conditions. Additionally, it utilizes a Physics Knowledge Distillation Graph Neural Network (Physics-KD GNN) to decouple physics learning from geometric learning, transferring universal heat conduction knowledge to specific meter box geometries through a teacher–student architecture. Experimental results demonstrate that on both synthetic and real-world meter box datasets, PhysGraphIR achieves a hotspot region mean absolute error (MAE) of 11.8 °C under 60% infrared data missing conditions, representing a 22% improvement over traditional PINN-GNN. The training speed is accelerated by 3.1 times, requiring only five infrared samples to adapt to new box types. The experiments prove that this method significantly enhances prediction accuracy and computational efficiency under sparse infrared data while maintaining physical consistency, providing a feasible solution for edge intelligence in power systems.

1. Introduction

Infrared thermal field prediction for meter boxes plays a critical role in the condition monitoring and early warning of faults in modern power systems, forming a core component of smart grid asset management [1,2]. Accurate identification of localized hotspot regions—typically concentrated around electrical connections—is paramount for preventing equipment failure and ensuring operational reliability [2,3,4]. To achieve physics-compliant predictions under complex geometric constraints, recent research has integrated Physics-Informed Neural Networks (PINNs) with Graph Neural Networks (GNNs), leveraging the latter’s ability to model irregular domains [5,6,7].
Despite their promise, existing PINN-GNN methods suffer from a fundamental inefficiency here termed the “Full-Graph Residual Redundancy Problem.” These approaches uniformly enforce physical constraints across the entire computational graph, ignoring the stark heterogeneity of thermal gradients on a meter box surface [8,9,10,11]. Consequently, over 70% of computational effort is wasted on flat, low-information regions, while only 5–10% of the surface (the critical hotspots and boundaries) may receive insufficient physical regularization [12,13,14]. This redundancy renders such models prohibitively expensive for training and deployment on resource-constrained edge devices [15,16,17]. Furthermore, these models lack geometric generalization; each new meter box type requires costly retraining with extensive data, an impractical demand for large-scale industrial deployment [18]. Therefore, a critical research gap exists: a lightweight, physics-informed framework capable of efficient, geometrically adaptive thermal field prediction under severe infrared data sparsity and edge-computing constraints [19,20].
To bridge this gap, we propose PhysGraphIR, a novel framework that synergistically combines an Adaptive Residual Sampling (ARS) mechanism with a Physics Knowledge Distillation Graph Neural Network (Physics-KD GNN). Our solution is driven by two core innovations designed to directly address the stated limitations: (1) The ARS mechanism dynamically identifies critical nodes (e.g., high-gradient regions and boundaries) via a physics-aware gating network, calculating PDE residuals only on this sparse subset (typically ∼10% of nodes), thereby reducing computational overhead by over 80%. (2) The Physics-KD architecture decouples universal physics learning from geometry-specific adaptation. A teacher PINN learns general heat conduction laws on a canonical domain, while a lightweight student GNN, guided by a distillation loss, learns to map these laws onto arbitrary meter box geometries. This decoupling enables rapid adaptation to new box types with minimal data.
The key contributions of this work are explicitly summarized as follows:
  • We formulate the “Full-Graph Residual Redundancy Problem” in PINN-GNNs for thermal prediction and propose the ARS mechanism to solve it, achieving a 3.1× training acceleration.
  • We design a novel Physics-KD GNN that separates physical prior learning from geometric representation, enabling few-shot adaptation (as few as five samples) to unseen meter box types.
  • We integrate these components into an edge-deployable framework that maintains high physical consistency under significant data occlusion, validated on both synthetic and real-world datasets.
Our framework is rigorously validated using a synthetic dataset (SynMeterBox) generated via finite element simulation and a real-world infrared dataset (RealMeterIR) collected from a single-phase metering box, as shown in Figure 1. Experiments demonstrate that under a 60% infrared data missing rate, PhysGraphIR achieves a hotspot region mean absolute error (MAE) of 15.6 °C, a 29.5% improvement over the best baseline. The model requires only five samples to adapt to a new box type and achieves an inference latency of under 10 ms on an M1 Mac, confirming its suitability for edge deployment.
The significance of this work is twofold. Practically, it provides a feasible, efficient solution for edge-based intelligent inspection in power systems, reducing reliance on costly data collection and cloud computation. Methodologically, it introduces a principled approach to inject physical priors into geometric deep learning models in a computationally efficient and generalizable manner, offering a new paradigm for physics-informed learning on irregular domains.
The remainder of this paper is organized as follows: Section 2 reviews related work. Section 3 establishes the physical model of meter box thermal conduction. Section 4 details the PhysGraphIR framework. Section 5 presents experimental results and analysis. Section 6 discusses findings and limitations, and Section 7 concludes the paper.

2. Related Work

This section reviews related research in the fields of infrared thermography fault diagnosis, physics-informed machine learning (PINNs), and knowledge distillation for scientific computing. Furthermore, it analyzes the limitations of existing methods regarding edge device deployment and adaptability to complex geometries.

2.1. Deep Learning for Infrared Fault Diagnosis

Traditionally, infrared thermography analysis of electrical equipment relied on image processing techniques and manual feature extraction [21,22,23]. With the development of deep learning, Convolutional Neural Network (CNN)-based methods have become mainstream, widely used for classifying and segmenting thermal anomaly regions [24,25,26]. For instance, ref. [27] proposed a Mask R-CNN-based method to automatically locate hotspots in substation components. However, these purely data-driven methods suffer from two main drawbacks: first, predictions are made through “black-box” mappings lacking physical interpretability, often leading to results that violate thermodynamic laws; second, they rely heavily on large volumes of expensive labeled data, whereas acquiring high-quality infrared data containing rare fault modes is extremely challenging in practical scenarios.

2.2. Physics-Informed Machine Learning for Heat Transfer

To address data scarcity and physical consistency issues, Physics-Informed Neural Networks (PINNs) proposed by Raissi et al. [28] introduce Partial Differential Equation (PDE) residuals into the loss function as regularization terms. In the field of heat transfer, PINNs have been successfully applied to solve forward and inverse heat conduction problems [29,30,31].
Despite the success of PINNs, standard PINNs face challenges when dealing with complex geometric structures. They typically require a large number of collocation points to cover the entire computational domain, leading to high computational costs. Although recent Graph Neural Network (GNN)-based physics solvers (such as MeshGraphNets [32] and Physics-GNN [33]) can better handle unstructured meshes, most existing methods perform uniform residual calculation on all nodes during training. This uniform sampling strategy ignores the spatial heterogeneity of the thermal field, wasting significant computational resources in low-gradient regions. In contrast, our PhysGraphIR introduces an Adaptive Residual Sampling (ARS) mechanism, which reduces residual calculations by over 80% and significantly lowers the single iteration time.

2.3. Knowledge Distillation in Scientific Computing

Knowledge Distillation (KD) was originally proposed for model compression, transferring knowledge from a large teacher model to a lightweight student model [34]. In recent years, KD has also been applied to accelerate PDE solving [35,36,37]. Existing PDE distillation methods typically focus on enabling the student model to mimic the teacher model’s ability to solve specific equations.
However, existing methods rarely address the issue of “geometric generalization.” Typically, both teacher and student models are bound to the same geometric domain. In this study, we propose a novel Physics-KD architecture that achieves the decoupling of physics and geometry. Addressing the generalization issue discussed in [11,38,39], our teacher model (General Heat Conduction PINN) learns general heat conduction physics ( 2 T = Q / k ) solely on a regular rectangular domain, while the student model (Lightweight GNN) focuses on mapping these physical laws onto the specific meter box geometry graph structure. This design enables the model to quickly adapt to unseen geometric structures without the need for retraining from scratch [40,41].

2.4. Summary of Research Gaps and Our Position

To provide a clearer overview of the landscape and precisely position our contribution, we summarize the key characteristics, methodologies, and limitations of the aforementioned state-of-the-art approaches in Table 1. As illustrated, pure data-driven methods lack physical consistency, while standard PINNs suffer from poor geometric adaptability and high computational cost. Although unified PINN-GNN methods integrate physics and geometry, they inherently suffer from the Full-Graph Residual Redundancy Problem and exhibit limited efficiency for edge deployment.
Our proposed PhysGraphIR framework is designed to address these identified gaps jointly, as shown in Figure 2. By introducing the Adaptive Residual Sampling (ARS) mechanism, we directly tackle the computational redundancy issue, enabling efficient training. Meanwhile, the Physics Knowledge Distillation (Physics-KD) strategy explicitly decouples physics from geometry, overcoming the generalization bottleneck. The synergistic integration of these two innovations results in a method that maintains high physical consistency and geometric adaptability while achieving superior computational efficiency and edge-deployment readiness, as validated in our experiments.

3. Meter Box Thermal Conduction Physical Model

In power systems, the temperature distribution of meter boxes (such as electricity meter boxes and distribution boxes) strictly follows the fundamental laws of thermodynamics. When current flows through internal components of a meter box, the Joule heating effect causes temperature elevation, while heat dissipates to the surrounding environment through conduction, convection, and radiation. This thermal behavior can be precisely described by the heat conduction equation, which is essentially the mathematical expression of the energy conservation law in heat transfer processes.
In this section, we establish the thermal conduction model that serves as the physics foundation for the PhysGraphIR framework. We adopt a quasi-steady-state assumption, aligning with the Teacher PINN’s training objective, where transient fluctuations are considered negligible over the sampling period. The mathematical formulations detailed below—specifically the temperature gradients at material interfaces and heat flux at convective boundaries which provide the critical scalar inputs that directly feed into the Adaptive Residual Sampling (ARS) gating mechanism to determine node importance.

3.1. Heat Conduction Governing Equations

Consider the steady-state heat conduction process on the meter box surface (infrared thermal imaging is typically captured under short-term steady-state conditions). The temperature field T ( x ) satisfies the following Poisson equation [42,43]:
2 T ( x ) = Q ( x ) k ( x ) , x Ω
  • Ω denotes the meter box surface domain; x = [ x , y ] represents the surface coordinates;
  • T ( x ) is the temperature field (scalar function);
  • Q ( x ) is the heat source term generated by Joule heating, Q ( x ) = c · I 2 · δ ( x source ) , where c is a material constant, I is the load current, and  x source represents the heat source locations;
  • k ( x ) is the thermal conductivity, which varies spatially (different materials in the meter box have different k values, e.g., metal k 200 W / ( m · K ) , plastic k 0.2 W / ( m · K ) );
  • 2 = 2 x 2 + 2 y 2 is the two-dimensional Laplacian operator.
In practical applications, when meter box operating conditions change dynamically, the transient heat conduction equation may be necessary:
ρ c p T t = · ( k T ) + Q ( x , t )
where ρ is the density, c p is the specific heat capacity, and t is time. However, in infrared thermal imaging analysis, it is typically assumed that the system is in quasi-steady-state during image capture, so Equation (1) is primarily used.
For complex meter box structures with multiple materials, the thermal conductivity k ( x ) exhibits piecewise continuity:
k ( x ) = m = 1 M k m · I ( x Ω m )
where M is the number of distinct materials, k m is the thermal conductivity of material m, Ω m is the region occupied by material m, and  I ( · ) is the indicator function.
At material interfaces, the temperature field must satisfy the interface condition [42]:
k 1 T 1 · n = k 2 T 2 · n , x Γ 12
where Γ 12 denotes the interface between materials 1 and 2, and  T 1 , T 2 are the temperature fields in the respective regions.

3.2. Boundary Conditions

The boundary conditions on the meter box surface have a decisive impact on the temperature field distribution, mainly including the following types:
(1) Convective Boundary Conditions (Surfaces Exposed to Air) [42]
k T · n = h ( T T ) , x Ω conv
where
  • n is the outward normal vector of the surface;
  • h is the convective heat transfer coefficient, which depends on wind speed v, modeled as h = a · v + b ( a , b are empirical constants);
  • T is the ambient temperature;
  • Ω conv represents the convective boundary.
For natural convection (no wind), h can be approximated as follows:
h natural = 1.32 T T L 0.25
where L is the characteristic length of the surface.
For forced convection (with wind), h can be calculated using the Nusselt number correlation:
h forced = k air L · Nu = k air L · C · Re m · Pr n
where k air is the thermal conductivity of air, Nu is the Nusselt number, Re is the Reynolds number, Pr is the Prandtl number, and C, m, n are empirical coefficients.
(2) Heat Source Boundary Conditions (Internal Component Contact Surfaces)
T = T source , x Ω source
where T source is determined by the load current I, indirectly related through Q I 2 .
For electrical connections, the heat generation can be more precisely modeled as follows:
Q connection = I 2 R contact = I 2 R 0 + α ( T T 0 )
where R contact is the contact resistance, which varies with temperature ( α is the temperature coefficient of resistance).
(3) Radiation Boundary Conditions (For High-Temperature Scenarios)
k T · n = ϵ σ ( T 4 T sur 4 ) , x Ω rad
where
  • ϵ is the surface emissivity;
  • σ = 5.67 × 10 8 W · m 2 · K 4 is the Stefan–Boltzmann constant;
  • T sur is the surrounding surface temperature;
  • Ω rad represents the radiative boundary.
For most meter box applications, radiation effects are negligible below 80 °C, but become significant at higher temperatures.

3.3. Infrared Thermal Imaging Observation Model

The radiance intensity captured by an infrared thermal imager is related to the surface temperature through the following observation model:
I IR ( x ) = ϵ ( x ) σ T 4 ( x ) + τ ( x ) I amb + n ( x )
where
  • I IR ( x ) is the radiance intensity measured by the infrared camera;
  • ϵ ( x ) is the surface emissivity (material-dependent, metal approximately 0.1–0.3, plastic approximately 0.8–0.95);
  • σ is the Stefan–Boltzmann constant;
  • τ ( x ) is the atmospheric transmittance;
  • I amb is the ambient radiation;
  • n ( x ) is the measurement noise.
In practice, the infrared camera converts radiance intensity to temperature readings through internal calibration, but due to occlusions, resolution limitations, and other factors, only partial regions have valid observations:
T obs ( x ) = T ( x ) + η ( x ) , if x O undefined , otherwise
where O represents the observable region and  η ( x ) is the measurement error.
The measurement error η ( x ) can be modeled as follows:
η ( x ) = η cal + η noise + η ref + η atm
where
  • η cal is the calibration error (typically 1–2% of reading);
  • η noise is the random noise (approximately Gaussian with σ noise 0.05 0 . 1 ° C );
  • η refl is the reflected apparent temperature error;
  • η atm is the atmospheric absorption error.
For meter boxes, the observable region O is often limited due to the following:
  • Physical obstructions (wires, mounting hardware);
  • Emissivity variations causing reflection artifacts;
  • Limited field of view of the infrared camera;
  • Safety constraints preventing close inspection.

3.4. Meter Box Thermal Field Parameterization

According to Equations (1)–(13), the meter box temperature field can be parameterized as follows:
T = T ( x ; θ ) θ = Ω , k ( x ) , Q ( x ) , h , T , I , v , ϵ ( x )
where θ includes all parameters affecting the temperature distribution:
  • Ω : Meter box geometric structure;
  • k ( x ) : Spatial thermal conductivity distribution;
  • Q ( x ) : Heat source distribution;
  • h , T : Environmental parameters;
  • I: Operational parameter (load current);
  • v: Wind speed;
  • ϵ ( x ) : Surface emissivity distribution.
Notably, the heat source locations x source are typically known as prior knowledge (obtained from meter box design diagrams), providing critical constraints for physics-informed learning. Additionally, the non-uniform material distribution on the meter box surface causes spatial variations in k ( x ) , making the heat conduction equation complex to solve on irregular geometric domains. This complexity is precisely why traditional numerical methods (such as finite element analysis) have high computational costs, which is the motivation for proposing the PhysGraphIR framework.
The thermal behavior of meter boxes exhibits several distinctive characteristics:
1.
Hotspot Localization: Temperature anomalies are typically concentrated in specific regions (5–10% of the surface), such as electrical connections and current-carrying components.
2.
Gradient Heterogeneity: Temperature gradients vary significantly across the surface, with high gradients near heat sources and low gradients in passive regions.
3.
Multi-Physics Coupling: Electrical current, thermal conduction, and fluid dynamics interact, though for infrared prediction, we primarily focus on the thermal aspect.
4.
Dynamic Boundary Conditions: Environmental parameters ( T , v) and operational parameters (I) change over time, requiring adaptive modeling.
This physical model strictly adheres to the First Law of Thermodynamics and provides a theoretical foundation for the subsequent design of physics-informed graph neural networks. In practical applications, environmental parameters ( T , v) and operational parameters (I) can be obtained in real-time through sensors, while the geometric structure Ω can be extracted from meter box CAD models. These inputs will be utilized by the PhysGraphIR framework to generate physically consistent temperature field predictions.
The parameterized model enables us to formulate the infrared thermal field prediction as a physics-constrained optimization problem:
min T λ data L data ( T ) + λ pde L pde ( T ) + λ bc L bc ( T )
where
  • L data ( T ) = 1 | O | x O T ( x ) T obs ( x ) 2 is the data loss on observed regions;
  • L pde ( T ) = 1 | Ω | Ω · ( k ( x ) T ( x ) ) + Q ( x ) 2 d x is the PDE residual loss;
  • L bc ( T ) = 1 | Ω | Ω BC ( T , x ) 2 d x is the boundary condition loss;
  • λ data , λ pde , λ bc are balancing coefficients.
This formulation serves as the mathematical basis for the PhysGraphIR framework, where the Adaptive Residual Sampling mechanism and Physics Knowledge Distillation techniques are designed to efficiently solve this optimization problem under resource constraints. In summary, the interaction of parameters within the teacher–student architecture is categorized as follows: Environmental variables such as ambient temperature T and load current I are treated as sensed inputs. Conversely, parameters like the convective heat transfer coefficient h and contact resistance R c o n t a c t are estimated variables implicitly optimized during training. These physical quantities directly constitute the PDE residual loss (consistent with the governing term 2 T ^ i + Q i / k i ), while the magnitude of boundary heat fluxes serves as the primary weight for calculating node importance scores.

4. Infrared Thermal Field Prediction Framework Based on Adaptive Physics Residual Sampling and Knowledge Distillation

In this section, we present the PhysGraphIR framework, a lightweight architecture designed to address the challenges of data sparsity and geometric complexity in meter box thermal field prediction by integrating physics-informed learning with graph neural networks. The methodology is driven by two core technologies: First, we describe the Adaptive Residual Sampling (ARS) mechanism, which utilizes a physics-aware gating network to dynamically identify critical nodes and calculate physics residuals only in high-information regions, thereby significantly reducing computational redundancy for efficient edge deployment. Second, we detail the Physics Knowledge Distillation (Physics-KD) strategy, which decouples physics learning from geometric representations to enable rapid adaptation across different meter box structures.

4.1. PhysGraphIR Framework Overview

The PhysGraphIR framework innovatively integrates physics-informed learning with graph neural networks, specifically designed to address the challenges of data sparsity and geometric complexity in meter box infrared thermal field prediction. Unlike traditional PINN-GNN methods, PhysGraphIR achieves efficient and accurate prediction through two core technologies: (1) Adaptive Residual Sampling (ARS) mechanism that dynamically identifies critical nodes and calculates physics residuals only at these nodes; (2) Physics Knowledge Distillation Graph Neural Network (Physics-KD GNN) that decouples physics learning from geometric learning, enabling rapid adaptation across different meter box geometries.
The framework workflow consists of three stages:
1.
Physics Prior Learning Stage: Pre-train a teacher model (general heat conduction PINN) to learn fundamental thermodynamic laws;
2.
Geometry Adaptation Stage: Transfer physics knowledge to specific meter box geometry through ARS mechanism and knowledge distillation;
3.
Edge Inference Stage: The use of a lightweight student model that directly outputs the complete temperature field, meeting real-time monitoring requirements.
Unlike RF Map construction in reference literature, PhysGraphIR is deeply optimized for the characteristics of meter box infrared tasks: (1) Considering the non-uniform temperature distribution on meter box surfaces, the ARS mechanism focuses on hotspot regions (only 5–10% of the surface); (2) Designed lightweight graph encoder adapts to different meter box geometries, requiring only five infrared samples to adapt to a new box type; (3) Achieves end-to-end training and deployment on Mac-class devices with inference latency < 10  ms. This design enables PhysGraphIR to maintain high prediction accuracy even under extreme conditions of 60% infrared data missing, significantly outperforming traditional methods.The detailed workflow of the PhysGraphIR framework, integrating the ARS mechanism and Physics-KD GNN, is summarized in Algorithm 1.

4.2. Adaptive Physics Residual Sampling (ARS) Mechanism

(1) Problem Motivation and Design Principle
Traditional PINN-GNN methods uniformly calculate physics residuals across all nodes on the meter box surface, resulting in significant computational resources wasted in flat temperature regions (accounting for over 70% of the surface), while critical hotspot regions (only 5–10%) fail to receive sufficient physical constraints. To address this “Full-Graph Residual Redundancy Problem,” we propose the ARS mechanism, whose core idea is to dynamically identify critical nodes through a physics-aware gating network and calculate physics residuals only at these nodes, as shown in Figure 3.
Specifically, critical nodes are defined as follows:
  • High-gradient regions: Areas with rapid temperature changes ( | T | > τ grad );
  • Boundary regions: Nodes on convective boundaries and heat source boundaries;
  • Missing observation regions: Areas with missing infrared data but high gradients in neighboring regions.
Where the threshold τ grad is dynamically set to the 75th percentile of the global gradient distribution, ensuring critical nodes account for approximately 10%.
Algorithm 1 PhysGraphIR: Temperature Field Prediction Algorithm with Adaptive Physics Residual Sampling and Knowledge Distillation
Require: 
Meter box geometry graph G = ( V , E ) ; Environmental parameters p env = [ T , v ] ; Operational parameters q op = [ I ] ; Partial infrared observations T obs (only available for some nodes).
Ensure: 
Complete temperature field prediction T ^ : V R .
1:
// Stage 1: Pre-train teacher model (only once)
2:
Load pre-trained teacher PINN model T teacher .
3:
for each training batch do
4:
   Sample coordinates X sample from rectangular domain.
5:
   Compute PDE residuals R pde .
6:
   Update teacher model parameters.
7:
end for
8:
// Stage 2: Geometry adaptation phase
9:
Initialize student GNN model T student .
10:
for each training epoch do
11:
   Forward propagation: H = GNN ( G ) .
12:
   Gating network: g = Gate ( H , T , c ) .
13:
   Dynamically sample critical nodes: K = Top - k ( g ) .
14:
   // ARS Physics Residual Calculation
15:
    L pde = 1 | K | i K R pde , i 2
16:
    L bc = 1 | Ω K | i Ω K R bc , i 2
17:
   // Knowledge Distillation
18:
    T teacher = PINN ( X V )
19:
    L kd = 1 | V | i = 1 | V | ( T student ( v i ) T teacher ( x i ) ) 2
20:
   // Data Loss
21:
    L data = 1 | O | i O ( T student ( v i ) T obs ( v i ) ) 2
22:
   // Total Loss and Update
23:
    L = λ data L data + λ pde L pde + λ bc L bc + λ kd L kd
24:
   Backpropagation to update student GNN and gating network parameters.
25:
end for
26:
// Stage 3: Edge Inference Phase
27:
Load trained student GNN model T student .
28:
Input real-time parameters p env , q op .
29:
Output complete temperature field T ^ = T student ( G ) .
30:
Hotspot detection:  Alert = { v i T ( v i ) > T threshold } .
(2) Physics-aware gating network design
The core of the ARS mechanism is the physics-aware gating network, whose structure is shown in Figure 3, including the following components:
  • Input Layer:
  • Node features h i ( l ) : Intermediate layer features of GNN (dimension d h );
  • Approximate gradient T i : Rapidly calculated through first-order neighborhood differencing.
T i 1 | N ( i ) | j N ( i ) ( T j T i ) d i j
where d i j is the directional vector from node i to j.
  • Global context c : Embedding of environmental parameters ( T , v ) and operational parameters (I).
  • Gating Network Architecture:
gate ( h i ) = σ W 2 · ReLU W 1 · [ h i ( l ) ; T i ; c ] + b 1 + b 2
where
  • [ ; ] denotes vector concatenation;
  • W 1 R 64 × ( d h + 2 + d c ) , W 2 R 1 × 64 are trainable weights;
  • σ is the sigmoid activation function, outputting node importance score g i [ 0 , 1 ] .
  • Dynamic Sampling Strategy:
K = { i g i > τ g } where τ g = Top - k quantile ( g )
k is dynamically set to k = max ( 5 , 0.1 · | V | ) , ensuring the number of critical nodes is not less than five (minimum scale of hotspot regions).
Sensitivity and Coverage Analysis: The selection of the top-k quantile is substantiated by our sensitivity analysis, which indicates that a sampling rate between 8% and 12% yields the optimal trade-off. Sampling rates below 8% are insufficient to capture the full extent of gradient variations T around material interfaces, leading to inadequate physics regularization. Conversely, rates exceeding 12% introduce computational redundancy by including low-information background nodes without improving the PDE residual norm. This range ensures adequate coverage of critical current paths and hotspot regions, aligning with the minimum scale constraint ( k 5 ) to guarantee physical consistency across diverse geometries.
As illustrated in Figure 4, the computational savings are achieved by introducing a conditional branch in the forward pass. The ARS gating network dynamically identifies “critical nodes” (defined as high-gradient or boundary regions) which constitute only about 10% of the total graph. By restricting the computationally expensive PDE residual calculation to this subset K, we avoid redundant computations on the flat “blue nodes”, effectively compressing the training iteration time.
(3) Residual Calculation Optimization Strategy
After identifying the critical node set K , the ARS mechanism calculates physics residuals only at these nodes:
  • PDE Residual:
    R pde , i = j N ( i ) w i j ( T j T i ) + Q ( x i ) k ( x i ) , i K 0 , otherwise
  • Boundary Residual:
R bc , i = k T i · n i h ( T i T ) , i K Ω conv T i T source , i K Ω source 0 , otherwise
where boundary nodes are always included in the critical node set ( Ω K ), ensuring strict satisfaction of boundary conditions.
  • Computational Efficiency Analysis:
  • Traditional method: Computational complexity O ( | V | ) ;
  • ARS mechanism: Computational complexity O ( 0.1 | V | ) , reducing residual calculations by over 80%;
  • Actual measurement: On a 200-node meter box, ARS reduces single iteration time from 32 ms to 9 ms (M1 Mac).

4.3. Physics Knowledge Distillation Graph Encoder (Physics-KD GNN)

(1) Teacher–Student Architecture Design
The core of Physics-KD GNN is decoupling physics learning from geometric learning through a teacher–student architecture:
  • Teacher Model (General Heat Conduction PINN):
  • Input: 2D coordinates x = [ x , y ] ;
  • Network Architecture: 4-layer fully connected network (input layer + 3 hidden layers + output layer), 50 neurons per layer;
  • Output: Temperature prediction T teacher ( x ) ;
  • Training Objective: Solve heat conduction equation on rectangular domain.
    2 T = Q k , with   T | Ω = T boundary
  • Characteristics: Learns general heat conduction physics, independent of specific geometry.
  • Student Model (Lightweight GNN):
  • Input: Meter box geometry graph G = ( V , E ) , node features f i = [ x i , y i , material i ] ;
  • Network Architecture: Two-layer Graph Attention Network (GAT).
    h i ( l + 1 ) = σ j N ( i ) α i j W ( l ) h j ( l )
    where α i j is the attention coefficient and W ( l ) is the trainable weight.
  • Output: Temperature prediction T student ( G ) ;
  • Characteristics: Designed specifically for a particular meter box geometry, <5k parameters.
  • Quantitative Analysis of Knowledge Transfer:
To further quantify the efficacy of physical consistency transfer, we analyzed the correlation between the Student model’s node importance scores (generated by the ARS gating network) and the Teacher model’s PDE residual magnitudes.
We calculated the Pearson Correlation Coefficient (PCC) across the validation set. The results show a strong positive correlation ( ρ = 0.87 ), indicating that the student model effectively mimics the teacher’s focus mechanism. Specifically, nodes with high PDE residuals—typically located at material interfaces and convective boundaries—receive significantly higher attention weights from the student model. This confirms that the physical constraints are not merely memorized but are actively guiding the student’s feature extraction process.
(2) Physics Knowledge Transfer Mechanism
Knowledge transfer is achieved through the following loss function:
  • Knowledge Distillation Loss:
    L kd = 1 | V | i = 1 | V | T student ( v i ) T teacher ( x i ) 2
  • Data Fitting Loss:
    L data = 1 | O | i O T student ( v i ) T obs ( v i ) 2
  • Total Loss Function:
L total = λ data L data + λ kd L kd + λ ars L ars
where L ars is the loss of the gating network in the ARS mechanism, and λ data = 1.0 , λ kd = 0.5 , and λ ars = 0.2 optimized through grid search.
  • Transfer Learning Advantages:
  • Teacher model provides physically smooth soft labels, suppressing infrared noise;
  • Student model adapts to new geometries with only a few samples;
  • Experiments show: With only five infrared samples, the student model achieves over 90% performance of the teacher model.
(3) Lightweight Student Model Architecture
To adapt to edge device deployment, the student model adopts the following lightweight design:
  • Graph Construction Optimization:
  • K-Nearest Neighbor Graph: K = 4 (balancing connectivity and sparsity);
  • Edge weights: w i j = exp ( x i x j 2 / σ 2 ) , σ set to average node spacing.
  • Network Architecture:
  • Input Layer: Three-dimensional node features (x, y, material);
  • GAT Layer 1: Eight attention heads; output dimension 8;
  • Activation Function: LeakyReLU ( α = 0.2 );
  • GAT: Layer 2: One attention head; output dimension 1 (temperature prediction);
  • Total Parameters: 4865 (only 1 / 10 of traditional GNN).
  • Edge Deployment Optimization:
  • Model Quantization: FP32 → INT8, model size reduced from 190 KB to 48 KB;
  • Graph Simplification: K-core compression, reducing node count by over 30% (retaining hotspot regions);
  • Inference Acceleration: Metal Performance Shaders optimization, inference latency < 10 ms on M1 Mac.

5. Simulation and Performance Evaluation

This section comprehensively evaluates the performance of the PhysGraphIR framework for meter box infrared thermal field prediction. We conduct experiments on both synthetic and real-world datasets to validate the effectiveness, robustness, and practicality of the proposed method. The experimental design considers three major challenges: infrared data sparsity, geometric complexity, and resource constraints in edge computing. We particularly focus on evaluating PhysGraphIR’s performance in hotspot prediction accuracy, computational efficiency, and edge deployment feasibility.

5.1. Experimental Setup

(1) Datasets
The proposed method is validated using both synthetic and real-world datasets to comprehensively evaluate its performance in generalization, adaptation to real noise, and handling of data sparsity.
(1)
SynMeterBox Synthetic Dataset
  • Generation Method: Finite element simulation using COMSOL Multiphysics 5.6a to simulate temperature fields under various operating conditions.The workflow of synthetic data preparation is shown in Figure 5 below.
  • Box Types: 50 different geometric structures of meter boxes (based on actual power equipment designs).
  • Conditions per Box: 100 operating conditions (current range: 0–200 A, ambient temperature: −10–40 °C, wind speed: 0–5 m/s).
  • Data Scale: 5000 samples ( 50 boxes × 100 conditions ).
  • Data Format:
    Geometry graph: Surface point cloud (200–500 nodes, depending on box complexity).
    Node features: [ x , y , material encoding ( 0 = metal , 1 = plastic ) ] .
    Environmental parameters: [ T , v ] .
    Operational parameters: [ I ] .
    Temperature field: True temperature value for each node.
  • Data Split:
    Training set: 40 boxes × 100 conditions = 4000 samples.
    Validation set: 5 boxes × 100 conditions = 500 samples.
    Test set: 5 boxes × 100 conditions = 500 samples.
Figure 5. Workflow of synthetic data preparation.
Figure 5. Workflow of synthetic data preparation.
Electronics 15 00410 g005
(2)
RealMeterIR Real-World Dataset
To ensure the reproducibility of the data acquisition process, the physical setup is visualized in Figure 6. The FLIR T1020 camera (FLIR Systems, Wilsonville, OR, USA) was mounted on a tripod at a fixed distance (approx. 1.5 m) to balance resolution requirements with safety constraints preventing close inspection.
  • Collection Method: FLIR T1020 infrared thermal imager (resolution 1024 × 768 , accuracy ±1 °C or ±1% of reading). The dataset collection spanned from January to October 2025, capturing environmental variations where ambient temperatures ranged from −10 °C to 40 °C.
  • Box Types: Three typical meter boxes (Type A: single-phase meter box, Type B: three-phase meter box, Type C: distribution box).
  • Data Scale: 217 valid samples (Type A: 85, Type B: 72, Type C: 60).
  • Data Content
    Infrared images: Each sample contains one infrared image and corresponding visible light image.
    Temperature annotations: Hotspot temperatures manually labeled through professional software.
    Environmental parameters: Recorded via temperature–humidity sensors and anemometers.
    Operational parameters: Load current recorded via clamp meters.
  • Data Characteristics
    Infrared data sparsity: Average 35 % area cannot obtain valid temperatures due to occlusion or reflection.
    Noise level: Signal-to-noise ratio (SNR) approximately 25–30 dB.
    Hotspot regions: Account for 5– 10 % of surface area but contain over 90 % of fault information.
(2) Baseline Methods
To comprehensively evaluate PhysGraphIR’s performance, we compare it with the following baseline methods, as shown in Table 2:
All methods are implemented and evaluated under the same environment to ensure a fair comparison. Hyperparameters for PhysGraphIR and baseline methods are optimized through grid search.
(3) Evaluation Metrics
To comprehensively assess prediction performance, we adopt the following metrics:
  • Overall MAE: Mean absolute error across all nodes
    MAE overall = 1 | V | i = 1 | V | | T ^ i T i true |
  • Hotspot MAE: Mean absolute error in hotspot regions ( T > 60 °C)
    MAE hotspot = 1 | H | i H | T ^ i T i true |
    where H = { i T i true > 60   ° C }
  • PDE Residual Norm: Physics consistency metric
    PDE Residual = 1 | V | i = 1 | V | 2 T ^ i + Q i k i 2
  • Training Speed: Training time per epoch (seconds);
  • Inference Latency: Prediction time per sample (milliseconds);
  • Memory Usage: Peak memory consumption during training/inference (MB);
  • Data Efficiency: Minimum number of training samples required to achieve specific accuracy.
Notably, hotspot MAE is a critical metric for power system fault detection, as prediction accuracy in hotspot regions directly determines the reliability of fault detection.
(4) Experimental Environment
The experiments were conducted on an Apple MacBook Pro (M1 Pro, 2021) with 16GB RAM as the primary device, supplemented by a Raspberry Pi 4 (4 GB RAM) for edge deployment testing; the software environment ran on macOS Monterey 12.3.1 with PyTorch 1.12.0 and PyTorch Geometric 2.2.0 as the deep learning framework, DeepXDE 1.0.0 for physics-informed learning, and utilized MPS (Metal Performance Shaders) backend for acceleration; training parameters included the Adam optimizer with learning rate 0.001 and weight decay 1 × 10 4 , batch sizes of 16 for the SynMeterBox dataset and 8 for the RealMeterIR dataset, 200 training epochs for SynMeterBox and 100 for RealMeterIR, with early stopping implemented to halt training if the validation loss did not decrease for 10 consecutive epochs.

5.2. Main Results

(1) Datasets Overall Performance Comparison
To quantitatively evaluate the effectiveness of the PhysGraphIR framework, we conducted a systematic comparison with various state-of-the-art baseline methods, as comprehensively summarized in Table 3. The comparison encompasses performance on the synthetic dataset (SynMeterBox) under both 50% and 60% infrared data missing rates, as well as generalization capability on the real-world dataset (RealMeterIR). Key metrics include the overall mean absolute error (MAE), the critical hotspot region MAE, and the PDE residual norm for assessing physical consistency.
The key findings are as follows:
  • Hotspot Prediction Accuracy: Under SynMeterBox 60% missing condition, PhysGraphIR achieves a hotspot MAE of 15.6 °C, representing a 29.5% improvement over the best baseline (PINN-GNN: 22.1 °C → 15.6 °C).
  • Robustness to Data Sparsity: When data missing rate increases from 50% to 60%, PhysGraphIR’s hotspot MAE increases only by 5.4%, while U-Net increases by 24.6%, demonstrating strong robustness to data missing.
  • Real-World Performance: On RealMeterIR, PhysGraphIR achieves 14.3 °C hotspot MAE, 23.6% lower than the second-best Physics-KD, proving its effectiveness in real-world scenarios.
  • Physics Consistency: PhysGraphIR’s PDE residual norm is 0.19, significantly lower than other methods (PINN-GNN: 0.28). This demonstrates that our predictions adhere more strictly to the thermodynamic laws governed by the optimization problem formulated in Equation (15).
In summary, compared to existing advanced methods, PhysGraphIR demonstrates marked advantages in prediction accuracy (particularly in critical hotspot regions), physical consistency, and robustness to data missing. These results robustly validate the efficacy of the proposed Adaptive Residual Sampling and Physics Knowledge Distillation mechanisms.
(2) Training and Inference Efficiency
Table 4 shows computational efficiency metrics of various methods, with special focus on edge device deployment feasibility.
The key findings are as follows:
  • Training Acceleration: PhysGraphIR’s training speed is 3.1 × faster than PINN-GNN, primarily due to ARS mechanism reducing over 80% of residual calculations.
  • Inference Latency: On M1 Mac, PhysGraphIR achieves 9.5 ms inference latency, meeting real-time monitoring requirements (<30 ms).
  • Memory Usage: PhysGraphIR uses only 480 MB memory, 61.6 % lower than PINN-GNN, suitable for resource-constrained devices.
  • Model Size: After INT8 quantization, PhysGraphIR’s model size is only 48 KB, deployable on embedded devices.
Figure 7 shows the training curves of PhysGraphIR and baseline methods. PhysGraphIR not only converges faster but also achieves lower loss on the validation set, demonstrating its optimization efficiency and generalization capability.
(3) Cross-Box Adaptation Capability
Table 5 evaluates PhysGraphIR’s cross-box adaptation capability, i.e., quickly adapting to new box types with minimal samples.
The key findings are as follows:
  • Data Efficiency: PhysGraphIR achieves 14.9 °C hotspot MAE with only 5 samples on a new box type, while training from scratch requires 60 samples.
  • Training Speed: Cross-box adaptation training takes only 28 s, 6.4 × faster than training from scratch.
  • Accuracy Advantage: 35.5 % lower hotspot MAE than transfer learning PINN-GNN (23.1 °C → 14.9 °C).
This demonstrates that the Physics-KD mechanism effectively decouples physics learning from geometric learning, enabling the model to quickly adapt to new box geometries and significantly reduce data collection costs.

5.3. Ablation Studies

(1) Component Contribution Analysis
Table 6 analyzes the contribution of each PhysGraphIR component through ablation studies, all conducted under SynMeterBox 60% missing condition.
The key findings are as follows:
  • ARS Mechanism: Using ARS alone reduces hotspot MAE by 12.7 % (22.1 °C → 19.3 °C) and accelerates training by 2.6 × .
  • Physics-KD Mechanism: Using Physics-KD alone reduces hotspot MAE by 10.4 % (22.1 °C → 19.8 °C), but slightly decreases training speed.
  • Combined Effect: ARS + Physics-KD reduces hotspot MAE by 16.3 % (22.1 °C → 18.5 °C) and accelerates training by 2.3 × .
  • Optimized Weights: By adjusting loss function weights ( λ data = 1.0 , λ kd = 0.5 , λ ars = 0.2 ), hotspot MAE is further reduced by 18.6 % , reaching 15.6 °C.
(2) ARS Mechanism Analysis
Figure 8 shows the critical node selection effect and performance impact of the ARS mechanism.
The key findings are as follows:
  • Critical node selection: As shown in Figure 8a, the ARS mechanism accurately focuses on current paths (red highlighted areas), highly consistent with true hotspot regions.
  • Performance impact of ARS Mechanism: Figure 8b shows that when sampling rate is between 8–12%, hotspot MAE reaches its minimum; below 8% leads to insufficient physics constraints, above 12% increases computational redundancy.
(3) Physics-KD Mechanism Analysis
Figure 9 shows the knowledge transfer effect and data efficiency of the Physics-KD mechanism.
The key findings are as follows:
  • Knowledge Transfer Effect: As shown in Figure 9a, student model predictions are highly consistent with teacher model, but better aligned with true temperature field (dashed line), proving effective physics knowledge transfer.
  • Data Efficiency: Figure 9b shows that PhysGraphIR achieves performance equivalent to traditional methods with 50 samples using only 10 samples, improving data efficiency by 5 × .
  • New Box Adaptation: Figure 9c demonstrates how PhysGraphIR quickly adapts to new boxes, converging to stable performance after only five samples.

5.4. Visualization Analysis

(1) Temperature Field Prediction Comparison
Figure 10 shows the temperature field prediction visualization comparison of different methods on the RealMeterIR dataset.
(2) ARS Mechanism Heatmap
Figure 11 shows the critical node identification effect of the ARS mechanism on a real meter box.
The key observations are as follows:
  • Infrared Image: As shown in Figure 11a, the infrared thermal image reveals significant temperature variations across the meter box surface, with clear hotspot regions concentrated around electrical connection terminals and wire entrances where temperature exceeds 36 °C; notably, approximately 35% of the surface area shows missing or unreliable data due to physical occlusions and reflection artifacts, highlighting the data sparsity challenge that motivates our PhysGraphIR framework’s design for robust prediction under partial observations.
  • Importance Scoren: As shown in Figure 11b, the ARS mechanism assigns high scores to true hotspot regions (connection terminals, wire entrances), highly consistent with manual annotations.
  • Physics Consistency: By focusing on critical regions, ARS ensures physics constraints are applied where most needed, improving prediction quality.
    (3) Worst-case Violation and Spatial Leakage Analysis
    While the average PDE residual norm decreases from 0.28 (PINN-GNN) to 0.19 (PhysGraphIR) as shown in Table 6, the average metric may mask local failures. To address this, we visualize the spatial distribution of the absolute PDE residuals | R p d e | in Figure 12.
    Spatial Leakage: As observed in Figure 12a, the baseline method suffers from significant “spatial leakage,” where high residual errors (>1.5) cluster densely around material interfaces and convective boundaries. This aligns with the definition of “high-gradient regions” ( | T | > τ g r a d ), where complex physics dynamics make optimization difficult.
    Worst-case Suppression: In contrast, Figure 12b demonstrates that PhysGraphIR effectively suppresses these worst-case violations. By dynamically allocating critical nodes to these high-frequency areas via the ARS mechanism, our method reduces the maximum local residual by approximately 40%. This confirms that the proposed method does not merely optimize the global average but strictly enforces physical consistency in the most challenging geometric regions.

5.5. Edge Deployment Testing

(1) Mac Device Performance
Table 7 shows the performance of PhysGraphIR on Mac devices, including the effects of different optimization strategies.
The key findings are as follows:
  • Quantization Effect: INT8 quantization reduces inference latency by 12.2 % and model size by 50 % .
  • Graph Simplification: K-core compression reduces memory usage by 33.3 % and inference latency by 14.6 % .
  • Combined Optimization: INT8 + K-core compression achieves optimal performance with inference latency < 10 ms, meeting real-time monitoring requirements.
(2) Edge Device Performance
Table 8 evaluates the deployment performance of PhysGraphIR on typical edge devices.
The key findings are as follows:
  • Wide Compatibility: PhysGraphIR runs efficiently on multiple edge devices, meeting requirements of different scenarios.
  • Power Efficiency: Power consumption < 1 W on mobile devices, suitable for battery-powered scenarios.
  • Real-time Capability: Inference latency < 30 ms on all devices, meeting real-time monitoring requirements.
(3) Long-Term Running Stability
We conducted a 72 h continuous running test on M1 Mac to evaluate the long-term stability of PhysGraphIR.
  • Inference Latency: Average latency of 9.7 ms over 72 h, standard deviation 0.3 ms, no significant performance degradation.
  • Memory Usage: Stable around 320 MB, no memory leaks.
  • Prediction Consistency: Prediction difference < 0.1 °C for the same input, proving numerical stability.
  • Power Consumption: Average 0.78 W, total energy consumption approximately 0.056 kWh over 72 h, suitable for long-term deployment.

5.6. Uncertainty and Reliability Analysis

To evaluate the reliability of model predictions, particularly under high data-missing conditions as highlighted in previous studies on uncertainty quantification, we conducted an uncertainty evaluation experiment. According to the observation model in Equation (12), the observable region O is often sparse due to occlusions. We simulated high data-missing rates (ranging from 30% to 70%) and introduced Gaussian measurement noise n ( x ) N ( 0 , 0.5 ) . We performed 50 stochastic forward passes for each missing rate to calculate the predictive standard deviation ( σ ). While the uncertainty ( σ ) naturally increases with the missing rate, PhysGraphIR maintains a significantly lower uncertainty bound compared to the baseline. Even at a 60% missing rate, the hotspot temperature variance remains within ±1.2 °C. This indicates that the physical constraints effectively regularize the solution space, preventing the model from generating thermodynamically inconsistent predictions in occluded regions, thereby ensuring high reliability.

6. Discussion

6.1. Performance and Mechanism Analysis

In this study, we proposed the PhysGraphIR framework to address the challenges of sparse data and geometric complexity in meter box thermal monitoring. By solving the physics-constrained optimization problem formulated in the governing equations, our method effectively integrates physical laws with data-driven learning.
  • Data Efficiency: The Physics Knowledge Distillation (Physics-KD) mechanism significantly improves data efficiency. Our experimental results indicate that our method achieves performance equivalent to traditional methods (which typically require 50 samples) using only 10 samples, representing a 5 × improvement in data efficiency.
  • Knowledge Transfer: Visualization of the thermal fields confirms that the student model not only remains consistent with the teacher model but also aligns better with the true temperature field, demonstrating effective physics knowledge transfer.

6.2. Limitations and Sensitivity Analysis

Despite the promising results, several limitations identified during the development of the Adaptive Residual Sampling (ARS) mechanism and deployment testing must be acknowledged, addressing the concerns regarding heuristic choices and computational constraints:
  • Heuristic Thresholds in ARS: The current ARS mechanism relies on static thresholds (e.g., top-k nodes). Our sensitivity analysis indicates that a sampling rate between 8 % and 12 % is optimal. A sampling rate below 8 % risks insufficient physics constraints, leading to potential violations of the governing equations, while a rate above 12 % increases computational redundancy without yielding significant performance gains.
  • New Box Adaptation Latency: While the decoupling of physics and geometry allows for rapid adaptation, the “New Box Adaptation” phase is not instantaneous. The model requires approximately 5 samples to converge to stable performance on a new geometric structure. This implies a brief calibration period is necessary when deploying to unseen meter box types to ensure the inference latency remains within acceptable limits.
  • Edge Computing Constraints: Although optimized for edge devices, the computational load of calculating the PDE residual loss (incorporating terms like 2 T ^ i ) still presents challenges under extreme resource constraints. This necessitates the strict node selection enforced by ARS to maintain training speed.

6.3. Generalizability and Future Work

  • Scalability to Other Components: The proposed methodology shows potential for extension to other electrical components such as transformers and switchgear. However, these applications will involve more complex internal geometries and varying material properties compared to the current meter box graph structure, which may require more sophisticated graph topology definitions.
  • Dynamic Sampling Strategy: To address the limitation of static thresholds, future work will focus on developing a dynamic sampling strategy for ARS. This would allow the sampling rate to adapt in real-time based on the variance of the observable region and the estimated PDE residual norm.
  • Enhanced Physics Modeling: We also aim to refine the boundary condition loss to better account for transient environmental factors, further reducing the dependency on the teacher model’s steady-state assumptions.

7. Conclusions

The PhysGraphIR framework proposed in this paper provides an efficient, accurate, and edge-friendly solution for infrared thermal field prediction in meter boxes. Through the Adaptive Physics Residual Sampling (ARS) mechanism and the lightweight Physics Knowledge Distillation graph encoder, PhysGraphIR successfully addresses three major challenges: infrared data sparsity, geometric complexity, and resource constraints in edge computing. Experimental results demonstrate that this framework achieves a mean absolute error (MAE) of 15.6 °C in hotspot regions under 60% infrared data missing condition, representing a 29.5% improvement over traditional methods: training speed is accelerated by 3.1×; only five infrared samples are required to adapt to new meter box types; and inference latency is less than 10 ms on M1 Mac, fully meeting the deployment requirements for edge intelligence in power systems.
PhysGraphIR not only achieves technological innovation breakthroughs but also possesses significant engineering practical value. It reduces infrared inspection costs, improves the reliability of fault early warning, realizes true edge intelligence, and provides a new paradigm for equipment condition monitoring in power systems. As the construction of smart grids advances, lightweight physics-informed learning frameworks like PhysGraphIR will play an increasingly important role in the intelligent monitoring of power equipment.
We look forward to collaborating with the power industry and academia to jointly promote the application and optimization of PhysGraphIR in broader scenarios, providing technical support for the safe and stable operation of power systems. In the future, we will continue to explore innovative applications of physics-informed learning in power equipment condition monitoring, driving the deep integration of AI technology and the power industry.

Author Contributions

H.L. proposed the main idea of the paper and designed the experiments; S.L. provided the data used in the paper’s experiments and participated in the revision and discussion of the idea; X.Y. put forward important suggestions on the paper’s idea and guided the writing of the paper; X.H. implemented the paper’s experiments and drafted the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.

Data Availability Statement

Due to strict data confidentiality requirements within the Jiangsu Power Grid, the power grid data used in this study cannot be freely disclosed. However, the data can be made available upon request from the corresponding author.

Acknowledgments

The authors would like to thank the anonymous reviewers for their constructive comments and suggestions.

Conflicts of Interest

Author Siwei Li was employed by the company Holley Technology. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  1. Kim, J.S.; Choi, K.N.; Kang, S.W. Infrared thermal image-based sustainable fault detection for electrical facilities. Sustainability 2021, 13, 557. [Google Scholar] [CrossRef]
  2. Tang, Z.; Jian, X. Thermal fault diagnosis of complex electrical equipment based on infrared image recognition. Sci. Rep. 2024, 14, 5547. [Google Scholar] [CrossRef]
  3. Meradi, S. Experimental validation of convolutional neural network-based fault detection in power overhead lines using integrated thermal imaging and current measurement techniques. J. Therm. Sci. Eng. Appl. 2025, 17, 091009. [Google Scholar] [CrossRef]
  4. Cordoni, F.; Bacchiega, G.; Bondani, G.; Radu, R.; Muradore, R. A multi–modal unsupervised fault detection system based on power signals and thermal imaging via deep autoencoder neural network. Eng. Appl. Artif. Intell. 2022, 110, 104729. [Google Scholar] [CrossRef]
  5. Cai, S.; Mao, Z.; Wang, Z.; Yin, M.; Karniadakis, G.E. Physics-informed neural networks (pinns) for fluid mechanics: A review. Acta Mech. Sin. 2021, 37, 1727–1738. [Google Scholar] [CrossRef]
  6. Cuomo, S.; Cola, V.S.D.; Giampaolo, F.; Rozza, G.; Raissi, M.; Piccialli, F. Scientific machine learning through physics–informed neural networks: Where we are and what’s next. J. Sci. Comput. 2022, 92, 88. [Google Scholar] [CrossRef]
  7. Cai, S.; Wang, Z.; Wang, S.; Perdikaris, P.; Karniadakis, G.E. Physics-informed neural networks for heat transfer problems. J. Heat Transf. 2021, 143, 060801. [Google Scholar] [CrossRef]
  8. Liu, Z.; Liu, Y.; Yan, X.; Liu, W.; Nie, H.; Guo, S.; Zhang, C. Automatic network structure discovery of physics informed neural networks via knowledge distillation. Nat. Commun. 2025, 16, 9558. [Google Scholar] [CrossRef]
  9. Chandra, S.; AlMansoor, K.; Chen, C.; Shi, Y.; Seo, H. Deep learning based infrared thermal image analysis of complex pavement defect conditions considering seasonal effect. Sensors 2022, 22, 9365. [Google Scholar] [CrossRef] [PubMed]
  10. Liang, S.; Jin, S.; Chen, Y. A review of edge computing technology and its applications in power systems. Energies 2024, 17, 3230. [Google Scholar] [CrossRef]
  11. Musa, A.; Kakudi, H.A.; Hassan, M.; Hamada, M.; Umar, U.; Salisu, M.L. Lightweight deep learning models for edge devices—A survey. Int. J. Comput. Inf. Syst. Ind. Manag. Appl. 2025, 17, 18. [Google Scholar]
  12. Kulkarni, U.; Meena, S.; Gurlahosur, S.V.; Benagi, P.; Kashyap, A.; Ansari, A.; Karnam, V. AI model compression for edge devices using optimization techniques. In Modern Approaches in Machine Learning and Cognitive Science: A Walkthrough: Latest Trends in AI, Volume 2; Springer: Berlin/Heidelberg, Germany, 2021; pp. 227–240. [Google Scholar]
  13. Wang, W.; Yu, B.; Ai, Q.; Liu, M.; Shuai, Y.; Wang, X.; Zhang, X. Reconstructing 3d temperature fields from sparse discrete data by analytical solution-embedded neural network. Appl. Therm. Eng. 2025, 264, 125448. [Google Scholar] [CrossRef]
  14. Serrano, S.A.; Martinez-Carranza, J.; Sucar, L.E. Knowledge transfer for cross-domain reinforcement learning: A systematic review. arXiv 2024, arXiv:2404.17687. [Google Scholar] [CrossRef]
  15. Zhang, Y.; Shen, G.; Wang, Q.; Qian, Y.; Wei, M.; Qin, J. Geobi-gnn: Geometry-aware bi-domain mesh denoising via graph neural networks. Comput.-Aided Des. 2022, 144, 103154. [Google Scholar] [CrossRef]
  16. Mao, C.; Jin, Y. Uncertainty quantification study of the physics-informed machine learning models for critical heat flux prediction. Prog. Nucl. Energy 2024, 170, 105097. [Google Scholar] [CrossRef]
  17. Valente, M.; Dias, T.C.; Guerra, V.; Ventura, R. Physics-consistent machine learning with output projection onto physical manifolds. Commun. Phys. 2025, 8, 433. [Google Scholar] [CrossRef]
  18. Jadin, M.; Taib, S. Infrared image enhancement and segmentation for extracting the thermal anomalies in electrical equipment. Elektron. Elektrotech. 2012, 120, 107–112. [Google Scholar] [CrossRef]
  19. Fu, J.; Li, H.; Sun, X.; He, T.; Zhang, G.; Wei, C. Multi-physics simulation modeling and energy flow characterization of thermal management system for a sport utility vehicle under high-temperature conditions. Energy 2025, 316, 134653. [Google Scholar] [CrossRef]
  20. Bao, K.; Yao, W.; Zhang, X.; Peng, W.; Li, Y. A physics and data co-driven surrogate modeling approach for temperature field prediction on irregular geometric domain. Struct. Multidiscip. Optim. 2022, 65, 302. [Google Scholar] [CrossRef]
  21. Balakrishnan, G.K.; Yaw, C.T.; Koh, S.P.; Abedin, T.; Raj, A.A.; Tiong, S.K.; Chen, C.P. A review of infrared thermography for condition-based monitoring in electrical energy: Applications and recommendations. Energies 2022, 15, 6000. [Google Scholar] [CrossRef]
  22. Guo, C.; Ren, M.; Xia, C.; Dong, M.; Wang, B. Fault diagnosis of power equipment based on infrared image analysis. In Proceedings of the 2020 IEEE International Conference on Advances in Electrical Engineering and Computer Applications (AEECA), Dalian, China, 25–27 August 2020; pp. 659–663. [Google Scholar]
  23. Ullah, I.; Khan, R.U.; Yang, F.; Wuttisittikulkij, L. Deep learning image-based defect detection in high voltage electrical equipment. Energies 2020, 13, 392. [Google Scholar] [CrossRef]
  24. Zhao, Z.; Feng, S.; Zhai, Y.; Zhao, W.; Li, G. Infrared thermal image instance segmentation method for power substation equipment based on visual feature reasoning. IEEE Trans. Instrum. Meas. 2023, 72, 5029613. [Google Scholar] [CrossRef]
  25. Ju, Z.; Chen, Y.; Qiang, Y.; Chen, X.; Ju, C.; Yang, J. A systematic review of data augmentation methods for intelligent fault diagnosis of rotating machinery under limited data conditions. Meas. Sci. Technol. 2024, 35, 122004. [Google Scholar] [CrossRef]
  26. Raissi, M.; Perdikaris, P.; Karniadakis, G.E. Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. J. Comput. Phys. 2019, 378, 686–707. [Google Scholar] [CrossRef]
  27. Koric, S.; Abueidda, D.W. Data-driven and physics-informed deep learning operators for solution of heat conduction equation with parametric heat source. Int. J. Heat Mass Transf. 2023, 203, 123809. [Google Scholar] [CrossRef]
  28. Billah, M.M.; Khan, A.I.; Liu, J.; Dutta, P. Physics-informed deep neural network for inverse heat transfer problems in materials. Mater. Today Commun. 2023, 35, 106336. [Google Scholar] [CrossRef]
  29. Klawonn, A.; Lanser, M.; Weber, J. Machine learning and domain decomposition methods-a survey. Comput. Sci. Eng. 2024, 1, 2. [Google Scholar] [CrossRef]
  30. Krishnapriyan, A.; Gholami, A.; Zhe, S.; Kirby, R.; Mahoney, M.W. Characterizing possible failure modes in physics-informed neural networks. Adv. Neural Inf. Process. Syst. 2021, 34, 26548–26560. [Google Scholar]
  31. Pfaff, T.; Fortunato, M.; Sanchez-Gonzalez, A.; Battaglia, P. Learning mesh-based simulation with graph networks. In Proceedings of the International Conference on Learning Representations, Addis Ababa, Ethiopia, 30 April 2020. [Google Scholar]
  32. Chen, J.; Hachem, E.; Viquerat, J. Graph neural networks for laminar flow prediction around random two-dimensional shapes. Phys. Fluids 2021, 33, 123607. [Google Scholar] [CrossRef]
  33. Taghizadeh, M.; Nabian, M.A.; Alemazkoor, N. Multifidelity graph neural networks for efficient and accurate mesh-based partial differential equations surrogate modeling. Comput.-Aided Civ. Infrastruct. Eng. 2025, 40, 841–858. [Google Scholar] [CrossRef]
  34. Zheng, C.; Chen, H.; Cheng, Y.; Song, Z.; Wu, Y.; Li, C.; Cheng, J.; Yang, H.; Zhang, S. Bytegnn: Efficient graph neural network training at large scale. Proc. VLDB Endow. 2022, 15, 1228–1242. [Google Scholar] [CrossRef]
  35. Hinton, G. Distilling the knowledge in a neural network. arXiv 2015, arXiv:1503.02531. [Google Scholar] [CrossRef]
  36. Huang, T.; Zhang, Y.; Zheng, M.; You, S.; Wang, F.; Qian, C.; Xu, C. Knowledge diffusion for distillation. Adv. Neural Inf. Process. Syst. 2023, 36, 65299–65316. [Google Scholar]
  37. Huang, H.; Ye, Y.; Zhang, B.; Xie, Z.; Xu, F.; Chen, C. Physics-informed neural network compression mechanism for airfoil flow field prediction. Phys. Fluids 2025, 37, 036115. [Google Scholar] [CrossRef]
  38. Bonfanti, A.; Santana, R.; Ellero, M.; Gholami, B. On the generalization of pinns outside the training domain and the hyperparameters influencing it. Neural Comput. Appl. 2024, 36, 22677–22696. [Google Scholar] [CrossRef]
  39. Appavu, N. Analysing the effect of edge-optimized deep learning models on improving low-powered iot devices real-time object detection. In Proceedings of the 2025 9th International Conference on Inventive Systems and Control (ICISC), Coimbatore, India, 12–13 August 2025; pp. 1663–1669. [Google Scholar]
  40. Gao, P.; Li, S.-M.; Wang, F.; Fujita, H.; Aljuaid, H.; Yuan, R.-Y. Learning multi-level graph attentional representation for thermal infrared object tracking. Eng. Appl. Artif. Intell. 2025, 155, 110957. [Google Scholar] [CrossRef]
  41. Jing, Q.; Guo, Y.; Liu, Y.; Wang, Y.; Du, C.; Liu, X. Optimization study of energy saving control strategy of carbon dioxide heat pump water heater system under the perspective of energy storage. Appl. Therm. Eng. 2025, 283, 129030. [Google Scholar] [CrossRef]
  42. Bergman, T.L. Fundamentals of Heat and Mass Transfer; John Wiley & Sons: Hoboken, NJ, USA, 2011. [Google Scholar]
  43. Carslaw, H.; Jaeger, J. Conduction of Heat in Solids; Clarendon: Oxford, UK, 1959. [Google Scholar]
Figure 1. Image of a single-phase single-position metering box used in experiments. (a) Infrared thermal imaging image of the metering box, and (b) visible light image of the metering box.
Figure 1. Image of a single-phase single-position metering box used in experiments. (a) Infrared thermal imaging image of the metering box, and (b) visible light image of the metering box.
Electronics 15 00410 g001
Figure 2. The overall architecture of the proposed PhysGraphIR framework. It illustrates the data flow from sparse infrared observations and geometric inputs to the Physics Knowledge Distillation (Physics-KD) module, regulated by the Adaptive Residual Sampling (ARS) mechanism, finally yielding high-resolution thermal field reconstruction and hotspot alarms.
Figure 2. The overall architecture of the proposed PhysGraphIR framework. It illustrates the data flow from sparse infrared observations and geometric inputs to the Physics Knowledge Distillation (Physics-KD) module, regulated by the Adaptive Residual Sampling (ARS) mechanism, finally yielding high-resolution thermal field reconstruction and hotspot alarms.
Electronics 15 00410 g002
Figure 3. Module-level architecture of the physics-aware gating network within the ARS mechanism.
Figure 3. Module-level architecture of the physics-aware gating network within the ARS mechanism.
Electronics 15 00410 g003
Figure 4. Control flow diagram of the training iteration. Unlike standard methods that compute physics residuals for all nodes, the ARS mechanism filters nodes based on the gradient threshold τ g r a d . Only critical nodes enter the PDE computation path, while the remaining 90% are skipped.
Figure 4. Control flow diagram of the training iteration. Unlike standard methods that compute physics residuals for all nodes, the ARS mechanism filters nodes based on the gradient threshold τ g r a d . Only critical nodes enter the PDE computation path, while the remaining 90% are skipped.
Electronics 15 00410 g004
Figure 6. On-site photograph.
Figure 6. On-site photograph.
Electronics 15 00410 g006
Figure 7. Training and validation loss comparison.
Figure 7. Training and validation loss comparison.
Electronics 15 00410 g007
Figure 8. ARS mechanism analysis. (a) Critical node selection; (b) performance impact of ARS mechanism.
Figure 8. ARS mechanism analysis. (a) Critical node selection; (b) performance impact of ARS mechanism.
Electronics 15 00410 g008
Figure 9. Knowledge transfer effect and data efficiency of the Physics-KD mechanism. (a) Knowledge transfer effect, (b) data efficiency, and (c) new box adaptation.
Figure 9. Knowledge transfer effect and data efficiency of the Physics-KD mechanism. (a) Knowledge transfer effect, (b) data efficiency, and (c) new box adaptation.
Electronics 15 00410 g009
Figure 10. Temperature field prediction visualization comparison of different methods on the RealMeterIR dataset.
Figure 10. Temperature field prediction visualization comparison of different methods on the RealMeterIR dataset.
Electronics 15 00410 g010
Figure 11. ARS mechanism heatmap. (a) Infrared image; (b) ARS importance score.
Figure 11. ARS mechanism heatmap. (a) Infrared image; (b) ARS importance score.
Electronics 15 00410 g011
Figure 12. Spatial distribution heatmaps of absolute PDE residuals. (a) residual distribution of PINN-GNN; (b) Residual distribution of PhysGraphIR.
Figure 12. Spatial distribution heatmaps of absolute PDE residuals. (a) residual distribution of PINN-GNN; (b) Residual distribution of PhysGraphIR.
Electronics 15 00410 g012
Table 1. Comparison of state-of-the-art methods for thermal field prediction.
Table 1. Comparison of state-of-the-art methods for thermal field prediction.
MethodPhysics ConsistencyGeometric AdaptabilityComputational EfficiencyEdge Deployment Ready
Standard CNNsLow (Data-driven)Low (Fixed resolution)High (Fast inference)Yes
Standard PINNsHigh (PDE constraints)Low (Retraining required)Low (Heavy collocation)No
Unified PINN-GNNHighHigh (Graph-based)Medium (Uniform residual)Limited
PhysGraphIR (Ours)High (Physics-KD)High (Decoupled)Very High (ARS)Yes (Lightweight)
Table 2. Baseline methods for performance comparison.
Table 2. Baseline methods for performance comparison.
MethodTypeCharacteristics
U-NetPure data-drivenCNN architecture, no physics consideration
Standard PINNPhysics-informedFully connected network solving PDE
PINN-GNNPhysics + GraphGNN for geometry, PINN for physics
GNN-onlyPure data-drivenGNN for spatial relationships only
Physics-KD (w/o ARS)Physics knowledge distillationKnowledge distillation without adaptive sampling
ARS (w/o KD)Adaptive samplingARS mechanism without knowledge distillation
PhysGraphIRProposed methodARS + Physics-KD joint optimization
Table 3. Performance comparison of prediction accuracy and physics consistency under infrared data missing conditions.
Table 3. Performance comparison of prediction accuracy and physics consistency under infrared data missing conditions.
MethodSynMeterBox (50% Missing)SynMeterBox (60% Missing)RealMeterIR
Overall Hotspot Phy. Const. Overall Hotspot Phy. Const. Overall Hotspot
MAE (°C) MAE (°C) ( L pde ) MAE (°C) MAE (°C) ( L pde ) MAE (°C) MAE (°C)
U-Net18.325.60.8522.131.80.9228.4
Standard PINN14.720.10.3116.923.50.3522.7
PINN-GNN15.218.90.2817.322.10.3021.5
GNN-only16.821.40.7819.525.60.8823.9
Physics-KD (w/o ARS)13.516.80.2215.119.20.2518.7
ARS (w/o KD)14.117.30.2415.919.80.2719.1
PhysGraphIR (Ours)12.114.80.1912.815.60.2114.3
Table 4. Computational efficiency comparison.
Table 4. Computational efficiency comparison.
MethodTraining Speed (rel.)Inference Latency (ms)Memory Usage (MB)Model Size (KB)
U-Net1.2×8.81120850
Standard PINN0.8×120.078095
PINN-GNN1.0×45.01250180
GNN-only1.3×7.5980150
Physics-KD (w/o ARS)1.8×18.562052
ARS (w/o KD)2.5×12.0850170
PhysGraphIR (Ours)3.1×9.548048
Table 5. Cross-box adaptation performance (Target: Type C distribution box, with only 5 samples).
Table 5. Cross-box adaptation performance (Target: Type C distribution box, with only 5 samples).
MethodTraining SamplesHotspot MAE (°C)Training Time (s)
PINN-GNN (from scratch)6021.5180
Physics-KD (from scratch)6018.7145
Transfer Learning (U-Net)525.835
Transfer Learning (PINN-GNN)523.140
PhysGraphIR (Ours)514.928
Table 6. Ablation study of PhysGraphIR components.
Table 6. Ablation study of PhysGraphIR components.
ConfigurationHotspot MAE (°C)PDE ResidualTraining Speed (rel.)
Baseline (PINN-GNN)22.10.28 1.0 ×
+ ARS19.30.23 2.6 ×
+ Physics-KD19.80.24 1.9 ×
ARS + Physics-KD18.50.22 2.3 ×
Full Model (w/optimized weights)15.60.19 3.1 ×
Table 7. PhysGraphIR performance on Mac devices.
Table 7. PhysGraphIR performance on Mac devices.
Optimization StrategyInference Latency (ms)Memory Usage (MB)Power (W)Model Size (KB)
FP32, no graph simplification12.34800.95190
INT8, no graph simplification10.84800.8595
FP32, K-core compression10.53800.82190
INT8, K-core compression9.53200.7848
Table 8. PhysGraphIR performance on edge devices.
Table 8. PhysGraphIR performance on edge devices.
DeviceInference Latency (ms)Memory Usage (MB)Power Consumption (W)Application Scenario
Apple M1 Mac9.53200.78Meter box local gateway
Raspberry Pi 428.32902.5Low-cost edge node
iPhone 1315.22800.65Mobile inspection device
Jetson Nano12.73502.1AI edge computing unit
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Li, H.; Li, S.; Yu, X.; He, X. PhysGraphIR: Adaptive Physics-Informed Graph Learning for Infrared Thermal Field Prediction in Meter Boxes with Residual Sampling and Knowledge Distillation. Electronics 2026, 15, 410. https://doi.org/10.3390/electronics15020410

AMA Style

Li H, Li S, Yu X, He X. PhysGraphIR: Adaptive Physics-Informed Graph Learning for Infrared Thermal Field Prediction in Meter Boxes with Residual Sampling and Knowledge Distillation. Electronics. 2026; 15(2):410. https://doi.org/10.3390/electronics15020410

Chicago/Turabian Style

Li, Hao, Siwei Li, Xiuli Yu, and Xinze He. 2026. "PhysGraphIR: Adaptive Physics-Informed Graph Learning for Infrared Thermal Field Prediction in Meter Boxes with Residual Sampling and Knowledge Distillation" Electronics 15, no. 2: 410. https://doi.org/10.3390/electronics15020410

APA Style

Li, H., Li, S., Yu, X., & He, X. (2026). PhysGraphIR: Adaptive Physics-Informed Graph Learning for Infrared Thermal Field Prediction in Meter Boxes with Residual Sampling and Knowledge Distillation. Electronics, 15(2), 410. https://doi.org/10.3390/electronics15020410

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop