Strengths and Contributions. The proposed framework delivers three complementary advantages:
(i) physics-guided augmentation (PIAM) increases sample efficiency and preserves physical plausibility;
(ii) self-supervised invariant representations (SSIRL) reduce label dependency while retaining defect-specific cues; and
(iii) adversarial alignment (AFA) maintains transferability across heterogeneous concretes.
4.1. Strengths of the Proposed Framework
Recent developments in artificial intelligence for structural health monitoring (SHM) have explored physics-informed, self-supervised, and few-shot learning methods, yet these paradigms have largely evolved in isolation or through partial pairwise combinations. The proposed framework distinguishes itself by integrating all three elements—physics-informed augmentation (PIAM), self-supervised invariant representation learning (SSIRL), and adversarial domain alignment (AFA)—into a single synergistic cycle. This combination allows the model to simultaneously address the dual challenges of data scarcity and domain variability, which have historically limited the deployment of intelligent SHM systems. Unlike previous approaches that rely solely on data-driven embeddings, the present framework ensures that feature representations remain both physically interpretable and generalizable across diverse structures [
41,
42].
The physics-guided foundation of the proposed approach represents a key methodological advancement. Physics-informed neural networks (PINNs) have recently shown their ability to capture deflection and bending-moment fields in structural components by embedding mechanical equations into their learning process [
41]. However, these models typically treat physics as a constraint within the loss function. In contrast, the PIAM module in this work employs physical modeling as a data-generation engine, producing synthetic ultrasonic signals through stochastic elastodynamic simulations that enrich the learning space. This physically grounded augmentation ensures that the model learns causally valid relationships between material parameters and signal responses, effectively mitigating the risk of overfitting to spurious correlations. Recent reviews confirm that machine learning in ultrasonics increasingly benefits from such hybrid physical-data strategies, which improve both interpretability and adaptability in non-destructive testing (NDT) systems [
43].
Parallel to this physics-guided foundation, the framework extends the capabilities of few-shot and self-supervised learning for data-efficient defect detection. Few-shot architectures such as prototypical networks have achieved substantial progress in guided-wave SHM applications [
40]. Similarly, recent studies combining self-supervised and contrastive learning for sensor-based fault detection have reported substantial performance improvements even with minimal labeled data [
44,
45,
46]. Building on these advances, the present framework unites both paradigms within a physics-consistent environment, where self-supervised pretraining operates on a mixture of real and PIAM-simulated signals. This dual-source training strategy yields embeddings that are both data-efficient and physically robust, ensuring stable performance even in extreme one-shot conditions.
The proposed Adversarial Feature Aligner (AFA) further strengthens cross-domain robustness by addressing discrepancies between laboratory and field datasets. Traditional SHM models often fail when faced with material or coupling variations not represented in their training data. Domain adaptation and transfer learning have recently emerged as effective methods to mitigate such distributional shifts. For instance, Bowler and Watson (2021) [
47] demonstrated that reflection-mode ultrasonic sensing combined with transfer learning can maintain predictive accuracy above 96% across different industrial processes without requiring labeled target data. Inspired by these findings, the AFA module employs a Wasserstein-based adversarial alignment to minimize inter-domain feature distances, enabling invariant feature extraction across heterogeneous concrete types and inspection setups [
42,
47]. This ensures that the model maintains high accuracy and reliability even under real-world variability.
Beyond its algorithmic structure, the framework also exemplifies methodological rigor and engineering readiness. In line with the growing emphasis on statistical robustness in AI-based SHM research, the evaluation protocol incorporates root-mean-square error (RMSE), coefficient of determination (R
2), and bootstrap-based confidence intervals (95% BCa CI), supplemented by Bonferroni-corrected t-tests to confirm significance. Such formal statistical testing aligns with the highest standards of reproducibility and transparency recently advocated in hybrid SHM–NDT studies [
39,
48]. Furthermore, the FPGA–GPU hybrid architecture transforms the system from a static inference device into an adaptive learning instrument. While existing embedded AI systems typically separate simulation and inference tasks, the present implementation performs real-time physics simulations on the FPGA (Zynq Ultrascale+) and concurrent inference on the Jetson AGX GPU. This hybridization bridges high-fidelity modeling with edge computing, allowing engineers to recalibrate the model on-site using a single sample—a paradigm shift from static to adaptive SHM systems [
43,
49].
In summary, the synergy of physics-informed augmentation, self-supervised representation learning, and adversarial domain alignment establishes a unified and field-ready SHM paradigm that bridges the gap between scientific innovation and real-world applicability. This integrated design not only achieves methodological excellence but also enables adaptive deployment in challenging inspection environments, fulfilling the dual goals of academic rigor and engineering impact sought by contemporary SHM research.
These strengths are empirically evidenced by the 12–15 pp gains over few-shot baselines (
Section 3.1), the graceful performance under domain shifts (
Section 3.2), and the ablation/sensitivity results confirming the mutually reinforcing cycle of PIAM–SSIRL–AFA (
Section 3.3,
Section 3.4 and
Section 3.5).
4.2. Limitations and Future Directions
While the proposed physics-guided self-supervised few-shot framework demonstrates strong performance in controlled experiments, several limitations emerge when transitioning toward real-world deployment scenarios. The most critical limitation lies in the variability of transducer coupling, which remains one of the most frequent practical challenges in ultrasonic inspections. Surface roughness, moisture, or debris often alter coupling efficiency and can introduce signal artifacts that the contrastive learning strategy may incorrectly associate with genuine defect signatures [
50]. As noted by [
47], such sensor-related inconsistencies pose a major barrier to domain generalization in ultrasonic sensing systems. To mitigate this, future research should incorporate stochastic coupling modeling into the PIAM module—introducing an attenuating interface layer that varies in impedance—to train the network toward coupling-invariant representations. Additionally, embedded self-calibration routines could be developed for field devices, allowing real-time correction of coupling effects during inspections.
A second key limitation concerns the material modeling assumptions within the physics-informed augmentation stage. The current PIAM formulation assumes isotropic elastic behavior, whereas actual reinforced concrete often exhibits anisotropy and heterogeneity due to embedded steel bars and localized cracking [
51]. This simplification may reduce the fidelity of synthetic data when representing complex structural geometries. Similar challenges have been reported in recent PINN-based SHM research, where Kirchhoff–Love plate formulations captured deflections effectively but showed reduced accuracy under anisotropic conditions [
41]. Future improvements should therefore include anisotropic parameterization of stiffness tensors and density distributions within the stochastic simulation process.
Moreover, the framework currently focuses on bulk-wave propagation, neglecting surface and guided waves that dominate in thin components such as bridge decks, pavements, and tunnel linings [
52]. As demonstrated in recent studies on guided-wave SHM [
42,
53], integrating multiple wave modes could substantially enhance sensitivity to near-surface delaminations and corrosion defects. Extending PIAM to incorporate surface wave physics thus represents an important step toward broader structural applicability.
From a methodological standpoint, the domain adaptation module (AFA) assumes the availability of representative target-domain samples during training. In practice, such samples may be scarce when inspecting new or unique structure types. To address this, domain adaptation could be reformulated as an unsupervised or transfer-based learning task—leveraging physics-generated synthetic data to bridge unseen domains without explicit target labels. The use of adversarial alignment has proven beneficial in other ultrasonic applications [
47], but incorporating cross-domain contrastive pretraining and few-shot meta-adaptation would further improve transferability.
Another limitation arises from the single-channel data acquisition setup. Commercial SHM and NDT systems increasingly rely on sensor arrays and phased-array transducers to enable spatial localization of defects [
54]. While the proposed model processes temporal ultrasonic signals effectively, future iterations could integrate multi-channel spatial encoding into the SSIRL framework. This would allow the system to capture spatio-temporal correlations, enabling not only classification but also localization and severity estimation of structural defects.
From the computational perspective, although the hybrid FPGA–GPU implementation achieves real-time performance, scalability challenges persist. The finite-difference time-domain (FDTD) solver scales cubically with simulation domain size, limiting high-resolution modeling of large structural components such as bridge piers or dam segments [
55]. Transformer-based encoders also exhibit quadratic memory growth with input sequence length, constraining the maximum detectable defect depth [
56]. Similar scalability issues have been noted in recent hybrid SHM–NDT implementations [
43,
48]. Addressing these challenges will require multi-scale simulation strategies, coupling coarse-grid domains with analytical or reduced-order submodels [
57]. Likewise, adopting hybrid CNN–Transformer architectures could maintain long-range temporal dependencies while reducing computational overhead [
58]. Additionally, incorporating curriculum learning strategies—where task difficulty is gradually increased during meta-training—could accelerate convergence and stabilize adversarial optimization [
59,
60].
From an operational standpoint, field deployment introduces further environmental and energy constraints. Although the prototype consumes only 1.2 W in active operation, prolonged use in remote inspections will necessitate improvements in power management and adaptive sampling based on model confidence [
61]. Outdoor use also exposes transducers to dust, humidity, and temperature fluctuations, potentially degrading sensitivity and necessitating adaptive feature alignment mechanisms for dynamic environmental compensation [
62]. As highlighted by [
48], such hybrid SHM–NDT configurations must balance real-time accuracy with long-term robustness under uncertain environmental dynamics.
Looking ahead, several future research directions emerge. First, integrating multi-scale physics simulations into PIAM would allow simultaneous modeling of large structural behavior and localized defect dynamics, improving both realism and efficiency. Second, expanding the wave physics spectrum—to include surface and guided wave propagation—would increase the versatility of the framework across material types and geometries. Third, exploring multi-sensor fusion and active learning strategies could reduce reliance on pre-labeled data, enabling autonomous model refinement in the field. Finally, advancing energy-efficient hardware co-design—combining FPGA acceleration with neuromorphic or low-power AI processors—could make real-time SHM feasible in remote or embedded applications [
47,
48]. Beyond concrete applications, similar physics-informed hybrid strategies could also be extended to asphalt pavements, where viscoelastic surface waves dominate the response [
63], and to metallic or composite structures, where the physics-based module could be reformulated using Maxwell’s equations or coupled electromechanical solvers for eddy-current-based defect detection [
64]. These cross-material extensions would broaden the framework’s applicability across diverse engineering domains while preserving its physics-consistent foundation.
Collectively, addressing these limitations and directions will enhance the framework’s reliability, adaptability, and sustainability, ensuring that the synergy between physics-based modeling and data-driven learning evolves into fully deployable, self-adaptive SHM technology for civil infrastructure.
Importantly, these limitations do not undermine the core findings reported in
Section 3; rather, they motivate future extensions (anisotropy, multi-sensor fusion, guided waves) that are likely to further strengthen cross-domain robustness.
4.4. Practical Implications
The experimental results and validation analyses presented in this study have direct implications for the practical deployment of AI-driven ultrasonic inspection systems in structural health monitoring (SHM) and non-destructive testing (NDT). The proposed physics-guided self-supervised few-shot learning framework demonstrates that combining physics-based modeling with adaptive learning mechanisms can substantially reduce the dependence on large labeled datasets—one of the main obstacles in field implementation of intelligent inspection tools.
4.4.1. Toward Adaptive, Field-Ready SHM Systems
Traditional SHM algorithms are trained offline and transferred to the field as static models that often fail under domain shifts caused by environmental or material variability. In contrast, the present FPGA–GPU hybrid implementation enables on-site adaptive learning: the FPGA executes physics-based FDTD simulations to generate physically consistent signal augmentations, while the GPU retrains the model in real time using these augmented samples. This configuration transforms the inspection unit from a passive inference device into an active learning system, capable of self-calibration and domain adaptation directly in the field [
47,
48].
Such adaptability is crucial for large-scale infrastructure applications—bridges, tunnels, and dams—where environmental conditions and coupling interfaces vary continuously. Similar hybrid SHM–NDT approaches have recently been proposed for metal components and aerospace structures [
48], confirming that real-time co-design of hardware and learning algorithms is a decisive step toward scalable deployment.
4.4.2. Engineering and Operational Benefits
From an engineering perspective, the demonstrated one-shot and few-shot capabilities have profound operational advantages. Achieving more than 60% accuracy with a single labeled sample means that inspectors can perform rapid pre-screening of large structures using minimal calibration data. When five labeled examples per defect type are available, accuracy surpasses 85%, making the system suitable for semi-automated assessment in maintenance workflows.
This performance profile supports a tiered inspection strategy:
Collect a small number of reference ultrasonic signals on-site;
Generate synthetic augmentations using the embedded PIAM module;
Fine-tune the model within minutes on portable GPU hardware;
Perform real-time scans with probabilistic defect classification outputs.
Such a workflow reduces inspection time and cost while maintaining high reliability and interpretability—key requirements emphasized in recent SHM digitalization frameworks [
43,
53]. In addition, the probability-based outputs (e.g., Crack: 94%, Void: 4%, Other: 2%) provide intuitive diagnostic feedback that engineers can integrate with traditional confirmatory tests such as coring, rebound-hammer, or ground-penetrating radar measurements.
4.4.3. Societal and Sustainability Impact
Beyond immediate engineering use, this framework contributes to broader goals of infrastructure sustainability and resilience. By enabling early-stage, non-invasive defect detection with minimal data, the approach reduces material waste, inspection frequency, and human exposure to hazardous environments. The hardware’s low-power consumption (≈1.2 W) and compatibility with portable, battery-operated inspection kits make it well-suited for remote monitoring in developing regions or post-disaster assessments [
48].
These qualities align with the recent paradigm shift in civil-infrastructure monitoring from reactive maintenance toward predictive, physics-aware digital twins [
41,
53]. Embedding the proposed framework within such digital-twin environments could allow continuous data assimilation and adaptive recalibration as new measurements become available.
4.4.4. Outlook
Overall, the presented system represents an important advancement toward AI-enabled, self-adapting SHM technology. The demonstrated synergy between physics-based simulation, self-supervised learning, and domain adaptation provides a blueprint for future intelligent inspection platforms that are accurate, explainable, and energy-efficient. As future research addresses the identified limitations—particularly anisotropic material modeling, multi-sensor fusion, and multi-scale simulation—the framework is expected to evolve into a fully deployable, autonomous inspection solution capable of enhancing the safety, efficiency, and sustainability of civil infrastructure systems.
Taken together with the real-time, low-power hardware profile (
Section 3.4), the framework provides a practical pathway from laboratory validation to field-ready SHM workflows under severe label scarcity.