Hierarchical Decoupling Digital Twin Modeling Method for Topological Systems: A Case Study of Water Purification Systems

Wu, Xubin; Wu, Guoqiang; Zhang, Xuewei; Yang, Qiliang; Xie, Liqiang

doi:10.3390/technologies14010042

Open AccessArticle

Hierarchical Decoupling Digital Twin Modeling Method for Topological Systems: A Case Study of Water Purification Systems

by

Xubin Wu

,

Guoqiang Wu

,

Xuewei Zhang

,

Qiliang Yang

and

Liqiang Xie

^*

College of Defense Engineering, Army Engineering University of PLA, Nanjing 210007, China

^*

Author to whom correspondence should be addressed.

Technologies 2026, 14(1), 42; https://doi.org/10.3390/technologies14010042

Submission received: 18 November 2025 / Revised: 11 December 2025 / Accepted: 23 December 2025 / Published: 6 January 2026

Download

Browse Figures

Versions Notes

Abstract

Digital twins (DTs) have seen widespread application across industries, enabling deep integration of cyber–physical systems. However, previous research has largely focused on domain-specific DTs and lacks a universal, cross-industry modeling framework, resulting in high development costs and low reusability. To address these challenges, this study proposes a DT modeling method based on hierarchical decoupling and topological connections. First, the system is decomposed top–down into three levels—system, subsystem, and component—through hierarchical functional decoupling, reducing system complexity and supporting independent component development. Second, a method for constructing component-level DTs using standardized information sets is introduced, employing the JSON-LD language to uniformly describe and encapsulate component information. Finally, a topological connection mechanism abstracts the relationships between components into an adjacency matrix and assembles components and subsystems bottom–up using graph theory, ultimately forming the system-level DT. The effectiveness of the proposed method was validated using a typical surface water purification system as a case study, where the system was decomposed into four functional subsystems and 12 types of components. Experimental results demonstrate that the method efficiently enables automated integration of DTs from standardized components to subsystems and the complete system. Compared with conventional monolithic modeling approaches, it significantly reduces system complexity, supports efficient component development, and accelerates system integration. For example, when the number of components exceeds 300, the proposed method generates topology connections 44.69% faster than direct information set traversal. Consequently, this approach provides a novel and effective solution to the challenges of low reusability and limited generality in DT models, laying a theoretical foundation and offering technical support for establishing a universal cross-industry DT modeling framework.

Keywords:

digital twin; hierarchical decoupling; standardized information set; JSON-LD; topological connection; water purification system

1. Introduction

In a 2003 product lifecycle management course at the University of Michigan, Professor Michael Grieves first introduced a framework he initially termed the “Mirrored Space Model” [1], which was later renamed the Digital Twin in 2011. In 2012, the National Aeronautics and Space Administration (NASA) released a technology roadmap describing the Digital Twin concept as a physical mirror of a product that integrates multidisciplinary, multiscale simulation processes to reflect its entire lifecycle, utilizing physical models, sensor data, and historical data [2]. Digital Twins have attracted increasing research interest from the academic community in recent years owing to advancement in new-generation information technologies [3], with widespread practical applications in the industrial sector [4].

In the development of Digital Twin technology, constructing a Digital Twin model is a crucial prerequisite for its realization. Research on the construction of a high-fidelity Digital Twin model has attracted significant interest from numerous experts, scholars, and corporate institutions worldwide [5]. The initial conceptual model of the Digital Twin was a three-dimensional model comprising the physical entity, virtual entity, and the connections between them [6]. Subsequently, Fei et al. proposed a five-dimensional Digital Twin model that includes the physical entity, virtual entity, twin data, services, and connections based on the previous three-dimensional model [7]. Theodor et al. [8] proposed a four-dimensional Digital Twin architecture which includes data acquisition and transmission, virtual twins, predictive twins, and decision twins. These four dimensions are categorized function, helping operators understand the capabilities of the virtual entity model. Zheng et al. [9] proposed a Digital Twin product lifecycle management application framework based on the Physical Space, Information Processing Layer, and Virtual Space. These models provide standardized reference frameworks for the construction of Digital Twins.

Digital Twins have been extensively applied in various fields, including aerospace [10,11], manufacturing workshops [12,13], smart cities [14,15], healthcare [16,17] and smart water management [18,19]. Especially in the field of smart water management, with the continuous development of Digital Twin modeling and simulation, remarkable progress has been achieved in both the facility and overall optimization of water purification systems. For example, in terms of settling techniques, Plósz et al. developed and validated Vesilind function for hindered settling, leading to a new exponential function addressing the compression settling velocity [20]; For pump technology, Nguyen et al. developed an adjustable tongue vane that controls the internal flow direction in the volute to increase the energy performance of a single-channel pump [21]. In the field of whole-plant wastewater treatment modeling and optimization technology, Ekama integrated steady-state models of the activated sludge process and sludge digestion with stoichiometric conversions of biological processes, thereby developing a whole-plant wastewater treatment model based on mass balances of carbon, hydrogen, oxygen, nitrogen, chemical oxygen demand, and charge to aid in the layout design of treatment plants [22]. In the field of flow monitoring and hydraulic optimization technology, Matias et al. employed a method combining in situ experiments with computational fluid dynamics (CFD), using Large-Scale Particle Image Velocimetry (LS-PIV) to obtain flow characteristic parameters such as cross-sectional velocity distribution and discharge rate, and established a numerical model based on the Reynolds-averaged Navier–Stokes equations with the k-ε turbulence model, ultimately completing the calibration of the rectangular weir discharge equation, providing technical support for ensuring reservoir water quality and public health [23].

The Digital Twin implementation process can be summarized into the following core stages: requirement analysis and goal definition, model construction and hierarchical mapping, and data integration and real-time connection.

Despite these advances, a unified Digital Twin methodology for cross-domain applications has not been realized. Significant challenges persist, including disparities in application requirements, integration complexity of multidomain heterogeneous models, and lack of interoperability standards. Although recent studies have proposed various domain-specific solutions [24,25,26,27] and modular approaches to improve reusability [28,29,30,31], a systematic and universal methodology for constructing complex system-level Digital Twins has not been implemented.

To bridge this gap, this study proposes a novel Digital Twin modeling method based on a hierarchical decoupling architecture and topological connection mechanism. The main contributions of this study are summarized below.

The system complexity is reduced through hierarchical functional decoupling, thereby establishing an architectural foundation for independent component development and reuse.
This study proposed a method for constructing component-level Digital Twins based on standardized information sets.
A multi-dimensional topological connection mechanism is designed based on graph theory.

To address the diverse functionalities and variable scenarios of system-level Digital Twins, a progressive construction method (“hierarchical decoupling-topological connection”) is proposed (Figure 1). This method first decomposes the overall system top–down into multiple relatively functionally independent subsystems based on core functions, key characteristics, and application objectives. Each subsystem is then further refined into several basic components following the same principles of functional, characteristic, and scenario division. To achieve system integration, the physical and logical connection relationships between components must be precisely described, specifically by abstracting components as nodes and their interaction relationships as edges, generating an adjacency matrix, and constructing a topological connection model based on graph theory. Using this as a structural blueprint, components are assembled bottom–up layer by layer to form subsystem Digital Twins, which are ultimately assembled into a complete system-level Digital Twin, thereby supporting the gradual restoration of system functions and emergent holistic behaviors.

The novelty of this study lies in three key aspects. First, unlike existing methods that use untargeted system decomposition, its scenario-driven hierarchical decoupling achieves a balance between modular independence and integration feasibility. Second, it improves compatibility in existing component modeling by introducing standardized information sets for component-level digital twins. Third, through a graph theory-based multidimensional topological mechanism, it quantifies both physical and logical interactions, surpassing traditional approaches that consider only physical connections. Collectively, these aspects establish a “decomposition–modeling–integration” loop, offering a new approach to complex system-level digital twin construction.

The remainder of this paper is structured as follows: Section 2 presents a review of the challenges, core characteristics, and current methodological gaps in system-level Digital Twins, as identified in the literature. Section 3 introduces the construction method of standardized information sets and the partitioning and connection methods for system-level Digital Twins. Section 4 presents the validation of the feasibility of the proposed method through a typical modeling case study of a water purification system. Discussion is shown in Section 5. The main conclusions of the study are presented in Section 6, whereas future research directions are discussed in Section 7.

2. Literature Review

The construction of system-level Digital Twins faces numerous obstacles. This section analyzes the challenges inherent in building system-level Digital Twins and their fundamental characteristics to determine how to establish an effective construction methodology based on these challenges and characteristics.

2.1. Challenges in System-Level Digital Twin Modeling

The construction of system-level Digital Twins faces several inherent challenges, primarily stemming from their multi-dimensional, highly complex, and strongly coupled nature. The main bottlenecks include the computational intensity of high-fidelity modeling for large-scale systems, technical hurdles in integrating cross-domain models that utilize disparate languages and tools, and the absence of a seamless, lifecycle-encompassing data integration and synchronization mechanism. Across diverse application domains, these bottlenecks manifest in domain-specific scenarios: In manufacturing, for example, the communication protocols of some CNC machine tools are incompatible with those of industrial robots, which incompatibility hinders the execution of collaborative multi-device tasks [32]. The lack of unified standardized interfaces for IoT devices restricts data-sharing capabilities and the reusability of modeling libraries [33]. In aerospace, the assembly process of aerospace products generates vast amounts of quality data, yet current technologies struggle to achieve accurate predictions based on this data, thereby limiting the decision-support value of digital twin models [34]. In smart transportation, unpredictable traffic dynamics and human behavior increase modeling complexity, necessitating robust frameworks to handle [35]. In agriculture, although current practice involves using tractor-mounted camera systems for periodic data collection, this method is costly, hindering its widespread and promotion [36]. Furthermore, challenges persist, including widespread internet connectivity issues, high costs, and data privacy concerns [37]. When constructing datasets for digital twins, data from sensors with different architectures are heterogeneous, requiring advanced feature extraction methods for unified processing and effective information retrieval, which increases the complexity of data preprocessing and model training [38]. In smart building, most current integration platforms for building digital twin models are based on BIM for secondary development, integrating measured data into BIM to realize the interaction between other information and BIM [39,40]. However, BIM exhibited a certain degree of closure. BIM software typically has a relatively closed data structure and lacks scalability. Although it supports some open standards (e.g., IFC), achieving real-time data exchange across multiple domains and platforms remains challenging [41]. In the domain of smart water systems, key elements for hydraulic modeling, such as isolation valves, control valves, or pumps, are sometimes represented as isolated points or short pipe elements, posing challenges for accurately modeling and simulating network hydraulic behavior [42]. These challenges impede real-time simulation performance and hinder dynamic synchronization between the virtual and physical entities.

As shown in Table 1, Many strategies have been proposed to mitigate these issues, including modular frameworks [43], workflow and multi-level data-driven digital twin frameworks [44], four-tuple modeling with Petri-net-based consistency mechanisms [45], component-based dynamic assembly [46], hybrid modeling approaches [47], ontology-based semantic integration [48], class-level entity management [49], architecture blueprints [50], and DevOps-inspired agile development pipelines [51]. Despite these advances, specific aspects of the problem have often been addressed in isolation. However, a holistic and universally applicable methodology has not been established.

2.2. Core Characteristics of System-Level Digital Twins

System-level Digital Twins must mirror the intricacies of real-world systems such as water purification plants, which are inherently multiscale. This requires dynamic data representation and interaction across hierarchical levels, from component-specific parameters (e.g., valve openings) to subsystem metrics (e.g., backwash frequency) and system-level key performance indicators (e.g., daily processing capacity). Consequently, an effective Digital Twin framework should exhibit the following core characteristics.

Holism: Capturing not only individual components but also emergent system-level behaviors resulting from complex interactions.
Hierarchy: Accurately maintaining and representing nested structural relationships (e.g., system–subsystem–component).
Heterogeneity: Integrating components from diverse domains with varying models, protocols, and data formats within a unified semantic framework.
Flexibility: Adapting dynamically to system evolution such as upgrades or expansions.
Interactivity: Capturing dynamic physical and informational interactions between components.

2.3. Current Gaps in Construction Methodologies

As identified in Section 2.1, although significant progress has been made through various strategies, the current approaches struggle to cohesively reconcile the competing demands of high-fidelity modeling, cross-domain integration, and lifecycle-wide data synchronization. The existing research landscape is fragmented, with domain-specific solutions and isolated technical improvements. There remains a critical need for a unified framework that can harmonize multi-scale representation with dynamic adaptability, effectively addressing the intrinsic complexity, including hierarchical organization, structural heterogeneity, and continuous evolution, of cyber–physical systems in a scalable way. This gap motivated the integrated methodology proposed in this study.

3. System-Level Digital Twin Assembly Methodology

The hierarchical decomposition of system Digital Twins can be performed according to multi-dimensional criteria, including temporal, spatial, and operational states. Entities at different levels typically exhibit distinct functions. During the construction of system-level Digital Twins, a spatial scale should be adopted as the fundamental partitioning criterion to establish the hierarchical architecture, with entities categorized into corresponding contextual units based on functionality.

The hierarchical decoupling architecture for system-level Digital Twins can be implemented as follows: First, the system is partitioned into component, subsystem, and system layers based on functional, dimensional, and state differences (where state differences refer to distinctions in operational or lifecycle states, such as data update frequency or component lifespan). At the component layer, a standardized information set (SIS) is established to store basic attribute information, physical parameters, structural materials, and operational parameters, whereas standardized service interfaces enable a “data-as-generated” dynamic construction mechanism where SIS data changes directly trigger twin reconstruction. In the subsystem and system layers, graph theory-based adjacency matrices are constructed according to the spatial topology and functional dependencies of the physical system to represent the connection relationships between components or subsystems. Component-level Digital Twins can be aggregated bottom–up based on topological connection rules defined by the adjacency matrix, first integrated into subsystem-level Digital Twins, and ultimately coupled into a complete system-level Digital Twin. The hierarchical and coupling relationships of this system integration structure are shown in Figure 2.

M_{D T_s y s t e m} = {S y s t e m l e v e l, S u b s y s t e m l e v e l, C o m p o n e n t l e v e l}

(1)

3.1. System Partitioning Methodology

The overall architecture of system-level Digital Twins adopts a hierarchical decoupling approach, the core of which is the structural decomposition of complex physical systems. Specifically, the fundamental basis for hierarchical partitioning comes from analyzing the intrinsic correlations of the system. Two key factors are prioritized in this analysis: (1) the degree of functional goal aggregation among system units and (2) the tightness of coupling in their physical connections.

System Level: At the highest level, the system level represents the macro-level functional implementation of the entire physical object. System-level Digital Twins focus on the comprehensive performance metrics of the entire system and the degree of fulfilment of the overall operational objectives. When delineating the specific scope of the Digital Twins system, one must consider not only the physical boundaries in material form but also jointly define them through key input/output interfaces where the system interacts with its external environment or related systems. Simultaneously, the top-level performance metric requirements corresponding to the core tasks of the system-level Digital Twin must be comprehensively considered.

Subsystem Level: The subsystem level occupies the intermediate layer of the entire hierarchical system, bearing the collaborative logic of specific functional modules with the primary objective of describing and implementing localized coordination processes with well-defined functional orientations within the system. Digital Twins at this level are aggregated from component-level entities based on physical topological relationships. When delineating subsystem boundaries, the core criterion is the relative independence of information flow, specifically manifested by significantly higher data exchange and coordination requirements among functional units within a subsystem compared with the interaction needs between subsystems.

Component Level: As the fundamental unit layer, the component level resides at the base of the entire hierarchical architecture, and its modeling objects are the smallest functional units that constitute the physical system. When determining the specific partitioning scale for this level, it is necessary to fully balance the relationship between modeling precision requirements and practical engineering technical conditions. Specifically, each physical unit designated as an independent component must be capable of deploying and configuring the sensing systems required to perceive, monitor, and control its own critical operational state parameters without relying on external support.

3.2. Construction of Component-Level Digital Twin Models Based on Information Sets

This section elaborates on a construction method for component-level Digital Twins based on standardized information sets. The method employs “standardized information sets” as both the unified data source and driving core for physical components within the Digital Twin space. It encapsulates four categories of data, namely basic attribute information, physical parameter information, structural material information, and operational parameter information, within a unified structure and provides plug-and-play instantiation capability for twins through a set of universal interfaces. Digital Twins can be automatically generated or updated on demand solely by invoking the internal resources of the information set. Any updates to data within the information set will drive synchronous changes in the Digital Twin’s state through interfaces, thereby maintaining dynamic consistency with the physical entity.

The descriptive method for information set data is crucial for its successful transition from a theoretical framework to technical implementation, requiring simultaneous satisfaction of both low-level data parsing and code development requirements, as well as the upper-level application needs of component-level Digital Twins. Therefore, the descriptive format of information sets should balance human interpretability with machine executability.

Based on the above requirements, standardized information sets must clearly define the following four categories of key information to comprehensively describe component entities.

Basic attribute information used for identifying the component’s identity and static characteristics, including a globally unique identifier for unambiguous retrieval across systems and platforms; data category and version number to support backward compatibility during information set evolution; geometric topology description containing 3D entities, assembly constraints, and interface coordinate systems; and a topological connection endpoint list recording physical or logical interfaces with other components and their associated device IDs.
Physical parameter information characterizing the dynamic features and performance boundaries during component operation, including real-time measurements, cumulative statistics, extreme operating condition ranges, environmental excitation conditions, failure thresholds, and health status indicators, providing inputs for multi-physics simulation, condition monitoring, and lifespan prediction.
Structural material information describing the component’s material composition and macroscopic physical properties, including material type, standard systems, typical parameters, process history, alternative material indices, and simulation-oriented simplified characterization methods, providing a data foundation for strength, thermal analysis, and reliability assessment.
Business parameter information encapsulating universal rules and strategy templates for component operation, control, management, and decision-making, including start-stop logic, anomaly criteria, alarm classification, maintenance strategies, safety constraints, permission models, and coordination interfaces with external systems, providing orchestratable semantic support for upper-level business systems.

To achieve systematic integration and executability of the four aforementioned information categories within the information set, this study used a rigorous mathematical modeling language based on set theory to describe the information set, ensuring that its content can be precisely expressed and consistently processed. The mathematical model of the information set is given below.

S I S = {A, P, M, B}

(2)

where SIS represents the standardized information set, A denotes basic attribute information, P represents physical parameter information, M indicates structural material information, and B denotes business parameter information.

Basic Attribute Modeling

Within the standardized information set, the basic attribute model provides detailed and precise descriptions of component physical characteristics, accurately capturing and digitally representing fundamental attributes, such as unique identifiers, version numbers, production dates, and geometric and topological descriptions. The mathematical description of the basic attribute model is as follows:

A = {I D, G, T D, T C}

(3)

I D = {i d_{i}, \forall i \neq j, i d_{i} \neq i d_{j}} i, j \in [1, N]

(4)

G = {g_{1}, g_{2}, g_{3} \dots}

(5)

T D = T_{D} (i, j) = \{\begin{matrix} 1, \\ 0, \end{matrix} \begin{matrix} E n t i t y i i s t h e u p s t r e a m a s s o c i a t e d c o m p o n e n t o f E n t i t y j . \\ E n t i t y i h a s n o d e p e n d e n c y r e l a t i o n s h i p w i t h E n t i t y j . \end{matrix}

(6)

T C = {t c_{1}, t c_{2}, t c_{3} \dots}

(7)

where ID represents the identification information of the physical entity in the standardized information set. G denotes the set of physical entity dimension attributes, including geometric dimensions, scale ranges, and geometric contours. T_D(i,j) is a matrix describing topological dependencies between components, including upstream and downstream related components. Specifically, T_D ∈ {0, 1}^N^×N, where N denotes the number of components. T_D(i,j) = 1 indicates that component i is an upstream dependency of component j; otherwise, T_D(i,j) = 0. TC is the set of topological constraint relationships between physical entities, including maximum connection numbers and compatibility rules.

2.: Physical Parameter Modeling

Within the standardized information set, the physical parameter model characterizes three aspects of physical components: real-time status, historical evolution, and responses to external events. For physical parameter modeling, two requirements must be met: (1) it must cover the physical laws and data-driven rules that components follow during operation; (2) it must include continuous or discrete descriptions of how state variables evolve over time. Therefore, the mathematical model for the physical parameters is as follows:

P = {S, D (t), E}

(8)

S = {s_{1}, s_{2}, s_{3} \dots}

(9)

D (t) = {d_{1} (t), d_{2} (t), d_{3} (t) \dots}

(10)

E = {e_{1}, e_{2}, e_{3} \dots}

(11)

where S represents the static parameters of the physical entity, including rated power, rated voltage, and rated flow rate of the equipment. D_k(t) denotes real-time operational state data during the physical entity’s working process, including real-time pressure, real-time temperature, and real-time flow rate. Here, D_k(t) satisfies the dynamic update equation d_k(t) = d_k(t − ∆t) + ∆d_k(t), where ∆t represents the sampling period, and ∆d_k(t) represents the increment from t − ∆t to t. E indicates environmental data of the physical entity, including ambient temperature, humidity, and environmental interference.

3.: Structural Material Modeling

The structural material model consists of material properties, manufacturing process attributes, environmental interaction parameters, and failure characteristics. Material properties provide the constitutive relationships of components, thereby supporting high-fidelity simulations and lifespan assessments. Manufacturing process attributes characterize the relationships between the processing techniques, microstructure, and performance. Environmental interaction parameters describe the evolution patterns of coupled effects such as corrosion, oxidation, and irradiation in working environments. Material failure characteristics include long-term load-bearing failure features and instantaneous ultimate failure characteristics. The mathematical model of the structural material model is as follows:

M = {M A T, M F G, E N V, L O S}

(12)

M A T = {m a t_{1}, m a t_{2}, m a t_{3} \dots}

(13)

M F G = (\begin{matrix} f_{1} & f_{2} & f_{3} \\ τ_{1} & τ_{2} & τ_{3} \end{matrix})

(14)

E N V (t) = {e n v_{1} (t), e n v_{2} (t), e n v_{3} (t) \dots}

(15)

L O S = {l o s_{1}, l o s_{2}, l o s_{3} \dots}

(16)

where MAT represents the material properties, including material type, density, and thermal conductivity. MFG denotes the material manufacturing processes, here, f represents a process parameter, such as welding techniques, surface roughness, and dimensional tolerances. τ represents the allowable deviation for the process parameter, which satisfies the inequality |f_k-f_std ≤ τ_k|. f_std is the standard process parameter. ENV(t) represents the environmental durability parameters, including hygrothermal coupling coefficients, corrosion rates, and photodegradation rates. The value of ENV(t) varies over time. LOS denotes the material failure characteristics, including fatigue life, creep life, and fracture toughness.

4.: Business Parameter Modeling

Within the standardized information set, the business parameter model encapsulates the behavioral rules and decision logic of the components in operation, control, and management scenarios, abstracting industry standards and engineering experience into computable and orchestratable semantic units to achieve precise mapping and dynamic response of Digital Twins to business processes, thereby supporting their adaptive state adjustment under complex working conditions. Therefore, the mathematical model of business parameters is as follows:

B = {C T L, S A F, M A I N T, P O L, A U D}

(17)

C T L = {c_{1}, c_{2}, c_{3} \dots}

(18)

S A F = (\begin{matrix} d_{\min, k} & d_{w a r n, k} & d_{\max, k} \\ L_{1} & L_{2} & L_{3} \end{matrix})

(19)

M A I N T = \{\begin{matrix} m a i_{1}, t = n \cdot T_{r e g} (n = 1, 2, \dots) \\ m a i_{2}, D (t) \in [d_{m a int, k}, d_{w a r n, k}] \\ m a i_{3}, D (t) \notin [d_{\min, k}, d_{\max, k}] \end{matrix}

(20)

P O L = {p o l_{1}, p o l_{2}, p o l_{3} \dots}

(21)

A U D = {a_{1}, a_{2}, a_{3} \dots}

(22)

where CTL represents physical entity control strategies, including start-stop sequences, PID parameters, and priority rules. SAF denotes safety constraints, including anomaly thresholds, alarm levels, and fault tolerance strategies. Here, d_min,k, d_warn,k and d_max,k respectively denotes the three-level thresholds (lower limit, warning, upper limit), and L represents the alarm levels (normal, level 2 alarm, level 1 alarm). A level 2 alarm (L2) is triggered when d_k(t) ∈ [d_warn,k, d_max,k], and a level 3 alarm (L3) is triggered when d_k(t) > d_max,k or d_k(t) < d_min,k. MAINT represents maintenance policies, such as periodic inspection cycles, condition-based maintenance thresholds, and emergency repair thresholds. Here, mai₁, mai₂ and mai₃ represent regular maintenance, condition-based maintenance, and emergency maintenance, respectively, where T_reg denotes the regular maintenance cycle. d_maint,k represents the condition-based maintenance threshold. POL denotes policy configurations, including operator instruction sets and permission role tables. AUD represents auditing and tracing, encompassing operation logs, responsible person identifiers, and event timestamps.

The overall framework of the standard information set is shown in Figure 3.

3.3. JSON-LD-Based Standardized Description and Construction Method for Information Sets

To achieve scalability and semantic consistency of information sets across platforms, we used JSON-LD as the structured data carrier. JSON-LD is a semantic data representation format using the JSON syntax, which supports explicit data semantics through context mechanisms, thereby facilitating system interoperability and integration [52]. This format offers excellent readability and developer-friendliness, supports flexible schema extension, and strictly adheres to linked data principles. It enables seamless compatibility with other systems that use JSON-LD, making it suitable for standardized representation of information sets.

The process for constructing information sets based on JSON-LD is shown in Figure 4. First, the semi-structured and non-semi-structured data generated during the design, manufacturing, and operation phases are uniformly collected and preprocessed to form a heterogeneous raw dataset. Branch processing is then performed based on “whether reusable domain ontologies exist.” If available, the existing domain ontologies are reused; otherwise, new domain vocabularies must be constructed. Next, a JSON-LD skeleton is generated, and the structured data are mapped to four core information categories (basic attributes, physical parameters, structural materials, and business parameters) using entity alignment and attribute value population techniques to complete semantic encapsulation. Finally, all concepts and relationships are integrated to form a structured information set that comprehensively describes the characteristics of physical entities.

By integrating component characteristics and behavioral logic, the standardized information set establishes a single data source-driven mechanism for generating and updating Digital Twins, ensuring dynamic consistency between physical entities and digital models, enhancing the construction efficiency and maintainability of component twins, providing a modular foundation for system-level Digital Twin integration, and supporting machine-parsable model computation requirements.

3.4. Graph Theory-Based System-Level Digital Twin Integration Method

Subsystem- and system-level Digital Twins comprise multiple components and their connection relationships. To achieve systematic integration of component-level Digital Twins, graph theory methods can be employed to represent and process system-level Digital Twins.

In graph theory, a graph consists of two types of elements: nodes and edges.

G = (V, E)

(23)

where V and E represent nodes and edges, respectively.

In the context of system-level Digital Twins, individual components in the system are represented as nodes in the graph, whereas physical connections, mechanical couplings, and logical relationships between components are represented by edges in the graph.

The integration of Digital Twins based on topological connection graphs achieves hierarchical construction through adjacency matrices with the following specific process.

The connection relationships between components within a specific subsystem are extracted from the information set based on component attributes and topological information to construct a corresponding homogeneous adjacency matrix for that subsystem. This matrix encapsulates the internal topological structure of the subsystem and its external interfaces, forming an independently representable subsystem-level Digital Twin. At the system level, the homogeneous adjacency matrices of multiple subsystems are superimposed based on their interface relationships to integrate a system-level heterogeneous adjacency matrix. This matrix completely describes the overall system topology and serves as a connection blueprint for the system-level Digital Twin, enabling a unified topological reconstruction from the components to the system through matrix parsing. Figure 5 illustrates the topological connection process of the component-level digital twin.

4. Construction of a Simple Digital Twin Application Scenario for Water Purification Systems

The technical support system and core resource allocation for this research, which form the foundation for subsequent experimental case modeling and verification, are as follows. At the development hardware level, a terminal equipped with an Intel Core i7-12700H CPU, 32 GB DDR5 RAM, and an NVIDIA GeForce RTX 3060 graphics card was used for front-end coding, 3D scene debugging, and functional module integration. Front-end development was carried out in Visual Studio Code 1.85.1 for HTML structure construction, CSS styling, and JavaScript interaction logic development. The 3D visualization functionality was implemented using the Three.js 0.132.2 WebGL library in combination with the GLTFLoader plugin. The 3D model of the water supply system was created in Blender 3.5.1, consisting of 12 types of components—including RWP (rainwater pool), DW (distribution chamber), and P1–P12 (pipelines)—and exported in lightweight GLB format after polygon simplification in MeshLab 2022.02 to optimize web client loading performance.

Component parameters and operational standards strictly follow industry specifications such as the Standard for Design of Outdoor Water Supply Engineering (GB 50013-2018; Standard for Design of Outdoor Water Supply Engineering. Ministry of Housing and Urban-Rural Development of the People’s Republic of China: Beijing, China, 2018.), providing a compliance basis for pipeline flow velocity thresholds, distribution chamber liquid level design, and component maintenance cycles. Additionally, a set of JSON-LD information sets was constructed to test the data retrieval functionality of platform components. Regarding process integrity design, component selection covered the entire chain of the water supply system—water intake–pretreatment–filtration–disinfection–clean water transport—ensuring that the digital twin fully represents the operational workflow of the physical system.

To validate the proposed method, we selected a typical surface water treatment plant as an experimental case study. A top–down decomposition strategy was adopted to divide the system into four subsystems, namely “System Interaction with External Environment”, “Main Water Flow Treatment”, “Wastewater Recirculation”, and “Chemical Dosing”, based on the core functions, key characteristics, and target application scenarios of the surface water treatment plant.

The System Interaction with External Environment subsystem includes raw water intake and treated water output, including components such as the external environment, rainwater collection basin, and clear water reservoir. The Main Water Flow Treatment subsystem represents the core process of rainwater purification, including components such as distribution chambers, sedimentation tanks, siphon filters, and corresponding pipelines. The Wastewater Recirculation subsystem handles the recirculation and retreatment of backwash water and sludge, including components such as pipelines, valves, and corresponding control units. The Chemical Dosing subsystem is responsible for adding chemical agents to the main water flow to improve water quality, including components such as dosing equipment and corresponding chemical feed pipelines.

In summary, the division of the water purification system into four functional subsystems clarifies the role of each component in different processes, providing a clear structural basis for the construction of component information sets and system modeling.

4.1. Construction of Component-Level Digital Twins and Information Set Description

Each Digital Twin component is defined as an independent entity with the core structure of the information set as follows: @context defines the semantic context to ensure unambiguous interpretation across components, @id serves as the globally unique identifier, @type describes the component type, basicAttributes describe basic attribute information, physicalParameters describe physical parameter information, businessParameters describe operational parameter information, and structuralMaterial describes structural material information. This section provides detailed descriptions using distribution well and pipeline1 as examples.

Listings 1 and 2 show the simplified information set model for the distribution chamber and pipeline1 (for complete JSON-LD models, see Appendix A). In basicAttributes, the ID field defines metadata, such as equipment name, model, and version. The G field describes the component’s geometric form, including diameter, height, and volume. The TD field specifies the component’s position in the topology and identifies the upstream and downstream connections. In physicalParameters, the S field specifies the design operating conditions. The D field represents the component’s real-time operational status, including dynamic sensor data such as inlet/outlet flow rates, water level, and turbidity. structuralMaterial defines information such as the component’s material, density, and corrosion resistance grade. businessParameters define the operational parameter logic, enabling digital management throughout the entire lifecycle.

Listing 1. Model of the distribution chamber information set.

{
    “@context”: “https://example.org/dt-context/water-treatment/v1”,
    “@type”: “DistributionChamber”,
    “@id”: “WT_Plant_01::DS_01”,

    “basicAttributes”: {
        “ID”: {
            …
        },
        “G”: {
            …
        },
        “TD”: {
            …
        },
        “TC”: {
              …
        }
    },

    “physicalParameters”: {
        “S”: {
            …
        },
        “D”: {
              …
        },
        “E”: {
            …
        }
    },

    “structuralMaterial”: {
        “MAT”: {
              …
        },
        “MFG”: {
              …
        },
        “ENV”: {
            …
        },
        “LOS”: {
            …
        }
    },

    “businessParameters”: {
        “CTL”: {
            …
        },
        “SAF”: {
            …
        },
        “MAINT”: {
            …
        },
        “AUD”: {
            …
        }
    }
}

Listing 2. Model of the pipeline1 information set.

{
    “@context”: “https://example.org/dt-context/water-treatment/v1”,
    “@type”: “Pipe”,
    “@id”: “WT_Plant_01::Pipe_01”,

    “basicAttributes”: {
        …
    },

    “physicalParameters”: {
        …
    },

    “structuralMaterial”: {
        …
    },

    “businessParameters”: {
        …
    }
}

Through the construction of a standardized information set, each component in the case study possesses a machine-readable and semantically clear digital description, providing a nodal attribute foundation for subsequent topological connections. Each component is encapsulated as a self-contained independent unit that can be identified, parsed, and reused across different subsystems, and even cross-industry Digital Twins, significantly enhancing the reusability of the model. Figure 6 and Figure 7 respectively present the encapsulated standardized information sets for the distribution chamber and pipeline1.

4.2. Topology-Based System Integration and Twin Assembly

After completing the standardized construction of component-level Digital Twins, they must be integrated layer by layer into subsystem and system-level Digital Twins to support practical applications. Based on the described method, the first step is to construct a system adjacency matrix according to the component information sets, using this matrix as a structured blueprint for system integration. Components are subsequently assembled bottom–up layer by layer based on the topological rules defined by the adjacency matrix, ultimately completing the encapsulation and integration of the system-level Digital Twin.

The primary step in system integration is to read the unique identifier under basicAttributes.ID in each component’s information set, parse the upstreamComponent and downstreamComponent fields in basicAttributes.TD, and set weights in the corresponding positions of the adjacency matrix based on their relationship types. Finally, multiple homogeneous adjacency matrices describing different types of relationships are superimposed to form a complete description of the topological relationships of the system.

Figure 8 shows the representation form within the adjacency matrix, with the example object system being the water purification system shown in Figure 9. Here, ENV represents the external environment, RWP denotes the rainwater collection pool, DW represents the distribution chamber, SP denotes the sedimentation pool, SFP represents the siphon filter pool, CWP denotes the clear water pool, DE denotes the dosing equipment, P1 indicates pipeline1, P2 indicates pipeline2, P3 represents pipeline3, and P4 represents pipeline4.

The system contains four independent flow paths, where the green edges represent interactions between the system and the external environment, blue edges indicate the water flow path, red edges indicate wastewater recirculation, and yellow edges represent chemical dosing, which are denoted by G, B, R, and Y, respectively. Therefore, four types of edges exist in the adjacency matrix.

In the water flow circuit, rainwater from the collection tank passes through the distribution chamber, sedimentation tank, and siphon filter before finally entering the clear water tank. In the chemical dosing circuit, chemicals are delivered from the dosing device to the sedimentation tank via Pipeline 2. In the sewage return circuit, wastewater from the clear water tank and the sedimentation tank is conveyed to the dosing device via Pipeline 4 and Pipeline 2, respectively. Furthermore, due to water evaporation, the rainwater collection tank and the clear water tank also interact with the external environment.

The topological structure of the system can be decomposed as four mutually independent isomorphic subgraphs, each containing only a single type of edge and corresponding to a homogeneous adjacency matrix: By superimposing these four isomorphic subgraphs, a complete topological description of the water purification system can be generated.

The integration engine uses the adjacency matrix as input and automatically identifies and assembles components by executing graph theory algorithms. The integration engine reads the adjacency matrix of the water flow links, automatically finds paths from the source node (rainwater collection pool) to the sink node (clear water pool), retrieves the corresponding Digital Twin instances from the component library based on the identified component list, automatically performs logical binding of interfaces according to the connection relationships, and finally encapsulates these components and their internal relationships into an independent, identifiable subsystem-level Digital Twin.

The final system-level integration generates the complete system-level Digital Twin by reading the heterogeneous adjacency matrix formed through the superposition of the four isomorphic subgraphs.

The integrated system is shown in Figure 10, with a functional menu area at the top of the page supporting the switching and viewing of different components. The left side of the interface is the 3D visualization area for the digital twin, displaying, from left to right, a clear water tank, a siphon filter, a sedimentation tank, a water distribution chamber, a chemical dosing device, and a rainwater collection tank. Blue pipelines represent the water flow lines, yellow pipelines represent the chemical dosing lines, and red pipelines represent the sewage return lines. The right side of the interface is a component information display area, which can present the corresponding dataset information for each component.

4.3. Model Reusability Analysis

The core reusability of this method lies in the independence and completeness of the component-level Digital Twins. Pipeline 1, constructed in the aforementioned example, encapsulates the basic attribute information, physical parameter information, structural material information, and business parameter information of the component. Constructing a new pipeline only requires updating of the corresponding fields of Pipeline 1: except for the modification of the main parameters such as pipeline length, inner diameter, and shape, most of the other attributes such as material MAT, manufacturing process MFG, and interface type TC.connectionCompatibility can be directly reused without changes.

Import the modified JSON-LD file into the new project’s Digital Twin construction platform, generate a new adjacency matrix that defines the topological connection relationships between this pipeline and other components in the new system, connect this pipeline’s interfaces with new components, and complete the Digital Twin encapsulation.

The system-level digital twin constructed based on the aforementioned component reuse and system integration methods incorporates eight water flow pipelines, one clear water tank, one siphon filter, and one sedimentation tank. The integration results are shown in Figure 11 and Figure 12.

By reusing components with complete functionality and well-encapsulated data, the integration complexity and error risks are significantly reduced, avoiding redundant modeling and saving substantial time and labor costs. This approach achieves engineering innovation and optimization.

The reusability of this method is not limited to the water treatment industry. The siphon filter unit is essentially a gravity-based filtration and backwashing mechanism, whose core functionality holds a universal reference value for equipment with similar filtration-separation functions in industries such as chemical, petroleum, and food processing. By establishing higher-level cross-industry ontologies to map the information set’s @context, components constructed using this method can potentially find reuse scenarios across different domains, providing a viable technical pathway towards achieving a truly universal cross-industry modeling system.

4.4. Topology Connection Method Comparison

To validate the engineering practicality and efficiency advantages of adjacency matrix-based topology connections over those generated by traversing information set data, a comparative experiment was designed. The experiment simulated variations in the scale of digital twin system components, using digital twin generation time as the evaluation metric to quantitatively assess efficiency differences between the two methods and identify their respective applicable scenarios.

The experiment was conducted in a Python 3.10.8 environment, primarily using the random and numpy packages to simulate the generation of component information sets and to construct and manipulate adjacency matrices, respectively. First, a component information set containing attributes such as component ID, geometric parameters, and upstream/downstream topological relationships was generated. Subsequently, two topology connection generation methods were implemented: one directly traverses the component information set to establish connection relationships, while the other first constructs an adjacency matrix and then establishes connections based on the matrix. To ensure accurate measurement of generation efficiency, both methods used the time library to record timestamps at the start and end of function execution, with the time difference representing the actual time required for topology connection generation.

In this experiment, the number of system components was gradually increased from 10 to 100 in increments of 1, simulating the progressive expansion of component scale in a water purification system and enabling comparison of the efficiency of the two topology connection generation methods. To minimize fluctuations caused by non-theoretical factors such as memory allocation delays, 100 repeated tests were conducted for each component scale, and the mean time consumption was used as the final result. The results showed that when the number of components ranged from 10 to 28, the direct information set traversal method was, on average, 26.63% faster than the adjacency matrix method. When the number of components increased to 29–47, the speeds of the two methods converged (Figure 13). Beyond 47 components, the adjacency matrix-based method surpassed the direct traversal method in efficiency, averaging 18.61% faster. As the component scale continued to expand beyond 300 (Figure 14), the speed advantage of the adjacency matrix-based method became increasingly pronounced, averaging 44.69% faster than the direct information set traversal method.

4.5. Assessment of Resource Utilization

To further validate the engineering efficiency of the adjacency matrix-based (AM) method versus the direct information set traversal (DIT) method for digital twin generation, we conducted a comparative experiment on resource utilization.

We selected three metrics: CPU utilization, memory consumption (excluding cache), and GPU utilization, and recorded them using the built-in Windows Task Manager.

To ensure consistency, the experiment was conducted in the same hardware and software environment as Section 4. For each scenario, 100 repeated tests were performed to eliminate random fluctuations, and the mean value was used as the final result. The resource utilization results across scenarios are summarized in Table 2.

As shown in Table 2, when the number of components is fewer than 28, the DIT method reduces CPU utilization by an average of 20.1% and memory consumption by an average of 15.8% compared to the AM method. This is because AM requires additional overhead for matrix initialization and storage, whereas DIT directly parses relationships without the need for intermediate data structures.

When the component count ranges from 29 to 47, the differences in CPU and memory usage between the two methods begin to diminish.

When the number of components exceeds 47, the AM method reduces CPU utilization by an average of 28.9% and memory consumption by an average of 11.3%. The primary reason is that the increased O(n²) complexity of DIT leads to an exponential rise in both CPU occupancy and memory consumption.

Since the 3D visualization workload is minimal for both methods, their GPU utilization rates are nearly identical.

Therefore, it can be concluded that the AM method demonstrates superior resource efficiency in large-scale digital twin systems. This finding is consistent with the topology connection efficiency results presented in Section 4.4, confirming the suitability of the AM method for practical engineering applications.

5. Discussion

This study proposed a novel Digital Twin construction approach to address the issues of low reusability and poor cross-industry generality in Digital Twin models. The proposed method first divides the system into subsystems and components based on the degree of functional aggregation and physical coupling relationships, achieves topological integration of component-level Digital Twins through constructing adjacency matrices, and ultimately generates system-level Digital Twins. The main contributions of this study are summarized below.

A three-level hierarchical architecture of system–subsystem–component levels was constructed. Through stepwise refinement, cross-scale, highly coupled complex systems are transformed into multiple independently implementable simple Digital Twin components, whereas the integration and coordination of multilevel Digital Twins are achieved via unified interface specifications and information set models.

An information set model and graph theory methods were introduced to achieve topological integration and system encapsulation of component-level Digital Twins. The information set model integrates the component’s basic attribute information, structural material information, physical parameter information, and business parameter information, thereby achieving a unified representation and semantic integration of multi-source heterogeneous data. A system topological network was constructed with components as nodes and connection relationships as edges, generating subsystem and system-level Digital Twins based on adjacency matrices corresponding to isomorphic subgraphs and heterogeneous graphs, respectively, and achieving automated cross-level construction. Notably, the adjacency matrix-based method demonstrated high scalability and efficiency, generating topology connections 44.69% faster than the direct information set traversal method when the number of components exceeded 300.

Validation was performed by reading JSON-LD formatted information set data and adjacency matrices to achieve Digital Twin assembly from component to system level using a water purification system as a case study, and multiple scenario examples were constructed to verify the feasibility of the proposed method.

Integrating the contributions and case validations of this research, the study demonstrates that the proposed three-tier hierarchical architecture and adjacency matrix-based topology integration method effectively reduce the modeling complexity of intricate systems through component-based decomposition. By employing unified information sets and graph theory, the approach enables standardized digital twin construction while maintaining efficient topology generation even at large component scales, providing a viable technical pathway for the scalable application of digital twins.

However, this study has certain limitations. First, the proposed framework lacks a topology reconstruction mechanism for component failure or replacement; the adjacency matrix cannot automatically update connection relationships during faulty component replacement. This necessitates manual topology redefinition, which prolongs fault response cycles and fails to meet industrial real-time requirements. Second, the current framework does not address the need for real-time data updates or dynamic topology adjustments during digital twin operation. Although suitable for static modeling, its model iteration capability in dynamic scenarios requires further enhancement.

Despite these limitations, the study retains both theoretical and practical significance. Theoretically, the three-tier hierarchical architecture enriches the layered modeling theory of digital twins, and integrating information sets with graph theory provides a new paradigm for the semantic fusion of multi-source heterogeneous data. Practically, component-based reuse and standardized integration reduce modeling costs and timelines, deliver standardized solutions for process industries and energy systems, support large-scale deployment through efficient topology generation, and facilitate cross-industry scalable implementation of digital twin technology.

6. Conclusions

This study aims to address the long-standing limitations of Digital Twin (DT) technology, including the lack of a universal cross-industry modeling framework, low component reusability, and high development costs, which have hindered the application of system-level digital twins. To tackle these issues, a hierarchical decoupling and graph theory-based topological connection method for digital twin modeling is proposed and validated using a water purification system.

First, a system is decomposed top–down based on functional aggregation and physical coupling to establish a three-tier hierarchical architecture consisting of System, Subsystem, and Component. This architecture decomposes complex systems into independent, reusable digital twin components, reducing modeling complexity while enabling standardized integration through a unified interface specification.

Second, a component-level digital twin construction method based on a Standardized Information Set (SIS) is developed. The SIS encompasses basic attribute information, physical parameter information, structural material information, and business parameter information, and employs JSON-LD to achieve cross-platform semantic consistency, realizing a data-driven DT generation mechanism that significantly enhances component reusability.

Finally, a topological integration mechanism based on an adjacency matrix is proposed for the bottom–up assembly of system-level digital twins. Experimental results show that when the number of components exceeds 300, the topological connection generation speed is 44.69% faster than direct information set traversal; for systems with more than 47 components, the average CPU utilization and memory consumption are reduced by 28.9% and 11.3%, respectively. The water purification system case further validates that this method enables accurate and standardized system-level digital twin assembly, shortening the modeling cycle.

Theoretically, this study enriches digital twin modeling theory and provides a new paradigm for multi-source heterogeneous data fusion. Practically, it offers a cost-effective and scalable approach for digital twin and model construction in industrial systems.

7. Future Research Directions

This paper elaborates on the specific steps for constructing system-level Digital Twins and validates the feasibility of the proposed modeling approach using two application cases. Future research should focus on the following aspects:

Standardization with domain ontologies. Future research should focus on integrating the information sets of this method with richer ontological description languages, such as W3C OWL, and developing automated semantic matching and validation tools to promote broader interoperability.
Self-evolution of Digital Twins. Although current topological relationships are constructed based on adjacency matrices generated from information sets, future research should focus on automatically discovering inter-component connections through correlation analysis and causal inference of physical entity sensor data streams and subsequently updating information sets and topological matrices.
Hardware-in-the-loop (HIL) integration is employed for high-fidelity verification and closed-loop control. To bridge the gap between virtual dynamic modeling and physical system operation, future research should integrate the proposed hierarchical decoupled modeling architecture with the HIL methodology. Specifically, a system-level dynamic modeling HIL platform should be constructed to connect physical entities with virtual models, enabling bidirectional real-time data interaction and thereby accelerating the transition of DT models from virtual simulation to industrial deployment.

Author Contributions

X.W., L.X. and Q.Y. conceived the idea and designed the experiments, and the implementation steps. G.W. and X.Z. fabricated the devices and measured the performance. X.W., G.W. and L.X. assisted with the experiments and provided some advice. X.W. and L.X. wrote and revised the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Key R&D Program of China (NKRDP) [grant number 2023YFC3107100].

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The datasets used and/or analyzed during the current study are available from the corresponding author upon reasonable request.

Acknowledgments

We thank all individuals who participated in this study.

Conflicts of Interest

The authors declare that they have no known competing financial interests or personal relationships that may have appeared to influence the work reported in this paper.

Appendix A

Listings A1 and A2 respectively present the complete information set models for water distribution chamber and pipeline1.

Listing A1. Model of the distribution chamber information set.

{
    “@context”: “https://example.org/dt-context/water-treatment/v1”,
    “@type”: “DistributionChamber”,
    “@id”: “WT_Plant_01::DS_01”,

    “basicAttributes”: {
        “ID”: {
            “name”: “Inlet_Distribution_Chamber_03”,
            “modelNumber”: “DC-500-SS”,
            “version”: “2.1”,
            “manufactureDate”: “2025-07-17”
        },
        “G”: {
            “geometry”: “CYLINDRICAL”,
            “diameter”: {“value”: 3.5, “unit”: “m”},
            “height”: {“value”: 4.0, “unit”: “m”},
            “volume”: {“value”: 38.5, “unit”: “m³”}
        },
        “TD”: {
            “upstreamComponent”: [“WT_Plant_01::Pipe_01”],
            “downstreamComponent”: [“WT_Plant_01::Pipe_02”]
        },
        “TC”: {
            “maxInlets”: 2,
            “maxOutlets”: 4,
            “connectionCompatibility”: [“Flange_DN300”, “Flange_DN250”]
        }
    },

    “physicalParameters”: {
        “S”: {
            “ratedFlow”: {“value”: 500, “unit”: “m³/h”},
            “designLiquidLevel”: {“value”: 3.0, “unit”: “m”}
        },
        “D”: {
            “inletFlowRate”: {“value”: 480, “unit”: “m³/h”, “timestamp”: “2025-06-10T08:30:00Z”},
            “outletFlowRate”: [{“value”: 240, “unit”: “m³/h”}, {“value”: 240, “unit”: “m³/h”}],
            “currentLiquidLevel”: {“value”: 2.95, “unit”: “m”},
            “turbidity”: {“value”: 15, “unit”: “NTU”}
        },
        “E”: {
            “ambientTemperature”: {“value”: 25.5, “unit”: “°C”}
        }
    },

    “structuralMaterial”: {
        “MAT”: {
            “materialType”: “Stainless Steel 316L”,
            “density”: {“value”: 8000, “unit”: “kg/m³”},
            “corrosionResistance”: “EXCELLENT”
        },
        “MFG”: {
            “weldingStandard”: “AWS D1.6”,
            “surfaceFinish”: “2B Finish”
        },
        “ENV”: {
            “corrosionRate”: {“value”: 0.001, “unit”: “mm/year”}
        },
        “LOS”: {
            “designLife”: {“value”: 30, “unit”: “year”}
        }
    },

    “businessParameters”: {
        “CTL”: {
            “levelControlSetpoint”: {“value”: 3.0, “unit”: “m”}
        },
        “SAF”: {
            “highLevelAlarm”: {“value”: 3.5, “unit”: “m”, “severity”: “CRITICAL”},
            “lowLevelAlarm”: {“value”: 1.0, “unit”: “m”, “severity”: “WARNING”}
        },
        “MAINT”: {
            “inspectionInterval”: {“value”: 90, “unit”: “day”},
            “cleaningInterval”: {“value”: 180, “unit”: “day”}
        },
        “AUD”: {
            “lastInspection”: {“date”: “2024-03-15”, “technician”: “TECH_2021”}
        }
    }
}

Listing A2. Model of the pipeline1 information set.

{
    “@context”: “https://example.org/dt-context/water-treatment/v1”,
    “@type”: “Pipe”,
    “@id”: “WT_Plant_01::Pipe_01”,

    “basicAttributes”: {
        “ID”: {
            “name”: “Pipe_RainwaterTank_to_DistributionChamber”,
            “modelNumber”: “PIPE-SS-300”,
            “version”: “1.0”
        },
        “G”: {
            “length”: {“value”: 15, “unit”: “m”},
            “diameter”: {“value”: 0.3, “unit”: “m”},
            “geometry”: “LINEAR”
        },
        “TD”: {
            “upstreamComponent”: “WT_Plant_01::RainwaterTank_01::outletA”,
            “downstreamComponent”: “WT_Plant_01::DS_01::inletA”
        },
        “TC”: {
            “connectionType”: “Flange_DN300”
        }
    },

    “physicalParameters”: {
        “S”: {
            “roughness”: {“value”: 0.045, “unit”: “mm”},
            “maxWorkingPressure”: {“value”: 1.6, “unit”: “MPa”}
        },
        “D”: {
            “flowVelocity”: {“value”: 1.88, “unit”: “m/s”, “timestamp”: “2024-06-10T08:30:00Z”},
            “internalPressure”: {“value”: 0.35, “unit”: “MPa”}
        }
    },

    “structuralMaterial”: {
        “MAT”: {
            “materialType”: “HDPE”,
            “density”: {“value”: 950, “unit”: “kg/m³”}
        },
        “ENV”: {
            “thermalExpansionCoefficient”: {“value”: 0.2, “unit”: “mm/m°C”}
        }
    },

    “businessParameters”: {
        “SAF”: {
            “maxPressureAlarm”: {“value”: 1.5, “unit”: “MPa”, “severity”: “CRITICAL”}
        },
        “MAINT”: {
            “inspectionInterval”: {“value”: 365, “unit”: “day”}
        }
    }
}

References

Grieves, M.W. Product lifecycle management: The new paradigm for enterprises. Int. J. Prod. Dev. 2005, 2, 71–84. [Google Scholar] [CrossRef]
Rosen, R.; Von Wichert, G.; Lo, G.; Bettenhausen, K.D. About the importance of autonomy and digital twins for the future of manufacturing. IFAC Pap. 2015, 48, 567–572. [Google Scholar] [CrossRef]
Tao, F.; Zhang, H.; Qi, Q.; Xu, J.; Sun, Z.; Hu, T.; Liu, X.; Liu, T.; Guan, J.; Chen, C. Theory of digital twin modeling and its application. Comput. Integr. Manuf. Syst. 2021, 27, 1–15. [Google Scholar]
Tao, F.; Zhang, H.; Liu, A.; Nee, A.Y.C. Digital twin in industry: State-of-the-art. IEEE Trans. Ind. Inf. 2019, 15, 2405–2415. [Google Scholar] [CrossRef]
Zhu, Q.; Zhang, L.; Ding, Y.; Hu, H.; Ge, X.; Liu, M.; Wang, W. From real 3D modeling to digital twin modeling. Acta Geod. Cartogr. Sin. 2022, 51, 1040–1049. [Google Scholar]
Grieves, M. Intelligent digital twins and the development and management of complex systems. Digit. Twin 2022, 1, 2. [Google Scholar] [CrossRef]
Tao, F.; Liu, W.; Zhang, M.; Hu, T.L.; Qi, Q.; Zhang, H.; Sui, F.; Wang, T.; Xu, H.; Huang, Z.; et al. Five-dimension digital twin model and its ten applications. Comput. Integr. Manuf. Syst. 2019, 25, 1–18. [Google Scholar]
Borangiu, T.; Oltean, E.; Răileanu, S.; Anton, F.; Anton, S.; Iacob, I. Embedded Digital Twin for ARTI-Type Control of Semi-continuous Production Processes. In International Workshop on Service Orientation in Holonic and Multi-Agent Manufacturing; Springer International Publishing: Cham, Switzerland, 2019; pp. 113–133. [Google Scholar]
Zheng, Y.; Yang, S.; Cheng, H. An application framework of digital twin and its case study. J. Ambient Intell. Human. Comput. 2019, 10, 1141–1153. [Google Scholar] [CrossRef]
Liu, S.; Bao, J.; Lu, Y.; Li, J.; Lu, S.; Sun, X. Digital twin modeling method based on biomimicry for machining aerospace components. J. Manuf. Syst. 2021, 58, 180–195. [Google Scholar] [CrossRef]
Zhang, W.; Deng, J.; Liu, X.; Lu, J. FlightTwin: A generalized digital twin accompanying flight framework for fixed-wing aircraft. IEEE Access 2024, 12, 125194–125210. [Google Scholar] [CrossRef]
Ding, K.; Chan, F.T.S.; Zhang, X.; Zhou, G.; Zhang, F. Defining a digital twin-based cyber-physical production system for autonomous manufacturing in smart shop floors. Int. J. Prod. Res. 2019, 57, 6315–6334. [Google Scholar] [CrossRef]
Jiang, H.; Qin, S.; Fu, J.; Zhang, J.; Ding, G. How to model and implement connections between physical and virtual models for digital twin application. J. Manuf. Syst. 2021, 58, 36–51. [Google Scholar] [CrossRef]
Yan, J.; Lu, Q.; Li, N.; Pitt, M. Developing data requirements for city-level digital twins: Stakeholder perspective. J. Manag. Eng. 2025, 41, 04024068. [Google Scholar] [CrossRef]
Lam, H.K.; Lam, P.D.; Ok, S.Y.; Lee, S.H. Digital twin smart city visualization with MoE-based personal thermal comfort analysis. Sensors 2025, 25, 705. [Google Scholar] [CrossRef] [PubMed]
Böttcher, L.; Fonseca, L.L.; Laubenbacher, R.C. Control of medical digital twins with artificial neural networks. Phil. Trans. R. Soc. A 2025, 383, 20240228. [Google Scholar] [CrossRef]
Demuth, S.; De Sèze, J.; Edan, G.; Ziemssen, T.; Simon, F.; Gourraud, P.A. Digital representation of patients as medical digital twins: Data-centric viewpoint. JMIR Med. Inform. 2025, 13, e53542. [Google Scholar] [CrossRef]
Dui, H.; Cao, T.; Wang, F. Digital twin-based resilience evaluation and intelligent strategies of smart urban water distribution networks for emergency management. Resil. Cities Struct. 2025, 4, 41–52. [Google Scholar] [CrossRef]
Rodríguez-Alonso, C.; Pena-Regueiro, I.; García, Ó. Digital twin platform for water treatment plants using microservices architecture. Sensors 2024, 24, 1568. [Google Scholar] [CrossRef] [PubMed]
Plósz, B.G.; Climent, J.; Griffin, C.T.; Chiva, S.; Mukherjee, R.; Penkarski-Rodon, E.; Clarke, M.; Valverde-Pérez, B. Hindered and compression solid settling functions—Sensor data collection, practical model identification and validation. Water Res. 2020, 184, 116129. [Google Scholar] [CrossRef] [PubMed]
Nguyen, D.A.; Wu, K.; Li, X.; Kim, G.S.; Kim, J.H. Adjustable tongue vane for improving the energy performance of a submersible single-channel pump for wastewater treatment. Heliyon 2025, 11, e41511. [Google Scholar] [CrossRef]
Ekama, G.A. Using bioprocess stoichiometry to build a plant-wide mass balance based steady-state WWTP model. Water Res. 2009, 43, 2101–2120. [Google Scholar] [CrossRef]
Matias, R.I.; Gianina, L.R.; Facundo, G.; Marcelo, G.C.; Melina, D.B.; Gerardo, B.B. Combined use of ls-piv and cfd for the characterization of turbulent flow in the contact chamber of ‘costa azul’ wastewater treatment plant, carlos paz. J. Hydroinformat. 2021, 23, 1083–1097. [Google Scholar] [CrossRef]
Lu, H.; Gao, Z.; Sun, Y.; Gao, C.; Xu, Z.; Pan, Y.; Liu, L. Shape-performance coupled digital twin based on heterogeneous data from multiple sources: A scissor lift platform example. Eng. Comput. 2025, 41, 609–626. [Google Scholar] [CrossRef]
Wang, X.; Cheng, N.; Ma, L.; Sun, R.; Chai, R.; Lu, N. Digital twin-assisted knowledge distillation framework for heterogeneous federated learning. China Commun. 2023, 20, 61–78. [Google Scholar] [CrossRef]
Wen, J.; Gabrys, B.; Musial, K. DTCNS: A python toolbox for digital twin-oriented complex networked systems. SoftwareX 2024, 27, 101818. [Google Scholar] [CrossRef]
Jeong, D.; Jeong, T.; Lee, C.; Choi, Y.; Lee, D. A study on guidelines for constructing building digital twin data. Buildings 2025, 15, 434. [Google Scholar] [CrossRef]
Machalski, A.; Szulc, P.; Błoński, D.; Nycz, A.; Nemś, M.; Skrzypacz, J.; Janik, P.; Satława, Z. The concept of a digital twin for the wały śląskie hydroelectric power plant: A case study in Poland. Energies 2025, 18, 2021. [Google Scholar] [CrossRef]
Nguyen, T.D.H.N.; Ahn, Y.; Kim, B. Integrated digital-twin-based decision support system for relocatable module allocation plan: Case study of relocatable modular school system. Appl. Sci. 2025, 15, 2211. [Google Scholar] [CrossRef]
MAlam, Z.; Khan, K.S.; Jamalipour, A. Multiagent best routing in high-mobility digital-twin-driven Internet of Vehicles (IoV). IEEE Internet Things J. 2024, 11, 13708–13721. [Google Scholar] [CrossRef]
Raj, D.R.R.; Shaik, T.A.; Hirwe, A.; Tammana, P.; Kataoka, K. Building a digital twin network of sdn using knowledge graphs. IEEE Access 2023, 11, 63092–63106. [Google Scholar] [CrossRef]
Zhang, B.; Ding, G.; Zheng, Q.; Zhang, K.; Qin, S. Iterative updating of digital twin for equipment: Progress, challenges, and trends. Adv. Eng. Inform. 2024, 62, 102773. [Google Scholar] [CrossRef]
Liu, Y.; Liu, N.; Huo, Y. Impact of digital technology innovation on carbon emission reduction and energy rebound: Evidence from the Chinese firm level. Energy 2025, 320, 135187. [Google Scholar] [CrossRef]
Zhuang, C.; Liu, Z.; Liu, J.; Ma, H.; Zhai, S.; Wu, Y. Digital Twin-based Quality Management Method for the Assembly Process of Aerospace Products with the Grey-Markov Model and Apriori Algorithm. Chin. J. Mech. Eng. 2022, 35, 105. [Google Scholar] [CrossRef]
Glaessgen, E.; Stargel, D. The digital twin paradigm for future NASA and US Air Force vehicles. In Proceedings of the 53rd AIAA/ASME/ASCE/AHS/ASC Structures, Structural Dynamics and Materials Conference, Honolulu, Hawaii, 23–26 April 2012. [Google Scholar] [CrossRef]
Kallenberg, M.; Baja, H.; Ili, M.; Tomi, A.; Toi, M.; Athanasiadis, I. Interoperable agricultural digital twins with reinforcement learning intelligence. Smart Agric. Technol. 2025, 12, 101412. [Google Scholar] [CrossRef]
Gund, R.; Badgujar, C.M.; Samiappan, S.; Jagadamma, S. Application of Digital Twin Technology in Smart Agriculture: A Bibliometric Review. Agriculture 2025, 15, 1799. [Google Scholar] [CrossRef]
Manocha, A.; Sood, S.K.; Bhatia, M. Iot-digital twin-inspired smart irrigation approach for optimal water utilization. Sustain. Comput. Inform. Syst. 2024, 41, 100947. [Google Scholar] [CrossRef]
Hosamo, H.H.; Nielsen, H.K.; Kraniotis, D.; Svennevig, P.R.; Svidt, K. Digital twin framework for automated fault source detection and prediction for comfort performance evaluation of existing non-residential norwegian buildings. Energy Build. 2023, 281, 112732. [Google Scholar] [CrossRef]
Hu, X.; Assaad, R.H. A BIM-enabled digital twin framework for real-time indoor environment monitoring and visualization by integrating autonomous robotics, LiDAR-based 3D mobile mapping, IoT sensing, and indoor positioning technologies. J. Build. Eng. 2024, 86, 108901. [Google Scholar] [CrossRef]
Opoku, D.-G.J.; Perera, S.; Osei-Kyei, R.; Rashidi, M.; Bamdad, K.; Famakinwa, T. Barriers to the adoption of digital twin in the construction industry: A literature review. Informatics 2023, 10, 14. [Google Scholar] [CrossRef]
Ciliberti, F.G.; Berardi, L.; Laucelli, D.B.; Ariza, A.D.; Enriquez, L.V.; Giustolisi, O. From digital twin paradigm to digital water services. J. Hydroinformat. 2023, 25, 16. [Google Scholar] [CrossRef]
Matei, A.; Butean, A.; Zamfirescu, C.B.; Marcos, J.D. Designing a conceptual digital twin architecture for high-temperature heat upgrade systems. Appl. Sci. 2025, 15, 2350. [Google Scholar] [CrossRef]
Ma, X.; Qi, Q.; Cheng, J.; Tao, F. A consistency method for digital twin model of human-robot collaboration. J. Manuf. Syst. 2022, 65, 550–563. [Google Scholar] [CrossRef]
Cheng, X.; Huang, F.; Yang, Q.; Qiu, L. A digital twin data management and process traceability method for the complex product assembly process. J. Braz. Soc. Mech. Sci. Eng. 2025, 47, 151. [Google Scholar] [CrossRef]
Wagg, D.J.; Burr, C.; Shepherd, J.; Conti, Z.X.; Enzer, M.; Niederer, S. The philosophical foundations of digital twinning. Data-Centric Eng. 2025, 6, e12. [Google Scholar] [CrossRef]
Mahmoud, M.; Semeraro, C.; Ramadan, M.; Abdelkareem, M.A.; Olabi, A.G. Building a digital twin for a ground heat exchanger. Chem. Eng. Technol. 2025, 48, e202300492. [Google Scholar] [CrossRef]
Zheng, Y.; Al Barazi, A.; Seppänen, O.; Abou-Ibrahim, H.; Görsch, C. Semantic digital twin framework for monitoring construction workflows. Autom. Constr. 2025, 176, 106301. [Google Scholar] [CrossRef]
Lee, Y.; Baek, M.S.; Yoon, K. Digital entity management methodology for digital twin implementation: Concept, definition, and examples. IEEE Trans. Broadcast. 2025, 71, 19–29. [Google Scholar] [CrossRef]
Edrisi, F.; Perez-Palacin, D.; Caporuscio, M.; Giussani, S. Developing and evolving a digital twin of the organization. IEEE Access 2024, 12, 45475–45494. [Google Scholar] [CrossRef]
Miao, R.; Liu, S.; Sun, Y.; Du, M.; Bao, J. Streamlining digital twin development and operation with DTOps. J. Intell. Manuf. 2025, 1–20. [Google Scholar] [CrossRef]
Baskauf, S.J. Having Your Cake and Eating It Too: JSON-LD as an RDF serialization format. Biodivers. Inf. Sci. Stand. 2021, 5, e74266. [Google Scholar] [CrossRef]

Figure 1. Construction process of system-level digital twins.

Figure 2. Modeling architecture of system-level digital twins in hierarchical decoupling architecture.

Figure 3. Standardized information set framework.

Figure 4. Construction process of standardized information set.

Figure 5. Topological connection process of component-level digital twin.

Figure 6. Distribution chamber components.

Figure 7. Pipeline 1 components.

Figure 8. Adjacency matrices for system isomorphic and heterogeneous graphs.

Figure 9. System schematic diagram.

Figure 10. System-level digital twin integration.

Figure 11. Extended system-level digital twin integration (Pipeline1).

Figure 12. Extended system-level digital twin integration (Pipeline5).

Figure 13. Topology connection method comparison (small scale).

Figure 14. Topology connection method comparison (large scale).

Table 1. A comparison of methods for implementing digital twins by different scholars.

Ref	Modeling Method	Platform	Applicable Domain	Component Reusability
[31]	data-driven invariant modeling pattern methodology	/	shallow geothermal energy systems	it can be utilized for a wide range of shallow geothermal energy projects
[43]	Mathematical Modeling	TwinCAT XAE(v.3.1), Unity(v.6), Grafana, AWS, Google Cloud	high-temperature heat upgrade systems for industrial processes	It can be utilized for industrial energy systems
[50]	Business Process Model, Discrete-Event System	MATLAB Simulink	Digital Twin of the Organization	It can be utilized to develop DTOs of varying types and sizes
[49]	class-level digital entity management methodology	/	/	class-level management enables efficient implementation and reuse of digital entities
[48]	ontology network	Stardog, Python, OWL	Architecture engineering and construction (AEC)	The ontology network can avoid duplicative ontology modeling in related domains.
[51]	service-oriented architecture	GitLab, Neo4j, Python	Cyber–Physical Production Systems	support maintenance and reuse of twin assets
[46]	Dynamic Assembly	/	transport systems	/
[44]	DTMHRC model	/	human–robot collaboration in manufacturing applications	Effectively adapt to the dynamic adjustment requirements of each sub-model of DTMHRC
[45]	OPC UA	Microsoft’s .NET Framework 3.5, Visual Studio 2008, Python 3.8, JavaScript	complex product assembly process	this approach is particularly applicable to labeled component assembly processes in discrete manufacturing workshops

Table 2. Resource utilization comparison of two DT generation methods.

Component Scale	CPU Utilization (Avg/Peak, %)	Memory Consumption (Avg/Peak, GB)	GPU Utilization (Avg/Peak, %)	Method
10–28	21.8/37.5	3.2/4.8	10.9	DIT
10–28	27.3/43.1	3.8/5.5	11.2	AM
29–47	34.2/54.8	4.5/5.8	12.8	DIT
29–47	37.2/51.3	4.3/5.6	12.6	AM
47–1000	56.4/80.2	5.3/6.4	15.7	DIT
47–1000	40.1/62.5	4.7/5.9	15.5	AM

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wu, X.; Wu, G.; Zhang, X.; Yang, Q.; Xie, L. Hierarchical Decoupling Digital Twin Modeling Method for Topological Systems: A Case Study of Water Purification Systems. Technologies 2026, 14, 42. https://doi.org/10.3390/technologies14010042

AMA Style

Wu X, Wu G, Zhang X, Yang Q, Xie L. Hierarchical Decoupling Digital Twin Modeling Method for Topological Systems: A Case Study of Water Purification Systems. Technologies. 2026; 14(1):42. https://doi.org/10.3390/technologies14010042

Chicago/Turabian Style

Wu, Xubin, Guoqiang Wu, Xuewei Zhang, Qiliang Yang, and Liqiang Xie. 2026. "Hierarchical Decoupling Digital Twin Modeling Method for Topological Systems: A Case Study of Water Purification Systems" Technologies 14, no. 1: 42. https://doi.org/10.3390/technologies14010042

APA Style

Wu, X., Wu, G., Zhang, X., Yang, Q., & Xie, L. (2026). Hierarchical Decoupling Digital Twin Modeling Method for Topological Systems: A Case Study of Water Purification Systems. Technologies, 14(1), 42. https://doi.org/10.3390/technologies14010042

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Hierarchical Decoupling Digital Twin Modeling Method for Topological Systems: A Case Study of Water Purification Systems

Abstract

1. Introduction

2. Literature Review

2.1. Challenges in System-Level Digital Twin Modeling

2.2. Core Characteristics of System-Level Digital Twins

2.3. Current Gaps in Construction Methodologies

3. System-Level Digital Twin Assembly Methodology

3.1. System Partitioning Methodology

3.2. Construction of Component-Level Digital Twin Models Based on Information Sets

3.3. JSON-LD-Based Standardized Description and Construction Method for Information Sets

3.4. Graph Theory-Based System-Level Digital Twin Integration Method

4. Construction of a Simple Digital Twin Application Scenario for Water Purification Systems

4.1. Construction of Component-Level Digital Twins and Information Set Description

4.2. Topology-Based System Integration and Twin Assembly

4.3. Model Reusability Analysis

4.4. Topology Connection Method Comparison

4.5. Assessment of Resource Utilization

5. Discussion

6. Conclusions

7. Future Research Directions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI