Next Article in Journal
Long-Term Water Level Projections for Lake Balkhash Using Scenario-Based Water Balance Modeling Under Climate and Socioeconomic Uncertainties
Previous Article in Journal
Correlating Parameters Evaluating Sludge Dewaterability and Morphological Characteristics of Sludge Flocs by a Commercial Smartphone and Image Analysis
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Research on Construction and Application of Water Processes Based on Knowledge Graph: Analysis of Dynamic Paths and Impact Factors

1
College of Computer Science and Software Engineering, Hohai University, Nanjing 211100, China
2
College of Hydrology and Water Resources, Hohai University, Nanjing 210098, China
*
Author to whom correspondence should be addressed.
Water 2025, 17(13), 2020; https://doi.org/10.3390/w17132020
Submission received: 21 May 2025 / Revised: 25 June 2025 / Accepted: 3 July 2025 / Published: 4 July 2025
(This article belongs to the Section Hydrology)

Abstract

The water process refers to the movement and changes in water on Earth, encompassing changes among its three states and its spatial movement. This process is vital for human society as it directly influences water resources, environmental sustainability, and climate regulation. Previous studies have used various related factors to analyze the water process but have not explained the rationale behind selecting these factors from the perspective of pathways. Based on this, the paper explores the construction and application of a top-down water process knowledge graph to clarify the changing process of water movement and the sources of influencing factors. Firstly, we define the concept of the water process and classify its entities based on the concept of water boundaries. Secondly, we identify key knowledge components of the water process, including water bodies, processes, and influencing factors. Finally, we construct and analyze a knowledge graph of the water process and its influencing factors. Results show that (1) the paths of water process help us understand the movement and change process of the water bodies; (2) the number of paths increases with the length of the connection between entities, reflecting the complexity of water process relationships; and (3) tracing these pathways can help identify their influencing factors, providing a data foundation for applying deep learning algorithms in water process research.

1. Introduction

Water is not only the foundation of survival and civilization but also the important support and basic guarantee of economic and social development. The water process is the movement and change process of the water body on the earth, which mainly includes the three-state change in the water body and its position movement. With the development of digital society and the rapid progress of artificial intelligence technology, more and more studies began to pay attention to the water process. This is because the study of the water process can not only more effectively manage water resources but also better protect the water environment and predict and prevent natural disasters such as floods and droughts.
At present, the water process research can be mainly divided into two types, including process-based models and data-driven models. The process-based models are considered the powerful tools to study the water process, with the capability of simulating both spatial and temporal distribution patterns based on coupled physical process and the interactions between different state variables [1]. However, they require large computational resources to compute the complex interactions between state variables in the different processes [2], and the previous assumptions no longer meet the current demand. By contrast, data-driven models only depend upon historical hydro-meteorological data without directly considering underlying physical processes and, thus, entailing much less input and parameter data [3] and have been widely used in the water process. These methods include multiple linear regression (MLR) [4], back propagation neural network (BPNN) [5], multilayer perceptron (MLP) [6], support vector regression (SVR) [7,8], and extreme learning machine (ELM) [9]. However, they belong to the “shallow” learning category, in which the instinct information represents insufficiently.
To overcome the problem, deep learning techniques, which can gain better forecasting performance owing to its “deeper” representations, are attracted much attention in hydrological prediction [3]. For example, Yue et al. [10] adopted the deep belief network (DBN)with partial least-squares regression (PLSR) and used key factors obtained by the partial mutual information (PMI) as its input for mid-to long-term runoff prediction. Gao et al. [11] applied three models including LSTM, Gated Recurrent Unit (GRU), and artificial neural network to simulate runoff in the Yutan station control catchment, and showed that LSTM and GRU have better performance. Lv et al. [12] proposed a long Short-Term memory cyclic model with mutual information for forecasting flood flow of Xixian Basin, China. Hu et al. [13] proposed a method based on a convolutional neural network-bidirectional long short-term memory-difference analysis (CNN-BiLSTM-DA) model for water level prediction analysis and flood warning. Even though, due to the features of deep learning, e.g., high accuracy, high efficiency, low cost, and low barriers, the above-mentioned models offer not only highly accurate results but orders-of-magnitude lower cost in terms of both model preparation/validation and run-time computation efforts; however, it is difficult to provide an easy-to-understand explanation for its results [14]. In addition, there is no physical water routing process being formulated in above-mentioned models, which leads to insufficient information describing the water process and affects the accuracy of the model [15,16]. Thus, to make the model explainable, this paper studies water process based on physical mechanism. On this basis, all the data describing the water process are given to obtain sufficient information.
Knowledge graph (KG) is a graph-based data structure used to describe concepts and their relationships in the physical world. It can mine entity relationships from vast, fragmented information and present them as structured semantic networks [17]. At present, it has been widely used in various fields, such as in the field of fault diagnosis, Tang et al. [18] proposed a new intelligent auxiliary diagnosis method based on knowledge graph which helps grass-roots maintenance engineers to quickly and accurately locate the fault unit of aircraft. In the field of biomedicine, Lin et al. [19] proposed a KG-based neural network framework to solve the problem of drug–drug interaction prediction. Zeng et al. [20] summarized knowledge graph-based works that implement drug repurposing and adverse drug reaction prediction for drug discovery. In addition, there are a few studies in the field of water conservancy. Feng et al. [21] proposed a knowledge graph construction method for water conservancy object data, aiming at the problem that it is difficult for water conservancy industry to use the relationship between objects for information retrieval based on keyword search technology. Duan et al. [22] put forward the description method of water conservancy comprehensive knowledge system, constructed the construction framework and key technology system of water conservancy knowledge graph, and realized the cross-domain query and retrieval of water conservancy knowledge query the relationship between discipline graph and water network graph, and mine the implicit relationship between different water conservancy entities. Rondon Diaz and Vilches-Blazquez [23] characterized water quality datasets through multi-dimensional knowledge graphs for bridging data silos. However, to our knowledge, there are no studies that apply knowledge graph to water motion change. Hence, to obtain sufficient information, it is necessary to carry out the research based on the knowledge graph of water process.
The important point for solving the above problems is to determine the water body and its relations that describe the water process. For this reason, based on the top-down scheme [24], this paper firstly gives the relevant definition of the effluent process and divides it according to the different boundaries of the carrying water body. Secondly, the knowledge of water process is extracted, including water body, process and influence factors. Then, the knowledge graph of water process and the knowledge graph of influence factors of water process are constructed. Finally, the path and influence factors of water process are analyzed. This can further provide new insights into the understanding of water processes.

2. Methods

2.1. Problem Formalization

For the convenience of the study, this paper makes some hypotheses and gives some definitions. The specific contents are as follows:
Suppose that the water body is enclosed in a space, and it can undergo three-state change by itself, but it will not disappear. In addition, the water process mentioned in this paper is assumed to be continuous.
Definition 1.
The water process is the movement and change process of the water body on the earth. Among them, motion refers to the position change of water body, such as water body from point a to point b; change refers to the three-state transformation of water body, such as the transformation of water body from solid to liquid.
Definition 2.
The water body is a water structure composed of unit water in a certain range, which is composed of gas–liquid-solid single state or multiple states coexisting unit water. In addition, due to the different boundaries of the carrying water bodies, various water bodies such as rivers and lakes have been formed. This will prepare for the subsequent division of the water body.
Definition 3.
The water process KG can be represented as WP = (P, E) where P is the set of water process entities, and E is the set of their relations.
Definition 4.
The water process path refers to the spatial trajectory left by the change of the position of the water body. A path M 1 , L defines a sequence of entity types and relationship types. Taking M 1 , L = A 1 R 1 A 2 R 2 R L 1 A L as an example, it indicates that there is a compound relationship ( R 1 ,   R 2 ,   ,   R L 1 ) between entity type A 1 and A L . Where A i   ( i = 1 ,   2 ,   ,   L ) is the entity and R j   ( j = 1 ,   2 ,   ,   L 1 ) is the relationship [25].
Definition 5.
The water process elements are the factors that affect the movement of water bodies.

2.2. Overall Structure

This paper elaborated the relevant definition of water process and the architecture of knowledge graph, as well as its application in water process. Figure 1 shows the overall structure and includes four compounds: water process knowledge, related definitions of water process and their relationships, water process knowledge extraction and fusion, and knowledge graph in water process and analysis.

2.2.1. Water Process Knowledge

For the construction of water process KG, this paper gives some definitions of water process (as shown in Section “Problem formalization”) and selects three categories of texts, including literature data, network data, and book data, which are collected. Considering the reliability and stability of knowledge, network data mainly includes some encyclopedia web pages whose contents are more reliable and stable; book data includes the authoritative books in the field of hydrology, such as Handbook of Hydrology [26]. All network data and literature data are chosen based on Chinese and English keywords about the water process; for example, they describe the three-state changes in water body, such as “evaporation”, “sublimation”, etc.; the movement of water body, such as “recharge”, etc.; and the factors that affect the water process, such as “evaporation factors”, etc.

2.2.2. Water Process Knowledge Extraction

Water process knowledge extraction is a process of identifying entity and relationship types from both structured and unstructured water process documents. In the construction of water process KG, we extract the carrier of water body as entities, such as rivers, lakes, etc., and extract the movement and change process of water bodies as relations, such as evaporation, infiltration, condensation, etc., and extract the factors that affect the movement of water bodies as attributes, such as temperature, air pressure and so on. In addition, according to the difference in water boundary and conceptual semantic relationship [27], this paper classifies the entities in the water process.

2.2.3. Knowledge Graph in Water Process and Analysis

As we are dealing with entities and their relationships, it only makes sense to store the results in a graph database. Graph databases are used as a warehouse to store KGs in terms of nodes, edges, and its properties of graphs [28]. In this paper, we use the Neo4j diagram database to describe the water process. The Neo4j diagram database is an opensource graph database with high performance, high reliability, and expandability. It contains two basic data types: nodes and edges, and both nodes and edges can contain various forms of attributes. Among them, the node is equivalent to the entity in the water process and is the vertex in the graph; the edge is equivalent to the relationship between the two entities in the water process and is the edge in the graph; Attributes are the characteristics of nodes and relationships, that is, elements that describe water processes, which are stored in neo4j in the form of key-value pairs. Moreover, we use the triplets of <entity-relation-entity> to form a semantic network to represent the water process.
In addition, based on graph theory search algorithm, Neo4j graph database provides a set of descriptive Cypher query language, which mainly includes three parts: MATCH sentence to realize graph matching, WHERE sentence to realize filter condition, and RETURN sentence to realize result return. Among them, path search refers to traversing along a specific edge type from a specified node and returns the terminating nodes and edges that can be reached. The query is translated into Cypher as “MATCH Paths = (p1: Entity {name: ‘Head Entity’})-[r*n]-> (p2: Entity {name: ‘Tail Entity’}) RETURN Paths”, where n represents the length between two entities, which will help us to study the water process path.
The application of knowledge graph in water process is mainly in two aspects: (1) water process path. The knowledge graph clearly shows the movement and change process of the water body and can provide the path between any two water bodies, which will help to find the source that affects the water quality. (2) water process element. Combined with the water process path, we will obtain all the influencing factors of the water process, which will provide a data basis for using artificial intelligence to study the water process.

3. Results

To achieve this work, this study employed graph database technologies using Neo4j (version 5.15) and Python 3.11 (with Scrapy 2.11 and Py2neo 2021.2.3) as the computational platform. The hardware environment consisted of a Windows 11 workstation equipped with an AMD Ryzen ™ 9 7945HX processor (5.4 GHz boost) and 32 GB RAM, Santa Clara, CA, USA. In addition, during the data-acquisition phase, the data collected in this study primarily draws from Handbook of Hydrology. Through systematic knowledge integration, we have constructed an extensible framework.

3.1. The Water Process KG

Figure 2 shows the water process KG, where arrows of different colors represent that two water bodies are connected in different states. For example, the purple arrow indicates that two water bodies are connected in a solid state. And Table 1 shows the results of the classification of water process entities according to conceptual semantic relations [27], which include “Water vapor”, “Water cloud”, “Mixed cloud”, “Halicloud”, “Rain”, “Snow” and other entities. Table 2 illustrates the definitions of corresponding entity types and relationships.

3.2. Water Process Path

Through the above analysis, we already know that the water process is very complicated. Therefore, to make clear the water source and its movement change process, we use the path to represent it. Due to the limited space, this paper takes the head entity “Water vapor”(id = 1) and the tail entity “River” (id = 8) as the research object and finds the path with length 3 based on the query language “match Paths= (p1: Entity {name: ‘Water vapor’})-[r*3]-> (p2: Entity {name: ‘River’}) return Paths”. The results are as follows:
M 1,8 1 = W a t e r   v a p o r C o n d e n s a t i o n W a t e r   c l o u d C o n d e n s a t i o n _ c o l l i s i o n _ g r o w t h _ L R a i n L a n d i n g _ L R i v e r ; M 1,8 2 = W a t e r   v a p o r C o n d e n s a t i o n W a t e r   c l o u d C o n d e n s a t i o n _ c o l l i s i o n _ g r o w t h _ L R a i n S u r f a c e r u n o f f R i v e r ; M 1,8 3 = W a t e r   v a p o r D e s u b l i m a t i o n M i x e d   c l o u d C o n d e n s a t i o n _ c o l l i s i o n _ g r o w t h _ S S n o w L a n d i n g _ S R i v e r ; M 1,8 4 = W a t e r   v a p o r D e s u b l i C o n d e n s a t i o n _ d e s u b l i m a t i o n H a l i c l o u d C o n d e n s a t i o n _ c o l l i s i o n _ g r o w t h _ S H a i l L a n d i n g _ S R i v e r .
Among them, M 1,8 i represents the i -th (here i = 4 ) path from “Water vapor” to “River”. The above examples of different paths (there are four.) between “Water vapor” and “River” show that it is feasible for scholars to construct input data from multiple perspectives for research. The KG of sub-water process connecting water vapor and river is shown in Figure 3. As can be seen from Figure 3, the water entering the river is in the form of “rain”, “snow”, and “hail”. If the problem of river water quality is studied, it can be seen from the above forms that the source is likely to be surface runoff. Therefore, it is particularly important to study the path of water process.
Additionally, to figure out what happens when the path length continues to increase, we increase the path (connecting “Water vapor” to the “River”) length to 5, and the result is shown in Figure 4. Comparing Figure 3 and Figure 4, we can see that with the increase in the length of the connection between “Water vapor” and “River”, the more entities and relationships appear, the more the number of paths. And, to increase the persuasion of the results, we increase the path length of connecting different head entities and tail entities from 1 to 9, and count the number of paths, as shown in Table 3 and Figure 5. As can be seen from Table 3 and Figure 5, the number of paths also increases as the path length between the two entities increases. In addition, when the path length exceeds 6, the number of paths from the head entity to the tail entity increases rapidly.

3.3. Water Process Elements

Due to the limited space, this section takes
M 8,8 = River E v a p o r a t i o n s s Water   vapor C o n d e n s a t i o n W a t e r   c l o u d C o n d e n s a t i o n _ c o l l i s i o n _ g r o w t h _ L Rain L a n d i n g _ L River
a path from the “River” to “River” as an example, to give the influencing factors of the effluent process. To show the relationship between elements and water process, this section defines water body, process and influence factors as entities, between entities and processes, between entities and influence factors, and between processes and influence factors as relationships. Thus, the path M 8,8 becomes M 8,8 , and the specific M 8,8 is
M 8,8 = River P r o c e s s E v a p o r a t i o n P r o c e s s Water   vapor P r o c e s s Condensation P r o c e s s W a t e r   c l o u d P r o c e s s Condensation _ collision _ growth _ L P r o c e s s Rain P r o c e s s Landing _ L P r o c e s s River
Figure 6 shows the KG of water process with influence factors, in which “blue” represents the influence factors of entity and process, “green” is liquid water body, “red” is gaseous water body, and “curry” color is the movement and change process of water body. From Figure 6, we can see that it takes 25 elements, 4 processes, and 4 different bodies of water to start from the river and then return to the river. This means that to calculate the amount of rainfall that falls into a river in the form of rain, it requires at least 25 input factors, four processes and four different carriers. This also indicates that the water process can be regarded as the propagation chain of the water body, and if one of the processes is interrupted or some influencing factor cannot reach the value of the change, it will lead to the whole transmission. In such cases, it is necessary to study every component of the communication chain. Therefore, here we assume that the process of water is continuous. In addition, we found that processes and entities have the same influence factors, such as the “water area” is not only the factor (attribute) of the river, but also the factor affecting evaporation. This is very critical for us to study the location of the input factors later.

4. Discussion

We have constructed a knowledge graph of water processes and analyzed the paths between any two water bodies and the elements driving the movement and change in water bodies. Several studies in the literature analyze related physical water processes using knowledge graphs [21,22,23]. Their core similarity is that they both study water by using the method of knowledge graphs, but differ in scope: Literatures [21,22] mainly study the construction of the knowledge graph of water conservancy information; Literature [23] primarily studies water quality through knowledge graphs, which is a relatively specific water process. This paper investigates the dynamic processes of water bodies across different states, including: (1) transition paths between any two water bodies; (2) the elements that can drive the movement and change in water bodies based on these paths.
In addition, to demonstrate the practical applicability of our framework to water processes, we apply the concept-instance relational principle from Reference [21]—which states that relations between conceptual entities imply analogous instance-level linkages. Focusing on the Qinghai Lake Basin as a specific watershed, we construct a knowledge graph for the Qinghai Lake Basin. The Qinghai Lake Basin, also known as the Qinghai Lake Basin, is an independent and complete physical geography unit. Its geographical coordinates range from 97°4′55.70″ E to 101°11′24.88″ E and 36°17′43.58″ N to 38°19′16.20″N, with elevations spanning 3195 to 5291 m [29].
Firstly, within the Qinghai Lake Basin, conceptual entities were mapped to instance entities. Due to space limitations, the instance entities for the Qinghai Lake Basin were primarily referenced from Literature [29], with the mapping results presented in Table 4. As shown in Table 4, marine entities (id = 12) are not involved here, as there are no oceans within the Qinghai Lake Basin.
Secondly, based on the conceptual entity relationships in Table 2 and the instance entities in Table 4, we constructed the Qinghai Lake Basin knowledge graph using Neo4j and Python. The visualization employs a color-coded schema where each conceptual entity category shares a unified hue. As shown in Figure 7: all lake instances (Qinghai Lake, Erhai Lake, and Gahai Lake) are rendered in #0000FF (blue), while wetland instances use #FF00FF (magenta). This approach enables intuitive recognition of the 22 entity types within the knowledge graph.

5. Conclusions

The water process plays an important role in obtaining influence factors and finding the source of water. Previous studies have used related factors to calculate and analyze water process but overlook the sources of factors affecting water process. Thus, we explored the construction and application of the top-down water process knowledge graph to clarify the changing process of water movement and the sources of influencing factors. The build process consists of four steps: (1) defining and classifying water bodies of water process; (2) identifying the knowledge of water process; (3) constructing the knowledge graph of water process and water process factors; (4) and analyzing the results.
The findings are summarized as follows: by comparing the entities and relationships involved in the water process, it is known that the water process is more complex. The path of water process and its influencing factors are extracted and analyzed. When the length of the connection between the two nodes is longer, the more paths of the water process, the more factors that affect the water process. Moreover, there are intersections between different paths, mainly because there are similarities in some parts of the water process. This may also be the basis for scholars to build model input from different angles. Furthermore, combined with the knowledge graph of influence factors of water process, we can not only clearly know the source of water process, but also know how many input factors are needed to calculate a certain amount. This will provide a data basis for scholars to adopt deep learning algorithms to study water processes.
We acknowledge that certain water processes were not explicitly considered in this study, such as those processes at different latitudes, basins, and a given basin. Moreover, quantification of those processes as well as scenarios (for example, how will the water process take place if the air temperature rises?) also were not taken into consideration. To address these limitations, future work will (1) establish composite entity concepts (e.g., watershed entities encapsulating vegetation, river, soil) and construct their knowledge graphs; (2) implement spatiotemporal graph neural networks (STGNNs) to quantify and analyze process dynamics based on spatial topologies and attribute evolution.

Author Contributions

Y.S.: Writing—Original draft, Methodology, Formal analysis. P.A.: Conceptualization, Writing—Reviewing, Funding acquisition. C.X.: Methodology and Data curation. J.L.: Resources, Data curation. S.G.: Writing—Review, Data curation. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by ‘the Key Research and Development Project of Jiangsu Province’ (Grant No. BE2020729).

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Tong, X.; You, L.; Zhang, J.; Chen, H.; Nguyen, V.T.; He, Y.; Gin, K.Y.H. A comprehensive modelling approach to understanding the fate, transport and potential risks of emerging contaminants in a tropical reservoir. Water Res. 2021, 200, 117298. [Google Scholar] [CrossRef]
  2. García-Alba, J.; Bárcena, J.F.; Ugarteburu, C.; García, A. Artificial neural networks as emulators of process-based models to analyse bathing water quality in estuaries. Water Res. 2019, 150, 283–295. [Google Scholar] [CrossRef]
  3. Bai, Y.; Chen, Z.; Xie, J.; Li, C. Daily reservoir inflow forecasting using multiscale deep feature learning with hybrid models. J. Hydrol. 2016, 532, 193–206. [Google Scholar] [CrossRef]
  4. Salim, I.; Sajjad, R.U.; Paule-Mercado, M.C.; Memon, S.A.; Lee, B.-Y.; Sukhbaatar, C.; Lee, C.-H. Comparison of two receptor models PCA-MLR and PMF for source identification and apportionment of pollution carried by runoff from catchment and sub-watershed areas with mixed land cover in South Korea. Sci. Total. Environ. 2019, 663, 764–775. [Google Scholar] [CrossRef] [PubMed]
  5. Tan, Q.-F.; Lei, X.-H.; Wang, X.; Wang, H.; Wen, X.; Ji, Y.; Kang, A.-Q. An adaptive middle and long-term runoff forecast model using EEMD-ANN hybrid approach. J. Hydrol. 2018, 567, 767–780. [Google Scholar] [CrossRef]
  6. Boucher, M.; Quilty, J.; Adamowski, J. Data Assimilation for Streamflow Forecasting Using Extreme Learning Machines and Multilayer Perceptrons. Water Resour. Res. 2020, 56, e2019WR026226. [Google Scholar] [CrossRef]
  7. Sahoo, B.B.; Jha, R.; Singh, A.; Kumar, D. Application of Support Vector Regression for Modeling Low Flow Time Series. KSCE J. Civ. Eng. 2018, 23, 923–934. [Google Scholar] [CrossRef]
  8. Samantaray, S.; Das, S.S.; Sahoo, A.; Satapathy, D.P. Monthly runoff prediction at Baitarani river basin by support vector machine based on Salp swarm algorithm. Ain Shams Eng. J. 2022, 13, 101732. [Google Scholar] [CrossRef]
  9. Yaseen, Z.M.; Sulaiman, S.O.; Deo, R.C.; Chau, K.-W. An enhanced extreme learning machine model for river flow forecasting: State-of-the-art, practical applications in water resource engineering area and future research direction. J. Hydrol. 2019, 569, 387–408. [Google Scholar] [CrossRef]
  10. Yue, Z.; Ai, P.; Xiong, C.; Hong, M.; Song, Y. Mid- to long-term runoff prediction by combining the deep belief network and partial least-squares regression. J. Hydroinform. 2020, 22, 1283–1305. [Google Scholar] [CrossRef]
  11. Gao, S.; Huang, Y.; Zhang, S.; Han, J.; Wang, G.; Zhang, M.; Lin, Q. Short-term runoff prediction with GRU and LSTM networks without requiring time step optimization during sample generation. J. Hydrol. 2020, 589, 125188. [Google Scholar] [CrossRef]
  12. Lv, N.; Liang, X.; Chen, C.; Zhou, Y.; Li, J.; Wei, H.; Wang, H. A long Short-Term memory cyclic model with mutual information for hydrology forecasting: A Case study in the xixian basin. Adv. Water Resour. 2020, 141, 103622. [Google Scholar] [CrossRef]
  13. Hu, C.; Zhou, L.; Gong, Y.; Li, Y.; Deng, S. Research on Water Level Anomaly Data Alarm Based on CNN-BiLSTM-DA Model. Water 2023, 15, 1659. [Google Scholar] [CrossRef]
  14. Camps-Valls, G.; Tuia, D.; Zhu, X.X.; Reichstein, M. Deep Learning for the Earth Sciences; Applications of Deep Learning in Hydrology; John Wiley & Sons: Hoboken, NJ, USA, 2021; pp. 285–297. [Google Scholar] [CrossRef]
  15. Kim, T.; Yang, T.; Gao, S.; Zhang, L.; Ding, Z.; Wen, X.; Gourley, J.J.; Hong, Y. Can artificial intelligence and data-driven machine learning models match or even replace process-driven hydrologic models for streamflow simulation?: A case study of four watersheds with different hydro-climatic regions across the CONUS. J. Hydrol. 2021, 598, 126423. [Google Scholar] [CrossRef]
  16. Nearing, G.S.; Kratzert, F.; Sampson, A.K.; Pelissier, C.S.; Klotz, D.; Frame, J.M.; Prieto, C.; Gupta, H.V. What Role Does Hydrological Science Play in the Age of Machine Learning? Water Resour. Res. 2021, 57, e2020WR028091. [Google Scholar] [CrossRef]
  17. Tang, X.; Chi, G.; Cui, L.; Ip, A.W.H.; Yung, K.L.; Xie, X. Exploring Research on the Construction and Application of Knowledge Graphs for Aircraft Fault Diagnosis. Sensors 2023, 23, 5295. [Google Scholar] [CrossRef]
  18. Tang, X.; Hu, B.; Wang, J.; Wu, C.; Noman, S.M. Intelligent Auxiliary Fault Diagnosis for Aircraft Using Knowledge Graph. In Advanced Intelligent Technologies for Industry: Proceedings of 2nd International Conference on Advanced Intelligent Technologies (ICAIT 2021); Springer: Singapore, 2021; Volume 285, pp. 283–288. [Google Scholar] [CrossRef]
  19. Lin, X.; Quan, Z.; Wang, Z.J.; Ma, T.F.; Zeng, X.X. KGNN: Knowledge Graph Neural Network for Drug-Drug Interaction Prediction. IJCAI 2021, 380, 2739–2745. [Google Scholar]
  20. Zeng, X.; Tu, X.; Liu, Y.; Fu, X.; Su, Y. Toward better drug discovery with knowledge graph. Curr. Opin. Struct. Biol. 2022, 72, 114–126. [Google Scholar] [CrossRef]
  21. Feng, J.; Xu, X.; Lu, J. Construction and Application of Water Conservancy Information Knowledge Graph. Comput. Mod. 2019, 9, 35–40. [Google Scholar]
  22. Duan, H.; Han, K.; Zhao, H.; Jiang, Y.; Li, H.; Mao, W. Research on water conservancy comprehensive knowledge graph construction. J. Hydraul. Eng. 2021, 52, 948–958. [Google Scholar] [CrossRef]
  23. Díaz, J.D.R.; Vilches-Blázquez, L.M. Characterizing water quality datasets through multi-dimensional knowledge graphs: A case study of the Bogota river basin. J. Hydroinform. 2022, 24, 295–314. [Google Scholar] [CrossRef]
  24. Guo, R.; Yang, Q.; Liu, S.; Li, W.; Yuan, X.; Huang, X. Construction and Application of Power Grid Fault Handing Knowledge Graph. Power Syst. Technol. 2021, 45, 2092–2100. [Google Scholar] [CrossRef]
  25. Jin, Y. Research on TCM Prescription Recommendation Based on Graph Neural Network. Ph.D. Thesis, East China Normal University, Shanghai, China, 2022. [Google Scholar] [CrossRef]
  26. Maidment, D. Handbook of Hydrology; McGraw-Hill Companies: New York, NY, USA, 1993. [Google Scholar]
  27. Ling, Z.; Li, R.; Wu, H.; Li, J.; Gui, Z. Semantic-driven construction of geographic entity association network and knowledge service. Acta Geod. Cartogr. Sin. 2023, 52, 478–489. [Google Scholar] [CrossRef]
  28. Hailemichael, M.T.; Muhammad, U.H.; Bekele, M.W.; Nigus, M.G.; Dechasa, K.Y.; Mekbibu, A.M. Design and Implementation of Satellite Knowledge Graph and Its Application. In Proceedings of the 3rd Asia-Pacific Conference on Communications Technology and Computer Science (ACCTCS), Shenyang, China, 25–27 February 2023; pp. 240–245. [Google Scholar] [CrossRef]
  29. Chen, Z. Assessment of Hydrological Processes and Ecological Effects in the Qinghai Lake River Basin. Ph.D. Thesis, Qinghai Normal University, Xining, China, 2022. [Google Scholar] [CrossRef]
Figure 1. The overall structure.
Figure 1. The overall structure.
Water 17 02020 g001
Figure 2. The schema of water process KG.
Figure 2. The schema of water process KG.
Water 17 02020 g002
Figure 3. The sub-water process KG of path length 3 connecting “Water vapor” to “River”.
Figure 3. The sub-water process KG of path length 3 connecting “Water vapor” to “River”.
Water 17 02020 g003
Figure 4. The sub-water process KG of length 5 connecting “Water vapor” to “River”.
Figure 4. The sub-water process KG of length 5 connecting “Water vapor” to “River”.
Water 17 02020 g004
Figure 5. The relationship between the path length and the number of paths.
Figure 5. The relationship between the path length and the number of paths.
Water 17 02020 g005
Figure 6. The KG of water process with influence factors.
Figure 6. The KG of water process with influence factors.
Water 17 02020 g006
Figure 7. The water processes KG of the Qinghai Lake Basin.
Figure 7. The water processes KG of the Qinghai Lake Basin.
Water 17 02020 g007
Table 1. Division results of water process entities.
Table 1. Division results of water process entities.
No.Concept IConcept AConcept BConcept Entity
1Water bodySkyWater vaporWater vapor
2CloudWater cloud
3Mixed cloud
4Halicloud
5Falling waterRain
6Snow
7Hail
8SurfaceSurface waterRiver
9Lake
10Reservoir
11Swamp
12Ocean
13Glacier
14Perpetual snow
15BiowaterGreen plants
16Human water useAgricultural water
17Industrial water
18Domestic water
19UndergroundUnsaturated zoneSoil water zone
20Intermediate hose
21Capillary zone
22Saturated zonePhreatic water
23Confined water
Table 2. Definition of the entity relation type.
Table 2. Definition of the entity relation type.
Relation TypeConnotationHead EntityTail Entity
Condensation_collision_growthIndicating that water clouds form rain through condensation, collision and growth.Water cloudRain
Condensation_collision_growthIndicating that mixed clouds form snow through condensation, collision and growth.Mixed cloudSnow
Condensation_collision_growthIndicating that halicloud forms hail through condensation, collision and growth.HalicloudHail
Landing_LIndicating that the rain falls from the sky to the river.RainRiver
Landing_SIndicating that the snow falls from the sky to the river.SnowRiver
Landing_SIndicating that the hail falls from the sky to the river.HailRiver
SurfacerunoffIndicating that rain enters the river in the form of surface runoff.RainRiver
SurfacerunoffIndicating that rain enters the lake in the form of surface runoff.RainLake
Landing_LIndicating that the rain falls from the sky to the lake.RainLake
Landing_SIndicating that the snow falls from the sky to the lake.SnowLake
Landing_SIndicating that the hail falls from the sky to the lake.HailLake
Landing_LIndicating that the rain falls from the sky to the reservoir.RainReservoir
Landing_SIndicating that the snow falls from the sky to the reservoir.SnowReservoir
Landing_SIndicating that the hail falls from the sky to the reservoir.HailReservoir
Landing_LIndicating that the rain falls from the sky to the swamp.RainSwamp
Landing_SIndicating that the snow falls from the sky to the swamp.SnowSwamp
Landing_SIndicating that the hail falls from the sky to the swamp.HailSwamp
SurfacerunoffIndicating that rain enters the swamp in the form of surface runoff.RainSwamp
SurfacerunoffIndicating that rain enters the ocean in the form of surface runoff.RainOcean
Landing_LIndicating that the rain falls from the sky to the ocean.RainOcean
Landing_SIndicating that the snow falls from the sky to the ocean.SnowOcean
Landing_SIndicating that the hail falls from the sky to the ocean.HailOcean
Landing_LIndicating that the rain falls from the sky to the glacier.RainGlacier
Landing_SIndicating that the snow falls from the sky to the glacier.SnowGlacier
Landing_SIndicating that the hail falls from the sky to the glacier.HailGlacier
Landing_LIndicating that the rain falls from the sky to the perpetual snow.RainPerpetual snow
Landing_SIndicating that the snow falls from the sky to the perpetual snow.SnowPerpetual snow
Landing_SIndicating that the hail falls from the sky to the perpetual snow.HailPerpetual snow
InterceptionIndicating that the parts of trees and plants cut off by the rain before it hits the ground.RainGreen plants
InterceptionIndicating that the parts of trees and plants cut off by the snow before it hits the ground.SnowGreen plants
Diversion_irrigationIndicating that agriculture is irrigated with river water.RiverAgricultural water
Diversion_irrigationIndicating that agriculture is irrigated with reservoir water.ReservoirAgricultural water
Diversion_irrigationIndicating that agriculture is irrigated with lake water.LakeAgricultural water
Irrigation_reosmosisIndicating that agricultural water is called soil water through irrigation.Agricultural waterSoil water zone
Water_diversionIndicating that the river water is diverted to domestic use.RiverDomestic water
Water_diversionIndicating that the lake water is diverted to domestic water.LakeDomestic water
Water_diversionIndicating that the reservoir water is diverted to domestic use.ReservoirDomestic water
Processing_dischargeIndicating that domestic water is processed and discharged into the river.Domestic waterRiver
Water_diversionIndicating that the river water is used for production.RiverIndustrial water
Water_diversionIndicating that the reservoir water is used for production.ReservoirIndustrial water
Water_diversionIndicating that the lake water is used for production.LakeIndustrial water
Water_diversionIndicating that the ocean water is used for production.OceanIndustrial water
Processing_dischargeIndicating that industrial water is processed and discharged into the river.Industrial waterRiver
InterceptIndicating that the river has been intercepted and entered the reservoir.RiverReservoir
Flood-reliefIndicating that the reservoir water is discharged into the river through flood discharge.ReservoirRiver
InflowIndicating that the river flows into the lake.RiverLake
InflowIndicating that river water flows into the ocean.RiverOcean
SubsurfaceflowIndicating that soil water enters the river in the form of soil flow.Soil water zoneRiver
Stream_runoffIndicating that glaciers melt into water and flow into rivers.GlacierRiver
Stream_runoffIndicating that the snow melts into water and flows into the river.Perpetual snowRiver
InfiltrationIndicating that soil water can be changed into intermediate hose by infiltration.Soil water zoneIntermediate hose
InfiltrationIndicating that intermediate hose water can be changed into capillary zone water by infiltration.Intermediate hoseCapillary zone
InfiltrationIndicating that capillary zone can be changed into phreatic water by infiltration.Capillary zonePhreatic water
InfiltrationIndicating that phreatic water can be changed into confined water by infiltration.Phreatic waterConfined water
Shallow_subsurface_runoffIndicating that diving enters the river in the form of shallow subsurface runoff.Phreatic waterRiver
Deep_subsurface_runoffIndicating that the confined water enters the river in the form of deep underground runoff.Confined waterRiver
EvaporationIndicating that the river water changes from liquid to water vapor by evaporation.RiverWater vapor
EvaporationIndicating that the lake water changes from liquid to water vapor by evaporation.LakeWater vapor
EvaporationIndicating that the reservoir water changes from liquid to water vapor by evaporation.ReservoirWater vapor
EvaporationIndicating that the ocean water changes from liquid to water vapor by evaporation.OceanWater vapor
EvaporationIndicating that the soil water changes from liquid to water vapor by evaporation.Soil water zoneWater vapor
EvaporationIndicating that the swamp water changes from liquid to water vapor by evaporation.SwampWater vapor
TranspirationIndicating that green plants turn water into water vapor by transpiration.Green plantsWater vapor
SublimationIndicating that glaciers change solid water into vapor in the form of sublimation.GlacierWater vapor
SublimationIndicating that snow changes solid water into vapor in the form of sublimation.Perpetual snowWater vapor
CondensationIndicating that water vapor changes into water and clouds through condensation.Water vaporWater cloud
DesublimationIndicating that water vapor changes into mixed clouds through desublimation.Water vaporMixed cloud
Condensation_desublimationIndicating that water vapor turns into hail clouds by sublimation_condensation.Water vaporHalicloud
Table 3. Definition of the tertiary relation type.
Table 3. Definition of the tertiary relation type.
Head Entity (id)Water Vapor (1)Water Vapor (1)River (8)
Tail Entity (id)River (8)Ocean (12)Ocean (12)
Paths by length1 → 81 → 128 → 12
1001
2000
3443
4649
530624
6463048
7165103151
8348205355
91036467894
Table 4. The Mapping Relationship between Conceptual Entities and Instance Entities within the Qinghai Lake Basin.
Table 4. The Mapping Relationship between Conceptual Entities and Instance Entities within the Qinghai Lake Basin.
No.Concept Entity Instance EntityNo.Concept EntityInstance Entity
1Water vaporWater vapor14Perpetual snowZanbaohuaixiu Mountain Snowpack
2Water cloudWater cloudRiyue Mountain Snowpack
3Mixed cloudMixed cloud15Green plantsSplendid Achnatherum
4HalicloudHalicloudLeymus
5RainRainShort-flower Needlegrass
6SnowSnowPurple Needlegrass
7HailHailDrooping Wildrye
8RiverBuha River16Agricultural waterAgricultural Irrigation
Shaliu RiverLivestock Watering
Quanji River17Industrial waterAgro-livestock Product Processing
Haergai RiverClean Energy Facility Maintenance
Ganzi River18Domestic waterResidential Water Use
Daotang RiverTourism Service Water Use
Heima RiverPastoral Settlement Water Supply
9LakeQinghai Lake19Soil water zoneGelic Leptosols
Erhai LakeHistic Cambisols
Gahai LakeMollic Planosols
10ReservoirDongwei ReservoirCalcic Kastanozems
Heima River Regulating ReservoirGleysols
Tanggema Embankment DamArenosols
11SwampDaotang River WetlandChernozems
Nymph Bay WetlandKastanozems
Sand Island WetlandSolonchaks
Spring Bay Wetland20Intermediate hoseIntermediate hose
Satchel Lake Wetland21Capillary zoneCapillary zone
Hada Beach Wetland22Phreatic waterOverflow Spring
13GlacierShule South Mountain Glacier23Confined waterConfined water
Datong Mountain Glacier
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Song, Y.; Ai, P.; Xiong, C.; Li, J.; Gong, S. Research on Construction and Application of Water Processes Based on Knowledge Graph: Analysis of Dynamic Paths and Impact Factors. Water 2025, 17, 2020. https://doi.org/10.3390/w17132020

AMA Style

Song Y, Ai P, Xiong C, Li J, Gong S. Research on Construction and Application of Water Processes Based on Knowledge Graph: Analysis of Dynamic Paths and Impact Factors. Water. 2025; 17(13):2020. https://doi.org/10.3390/w17132020

Chicago/Turabian Style

Song, Yanhong, Ping Ai, Chuansheng Xiong, Jintao Li, and Shicheng Gong. 2025. "Research on Construction and Application of Water Processes Based on Knowledge Graph: Analysis of Dynamic Paths and Impact Factors" Water 17, no. 13: 2020. https://doi.org/10.3390/w17132020

APA Style

Song, Y., Ai, P., Xiong, C., Li, J., & Gong, S. (2025). Research on Construction and Application of Water Processes Based on Knowledge Graph: Analysis of Dynamic Paths and Impact Factors. Water, 17(13), 2020. https://doi.org/10.3390/w17132020

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop