PF2RM: A Power Fault Retrieval and Recommendation Model Based on Knowledge Graph

Liang, Kun; Zhou, Baoxian; Zhang, Yiying; Li, Yiping; Zhang, Bo; Zhang, Xiankun

doi:10.3390/en15051810

Open AccessArticle

PF2RM: A Power Fault Retrieval and Recommendation Model Based on Knowledge Graph

¹

College of Artificial Intelligence, Tianjin University of Science & Technology, Tianjin 300457, China

²

State Grid Smart Grid Research Institute Co., Ltd., Nanjing 210003, China

^*

Author to whom correspondence should be addressed.

Energies 2022, 15(5), 1810; https://doi.org/10.3390/en15051810

Submission received: 12 January 2022 / Revised: 13 February 2022 / Accepted: 22 February 2022 / Published: 1 March 2022

Download

Browse Figures

Versions Notes

Abstract

:

Energy saving and emission reduction have become common concerns in countries around the world. In China, with the implementation of the new strategy of “carbon peak and neutrality” and the rapid development of the new smart grid infrastructure, the amount of data of actual power grid dispatching and fault analysis show exponential growth, which has led to phenomena such as poor supervision effectiveness and difficulty in handling faults in the process of grid operation and maintenance. Existing research on retrieval recommendation methods has had a lower accuracy rate at cold-start due to a small sample of user interactions. In addition, the cumulative learning of user personalization during general retrieval results in a poor perception of potential interest. By constructing a power knowledge graph, this paper presents a power fault retrieval and recommendation model (PF2RM) based on user-polymorphic perception. This model includes two methods: the power fault retrieval method (PFR) and the user-polymorphic retrieval recommendation method (UPRR). First, we take the power grid fault dispatching business as the core and reconstruct the ontology layer of the power knowledge graph. The PFR method is used to design the graph-neighbor fault entity cluster to enhance the polymerization degree of a fault implementation scenario. This method can solve the search cold-start recommendation problem. At the same time, the UPRR method aims to form user retrieval subgraphs of the past-state and current-state and make a feature matching for the graph-neighbor fault entity cluster, and then realize the accurate prediction of the user’s general search intention. The model is compared with other current classical models through the evaluation of multiple recommendation evaluation metrics, and the experimental results show that the model has a 3–8% improvement in the cold-start recommendation effect and 2–10% improvement in regular retrieval. The model has the best average recommendation performance in multiple metrics and has good results in fault analysis and retrieval recommendation. It plays a helpful role in intelligent operation and maintenance of the power grid and auxiliary decision-making, and effectively improves the reliability of the power grid.

Keywords:

knowledge graph; fault research and judgment; search recommendation; smart grid

1. Introduction

As countries around the world gradually focus on energy conservation and green development, China has also put forward the environmental protection strategies of “carbon neutrality” and “carbon peaking”. The current policy of building smart grids is steadily advancing, which poses new challenges for the automated operation of grids and the immediate maintenance of equipment. With the deployment of large-scale power grid terminal equipment and the expansion of power equipment maintenance scenes, the problem of how to realize the daily maintenance of power equipment and accurate fault analysis needs an efficient solution. With the development of artificial intelligence, the recommendation algorithm plays a crucial role in avoiding information overload and efficient access to information, and has been popularized in various industrial applications such as platform promotion and e-commerce.

Some existing, more typical recommendation methods mostly use collaborative filtering-based user-personalized acquisition matching [1,2], which generally includes an embedding learning module and an interactive modeling module. The embedding learning maps users and items to vectorized representations, and interactive modeling is based on vectorized representations to match user–item interaction features [3,4,5]. However, this method has limitations because it relies more on historical data viewed by users, so there is a problem of scarcity of data samples in the case of cold-start. In addition, encoding the correlation between users and items directly into the embedded representation learning makes it difficult to ensure the interpretability of information dissemination. The existing solution is to learn user and item representations in isolation. However, this leads to a loss of relevance between users and items, which makes it difficult to recommend items to users that have not been viewed but may be of interest. Recently, under the trend of systematization and structurization of knowledge, the recommendation method of incorporating a knowledge graph has become a research hotspot [4,6,7,8]. This new recommendation model improves the above-mentioned limitations to a certain degree.

According to the feature acquisition and prediction algorithm, the knowledge recommendation models incorporating graphs/paths can be roughly categorized into the following three methods: method based on semantic embedding, method based on graph paths, and hybrid method of combining semantic embedding and graph paths. The semantic embedding-based recommendation approach aims to achieve knowledge representation through semantic mapping, followed by learning models to represent users’ preferences sequentially, and finally calculating feature offsets between users and items to achieve a recommendation prognosis [6,7,8,9,10]. Although this method greatly enhances the correlation between entities and items, it focuses more on semantic relation modeling and is more suitable for knowledge inference and relation prediction. The graph path-based recommendation method fully incorporates the contextual structure of knowledge graph paths and makes an embedded representation of the path connection features between different items. They achieve search prediction by mining user interests through multi-source heterogeneous information [11,12,13]. This method relies on high-quality paths of entities within domain knowledge, and thus requires a large amount of manpower to optimize the knowledge graph. It is costly and the universality of its algorithm is poor [14]. The hybrid method uses the user–item interaction graph to realize the embedding-propagation of entity relation features and carries out preference ranking based on the extracted retrieval central entities [15]. Chen proposed a collaborative filtering method using knowledge graphs, which embeds both users and attributes into the knowledge graph and aggregates them using matrix modeling to achieve implicit recommendations. It effectively improves the cold-start retrieval [16]. However, existing hybrid methods mainly take features of interactive user embedding into the accumulation in the process of embedding-propagation, but do not take potential item affinity into the short-term [17,18]. Therefore, it suffers from a lack of timeliness of recommendation.

To solve the above limitations, we propose a Power Fault Retrieval and Recommendation Model (PF2RM) based on the knowledge graph. Combined with the power fault knowledge graph, this model designs a user-polymorphic recommendation method for both the cold-start mode and general retrieval mode. Specifically, we use the graph-neighbor cluster polymerization method to update the cold-start prediction sequence for the power knowledge graph. It forms the user retrieval subgraph based on the polymorphic interaction process between users and items, and then recommends the fault maintenance treatment scheme according to the implementation scenario.

Contributions:

We reconstruct the ontology layer of the power knowledge graph with grid dispatching business as the core.
We propose the graph-neighbor fault entity cluster to emphasize the correlation between neighboring project entities to solve the problem of search cold starts.
We design a user-polymorphic recommendation method to study the user’s search intention from the user’s past state and current state.

The rest of the paper is organized as follows. Section 2 focuses on reviewing relevant research. Section 3 describes the retrieval and recommendation method in the PF2RM. Section 4 shows the situational analysis of experimental simulations. Section 5 presents the summary and prospect for the next steps.

2. Related Work

As an important component of the intelligent interactive system, the recommendation model based on the knowledge graph has high research value in both industrial and academic fields, and is a hot topic in the field of artificial intelligence. In the above summary, we review the existing recommendation algorithms based on the knowledge graph. In the following, we focus on the research status of three types of recommendation algorithms and emphasize the similarities and differences between the existing algorithms and the algorithm in this paper.

In the field of research and development of the knowledge graph, Wang et al. [19] studied the typical application scenarios and cases of enterprise-level knowledge graphs. They analyzed the construction needs and problems faced by the enterprise-level knowledge graph platform. Tian et al. [20] described the knowledge representation and graph construction methods and processes of the knowledge graph platform. Hao et al. [21] presented a rapid construction method and application framework of the domain knowledge graph with universality, and proposed a solution for the rapid landing problem of the domain knowledge graph in knowledge quiz applications. Duan et al. [22] described the importance of the knowledge graph in recommendation quality improvement and made a feasibility validation comparison. The development of quiz and recommendation systems in various fields has effectively advanced the research progress of knowledge graphs and search recommendation technology. This paper focuses on the recommendation algorithm based on knowledge graphs.

The recommendation algorithm incorporating the knowledge graph was first proposed by Google [23], which represents knowledge as attribute-value pairs corresponding to entities. It expresses the relation between knowledge through the connection between entities. Huang et al. [3] proposed the neural graph personalized ranking (NGPR) to preserve the correlation between users and items by incorporating user–item interaction graphs into embedded learning. In the same way, Hu et al. [24] proposed an interactive rules-guided recommendation framework based on the knowledge graph in the user interaction field, which improves the recommendation results by constructing a connection mode between user portraits and perfect knowledge graphs through entity-relation paths and also improves the interpretability of the recommendation results. However, it needs to be optimized for optimal path selection. Chen et al. [11] proposed an automatic learning path generation method based on a knowledge graph, which serializes course objects based on entity attributes and relation paths to promote individualized learning. Li et al. [12] designed a forward search algorithm and some optimization techniques to solve the shortest path problem. The efficiency of the algorithm greatly exceeds that of traditional path methods. On the other hand, a new method based on semantic matching was proposed in reference [6]. It models the semantic concept of the input question by extracting the subgraphs of the knowledge base. Based on this, Jiao [25] proposed the concept of the multi-step relational path to discuss various semantic relations in the multi-relation path to improve recommendation performance. Wang et al. [7] made full use of the knowledge graph, combining multi-source interest knowledge and semantic correlations to reveal the internal principle and influencing factors of the knowledge graph representation model. Based on the concept of the knowledge graph, Yu et al. [4] proposed a graph attention recommendation model based on deep learning and realized the construction of a user portrait and two-end enhancement recommendation. Similarly, Xie et al. [8] proposed a potential meta-graph embedding for the item recommendation method. It made predictions by capturing users, items, and the low-dimensional and high-dimensional interactive information between their contexts based on the meta-graph. Based on the semantic embedding method, Sun et al. [1] proposed a collaborative filtering recommendation algorithm and the matrix decomposition method to realize semantic information acquisition and the knowledge token, and to alleviate the cold-start problem while generating an accurate recommendation list. Rendle et al. [5] used the series model to generate valid inference paths to infer the fundamental user–item interaction based on the knowledge graph and the preference information obtained from supplementary term information. Huang [26] and Tang [27] et al. fused existing grid multi-source data to construct a knowledge graph of power equipment for the integrated management of equipment in off-grid fault situations through intelligent search and visualization.

Although the existing recommendation methods have yielded good prediction results, some limitations remain: (1) Due to the strong complexity and centrality of the relation paths between entities in the fault knowledge graph, there is a lack of scientific path selection algorithms to measure the entity correlation degree. (2) The cold-start phase has a small sampling range and few user retrieval features, while the accuracy of the small sample recommendation in this case needs to be improved. (3) The diversity efficiency of the recommendation algorithm still needs to be improved, and the recommendation diversity should be considered in different stages of fault analysis to avoid the cocoon effect and to improve the time state perception of the search recommendation algorithm. Therefore, in this paper, we use PF2RM to make retrieval recommendations to users to solve the cold-start problem and the timeliness and diversity of the recommended content.

3. Methodology

Aiming at the characteristics of the strong professionalism of power data and high correlation degree of the power knowledge graph, this paper designs a power fault analysis system to realize the retrieval and troubleshooting of the power grid operation process. The architecture of the method consists of four layers: multi-source heterogeneous data layer, power graph construction layer, knowledge reasoning layer, and smart interaction layer. The intelligent analysis of the grid fault is realized through the data stream control transmission between layers, as shown in Figure 1.

The multi-source heterogeneous data layer serves as the database for the experiments throughout the study. It mainly consists of textual data and operational data. For the safety standards and scheduling regulations contained in the text data, the general approach of knowledge graph construction is used for knowledge extraction to obtain the equipment entities and fault relationships. The operation data are also combined with expert experience and a failure plan as the metadata for the construction of the knowledge graph. At the same time, the business logic of the ontology layer of the knowledge graph is constructed in combination with the grid business.

The power graph construction layer is responsible for knowledge integration of the data in the knowledge graph and building a multi-business-focused ontology layer design approach. We perform knowledge processing (for example, disambiguation of entities and knowledge inference between relationships) on the entities and relationships obtained by knowledge extraction. After that, we construct and adapt the business ontology layer of the power knowledge graph to the multi-business model of the grid. Finally, we reorganize the extracted event knowledge with the aid of a troubleshooting case library in terms of entity relations to form the power fault knowledge graph for the study of retrieval recommendation methods for subsequent knowledge.

The method proposed in this paper is mainly implemented in the knowledge reasoning layer to achieve retrieval location and analysis of power faults by designing intelligent fault retrieval methods. We deeply mine user search recommendations and retrieval intentions through the power fault retrieval method (PFR) in the cold-start state. We propose a user-polymorphic retrieval recommendation method (UPRR) to predict the retrieval behavior of regular state users and then generate user search recommendations for grid operation and fault handling.

The smart interaction layer is programmed to visualize fault retrieval and assisted decision making. Based on the storage structure of the knowledge graph and the intelligent fault inference method, the relevant retrieval results of the fault problem are displayed to the user to assist the user in overhauling and troubleshooting the grid. Through this visualization, it can help users to achieve intelligent search Q&A and assist in decision making for grid operation and maintenance scheduling issues.

3.1. Intelligent Fault Analysis Method

Combining the fault data of the existing power knowledge graph, we perform the fault retrieval, as shown in Figure 2. To the left of Figure 2 is the logical process of fault retrieval, and to the right is the actual process of the corresponding algorithm. It triggers the wildcard based on template matching (“✶” represents wildcard), extracts the retrieval keywords, and then creates the graph-neighbor fault entity cluster on the multi-business power knowledge graph. Finally, it realizes the retrieval and matching of fault entities through the power fault retrieval and recommendation model based on user-polymorphic perception, and assists the power grid fault retrieval and auxiliary decision-making.

3.2. Construction of Multi-Business Power Knowledge Graph

This paper is oriented to power fault analysis and retrieval auxiliary decision-making business, combining with the power grid topology to form a fault knowledge characterization. It takes the fault handling process as the entity attribute preset principle and uses user retrieval data and the fault preplan as the core event of the graph ontology layer construction. The preset method adopts the form of entity name → {relation} → attribute value.

The ontology structure of the multi-service power knowledge graph is shown in Table 1. In this paper, the ontology of the fault knowledge graph is divided into a three-layer structure. For each fault case in the mapping, relationships and attributes are extracted according to this structure. By redesigning the ontology layer, it is able to query more detailed solutions during grid equipment fault retrieval and can help the staff to analyze the cause of faults in the topology structure.

As shown in Figure 3, we present the ontology layer of the multi-service power knowledge graph with the help of the visualization tool of the knowledge graph. It differs from Table 1 in that the relationships between the layers are labeled and the key information available to the user during fault diagnosis is presented in a more intuitive way.

3.3. Graph-Neighbor Fault Entity Cluster

For the cold-start phenomenon, which is common in the current retrieval system, this paper designs a graph-neighbor fault entity correlation cluster method. It expands along the power entity relation path adjacent to the retrieval central entity. The graph entity feature is embedded in the recommendation model through extension and aggregation until beyond multi-hop to predict the user’s retrieval target. Its function is to make the associated entities form a retrieval domain and enhance the effectiveness and applicability of the recommendation result. At the same time, it can explore the potential retrieval trend based on user historical retrieval information, and can automatically generate an interpretable optimal target.

In order to describe the extension and relevance of relationships between entities in the knowledge graph G, this paper uses

N = {n | (r, k, t) \in G}

to define the information of neighboring cluster entities connected to entity N. N(k) is an entity node set directly connected to the entity at the relation distance of k hops, and t is used to store the hop count to the neighbor entity node. The k-hop neighbor cluster entity sampling information of entity N on the knowledge graph is obtained recursively:

N (k) = \forall_{n}^{k} \{n | (N, n) \exists r a n d t \leq k\} k = 1, 2, \dots, m

(1)

Among them, m is the predefined maximum relation hop count. The quantified form of the above relation shows the correlation degree between the neighbor entities of the knowledge graph.

At the same time, the neighbor cluster nodes of each entity node set N(k) are clustered according to the relation rules and entity update to obtain the neighbor cluster node set. Then, the graph-neighbor fault entity cluster nodes set matched by keywords is considered the user’s retrieval recommendation at retrieval cold-start, which reflects the extension of correlation in the retrieval recommendation of the knowledge graph. Each set sampling range is defined as:

W (n) = \{n | n \in N (k)\}, |W (n)| \leq K

(2)

Among them, K is the preset hop count of the neighbor cluster node. The N(k) in the original entity node will also be adjusted dynamically. While realizing the dynamic adjustment of the graph, it also automatically drives the improvement of the recommendation method. It is worth noting that the greater the number of neighbor nodes between entities, the closer the connection between the entities, that is, the richer the entity features in the sampling range. In order to solve the problem where the traditional method ignores the close relation between the sampling entity and the retrieval central entity when sampling, this paper proposes a graph-neighbor fault entity correlation cluster method based on the above features to realize the clustering of the graph entity relation.

As shown in Figure 4, the circles in the figure represent the set N(k) of entity nodes with strong association degree with the retrieved keywords in the two-level sampling range (when k = 2). To facilitate the display of the association degree, we use solid circles of different colors for the entity nodes in the figure to represent them. The darker the color of the entity circles, the larger the number, which means the stronger the association with the retrieved keywords and the higher the priority in the retrieved results.

Therefore, in order to define their priorities, we design the graph-neighbor fault entity cluster (Algorithm 1) to calculate the priority of each node. In the calculation process, the order of priority is indicated by numbers; the higher the number, the higher the priority and the darker the color of the entity in the graph. After the priority order of the entities is obtained by the relational calculation, we use the dark and filled entities (entity nodes marked as 3, 4, and 5) as the entities within the sampling set W(n) and display them to the user, and the light-colored and unfilled entities are ignored. The implementation steps are as follows:

Step 1: Retrieve the central graph-neighbor entity cluster. First, find the entity set N(k) directly connected to the retrieval central entity n, and make graph-neighbor entity cluster statistics for each neighbor node entity $N {(k)}_{i}$ stored in $N (k)$ ;
Step 2: Obtain the graph-neighbor entity cluster. Compare the actual hop count from the central entity in $N {(k)}_{i}$ with the value of the sample value K. Then, recursively determine whether the 1 to K layer neighbor entity satisfies the hop count k, and use the satisfying entities as the elements within the graph-neighbor entity cluster $W (k)$ ;
Step 3: Make statistics on the graph-neighbor cluster entity relations. Make traversal statistics for the element connections within the neighbor entity cluster $W (k_{i})$ according to the index list of each neighbor cluster node, and calculate the relation number of each entity $R (W (k_{i}))$ ;
Step 4: Merge and sort the elements in $R (N {(k)}_{K})$ . First, group $R (W (k_{i}))$ and sort by the relation number to obtain the entity index list with sequence length N/2. Record the sort result as r. Then, compare the r of the two sorted groups of N/2 entity elements from the beginning of the sequence, and place the entity with the larger value of r in the sampling list W(n). After traversal, delete the duplicate index entity in the sampling list. Finally, the entity corresponding to the index in the sampling list is the graph-neighbor entity cluster W(n) of the retrieval central entity.

Algorithm 1. Graph-Neighbor Fault Entity Cluster
1:	Input: Sampling entity cluster N(k)
2:	Output: Graph-neighbor entity cluster W(n)
3:	//Central graph-neighbor entity cluster retrieval
4:	while k < preset value do // Get collection of k-jump fault entities.
5:	Recurse (t); // By recursively computing the entities in N(k).
6:	W(k) ← Graph_neighbor(t, k);
7:	end while
8:	//Computation of each entity relation of graph-neighbor cluster
9:	for Entity in W(k) do
10:	for i = 1, 2, …, n do
11:	if n $i n W (k_{i})$ then
12:	Count ( $R (W (k_{i}))$ ) // Number of statistical samples.
13:	end if
14:	end for
15:	end for
16:	//Merge and sort the graph-neighbor cluster elements W(k).
13:	for Entity $i n$ W(k) do
14:	W(k) ← Merge( $R (W (k_{i}))$ ) // Subsuming sorting of element priorities.
15:	end for
16:	returnW(n)

3.4. User-Polymorphic Retrieval Recommendation

In order to realize the accurate retrieval recommendation to the user and ensure the actual efficiency of the power grid fault analysis, the user’s retrieval history is considered in the model prediction after the cold-start phase. Through sampling the retrieval information from the user past-state and current-state, the retrieval subgraph is formed, and the retrieval intention prediction is carried out by the benchmarking graph-neighbor fault entity cluster method. This enables long-term user retrieval behavior perception, as shown in Figure 5.

3.4.1. Polymorphic Feature Collection

The past-state feature collection aims to collect the long-term behavior features of the user. In order to save efficiency and reduce the burden of CPU operation, the collection method takes the implicit calculation of daemon, which triggers the collection behavior after the user retrieval behavior.

The current-state user feature collection process is as follows:

First, the user incremental browsing data are extracted, including the fault entities

D_{n} (e_{1}, e_{2}, \dots, e_{n})

in which the user makes the retrieval behavior click in a short time ∆t. Then, based on the click item, the K-hop range entity set D(k) is queried in the fault graph.

The past-state user feature collection process is as follows:

First, the user history browsing items

H_{m} (h_{1}, h_{2}, \dots, h_{m})

are collected, and then a query is made in the fault graph according to the click item to obtain the K-hop sampling range entity set H(k) of the item.

Finally, according to the neighbor cluster entity sampling information N(k) of the two kinds of items, the user current-state retrieval subgraph

U_{D}

and the past-state retrieval subgraph

U_{H}

are formed. During the general retrieval, the user retrieval knowledge graph is obtained based on these two subgraphs:

U (V, E) = \{\begin{matrix} U_{H} & I m p l i c i t \\ U_{D} ⨁ U_{H} & E x p l i c i t \end{matrix}

(3)

3.4.2. User Intention Prediction

The user retrieval knowledge graph

U (V, E)

is obtained through polymorphic feature collection. Then, the user retrieval knowledge graph is divided by spectral clustering to obtain high polymer user retrieval entity cluster S(k). Finally, the feature entity cluster corresponding to the retrieval central entity is sorted by the neighbor-graph fault entity cluster method, and the neighbor-graph fault entity cluster W’(n) is obtained. W’(n) is the user retrieval intention prediction result. The implementation steps are as follows.

Step 1: Define the adjacency matrix W and degree matrix D of the user retrieval knowledge graph. Use the Radial Basis Function (RBF) to define edge weights, where the edge weight between each entity v is represented as:

w_{i j} = e x p (- \frac{{||v_{i} - v_{j}||}_{2}^{2}}{2 σ^{2}}) v i, v j \in V

(4)

For any point in the graph, its degree is defined as:

d_{i} = \sum_{j = 1}^{n} w_{i j} w_{i j} \in E

(5)

Step 2: Calculate and construct the standardized Laplacian matrix L of the subgraph. The Laplacian matrix L = D − W. It is a symmetric matrix and satisfies the following formula for any vector f:

f^{T} L f = \frac{1}{2} \sum_{i, j = 1}^{n} w_{i j} {(f_{i} - f_{j})}^{2}

(6)

In order to make the points within the cutting subgraph

S (S_{1}, S_{2}, \dots, S_{k})

of the user retrieval knowledge graph weight high and between subgraphs weight low, the minimized

C u t (S)

needs to be calculated:

C u t (S_{1}, S_{2}, \dots, S_{k}) = \frac{1}{2} \sum_{i = 1}^{k} \frac{W (S_{i}, {\bar{S}}_{i})}{v o l (S_{i})}

(7)

Among them,

v o l (S_{i})

represents the sum of weights of all edges in the subset S. To achieve the optimal solution, introduce indicator vector

H (h_{1}, h_{2}, \dots, h_{j})

,

j = 1, 2, \dots, k

, then

H^{T} D H = I

. It is defined as:

h_{i j} = \{\begin{matrix} 0 & v_{i} \notin S_{j} \\ \frac{1}{v o l (S_{j})} & v_{i} \in S_{j} \end{matrix}

(8)

Based on the nature of the Laplacian matrix and the definition of the indicator vector, we derive

h_{i}^{T} L h h_{i}

as follows:

\begin{array}{l} h_{i}^{T} L h h_{i} = \frac{1}{2} \sum_{m = 1} \sum_{n = 1} w_{m n} {(h_{i m} - h_{i n})}^{2} \\ = \frac{1}{2} (\sum_{m \in S_{i}, n \notin S_{i}} w_{m n} {(\frac{1}{\sqrt{v o l (S_{i})}} - 0)}^{2} + \sum_{m \notin S_{i}, n \in S_{i}} w_{m n} {(0 - \frac{1}{\sqrt{v o l (S_{i})}})}^{2}) \\ = \frac{1}{2} \times \frac{1}{v o l (S_{i})} (c u t (S_{i}, {\bar{S}}_{i}) + c u t ({\bar{S}}_{i}, S_{i}) \\ = \frac{c u t (S_{i}, {\bar{S}}_{i})}{v o l (S_{i})} \end{array}

(9)

The optimization goal is:

\begin{array}{l} C u t (S_{1}, S_{2}, \dots, S_{k}) = \sum_{i = 1}^{k} h_{i}^{T} L h h_{i} \\ = \sum_{i = 1}^{k} {(H^{T} L H)}_{i i} \\ = t r (H^{T} L H) \end{array}

(10)

The final optimization goal is:

\begin{matrix} a r g m i n \\ H \end{matrix} t r (H^{T} L H) s . t . H^{T} L H = I

(11)

Order

H = D^{- \frac{1}{2}} F

, and substitute it into Equation (12):

\begin{array}{l} H^{T} L H = {(D^{- \frac{1}{2}} F)}^{T} L D^{- \frac{1}{2}} F \\ = F^{T} D^{- \frac{1}{2}} L D^{- \frac{1}{2}} F \end{array}

(12)

Step3: Calculate the feature vector F of the subgraph. Similarly, substitute D into:

\begin{matrix} H^{T} D H = {(D^{- \frac{1}{2}} F)}^{T} D D^{- \frac{1}{2}} F \\ = F^{T} D^{- \frac{1}{2}} D D^{- \frac{1}{2}} F \\ = F^{T} I F = I \end{matrix}

(13)

The optimization goal can be replaced by:

\begin{matrix} a r g m i n \\ H \end{matrix} t r (F^{T} D^{- \frac{1}{2}} L D^{- \frac{1}{2}} F) s . t . F^{T} F = I

(14)

Calculate the first

k_{1}

feature vectors f of

D^{- \frac{1}{2}} L D^{- \frac{1}{2}}

and standardize the matrix of each corresponding feature vector f by row. Then, form the feature matrix F of the

n \times k_{1}

dimension.

Step4: Obtain the subgraph division

(S_{1}, S_{2}, \dots, S_{k})

and predict the user intention. Use the rows of feature matrix F as samples to perform the K-Means clustering. The clustering dimension is k, and obtain the user retrieval subgraph

S (S_{1}, S_{2}, \dots, S_{k})

.

After that, perform the template matching based on user retrieval content to obtain the corresponding user retrieval subgraph S_n. Obtain the neighbor entities in the 1 to K layer that satisfy the k-hop from the retrieval subgraph S_n as elements within the graph-neighbor entity cluster W’(n). Obtain the graph-neighbor entity clusters W(n) of the retrieval central entity based on the graph-neighbor fault entity cluster method. W(n) is a list of predicted results, which is shown in Algorithm 2.

Algorithm 2. User Polymorphic Retrieval Recommendation Model
1:	Input: User retrieval subgraph $U_{D}, U_{H}$
2:	Output: User retrieval prediction result W(n)
3:	// Get user retrieval subgraphs $U (V, E)$
4:	if state = = Implicit then // Determine the search status
5:	$U (V, E) \to U_{H}$ ;
6:	else if state = = explicit then
7:	$U (V, E) \to U_{D} ⨁ U_{H}$ ;
8:	end if
9:	// User retrieval subgraph division
10:	W, D ← Define( $U (V, E)$ );
11:	L ← D-W;
12:	Calculate F;
13:	$S_{n}$ ← Retrieval_subgraphs $(S_{1}, S_{2}, \dots, S_{k})$ ; // Solve retrieval subgraph S_n.
14:	if Re_entity $i n$ Sn then// Use method PFR for retrieving subgraphs S_n
15:	for i = 0, 1, 2, …, n do
16:	W’_n ← Graph_neighbor(S_i);
17:	W_n ← re_Merge(W’_n); // Summarize and sort the retrieved subgraph S_n.
18:	end for
19:	end if
20:	returnW(n)

4. Experimental Results and Analysis

4.1. Power Fault Knowledge Graph

The power knowledge data are stored in the form of a knowledge graph, and the graph visual indication of the operational data is shown in Figure 6. The algorithm integrates the original power knowledge through ontology layer reconstruction. The fault event cases and dispatching specifications stored in the graph are connected by the hierarchical or the inclusive relation. The fault feature is evaluated and stored as a vector in the feature attribute unit corresponding to the entity.

4.2. Experimental Data and Model Comparison

This paper uses fault log data from grid operations in southern China, and it also includes power operation and maintenance manuals for equipment and expert cases. It consists of 5000 sentences, 255 entities, and 16 relations. After filtering, the number of training set entity pairs is 18,050, the number of relative facts is 30, the number of relations is 6, the number of testing set entity pairs is 3500, the number of relative facts is 20, and the number of relations is 6, representing 6 kinds of relation features.

The whole algorithm model is implemented in Python, and the experimental design of the PF2RM is compared with the four related recommendation algorithms mentioned in the paper:

RippleNet [13] is a hybrid recommendation model that propagates user preferences by introducing a knowledge graph, using the original features of users and items and the knowledge graph as input.
LibFM [5] is a model based on feature decomposition for click-through-rate prediction, using the original features of users and items as input.
DeepFM [28] is a deep learning recommendation model. The low-level feature and high-level feature are extracted by the neural network module and the factorization machine module, respectively, and the two extracted parts are input as features.
CKE [2] (Collaborative Knowledge Embedding) is a recommendation model based on the embedding method, which combines the collaborative filtering method to introduce the knowledge graph to accomplish the recommendation task.

There are two comparison modes for evaluating the recommendation effect. Both use the Top-50 [29] of the recommendation results as the evaluation data, and the evaluation indicators are compared by average.

4.3. Cold-Start Recommendation Effect Evaluation

In the cold-start search mode where the user feature information is scarce, the experiment uses NDCG (Normalized Discounted Cumulative Gain) and AUC (Area Under Curve) to evaluate the correlation of the recommendation effect in the cold-start situation.

As shown in Table 2, it is the average NDCG and AUC effect evaluation of the Top-50 of each recommendation model for the cold-start situation. The results show that the correlations of the recommendation result of RippleNet and CKE with the knowledge graph are much higher than those of LibFM and DeepFM, and the performance results in AUC are also better. In the cold-start situation, the high correlation retrieval result is more helpful for rapid fault analysis. The PF2RM recommendation algorithm proposed in this paper has an overall improvement on the current basis.

As shown in Figure 7, the experimental analysis evaluates the NDCG and AUC of the Top-50 retrieval recommendation results. Each of the ten recommendation results are calculated as a whole and then compared by drawing a line chart. The comparison results show that the knowledge graph-based recommendation models perform better in terms of NDCG and AUC. The retrieval results of the knowledge graph-based models within the Top-20 are similar. However, from the retrieval effect in the Top-30 to 50, it can be concluded that the PF2RM model proposed in this paper improves the NDCG and AUC by 3% to 8% compared with the best model. The reason for this is that the structure of the knowledge graph is more able to integrate the data with strong correlation, which meets the need of users to retrieve high correlation issues in the cold-start situation.

As shown in Figure 8, the experiment analyzes the sensitive influencing factors of the sampling hyper-parameter K to PF2RM in the cold-start situation and compares the results. With other parameters fixed, the K value is changed in turn to experiment. The experimental result shows that the NDCG and AUC effects are best when K is set to 1 or 2 in the cold-start situation, and the effects are worst when it is set to 5. This is because the neighbor feature information used to compute the central entity is too large, resulting in an increase in irrelevant entity features, which leads to a smoothing of the model.

4.4. User Click Recommendation Effect Evaluation

For the situation where users have long used the general search mode with rich feature information, the experiment uses accuracy P (Precision), F1 score, and D (Diversity) to evaluate the user click recommendation effect.

As shown in Table 3, the user click recommendation effect evaluation shows that LibFM, DeepFM, and CKE have similar recommendation effects in accuracy and F1 score. RippleNet uses path multi-hop correlation to mine the user’s personality, and its recommendation effect is better than the above two. The PF2RM recommendation model proposed in this paper implements polymorphic analysis on this basis. Compared with RippleNet, the accuracy is 2% higher, the F1 score is the same, and the overall effect is better. At the same time, D is used to evaluate the richness of search recommendation results. The PF2RM algorithm also has a high domain recommendation effect, which can effectively avoid the information cocoon effect.

As shown in Figure 9, the experiment analyzes and evaluates the precision, diversity, and F1 score of the Top-50 retrieval recommendation results for the five models. Similarly, each of the ten recommendation results are evaluated as a whole, and then compared by drawing a line graph. According to the comparison result, the top 10 to 20 results calculated by the recommendation algorithm are closest, and then the prediction effect declines rapidly after 20 results. The recommendation model based on the knowledge graph performs better in predicting results after 20. The reason is that the polymorphic recommendation algorithm is better able to recommend knowledge that is more consistent with the user’s personalization and meets the need for user click recommendation in the general retrieval situation. The experimental results show that the models before the Top-20 retrieve similar results, but the PF2RM model proposed in this paper consistently performs best in subsequent retrievals, with 10%, 6%, 8%, and 2% improvements over the best model in terms of precision, diversity, recall, and F1 score, respectively.

As shown in Figure 10, the experiment also analyzes the sensitive influencing factors of the sampling hyper-parameter K to PF2RM in the regular user search situation and compares the results. With the other parameters fixed, the K value is changed in turn to experiment. The last one in Figure 9 shows that the recommendation effect is best when K is set to 2 or 3 in the regular user search situation, and the effect is worst when it is set to 5. This is because as K increases, the excessive user feature sampling number causes the model to overfit.

5. Conclusions

Combined with the power fault knowledge, this paper presents a power fault retrieval and recommendation model based on a knowledge graph to perform deep mining of user search intention and finally realizes the accurate prediction and fault analysis of user search intention. It can make full use of the high-quality paths of the knowledge graph to realize the embedding learning of the neighbor retrieval entity relation, which greatly improves the user retrieval recommendation effect in the cold-start state. The polymorphic retrieval recommendation method collects the past-state and current-state features of user–item interaction retrieval through time domain awareness to realize short-term retrieval behavior learning of the general retrieval recommendation. It improves the diversity and personalization of recommendation results. Through the comparison experiments of four models, it can be concluded that our model has obvious advantages. From the comparative experiments of the four models, it can be concluded that our model has a clear advantage in terms of comprehensive evaluation. It is able to improve NDCG and AUC by 2–10% at cold-start, and by 10%, 6%, 8%, and 2% for precision, diversity, recall, and F1 scores in regular retrieval, respectively. This helps to assist the grid intelligent operation and auxiliary decision-making, and to improve the reliability of power grid.

Although the retrieval recommendation effect of this paper has been improved to a certain extent, the immediate efficiency and robustness of the model still remain to be tested in the face of sudden grid failures and massive data. Meanwhile, the structure of the knowledge representation and the method of autonomous learning still need to be studied, and the relational paths of entities in the knowledge graph have not been comprehensively considered. In addition, we will try to use graph neural networks to learn and represent the network structure information of the knowledge graph.

Author Contributions

All authors contributed to the writing and revisions; writing—review and editing, K.L.; writing—original draft, B.Z. (Baoxian Zhou); methodology, Y.Z.; data curation, Y.L.; project administration, B.Z. (Bo Zhang); supervision, X.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China, grant number 61807024.

Data Availability Statement

Data available on request due to restrictions of privacy or ethical.

Conflicts of Interest

The authors declare no conflict of interest.

References

Sun, W.; Chen, P.; Xiong, J.; Shen, J. A graph attention recommendation model based on bipartite knowledge graph. Comput. Eng. Appl. pp. 1–10. Available online: http://kns.cnki.net/kcms/detail/11.2127.TP.20210721.1555.008.html (accessed on 21 February 2022).
Thagard, P. Collaborative Knowledge. NOS 2010, 31, 242–261. [Google Scholar]
Huang, Q.; Wang, S.; Xu, Q.; Li, L.; Jiang, S. Image video-centric cross-media analysis and reasoning. J. Intell. Syst. 2021, 16, 835–848. [Google Scholar]
Yu, C.; Zhang, Z.; Kong, L. A comparative study of knowledge graph representation models for link prediction. Data Anal. Knowl. Discov. 2021, 5, 29–44. [Google Scholar]
Rendle, S. Factorization Machines with libFM. ACM Trans. Intell. Syst. Technol. 2012, 3, 1–22. [Google Scholar] [CrossRef]
Gao, J.; Liu, Q.; Huang, W. Research on automatic generation of learning paths based on knowledge mapping. Modern Educ. Technol. 2021, 31, 88–96. [Google Scholar]
Wang, T.; Shi, D.; Wang, Z.; Xu, S.; Xu, H. MRP2Rec: Exploring Multiple-Step Relation Path Semantics for Knowledge Graph-Based Recommendations. IEEE Access 2020, 8, 134817–134825. [Google Scholar] [CrossRef]
Xie, F.; Zheng, A.; Chen, L.; Zheng, Z. Attentive Meta-graph Embedding for item Recommendation in heterogeneous information networks—ScienceDirect. Knowl. Based Syst. 2020, 211, 106524. [Google Scholar] [CrossRef]
Liu, Q.; Qin, M. A collaborative matrix decomposition method based on knowledge representation learning. J. Beijing Univ. Technol. 2021, 41, 752–757. [Google Scholar]
Chou, D.; Wu, J.; Liu, Z.; Tu, Z.; Hu, X.; Li, C.; Su, H.; Xia, Y.; Gao, X.; Pan, Z.; et al. Research on key technologies and applications of intelligent medical and health care integration service platform. J. Intell. Syst. 2021, 16, 972–988. [Google Scholar]
Chen, J.; Yu, J.; Lu, W.; Qian, Y.; Li, P. IR-Rec: An interpretive rules-guided recommendation over knowledge graph. Inf. Sci. 2021, 563, 326–341. [Google Scholar] [CrossRef]
Li, Z.; Yang, Y.; Wang, X. Rule-based shortest path query algorithm. J. Softw. 2019, 30, 515–536. [Google Scholar]
Wang, H.; Zhang, F.; Wang, J.; Zhao, M.; Li, W.; Xie, X.; Guo, M. RippleNet. In Proceedings of the 27th ACM International Conference on Information and Knowledge Management, Turin, Italy, 22–26 October 2018; pp. 417–426. [Google Scholar]
Yu, Y.; Du, Q.; Yue, B.; Xiang, J.; Xu, G.; Leng, Y. A review of recommendation research based on reinforcement learning. Comput. Sci. 2021, 48, 1–18. [Google Scholar]
Tao, Y.; Yu, L.; Wu, Z. CoTransH: A translation model for semantic relationship prediction in the knowledge graph of scien-tific and technical documents. Intell. Theory Pract. 2021, 44, 187–196. [Google Scholar] [CrossRef]
Chen, Y.; Mensah, S.; Ma, F.; Wang, H.; Jiang, Z. Collaborative Filtering Grounded on Knowledge Graphs. Pattern Recognit. Lett. 2021, 107, 2411–2502. [Google Scholar] [CrossRef]
Zhang, Y.; Wang, W.; Liu, H.; Gu, R.; Hao, Y. Collaborative filtering recommendation algorithm based on knowledge graph embedding. Comput. Appl. Res. 2021, 38, 3590–3596. [Google Scholar] [CrossRef]
Wang, L.; Li, S.; Liu, H. A news recommendation method incorporating knowledge graph and differential privacy. Comput. Appl. 2021, 1–9. Available online: http://kns.cnki.net/kcms/detail/51.1307.TP.20210727.1429.006.html (accessed on 21 February 2022).
Wang, H.; Ding, J.; Hu, F.; Wang, X. A review of large-scale enterprise-level knowledge graph practices. Comput. Eng. 2020, 46, 1–13. [Google Scholar]
Tian, L.; Zhang, Z.; Zhang, J.; Zhou, W.; Zhou, X. A review of knowledge graphs - representation, construction, inference and knowledge hypergraph theory. Comput. Appl. 2021, 41, 2161–2186. [Google Scholar]
Hao, Y.; Zhang, J.; Wu, M.; Wu, X. A framework for rapid construction and application of domain knowledge graphs. J. Intell. Syst. 2021, 16, 871–884. [Google Scholar]
Duan, Y.; Shao, L.; Cui, L.; Gao, H. Knowledge Graph Based Personalized Test Recommendation in Cloud. J. Chin. Comput. Syst. 2018, 39, 2743–2747. [Google Scholar]
Google Knowledge Graph [EB/OL]. Available online: https://en.wikipedia.org/wiki/Knowledge_graph (accessed on 12 February 2022).
Hu, Z.; Wang, J.; Yan, Y.; Zhao, P.; Chen, J.; Huang, J. Neural graph personalized ranking for Top-N Recommendation. Knowl. Based Syst. 2020, 213, 106426. [Google Scholar] [CrossRef]
Jiao, J.; Wang, S.; Zhang, X.; Wang, L.; Feng, Z.; Wang, J. gMatch: Knowledge base question answering via semantic matching. Knowl. Based Syst. 2021, 228, 107270. [Google Scholar] [CrossRef]
Huang, H.; Chen, Y.; Lou, B.; Hongzhou, Z.; Wu, J.; Yan, K. Constructing Knowledge Graph from Big Data of Smart Grids. In Proceedings of the 2019 10th International Conference on Information Technology in Medicine and Education (ITME), Qingdao, China, 23–25 August 2019; pp. 637–641. [Google Scholar]
Tang, Y.; Liu, T.; Liu, G.; Li, J.; Dai, R.; Yuan, C. Enhancement of Power Equipment Management Using Knowledge Graph. In Proceedings of the 2019 IEEE Innovative Smart Grid Technologies—Asia (ISGT Asia), Chengdu, China, 21–24 May 2019; pp. 905–910. [Google Scholar]
Guo, H.; Tang, R.; Ye, Y.; Li, Z.; He, X. DeepFM: A Factorization-Machine based Neural Network for CTR Prediction. In Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, Melbourne, Australia, 19–25 August 2017. [Google Scholar]
Li, J.; Xu, Z.; Tang, Y.; Zhao, B.; Tian, H. Deep Hybrid Knowledge Graph Embedding for Top-N Recommendation. In Proceedings of the 17th International Conference on Hybrid Learning and Education, Guangzhou, China, 23–25 September 2020; pp. 59–70. [Google Scholar]

Figure 1. The architecture of our model: PF2RM.

Figure 2. Intelligent fault analysis process.

Figure 3. Multi-business power knowledge graph ontology layer visualization.

Figure 4. Solution process for graph-neighbor fault entity cluster.

Figure 5. User-polymorphic recommendation process.

Figure 6. Knowledge graph of power communication faults.

Figure 7. The evaluation charts of Top-50 recommendation results for cold-start.

Figure 8. Cold-start sampling parameter optimization chart.

Figure 9. Three evaluation results of Top-50 user click recommendations.

Figure 10. User click sampling parameter optimization chart.

Table 1. Multi-business power knowledge graph ontology layer.

Top Level	Middle Level	Bottom Level
Power Failures	Fault type	name, attribute, phenomenon, cause, processing task, expert experience
	Fault case	location, scenario, type, phenomenon, maintenance task, retrieval feature
	Fault record	time, location, type, phenomenon, cause, processing task, maintenance result
	Research	definition, application scenario, safety regulations, expert experience, analysis process
	Inspection tasks	maintenance content, application scenario, operation steps, expert experience
Topology	Power equipment	equipment name, equipment number, usage regulations, operating parameters, fault records, neighbor cluster node, production information
	Cross node	operating parameters, power equipment, load
	Line node	up-connect equipment, down-connect equipment, line attributes, operating parameters
	Responsibility	responsibility, location information, holder, contact information
	Department staff	name, department, position, contact information
	Department information	holder name, contact information, function, department staff
	Retrieval information	user ID, retrieval feature, retrieval history

Table 2. Evaluation table of cold-start recommendation model.

Model	NDCG	AUC
LibFM	0.38	76.3
DeepFM	0.32	85.4
RippleNet	0.63	88.2
CKE	0.57	86.6
PF2RM	0.60	89.1

Table 3. User click recommendation model evaluation table.

Model	P (Precision)	R (Recall)	F1 Score	D (Diversity)
LibFM	0.83	0.72	0.77	0.65
DeepFM	0.82	0.80	0.81	0.58
RippleNet	0.85	0.81	0.83	0.77
CKE	0.83	0.79	0.81	0.83
PF2RM	0.87	0.80	0.83	0.80

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Liang, K.; Zhou, B.; Zhang, Y.; Li, Y.; Zhang, B.; Zhang, X. PF2RM: A Power Fault Retrieval and Recommendation Model Based on Knowledge Graph. Energies 2022, 15, 1810. https://doi.org/10.3390/en15051810

AMA Style

Liang K, Zhou B, Zhang Y, Li Y, Zhang B, Zhang X. PF2RM: A Power Fault Retrieval and Recommendation Model Based on Knowledge Graph. Energies. 2022; 15(5):1810. https://doi.org/10.3390/en15051810

Chicago/Turabian Style

Liang, Kun, Baoxian Zhou, Yiying Zhang, Yiping Li, Bo Zhang, and Xiankun Zhang. 2022. "PF2RM: A Power Fault Retrieval and Recommendation Model Based on Knowledge Graph" Energies 15, no. 5: 1810. https://doi.org/10.3390/en15051810

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

PF2RM: A Power Fault Retrieval and Recommendation Model Based on Knowledge Graph

Abstract

1. Introduction

2. Related Work

3. Methodology

3.1. Intelligent Fault Analysis Method

3.2. Construction of Multi-Business Power Knowledge Graph

3.3. Graph-Neighbor Fault Entity Cluster

3.4. User-Polymorphic Retrieval Recommendation

3.4.1. Polymorphic Feature Collection

3.4.2. User Intention Prediction

4. Experimental Results and Analysis

4.1. Power Fault Knowledge Graph

4.2. Experimental Data and Model Comparison

4.3. Cold-Start Recommendation Effect Evaluation

4.4. User Click Recommendation Effect Evaluation

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI