A Visualization Method of Knowledge Graphs for the Computation and Comprehension of Ultrasound Reports

Knowledge graph visualization in ultrasound reports is essential for enhancing medical decision making and the efficiency and accuracy of computer-aided analysis tools. This study aims to propose an intelligent method for analyzing ultrasound reports through knowledge graph visualization. Firstly, we provide a novel method for extracting key term networks from the narrative text in ultrasound reports with high accuracy, enabling the identification and annotation of clinical concepts within the report. Secondly, a knowledge representation framework based on ultrasound reports is proposed, which enables the structured and intuitive visualization of ultrasound report knowledge. Finally, we propose a knowledge graph completion model to address the lack of entities in physicians’ writing habits and improve the accuracy of visualizing ultrasound knowledge. In comparison to traditional methods, our proposed approach outperforms the extraction of knowledge from complex ultrasound reports, achieving a significantly higher extraction index (η) of 2.69, surpassing the general pattern-matching method (2.12). In comparison to other state-of-the-art methods, our approach achieves the highest P (0.85), R (0.89), and F1 (0.87) across three testing datasets. The proposed method can effectively utilize the knowledge embedded in ultrasound reports to obtain relevant clinical information and improve the accuracy of using ultrasound knowledge.


Introduction
Medical imaging techniques, such as ultrasound examination, X-ray, computed tomography, and magnetic resonance imaging, offer great value in clinical diagnosis [1,2].Medical experts typically document various types of examination data using free text in their natural language into a report to describe patients' symptoms, conditions, imaging findings, and even diagnosis results [3].These narrative reports based on an ultrasound imaging examination can help medical experts exchange information and understand the conditions of patients from clinical perspectives [4].The descriptive text recorded in ultrasound reports provides an additional source of clinical data to facilitate decision making during ultrasound examinations.However, the complexity of clinical data poses significant challenges to the practical application of computer-assisted analysis and modeling [5].Medical data are annotated by experts in the field of medicine, with predictions made manually.In the absence of domain experts or a shortage of professional healthcare personnel, the erroneous interpretation of medical data can pose serious consequences for patients [6].Thus, it is crucial to have a comprehensive understanding of novel knowledge extraction methods that are suitable for analyzing domain-specific texts to utilize them effectively [7,8].
Knowledge graphs use graph-based data models to capture knowledge in application scenarios that involve integrating, managing, and extracting value from various data sources [9].The conversion of the narrative text in ultrasound reports into a knowledge graph form facilitates the rapid modeling and reuse of existing clinical knowledge in these historical reports.Accurate health information management plays a vital role in collecting relevant information on diseases, which, in turn, supports effective clinical decision making [10,11].Knowledge graphs are valuable data structures in the field of information retrieval and representation, allowing for knowledge inference and prediction based on entity relationships.Advances in digital technology have opened up new possibilities for language processing, including natural language processing (NLP) methods [12,13] and tools capable of constructing free text and performing semantic analysis.Developing intelligent medical applications such as disease diagnosis and answering healthcare questions requires the critical task of representation learning for these graphs [14].Therefore, leveraging the proper modeling techniques for knowledge graphs with ultrasound text can greatly facilitate medical decision making based on ultrasound reports.
Research on existing knowledge graphs has extensively investigated analytic techniques for converting natural text into entities, relationships, and semantic structures [15].While many existing tools are proficient at analyzing general text [16], only a few are capable of extracting domain-specific knowledge in the medical field [17].In addition, most of the existing knowledge graphs in the medical field are based on data from various open databases, the quality of which cannot meet the needs of disease diagnosis or treatment [18,19].Most models are not easily transferable and reusable in non-English medical NLP models.Instead, retraining with new data in the target language is typically necessary [20].Accurately and meaningfully extracting information from raw ultrasound reports poses a significant challenge due to the complex and unstructured nature of the text.
Furthermore, existing approaches involving the extraction of semantic relationships and entity information in electronic medical records mainly rely on rules with manual features or machine-learning models [21][22][23].However, these methods have not fully exploited the hierarchical semantics of textual language morphology, including word dependencies and syntactic relationships [24], which can make it difficult to identify relevant clinical entities and terminology accurately.In addition, unstandardized physician writing habits can lead to limitations in knowledge graph scalability and accuracy.Methods with insufficient accuracy are difficult to model for decision making, especially when analyzing clinical texts [25].Medical entities frequently use shorthand, abbreviations, and aliases in real-world scenarios [26].The omission of subjects in the text presents a significant challenge when analyzing similar reports [27].
The objective of this study is to provide more accurate clinical knowledge for downstream knowledge inference tasks and facilitate computer-assisted clinical decision-making processes for ultrasound reports.The contributions of this study are as follows.

•
Firstly, we propose a novel key term network extraction method for ultrasound reports that achieves the highly accurate decomposition and annotation of the narrative text within reports.

•
Secondly, we introduce a knowledge representation framework based on ultrasound reports that provides a structured and intuitive visualization of ultrasound report knowledge.

•
Finally, we propose a knowledge graph completion model to address the lack of entities in physicians' writing habits and improve the accuracy of using ultrasound knowledge.
The Section 2 presents techniques and algorithms for extracting networks of key terms for ultrasound reports from natural language in the text.In addition, a framework for knowledge representation based on ultrasound reports is presented.The Section 3 presents experiments to evaluate the performance of the proposed method.The Section 4 discusses the application value and drawbacks of the proposed method.Finally, we summarize our research in the Section 5.

Materials and Methods
In this article, we present a comprehensive method for extracting and visualizing clinical knowledge from ultrasound reports.Our approach involves several stages of knowledge extraction to simplify the reports and improve their accuracy.In the first stage, we use a word segmentation strategy to structure the medical text and extract a term network (TN) from it.In the second stage, we apply specific syntax analysis rules to identify meaningful entities and the relationships between entities in the structured TN, generating a simple knowledge graph (SKG).In the third stage, we use the Word2Vec algorithm to extract multiple instances of the same entity and reduce confusion about grammar relations in the report to improve the accuracy of the structured results.In the fourth stage, we perform knowledge representation and visualization based on the simple knowledge graph extracted in the second stage.The constructed knowledge graphs can support knowledge discovery from ultrasound reports and facilitate computer-assisted decision making based on reports.Finally, we propose a fine-grained model called the knowledge graph completion model (KGCM) to supplement the knowledge graph by combining the importance of entities that are omitted.Overall, our proposed method offers a comprehensive approach to leverage knowledge from ultrasound reports and utilize knowledge graphs to facilitate computer-assisted decision making based on reports.The overview of the proposed approach is shown in Figure 1.
sents experiments to evaluate the performance of the proposed method.The Section discusses the application value and drawbacks of the proposed method.Finally, we sum marize our research in the Section 5.

Materials and Methods
In this article, we present a comprehensive method for extracting and visualizin clinical knowledge from ultrasound reports.Our approach involves several stages knowledge extraction to simplify the reports and improve their accuracy.In the first stag we use a word segmentation strategy to structure the medical text and extract a term ne work (TN) from it.In the second stage, we apply specific syntax analysis rules to identi meaningful entities and the relationships between entities in the structured TN, genera ing a simple knowledge graph (SKG).In the third stage, we use the Word2Vec algorith to extract multiple instances of the same entity and reduce confusion about grammar r lations in the report to improve the accuracy of the structured results.In the fourth stag we perform knowledge representation and visualization based on the simple knowled graph extracted in the second stage.The constructed knowledge graphs can suppo knowledge discovery from ultrasound reports and facilitate computer-assisted decisio making based on reports.Finally, we propose a fine-grained model called the knowled graph completion model (KGCM) to supplement the knowledge graph by combining t importance of entities that are omitted.Overall, our proposed method offers a compr hensive approach to leverage knowledge from ultrasound reports and utilize knowled graphs to facilitate computer-assisted decision making based on reports.The overview the proposed approach is shown in Figure 1.

Structuring Medical Text
Ultrasound reports often contain complex descriptions for each organ that is an e amination item, leading to numerous short sentences and relatively long overall senten

Structuring Medical Text
Ultrasound reports often contain complex descriptions for each organ that is an examination item, leading to numerous short sentences and relatively long overall sentence lengths.As the sentence length increases, the complexity of syntactic analysis also increases, resulting in more complicated grammar tree structures and a decline in the quality of the final syntactic analysis results.To address these challenges, we propose a method for dividing long sentences into multiple parts and processing them separately.By breaking down the sentences, we can extract the most effective information and improve the accuracy of our syntactic analysis.This approach allows us to handle the complexity of the ultrasound report's language and improve the quality of our results.Furthermore, the knowledge sharing of exact terminology between experts in any sector requires domain-specific vocabulary [28].This is particularly true in the medical field, where exact terminology is not present in general language dictionaries.To address this challenge, we aim to develop an algorithm that can represent basic concepts and their semantic relationships extracted from the text using TNs.Our algorithm preserves the order of sentences and words from the reports, allowing us to create network-based graphs that represent the report's content.By utilizing these graphs, we can perform further analysis and gain a deeper understanding of the text.This approach helps to standardize the language used in the medical field and facilitate knowledge sharing between experts.
Our proposed algorithm consists of several key components designed to accurately extract and represent essential concepts and their semantic relationships.First, we use a word segmentation method to segment text from the report, which is denoted by the function WSM (text).Second, we use a function called SNLP (w) to return the result of w with its part-of-speech (pos) and normalized word (n w ).SNLP, as a function, encapsulates various techniques such as syntactic analysis, pos labeling, and so on in order to extract the required information from w.The complete algorithm is illustrated as Algorithm 1.By using these functions together, we can accurately extract and represent the essential concepts and their semantic relationships.Let t ← a list of terms through the WSM (text) 2: Let s ← an empty set of concept relationships between two concepts {r: (c 1 , c 2 )}, where a concept is defined as a tuple c = (w, n w , pos).

Extracting Medical Terms
Using the structured results from the TN, each ultrasound report is transformed into a set of report-based semantic models that support the following information extraction techniques.In this stage, a grammatical tree among words is obtained from the syntax analysis of the text in a report using the TN.We can obtain attributes and their corresponding values for entities in the text by analyzing the syntactic relationships and the corresponding part of speech in the tree.Finally, the resulting data from the ultrasonicrelated text, described in natural language, is converted into an entity-attribute-value format and stored in a simple knowledge graph (SKG).The SKG is defined as a set of entity relations, denoted as {(entity, relation, entity)}.To achieve this, we defined rules to obtain named entities and implemented knowledge extraction methods to extract entity relations from complex texts.Table 1 presents the rules for entity relation extraction, along with the corresponding relationships for each rule.The complete algorithm is shown in Algorithm 2. By utilizing named entity recognition, predefined rules for information extraction, and transformation networks, we could extract relevant information from the text in a flexible and efficient manner.

Algorithm 2. Constructing simple knowledge graphs based on the term network
Input: a term network of a report (TN = {t 1 , t 2 , t 3 , . ..})Output: a simple knowledge graph (SKG = {r 1 , r 2 , r 3 , . ..}) 1: Let R ← a set of {rule: (tag 1 , tag 2 , . .., tag c } where c is the number of tags in a rule and tag ∈ {pos} ∪ {custom tag} 2: Let M ← a subset of TN where ∀term ∈ M are restricted to rules R.

Extracting Medical Terms
We then use the SKGs constructed in the previous steps to implement text synonym elimination based on textual similarities.To accomplish this, we utilize the Word2Vec algorithm [29] that is trained on high-frequency words in the ultrasound texts following word segmentation.We use cosine similarity to compare the semantic relevance among words, with a higher value indicating greater similarity.By comparing the cosine similarity between words, we can construct a synonym table that identifies and eliminates textual synonyms that could cause ambiguity in knowledge representation.Word2Vec is known for its ability to capture the meaning of words without relying on their syntactic structure and can identify semantically related words even if they are not grammatically similar.The Word2Vec algorithm is trained using a sliding window approach, which involves moving a window of fixed size over the text corpus and using the words within the window to predict the target word.After neural network training, we obtain a word vector space that contains both the location and semantic information.The distance between two vectors can be regarded as the semantic similarity between words and can be calculated using different formulas such as the Euclidean distance, Chebyshev distance, and Cosine similarity.Cosine similarity mainly emphasizes the variation in the direction of word vectors in the vector space, and this difference makes it more accurate in assessing the semantic relatedness between words compared to other methods.The closer the cosine value between two vectors is to 1, the closer the angle between them is to 0 in vector space, indicating that the words represented by these vectors are semantically similar.Our approach is highly efficient and accurate, leveraging the power of Word2Vec to represent the meaning of words and identify textual synonyms in the context of ultrasound-related knowledge representation.

Completing Knowledge Graph
The expression of ultrasound reports can sometimes be incomplete, with hidden information omitted and the grammatical structure appearing relatively concise.Generally, experts often describe the same entity when it first appears and then omit it in the next sentence to simplify the description.This can result in confusion in entity-pointing relationships and missing entities in the constructed knowledge graph.To address this issue, we propose a knowledge graph completion model (KGCM) based on the importance of entities, which embeds triplet information into low-dimensional vector space using the TransH model to achieve knowledge computability.The importance of entities is then combined to supplement the knowledge graph of omitted entities and avoid confusion in the knowledge representation and visualization process.
In the TransH model [30], the relation r is represented by a hyperplane w r and a relation vector d r , with the relation r in a different embedding space with e f and e t .In each triple (e f , r, e t ), e f and e t are projected on the hyperplane, corresponding to e f ⊥ , e t⊥ , which can be defined as follows: e t⊥ = e t − w r e t⊥ w r , where e f ⊥ and e t⊥ denote the mapping vectors that map e f and e t onto the hyperplane w r .
w r e f ⊥ w r denotes that e f is the projection on hyperplane w r .Additionally, it is important to note that the magnitude of the hyperplane vector w r is normalized to unity, as indicated by the following constraint: For a correct triple (e f , r, e t ), we consider that in the vector space e f ⊥ + d r ≈ e t⊥ .So the score function f r (e f , e t ) can be defined as follows: f r e f , e t = e f − w r e f ⊥ w r + d r − (e t − w r e t⊥ w r ) 2  2 . ( Equation ( 5) measures the squared distance between the sum of the projected embeddings and the corresponding relation vector.By minimizing this score function, the TransH model aims to capture meaningful relationships between entities in different embedding spaces.
TransH effectively embeds knowledge into low-dimensional vector space, enabling the computability of knowledge in knowledge graphs and supplementing e f in the knowledge graph triplet due to the lack of ultrasound report writing habits; however, it does not consider the importance of different entities e f to the same entity e t .To address this challenge, our KGCM incorporates a proximity function k(e i , e j ), which evaluates the importance of an entity based on the distance between entities.The proximity function is defined using a Gaussian kernel, which converts the dot product in the infinite-dimensional space into a Gaussian function of the distance between points in the data space.This approach is particularly suitable for evaluating the importance of an entity based on the distance between entities.When the distance between e i and e j is small, the value of k decreases slowly.However, when the distance value is in the middle range, it drops rapidly, and then when the distance value between two nodes is very large, it drops slowly again.More formally, the proximity function k(e i , e j ) can be defined as follows: where σ is the bandwidth parameter, which plays a crucial role in determining the radial range of action of the Gaussian kernel function.Specifically, σ controls the local range of the Gaussian kernel function, thereby influencing the accuracy of the proximity function k(e i , e j ) when evaluating the importance of an entity through the distance between entities.A smaller σ value restricts the range of the Gaussian kernel function to a smaller region, resulting in a higher weight for entities that are closer together.Therefore, selecting an appropriate value for σ is critical to ensure the accuracy and effectiveness of the KGCM model to complete knowledge graphs in ultrasound reports.
Based on Equations ( 5) and ( 6), the scoring function of the KGCM model is defined as follows: Equation ( 7) combines the considerations of the TransH model with the proximity function to capture the relationships and importance between entities in knowledge graphs.
The KGCM model provides a novel approach to complete knowledge graphs in ultrasound imaging by supplementing omitted entities and considering the importance of different entities.This approach has the potential to significantly enhance the accuracy of diagnoses and facilitate more effective communication between clinicians and researchers.

Results
For this study, we obtained a dataset of ultrasound reports from a cooperative hospital in China.The dataset included abdominal examination results from 4484 patients with cirrhosis and ascites and contained 142,834 entity relationships with 12 different relationship types.An abdominal ultrasound examination is a common hospital examination that mainly explores the ultrasound characteristics of the abdominal tissue, including five main areas of liver, gallbladder, bile duct, pancreas, and kidney.Table 2 provides a record of the ultrasound reports used in our experiments.

Observations of imaging
The condition of the organ tissues is observed and recorded by the doctor through operating the ultrasound device The liver is abnormal, and the right liver is reduced with an uneven surface, the substantial echo is thickened, uneven, and nodular, and the hepatic vascular structure is clear.The internal diameter of the main portal vein is 1.4 cm, and the blood flow is unobstructed.The intrahepatic bile duct does not expand.The size and shape of the pancreas are observed, the echo is homogeneous, the pancreatic duct is not dilated, and the posterior pancreatic vessels are clear.The splenic thickness and length are approximately 4.6 and 10.7 cm, the echo is homogeneous, and the color blood flow distribution is normal.The size and shape of the double kidneys are homogeneous, the echo is homogeneous, and the renal pelvis, calyx, and ureter do not expand.

Result of examination
The ultrasound examination conclusion provided by the doctor Liver cirrhosis, and portal hypertension are observed; the accessory umbilical vein is open; the spleen is slightly large.

Visualization of Ultrasound Reports
To evaluate the performance of the proposed method to facilitate the representation of knowledge for ultrasound reports, we conducted the experiments at different stages of the process.Specifically, we evaluated our approach to medical text structuring and knowledge representation techniques.Table 3 shows the results of the medical text structuring process, which involves segmenting long sentences based on syntactic and semantic features.The results are reported for TN, SKG, and SKG with the KGCM model.In addition, Figure 2 displays semantic models for analyzing the results of various organs from a report.The five domains presented in the figure depict the semantic relations between the components.
We used knowledge graph representation techniques to visualize the ultrasound reports, which resulted in high-quality graphs.Figure 3 illustrates a domain-based knowledge graph defined in the RDF format, where missing entities were completed using the KGCM to enhance the accuracy and completeness of the graph.

Evaluation in Knowledge Extraction
Based on a thorough investigation of the distribution of the number of extracted domains and entity relations from the ultrasound report dataset, we provide a comprehensive evaluation of the proposed ultrasound report text information extraction method.Figure 4 shows a distribution of the number of reports based on the number of extracted domains.The results using our method show that the number of extracted domains from the reports exceeded five, indicating that our proposed method automatically extracts appropriate domains based on the semantics of the reports rather than relying on a fixed and predetermined set of domains for analysis.This approach leads to a more accurate and comprehensive knowledge representation of ultrasound reports.Based on a thorough investigation of the distribution of the number of extracted domains and entity relations from the ultrasound report dataset, we provide a comprehensive evaluation of the proposed ultrasound report text information extraction method.Figure 4 shows a distribution of the number of reports based on the number of extracted domains.The results using our method show that the number of extracted domains from the reports exceeded five, indicating that our proposed method automatically extracts appropriate domains based on the semantics of the reports rather than relying on a fixed and predetermined set of domains for analysis.This approach leads to a more accurate and comprehensive knowledge representation of ultrasound reports.Table 4 shows the results of the commonly used relationship formulas extracted from all the datasets.In addition, to evaluate the performance of our entity-relation extraction method, we analyzed the distribution of the number of extracted entity relationships in the report dataset.The results are shown in Figure 5, demonstrating a high frequency of extracted entity relations that were relevant and significant in ultrasound reports.These relations can serve as templates to standardize physicians' writing habits and improve the quality of medical decision making.Table 4 shows the results of the commonly used relationship formulas extracted from all the datasets.In addition, to evaluate the performance of our entity-relation extraction method, we analyzed the distribution of the number of extracted entity relationships in the report dataset.The results are shown in Figure 5, demonstrating a high frequency of extracted entity relations that were relevant and significant in ultrasound reports.These relations can serve as templates to standardize physicians' writing habits and improve the quality of medical decision making.To assess the effectiveness of our proposed structured method to construct ultrasound reports, we conducted experiments to compare it with two commonly used methods in the scientific literature: the typical text-matching method and the pattern-matching method.The text-matching method relies on a fixed set of five organ-related domains to extract information from the report, which results in a static outcome.The pattern-matching method employs simple rules of regular expression to extract information, but it may miss important details that do not fit the predefined patterns.To evaluate the performance of our method, we used a set of widely recognized information retrieval metrics to assess our model's performance, including precision (P), recall (R), and the F1 score (F1).We To assess the effectiveness of our proposed structured method to construct ultrasound reports, we conducted experiments to compare it with two commonly used methods in the scientific literature: the typical text-matching method and the pattern-matching method.The text-matching method relies on a fixed set of five organ-related domains to extract information from the report, which results in a static outcome.The pattern-matching method employs simple rules of regular expression to extract information, but it may miss important details that do not fit the predefined patterns.To evaluate the performance of our method, we used a set of widely recognized information retrieval metrics to assess our model's performance, including precision (P), recall (R), and the F1 score (F1).We obtained accurate values for P, R, and F1 through manual annotations by medical experts.Furthermore, we defined an evaluation index (η) that considered the number of extracted domains (n), the number of valid domains (v), and the baseline (text-matching method).This metric allowed for the effective extraction of domains while measuring the performance of information changes.A larger value of η indicates more meaningful overall extracted domains and better performance in capturing relevant information from ultrasound reports.The formula for η is given as follows: As shown in Figure 6, our proposed method outperforms the text-matching method and pattern-matching method on the ultrasound report dataset, achieving the highest P, R, F1, and η scores.This indicates that our method can identify most medical entities in the unstructured text, thereby achieving the desired goal of automated extraction.The Entity (from) category was found to be recognized more accurately than the Entity (to) category due to the variation in sentence structures arising from different doctors' writing styles.While incomplete sentence structures posed a difficulty for the baseline method, our method overcame this through semantic rules and entity supplementation, identifying them as TN.These results show that our proposed method for the text structuring and information extraction of ultrasound reports can effectively extract relevant medical terms and their terminological relationships, leading to a reduction in the semantic complexity and improved analysis of ultrasound reports.

Evaluation in Knowledge Expression and Visualization
In this stage, we aim to implement the elimination of textual synonyms using the Word2Vec tool.To identify synonyms, we performed cosine similarity computation between pairs of words, such as "unsmooth" and "Rough", as presented in Table 5.It was

Discussion
We present a robust and comprehensive method for structuring text, information extraction, and knowledge representation in ultrasound reports.This method extracts entities, relationships, and values from the report and utilizes the entity importance-based KGCM model to supplement implicit information not explicitly stated in the sentences of the report.Our experimental results validate the effectiveness of this method in terms of knowledge extraction, representation, and visualization, offering a promising approach for leveraging the knowledge embedded in ultrasound reports to aid medical professionals in their decision making.The proposed method has significant potential for accelerating the development of computer-aided analysis tools, leading to more efficient and accurate medical decision making.
Our proposed method outperformed traditional methods when extracting knowledge from complex ultrasound reports with a significantly higher extraction index η of 2.69 compared to the general pattern matching method (2.12) and general text matching method (1.00), as well as achieving the highest P (0.85), R (0.89), and F1 (0.87) when evaluated on the ultrasound report.Additionally, our entity relation extraction method is highly effective from a clinical perspective.Our proposed method utilizes knowledge graph visualization to analyze ultrasound reports, and the results presented in Table 4 indicate that we extracted a significant amount of clinical knowledge from the ultrasound reports.For example, changes in size and form were found to be the most frequent entity relationships and were of particular concern to patients with hepatic ascites caused by liver cirrhosis.This demonstrates the meaningfulness of the high-frequency entity relationships we extracted in the context of the ultrasound reports.
To further improve the quality of our results, we employed a knowledge graph model called KGCM based on entity importance to supplement the implicit information omitted in the ultrasound report.The experimental results in Table 6 indicate that the supplementation of missing entities through knowledge graph embeddings yields better performance compared to the token-based recombination approach, as exemplified by EEM-CMR.Notably, the success of KGCM in outperforming other knowledge graph embedding methods, particularly in the ultrasound report dataset, underscores its potential for applications where accurate and comprehensive entity completion is paramount.Moreover, the relative improvements achieved by KGCM in F1 scores, when compared to the latest knowledge graph-based method RotatE, signify its robustness across different datasets.The notable enhancements in F1, specifically 3.7%, 2.6%, and 4.8% in the NCBI, JLNPBA, and ultrasound datasets, respectively, demonstrate the generalizability and effectiveness of KGCM in diverse biomedical contexts.
One potential limitation of our approach is the fact that the knowledge extraction rules we defined are focused on Chinese ultrasound reports and are characterized by a narrative style different from that of English reports.Therefore, to accommodate the unique semantic features of English reports, it may be necessary to make some modifications to our detailed text structuring techniques.Nevertheless, the text structuring methods, semantic models, text synonym elimination, and KGCM proposed for analyzing ultrasound reports can be applied to any language.Another limitation is that our method can only analyze textual data.As ultrasound report data sources include both text and images, the accuracy of our method with respect to data from other sources cannot be guaranteed.Further multimodal analysis incorporating image data has the potential to optimize the model and enhance its robustness and generalization capabilities.

Conclusions
Our study proposes a knowledge graph-based approach for constructing, extracting, and representing knowledge within ultrasound reports.We propose a network extraction method for the key terms of ultrasound reports to extract entities and relationships from ultrasound reports and obtain reliable clinical knowledge.Moreover, we developed a knowledge graph for ultrasound reports, addressing the issue of missing entities caused by irregular writing habits using the KGCM method.In comparison to other baseline methods, our approach achieves the highest P (0.85), R (0.89), and F1 (0.87) across three testing datasets, providing evidence for the effectiveness and practicality of the knowledge graph.
Future research directions for this study involve the integration of additional multimodal data, such as images, to enhance the reliability and completeness of the knowledge graph.Furthermore, there is a need for the thorough validation of the extracted knowledge graph in real clinical environments to ensure its accuracy and applicability in practical medical settings.If the proposed method proves capable of generating reliable knowledge graphs, there is the potential to establish new clinical decision support applications in diagnosis and treatment.

Figure 1 .
Figure 1.Overview of the analysis process in this study.

Figure 1 .
Figure 1.Overview of the analysis process in this study.

Algorithm 1 .
Structuring narrative text from ultrasound reports Input: Narrative text of a report (text) Output: Term network of the report (TN) 1:

Figure 2 .
Figure 2. Group of semantic models to represent multiple domain-based narrative texts clustered in a report using our method.

Figure 2 .
Figure 2. Group of semantic models to represent multiple domain-based narrative texts clustered in a report using our method.

Figure 2 .
Figure 2. Group of semantic models to represent multiple domain-based narrative texts clustered in a report using our method.

Figure 3 .
Figure 3.An example of knowledge graph representation for a report.

Figure 3 .
Figure 3.An example of knowledge graph representation for a report.

Figure 4 .
Figure 4. Distribution of the number of reports based on the number of extracted domains.

Figure 4 .
Figure 4. Distribution of the number of reports based on the number of extracted domains.

Figure 5 .
Figure 5. Number of extracted entity relations using our method.

Figure 5 .
Figure 5. Number of extracted entity relations using our method.

Figure 6 .
Figure 6.Comparison of model performance with different methods.

Table 1 .
Rules of entity relation extraction.

Table 2 .
Example of main items in ultrasound reports.

Table 3 .
The results of knowledge extraction from ultrasound reports at different stages.

Table 4 .
Entity relations extracted from the reports using our method.