Large Language Models for Knowledge Graph Embedding: A Survey
Abstract
1. Introduction
- We propose a unified methodology for KGE-LLM synergy, systematically analyzing their complementary enhancement mechanisms across diverse application scenarios (e.g., dynamic knowledge graphs and multimodal reasoning), thereby overcoming the single-perspective limitations of existing research.
- We introduce an innovative classification framework for LLM-KGE integration technologies across broader research domains and application scenarios, addressing the gaps in systematicity and scalability in current studies.
2. Preliminaries
2.1. Large Language Models (LLMs)
2.2. KGE-Related Tasks
Relationship Between KG and KGE
2.3. Classic Knowledge Graph (CKG)
2.3.1. Link Prediction
2.3.2. Entity Alignment
2.3.3. KG Canonicalization
3. Techniques of KGE with LLMs
3.1. Classic Knowledge Graph (CKG)
3.1.1. Methods That Use LLMs’ Prompts
3.1.2. Methods for Fine-Tuning LLMs
3.1.3. Methods of Pre-Trained LLMs
3.1.4. Methods to Use LLMs as Agents
3.2. Temporal Knowledge Graph (TKG)
3.3. Multi-Modal Knowledge Graph (MMKG)
3.4. Inductive Knowledge Graph (IKG)
3.5. Open Knowledge Graph (OKG)
3.5.1. Methods That Use LLMs’ Prompts
3.5.2. Methods for Fine-Tuning LLMs
3.6. Domain Knowledge Graph (DKG)
3.6.1. Methods That Use LLMs’ Prompts
3.6.2. Methods for Fine-Tuning LLMs
3.6.3. Methods for Pre-Training LLMs
3.7. Summary of Method Evaluation
3.7.1. Effectiveness
3.7.2. Scalability
3.7.3. Practical Impact
3.8. Analysis of LLMs in Tasks
3.8.1. Specific Improvements
3.8.2. Defects
4. Prospective Directions
4.1. Cross-Domain CKG Fusion
4.2. Few-Shot Learning for TKG
4.3. Application of MMKG Tasks for More Specific Domains
4.4. Generic Framework Design for IKG
4.5. OKG Canonicalization
4.6. Unified Representation Learning Framework for Multiple Types and Domains
4.7. Standard Framework and Criteria for Evaluating the Fusion Effect of LLM and KGE
4.8. Summary of the Advantages and Disadvantages of Existing Research Work
5. Datasets and Code Resources
5.1. Dataset
5.1.1. ICEWS 05-15
5.1.2. GDELT
5.1.3. YAGO
5.1.4. UMLS
5.1.5. MetaQA
5.1.6. ReVerb45K
5.1.7. FB15k-237
5.1.8. WN18RR
5.2. Code Resources
6. Conclusions
Funding
Data Availability Statement
Conflicts of Interest
References
- Vidal, M.; Chudasama, Y.; Huang, H.; Purohit, D.; Torrente, M. Poisoning scientific knowledge using large language models. J. Web Semant. 2025, 84. [Google Scholar]
- Gao, F.; He, Q.; Wang, X.; Qiu, L.; Huang, M. An Efficient Rumor Suppression Approach with Knowledge Graph Convolutional Network in Social Network. IEEE Trans. Comput. Soc. Syst. 2024, 11, 6254–6267. [Google Scholar] [CrossRef]
- Yang, Y.; Peng, C.; Cao, E.-Z.; Zou, W. Building Resilience in Supply Chains: A Knowledge Graph-Based Risk Management Framework. IEEE Trans. Comput. Soc. Syst. 2024, 11, 3873–3881. [Google Scholar] [CrossRef]
- Jiang, C.; Tang, M.; Jin, S.; Huang, W.; Liu, X. KGNMDA: A Knowledge Graph Neural Network Method for Predicting Microbe-Disease Associations. IEEE/ACM Trans. Comput. Biol. Bioinform. 2023, 20, 1147–1155. [Google Scholar] [CrossRef] [PubMed]
- Zhao, L.; Qi, X.; Chen, Y.; Qiao, Y.; Bu, D.; Wu, Y.; Luo, Y.; Wang, S.; Zhang, R.; Zhao, Y. Biological Knowledge Graph-Guided Investigation of Immune Therapy Response in Cancer with Graph Neural Network. Brief. Bioinform. 2023, 24, bbad023. [Google Scholar] [CrossRef]
- Huang, C.; Chen, D.; Fan, T.; Wu, B.; Yan, X. Incorporating Environmental Knowledge Embedding and Spatial-Temporal Graph Attention Networks for Inland Vessel Traffic Flow Prediction. Eng. Appl. Artif. Intell. 2024, 133, 108301. [Google Scholar] [CrossRef]
- Mlodzian, L.; Sun, Z.; Berkemeyer, H.; Monka, S.; Wang, Z.; Dietze, S.; Halilaj, L.; Luettin, J. nuScenes Knowledge Graph—A Comprehensive Semantic Representation of Traffic Scenes for Trajectory Prediction. In Proceedings of the 2023 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), Paris, France, 2–6 October 2023; pp. 42–52. [Google Scholar] [CrossRef]
- Agrawal, G.; Pal, K.; Deng, Y.; Liu, H.; Chen, Y.-C. CyberQ: Generating Questions and Answers for Cybersecurity Education Using Knowledge Graph-Augmented LLMs. In Proceedings of the AAAI Conference on Artificial Intelligence 2024, Vancouver, BC, Canada, 20–27 February 2024; Volume 38, pp. 23164–23172. [Google Scholar] [CrossRef]
- Panda, P.; Agarwal, A.; Devaguptapu, C.; Kaul, M.; Ap, P. HOLMES: Hyper-Relational Knowledge Graphs for Multi-hop Question Answering Using LLMs. In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Bangkok, Thailand, 11–16 August 2024; pp. 13263–13282. [Google Scholar]
- Sun, L.; Tao, Z.; Li, Y.; Arakawa, H. ODA: Observation-Driven Agent for Integrating LLMs and Knowledge Graphs. In Proceedings of the Findings of the Association for Computational Linguistics: ACL 2024, Bangkok, Thailand, 11–16 August 2024; Association for Computational Linguistics: Kerrville, TX, USA, 2024; pp. 7417–7431. [Google Scholar]
- Venkatakrishnan, R.; Tanyildizi, E.; Canbaz, M.A. Semantic interlinking of Immigration Data using LLMs for Knowledge Graph Construction. In Proceedings of the ACM Web Conference 2024 (WWW ’24), Singapore, 13–17 May 2024; Association for Computing Machinery: New York, NY, USA, 2024; pp. 605–608. [Google Scholar] [CrossRef]
- Sun, L.; Zhang, P.; Gao, F.; An, Y.; Li, Z.; Zhao, Y. SF-GPT: A training-free method to enhance capabilities for knowledge graph construction in LLMs. Neurocomputing 2025, 613, 128726. [Google Scholar] [CrossRef]
- Pan, S.; Luo, L.; Wang, Y.; Chen, C.; Wang, J.; Wu, X. Unifying Large Language Models and Knowledge Graphs: A Roadmap. IEEE Trans. Knowl. Data Eng. 2024, 36, 3580–3599. [Google Scholar] [CrossRef]
- Yang, L.; Chen, H.; Li, Z.; Ding, X.; Wu, X. Give us the Facts: Enhancing Large Language Models with Knowledge Graphs for Fact-Aware Language Modeling. IEEE Trans. Knowl. Data Eng. 2024, 36, 3091–3110. [Google Scholar] [CrossRef]
- Liu, L.; Xu, P.; Fan, K.; Wang, M. Research on application of knowledge graph in industrial control system security situation awareness and decision-making: A survey. Neurocomputing 2025, 613, 128721. [Google Scholar] [CrossRef]
- Zhu, B.; Wang, R.; Wang, J.; Zhao, F.; Wang, K. A Survey: Knowledge Graph Entity Alignment Research Based on Graph Embedding. Artif. Intell. Rev. 2024, 57, 229. [Google Scholar] [CrossRef]
- Huang, X.; Ruan, W.; Huang, W.; Jin, G.; Dong, Y.; Wu, C.; Bensalem, S.; Mu, R.; Qi, Y.; Zhao, X.; et al. A survey of safety and trustworthiness of large language models through the lens of verification and validation. Artif. Intell. Rev. 2024, 57, 175. [Google Scholar] [CrossRef]
- Kumar, P. Large language models (LLMs): Survey, technical frameworks, and future challenges. Artif. Intell. Rev. 2024, 57, 260. [Google Scholar] [CrossRef]
- Zhao, S.; Sun, X. Enabling controllable table-to-text generation via prompting large language models with guided planning. Knowl.-Based Syst. 2024, 304, 112571. [Google Scholar] [CrossRef]
- Cui, M.; Du, J.; Zhu, S.; Xiong, D. Efficiently Exploring Large Language Models for Document-Level Machine Translation with In-context Learning. In Proceedings of the Findings of the Association for Computational Linguistics: ACL 2024, Bangkok, Thailand, 11–16 August 2024; Association for Computational Linguistics: Kerrville, TX, USA, 2024; pp. 10885–10897. [Google Scholar]
- Liu, C.; Wang, C.; Peng, Y.; Li, Z. ZVQAF: Zero-shot visual question answering with feedback from large language models. Neurocomputing 2024, 580, 127505. [Google Scholar] [CrossRef]
- Hellwig, N.C.; Fehle, J.; Wolff, C. Exploring large language models for the generation of synthetic training samples for aspect-based sentiment analysis in low resource settings. Expert Syst. Appl. 2025, 261, 125514. [Google Scholar] [CrossRef]
- Song, H.; Su, H.; Shalyminov, I.; Cai, J.; Mansour, S. FineSurE: Fine-grained Summarization Evaluation using LLMs. In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Bangkok, Thailand, 11–16 August 2024; Association for Computational Linguistics: Kerrville, TX, USA, 2024; pp. 906–922. [Google Scholar]
- Zhou, Z.; Elejalde, E. Unveiling the silent majority: Stance detection and characterization of passive users on social media using collaborative filtering and graph convolutional networks. EPJ Data Sci. 2024, 13, 28. [Google Scholar] [CrossRef]
- Cui, W.; Shang, M. KAGN: Knowledge-powered attention and graph convolutional networks for social media rumor detection. J. Big Data 2023, 10, 45. [Google Scholar] [CrossRef]
- Huang, Q.; Chen, J. Enhancing academic performance prediction with temporal graph networks for massive open online courses. J. Big Data 2024, 11, 52. [Google Scholar] [CrossRef]
- Teji, B.; Roy, S.; Dhami, D.S.; Bhandari, D.; Guzzi, P.H. Graph Embedding Techniques for Predicting Missing Links in Biological Networks: An Empirical Evaluation. IEEE Trans. Emerg. Top. Comput. 2024, 12, 190–201. [Google Scholar] [CrossRef]
- Ying, H.; Zhao, Z.; Zhao, Y.; Zeng, S.; Yu, S. CoRTEx: Contrastive learning for representing terms via explanations with applications on constructing biomedical knowledge graphs. J. Am. Med. Inform. Assoc. 2024, 31, 1912–1920. [Google Scholar] [CrossRef] [PubMed]
- Yang, P.; Wang, H.; Huang, Y.; Yang, S.; Zhang, Y.; Huang, L.; Zhang, Y.; Wang, G.; Yang, S.; He, L.; et al. LMKG: A large-scale and multi-source medical knowledge graph for intelligent medicine applications. Knowl.-Based Syst. 2024, 284, 111323. [Google Scholar] [CrossRef]
- Liu, B.; Hou, S.; Zhong, W.; Zhao, X.; Liu, Y.; Yang, Y. Enhancing Temporal Knowledge Graph Alignment in News Domain with Box Embedding. IEEE Trans. Comput. Soc. Syst. 2024, 11, 4909–4919. [Google Scholar] [CrossRef]
- Suchanek, F.M.; Kasneci, G.; Weikum, G. Yago: A core of semantic knowledge. In Proceedings of the 16th International Conference on World Wide Web, Banff, AB, Canada, 8–12 May 2007; pp. 697–706. [Google Scholar]
- Singhal, A. Introducing the Knowledge Graph: Things, Not Strings. Official Google Blog. 16 May 2012. Available online: https://blog.google/products/search/introducing-knowledge-graph-things-not/ (accessed on 2 January 2015).
- Young, M. The Technical Writer’s Handbook; University Science: Mill Valley, CA, USA, 1989. [Google Scholar]
- Jiang, C.; Jiang, Y.; Wu, W.; Zheng, Y.; Xie, P.; Tu, K. COMBO: A Complete Benchmark for Open KG Canonicalization. In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, Dubrovnik, Croatia, 2–6 May 2023; Association for Computational Linguistics: Kerrville, TX, USA, 2023; pp. 340–357. [Google Scholar]
- Cao, Z.; Luo, C. Link prediction for knowledge graphs based on extended relational graph attention networks. Expert Syst. Appl. 2025, 259, 125260. [Google Scholar] [CrossRef]
- Wei, Y.; Huang, Q.; Zhang, Y.; Kwok, J. KICGPT: Large Language Model with Knowledge in Context for Knowledge Graph Completion. In Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, Singapore, 6–10 December 2023; Association for Computational Linguistics: Kerrville, TX, USA, 2023; pp. 8667–8683. [Google Scholar]
- Zhang, J.; Cai, S.; Jiang, Z.; Xiao, J.; Ming, Z. FireRobBrain: Planning for a Firefighting Robot Using Knowledge Graph and Large Language Model. In Proceedings of the 2024 10th IEEE International Conference on Intelligent Data and Security (IDS), New York, NY, USA, 6–9 May 2024; pp. 37–41. [Google Scholar] [CrossRef]
- Mou, X.; Li, Z.; Lyu, H.; Luo, J.; Wei, Z. Unifying Local and Global Knowledge: Empowering Large Language Models as Political Experts with Knowledge Graphs. In Proceedings of the ACM Web Conference 2024 (WWW ’24), Singapore, 13–17 May 2024; Association for Computing Machinery: New York, NY, USA, 2024; pp. 2603–2614. [Google Scholar] [CrossRef]
- Wang, Y.; Hu, M.; Huang, Z.; Li, D.; Yang, D.; Lu, X. KC-GenRe: A Knowledge-constrained Generative Re-ranking Method Based on Large Language Models for Knowledge Graph Completion. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), Torin, Italy, 20–25 May 2024; pp. 9668–9680. [Google Scholar]
- Yang, R.; Zhu, J.; Man, J.; Fang, L.; Zhou, Y. Enhancing text-based knowledge graph completion with zero-shot large language models: A focus on semantic enhancement. Knowl.-Based Syst. 2024, 300, 112155. [Google Scholar] [CrossRef]
- Hou, W.; Zhao, W.; Jia, N.; Liu, X. Low-resource knowledge graph completion based on knowledge distillation driven by large language models. Appl. Soft Comput. 2025, 169, 112622. [Google Scholar] [CrossRef]
- Liu, Y.; Tian, X.; Sun, Z.; Hu, W. Finetuning Generative Large Language Models with Discrimination Instructions for Knowledge Graph Completion. In Proceedings of the Semantic Web–ISWC 2024, Hanover, MD, USA, 11 November 2024; Demartini, G., Hose, K., Acosta, M., Palmonari, M., Cheng, G., Skaf-Molli, H., Ferranti, N., Hernández, D., Hogan, A., Eds.; Lecture Notes in Computer Science. Springer: Cham, Switzerland, 2025; Volume 15231. [Google Scholar]
- Wang, L.; Hu, H.; Sha, L.; Xu, C.; Jiang, D.; Wong, K.-F. RecInDial: A Unified Framework for Conversational Recommendation with Pretrained Language Models. In Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, Online, 21–23 November 2022; Association for Computational Linguistics: Kerrville, TX, USA, 2022; Volume 1, pp. 489–500. [Google Scholar]
- Shu, D.; Chen, T.; Jin, M.; Zhang, C.; Du, M.; Zhang, Y. Knowledge Graph Large Language Model (KG-LLM) for Link Prediction. arXiv 2024, arXiv:2403.07311. [Google Scholar]
- Alqaaidi, S.K.; Kochut, K. Relations Prediction for Knowledge Graph Completion Using Large Language Models. arXiv 2024, arXiv:2405.02738. [Google Scholar]
- Tian, S.; Luo, Y.; Xu, T.; Yuan, C.; Jiang, H.; Wei, C.; Wang, X. KG-Adapter: Enabling Knowledge Graph Integration in Large Language Models through Parameter-Efficient Fine-Tuning. In Proceedings of the Findings of the Association for Computational Linguistics: ACL 2024, Bangkok, Thailand, 11–16 August 2024; Association for Computational Linguistics: Kerrville, TX, USA, 2024; pp. 3813–3828. [Google Scholar]
- Zhang, Z.; Gu, J.; Zhou, J.; Pan, J.Z.; Zhang, W.; Chen, H. Learning to Plan for Retrieval-Augmented Large Language Models from Knowledge Graphs. In Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, Miami, FL, USA, 7–11 December 2024; Association for Computational Linguistics: Kerrville, TX, USA, 2024; pp. 7813–7835. [Google Scholar]
- Bianchini, F.; Calamo, M.; De Luzi, F.; Macrì, M.; Mecella, M. Enhancing Complex Linguistic Tasks Resolution Through Fine-Tuning LLMs, RAG and Knowledge Graphs (Short Paper). In Proceedings of the Advanced Information Systems Engineering Workshops, CAiSE 2024, Limassol, Cyprus, 3–4 June 2024; Almeida, J.P.A., Di Ciccio, C., Kalloniatis, C., Eds.; Lecture Notes in Business Information Processing. Springer: Cham, Switzerland, 2024; Volume 521, pp. 176–181. [Google Scholar] [CrossRef]
- Hu, S.; Zou, G.; Yang, S.; Gan, Y.; Zhang, B.; Chen, Y. Large Language Model Meets Graph Neural Network in Knowledge Distillation. arXiv 2024, arXiv:2402.05894. [Google Scholar] [CrossRef]
- Zhang, M.; Ye, X.; Liu, Q.; Ren, P.; Wu, S.; Chen, Z. Knowledge Graph Enhanced Large Language Model Editing. In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Miami, Florida, USA, 7–11 December 2024; Association for Computational Linguistics: Kerrville, TX, USA, 2024; pp. 22647–22662. [Google Scholar]
- Salnikov, M.; Le, H.; Rajput, P.; Nikishina, I.; Braslavski, P.; Malykh, V.; Panchenko, A. Large Language Models Meet Knowledge Graphs to Answer Factoid Questions. In Proceedings of the 37th Pacific Asia Conference on Language, Information and Computation, Hong Kong, China, 12–14 December 2023; Association for Computational Linguistics: Kerrville, TX, USA, 2023; pp. 635–644. [Google Scholar]
- Geng, Y.; Chen, J.; Zeng, Y.; Chen, Z.; Zhang, W.; Pan, J.Z.; Wang, Y.; Xu, X. Prompting disentangled embeddings for knowledge graph completion with pre-trained language model. Expert Syst. Appl. 2025, 268, 126175. [Google Scholar] [CrossRef]
- Sun, J.; Xu, C.; Tang, L.; Wang, S.; Lin, C.; Gong, Y.; Ni, L.; Shum, H.-Y.; Guo, J. Think-on-Graph: Deep and Responsible Reasoning of Large Language Model on Knowledge Graph. In Proceedings of the Twelfth International Conference on Learning Representations, Vienna, Austria, 7–11 May 2024. [Google Scholar]
- Gao, Y.; Qiao, L.; Kan, Z.; Wen, Z.; He, Y.; Li, D. Two-stage Generative Question Answering on Temporal Knowledge Graph Using Large Language Models. In Proceedings of the Findings of the Association for Computational Linguistics: ACL 2024, Bangkok, Thailand, 11–16 August 2024; Association for Computational Linguistics: Kerrville, TX, USA, 2024; pp. 6719–6734. [Google Scholar]
- Ding, Z.; Cai, H.; Wu, J.; Ma, Y.; Liao, R.; Xiong, B.; Tresp, V. zrLLM: Zero-Shot Relational Learning on Temporal Knowledge Graphs with Large Language Models. In Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), Mexico City, Mexico, 16–21 June 2024; Association for Computational Linguistics: Kerrville, TX, USA, 2024; pp. 1877–1895. [Google Scholar]
- Xia, Y.; Wang, D.; Liu, Q.; Wang, L.; Wu, S.; Zhang, X. Chain-of-History Reasoning for Temporal Knowledge Graph Forecasting. arXiv 2024, arXiv:2402.14382. [Google Scholar]
- Liao, R.; Jia, X.; Li, Y.; Ma, Y.; Tresp, V. GenTKG: Generative Forecasting on Temporal Knowledge Graph with Large Language Models. In Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, Mexico City, Mexico, 16–21 June 2024; Association for Computational Linguistics: Kerrville, TX, USA, 2024; pp. 4303–4317. [Google Scholar]
- Li, Q.; Wu, G. Explainable Reasoning over Temporal Knowledge Graphs by Pre-trained Language Model. Inf. Process. Manag. 2025, 62, 103903. [Google Scholar] [CrossRef]
- Deng, Y.; Song, J.; Yang, Z.; Long, Y.; Zeng, L.; Zhou, L. Diachronic Semantic Encoding Based on Pre-trained Language Model for Temporal Knowledge Graph Reasoning. Knowl.-Based Syst. 2025, 318, 113479. [Google Scholar] [CrossRef]
- Xu, D.; Zhang, Z.; Lin, Z.; Wu, X.; Zhu, Z.; Xu, T.; Zhao, X.; Zheng, Y.; Chen, E. Multi-perspective Improvement of Knowledge Graph Completion with Large Language Models. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), Torino, Italy, 22–24 May 2024; pp. 11956–11968. [Google Scholar]
- Yu, Z.; Wang, S.; Zhu, Y.; Zou, Z. A Multimodal Ecological Civilization Pattern Recommendation Method Based on Large Language Models and Knowledge Graph. arXiv 2023, arXiv:2310.18951. [Google Scholar]
- Li, Y.; Huang, C.; Deng, S.; Lock, M.L.; Cao, T.; Oo, N.; Lim, H.W.; Hooi, B. KnowPhish: Large Language Models Meet Multimodal Knowledge Graphs for Enhancing Reference-Based Phishing Detection. In Proceedings of the 33rd USENIX Security Symposium (USENIX Security 24), Philadelphia, PA, USA, 14–16 August 2024; USENIX Association: Berkeley, CA, USA, 2024; pp. 793–810. [Google Scholar]
- Sun, Q.; Luo, Y.; Zhang, W.; Li, S.; Li, J.; Niu, K.; Kong, X.; Liu, W. Docs2KG: Unified Knowledge Graph Construction from Heterogeneous Documents Assisted by Large Language Models. arXiv 2024, arXiv:2406.02962. [Google Scholar]
- Xing, X.; Jia, B.; Huang, Z.; Chen, Y.; Wang, J.; Fan, A.; Chen, X.; Cao, L. A Fusion Inference Method for Large Language Models and Knowledge Graphs Based on Structured Injection and Causal Inference. In Proceedings of the 2023 5th International Conference on Internet of Things, Automation and Artificial Intelligence (IoTAAI 2023), Nanchang, China, 24–26 November 2023; ACM: New York, NY, USA, 2023; pp. 208–213. [Google Scholar] [CrossRef]
- Hamann, F.; Falk, M.; Walker, L. Expanding Knowledge Graphs through Text: Leveraging Large Language Models for Inductive Link Prediction. In Proceedings of the INFORMATIK 2024, Wiesbaden, Germany, 24–26 September 2024; Gesellschaft für Informatik e.V.: Bonn, Germany, 2024; pp. 1407–1417. [Google Scholar]
- Zhao, Q.; Qian, H.; Liu, Z.; Zhang, G.-D.; Gu, L. Breaking the Barrier: Utilizing Large Language Models for Industrial Recommendation Systems through an Inferential Knowledge Graph. In Proceedings of the 33rd ACM International Conference on Information and Knowledge Management (CIKM 2024), Boise, ID, USA, 21–25 October 2024; ACM: New York, NY, USA, 2024; pp. 5086–5093. [Google Scholar] [CrossRef]
- Wang, K.; Xu, Y.; Wu, Z.; Luo, S. LLM as Prompter: Low-resource Inductive Reasoning on Arbitrary Knowledge Graphs. In Proceedings of the Findings of the Association for Computational Linguistics: ACL 2024, Bangkok, Thailand, 11–16 August 2024; ACL: Kerrville, TX, USA, 2024; pp. 3742–3759. [Google Scholar]
- Liu, J.; Zhan, J. Constructing Knowledge Graph from Cyber Threat Intelligence Using Large Language Model. In Proceedings of the 2023 IEEE International Conference on Big Data (BigData 2023), Sorrento, Italy, 15–18 December 2023; pp. 516–521. [Google Scholar] [CrossRef]
- Park, C.; Lee, H.; Jeong, O.-R. Leveraging Medical Knowledge Graphs and Large Language Models for Enhanced Mental Disorder Information Extraction. Future Internet 2024, 16, 260. [Google Scholar] [CrossRef]
- Ning, Y.; Liu, H. UrbanKGent: A Unified Large Language Model Agent Framework for Urban Knowledge Graph Construction. arXiv 2024, arXiv:2402.06861. [Google Scholar]
- Arsenyan, V.; Bughdaryan, S.; Shaya, F.; Small, K.W.; Shahnazaryan, D. Large Language Models for Biomedical Knowledge Graph Construction: Information Extraction from EMR Notes. In Proceedings of the 23rd Workshop on Biomedical Natural Language Processing, Bangkok, Thailand, 16 August 2024; ACL: Kerrville, TX, USA, 2024; pp. 295–317. [Google Scholar]
- Xu, T.; Gu, Y.; Xue, M.; Gu, R.; Li, B.; Gu, X. Knowledge Graph Construction for Heart Failure Using Large Language Models with Prompt Engineering. Front. Comput. Neurosci. 2024, 18, 1389475. [Google Scholar] [CrossRef]
- AlMahri, S.; Xu, L.; Brintrup, A. Enhancing Supply Chain Visibility with Knowledge Graphs and Large Language Models. arXiv 2024, arXiv:2408.07705. [Google Scholar]
- Datta, P.; Vitiugin, F.; Chizhikova, A.; Sawhney, N. Construction of Hyper-Relational Knowledge Graphs Using Pre-Trained Large Language Models. arXiv 2024, arXiv:2403.11786. [Google Scholar]
- Chen, H.; Méndez, S.J.R.; Omran, P.G. Open Local Knowledge Graph Construction from Academic Papers Using Generative Large Language Models. In Proceedings of the Companion Proceedings of the ACM on Web Conference 2025 (WWW ’25), Sydney, Australia, 28 April–2 May 2025; ACM: New York, NY, USA, 2025; pp. 2551–2559. [Google Scholar] [CrossRef]
- Hu, Y.; Zou, F.; Han, J.; Sun, X.; Wang, Y. LLM-TIKG: Threat Intelligence Knowledge Graph Construction Utilizing Large Language Model. Comput. Secur. 2024, 145, 103999. [Google Scholar] [CrossRef]
- Yang, R.; Liu, H.; Marrese-Taylor, E.; Zeng, Q.; Ke, Y.; Li, W.; Cheng, L.; Chen, Q.; Caverlee, J.; Matsuo, Y.; et al. KG-Rank: Enhancing Large Language Models for Medical QA with Knowledge Graphs and Ranking Techniques. In Proceedings of the 23rd Workshop on Biomedical Natural Language Processing, Bangkok, Thailand, 16 August 2024; Association for Computational Linguistics: Kerrville, TX, USA, 2024; pp. 155–166. [Google Scholar]
- Karim, R.; Molinas Comet, L.; Shajalal, M.; De Perthuis, P.; Rebholz-Schuhmann, D.; Decker, S. From Large Language Models to Knowledge Graphs for Biomarker Discovery in Cancer. In Proceedings of the 57th Hawaii International Conference on System Sciences, Honolulu, HI, USA, 3–6 January 2024; HICSS: Waikoloa, HI, USA, 2024; pp. 5576–5586. [Google Scholar]
- Du, J.; Yin, H. KLDP: A data profiling technique based on knowledge graph and large language modeling. In Proceedings of the 2023 IEEE 22nd International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom), Exeter, UK, 10–12 July 2023; pp. 2333–2340. [Google Scholar] [CrossRef]
- Wang, C.; Hua, M.; Song, J.; Tang, X.-S. Knowledge graphs enhanced large language model prompt for electric power question answering. In Proceedings of the 2023 7th International Conference on Electronic Information Technology and Computer Engineering (EITCE ’23), Zhuhai, China, 27–29 October 2023; Association for Computing Machinery: New York, NY, USA, 2023; pp. 24–29. [Google Scholar] [CrossRef]
- Yang, J.; Xu, H.; Mirzoyan, S.; Chen, T.; Liu, Z.; Ju, W.; Liu, L.; Zhang, M.; Wang, S. Integrating knowledge graphs with symbolic AI: The path to interpretable hybrid AI systems in medicine. Web Semant. 2025, 84, 100856. [Google Scholar] [CrossRef]
- Pei, A.; Yang, Z.; Zhu, S.; Cheng, R.; Jia, J.; Wang, L. KGPA: Robustness evaluation for large language models via cross-domain knowledge graphs. arXiv 2024, arXiv:2406.10802. [Google Scholar]
- Kumar, N.M.; Lisa, F.T.; Islam, S.R. Prompt chaining-assisted malware detection: A hybrid approach utilizing fine-tuned LLMs and domain knowledge-enriched cybersecurity knowledge graphs. In Proceedings of the 2024 IEEE International Conference on Big Data (BigData), Washington, DC, USA, 4–7 December 2024; pp. 1672–1677. [Google Scholar] [CrossRef]
- Dernbach, S.; Agarwal, K.; Zuniga, A.; Henry, M.; Choudhury, S. GLaM: Fine-tuning large language models for domain knowledge graph alignment via neighborhood partitioning and generative subgraph encoding. In Proceedings of the AAAI Symposium Series 2024, Stanford, CA, USA, 25–27 March 2024; Volume 3, pp. 82–89. [Google Scholar] [CrossRef]
- Li, Y.; He, Y.; Lian, R.; Guo, Q. Fault diagnosis and system maintenance based on large language models and knowledge graphs. In Proceedings of the 2023 5th International Conference on Robotics, Intelligent Control and Artificial Intelligence (RICAI), Hangzhou, China, 20–23 November 2023; pp. 589–592. [Google Scholar] [CrossRef]
- Liu, P.; Qian, L.; Zhao, X.; Tao, B. Joint knowledge graph and large language model for fault diagnosis and its application in aviation assembly. IEEE Trans. Ind. Informat. 2024, 20, 8160–8169. [Google Scholar] [CrossRef]
- Wawrzik, F.; Plaue, M.; Vekariya, S.; Grimm, C. Customized Information and Domain-centric Knowledge Graph Construction with Large Language Models. arXiv 2024, arXiv:2409.20010. [Google Scholar]
- Qi, Z.; Yu, Y.; Tu, M.; Tan, J.; Huang, Y. FoodGPT: A Large Language Model in the Food Testing Domain with Incremental Pre-training and Knowledge Graph Prompt. arXiv 2023, arXiv:2308.10173. [Google Scholar]
- Xiao, Y.; Zhang, S.; Zhou, H.; Li, M.; Yang, H.; Zhang, R. FuseLinker: Leveraging LLM’s pre-trained text embeddings and domain knowledge to enhance GNN-based link prediction on biomedical knowledge graphs. J. Biomed. Inform. 2024, 158, 104730. [Google Scholar] [CrossRef] [PubMed]
- Bodenreider, O.; Burgun, A. Aligning Knowledge Sources in the UMLS: Methods, Quantitative Results, and Applications. In MEDINFO 2004, Studies in Health Technology and Informatics; IOS Press: Amsterdam, The Netherlands, 2004; Volume 107, pp. 327–331. [Google Scholar] [CrossRef]
- Sun, Z.; Deng, Z.-H.; Nie, J.-Y.; Tang, J. RotatE: Knowledge Graph Embedding by Relational Rotation in Complex Space. In Proceedings of the International Conference on Learning Representations, New Orleans, LA, USA, 6–9 May 2019. [Google Scholar]
- Toutanova, K.; Chen, D.; Pantel, P.; Poon, H.; Choudhury, P.; Gamon, M. Representing Text for Joint Embedding of Text and Knowledge Bases. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, Lisbon, Portugal, 17–21 September 2015; Association for Computational Linguistics: Kerrville, TX, USA, 2015; pp. 1499–1509. [Google Scholar]
- Hua, M.; Zhang, Y.; Zhang, Q.; Tang, H.; Guo, L.; Lin, Y. KG-IBL: Knowledge Graph Driven Incremental Broad Learning for Few-Shot Specific Emitter Identification. IEEE Trans. Inf. Forensics Secur. 2024, 19, 10016–10028. [Google Scholar] [CrossRef]
- Xu, C.; Nayyeri, M.; Alkhoury, F.; Yazdi, H.; Lehmann, J. Temporal Knowledge Graph Completion Based on Time Series Gaussian Embedding. In Proceedings of the Semantic Web–ISWC 2020, Online, 2–6 November 2020; Lecture Notes in Computer Science. Springer: Cham, Switzerland, 2020; Volume 12506. [Google Scholar] [CrossRef]
- Consoli, S.; Pezzoli, L.T.; Tosetti, E. Using the GDELT Dataset to Analyse the Italian Sovereign Bond Market. In Proceedings of the Machine Learning, Optimization, and Data Science, LOD 2020, Siena, Italy, 19–23 July 2020; Lecture Notes in Computer Science. Springer: Cham, Switzerland, 2020; Volume 12565. [Google Scholar] [CrossRef]
- Zhang, Z.; Cai, J.; Zhang, Y.; Wang, J. Learning Hierarchy-Aware Knowledge Graph Embeddings for Link Prediction. In Proceedings of the AAAI Conference on Artificial Intelligence, New York, NY, USA, 7–12 February 2020; Volume 34, pp. 3065–3072. [Google Scholar] [CrossRef]
- Jiang, C.Y.; Jiang, Y.; Wu, W.Q.; Zheng, Y.T.; Xie, P.J.; Tu, K.W. COMBO: A Complete Benchmark for Open KG Canonicalization. arXiv 2023, arXiv:2302.03905. [Google Scholar]
- Lehmann, J.; Isele, R.; Jakob, M.; Jentzsch, A.; Kontokostas, D.; Mendes, P.N.; Hellmann, S.; Morsey, M.; van Kleef, P.; Auer, S.; et al. DBpedia–A large-scale, multilingual knowledge base extracted from Wikipedia. Semant. Web J. 2015, 6, 167–195. [Google Scholar] [CrossRef]
- Liu, R.; Wu, L.; Zhang, P. KG-TREAT: Pre-training for Treatment Effect Estimation by Synergizing Patient Data with Knowledge Graphs. In Proceedings of the AAAI Conference on Artificial Intelligence 2024, Vancouver, BC, Canada, 20–27 February 2024; Volume 38, pp. 8805–8814. [Google Scholar] [CrossRef]
- Jiang, Z.; Zhong, L.; Sun, M.; Xu, J.; Sun, R.; Cai, H.; Luo, S.; Zhang, Z. Efficient Knowledge Infusion via KG-LLM Alignment. In Proceedings of the Findings of the Association for Computational Linguistics: ACL 2024, Bangkok, Thailand, 11–16 August 2024; Association for Computational Linguistics: Kerrville, TX, USA, 2024; pp. 2986–2999. [Google Scholar]
- Wang, Y.; Xie, Q.; Tang, M.; Li, L.; Yuan, J.; Liu, Y. Amazon-KG: A Knowledge Graph Enhanced Cross-Domain Recommendation Dataset. In Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’24), Washington, DC, USA, 14–18 July 2024; Association for Computing Machinery: New York, NY, USA, 2024; pp. 123–130. [Google Scholar] [CrossRef]
- Gong, L.; Zhang, H. MR2-KG: A Multi-Relation Multi-Rationale Knowledge Graph for Modeling Software Engineering Knowledge on Stack Overflow. IEEE Trans. Softw. Eng. 2024, 50, 1867–1887. [Google Scholar] [CrossRef]
Symbol | Specific Meaning |
---|---|
h | Head entity |
r | Relationship |
t | Tail entity |
Positive sample set | |
Negative sample set | |
Interval parameters (margin) | |
Score of triplet | |
True label | |
Predicted tags | |
Test set | |
Ranking of triplet | |
TPR | True Positive Rate |
FPR | False Positive Rate |
Query collection | |
Average Precision of query q | |
∘ | Element-wise multiplication (Hadamard product) |
Take the real part of a complex number | |
Cosine similarity | |
Distance between vectors and | |
accuracy | Accuracy |
MRR | Mean Reciprocal Rank |
Hits@k | Hit rate of the first k hits |
Embedding vectors of head and tail entities | |
Embedding vectors of relationship and inverse relationship | |
Conv2D | 2D convolution operations |
ReLU | Rectified Linear Unit activation function |
Quaternion representation of head entity, relationship, and tail entity | |
⊗ | Quaternion multiplication |
J | Objective Function (loss function) |
i th cluster | |
Center of the i th cluster | |
Single linkage distance | |
Complete linkage distance | |
Average linkage distance | |
J | Objective function, representing the sum of squared distances from all data points to their respective cluster centers. |
n | Total number of data points in the dataset. |
K | Number of clusters. |
i th data point (). | |
Center of the k th cluster (centroid). | |
Indicator variable indicating whether data point belongs to cluster k. If belongs to cluster k, ; otherwise, . | |
P | Proportion of samples predicted as positive by the model that are actually positive. |
R | Proportion of samples that are actually positive classes that are correctly predicted by the model as positive classes. |
Knowledge Graph Type | Main Characteristics | Application Scenarios | Advantages | Disadvantages |
---|---|---|---|---|
Classic Knowledge Graph (CKG) | Structured semantic knowledge base based on graphs, storing data in triplet form | Social networks, bioinformatics networks, traffic flow prediction, etc. | Rich semantic information, capable of intuitively modeling heterogeneous relationships and supporting interpretable reasoning | Traditional methods face limitations in handling complex relationships, long-tail entities, and dynamic knowledge updates |
Temporal Knowledge Graph (TKG) | Dynamic knowledge graph containing facts that change over time | Drug efficacy analysis, event prediction, etc. | Capable of processing temporal information and supporting time-series tasks | Relies heavily on the accuracy and clarity of prompts; prompt design and selection are highly demanding |
Multi-Modal Knowledge Graph (MMKG) | Combines multiple data modalities (e.g., text, images, and videos) to represent facts | Medical knowledge graphs (combining text and medical images) | Enhances the diversity and accuracy of knowledge representation by integrating multi-modal data | May produce inaccurate results due to limitations of LLMs (e.g., hallucinations and insufficient attention to modal information) |
Inductive Knowledge Graph (IKG) | Contains a source KG and a target KG, aiming to transfer structural patterns from the source to the target | Recommendation systems, cross-domain knowledge transfer | Capable of handling new entities and relations, supporting inductive reasoning | May generate unreliable reasoning results due to LLMs’ hallucinations |
Open Knowledge Graph (OKG) | Derived from open datasets, user contributions, or internet information, emphasizing openness and scalability | Cyber threat response, mental health observation, geographic relation understanding, etc. | Lowers the threshold for knowledge access and enables rapid knowledge expansion | Data quality is inconsistent, and there are reliability concerns, which may lead to incorrect information generation by LLMs |
Domain Knowledge Graph (DKG) | Specific to a particular domain, recording domain-specific knowledge | Medicine, electric power, etc. | Provides targeted solutions for specific domains | Strong dependency on domain-specific data, with the risk of overfitting |
KGE Method Category | Working Procedure | Advantages | Disadvantages |
---|---|---|---|
Methods that use LLMs’ prompts | Extract structural and auxiliary information from CKG triplets and convert it into text descriptions to serve as prompts for LLMs. Leverage LLMs’ text generation and reasoning capabilities to complete tasks. | Effectively utilize the text generation and reasoning capabilities of LLMs to integrate CKG knowledge embeddings naturally. Useful for complex tasks like knowledge completion and uncovering hidden semantic relationships. | Potential information loss or semantic bias during the conversion of triplets to text. Linguistic ambiguity may reduce the accuracy of the original structural information. |
Methods for fine-tuning LLMs | Collect CKG triplet data, convert them into a format suitable for model training, and fine-tune the LLMs with specific loss functions to optimize performance for the task. | Allows the model to learn semantic relationships in CKG triplets, improving performance on CKG tasks and enhancing the model’s professionalism. | Requires large amounts of high-quality annotated data. Data collection and preprocessing are time-consuming. Risk of overfitting if data are insufficient or of low quality. |
Methods of pre-trained LLMs | Convert CKG triplets and background information into text format and use this data for pre-training tasks (e.g., MLM, NSP) to update model parameters. | Enhances the model’s understanding of CKG semantics and relationships through pre-training, enriching its knowledge reserves. | Conversion to text format may introduce noise or information loss, affecting the accuracy of learned prior knowledge. |
Methods to use LLMs as an agent | Use LLMs as an agent to interactively explore entities and relations on CKGs and perform reasoning based on retrieved knowledge. | Leverages pre-trained knowledge and reasoning patterns in LLMs, combined with CKG structure, for complex knowledge reasoning. | Currently less explored and requires further investigation. |
Datasets | Datasets Resource |
---|---|
ICEWS 05-15 [43] | https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/28075 (accessed on 15 May 2025) |
GDELT [44] | https://www.gdeltproject.org/data.html (accessed on 15 May 2025) |
YAGO [38] | https://yago-knowledge.org/ (accessed on 15 May 2025) |
UMLS [39] | https://www.nlm.nih.gov/research/umls/index.html (accessed on 15 May 2025) |
MetaQA [77] | https://github.com/yuyuz/MetaQA (accessed on 15 May 2025) |
ReVerb45K [78] | https://github.com/Gorov/REVERB (accessed on 15 May 2025) |
FB15k-237 [60] | https://github.com/TimDettmers/ConvE/blob/master/FB15k-237.tar.gz (accessed on 15 May 2025) |
WN18RR [70] | https://github.com/TimDettmers/ConvE/blob/master/WN18RR.tar.gz (accessed on 15 May 2025) |
Dataset Name | Data Source | Number of Entities | Number of Relations | Number of Triples | Application Scenarios | Features |
---|---|---|---|---|---|---|
ICEWS 05-15 | Global news media, government reports, sociological research | ∼50 k | - | ∼400 k | Time-series knowledge graphs, event prediction | Covers global political, social, and economic events from 2005 to 2015 |
GDELT | Global news media reports | ∼100 k | - | ∼3 million | Social science research (conflict analysis, political stability) | Contains global events, language, and sentiment information |
YAGO | Wikipedia, WordNet, GeoNames | ∼1.2 million | - | ∼350 k | Entity linking, relation extraction, knowledge graph construction | Provides rich semantic information |
UMLS | National Library of Medicine (USA) | ∼300 k | - | ∼1 million | Biomedical text processing, information retrieval, knowledge graph construction | Contains a large number of medical terms and concepts |
MetaQA | Movie database | ∼4.3 k | - | ∼300 k | Machine reading comprehension, question answering system development, multi-hop reasoning | Includes 1-hop, 2-hop, and 3-hop question answering pairs |
ReVerb45K | Web page text | ∼10 k | - | ∼45 k | Relation extraction, knowledge graph construction | Automatically extracted relation instances from web page text |
FB15k-237 | Subset of Freebase knowledge graph | 15,028 | 237 | ∼310 k | Knowledge graph completion, link prediction | Processed subset to reduce overlap between training and testing sets |
WN18RR | Subset of WordNet knowledge graph | 40,943 | 11 | ∼93 k | Knowledge graph completion, link prediction | Revised subset to increase task difficulty |
Reference | Code Resource |
---|---|
RecInDial: A Unified Framework for Conversational Recommendation with Pretrained Language Models [43] | https://github.com/Lingzhi-WANG/PLM-BasedCRS (accessed on 15 May 2025) |
Knowledge Graph Large Language Model (KG-LLM) for Link Prediction [44] | https://anonymous.4open.science/r/KG-LLM-FED0 (accessed on 15 May 2025) |
THINK-ON-GRAPH: Deep and Responsible Reasoning of Large Language Model on Knowledge Graph [53] | https://github.com/IDEA-FinAI/ToG (accessed on 15 May 2025) |
Unifying Local and Global Knowledge: Empowering Large Language Models as Political Experts with Knowledge Graphs [38] | https://github.com/xymou/PEG (accessed on 15 May 2025) |
KC-GenRe: A Knowledge-constrained Generative Re-ranking Method Based on Large Language Models for Knowledge Graph Completion [39] | https://github.com/wylResearch/KC-GenRe (accessed on 15 May 2025) |
KG-Rank: Enhancing Large Language Models for Medical QA with Knowledge Graphs and Ranking Techniques [77] | https://anonymous.4open.science/r/KGQA-270F (accessed on 15 May 2025) |
From Large Language Models to Knowledge Graphs for Biomarker Discovery in Cancer [78] | https://github.com/rezacsedu/llms_to_kgs_for_cancer (accessed on 15 May 2025) |
Multi-perspective Improvement of Knowledge Graph Completion with Large Language Models [60] | https://github.com/quqxui/MPIKGC (accessed on 15 May 2025) |
UrbanKGent: A Unified Large Language Model Agent Framework for Urban Knowledge Graph Construction [70] | https://github.com/usail-hkust/UrbanKGent (accessed on 15 May 2025) |
Docs2KG: Unified Knowledge Graph Construction from Heterogeneous Documents Assisted by Large Language Models [63] | https://docs2kg.ai4wa.com (accessed on 15 May 2025) |
zrLLM: Zero-Shot Relational Learning on Temporal Knowledge Graphs with Large Language Models [55] | https://github.com/ZifengDing/zrLLM (accessed on 15 May 2025) |
GenTKG: Generative Forecasting on Temporal Knowledge Graph with Large Language Models [57] | https://github.com/mayhugotong/GenTKG (accessed on 15 May 2025) |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Liu, B.; Fang, Y.; Xu, N.; Hou, S.; Li, X.; Li, Q. Large Language Models for Knowledge Graph Embedding: A Survey. Mathematics 2025, 13, 2244. https://doi.org/10.3390/math13142244
Liu B, Fang Y, Xu N, Hou S, Li X, Li Q. Large Language Models for Knowledge Graph Embedding: A Survey. Mathematics. 2025; 13(14):2244. https://doi.org/10.3390/math13142244
Chicago/Turabian StyleLiu, Bingchen, Yuanyuan Fang, Naixing Xu, Shihao Hou, Xin Li, and Qian Li. 2025. "Large Language Models for Knowledge Graph Embedding: A Survey" Mathematics 13, no. 14: 2244. https://doi.org/10.3390/math13142244
APA StyleLiu, B., Fang, Y., Xu, N., Hou, S., Li, X., & Li, Q. (2025). Large Language Models for Knowledge Graph Embedding: A Survey. Mathematics, 13(14), 2244. https://doi.org/10.3390/math13142244