Knowledgebra: An Algebraic Learning Framework for Knowledge Graph
Abstract
:1. Introduction
2. Knowledgebra: An Emergent Algebra in Knowledge Graph
2.1. A Categorical Language for Knowledge Graph
- A class of objects;
- A class of morphisms, or arrows, or maps between the objects;
- A domain, or source object class function ;
- A codomain, or target object class function ;
- For every three objects a, b, and c, a binary operation called composition of morphisms; the composition of , and is written as ;
- Associativity: if , and , then ;
- Identity: for every object x, there exists a morphism , called the identity morphism for x, such that every morphism satisfies , and every morphism satisfies .
2.2. Logic Construction versus Logic Extraction
- The hand-crafting effort of integrating logic rules becomes impractical when the number of rules gets large;
- The explicit construction could not accommodate any possible faults;
- The construction could only take into account rules known a priori, and could not observe new ones (with logic operators, higher-order rules could be composed; However, here we refer to an inductive process to obtain new elementary rules).
2.3. Algebraic Constraints in KGE
- Totality:, is also in ;
- Associativity:, ;
- Identity:;
- Invertibility:;
- Commutativity:.
3. A Semigroup Based Instantiation of Knowledge Graph Embedding
3.1. Model Design and Analysis
3.1.1. Embedding Spaces for Entities and Relations
3.1.2. Distance Function for Similarity Measure
3.1.3. Low Dimensional Relation Embedding
- shared blocks: instead of using n distinct matrices, we use identical copies of one matrix, i.e., . The number of parameters of embedding for one relation then reduces from to , which is a super low dimensional embedding, termed as SemE-s.
- shared blocks with shift: in the case where a single matrix is insufficient while low-dimensional efficiency is still demanded, we could break the symmetry among n subspaces by introducing a block-dependent shift . Precisely, the transformation in each subspace could be written as:The number of parameters is then . And we term the resulting model as SemE-s (importantly, this shift corresponds to a translation in each subspace, which, together with the matrix multiplication, still hold a semigroup structure. The resulting operation is quite similar to a Euclidean group but with non-invertible elements).
3.2. Experiments on Benchmark Datasets
3.2.1. Experimental Setup
3.2.2. Experiment Results
4. Integrating Human Knowledge into Knowledge Graph Embedding
4.1. A Regularization Method for Logic Rules
4.2. Kinship: A Case Study of Logic Integration
5. Discussion
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
KG | Knowledge Graph |
KGE | Knowledge Graph Embedding |
References
- Guo, Q.; Zhuang, F.; Qin, C.; Zhu, H.; Xie, X.; Xiong, H.; He, Q. A Survey on Knowledge Graph-Based Recommender Systems. IEEE Trans. Knowl. Data Eng. 2020, 1, 5555. [Google Scholar] [CrossRef]
- Bordes, A.; Weston, J.; Usunier, N. Open question answering with weakly supervised embedding models. In Proceedings of the Joint European Conference on Machine Learning and Knowledge Discovery in Databases, Nancy, France, 15–19 September 2014; pp. 165–180. [Google Scholar]
- Bordes, A.; Chopra, S.; Weston, J. Question Answering with Subgraph Embeddings. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Doha, Qatar, 25–29 October 2014; pp. 615–620. [Google Scholar] [CrossRef]
- Huang, X.; Zhang, J.; Li, D.; Li, P. Knowledge graph embedding based question answering. In Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining, Melbourne, Australia, 11–15 February 2019; pp. 105–113. [Google Scholar]
- Hoffmann, R.; Zhang, C.; Ling, X.; Zettlemoyer, L.; Weld, D.S. Knowledge-based weak supervision for information extraction of overlapping relations. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Portland, OR, USA, 19–24 June 2011; pp. 541–550. [Google Scholar]
- Daiber, J.; Jakob, M.; Hokamp, C.; Mendes, P.N. Improving efficiency and accuracy in multilingual entity extraction. In Proceedings of the 9th International Conference on Semantic Systems, Graz, Austria, 4–6 September 2013; pp. 121–124. [Google Scholar]
- Jumper, J.; Evans, R.; Pritzel, A.; Green, T.; Figurnov, M.; Ronneberger, O.; Tunyasuvunakool, K.; Bates, R.; Žídek, A.; Potapenko, A.; et al. Highly accurate protein structure prediction with AlphaFold. Nature 2021, 596, 583–589. [Google Scholar] [CrossRef] [PubMed]
- Thakur, N.; Han, C.Y. A Study of Fall Detection in Assisted Living: Identifying and Improving the Optimal Machine Learning Method. J. Sens. Actuator Netw. 2021, 10, 39. [Google Scholar] [CrossRef]
- Brown, T.; Mann, B.; Ryder, N.; Subbiah, M.; Kaplan, J.D.; Dhariwal, P.; Neelakantan, A.; Shyam, P.; Sastry, G.; Askell, A.; et al. Language models are few-shot learners. Adv. Neural Inf. Process. Syst. 2020, 33, 1877–1901. [Google Scholar]
- Sun, Z.; Deng, Z.H.; Nie, J.Y.; Tang, J. RotatE: Knowledge Graph Embedding by Relational Rotation in Complex Space. In Proceedings of the International Conference on Learning Representations, New Orleans, LA, USA, 6–9 May 2019. [Google Scholar]
- Chami, I.; Wolf, A.; Juan, D.C.; Sala, F.; Ravi, S.; Ré, C. Low-Dimensional Hyperbolic Knowledge Graph Embeddings. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Online, 5–10 July 2020; pp. 6901–6914. [Google Scholar] [CrossRef]
- Bordes, A.; Usunier, N.; Garcia-Durán, A.; Weston, J.; Yakhnenko, O. Translating Embeddings for Modeling Multi-relational Data. In Proceedings of the 26th International Conference on Neural Information Processing Systems (NIPS’13), Mountain View, CA, USA, 6–12 December 2020; Volume 2, pp. 2787–2795. [Google Scholar]
- Fan, M.; Zhou, Q.; Chang, E.; Zheng, F. Transition-based knowledge graph embedding with relational mapping properties. In Proceedings of the 28th Pacific Asia Conference on Language, Information and Computing, Phuket, Thailand, 12–14 December 2014; pp. 328–337. [Google Scholar]
- Xiao, H.; Huang, M.; Zhu, X. From One Point to a Manifold: Knowledge Graph Embedding for Precise Link Prediction. In Proceedings of the IJCAI’16, New York, NY, USA, 9–15 July 2016; pp. 1315–1321. [Google Scholar]
- Feng, J.; Huang, M.; Wang, M.; Zhou, M.; Hao, Y.; Zhu, X. Knowledge graph embedding by flexible translation. In Proceedings of the Fifteenth International Conference on the Principles of Knowledge Representation and Reasoning, Cape Town, South Africa, 25–29 April 2016. [Google Scholar]
- Xiao, H.; Huang, M.; Hao, Y.; Zhu, X. TransA: An adaptive approach for knowledge graph embedding. arXiv 2015, arXiv:1509.05490. [Google Scholar]
- Wang, Z.; Zhang, J.; Feng, J.; Chen, Z. Knowledge graph embedding by translating on hyperplanes. In Proceedings of the AAAI Conference on Artificial Intelligence, Quebec City, QC, Canada, 27–31 July 2014; Volume 28. [Google Scholar]
- Lin, Y.; Liu, Z.; Sun, M.; Liu, Y.; Zhu, X. Learning entity and relation embeddings for knowledge graph completion. In Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, Austin, TX, USA, 25–30 January 2015. [Google Scholar]
- Ji, G.; He, S.; Xu, L.; Liu, K.; Zhao, J. Knowledge graph embedding via dynamic mapping matrix. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, Beijing, China, 26–31 July 2015; pp. 687–696. [Google Scholar]
- Guo, S.; Wang, Q.; Wang, L.; Wang, B.; Guo, L. Jointly embedding knowledge graphs and logical rules. In Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, Austin, TX, USA, 1–5 November 2016; pp. 192–202. [Google Scholar]
- Guo, S.; Wang, Q.; Wang, L.; Wang, B.; Guo, L. Knowledge graph embedding with iterative guidance from soft rules. In Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, New Orleans, LA, USA, 2–7 February 2018; Volume 32. [Google Scholar]
- Cheng, K.; Yang, Z.; Zhang, M.; Sun, Y. UniKER: A Unified Framework for Combining Embedding and Definite Horn Rule Reasoning for Knowledge Graph Inference. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, Punta Cana, Dominican Republic, 7–11 November 2021; pp. 9753–9771. [Google Scholar]
- Qu, M.; Tang, J. Probabilistic logic neural networks for reasoning. Adv. Neural Inf. Process. Syst. 2019, 32, 1–14. [Google Scholar]
- Harsha Vardhan, L.V.; Jia, G.; Kok, S. Probabilistic logic graph attention networks for reasoning. In Proceedings of the Companion Proceedings of the Web Conference 2020, Taipei, Taiwan, 20–24 April 2020; pp. 669–673. [Google Scholar]
- Zhang, Y.; Chen, X.; Yang, Y.; Ramamurthy, A.; Li, B.; Qi, Y.; Song, L. Can graph neural networks help logic reasoning? arXiv 2019, arXiv:1906.02111. [Google Scholar]
- Xu, C.; Li, R. Relation Embedding with Dihedral Group in Knowledge Graph. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Florence, Italy, 28 July–2 August 2019; pp. 263–272. [Google Scholar]
- Yang, T.; Sha, L.; Hong, P. NagE: Non-Abelian Group Embedding for Knowledge Graphs. In Proceedings of the 29th ACM International Conference on Information & Knowledge Management, Online, 19–23 October 2020. [Google Scholar] [CrossRef]
- Barr, M.; Wells, C. Toposes, Triples, and Theories. 1985. Available online: https://books.google.com.hk/books?id=q_-EAAAAIAAJ (accessed on 15 January 2022).
- Wikipedia Contributors. Category (Mathematics)—Wikipedia. The Free Encyclopedia. 2021. Available online: https://en.wikipedia.org/w/index.php?title=Category_ (accessed on 1 March 2022).
- Choudhary, S.; Luthra, T.; Mittal, A.; Singh, R. A survey of knowledge graph embedding and their applications. arXiv 2021, arXiv:2107.07842. [Google Scholar]
- Schlichtkrull, M.; Kipf, T.N.; Bloem, P.; Berg, R.v.d.; Titov, I.; Welling, M. Modeling relational data with graph convolutional networks. In Proceedings of the European Semantic Web Conference, Monterey, CA, USA, 8–12 October 2018; pp. 593–607. [Google Scholar]
- Balazevic, I.; Allen, C.; Hospedales, T. TuckER: Tensor Factorization for Knowledge Graph Completion. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), Hong Kong, China, 3–7 November 2019; pp. 5185–5194. [Google Scholar] [CrossRef] [Green Version]
- Dettmers, T.; Minervini, P.; Stenetorp, P.; Riedel, S. Convolutional 2D knowledge graph embeddings. In Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, New Orleans, LA, USA, 2–7 February 2018. [Google Scholar]
- Toutanova, K.; Chen, D. Observed versus latent features for knowledge base and text inference. In Proceedings of the 3rd Workshop on Continuous Vector Space Models and Their Compositionality, Beijing, China, 26–31 July 2015; pp. 57–66. [Google Scholar]
- Miller, G.A. WordNet: A lexical database for English. Commun. ACM 1995, 38, 39–41. [Google Scholar] [CrossRef]
- Bollacker, K.; Evans, C.; Paritosh, P.; Sturge, T.; Taylor, J. Freebase: A collaboratively created graph database for structuring human knowledge. In Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data, Vancouver, BC, Canada, 10–12 June 2008; pp. 1247–1250. [Google Scholar]
- Kingma, D.P.; Ba, J. Adam: A method for stochastic optimization. arXiv 2014, arXiv:1412.6980. [Google Scholar]
- Trouillon, T.; Welbl, J.; Riedel, S.; Gaussier, É.; Bouchard, G. Complex embeddings for simple link prediction. In Proceedings of the International Conference on Machine Learning (PMLR), New York, NY, USA, 20–22 June 2016; pp. 2071–2080. [Google Scholar]
- Yang, B.; Yih, W.t.; He, X.; Gao, J.; Deng, L. Embedding entities and relations for learning and inference in knowledge bases. arXiv 2014, arXiv:1412.6575. [Google Scholar]
Totality | Associativity | Identity | Invertibility | Commutativity | |
---|---|---|---|---|---|
semigroupoid | - | ✓ | - | - | - |
small category | - | ✓ | ✓ | - | - |
groupoid | - | ✓ | ✓ | ✓ | - |
magma | ✓ | - | - | - | - |
unital magma | ✓ | - | ✓ | - | - |
loop | ✓ | - | ✓ | ✓ | - |
semigroup | ✓ | ✓ | - | - | - |
monoid | ✓ | ✓ | ✓ | - | - |
group | ✓ | ✓ | ✓ | ✓ | - |
abelian group | ✓ | ✓ | ✓ | ✓ | ✓ |
Model | WN18RR | FB15k-237 | ||||||
---|---|---|---|---|---|---|---|---|
MRR | H@1 | H@3 | H@10 | MRR | H@1 | H@3 | H@10 | |
TransE [12] | 0.226 | - | - | 0.501 | 0.294 | - | - | 0.465 |
ComplEx [38] | 0.440 | 0.410 | 0.460 | 0.510 | 0.247 | 0.158 | 0.275 | 0.428 |
DistMult [39] | 0.430 | 0.390 | 0.440 | 0.490 | 0.241 | 0.155 | 0.263 | 0.419 |
ConvE [33] | 0.430 | 0.400 | 0.440 | 0.520 | 0.325 | 0.237 | 0.356 | 0.501 |
MuRE [32] | 0.475 | 0.436 | 0.487 | 0.554 | 0.336 | 0.245 | 0.370 | 0.521 |
RotatE [10] | 0.476 | 0.428 | 0.492 | 0.571 | 0.338 | 0.241 | 0.375 | 0.533 |
NagE [27] | 0.477 | 0.432 | 0.493 | 0.574 | 0.340 | 0.244 | 0.378 | 0.530 |
SemE | 0.481 | 0.437 | 0.499 | 0.567 | 0.354 | 0.258 | 0.393 | 0.548 |
isWifeOf | isHusbandOf | Identity |
isHusbandOf | isMotherOf | isFatherOf |
isSonOf | isMotherOf | isBrotherOf |
isSonOf | isFatherOf | isBrotherOf |
isBrotherOf | isFatherOf | isUncleOf |
isBrotherOf | isMotherOf | isUncleOf |
isSisterOf | isFatherOf | isAuntOf |
isSisterOf | isMotherOf | isAuntOf |
isSonOf | isBrotherOf | isNieceOf |
Model | MR | MRR | |||
---|---|---|---|---|---|
SemE-s | 4.83 | 0.464 | 0.292 | 0.458 | 0.875 |
regularized SemE-s | 3.71 | 0.574 | 0.458 | 0.583 | 0.958 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Yang, T.; Wang, Y.; Sha, L.; Engelbrecht, J.; Hong, P. Knowledgebra: An Algebraic Learning Framework for Knowledge Graph. Mach. Learn. Knowl. Extr. 2022, 4, 432-445. https://doi.org/10.3390/make4020019
Yang T, Wang Y, Sha L, Engelbrecht J, Hong P. Knowledgebra: An Algebraic Learning Framework for Knowledge Graph. Machine Learning and Knowledge Extraction. 2022; 4(2):432-445. https://doi.org/10.3390/make4020019
Chicago/Turabian StyleYang, Tong, Yifei Wang, Long Sha, Jan Engelbrecht, and Pengyu Hong. 2022. "Knowledgebra: An Algebraic Learning Framework for Knowledge Graph" Machine Learning and Knowledge Extraction 4, no. 2: 432-445. https://doi.org/10.3390/make4020019
APA StyleYang, T., Wang, Y., Sha, L., Engelbrecht, J., & Hong, P. (2022). Knowledgebra: An Algebraic Learning Framework for Knowledge Graph. Machine Learning and Knowledge Extraction, 4(2), 432-445. https://doi.org/10.3390/make4020019