Ontological Representation of the Structure and Vocabulary of Modern Greek on the Protégé Platform
Abstract
:1. Introduction
2. Methodology and Usefulness
3. The New Ontological Knowledge System of Modern Greek
- The Greek Language Ontology, which captures the structure of the Modern Greek language at its various levels of analysis;
- The Greek Ontology Dictionary, which includes in its entries the vocabulary of the Modern Greek language;
- The Supply Chain Ontology, which explores a vertical application of the language in the specific domain, as a proof-of-concept implementation.
3.1. The Ontology of Modern Greek Language
- which concepts of language should be approached for a complete analysis of the field,
- which constituents of the language should be considered primary or secondary,
- how these constituents should be structured to have a more complete structure of the language at all its levels,
- from which perspective natural language should be approached as a unified whole, etc.
- it achieves the mapping of the properties of the concepts of Modern Greek;
- it defines stable and well-defined semantic generalization and specialization relations between concepts;
- it creates a hierarchical relation between concepts and their semantic associations by linking them as nodes;
- the inheritance of properties, namely multiple and non-monotonic inheritance, is supported;
- and terminologies supported by valid reasoning are developed, through a systematic way of encoding associations, based on rules, axioms, functions, and constraints, while conclusions are drawn.
3.2. The Greek Ontology Dictionary
- the existence of phonetic transcription;
- the definition/interpretation;
- morphological and grammatical information;
- the connection of each lemma of a constituent word (noun, adjective, verb, etc.) with its declension paradigm;
- the recording of the largest possible number of expressions and phrases of Modern Greek, related to the entry;
- etymology;
- synonyms, antonyms, diminutives, magnifying nouns;
- English translation;
- textual/phonological features;
- examples of usage;
- spelling;
- and, occasionally, photographs related to each entry.
3.3. The Ontology of the Supply Chain
4. Designing the New Ontological Dictionary of Modern Greek
- the hierarchy of concepts, which contains the entities,
- and the hierarchy of conceptual relations, which contains the semantic relations of the entities.
4.1. Design Methodology of the New Ontological Dictionary
- the definition of classes in the ontology and their organization into a taxonomic hierarchy (superclass–subclass);
- the definition of the relations that link the classes to each other, i.e., the definition of object properties;
- the definition of the individuals/instances, their annotation properties, and the relations between the individuals, i.e., the definition of the data properties;
- the definition of the values of the object properties and of the data properties of the individuals;
- the axioms, or the rules defined based on the relations between classes and individuals.
4.1.1. Identifying the Scope of Definition
- What is the scope of definition that the ontology will cover?
- What will we use the ontology for?
- Who will use and maintain the ontology?
- For which types of questions should the information in the ontology provide answers?
- How will the four levels of linguistic analysis be linked? etc.
- and to the level of operation of the language at its various levels, e.g.:
- What is the definition-interpretation of a word?
- What are the morphosyntactic features of a word?
- Which words have only singular and which have only plural form?
- Which parts of speech can form the first or second compound in a compound word?
- How is the concept of polysemy defined in Modern Greek?
- How does the origin of words affect the style of speech?
- How can we be led in the automatic derivation of new words (derivative and compound)?
- Why can a grammatical type be formed in various ways in Modern Greek (e.g., the monosyllabic and periphrastic formation of perfect tenses)?
- When are the conjunctions introducing the same kind of subordinate clause used (e.g., when is the conjunction ’since’ and when is the conjunction ’while’ used in temporal clauses);
- Where (in what circumstances) is each of the words with similar meanings, e.g,. synonyms, used? etc.
4.1.2. Ontology from Scratch
4.1.3. Language Resources
4.1.4. Definition of Classes-Subclasses
4.1.5. Defining Relations
- relations are denoted by phrases that indicate logical naming conventions, which means that they obey a logical pattern (e.g., isA, aKindOf, agreesIn, consistsOf, derivedFrom, describes, expresses, forms, functionsSyntacticallyAs includes isAccompaniedBy, isDistinguisedIn, isDividedInto, isPartOf, refersTo, relatedTo, represents);
- each relation has its inverse, wherever this is possible (e.g., the relation ‘follows’ has its inverse ‘isFollowedBy’); and
- the types of relations follow a taxonomy and extend to all four levels of language analysis to ensure consistency of the language.
- the relation ‘isA’, and also the relations ‘canBe’, ‘mayBe’;
- the relation ‘consistsOf’, and also the relation ‘mayConsistOf’;
- the relation ‘expresses’, and also the relation ‘canExpress’;
- the relation ‘takesAsNegation’, and also ‘canTakeAsNegation’;
- the relation ‘isFormedBy’, and also ‘mayBeFormedBy’ and ‘canBeFormedBy’;
- the relations ‘isAccompaniedBy’ and ‘canBeAccompaniedBy’, etc.
- the relation ‘isAccompaniedBy’, but also the relation ‘isUsuallyAccompaniedBy’;
- the relation ‘forms’, but also the relation ‘usuallyForms’ or even the relation ‘canBeUsuallyFormedBy’;
- the relation ‘usuallyTakesAnObjectIn’, but also the relation ‘rarelyTakesAnObjectIn’.
4.1.6. Determining the Data Properties
- αδίστακτος/adistaktos/ and αδίσταχτος/adistachtos/ (unscrupulous),
- different forms of the masculine, e.g., two forms for the masculine noun upholsterer:
- ταπετσιέρης/tapetsieris/ and ταπετσέρης/tapetseris/
- or of the feminine, e.g., three forms for the feminine noun doctor:
- γιατρίνα/yatrina/, γιάτρισσα/yatrissa/, γιατρέσσα/yatressa/,
- the other gender in a word (e.g., ο διευθυντής—the director, η διευθύντρια—the directress), and the English translation (Appendix A—item 4) that a word-lemma may have, the characteristics of object properties, i.e., the data properties (facets of slots), were defined, as shown in Figure 2. This is information that, like the lemmas, is also automatically extracted from [18] via code in Python computer language.
4.1.7. Defining Instances
4.1.8. Definition of Rules and Axioms
4.2. The Classes and Subclasses of Modern Greek Ontology
4.2.1. The ‘Morphology’ Class
4.2.2. The ‘Syntax’ Class
- a sentence of desire, of estimation, interrogative, or exclamatory (the division criterion being its content);
- whether it is a simple, augmented, elliptical, or compound sentence (the division criterion being its basic terms);
- or even whether it is an affirmative or negative sentence (the division criterion being its quality).
4.2.3. The ‘Semantics’ Class
- the derivational endings of adjectives derived from adverbs, verbs, adjectives, or nouns;
- the derivational endings of adverbs;
- the derivational endings of nouns; and
- the derivational endings of verbs derived from names or non-inflected words.
4.2.4. The ‘Phonetics’ Class
5. Results and Evaluation
- the alphabetical listing (and coding) of lemmas, which allows for their easy searching;
- the search for information about the various meanings and uses of words, their correct spelling, inflection and pronunciation, their valid etymology, their conceptual connection with other words, their position in the text and, more generally, their comprehensive mapping, representation and description;
- the grouping of endings of nominal and verbal types, which makes it suitable for use by natural language automatic processing programs (taggers and lemmatizers);
- the connection of all four levels of the language, which enables the encoding of the semantic relations existing at the grammatical-syntactic level;
- the complete rendering of lexical meaning through the various correlations of lexical entities/lemmas, which can infer all the word forms and associations derived, as relations related, for example, to inflectional or derivational patterns that have been developed, e.g., συγγράφω > συγγράφεις (I write > you write), or συγγράφω > συγγραφέας (write > writer), respectively;
- the possibility of multiple inheritance;
- the possibility of dealing with cases of ambiguity;
- the existence of an English translation for each lemma-instance, so that the ontology can be understood and used internationally, and also so that the differences between the Greek language and English can be more easily seen.
5.1. Connecting the Levels of Linguistic Analysis
- the agreement of the verb with its subject (level of syntax) in person and number (level of morphology);
- the agreement of the predicative with its subject (level of syntax) in gender, case and number (level of morphology);
- the case (level of morphology) of uttering the direct, indirect and corresponding object (level of syntax);
- the syntactic functions (level of syntax) that the different parts of speech can have (level of morphology) as subject, object, predicative, nominal or adverbial attributes, etc.
- the inflection and tense (level of morphology) of the utterance of subordinate clauses, as well as the rendering of conditional clauses (level of syntax), so that the meaning of each sentence (semantics level), even of conditionals, can be ontologically captured;
- the manner of utterance (level of syntax) of the moods (level of morphology) in relation to the meaning of the moods (level of semantics);
- the cases (level of morphology) with which the prepositions are formed into (level of syntax) and the meaning expressed by the prepositional phrases (level of semantics) that are created.
5.2. Multiple Inheritance at All Levels of Linguistic Analysis
- combine features of the name and the verb;
- sometimes have features of inflected types and sometimes features of non-inflected types, e.g., the non-inflected participle in -οντας/-ontas/ or ώντας/-όntas/ and the inflected participle in -μένος/-menos/; and
- belong to the impersonal moods which are non-inflected, but also to the pendant parts of the verb, which is an inflected grammatical type.
5.3. Linguistic Ambiguity
5.4. Automatic Management and Processing of Information
6. Conclusions and Future Approaches
- it is readable and understandable by computers (machine-readable);
- it provides semantic description to the contents of the internet;
- it makes it possible with simple reasoning mechanisms to perform concept-based search instead of keyword-based search, thus enabling semantic focus of queries, questions and answers in terms of more than one, and the use of text transformation operators;
- it enables automated inference and reasoning services;
- it is a powerful tool for database integration and natural language understanding [22];
- it can participate in the use of different information sources in a variety of applications;
- it includes an electronic conceptual dictionary of Modern Greek that organizes its linguistic material ontologically, according to the standards of dictionaries at the international level; and
- it is the most appropriate way of representing linguistic knowledge, since it allows for the definition of relationships between words, which is not present in a standard dictionary [23].
- Linguistic and cross-linguistic research: Facilitates research in linguistics by offering a structured framework for analyzing and comparing languages.
- Educational applications: Supports language learning tools by analyzing grammar, syntax, and linguistic structures.
- Interconnection with other dictionaries and enrichment: Enables the integration and enhancement of existing dictionaries through semantic alignment and data sharing.
- Search Engine Optimization (SEO): Improves semantic analysis of queries and search results, enhancing search engine performance.
- Natural Language Generation (NLG): Assists in producing high-quality, context-sensitive content in various styles.
- Natural Language Understanding (NLU): Enhances the interpretation of texts and conversations for more accurate understanding.
- Interoperability and data exchange: Ensures seamless integration with other systems for improved data sharing and interaction.
- Semantic web integration: Contributes to better categorization and searchability of data on the semantic web.
- Decision-making systems: Supports decision-making processes in fields like healthcare, law, and administration by providing structured linguistic data.
- Sentiment analysis: Assists in identifying emotional and ideological content within texts, aiding in sentiment analysis applications.
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Appendix A
- The term “word” means any linguistic unit that contains meaning and grammatical determination (see [24]).
- The term “concept” means the totality of the main characteristics of a multitude of similar objects, concrete or abstract, as well as the permanent and definite representation formed in our minds of them (see [25]).
- The relationships that connect linguistic elements/linguistic units to a specific linguistic environment or other similar units, through contrast and substitution, are paradigmatic [26]. The relations that the linguistic unit enters into as a result of its occurrence with other similar units are constitutional.
- There were, of course, cases in which there was no correspondence between the Greek terms and the English ones, due to the lack of the corresponding grammar-syntactic phenomena in English. However, an attempt was made to reproduce these terms as closely as possible to their meaning in Greek. Thus, for example, the term ‘ειδική πρόταση’, which does not exist in English, was rendered with the term: ‘sentence_of_saying’.
- The annotation property: phrase lists stereotyped compounds whose meaning has been completely removed from the literal and ceases to have any literal relation to the words of which they are composed, referring directly to a metaphor, e.g., “the last hole of the flute”, a phrase referring to a person we consider inferior and do not consider (see [27]).
References
- Samaridi, N.; Karanikolas, N.; Papoutsidakis, M.; Papakitsos, E. A Survey on Ontological Organization of Data in the Semantic Web. In Proceedings of the 14th International Scientific Conference “eRA 2021” in the Field of “Industry 4.0”, Data Management and Educational Parameters of Industry 4.0, Athens, Greece, 19 October 2021; Available online: http://era-conference.teipir.gr/ (accessed on 12 June 2024).
- Samaridi, N.; Karanikolas, N.; Papakitsos, E. Lexicographic Environments in Natural Language Processing (NLP). In Proceedings of the 24th Pan-Hellenic Conference on Informatics (PCI 2020), Athens, Greece, 20–22 November 2020. [Google Scholar] [CrossRef]
- Samaridi, N.; Karanikolas, N.; Papakitsos, E.; Papoutsidakis, M. Designing a Greek Electronic Dictionary based on Ontology. In Proceedings of the 24th Pan-Hellenic Conference on Informatics (PCI 2020), Athens, Greece, 20–22 November 2020. [Google Scholar] [CrossRef]
- McGuinness, N.; Van Harmelen, F. OWL Web Ontology Language Overview. February 2004. Available online: www.w3.org/TR/owl-features/ (accessed on 27 July 2024).
- Sirin, E.; Parsia, B.; Grau, B.C.; Kalyanpur, A.; Katz, Y. Pellet: A practical OWL-DL reasoner. J. Web Semant. Sci. Serv. Agents World Wide Web 2007, 5, 51–53. [Google Scholar] [CrossRef]
- Shearer, R.; Motik, B.; Horrocks, I. HermiT: A Highly-Efficient OWL Reasoner. In Proceedings of the Fifth OWLED Workshop on OWL: Experiences and Directions, Collocated with the 7th International Semantic Web Conference (ISWC-2008), Karlsruhe, Germany, 26–27 October 2008. [Google Scholar]
- Grigoriadou, M.; Kornilakis, H.; Galiotou, E.; Stamou, S.; Papakitsos, E. The Software Infrastructure for the Development and Validation of the Greek Wordnet. J. Inf. Sci. Technol. 2004, 7, 89–105. [Google Scholar]
- Copeland, G.; Khoshafian, S. Identity and Versions for Complex Objects. In Proceedings of the 1986 International Workshop on Object-Oriented Database Systems, Pacific Grove, CA, USA, 23–26 September 1986; Available online: https://dl.acm.org/doi/10.5555/318826.318873 (accessed on 27 July 2024).
- Stavroulas, G. Object-Oriented Methodology and Tools for the Integration of Structured and Semi-Structured Information Using Ontologies [Aντικειµενοστραφής Μεθοδολογία και Εργαλεία για την Oλοκλήρωση ∆οµηµένης και Hµι-∆οµηµένης Πληροφορίας µε Χρήση Oντολογιών]. Ph.D. Thesis, National Technical University of Athens, Athens, Greece, 2016. (In Greek). [Google Scholar]
- Samaridi, N.E.; Karanikolas, N.N.; Papoutsidakis, M.; Papakitsos, E.C.; Papakitsos, C.E. A survey on supply chain ontologies. Int. J. Prod. Manag. Eng. 2023, 11, 89–101. [Google Scholar] [CrossRef]
- Guarino, N., Ed.; Formal Ontology in Information Systems. In Proceedings of the FOIS’98, Trento, Italy, 6–8 June 1998; Available online: https://www.researchgate.net/publication/272169039_Formal_Ontologies_and_Information_Systems (accessed on 12 June 2024).
- Samaridi, N.; Papakitsos, E.; Papoutsidakis, M.; Mouzala, M.; Karanikolas, N. Developing a Logistics Ontology for Natural Language Processing. WSEAS Trans. Inf. Sci. Appl. 2024, 21, 385–397. [Google Scholar] [CrossRef]
- Noy, N.; McGuinness, D.L. Ontology Development 101: A Guide to Creating Your First Ontology. Knowledge Systems Laboratory of Stanford University, Technical Report KSL-01-05 and Stanford Medical Informatics Technical Report SMI-2001-0880, March 2001. Available online: https://protege.stanford.edu/publications/ontology_development/ontology101.pdf (accessed on 12 June 2024).
- Grüninger, M.; Fox, M.S. Methodology for the Design and Evaluation of Ontologies. In Proceedings of the Workshop on Basic Ontological Issues in Knowledge Sharing, IJCAI-95, Montreal, QC, Canada, 19–21 August 1995. [Google Scholar]
- Zachman, J.A. A framework for information systems architecture. IBM Syst. J. 1987, 26, 276–292. [Google Scholar] [CrossRef]
- Triantaphyllidis, M. Modern Greek Grammar; Update of the Small Modern Greek Grammar; OEDB: Athens, Greece, 2007. (In Greek) [Google Scholar]
- Hatzisavvidis, S.; Hatzisavvidou, A. Grammar of the Modern Greek Language, A, B, C High School; ITYE “Diophantos”: Athens, Greece, 2011. (In Greek) [Google Scholar]
- Electronic Dictionary of Common Modern Greek by M. Triantafyllidis. Available online: https://www.greek-language.gr/greekLang/modern_greek/tools/lexica/triantafyllides/ (accessed on 27 July 2024). (In Greek).
- Supply Chain. Available online: https://www.supplychain.gr (accessed on 27 July 2024). (In Greek).
- Musen, M.A. Dimensions of knowledge sharing and reuse. Comput. Biomed. Res. 1992, 25, 435–467. [Google Scholar] [CrossRef] [PubMed]
- Gruber, T.R. A Translation Approach to Portable Ontology Specification. Acad. Press 1993, 2, 199–220. [Google Scholar] [CrossRef]
- Dahlgren, K. A Linguistic Ontology. Int. J. Hum.-Comput. Stud. 1995, 43, 809–818. [Google Scholar] [CrossRef]
- Markantonatou, S.; Fotopoulou, A. The tool “Ekfrasi”. In Proceeding of the 8th International Conference on Greek Linguistics, the Lexicography Workshop, Ioannina, Greece, 8–10 September 2007. [Google Scholar]
- Gate for the Greek language. Available online: https://www.greek-language.gr/greekLang/modern_greek/tools/lexica/triantafyllides/search.html?lq=%CE%BB%CE%AD%CE%BE%CE%B7&dq (accessed on 27 July 2024). (In Greek).
- Gate for the Greek Language. Available online: https://www.greek-language.gr/greekLang/modern_greek/tools/lexica/triantafyllides/search.html?lq=%22%CE%AD%CE%BD%CE%BD%CE%BF%CE%B9%CE%B1+1%22&dq (accessed on 27 July 2024). (In Greek).
- Babiniotis, G. Theoretical Linguistics, Introduction to Modern Linguistics [Θεωρητική Γλωσσολογία, Εισαγωγή στη Σύγχρονη Γλωσσολογία]; Babiniotis: Athens, Greece, 1988; p. 121. (In Greek) [Google Scholar]
- Gate for the Greek Language. Available online: https://www.greek-language.gr/greekLang/modern_greek/tools/lexica/triantafyllides/semasiology.html#toc006 (accessed on 27 July 2024). (In Greek).
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Samaridi, N.; Papakitsos, E.; Karanikolas, N. Ontological Representation of the Structure and Vocabulary of Modern Greek on the Protégé Platform. Computation 2024, 12, 249. https://doi.org/10.3390/computation12120249
Samaridi N, Papakitsos E, Karanikolas N. Ontological Representation of the Structure and Vocabulary of Modern Greek on the Protégé Platform. Computation. 2024; 12(12):249. https://doi.org/10.3390/computation12120249
Chicago/Turabian StyleSamaridi, Nikoletta, Evangelos Papakitsos, and Nikitas Karanikolas. 2024. "Ontological Representation of the Structure and Vocabulary of Modern Greek on the Protégé Platform" Computation 12, no. 12: 249. https://doi.org/10.3390/computation12120249
APA StyleSamaridi, N., Papakitsos, E., & Karanikolas, N. (2024). Ontological Representation of the Structure and Vocabulary of Modern Greek on the Protégé Platform. Computation, 12(12), 249. https://doi.org/10.3390/computation12120249