An Ontology-Based Expert System for Rice Disease Identiﬁcation and Control Recommendation

: A great deal of information related to rice cultivation has been published on the web. Conventionally, this information is studied by end-users to identify pests, and to prevent production losses from rice diseases. Despite its beneﬁts, such information has not yet been encoded in a machine-processable form. This research closes the gap by modeling the knowledge-bases using ontologies and semantic technologies. Our modeled ontologies are externalized from existing reliable sources only, and offer axioms that describe abnormal appearances in rice diseases (and insects) and the corresponding controls. In addition, we developed an expert system called RiceMan, based on our ontologies, to support technical and non-technical users for diagnosing problems from observed abnormalities. We also introduce a composition procedure that aggregates users’ observation data with others for realizing spreadable diseases. This procedure, together with ontology reasoning, lies at the heart of our methodology. Finally, we evaluate our methodology practically with four groups of stakeholders in Thailand: senior agronomists, junior agronomists, agricultural students, and ontology specialists. Both ontologies and RiceMan are evaluated to verify their correctness, usefulness, and usability in various aspects. Our experimental results show that ontology reasoning is a promising approach for this domain problem.


Introduction
A critical challenge in rice cultivation is to handle rice diseases and paddy insects that affect the plants. Rice plants can be damaged by both abiotic factors (i.e., the environment and nutrient factors, such as drought and cold) and biotic factors (i.e., animals, bacteria, and viruses). When they are damaged, different kinds of observable abnormal characteristics can occur. In real-life situations, farmers generally handle these infections in manual and ad-hoc manners. For instance, they may either recall from their background knowledge or consult with experts (i.e., experienced farmers and agronomists). However, these practices are not efficient, since different experts may have diverse skills and familiarities. Indeed, knowledge should be refined and externalized to farmers so that they can recognize diseases from abnormal characteristics and pick-up treatments appropriately.
Information related to rice cultivation on diverse websites enables rice farmers to enhance their knowledge and adapt it to their circumstances. For instance, they can search and seek desirable information to identify why a disease occurs and how they should handle it. On the other hand, farmers need to consume vast and diverse sources to acquire the desired knowledge. Furthermore, such published information is usually not encoded in a machine-processable form, causing difficulties to expert systems for utilizing it properly. This problem motivated us to close the gap by constructing structured knowledge with well-defined semantics and develop an expert system for rice disease identification and control recommendation to support user decisions. At a high-level, our motivations amount to investigations of the design and modeling of rice disease ontologies and their adoption to practical applications. Therefore, our research questions (RQs) are set up as follows: • RQ1: how could a knowledge base for rice disease and paddy insects be modeled from the available unstructured resources?
• RQ2: How could a knowledge base for rice disease be applied practically?
• RQ3: How can a designed expert system be evaluated to ensure its appropriateness, from the viewpoints of stakeholders?
This research is an extended study of our proceeding research, which developed the first preliminary version of the ontologies "RiceDO" and "TreatO", covering 22 rice diseases and paddy insects, and introduced a semantic-based framework for supporting rice disease and control identification. In this paper, our novelties consist of two ontologies (RiceDO and TreatO version 2) which have been improved from RiceDO and TreatO to meet the accepted standards. The structures of RiceDO and TreatO version 2 are reorganized and are designed for retrieving possible rice pests and controls, respectively. Additionally, to practically utilize both ontologies, we extended and implemented a novel framework and expert system, called RiceMan, and then performed a thorough evaluation to ensure the correctness, usefulness, and usability. The contributions of this work are threefold: • Existing ontologies were investigated, remodeled, revised, and evaluated, practically, from the viewpoints of four groups of stakeholders; • An expert system, called RiceMan, was designed and developed for rice disease identification and control recommendation; • Results of applying a expert system in the rice domain were evaluated and reported.
Section 2 reviews the related works. Sections 3 and 4 present the ontologies and the RiceMan expert system, respectively. Section 5 explores the empirical evaluation. In Section 6, we draw our conclusions. Table 1 summarizes and compares existing ontologies and vocabularies in the agriculture domain, with respect to their main focus and covered knowledge.

Existing Ontologies and Vocabularies
AGROVOC [1] is a well-known vocabulary, covering many areas, such as food, nutrition, agriculture, fisheries, forestry, and the environment. Despite its huge size, the presented vocabulary is not applicable for rice disease identification and control recommendation due to the lack of specific rice plant axioms. Agriculture Activity Ontology (AAO) [2] is a part of the Common Agricultural Vocabulary (CAVOC). It was developed as a common vocabulary to support cooperation between different farm management systems. It defines farm activity expressions based on description logics, with eight kinds of essential attributes, e.g., purposes, actions, equipment, and crop names. The National Agricultural Library Thesaurus (NALT) (https://agclass.nal.usda.gov, accessed on 1 November 2021) is an agricultural vocabulary and glossary in English and Spanish. It covers 17 subject categories, such as food, animals, biological, farms, and insects. Phenotype and Trait Ontology (PATO) (https://github.com/pato-ontology/pato, accessed on 1 November 2021) [3] provides definitions of phenotypic qualities (properties and attributes) that can be used to define phenotypes across species' domains, such as color (e.g., red, yellow), abnormal positions, temperatures (e.g., high, low), and symptoms (e.g., swelling) [4].  [7][8][9], and others. TO provides nine broad categories of plant traits, in which each category consists of measurable characteristics of plants. Planteome develops PO to model plant-related knowledge. Thus, PO describes anatomical structures and growth stages in the plant bodies (e.g., roots, stems, leaves, fruits, and seeds) Several rice plant knowledge-bases were developed using the OWL, RDF, and OBO formats. For example, the Crop Ontology Rice Trait Ontology (CO_320) (https://cropontology. org/term/CO_320:ROOT, accessed on 1 November 2021) [10] is a part of Crop Ontology (CO) (https://cropontology.org/, accessed on 1 November 2021) [16,17], providing validated trait names as well as the terminology of phenotypic data of rice plants. Plant Disease Ontology (PDO) (https://github.com/Planteome/plant-disease-ontology, http://wiki.plantontology.org/index.php/Plant_Disease_Ontology_(PDO), accessed on 1 November 2021) [11] provides a list of maize, wheat, and rice diseases, structurally classified into three main categories, i.e., bacteria, fungi, and viruses. Plant Protection Ontology (PPOntology) (https://sites.google.com/site/ontoworks/ontologies, https: //sites.google.com/site/ppontology/home, accessed on 1 November 2021) [12,13] classifies barley's disorder into abiotic and biotic. The biotic disorder is also classified further into bacteria, fungi, and viruses. Rice Disease Ontology (RiceDO) and Treatment Ontology (TreatO) [14] initially model rice diseases and insects according to their abnormal characteristics of rice plants and defined relevant biological and chemical controls using ontological classes and relation expressions.
Regarding the engineering discipline of each ontology, CO_320 and PDO exploit annotations for modeling the pathogens and short descriptions of each disease. In contrast, PPOntology models disease names, appearances, and pathogens as individuals in the ontology. While these models can represent the domain knowledge, their choices cause difficulties in employing inference engines, such as reasoning for implicit diseases from a symptom expression. RiceDO and TreatO [14] address this issue by employing a different modeling approach using ontology class definitions. Despite their existence, they have not yet been evaluated and used empirically. These unaddressed points motivate us to justify and revise towards their practical implementation.  [19,20] provides pest diagnostic tools and fact sheets on plants via a website and a mobile application. Evidence in [19] shows that the combination of online fact sheets and plant doctors, to advise farmers, is helpful in practice. Toepfer et al. [21] reports that 86% of trained plant doctors in China indeed provide comprehensive pest management recommendations, leading to sustainable farming. CROPROTECT (https://croprotect.com, accessed on 1 November 2021) [22] is a UK-based website that aims to provide remote farmers with access to knowledge about crops, pests, weeds, and disease management. RiceXpert (https://play.google.com/store/apps/details?id=com.icar.ricexpert&hl=en&gl=US, accessed on 1 November 2021) [23] is a mobile application for disseminating rice plant knowledge, e.g., rice diseases, paddy insects, non-insect pests, weeds, nutrients, as well as trendy news, a fertilizer calculator, and online advisory services from experts via text, images, and voice. Finally, Plantix [24,25] is a pest and disease diagnosis tool that employs machine learning algorithms to recommend a customized treatment corresponding to an identified disease based on an input image. It can support the knowledge of pests and diseases of more than 46 plants; it also provides a fertilization calculator. Regarding image-based disease detection, ref. [31] reviews various modern feature extraction techniques for plant disease detection. In [32], the authors propose a C-GAN based deep learning approach to detect tomato diseases from leaf images. The residualbased CNN architecture is proposed in [33]. Still, there exist various expert systems for plant disease diagnosis. An android-based application in [34] is developed to identify fruit diseases from photos by using K-Means to segment each damaged area, using an artificial neuron network (ANN) to classify the diseases. A desktop application for diagnosing six papaya plant diseases [35] is developed as an IF-THEN rule-based system, using CLIPS and Delphi language. In [36], a fuzzy logic rule-based web application for diagnosing banana diseases and pests contains 37 symptoms for 5 pests and 4 diseases of banana plants. In [37], a web-based system for managing nine rice diseases and pests is introduced, covering 32 damages for a rice plant.

Farming Knowledge Systems and Expert Systems for Plant Disease Diagnosis
Several expert systems [14,[38][39][40] utilize ontologies and semantic web technologies for crop planning and diagnosis of plant diseases. An expert system in [38] employs an ontology and rules for recommending rice varieties and theirs corresponding crop calendar based on geographic and temporal information. Its knowledge-base and rules cover rice varieties, disease and pest resistance, soil types, and ecosystems. For disease diagnosis purposes, ref. [39] presents phytopathology ontology covering plants, diseases, causes, a comprehensive set of symptoms, recommendations, etc. Its knowledge base is modeled as individuals (ABox) with rules developed based on SWRL. On the other hand, refs. [14,40] RiceDO and TreatO ontologies model rice diseases, abnormalities, and controls as TBox axioms using the description logic-based formalism. This paper aims to improve RiceDO and TreatO to meet the accepted standards and utilizes both ontologies for identifying rice diseases and controls with the developed expert system called RiceMan.

Ontology Development for RiceMan
RiceMan exploits OWL for modeling knowledge about rice diseases and controls. Hence, it enables the disease identification and control recommendation to be functioned by an ontology reasoner equipped within the framework.

Ontology Requirements
We carefully elaborate competency questions (CQs) to clarify the ontology usage's scope. Ontologies do not merely store knowledge, but they also enable entailing implicitly valid knowledge to the users. To validate the entailment's soundness, CQs are also used during the system evaluation tasks. The CQs used in this research are shown below: CQ1: What are the recognizable diseases when a rice plant has an appearance A?

RiceDO and TreatO: Original Design
RiceDO and TreatO, initially introduced in [14], model terminological knowledge with classes and properties related to traits and phenotype of various rice diseases in Thailand. Their main objectives are to support retrieval of potential diseases that might occur on a rice plant, given an expression of abnormal characteristics with a DL query language. Supported queries could be either in the form of a single appearance (such as a gray spot) or multiple ones (such as a gray spot with a yellow oval spot). The ontologies cover 22 rice diseases and paddy insects in Thailand and coincide with the aforementioned CQs. However, the modeled knowledge is verbose and impractical for the disease identification and control recommendation. To overcome these issues, we further investigate and redesign the ontologies. Thus, incorrect and improper design issues are identified, including (1) unused classes; (2) classes and properties with ungrammatical semantics; (3) classes with semantic overlapping; (4) inappropriateness of class-subclass relationships; and (5) lack of restrictions in domains/ranges of properties.

RiceDO v2: Revised Design
RiceDO v2 is redesigned to address the aforementioned issues and is enhanced to support interoperability with related ontologies. Three reliable knowledge sources are used to formulate axioms in RiceDO: (1) Rice Knowledge Bank Thai version (Thai-RKB) [41] maintained by the Rice Department, Thailand; (2) Rice Knowledge Bank (RKB) from IRRI [26]; and (3) Rice Ontology (CO_320) [10]. Here, we consider Thai-RKB as the primary knowledge source for modeling our ontology, while the other two are used in absence of the information in Thai-RKB.
We adopt the design principle of TBox-based knowledge representation from a wellknown medical ontology called SNOMED CT (https://www.snomed.org/, accessed on 1 November 2021). Following this design principle, RiceDO v2 (http://purl.org/ricedo, accessed on 1 November 2021) [42] consists of nine top-level classes under the RiceDOTop and 11 object properties in the object property hierarchy, as shown in Figure 1a,b, respectively. RiceDO v2 uses the special-purpose property called abnormalityGroup to axiomatize a group of various characteristics that can occur together on a certain disease. We illustrate this in Figure 1c for the rice brown spot fungal disease, in which the property abnormalityGroup is used to group characteristics 'light yellow halo spot on leaf during the tillering growth stage'; that is, abnormalityGroup some (SpotOnLeaf and (hasColor some LightYellow) and (hasGrowthStage some Tillering) and (hasShape some Halo)). Another radical change in RiceDO v2 is to redefine Insect as a kind of Pest instead of RiceDisease as in the first version. In addition, the class PestDamage is redefined to characterize damages caused by pests. Furthermore, the class PlantDisease subsumes 18 diseases, which are categorized under four classes, namely PlantBacterialDisease, PlantFungalDisease, PlantPhytoplasmaDisease, and PlantViralDisease.

a. Class Hierarchy b. Object Property Hierarchy c. Axiom's Description
Moreover, RiceDO v2 complies with good modeling practices of W3C. Regarding the interoperability, RiceDO v2 reuses terms and handles potential mappings with existing ontologies: PDO, PO, and PATO. Rice disease names are imported from PDO. Besides, terms of plant parts and growth stages are mapped to existing equivalent classes in PO by using owl:equivalentClass. Similarly, terms of colors and shapes are mapped to the existing ones in PATO. Mapping terms to PO and PATO are used instead of importing in order to maintain the meaning and simplicity of RiceDO v2's class hierarchy.

TreatO V2: Revised Design
In [43], four approaches for handling rice diseases are discussed: following the recommended physical practices, using host resistance, applying biological controls, and applying chemical controls. Coincided with CQs, the original TreatO focuses on modeling the biological and chemical control knowledge of each disease and paddy insects based on three knowledge sources: RKB's fact sheets [26] and plant disease management knowledge [43,44].
The revised design of TreatO v2 contains corrected classes and properties to resolve ungrammatical semantics on the 'is-a' relation. In addition, it classifies control agents into bacteria, fungi, and viruses. Moreover, it has restrictions on all properties, and includes the class RiceDisease from RiceDO v2. Figure 2 shows the new TreatO v2 (http://purl.org/treato, acsessed on 1 November 2021) [45] with an example of a class description of the chemical control agent to its related disease.

RiceMan Application Development
This section designs and develops the RiceMan application corresponding to the previously defined CQs and the elicited requirements to support user decisions. Figure 3 depicts the use case diagram of RiceMan, covering three groups of target users: farmers, agronomists, and scholars. Firstly, farmers can use RiceMan to understand possible causes and suitable treatments for their plants. However, according to the discussion with agronomists, many traditional farmers in Thailand prefer to consult with agronomists. Thus, we consider agronomists who are responsible for supporting farmers to seek suitable treatments as the second target group. They need to understand the relationship between abnormalities and diseases and the relationship between diseases and treatments quite well. Without intelligent tools, they need to memorize all complex relationships to support farmers. Although the domain knowledge covers 18 diseases, its search space is complex since any disease can be indicated from various combinations of abnormalities: 28 colors, 11 shapes, 29 symptoms, 24 plant parts, and 5 growth stages. While RiceDO's disease axioms do not contain all combinations, a particular disease could contain up to 15 combinations. Hence, RiceMan advances agronomists' working methods by offering intelligent functionalities about diseases and treatments. RiceMan provides a more reliable approach than utilizing solely offline documentations. Lastly, scholars use RiceMan to support their activities. Since RiceMan identifies diseases based on observations and suggests treatments based on the identified diseases, then students and researchers can utilize this functionality to simulate specific scenarios for their study and research. The reasoning capability equipped with RiceMan is a perfect tool for this purpose. Important use cases of the system are explained as follows:

System Requirements
• Add observations: upon the finding of abnormal appearances on a rice plant, a user can use RiceMan to create observation data digitally that represents the observed abnormal appearances.
• Find possible diseases: to find possible diseases, both explicit and implicit diseases corresponding to an observation, RiceMan employs logical reasoning to consider the logical relationships defined in RiceDO for the identification.
• Compose observations (with respect to a distance and a duration): concerning the fact that some rice diseases and paddy insects are spreadable to nearby areas, a user may choose to include observations made within the specific distance and duration in his/her consideration of the disease diagnosis. This functionality is called the observation composition. It calculates the distance between different latitudes and longitudes and considers the duration in the unit of days.
• Find possible transmittable diseases: once users consider composing observations, the identified results can be classified into (1) possible diseases with respect to distance and duration; and (2) possible transmittable diseases, with respect to distance and duration.   Figure 4 depicts the RiceMan's architectural design, based on a client-server architecture, and implemented by Java, OWL API (http://semanticweb.org/wiki/OWL_API.html, accessed on 1 November 2021) [46], and MySQL (for storing observation data). The main reasoning technique used in RiceMan is subsumption [47], which enables to logically infer if a class expression subsumes one another or not. Thus, to enable the search of recognizable diseases and potential controls with subsumption reasoning, we convert each observation object as a class expression. Once the potential diseases are indicated, we also employ the subsumption reasoner to suggest corresponding controls.  The web server exposes five main APIs. First, createObservation provides a service to store observations into the database. Second, getSubclassOfObservation returns a list of disease names, which are sub-categories of an observation. This API can also be parameterized for the composition with other observations if a distance and a duration are satisfied. Internally, when this API is called, a corresponding OWL class expression will be determined and be subsequently used by the OWL reasoner to find their subclasses from the RiceDO ontology. The third and fourth APIs are getBiologicalTreatmentsOfDisease and getChemicalTreatmentsOfDisease, respectively. As indicated by their names, they return a list of (biological and chemical) treatments corresponding to a given disease. Lastly, API getAbnormalities returns a group of abnormality for a specified disease.

Query Construction and User Interface
Since RiceDO and TreatO ontologies are developed by the OWL language, then the appropriate query language is thus the DL query expressed in the Manchester OWL syntax. DL query uses a class expression as an input to logically determine implicit facts inside an ontology. This research uses this competency of DL query to search for certain classes using subsumption reasoning. Note that we employ subsumption in DL query because our prime objective is to classify whether an input observation expression belongs to any disease classes or not. For this purpose, a user's input is needed to transform into a DL query before it can be used to query for a certain pest. Therefore, we develop two procedures for creating DL queries from a user's input by complying with the design of our ontologies.
The first procedure is for converting an observation object in the database into an OWL class expression based on the following three steps: (1) we nest a set of symptoms, places, shapes, growth stages, transmission, and colors using existential quantification via properties hasSymptom, hasSymptomAt, hasShape, hasGrowthStage, hasTransmission, and hasColor, respectively; (2) we conjunct these nested expressions; and (3) we nest all using existential quantification via property abnormalityGroup. Figure 5a illustrates a constructed OWL class expression of an input observation object using this procedure.
abnormalityGroup some ((hasSymptom some Spot) and (hasSymptomAt some Leaf) and (hasColor some Yellow)) • Rice brown spot fungal disease • Rice narrow brown leaf spot fungal disease • Rice tungro viral disease isBiologicalControlAgentOf some Rice brown spot fungal disease isChemicalControlAgentOf some Rice brown spot fungal disease Query:

Result:
Finding diseases based on a given observation data Finding biological controls based on a given disease Finding chemical controls based on a given disease The second procedure is for treatment recommendation. That is, it nests a given disease name via property isBiologicalControlAgentOf (or isChemicalControlAgentOf depending upon a user's request) using existential quantification. Figure 5b,c illustrates an application of this procedure to recommend controls for a recognizable disease. Figure 6 shows a sequence of RiceMan's user interfaces that display the results of our queries as follows. From Steps 1-3, the user inputs the observed abnormalities as a brown oval spot on a leaf. Step 4 allows the user to define thresholds for a distance and a duration. In Step 5, RiceMan returns the identified disease(s). In Step 6, the user can click on the disease identified to get recommended treatments. Steps 7-8 show the groups of all possible abnormalities of a disease followed by the suggested controls.

Observation Data Composition
RiceMan implements the observation composition mechanism to identify diseases from multiple observations. If observations are located nearby each other, occur coincidentally, and are composable, then they are composed and are collectively treated as a single observation. Figure 7 illustrates an example in which each observation O 1 , O 2 , and O 3 occurs in different geolocations and dates as described in Table 3. Here, suppose that O 1 is collected first; after that, O 2 and O 3 are collected. RiceMan users can compose past observations with the current observation for disease identification, which is implemented as follows: 1.
Past observations that satisfy three conditions are retrieved: a distance threshold, a date threshold, and a composability boolean, from the database; 2.
An OWL class expression is created from a database object, representing a current observation manifested with certain characteristics and intersects it with the retrieved past OWL class expressions;

3.
The new observation is used for the subsequent disease identification based on the subsumption reasoning.
The observation composition mechanism helps RiceMan users delve into specific results when a single observation captures partial or broad abnormalities. Nevertheless, this mechanism may yield the same results as considering a single observation. That is when there are no observations that satisfy the three conditions. For instance, using the composition function with O 2 yields the same results as without using this function, i.e., it returns the rice brown spot fungal disease. This is because there are no nearby observations with O 2 . On the other hand, RiceMan yields the rice blast fungal disease, the rice sheath rot fungal disease, and the rice brown spot fungal disease for O 3 when the composition function is disabled (cf. the top part of Figure 8). In contrast, it yields the rice brown spot fungal disease for O 3 when this composition functionality is opted in (cf. the bottom part of Figure 8). When a single observation contains broad abnormal characteristics, diverse results can be obtained. Hence, this composition mechanism helps users narrow the disease identification from many possible disease candidates.    Figure 9 illustrates our evaluation design to validate important aspects of the developed ontologies and the RiceMan system. The evaluation comprises three important dimensions: RiceDO and TreatO ontology validation, ontology usage, and RiceMan application usability. The following subsections then elaborate each evaluation dimension in more details.

Ontology Validation
We carried out this part of the evaluation with five ontology experts, who were ontology engineer experts from NECTEC (National Electronics and Computer Technology Center) and AIT (Asian Institute of Technology) with experiences in modeling and applying ontologies in their work. Four important criteria were considered:

1.
Appropriateness: we concentrated on verifying whether each defined class, axiom, and the object property was appropriately modeled based on the domain knowledge and practical usage from rice plant experts' perspectives. Here, the participants investigated both ontologies by using Protégé and provided scores for the defined eight sub-criteria, shown in Table 4. That is, RiceDO and TreatO were modeled with 95.2% and 89.4% appropriateness, respectively.

2.
Consistency: we performed consistency checking in RiceDO and TreatO according to the six sub-criteria proposed in [48]: (1) no duplicated classes; (2) no class cycles in the class hierarchy; (3) no duplicated properties; (4) no property cycles; (5) no invalid ranges; and (6) no invalid domains. As a result, the participants found a few mistakes in some object properties' domains, and we had corrected them. Finally, all participants agreed that the revised ontologies contain no inconsistency according to the mentioned criteria.

3.
Query validation: to ensure the practicability of applying the ontologies with respect to the defined CQs, we carried out a task-based evaluation, in which we presented for each task the prepared queries corresponding to the defined CQs and their results. The participants verified whether the results were correct, and the queries were written efficiently. Table 5 shows that every query was formulated correctly and efficiently except for Query#4 (i.e., 92%) due to its semantics' incorrectness. Note that we skipped the evaluation of CQ 3 (i.e., finding the abnormalities of any disease) because it was natural to retrieve axioms of abnormalities modeled in RiceDO. Besides, we validated that our shortened axioms usage enabled us to write concise queries. Though this aspect was not part of CQs, it was helpful to realize the effectiveness towards writing practical queries.

4.
Ontology satisfaction: the participants were given a questionnaire to analyze their ontology satisfaction usage. In summary, the participants were satisfied with RiceDO in terms of identifying rice diseases based on either single or multiple abnormalities as well as its convenience in querying by means of the shortened axioms (score 100%). For TreatO, the participants were satisfied with its query facilities for searching of controls (score 88-92%). Moreover, the participants agreed that both ontologies could improve work efficiency (score 100%) and they were willing to use them (score 96%).
Overall, it shows the ontologies gain high satisfaction with an average score of 95.5%. For more details, see Table A1 in the Appendix A. The class hierarchy in RiceDO is well defined and accurate. 96 2 The object property hierarchy in RiceDO is well defined and accurate. 92 3 The abnormalityGroup is appropriate to explain a group of abnormal characteristics. 92 4 The axiom that represents each disease's characteristics is well defined and accurate by using the abnormalityGroup. 100 5 The SymptomCharacteristic and GrowthStageCharacteristic, which are used to shorten axioms, are well defined and correct. 96 Overall appropriate score of RiceDO 95.2% The class hierarchy in TreatO is well defined and accurate. 92 7 The object property hierarchy in TreatO is well defined and accurate. 88 8 The axiom representing diseases that can be treated by each biological control and chemical control is well defined and accurate. 88 Overall appropriate score of TreatO (89.4%) This evaluation was conducted to verify that our ontologies are valuable and can effectively facilitate our users with respect to the three important criteria: completeness, accuracy, and usefulness. Two groups of senior agronomists: two plant pathologists and two entomologists, from DRRD (The Division of Rice Research and Development under the Rice Department, Ministry of Agriculture and Cooperatives, Thailand), with 8-year experience on average, were involved.

1.
Evaluating the completeness of diseases and paddy insects: we evaluated the completeness in terms of the number of diseases and insects provided in RiceDO (cf. Though the number of insects' coverage seems insufficient, two of them frequently occur in rice fields, and three of them are carriers for some rice diseases (such as gall dwarf disease and rice tungro disease).

2.
Evaluating the accuracy of modeled knowledge: two aspects were considered: (1) the correctness of the modeled axioms; and (2) the appropriateness of our vocabulary. Precisely, the participants were asked to consider if the modeled axioms were correct or not. Similar steps were carried out to verify the appropriateness of the vocabulary. Table 7 shows the results of this evaluation in percentage. Note that 13 (out of 18) rice diseases, frequently found in Thailand rice fields, and four paddy insects were selected for this validation. For the error analysis, the correctness of rice disease axioms was lower than those of other classes due to two main reasons. The first reason was the inconsistency between knowledge sources and domain experts. Some participants voted the axioms as incorrectly defined because the axioms did not coincide with their experiences, although they were modeled properly according to the referenced knowledge sources (IRRI and Thai-RKB). Concerning the appropriateness of the used vocabulary, some participants argued that the terms referenced from the knowledge sources were not suitable. We found that the vocabulary's appropriateness relies on a proper term selection and a proper English-Thai translation.

3.
Evaluating the usefulness of inferred knowledge: to evaluate the practical applicability of the ontologies, we asked the participants to review the prepared inferred knowledge based on six scenarios. Specifically, scenarios #1-#2 identify rice diseases from single and multiple abnormality groups (usefulness score: 48% and 70%); Scenario #3 queries for all abnormality groups of disease (76%); Scenarios #4-#5 query for a disease's biological and chemical control agents (42% and 48%); Scenario #6 focuses on the disease classification into possible diseases and warning diseases with respect to nearby and recent observation data (90%) (cf. Table A2). From the results, Scenarios #2, #3, and #6 obtained prominently high scores, indicating that the proposed functions help the participants reduce the searching times compared to the manual search. The participants agreed that the multiple observation data and the composition mechanism, and ontology reasoning are promising for those scenarios.

RiceMan Application Usability Evaluation
Based on the proposed RiceMan system architectural design, the application was developed with the UI in both English and Thai languages. The usability evaluation was then conducted with two groups of participants: senior-year agricultural students and junior agronomists. The first group of participants were five senior-year undergraduate students (Chulalongkorn University, School of Agriculture Resources) who were knowledgeable in rice diseases and paddy insects from their degree program's coursework, but did not have direct experience in real-world practice. The students used RiceMan via their mobile phones to complete four provided tasks. For tasks #1 and #2, the participants were asked to identify rice diseases from a given single observation and multiple observations, respectively. For tasks #3 and #4, they were asked to identify biological and chemical controls from the returned diseases. After their completion, the participants provided their opinions by answering ten questions based on System Usability Scale (SUS) [49] (cf. Table A3 in the Appendix A for more information). The average SUS score among the students was 70.5. Since it was above 68, then RiceMan was acceptable from the agricultural students' viewpoints. The participants were interested in applying ontology reasoning techniques to identify diseases and treatments. The feedback shows that the application can support their study and improve the search in many real-life use cases. Since they had experience in farm work, they believed that the application can benefit the farmers, especially those interested in using new technologies to support their works. However, most users were facing difficulties with using the application due to some graphical interfaces.
The second group of participants were ten junior agronomists from Ayutthaya Rice Research Center, who worked on transferring technologies to farmers, focused on rice variety research and development, and advised farmers. Similarly, they performed the same four tasks on their mobile phones, and then answered a SUS questionnaire. The average SUS score obtained was 64.3, close to the standard average of SUS.
Although the score was less than the standard average, the participants agreed that the application could significantly help them identify diseases and treatments, compared with searching through manuals and websites. It shows that RiceMan is applicable for use from the participants' viewpoints since it offers helpful and sufficient functionalities for real-life situations even though the current version seems challenging for farmers to use. We summarize our lessons learned as follows: UI/UX design:

1.
The design of the observation data input form should be improved to make it more human-friendly for use and to support various kinds of abnormality appearances.

2.
Since RiceMan is built as a responsive web application, its input form varies among iOS and Android devices. Here, the participants who used an iOS device faced some difficulty as the input form was a picker instead of a drop-down box.

3.
The treatment names shown on RiceMan should be italicized for easy-looking.
The participants preferred to have photos of plant parts to support the observation data input forms and photos of diseases and insects on the result page.

2.
The control names should be translated into Thai and provided with the product/brand names.

3.
Some agronomists were specialists in insects. Therefore, they preferred RiceMan to contain more knowledge of paddy insects.

Conclusions
This study proposes the design and development of RiceMan, an ontology-based expert system for rice disease identification and control recommendation. As part of our development, we make further progress on improving the two ontologies for rice disease in Thailand: RiceDO and TreatO. Together with the composition mechanism, these two ontologies lie at the heart of RiceMan's functionalities. RiceMan facilitates users to identify diseases with respect to RiceDO's TBox axioms and suggests controls with respect to TreatO's TBox axioms. Though both ontologies are initially built for RiceMan, they can be potentially extended for other expert systems in the agriculture domain with some revisions to comply with the adapted problems.
To ensure its usability and practicability, we evaluate the developed ontologies and the expert system in comprehensive viewpoints with four user groups: (1) ontology specialists; (2) senior agronomists; (3) junior agronomists; and (4) scholars/agricultural students. From the ontology specialists' perspective, RiceDO and TreatO are modeled soundly, appropriately, and consistently; hence, they can support the users to write efficient queries. From the senior agronomists' perspectives, the knowledge related to rice diseases is modeled almost completely, but the knowledge regarding paddy insects may be insufficient. The correctness of the modeled axioms is acceptable, but requires some improvement in terms of the vocabulary. Regarding the usability evaluation, the participants found themselves pleasant with the Thai version, even though they faced some difficulties in the observation data input with the current user interface. These results demonstrate that RiceMan is a promising approach to the rice disease identification and control recommendation problem.
There are several future directions. Firstly, RiceDO and TreatO can be enhanced with modeling of axioms from various aspects of information and reliable knowledge sources. Secondly, we aim at collaborating more closely with agronomists at multiple stages of ontology development to clean up some inconsistency that may happen between the adopted knowledge sources and domain experts' tacit knowledge. Besides, we are interested in applying ontology learning techniques to (semi-)automatically model and enhance the constructed knowledge base. Finally, considering potential applications of our ontologies with machine learning for disease identification, we plan to investigate and study the development of latent representation from RiceDO and TreatO. Inspired by [50], each class and property can be trained based on self-supervised learning to find relevant vector representations from the ontologies. In [50], ontologies from BioPortal (https://bioportal.bioontology.org/, accessed on 1 November 2021) are used to develop semantic word embedding for applications of monitoring diabetes and blood pressure (BP) patients. In the near future, we plan to attempt similar directions based on our ontologies for monitoring diseases from a multi-modal context.   Searching for all abnormality groups of a disease. 76 4 Searching for biological controls of a disease. 42 5 Searching for chemical controls of a disease. 48 6 The classification of diseases into possible diseases and warning diseases with respect to considering nearby observations.