Comparison of Three Computational Approaches for Tree Crop Irrigation Decision Support

: This paper explores methodologies for developing intelligent automated decision systems for complex processes that contain uncertainties, thus requiring computational intelligence. Irrigation decision support systems (IDSS) promise to increase water e ﬃ ciency while sustaining crop yields. Here, we explored methodologies for developing intelligent IDSS that exploit statistical, measured, and simulated data. A simple and a fuzzy multicriteria approach as well as a Decision Tree based system were analyzed. The methodologies were applied in a sample of olive tree farms of Heraklion in the island of Crete, Greece, where water resources are scarce and crop management is generally empirical. The objective is to support decision for optimal ﬁnancial proﬁt through high yield while conserving water resources through optimal irrigation schemes under various (or uncertain) intrinsic and extrinsic conditions. Crop irrigation requirements are modelled using the FAO-56 equation. The results demonstrate that the decision support based on probabilistic and fuzzy approaches point to strategies with low amounts and careful distributed water irrigation strategies. The decision tree shows that decision can be optimized by examining coexisting factors. We conclude that irrigation-based decisions can be highly assisted by methods such as decision trees given the right choice of attributes while keeping focus on the ﬁnancial balance between cost and revenue.


Introduction
Globally, the agricultural sector consumes the large majority (70%) of water abstractions [1], nevertheless, the resulting water productivity ranges widely even for similar climates, locations and crops [2]. This extensive variability in crop water productivity under otherwise similar conditions, indicates that well planned agricultural management strategies [3,4] can improve sub-optimal yields. In the face of increasing demand [5] and climate change, both at the global [6] and the local scale [7], pressure on agricultural systems is expected to increase, and water scarcity will become a major limitation for sustainable development, especially in semi-arid regions [8].
Since their advent [9], irrigation decision support systems (IDSS) aim to facilitate "smart" water use that allows water users to decouple water consumption from yields, thus achieving economic growth with a lower environmental footprint [10]. Traditionally, these systems produce irrigation schedules based on deterministic models such as AquaCrop [11][12][13] or IRRINET [14] that eventually rely on the Food and Agriculture Organization (FAO) of the United Nations FAO-56 method [15] to estimate crop yield response to temperature and water availability. Despite the fact that numerous

System Definition
The starting point for the probabilistic decision method is the classification of neighboring olive orchards that share common extrinsic conditions E x such as weather related factors and externalities (e.g., water price, cost per irrigation event, agricultural input costs, yield market value), but may differ in some intrinsic farm-specific conditions I x (e.g., soil type) as well as individual agricultural management strategies S x (e.g., crop management, irrigation pattern). These farms can be classified based on profit P (the difference between revenue R from selling yield Y and costs C x incurred during cultivation) which is a function of the above:

Crop Yield
Under optimal water and nutrient availability, climatic conditions, and crop management practices, crop yield Υ and crop evapotranspiration reach a theoretical maximum. Reference [15] includes these Mathematics 2020, 8,717 3 of 26 parameters in a crop water production function and the respective relative decrease of yield to the relative reduction in evapotranspiration: where Y max [t/ha] and Y a [t/ha] are the maximum and actual yield, and ET c [mm] and ET a [mm] are the maximum and actual evapotranspiration. This is a practical and common approach to estimate crop yield on an operational basis [35] as when other agronomical parameters are considered optimal, ET a can be equated with water requirements covered by precipitation and irrigation, and ET c with the crop's irrigation requirements. ET c is a function of reference evapotranspiration ET 0 [mm] which represents the atmospheric water demand, a crop coefficient K c which accounts for agricultural management, crop type and phenology, and a water stress coefficient K s which accounts for root zone and soil water status [15]: where ET 0 [mm] is typically assessed using physically based methods [36] but under data-scarce conditions, empirical formulas can be employed, such as Blaney-Criddle [37]: where T [ • C] is the mean daily temperature and ξ is the mean daily percentage of annual daytime hours. The crop coefficient K c can be estimated experimentally [38][39][40][41][42][43][44] or found in crop yield response factor datasets which are verified in the field [45]. For example, for olive trees, evapotranspiration (or irrigation), requirements are considered from March to November (Day of year 90-330) and, depending on agricultural management or crop type and phenology, may take different values, as shown in Figure 1.
where [t/ha] and [t/ha] are the maximum and actual yield, and [mm] and [mm] are the maximum and actual evapotranspiration. This is a practical and common approach to estimate crop yield on an operational basis [35] as when other agronomical parameters are considered optimal, can be equated with water requirements covered by precipitation and irrigation, and with the crop's irrigation requirements.
is a function of reference evapotranspiration [mm] which represents the atmospheric water demand, a crop coefficient which accounts for agricultural management, crop type and phenology, and a water stress coefficient which accounts for root zone and soil water status [15]: where [mm] is typically assessed using physically based methods [36] but under data-scarce conditions, empirical formulas can be employed, such as Blaney-Criddle [37]: where T [°C] is the mean daily temperature and is the mean daily percentage of annual daytime hours. The crop coefficient can be estimated experimentally [38][39][40][41][42][43][44] or found in crop yield response factor datasets which are verified in the field [45]. For example, for olive trees, evapotranspiration (or irrigation), requirements are considered from March to November (Day of year 90-330) and, depending on agricultural management or crop type and phenology, may take different values, as shown in Figure 1.
where [m 3 /m 3 ] and [m 3 /m 3 ] are the field capacity and wilting point of the soil, respectively, and is the fraction of potentially available soil moisture that can be depleted from the root zone or depth before moisture stress occurs. Daily soil moisture ( ) can be estimated using a simple water balance model that neglects the terms of runoff and groundwater:  The value of water stress coefficient K s varies daily between 0 and 1 depending on soil moisture content θ [m 3 /m 3 ]: where θ f c [m 3 /m 3 ] and θ w [m 3 /m 3 ] are the field capacity and wilting point of the soil, respectively, and p is the fraction of potentially available soil moisture that can be depleted from the root zone or depth Z r before moisture stress occurs. Daily soil moisture θ(t) can be estimated using a simple water balance model that neglects the terms of runoff and groundwater: On any day, water content θ(t) above field capacity θ f c will drain below the root zone, whereas plant roots cannot extract quantities below wilting point θ w . It is due to this limitation that the farmer needs to consider irrigation scheduling for covering ET c requirements. For a more in-depth description, references [15,48,49] elaborated more on this approach of irrigation scheduling.

Strategies
To take advantage of intrinsic farm conditions and reduce the uncertainty incurred by extrinsic conditions, a farmer can follow various strategies. Here, we consider crop management that can lead to small but significant changes of the crop coefficient K c such as heavy pruning S PH which can effectively reduce the evapotranspiration of tree crops [50,51] versus light pruning S PL . Furthermore, we consider that irrigation scheduling strategies S I can vary. Considering the limitation of θ, the optimal scheduling strategy in terms of covering 100% of irrigation requirements is to take advantage of the soil's water buffering capacity and irrigate a quantity equal to exactly θ f c − θ w Z r when θ = θ w . The frequency of irrigation depends on the time required for θ to reach θ w due to ET c . Therefore, assuming zero precipitation, a farmer is required to schedule exactly ET c / θ f c − θ w Z r irrigation events. Nevertheless, as the farmer may also bear significant costs per water unit or/and per irrigation event, other strategic goals co-exist. Therefore, in an effort to increase profit, or merely achieve water conservation, other irrigation strategies may include reducing the amount of irrigated water per irrigation event by a fraction (i.e., S Ix% ) or reducing the amount irrigation events by n times (i.e., S IE−n ).

Simple Multicriteria Approach
The idea of the multi-criteria approach is to use data which indirectly relate to crop yield [52]. The future crop yield states of a farm consist of the labels to be assigned to each farm after the implementation of alternatives scenarios: For each scenario a j , we will subjectively assign a utility value u ji provided the resulting production yield after its implementation is S i (i = A,B,C,D,E). Utility values express the cost or advantage for each pair of alternative action-future state [53]. The factors defining each scenario's utility are normalized and combined with utility indexes for each potential state: The expected utility referred to alternative j is [23] where p(s i ) expresses the likelihood among farm observations for the specific state. The decision choice will be pointed to the alternative that gave the highest expected utility:

Multicriteria Approach with Posterior Information
We can collect more data regarding future states so we can calculate in a safer way the probability for those events in set S (7). Suppose the elements of the "new" information are grouped together in set C. These elements can be used as posterior information in the previous approach to enhance the marginal probabilities. With the help of conditional probabilities, we can express the probability of posterior information given that a future state is true. These are the conditional probabilities of the type: p(s i |c k ).
That "new" information is often referred to as "imperfect information" because it attempts to express certainty (through conditional probabilities) by approximation in the future states' occurrence [54]. Typically, in decision-making problems, the type of "new" information is chosen to indirectly relate to future states. The more relevant the new information is to the actions we examine, the more reliable the decision can be considered [55].

Multicriteria Fuzzy Approach
The concept of participation in a set is fundamental in the representation of elements in a universe A [18]. The membership of an object in a crisp set is 0 or 1, but in fuzzy sets, we can assign infinite mappings to objects (belonging in universe A) in the range [0, 1]. Fuzzy sets contain objects with ambiguous membership properties [23]. The fuzzy membership function embeds the mathematical formulation of the participation into a set and the representation is [25] The value of (11) is the degree of membership for an element x in the fuzzy set A. Assigning utility values in this manner like in the previous multicriteria subsection cannot be considered totally objective. We can introduce a more objective way to calculate utility for every irrigation scenario by taking into account the interaction among the three aforementioned utility factors as a combination value [56]. We will work with fuzzy logic on the simple multicriteria approach to see how it may alter those results.
First, a fuzzy set F is defined with elements a predefined number of fuzzy states, e.g., excellent, good, moderate, etc.
The challenging part is defining the membership function. It will assign each combination-transition of events a value that describes the degree between [0, 1] which belong to the fuzzy linguistic variables.
In analogy to (9), the expected utility of each alternative is where [23] Equation (13) expresses the concept of the "probability of a fuzzy event". If a fuzzy event, F is crisp: F the probability reduces to [23]

Decision Tree and the ID3 Algorithm
The concept in the multicriteria approach was to use data that are indirectly relevant to crop production. Decision trees are mechanisms for predictive modelling and are destined for supervised data mining that compute data in a multi-variate and tree-like fashion [57]. They result into object classification by splitting the branches of a tree. Each split corresponds to a test based on an attribute or a criterion. Those splits are called nodes and the first node is the root of the tree [58]. When the splitting procedure is terminated, it leaves at the end the terminal nodes or the "leaves" of the tree. The rules for object classification are provided by the branch sequences in the tree [59].
Decision trees give the ability to split a complex decision problem into a set of simpler decisions and classify objects with comprehensible representation [60] and challenging applications [61,62]. A formal definition of a typical classification problem can be described as follows. S is the set of objects in a training set in which each object is described by attributes [60]: The domain of each attribute Ai is represented by a set of discrete linguistic terms [60]: Each object in S is classified by a set of classes [60]: The set of classes can be numerical values, ordered or unordered factors. The construction of a decision tree involves the following decisions [60]: which is the next attribute to split, 2.
when splitting is terminated, and 3.
how to assign terminal nodes to a class.
The ID3 (Interactive Dichotomizer 3) algorithm is a quite popular decision tree induction algorithm [60]. It works with an entropy information measure in order to evaluate the discriminatory power of each attribute. It works well and fast with few attributes and small training data sets and has advantages such as understandable decision rules and an intuitive model [63]. Other famous algorithms like C4.5 and Classification-Regression Tree (CART) have been developed based on ID3 [58,63]. ID3 exploits information gain values for each attribute to build a tree. The attribute with the greatest information gain on the decision is primarily selected [58]. Entropy is calculated as follows. The greatest information gain will lead to the greatest decrease in entropy [60]: where E(S) represents the entropy and p(k) is the relative frequency of class k in set S, i.e., it is the ratio of the objects in class k to the whole set S: In all computations, 0 * log 2 0 is assumed as 0. If decisions for all objects are the same, then entropy will be zero. This means that it is not necessary to split the node on the corresponding decision level [60]. Every time a split is done, S is updated. If E(S) = 0 or no attribute remains to be split, then this branch is terminated. If all branches are terminated, then classification is complete. Let Ti be the set of subsets created from splitting set S by attribute Ai: Information gain for each attribute is computed as follows [60]: and entropy for subset t: At each iteration of ID3 algorithm, the attribute with the highest information gain is selected as the decision level.

Decision Variable Importance
Gini is one of the most widely applied impurity functions, providing a measure of the "goodness-of-split" for decision trees by promoting "splits that allocate a single pure node for the largest class and the rest for the remaining classes" [64]. The Gini index for a node t can be calculated as where c is the number of classes and p(i|t), p( j t) are the estimated probabilities of classes i, j at node t [65-67].

Study Area
The island of Crete, Greece, has a typical Mediterranean island environment, with about 53% of the annual precipitation occurring in the winter, 23% during autumn and 20% during spring, covering most irrigation requirements, while there is almost no rainfall during summer [68,69]. The average precipitation for a normal year is approximately 934 mm or 7697 Mm 3 [70], but the nonuniform distribution in space (a negative gradient of almost 300 mm from west to east and a strong orographic effect) and time (dry summers) makes dry season precipitation a very small but crucial portion of the total supply [71]. Figure 2 and Table 1 show dry season (March to November) weather parameters for three characteristic years-dry, normal and wet-with probabilities of occurrence equal to 20%, 60% and 20%.

Study Area
The island of Crete, Greece, has a typical Mediterranean island environment, with about 53% of the annual precipitation occurring in the winter, 23% during autumn and 20% during spring, covering most irrigation requirements, while there is almost no rainfall during summer [68,69]. The average precipitation for a normal year is approximately 934 mm or 7697 Mm 3 [70], but the nonuniform distribution in space (a negative gradient of almost 300 mm from west to east and a strong orographic effect) and time (dry summers) makes dry season precipitation a very small but crucial portion of the total supply [71]. Error! Reference source not found. and Error! Reference source not found. show dry season (March to November) weather parameters for three characteristic years-dry, normal and wet-with probabilities of occurrence equal to 20%, 60% and 20%.
On the other hand, agriculture is the major water consumer (81.2%), with olive trees (Olea europea, L. cv Koroneiki) being the most widespread crop, accounting for 64.2% of the agricultural land and 86% of all tree crops (35 million trees producing 120 kt of olive oil) [72]. Currently, olive tree irrigation is mainly empirical, often without scientific documentation and guidance. This is due to the resilient characteristic of the olive tree which demonstrates sustainable yield even under rainfall conditions while also showing minimal immediate impacts under over-irrigation [73], but also due to favorable irrigation water pricing. Therefore, even though only 65% of irrigated water covers crop transpiration requirements [74] and there is evidence that 34% can drain under the root zone [75], the majority of farmers do not apply optimal irrigation scheduling that could lead to significant conservation of water resources [76].   On the other hand, agriculture is the major water consumer (81.2%), with olive trees (Olea europea, L. cv Koroneiki) being the most widespread crop, accounting for 64.2% of the agricultural land and 86% of all tree crops (35 million trees producing 120 kt of olive oil) [72]. Currently, olive tree irrigation is mainly empirical, often without scientific documentation and guidance. This is due to the resilient characteristic of the olive tree which demonstrates sustainable yield even under rainfall conditions while also showing minimal immediate impacts under over-irrigation [73], but also due to favorable irrigation water pricing. Therefore, even though only 65% of irrigated water covers crop transpiration requirements [74] and there is evidence that 34% can drain under the root zone [75], the majority of farmers do not apply optimal irrigation scheduling that could lead to significant conservation of water resources [76].

Argicultural Input and Water Cost
For the cultivation of olive trees, agricultural input cost C a [€/ha] was classified into pruning, tilling, pesticide and fertilizer application, and crop picking, which are all functions of the cultivated area and amount to 3000 €/ha [77]. Agricultural water service cost recovery in Crete is among the highest in Greece, with 56.3% of the financial and environmental cost being recovered [78]. This means that in essence, agricultural water is among the most expensive in Greece and in most cases, prices range between 0.05 and 0.23 €/m 3 with an average (normalized per distributed m 3 ) of 0.13 €/m 3 [79] or 13 mm/ha irrigated, depending on water scarcity or quality. Here, we tested three low-to-moderate price scenarios of 0.05, 0.10, and 0.18 €/m 3 assuming that under high demand or climate change conditions, values will increase. Additionally, we considered that each irrigation event causes an additional expense of 50 €/event.

Crop Yield
Although olive trees can sustain adequate yield under low annual water supply, irrigation during the dry period is indispensable when aiming for high yields [80]. In a farm-scale study under various irrigation scenarios, the cv. Koroneiki cultivated in the study area had a fruit yield between 4.8 and 7.6 t/ha (considering a typical plating density of 200 trees/ha) [81]. Information from 35 municipal units of Heraklion (LAU 2 level) from the Hellenic Statistical Authority (HSA) [72] for the period 2002-2007 confirm this measurement, producing a slightly larger yield range ( Figure 3). As this sample was considered more representative and focuses on a relatively recent and financially stable period for Greece (i.e., after entering the eurozone and before the financial crisis), it was used to define the prior probability distribution of yield Y for the case study. Y was divided into five ( Table 2) and three classes (Table 4) for the simple and the fuzzy multicriteria approach, respectively. Moreover, in accordance with Equation (2), the results of [81] show a reduction of Y a by 40% for a rainfed treatment where about 50% of the total ET c requirements were covered. Above a certain threshold, Y a gains are no longer statistically significant. Here, we consider that wet season precipitation covers at least 50% of the annual ET c ; therefore, irrigation strategies refer to a fraction of the remaining irrigation requirements. The unit price of the yield is considered fixed at 2.5 €/ha, based on recent Extra Virgin Olive Oil prices [82].
the simple and the fuzzy multicriteria approach, respectively. Moreover, in accordance with Equation (2), the results of [81] show a reduction of by 40% for a rainfed treatment where about 50% of the total requirements were covered. Above a certain threshold, gains are no longer statistically significant. Here, we consider that wet season precipitation covers at least 50% of the annual ; therefore, irrigation strategies refer to a fraction of the remaining irrigation requirements. The unit price of the yield is considered fixed at 2.5 €/ha, based on recent Extra Virgin Olive Oil prices [82].

Alternative Conditions and Strategies
To incorporate a wider range of alternative scenarios and strategies, the cropping system was modeled for three soil types (Table 4), three climatic conditions (Table 1), two crop management practices for high and low K c , and nine irrigation strategies (Table 3). Regarding K c , values from the Hellenic Ministry of Agriculture [83] were used (two management practices, light pruning/heavy pruning). The irrigation strategies presented in Table 3 are a combination of reduced irrigation with respect to the optimal θ f c − θ w Z r , considering an average Z r of 1.2 m and p equal to 0.6 [49], and reduced irrigation events.   Table 3 shows the different irrigation scenarios comprised of combinations of irrigation amount and frequency relative to the amount prescribed by FAO-56.

Simple Multicriteria Approach
The factors defining each scenario's utility were chosen to be 1.
The relative amount (in percentage) of the disposed water used during the irrigation compared to the full-scale irrigation in order to reach the field capacity (100%, 75%, 50%).

2.
The reduction in frequency (number of irrigation times) during the cultivation season compared to the number recommended for maximum crop yield (recommended n -1, recommended n-2). 3.
The profit from the farm. This is calculated by subtracting the costs of irrigation from the revenue from selling the crop.
The amount and frequency recommendations are provided by Equation (3) and they are calculated based on soil and plant parameters described previously in order to achieve the best yield possible. For the first factor, the full irrigation (100%) is taken as reference. It is considered as the irrigation performed in full scale to the farm which is adequate to reach the field capacity. The frequency refers to how many times it is recommended to visit and irrigate the farm throughout the cultivation season with the proposed water amount. Optimally, we want to have the least amount of water consumed on the farm with the fewest visits for irrigation. The third factor conflicts with the first two and this makes the decision process more interesting. The more we irrigate, the more we spend on water. This may lead to a higher crop yield but also a higher cost, but the profit will reflect this tradeoff. The challenge in this approach was to build the utility values by exploiting relevant information and not pure crop measurements.
Initially, a utility index is attributed to each future state: The values for the three utility factors are normalized in the range [0, 1]. The sum of the three normalized values is multiplied by the corresponding utility index and thus, the utility value is determined [85]: In Equation (22), a non-successive numbering was arbitrarily chosen; thus, patterns with close values on water preservation and irrigation frequency can still hold the lead if they refer to higher profit.
The challenge examined in this paper is to enhance the utility with adjacent parameters which will affect the decision in an objective manner.
The expected utility referred to alternative j is [86] E u j = 5 i=1 u ji ×p(s i ), The decision will then point to the alternative with the highest expected utility: The likelihood of occurrence (p(s i )) for the various crop yields is given by data by the Hellenic Statistical Agency and address olive farms in the island of Crete. The data are presented in Table 5.

Multicriteria Approach with Posterior Information
In order to assist the decision on which irrigation patterns to propose for improving the crop yield, we chose, as "new" information, the climate conditions. Thus, we explore the likelihood of a farm achieving a certain crop state depending on whether it was a wet, normal or dry year. The statistical data for precipitation are presented in Table 6. The expected utility that refers to the alternative j under the condition of "new" information is then: And the maximum expected utility: To determine the maximum unconditional expected utility for each alternative in (26), the marginal probability is used: In Equation (28), r is the cardinality of the "new" information and equals to the number of weather conditions (here dry, normal, and wet, so equal to 3).

Multicriteria Fuzzy Approach
For the olive farms, a fuzzy set F is defined with elements the following three fuzzy states: • Excellent application (Fuzzy Element 1).

• Good application (Fuzzy Element 2). • Moderate application (Fuzzy Element 3).
The next step is to choose the factors which will form the meaning of the fuzzy states. Then, we can assign numerical values to the irrigation scenarios depending on how they perform in relation to the verbal fuzzy variables. The idea was to take into account the estimated profit, the relative irrigation and the trips reduction factors and build a scoring matrix.
Profit is divided into three groups based on the values calculated for all possible cases (low, medium, and high). Next, scoring is done by summing the scoring values for the profit, the relative irrigation water amount and the reduction in frequency of irrigations. The optimal case is that where we have a maximum increase in yield with the 50% preservation in water and a reduction by 2 on the optimal irrigation times while we achieve a high profit. The relative irrigation amount is grouped into three groups-50%, 75% and 100%-and the possible scorings span from 1 to 3, respectively. The same goes for the trips reduction times: n-0→1, n-1→2, n-2→3, and the profit: low→1, medium→2, high→3. Thus, a scenario which preserves 75% of water will add 2 to the total score. If it achieves that with a reduction of irrigation events by 2, another 3 is added to the total score. If it achieves low profit after applying the irrigation scenario, only 1 will be added. Note that in the absence of more information, scoring here is assigned arbitrarily but in a real circumstances, the water manager may assign asymmetric scores.
The above values represent the scoring of each case, which will shape the membership in each fuzzy variable. The lower and upper value for each fuzzy variable define the intervals which will form the boundaries among those states. Because scoring takes into account profit, relative irrigation amount, and reduction in irrigation frequency, the fuzzy variables' definition is represented by the combination of those three factors. The cardinality of occurrence among the scoring values as well as minimum and maximum scorings are important in defining the numeric participation ranges. Those will express the values where a combination is or is not a match for each verbal fuzzy state. From Table 7, we observe that most occurrences are in the scoring range between 5 and 7. Those scores will be considered as "good" performances. The rest of the scoring in the lower and upper space will reflect "moderate" and "excellent" scorings, respectively. The numerical ranges which represent exact matches for the linguistic fuzzy variables are given in Table 8. Table 7. Fuzzy Scorings.

Profit
Irrigation Strategy 50%/n-2 50%/n-1 50%/n 75%/n-2 75%/n-1 75%/n 100%/n-2 100%/n-1 100%/n  (7,9] The degree of membership of an element in each fuzzy variable will vary depending on the association of its score and the exact verbal match of the given fuzzy variable. For example, an application with a score of 5 is a match for a "good" application but in the fuzzy world, it will have a smaller degree of membership in the neighbor fuzzy variables. For this example, a small participation in the "moderate" fuzzy set is expected but a scoring of 5 is not expected to belong in the "excellent" set. The membership function we will choose must reflect this logic. Many popular membership functions are used in fuzzy decision making (triangular, trapezoidal, Gaussian) [87]. We chose the triangular membership function for its simplicity but also because we have a small non-continuous numerical set of scores. The triangular membership function will assign lower memberships in neighboring fuzzy variables for a given score and will also assign the highest membership to scores which are in the middle of the range for each fuzzy state [19]. After forming those intervals, we use the triangular membership function [19] to determine values for the scenarios in linguistic terms.
The assignment of utility values will now be made on the three fuzzy future states of the set F in a similar way as for the non-fuzzy states in (22): Equation (32) indicates that utility is doubled for "good" applications and is tripled for "excellent" applications. No special weighting was assigned to the fuzzy states in order to let the fuzzy scoring operate and result into utilities with a more objective manner than the simple multicriteria approach. Based on (12), the fuzzy utility matrix will be calculated.

Shortcomings of Probabilistic Approaches
In the previous sections, we looked into probabilistic approaches to the problem which mainly exploit statistical data. Conditional probabilities can help examine the utilities given the new facts. Multicriteria approaches while exploiting posterior information are used in diverse areas such as public health, energy conservation and economics [24,[88][89][90]. Suppose we are looking for the chances of achieving crop yields in various weather conditions. If the event of achieving a high yield is P(H), the event of applying medium irrigation is P(I) and the event of a wet period is β, then the combined conditional probabilities can be expressed as [55] P(H I, β) = P(H|C), We assume that so, and therefore, We will be needing the cases where all the events co-occur and also cases which show what irrigation plans are independently chosen in the occurrence of the possible rain conditions. This logic can be expanded to accommodate additional attributes. We will be needing accordingly the likelihood where those events co-occur. But when we look at a specific irrigation combined with precipitation, we understand that those events are not mutually exclusive, nor are they independent.
This means [54] that P(I ∩ β) ∅ and P(I | β) The likelihood of applying an irrigation scheme is affected by the likelihood of rain and the disadvantage is that the approach lacks the ability to present a correlation of the attributes. It is not necessarily about prioritizing and weighting the effect one attribute can have on the result but examining sequences that cause actions that depend on the occurrence of an attribute. Let us examine the method of decision trees on our problem which allows us to examine multiple attributes together [58] and also build paths that lead to classifying selling profit given the circumstances which can occur. The goal is to achieve a classification of examples that will point to the proper decision in a given combination of parameters [58].

Decision Tree and the ID3 Algorithm
For the irrigation decision problem, the following attributes will be examined: 1.
Soil type. Possible soil types are loamy sand, sandy loam, and clay.
Management practices. They are chosen to be: M1 heavy pruning and M2 light pruning. Tree pruning may bring down the total production, but it is a wise choice during a dry year (low in precipitation).

4.
The irrigation amount as a percentage of the recommended amount per irrigation event.

5.
The reduction in irrigation events related to the recommended.
The class on which classification is performed is profit rather than crop yield since, as discussed in Equation (1) and Sections 3.2 and 3.3, profit is also a function of water cost, yield price and other operational costs. Thus, crop yield alone is not representative of how well we did in our decision process. The idea is to study multiple attributes together and how their selection leads to a more refined decision compared to evaluating them independently [91].
Finally, to quantify the sensitivity of the decision tree to each variable, the tree was trained over 1000 iterations presented with only 70% of the samples each time. The Gini importance, quantifying the importance of each variable, was calculated for each resulting tree. To allow a comparison between iterations, for each run Gini, importance values were normalized so that their sum totaled 100. Table 9 shows the results for the simple multicriteria approach based on Equations (22)- (24), where the last strategy is the optimal. The reason this strategy has high utility is that it performs really well regarding the three multicriteria factors. It consumes the lowest amount of water combined with the fewer trips to the farm. Although it spends the least amount of water, it causes medium crop yield and this is reflected on the profit. Moreover, scenarios with a lower irrigation amount and irrigation event frequency values (5 and 6) perform very well. This shows that we can achieve satisfying profits without using high irrigation amounts and without frequent irrigation events. The three first scenarios share the same low utility, showing that regardless of the amount, not reducing the frequency is not favored in decision making.

Multicriteria Approach with Posterior Information
With the help of conditional probabilities (Table 10), we can express the relation of estimated crop yield and rainfall. The probability p(s i |c k ) expresses the probability of the farm yielding to a crop label when the specific precipitation degree is observed. In this case study, the estimates were produced using Equation (3).
The variation of results should be mathematically represented in every column by displaying a distribution of the probabilities values among more than one crop yield label. Those probabilities (which express estimates) should therefore be less than 1 and be allocated to the adjacent crop yield labels where the farm is about to move to [55]. The sum of each column's probabilities is equal to 1 [55]. Table 11 shows the results. We notice that the last alternative would be chosen again. Scenarios 5 and 6 still perform very well. However, we observe that scenarios with no water preservation in mind like scenarios 1-3 are left behind with a higher distance from the winners. This shows that the use of rainfall as posterior information helped into distinguishing in a more clear way the best from the worse scenarios.

Multicriteria Fuzzy Approach
In Table 12, the membership values for each one of the 27 irrigation per yield combinations are presented. Based on (13), the probabilities for the fuzzy events (Table 13) are calculated:

Profit
Irrigation Strategy 50%/n-2 50%/n-1 50%/n 75%/n-2 75%/n-1 75%/n 100%/n-2 100%/n-1 100%/n Low A sum of 1 for the three fuzzy probabilities verifies that calculations are correct [18]. The fuzzy expected utility values are calculated based on Equation (12) and are presented in Table 14, together with the expected utilities from simple multicriteria approach. Scenario 9 is the preferred application in both cases. Scenario 9 is the preferred scenario in both approaches. The ranking order is preserved in both cases. The distance, though, is denser among the fuzzy utilities group, showing that various irrigation scenarios are competing strongly with each other. Scenarios 5 and 6 still have high utility.

Decision Tree and the ID3 Algorithm
The logic presented in Section 2.7 along with the specific feature engineering selection [91] produced an exhaustive table of 486 rows (sample shown in Table 15) where the combinations of the above attributes are examined. In the table below, a sample of the training data is given. The entropy and information gain was calculated for each attribute in order to set the hierarchy and the splitting nodes of the decision tree. Figure 4 shows the decision tree obtained by ID3 algorithm for relative irrigation equal to 75% (Figure 4a) and 100% (Figure 4b) of the recommended. The results for 50% relative irrigation are omitted from Figure 4 as they only lead to low profit (L).  From the decision tree diagram, we obtain classifications for the route of actions we should follow when certain conditions occur. Those will determine the degree of profit we expect. The choice The Gini importance for each variable was estimated and normalized. The results are presented in Figure 5, ranked by overall importance. In the specific case study, irrigation-related decisions have the highest importance, with the amount of irrigation per event carrying 57.5% of the weight of the decision towards achieving a high profit. Water price, weather and soil texture follow with median values between 5.9% and 9.3%, whereas pruning has a negligible effect on the decision. The Gini importance for each variable was estimated and normalized. The results are presented in Figure 4, ranked by overall importance. In the specific case study, irrigation-related decisions have the highest importance, with the amount of irrigation per event carrying 57.5% of the weight of the decision towards achieving a high profit. Water price, weather and soil texture follow with median values between 5.9% and 9.3%, whereas pruning has a negligible effect on the decision.  From the decision tree diagram, we obtain classifications for the route of actions we should follow when certain conditions occur. Those will determine the degree of profit we expect. The choice of profit rather than yield as the decision goal contributed to challenging the decision process because it is shaped by the cost and the revenue. An example of classification is: "During a wet year in a farm with clay soil type, we are going to have a low profit if we adopt a 75% relative irrigation and reduce the recommended number of irrigation events by 2." These decisions determine the degree of profit we expect given certain conditions. If irrigation is kept at a low level (50%), farm states or management subsequent decisions do not affect the final outcome which is always a low profit. In a sense, this can be considered as a "doing as little as possible, as much as necessary" approach in which the farmer refrains from investing effort and capital but also avoids any risk failing. This option can be especially attractive when there is lack of information (e.g., little knowledge of soil properties, crop behavior) or high uncertainty (e.g., market volatility, weather extremes). For example, one can irrigate at 75% per trip and still get low yield if the wrong management decisions are made. As irrigation volume per trip approaches the optimum (75% and 100%), alternatives become more complicated.
Sometimes, key factors will make no difference to the anticipated profit. For instance, we notice that in a 100% irrigation amount which is divided into the lowest number of trips, we never achieve a high profit no matter the soil type, precipitation or the management practice applied. Other factors, e.g., soil type and management practice, make a difference under specific climate conditions, e.g., a sandy loam farm with heavy pruning during a dry year will give a high profit.
An alternative reading of the decision tree is bottom-up, as the most relevant question is: "What irrigation strategy do I need to apply in order to achieve the best profit?" Thus, we would expect "relative irrigation" and "irrigation events reduction" to be the lower level nodes in the decision tree. But the case is that we placed "profit" as the classification column (Equation (17)) in decision for our training set of data. This means that the profit is the endpoint in our decision process, but it should be considered as an outcome rather than as an action we can take or apply to the problem. The reason the data were hierarchized like this is the means we had in order to produce them (the knowledge base reflected on Equation (3), the irrigation strategies modelling and cost-revenue estimations). In a reverse approach, we could have observed resulting profits from a known group of farms after a complete season and would have examined which attributes were applied in each and every case. All the parameters analyzed in Section 2.3 contribute to the calculations in order to formulate and give, as a result, the expected profit. That is why the decision tree resulted this way after processing the training data. We could have achieved a different structure of the tree if we had our training obtained in a different way. If we had statistical measurements of farms with known profit numbers and irrigation strategies, we could train the tree accordingly. Nevertheless, the takeaway after studying the tree nodes and paths is that we should not try to interpret it using a top-down approach. This can lead to paradox assumptions like, for instance, "How is it possible to begin examining if we should adopt a specific relative irrigation and yet we don't know during the process what is the soil type or the climate?" The trick is to deal with the classifications as combinations presented in the tree. This way, we will know how to act given conditions we cannot control like the weather or the soil type of our farm. What we can control, depending on the profit we want to achieve, are the irrigation schemes (amount and number of trips) and the management practices. It is apparent that the two major role players are the relative irrigation and the irrigation trips and that is why they are the top nodes.
To further analyze the decision tree shape, we owe to explain that throughout the training data set, the amount of irrigation as well as the number of trips were those proposed by the calculations as "optimal", given the parameters regarding weather and soil type each time. This means that we are examining a sort of "normalized" samples of irrigation schemes among the different soil types and weather conditions. This strengthens the importance of examining a path in the tree as coexistence of conditions in order to evaluate our decision options and define our course of action.

Limitations
The above method can be easily applied to other crops that have been parametrized for the FAO-56 equation. While FAO-56 is robust, it also has inherent limitations in the estimation of actual crop evapotranspiration [92]. Here, apart from the FAO-56 equation parameters, we can consider pruning, water price and yield price as decision parameters. In reality, there is a wide range of factors that can be accommodated by the model and have been used by other authors, e.g., nutrient pollution [93], topography, soil degradation and tree density [94], remote sensing information [95,96], tree hydration status [97], tree trunk diameter [98] pests [99], and socioeconomic parameters [100]. Moreover, under current assumptions and implementation, the decision tree shows obvious routes. When different prices of water and crop interact with each other, the decision tree will reach a point at which will not give such obvious paths anymore and there may be paths which lead to loss instead of profit. For example, under fixed extreme weather and water price scenarios, other variables may emerge as important or sensitive factors.

Conclusions
The probabilistic simple and fuzzy multicriteria approaches gave a ranking of the given irrigation strategies and displayed that a high utility can be found in scenarios which dictate low water consumption with fewer irrigation events while leading to higher crop yield and profit. Competing factors in fuzzy scoring (total profit versus the cost of irrigation water) were important for producing a more objective fuzzy engine. The probabilistic methods also displayed the shortcoming of not being able to exploit multiple features relevant to our case in the decision process. The decision tree approach showed that we can get a more refined proposal set when taking into consideration relevant factors like the weather and the soil type. This approach is recommended over the others presented and as it embraces a broader set of attributes that remain relevant to the decision and can provide tools which can receive higher adoption than traditional IDSS [17,32,33]. The methods and strategies examined are considered as more suitable for Mediterranean islands or across the globe, where water scarcity is a limiting factor.
Challenges to be investigated in future work include focusing on the production cost which depends heavily on water price. Under current and foreseen water stress conditions, an increase of agricultural water price is a very realistic and will certainly affect the profit, rendering the decision process an even more useful tool in the farming economy. Additionally, the price of olive oil in a selected region like the Mediterranean is adjusted each year and depends, among others, on the production levels. The decision system can point to wise decisions given the circumstances which can help farmers to survive a crisis. The importance of the correct decision will have a bigger impact then because the sustainability of the farming economy and water preservation with regard to environmental caution will be at a high stake.
Finally, we will analyze the behavior of a fuzzy ID3 implementation [60,101] to obtain flexible thresholds to the attributes that change the path structure according to the degree of fuzziness applied.