Indicative Value of the Dominant Plant Species for a Rapid Evaluation of the Nutritional Value of Soils

A study was conducted on 14 grassland communities located in the south of the Iberian Peninsula and their edaphology, which is identified as specific plant associations. The edaphic study of each association allows a rapid evaluation of the nutrient content in the soil without the need for laboratory edaphic analysis. For each phytosociological relevé and soil, samplings were carried out. The field data were subjected to various statistical analysis—canonical correspondence analysis (CCA), Bayesian networks, and decision trees—to establish nutrient content. When the abundance value of the species is 9 in the Van der Maarel scale, there is an increase in the values of several soil parameters. In the case of Hordeum leporinum, when the Van der Maarel index is 9, the Kc (exchangeable potassium in cmol/kg) undergoes the greatest variation, to a value of up to 0.729 cmol/kg. The application of the decision tree to this species reveals that the soil attributes with the greatest influence in the classification are conductivity, %_si (silt texture), pH, and pF 15 atm (pressure at 15 atmospheres (water retention capacity) in %). Indeed, this interlaced edaphic and phytosociological study provides us with a high-value tool to obtain quick information on the content of nutrients in the soil.


Introduction
The great diversity of Mediterranean grazings and their use in livestock and agriculture has been analyzed in olive areas in Spain, Italy, and Portugal, with a focus on the Iberian Peninsula [1][2][3]. Pastures and grasslands are relevant as plant covers, because they prevent soil erosion [4]. Each plant association of pasture or grassland has few or more dominant species thanks to the type and quantity of soil nutrient [5]. Hence, the interest of knowing these plant communities relates to fast and correct agricultural management, enabling the avoidance or reduction of the economic cost of laboratory analysis as well as the use of fertilizers and herbicides. The great abundance of floristic and phytosociological studies [6][7][8][9][10][11][12][13][14][15], and the lack of studies that correlate plant associations with soils, allows proposing a new applied model, through which quick information on the nutritional status of soils is obtained. the network, and the values associated to other variables are recalculated using the Bayes theorem as the basic operator. This is what is called "evidence propagation". Thanks to the principles of conditional dependence and independence codified by the structure of the graph and the algorithms developed to operate on them, the calculations can be made much faster than if the whole set of variables in the model needed to be processed.
There are currently different software packages available to create models of Bayesian networks. In our case, we used the program Bayesia Lab 3.1 (http://www.bayesia.com). The classifier used to characterize each node was naive Bayes, as this assumes a strong independence between the characteristics in the classification process, which is the process of assigning a class mark to each instance (in our case plant species) described by a series of attributes (which in our model correspond to soil parameters).
In this work, we demonstrate how Bayesian networks can act as an exploratory tool to anticipate and recommend a formula for more effective future decisions for managing vegetation cover in sustainable agriculture [23].
Although several authors have worked on edaphic nutrients [35][36][37][38], there are only a few studies in which the phytosociological method for studying plant associations connected with the use of artificial intelligence: so, this study is really new. Consequently, knowledge of the ecology, structure, function, and floristic composition allows us to know the composition of the soil with high detection. The aim of this work is to demonstrate the value of plant associations as nutrient indicators so that they can be used in a sustainable ordination of crops by seeking the optimum nutrient values. Through pasture or grassland plant associations, we can know exactly whether or not there is a lack of any nutrient in the soil. This is important in agricultural management, so as not to abuse the application of fertilizers to the soil, which in most of the cases end up polluting lakes, rivers, swamps, and the agricultural product.

Materials and Methods
This work considered 14 plant-community types: AcH*: Anacyclo clavati-Hordeetum leporini; ArH*: Anacyclo radiati-Hordeetum leporini; BH*: Bromo scoparii-Hordeetum leporini; LR*: Linario spartei-Raphanetum raphanistri; PD*: Papaveri rhoeadis-Diplotaxietum virgatae; ArP*: Anacyclo radiati-Papaveretum rhoeadis; ArChr*: Anacyclo radiati-Chrysanthemetum coronari; RChr*: Resedo albae-Chrysanthemetum coronari; UM*: Urtico urentis-Malvetum neglectae; PA*: Plantago bellardi-Aegilopetum geniculatae; TT*: Trifolio cherleri-Taeniatheretum capitismedusae; TP*: Trifolio cherleri-Plantaginetum bellardii; FS*: Fedio cornucopiae-Simapietum mairei; and CS*: Carduo bourgeani-Silybetum mariani (* represents the dominant species in the association, according to [39]). Each community was selected to be both physiognomically and ecologically homogeneous, and each contains one or more dominant species. For sampling, ecologically homogeneous plots were selected in grazing areas and olive groves in Jaén (Spain) and Évora (Portugal). The sampling is decided by the homogeneousphysiognomic character of the community, ensuring that the samplings were distributed in the study area (see distribution map). Regarding the edaphic sampling, a soil sample of 1 kg was extracted, while the measurements of the root system of the dominant species were previously carried out. The plot selection was made at those points where the main species of the plant association was dominant; next, the minimum area was obtained to establish the size of the plot, which ranges between 0.5, 1, and 2 m 2 , depending on the type of plant association. The sampling was repeated 20 times in different areas for each plant association, all with the same dominant species, for a total of 280 botanical relevés following the Braun-Blanquet [39] abundance-dominance index, which was subsequently transformed into Van der Maarel indices [40]. The edaphic sampling was carried out taking into account the depth of the root system of the dominant species and therefore of the rhizosphere. These samples were analyzed in the agri-food laboratories of Córdoba and Granada, according to the protocol established by the Institute for Research, Agricultural and Fisheries Training (IFAPA). Twelve soil parameters were analyzed in the laboratory for each of the 20 samples per plant association, as follows: cation-exchange capacity (CEC; cmol/kg); carbonates (%); assimilable phosphorus (Pa; p.p.m.); exchangeable magnesium (Mgc; cmol/kg); total nitrogen (Nt; %); exchangeable potassium (K; cmol/kg); pF 15 atmospheres (%); clay texture (%_cl); silt texture (%_si); sand texture (%_sa); EC: conductivity in dS/m; pH. The data used in this work are partially coming also from [22,23]. These 12 parameters are the most representative, according to [16] (Tables 1 and 2). A total of 280 soil samples were analyzed: one for each botanical relevé. The plant species that were present only in the plant association were eliminated from the subsequent statistical analysis to avoid noise ( Figure 1).  relevés following the Braun-Blanquet [39] abundance-dominance index, which was subsequently transformed into Van der Maarel indices [40]. The edaphic sampling was carried out taking into account the depth of the root system of the dominant species and therefore of the rhizosphere. These samples were analyzed in the agri-food laboratories of Córdoba and Granada, according to the protocol established by the Institute for Research, Agricultural and Fisheries Training (IFAPA). Twelve soil parameters were analyzed in the laboratory for each of the 20 samples per plant association, as follows: cation-exchange capacity (CEC; cmol/kg); carbonates (%); assimilable phosphorus (Pa; p.p.m.); exchangeable magnesium (Mgc; cmol/kg); total nitrogen (Nt; %); exchangeable potassium (K; cmol/kg); pF 15 atmospheres (%); clay texture (%_cl); silt texture (%_si); sand texture (%_sa); EC: conductivity in dS/m; pH. The data used in this work are partially coming also from [22,23]. These 12 parameters are the most representative, according to [16] (Tables 1  and 2). A total of 280 soil samples were analyzed: one for each botanical relevé. The plant species that were present only in the plant association were eliminated from the subsequent statistical analysis to avoid noise ( Figure 1).

Statistical Analysis
To verify the influence of the independent variables (edaphic factors), which condition the presence, absence, and abundance in each of the dependents (species), we applied a canonical correspondence analysis (CCA) in order to establish the influence of the various parameters (12 parameters, see Table 2) in each species and site (communities). This was done using the XLSTAT 2009.3© program (B 429 102 767 RCS Paris-France).

Bayesian Statistics and Decision Trees
After obtaining the parameters with the greatest influence on each species, we selected the species to which to apply a classification analysis by expert systems (decision trees) according to Braun-Blanquet [39]. Decision trees are prediction models from the realm of artificial intelligence that are used to classify plant-soil relations. Programs that simulate human reasoning are known by the name of "expert systems".
To obtain the plant-soil relations, we also created decision trees for some of the most representative species, now using expert systems procedures. As mentioned in the Materials and Methods section, the information gain was first obtained for each one as an assessor of attributes before subsequently launching the classification algorithm (J48) to create the tree.
One of the techniques we followed to classify and represent plant-soil relations was to create decision trees. These trees were created by obtaining the Information Gain (IG) for each species, which is defined in [17] as the amount of information provided by a soil attribute for a particular plant species. Thus, we can deduce that the higher the value of IG, the more information is provided by the soil parameter to delimit the abundance of the species, and the greater the number of soil parameters with IG, the more demanding is the species in the choice of soil in which it grows [41].
A decision tree has certain inputs, which may be an object or situation described by means of a set of attributes. Based on these, it returns a response derived from the inputs that can be used as a decision criterion. The inputs and outputs can take either discrete or continuous values.
In our work, the decision trees were obtained using the data mining program Weka (http://www.cs.waikato.ac.nz/ml/weka/). This was done by first preparing a series of data matrices with all the relevés from a country, but for each matrix including only the Van der Maarel indices for a chosen species and all the soil attribute values. Then, the matrix was uploaded to Weka, and information gain was obtained as the evaluator of the attributes; the value of this parameter indicated the information provided by each soil attribute according to the presence of the species in question. Then, we launched the classifier (in our case, the algorithm C4.5, -J48 in the Weka software) to build the decision tree, but including only the relevant soil attributes; that is, those for which the information gain was different from zero, and which therefore provided certain information and avoided noise.
Thus, each of the decision trees built runs a customized test of all the leaves to reach the decision. The decision trees shown contain internal nodes, leaf nodes, and arcs. An internal node contains a test of some values of one of the properties (in our case, the range of soil attribute values); a leaf node represents the value that will be returned by the decision tree (Van der Maarel index for the species in question); and finally, the branches show the possible path available according to the decision made [17].   (Table 3). Table 3. Soil parameters of the herbaceous communities in the study. CEC = cation-exchange capacity in cmol/kg; Nt = total nitrogen in %; Pa = assimilable phosphorous in ppm; Mgc = exchangeable magnesium in cmol/kg; Kc = exchangeable potassium in cmol/kg; %_Carb = carbonates in %; %_cl = clay texture in %; %_sa = sand texture in %; %_si = silt texture in %; pF 15 atm = pressure at 15 atmospheres (water retention capacity) in %; EC: conductivity in ds/m; pH. The edaphic analysis obtained from the studied associations reveals a variability in the value of the different edaphic parameters. The values obtained in Table 3 represent mean values of the 20 edaphic samplings carried out for each association: the associations being dominated by Chrysanthemum coronarium, Hordeum leporinum, Malva neglecta, and Silybum marianum, which have high values of total nitrogen, assimilable phosphorus, assimilable potassium, and pH. However, the associations LR, TT, and TP that are located in siliceous substrates present low pH and very low assimilable phosphorus values.

Plant
In order to determine which parameters have a direct influence on each of the dominant species in each association, we established a prior canonical correspondence analysis among the dominant species in the 12 independent variables (soil factors). Obviously, the dominant species used are located in optimal edaphic environments. However, these same species can be companions in other associations due to other edaphic conditions. Figure 2 shows that texture, pH, EC (conductivity), and OOM are the parameters that have the highest loading on species presence and abundance. The plant associations TP* Trifolio cherleri-Plantaginetum bellardii (Spain), TT* Trifolio cherleri-Taeniatheretum caput-medusae (Spain), LR* Linario-Raphanetum raphanistri (Spain), and PA* Plantago bellardi-Aegilopetum geniculatae (Portugal) are at the right of Figure 2, and %_ar, together with acid/neutral pH, are the variables that condition their presence. In contrast conductivity, OOM, basic pH, carbonate etc., condition the rest of the associations (left side of Figure 2).  Table 2).

Bayesian Analysis
We created directed acyclic graphs and tables of probabilities for the species that give the name to each association, and for which they provide more information, considering that by selecting the highest Van der Maarel index for that species, the values for the soil parameters studied are located at clearly delimited intervals in over 80% of cases.
In the case of the association Anacyclo radiati-Hordeetum leporini, Hordeum leporinum initially has a Van der Maarel index of 9 in 68.25% of cases, 8 in 20.63%, and 7 in 11.11% of cases. With these species' abundances, the soil parameters move within specific ranges: conductivity has values of 0.620 mmhos/cm or less in 91.67% of cases; Nt has values of  Table 2).
The following plant associations are those that have served to establish the edaphic attributes (nutritional value of the soil): TP* belongs to the phytosociological alliance Tuberarion guttatae, which is described as pure grasslands developed in lithosols of acid pH, with low nitrogen content and in organic matter; TT* and PA* have been included in Taenatherio caput-medusae-Aegilopion geniculatae (to this phytosociological alliance belong associations with variable pH and with average values of nitrogen and organic matter); while LR* is included in the Alysso granatensis-Brassicion barrelieri phytosociological alliance, which is differentiated from the previous ones by needing soils with some sand content [42].
The plant associations AcH*, ArH*, BH*, PD*, FS*, and ArP* with abundance of the Hordeum leporinum species are included in the Hordeion leporini phytosociological alliance, which brings together grassland plant associations developed on acidic and basic pH soils with content in nitrogen and organic matter greater than those of Taeniathero-Aegilopion geniculatae. The phytosociological alliance Hordeion leporini dominates most rainfed crops, especially olive groves, as well as the UM* and RChr plant associations that belong to the Malvenion neglectae phytosociological alliance [25,26]. Finally, plant associations such as CS*, belonging to the Silybo-Urticion phytosociological alliance, appear in the crops with low frequency, which includes the plant associations that present the greatest amount of organic matter.

Bayesian Analysis
We created directed acyclic graphs and tables of probabilities for the species that give the name to each association, and for which they provide more information, considering that by selecting the highest Van der Maarel index for that species, the values for the soil parameters studied are located at clearly delimited intervals in over 80% of cases.
In the case of the association Anacyclo radiati-Hordeetum leporini, Hordeum leporinum initially has a Van der Maarel index of 9 in 68.25% of cases, 8 in 20.63%, and 7 in 11.11% of cases. With these species' abundances, the soil parameters move within specific ranges: conductivity has values of 0.620 mmhos/cm or less in 91.67% of cases; Nt has values of 0.130% or less in 44.05%; Pa has values of 13.575 p.p.m. or less in 58.33%; and Kc has values of 0.729 cmol/kg or less in 48.81% of cases.
When we establish the prior evidence that this species has a Van der Maarel index of 9, it can be seen that it is mainly the proportion of Kc that shows the widest variation, with 63.37% of cases with values of 0.729 cmol/kg.
In the association Carduo bourgaeani-Silybetum mariani, the species Silybum marianum has the following proportions with the following Van der Maarel abundance indices: a Van der Maarel index of 2 in 10.83% of cases; 3 in 5.83%, 5 in 20.83%, 7 in 5.83%, 8 in 10.83%, and 9 in 45.83%. In this case, the absolute of Silybum marianum with an abundance value of 9 brings us to 45.83%. For these species, abundance values and these proportions, the soil parameters studied move within certain ranges; it is significant that Pa values in 61.25% of cases are 30.750 p.p.m. or less, Kc values are 1.067 cmol/kg or less in 56.25%, Nt values are 0.162% in 86.25%, and OOM values are 2.435% or less in 82.25% of cases. After propagating the prior evidence that S. marianum has an abundance of 9 in 100% of cases, we can see how the percentages for the aforementioned soil attributes increase considerably for the same values.
With regard to the association Trifolio cherleri-Plantaginetum bellardii, the species Plantago bellardii initially has a Van der Maarel index of 3 in 36% of cases, 5 in 16%, 7 in 31%, 8 in 6%, and 9 in 11%. With the abundance distribution of this species, the soil parameters appear in specific ranges, so in 41.25% of cases, the value of Kc is 0.144 cmol/kg or less, Pa is 0.9500 p.p.m. or less in 86.25%, %_ar is over 86.255 in 73.448% of cases, and pF 15 atm is 5.285 or less in 86.25% of cases. If we select 100% of the cases in which the species has a Van der Maarel index of 9, then Pa, Nt, and OOM increase the percentage of the previously mentioned values, although in 47.73% of cases, Kc has values of 0.185 cmol/kg or less.

Decision Trees
We did not build a decision tree in cases where values for Information Gain (IG) equal to 0 were obtained for all the attributes in a species, only in cases in which IG was more than zero.
Based on the results of our previous analyses, we selected some species to make their decision trees: Plantago bellardii, Taeniatherum caput-medusae, Hordeum leporinum, and Glebionis discolor.

Plantago bellardii
The parameters with the greatest influence for the classification of the species P. bellardii are conductivity, %_sa, pF 15 atm, pH, carbonates, Kc, CEC, pF 13 atm, Mgc, %_si, %_cl, and Pa. The parameter with the most influence is conductivity. The species is not present on its tree for values of pF 15 atm of over 5.06%, pF 15 atm of 5.06% or less, conductivity of 0.07 mmhos/cm or less, Mgc of 0.694 cmol/kg or less, %_cl of 5.65% or less. P. bellardii has an abundance of 9 in the Van der Maarel indices.

Taeniatherum caput-medusae
In the case of Taeniatherum caput-medusae, the parameters that have the greatest influence on the classification of this species according to the information gain are carbonates, with a value of 0.239, followed by conductivity, pH, Pa, Kc, pF 15 atm, and %_cl. The decision tree shows that this species is not found when conductivity acquires values of over 0.06 mmhos/cm, Tcm = 0, but if it has values of 0.06 mmhos/cm or less and %_cl of 40.77% or less, the abundance value for this species rises to 8 in the Van der Maarel index; if %_cl is over 40.77%, Kc over 0.225 cmol/kg and Pa 7 p.p.m. or less, the species abundance is Tcm = 7.
The soil attributes ordered by information gain for the species T. caput-medusae are 0.239 carbonates; 0.218 EC (conductivity); 0.211 pH; 0.184 Pa; 0.142 Kc; 0.123 pF 15 atm; and 0.12 %_cl (Figure 4). Decision tree for Pb = Plantago bellardii All. The numbering inside the boxes represents the abundance-dominance index converted to Van der Maarel [40]. The other numbering corresponds to the range of values of the edaphic attributes.

Figure 4.
Decision tree for Tcm = Taeniatherum caput-medusae. The numbering inside the boxes represents the abundance-dominance index converted to Van der Maarel [40]. The other numbering corresponds to the range of values of the edaphic attributes.

Hordeum leporinum
The soil attributes that have the greatest influence on the classification of these species are conductivity, %_si, pH, and pF 15 atm, with conductivity exerting the greatest influence. The decision tree for this species shows that different paths can be taken to reach a Van der Maarel value of 9 for Hordeum leporinum, so that if we start with a %_si of over 72.17%, we reach an abundance of 9 for H. leporinum (Van der Maarel), and if %_si is over 53.81% and less than or equal to 72.17%, and conductivity is over 0.24 mmhos/cm, this species is not present. The other methods for obtaining the maximum abundance value for this species is the path of %_si < = 53.81.

Hordeum leporinum
The soil attributes that have the greatest influence on the classification of these species are conductivity, %_si, pH, and pF 15 atm, with conductivity exerting the greatest influence. The decision tree for this species shows that different paths can be taken to reach a Van der Maarel value of 9 for Hordeum leporinum, so that if we start with a %_si of over 72.17%, we reach an abundance of 9 for H. leporinum (Van der Maarel), and if %_si is over 53.81% and less than or equal to 72.17%, and conductivity is over 0.24 mmhos/cm, this species is not present. The other methods for obtaining the maximum abundance value for this species is the path of %_si < = 53.81.  According to the information gain for the soil attributes, conductivity has the greatest influence on the classification of Glebionis discolor. The participation of the rest of the soil parameters and their different abundance values in this species can be seen in its decision

Glebionis discolor (=Chrysanthemum coronarium var. discolor)
According to the information gain for the soil attributes, conductivity has the greatest influence on the classification of Glebionis discolor. The participation of the rest of the soil parameters and their different abundance values in this species can be seen in its decision tree.
The attributes ordered by information gain for the species G. discolor are 0.1881 EC (conductivity); 0.  Decision tree for Chco = Glebionis discolor (=Chrysanthemum coronarium var. discolor). The numbering inside the boxes represents the abundance-dominance index converted to Van der Maarel [40]. The other numbering corresponds to the range of values of the edaphic attributes.

Soil Attributes of the Other Species Studied
Sylibum marianum (Spain). This species is classified by nine soil parameters out of the 16 studied that have the greatest influence in this order: Kc, conductivity, carbonates, Mgc, pH, pF 15 atm, %_sa, %_cl, and CEC. The decision tree for this species shows the values for these soil parameters that lead to a range of species abundances. The attributes ordered by information gain for the species S. marianum are 0. Avena barbata (Italy). The soil parameters that provide information for the classification of the species A. barbata are P, Na, electrical conductivity, and pH. The most influential of all these is P. The corresponding decision tree shows that the species is not present for values of P of 11.2 p.p.m. or less, and that A. barbata is present for values of P over 11.

Discussion
Joint phytosociological and edaphic studies are not frequent, but there are many separate studies of both aspects. In this work, a clear relationship is established between the plant association and the soil on which it is found. The rapid identification of the dominant species and its abundance-dominance index presents the necessary information to The numbering inside the boxes represents the abundance-dominance index converted to Van der Maarel [40]. The other numbering corresponds to the range of values of the edaphic attributes.

Soil Attributes of the Other Species Studied
Sylibum marianum (Spain). This species is classified by nine soil parameters out of the 16 studied that have the greatest influence in this order: Kc, conductivity, carbonates, Mgc, pH, pF 15 atm, %_sa, %_cl, and CEC. The decision tree for this species shows the values for these soil parameters that lead to a range of species abundances. The attributes ordered by information gain for the species S. marianum are 0. Avena barbata (Italy). The soil parameters that provide information for the classification of the species A. barbata are P, Na, electrical conductivity, and pH. The most influential of all these is P. The corresponding decision tree shows that the species is not present for values of P of 11.2 p.p.m. or less, and that A. barbata is present for values of P over 11.

Discussion
Joint phytosociological and edaphic studies are not frequent, but there are many separate studies of both aspects. In this work, a clear relationship is established between the plant association and the soil on which it is found. The rapid identification of the dominant species and its abundance-dominance index presents the necessary information to know if there are nutritional deficiencies in the soil; this is fundamental for a sustainable and effective management of the crop, which allows favoring the use of vegetal covers and restricting the use of herbicides, less use of fertilizers and protection from the contamination of lakes, rivers, swamps, and crops. Many authors have worked on plant associations from the floristic point of view [4], others established relationships between plant associations and substrate [5,6,16,[22][23][24], or works are being done on the value of plant covers to mitigate climate change [40], and certain species are even used as indicators of heavy ion pollutants in the soil [43]. There are also more studies with a strictly edaphic character [37].
The plant-soil relationship has been studied by different authors [44], revealing the relationship between certain species and some edaphic parameters: although soil fertility is often established using edaphic analysis techniques [45,46], there are no studies on the influence of nutrients on the abundance of species. For these reasons, vegetation covers, breakages, soil erosion, and water resources are studied to evaluate the environmental impact, such as soil loss and increased evapotranspiration [47][48][49]. In all cases, the authors made a diagnosis without deep knowledge about the plant-soil relationship, which moreover did not result, as in other work where for example in a contaminated soil with heavy metal, a close relationship was observed [43]. However, this last point is extremely interesting in our study, since it allows us to establish the influence of the substrate on the abundance-dominance of the species and vice versa: in such a way that, knowing the abundance of the species (decision trees), we know the type and amount of nutrients present in the soil. It is a hybrid method between phytosociology and edaphology, which is of interest to environmental and agricultural agencies, because it enhances sustainable agricultural development [50], avoiding impacts [51] and mitigating climate change, which is causing an irregularity in production and food security. While Peter et al. [52] propose the use of Google Earth to select the best biotopes for cultivation, this tool is useful but not satisfactory; the use of bioclimatic maps and the phytosociological study of plant covers is necessary to obtain the amount and type of nutrients ground. Cover crops are necessary to maintain the richness of rhizobacteria, which intervene in the assimilation of nutrients and in plant growth [53,54]. The cover crop acts as a protective element of the soils, which reduces evapotranspiration, minimizing crop irrigation [55,56], and not being necessary the breakages that destroy the soil mycorrhizae that allow the nitrogen entry into nodular systems [57][58][59][60][61]. Nitrogen fixation by mycorrhizal fungi is essential for sustainable cultivation to be productive [62][63][64]: for this, it is essential to know certain edaphic parameters, since within the Fabaceae family, there are species indifferent edaphic to pH, being able to be acidophilic exclusively or basophilic. Jakubus and Graczyk [65] studied the variability of edaphic microelements in fields with Lupinus albus L., oscillating the soil pH between 6.5-7.0, soils in which some of the associations that we study growth (TP, TT, PA and LR): favoring this type of cover crop, it would not be necessary to carry out organic and inorganic fertilizations [66].
However, [17] established an in-depth study of certain pasture and grassland plant associations, correlating the abundance of a certain species with the types and amounts of nutrients in the soil, using various statistical techniques: among which we highlight Bayesian networks and decision trees. As we can see in the results obtained, the amount and type of nutrients condition the presence of a species with a higher or lower abundancedominance index. Consequently, knowing the value of the abundance-dominance index, it is possible to know what type and amount of nutrients exist in the soil. The greater or lesser abundance-dominance of a species is due to edaphic attributes. These are more influential than others, and this means that there are various routes that culminate in an abundance value of the species. As an example, in the specific case of Plantago bellardii, it is fundamentally conditioned by pF15 atm, conductivity, and pH. In the case of Hordeum leporinum, it is the silt texture that determines high or low values of this species.

Conclusions
This work demonstrates the importance of the soil-plant relation. The combination of phytosociological and soil analyses highlights the advances in the knowledge of the soil ecology of the plants. The study of the plant-soil relation using new methodologies that consider phytosociological, soil, and statistical aspects provides important information on the nutritional status of the soil. This is essential for improving crop yields, which can be achieved quickly by knowing the abundance of the dominant species in the plant association. Both the Bayesian analysis and the decision trees clearly establish the soil values for the abundance of a particular species. Therefore, if we know the abundance of a plant, we also know the amount of nutrients present in the soil. In this study, we have used several species, and we have observed that the soil attributes that most affect the presence or absence and abundance of a species, ordered according to the Information Gain (IG), are different for each species (decision trees). A species' abundance values, taking into account prior evidence, indicate the soil nutrients (Bayesian analysis). This is a new method, in which the knowledge of plant communities becomes very important. These plant communities are dominated by a certain species, which is a domain that is a consequence of the values acquired by different edaphic parameters. In the decision trees that we present for several species, dominant in a given association, edaphic nutrient values are established for a given abundance of the species. This fact acquires a high value in agriculture, since knowing the vegetal association and the abundance of the dominant species, we deduce the nutritional state of the soil.
We are sure that it is a work that improves the ability to predict changes in the plant community, in response to soil modification: this is very valuable to support the decision of environmental and agricultural agencies and decide interventions in the soil on the nutrient content.