Management Practices and Breeding History of Varieties Strongly Determine the Fine Genetic Structure of Crop Populations: A Case Study Based on European Wheat Populations

: As the effects of climate change begin to be felt on yield stability, it is becoming essential to promote the use of genetic diversity in farmers’ ﬁelds. The presence of genetic variability in variety could fulﬁl this purpose. Indeed, the level of intra-varietal genetic diversity inﬂuences the spatio-temporal stability of yields and the disease susceptibility of crop species. Breeding history of varieties and their management practices are two factors that should inﬂuence intra-varietal genetic diversity. This paper describes the genetic diversity of eight wheat samples covering a gradient from modern single varieties to on-farm mixtures of landraces. This gradient discriminates between landrace, historical and modern varieties, considering the breeding history of varieties, between single-varieties and mixtures of varieties, and between ex situ and in situ de facto strategy in terms of management practices. Genetic diversity of these samples was analyzed with the help of 41 single nucleotide polymorphism markers located in neutral regions, through computing genetic indices at three different levels: Allelic, haplotypic and genetic group level. Population structure and kinship were depicted using discriminant analysis and kinship network analysis. Results revealed an increase in the complexity of the genetic structure as we move on the gradient of variety types (from modern single variety to in situ on-farm mixtures of landraces). For the landraces, the highest levels of genetic diversity have been observed for a landrace (Solina d’Abruzzo) continuously grown on-farm in the region of Abruzzo, in Italy, for many decades. This landrace showed an excess of haplotypic diversity compared to landraces or the historical variety that were stored in genebanks (ex situ conservation). Genetic analyses of the mixtures revealed that, despite a very high selﬁng rate in wheat, growing in evolutionary mixtures promotes recombination between different genetic components of the mixture, a second way to increase the level of haplotype diversity. When management practices such as growing in mixture and on-farm management are combined, they substantially increase the different levels of genetic diversity of the populations (allelic, haplotypic, genetic group diversity), and consequently promote their adaptability. Our results conﬁrm the need to develop and manage evolving diversiﬁed large populations on-farm. These management strategies to the type of variety they wish to manage, because we have shown that their choices have a strong inﬂuence on the genetic composition of the crop populations.


Introduction
Since domestication and for thousands of years, crops have been grown as populations (i.e., landraces) and selected on-farm, therefore allowing among population (varietal) and within population (genetic) diversification. These two levels of diversity confer crop adaptability to contrasting environmental conditions and farmers' practices [1][2][3][4]. However, with the advancement of knowledge, especially in plant breeding and genetics (Mendelism, F1 hybrid development, pure line breeding, etc.), and modernization of agriculture from the middle of the nineteenth century in western countries, these genetically diverse landraces have been progressively replaced by genetically uniform modern varieties. This was particularly the case after the Second World War with the introduction of seed regulations such as distinctness, uniformity and stability (DUS) that a new variety had to meet to be registered and marketed [5]. The breeding history of varieties is thus really informative.
The increased use of genetically uniform crops since the Green Revolution has significantly improved yields and helped to alleviate the problem of hunger in the world. However, these modern varieties are highly dependent on inputs. The widespread use of these products in a context of climate change, water restrictions, pesticide resistance and rising prices for petroleum-derived chemicals is beginning to raise real concerns for society. One typical example of the impact of diseases on modern varieties is the corn leaf blight epidemics in America in 1970, which caused a 15% reduction in the estimated production [6]. Similar events have been described in other studies [7][8][9]. In the case of oat, the impact of the disease was found to be 25% less for landraces than for modern varieties [10]. The causal relationship between genetic diversity and disease tolerance has been demonstrated for rice in China [11] showing a lower impact of plant disease for mixtures of varieties compared to single varieties. In some cases, higher robustness and stress tolerance have been observed due to genetic diversity and higher level of heterozygosity [12,13]. Mobilizing among-variety genetic diversity through mixtures and within-variety genetic diversity by growing landraces or population varieties has become more important than ever before with global climate change and thus the increased risk of environmental variability in the near future [13,14].
These changes in the environmental conditions have made the conservation and use of genetic diversity crucially important. The scientific community has raised awareness about genetic erosion since the pioneer works accomplished by Vavilov or Harlan from the first part of the twentieth century [15]. Two types of conservation methods are generally employed for plant genetic resources: (i) Ex situ conservation and (ii) in situ conservation. Ex situ conservation is a static evolutionary strategy that involves the storage of samples (seeds, propagules and plants) outside their environment of origin such as in gene banks, botanical gardens, DNA banks, etc. [16][17][18][19]. In situ conservation is an evolutionary dynamic strategy in which genetic diversity is maintained in habitats where such diversity arose and continues to grow, i.e., for crop plants, in areas where that particular species has been domesticated and is continuously selected by farmers [20][21][22]. This in situ conservation is usually considered to be on-farm conservation (mainly landraces). This approach ensures the conservation of genetic diversity over time through the action of evolutionary forces (genetic drift, selection, mutation, migration) [1,23]. The adaptive response that a population shows when encountering a changing environment largely depends on the level and the structure of genetic diversity within the population [24,25].
Several studies were carried out in the recent past to understand the trends of crop genetic diversity in the last century. These studies generally neglected the within-variety (within-accession) diversity as they analysed very small sample sizes (1-5 individuals per landrace, accession or even species), and very often investigated either released varieties or ex situ conserved landraces [26][27][28]. Consequently, few studies have compared the effects of ex situ and in situ conservation on crop population genetic diversity. Sun et al. found higher genetic diversity and more alleles in landraces conserved in situ compared to the landraces conserved ex situ in rice [29]. Further, the evaluation of 600 landraces of rice conserved on-farm revealed that the genetic diversity was maintained even after 27 years strengthening the case of advantages of on-farm conservation [30]. In contrast, a decrease in genetic diversity due to genetic drift during regeneration with limited sample sizes in ex situ conservation has been shown in bean [31] and barley [32]. While 88% of total natural diversity was found in ex situ conserved Vatica guangxiensis [33], a better maintenance of genetic diversity of Parashorea chinensis was observed in in situ conservation than in ex situ conservation [34]. The effects of conservation of crop biodiversity by the two methods (in situ and ex situ) was studied in more detail by Negri and Tiranti [35], who identified reduced population size as the main factor causing the reduction of genetic variation, followed by ex situ multiplication, which increased subpopulation differentiation due to different environmental conditions compared to the adaptation area. In a recent study, the comparison of genetic diversity of rice landraces conserved ex situ in 1980 as well as conserved on-farm until 2014 showed that the on-farm conservation system maintained more alleles and higher genetic diversity compared to ex situ conserved landraces indicating that on-farm conservation provides the opportunity of evolution along with conservation [36]. Various other studies in crops species such as maize [1], rice [37], common bean [35], barley [32] and sorghum [38], have confirmed that in situ conservation, especially on-farm conservation, was highly capable of maintaining and enriching the within-population genetic diversity and heterozygosity. In these studies, diversity was found within each population cultivated under the name of a given landrace variety and to a lesser extent among those populations. Such heterogeneity is compatible with good productivity and quality under low input conditions, probably due to better local adaptation and more buffering capacity under stress conditions, as shown for instance in the case of an Italian landrace of celery [39].
Therefore, from a plant genetic resource management perspective, it is important to evaluate the effect of factors such as the mating system, the breeding history of the variety, the farmer's practices and the conservation strategies on the extent of the genetic population structure of varieties. We propose to describe the role of three of them: The breeding history of the variety, the conservation strategy and the population composition. Three modalities are considered for the breeding history of the varieties: Landraces, historical and modern varieties. Landraces are varieties developed by farmers before the 1850s. historical varieties were developed between the 1850s and the 1960s. Modern varieties are those developed since the 1960s. The conservation strategy discriminates ex situ conservation and in situ conservation. The composition of the population distinguishes between single varieties and mixtures.
In this study, using a set of eight populations of bread wheat (Triticum aestivum L., 2n = 6x = 42, AABBDD, self-pollinating species) with varying breeding histories and management practices (conservation and composition) provides an opportunity to understand the influence of all these three factors on population structure (Table 1). In such a set of populations, different genetic structures are expected to arise depending upon their history and farmer practices in the area. Based on our knowledge and the literature, we hypothesized that there was a gradient of increasing genetic diversity from modern varieties to mixture of landraces. This hypothesis was used to examine the diversifying processes in action in farmers' fields.
These eight populations (described in Table 1) were characterized using 41 neutral molecular markers. Genetic diversity was analysed at three levels: The allelic, haplotypes and genetic group level to search for a possible gradient within these eight populations. Additionally, the fine population structure of the different varieties has been revealed using a discriminant analysis and a kinship network analysis. The consequences of conservation and management methods on these different population structures are discussed in light of the different history of each variety and the genetic structure depicted in this study.

Context of the Study
The present study is based on the same sampling and experimental design described in Dawson et al. [40]. Seven farmer's varieties (three landraces, three mixtures, one historical variety) and one modern variety of bread wheat were selected for genotypic analysis (Table 1). Only one modern variety was sampled in this study as no genetic variation is expected for modern varieties since they are submitted to the distinction, uniformity and stability criteria to be marketed. The choice of these varieties was done by farmers on the basis of particular agronomic or quality traits, which they felt could be of interest for other organic farmers [41]. Seven organic farmers (four from France: JFB, FFM, HHF and VVC, two from the Netherlands: PVI and PVZ and one from Italy: GCX) were contacted through the partner organizations in the European Farm Seeds Opportunities project (http://cordis.europa.eu/search/index.cfm?fuseaction=proj.document&PJ_RCN=9643492) for this purpose. Mixture of two modern German varieties (Rektor and Bussard). Provided by the farmer who cultivated these two as mixture for more than ten years in the central region of the Netherlands.  The eight varieties, chosen for the present study, represent different management practices and types of farmer varieties presented in Table 1. Management practices distinguished: (a) Whether farmers have grown these varieties in their fields for at least ten years (in situ) or they requested from ex situ gene-bank or seed trader within the five past years (ex situ); (b) whether the varieties are grown on their own in a field (single variety) or are grown in a mixture. We distinguished three types of breeding history of varieties: Landraces, historical varieties, and modern varieties.

Ex situ conservation
Three varieties of this study were managed following ex situ conservation practices. Renan (RN) is a French modern variety currently the most widely cultivated variety in organic agriculture in France. Farmers requested Haute-Loire (HL) and Piave (PI), two landraces from France and Italy ex situ gene-bank collections, respectively.

In situ conservation
Five varieties were managed following in situ conservation practices. Rouge de Bordeaux (RB) is technically an historical variety from the late 1800s but was never selected for genetic homogeneity and the sample used here came from a farming community near the Bordeaux region. Solina d'Abruzzo (SO) is a landrace continuously cultivated in its region of origin in Italy. Zonnehoeve (ZH) is a mixture of two modern varieties from 1980 to 1990, planted and harvested together over ten years on an organic farm in The Netherlands. Redon (RD) and Touselles (TO) are landraces from ex situ collections, reconstituted by farmers as mixtures of several distinct accessions with the same name in the gene bank. TO is particular because it has three T. aestivum components and one T. durum component. Both RD and TO have been cultivated about ten years on-farm [4,40].

Experimental Design
In 2006, seeds needed for the experiment were sampled on the farms of origin (around 3 kg each). RN seed lot (3 kg) was obtained from INRA in 2006. An amount of 0.3 to 0.4 kg per variety per farmer depending on their seeding rate (between 250 to 500 seeds per m 2 ) and on the varieties' thousand kernel weight was sent to each of these seven farms, mostly those of the farmers who provided varieties for sowing in 2006. For three years, the farmers multiplied each variety in two 10 m 2 plot which were separated from other plots with a distance sufficient to prevent the sample mixing at harvest (see [4,40] for more details). After three years of multiplications at these farms, the seed samples were collected to be used for genotypic analysis. After three years, only 48 of the 56 populations (eight varieties x seven sites) were sampled. Unfortunately, eight seed lots could not be recovered due to experimental problems (five populations from JFB site and three populations from PVZ site). To improve the power of our analysis dealing with within-variety diversity, we added seed lots from the initial 2006 versions of each variety (eight initial varieties) for a total of 56 seed lots. Plants from each seed lot were sown in trays on the 16th of December 2010 at the rate of 30 individuals per population. The second leaf was sampled from each plant for DNA extraction.

Molecular Analysis
The DNA from each plant (30 plants per population and 56 populations) was separately extracted and genotyping of each sample (plant) was individually performed. For each plant, total DNA was extracted from 200 mg of the second leaf through DNA adsorption on Whatman Unifilter plates by following a protocol derived from the DNeasy 96 Plant kit (QIAGEN, Valencia, CA, USA). The quantification of the extracted DNA was done using Nanodrop R (Thermo Scientific) and PicoGreen TM followed by photospectrometry. Then the DNA concentration was normalized to 20 ng/µ for each sample. KASPAR method (Kbioscience) was used to genotype 1650 individuals of 56 populations (around 30 individuals per population) with a 96 SNP array. For this genotyping, 46 neutral markers (SNP) were used. These 45 neutral markers (out of a 90 markers) were selected from wheat 9 K iSelect assay (http://malt.pw.usda.gov/t3/sandbox/wheat/termsofuse.php) [42,43] after validation through genotyping of test individuals (preliminary genotyping with only 12 individuals) because they showed satisfying results, in terms of quality of genotyping, in the preliminary genotyping evaluation, had high diversity on the screening panel on the basis of Polymorphism Information Content value (PIC value), as well as evenly distributed across the genome (data not shown).

Data Analysis
Molecular analysis revealed that five neutral markers were of poor quality and therefore were discarded and 161 individuals were rejected as they had more than 15% of missing data. The genetic analysis of the samples was done based on 41 neutral markers using a panel of 1489 individuals.

Within-Variety Allelic Diversity
The mean observed heterozygosity (H o ) was estimated for each sample. A genetic diversity (H e ) index was estimated at three levels: (a) Allelic (H e (a)), (b) haplotypic (H e (h)) and (c) genetic group (H e (gg)). H o and H e (a), which is the unbiased Nei's genetic index, were estimated using the software GENETIX version 4.05.2 [44]. The haplotypic diversity index was estimated using the gene diversity formula in the software Arlequin [45] and the genetic group diversity index was estimated applying the Simpson index using Vegan [46], a package developed in R core [47].

Haplotype Inference
In order to study the fine genetic structure, imputation of missing data and haplotype phase detection was carried out with PHASE [48] by using the 1489 diploid individuals (corresponding to the pooled dataset) with the 41 neutral markers. Settings performed with PHASE were decided according to PHASE guideline recommendations [49] by using the MR4 algorithm and a numeric experiment based on a high quality subset of 300 individuals without any missing data. The quality of the imputation was controlled by simulating different rates of missing data within the high quality data subset. Based on this work, the analysis was performed with 100 burns-in periods before 100 iterations with 100 permutations per population and a recombination rate of 0.01.

Structure Analysis
Genetic relationship among the eight samples of bread wheat was analyzed computing the Roger's distance between all pairs of samples using Adegenet [50], a package developed in R core [47]. A neighbor Joining dendrogram was generated to represent the relationship among these eight populations based on Roger's distance and using the R package Ape [51]. Bootstrap support was obtained for each node of the dendrogram out of 10,000 replicates.
The pairwise-F ST were computed to estimate genetic differentiation among the same populations using the R package Hierfstat [52]. The confidence interval was estimated for each pairwise-F ST value based on 1000 bootstrap replicates.
The organization of genetic diversity was studied using two methods: Discriminant analysis of principal components (DAPC) and kinship network analysis (KNA). They were employed on the new data set constituted of 397 distinct haplotypes after PHASE missing data imputation for the 41 neutral markers. For technical reasons, each haploid haplotype was converted into homozygous diploid.
The DAPC was done using the Adegenet package. This analysis was done in two steps on diploid haplotypes. First, a k-means procedure was applied to detect the optimal partition of the pooled haplotypes. The K-means algorithm was run with a number of clusters ranging from 1 to 60, with 100,000 replicates for each value of k and 30 different seeds. The Bayesian information criterion (BIC) was used. The optimal number of clusters (k) is obtained for the smallest value of BIC. During a second step, a discriminant analysis was carried out on the dataset using the optimum number of clusters detected, providing a probability of assignation of each haplotype to each of the k groups. The genetic groups detected were named according to the corresponding variety when they were found to be representative of the considered farmer's varieties (FVs). All the haplotypes present in clusters others than their specific variety groups were considered as potential migrant or off-type haplotypes. Identification of potential migrants and off-types within populations was done on the basis of DAPC results. The haplotype of each of these potential migrants and off-types was compared one by one with the haplotypes of the same cluster and with the haplotypes of clusters assigned to their varieties, to decide their status as good genotypes or migrants. Only 142 (9.5%) individuals were detected as migrants and removed from further analysis. DAPC analysis was done a second time on the refined dataset to check for the stability of the clustering.
Kinship network analysis (KNA) at the haplotype level provided a representation of the relationship among all 448 distinct haplotypes detected in the pooled dataset after missing data imputation and phasing. KNA was based on the kinship matrix, obtained after counting the number of shared alleles between all pairs of the 448 distinct haplotypes present in the dataset. Thresholds were applied at two levels to simplify the network representation while retaining the maximum level of information. Only haplotypes occurring more than once are represented by nodes. Two nodes (haplotypes) are connected if they share a minimum of 87% of their alleles. The same approach was used in [13]. In order to combine the results of both DAPC and KNA and to discuss the consistency of the two approaches, haplotypes (individuals) were color coded by DAPC cluster assignment. Then, in KNA each haplotype was represented using the same color to see whether haplotypes belonging to the same DAPC cluster showed a tendency to be more clustered in the KNA analysis.

Off-Type Identification and Data Purification
The discriminant analysis of principal component (DAPC) performed on 1489 individuals (2978 haplotypes) after missing data imputation, revealed an optimal number of clusters (genetic group) equal to 25. Twenty-two genetic groups among the 25 corresponded mainly to one variety and were renamed according to this information, while three clusters shared individuals from two or more varieties and were therefore denoted as a Common Group (CG1, CG2, CG3) ( Figure S1). This first analysis allowed us to identify two populations (RN and TO both cultivated in the farm called PVZ) which were not classified in their respective varietal group. RNPVZ belonged to one specific group with individuals only found in this population. TOPVZ belong to one of the groups specific for RD. This was consistent with information from the person responsible for the experiment in FSO project saying that the farmer PVZ had difficulties in identifying some populations at harvest. For this reason, RNPVZ and TOPVZ were discarded from further analysis. In addition, 75 haplotypes were not assigned to their respective varietal groups, but instead were assigned to another genetic group. We also removed these individuals from the analysis, as we considered them as migrants from other varieties grown on the same farm. After this step of dataset cleaning, 310 distinct haplotypes were identified among 1325 individuals. DAPC was performed on this cleaned sub-dataset and the optimal number of genetic groups was k = 24 ( Figure S2).

Multi-Level Genetic Diversity and Population Structure of Samples
Heterogeneous levels of neutral genetic diversity were found among the eight samples corresponding to different variety types (modern variety, historical varieties, landraces and mixtures) and different management strategies (ex situ vs in situ conservation and mono-varietal vs mixtures). The following sections provide a description of each sample using a multi-level analysis of the genetic diversity: the unbiaised Nei's diversity at the allelic (H e (a)), haplotype (H e (h)) and genetic group (H e (gg)) levels ( Table 2).

Single Variety Conserved Ex Situ and Modern Variety
Genetic diversity analysis revealed similar characteristics for one of the two landraces, "Haute Loire" (HL), which is managed as a single variety and conserved ex situ, and the modern variety, Renan. The other landrace (Piave) showed a contrasted pattern. "Haute Loire" was the most homogeneous sample with a single genetic group (Figure 1) denoted as "HL1". Consistently HL had a very low genetic diversity at the genetic group level (H e (gg) = 0.000 (Table 2), but also at the allelic level (H e (a) = 0.0012) and haplotypic level (H e (h) = 0.0870). We noticed that the observed heterozygosity was also an order of magnitude lower in this sample than in the others (H o = 0.0003 compared to 0.03 < H o < 0.09 for the others).
Similar to HL, all the individuals from the modern variety Renan (RN) were also characterized to have a unique genetic group named "RN1" (Figures 1 and S2). Moreover, the very low genetic diversity at all the three levels (H e (a) = 0.0057, H e (h) = 0.2354 and H e (gg) = 0.000) indicated very low within-population genetic diversity.
Piave (PI) the second ex situ landrace clearly showed a different pattern of genetic diversity and ultimately different genetic structure compared to the two previous samples. PI showed higher genetic diversity values at the allelic level (H e (a) = 0.1046, haplotypic level H e (h) = 0.4334 and genetic group level H e (gg) = 0.2315 (Table 2) than the two other samples. PI was a composite variety with two specific groups denoted PI1 and PI2 (Figure 1).

Single Variety Conserved In Situ (HV and LR)
The Rouge de Bordeaux (RB) variety showed a composite structure with three genetic groups (Figure 1), i.e., two main specific groups (RB1 and RB2) and one shared group (CG1) with Redon. In addition, genetic diversity at allelic, haplotypic and genetic group levels were relatively high for RB (H e (a) = 0.3364, H e (h) = 0.6012 and H e (gg) = 0.5713, respectively, Table 2) compared to the previously discussed varieties.
Solina d'Abruzzo (SO) also had a composite population structure with four specific genetic groups (SO1, SO2, SO3, SO4) and one shared group (CG2) with Touselles ( Figure 1) indicating highly complex population structure. Interestingly, the probability that some individuals of this landrace fall in one specific group was relatively weak, but they always fall in groups specific to this landrace or in the CG2 group. A high level of genetic diversity at the haplotypic and genetic group levels was observed for SO (H e (h) = 0.9850 and H e (gg) = 0.7497, respectively, Table 2), but a relatively low allelic diversity (H e (a) = 0.1607) was observed compared to RB.

Mixtures In Situ Conserved (MIXMV and MIXLR)
Zonne Hoeve (ZH) was structured into two main fixed groups (ZH1 and ZH2) and one shared group (CG3) with Touselles. ZH2 represented almost 75% of the mixture, explaining the relatively low genetic group diversity H e (gg) = 0.4347 observed for a mixture. The allelic diversity and haplotypic diversity (H e (a) = 0.1642, H e (h) = 0.8685) were also intermediate ( Table 2).
The "Touselles" (TO) sample was composed of five main specific groups (TO1, TO2, TO3, TO4 and TO5) and two groups (CG2 and CG3) shared, respectively, with SO and ZH (Figure 1), consistent with the expectation that this mixture would have a complex population structure based on its history. A high level of genetic diversity was observed for the different genetic indices: H e (a) = 0.3025, H e (h) = 0.9340 and H e (gg) = 0.7283 indicating that the TO population was genetically diverse ( Table 2).
Similar to TO, Redon (RD) also presented a composite structure with four main specific groups (RD1, RD2, RD3 and RD4) and one group (CG1) shared with RB ( Figure 1). Genetic diversity indices were relatively high at the three levels: H e (a) = 0.3364, H e (h) = 0.9301 and H e (gg) = 0.7414 confirming the diverse nature of this population ( Table 2).
However, some samples like RB, ZH and SO were relatively far from the regression line ( Figure 2). For RB, an excess of allelic diversity, deficit of haplotypic diversity, and excess of genetic group diversity was observed compared to the other samples. ZH showed an excess of haplotype diversity and a deficit of allelic diversity and genetic group diversity compared to the other samples. SO showed an excess of haplotypic and genotype group diversity and a deficit of allelic diversity compared to the other samples.

Deciphering the Varietal Relatedness
Dendrogram confidently separated the eight populations into three clusters (bootstrap support of 100%, Figure 3). The first one is composed of ZH and PI which are themselves relatively different (D Rodger = 0.46, bootstrap support equal to 68%). The second cluster grouped RD and RB, two closed varieties (D Rodger = 0.17) which are distinguished with intermediate confidence (bootstrap support equal to 50%). The third group is composed of TO, SO, RN and HL in which varieties cannot be clearly distinguished (bootstrap support < 50%) even if their genetic distances ranged from 0.14 to 0.53. RN is the most distant population compared to the other populations, then it is ZH and PI. RN and PI are the two far populations (D Rodger = 1.00).  Table 1. Node labels represent boostrap support (>50%) out of 10,000 bootstrap replicates.
Pairwise-FST provided additional information (Table S1). Very strong differentiation was observed between the two varieties which are almost fixed: HL and RN, with a pairwise-FST close to 1. Strong genetic differentiation was observed for pairs involving one of these two varieties with pairwise-FST ranging from 0.74 to 0.92. Intermediate pairwise-FST values ranging from 0.73 to 0.64, were observed when populations with a limited amount of genetic diversity such as PI, SO and ZH were involved. The lowest differentiation values were observed when one or both populations were genetically diverse with a pairwise-FST ranging from 0.25 to 0.63.
The kinship network analysis performed on 245 haplotypes after filtering (KNA, Figure 4) revealed that: (1) Genetic groups from the same variety were not always strongly related, and (2) genetic groups from two different varieties could be related.
h156 and h40, the two main haplotypes from the modern variety RN were only related to each other. The main haplotype from PI2 genetic group, h22, was related to few haplotypes belonging to the same genetic group and not connected to the other haplotypes, indicating that this component of Piave variety was genetically very different from the rest. h362, the main haplotype of the PI1 genetic group was connected to haplotypes belonging to several TO genetic groups, revealing that the PI1 genetic group appeared to be more related to TO rather than to its second genetic group PI2. h1 was the main haplotype of the HL variety. It showed some relatedness with different haplotypes from several TO genetic groups and with only one SO haplotype (h305), respectively. The KNA analysis confirmed the composite characteristic of the mono-varietal in situ conserved FV, (RB and SO). h110 was the main haplotype of RB which was present in the genetic group RB1 and was connected to the other haplotypes present in this genetic group. h59 haplotype, the main haplotype of genetic group RB2, was not connected to other haplotypes, showing a clearly distinct genetic make-up compared to other haplotypes from RB, as well as other samples. h144 belonging to the CG1 genetic group was the same haplotype detected in RB and RD. This haplotype seemed very distinct from the other genetic groups. The landrace variety SO showed the most diverse population structure compared to other samples investigated in this experiment. Many haplotypes with mostly very low haplotypic frequencies were observed. Interestingly, most of these haplotypes were connected together, revealing a very strong relatedness among them. h336 was the main and the most central haplotype within SO.
As expected, mixtures showed a high haplotypic diversity. For ZH, h436, the main haplotype of genetic group ZH1 was not connected to the other haplotypes of its genetic group, appearing to be weakly related to the rest of ZH as well as of other FV. h451, the main haplotype of ZH2 was connected with haplotypes belonging the ZH1 genetic group. h432 is the main ZH haplotypes assigned to the CG3 genetic group. The other ones were more related to ZH1 and ZH2.
Among different genetic groups of the RD mixture, the main haplotypes assigned to RD1 and RD2 like h126, h127, h147 or h21 were strongly related as the make a cluster. They were weakly connected with SO haplotypes. By contrast, the main haplotypes from the genetic group RD3 and RD4 like h206 or h216 showed a strong genetic relatedness with haplotypes of SO. The KNA applied to the TO mixture revealed that TO1 was very distinct from rest of the genetic groups as h121, the main haplotype of this genetic group, was not connected to any other haplotype except h119. The genetic groups TO3 and TO4 were composed by only one haplotype (respectively, h114 and h113). These two haplotypes were related to each other, but were not connected to the rest. The different haplotypes belonging to genetic group TO2 were connected to each other, as well as to the h362 (the main haplotype of PI1 genetic group as described above). In addition, they were also connected with some other haplotypes belonging to TO5 genetic group indicating that TO2 was genetically more related to TO5 as compared to other genetic groups of TO.

Discussion
This investigation was designed to test the basic hypothesis that farmers' varieties are more genetically diverse than a modern commercial variety using bread wheat as case study. From this, the exploration of a range of intermediate situations has revealed that certain practices such as on-farm dynamic management of crop diversity or the cultivation of variety mixtures foster genetic diversification in the fields by allowing evolutionary mechanisms to occur.

Diversifying Practices Interacting with the History of Varieties
When grown as single varieties, the modern variety Renan (RN) and the two conserved ex situ landrace varieties (Piave, PI and Haute-Loire, HL) showed the lower levels of genetic diversity (allelic, haplotypic and for the genotypic groups), HL showing the lowest values for the three diversity indicators. These results are in line with previous studies that have identified that gene bank practices during regeneration may promote genetic drift [32] and selection [53], which induce a loss of genetic diversity. This is due to the constraints related to the large number of accessions that need to be regenerated each year and therefore the small amount of time and space that may be devoted to each accession and to the field conditions that may generate selective pressures.
By contrast, in situ on-farm management of crop diversity in farmers' fields by growing and harvesting a very large number of plants (usually 200-500 seeds/m 2 over more than 10,000 m 2 ) every year for a very long time fosters accumulation of genetic diversity in single varieties as observed for RB and SO. Indeed, these management practices should limit genetic drift and favor the emergence of new alleles by mutation and of new genotypes by recombination, even though outcrossing rate is low but not null in bread wheat [54]. Homozygous lineages are generally observed in such situation with a limited number of haplotypes that can reach a high frequency [55][56][57]. This interpretation is consistent with the historical variety (RB) which showed an excess of allelic diversity (0.33) but a deficit of haplotype diversity (0.60) indicating that only a few haplotypes are dominating. This phenomenon leading to a limited number of frequent haplotypes could be reinforced by a few bottlenecks in the past of this population. Such demographic event is consistent with available information about RB as it was selected in the 1850s from another historical variety called Noé before being widely grown in France at the end of the nineteenth century.
However, it was not quite the case for Solina (SO), a landrace continuously grown in the Abruzo area, Italy, which showed an excess of haplotype diversity (0.98) together with a moderate allelic diversity (0.16). While the allelic diversity of Solina seemed to be at the lower end of the range found in the literature for on-farm managed bread wheat landraces (for instance, H e (a) ranged from 0.17 to 0.34 in three bread wheat landraces collected in Oman in 2001 [58], haplotypic diversity clearly seemed extremely high. This makes it complicated to explain the particular pattern of genetic diversity structure found for SO. While natural selection due to the somewhat stressful conditions encountered in this region could drastically reduce genetic effective size as demonstrated in experimental wheat populations conducted under in station dynamic management [59,60] and therefore impact the allelic diversity, the haplotypic genetic diversity would also be much reduced, which is not the case for SO. In addition, the populations conducted in experimental dynamic management over 10 to 20 generations probably do not represent the evolutionary dynamics of a landrace such as Solina, that has been managed on-farm for several hundred years, under conditions that we may imagine stable, and therefore probably relatively adapted to its local environment. One hypothesis is that recombination played a non-negligible role in the evolution of SO population structure which is in line with the trend found by Jullien et al. [56] who investigated the effects of the selfing rate, population genetic effective size and migration rate using a population genetic simulation model. However, although decreasing selfing led to a greater number of multilocus genotypes (i.e., a larger H e (h)), the range of values does not seem compatible with Solina's diversity pattern. In-depth investigating of more contrasted situations with higher seed migration together with selection and recombination in this area is essential to further clarify the mechanisms and evolutionary forces at work.
The same level of allelic genetic diversity was observed in the field when farmers decided to create mixtures compared to the level of allelic genetic diversity estimated for on-farm single varieties (an average of 0.26 compared to an average of 0.24, respectively). However, more interestingly, the mixture of modern varieties, Zonne Hoeve (ZH), showed a slight excess of haplotype diversity compared to the other mixtures (Figure 2A). This finding can be interpreted as an increase in effective recombination for a mainly selfing species.
Year after year of cultivation including seed saving and seed replanting, a population may accumulate a few recombining plants between the two components of the mixture while the allelic diversity is still relatively limited because it is a mixture between two modern varieties genetically homogeneous.
Therefore, two practices can be considered as diversifying: (1) When farmers create mixtures of varieties, (2) when they grow a population for a long time in the same area such in the case of SO. The combination of the two practices, making mixture of landraces, leads to higher levels of allelic and haplotypic diversity, as we observed in TO and RD.

Genetic Relatedness between Different European Farmer's Varieties
Structure analysis of the eight varieties/mixtures of bread wheat showed strong genetic distance and pairwise-F ST between HL and RN which is consistent with the absence of genetic variability within each of these samples with a very limited number of shared alleles. By contrast, intermediate genetic distance and pairwise-F ST were observed between historical variety, landraces and landrace mixtures grown on-farm. This indicated that genetic variance is available and that a large part of alleles are shared among these heterogeneous varieties or mixture.
In previous studies, Thomas et al. [13,57] have shown that clustering methods usually used to analyse population structure are not fully adapted to detect the fine structure of crop populations with heterogeneous levels of genetic diversity. Results from a multivariate decomposition (DAPC) and a kinship network analysis, also known as haplotypic network, were combined to avoid the limitations of each analysis.
Our results confirmed the composite nature of variety mixtures such as ZH, TO or RD as they were composed of three to seven genetic groups. These groups should correspond to the different components of the mixtures. More surprisingly, the same pattern was observed in two landraces (PI and SO, Figure 1) and the historical variety (RB). This composite nature was previously reported in detail for RB [57] and TO [13] using SSR markers. This structural feature is thus independent of the type of markers used to analyse the genetic polymorphism of crop populations.
A focus on the genetic relationship among these composite populations reveals an unexpected situation: Two genetic groups belonging to two different varieties could be more related than two genetic groups from the same varieties. As already mentioned in a study describing TO [13], this reflects an overlapping genetic structure between certain varieties and could explain the lack of bootstrap support when genetic distance is computed among SO, TO, HL and RN, for instance, even if they are relatively distant and differentiated. It was also the case for Piave as its two genetic groups (PI1 and PI2) were not closely related, whereas PI1 and several genetic groups from TO were relatively closer ( Figure 4). In addition, some genetic groups of TO were also related to the unique genetic group of HL. The same trend was observed between RB and RD, and between RD and SO. These results suggest that the western Europe landraces studied here shared, to some extent, a common genetic background which is still partially present within on-farm landraces and evolutionary mixtures. This highlights the existing porosity between these European landraces due to their common long term history which was progressively erased by human activity inducing strong discontinuity. These remnants of common genetic background are thus still available and can be used by farmers and breeders and adapted to different environmental conditions to face recent threats induced by climate change and other stressful phenomenon.

Conclusions
In this paper, we considered different crop varieties and mixtures of bread wheat ex situ conserved or continuously grown in situ on-farm. Our results revealed an increase in the complexity of genetic structure as we move on the gradient of variety types (from modern single variety to in situ on-farm mixtures of landraces). We have shown that two on-farm management practices increased genetic diversity: Mixtures of varieties and in situ on-farm management of landraces. When these management practices are combined, they substantially increase the different levels of genetic diversity of the populations (allelic, haplotypic, genetic groups diversity), and consequently improve their adaptability.
Indeed, genetic analyses of the mixtures revealed that, despite a very high selfing rate in wheat, growing in evolutionary mixtures promotes recombination between different genetic components of the mixture, a second way to increase the level of haplotype diversity. More surprisingly, an excess of haplotype diversity compared to the level of allelic diversity was observed for Solina d'Abruzzo (SO) in Italy, one of the landraces continuously cultivated in situ on-farm. Different evolutionary mechanisms have already been proposed to describe distribution of genetic diversity in self-pollinated species, but none of them is sufficient to explain the distribution observed in SO. The most likely scenario to explain such an organization of genetic diversity is a very large effective size through collective management by local farmers that combines seed circulation with mixing and selection of their seeds in addition to natural selection. These practices promote recombination between different genotypes and the fixation of neutral and adaptive mutations. In-depth investigation by simulation and experimentation is needed to decipher precisely the evolutionary mechanisms in action which allowed such features to emerge.
All these findings confirm the need to develop and manage evolving diversified large populations on-farm. In an era of global climate shift, where modern varieties have shown low adaptive potential to changing environmental conditions, especially in organic and low input areas, the use of highly diverse landraces and historical varieties in mixtures could solve the problem to a certain extent because on-farm diversified populations have a higher potential to adapt to their local environment than uniform varieties. Such diversifying practices need to be generalized, in particular when promoting agroecological and sustainable agriculture. The new EU seed legislation on heterogeneous material in 2018 is opening new opportunities in this direction. However, the evolutionary characteristics of these populations are still poorly understood by researchers and need to be better considered in the future.
Our conclusions invite crop diversity managers such as genebank curators, community seed bank managers and farmers' organizations to adapt their management strategies to the type of variety they wish to manage, because we have shown that their choices have a strong influence on the genetic composition of the crop populations.
Author Contributions: A.R.K. conceptualized the study, devised the methodology, conducted the investigation, curated the data, performed formal analysis and drafted the original manuscript. I.G. conceptualized and supervised the study, devised and validated the methodology, acquired funding, administered the project, provided resources and, reviewed and edited the manuscript. M.T. supervised the study, devised and validated the methodology, visualized, reviewed and edited the manuscript.