Intercultivar Diversity of Sour Orange ( Citrus aurantium L.) Based on Genetic Markers, Phenotypic Characteristics, Aromatic Compounds and Sensorial Analysis

: Sour oranges ( Citrus aurantium L.) are well known in the processing and cosmetics indus-tries for the aromatic properties of their essential oils. Intercultivar genetic and aromatic diversity is not well documented. The objective of this study was to evaluate the impact of morphological selection and genetic mechanisms of varietal diversiﬁcation (mutation or hybridization) on the aromatic and odor variability of sour orange essential oils. Forty-ﬁve sour orange accessions from INRAE-CIRAD citrus Biological Resources Center (France) were assessed for ten simple sequence repeat (SSR) and 54 single nucleotide polymorphism (SNP) markers, nine morphochemical fruit traits and with the aromatic components of leaf and peel essential oils. Thirty-nine sour oranges displayed no intercultivar molecular polymorphism and six genotypes originated from interspe-ciﬁc hybridizations involving sour orange, citron, pummelo or mandarin. The peel essential oil (PEO) diversity was low, in accordance with the genetic diversity. The predominance of limonene (>90%) prevents any possible correlation to be made between the composition and the variation in sensory proﬁles detected by panelists. Few compounds in the leaf essential oil (LEO), such as linalool, linalyl acetate, α -terpineol and geraniol were signiﬁcantly different across sour oranges varieties. The morphological fruit attributes mainly used in varietal selection differed highly across the main genetically identical group of sour orange accessions. These results conﬁrm that mutation can generate variability in aromatic compounds and aromas and that their exploitation requires an improvement in characterization processes.


Introduction
The regular sour orange belongs the Rutaceae family. Its genetic origin is now commonly accepted to be direct hybridization between a pure male mandarin (C. reticulata Blanco) and a pummelo (C. maxima (Burm.) Merr.) [1,2]. Although its precise geographical origin is unknown, it probably originated between the southern China, northern Burma and northeastern India; then, it spread worldwide following traders and discoverers very early in recorded history. Sour orange was probably cultivated in India and China hundreds of years ago, before the Christian era [3]. Sour orange became a major rootstock at the end of the 19th century to fight against Phytophtora gummosis [4]. When used as a rootstock, sour orange also promotes high yield, high quality fruits and tolerance to various stresses, such as root rot, high pH, salinity, flooding, cold and blight. Unfortunately, its popularity has decreased with the emergence of citrus tristeza virus (CTV) [5]. Even if

Biological Materials
All the biological materials used in this study came from the orchard of the INRAE-CIRAD citrus collection at San Giuliano, France (latitude 42 • 17 N, longitude 9 • 32 E; Mediterranean climate, average rainfall of 840 mm per year and average temperature of 15.2 • C; soil derived from alluvial deposits and classified as fersiallitic, with a pH range of 5.0-5.6) [37]. Forty-five sour orange accessions grafted on the same rootstock (Carrizo citrange, [C. sinensis × Poncirus trifoliata]) and grown under identical conditions were used for the analysis of phenotypic and aromatic diversity. For the genetic analysis, one additional variety of each of the four ancestral species was included as a reference to help us determine the genetic origin of sour orange accessions (Table S1).

Phenotypic Description 2.2.1. Data Acquisition
To perform the phenotypic analysis five representative fruits from three different trees each of forty-three accessions ("Bizzaria" and "A fleurs grosses Carle" were not phenotyped) of sour orange were harvested during the first week of March 2019 corresponding to fully ripened fruits. In this study all the fruits were individually measured.
The polar and equatorial diameters (PD and ED) of each fruit were measured using a caliper model IP 67 (BLET, Rueil-Malmaison, France), and their ratio (PD/ED) yields the shape of the fruits: flattened for values below 1 and oblong for values higher than 1. The weight (W) of each fruit was measured using a balance model PIONNER (OHAUS, Parsippany, NJ, USA). The thickness of the peel (PT) was measured using a caliper IP 67 model (BLET, Rueil-Malmaison, France). The number of segments (NS) of each fruit were individually counted. The color of the flavedo fruit peel was measured using a colorimeter model Chroma meter CR-400 (Konica Minolta Sensing, Ramsey, NJ, USA) with the determination of a*, b*, and L color indices, where a* corresponds to the variation between green and red, b* corresponds to the variation between blue and yellow, L is the brightness variation between black and white, the combination of these variables permit to calculate the Citrus Color Index (CCI). These variables were then applied in the formula CCI = 1000 × a*/(L × b*), with variation between −20 (green) and +20 (orange), where zero (0) corresponded to yellow [38]. Each fruit was measured four times around the equatorial line.
Five fruits from each tree were separately hand-pressed and equal proportions were mixed in three lots per cultivar to perform the following juice analysis. The total soluble solid (TSS) content expressed in Brix was measured three times using an RFM710 refractometer (Bellingham+ Stanley ® , Weilheim in Oberbayern, Germany). The acidity (AC) expressed in percent (gram of citric acid per 100 g of juice) was measured three times using an 855 Robotic Titrosampler, (ΩMetrohm ® , Herisau, Switzerland).
The fruit base and the cross-section of the eight most representative cultivar in terms of diversity were photographed and graded based on the same scale.

Statistical Analysis
The distribution of each parameter (W, CCI, NS, PT, TSS, AC and PDED) was represented using a violin plot coupled with a boxplot using the R (v4.0.1) package ggplot2 [39]. Significant differences within cultivars were calculated for each parameter using one-way analysis of variance (ANOVA) and Tukey's test using the agricolae package in R [40].
To decipher the structure of our cultivars based on all the phenotypic parameters, we performed principal component analysis (PCA) using the following packages in R: ade4, FactoMineR and factoextra [41][42][43]. To perform the PEO extraction the fruits were harvested in January 2019 and handpeeled in order to obtain approximately 250 g of fresh material for the forty-four sour orange accessions. The fruits were picked randomly from three different trees.
The LEO extraction was conducted with 100 g of leaves at their maximum development point that were harvested in June 2019 from three different trees of the fourty-four cultivars (except "Bizzaria").
A second LEO extraction was conducted one year later (June 2020) on three different trees for each of six cultivars to confirm the reproducibility of the previous leaf analysis and to asses cultivar variability. The six cultivars were chosen because they were representative of the range of chemical diversity found in the first LEO experiment.

Hydro Distillation
Hydrodistillation, 250 g of fruit peel material was blended with distilled water oneminute using a blender model 1300W (Magimix, Vincennes, France). The leaf materiel was not blended.
The samples were reacted in a 2 L wide neck flask with a final volume of one liter (sample and distilled water) and heated for two and a half hours using a heating mantle model EM2000/CE (Eletrothermal ® , London, England) thermostat 7. The essential oil was collected using a classical Clevenger apparatus. The Clevenger apparatus was cooled using a refrigerated fluid (mix of glycol/water) cooled at 4 • C and moved by a minichiller model C20 (Huber ® , Offenburg, Germany).
Then the essential oils were stored in overfull 300 µL tainted vial and stored at −20 • C before further analysis.

Essential Oil Analysis Using Gas Chromatography and Gas Chomatrography-Mass Spectrometry
Gas chromatography analyses were performed on a Clarus 500 gas chromatograph (PerkinElmer, Waltham, MA, USA) equipped with flame ionization detector and equipped with two fused silica gel capillary columns (50 m, 22 mm id, film thickness 0.25 µm), BP-1 (polydimethylsiloxane) and BP-20 (polyethylene glycol). The oven temperature was programmed from 60 to 220 • C at 2 • C/min and then held isothermal at 220 • C for 20 min, with injector temperature 250 • C, detector temperature 250 • C, carrier gas hydrogen (1.0 mL/min), and split 1/60. The relative proportions of the oils constituents were expressed as percentages obtained by peak area normalization, without using correcting factors. Retention indices were determined relatives to the retention times of a series of n-alkanes (C7-C28) with linear interpolation ("Target Compounds" software of Perkin Elmer).
Gas chromatography coupled with mass spectrometry were conducted with a Tur-boMass quadrupole detector (Perkin Elmer, Waltham, MA, USA, directly coupled to an Autosystem XL (Perkin Elmer), equipped with a fused silica gel capillary column (50 m, 0.22 mm id, film thickness 0.25 µm), (BP-1 polydimethylsiloxane). Carrier gas, helium at 0.8 mL/min; split 1/75; injection volume 0.5 µL; injector temperature, 250 • C; energy ionization, 70 eV; electron ionization mass spectra were acquired over the mass range 40-400 Da. The identification of components was based: on comparison of their gas chromatography retention indices on polar and apolar columns, determined relative to the retention times of a series of n-alkanes with linear interpolation with those of authentic compounds and literature data; on computer matching against National Institute of Standards and Technology (NIST) commercial mass spectral library and by comparison of spectra with literature data. For further information were refer to the publication by Luro et al. [23].

Statistical Analysis
The structure of all the LEO cultivars was graphically represented using a heatmap. The analysis was performed using the package gplots in R [44]. The LEO diversity of six cultivars with three biological repetition was graphically represented using a cluster dendrogram with the hierarchical clustering method. The analysis was performed in R using the packages ggplot2, ade4, FactoMineR and factoextra [39,[41][42][43]. ANOVA followed by Tukey's test was conducted on the ten main compounds of the LEO to statistically assess of the observed variability using the agricolae package in R [40].

Tests, Panelists and Descriptors
Sensorial analysis was performed on the PEO of thirty-nine regular sour oranges (genotypically identical varieties). A check all that apply (CATA) questionnaire was used. Twenty-nine descriptors were selected during consensus training sessions of the Cointreau Expert panel, on a few randomly selected samples of bitter orange essential oils. These descriptors are terms that the Expert panel already uses for Cointreau distillates already identified during CG-O sessions on the distillate (Table S2). The panelists (ten women and eight men aged between 22 and 55) are people from the laboratory who control the quality of the liquids during the manufacturing process and participate in daily tastings.
The pure distilled PEO sample was presented in tainted vials to the panelists. Each PEO sample was identified by a code making it unidentifiable by panelists and given in a random order without prior information. The panelist had one minute to smell the sample using a test strip and tick the perceived aromatic characteristics in the provided chart. The question asked to the panelists is the following: Which of the twenty-nine descriptors characterize the aroma of the sample? Five distinct sessions were conducted for the analysis of the thirty-nine samples.

Statistical Analysis
The data from each judge and each session were gathered and transformed into a unique matrix with attributes coded as (1) detected by a panelist and (0) not detected by the panelist for each descriptor, panelist and cultivar. The previous matrix was transformed into a contingency table for further analysis.
Cochran's Q-test (p ≤ 0.05) was applied to the raw binary matrix using the RVAide-Memoire package in R to determine significant differences among cultivars for each sensory attribute (Table S2) [45].
Factorial correspondence analysis (FCA) was performed on the contingency table (with only the seven attributes that were significantly different between cultivars according to Cochran's Q-test) using R (v4.0.1) software to detect clustering and to establish relationships between cultivars and sensory attributes. Pearson's chi squared test (p ≤ 0.05) was performed on the contingency table (with only the seven attributes that were significantly different between cultivars according to Cochran's Q-test) using R software to determine the existence of a statistical relationship between sensory attributes and cultivars.
2.5. Genetic Diversity Analysis 2.5.1. DNA Extraction DNA was extracted from 50 mg of leaves of the forty-five sour orange cultivars using a DNeasy ® Plant Mini kit (Qiagen, Hilden, Germany) following the manufacturer's protocol. The DNA quality and concentration were controlled twice using a Nanodrop 2000 spectrophotometer (ThermoFischer Scientific, Waltham, MA, USA) and by electrophoresis on an agarose gel. In total forty-nine DNA samples were extracted from forty-five accessions of sour orange and four other accessions representing the ancestors of commonly cultivated Citrus species including: Poncire commum citron (C. medica), Cleopatra mandarin (C. reticulata), Chandler pummelo (C. maxima) and micrantha (C. micrantha).

SSR Genotyping
Polymerase chain reaction (PCR) was performed in 20 µL reaction containing 8 ng of DNA, 0.2 µM of each primer, DreamTaq™ Hot Start PCR Master Mix (ThermoFisher Scientific Waltham, MA, USA) containing Taq polymerase and all other reagents. PCR amplification was performed in a Primus 96+ thermocycler (MWG-Biotech ® , Luxembourg, Luxembourg), and amplicons were analyzed as described by Luro et al., with the ten pairs of primers (Table S3) [46].
The genetic structure of sour oranges based on allelic data was studied by principal component analysis (PCA) on allelic data using the factoextra package in R [43]. The PCA coordinates of the first three axes of all individuals were plotted on a 3D scatterplot using the package scatterplot3d in R [47].

SNP Genotyping
Fifty-four SNP markers were used and included eighteen SNP markers (two per chromosome) for detecting specific alleles for each ancestral species (Table S4). Genotyping was performed with the KasPar competitive, allele-specific dual Förster resonance energy transfer (FRET)-based assay according to the method described by Curk et al. [48]. The three ancestral species used as positive controls were C. maxima, C. reticulata and C. medica. For each cultivar, the total percentage of ancestral alleles was calculated based on raw KasPar data and plotted on a stacked bar graph using the package ggplot2 in R [38].
For each cultivar, the heterozygous and homozygous allele frequencies of the three basic taxa were plotted on a split bar chart.

Genetic Diversity
A main group composed of thirty-nine cultivars showed no genetic variability based on the ten SSR markers data (Table S3); these cultivars, represented the core group of sour oranges and were positioned close to the representative varieties of mandarin and pummelo on the first two PCA axes ( Figure 1). Six distinct genotypes with variable genetic distances from the main group emerged from the analysis. The "Fraser Seville" and "Australian" cultivars displayed relationships with pummelo and mandarin and were probably closer to mandarin as shown by the third axis. They differed slightly from each other by only one marker (MEST375), and they had no alleles in common with regular sour oranges for two markers (TAA41, MEST469). The "Gou Tou" and "Tosu" cultivars had the same genetic profile for three markers (MEST15, MEST88 and MEST488) and had no alleles in common with regular sour oranges. These two cultivars appeared to be less related to the representative varieties of mandarin and pummelo than to the regular sour orange group and were differentiated by the second axis. Finally, "Chinensis" and "Chinensis de semis" were the most distant genotypes from the sour orange main group and both were related to "Poncire commun" citron.
By using diagnostic SNP markers that can differentiate the ancestral species at each studied locus, four genotypes were observed among the sour orange set ( Figure 2a and Table S4 for data for each marker). In addition, the frequency of diagnostic alleles with homozygosity and heterozygosity allowed the identification of potential parents (Figure 2b).
The genome of the thirty-nine regular sour oranges previously characterized by the same SSR markers was composed of 41.7% C. maxima, 47.2% C. reticulata and 11.1% citrus of unknown origin. The frequencies of heterozygous markers for C. reticulata and C. maxima were high at 0.96 and 0.61 respectively, suggesting a direct hybrid origin between mandarin and pummelo. of unknown origin. The frequencies of heterozygous markers for C. reticulata and C. maxima were high at 0.96 and 0.61 respectively, suggesting a direct hybrid origin between mandarin and pummelo. "Australian" (or "Smooth Flat Seville") and "Fraser Seville" had the same constitution, with equal proportions of C. maxima and C. reticulata alleles (52.8 and 47.2%, respectively). The frequencies heterozygous markers for of C. reticulata and C. maxima were high at 0.94 and 0.72, respectively, suggesting that those two cultivars were most likely direct hybrids between mandarin and pummelo. "Gou Tou" and "Tosu" also presented an identical genotype characterized by 33.3% C. maxima, 50.0% C. reticulata and 16.7% citrus of unknown origin. This repartitioning of the allelic parentage differed from the phylogenomic profile of the regular sour orange due to the lower proportion of C. maxima and the higher proportion of alleles of unknown origin. The frequencies of heterozygous markers C. reticulata and C. maxima were high (1 and 0.56, respectively), suggesting a direct hybrid origin between mandarin and pummelo.
The proportions of ancestral species alleles in the "Chinensis" (8.3% C. maxima, 27.8% C. reticulate and 58.3% C. medica) and "Chinensis de semis" (2.8% C. maxima, 8.3% C. reticulate and 66.67% C. medica) genotypes clearly identified citron as one of the parents of those cultivars. The second parent remains unidentified. The homozygous and heterozygous markers frequencies were 0.22/0.72 and 0.50/0.33 respectively, suggesting more than one fecundation event with citron or possibly one fecundation followed by self-fecundation. Pummelo markers were found only in heterozygous state in both cultivars with frequencies of 0.17 and 0.06 for "Chinensis" and "Chinensis de semis", respectively. The frequency of heterozygous mandarin markers were higher for "Chinensis" (0.44) and lower for "Chinensis de semis" (0.17). Interestingly "Chinensis" had a homozygous mandarin markers frequency of 0.06, whereas "Chinensis de semis" had no homozygous markers.
In conclusion, and for the rest of the study, the thirty-nine cultivars with an identical profile were considered C. × aurantium var. aurantium according to the new phylogenomic classification of regular sour oranges [16].

Phenotypic Description of the Fruit
The phenotypes of the fruits were highly variable between cultivars in terms of size, pericarp color, aspect and internal structure (Figure 3). This diversity in appearance was characterized by the measurements of five morphological characters, one skin character "Australian" (or "Smooth Flat Seville") and "Fraser Seville" had the same constitution, with equal proportions of C. maxima and C. reticulata alleles (52.8 and 47.2%, respectively). The frequencies heterozygous markers for of C. reticulata and C. maxima were high at 0.94 and 0.72, respectively, suggesting that those two cultivars were most likely direct hybrids between mandarin and pummelo. "Gou Tou" and "Tosu" also presented an identical genotype characterized by 33.3% C. maxima, 50.0% C. reticulata and 16.7% citrus of unknown origin. This repartitioning of the allelic parentage differed from the phylogenomic profile of the regular sour orange due to the lower proportion of C. maxima and the higher proportion of alleles of unknown origin. The frequencies of heterozygous markers C. reticulata and C. maxima were high (1 and 0.56, respectively), suggesting a direct hybrid origin between mandarin and pummelo.
The proportions of ancestral species alleles in the "Chinensis" (8.3% C. maxima, 27.8% C. reticulate and 58.3% C. medica) and "Chinensis de semis" (2.8% C. maxima, 8.3% C. reticulate and 66.67% C. medica) genotypes clearly identified citron as one of the parents of those cultivars. The second parent remains unidentified. The homozygous and heterozygous markers frequencies were 0.22/0.72 and 0.50/0.33 respectively, suggesting more than one fecundation event with citron or possibly one fecundation followed by self-fecundation. Pummelo markers were found only in heterozygous state in both cultivars with frequencies of 0.17 and 0.06 for "Chinensis" and "Chinensis de semis", respectively. The frequency of heterozygous mandarin markers were higher for "Chinensis" (0.44) and lower for "Chinensis de semis" (0.17). Interestingly "Chinensis" had a homozygous mandarin markers frequency of 0.06, whereas "Chinensis de semis" had no homozygous markers.
In conclusion, and for the rest of the study, the thirty-nine cultivars with an identical profile were considered C. × aurantium var. aurantium according to the new phylogenomic classification of regular sour oranges [16].

Phenotypic Description of the Fruit
The phenotypes of the fruits were highly variable between cultivars in terms of size, pericarp color, aspect and internal structure ( Figure 3). This diversity in appearance was characterized by the measurements of five morphological characters, one skin character and two chemical attributes of juice ( Figure 4 and Table S5). Most of the time the observable phenotypic diversity was in accordance with the genotypic origin. However, the regular sour orange group displayed important phenotypic diversity despite its lack of genetic diversity.
The thirty-seven regular sour orange cultivars ("Bizzaria" and "A fleurs grosses Carle" were removed from the initial set due to a lack of fruits) visually displayed an important phenotypic diversity in terms of size, color and shape. The weight (W) varied between 23 and 260 g. "Chinotto" had the lowest weight and "Bouquetier" had the highest weight. The citrus color index (CCI) varied between 2.2 and 13.4, with a mean value of 8.7, corresponding to a reddish orange color. The cultivar "Variegated" produced a yellow fruit with orange stripes (Figure 3). The two autotetraploid cultivars "N • 10_4X_SG" and "N • 11_4X_SG" presented an altered phenotype with the highest CCI values (13.37 and 12.22). The mean number of segments (NS) varied between 6.8 and 18.2. The distribution was not homogenous because the three outgroups corresponded to "Bouquetier" cultivars, which significantly differed from the others, which had almost twice the number of endocarp sections. The mean peel thickness (PT) varied between "Chinotto" (2.7 mm) and the tetraploid "N • 10_4X_SG" (10.5 mm). The mean shape value corresponding to the ratio PD/ED (polar diameter/equatorial diameter) varied between flattened fruits (0.63-0.80) and round fruits (close to 1). The distribution was homogenous around the median except for the "Bouquetier" cultivars, which were significantly more flattened than the others. The total soluble solids content (TSS) varied between 6.8 and 10.7 • Brix, and the acidity (AC) varied between 0.1 and 4.6%, with a mean value of 2.3%. Low acidity levels (<0.1%) characterized "Small leaves Chinotto", "Sweet", "Curacao", "Alibert hyb 12" and "Bitter sweet". The TSS was the character with the largest variation or dispersed distribution The thirty-seven regular sour orange cultivars ("Bizzaria" and "A fleurs grosses Carle" were removed from the initial set due to a lack of fruits) visually displayed an important phenotypic diversity in terms of size, color and shape. The weight (W) varied between 23 and 260 g. "Chinotto" had the lowest weight and "Bouquetier" had the highest weight. The citrus color index (CCI) varied between 2.2 and 13.4, with a mean value of 8.7, corresponding to a reddish orange color. The cultivar "Variegated" produced a yellow fruit with orange stripes (Figure 3). The two autotetraploid cultivars "N°10_4X_SG" and "N°11_4X_SG" presented an altered phenotype with the highest CCI values (13.37 and 12.22). The mean number of segments (NS) varied between 6.8 and 18.2. The distribution was not homogenous because the three outgroups corresponded to "Bouquetier" cultivars, which significantly differed from the others, which had almost twice the number of endocarp sections. The mean peel thickness (PT) varied between "Chinotto" (2.7 mm) and the tetraploid "N°10_4X_SG" (10.5 mm). The mean shape value corresponding to the ratio PD/ED (polar diameter / equatorial diameter) varied between flattened fruits (0.63-0.80) and round fruits (close to 1). The distribution was homogenous around the median except for the "Bouquetier" cultivars, which were significantly more flattened than the others. The total soluble solids content (TSS) varied between 6.8 and 10.7°Brix, and the acidity (AC) varied between 0.1 and 4.6%, with a mean value of 2.3%. Low acidity levels (<0.1%) characterized "Small leaves Chinotto", "Sweet", "Curacao", "Alibert hyb 12" and "Bitter sweet". The TSS was the character with the largest variation or dispersed distribution among the currently observed characteristics. In contrast to TSS and other characteristics, acidity had a normal distribution, with 50% of the cultivars close to the mean value. The other characteristics demonstrated heterogeneous distributions.
The comparative variation of the two juice characteristics (acidity and sugar content) presents a scattered point cloud of varietal diversity, i.e., a representation of numerous The comparative variation of the two juice characteristics (acidity and sugar content) presents a scattered point cloud of varietal diversity, i.e., a representation of numerous combinations of phenotypic profiles ( Figure 5). The varietal diversity of the sour oranges was segmented into five chemotypes including four groups and one single variety (Figure 4). Group I included six varieties with low acidity (<1%); group II included eight varieties with intermediate acidity (2.2-3.5%) and high TSS (>9.5 • Brix). Group III included varieties with intermediate acidity (1.5-3.5%) but with low TSS (<9 • Brix); and group IV included six varieties, including three sour oranges with a unique genetic profile and three hybrids with very high acidity (>4%) and TSS (>8.5 • Brix). "Chinensis" is the only variety with very high acidity (>4%) and low TSS (<7 • Brix). The dispersion of this diversity suggests an independence of these two traits. 4). Group I included six varieties with low acidity (<1%); group II included eight varieties with intermediate acidity (2.2-3.5%) and high TSS (>9.5 °Brix). Group III included varieties with intermediate acidity (1.5-3.5%) but with low TSS (<9 °Brix); and group IV included six varieties, including three sour oranges with a unique genetic profile and three hybrids with very high acidity (>4%) and TSS (>8.5 °Brix). "Chinensis" is the only variety with very high acidity (>4%) and low TSS (<7 °Brix). The dispersion of this diversity suggests an independence of these two traits. "Gou Tou" and "Tosu" were extremely similar to most regular sour oranges in many aspects (W, NS, PT, TSS and PD/ED) but differed significantly due to its yellowish flavedo and higher juice acidity (AC). "Australian" and "Fraser Seville" were similar with to regular sour orange in PT, CCI and AC characteristics but had larger fruits (W), a higher number of segments (NS), a slightly more flattened shape (PD/ED) and a relatively higher TSS content. "Chinensis" and "Chinensis de semis" were extremely different from the regular sour oranges. Indeed, the fruit were significantly smaller, ellipsoid, and more acidic with a bright yellow flavedo. There are characteristics that were not measured in this study, that are common between these two cultivars and the "Chinotto" type, such as a dwarf of tree shape, the small size and shape of the leaves (myrtle aspect), and the small distance separating leaves on the branch. "Gou Tou" and "Tosu" were extremely similar to most regular sour oranges in many aspects (W, NS, PT, TSS and PD/ED) but differed significantly due to its yellowish flavedo and higher juice acidity (AC). "Australian" and "Fraser Seville" were similar with to regular sour orange in PT, CCI and AC characteristics but had larger fruits (W), a higher number of segments (NS), a slightly more flattened shape (PD/ED) and a relatively higher TSS content. "Chinensis" and "Chinensis de semis" were extremely different from the regular sour oranges. Indeed, the fruit were significantly smaller, ellipsoid, and more acidic with a bright yellow flavedo. There are characteristics that were not measured in this study, that are common between these two cultivars and the "Chinotto" type, such as a dwarf of tree shape, the small size and shape of the leaves (myrtle aspect), and the small distance separating leaves on the branch.
The first two axes of the multivariate analysis represented 57% of the overall diversity, suggesting an average level of structure ( Figure 6). Agronomy 2021, 11, x FOR PEER REVIEW 11 of 20 The first two axes of the multivariate analysis represented 57% of the overall diversity, suggesting an average level of structure ( Figure 6). Weight, skin thickness and fruit shape are the main characteristics determining this diversity. The related bitter orange cultivars (different from the basic genetic profile), had several phenotypic aspects that, also differed from those of the regular cultivars. However, some regular varieties also stand out from the lot such as "Bouquetier" (by mass and The first two axes of the multivariate analysis represented 57% of the overall diversity, suggesting an average level of structure ( Figure 6). Weight, skin thickness and fruit shape are the main characteristics determining this diversity. The related bitter orange cultivars (different from the basic genetic profile), had several phenotypic aspects that, also differed from those of the regular cultivars. However, some regular varieties also stand out from the lot such as "Bouquetier" (by mass and Weight, skin thickness and fruit shape are the main characteristics determining this diversity. The related bitter orange cultivars (different from the basic genetic profile), had several phenotypic aspects that, also differed from those of the regular cultivars. However, some regular varieties also stand out from the lot such as "Bouquetier" (by mass and number of segments), "Chinotto" and "Variegated" (by TSS, AC and weight) and the tetraploid variety "N • 10_4X_SG" (essentially by its peel thickness).

Peel Essential Oil Sensorial Analysis
Cochran's Q-test (p ≤ 0.05) was significant only for seven of the twenty-nine sensory attributes studied (fat, musky, white blossom, cooked, lemon, pomelo and fresh), suggesting that all cultivars are statistically equivalent for all other sensory attributes. The expert panel detected sensory differences between samples but this diversity was relatively low because factorial correspondence analysis (FCA) explained only 55.39% (Figure 7) of the observed variability across the first two axes. For instance, "Bitter sweet" and "N10 4XSG" stand out a slightly from the group by their 'musky' and 'pomelo' attributes, while "Apepu", "Algerie" and "Alibert" do so based on the 'white blossom' attribute.
Pearson's chi squared test performed on the contingency table of the seven attributes showed significant differences according to the Cochran Q-test and determine the existence of a statistical relationship between sensory attributes and cultivars with a p-value equal to 0.05. Unfortunately, we were unable to establish correlation between sensory attributes and chemical composition.

Leaf Essential Oil
A total of fifty-three compounds were identified in the LEO of the forty-four cultivars (except "Bizzaria") accounting for 95.35 to 99.99% of the total composition (Table S7).
The LEO composition of the thirty-eight regular (except "Bizzaria") sour oranges was almost exclusively composed of terpenes (96.71-99.77%) (Figure 8). Compared to PEO, LEO was richer and mainly composed of oxygenated compounds (88.74-94.27%). This abundance of diverse compounds was more appropriate to perform diversity analysis and to discriminate cultivars.

Leaf Essential Oil
A total of fifty-three compounds were identified in the LEO of the forty-four cultivars (except "Bizzaria") accounting for 95.35 to 99.99% of the total composition (Table S7).
The LEO composition of the thirty-eight regular (except "Bizzaria") sour oranges was almost exclusively composed of terpenes (96.71-99.77%) (Figure 8). Compared to PEO, LEO was richer and mainly composed of oxygenated compounds (88.74-94.27%). This abundance of diverse compounds was more appropriate to perform diversity analysis and to discriminate cultivars.
It should be noted that the three replicates of each variety were most often grouped together, i.e., the intravarietal variation was smaller than the intervarietal variation. Significant differences between cultivars were identified for the following compounds: βpinene, (E)-β-ocimene, geranyl acetate, linalool, linalyl acetate and neryl acetate, suggesting three cluster specific chemotypes (Figure 9 and Table S8). The main differences between these three chemotypes are mainly based on the proportions of linalool, linalyl acetate and β-pinene.  It should be noted that the three replicates of each variety were most often grouped together, i.e., the intravarietal variation was smaller than the intervarietal variation. Significant differences between cultivars were identified for the following compounds: βpinene, (E)-β-ocimene, geranyl acetate, linalool, linalyl acetate and neryl acetate, suggesting three cluster specific chemotypes (Figure 9 and Table S8). The main differences between these three chemotypes are mainly based on the proportions of linalool, linalyl acetate and β-pinene.

Discussion
The microsatellites are known to present relatively high variability. However, it was recently demonstrated in citrus that the mutation of SSR markers is a rare event between two clonally propagated cultivars [26]. This conclusion concerning extremely low/absent genetic diversity is in accordance with previous work [20 -26,49]. Studies using multilocus genotyping with dominant markers found higher diversity than studies using SSR markers [22,24]. Nevertheless, multilocus dominant markers suffer from a lack of repeatability and tend to overestimate the variability [50].
The relatively low diversity observed in the sour orange gene pool could be explained by the spread of a unique sour orange cultivar worldwide from China. In addition, other factors enabled the vegetative propagation of this unique clone, such as apomixes (by somatic embryogenesis) and grafting implying the conservation of a desired genotype.
Regular sour oranges have diversified by mutations or epigenetic modifications. These mechanisms have generated very important phenotypic variations in fruit traits. Fruit characteristics are directly submitted to disruptive selection for new fruit types by humans, which could better explain the greater diversity of fruit morphology than the composition of essential oils. We also observed important variability in the sweetness and acidity of the fruit pulp. This AC/TSS variability in the sour orange group seems to be as high as that observed by Luro et al. 2011 [19] in the mandarin group, where sexual crossing has been the main driver of genetic diversification [51]. The tetraploid cultivars presented a modified phenotype (highest TSS, PT, CCI) but a similar essential oil composition, in accordance with previous studies [52,53].
Concerning the aromatic characteristics, the PEO and LEO compositions of regular sour oranges were slightly influenced by the cultivar, consistent with the observed lack of genetic diversity and inconsistent with the high pomological diversity. This result agrees with those of previous studies [8,17]. However, the panel of sensory analysts detected variations in the fragrance of fruit EOs from different cultivars without allowing correlations to be established between aromatic compounds and odor profiles. Nonetheless, we only identified major compounds with a relative composition higher than 0.01% and may have missed odor active compounds at trace concentrations with lower odor thresholds [46]. For example, we were unable to detect compounds with a high sensory impact that were identified in previous studies such as (E,Z)-2,4-decadienal, cedrol, trans-dec-2-enal and β-elemol [35,36]. In addition, citrus flavor is known to be mainly influenced by minor compounds, so it may be valuable to perform GC-olfactometry of sour orange cultivars to distinguish them based on their odor profile and possibly identify compounds specific to certain cultivars [54]. Indeed, a point mutation has been observed to significantly alter some terpene synthase activities and could modify aromatic profiles [55,56].
The analysis of the LEO composition of a few cultivars, with biological repetitions (different trees) proved the existence of a variation, however small, of the profiles between cultivars.
If we consider that the appellation 'sour orange' only concerns the forms derived from the initial genotype resulting from a pummelo x mandarin cross (C. × aurantium var. aurantium), then the diversity of SSR markers is none. On the other hand, if the term 'sour orange' includes morphotypes close to that of sour orange, then we consider that the sexual crossing also contributed to diversifying this group of citrus fruits. In this second option what would be the limits of belonging to the group of sour oranges? Can similar crosses (pummelo × mandarin) with different parental genotypes lead to hybrids that would be named 'sour orange'? Moreover, we have shown that there is not one regular sour orange morphotype but several such as "Chinotto" cultivars, which have very small fruits, resulting from mutations. "Gou Tou" and "Tosu" cultivars are morphologically, chemically and genetically identical and they probably came from the same cross a clonal selection. The complete and high heterozygosity of diagnostic SNPs in mandarins and pummelos respectively, suggest that these two cultivars also resulted from direct hybridization between a pummelo and a mandarin. Moreover, both cultivars have no alleles in common with regular sour oranges for three SSR markers, revealing a different origin than regular sour oranges. Considering these new elements, we propose to classify these two cultivars as C. × aurantium var. neoaurantium according to the new phylogenomic classification and species name from the classification of Tanaka [16].
The cultivars "Australian" (or "Smooth flat Seville") and "Fraser Seville" are both extremely similar on many aspects. Genetically they differ only by one marker. As previously discussed, those two cultivars probably originated from the same cross but from a different fecundation event. It was previously hypothesized that these two cultivars were not related to sour orange and that "Australian" originated from a sweet orange and grapefruit cross [22,24,25], however the SNP analysis again suggests a direct hybrid origin between a mandarin and a pummelo. Considering these elements, we propose classifying these two cultivars according to the new phylogenomic classification [16] as C. × aurantium var. superaurantium due to their large fruit size.
"Chinensis" and "Chinensis de semis" cultivars are the most morphologically, chemically and genetically distant from regular sour oranges. Based on molecular markers (SSRs and SNPs), their exact origin remains undetermined. Their PEO profile (limonene/γterpinene) and LEO profile limonene/neral/geranial confirm the genetic data and the tight relationship with C. medica [57]. The fruit and tree shapes of these cultivars present the same dwarf aspect as "Chinotto" suggesting "Chinotto" as a distant parent with additional C. medica properties (fruit shape, odor, color). These cultivars, being pummelo, mandarin and citron admixture, may be classified as C. × limon var. chinensis in the new phylogenomic classification.

Conclusions
Among sour oranges, three main genetic groups of C. × aurantium have been identified: a regular sour orange group represented by 39 varieties without intercultivar molecular diversity, a "Tosu" and "Gou Tou" pair, and "Australian" and "Fraser Seville" pair. The molecular marker analysis also revealed a strong contribution of C. medica to the "Chinensis" group discarding it from C. × aurantium and repositioning it with C. × limon. The morphological diversity among the regular sour oranges based only on mutation or epigenetics suggested the intervention of human selection on a variety of phenotypic traits. The main contributor to the chemical diversity was shared between the four mentioned groups. However, among the regular sour oranges we identified a significantly different LEO composition and significant different PEO aromatic profiles, suggesting the existence of chemical diversity, even in clonally propagated species. This result indicates that cultivar collections could be a valuable source of small variations in sour orange aroma, even for clonal selection. However, it seems that the mutation/selection process did not diversify the aromatic composition as much as it did the phenotype of the fruit and the chemical composition of its juice.