Dataset of Targeted Metabolite Analysis for Five Taxanes of Hellenic Taxus baccata L. Populations

: Novel primary sources of one of the world’s leading anticancer agent, paclitaxel, as well as of other antineoplastic taxanes such as 10-deacetylbaccatin-III, are needed to meet an increasing demand. Among the Taxus species the promise of Taxus baccata L. (European or English yew) has been documented. In this study, the metabolite analysis of two marginal T. baccata populations in Greece (Mt. Cholomon and Mt. Olympus), located at the southeastern edge of the species natural distribution, are being explored. A targeted liquid chromatography – mass spectrometry (LC-MS/MS) analysis was used to determine the content of 10-deacetylbaccatin III, baccatin III, 10-deacetyltaxol, paclitaxel and cephalomannine in the needles of each of the populations from three sampling periods (spring, summer and winter). This is the first survey to generate a taxane targeted metabolite data set, since it derives from Hellenic natural populations that have not been explored before. Furthermore, it has used an extensive sample design in order to evaluate chemodiversity at the population level. The analysis revealed significant levels of chemodiversity within and among the investigated populations and significant seasonal variation that could be exploited for the selection of superior germplasm native to Greece, for yew plantations and further exploitation which is necessary for the production of important taxanes.


Summary
Taxus L. species (yew), a slow-growing and shade-tolerant dioecious conifer tree, are the source of one of the world's leading anticancer agent, paclitaxel [1,2]. It was first isolated from the bark of Taxus brevifolia Nutt [3] and it is now used for the treatment of several forms of breast, lung, blood and gynecological cancers [4]. Nowadays, different plant parts of various Taxus species, such as T. baccata needles, are being used for the collection of this diterpenoid alkaloid. While the demands of paclitaxel are high, the yield is low (0.001-0.05%). Additionally, for the treatment of one cancer patient, almost eight 60-year-old yew trees are needed [5]. Considering the above, it is of a high interest to explore new sources of taxanes.
There is a high inter-and intraspecific variability in the taxane content [7,10] and it has been shown that this variability can depend on the season and taxoid type. The concentration of 10-DAB of firstand second-year needles of T. baccata, grown in a Polish garden was highest in January, and lowest in April [11], where in a study in Tehran, Iran [6], the highest concentration of yew needles was obtained in August. The bark of T. baccata grown in a Czech Republic Forest showed that the highest concentration of PACL and three other taxanes was in October and the lowest in January. [12] found that 17 Taxus x media cultivars had large variation in the content of 10-DAB, BAC, CEPH and PACL.
Due to the fact that T. baccata populations in Greece are located at the southeastern edge of the species natural distribution we anticipate that there is a considerable possibility that their genetic diversity presents unique characteristics, in line to the rear edge theory expectations [13,14]. Until now, no other studies exist that embody quantitative data from Greek T. baccata populations with variation of taxane compounds. This first dataset will: (1) provide an assessment of the variation on taxane compounds which is associated with genotype and seasonal effect, (2) serve as a baseline for the selection of superior germplasm, native to Greece and 3) assist future breeding strategies towards high taxane production.
In this study, two natural yew populations were chosen, both located in Northern Greece, Mt Cholomon and Mt Olympus, in order to estimate the levels of five important taxanes; 10-DAB (Chemical Abstracts Service "CAS": 32981-86-5), 10-deacetyltaxol (10-DTAX; CAS: 78432-77-6), BAC (CAS: 27548-93-2), CEPH (CAS: 71610-00-9) and PACL (CAS: 33069-62-4). An Ultra Performance Liquid Chromatography Mass Spectrometry (UPLC-MS/MS) analysis was adapted from [15] using needle samples that were collected from 27 trees of each population, from north facing and well shaded branches. The sampling was repeated three times: (1) at the beginning of May after the end of the flowering period, (2) in late August at the end of the growing season and (3) in early December at the beginning of winter in order to estimate population form one hand and seasonal variation of taxanes compounds.
Analysis of the current dataset revealed significant levels of chemodiversity within and among the selected populations, and significant seasonal variation.

Data Description
Data were organized in tabular form in an MS Excel® spreadsheet, which is available as Supplementary Table 1. Each line in the dataset presents information from the LC -MS/MS needle analysis of one tree per population and for each season.
The first column (A), with the heading "Sample name" displays the entry identification number of each sample into the mass spectrometry. Column "B" (population), indicates the population that each sample belongs to (either Mt Cholomon or Mt Olympus). The third column (C), with the heading "Season" presents the season in which the samples were collected. Columns D, E, F, G, and H with the headings "10-deacetylbaccatin III_Q1", "10-deacetyltaxol_Q1", "baccatin III_Q1", "cephalomannine_Q1" and "paclitaxel_Q1" respectively, display the quantitative data of each of the five taxanes. The concentration of BAC in column (F), in a number of samples was too low (indicated as "N/A"), i.e., lower than the detection or even quantification limit for the LC-MS/MS analysis followed. Statistically this is usually considered as 'zero' and in the Supplementary Table 1 is appropriately presented as 'not identified' -"N/A". Table 1 shows the multiple reactions monitoring (MRM) parameters (RT: retention time, R2: Rsquared, ES: electrospray ion mode, electrospray molecular ion, qualifier and quantifier ions) of the five taxanes. Furthermore, Figure 1 shows the calibration curves of the five targeted compounds that were created with the use of external standards. On each of the calibration curves information of the regression model and the coefficient of determination (R-squared) are given. Ten blanks and 10 standard mix (std_mix) were inserted every 17 samples, in order to optimize the quality and precision of the API 5500 triple-quadrupole mass spectrometer.

Experimental Sites and Collection
Needle clippings were collected from 27 healthy trees of two Hellenic natural T. baccata L. populations: Mt Cholomon (Taxiarchis; 40° 26.299' N, 23° 32.374 E; altitude: 804m asl) and Mt Olympus (Litochoro; 40° 5.624' N, 22° 25.797' E, altitude: 819m asl). Selected trees were spatially distributed in order to avoid sampling related individuals, while isolated trees were not sampled. Samples were collected from three sampling periods: (1) at the beginning of May after the end of the flowering period, (2) in late August at the end of the growing season and (3) in early December at the beginning of winter.
The climate of Mt Cholomon is classified as subhumid Mediterranean. The mean annual air temperature is 11.1 °C and the annual rainfall is 767mm [16]. The main rock formation of the area is mica-and talk-schist [17].
In the case for Mt Olympus the climate is characterized as sub-Mediterranean, with a mean annual air temperature of 8.9 °C and an annual rainfall of 688.7mm [18]. Lastly, it is mainly formed of dolomitic limestone and marbles [19].

Chemicals
General laboratory reagents were purchased from Sigma-Aldrich and Fisher Scientific. Deionized water was purified in loco with an Arium® purification system (Sartorius AG, Goettingen, Germany). Authentic standards of taxanes, namely 10-DAB, 10-DTAX, BAC, CEPH and PACL were purchased from TransMIT PlantMetaChem (Gießen, Germany). Stock solutions of taxanes were prepared by dissolving the standards in methanol (LC-MS grade).

Taxane Extraction
The extraction of taxanes was performed according to the method of [20] with modifications. Fresh needles collected from plants were freeze-dried (Freeze-dryer Alpha 1-2 LD plus, Christ, Germany; at −24 °C). The dried needles were milled into homogenous powders. Samples (0.5 g) were mixed with 4 mL of 60% acetone, stirred on an orbital shaker for 20 mins, and centrifuged (5 min; 1800× g; 4 °C). This procedure was repeated three times. The supernatants were pooled, and the acetone was evaporated under reduced pressure at T ≤38 °C. The remaining aqueous phase was mixed with 4 mL of dichloromethane. The mixture was vortexed for about 1 min and centrifuged (2 min; 900× g; 4 °C). This procedure was repeated twice. The organic layers were collected and dried under reduced pressure at T ≤38 °C. The dry residue was dissolved in 2 mL of LC-MS methanol, filtered through PTFE 0.22 μm membranes into glass vials, and stored at −80 °C until further analysis.
The LC-MS/MS analysis was performed on an API 5500 triple-quadrupole mass spectrometer (Applied Biosystems/MDS Sciex, TOR, Canada). The instrument was operated using an electrospray source in positive ion mode. The electrospray ionization (ESI) parameters were as follows: the spray voltage was set at 5500 V for positive mode, the source temperature was set at 250 °C, the nebulizer gas (Gas 1) and heater gas (Gas 2) at 40 and 20 psi respectively. UHP nitrogen (99.999%) was used as both curtain and collision gas (CAD) at 20 and 9 psi respectively.) Analyst™ software version 1.6.1 (Applera Corporation, Norwalk, CT, USA) was used for instrument control and data acquisition. Compounds were identified by comparing the retention time and the spectral characteristic of the peaks with those of authentic standards. Multiple reactions monitoring (MRM) was used for quantification based on the peak area of the samples ( Table 1).
The data of Table S1 were calculated based on the calibration curves presented in Figure 1. The Linear range for the taxanes studied (expressed in mg/L) was: 10-DAB, 0.03 to 57.8; BAC, 0.03 to 57.4; 10-DTAX, 0.03 to 7.23; PACL, 0.06 to 29.45; CEPH, 0.06 to 28.9. As internal standard, cinnamic acid was used and the recovery was 91.4%, with standard deviation (STDV) 2.50%. The reproducibility was estimated and expressed with a value of RSD = 1.72, as relative standard deviation. Samples were diluted ten times and analyzed for 10-DAB and 10-DTAX in order to retrieve intensities of absorbance within the linear range of the calibration curves. Concentrations are expressed as mg L-1.   Funding: This research is co-financed by Greece and the European Union (European Social Fund-ESF) through the Operational Program «Human Resources Development, Education and Lifelong Learning 2014-2020» in the context of the project "Genetics, epigenetics and metabolomic analysis of European yew (Taxus baccata) in order to select plant material for the production of taxol and other antineoplastic taxanes in Greece" (MIS 5004922).