Ancient Mitogenomes Suggest Stable Mitochondrial Clades of the Siberian Roe Deer

The roe deer (Capreolus spp.) has been present in China since the early Pleistocene. Despite abundant fossils available for detailed morphological analyses, little is known about the phylogenetic relationships of the fossil individuals to contemporary roe deer. We generated near-complete mitochondrial genomes for four roe deer remains from Northeastern China to explore the genetic connection of the ancient roe deer to the extant populations and to investigate the evolutionary history and population dynamics of this species. Phylogenetic analyses indicated the four ancient samples fall into three out of four different haplogroups of the Siberian roe deer. Haplogroup C, distributed throughout Eurasia, have existed in Northeastern China since at least the Late Pleistocene, while haplogroup A and D, found in the east of Lake Baikal, emerged in Northeastern China after the Mid Holocene. The Bayesian estimation suggested that the first split within the Siberian roe deer occurred approximately 0.34 million years ago (Ma). Moreover, Bayesian skyline plot analyses suggested that the Siberian roe deer had a population increase between 325 and 225 thousand years ago (Kya) and suffered a transient decline between 50 and 18 Kya. This study provides novel insights into the evolutionary history and population dynamics of the roe deer.


Introduction
Roe deer (Capreolus spp.) are the only embryonic diapause species in even-toed ungulates. Based on the fossil record, they first appeared about 3.3-2.4 million years ago (Ma) in Udunga, eastern Russia [1], while molecular data suggests an origin around 4-2 Ma [2]. Capreolus contains two extant species: the European roe deer (C. capreolus) that inhabits most regions of Europe [3], and the Siberian roe deer (C. pygargus) that is distributed in the Palaearctic of Asia and parts of Eastern Europe [4].
In a molecular sense, the Siberian roe deer is less studied than the European roe deer and its intraspecific taxonomy is still debated. To date, based on cytogenetic characteristics and morphological parameters, six subspecies of the Siberian roe deer are defined: C. p. pygargus, C. p. tianschanicus, C. p. ochracea, C. p. bedfordi, C. p. mantschuricus, and C. p. melanotis [5,6], the later three of which being somewhat controversial [3,[7][8][9][10]. Nevertheless, molecular analyses based on different markers or specimens from different regions Figure 1. Distribution ranges of two roe deer species (C. capreolus and C. pygargus) [5]. The sampling site in this study is indicated by a red triangle. Other roe deer fossils are represented by dots in four different colors [17][18][19][20][21][22].

DNA Extraction
Ancient DNA extractions were performed in a dedicated ancient DNA laboratory at the China University of Geosciences (Wuhan), physically separated from post-PCR facilities. Approximately 200 mg of bone or tooth powder of each sample was digested in extraction buffer which contained 4.5 mL of EDTA (0.5 M, pH = 8) and 0.06 mL of Proteinase K (20 mg/mL), followed by incubation for 16 hours in a rotating hybridization oven at 37 °C. After centrifugation at 7000 rpm for 10 min, the supernatant was transferred into an Figure 1. Distribution ranges of two roe deer species (C. capreolus and C. pygargus) [5]. The sampling site in this study is indicated by a red triangle. Other roe deer fossils are represented by dots in four different colors [17][18][19][20][21][22].

DNA Extraction
Ancient DNA extractions were performed in a dedicated ancient DNA laboratory at the China University of Geosciences (Wuhan), physically separated from post-PCR facilities. Approximately 200 mg of bone or tooth powder of each sample was digested in extraction buffer which contained 4.5 mL of EDTA (0.5 M, pH = 8) and 0.06 mL of Proteinase K (20 mg/mL), followed by incubation for 16 h in a rotating hybridization oven at 37 • C. After centrifugation at 7000 rpm for 10 min, the supernatant was transferred into an ultrafiltration tube (Millipore, Darmstadt, Germany) and condensed to 100 µL. Finally, DNA was purified and eluted in 80 µL EB buffer using the MinElute PCR Purification Kit (Qiagen, Hilden, Germany) following the manufacturer's instructions.

Library Construction
Multiple double-stranded Illumina libraries were constructed for each sample using 20 µL DNA extract and the protocol from Meyer and Kircher [23]. We used NEB buffer 2 (New England Biolabs, Ipswich, UK), ATP (New England Biolabs, Ipswich, UK), BSA (New England Biolabs, Ipswich, UK), T4 DNA polymerase (New England Biolabs, Ipswich, UK), T4 polynucleotide kinase (New England Biolabs, Ipswich, UK), and dNTP (Tiangen, Beijing, China) in a blunt-end repair reaction system. In the Adapter Ligation Step, we ligated a 1:20 adapter dilution (Sangon Biotech, Shanghai, China), diluted using Quick Ligase buffer (New England Biolabs, Ipswich, UK) to the ends of DNA fragments. Additionally, we performed an adaptor fill-in reaction using Isothermal buffer (New England Biolabs, Ipswich, UK). Indexing PCR amplifications were prepared with Q5 Hot start High-Fidelity 2× Mater Mix (New England Biolabs, Ipswich, UK) using the following cycling protocol: 98 • C for 30 s and 17 cycles of: 98 • C for 10 s, 60 • C for 75 s, and 60 • C for 6 min. All purification procedures were performed using the MinElute PCR Purification Kit (Qiagen, Hilden, Germany) following the manufacturer's protocol. Both extraction and library blanks were performed to monitor for potential contamination. Library quality and concentration were measured using Qubit 4.0 (Invitrogen, Carlsbad, CA, USA) and TapeStation 4150 (Agilent, Santa Clara, CA, USA). Finally, indexed libraries were pooled into a single pool in equimolar ratios and sequenced on an Illumina HiSeq×10 platform at Personalbio Inc, Shanghai, China.

Data Processing and Data Sets
Raw reads were processed to trim Illumina adapter sequences using Cutadapt v1.4.2 [24], discarding reads shorter than 30 bp. Flash v. 1.2.11 [25] was used to merge overlapping read pairs. To avoid loss of reads near the break point, the trimmed reads were then mapped against the complete mitochondrial genome of a modern Siberian roe deer (GenBank accession number: NC_025271) twice with different break points using the "aln" and "samse" algorithms in Burrows-Wheeler Aligner v. 0.6.2 [26] with default parameters. Next, sequences below a mapping quality of 30 were filtered out using "view" and the alignment was sorted on the reference genome by 5 mapping position using "sort" in SAMtools v. 0.1.19 [27]. In addition, potential PCR duplicates were removed using "rmdup", and a single bam format file of each library was obtained using "merge" in SAMtools. Finally, mitochondrial consensus sequences were constructed using GENEIOUS v.10 (http: //www.geneious.com/, accessed on 25 December 2021) based on the final bam files and considering only bases with a sequencing depth of ≥2×. At every position, a nucleotide was called only if it was observed in ≥ 75% of the reads covering that site [28,29]. Read coverages across the reference were calculated using Qualimap v. 2.2.1 [30] and the evaluation of base damage for ancient DNA was performed using mapDamage2 with default parameters [31].
The four newly retrieved mitochondrial genomes were deposited in GenBank (Accession numbers: OK041025-27 and OK323229). Here, four data sets were defined to initiate different analyses.
Data set 1 comprised 22 complete mitochondrial genomes (Table S1): 11 Siberian roe deer, nine European roe deer, one Chinese water deer (Hydropotes inermis), and one moose (Alces alces cameloides) [32][33][34][35][36][37] plus the four newly obtained ancient mitochondrial genomes of our samples. To improve the accuracy of the divergence time estimation, Data set 1 retained the most complete sequences. This data set was used for phylogenetic analyses and molecular dating among roe deer populations.
Data set 2 contained 146 combined sequences of cyt b and the D-loop (Table S2) from samples with detailed geographic information: 141 modern Siberian roe deer (covering Korea, Mongolia, Kazakhstan, Siberia, and Poland) [10,13,33,[35][36][37], the four newly obtained fossil Chinese roe deer from this study, and one European roe deer which served as an outgroup. This data set, which had a high number of individuals sampled across most of the geographical range of Siberian roe deer, was used to investigate the phylogeographical structure of Siberian roe deer.
Data set 3 comprised 11 modern complete mitochondrial genomes and our four ancient near-complete mitochondrial genomes of the Siberian roe deer. Data set 4 comprised nine complete mitochondrial genomes of modern European roe deer. Data sets 3 and 4 (Table S1) were used to reconstruct the maternal demographic histories for the Siberian roe deer and the European roe deer, respectively.

Bioinformatics Analyses
Sequence alignments were carried out using MAFFT [38] on the CIPRES portal [39], and the alignments were checked manually.
We first performed maximum-likelihood (ML) phylogenetic analysis on Data set 1 with RAxML-HPC v. 8 [40] on the CIPRES portal to address the phylogenetic relationships among roe deer. As the D-loop was characterized by high interspecific variation and poorly aligned regions, it was manually eliminated before subsequent analyses. H. inermis and A. alces served as outgroups to root the tree. Considering the heterogeneity of evolutionary rate among partitions, the best fitting partitioning scheme and substitution models were determined by PartitionFinder2 [41] with the greedy search algorithm and linked branch lengths along with Bayesian Information Criterion, resulting in five partitions with the GTR+I nucleotide substitution model for most of the partitions (Table S3). For our ML analysis, node support was assessed using 500 bootstrap replicates.
We then performed Bayesian analysis on Data set 1 using BEAST 1.8.4 [42] to investigate the most recent common ancestor (TMRCA) of each lineage of roe deer. The substitution model GTR + I + G was selected through comparison of Bayesian Information Criterion scores in jModelTest v2.1 [43]. In order to calibrate our BEAST analysis, we constrained the age of two nodes based on a previous molecular dating study [44]: the TMRCA of Capreolus and A. alces with a mean of 13.9 Ma (Normal distribution, 95% interval of 15.89-11.99 Ma) and the divergence between Capreolus and H. inermis with a mean of 9.48 Ma (Normal distribution, 95% interval of 11.31-7.76 Ma). Meanwhile, we specified a strict molecular clock, and a constant population size in parameter settings. Markov Chain Monte Carlo (MCMC) sampling was carried out using 70 million iterations with sampling every 1000 steps. Runs convergence and the effective sample size (ESS > 200) were checked using Tracer v1.7 [45], discarding the first 10% as burn-in. The maximum clade credibility (MCC) tree was generated using TreeAnnotator v1.8.4 [46] and visualized in FigTree v1.4.3 (http://tree.bio.ed.ac.uk/software/figtree, accessed on 25 December 2021).
Given the limited number of complete mitochondrial genomes, we used Data set 2 (146 combined sequences of cyt b and the D-loop with detailed geographic information) to investigate the phylogeographical structure of Siberian roe deer. We grouped the locations of Data set 2 into eight regions (Table S2) [47] on the CIPRES portal to define haplogroups of different geographic populations. The parameters were the same as previous analysis, except for the substitution model, for which we used HKY+I+G as indicated by jModelTest v2.1 [43].
To construct the maternal demographic histories of the Siberian roe deer and the European roe deer, we performed Bayesian skyline plots (BSPs) on Data sets 3 and 4 using BEAST v. 1.8.4 [42]. The skyline plot was time-calibrated using a substitution clock rate with a mean of 8.91 × 10 −9 substitutions per site per year (normal distribution, 95% HPD: 10.01 × 10 −9 -7.71 × 10 −9 ) suggested by our previous Bayesian analysis on Data set 1. Optimal substitution models for two data sets were assigned by jModelTest v2.1 [43], resulting in HKY + I for Data set 3, and HKY + G for Data set 4. Additionally, the age of our ancient samples was set as the mean of the calBP in tip dates. Other parameters were set the same as previously.

Results
We successfully retrieved four near-complete mitochondrial genomes of the roe deer with mean depths of 12.7, 9.0, 14.5, and 5.9 folds, respectively (Table S4 and Figure S3). The results of mapDamage2 indicated advanced DNA fragmentation and high levels of cytosine deamination at the terminal DNA fragment ends ( Figures S4 and S5), authenticating that our sequences were not derived from modern contamination.
The phylogenetic trees computed using either RAxML-HPC or BEAST v. 1.8.4 yielded a consistent topology (Figures 2 and S6), which indicated two main clades of roe deer (the European roe deer vs. the Siberian roe deer) as previously suggested [2,33,48,49]. The Siberian roe deer clade is further divided into three distinct subclades, and subclade P3 occupies a slightly earlier diverging position, while subclades P1 and P2 are sister to each other. Interestingly, our four ancient samples fell into two different subclades of the Siberian roe deer, with CADG573 and CADG626 in subclade P2, while CADG580 and CADG842 fell within subclade P3.
The phylogenetic analysis of Data set 2 (combined cyt b and D-loop) divided the Siberian roe deer into four haplogroups (A, B, C, and D) (Figure 3a). The number of haplogroups is different from previous analysis using complete mitochondrial genomes, which indicates only three subclades (Figures 2 and S6). When comparing phylogenetic trees among analyses of Data sets 1 and 2 (Figures 2 and 3a), we observed that the phylogenetic positions of overlapping samples were consistent. We saw that haplogroup B corresponded to subclade P1, haplogroups A and D corresponded to subclade P2, and haplogroup C corresponded to subclade P3. While considering that too few complete mitochondrial genomes may lead to incomplete details of branches, and that the rapid evolutionary rate of D-loop makes it more effective to distinguish individuals with close genetic relationships, the former two results of Data sets 1 and 2 are not contradictory.
(95% HPD: 2.63-1.87 Ma), which is consistent with previous molecular results (4-2 Ma) [2,48]. The TMRCA for the European roe deer clades was estimated at 0.68 Ma (95% HPD: 0.81-0.55 Ma), while TMRCA for the Siberian roe deer clades was calculated at 0.34 Ma (95% HPD: 0.41-0.27 Ma). The subclades P1, P2 and P3 diverged at 0.10 Ma (95% HPD: 0.14-0.07 Ma), 0.17 Ma (95% HPD: 0.23-0.14 Ma), and 0.30 Ma (95% HPD: 0.37-0.24 Ma), respectively.  (Figure 3a). The number of haplogroups is different from previous analysis using complete mitochondrial genomes, which indicates only three subclades (Figures 2 and S6). When comparing phylogenetic trees among analyses of Data sets 1 and 2 (Figures 2 and 3a), we observed that the phylogenetic positions of overlapping samples were consistent. We saw that haplogroup B corresponded to subclade P1, haplogroups A and D corresponded to subclade P2, and haplogroup C corresponded to subclade P3. While considering that too few complete mitochondrial genomes may lead to incomplete details of branches, and that the rapid evolutionary rate of D-loop makes it more effective to distinguish individuals with close genetic relationships, the former two results of Data sets 1 and 2 are not contradictory.
To better visualize the phylogeographical structure of the Siberian roe deer, we mapped the geographical distribution of haplogroups (Figure 3b). Haplogroups B and C are geographically extensive, ranging from Central Europe to East Asia (across both sides To better visualize the phylogeographical structure of the Siberian roe deer, we mapped the geographical distribution of haplogroups (Figure 3b). Haplogroups B and C are geographically extensive, ranging from Central Europe to East Asia (across both sides of Lake Baikal), while haplogroups A and D were only found in the east of Lake Baikal. Populations on both sides of Lake Baikal have different haplogroup compositions. Specifically, except for the isolated population SKJ, most of the individuals in the western population belong to haplogroup B, while all haplogroups exist in the eastern population. Only one haplogroup (haplogroup B) was detected in Jeju Island, and this may be due to geographical isolation resulting in less gene flow between populations in a short period of time or processes such as founder effect and genetic drift as evidenced by reduced genetic diversity of the animal group on the island [50]. Our ancient samples CADG573, CADG580, CADG626, and CADG842 fell into haplogroup D, haplogroup C, haplogroup A, and haplogroup C, respectively (Figure 3a).
The BSPs (Figure 4) revealed changes in maternal effective population size of Siberian and European roe deer over time. The maternal effective population size of European roe deer had a major expansion during~100 to~50 thousand years ago (Kya), followed by a relatively stable population size to the present. In contrast, the Siberian roe deer experienced a population increase during~325 to~225 Kya and had a transient decline during~50 to~18 Kya.
ically, except for the isolated population SKJ, most of the individuals in the western population belong to haplogroup B, while all haplogroups exist in the eastern population. Only one haplogroup (haplogroup B) was detected in Jeju Island, and this may be due to geographical isolation resulting in less gene flow between populations in a short period of time or processes such as founder effect and genetic drift as evidenced by reduced genetic diversity of the animal group on the island [50]. Our ancient samples CADG573, CADG580, CADG626, and CADG842 fell into haplogroup D, haplogroup C, haplogroup A, and haplogroup C, respectively (Figure 3a).

Phylogeny and Phylogeography of the Roe Deer in Northeastern China
Previous molecular identifications have suggested extant Northeastern China roe deer as Siberian roe deer [48,51,52]. However, there is also debate as to whether it is a subspecies or not. Morphologically, it has been assigned to different subspecies of Siberian roe deer, i.e., C. p. bedfordi [53], C. p. tianschanicus [4], or C. p. manchuricus [48]. Further morphological evidence suggested that the subspecies C. p. bedfordi and C. p. manchuricus are synonyms of C. p. tianschanicus, which is distributed in Mongolia, Trans-Baikalia, the Russian Far East, and China [54,55]. Our maximum-likelihood and Bayesian results (Figures 2 and S6) strongly support that the sub-fossil roe deer in Northeastern China belong to the Siberian roe deer, which is in line with previous studies based on D-loop [48,51,52]. Moreover, we observed similar genetic composition in all populations to the east of Lake Baikal (Figure 3b), including Northeastern China. Therefore, it seems reasonable to define Northeast Chinese roe deer as C. p. tianschanicus.
In the phylogenetic trees inferred from Data set 1 (Figures 2 and S6) our four ancient samples, dated to between 6816-11,270 calBP, fell into two different subclades of modern Siberia roe deer, i.e., subclades P2 and P3, indicating genetic continuity for this species in Northeastern China, at least since the end of the Late Pleistocene to the present. Furthermore, we observed that subclade P2 contained both ancient (CADG573: 6816 calBP and CADG626: 9478 calBP) and modern Northeastern China roe deer samples (MN485773), and we thus speculate that subclade P2 has existed in Northeastern China throughout the early Holocene into the present. Our phylogenetic analyses using Data set 2 ( Figure 3) revealed that our four ancient samples, grouped with haplotypes from RPRA, distribute in three different haplogroups, i.e., haplogroup A, haplogroup C, and haplogroup D. These results demonstrate that haplogroup C, distributed throughout Eurasia today, was in Northeastern China since at least the Late Pleistocene, while haplogroup A and haplogroup D, only distributed in the east of Lake Baikal today, were distributed in Northeastern China since the Middle Holocene. Moreover, both phylogenetic analyses (Figures 2, 3a and S6) support an initial divergence of haplogroup D/subclade P3, which indicates that haplogroup D/subclade P3 may represent the oldest genetic lineage.
Similar to previous studies [10,[12][13][14], our geographical distribution of haplogroups (Figure 3b) showed that Siberian roe deer from Central Europe to East Asia are highly heterogeneous without clear geographical separation of maternal lineages. This feature might be derived from substantial gene flow due to the high mobility of this species [9,56], as well as high levels of connectivity among populations during the Ice Age [33]. Furthermore, we found that taking Lake Baikal as the boundary, except the isolated population SKJ, the western populations mainly belong to haplogroups B and C, while all haplogroups are found in the eastern populations, suggesting a different haplogroup composition of populations on both sides of Lake Baikal. Consistent with previous studies [10,50,57], this result supports the existence of the subspecies C. p. pygargus (to the west of Lake Baikal) and C. p. tianschanicus (to the east of Lake Baikal) and emphasizes the role of Lake Baikal as a geographic barrier to gene flow. In contrast to Lee et al. [10], we detected haplotypes of haplogroup B in SKM, probably because our data set contains more samples that make new haplogroup discovery possible. Moreover, we observed that the SKM, RSMG, and RPRA populations were distributed in all four haplogroups, suggesting high mitochondrial genetic diversity in these regions. It may indicate either a long term large effective population size or East Asia is the center of Siberian roe deer diversification. In East Asia, several vertebrate species have also been reported to possess high levels of mitochondrial DNA variations compared with individuals from surrounding areas [58]. The co-existence of cold open steppes and forests in East Asia could provide favorable conditions for a large population and thus the diversification of genetic lineages [50,59].

Divergence among Capreolus
Currently, the oldest record of Capreolus is from the Late Pliocene (MN16, 3.3-2.4 Ma), discovered in Udunga, eastern Russia [1]. Previous molecular studies based on partial mitochondrial genes have estimated the separation time of the European roe deer and the Siberian roe deer at around 4-2 Ma [2,48]. A recent study using the nuclear genomes of modern individuals of Siberian roe deer in Northeastern China and European roe deer in Germany estimated that these two species diverged 1.35-0.9 Ma [60]. Our estimation (2.25 Ma; 95% HPD: 2.63-1.87 Ma) is compatible with previous studies based on partial mitochondrial genes but is, however, earlier than the estimation using nuclear genes. We proposethat the discordance may be caused by nuclear gene flow. Moreover, according to the fossil record [61], the European roe deer was already present in Europe at least 0.6 Ma. Our estimated coalescence of the European roe deer (0.68 Ma; 95% HPD: 0.81-0.55 Ma) conforms to the fossil record. In addition, using D-loop data, Randi et al. [2] showed that the divergence among haplotypes of the Siberian roe deer might have arisen 0.37-0.19 Ma and we obtained similar coalesce at 0.34 Ma (95% HPD: 0.41-0.27 Ma). Among the Siberian roe deer, we found the coalescence events of subclades P1 and P2 occurred close in time (~0.10 Ma vs.~0.18 Ma). This comparatively rapid divergence might be explained by glacial cycles, which would result in range shifts and geographical isolation in refugia of the Siberian roe deer and further induce inter-specific diversification.

Maternal Demographic History of the Roe Deer
Using nuclear genes, de Jong et al. [60], suggested that the Siberian roe deer experienced two increases in effective population size from~2000 to~200 Kya and~140 tõ 50 Kya, and suffered a sharp decrease from~30 to~18 Kya. In our BSPs (Figure 4 With regard to the European roe deer, Randi et al. [62] identified two waves of population expansion from~244 to~122 Kya and~156 to~78 Kya, respectively, using the mismatch distributions of the D-loop. In contrast, our BSP analysis (Figure 4) based on a near-complete mitochondrial genome indicated one apparent population expansion from 100 to~50 Kya. This inconsistency may have resulted from both different datasets with different evolution rates, and different sampling schemes in which we excluded samples from locations where near complete mitochondrial sequences are not available.
For the increase in the maternal population size, both a demographic expansion and a population subdivision have been proposed as possible reasons [63]. The increase in the maternal population size of the Siberian roe deer occurred around 0.325 Ma, which is consistent with the divergence of subclade P3 (0.30 Ma; 95% HPD: 0.37-0.24 Ma) ( Figure 2 and Table S5); the increase in the maternal population size of the European roe deer appeared around 0.10 Ma, which also corresponds to the emergence of subclade C3 of the European roe deer (0.13 Ma; 95% HPD: 0.18-0.08 Ma). Therefore, we hypothesise that the increase in the maternal effective population size of both roe deer species may be the consequences of population subdivisions rather than population expansions. Moreover, global climate change and human activities likely played a key role in the observed population size shrink of the Siberian roe deer. The identified period of bottleneck (from~50 Kya to~18 Kya) coincides with the transition from the climatically unstable MIS 3 to the more stable cold period of MIS 2 around 29 Kya. Similar to other species from temperate climates, the Siberian roe deer expanded during interglacials and contracted into cryptic and isolated refugia during glacials [62,64]. Numerous archaeology discoveries indicated that this period contained frequent human activities in Eurasia [65]; thus, human activities may have had a negative impact on the population size of the Siberian roe deer.

Conclusions
In summary, our study provides real-time molecular records for tracing the divergence and demographic history within roe deer populations. We confirmed that the ancient roe deer shares a close genetic background with modern roe deer, which implies that this species maintains stable mitochondrial clades since the Late Pleistocene even though it has experienced multiple dynamics in its population size. In addition, our results also revealed the connection between climate change and the evolutionary events of roe deer. The identified period of bottleneck coincides with the transition of unstable climatic stage to more stable cold period, which suggests that the cooling periods might have resulted in extensive decrease of population size of roe deer.  Figure S3: Coverage plots of the mitochondrial genomes obtained in this study; Figure S4: Mitochondrial read length distribution of four roe deer sub-fossil specimens; Figure  S5: Mitochondrial DNA damage plots for four roe deer sub-fossil specimens. X axis represents position from 5 (left) and 3 (right) read ends. Red line corresponds to C to T substitutions and blue line corresponds to G to A substitutions; Figure S6: Maximum-likelihood phylogenetic tree of Capreolus based on complete mitochondrial genomes. A. alces was selected as outgroup. Names of four Chinese samples are shown in red. Node label represents the bootstrap value; Table S1: Complete mitochondrial genomes used in this study; Table S2: Siberian roe deer sample information for each location per region; Table S3: Substitution models for different mitochondrial DNA partitions determined using Partitionfinder 2; Table S4: Information of the specimens and mitochondrial genomes obtained in this study; Table S5: Average node ages from BEAST (Ma); Table S6: Mapping values of four Northeastern China roe deer.