Complete Genome Sequence, Molecular Characterization and Phylogenetic Relationships of a Temminck’s Stint Calicivirus: Evidence for a New Genus within Caliciviridae Family

Caliciviridae is a family of viral pathogens that naturally infects vertebrates, including humans, and causes a range of highly contagious infectious diseases. Caliciviruses are not well studied because of the lack of a universal approach to their cultivation; however, the development of molecular genetics and bioinformatics methods can shed light on their genetic architecture and evolutionary relationships. Here, we present and characterize the complete genome sequence of calicivirus isolated from a sandpiper—Temminck’s stint (Calidris temminckii), preliminarily named Temminck’s stint calicivirus (TsCV). Its genome is a linear, non-segmented, single-stranded (+sense) RNA with genome organization typical of avian caliciviruses. Comparative studies have shown significant divergence of the nucleotide sequence of the TsCV genome, as well as the amino acid sequence of the major capsid protein from all publicly available genomic and protein sequences, with the highest genome sequence similarity to unclassified Ruddy turnstone calicivirus A (43.68%) and the lowest pairwise divergence of the major capsid protein with unclassified goose calicivirus (57.44%). Phylogenetic analysis, as well as a comparative analysis of the homologous proteins, showed evidence of another separate genus within the Caliciviridae family—previously proposed, but not yet accepted by International Committee on Taxonomy of Viruses (ICTV)—the Sanovirus genus, which combines seven previously unclassified genomic sequences of avian caliciviruses, including the newly discovered TsCV, which we propose to consider as a separate species.


Introduction
Caliciviruses are small non-enveloped pathogens with a single-stranded RNA genome varying from 6.4 to 8.5 kb in length belonging to the Caliciviridae family [1]. These viruses are known to infect mammals, birds and fish [2,3]. According to the current International Committee on Taxonomy of Viruses (ICTV) report, 11 genera are currently accepted within the Caliciviridae family-Lagovirus, Norovirus, Nebovirus, Recovirus, Sapovirus, Valovirus, Vesivirus, Bavovirus, Nacovirus, Minovirus and Salovirus [4], with each genus including one to two species [5]. The taxonomic classification of Caliciviridae is based on the protein sequence of the major capsid protein (VP1) with isolates with less than 60% sequence identity being assigned to different genera [4]. However, a number of calicivirus isolates currently remain unclassified and several new genera are proposed, including Sanovirus [6] and Secalivirus [7].
The genomic RNA of caliciviruses is organized in one of two ways. In the genomes of representatives of Norovirus, Recovirus and Vesivirus genera, three open reading frames

Sampling
The sample under the study was collected in 2017 on the banks of the Yenisei River in Krasnoyarsk Region (Russia, Siberia) [30] and belongs to the Temminck's stint (Calidris temminckii), a small-sized shorebird of the Sandpipers family (Scolopacidae). The biological sample was collected without direct contact with animals; no invasive interventions on animals were carried out. The birds have been observed and identified by qualified zoologists at close distance from a camouflaged hideout. Bird droppings were collected immediately after discharge, taking only the surface part of the fecal pellet to avoid contamination.
To ensure the preservation of the nucleic acids of viral pathogens, fecal samples were placed in sterile tubes containing the transport medium (reagent for transportation and storage of clinical material, Amplisens, Moscow, Russia). After transportation to the laboratory, the sample was stored in a low-temperature refrigerator.

Sample Preparation and Sequencing
For the extraction of nucleic acids, an Allprep DNA/RNA mini kit (Qiagen, Hilden, Germany) was used. All manipulations were carried out in accordance with the manufacturer's instructions. Preliminary screening for avian viral pathogens was carried out as described earlier [30].
Library preparation for high-throughput sequencing was performed with a NEBNext Ultra II RNA Library Prep Kit (New England Biolabs, Ipswich, MA, USA) following the manufacturer's recommendations for partially degraded samples. No additional steps involving depletion and any kind of enrichment were used. Sequencing with MiSeq Reagent Kit v2 (500-cycles) (Illumina, San Diego, CA, USA) on the Illumina MiSeq platform (Illumina, San Diego, CA, USA) resulted in 1.276 M paired-end reads per the sample under the study.

Assembly and Genome Annotation
SPAdes software v.3.15.3 [31] (CAB SPbU, St. Petersburg, Russia) was used for de novo metagenomic assembly. The main script with standard parameters was used, except for the activation of the "careful" option. The resulting contigs were used for taxonomic classification by nucleotide and translated protein sequences using the BLAST algorithm [32] with Nr/Nt and the NCBI Taxonomy databases [33]. The taxonomic classification of the host was further confirmed by the analysis of contigs related to eukaryotes (Table A1). Contigs attributed to the Caliciviridae family were used to obtain draft whole-genome assembly using the SeqMan NGen program (DNASTAR, Madison, WI, USA).
Additionally, the original raw reads were mapped to the draft whole-genome assembly to perform errors correction. BWA v.0.7.17 [34] was used for used for mapping and Samtools package v.1.10 [35] for operations with sam/bam files. Assembly check and correction was performed using Tablet program v. 1.19.09.03 [36]. The quality and integrity of the 3 end of the assembly was assessed manually.
An NCBI open reading frame finder [37] was used to annotate the open reading frames (ORFs). The following search parameters were used: minimal ORF length 150 amino acids, genetic code 1 and start codon "ATG only". Protein-coding genes were identified by analyzing homologous protein sequences using the BLAST algorithm [32] with translated nucleotide search. The domain enhanced look-up time accelerated BLAST (DELTA-BLAST) algorithm [38] was used to detect highly distant protein homologues in the absence of significant hits in the standard blast search. Visualization of the annotated sequence and search for characteristic conserved protein motifs were carried out using the SnapGen Viewer software [39] (Dotmatics, Boston, MA, USA).

Comparative Analysis
The annotated genome assemblies used in the comparative analyses were retrieved from the GenBank database [40] [date of access: 15 May 2022].
The pairwise alignments of whole-genome sequences, and amino acid sequences of annotated major capsid proteins of representative genomes of each Caliciviridae genus (Table A2), unclassified Caliciviridae representatives (Table A3) and TsCV were constructed using MAFFT software [41] for every possible pair of genomes and protein sequences. Pairwise identity for each nucleotide alignment was calculated using DistanceCalculator from Bio.Phylo.TreeConstruction module of BioPython [42] and 'identity' model for calculation of nucleotide divergence. Evolutionary divergence between VP1 amino acid sequences was estimated using MEGA11 software [43] with a frequencies model with a gamma distribution of variation including invariant sites, as described in [4].

Phylogenetic Analysis
To build a phylogenetic tree of representatives of the Caliciviridae family, including TsCV, we used the amino acid sequences of annotated major capsid proteins of the above set of genomes (Tables A2 and A3). The multiple sequences alignment was performed using MAFFT [41]. The maximum likelihood unrooted tree was generated using RAxML-NG v.1.0.2 according to the recommendations of the ICTV [4] with the only improvement being the choice of an evolutionary model, which was determined using PartitionFinder v.2.1.1 [44] under the corrected Akaike (AICc) and the Bayesian (BIC) information criteria.
To build a high-resolution phylogenetic tree of TsCV and its closest relatives, we used the nucleotide sequences of the complete genomes of TsCV and a set of the most closely related members of the Caliciviridae family according to the criterion of percent sequence identity (Table A3). This list includes representatives of the genera Bavovirus and Nacovirus, as well as unclassified caliciviruses. The multiple sequences alignment was performed using MAFFT [45]. To eliminate poorly aligned and diverged regions, Gblocks v.0.91b [46] was used with the default parameters. The analysis of possible recombination events was performed using the GARD program implemented in the HyPhy software v.2.5.40 [47]. The maximum likelihood unrooted tree was generated using RAxML-NG v.1.0.2 [48] with GTR + I + G as the most parameter-rich model [49,50]. Bootstrapping converged after 300 replicates. The trees were visualized and rooted in midpoint using iTOL v.6 [51].

Species Demarcation
We used three approaches that propose de novo species partitions to confirm the species status of the TsCV virus as described previously in [52]: the GMYC [53], bPTP [54] and ASAP [55] methods. Calculations have been carried out for the ICTV set of representative genomes of the Caliciviridae family (Table A2), supplemented with genome sequences of unclassified caliciviruses that are most closely related to TsCV (Table A3). We used the single-threshold version of the GMYC method, since the multiple-threshold version tends to overestimate the number of species partitions [53]. The ultrametric timetree as input tree for GMYC analysis was obtained by applying the RelTime method [56,57] implemented in the MEGA-11 software v.11.0.11 [58] using the appropriate evolutionary model. bPTP analysis was run with default parameters using 500,000 MCMC generations. A matrix of patristic distances as input matrix for ASAP analysis was obtained using the cophenetic.phylo function implemented in the R package ape [59].

Protein 3D Structure Prediction
To predict 3D structures of TsCV proteins we used a machine learning approach, AlphaFold2, which is able to predict protein structures with an accuracy close to experimental [60]. To build a multiple alignment, we searched for homologues in the following databases: Uniref90 [61], Mgnify [62], BFD [63], UniClust30 [64] and pdb70 [65]. We obtained five relaxed models and five unrelaxed models for each protein ranging by perresidue confidence score (pLDDT). For each protein, the model with the best pLDDT score was chosen for subsequent analysis. Visualization for all individual proteins and structures was performed using UCSF Chimera [66]. Comparison of 3D structures was carried out using the "match maker" function of the UCSF Chimera [66].

Annotation of TsCV Genome and Comparative Analysis
The genome of Temminck's stint calicivirus (TsCV) is a linear, non-segmented, singlestranded positive-sense RNA, comprised of 8575 bases with an average G + C content of 51.73% (Figure 1). The TsCV genome showed significant divergence from all publicly available genomic sequences with the highest similarity to unclassified Ruddy turnstone calicivirus A (MH453861.1, 43.68%) [67] ( Table 1). The G + C proportion is also most similar to the nucleotide composition of the Ruddy turnstone calicivirus A genome. Open reading frames (ORFs) prediction showed genome structure typical of avian caliciviruses-TsCV coding regions are organized into two ORFs: ORF1 of 6849 bases and short ORF2 of 711 bases separated by 1 nucleotide frameshift ( Figure 1). However, the relative position of open reading frames differs from its closest relatives-in the TsCv genome, the first and second open reading frames do not overlap, while for the genomes of its closest relatives the overlap is from 18 to 74 nucleotides, according to their annotation (Table 1). In addition, the length of the nucleotide sequence of the ORF1 is noticeably shorter, whereas the non-transcribed regions and the ORF2 sequence show the average lengths.  (Figure 2). The prediction of cleavage sites was based on the alignment of the amino acid sequence of the polyprotein of TsCV and goose calicivirus (KY399947, [6])-the one fully annotated sequence of the closest relatives to date, caliciviral 3C-like protease cleavage sites preferences [69] and average weights of mature proteins. The cleavage sites of the polyprotein were predicted to be: E 370 /G, Q 811 /N, Q 971 /G, E 1047 /G and E 1736 /S. Based on the indicated cleavage sites, the molecular weights of mature proteins were predicted to be 40.7 kDa for Nterm protein (370 aa), 48.5 kDa for NTPase (441 aa), 18 kDa for NS3 protein (160 aa), 8.3 kDa for VPg protein (76 aa), 75.8 kDa for Pro-Pol (689 aa) and 58.2 kDa for major capsid protein (546 aa) ( Figure 2).

Taxonomic Classification of TsCV by ICTV Criteria
According to the International Committee on Taxonomy of Viruses (ICTV), in the Caliciviridae family, the amino acid sequence of the major capsid protein (VP1) is used for taxonomic classification wherein the criterion for species demarcation is the divergence of the VP1 amino acid sequence of more than 60% [4].
We calculated the divergence of the VP1 amino acid sequence of TsCV and the VP1 sequences of representatives of each of the accepted genera. A set of representative sequences provided by ICTV was used for calculations (Table A2). Thus, according to the accepted criterion, TsCV cannot be assigned to any of the accepted genera of the Caliciviridae family, since all the given divergence values, with their standard deviations considered, are more than 60% ( Table 2). The VP1 sequence of the virus is closest to the representatives of the Nacovirus genus with an average value of 63%. The next in order of increasing degree of divergence is the Bavovirus genus (71%). Both of these genera include avian caliciviruses. To establish the taxonomic relationship of TsCV with currently known caliciviruses with publicly available major capsid protein sequences, we performed a search for homologous proteins using the BLASTp algorithm and nr database [date of access: 15 May 2022]. List of top BLAST hits is shown in Table A3. This set consisted of unclassified caliciviruses, as well as representatives of the genera Nacovirus and Bavovirus according to the specified taxonomy. From each polyprotein sequence, a VP1 protein region was isolated based on sequence annotation and/or alignment with annotated members of the Nacovirus and Bavovirus genera (listed in the Table A2). Then, identical sequences were filtered out and representative sequences of Bavovirus and Nacovirus genera were added to the analyzed set. After that, the matrix of pairwise divergence of VP1 sequences was calculated, which is presented as a heat map (Figure 3). Sequences belonging to unclassified caliciviruses are divided into two main clusters: a major one, which also includes representative sequences of Nacovirus genus, and a minor one, which consists entirely of unclassified caliciviruses, but includes the characterized goose calicivirus (KY399947.1, [6]). With this paper, the authors carried out a comparative and phylogenetic study showing that the discovered calicivirus did not belong to any of the accepted genera and proposed a new genus Sanovirus. Our results also support this assumption. Based on the given divergence values, considering the accepted criterion, it can be concluded that:

Phylogenetic Analysis
A phylogenetic analysis was performed to determine the evolutionary relationship between TsCV and other Caliciviridae members. The tree was built on the basis of the amino acid sequences of major capsid proteins in accordance with the ICTV recommendations [4], since this sequence is the gold standard for identifying the Caliciviridae family.
The topology of external nodes of the obtained phylogenetic tree (Figure 4) was strongly supported by bootstrap values. The topology of the tree was consistent with the ICTV phylogeny that is traditionally used to characterize the Caliciviridae family [4]. Genome TsCV was located within the clade containing many unclassified members of the family, as well as members of the genera Bavovirus and Nacovirus. Since all of the characterized members of the clade have been isolated from birds, the clade appears to represent a group of related bird-infecting caliciviruses. Since the topology in the clade of interest was not supported by high bootstrap values (the bootstrap value of the TsCV branch was 58), which can often be expected when constructing a phylogenetic tree of highly divergent sequences, we constructed a more accurate phylogenetic tree with a higher resolution on the basis of the whole genome nucleotide sequences of the TsCV and its closest relatives ( Figure 5). The tree, regardless of the slight differences, is mostly consistent with the topology based on the amino acid sequences of the major capsid protein. Strong bootstrap support for TsCV clustering with other proposed Sanovirus sequences supports our hypothesis.

Species Demarcation
Various single-locus approaches based on the amino-acid sequence of the major capsid protein have been used to distinguish species. To create species partitions using paired patristic genetic distances, we employed the assemble species by automatic partitioning (ASAP) approach. The partition with the best ASAP score was selected. As a result of applying this method, the studied set of 96 Caliciviridae representatives was partitioned into 69 groups corresponding to different species. In addition, we applied the bPTP web interface, which uses the phylogenetic species concept to delimit species. Using both maximum likelihood and Bayesian approaches, 73 and 68 species partitions were identified, respectively. In both cases, the TsCV formed an independent operational taxonomic unit. Finally, using the GMYC method, 73 species groups with a single-threshold approach were identified (Table A4 contains complete data for the family Caliciviridae).
The result shows that the TsCV genome is not partitioned with other genomes when using any of the listed models. Thus, we have shown the TsCV does not belong to a previously sequenced species of the Caliciviridae family. We propose assigning the TsCV virus to a new, previously undescribed species, preliminarily named Temminck's stint calicivirus.
All three methods we used unanimously attributed the following groups of caliciviruses to common species: In addition, according to the species delimitation criteria used in this study, the following pairs of viruses should also be considered as the one species, since their major capsid protein sequences were exactly identical:

Protein 3D-Structure Prediction
We predicted the 3D structure of the following TsCV proteins: the VP1 protein, the proteinase-polymerase precursor protein (Pro-Pol) and the core domain of the VPg protein. All predictions were made using the machine learning approach AlphaFold 2 [60] based on the primary sequence of each protein. Of the five models generated for each protein, the one with the highest per-residue confidence score (pLDDT) was selected ( Figures 6A, 7A and 8A).
The predicted VP1 protein has a structure typical of the major capsid protein of caliciviruses and consists of short N-terminal arm and two main domains, shell (S) and protruding (P), linked by a flexible hinge ( Figure 6A). P domains divided into two subdomains, P1 and P2. S domain of TsCV VP1 has a classical for viral capsids structure of eight-stranded anti-parallel β sandwich with two well-defined α-helices and shows high structural similarity to other caliciviruses (on the example of known crystal structures of VP1 of feline calicivirus [17] and Norwalk virus [70]). The P1 domain, which shows a high correspondence of the spatial arrangement of structural elements with the feline calicivirus, differs from it by the presence of an additional short alpha helix and two extra β strands. The P2 domain, which includes the host-specific receptor binding site and major immunodominant epitopes [17], folds into β-barrel-like structure, but in comparison with Norwalk virus has two extra C-terminal β-strands. The predicted structure of the Pro-Pol complex contains proteinase and polymerase pro-domains ( Figure 7A). Domain I of TsCV structurally resembles Norwalk virus [71], but misses two α-helices: one at the C-terminus end and in the proximal part of the domain. Domain II folds into a β-barrel-like structure, similar to Norwalk virus, but misses a β-strand and a short α-helix in the proximal part of the domain. The polymerase pro-domain shows high structural similarity to that of both rabbit hemorrhagic disease virus (RHDV) [72] and Norwalk virus [73], typical for three-dimensional structures of most other polynucleotide polymerases. The N-terminal region of TsCV contains two additional short β-strands compared to Norwalk virus (or one, compared to RHDV) and misses a short α-helix present in RHDV. Several additional short β-strands compared to both RHDV and Norwalk virus are also found in the fingers domain. An additional α-helix in the C-terminal part of the thumb domain was predicted in the TsCV polymerase pro-domain.   The core domain of the predicted 3D structure of VPg protein adopts a helical structure with N-terminus and C-terminus regions at two separate ends of the domain. The structure of the TsCV VPg core domain is highly similar to feline calicivirus [74] and porcine sapovirus [75] VPg.
For the VP2 protein of TsCV, we were unable to obtain a 3D structure with acceptable pLDDT scores (the highest pLDDT score was 43.09); however, according to all the models obtained, the core part of the VP2 protein of TsCV consists of three consecutive alphahelices-one long and two short. The predicted order of the secondary structures was in complete agreement with the structure of the VP2 protein of the feline calicivirus obtained with cryo-electron microscopy [17].

Discussion
With the development of metagenomics, the inability to obtain a culture of microorganisms has ceased to be a problem that limits the ability to characterize the species diversity of an ecological niche and study the genomic features of its representatives. Here, we identified and characterized the complete genome of a novel Temminck's stint calicivirus (TsCV), isolated from the wild bird Calidris temminckii captured in Russia, Siberia. Using metagenomic data, we were able to obtain the complete genome sequence of TsCV, annotate its CDS, describe its proteins and model their 3D structure, and carry out taxonomic classification and phylogenetic study.
We showed that the structure of the TsCV genome corresponds to that of the accepted genera Lagovirus, Nacovirus, Nebovirus, Sapovirus and Valovirus, for which the coding part is organized into two open reading frames encoding the polyprotein and VP1. The TsCV polyprotein has all the expected proteins, arranged in an order conservative among all caliciviruses [76]. However, TsCV is genetically distant from all known caliciviruses and, in addition, has a different relative arrangement of open reading frames, indicating that this virus apparently does not use a termination-re-initiation mechanism during VP2 translation [77,78], as suggested for all caliciviruses [4].
The spatial similarity of the predicted 3D structures of TsCV proteins and the crystal structures of their homologues shows that machine-learning approaches can be successfully applied to model the caliciviral proteins and additionally confirms the CDS annotation. The ability to obtain the 3D structure of the caliciviral VP1 protein is a task of particular importance, since in caliciviruses, capsid-related functions, such as antigenicity and host specificity, are predominantly determined by its primary sequence and spatial configuration [79].
Metagenomic sequencing has provided researchers with a multitude of genomes, many of which require de novo classification. Development and improvement of bioinformatics methods and robust classification of metagenomic sequences allows a significant expansion of the formal taxonomy of viruses in the way of future studies of virus diversity [80,81]. However, it is obvious that by using this approach, most of the recommended species and genus classification criteria are inaccessible to the researcher. However, on the other hand, most of the traditional species-defining traits [82] have not been fully characterized for most caliciviruses since all of them require viral cultures [1]. Therefore, species-delimitation methods based mainly on genomic information are gaining popularity. Such methods include ANI (average nucleotide identity) and ANI-like approaches based on pairwise distances between genome nucleotide sequences [83], as well as single-locus distances methods. In the case of caliciviruses, a recognized criterion for the delimitation of genera is the percentage identity of the amino acid sequence of the major capsid protein [4], while there is no single criterion for the division of species at all. Thus, such methods, although widely used, exhibit disadvantages, among which is the need to a priori establish a threshold for taxonomic delimitation, which in some cases cannot be correctly established. The taxonomic classification of TsCV according to the criteria established by the international committee does not allow it to be placed in one of the accepted genera of the Caliciviridae family. Then we applied the ICTV genus delimitation criterion to the unclassified viruses belonging to the Caliciviridae family and showed that TsCV can be assigned to the proposed Sanovirus genus [6] together with goose calicivirus, (KY399947 [6] and MN068022), duck calicivirus (MH453811, [67]) and Caliciviridae sp. with accession numbers MT138017, MT138020 and MT138018.
The general definition developed by the Caliciviridae Study Group (CSG) for a caliciviral species was as follows: "A calicivirus species will be defined as a cluster of viruses that constitutes a major phylogenetic branch within a genus and is also distinguishable from other branch(es) by one or more of the following biologic properties: natural host range, natural cell and tissue tropism, and antigenicity" [83]. As can be seen from the obtained phylogenetic trees (Figures 4 and 5), the clade containing TsCV, which we propose to consider as a separate genus, is characterized by long branches (except for goose caliciviruses KY399947 [6] and MN068022). According to this criterion, there is no reason to believe that the TsCV virus forms a species together with other closely related published caliciviruses.
We tested this assumption using several other single-locus methods for species delimitation. In this study, to classify species, we used GMYC and bPTP coalescent-based methods that combine population genetic and phylogenetic theory to provide an objective means for the delimitation of evolutionarily significant units of diversity [53,54]. In addition to the methodologies mentioned above, the ASAP method was applied in this research.
Compared to GMYC and PTP, ASAP utilizes a phenetic approach where similar sequences are clustered in the same group/species [55]. Since the tools use different approaches, combining them improves the accuracy of the analysis. The most reliable can be considered species partitions, confirmed by several different methods [45]. The approaches we apply have the benefit of proposing de novo species divisions and requiring no a priori-defined intraspecific genetic distances. All the methods we used classified the virus TsCV as a separate novel species. All the methods also confirmed that the remaining members of this proposed genus also belong to separate species, except for goose caliciviruses KY399947 [6] and MN068022, which appear to be strains of the same species. In addition, these methods have revealed groups of caliciviruses that are very closely related and could potentially be considered as strains of the same species.
Despite the ICTV approach being traditional and widely used, we applied an additional method to obtain a more accurate phylogenetic tree with a higher resolution. For this, we applied the approach described in [52] with the only difference being that we used whole genome sequences to construct an alignment. Since the genome of the TsCV is extremely divergent from all other known representatives of the family, we used sequences belonging only to the closest relatives of the TsCV virus. This allowed us to obtain a reliable phylogenetic tree containing the representatives of Bavovirus and Nacovirus genera, as well as unclassified caliciviruses, among which we localized the new TsCV virus. The tree demonstrates some minor differences in topology, e.g., the localization of branches MK204392.1 and MT138020.1. It should be noted that the nucleotide-based topologies are based on more phylogenetic information than amino acid-based topologies. In addition, it was shown that the use of alignment-editing methods allows the obtaining of a more correct topology, although sometimes with less robust supports [46].
Several viral species known to generate major disease burdens in people and animals, such as influenza viruses, West Nile virus, and Newcastle disease virus, have natural reservoirs in birds. Migratory birds, as a result, play a crucial role in the development and spread of dangerous viruses [58]. Extensive metagenomic investigations have significantly increased our understanding of the viromes of various ecosystems in recent years, including the identification of new viruses in domestic and wild bird species [84]. Caliciviruses have previously been detected in migratory birds such as ruddy turnstones (Arenaria interpres) [18]. To our knowledge, Temminck's stint has not previously been described as a host for caliciviruses.
Temminck's stint breeds in the north of Eurasia, mainly from Scandinavia to the east to Chukotka, Anadyr and Kamchatka, with more than 93% of the population occurring in Russia [85,86]. A typical migratory bird, it winters in the tropical climates of southern Europe, Africa and South and Southeast Asia. Infections transmitted by migrating birds potentially have the ability to travel long distances. Therefore, the Temminck's stint can be a source of viral spreading in the Russia and other countries. Migratory bird virome characterization can help monitor potential infectious disease outbreaks in poultry and other animals, including humans.  Institutional Review Board Statement: Ethical review and approval were waived for this study due to the biological sample was collected without direct contact with animals; no invasive interventions on animals were carried out.

Informed Consent Statement: Not applicable.
Data Availability Statement: Temminck's stint calicivirus complete genome and annotation available in the GenBank database, accession number ON815296.

Acknowledgments:
The authors express their gratitude to Kamil Khafizov for the opportunity to work with archival material and for the supervision and administrative support during the collection of specimens and pre-screening study.

Conflicts of Interest:
The authors declare no conflict of interest.
Appendix A Table A1. List of contigs assigned to eukaryotes. Statistics are given for contigs with a length of more than 600 nucleotides and taxonomic groups with a total length of contigs of more than 2000 nucleotides.

Taxa
Number