The phylogenetic clustering of 95 SARS-CoV-2 sequences from the first 3 months of the pandemic reveals insights into the early evolution of the virus and gives first indications of how the variants are globally distributed. Variants might become a challenge in terms of diagnostics, immunology, and effectiveness of drugs. All available whole genome sequence data from the NCBI database (March 16, 2020) were phylogenetically analyzed, and gene prediction as well as analysis of selected variants were performed. Antigenic regions and the secondary protein structure were predicted for selected variants. While some clusters are presenting the same variant with 100% identical bases, other SARS-CoV-2 lineages show a beginning diversification and phylogenetic clustering due to base substitutions and deletions in the genomes. First molecular epidemiological investigations are possible with the results by adding metadata as travelling history to the presented data. The advantage of variants in source tracing can be a challenge in terms of virulence, immune response, and immunological memory. Variants of viruses often show differences in virulence or antigenicity. This must also be considered in decisions like herd immunity. Diagnostic methods might not work if the variations or deletions are in target regions for the detection of the pathogen. One base substitution was detected in a primer binding site.
This is an open access article distributed under the Creative Commons Attribution License
which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited