Next Article in Journal
Machine Learning for Rupture Risk Prediction of Intracranial Aneurysms: Challenging the PHASES Score in Geographically Constrained Areas
Next Article in Special Issue
Trajectories Derived from Periodic Orbits around the Lagrangian Point L1 and Lunar Swing-Bys: Application in Transfers to Near-Earth Asteroids
Previous Article in Journal
Evolutionary Dynamics of Division of Labor Games for Underwater Searching Tasks
Previous Article in Special Issue
Spectral Decomposition of Mappings of Molecular Genetic Information in the System Basis of Single Nucleotide Functions
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Comparative Analysis of Different Strains of Coronavirus Based on Genometric Mappings

by
Ivan V. Stepanyan
1,2,* and
Michail Y. Lednev
1
1
Mechanical Engineering Research Institute of the Russian Academy of Sciences (IMASH RAN), M. Kharitonyevskiy Pereulok, 101990 Moscow, Russia
2
Peoples’ Friendship University of Russia (RUDN University), 6 Miklukho-Maklaya Street, 117198 Moscow, Russia
*
Author to whom correspondence should be addressed.
Symmetry 2022, 14(5), 942; https://doi.org/10.3390/sym14050942
Submission received: 8 February 2022 / Revised: 29 March 2022 / Accepted: 27 April 2022 / Published: 5 May 2022
(This article belongs to the Special Issue Advances in Mechanics and Control)

Abstract

:
Coronaviruses are viruses that infect the respiratory system of humans. Besides high mortality rates among the population, they brought about several economic crises on a global scale. Methods. To study and identify features in the genetic composition of the nucleotide sequences of various coronaviruses, we applied copyright algorithms and visualization, which allowed us to compare the biochemical parameters of diverse RNA coronaviruses in a visual form. Results. The article provides examples of different approaches to imaging coronaviruses. We have provided examples of coronavirus RNA structure visualization in various parametric spaces (1-D and 2-D). We employed various visualization types, including structural, integral, and frequency. The research discussed methods of visualization. Our team developed visualization and comparative analysis of coronavirus serotypes and visualization of SARS-CoV-2 coronavirus datasets. Discussion followed on the visualization results. The presented techniques and the results allowed for displaying the structure of RNA sequences of coronaviruses in spaces of various dimensions. Conclusions. According to our findings, the proposed method contributes to the visualization of the genetic coding of coronaviruses. We discussed the issues of machine learning and neural network technology concerning the analysis of coronaviruses based on the presented approach. The described line of research is essential for the study and control of complex quantum mechanical systems, such as RNA or DNA.

1. Introduction

Bioinformatics development made it possible to obtain new tools for studying genetic sequences. In particular, the algorithms for visualization of the nucleotide composition of DNA molecules and RNA are developing. Research in this area has become increasingly relevant in light of the latest challenges, such as the Coronavirus pandemic.
HCVs (human coronaviruses) are large RNA viruses that infect the lung tissue of humans. The authors of the article [1] developed models and morphological cultures of the respiratory tract for the study of coronaviruses, the identification of antiviral drugs, and the assessment of viral inhibition. With tens of thousands of verified cases worldwide, SARS-CoV-2 has caused a substantial number of deaths, creating a grave threat to public health [2]. Various medical, social, and bioengineering methods have been proposed to combat SARS-CoV-2, including detection and prognosis [3,4] highlighting the significance of a chest surface plasmon resonance (SPR)-based biosensor and AI-assisted COVID-19 diagnosis. Publications from various companies around the world discuss the financial consequences of the Coronavirus/COVID-19 crisis [5,6]. According to the statistics, each additional month of the crisis costs 2.5–3 percent of the global GDP on average.
The article [7] presents the results of a phylogenetic analysis of viral genome sequences from India. In SARS-CoV-2 strains, a nonsynonymous mutation 4809C> T (S1515F) has been discovered in the NSP3 gene. The review [8] aims to analyze the genetic similarity and diversity of coronavirus strains, which could reveal SARS-sensitive CoV2′s antigenic targets. According to the research results [9], the 2019-nCoV appears to be a hybrid virus between the bat coronavirus and an undiscovered coronavirus. In terms of virulence, the immunological response of the body, and vaccine efficiency, each strain is different [10]. The focus of the research [11] is to examine epidemiological, clinical, and laboratory characteristics of COVID-19 and MERS-COV populations. SARS-CoV-2 is an RNA virus that can produce a range of strains due to mutation. The next generation of diagnostic assays should be able to detect numerous coronaviruses simultaneously, multiplexed, and with high throughput in a single response [12]. As a result, several medical diagnostic laboratories employ deep learning techniques in molecular genetics [13].
The authors [14] have proposed a machine learning-based coronavirus prediction technique, where 1000 RNA sequences of SARS-CoV-1, MERS, SARS-CoV-2, and other viruses are used to train a naive Bayes classifier. In their work [15], the authors report the current predictions from structure prediction using a machine learning-based method applying molecular dynamics simulation-based refinement.
The cited articles discuss topical issues of the diversity of coronaviruses and individual prevention measures associated with it. That is since viruses tend to mutate, and their changes require monitoring to ensure the biological protection of public health. The paper discusses a method for tracking changes in mutation dynamics and a comparative analysis of viruses.

2. Methods: An Algorithm for Genometric Visualization of Genetic Sequences

Each nitrogenous base of the genetic information has three binary representations. S.V. Petoukhov binary sub-alphabets represent a type of presentation variant. The binary sub-alphabets in the set of nitrogenous bases differ according to the kinds of boolean properties: C = T “pyrimidines”/A = G “purines”; A = C “amino”/G = T “keto”; C = G has 3 hydrogen bonds and A = T has two hydrogen bonds. Based on the boolean properties method for scale-parametric analysis of large nucleotide sequences described in [16,17,18,19,20] is as follows:
(1)
A sequence of characters encoding nitrogen bases from the set A, G, C, and T or A, G, C, and U decomposes into N equal-length segments, where N is a method parameter. N-plets refer to the associated equal-length elements.
(2)
Three binary sequences comprising 0 and 1 can describe the pattern of nitrogen bases. The method (considering 0 or 1) affects the orientations and other symmetrical changes in the resulting display.
(3)
The resulting three records of the elements are encoded in three decimal representations or other uniquely identifying functions.
There have been three basic types of genometric visualizations devised and documented hitherto, each displaying different features of nucleotide sequences [17]. The first type represents the integral type of visual representation (every point on the graph relates to the total count of ones or zeros in the binary format of the N-plet). The second one is the structural form of visual representation (each point relates to the numeric interpretation of the binary format of the N-plet). Ultimately, the third type is the frequency form of imaging (each point on the graph relating to the count of ones or zeros in the binary format of the N-plet or to the frequency of occurrence of the N-plet binary code in the entire sequence).
We can depict three binary sub-alphabets in the form of a matrix, shown in Figure 1. It is possible to divide the nucleotide sequence into equal parts in various ways: with overlap (step = 1) and without overlap (step = N). For the best quality and resolution of visualizations, we used step = 1. With this approach, the number of points in visualizations becomes N times larger, which allows for obtaining more accurate results.
Each row and each column of the Hadamard matrix is a Walsh function. Walsh’s functions are a complete set of orthogonal functions that can be used to represent any discrete function in a manner similar to using trigonometric functions in Fourier analysis. In this study, we relied on the objectively existing system of orthogonal Walsh functions, which is formed by the physicochemical properties of nucleotides (and not on the IUPAC system). These functions have been considered in detail by us in previous publications and are based on a trio of binary sub-alphabets. Any changes in the binary encoding while observing the basic Walsh functions lead to invariant transformations of the resulting mappings—rotations and reflections.
We have proposed an objective measure to reveal the most distinct fractal structure. In the formula, N is the visualization scale parameter for nucleotide sequences of length L:
N = [ log 2 ( L ) ] .
Square brackets are the operation of taking the integer part of a number. Width and height of two-dimensional square dot images:
K = 2 N .
Next, we will consider these types of imaging using the example of the SARS-CoV-2 coronavirus strain imaging in one-dimensional and two-dimensional spaces. Figure 2 illustrates the structural visualization type, where two-dimensional mapping has a symmetrical fractal structure. Figure 3 shows the integral visualization type, and Figure 4 depicts a frequency type of visualization. We employed these three types of visualization in the present work since they are fundamental and the most informative. Different imaging algorithms can produce different results depending on the analyzed genetic sequence, namely its composition and length. We set out to compare these three types of coronavirus imaging.
The biology behind the imaging is to use a three-channel view of essential traits: purine/pyrimidine, keto/amino, two/three hydrogen bonds. Each of these features forms a channel that has common two-dimensional mappings with other channels. We derived all numerical parameters in the resolution at which the most distinct fractal structure appeared.
Visualization methods based on the genetic functions of Walsh allow evaluation of the qualitative features and differences between nucleotide sequences, which become visible in the structure of the visualizations. For short viruses, these differences are less than between the DNA of different species of living organisms. As can be seen from Figure 3, for coronavirus, there is a correlation between its one-dimensional representations in purine/pyrimidine and keto/amino sub-alphabets. This correlation is a feature of this coronavirus. The detection of this feature became possible due to the application of the method proposed in this paper. Apparently, such features can be the subject of further research by biologists and virologists.
Some visualizations are shown in 1D and others in 2D to demonstrate the capabilities of the method, as well as to demonstrate various options for visualization and comparison of the nucleotide composition of viruses. Thus, our work is not only research but also methodological.
Figure 2, Figure 3 and Figure 4 show the use of different visualization methods for the same genetic sequence of a coronavirus. Simultaneously, for a clear construction, diverse methods required diverse scaling parameters N. Please note that the fragment size of N nucleotides is only one display point. There are as many such points as there are parts of N nucleotides in length in the sequence. If N is too small, then the number of combinations of N nucleotides will also be small to build a sufficiently explicit display. If N is extremely large, few points will be present, which will not lend clarity, either. A rule based on combinatorics can direct us to acceptable clarity. According to this rule, N represents the integer part of the base two logarithms of the square root of the length of the nucleotide sequence. Furthermore, to improve the clarity of the mappings, we used fragments of N nucleotides in length as intersecting fragments with the minimum possible overlap of one nucleotide. We practiced these methods to build possibly the most explicit mappings of such short sequences as the RNA of coronaviruses.

3. Results

3.1. Visualization and Comparative Analysis of Coronavirus Serotypes

To carry out genometric visualization, the authors used the genetic data presented in [6]. Table 1 shows the seven most common strains of the human coronavirus [1].
The results presented in Figure 5 are well distinguishable for each data instance. As seen in Figure 5, the most qualitative type of visualization for the visual distinction of nucleotide sequences is one-dimensional integral. The black lines highlight the COVID-19 (SARS-CoV-2) coronavirus strain, and the gray lines indicate the remaining strains from Table 1.
A 2D structural imaging characterizes more generalized differences in nucleotide sequences. Figure 6 illustrates the results of this type of visualization. Based on the comparative analysis of the structural type of visualization, we can conclude that the results can be divided into two groups:
(1)
HCoV-229E, HCoV-NL63, HCoV-HKU1, HCoV-OC43
(2)
MERS-CoV, SARS-CoV, SARS-CoV-2.
Figure 6 reveals certain differences in the structure of coronaviruses. In this instance, it is possible to distinguish three main groups. Viruses HcoV-NL63, HcoV-HKU1, HcoV-OC43 have a similar structure, almost invisible to the eye in two-dimensional representations, forming the first group among themselves. MERS-CoV, SARS-CoV, and SARS-CoV-2 viruses have a more pronounced fractal structuring, which is similar to all. Here, a higher concentration of dots (polynucleotides 1024 nucleotides long) is observed. The HcoV-229E virus is an intermediate variant between these two groups, closer to the first group.
The secondary structure of RNA is determined by the formation of a double helix. Unlike DNA, these helices are relatively short, typically shorter than 8–10 base pairs in length, and are interrupted by single-stranded nucleotides and branches. The method we used does not allow visualization of such short and branching structures since it is intended for visualization of sufficiently long nucleotide sequences. Therefore, we did not consider secondary structures in this study.
In this study, the task was to build visual representations of RNA chains of various coronaviruses for their visual representation and comparative analysis. The method used by the authors made it possible to construct linear mappings along the nucleotide chain, where features and heterogeneity in the RNA nucleotide composition are visible. Two-dimensional mappings were also obtained, in which RNA’s are represented as square matrices with fractal properties and symmetries. The fractal properties of these matrices are a consequence of the mathematics behind the Walsh functions, responsible for the noise immunity of genetic information. This article demonstrated the possibility of applying new methods to the analysis of coronavirus RNA.

3.2. A Visualization of SARS-CoV-2 Coronavirus Datasets

To perform genometric visualization of the SARS-CoV-2 coronavirus datasets, the authors used the US NCBI dataset published at https://www.ncbi.nlm.nih.gov/datasets/coronavirus/genomes/ (accessed on 20 January 2022). This dataset contains about half a million (401,330) SARS-CoV-2 coronavirus samples.
Imaging did not involve any genetic sequence manipulation (alignment, etc.). In Figure 7, each of the three plots corresponding to a set of nitrogenous bases with three forms of binary oppositional features shows all the instances represented in the dataset. In Figure 7, we can see that despite the absence of any preparatory steps before genometric imaging, all coronavirus specimens presented in the dataset possess a similar structure in all (three) types of boolean properties. Figure 8 shows the result of frequency imaging of the same dataset. Accordingly, we can conclude that this type of visualization has a considerable overlap of results. Here, we compare the SARS-CoV-2 with other familiar strains.
In general, the genometric approach as a variant of visualizing the nucleotide composition of DNA/RNA has proved its effectiveness in solving various practical problems for visualizing the nucleotide composition of the coronavirus RNA in various parameter spaces. The RNA length of coronaviruses was found to be sufficient for the manifestation of the fractal structure in two-dimensional visualizations (Figure 6). It is worth noting that such a property as the manifestation of a fractal structure on two-dimensional genometrics is uncharacteristic of all viruses and depends on the length of the RNA of the virus. As a rule, the longer the RNA of the virus is, the more clearly we trace the structure in the parametric visualization space.
Summarizing the above, as a result of the calculated experiments, we can derive several conclusions. To divide the results into groups, it is better to use structural visualization. For a more qualitative difference in the results, it is more efficient to use integral visualization. On structural and integral visualizations, neural networks can be trained for fast recognition. Frequency imaging is unsuitable for comparative analysis on large datasets.

4. Conclusions

The article examined various types of imaging and carried out a comparative analysis of various strains of the coronavirus. Various variants of one of the coronavirus strains were also examined and visualized. The research results allow us to conclude that some visualization options have shown their clarity and effectiveness in displaying the nucleotide composition of coronaviruses.
The application of a new method for visualization and comparative analysis of coronaviruses RNA is shown. The scaling parameter was chosen based on a mathematical formula that allows choosing the optimal display scale.
The critical biological concept of the study is that any molecular genetic sequence of nucleotides, including the RNA of viruses, has three binary-opposite features. Simultaneously, one-to-one functional-geometric manifestations exhibit in various spaces. In this paper, we considered solely one- and two-dimensional images.
The scientific value of the proposed results lies in the visual representation of the molecular structure of such a complex and complex object as coronavirus RNA. This allows us to visualize and compare diverse types of coronaviruses in their easy-to-perceive parametric space. Our earlier articles [16,17,18,19,20] describe in detail all methods of constructing visualizations.
At any rate, the problem of coronaviruses remains relevant. The number of severe complications and deaths from coronaviruses is on the rise. Moreover, the virus mutates. It has already caused serious economic and social difficulties, showing some systemic issues and imperfections of the health care system in several cases. In this regard, the relevance of studies to identify and compare the internal structure of various strains of coronaviruses is beyond doubt.
In this respect, the authors draw attention to the use of genometric imaging methods for research in the field of virology. The results obtained are apparent and demonstrate the possibility of using parametric visualization algorithms. There is a need to note that the presented visualization algorithms and the results of their application to identify the parametric structure of coronaviruses seem to be very promising, not only for human perception but also for machine learning methods and neural network algorithms. This is due to the ergonomics of the perception of genetic information, which receives a more structured look to improve the quality of the work of neural network algorithms and other analytical methods.
It should be noted that the results of this study correlate with the results of work [21] devoted to visualization and analysis of coronaviruses based on machine learning technology. However, our research is focused on algorithms for the visual processing of genetic information for the comparative analysis of viruses, bypassing traditional machine learning methods.
The possibility and adequacy of new visualization tools in bioinformatics shown in this work will contribute to an in-depth study of the topic of pandemics and the involvement of various specialists in the analysis of genetic data, which, in turn, will contribute to solving urgent problems of bioinformatics and healthcare.
Compliance with Ethical Standards: this article does not include human or animal research by any of the authors. Ivan V. Stepanyan declares that he does not have any conflicts of interest. Michail Y. Lednev declares that he does not have any conflicts of interest.

Author Contributions

I.V.S. and M.Y.L. have contributed equally to the paper. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

This paper has been supported by the RUDN University Strategic Academic Leadership Program.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Jónsdóttir, H.R.; Dijkman, R. Coronaviruses and the human airway: A universal system for virus-host interaction studies. Virol. J. 2016, 13, 24. [Google Scholar] [CrossRef] [Green Version]
  2. Li, H.; Liu, S.M.; Yu, X.H.; Tang, S.L.; Tang, C.K. Coronavirus disease 2019 (COVID-19): Current status and future perspectives. Int. J. Antimicrob. Agents 2020, 55, 105951. [Google Scholar] [CrossRef] [PubMed]
  3. Rahmani, A.M.; Mirmahaleh, S.Y.H. Coronavirus disease (COVID-19) prevention and treatment methods and effective parameters: A systematic literature review. Sustain. Cities Soc. 2020, 64, 102568. [Google Scholar] [CrossRef] [PubMed]
  4. Cui, F.; Zhou, H.S. Diagnostic methods and potential portable biosensors for coronavirus disease 2019. Biosens. Bioelectron. 2020, 165, 112349. [Google Scholar] [CrossRef] [PubMed]
  5. Fernandes, N. Economic Effects of Coronavirus Outbreak (COVID-19) on the World Economy. Available online: https://ssrn.com/abstract=3557504 (accessed on 20 January 2022).
  6. Brodeur, A.; Gray, D.; Islam, A.; Bhuiyan, S. A literature review of the economics of COVID-19. J. Econ. Surv. 2021, 35, 1007–1044. [Google Scholar] [CrossRef] [PubMed]
  7. Joshi, A.; Paul, S. Phylogenetic analysis of the novel coronavirus reveals important variants in Indian strains. BioRxiv 2020. [Google Scholar] [CrossRef] [Green Version]
  8. Kaur, N.; Singh, R.; Dar, Z.; Bijarnia, R.K.; Dhingra, N.; Kaur, T. Genetic comparison among various coronavirus strains for the identification of potential vaccine targets of SARS-CoV-2. Infect. Genet. Evol. 2021, 89, 104490. [Google Scholar] [CrossRef] [PubMed]
  9. Ji, W.; Wang, W.; Zhao, X.; Zai, J.; Li, X. Cross-species transmission of the newly identified coronavirus 2019-nCoV. J. Med. Virol. 2020, 92, 433–440. [Google Scholar] [CrossRef] [PubMed]
  10. Roy, B.; Dhillon, J.K.; Habib, N.; Pugazhandhi, B. Global variants of COVID-19: Current understanding. J. Biomed. Sci. 2021, 8, 8–11. [Google Scholar] [CrossRef]
  11. Xu, P.; Sun, G.D.; Li, Z.Z. Clinical characteristics of two human to human transmitted coronaviruses: Corona virus disease 2019 versus middle east respiratory syndrome coronavirus. MedRxiv 2020. [Google Scholar] [CrossRef] [Green Version]
  12. Hemida, M.G. The next-generation coronavirus diagnostic techniques with particular emphasis on the SARS-CoV-2. J. Med. Virol. 2021, 93, 4219. [Google Scholar] [CrossRef] [PubMed]
  13. Duś-Ilnicka, I.; Szymczak, A.; Małodobra-Mazur, M.; Tokarski, M. Role of laboratory medicine in SARS-CoV-2 diagnostics. Lessons learned from a pandemic. Healthcare 2021, 9, 915. [Google Scholar] [CrossRef] [PubMed]
  14. Sarkar, J.P.; Saha, I.; Seal, A.; Maity, D. COVID-predictor: RNA sequence based prediction of coronavirus. Researchsquare 2021, 9, 708224. [Google Scholar]
  15. Heo, L.; Feig, M. Modeling of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) proteins by machine learning and physics-based refinement. BioRxiv 2020. [Google Scholar] [CrossRef] [Green Version]
  16. Stepanyan, I.V.; Petoukhov, S.V. The matrix method of representation, analysis and classification of long genetic sequences. Information 2017, 8, 12. [Google Scholar] [CrossRef] [Green Version]
  17. Stepanyan, I.V.; Lednev, M.Y. Overview of possibilities of genometric information systems. IOP Conf. Ser. Mater. Sci. Eng. 2021, 1129, 012047. [Google Scholar] [CrossRef]
  18. Stepanyan, I.V. The genometrical concept. Symmetry Cult. Sci. 2021, 32, 269–272. [Google Scholar] [CrossRef]
  19. Stepanyan, I.V. DNA clustering algorithms. Autom. Doc. Math. Linguist. 2021, 55, 1–7. [Google Scholar] [CrossRef]
  20. Stepanyan, I.V. A multiscale model of nucleic acid imaging. Sci. Vis. 2020, 12, 61–78. [Google Scholar] [CrossRef]
  21. Hassan, A.H.M.; Qasem, A.A.M.; Abdalla, W.F.M.; Elhassan, O.H. Visualization & prediction of COVID-19 future outbreak by using machine learning. Int. J. Inf. Technol. Comput. Sci. 2021, 13, 16–32. [Google Scholar]
Figure 1. (Left): system of three sub-alphabets as rows with white (true) and black (false) cells in a matrix (a). (Right): two ways of applying parameter N with steps equal to 1 and to N.
Figure 1. (Left): system of three sub-alphabets as rows with white (true) and black (false) cells in a matrix (a). (Right): two ways of applying parameter N with steps equal to 1 and to N.
Symmetry 14 00942 g001
Figure 2. One-dimensional (left) and two-dimensional (right) structural mappings at N = 16 of the nucleotide composition of SARS-CoV-2. 1D sets (left top and bottom): y–from 0 to 216 decimal values; x–from 1 to 29,888 N-plets. The 2D set (right): x and y are from 0 to 216 decimal values.
Figure 2. One-dimensional (left) and two-dimensional (right) structural mappings at N = 16 of the nucleotide composition of SARS-CoV-2. 1D sets (left top and bottom): y–from 0 to 216 decimal values; x–from 1 to 29,888 N-plets. The 2D set (right): x and y are from 0 to 216 decimal values.
Symmetry 14 00942 g002
Figure 3. One- and two-dimensional integral imaging of the nucleotide composition of SARS-CoV-2 N = 512. 1D sets: x—from 1 to 29,888 N-plets, (left top) y—from 207 to 300 values of integral characteristic, (left bottom) y—from 185 to 306 values of the integral characteristic; 2D set (right): x—from 207 to 300 N-plets, y—from 185 to 306 N-plets.
Figure 3. One- and two-dimensional integral imaging of the nucleotide composition of SARS-CoV-2 N = 512. 1D sets: x—from 1 to 29,888 N-plets, (left top) y—from 207 to 300 values of integral characteristic, (left bottom) y—from 185 to 306 values of the integral characteristic; 2D set (right): x—from 207 to 300 N-plets, y—from 185 to 306 N-plets.
Symmetry 14 00942 g003
Figure 4. One- and two-dimensional frequency imaging at N = 10 of the nucleotide composition of SARS-CoV-2. 1D sets (left top and bottom): x–from 0 to 29894 N-plets; y–from 12 to 62 times (top), and from 6 to 94 times (bottom). The 2D set (right): x is from 12 to 62 times, and y is from 6 to 94 times.
Figure 4. One- and two-dimensional frequency imaging at N = 10 of the nucleotide composition of SARS-CoV-2. 1D sets (left top and bottom): x–from 0 to 29894 N-plets; y–from 12 to 62 times (top), and from 6 to 94 times (bottom). The 2D set (right): x is from 12 to 62 times, and y is from 6 to 94 times.
Symmetry 14 00942 g004
Figure 5. One-dimensional integral mappings of the nucleotide structure of coronaviruses genomics data (N = 1024). For each graph, x–from 0 to 30,741 N-plets, y–from 0 to 1024 values.
Figure 5. One-dimensional integral mappings of the nucleotide structure of coronaviruses genomics data (N = 1024). For each graph, x–from 0 to 30,741 N-plets, y–from 0 to 1024 values.
Symmetry 14 00942 g005
Figure 6. Two-dimensional imaging of the nucleotide structure of coronaviruses genomics data (N = 8). X and Y have 28 = 256 pixels, the header includes parameter values for each coordinate.
Figure 6. Two-dimensional imaging of the nucleotide structure of coronaviruses genomics data (N = 8). X and Y have 28 = 256 pixels, the header includes parameter values for each coordinate.
Symmetry 14 00942 g006
Figure 7. One-dimensional integral mappings of the nucleotide structure of coronaviruses genomics data (N = 1024; 401,330 samples of SARS-CoV-2 from different people); y–from 1 to 1024 values, x–from 1 to 31,500 N-plets.
Figure 7. One-dimensional integral mappings of the nucleotide structure of coronaviruses genomics data (N = 1024; 401,330 samples of SARS-CoV-2 from different people); y–from 1 to 1024 values, x–from 1 to 31,500 N-plets.
Symmetry 14 00942 g007
Figure 8. One-dimensional frequency mappings of the nucleotide structure of coronaviruses genomics data (N = 9; 401,330 samples of SARS-CoV-2 from different people); y–from 1 to 31,500 N-plets, x–from 1 to 250 times.
Figure 8. One-dimensional frequency mappings of the nucleotide structure of coronaviruses genomics data (N = 9; 401,330 samples of SARS-CoV-2 from different people); y–from 1 to 31,500 N-plets, x–from 1 to 250 times.
Symmetry 14 00942 g008
Table 1. Common strains of the human coronavirus.
Table 1. Common strains of the human coronavirus.
IndexAbbreviationCoronavirus GenomeLength
NC_006577HCoV-HKU1Human (HKU1)29,926
NC_005831HCoV-NL63Human (NL63)27,553
NC_002645HCoV-229EHuman (229E)27,317
NC_006213HCoV-OC43Human (OC43)30,741
NC_004718SARS-CoVSevere acute respiratory syndrome29,751
NC_045512SARS-CoV-2Severe acute respiratory syndrome-229,903
NC_019843MERS-CoVMiddle East respiratory syndrome30,119
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Stepanyan, I.V.; Lednev, M.Y. A Comparative Analysis of Different Strains of Coronavirus Based on Genometric Mappings. Symmetry 2022, 14, 942. https://doi.org/10.3390/sym14050942

AMA Style

Stepanyan IV, Lednev MY. A Comparative Analysis of Different Strains of Coronavirus Based on Genometric Mappings. Symmetry. 2022; 14(5):942. https://doi.org/10.3390/sym14050942

Chicago/Turabian Style

Stepanyan, Ivan V., and Michail Y. Lednev. 2022. "A Comparative Analysis of Different Strains of Coronavirus Based on Genometric Mappings" Symmetry 14, no. 5: 942. https://doi.org/10.3390/sym14050942

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop