Abstract
Aminoacyl-tRNA synthetases (aaRSs) originated from an ancestral bidirectional gene (mirror symmetry), and through the evolution of the genetic code, the twenty aaRSs exhibit a symmetrical distribution in a 6-dimensional hypercube of the Standard Genetic Code. In this work, we assume a primeval RNY code and the Extended Genetic RNA code type II, which includes codons of the types YNY, YNR, and RNR. Each of the four subsets of codons can be represented in a 4-dimensional hypercube. Altogether, these 4 subcodes constitute the 6-dimensional representation of the SGC. We identify the aaRSs symmetry groups in each of these hypercubes. We show that each of the four hypercubes contains the following sets of symmetries for the two known Classes of synthetases: RNY: dihedral group of order 4; YNY: binary group; YNR: amplified octahedral group; and RNR: binary group. We demonstrate that for each hypercube, the group of symmetries in Class 1 is the same as the group of symmetries in Class 2. The biological implications of these findings are discussed.
1. Introduction
Aminoacyl-tRNA synthetases (aaRSs) are key players in the genetic code of all living beings. AaRS attaches an amino acid to the cognate tRNA, and the aminoacyl-tRNA is then used for translation upon binding to mRNA according to the codon-anticodon interaction on the ribosome. The Standard Genetic Code (SGC) is the mapping of 61 codons or triplets to 20 canonical amino acids. There are 20 aaRSs, one for each of the 20 standard amino acids. AaRSs are divided into two mutually exclusive Classes, 1 and 2, based on their structural, functional, and evolutionary relatedness [1,2,3,4,5,6,7,8]. Each aaRS falls into either Class 1 or Class 2, except for lysyl-tRNA synthetase (LysRS), which has a representative in both classes. The correspondence between amino acids and synthetases is one-to-one; that is, it is a bijective, non-degenerate code. For every triplet, or codon, there is a synthetase associated with the amino acid it specifies. In a previous work [9], we outlined the group of symmetries of both Classes of synthetases in each of the 4-dimensional hypercubes of the so-called Extended Code of type II [10,11]. Herein, we rigorously determine the symmetries of aaRSs in the 4 subcodes RNY, YNY, YNR, and RNR. The article is organized as follows: First, a succinct description of the Table of the genetic code, highlighting the subcodes and the Classes of aaRSs, Next, the arithmetization of the SGC is introduced. Second, we provide the basic mathematical concepts needed to understand the derivation of the symmetries of the aaRSs in the 4 subcodes and in the whole SGC. In Appendix E, we present formal mathematical definitions and concepts. Third, we determine the symmetries in each of the four subcodes, where the mathematical derivations are explained in detail in a series of four Appendixes. We prove that the group of symmetries of both classes of synthetases in the hypercube is the dihedral group the group of symmetries of both Classes of synthetases in the hypercube YNY is the binary group. The group of symmetries of both classes of synthetases in the hypercube YNR is the extended octahedral group and the group of symmetries of the Classes of synthetases in the hypercube RNR is the group isomorphic to the Abelian group where the reflection is the only symmetry of both Classes 1 and 2, and I is the identity matrix.
Notably, we show that the group of symmetries in Class 1 is the same as the group of symmetries in Class 2 for each of the 4 subcodes. The 4 subcodes constitute the whole SGC, which displays a mirror symmetry of aaRSs. We also analyzed the 5-dimensional hypercubes NNY (all triplets that end in pyrimidine) and NNR (all triplets that end in purine) as obtained by the union of RNY with YNY and YNR with RNR, respectively. We remark that the latter step has been simply ignored in the evolution of the SGC. Finally, we discuss the biological implications of the results, in terms of the evolution of the SGC and in how the code went from managing the information by quadruples of 0’s and 1’s to sextuples of 0’1 and 1’s, where the distributions of aaRSs in the subcodes obeyed symmetrical groups.
1.1. Basic Mathematics of the SGC
The cartesian product being the set of the four RNA-nucleotides, C = Cytosine, U = Uracil, A = Adenine, and G = Guanine, is the set of the 64 triplets where The standard genetic code (SGC) may be seen as a surjective function being the set of the 20 known amino acids, and the stop signal, which means the instruction of finalizing the process of synthesis of a protein.
For every preimage set is the set of triplets, also called codons, that encode the amino acid, or stop signal, Since the decipherment of the genetic code [12], it is known that the number of coding triplets for any is one of the numbers 1, 2, 3, 4, or 6.
In Table 1, we show the table of the genetic code, highlighting the 4 subcodes RNY, YNY, YNR, and RNR, as well as the 2 Classes of synthetases. The table also indicates the subcodes NNY and NNR.
Table 1.
Genetic code: Triplets and amino acids in each of the 4-, 5-, and 6-dimensional hypercubes. aaRS Class 1, one red asterisk *; aaRS Class 2, two blue asterisks **. RNY triplets in gold; YNY triplets in black; YNR triplets in blue; RNR triplets in green.
1.2. Arithmetization of the Genetic Code
The bijective correspondence between the set and the set being induces a bijective correspondence between the set of the 64 triplets and the set of all the sextuples of zeros and ones.
Binary operations: In the binary set the two binary operations the so-called addition and product, module 2, are defined. Their Cayley Tables (Table 2) are:
Table 2.
The Cayley tables of sum and multiplication of the binary set .
They define in the set the algebraic structure of a field, or commutative division ring. This is the smallest possible field, with only two elements: the neutral of the addition and the neutral of the product, respectively.
The addition is extended, component-wise, to the set which, with the natural definition of the product of any scalar , by a sextuple becomes a 6-dimensional vector space over the binary field This vector space is the so-called 6-dimensional binary hypercube.
The bijection between and the set of the 64 triplets, induces in it the algebraic structure of a 6-dimensional vector space, whose canonical basis is the system: (ACC, UCC, CAC, CUC, CCA, CCU). The addition of nucleotides is shown in the following Table 3:
Table 3.
Sum module 2 of nucleotide bases.
It is seen that the nucleotide C, cytosine, is the neutral element of the group. This group is isomorphic to the de group, which is known as the Four Klein Group. It is an Abelian group of order 4, where each element has its own inverse. It is, in the Felix Klein list of finite groups, the fourth and first that is not cyclic.
Mutations in the triplets of the genetic code: A mutation in a triplet is the substitution of any of its components by a nucleotide. For example, GUC→GAC, where U is replaced by A, Obviously, a mutation in a triplet may produce a change in the amino acid it encodes. In our example, valine, which is encoded by GUC, is converted into aspartic acid, encoded by GAC. Algebraically, a mutation in a triplet is performed by the addition of triplets to the canonical basis. In our example, the mutation GUC→GAC is obtained by the addition of the triplet CGC = CAC + CUC.
Classification of the nucleotides
There is a partition of the set being pyrimidines and purines. The condition of being a pyrimidine or a purine is called the chemical type of a nucleotide.
Transitions and transversion: A mutation is called a transition if it does not change the chemical type, and it is called a transversion if the chemical type is changed. It is easy to notice that transitions are produced by the addition of pyrimidines and transversions by the addition of purines.
The primaeval genetic code. It is assumed that there was, in the RNA world, a primitive, or primaeval, genetic code of only sixteen triplets and eight amino acids: the set where is a purine, is any of them, and is a pyrimidine. It is a 4-dimensional hypercube, a translation of the 4-dimensional subspace derived from the transversion in the first nucleotide. By transversions in the first or third component, the hypercubes and are derived from the primaeval These four sets are pairwise disjoint, and they cover the whole set. Hence, the set of sets is a partition of The hypercube determines, with the addition a subgroup of the additive group , the other and its cosets. With the addition and the product of scalars by vectors, a 4-dimensional vector subspace is determined, as are its affine subspaces, that is, translations of it, namely:
1.3. The Classes of Synthetases
Synthetases are enzymes that regulate the selection of amino acids that will be charged to tRNA molecules. A little more than 20 aaRSs are found in modern organisms. They are classified into two groups, Class 1 and Class 2, each having three subclasses (a, b, and c) based on similarity in sequences and structures [13,14]. The classification is shown in Table 4:
Table 4.
Classes and subclasses of aminoacyl tRNA synthetases.
In general, aaRS consists of a catalytic domain, an anticodon-binding domain, and often an editing domain. Each class harbors class-specific characteristic motifs and structural topology in its catalytic domains [3].
According to the RO model [15,16,17,18] the table of the genetic code can be divided into the sub-codes NAN, NGN, NUN, and NCN. We have also shown that there exists an automorphism F of the cube defined piecewise, which transforms that division into the sub-codes RNR, YNR, RNY, and YNY, respectively, which is precisely our algebraic model [19].
2. Results
2.1. Group of Symmetries of the Classes of Synthetases in the Hypercube
The members of Class 2 are in the vertices of a square, contained in the convex closure of the set taken by a suitable change of coordinates, rotations, and translations of axis as a representation of the hypercube See Figure 1:
Figure 1.
aaRSs distribution in codes . Class 1 (red, 2 amino acids) and Class 2 (blue, 6 amino acids). Graphic representation of the subsets and The first 4-dimensional hypercube of the code: .
The quadruplets that belong to Class 2 of synthetases (colored black) are in the set of vertexes of a square on one of the 24 faces of the hypercube. is a subset of the set of the sixteen vertices of the hypercube The set of vertexes of that square is:
where
v1 = (1,−1,−1,−1), v2 = (1,−1,−1,1), v3 = (1,1,−1,1), v4 = (1,1,−1,1), black colored.
The center of this square is the point .
The members of Class 1 are in the complementary set of vertexes that is, the other 12 vertexes, red colored, out of the four vertexes of the square
v1 = (u1,u2,u3,u4,u5,u6,u7,u8,u9,u10,u11,u12) where
u1 = (−1,−1,−1,−1), u2 = (−1,−1,−1,1), u3 = (−1,1,−1,1), u4 = (−1,1,−1,−1)
u5 = (−1,−1,−1,−1), u6 = (−1,−1,1,1), u7 = (−1,1,1,1), u8 = (−1,1,1,−1)
u9 = (1,−1,1,−1), u10 = (1,−1,1,1), u11 = (1,1,1,1), u12 = (1,1,1,−1)
The main result of this Section 2.1 is: We have proved that the group of symmetries of both classes of synthetases, in the hypercube is the dihedral group Details are given in Appendix A.
Now we can state the following:
Theorem 1.
The group of symmetries of Class 1 are the same as that of Class 2.
Proof.
As the symmetry of Class 1 is a bijective isometrical function from onto itself, it preserves the set It also preserves the square that contains the quadruples of Class 2, that is, the vertices of the square It means that both classes have the same group of symmetries. That is so because the binary set of sets is a partition of the finite set of the 16 vertices of the hypercube
Observation: A similar theorem takes place in any of the other hypercubes of Extended code type II and in the whole 6-dimensional hypercubes. In the case of the 4-dimensional YNR, the 5-dimensional NNR, and the whole 6-dimensional, it is valid if, for methodological reasons, we assume the stop signal as it would be an amino acid of Class 1.
2.2. Group of Symmetries of the Classes of Synthetases in the Hypercube
In this case, the members of each class are in eight vertexes, out of the sixteen vertexes of the hypercube (see Figure 2). They are represented by the sixteen quadruples of the set of ones and minus one, being the hypercube the convex closure of the set The set is taken, by a suitable change of coordinates, rotations, and translations of axis, as a representation of the hypercube The center of this hypercube is the origin of coordinates
Figure 2.
aaRSs distribution in the code. Class 1 (red, 4 amino acids) and Class 2 (blue, 4 amino acids). Graphic representation of the subsets and Second 4-dimensional hypercube of the Extended RNA-code type 2: .
The set of vertexes of the Class 1, colored in red, is the set:
The members of Class 2 are in the complementary set of vertexes, that is, the other eight vertexes, colored black, out of the eight of Class 1.
This is the set:
Let us consider the affine transformation: that converts each −1 into 0 and 1 into itself. It represents a change of coordinates, that converts the hypercube into its isometric whose center is the point (1/2, 1/2, 1/2, 1/2). The sets and are now:
Then, the group of symmetries of both Classes of synthetases in the hypercube YNY is the binary group Details can be found in Appendix B.
2.3. Group of Symmetries of the Synthetases in the Hypercube
In this case there are 3 amino acids of Class 1 and 3 amino acids of Class 2 (Figure 3).
Figure 3.
aaRSs distribution in the code. Class 1 (red, 3 amino acids) and Class 2 (blue, 3 amino acids and yellow, the stop signals). Graphical representation of the subsets and Third 4-dimensional hypercube of the Extended code type II: .
For methodological reasons, we have assumed the three stop codons are members of Class 1. In this case, the members of each Class are on the eight vertices of a 3-dimensional cube. Both Classes coincide with the cubes and The vertices are represented by the sixteen quadruples of the set of ones and minus ones, being the hypercube the convex closure of the set The set is taken, by a suitable change of coordinates, rotation, and translations of axis, as a representation of the hypercube The center of this hypercube is the origin of the coordinates.
The set of vertexes of the cube, associated to Class I, colored in red, is the set:
The center of this cube is the point and the set of vertexes of the Class 2, colored in black, is the set:
whose center is the point
Then, we have proved that the group of symmetries of both classes of synthetases in the hypercube is the extended octahedral group See Appendix C.
2.4. Group of Symmetries of the Synthetases in the Hypercube
In this case the vertexes of Class 1, colored in red, are the set of six vectors (Figure 4):
which is the union of the square of vertexes with the edge of vertexes
Figure 4.
aaRSs distribution in the code. Class I (red, 3 amino acids) and Class 2 (blue, 5 amino acids). Graphic representation of the subsets and Fourth 4-dimensional hypercube of the Extended RNA-code type II: .
And the set of vertexes of Class 2, colored in black, are the set of the ten vertexes (Figure 4):
which is he union of the squares of vertexes and with the edge of vertexes
The group of symmetries of the Classes of synthetases in the hypercube RNR is isomorphic to the Abelian group where the reflection is the only symmetry of both Classes 1 and 2, and is the identity matrix. A detailed derivation is given in Appendix D.
2.5. Conclusions
It has been proven that the groups of symmetries of both classes of synthetases in each of the four hypercubes are the following (Table 5):
Table 5.
Summary of symmetry groups of aaRSs for each subcode.
3. The Genetic Code in 5 and 6 Dimensions
3.1. The Genetic 5-Dimensional Hypercube
The 5-dimensional hypercube is the disjoint union of the 4-dimensional hypercubes and is the subspace generated by the four unitary vectors and is its affine subspace Here, the sextuples of zeros and ones have been replaced by triplets of the letters with the following equivalence: is the binary set is the also binary and is the set
The four triplets are associated with the canonical unitary vectors In Figure 5, they are vectors initiated at the point colored red that generate the hypercube The triplet associated with the unitary vector also initiated and colored blue, completes the canonical basis of the 5-dimensional hypercube Obviously, the translation associated with converting into its affine subspace is colored blue, and the union of both with is the 5-dimensional hypercube The vertexes are the triplets or codons that end in a pyrimidine.
Figure 5.
The five-dimensional hypercube of triplets which is the union of the 4-dimensional hypercubes and is obtained from the by means of transversion of each nucleotide at the left of the triplet. is the subspace generated by the four unitary vectors CAC, CUC, CCU, UCC, and is its affine subspace The translation that represents these transversions is situated at an angle of 22.5 degrees with respect to the horizontal axis generated by the canonical vector At this angle, none of the vertices are overlapped.
It can be seen that the translation associated with the unitary vector represented by the triplet converts the 5-dimensional subspace into its 5-dimensional affine subspace whose triplets are those that end in a purine (Figure A1). It is the disjoint union of the 4-dimensional hypercubes and It is clear that the 6-dimensional hypercube is the union of the 5-dimensional hypercube with its affine subspace, also a 5-dimensional hypercube, Finally, we have, that is the disjoint union of the four 4-dimensional hypercubes, derived from the primaeval by transversions in the first or third nucleotide.
Note: The subspace (hypercube) is the solution set of the homogeneous system of two liner equations with the six unknowns Its affine subspace (hypercube) is the solution set of the non-homogeneous system also of rank 2, with the same six unknowns.
The 5D hypercube (see Figure A1) is obtained by the translation that converts every triplet that ends in pyrimidine into another that ends in purine, obtaining the hypercube It represents the disjoint union of the subcodes and Note that this hypercube contains the stop codons and The disjoint union of them, which is equal to the 6D hypercube that represents the entire SGC. For the 6D genetic code, each triplet is mapped into sextuples of o’s and 1's. This hypercube is the set isomorphic to which is the binary vector space over the binary field
3.2. The Hypercube of Dimension 6
It is the disjoint union of the 5-dimensional space and is the subspace generated by the five unitary vectors CAC, CUC, CCU, UCC, ACC, and its affine subspace The vector space is generated by the six unitary vectors CAC, CUC, CCU, UCC, ACC, and CCA. They conform to the canonical basis of The conversion of the hypercube in other whose components are ones and minus ones. Multiplying by 2 all the vectors of the space, the set of the vertexes is converted into the set of vertexes of the hypercube whose edges have length 2. Next, performing the substraction of the n-tuple the hypercube is converted into which is another hypercube, also with edges of length 2, whose set of vertexes is the set of the n-tuples of ones and minus ones.
Note: The n-tuple E is the opposite vertex, in the hypercube of the null vector The distance between them is the real number which is the diameter of that hypercube. The diameter of a hypercube is the diameter of the hypersphere that circumscribes it-Actually, the hypercube is the image of under the affine transformation being the identical automorphism. The new hypercube has its center at the origin of coordinates, and its radius is (Figure 6). The affine transformation converts every 0 into −1 and the number 1 into itself. The linear transformation is a homothetic transformation of ratio 2, which duplicates the size of every n-dimensional Figure. Its only fixed point is the origin O of the coordinates. The affine transformation is also a homothety of ratio 2, whose only fixed point is the point
Figure 6.
The 6D representation of the Standard Genetic Code.
4. Discussion
The emergence of the first encoded processing system at the molecular level triggered the phenomenon of life. Coding systems are an inherent property of biological systems. When analyzing the origin and evolution of codes, it becomes clear that the overlap of new codes over earlier/older ones increased the complexity of the previous codes, operating to better adjust and fine-tune them. These interrelated codes started to interact to form a macrocode composed of multiple, overlapping coding systems. Therefore, an information system should be understood as any system capable of implementing the (encoded) connection between natural entities. Life should be understood as an information system that is dependent on a code (open or closed) for its processing but operates independently of other macrocode structures [20].
The distribution of aaRSs in the subcodes is not random. From the putative ancestral symmetric gene to the whole SGC, there is always a symmetrical distribution along the evolution of the genetic code.
Class aaRS I (Class aaRS II) can be converted into Class aaRS II (Class aaRS I) by means of isometric algebraic functions [19]. The fact that enzymes belonging to the two synthetase classes are grossly mirror images of each other (e.g., they approach opposite sides on tRNAs) motivated a phylogenetic analysis that indicated that these proteins were originally coded for by opposite strands of the same gene [15,16,17] in the later stages of the RNA world. This scenario was strengthened experimentally [18]. When the symmetry groups have a structure-preserving one-to-one correspondence, they are considered isomorphic [21].
All synthetases of Class II can be found in the first two 4D-hypercubes (RNY and YNY). Moreover note that Class I and II of aaRSs existed for all 20 amino acids in the Extended RNA codes 1 and 2, before the completion of the SGC. According to the symmetries found in the last step to arrive at the SGC, duplications of Lys, Arg, Glu, Gly, Pro, Leu, Ser, and Phe for the Extended RNA code type 1 and duplications of Gln, Arg, Stop, Trp, Pro, Leu, and Ser for the Extended RNA code type 2 were necessary, resulting in a simpler algebraic structure of the SGC (see Table 1).
The hypercubes consider mono-codonic, di-codonic, three-codonic, tetra-codonic, and hexa-codonic amino acids. In each code, we can observe symmetrical structures in the distribution of aaRSs. We observed that the symmetrical properties of the aaRSs distribution in the SGC are simpler than the ones observed for both Extended RNA codes. In short, there are only 20 aaRSs, one for each amino acid and, respectively, for isoacceptor tRNAs; hence, the aaRS link to the 20 coded amino acids is non-degenerate [3].
Our group-theoretical approach does not explicitly consider how the allocation of aaRSs during the evolution of the genetic code was constrained by the structural and functional properties of the interaction of aaRS, amino acids, and tRNA. Yet it permits us to determine the types of symmetries of aaRSs during the evolution of the SGC. The symmetry groups found in the RNA codes highlight intricacies in the evolution of aaRSs in conjunction with the evolution of the genetic code itself.
We have determined the type of symmetries of aaRSs in each of the 4 subcodes of the Extended RNA code type II. We used the 4D representation of each subcode. The RNY subcode has the dihedral group, with 8 symmetries, 4 rotations, and 4 reflections. The YNY subcode has symmetry with 2 symmetries; the YNR subcode exhibits octahedral amplified symmetry with 48 elements, 24 rotations, and 24 roto-reflections (the octahedral classic group contains only 24 rotations); and the RNR subcode displays the symmetry of the binary group In each subcode, symmetrical distributions of both Classes of aaRSs were found. Indeed, we proved that for each hypercube, the group of symmetries of Class 1 is the same as the group of symmetries of Class 2. This theorem holds for both 5D hypercubes and for the whole 6D representation of the SGC. Note that the distribution of aaRSs in both Extended RNA code type I and II and the SGC is symmetrical, which is consistent with the notion that the evolution of the two aaRS classes evolved in parallel [21].
The consideration of stop codons in the determination of the type of symmetry of the subcode is not fortuitous. Mitochondrial codes present variations principally in the codons for the stop signals and unicoded amino acids. The mitochondrial genetic codes of yeast, invertebrates, and vertebrates present variations principally in the codons for the stop signals and unicoded amino acids https://www.ncbi.nlm.nih.gov/Taxonomy/Utils/wprintgc.cgi?chapter=tgencodes#SG24 (accessed on 31 July 2023). The stop codons are tricodonic in SGC, tetracodonic in vertebrate mitochondria, and dicodonic in invertebrate and yeast mitochondria.
In computer science, a byte is an octet of 0’s and 1’s, where each bit represents 0 or 1. Hence, in the genetic code, a byte would correspond to a sextuple of 0’s and 1’s, where each of them represents a triplet or codon of the nucleotides The presence of stop codons converts the genetic code into an algorithm that carries out protein synthesis. This means that the whole process of protein synthesis is carried out by a Turing machine, i.e., by a recursive function. Unlike the Turing machine, the genetic code has the additional property of heritability. In a forthcoming work, we will develop these concepts.
Our work can be regarded as a possible pathway for the distribution of the 2 Classes of aaRSs during the formation of the SGC. According to the model of Rodin-Ohno [15,16,17,18], there was a single gene encoding for two synthetases. They proposed a single anti-parallel complementary strand of a single base-paired nucleic acid molecule. The Rodin-Ohno model divides the table of the genetic code into two classes of aminoacyl-tRNA synthetases (Classes 1 and 2), with recognition from the minor or major groove sides of the tRNA acceptor stem [15,16,17,18]. According to the table of the genetic code, the RO model is almost symmetric. It turns out that the RO model is symmetric in a six-dimensional (6D) hypercube (José et al. 2017). Conversely, using the same automorphisms, the RO model can lead to the SGC. Class aaRS 1 (Class aaRS 2) can be converted into Class aaRS II (Class aaRS I) by means of isometric algebraic functions [19]. Notably, the 6D algebraic model is compatible with both the SGC (based upon the primeval RNY code) and the RO model [19]. Our results have implications in areas such as creating synthetic codes, astrobiology, and computer science. In astrobiology, it provides new insights into the quest for life in the Universe. In computer science, it provides a guideline for the establishment of decision criteria to define what should be considered an artificial life.
Author Contributions
Conceptualization, M.V.J. and E.R.M.; methodology, M.V.J. and E.R.M.; formal analysis, E.R.M. and M.V.J.; investigation, M.V.J. and E.R.M.; writing—original draft preparation, E.R.M. and M.V.J.; writing—review and editing, M.V.J. and E.R.M.; visualization, J.R.B.; Figures, J.R.B. All authors have read and agreed to the published version of the manuscript.
Funding
M.V.J. was funded by Dirección General de Asuntos del Personal Académico (DGAPA), the Universidad Nacional Autónoma de México, and UNAM (PAPIIT-IN200823).
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
Not applicable.
Conflicts of Interest
The authors declare no conflict of interest.
Appendix A
Let us consider the orthogonal matrix which represents a rotation of angle in the plane It operates over the set of vertexes of the square S in the following way:
This means that is an isometry of the square
Let us now consider the orthogonal matrix which represents a reflection through the 3-dimensional hyperplane that reverses the fourth axis It operates over the set of vertexes of the square in the following way:
This means that is an isometry of the square It is not difficult to verify the following equalities, that occur between the matrixes and
They are the defining relations of the dihedral group the group of all the symmetries of a square.
Now, we will see which is the action of the dihedral group generated by the matrixes and over the set of the 12 quadruplets that represent the synthetases of Class 1.
The action of over the set is,
It is seen that is an isometry of the set of synthetases of Class 1.
The action of over the set is:
Note that is an isometry of the set of synthetases of Class 1.
We see that this is a symmetry of the set vertexes of Class 1 synthetases. As and are symmetries of the Class 1 of synthetases, this result confirms the assertion of Theorem 1, where it was proved that both classes have the same group of symmetries.
Appendix B
The group of symmetries of both classes of synthetases is the binary group
Let us consider the orthogonal matrix which represents a rotation of angle around the vectorial plane generated by the vectors and which fixes the point changes the sign the vector and interchanges the vectors and
Let us now see what the action of the rotation A over the set is:
It is seen that the rotation is a symmetry of the set
Next, we will see the action of over the set of vertexes
Notice that rotation is a symmetry of the set
The rotation of an angle is an element of order two. Then, the group of symmetries of the classes of synthetases in the hypercube contains the group generated by the rotation which is isomorphic to the binary group
The isometric transformation is the only symmetry of the sets, and in the hypercube Then, the group of symmetries of both classes of synthetases is the binary group
Appendix C
The group of symmetries of classes of synthetases in the hypercube is the extended octahedral group
Let us consider the translation associated to the point Applying it to the set we obtain:
which is the set of vertexes of another cube, with center in the origin of coordinates.
Analogously, applying to the set the translation we obtain:
which is the same set of vertexes of a cube with center in the origin of coordinates.
Let us now consider the orthogonal matrices:
which represent rotations, and of angles radians and of angle radians. They satisfy the relations:
that are the defining relations of the octahedral group, which is the group of the rotational symmetries of a cube or regular hexahedron. It is also the group of rotational symmetries of a regular octahedron, which is the dual figure of the regular hexahedron. Adding to the set of rotational matrixes and the matrix
which represents a reflection through the three-dimensional subspace generated by the unitary vectors that reverses the fourth axis the amplified octahedral group is obtained, where
We invite the lector to verify that these rotations, and the reflection are symmetries of the sets and of the vertexes of Class 1 and Class 2, respectively.
From this, it follows that the group of symmetries of classes of synthetases in the hypercube is the extended octahedral group
Appendix D
The reflection is the only symmetry of both Classes 1 and 2.
Now, consider the matrix which represents the same reflection of Section 2.4.
Through the hyperplane that reverses the axis This reflection changes the sign of the fourth component of each quadruple, and it is easy to notice that it produces the following interchange of vectors in the Class I of synthetases:
Hence, is a symmetry in the Class I of synthetases in the hypercube
By Theorem 1, it should also be a symmetry of the set of vertices in Class 2. However, it is straightforward to verify that it produces the interchange of vectors:
Then, is also a symmetry of Class 2.
It is not difficult to notice that the reflection is the only symmetry in both Classes 1 and 2. As it is an element of order two, the group isomorphic to the Abelian group is, as in the case of Part II, the group of symmetries of the classes of synthetases in the hypercube
Appendix E
Mathematical background
The Euclidean n-dimensional space, being the field of real numbers, has a basis the so-called canonical basis, where, for each with 1 is in the place and 0 is in the remainders.
A remarkable observation: The n-dimensional vector space is a generalization of the ordinary 3-dimensional space, where every triplet represents a point of the space, endowed with a system of axis and pairwise orthogonal or perpendicular, intercepted
in a common point denoted as O and called the origin of coordinates, represented by the triplet (0,0,0). Usually, every point P is identified with the triplet that represents it, as if they were the same object. So, the equality is admitted. The axes are right lines defined as , , The addition of two points and is defined as This operation, therefore defined, is associative, has a neutral element, and for every point its additive inverse exists, such that Then, the ordered pair is a group. As the addition + is also commutative, it is an Abelian group. The product of a real number by a point is defined as This external operation of a real number by a point has the following algebraic properties:
- for (Mixed associativity).
- for every (Existence of an external neutral element).
- for all (Distributivity of the product with respect to the addition of points, also called distributivity at the left).
- for all (Distributivity of the product with respect to the addition of numbers, also called distributivity at the right.)
The condition of the Abelian group for the addition and the four properties of the external operation completes the structure of a vector space over the field of the real numbers, in the set of all the points of the space.
The concept of vector: Given two different points we call the vector of origin and extreme to the oriented segment of the right line that joins with It is denoted as We define the addition of two vectors and with common origin P as the vector such that If the points and are not collinear, the four points are the vertices of a plane parallelogram. It means that the lines and are parallel. If the null point the function: where for is defined as a vector whose initial and final points coincide, is a bijective function between the set of all the triplets or points of the space, and the set of all the vectors with origin This correspondence induces over the set the operations and of addition and product of numbers by vectors and is endowed the structure of a vector space over the field of the real numbers.
Euclidean norm
The Euclidean norm, module, or length, of a vector is defined as the non-negative square root of the sum of the squares of its components.
Main properties of the Euclidean norm
(N1) For every and it is =0 if, and only if, the null vector , (Positivity of the norms for all the non-null vectors)
(N2) For and (The norm module, or absolute value, of the real number is the positive of the couple and for
From N2 we have that for every
(N3) For all (Triangle inequality).
The Euclidean Distance between vectors
For the Euclidean distance between their extremes, is defined as
Main properties of the Euclidean distance
(D1) For every being the null vector (Positivity of the distance to the origin for all the final points of the non-null vectors).
(D2) For all (Symmetry).
(D3) For all (Triangle inequality).
With this distance the set of vectors, is a metric space and the vector space is called the n-dimensional Euclidean Vector Space.
Linear transformations
A function is called a linear transformation, or linear endomorphism, if it has the following property:
For all and the equality holds.
Note: The latter condition is equivalent to the two conditions:
(L1) for all (This means that is a group homomorphism of the additive group onto itself).
(L2) for all and
If the function is bijective, the linear transformation is a linear isomorphism, and it is called a linear automorphism of the vector space.
Matrix representation of a linear transformation
For every endomorphism there is a unique square matrix such that for every j,
being the column of Then, is a matrix whose columns are the column matrixes associated with the vector images of the canonical unitary vectors. It is called the matrix of with respect to the canonical basis. It will be denoted as In the case of an automorphism, the matrix is a non-singular or invertible matrix; hence, its determinant is different from 0.
For every vector the column matrix is taken as the matrix representation of It is well known that the matrix representation of the image vector is the matrix product Conversely, every square matrix defines an endomorphism whose matrix representation is
Translations
For any vector the translation defined by is the function such that, for all For every is a bijective function that preserves the norm of each vector and the distance between different vectors of the space. Hence, it is an isometry of the Euclidean vector space. For two translations and the composition is equal to the translations
The correspondence between and the set of all the translations, is a bijective function, that induces over the structure of a n-dimensional vector space, isomorphic to the Euclidean n-space
The concept of affine transformation
An affine transformation is a function that is a composition of a linear endomorphism followed by a translation It acts on every vector as The affine transformation is bijective if, and only if, its linear part f is an automorphism. The set of all the affine transformations of the vector space will be denoted as The composition of two affine transformations and acts over a vector in the form:
Then, the composition is the affine transformation Hence, we have equality
We see that the composition of two affine transformations is an affine transformation. Then, the composition is an inner operation in the set As the operation is associative and has a neutral element, the identity function it is the affine transformation the ordered pair is a monoid or semigroup with a neutral element.
Matrix representation
The matrix representation of the affine transformation is given by the composition being the matrix of which acts over the matrix of a vector as
The conjugated of a translation by a linear automorphism
The conjugated acts over any vector as Then The latter means that the group of translations is a normal subgroup of the group being the set of all the invertible affine transformations, and the group of all the linear automorphisms of the space The group that is called the affine group of the space is a subgroup of the group of all the bijective transformation of the set It contains as a normal subgroup, the group of all the translations. In fact, it is the semidirect product of the group with the group of all the linear automorphisms. (A group is semidirect product of its subgroups and if and is a normal subgroup of The semidirect product will be denoted as Then, The affine group is the group of invertible elements of the monoid
The n-dimensional binary hypercube
The binary vector space vector space over the binary field is also called the n-dimensional binary hypercube. Its set of vertexes coincides with the set of the n-tuples of zeros and ones. This set is a subset of the set of all the n-tuples of the whole numbers. The set determines the Abelian group subgroup of the additive group of the n-dimensional Euclidean n-space
The n-dimensional hypercube, whose vertexes are the elements of the set is defined as the convex closure of the set Here, denotes the closed interval of the real numbers between 0 and 1. The convex closure of a set is the minimal convex set that contains it.
A convex set is a set that, for two different elements and of the segment that joins them is entirely contained in
The center of the hypercube is the point obtained by the sum of the vertexes divided by That is so because, for every place i of the n-tuples, of them, that is, the half, have 1, and the other have 0. Then, for each place i, the quotient is
A metric in the hypercube
The Hamming Norm and the Hamming Distance defined in
The Hamming Norm: The Hamming Norm of a sextuple is defined as the sum in of its components It is equal to the number of ones it has. The only unitary sextuples, that is, of norm equal to 1, are those of the canonical basis.
The Hamming distance. The Hamming distance between two sextuples and is defined as the norm of their module 2 addition. It is equal to the number of places where they differ. It is not difficult to prove that the distance so defined is actually a distance, in the mathematical sense. It means that it has the following properties:
(D1) For all (positivity of the distance between two different sextuples and ), and nullity of the distance of a sextuple to itself.
(D2) For all (Symmetry)
(D3) For (Triangle inequality)
The edges of the hypercube
Two vertexes are said to be neighbors if the distance between them is equal to 1. It means that they differ in only one of their n components. An edge of the hypercube is the segment that joins two neighbor vertexes. This segment has a length equal to 1. For every vertex there are neighbors. As there are vertexes, the product would count the total number of edges. However, as every edge has two extremes, it would be counted twice. Then, the actual number of edges is equal to . For example, in the ordinary cube, 3-dimensional, the number of edges is and, in the 4-dimensional, this number is
The faces of the hypercube
Any face of the hypercube is a plane square determined by four vertexes and four edges in such a way that two edges with a common vertex are orthogonal. If we fix components with values zeros and ones and leave free the other two, the four obtained vertexes are the vertexes of a plane square, then of the face of the hypercube. It may be conducted in ways. Hence, the total number of different faces is precisely (The combinatorial number is the number of subsets of k elements of a set of n elements, being ). For example, for n = 3 the number of faces is and for n = 4 it is 6 × 4 = 24.
Figure A1.
Five-dimensional hypercube NNR. The five-dimensional hypercube of triplets which is the union of the 4-dimensional hypercubes and .
References
- Ibba, M.; Söll, D. Aminoacyl-tRNA synthesis. Annu. Rev. Biochem. 2000, 69, 617–650. [Google Scholar] [CrossRef] [PubMed]
- Cusack, S.; Berthet-Colominas, C.; Hartlein, M.; Nassar, N.; Leberman, R. A second class of synthetase structure revealed by X-ray analysis of Escherichia coli seryl-tRNA synthetase at 2.5 A. Nature 1990, 347, 249–255. [Google Scholar] [CrossRef] [PubMed]
- Eriani, G.; Delarue, M.; Poch, O.; Gangloff, J.; Moras, D. Partition of aminoacyl-tRNA synthetases into two classes based on mutually exclusive sets of sequence motifs. Nature 1990, 347, 203–206. [Google Scholar] [CrossRef] [PubMed]
- Burbaum, J.J.; Schimmel, P. Structural relationships and the classification of aminoacyl-tRNA synthetases. J. Biol. Chem. 1991, 266, 16965–16968. [Google Scholar] [CrossRef] [PubMed]
- Ribas de Pouplana, L.; Schimmel, P. Two classes of tRNA synthetases suggested by sterically compatible dockings on tRNA acceptor stem. Cell 2001, 104, 191–193. [Google Scholar] [CrossRef] [PubMed]
- Ribas De Pouplana, L.; Schimmel, P. Operational RNA code for amino acids in relation to genetic code in evolution. J. Biol. Chem. 2001, 276, 6881–6884. [Google Scholar] [CrossRef] [PubMed]
- Ribas de Pouplana, L. The evolution of aminoacyl-tRNA synthetases: From dawn to LUCA. In Enzymes; Academic Press: Cambridge, MA, USA, 2020; Volume 48, pp. 11–37. [Google Scholar]
- Schimmel, P.; Giege, R.; Moras, D.; Yokoyama, S. An operational RNA code for amino acids and possible relation to genetic code. Proc. Nat. Acad. Sci. USA 1993, 90, 8763–8768. [Google Scholar] [CrossRef] [PubMed]
- José, M.V.; Bobadilla, J.R.; Zamudio, G.S.; Farías, S.T. Symmetrical Distributions of Aminoacyl-tRNA Synthetases during the evolution of the Genetic Code. Theory Biosci. 2023, 142, 211–219. [Google Scholar] [CrossRef]
- José, M.V.; Morgado, E.R.; Govezensky, T. An extended RNA code and its relationship to the standard genetic code: An algebraic and geometrical approach. Bull. Math. Biol. 2007, 69, 215–243. [Google Scholar] [CrossRef] [PubMed]
- José, M.V.; Morgado, E.R.; Govezensky, T. Genetic Hotels for the Standard Genetic Code: Evolutionary Analysis based upon Novel Three-dimensional Algebraic Models. Bull. Math. Biol. 2011, 73, 1443–1476. [Google Scholar] [CrossRef] [PubMed]
- Nirenberg, M.W.; Matthaei, J.H. The dependance of cell-free protein synthesis in E. coli upon naturally occurring or synthetic polyribonucleotides. Proc. Natl. Acad. Sci. USA 1961, 47, 1588–1602. [Google Scholar] [CrossRef] [PubMed]
- Nagel, G.M.; Doolittle, R.F. Phylogenetic analysis of the aminoacyl-tRNA synthetases. J. Mol. Evol. 1995, 40, 487–498. [Google Scholar] [CrossRef] [PubMed]
- Woese, C.R.; Olsen, G.J.; Ibba, M.; Söll, D. Aminoacyl-tRNA synthetases, the genetic code, and the evolutionary process. Microbiol. Mol. Biol. Rev. 2000, 64, 202–236. [Google Scholar] [CrossRef] [PubMed]
- Rodin, S.N.; Ohno, S. Two types of aminoacyl-tRNA synthetases originally encoded by complementary strands of the same nucleic acid. Orig. Life Evol. Biosph. 1995, 25, 565–589. [Google Scholar] [CrossRef] [PubMed]
- Rodin, S.N.; Rodin, A.S. On the origin of the genetic code: Signatures of its primordial complementarity in tRNAs and Aminoacyl tRNA synthetases. Heredity 2008, 100, 341–355. [Google Scholar] [CrossRef] [PubMed]
- Rodin, A.S.; Rodin, S.N.; Carter, C.W. On primordial sense–antisense coding. J. Mol. Evol. 2009, 69, 555–567. [Google Scholar] [CrossRef] [PubMed]
- Carter, C.W., Jr.; Li, L.; Weinreb, V.; Collier, M.; Gonzalez-Rivera, K.; Jimenez-Rodriguez, M.; Erdogan, O.; Kuhlman, B.; Ambroggio, X.; Williams, T.; et al. The Rodin-Ohno hypothesis that two enzyme superfamilies descended from one ancestral gene: An unlikely scenario for the origins of translation that will not be dismissed. Biol. Direct. 2014, 9, 11. [Google Scholar] [CrossRef] [PubMed]
- José, M.V.; Zamudio, G.S.; Morgado, E.R. A unified model of the standard genetic code. R Soc. Open Sci. 2017, 4, 160908. [Google Scholar] [CrossRef] [PubMed]
- de Farias, S.T.; Prosdocimi, F.; Caponi, G. Organic Codes: A Unifying Concept for Life. Acta Biotheor. 2021, 69, 769–782. [Google Scholar] [CrossRef] [PubMed]
- Sethuraman, B.A. Rings, Fields, and Vector Spaces; Springer: New York, NY, USA, 1997. [Google Scholar]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).