Recognition of Timestamps and Reconstruction of the Line of Organism Development

Kasperski, Andrzej

doi:10.3390/pr11051316

Open AccessFeature PaperArticle

Recognition of Timestamps and Reconstruction of the Line of Organism Development

by

Andrzej Kasperski

Laboratory of Bioinformatics and Control of Bioprocesses, Institute of Biological Sciences, Department of Biotechnology, University of Zielona Góra, ul. Szafrana 1, 65-516 Zielona Góra, Poland

Processes 2023, 11(5), 1316; https://doi.org/10.3390/pr11051316

Submission received: 20 March 2023 / Revised: 4 April 2023 / Accepted: 13 April 2023 / Published: 24 April 2023

(This article belongs to the Special Issue Computational Biology Approaches to Genome and Protein Analyzes)

Download

Browse Figures

Versions Notes

Abstract

In this work, an artificial neural network is used to recognize timestamps of evolution. Timestamps are associated with outliers determined during the recognition of the genome attractors of organisms. The aim of this work is to present a new method of penetrating deep into evolution using the recognized timestamps. To achieve this aim, the neural networks of different number of layers were implemented in order to check the influence of the number of layers on the visibility of the timestamps. Moreover, the teaching process was repeated 10 times for each implemented neural network. The recognition of each organism evolution was also repeated 10 times for each taught neural network to increase the reliability of the results. It is presented, among other findings, that during the recognition of the timestamps of evolution not only the number of homologous comparisons and the lengths of compared sequences are important but also the distribution of similarities between sequences. It is also presented that the recognized timestamps allow for travel between genome attractors and reconstruct the line of organism development from the most advanced to the most primitive organisms. The results were validated by determining timestamps for exemplary sets of organisms and also in relation to semihomology approach and by phylogenetic tree generation.

Keywords:

line of development; neural network; outliers; timestamps; unified cell bioenergetics

1. Introduction

The genetic history of organism development is stored in the cell genome [1,2]. This phenomenon is called phylogenetic memory [1,2]. The existence of a phylogenetic memory can indicate that there are genes in the genome that represent the evolutionary genetic history of organisms. These genes should also allow for the recognition of the timestamps of organism development. An example of the existence of phylogenetic memory is cancer development. The transformed cell loses control over this historic potential stored in the genome, and as a result, the atavistic code is activated uncontrollably [3,4]. This article is an extension of previously published works that present unified cell bioenergetics (UCB) and propose methods that can be used to study organism evolution [4,5,6,7,8,9,10]. It has been pointed out that UCB allows for the interpretation of several cell bioenergetic effects, certain diseases and as well as some causes of evolution [10,11,12,13,14,15]. As it was presented, the evolution of organisms can be considered as a discontinuous process that occurs mainly in genome attractors, where organisms undergo random mutations and natural selection [5,16,17]. In a general sense, the term “attractor” denotes a configuration to which the system tends over time. After attaining the attractor, the system gains sufficient stability to return to its original state after the disappearance of emerging disturbances [18]. In this sense, genome attractors allow for the stabilization of configurations of features that are typical for given organisms [5]. This approach is consistent with the researchers’ current interest in ‘organisms as attractors in phase space’ [19].

As a highly conserved and omnipresent protein, cytochrome c is additionally characterized by linear changes in amino-acid differences between different lineages over time, which makes it suitable for the study of cladistics [20,21,22]. In this work, cytochrome c is used as a representative that allows not only for the recognition of genome attractors, but also keeps historical information about organism evolution. This historical information can be considered as specific pictures that have been written in DNA during organism evolution. In this work, is presented what is visible during the analysis of these evolutionary pictures and how deep we can penetrate into evolution using these pictures. As it is presented, these pictures can be recognized by artificial neural networks and, based on this recognition, the timestamps of evolution can be identified. As a result, in this work a new method of penetrating deep into evolution using the recognized timestamps has been established. It is presented that, using this method, traveling between genome attractors and the reconstruction of the line of organism development from the most advanced to the most primitive organisms is possible.

Modern methods of determining phylogeny are usually based on calculations. Phylogenetic trees have remained a central metaphor in evolutionary biology since Charles Darwin sketched his first evolutionary tree in 1837 [23]. Evolutionary trees have now become the subject of detailed, rigorous study to reconstruct the branching patterns that led to the diversity of life evolution [23,24,25]. The fundamental problem in striving to determine the real course of evolution by generating phylogenetic trees is the very large number of trees (even for a relatively small number of taxa) that should be evaluated in order to select the tree that can be considered the best in the light of the assumed criteria [26]. For this reason, the determination of the best tree must very often be made after evaluating a small number of possible trees. In this case, along with determining the best tree, it is necessary to determine the reliabilities of internal nodes in the tree. One of the most commonly used methods to determine the reliability of a generated tree is the bootstrap method, which uses the bootstrap resampling technique [27,28]. The bootstrap method has been implemented, for example, in the MEGA program [29]. A small number of evaluated trees, in relation to the total number of trees that can be generated, can be considered as an important factor of the low reliability of tree nodes. Especially in the case when nodes of low reliability are located close to the root of the tree, the reliability of the entire generated tree can also be considered low. This disadvantage of using phylogenetic trees to reconstruct the real phylogenesis makes it necessary to develop and implement new algorithms dedicated, among others, to reconstruct the line of organism development.

Artificial neural networks (ANNs) are a method of artificial intelligence that has been applied in various fields [30,31,32,33,34]. ANNs are most suitable for solving complex, highly non-linear, ill-defined problems with many different variables, which include, for example, establishing protein secondary structure, classification of cancers, gene prediction and drug design [30,35,36]. In this work, neural networks are used to recognize patterns drawn by evolution (and stored in cytochromes c), but contrary to previous works, genome attractors were not exclusively detected. The aim of this work is to show how to reconstruct the line of organism development using the timestamps of evolution recognized by artificial neural networks.

This article is organized as follows: First, the methods and theoretical bases are presented, including descriptions of the implemented neural networks along with teaching and recognition procedures and a semihomologous approach. Secondly, a new method of penetrating deep into evolution using the recognized timestamps is presented for the exemplary organisms along with the validation of the results. Then, a general algorithm of the method of penetrating deep into evolution, with the aim to establish a line of organism development, is proposed. Finally, the conclusions are presented.

2. Materials and Methods

At the beginning of this section, the web resources and programs used in this study are discussed, followed by information related to the artificial neural networks (i.e., implementation, teaching and recognition), as well as the semihomologous approach.

2.1. Web Resources

The cytochrome c amino-acid sequences selected for this study were downloaded from the NCBI and UniProt databases (http://staff.uz.zgora.pl/akaspers/Pro/sequences.txt (accessed on 20 March 2023)).

2.2. Computer Programs

The calculations presented in this work were made using the EvolutionXXI, dotPicker and MEGA X programs. The EvolutionXXI and dotPicker programs were written by the author. The EvolutionXXI contains an implemented neural network and was used in this work to recognize timestamps. The dotPicker program contains an implemented multidimensional semihomologous Dot-Matrix method and was used in this work to determine similarities between sequences at the codon level. The MEGA X program [29] was used to generate phylogenetic trees.

2.3. Implementation and Teaching of the Artificial Neural Networks

Neural networks with different numbers of layers were implemented, (i.e., neural networks with 2, 3, 4, 5, 6, 7 and 8 layers) in order to check the impact of the number of layers on the teaching process and results. In the first step of the research, each of the implemented neural networks was taught 10 times using a set of 15 organisms (see Table 1). The recognition of each organism was also repeated 10 times and the final result of recognition was calculated as an average value in order to increase the reliability of the results. A standard deviation was calculated for each final recognition result. A low standard deviation means that there are data clusters around the mean. A high standard deviation indicates that the data are widely spread out.

Full-synapse ANNs with the sigmoid transfer function y = 1/(1 + exp(−x)) were implemented and taught [37,38,39]. The amino-acid sequences used to teach and recognize organism evolution were converted to binary form by changing each character in the sequences to a five-positional binary number (i.e., ‘A’ was converted to “00001”, ‘B’ was converted to “00010” and ‘C’ was converted to “00011”). This way of conversion offered good results when cytochrome b and cytochrome c sequences were used to recognize evolution [5,6]. All sequence lengths were adjusted to 105 (since the number of amino acids in cytochrome c sequences is approximately 105) by cutting the sequence or adding “−” characters encoded by “00000” when the sequence was longer or shorter, respectively [5]. After alignment, the length binary form of sequences was equal to 5 × 105, which determines the number of neurons in the ANN input layer (i.e., nbrInp = 525) [5]. The number of neurons in the ANN output layer was equal to the number of organisms. The number of neurons in the hidden layers was established by the geometric pyramid rule [40]. In accordance with the geometric pyramid rule, the number of neurons in the hidden layers is equal to:

–: For one hidden layer:

$nbrHid = {(nbrInp \times nbrOut)}^{\frac{1}{2}}$

nbrHid—the number of neurons in the hidden layer;

nbrInp—the number of neurons in the input layer;

nbrOut—the number of neurons in the output layer.

–: For two hidden layers:

$r = {(nbrInp / nbrOut)}^{\frac{1}{3}}$

$nbrHid 1 = nbrOut \times r^{2} - the number of neurons in the first hidden layer;$

$nbrHid 2 = nbrOut \times r - the number of neurons in the \sec ond hidden layer .$

–: For three hidden layers:

$r = {(nbrInp / nbrOut)}^{\frac{1}{4}}$

$nbrHid 1 = nbrOut \times r^{3} - the number of neurons in the first hidden layer;$

$nbrHid 2 = nbrOut \times r^{2} - the number of neurons in the \sec ond hidden layer;$

$nbrHid 3 = nbrOut \times r - the number of neurons in the third hidden layer .$

and so on.

The number of neurons in the layers of the implemented neural networks for the 15 organisms used for teaching, calculated in accordance with the geometric pyramid rule, is presented in Table 2. The number of neurons in each layer, for example, for 5-layer neural network, is equal to: nbrInp = 525, nbrHid1 = 216, nbrHid2 = 89, nbrHid3 = 36, nbrOut = 15.

Before teaching, the 15 organisms (Table 1) were set in random order. A teaching pattern for the first organism was equal to “000000000000001”, for the second organism “000000000000010”, for the third organism “000000000000100”, and for the 15th organism “100000000000000”. ANNs were taught using the online backpropagation algorithm (learning rate equal to 0.3 and momentum equal to 0.1), using established patterns in this way. The teaching processes were carried out until the root-mean-squared Error (RMSE) was below 0.002. This way of teaching the ANN indicates that the recognition of organism similarities is in the range between 0 and 1, where 0 means the smallest similarity and 1 means the highest similarity. Calculations were made on a computer with the following characteristics: IBM HS23 (2CPU Xenon E5–2650 × 4 Core, 2.00 GHz, RAM = 12 GB, HDD = 50 GB.

2.4. Semihomologous Approach

The semihomologous approach allows for a better interpretation of the results, by allowing amino-acid comparison at the codon level [41,42,43]. If the compared amino-acids are different, the level of considerations is changed and the amino-acids are compared at the codon level. As a result, the result of the comparison of two amino-acid sequences can contain positions of the following types:

“R”—Homologous position, if the compared amino-acids are the same;
“#”—Transition-type semihomologous position, if there is a one-point mutation of the transition type in the codons of the compared amino-acids;
“$”—Transversion-type semihomologous position, if there is a one-point mutation of the transversion type in the codons of the compared amino-acids;
“-”—Another position, if there is two or three point mutation in the codons of compared amino acids.

3. Results and Discussion

First, the neural networks were taught and validated and timestamps were recognized for sets of monkeys and fish. It transpired that, during the recognition, not only similarity to the closest evolutionary organism can be recognized but also similarities to other, evolutionarily more distant organisms, as similarities of the lower values visible in the “background”. In this article, a set of recognized outliers constitutes the timestamp of evolution. Background similarities usually have relatively small values, for this reason, it was decided to teach each neural network more than once (i.e., 10 times) to increase the reliability of outlier recognition. Each examined organism was also recognized 10 times by each taught neural network and the similarities presented in the article were calculated as the arithmetic means of 10 recognitions. The standard deviations were calculated and presented along with the average similarities.

3.1. Teaching of the Neural Networks

In general, the teaching process involves updating the weights of neurons [37]. In this work, the weights in the artificial neural networks (ANNs) were adjusted using the error (root-mean-squared Error (RMSE)) between the predicted and correct values in order to minimize RMSE (i.e., lower RMSE to a value below 0.002). It was concluded that it is impossible to teach an 8-layer neural network using the set of 15 organisms, i.e., after several days of teaching, the RMSE was higher than 0.5. Moreover, for a 7-layer neural network, only about 30% of the teaching processes conducted were successful (i.e., RMSE dropped below 0.002). For 2-, 3-, 4-, 5- and 6-layer neural networks, the teaching process was repeated 10 times. Figure 1 shows the average number of teaching cycles and standard deviations for processes of teaching the 2-, 3-, 4-, 5- and 6-layer neural networks.

From Figure 1, it can be observed that the 2-layer neural network needs the largest number of teaching cycles, but the standard deviation (of the number of teaching cycles) calculated for 10 teaching processes is the smallest. For a 3-layer ANN, the number of teaching cycles is the smallest. When the number of layers increases from 3 to 6, the number of teaching cycles and standard deviation increase; however, the standard deviation remains relatively small for a 5-layer ANN.

3.2. Validation of the Neural Networks and Recognition of Timestamps for a Set of Monkeys

First, the taught neural networks were validated using a set of monkeys. As shown in Table A1, Table A2, Table A3, Table A4 and Table A5 (see Appendix A), timestamps that contain three outliers (Homo sapiens, Horse and Crow (listed in the order of the decreasing average value of recognized similarity)) were recognized during this validation. This means that, when recognizing similarity, monkeys are similar to Homo sapiens, but also to Horse to a lesser extent and to Crow the least. That means that monkeys show, mainly, a similarity to Homo sapiens but that similarities to Horse and to Crow are also visible in “the background”. It is also evident that similarities to Homo sapiens recognized by 2-, 3-, 4- and 5-layer neural networks were greater than similarities recognized to Horse and any standard deviation was small (see Table A1, Table A2, Table A3 and Table A4 in Appendix A). Moreover, an increase in the number of layers from 2 to 5 caused an increase in similarity to Homo sapiens. For the 6-layer ANN, the similarity of New World Monkeys (i.e., monkeys with numbers from 12 to 18 (see Table A5 in Appendix A)) to Homo sapiens suddenly decreased, the similarity to Horse increased significantly and the standard deviation increased, compared to the 5-layer ANN. For this reason, the subsequent recognitions presented in this work were made using 2-, 3-, 4- and 5-layer ANNs.

3.3. Validation of the Neural Networks and Recognition of Timestamps for a Set of Fish

The neural networks were also validated using a set of fish. As shown in Table A6 and Table A9, the timestamps that contain five outliers (Horse, Crow, Frog, Goldfish and Worm) were recognized during this validation. The greatest similarities of each fish from the set of fish used for validation (i.e., Epinephelus moara, Epinephelus fuscoguttatus, Brienomyrus brachyistius, Hypomesus transpacificus and Nothobranchius furzeri) were to Goldfish by each neural network, i.e., by 2-, 3-, 4- and 5-layer ANNs (see Table A6 and Table A9 in Appendix A), which is in accordance with our expectations.

An especially interesting result was that for Nothobranchius furzeri. The recognized similarities of Nothobranchius furzeri to Goldfish by the 2-, 3-, 4- and 5-layer ANNs were equal to 0.83611, 0.57080, 0.60185 and 0.88947, respectively, and were greater than the similarities recognized to Frog, Crow, Horse and Worm (i.e., the other outliers) (see Table A6 and Table A9 in Appendix A). The number of homologous comparisons (i.e., the number of “R” positions) between Nothobranchius furzeri and Goldfish is equal to 91 (see Table A6 and Table A9 in Appendix A). The numbers of homologous comparisons between Nothobranchius furzeri and Frog, Crow and Horse were equal to 92, 98 and 95, respectively (see Table A6 and Table A9 in Appendix A). Therefore, considering only the number of homologous comparisons, it is impossible to detect that the similarity of Nothobranchius furzeri to Goldfish is the greatest. It means that, during the detection of evolutionary similarities, not only the number of homologous comparisons and the lengths of compared sequences are important, but also the distribution of similarities between the compared sequences.

In addition, this validation also showed that the greatest similarities to the closest evolutionary organism was recognized by the 5-layer ANN, i.e., the recognized similarities of each fish from the set of fish used for validation were the greatest for the 5-layer ANN (see Table A6 in Appendix A). For this reason, the 5-layer ANN was used to demonstrate the method of penetrating deep into evolution and reconstructing the line of organism development.

3.4. Method of Penetrating Deep into Evolution and Reconstruction of the Line of Organism Development

The recognized timestamps for a set of monkeys (see the Validation of the Neural Networks and Recognition of Timestamps for a Set of Monkeys Section) can indicate that the temporary line of development from the most advanced to the most primitive organisms begins with Homo sapiens and leads to two less developed organisms, i.e., to Horse and then Crow (see Table A1, Table A2, Table A3, Table A4 and Table A5 in Appendix A). Because the recognized timestamps for a set of monkeys contain only three outliers (Homo sapiens, Horse and Crow (see Table A1, Table A2, Table A3, Table A4 and Table A5 in Appendix A)), it is impossible to determine which is the next organism “hidden in the depths” of evolution.

The steps of the proposed method, which reveal the line of organism development from the most advanced to the most primitive organisms, are as follows:

(a): In the first step of the proposed method, Homo sapiens, as the most developed organism, is recognized by the 5-layer neural network. The results of the recognition of Homo sapiens are presented in Table 3.
(b): In the next step, the organism with the greatest recognized similarity (i.e., Homo sapiens) is removed from the set of organisms and the 5-layer neural network is taught using the remaining (fourteen) organisms. After re-teaching, the removed organism (i.e., Homo sapiens) is recognized once again. The new results of the recognition of Homo sapiens are presented in Table 4.

The recognized timestamp (a set of outliers) indicates that the removal of Homo sapiens caused the level of considerations to be shifted to less developed organisms, i.e., from the Homo sapiens genome attractor to the Horse genome attractor (with Crow and Frog genome attractors being visible in the background). It should be noted that the largest outlier (Horse) was recognized reliable, as evidenced by the relatively small value of the standard deviation (0.05320). The recognized timestamp indicates a temporary line of development from the most advanced to the most primitive organisms, i.e., Homo sapiens, Horse, Crow and Frog. The procedure was repeated in the next steps as follows. The organism with the greatest recognized similarity is removed from the set of organisms; the 5-layer neural network is taught using the remaining organisms; and, after re-teaching, the removed organism is recognized. The recognized timestamps indicate the temporary line of development from the most advanced to the most primitive organisms. It can be seen that, in general, the method is based on the principle that “it is necessary to approach the bend in order to see what is around the bend”.

(c): In the next step, the organism with the greatest recognized similarity (i.e., Horse) is removed from the set of organisms and the 5-layer neural network is taught using the remaining (thirteen) organisms. After re-teaching, the removed organism (i.e., Horse) is recognized. The results of the recognition of Horse are presented in Table 5.
(d): In the next step, the organism with the greatest recognized similarity (i.e., Crow) is removed from the set of organisms and the 5-layer neural network is taught using the remaining (twelve) organisms. After re-teaching, the removed organism (i.e., Crow) is recognized. The results of the recognition of Crow are presented in Table 6. In this case, the standard deviation associated with the recognized similarity Crow to Frog (i.e., the biggest outlier) is relatively large (recognized similarity equal to 0.57232 and standard deviation equal to 0.30661), which means that this similarity was not very clearly visible by the neural network.
(e): In the next step, the organism with the greatest recognized similarity (i.e., Frog) is removed from the set of organisms and the 5-layer neural network is taught using the remaining (eleven) organisms. After re-teaching, the removed organism (i.e., Frog) is recognized. The results of the recognition of Frog are presented in Table 7.
(f): The organism with the greatest recognized similarity (i.e., Goldfish) is removed from the set of organisms and the 5-layer neural network is taught using the remaining (ten) organisms. After re-teaching, the removed organism (i.e., Goldfish) is recognized. The results of the recognition of Goldfish are presented in Table 8.
(g): In the next step, the organism with the greatest recognized similarity (i.e., Worm) is removed from the set of organisms and the 5-layer neural network is taught using the remaining (nine) organisms. After re-teaching, the removed organism (i.e., Worm) is recognized. The results of the recognition of Worm are presented in Table 9.

It is clear that, after this step, the line of organism development leads to Bacteria, which may indicate that the remaining organisms are located in other side branches (i.e., “side tracks” [23]) of the evolutionary tree having a small similarity to Worm. The standard deviation associated with the recognized similarity to Bacteria (and also associated with the recognition of the other outliers) is relatively large (Table 9), which may also indicate that the ANN is unable to unambiguously recognize this similarity. These conclusions were checked by phylogenetic tree generation (see the Validation of the Reconstructed Line of Organism Development by Phylogenetic Tree Generation Section) and also using the semihomology approach (Table 10).

From Table 10, it is evident that evolutionary distances between Worm and the organisms used to teach ANN in the last step of the method are very large, i.e., there are a very small number of “R” positions (i.e., positions with homologous comparisons) and large number of “-” positions (positions with two or three point mutations between codons of the compared amino acids), which confirms that the Worm recognition was correctly taken as the last step of the method.

Finally, the proposed method allowed for reconstructing the line of organism development from the most advanced to the most primitive organisms, as follows: Homo sapiens, horses, birds, amphibians, fish, worms and bacteria. This line of development is in accordance with the latest scientific findings (see, for example, [44]).

3.5. Structure of the Neural Networks in the Subsequent Steps of the Method

The number of neurons in the layers of the implemented 5-layer neural networks (calculated according to the geometric pyramid rule) for the 15, 14, 13, 12, 11, 10 and 9 organisms used for teaching the ANNs in the subsequent steps of the method is presented in Table 11. The number of neurons in the layers, for example, for 12 organisms, is equal to: nbrInp = 525, nbrHid1 = 204, nbrHid2 = 79, nbrHid3 = 31, nbrOut = 12.

The average number of teaching cycles and standard deviations of the number of teaching cycles for processes of teaching the 5-layer neural networks are presented in Figure 2.

Figure 2 shows that the removal of consecutive organisms from a set of organisms used to teach the ANNs does not have a significant effect on the number of teaching cycles and standard deviations, although after removing the 6th organism, a slightly greater increase in these two values is visible, which can also indicate (and confirm) that this step (i.e., Worm recognition) should be the last step of the method.

3.6. Validation of the Method of Penetrating Deep into Evolution

In this section, is presented how the result of the method of penetrating deep into evolution can be validated. The reconstructed line of organism development was validated by 2-, 3-, 4- and 5-layer ANNs taught using 15 organisms and by phylogenetic tree generation. A set of birds (see Table A7 and Table A10 in Appendix A) and a set of amphibians and reptiles (see Table A8 and Table A11 in Appendix A) were used for the validation.

3.6.1. Validation of the Reconstructed Line of Organism Development by Determining Timestamps for a Set of Birds

At the beginning, the reconstructed line of organism development was validated using a set of birds (see Table A7 and Table A10 in Appendix A). In Figure 3, the average similarities (calculated as the arithmetic means of similarities recognized by the 2-, 3-, 4- and 5-layer ANNs) for the three recognized largest outliers (i.e., Horse, Crow and Frog) are presented. The other outliers presented in Table A7 and Table A10 (see Appendix A) (i.e., Goldfish, Worm and Fly) are much smaller compared to Horse, Crow and Frog and, therefore, are not shown in Figure 3 for readability. As it can be seen, the similarities of each bird (from the set of birds used for validation) are the most obvious to Crow (see Table A7 in Appendix A). The background similarities to Horse and Frog are also visible (see Table A7 in Appendix A), which confirms the correctness of this part of the reconstructed line of organism development, i.e., birds are evolutionarily between Horse and Frog.

3.6.2. Validation of the Reconstructed Line of Organism Development by Determining Timestamps for a Set of Amphibians and Reptiles

Next, the recognized line of organism evolution was validated using a set of amphibians (Bufo gargarizans, Bufo bufo and Lithobates catesbeianus) and reptiles (Pelodiscus sinensis and Mauremys reevesii) (see Table A8 and Table A11 in Appendix A). In Figure 4, the average similarities (calculated as the arithmetic means of similarities recognized by the 2-, 3-, 4- and 5-layer ANNs) for the four recognized largest outliers (i.e., Horse, Crow, Frog and Goldfish) are presented. The four largest outliers (out of the five outliers presented in Table A8 and Table A11 in Appendix A) are shown in Figure 4 to ensure readability.

As it can be seen, the similarities of each amphibian (from the set of amphibians used for validation) are the greatest to Frog (see Table A8 in Appendix A). The background similarities to Horse, Crow and Goldfish are also visible (see Table A8 in Appendix A), which confirms the correctness of this part of the reconstructed line of organism development, i.e., amphibians are evolutionarily between birds and fish (this conclusion is also supported by other works [44]). Moreover, it can be observed that reptiles (Pelodiscus sinensis and Mauremys reevesii) are evolutionarily between birds (Crow) and amphibians (Frog) (closer to Crow), which is in accordance with other authors [44].

3.6.3. Validation of the Reconstructed Line of Organism Development by Phylogenetic Tree Generation

The reconstructed line of organism development was validated by phylogenetic tree generation. The trees were generated using the MEGA X program [29]. The phylogenetic trees were generated for 15 organisms used for ANN teaching using the Maximum Likelihood (ML) method with Poisson correction [45,46]. This method was used because it is considered one of the most accurate and widely used methods for reconstructing phylogenetic trees [46,47]. The ML method approaches a phylogenetic tree from a probabilistic point of view and looks for a tree that maximizes the probability of observing a given set of sequences on the tree leaves [46]. In this work, the bootstrap method with 1000 replications (Figure 5) and 10,000 replications (i.e., maximum number of replications in the MEGA X program) (Figure 6) was used to determine the reliability of the generated tree nodes. Trees were generated for different values of replications to check the effect of the number of replications on the reliability of the tree nodes.

Figure 5 and Figure 6 show that the increase in the number of replications has not significantly affected the reliability of the tree nodes. Although the reliability of the tree nodes (for both 1000 and 10,000 replications) can hardly be considered satisfactory (see the Introduction for a possible cause), it is apparent that the most and the least developed organisms are Homo sapiens and Bacteria, respectively. The closest organism to Homo sapiens is Horse (from the organisms used for ANN teaching), which confirms the results obtained using the method of penetrating deep into evolution. Moreover, the closest organisms to Worm are Octopus and Bacteria, which is in accordance with the last step of the method of penetrating deep into evolution. Additionally, evolutionary relationships between Worm and Octopus can be confirmed by the works of other authors [44,48]. In accordance with the generated tree, Goldfish is the closest to Crow, but the reliability of common node between these two organisms has a reliability of only 26% (for 1000 replications) and 29% (for 10,000 replications), so it is difficult to trust this result. The method of penetrating deep into evolution allowed the determination of a result that fits better with the modern theory of evolution, i.e., Frog (as a representative of amphibians) is the closest to Crow (as a representative of birds) considering the organisms used to teach the ANNs, which is in accordance with the results of other authors (for example, [23,44,49]).

The presented considerations show that the proposed method makes it possible to narrow down the area of considerations to the timestamp (i.e., set of recognized outliers) that was recognized in the each step of the algorithm (see the algorithm presented in the General Algorithm of Penetrating Deep Into Evolution Method and Reconstructing the Line of Organism Development Section). Travelling between genome attractors and narrowing down the area of considerations to the recognized timestamp (that contains only a small number of outliers comparing to the entire number of organisms) makes possible to obtain more reliable results compared to the reliability of the results obtained by generating phylogenetic trees. In the algorithms of phylogenetic tree generation (including the algorithms of the Neighbor Joining, Maximum Parsimony and Maximum Likelihood methods), the generated trees are evaluated on the basis of the evaluation of the trees as the whole, i.e., the generated trees are evaluated taking into account the entire sets of the nodes and branches [50]. This means that, although the generated tree will meet the assumed criteria as a whole, locally, it may contain internal nodes of lower reliabilities (and this is exactly what was obtained when generating the tree using the Maximum Likelihood method in this work; see Figure 5 and Figure 6). In accordance with the information presented in the Introduction, an important factor related to lower reliability of the nodes is the impossibility of evaluating all possible trees for more organisms. The number of possible rooted trees (calculated for n organisms as (2n − 3)!/(2ⁿ⁻²(s − 2)!)) for n = 15 (i.e., for the number of organisms considered in this article)) is equal to 213,458,046,676,875 [51]. The number of trees grows very quickly with the increase in the number of organisms and, for n = 50, the number of possible rooted trees is greater than the number of atoms in the universe [26]. This may indicate and also justify the need to narrow down the area of considerations in order to obtain a greater reliability when determining the course of evolution and reconstructing the line of organism development.

3.7. General Algorithm of the Method of Penetrating Deep into Evolution and Reconstructing the Line of Organism Development

In the light of the presented idea, the general algorithm of the method of penetrating deep into evolution and reconstructing the line of organism development is presented in the following steps:

Step 1.: Set the organisms in random order.
Step 2.: Teach the artificial neural network (ANN) using a full set of organisms.
Step 3.: Recognize the evolutionary timestamp (i.e., recognize a set of outliers) by recognizing the similarity to Homo sapiens using the ANN.
Step 4.: Add the organism represented by the biggest outlier to the list of organisms.
Step 5.: If the maximal number of iterations is archived or a sufficiently primitive organism to finish the algorithm is added to the list, stop the algorithm.
Step 6.: Remove the organism represented by the biggest outlier from the set of organisms.
Step 7.: Teach the ANN using the reduced set of organisms.
Step 8.: Recognize the evolutionary timestamp by recognizing the similarity to the removed organism using the ANN.
Step 9.: Go back to Step 4.

As a result of executing this algorithm for a set of organisms, the created list will contain organisms written in the order from the most developed to the most primitive organisms.

4. Conclusions

This work showed that one of the features of evolutionary timestamps (i.e., a set of outliers recognized by an artificial neural network) is their transparency, which allows not only to determine the closest evolutionary organism but also to determine more distant organisms. In this article, it is presented that this feature allows for penetration deep into evolution. As a result, a general algorithm of penetrating deep into evolution was established, which allows to reconstruct the line of organism development from the most advanced to the most primitive organisms. It was also shown the way in which this line can be validated using, among others, recognized timestamps. Five-layer ANNs were used to demonstrate the application of the method of penetrating deep into evolution, for which the recognized similarities to the closest evolutionary organism were the greatest, as was demonstrated during validation. The work also showed that not only the number of homologous comparisons and length of identity fragments are important in determining the evolutionary relationships between organisms, but also the distribution of similarities between sequences (i.e., distribution of amino acids in the compared sequences). It is possible that lines reconstructed using the proposed method can be considered as main lines of development (i.e., lines that omit “side tracks” of development); however, this conclusion needs to be carefully corroborated in following works. Finding a way to reconstruct the line of organism development can be considered a great scientific challenge, so further continuation of this work is planned to confirm the effectiveness of the proposed method.

Funding

This research received no external funding.

Data Availability Statement

The cytochrome c amino-acid sequences selected for this study are available at http://staff.uz.zgora.pl/akaspers/Pro/sequences.txt (accessed on 20 March 2023). The EvolutionXXI and dotPicker programs were written by the author and are freely available at http://staff.uz.zgora.pl/akaspers/Pro/pgms.zip (accessed on 20 March 2023). EvolutionXXI (written in Java using the Joone framework) contains an implemented neural network and was used in this work to recognize timestamps. The dotPicker program (written in C#) contains an implemented multidimensional semihomologous Dot-Matrix method and was used in this work to determine similarities between sequences at the codon level.

Conflicts of Interest

The author declares no conflict of interest.

Appendix A

Table A1. Outliers recognized by the 2-layer ANN during the recognition of selected monkeys.

1. Pan troglodytes				2. Symphalangus syndactylus
Outliers recognized by ANN	Homo sapiens	Horse	Crow	Outliers recognized by ANN	Homo sapiens	Horse	Crow
Recognized similarity	0.99790	0.00161	0.00129	Recognized similarity	0.99583	0.00315	0.00170
Standard deviation	0.00001	0.00003	0.00001	Standard deviation	0.00016	0.00010	0.00003
3. Papio anubis				4. Theropithecus gelada
Recognized similarity	0.98306	0.00466	0.00176	Recognized similarity	0.94251	0.01858	0.00292
Standard deviation	0.00146	0.00042	0.00016	Standard deviation	0.00908	0.00209	0.00034
5. Piliocolobus tephrosceles				6. Rhinopithecus bieti
Recognized similarity	0.96990	0.00267	0.00084	Recognized similarity	0.98042	0.00189	0.00404
Standard deviation	0.00343	0.00023	0.00008	Standard deviation	0.00146	0.00016	0.00031
7. Macaca nemestrina				8. Cercocebus atys
Recognized similarity	0.98128	0.00290	0.00258	Recognized similarity	0.98352	0.00485	0.00161
Standard deviation	0.00165	0.00024	0.00011	Standard deviation	0.00171	0.00044	0.00012
9. Trachypithecus cristatus				10. Chlorocebus aethiops
Recognized similarity	0.98600	0.00503	0.00178	Recognized similarity	0.98717	0.00518	0.00183
Standard deviation	0.00068	0.00026	0.00011	Standard deviation	0.00063	0.00020	0.00010
11. Papio hamadryas				12. Brachyteles arachnoides
Recognized similarity	0.98176	0.00291	0.00065	Recognized similarity	0.76313	0.01674	0.00455
Standard deviation	0.00176	0.00020	0.00005	Standard deviation	0.01445	0.00189	0.00050
13. Aotus azarai				14. Saimiri boliviensis boliviensis
Recognized similarity	0.74336	0.01611	0.00435	Recognized similarity	0.76302	0.01065	0.00729
Standard deviation	0.01439	0.00190	0.00045	Standard deviation	0.01720	0.00099	0.00095
15. Alouatta belzebul				16. Alouatta seniculus
Recognized similarity	0.75863	0.01676	0.00450	Recognized similarity	0.75075	0.01660	0.00442
Standard deviation	0.01438	0.00170	0.00056	Standard deviation	0.01348	0.00150	0.00050
17. Sapajus apella				18. Leontopithecus chrysomelas
Recognized similarity	0.73079	0.01544	0.00428	Recognized similarity	0.72457	0.00894	0.00587
Standard deviation	0.02057	0.00185	0.00056	Standard deviation	0.02405	0.00110	0.00093

Table A2. Outliers recognized by the 3-layer ANN during the recognition of selected monkeys.

1. Pan troglodytes				2. Symphalangus syndactylus
Outliers recognized by ANN	Homo sapiens	Horse	Crow	Outliers recognized by ANN	Homo sapiens	Horse	Crow
Recognized similarity	0.99806	0.00125	0.00073	Recognized similarity	0.99660	0.00254	0.00098
Standard deviation	0.00005	0.00009	0.00013	Standard deviation	0.00022	0.00029	0.00018
3. Papio anubis				4. Theropithecus gelada
Recognized similarity	0.98987	0.00359	0.00091	Recognized similarity	0.96500	0.01674	0.00202
Standard deviation	0.00126	0.00043	0.00020	Standard deviation	0.00659	0.00374	0.00048
5. Piliocolobus tephrosceles				6. Rhinopithecus bieti
Recognized similarity	0.98411	0.00200	0.00044	Recognized similarity	0.98744	0.00153	0.00203
Standard deviation	0.00290	0.00042	0.00016	Standard deviation	0.00189	0.00020	0.00045
7. Macaca nemestrina				8. Cercocebus atys
Recognized similarity	0.98971	0.00211	0.00135	Recognized similarity	0.99040	0.00327	0.00108
Standard deviation	0.00156	0.00039	0.00017	Standard deviation	0.00110	0.00039	0.00013
9. Trachypithecus cristatus				10. Chlorocebus aethiops
Recognized similarity	0.99141	0.00374	0.00102	Recognized similarity	0.99157	0.00371	0.00105
Standard deviation	0.00051	0.00034	0.00022	Standard deviation	0.00068	0.00020	0.00016
11. Papio hamadryas				12. Brachyteles arachnoides
Recognized similarity	0.98788	0.00221	0.00042	Recognized similarity	0.87210	0.01230	0.00248
Standard deviation	0.00142	0.00025	0.00006	Standard deviation	0.01416	0.00232	0.00057
13. Aotus azarai				14. Saimiri boliviensis boliviensis
Recognized similarity	0.86602	0.01208	0.00250	Recognized similarity	0.85886	0.00863	0.00400
Standard deviation	0.01544	0.00250	0.00062	Standard deviation	0.02089	0.00188	0.00102
15. Alouatta belzebul				16. Alouatta seniculus
Recognized similarity	0.86950	0.01212	0.00248	Recognized similarity	0.87093	0.01212	0.00243
Standard deviation	0.01479	0.00244	0.00061	Standard deviation	0.01665	0.00182	0.00054
17. Sapajus apella				18. Leontopithecus chrysomelas
Recognized similarity	0.86163	0.01149	0.00249	Recognized similarity	0.83629	0.00782	0.00347
Standard deviation	0.01442	0.00213	0.00068	Standard deviation	0.02395	0.00168	0.00112

Table A3. Outliers recognized by the 4-layer ANN during the recognition of selected monkeys.

1. Pan troglodytes				2. Symphalangus syndactylus
Outliers recognized by ANN	Homo sapiens	Horse	Crow	Outliers recognized by ANN	Homo sapiens	Horse	Crow
Recognized similarity	0.99807	0.00138	0.00039	Recognized similarity	0.99735	0.00211	0.00042
Standard deviation	0.00009	0.00014	0.00039	Standard deviation	0.00016	0.00020	0.00044
3. Papio anubis				4. Theropithecus gelada
Recognized similarity	0.99576	0.00273	0.00044	Recognized similarity	0.98734	0.00896	0.00061
Standard deviation	0.00034	0.00055	0.00044	Standard deviation	0.00374	0.00294	0.00061
5. Piliocolobus tephrosceles				6. Rhinopithecus bieti
Recognized similarity	0.99490	0.00177	0.00034	Recognized similarity	0.99466	0.00224	0.00061
Standard deviation	0.00142	0.00071	0.00033	Standard deviation	0.00045	0.00058	0.00063
7. Macaca nemestrina				8. Cercocebus atys
Recognized similarity	0.99517	0.00275	0.00063	Recognized similarity	0.99596	0.00246	0.00056
Standard deviation	0.00041	0.00077	0.00066	Standard deviation	0.00044	0.00046	0.00053
9. Trachypithecus cristatus				10. Chlorocebus aethiops
Recognized similarity	0.99599	0.00272	0.00049	Recognized similarity	0.99601	0.00290	0.00050
Standard deviation	0.00050	0.00047	0.00046	Standard deviation	0.00039	0.00042	0.00047
11. Papio hamadryas				12. Brachyteles arachnoides
Recognized similarity	0.99498	0.00321	0.00034	Recognized similarity	0.95753	0.01878	0.00136
Standard deviation	0.00093	0.00060	0.00032	Standard deviation	0.01426	0.00658	0.00130
13. Aotus azarai				14. Saimiri boliviensis boliviensis
Recognized similarity	0.95690	0.01727	0.00131	Recognized similarity	0.94905	0.01781	0.00180
Standard deviation	0.01514	0.00584	0.00118	Standard deviation	0.02173	0.00821	0.00175
15. Alouatta belzebul				16. Alouatta seniculus
Recognized similarity	0.95752	0.01768	0.00137	Recognized similarity	0.95583	0.01846	0.00134
Standard deviation	0.01594	0.00664	0.00133	Standard deviation	0.01561	0.00690	0.00135
17. Sapajus apella				18. Leontopithecus chrysomelas
Recognized similarity	0.95283	0.01967	0.00135	Recognized similarity	0.94452	0.01706	0.00139
Standard deviation	0.01709	0.00787	0.00125	Standard deviation	0.03348	0.01078	0.00128

Table A4. Outliers recognized by the 5-layer ANN during the recognition of selected monkeys.

1. Pan troglodytes				2. Symphalangus syndactylus
Outliers recognized by ANN	Homo sapiens	Horse	Crow	Outliers recognized by ANN	Homo sapiens	Horse	Crow
Recognized similarity	0.99808	0.00125	0.00064	Recognized similarity	0.99782	0.00175	0.00058
Standard deviation	0.00010	0.00019	0.00055	Standard deviation	0.00016	0.00032	0.00048
3. Papio anubis				4. Theropithecus gelada
Recognized similarity	0.99767	0.00175	0.00074	Recognized similarity	0.99435	0.00597	0.00060
Standard deviation	0.00020	0.00029	0.00069	Standard deviation	0.00236	0.00318	0.00052
5. Piliocolobus tephrosceles				6. Rhinopithecus bieti
Recognized similarity	0.99738	0.00145	0.00109	Recognized similarity	0.99706	0.00209	0.00094
Standard deviation	0.00065	0.00061	0.00121	Standard deviation	0.00085	0.00087	0.00087
7. Macaca nemestrina				8. Cercocebus atys
Recognized similarity	0.99723	0.00207	0.00111	Recognized similarity	0.99767	0.00167	0.00079
Standard deviation	0.00058	0.00088	0.00124	Standard deviation	0.00018	0.00029	0.00074
9. Trachypithecus cristatus				10. Chlorocebus aethiops
Recognized similarity	0.99759	0.00189	0.00076	Recognized similarity	0.99765	0.00185	0.00076
Standard deviation	0.00026	0.00039	0.00069	Standard deviation	0.00022	0.00034	0.00069
11. Papio hamadryas				12. Brachyteles arachnoides
Recognized similarity	0.99744	0.00190	0.00091	Recognized similarity	0.98498	0.01427	0.00136
Standard deviation	0.00032	0.00073	0.00093	Standard deviation	0.00954	0.01031	0.00136
13. Aotus azarai				14. Saimiri boliviensis boliviensis
Recognized similarity	0.98617	0.01302	0.00129	Recognized similarity	0.97092	0.02990	0.00175
Standard deviation	0.00841	0.00887	0.00124	Standard deviation	0.02618	0.02937	0.00175
15. Alouatta belzebul				16. Alouatta seniculus
Recognized similarity	0.98658	0.01281	0.00135	Recognized similarity	0.98501	0.01331	0.00139
Standard deviation	0.00784	0.00957	0.00136	Standard deviation	0.01028	0.00963	0.00139
17. Sapajus apella				18. Leontopithecus chrysomelas
Recognized similarity	0.98382	0.01448	0.00137	Recognized similarity	0.95909	0.04033	0.00173
Standard deviation	0.01060	0.00946	0.00137	Standard deviation	0.04169	0.04078	0.00170

Table A5. Outliers recognized by the 6-layer ANN during the recognition of selected monkeys.

1. Pan troglodytes				2. Symphalangus syndactylus
Outliers recognized by ANN	Homo sapiens	Horse	Crow	Outliers recognized by ANN	Homo sapiens	Horse	Crow
Recognized similarity	0.99787	0.00137	0.00030	Recognized similarity	0.99777	0.0017	0.00035
Standard deviation	0.00019	0.00041	0.00044	Standard deviation	0.00027	0.00061	0.00055
3. Papio anubis				4. Theropithecus gelada
Recognized similarity	0.99771	0.00165	0.0003	Recognized similarity	0.99413	0.00592	0.00074
Standard deviation	0.0003	0.00052	0.00045	Standard deviation	0.00545	0.00758	0.0014
5. Piliocolobus tephrosceles				6. Rhinopithecus bieti
Recognized similarity	0.99619	0.00197	0.00018	Recognized similarity	0.99688	0.00243	0.00041
Standard deviation	0.0022	0.00169	0.00023	Standard deviation	0.00166	0.00185	0.00069
7. Macaca nemestrina				8. Cercocebus atys
Recognized similarity	0.99693	0.00247	0.00033	Recognized similarity	0.99761	0.00181	0.00027
Standard deviation	0.00116	0.00141	0.0005	Standard deviation	0.00037	0.0007	0.00038
9. Trachypithecus cristatus				10. Chlorocebus aethiops
Recognized similarity	0.99755	0.00193	0.00029	Recognized similarity	0.99759	0.00193	0.00029
Standard deviation	0.00054	0.00081	0.00042	Standard deviation	0.00046	0.00077	0.00042
11. Papio hamadryas				12. Brachyteles arachnoides
Recognized similarity	0.99719	0.00224	0.00028	Recognized similarity	0.87177	0.1496	0.00053
Standard deviation	0.00105	0.00131	0.00041	Standard deviation	0.23444	0.28675	0.0008
13. Aotus azarai				14. Saimiri boliviensis boliviensis
Recognized similarity	0.91012	0.11182	0.00048	Recognized similarity	0.82709	0.18639	0.00079
Standard deviation	0.16654	0.23995	0.00074	Standard deviation	0.30316	0.33623	0.00127
15. Alouatta belzebul				16. Alouatta seniculus
Recognized similarity	0.89811	0.12439	0.0005	Recognized similarity	0.89114	0.12502	0.00051
Standard deviation	0.1881	0.25193	0.00075	Standard deviation	0.21398	0.27196	0.00078
17. Sapajus apella				18. Leontopithecus chrysomelas
Recognized similarity	0.82968	0.16525	0.00055	Recognized similarity	0.77037	0.22613	0.00067
Standard deviation	0.31204	0.31296	0.0008	Standard deviation	0.33252	0.33591	0.00102

Table A6. Outliers recognized by the ANNs during the recognition of selected fish.

		Checked Organisms
Outliers Recognized by ANNs	ANN Type and Semihomology	Epinephelus moara	Epinephelus fuscoguttatus	Brienomyrus brachyistius	Hypomesus transpacificus	Nothobranchius furzeri
Horse	2-layer	0.00074	0.00034	0.00042	0.00021	0.00019
	3-layer	0.00100	0.00037	0.00028	0.00019	0.00013
	4-layer	0.04404	0.01481	0.00666	0.00707	0.00227
	5-layer	0.01765	0.00719	0.01303	0.00764	0.01524
	[R/#/$/-]	[88/4/3/10]	[88/4/4/9]	[91/4/3/7]	[85/7/5/8]	[95/3/4/3]
Crow	2-layer	0.00071	0.00096	0.00139	0.00241	0.00038
	3-layer	0.00086	0.00094	0.00112	0.00253	0.00029
	4-layer	0.01466	0.01306	0.01028	0.02347	0.00505
	5-layer	0.06987	0.03182	0.04955	0.10126	0.02373
	[R/#/$/-]	[89/6/4/6]	[90/5/4/6]	[93/4/4/4]	[87/5/5/8]	[98/4/3/0]
Frog	2-layer	0.03535	0.03959	0.01089	0.01508	0.04843
	3-layer	0.00270	0.00245	0.00057	0.00084	0.00211
	4-layer	0.00054	0.00088	0.00039	0.00022	0.00201
	5-layer	0.00053	0.00118	0.00089	0.00054	0.00458
	[R/#/$/-]	[88/3/7/7]	[88/3/6/8]	[91/2/5/7]	[85/4/8/8]	[92/2/3/8]
Goldfish	2-layer	0.66574	0.69711	0.76756	0.64289	0.83611
	3-layer	0.33739	0.41824	0.59990	0.37540	0.57080
	4-layer	0.26669	0.37647	0.67701	0.36406	0.60185
	5-layer	0.87796	0.92357	0.94176	0.89873	0.88947
	[R/#/$/-]	[96/4/2/2]	[97/3/2/2]	[97/3/4/0]	[91/4/4/5]	[91/6/3/5]
Worm	2-layer	0.00016	0.00016	0.00066	0.00026	0.00059
	3-layer	0.00026	0.00033	0.00133	0.00044	0.00146
	4-layer	0.00009	0.00014	0.00052	0.00021	0.00107
	5-layer	0.00128	0.00141	0.00151	0.00151	0.00260
	[R/#/$/-]	[72/5/15/13]	[72/5/14/14]	[75/3/14/13]	[70/7/15/13]	[78/5/11/11]

Table A7. Outliers recognized by the ANNs during the recognition of selected birds.

		Checked Organisms
Outliers Recognized by ANNs	ANN Type and Semihomology	Tympanuchus pallidicinctus	Apus apus	Myiozetetes cayanensis	Accipiter gentilis	Lagopus muta
Horse	2-layer	0.03577	0.01200	0.09890	0.00595	0.01448
	3-layer	0.03513	0.00945	0.09047	0.00522	0.01283
	4-layer	0.02018	0.00493	0.04083	0.00374	0.00860
	5-layer	0.07852	0.00945	0.15818	0.01248	0.03070
	[R/#/$/-]	[95/3/5/2]	[95/2/4/4]	[97/2/3/3]	[93/4/5/3]	[95/3/4/3]
Crow	2-layer	0.39697	0.94328	0.57873	0.83560	0.63272
	3-layer	0.53419	0.96596	0.67207	0.91917	0.75749
	4-layer	0.61147	0.97766	0.64598	0.95270	0.84513
	5-layer	0.82311	0.98738	0.75629	0.95001	0.92133
	[R/#/$/-]	[97/4/4/0]	[101/2/1/1]	[99/2/2/2]	[98/5/2/0]	[98/4/3/0]
Frog	2-layer	0.03494	0.00542	0.00850	0.01513	0.03036
	3-layer	0.06480	0.00925	0.02044	0.02315	0.05228
	4-layer	0.15925	0.01477	0.06331	0.04420	0.10086
	5-layer	0.13622	0.01046	0.07760	0.06945	0.12024
	[R/#/$/-]	[92/2/2/9]	[91/0/5/9]	[91/1/4/9]	[91/2/4/8]	[92/2/3/8]
Goldfish	2-layer	0.00019	0.00028	0.00008	0.00048	0.00030
	3-layer	0.00003	0.00012	0.00002	0.00010	0.00006
	4-layer	0.00002	0.00008	0.00001	0.00005	0.00003
	5-layer	0.00454	0.00054	0.00035	0.00614	0.00630
	[R/#/$/-]	[90/6/4/5]	[91/4/4/6]	[90/4/4/7]	[90/7/3/5]	[91/6/3/5]
Worm	2-layer	0.00101	0.00084	0.00149	0.00083	0.00095
	3-layer	0.00057	0.00058	0.00072	0.00058	0.00061
	4-layer	0.00012	0.00022	0.00011	0.00026	0.00019
	5-layer	0.00142	0.00066	0.00372	0.00086	0.00095
	[R/#/$/-]	[78/5/10/12]	[78/4/12/11]	[78/5/10/12]	[78/4/12/11]	[78/5/11/11]
Fly	2-layer	0.00001	0.00001	0.00001	0.00002	0.00001
	3-layer	0.00094	0.00069	0.00058	0.00203	0.00101
	4-layer	0.00020	0.00015	0.00019	0.00027	0.00017
	5-layer	0.00037	0.00029	0.00058	0.00029	0.00028
	[R/#/$/-]	[10/15/22/58]	[10/14/23/58]	[10/14/24/57]	[10/15/22/58]	[10/15/22/58]

Table A8. Outliers recognized by the ANNs during the recognition of selected amphibians and reptiles.

		Checked Organisms
Outliers Recognized by ANNs	ANN Type and Semihomology	Bufo gargarizans	Bufo bufo	Lithobates catesbeianus	Pelodiscus sinensis	Mauremys reevesii
Horse	2-layer	0.00736	0.00445	0.00577	0.00465	0.03108
	3-layer	0.00945	0.00461	0.00654	0.00432	0.02970
	4-layer	0.00696	0.00187	0.00433	0.00795	0.01185
	5-layer	0.00839	0.00404	0.00121	0.02150	0.03386
	[R/#/$/-]	[86/2/7/10]	[85/3/7/10]	[91/3/5/6]	[93/5/6/1]	[95/3/4/3]
Crow	2-layer	0.00067	0.00030	0.00101	0.18984	0.22497
	3-layer	0.00060	0.00024	0.00092	0.24976	0.32642
	4-layer	0.00858	0.00195	0.00202	0.37538	0.68809
	5-layer	0.24355	0.03220	0.01087	0.86080	0.79926
	[R/#/$/-]	[86/1/7/11]	[85/2/7/11]	[92/1/6/6]	[96/3/6/0]	[96/2/6/1]
Frog	2-layer	0.64879	0.89223	0.98701	0.11096	0.02163
	3-layer	0.61177	0.88936	0.98849	0.12391	0.02753
	4-layer	0.69284	0.92809	0.99240	0.30360	0.05976
	5-layer	0.79344	0.98635	0.99123	0.16504	0.13571
	[R/#/$/-]	[91/1/6/7]	[92/0/6/7]	[103/1/1/0]	[93/1/4/7]	[90/0/6/9]
Goldfish	2-layer	0.00549	0.00257	0.00065	0.00040	0.00091
	3-layer	0.00051	0.00016	0.00002	0.00005	0.00034
	4-layer	0.00192	0.00100	0.00015	0.00003	0.00036
	5-layer	0.03373	0.00448	0.00166	0.01761	0.06565
	[R/#/$/-]	[86/1/4/14]	[85/2/4/14]	[91/3/5/6]	[90/5/5/5]	[92/4/3/6]
Worm	2-layer	0.00081	0.00167	0.00042	0.00071	0.00160
	3-layer	0.00122	0.00258	0.00044	0.00054	0.00135
	4-layer	0.00148	0.00427	0.00037	0.00014	0.00061
	5-layer	0.00097	0.00507	0.00052	0.00119	0.00081
	[R/#/$/-]	[74/3/16/12]	[75/2/16/12]	[76/5/12/12]	[76/5/12/12]	[79/3/11/12]

Table A9. Standard deviation associated with the recognition of selected fish.

		Checked Organisms
Outliers Recognized by ANNs	ANN Type	Epinephelus moara	Epinephelus fuscoguttatus	Brienomyrus brachyistius	Hypomesus transpacificus	Nothobranchius furzeri
Horse	2-layer	0.0001	0.00005	0.00005	0.00003	0.00002
	3-layer	0.00026	0.00009	0.00004	0.00005	0.00003
	4-layer	0.03244	0.01387	0.00383	0.00701	0.00173
	5-layer	0.04133	0.01213	0.02362	0.01608	0.04559
Crow	2-layer	0.00008	0.00011	0.00012	0.00033	0.00005
	3-layer	0.00031	0.00027	0.00031	0.00088	0.00006
	4-layer	0.01493	0.01116	0.01027	0.01655	0.00912
	5-layer	0.11407	0.03447	0.0665	0.17458	0.03659
Frog	2-layer	0.00377	0.00367	0.00122	0.00172	0.00557
	3-layer	0.00275	0.00249	0.00057	0.00073	0.00207
	4-layer	0.00097	0.00213	0.00099	0.00031	0.00564
	5-layer	0.00098	0.00273	0.00164	0.00098	0.00761
Goldfish	2-layer	0.0164	0.0159	0.01639	0.03249	0.01223
	3-layer	0.06539	0.06957	0.06881	0.06166	0.06064
	4-layer	0.23176	0.20522	0.16185	0.21031	0.17467
	5-layer	0.13077	0.09576	0.06107	0.16356	0.17743
Worm	2-layer	0.00003	0.00003	0.00011	0.00006	0.0001
	3-layer	0.00008	0.00012	0.00034	0.00011	0.00033
	4-layer	0.00008	0.00011	0.00032	0.00013	0.00051
	5-layer	0.00381	0.00386	0.0039	0.00439	0.0048

Table A10. Standard deviation associated with the recognition of selected birds.

		Checked Organisms
Outliers Recognized by ANNs	ANN Type	Tympanuchus pallidicinctus	Apus apus	Myiozetetes cayanensis	Accipiter gentilis	Lagopus muta
Horse	2-layer	0.00299	0.00079	0.00841	0.00063	0.00124
	3-layer	0.00754	0.00199	0.02025	0.00168	0.00288
	4-layer	0.00829	0.00169	0.01335	0.00265	0.00422
	5-layer	0.14506	0.01827	0.26626	0.02423	0.06237
Crow	2-layer	0.01908	0.00303	0.01978	0.01282	0.01667
	3-layer	0.06004	0.00495	0.05482	0.01872	0.03813
	4-layer	0.16209	0.01011	0.18976	0.03346	0.06475
	5-layer	0.23561	0.01082	0.27244	0.04677	0.10369
Frog	2-layer	0.00263	0.00043	0.00065	0.00125	0.00260
	3-layer	0.01315	0.00230	0.00608	0.00463	0.00924
	4-layer	0.08109	0.00757	0.04027	0.03290	0.04887
	5-layer	0.28608	0.01226	0.21376	0.08621	0.23742
Goldfish	2-layer	0.00001	0.00002	0.00001	0.00005	0.00002
	3-layer	0.00001	0.00002	0.00001	0.00003	0.00002
	4-layer	0.00003	0.00006	0.00001	0.00008	0.00007
	5-layer	0.01344	0.00093	0.00071	0.01842	0.01890
Worm	2-layer	0.00017	0.00009	0.00021	0.00012	0.00014
	3-layer	0.00021	0.00025	0.00027	0.00026	0.00023
	4-layer	0.00009	0.00011	0.00013	0.00018	0.00012
	5-layer	0.00244	0.00114	0.0079	0.00191	0.00161
Fly	2-layer	0.00000	0.00000	0.00000	0.00001	0.00000
	3-layer	0.00036	0.00019	0.00019	0.00071	0.00038
	4-layer	0.00028	0.00019	0.00032	0.00028	0.00019
	5-layer	0.00116	0.00091	0.00183	0.00092	0.00088

Table A11. Standard deviation associated with the recognition of selected amphibians and reptiles.

		Checked Organisms
Outliers Recognized by ANNs	ANN Type	Bufo gargarizans	Bufo bufo	Lithobates catesbeianus	Pelodiscus sinensis	Mauremys reevesii
Horse	2-layer	0.00115	0.00053	0.00038	0.00064	0.00387
	3-layer	0.00425	0.00199	0.00038	0.00106	0.00739
	4-layer	0.00499	0.00130	0.00333	0.00536	0.0063
	5-layer	0.01695	0.00851	0.00206	0.04038	0.07599
Crow	2-layer	0.00012	0.00005	0.00005	0.01737	0.01585
	3-layer	0.00031	0.00010	0.00015	0.04986	0.04298
	4-layer	0.02121	0.00454	0.00156	0.14771	0.09737
	5-layer	0.36694	0.05180	0.01471	0.18195	0.25729
Frog	2-layer	0.03722	0.01525	0.0009	0.00766	0.00167
	3-layer	0.08595	0.03121	0.00161	0.021	0.00703
	4-layer	0.15468	0.03852	0.0019	0.1572	0.03855
	5-layer	0.18075	0.01	0.00802	0.28488	0.18174
Goldfish	2-layer	0.00059	0.00027	0.0001	0.00005	0.00009
	3-layer	0.00042	0.00014	0.00002	0.00002	0.00011
	4-layer	0.00558	0.00304	0.00047	0.00009	0.00066
	5-layer	0.0671	0.0091	0.00301	0.05482	0.2018
Worm	2-layer	0.0001	0.00019	0.00003	0.00011	0.00018
	3-layer	0.00028	0.0006	0.00004	0.00018	0.00038
	4-layer	0.00051	0.00149	0.00016	0.00013	0.00036
	5-layer	0.00095	0.00653	0.00018	0.00142	0.00108

References

Mazzocca, A. The Systemic–Evolutionary Theory of the Origin of Cancer (SETOC): A New Interpretative Model of Cancer as a Complex Biological System. Int. J. Mol. Sci. 2019, 20, 4885. [Google Scholar] [CrossRef] [PubMed]
Kováč, L. Lamarck and Darwin revisited. EMBO Rep. 2019, 20, 47922. [Google Scholar] [CrossRef] [PubMed]
Kasperski, A. Life Entrapped in a Network of Atavistic Attractors: How to Find a Rescue. Int. J. Mol. Sci. 2022, 23, 4017. [Google Scholar] [CrossRef] [PubMed]
Kasperski, A.; Kasperska, R. Study on attractors during organism evolution. Sci. Rep. 2021, 11, 9637. [Google Scholar] [CrossRef]
Kasperski, A. Genome Attractors as Places of Evolution and Oases of Life. Processes 2021, 9, 1646. [Google Scholar] [CrossRef]
Kasperski, A.; Kasperska, R. A new approach to the automatic identification of organism evolution using neural networks. Biosystems 2016, 142–143, 32–42. [Google Scholar] [CrossRef]
Kasperski, A.; Kasperska, R. Application of n-dimensional dot-matrix to analysis of plant genetic diversity. Biul. IHAR 2015, 276, 69–83. [Google Scholar] [CrossRef]
Kasperski, A.; Kasperska, R. Identifcation of protein family representatives. Curr. Bioinform. 2014, 9, 414–425. [Google Scholar] [CrossRef]
Kasperski, A.; Kasperska, R. A novel method of sequence similarity evaluation in n-dimensional sequence space. Curr. Bioinform. 2012, 7, 295–303. [Google Scholar] [CrossRef]
Kasperski, A. Modelling of Cells Bioenergetics. Acta Biotheor. 2008, 56, 233–247. [Google Scholar] [CrossRef]
Kasperski, A.; Kasperska, R. Bioenergetics of life, disease and death phenomena. Theory Biosci. 2018, 137, 155–168. [Google Scholar] [CrossRef]
Kasperski, A.; Kasperska, R. Selected disease fundamentals based on the unified cell bioenergetics. J. Investig. Biochem. 2013, 2, 93–100. [Google Scholar] [CrossRef]
Kasperski, A.; Miśkiewicz, T. Optimization of pulsed feeding in a Baker’s yeast process with dissolved oxygen concentration as a control parameter. Biochem. Eng. J. 2008, 40, 321–327. [Google Scholar] [CrossRef]
Kasperski, A.; Miśkiewicz, T. An adaptive fuzzy logic controller using the respiratory quotient as an indicator of overdosage in the baker’s yeast process. Biotechnol. Lett. 2002, 24, 17–21. [Google Scholar] [CrossRef]
Miśkiewicz, T.; Kasperski, A. A fuzzy logic controller to control nutrient dosage in a fed-batch baker’s yeast process. Biotechnol. Lett. 2000, 22, 1685–1691. [Google Scholar] [CrossRef]
Damasco, A.; Giuliani, A. A resonance based model of biological evolution. Phys. A Stat. Mech. Its Appl. 2017, 471, 750–756. [Google Scholar] [CrossRef]
Harvey, P.H.; Pagel, M.D. The Comparative Method in Evolutionary Biology; Oxford University Press: Oxford, UK, 1991; Volume 239. [Google Scholar]
Lewin, R. Complexity: Life at the Edge of Chaos; Collier Books: Springfield, OH, USA, 1993. [Google Scholar]
Giuliani, A. Review of Thomas McCabe (ed.) 2021, Descente and Logic in Biosystematics. Juneau: Perseverant Publishing. Org. J. Biol. Sci. 2021, 5, 87–88. [Google Scholar]
Keya, K.; Priya, S. A Study of Phylogenetic Relationships and Homology of Cytochrome C Using Bioinformatics. Int. Res. J. Sci. Eng. 2016, 4, 65–75. [Google Scholar]
Kumar, S. Molecular clocks: Four decades of evolution. Nat. Rev. Genet. 2005, 6, 654–662. [Google Scholar] [CrossRef]
Margoliash, E. Primary Structure and Evolution of Cytochrome C. Proc. Natl. Acad. Sci. USA 1963, 50, 672–679. [Google Scholar] [CrossRef]
Gregory, T.R. Understanding Evolutionary Trees. Evo. Edu. Outreach. 2008, 1, 121–137. [Google Scholar] [CrossRef]
Hodkinson, T.R.; Parnell, J.A.N. (Eds.) Reconstructing the Tree of Life: Taxonomy and Systematics of Species Rich Taxa; CRC Press: Boca Raton, FL, USA, 2019. [Google Scholar]
Lecointre, G.; Le Guyader, H. The Tree of Life: A Phylogenetic Classification; Belknap: Cambridge, MA, USA, 2007. [Google Scholar]
Hall, B.G. Phylogenetic Trees Made Easy; Oxford University Press Inc.: New York, NY, USA, 2017. [Google Scholar]
Felsenstein, J. Confidence limits on phylogenies: An approach using the bootstrap. Evolution 1985, 39, 783–791. [Google Scholar] [CrossRef] [PubMed]
Efron, B. The Jackknife, the Bootstrap and Other Resampling Plans; CBMS-NSF Regional Conference Series in Applied Mathematics, Monograph 38; SIAM: Philadelphia, PA, USA, 1982. [Google Scholar]
Kumar, S.; Stecher, G.; Li, M.; Knyaz, C.; Tamura, K. MEGA X: Molecular Evolutionary Genetics Analysis across Computing Platforms. Mol. Biol. Evol. 2018, 35, 1547–1549. [Google Scholar] [CrossRef] [PubMed]
Karthikeyan, A.; Priyakumar, U.D. Artificial intelligence: Machine learning for chemical sciences. J. Chem. Sci. 2022, 134, 2. [Google Scholar] [CrossRef] [PubMed]
O’connell, G.C.; Walsh, K.B.; Smothers, C.G.; Ruksakulpiwat, S.; Armentrout, B.L.; Winkelman, C.; Milling, T.J.; Warach, S.J.; Barr, T.L. Use of deep artificial neural networks to identify stroke during triage via subtle changes in circulating cell counts. BMC Neurol. 2022, 22, 206. [Google Scholar] [CrossRef]
Niazkar, H.R.; Niazkar, M. Application of artificial neural networks to predict the COVID-19 outbreak. Glob. Health Res. Policy 2020, 5, 50. [Google Scholar] [CrossRef]
Topczewska, J.; Kwater, T. Forecasting the Utility Value of Hucul Horses by Means of Artificial Intelligence. Sustainability 2020, 12, 7989. [Google Scholar] [CrossRef]
Qian, N.; Sejnowski, T.J. Predicting the secondary structure of globular proteins using neural network models. J. Mol. Biol. 1988, 202, 865–884. [Google Scholar] [CrossRef]
Graupe, D. Principles of Artificial Neural Networks, 3rd ed.; World Scientific: Singapore, 2013. [Google Scholar]
Krogh, A. What are artificial neural networks? Nat. Biotechnol. 2008, 26, 195–197. [Google Scholar] [CrossRef]
Han, S.-H.; Kim, K.W.; Kim, S.; Youn, Y.C. Artificial Neural Network: Understanding the Basic Concepts without Mathematics. Dement. Neurocognitive Disord. 2018, 17, 83–89. [Google Scholar] [CrossRef]
Rashid, T.; Huang, B.Q.; Kechadi, M.T. A new simple recurrent network with real-time recurrent learning process. In Proceedings of the 14th Irish Conference on Artificial Intelligence & Cognitive Science, AICS 2003, Dublin, Ireland, 17–19 September 2003; pp. 169–174. [Google Scholar]
Menon, A.; Mehrotra, K.; Mohan, C.K.; Ranka, S. Characterization of a Class of Sigmoid Functions with Applications to Neural Networks. Neural Netw. 1996, 9, 819–835. [Google Scholar] [CrossRef] [PubMed]
Masters, T. Practical Neural Network Recipes in C++; Academic Press: San Diego, CA, USA, 1993. [Google Scholar]
Leluk, J. A non-statistical approach to protein mutational variability. Biosystems 2000, 56, 83–93. [Google Scholar] [CrossRef]
Leluk, J. Regularities in mutational variability in selected protein families and the Markovian model of amino acid replacement. Comput. Chem. 2000, 24, 659–672. [Google Scholar] [CrossRef] [PubMed]
Leluk, J.; Konieczny, L.; Roterman, I. Search for structural similarity in proteins. Bioinformatics 2003, 19, 117–124. [Google Scholar] [CrossRef] [PubMed]
Schnell, A.K.; Amodio, P.; Boeckle, M.; Clayton, N.S. How intelligent is a cephalopod? Lessons from comparative cognition. Biol. Rev. 2021, 96, 162–178. [Google Scholar] [CrossRef]
Dhar, A.; Minin, V.N. Maximum Likelihood Phylogenetic Inference. In Encyclopedia of Evolutionary Biology; Kliman, R.M., Ed.; Academic Press: Cambridge, MA, USA, 2016; pp. 499–506. [Google Scholar]
Jin, G.; Nakhleh, L.; Snir, S.; Tuller, T. Maximum likelihood of phylogenetic networks. Bioinformatics 2006, 22, 2604–2611. [Google Scholar] [CrossRef]
Felsenstein, J. Evolutionary trees from DNA sequences: A maximum likelihood approach. J. Mol. Evol. 1981, 17, 368–376. [Google Scholar] [CrossRef]
Vinogradov, A.E.; Anatskaya, O.V. Gradistics: An underappreciated dimension in evolutionary space. Biosystems 2023, 224, 104844. [Google Scholar] [CrossRef]
Nassar, L.R.; Barber, G.P.; Benet-Pagès, A.; Casper, J.; Clawson, H.; Diekhans, M.; Fischer, C.; Gonzalez, J.N.; Hinrichs, A.S.; Lee, B.T.; et al. The UCSC Genome Browser database: 2023 update. Nucleic Acids Res. 2022, 51, D1188–D1195. [Google Scholar] [CrossRef]
Goldstein, D. Indo-European phylogenetics with R. Indo-Eur. Linguist. 2020, 8, 110–180. [Google Scholar] [CrossRef]
Felsenstein, J. The Number of Evolutionary Trees. Syst. Zool. 1978, 27, 27–33. [Google Scholar] [CrossRef]

Figure 1. The average number of teaching cycles and standard deviations for processes of teaching the neural networks.

Figure 2. The average number of teaching cycles and standard deviations for processes of teaching the 5-layer neural networks.

Figure 3. Validation of the reconstructed line of organism development using a set of birds. The recognized timestamps (i.e., sets of recognized outliers) indicate that birds are evolutionarily between Horse and Frog.

Figure 4. Validation of the reconstructed line of organism development using a set of amphibians (Bufo gargarizans, Bufo bufo and Lithobates catesbeianus) and reptiles (Pelodiscus sinensis and Mauremys reevesii). The recognized timestamps (i.e., sets of recognized outliers) indicate that amphibians are evolutionarily between Crow and Goldfish.

Figure 5. The generated bootstrap consensus tree with 1000 replications for the organisms used for ANN teaching.

Figure 6. The generated bootstrap consensus tree with 10,000 replications for the organisms used for ANN teaching.

Table 1. The set of organisms, listed in alphabetical order, used to teach the ANNs.

No.	Organism	Description of Sequence (First Line of the FASTA Format)
1	Bacteria	>PEK86573.1 cytochrome C [Bacillus thuringiensis]
2	Butterfly	>XP_032519089.1 cytochrome c [Danaus plexippus plexippus]
3	Coconut palm	>KAG1363645.1 Cytochrome c [Cocos nucifera]
4	Crow	>XP_039405800.1 cytochrome c [Corvus cornix cornix]
5	Fly	>AAA28437.1 cytochrome C [Drosophila melanogaster]
6	Frog	>XP_040208820.1 cytochrome c [Rana temporaria]
7	Goldfish	>XP_026069975.1 cytochrome c [Carassius auratus]
8	Homo sapiens	>NP_061820.1 cytochrome c [Homo sapiens]
9	Horse	>NP_001157486.1 cytochrome c [Equus caballus]
10	Octopus	>XP_029642027.1 cytochrome c [Octopus sinensis]
11	Spider	>GIY91737.1 cytochrome c [Caerostris extrusa]
12	Sunflower	>AAR30955.1 cytochrome c [Helianthus annuus]
13	Wasp	>XP_043491202.1 cytochrome c [Polistes fuscatus]
14	Worm	>AKI85307.1 cytochrome c [Cerebratulus lacteus]
15	Yeast	>BAP71068.1 cytochrome c [Kluyveromyces marxianus]

Table 2. The number of neurons in the layers of the implemented neural networks for the 15 organisms used for teaching.

ANN Types	Number of Neurons in the Layers
2-layers (i.e., 0 hidden layers)	525, 15
3-layers (i.e., 1 hidden layers)	525, 89, 15
4-layers (i.e., 2 hidden layers)	525, 160, 49, 15
5-layers (i.e., 3 hidden layers)	525, 216, 89, 36, 15
6-layers (i.e., 4 hidden layers)	525, 258, 127, 62, 31, 15
7-layers (i.e., 5 hidden layers)	525, 290, 160, 89, 49, 27, 15
8-layers (i.e., 6 hidden layers)	525, 316, 190, 114, 69, 41, 25, 15

Table 3. Timestamp (i.e., a set of recognized outliers) recognized by ANN during the recognition of Homo sapiens.

Outliers Recognized by ANN	Homo Sapiens
Recognized similarity	0.99808
Standard deviation	0.00010

Table 4. Timestamp (i.e., a set of recognized outliers) recognized by ANN during the recognition of Homo sapiens after removing Homo sapiens from the set of organisms and re-teaching the network using the remaining organisms.

Outliers Recognized by ANN	Horse	Crow	Frog
Recognized similarity	0.97202	0.02693	0.00619
Standard deviation	0.05320	0.04586	0.01771

Table 5. Timestamp recognized by ANN during the recognition of Horse after removing Homo sapiens and Horse from the set of organisms and re-teaching the network using the remaining organisms.

Outliers Recognized by ANN	Crow	Frog	Goldfish
Recognized similarity	0.90936	0.01322	0.04507
Standard deviation	0.07337	0.01483	0.04704

Table 6. Timestamp recognized by ANN during the recognition of Crow after removing Homo sapiens, Horse and Crow from the set of organisms and re-teaching the network using the remaining organisms.

Outliers Recognized by ANN	Frog	Goldfish
Recognized similarity	0.57232	0.49903
Standard deviation	0.30661	0.31334

Table 7. Timestamp recognized by ANN during the recognition of Frog after removing Homo sapiens, Horse, Crow and Frog from the set of organisms and re-teaching the network using the remaining organisms.

Outliers Recognized by ANN	Goldfish	Worm
Recognized similarity	0.93424	0.04564
Standard deviation	0.11108	0.05963

Table 8. Timestamp recognized by ANN during the recognition of Goldfish after removing Homo sapiens, Horse, Crow, Frog and Goldfish from the set of organisms and re-teaching the network using the remaining organisms.

Outliers Recognized by ANN	Worm
Recognized similarity	0.99733
Standard deviation	0.00038

Table 9. Timestamp recognized by ANN during the recognition of Worm after removing Homo sapiens, Horse, Crow, Frog, Goldfish and Worm from the set of organisms and re-teaching the network using the remaining organisms.

Outliers Recognized by ANN	Butterfly	Sunflower	Octopus	Bacteria
Recognized similarity	0.00239	0.06156	0.10406	0.52457
Standard deviation	0.00438	0.13616	0.27336	0.42770

Table 10. Results of using semohomology approach to check the evolutionary distances between Worm and the organisms used to teach ANN in the last step of the method. The organisms in the table are listed in alphabetical order.

Organisms	[R/#/$/-]
Bacteria	[10/10/27/58]
Butterfly	[11/11/25/59]
Coconut	[4/13/31/58]
Fly	[10/14/26/56]
Octopus	[10/8/28/60]
Spider	[8/11/31/55]
Sunflower	[6/10/33/57]
Wasp	[9/14/24/59]
Yeast	[9/11/31/55]

Table 11. The number of neurons in the layers of the implemented 5-layer neural networks for the 15, 14, 13, 12, 11, 10 and 9 organisms used for teaching.

Number of Organisms	Used for Teach	Number of Neurons in the Layers
15	all organisms from the initial set of organisms	525, 216, 89, 36, 15
14	without Homo sapiens	525, 212, 86, 35, 14
13	without Homo sapiens and Horse	525, 208, 83, 33, 13
12	without Homo sapiens, Horse and Crow	525, 204, 79, 31, 12
11	without Homo sapiens, Horse, Crow and Frog	525, 200, 76, 29, 11
10	without Homo sapiens, Horse, Crow, Frog and Goldfish	525, 195, 72, 27, 10
9	without Homo sapiens, Horse, Crow, Frog, Goldfish and Worm	525, 190, 69, 25, 9

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Kasperski, A. Recognition of Timestamps and Reconstruction of the Line of Organism Development. Processes 2023, 11, 1316. https://doi.org/10.3390/pr11051316

AMA Style

Kasperski A. Recognition of Timestamps and Reconstruction of the Line of Organism Development. Processes. 2023; 11(5):1316. https://doi.org/10.3390/pr11051316

Chicago/Turabian Style

Kasperski, Andrzej. 2023. "Recognition of Timestamps and Reconstruction of the Line of Organism Development" Processes 11, no. 5: 1316. https://doi.org/10.3390/pr11051316

APA Style

Kasperski, A. (2023). Recognition of Timestamps and Reconstruction of the Line of Organism Development. Processes, 11(5), 1316. https://doi.org/10.3390/pr11051316

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Recognition of Timestamps and Reconstruction of the Line of Organism Development

Abstract

1. Introduction

2. Materials and Methods

2.1. Web Resources

2.2. Computer Programs

2.3. Implementation and Teaching of the Artificial Neural Networks

2.4. Semihomologous Approach

3. Results and Discussion

3.1. Teaching of the Neural Networks

3.2. Validation of the Neural Networks and Recognition of Timestamps for a Set of Monkeys

3.3. Validation of the Neural Networks and Recognition of Timestamps for a Set of Fish

3.4. Method of Penetrating Deep into Evolution and Reconstruction of the Line of Organism Development

3.5. Structure of the Neural Networks in the Subsequent Steps of the Method

3.6. Validation of the Method of Penetrating Deep into Evolution

3.6.1. Validation of the Reconstructed Line of Organism Development by Determining Timestamps for a Set of Birds

3.6.2. Validation of the Reconstructed Line of Organism Development by Determining Timestamps for a Set of Amphibians and Reptiles

3.6.3. Validation of the Reconstructed Line of Organism Development by Phylogenetic Tree Generation

3.7. General Algorithm of the Method of Penetrating Deep into Evolution and Reconstructing the Line of Organism Development

4. Conclusions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI