A non-linear analogy procedure for gene repair

Interactions between sub-systems and their environment are everywhere. In this paper, logistic map is applied, which felicitously stimulates the chaotic behavior of gene, to make analogy of a health DNA sequence and a mutation DNA sequence. By successfully repairing mutational part of a DNA sequence segment with help of neighbor-compressed method, we propose that compressing the neighbor healthy part to replay the mutational part is an effective way for gene repair. Since information ecosystem also possesses chaotic characteristics, the concept that extracting information from the neighboring environment can amend or complete information of corresponding elements also apply to information ecosystems.


Introduction
Universe is a nonlinear system [1] and chaos is everywhere [2]. With no doubt, DNA sequences also possess chaotic characteristics [3][4][5]. So it makes sense that using a chaotic model to simulate some biological process or status.
The logistic map is a polynomial mapping of degree 2, which is often cited as an archetypal example of how complex, chaotic behavior can arise from very simple nonlinear dynamical equations [6]. Taking advantage of logistic map for making analogies to living matters may help with further exploration about mysteries of life. In order to study the bioinformatic feature of DNA sequence and conduct an analogy procedure of repairing mutational part in DNA sequence, logistic map was used to generate a health DNA sequence and a mutation DNA sequence.
After conducting an effective gene repair, we found that compressing the neighbor healthy part to replay the mutational part is an effective way for gene repair. As DNA sequence and information ecosystems [7,8] have chaotic characteristics, as well as behaviors inside body are similar to information ecosystems' interactions, it can extend to information ecosystems that extracting information from the neighboring environment can amend or complete information of subsystem.

Logistic map for DNA sequence
 Two different chaotic statuses were set, which respectively were health status with logistic map's coefficient R=3.9, and mutation status and with logistic map's coefficient R=3.6;  Two number sets were generated by logistic map. one number set of 500 numbers all in health status was generated with n=500, R=3.9, and =0.5. Another number set of 500 numbers was also generated with n=500 and =0.5, but four fifth of this set was in health status with R=3.9 and one fifth of it was in mutation status with R=3.7.
 The two number sets were transformed to be nucleotide sequences, health DNA sequence and mutation DNA sequence. The rule transforming number set into 'A', 'G','C','T' sequence is presented in table 1.
 Because length of the healthy part in mutation DNA sequence is 3 times longer than the length of the mutational part, the healthy part should be compressed to be 1/4 of the original length so that the new sequence we get from compression could take place the mutational part. The rule that every four letters in sequences should be taken as a whole, which independently corresponds to one single number. Following is the fourth mapping relationship between all kinds of tetrad-letter and numbers (from 1 to 256) in 1/4 compression method:  After compressing the healthy part of the mutation DNA sequence and obtaining the new sequence, we displaced the mutational part by the new sequence. So the new built-up sequence was the repaired DNA sequence.

Result
 Function 'seqdotplot' of bioinformatics toolbox in matlab was used to create dot plots of four pairs, visualizing the matches between every pair, which made one sequence the X-axis and another sequence the Y-axis. , it is evident that repaired DNA sequence is much more similar to healthy DNA sequence than mutation DNA sequence. (d) It is the dot plot of repaired DNA sequence and mutation DNA sequence, repaired DNA sequence as X-axis and mutation DNA sequence as Yaxis. Comparing group (b) and group (d), it is obvious that these two plots is extremely familiar, which indicates that repaired DNA sequence and healthy DA sequence share great similarity.
 Rescaled range analysis (R/S) was used to obtain the Hurst exponent for quantitative analysis of similarities among health DNA sequence, mutation DNA sequence, and repaired DNA sequence. That indicates health DNA sequence and repaired DNA sequence share high similarity in nonlinearity. On the contrary, mutation DNA sequence differs from them in nonlinearity.
As a result, visually or quantitatively, health DNA sequence and repaired DNA sequence proved to be much more similar to each other than mutation DNA sequence. So it shows this analogy procedure for gene repair is effective.

Discussion
 This kind of repair method is essentially to extract health information from mutational part's healthy neighbor, and to amend the mutational part by the health information. It exhibits that extracting information from the neighboring environment can amend or complete information of sub-system. This concept may also be appropriate for other systems possessing chaotic characteristics, such as information ecosystems.
 Organism can be regarded as a huge information ecosystem, where organs, tissues, cells or even DNA sequences exchange information to their neighboring environment. In the general information ecosystem, regardless of the actual boundary between an element and the environment, the former passes information, matter and energy through its internal structure. The actions and interactions of system components eventually create outflow from the system into the environment, so that the environment contain information and feature of element.
 That is to say, in the information ecosystem, environment can not only make influence on elements, further, it can reserve elements' information. Thus, if there is any element that is amiss or lost, its environment may provide help for amending or completing.