Quasi-Periodic and Fractal Polymers: Energy Structure and Carrier Transfer

We study the energy structure and the coherent transfer of an extra electron or hole along aperiodic polymers made of N monomers, with fixed boundaries, using B-DNA as our prototype system. We use a Tight-Binding wire model, where a site is a monomer (e.g., in DNA, a base pair). We consider quasi-periodic (Fibonacci, Thue–Morse, Double-Period, Rudin–Shapiro) and fractal (Cantor Set, Asymmetric Cantor Set) polymers made of the same monomer (I polymers) or made of different monomers (D polymers). For all types of such polymers, we calculate the highest occupied molecular orbital (HOMO) eigenspectrum and the lowest unoccupied molecular orbital (LUMO) eigenspectrum, the HOMO–LUMO gap and the density of states. We examine the mean over time probability to find the carrier at each monomer, the frequency content of carrier transfer (Fourier spectra, weighted mean frequency of each monomer, total weighted mean frequency of the polymer), and the pure mean transfer rate k. Our results reveal that there is a correspondence between the degree of structural complexity and the transfer properties. I polymers are more favorable for charge transfer than D polymers. We compare k(N) of quasi-periodic and fractal sequences with that of periodic sequences (including homopolymers) as well as with randomly shuffled sequences. Finally, we discuss aspects of experimental results on charge transfer rates in DNA with respect to our coherent pure mean transfer rates.


Introduction
Today, the electronic structure of biological molecules (e.g., proteins, enzymes, peptides and nucleic acids (DNA, RNA)) and their charge transfer and transport properties attract considerable interest among the physical, chemical, biological and medical communities, as well as a broad spectrum of interdisciplinary scientists and engineers [1][2][3][4][5][6]. DNA plays a fundamental role in genetics and molecular biology since its sequence of bases, adenine (A), guanine (G), cytosine (C), and thymine (T) contains the genetic code of living organisms. The base-pair stack of the DNA double helix creates a nearly one-dimensional π-pathway that favors charge transfer and transport. Although it is not rare to see or hear the terms transport and transfer indiscriminately, if we want to be more precise, transport means that the system under investigation is held between electrodes and that a voltage is applied between these electrodes, while the term transfer means that a carrier, created (e.g., by oxidation or reduction) or injected at a specific place, moves to a more favorable location, without the application of external voltage. Charge transfer through DNA plays a central role in DNA damage and repair [2,7,8], so it may be a critical issue in carcinogenesis and mutagenesis [9,10]. For example, the rapid hole migration from other bases to guanine is connected to the fact that direct strand breaks occur preferentially at guanines [9]. Furthermore, it might be an indicator of discrimination between pathogenic and non-pathogenic mutations at an early stage [11].
There are several works devoted to the study of transfer and transport in specific DNA structures using variants of the Tight-Binding method [12,15,16,50,51,63,[66][67][68][69]. Here, we employ a TB wire model, where the base pairs (or monomers) are the sites of the chain, to study the spectral and charge transfer properties of deterministic aperiodic (Thue-Morse (TM), Fibonacci (F), Double-Period (DP), Rudin-Shapiro (RS), Cantor Set (CS), Asymmetric Cantor Set (ACS)) DNA segments. Th relevant parameters are the on-site energies of base pairs and the hopping integrals between successive base pairs. We have to solve a system of N coupled equations for the time-independent problem, and a system of N coupled first order differential equations for the time-dependent problem. We study HOMO and LUMO eigenspectra, HOMO-LUMO gaps and the relevant density of states (DOS) as well as the mean over time probabilities to find the carrier at each site. We are also interested in the frequency content of carrier movement; hence, we analyze the Fourier spectra of the time-dependent probability to find the carrier at each site, the weighted mean frequency of each monomer and the total weighted mean frequency of the polymer. Finally, we study the pure mean transfer rate from a certain site to another, which describes the easiness of charge transfer; it gives us a measure of how much of the carrier is transferred and also of how fast this process is. Recently, charge transport (i.e., the movement of a carrier under external bias) in aperiodic polymers has been studied in Ref. [70]. Specifically, Ref. [70] was devoted to autocorrelation functions, Lyapunov eexponents, transmission coefficients, and current-voltage curves. Here, on the contrary, we study charge transfer (i.e., the movement of a carrier created via oxidation (hole) or reduction (electron), without external bias) along aperiodic polymers.
The rest of the paper is organized as follows: in Section 2, we provide some details on the studied deterministic aperiodic sequences and we outline our notation. In Section 3, we delineate the basic theory behind the time-independent (Section 3.1) and the time-dependent (Section 3.2) problem. In Section 4, we discuss our results for polymers made of the same monomer and polymers made of different monomers. Here, for DNA, a monomer is a base pair. Finally, in Section 5, we state our conclusions. The analysis of our results is done with the equations included in Section 3. Extensive analysis of our fitting methods can be found in Refs. [14][15][16] and in the Supplemental Material of Ref. [18].

Sequences and Notation
In our prototype system, B-DNA, we mention only the base sequence of the 5 -3 strand. For example, we denote two successive monomers by YX, meaning that the base pair X-X compl is separated and twisted by 3.4 Å and 36 • , respectively, relatively to the base pair Y-Y compl , around the B-DNA growth axis. X compl (Y compl ) is the complementary base of X (Y). The deterministic aperiodic sequences considered in this work are either quasi-periodic or fractal. Such structures are generally known as binary substitutional sequences, i.e., based on a binary alphabet, like {0, 1} and generated using appropriate substitution rules.

Fibonacci
The Fibonacci (F) sequence is named after the Italian mathematician Leonardo Pisano (Fibonacci) who introduced it to Western European mathematics in his 1202 book Liber Abaci, in a study of the population growth of rabbits [71]. However, this sequence appears many centuries before in Indian mathematics [72]. Fibonacci considers the growth of an idealized rabbit population, assuming that a single newly born pair of rabbits (N) are put in a field, and rabbits are able to mate at the age of one month so that, at the end of its second month, a mature pair (M) can produce another pair of rabbits. Rabbits never die and a mating pair always produces one new pair every month from the second month on. The puzzle that Fibonacci posed was: how many pairs will exist in one year? The collection of every month's population is: F 0 = N, F 1 = M, F 2 = MN, F 3 = MNM, F 4 = MNMMN, etc. Using e.g., the two-letter alphabet {G, A}, we can define the Fibonacci generation F g by the substitution rules A → G, G → GA, starting with F 0 = A. Hence, F 0 = A, F 1 = G, F 2 = GA, F 3 = GAG, F 4 = GAGGA, etc. If N g is the Fibonacci number of generation g, and we set N 0 = N 1 = 1, the recurrence relation N g = N g−1 + N g−2 produces the number sequence 1, 1, 2, 3, 5, 8, 13, 21, 34, ...

Thue-Morse
The Thue-Morse (TM) or Prouhet-Thue-Morse sequence was first studied by Eugène Prouhet in 1851, who applied it to number theory [73]. The systematic study was left to Axel Thue who, in 1906, applied it on his study of words combinatorics [74]. The most important contribution to the sequence was made in 1921 by Marston Morse in the context of differential geometry and topological dynamics [75], which brought the sequence to worldwide attention. In its simplest form, the TM sequence can be defined by the recursive relations S n = {S n−1 S + n−1 } and S + n = {S + n−1 S n−1 } (for n ≥ 1), with S 0 = 0 and S + 0 = 1 [76]. Using e.g., the two-letter alphabet {G, A} we can build up the sequence using the substitution rules G→GA and A→AG. Hence, T M 0 = G, T M 1 = GA, T M 2 = GAAG, T M 3 = GAAGAGGA, etc.

Double-Period
The double-period (DP) sequence has its origin in the study of system dynamics and laser applications to nonlinear optical fibers [77]. It is closely connected with the TM sequence: the n-th stage is S n = {S n−1 S + n−1 } and S + n = {S n−1 S n−1 } (for n ≥ 1), with S 0 = 0 and S + 0 = 1. Using e.g., the two-letter alphabet {G, A}, we can define the n-th generation by the substitution rules G→GA, A→GG. Hence, starting with DP 0 = G, then DP 1 = GA, DP 2 = GAGG, DP 3 = GAGGGAGA, etc.

Cantor Set
The Cantor Set (CS), introduced by mathematician Georg Cantor, is one of the most well-known deterministic fractals [81]. It is built by splitting a straight line segment in three, removing the middle third, then removing the middle third of each of the two new straight line segments and the process is repeated ad infinitum. Using, e.g., the two-letter alphabet {G, A} and the substitution rules G→GAG, A→AAA, we can define the n-th generation (n = 0, 1, 2, ...) as follows: CS 0 = G, CS 1 = GAG, CS 2 = GAGAAAGAG, etc.

Asymmetric Cantor Set
The Asymmetric Cantor Set (ACS) is built by splitting a straight line segment in four, removing the second quarter, then removing the second quarter of each of the three new straight line segments and the process is repeated ad infinitum. Using, e.g., the two-letter alphabet {G, A} and the substitution rules G→GAGG, A→AAAA, we can define the n-th generation (n = 0, 1, 2, ...) as follows: ACS 0 = G, ACS 1 = GAGG, ACS 2 = GAGGAAAAGAGGGAGG, etc.
One could think of many types of aperiodic polymers, some of which are shown synoptically in Table 1. We just give an example of each type, e.g., for Fibonacci I sequences, we give the example  G, C, CG, CGC, CGCCG, CGCCGCGC, ..., but there are obviously other similar sequences e.g., C, G,  GC, GCG, GCGGC, GCGGCGCG, ..., A, T, TA, TAT, TATTA, TATTATAT, ..., T, A, AT, ATA, ATAAT, ATAATATA, ... Table 1. Examples of the types of polymers studied in this work. I (D) denotes polymers made of identical (different) monomers. We only mention the 5 -3 base sequence along one of the two strands. X(Y) means that the zeroth generation (the "seed") of the binary sequence contains solely X and the other letter in the sequence is Y.

Type
Sequence Example Notation

Theory
In this article, we use a simple wire model, where the site is a monomer (e.g., in DNA, a base pair). We call µ the monomer index, µ = 1, 2, . . . , N. We assume that the state or movement of an extra hole or electron can be expressed through the monomer HOMOs or LUMOs, respectively, cf. Equations (2) and (8) below.

Stationary States-Time-Independent Problem
The TB wire model Hamiltonian can be written aŝ E µ is the on-site energy of the µ-th monomer, and t µ,λ = t * λ,µ is the hopping integral between monomers µ and λ. The state of a polymer can be expressed as Substituting Equations (1) and (2) to the time-independent Schrödinger equation we arrive to a system of N coupled equations which is equivalent to the eigenvalue-eigenvector problem where H is the Hamiltonian matrix of order N, composed of the TB parameters E µ and t µ,λ , and v is the vector matrix composed of the coefficients v µ (which can be chosen to be real). The diagonalization of H leads to the determination of the eigenenergy spectrum (eigenspectrum), {E k }, k = 1, 2, . . . , N, for which we suppose that E 1 < E 2 < · · · < E N , as well as to the determination of the occupation probabilities for each eigenstate, v µk 2 , where v µk is the µ-th component of the k-th eigenvector. {v µk } are normalized, and their linear independence is checked in all cases.
Having determined the eigenspectrum, we can compute the density of states (DOS), generally given by Changing the view of a polymer from one (e.g., top) to the other (e.g., bottom) side of the growth axis, reflects the Hamiltonian matrix H of the polymer on its main antidiagonal. This reflected Hamiltonian, H equiv , describes the equivalent polymer [16]. H and H equiv are connected by the similarity transformation H equiv = L −1 H L, where L(= L −1 ) is the unit antidiagonal matrix of order N. Therefore, H and H equiv have identical eigenspectra (hence the equivalent polymers' DOS is identical) and their eigenvectors are connected by v µk = v equiv (N−µ+1)k . Generally, equiv(YX. . . Z) = Z compl . . . Y compl X compl .

Time-Dependent Problem
To describe the spatiotemporal evolution of an extra carrier (hole/electron), inserted or created (e.g., by oxidation/reduction) at a particular monomer of the polymer, we consider the state of the polymer as where C µ (t) 2 is the probability to find the carrier at the µ-th monomer at time t. Substituting Equations (1) and (8) in the time-dependent Schrödinger equation we arrive at a system of N coupled differential equations Equation (10) is equivalent to a 1st order matrix differential equation of the forṁ where C(t) is a vector matrix composed of the coefficients C µ (t), µ = 1, 2, . . . , N. Equation (11) can be solved with the eigenvalue method, i.e., by looking for solutions of the form Hence, Equation (11) leads to the eigenvalue problem of Equation (5), that is, H v = E v. Having determined the eigenvalues and eigenvectors of H, the general solution of Equation (11) is In other words, the coefficients C µ (t), µ = 1, 2, . . . , N are given by a superposition of the time evolution of the stationary states with time-independent coefficients c k . Hence, this is a coherent phenomenon. The coefficients c k are determined from the initial conditions. In particular, if we define the N × N eigenvector matrix V , with elements v µk , then it can be shown that the vector matrix c, composed of the coefficients c k , k = 1, 2, . . . , N, is given by the expression Suppose that initially the extra carrier is placed at the λ-th monomer, i.e., C λ (0) = 1, C µ (0) = 0, ∀µ = λ. Then, In other words, the coefficients c k are given by the row of the eigenvector matrix which corresponds to the monomer the carrier is initially placed at. In this work, we choose λ = 1, i.e., we initially place the carrier at the first monomer. From Equation (12), it follows that the probability to find the extra carrier at the µ-th monomer is are the frequencies ( f kk ) or periods (T kk ) involved in charge transfer. If m is the number of discrete eigenenergies, then the number of different f kk or T kk involved in carrier transfer is If there are no degenerate eigenenergies (which holds for all cases studied here, but e.g., does not hold for cyclic homopolymers [15]), then m = N. If eigenenergies are symmetric relative to some central value, then S decreases (there exist degenerate f kk or T kk ). Specifically, in that case, S = m 2 4 , for even m and S = m 2 −1 4 for odd m. From Equation (15), in the absence of degeneracy and for real c k , v µk , it follows that the mean over time probability to find the extra carrier at the µ-th monomer is Furthermore, from Equation (15), it can be shown that the one-sided Fourier amplitude spectrum that corresponds to the probability C µ (t) 2 is given by Hence, the Fourier amplitude of frequency f kk is 2|c k v µk c k v µk |. We can further define the weighted mean frequency (WMF) of monomer µ as WMF expresses the mean frequency content of the extra carrier oscillation at monomer µ. Having determined the WMF for all monomers, we can now obtain a measure of the overall frequency content of carrier oscillations in the polymer: Since f µ W M is the weighted mean frequency of monomer µ and |C µ (t)| 2 is the mean probability of finding the extra carrier at monomer µ, we define the total weighted mean frequency (TWMF) as A quantity that evaluates simultaneously the magnitude of coherent charge transfer and the time scale of the phenomenon is the pure mean transfer rate [14] k t λµ is the mean transfer time, i.e., having placed the carrier initially at monomer λ, the time it takes for the probability to find the extra carrier at monomer µ, C µ (t) 2 , to become equal to its mean value, C µ (t) 2 , for the first time. For the pure mean transfer rates, where the superscript "equiv" refers to the equivalent polymer in the sense of Equation (7).

Results
In this article, the TB parameters for B-DNA are taken from Ref. [19]. The HOMO/LUMO base-pair on-site energies are [19] E G-C = −8.0/−4.5 eV, E A-T = −8.3/−4.9 eV. The hopping integrals are given in Table 2.
At this point, we mention that any sign alteration of the hopping integrals does not affect the results presented below, since the Hamiltonian matrices we deal with in the Wire Model are irreducible, symmetric and tridiagonal. This can be shown as follows: let us suppose a N × N irreducible tridiagonal Hermitian matrix T, with diagonal elements T k = a k and non-diagonal elements Then, D is unitary, and the similarity transformationT = D −1 T D leads to the matrixT with diagonal elements T k = a k and non-diagonal elementsT (k,k+1) = r k+1 . Hence, the tridiagonal Hermitian matrix T has the same eigenvalues with the tridiagonal real symmetric matrixT, which has positive non-diagonal entries [82]. Let us further suppose that T is real. Then, θ k = 0 or θ k = π depending on whether T (k,k+1) > 0 or T (k,k+1) < 0. The elements of D will be d k = d k−1 (±1), ∀k = 2, . . . , n. Hence, the matrixT, which has positive entries, has the same eigenvalues with T, which differs byT in that its off-diagonal elements have negative signs in arbitrary positions. Finally, if v is an eigenvector of T, then D −1 v is an eigenvector ofT.

Eigenspectra, Density of States, Energy Gaps
In Figures 1 and 2, we present the HOMO and LUMO eigenspectra, for increasing N, of I and D polymers, respectively, and in Figures 3 and 4 the corresponding DOS. In Figures 1 and 2, different colors correspond to different generations, e.g., in the Fibonacci sequence, the number of monomers in the polymer, N, takes the following values from generation to generation (1, 1,) 2, 3, 5, 8, 13, 21, 34, 55, 89, 144, ... For both I and D polymers, we notice that in quasi-periodic polymers the DOS has rather acute subbands, while in fractal polymers the DOS is fragmented and spiky. In Figures 3 and 4, for illustration purposes, the DOS has been calculated for polymers made of a very large number of monomers N. This value is shown in each panel. Of course, the persistence length of DNA is around 50 nm or 150 base pairs [32]. On the other hand, if we stretch and join the DNA of all chromosomes of a single cell that would give us a length of the order of a meter and would consist of billions of base pairs.
For I polymers, i.e., polymers made of identical monomers (cf. Figures 1 and 3), we observe that all eigenvalues are symmetric relative to the monomer's on-site energy (this, obviously, also holds for the DOS). This observation can be mathematically proven as follows: for N even, the Hamiltonian matrix of a generic I polymer is H = E µ I + T GK , where E µ is the (constant) on-site energy, I is the identity matrix and T GK is the Golub-Kahan matrix, containing only the non-diagonal elements of H, i.e., the HOMO or LUMO hopping integrals t µ,λ . It can easily be shown that T GK = P T BP, where P is the perfect shuffle matrix and By performing the Singular Value Decomposition of the upper bidiagonal matrix A, i.e., by writing it as A = USW T , we obtain Thus, finally, Hence, the eigenvalues of T GK are given by the positive and negative values of the diagonal matrix S, i.e., they are symmetric around zero [83]. Hence, since, H = E µ I + T GK , the eigenvalues of H are symmetric around E µ . For N odd, we can add a zero row and a zero column to T GK so that it is again of even order and follow the aforementioned procedure. Then, two degenerate trivial eigenvalues will appear apart from the symmetric ones [84]. Thus, the eigenvalues of H occur by omitting the zero row and column, hence they are symmetric around E µ , which is also an eigenvalue.  The horizontal axis shows the number of monomers in the polymer N. Different colors correspond to different generations. The notation of polymers is given in Table 1. The horizontal axis shows the number of monomers in the polymer N. Different colors correspond to different generations. The notation of polymers is given in Table 1. The notation of polymers is given in Table 1. The notation of polymers is given in Table 1.
For D polymers, i.e., polymers made of different monomers (cf. Figures 2 and 4), the eigenenergies and the DOS gather around the two monomer's on-site energies.
The energy gap of a monomer is the difference between its LUMO and HOMO levels. The energy gap of a polymer is the difference between the lowest level of the LUMO regime and the highest level of the HOMO regime because we assume that the orbitals-one per site-which contribute to the HOMO (LUMO) band are occupied (empty), since in both possible monomers there is an even number of p z electrons contributing to the π stack. In Figure 5, we present the energy gaps (calculated for large N; cf. Figures 3 and 4) and the HOMO and LUMO band limits of all aperiodic polymers examined in this work. The G-C (A-T) monomer gap is always greater than the gaps of I polymers made of G and C or A and T. D polymers have smaller HOMO-LUMO gaps than I polymers (cf. upper panel of Figure 5). Furthermore, the lower HOMO (LUMO) band limit of D polymers is always between the lower and upper HOMO (LUMO) band limit of I polymers consisted of A and T, while the upper HOMO (LUMO) band limit of D polymers is always between the lower and upper HOMO (LUMO) band limit of I polymers consisted of G and C (cf. lower panel of Figure 5).

T(A) TM A(T) TM T(A) DP A(T) DP T(A) RS A(T) RS T(A) CS A(T) CS T(A) ACS A(T) ACS T(A)
--  Table 1.

Mean over Time Probabilities
The main aspects of our results for the mean over time probabilities for I and D polymers are summarized in Figures 6 and 7 (where we show only two consecutive generations) and in Figures A1 and A2 in Appendix A (where we show many consecutive generations), for some example cases. We suppose that the extra carrier is initially placed at the first monomer. Usually, these probabilities are distributed to monomers close to the one the carrier was initially placed.
The mean over time probabilities of finding the extra carrier at each monomer of a polymer depends on the sequence on-site energies and magnitude of hopping parameters between successive monomers. This can more easily be seen in I polymers (cf. Figure 6), where only the hopping integrals affect the energy structure. For the Thue-Morse G(C) polymers, the probabilities are palindromic for odd generation numbers. This is due to the fact that the Hamiltonian matrices of these polymers are palindromic, i.e., reading them from top left to bottom right and vice versa gives the same result [18]. This property stems directly from the sequence structure. For Cantor Set A(T) polymers, the mean over time probability for an extra hole is almost totally distributed at the four (or three for generation 1) starting monomers, regardless of N, while for an extra electron the probabilities are almost semi-palindromic, i.e., C µ (t) 2 = C N−µ+1 (t) 2 , µ = 2, 4, ..., N − 1. In this case, even if the sequence structure is the same for HOMO and LUMO, the magnitude of hopping integrals has a stronger effect on the results. Another example is the Rudin-Shapiro A(T) sequence where the mean over time probability for an extra electron is almost totally distributed at the four starting monomers, regardless of N, while for holes it is basically distributed at monomers 1, 2, 3 and 6. Regarding the extra hole in Asymmetric Cantor C(G) polymers, the probability is much higher for monomers 1, 2, 9, 10 of every 32-monomer period. Generally, for I polymers, the mean over time probabilities are significant only rather close to the first monomer, although in some cases we observe non-negligible probabilities at more distant monomers.  Table 1.  Table 1.
Generally, for D polymers, the mean over time probabilities are almost negligible further than the first monomer. An exception is the Rudin-Shapiro A(G) sequence where the probabilities for both HOMO and LUMO are almost totally distributed at the three starting monomers of each polymer, regardless its length. Likewise, the mean over time probability for the extra electron in Cantor Set A(G) polymers is almost totally distributed at the first and third monomer of each polymer, regardless its length. An extra electron in Double-Period A(G) reaches somehow more distant monomers.
A general picture is that in the aperiodic polymers studied in this work the mean over time probabilities generally decline a little away from the site where the carrier was initially placed. This situation is different from the general picture in periodic polymers studied e.g., in Refs. [15,16,18], where, as a rule, non-negligible probabilities exist at distant-from the initial-sites.

Frequency Content
The frequencies involved in charge transfer are given by Equation (16). Hence, the maximum frequency is determined by the maximum difference of eigenenergies, i.e., by the upper and lower limits of the HOMO or LUMO band (calculated for large N; cf. Figures 3 and 4). These maximum frequencies for all studied polymers are shown in Figure 8.  Table 1.
The Fourier spectra of the time-dependent probability to find an extra electron or hole at each monomer are generally in the THz regime, mainly in the FIR (far infrared) and MIR (middle infrared) part of the electromagnetic spectrum. When the dominant frequencies, i.e., those with greater Fourier amplitudes, are smaller (larger), the carrier transfer is slower (faster). Extensive examples of the Fourier spectra of the probability to find an extra carrier at the first and at the last monomer, having placed it initially at the first monomer, for I and D aperiodic polymers, for the HOMO and the LUMO regime, can be found in Refs. [85,86]. Recently, we have also analyzed [18,87] the frequency content of periodic polymers, using the TB wire model or the TB extended ladder model, including the Fourier spectra, the WMFs and the TWMF as a function of N, with details in Refs. [88,89].
In Figure 9, we depict the TWMF as a function of N for the various types of aperiodic polymers. We notice that the TWMF generally stabilizes as the generation number increases. In all cases of aperiodic polymers studied in this work, the TWMFs are in the region ≈ 10 −2 -10 2 THz. In various cases of periodic I and D polymers studied in Ref. [18], the TWMFs were found in the region ≈ 10 0 -10 2 THz.  Table 1.

Pure Mean Transfer Rates
Next, we study the pure mean transfer rates from the first to the last monomer, k 1,N , or from now on, just k. We depict k(N) either for HOMO or for LUMO, for I and D polymers in Figure 10. In all cases, k(N) is a decreasing function. Generally, the degree of coherent transfer difficulty is greater for D polymers. Overall, our results suggest that I polymers, which are simpler cases in terms of energy intricacy, are more efficient regarding coherent hole and electron transfer. The notation of polymers is given in Table 1.
We include in each panel of Figure 10, k(N) of homopolymers (e.g., A...) which are the "champions" among periodic polymers in terms of efficiency of coherent carrier transfer [18], i.e., in terms of magnitude of k and of slower decrease of k(N). It seems that k(N) of homopolymers is an unreachable limit for aperiodic polymers. Comparing the periodic polymers studied in Refs. [15,16,18] with the aperiodic polymers studied in this work, in terms of k(N), we realize that although generally periodic polymers are more efficient, specific aperiodic polymers can be better than specific periodic ones. However, the general picture is that charge transfer in aperiodic polymers is orders of magnitude worse than in periodic polymers.
In each panel of Figure 10, we also take the best of aperiodic polymers in terms of k(N) and shuffle randomly the sequence of its monomers. In all cases, except for Cantor Set HOMO, this random shuffle deteriorates severely k(N). For Cantor Set, A(T) and T(A) have identical k(N) because the Cantor Set rules for A(T) and T(A) produce equivalent polymers, cf. Equation (7). For equivalent polymers, k(N) from the first to the last monomer are identical, cf. Equation (22). For example, TAT ≡ ATA, TATAAATAT ≡ ATATTTATA, TATAAATATAAAAAAAAATATAAATAT ≡ ATATTTATATTTTTTTTTATATTTATA and so on. Similarly, the Cantor Set rules for G(C) and C(G) produce equivalent polymers, which have identical k(N). In Cantor Set HOMO, the best sequences in terms of k(N) are A(T) and T(A), where the hopping integrals involved are t AA = t TT = − 8 meV, t AT = 20 meV, t TA = 47 meV, and we have just one on-site energy, that of A-T. From these hopping integrals, t AA has the smallest absolute value. Given the structure of the Cantor Set sequences, making the random shuffle, the number of t AA decreases, while the numbers of the larger hopping integrals, t AT and t TA increase. For this reason, in Cantor Set HOMO, the random shuffle increases k(N). In Cantor Set LUMO, this argument is inverted because now the best sequences in terms of k(N) are G(C) and C(G), where the hopping integrals involved are t GG = t CC = 20 meV, t GC = −10 meV, t CG = −8 meV, and we have just one on-site energy, that of G-C. In this case, the random shuffle decreases the number of the larger hopping integrals t GG = t CC and increases the number of the smaller hopping integrals t GC and t CG . However, apart from the exception of the Cantor set HOMO, generally speaking, the conclusion is that deterministic aperiodic polymers possess some kind of order, i.e., a well-defined construction rule that makes them more efficient than random polymers in terms of k(N); therefore, when this rule is destroyed, the transfer efficiency diminishes.

Transfer Rates in Experiments
Comparison of the coherent pure mean transfer rates k of our prototype system, B-DNA, with experimentally obtained transfer rates is a rather complicated issue. In the past, the experimental transfer rates in donor-bridge (DNA)-acceptor systems were obtained using the concentrations of different products generated e.g., when a hole is (PY) or is not (PN) transferred. The concentrations of PY and PN were indirectly measured by methods like polyacrylamide gel electrophoresis and piperidine treatment [90,91]. Although these methods revealed some aspects of hole transfer like the sequence dependence and the ability of transfer, they do not provide the kinetics of hole transfer in DNA [92]. Although, generally, greater concentration of PY implies greater charge transfer, there is no proof that the concentrations of PN and PY are proportional to the degree of transfer.
Quantum mechanically, only a fraction of the carrier reaches the acceptor through the bridge. For the same reason, the definition of transfer time is problematic. The transfer rate should depend both on the amount and the speed of transfer. However, the concentration of PY is not strictly proportional to the amount of carrier transfer and not strictly inversely proportional to the time of transfer. A more direct experimental approach is time-resolved spectroscopy, e.g., transient absorption, to observe the products of charge transfer [92][93][94].
Our point of view is different, since the quantity we use, the pure mean transfer rate [14], given by Equation (21), uses simultaneously the magnitude of coherent charge transfer and the time scale of the phenomenon. However, our method applies to coherent transfer only and cannot cover incoherent mechanisms like thermal hopping.
It is a common assertion in the literature that when the fall of the transfer rate with respect to the length of a given DNA segment is described by an exponential fit, the mechanism of transfer is superexchange, whereas when it is described by a power law fit, the mechanism of transfer is multi-step hopping. However, we stress that the fitted parameters produced this way should be treated with care, especially when it comes to attributing them to specific mechanisms. For example, in Ref. [92], where the hole transfer kinetics of various short DNA segments were experimentally investigated with time-resolved spectroscopy, the authors present an exponential decay length β = 1.6 Å −1 by fitting the experimental hole transfer rates of G(A) n G DNA oligomers (n = 0, 1, 2) to the exponential law K = K 0 e −βd , where d is the charge transfer distance, i.e., d = 3.4 × (N − 1) Å. Using the transfer rate values of Ref. [92], we observed that, although β, determined as the slope of the linear fit ln(K) = ln(K 0 ) − βd is indeed ∼ = 1.6 Å −1 , a direct exponential fit gives β ∼ = 1.3 Å −1 , suggesting that the law of decay is not exactly exponential. On the contrary, the fits of our theoretically obtained pure mean transfer rates, k, for the same system, give β ∼ = 1.84 Å −1 for β determined as the slope of the linear fit ln(k) = ln(k 0 ) − βd, and β ∼ = 1.79 Å −1 for a direct exponential fit k = k 0 e −βd , suggesting closer convergence to an exponential decay. Similarly, in Ref. [95], the authors experimentally study, with time-resolved spectroscopy, hole transfer through (GA) n and (GT) n sequences, where n = 2-12 is the number of repetition units. The authors fitted the obtained transfer rates to the power law K = K 0 N −η , where N is the number of hopping steps between guanines (in our notation, N = N 2 − 1), reported the same exponent for both sequences, i.e., η = 2, and suggested that this value provides evidence that the long-distance hole transfer occurs by multi-step hopping between guanines. From the rate values provided in Table I of Ref. [95], we observed that, although η as a slope of the linear fit ln(K) = ln K 0 − η ln(N ) is indeed 2 for both sequences, a direct power law fit yields η ∼ = 1.4 for (GA) n and η ∼ = 1.3 for (GT) n , suggesting that the rate decay does not follow exactly a power law. On the contrary, the fits of our theoretically obtained pure mean transfer rates, k, for (GA) n , give η ∼ = 1.40 for η determined as the slope of the linear fit ln(k) = ln k 0 − η ln(N ), and η ∼ = 1.56 Å −1 for a direct power law fit k = k 0 N −η . The respective values for (GT) n are η ∼ = 2 for both fits. Hence, our theoretical results suggest that the fall of k, as the length of the bridge increases, convergences to a power law and that the fall of the transfer rate is less steep when purines are on the same strand compared to the case when purines are crosswise.
DNA is a dynamical structure, i.e., the geometry is not fixed. Large variations of the TB parameters are expected in real situations and also large variations of the TB parameters have been obtained by different theoretical methods by different authors, cf. e.g., Ref. [14] and references therein. Hence, the parameters any TB model uses have to be utilized with care. In Ref. [96], the authors report experimentally deduced (by transient absorption spectroscopy) charge separation rates, in capped A n (n = 1-7) and A 3 G n (n = 1-19) DNA hairpins with a stilbenedicarboxamide hole donor and a stilbenediether hole acceptor. We computed our theoretical coherent pure mean transfer rates, k, for the same systems with a modified parametrization: t AA → 1.6t AA , t AG → 2.1t AG , t GG → 2.25t AG (cf. Table 2). In order to mimic the donor and the acceptor, we added two sites at the ends of the TB chain, with on-site energies E don = E A−T − 0.1 eV, E ac = E G−C + 0.1 eV. We used for the hopping integral from the donor (last base pair) to the first base pair (acceptor) 100 meV (250 meV). Our results, along with the experimental ones, are depicted in Figure 11. Apart from the A 1 and A 2 systems, for which we find much larger rates, the pure mean transfer rates k are of the same order of magnitude, in good quantitative agreement with the experimental transfer rates K. Actually, the same sequences A n (n = 1-7) and A 3 G n (n = [1][2][3][4][5][6][7][8][9][10][11][12][13][14][15][16][17][18][19] analyzed in Ref. [96] had also been analyzed by the same group in Ref. [97]. In Ref. [97], the authors mention a time resolution of ca. 180 fs. Hence, roughly, only transfer rates K < (1/180) PHz ≈ (1/200) PHz = 5 × 10 −3 PHz can be detected by this technique.  Figure 11. Comparison of experimental hole transfer rates K for A n and A 3 G n segments [96] (full circles) with our theoretical coherent pure mean transfer rates k (empty circles), as a function of the number of monomers N in the polymer. The Tight-Binding parametrization is described in the main text.

Conclusions
We systematically studied the energy structure and the coherent transfer of an extra carrier, electron or hole, along various categories of binary quasi-periodic (Fibonacci, Thue-Morse, Double-Period, Rudin-Shapiro) and fractal (Cantor Set, Asymmetric Cantor Set) polymers consisting of either the same monomer (I polymers) or different monomers (D polymers), using the TB wire model and B-DNA as a prototype system.
Regarding the energy structure of the polymers, we calculated the HOMO and LUMO eigenspectra and the density of states, as well as the HOMO-LUMO gap. The eigenenergies lie around the monomers' on-site energies. We demonstrated numerically and proved analytically that for I polymers, the eigenenergies are always symmetric relative to the (constant) monomer on-site energy. For both I and D polymers, in quasi-periodic cases, the DOS has rather acute subbands, while in fractal cases, it is fragmented and spiky. D polymers pose smaller HOMO-LUMO gaps than I polymers.
The G-C (A-T) monomer gap is always greater than the gaps of I polymers made of G and C or A and T. The lower HOMO (LUMO) band limit of D polymers is always between the lower and upper HOMO (LUMO) band limit of I polymers consisted of A and T, while the upper HOMO (LUMO) band limit of D polymers is always between the lower and upper HOMO (LUMO) band limit of I polymers consisted of G and C.
Next, we studied the mean over time probabilities to find an extra hole or electron at each monomer of the polymer, having it initially placed at the first monomer. For I polymers, the mean over time probabilities are significant only rather close to the first monomer, although in some cases we observe non-negligible probabilities at more distant monomers. For D polymers, the mean over time probabilities are generally negligible further than the first monomer. This situation in aperiodic polymers, where the mean over time probabilities generally decline a little away from the site where the carrier was initially placed, is in contrast to the situation in periodic polymers studied e.g., in Refs. [15,16,18], where, generally, non-negligible probabilities exist at distant-from the initial-sites.
Furthermore, we determined the frequency content of coherent extra carrier transfer via the total weighted mean frequency of the polymer, using the weighted mean frequencies of the Fourier spectra that correspond to the probabilities to find the carrier at each monomer. The TWMF generally stabilizes after a few generations. We showed that, in all cases of aperiodic polymers studied in this work, the TWMF lies in the regime ≈ 10 −2 − 10 2 THz. This is different from various cases of periodic I and D polymers studied in Ref. [18], where the TWMFs were found in the region ≈ 10 0 − 10 2 THz.
The study of the pure mean transfer rates, k(N), shows that I polymers, which are simpler cases in terms of energy intricacy, are more efficient than D polymers regarding coherent hole and electron transfer. Comparing the periodic polymers studied in Refs. [15,16,18] with the aperiodic polymers studied in this work, in terms of k(N), we realize that, although generally periodic polymers are more efficient, specific aperiodic polymers can be better than specific periodic ones. However, the general picture is that charge transfer in aperiodic polymers is orders of magnitude worse than in periodic polymers. The structurally simplest periodic polymers, i.e., the homopolymers [18], represent an unreachable limit for all aperiodic polymers. Furthermore, a random shuffle of a quasi-periodic or fractal monomer sequence destroys the deterministic character of its construction rules, thus leading to vanishing transfer rates. As far as comparison with experiments is concerned, large variations of the TB parameters are expected in real situations, hence modifications are necessary. Using a modified parametrization, we were able to find hole pure mean transfer rates k of similar magnitude with experimental transfer rates K obtained by time-resolved spectroscopy.

Conflicts of Interest:
The authors declare no conflict of interest.

Abbreviations
The following abbreviations are used in this manuscript:  The notation of polymers is given in Table 1. The notation of polymers is given in Table 1.