



# Article Quantum LFSR Structure for Random Number Generation Using QCA Multilayered Shift Register for Cryptographic Purposes

Hyun-Il Kim<sup>1</sup> and Jun-Cheol Jeon<sup>2,\*</sup>

- <sup>1</sup> Department of Robotics Engineering, Daegu Gyeongbuk Institute of Science & Technology, Dalseong-gun, Daegu 42988, Korea; hyunil89@dgist.ac.kr
- <sup>2</sup> Department of Convergence Science, Kongju National University, Gongju 32588, Korea
- \* Correspondence: jcjeon@kongju.ac.kr; Tel.: +82-41-850-8563

Abstract: A random number generator (RNG), a cryptographic technology that plays an important role in security and sensor networks, can be designed using a linear feedback shift register (LFSR). This cryptographic transformation is currently done through CMOS. It has been developed by reducing the size of the gate and increasing the degree of integration, but it has reached the limit of integration due to the quantum tunneling phenomenon. Quantum-dot cellular automata (QCA), one of the quantum circuit design technologies to replace this, has superior performance compared to CMOS in most performance areas, such as space, speed, and power. Most of the LFSRs in QCA are designed as shift registers (SR), and most of the SR circuits proposed based on the existing QCA have a planar structure, so the cell area is large and the signal is unstable when a plane intersection is implemented. Therefore, in this paper, we propose a multilayered 2-to-1 QCA multiplexer and a D-latch, and we make blocks based on D-latch and connect these blocks to make SR. In addition, the LFSR structure is designed by adding an XOR operation to it, and we additionally propose an LFSR capable of dual-edge triggering. The proposed structures were completed with a very meticulous design technique to minimize area and latency using cell interaction, and they achieve high performance compared to many existing circuits. For the proposed structures, the cost and energy dissipation are calculated through simulation using QCADesigner and QCADesigner-E, and their efficiency is verified.

**Keywords:** cryptography; random number generator; linear feedback shift register; quantum-dot cellular automata; cell interaction

# 1. Introduction

Current CMOS technology is a basic technology used in the digital and analog electronics industry. CMOS continues to be developed by reducing the size of the gate and increasing integration, but it is approaching the limit of integration due to the quantum tunneling phenomenon [1]. To solve this problem, nano-circuit design technologies such as quantum-dot cellular automata (QCA) are being developed to replace CMOS. QCA circuit technology is a next-generation digital nano circuit design technology with many advantages such as low power consumption and fast switching speed [2]. In QCA, circuits made based on digital logic can be redesigned and used based on QCA, and many digital circuits in CMOS such as logical and arithmetic operators have been proposed using QCA [3–10].

A multiplexer (Mux) is a combination circuit that is essential in most digital circuit designs, and it determines an output line according to an input selection value. Mux can be used effectively to create storage space such as D-latch or D flip-flop (F/F), and can be implemented easily. Circuits can be implemented using basic logical operators such as AND, OR, and NOT, and various circuits designed using fault tolerance and cell interaction



Citation: Kim, H.-I.; Jeon, J.-C. Quantum LFSR Structure for Random Number Generation Using QCA Multilayered Shift Register for Cryptographic Purposes. *Sensors* 2022, 22, 3541. https://doi.org/ 10.3390/s22093541

Academic Editors: France Le Bihan and Olivier Bonnaud

Received: 30 March 2022 Accepted: 4 May 2022 Published: 6 May 2022

**Publisher's Note:** MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.



**Copyright:** © 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https:// creativecommons.org/licenses/by/ 4.0/). are emerging. Efforts to minimize latency and area have continued by designing not only coplanar structures, but also multilayered structures [11–19].

Using D-latch or D-F/F, the shift register (SR) can be designed and developed into the linear feedback shift register (LFSR) structure we want to make. A latch simply stores and outputs a value, whereas F/F has a structure in which the output value is determined by a change in the clock. There is also a negative-edge-triggered structure that generates an output when the clock changes to 0 and a positive-edge-triggered structure that produces an output when the clock changes to 1, and there is a dual-edge-triggered structure that enables both. In addition, D-F/F with a reset function has been proposed to increase efficiency, but it has resulted in the degradation of delay time and space efficiency [20-26].

As SRs are widely used in digital circuits such as memory circuits, computer output and input ports, and counter configurations, many studies have been conducted based on QCA [27–33]. SRs have been generally proposed by connecting blocks made based on D-latch or D-F/F. However, there is a problem of delay of the clock signal transmitted to each block. Therefore, improved circuits were suggested so that the clock signal can be transmitted equally to each block. Additionally, a multilayered structure was used to solve the spatial complexity problem. Conventional SR circuits have various problems such as structural parts, signal noise, time and space complexity, and clock synchronization. In this paper, we propose a 2-to-1 Mux and a D-F/F with a multilayer structure with cell interaction. The proposed structures solve various existing problems and constitute an efficient *n*-bit SR.

In addition, we designed the LFSR structure, which plays an important role in the random number generator [34–36]. Various LFSR structures have been proposed in the past, but a large amount of wasted area was used for wiring, or due to an effort to reduce such space, various problems of latency and signal transmission or energy dissipation increased [36–39]. The proposed structures not only solve the area and latency problems mentioned above, but also minimize energy dissipation, which is very important in designing a large quantum circuit. The contributions of this work can be itemized as follows.

- A multilayered 2-to-1 Mux using cell interaction is proposed. Additionally, an optimized D-latch is proposed using the Mux.
- By connecting the proposed D-latch, a 4-bit SR with modularity and scalability is proposed using a multilayered structure.
- A three-input XOR gate is connected to the proposed SR to complete the 4-bit LFSR structure, and a dual-edge trigged LFSR structure is additionally proposed.
- The proposed structures and the structures of existing papers were compared, the accuracy of design and operation was checked and compared using QCADesigner [40], the latency and required area were checked, and the cost was calculated.
- Finally, the proposed LFSR structure was compared with the best existing structures by additionally calculating energy dissipation using QCADesigner-E [41].

In this paper, we propose an LFSR structure for random number generation. This paper is structured as follows. Section 2 describes the basic knowledge of QCA and previously proposed 2-to-1 Muxes, D-latches, SRs, and LFSRs. Section 3 presents a multilayered 2-to-1 Mux and extends it to implement D-latch. In addition, the SR structure is made by connecting D-latch to several blocks, the three-input XOR gate is connected, and the LFSR structure is proposed. Additionally, an LFSR structure capable of dual-edge triggering is also proposed. Section 4 compares and analyzes the performance of the proposed circuits and the existing circuits in area, latency, and energy dissipation. Finally, we conclude in Section 5.

### 2. Related Works

### 2.1. Background of QCA

A quantum cell, a basic component of QCA, has four quantum dots and two electrons. Each electron is positioned diagonally to each other due to Coulomb repulsion [2]. These electrons have two types of arrangements according to the input signal, and each type has a polarization of +1 (binary logic 1) or -1 (binary logic 0) as shown in Figure 1a. Additionally, QCA cells can be placed adjacent to each other and used as QCA wiring. If the polarization state of one cell is specified, the adjacent cells change to the same polarization to lower the energy between cells, as shown in Figure 1b, and the signal will be transmitted [3–7].



Figure 1. QCA basic concept: (a) two possible polarizations; (b) wiring.

Figure 2 shows a representative inverter gate among the basic gates composed of QCA cells. In the inverter gate, when a value is input from the input cell IN, the output cell OUT changes to the opposite polarization and outputs it. The inverter in QCA can be implemented in the form of the weak inverter in Figure 2a, which transmits a weak signal although the required area is small, and the robust inverter in Figure 2b, which takes up a relatively large area although the signal is transmitted strongly [8–10].



Figure 2. Inverters: (a) week inverter; (b) robust inverter.

## 2.2. Multilayer Structure

The QCA circuit is classified into a planar structure using only one layer and a multilayer structure using multiple layers. Unlike the planar structure, the multilayer structure has an interlayer interaction. As shown in Figure 3a, when cells are connected diagonally, the signal is transmitted as it is in the existing planar structure, but as shown in Figure 3b the cells are connected vertically. It has the property of an inverter in which the signal of the cell is inverted [11]. Cells connected vertically have higher signal strength than cells connected with diagonal lines, and efficient circuit design can be achieved using this property.

The multilayer structure is closer to the electrons than the planar structure, so the signal strength is strong and it only requires a small area. As shown in Table 1, when the conventional planar structure inverter is changed to a multilayered inverter as shown in Figure 3, the weak inverter of Figure 3b has higher signal strength and a much smaller area than the robust inverter of Figure 2b. Using these characteristics to design a multilayer structure can have several advantages over designing a planar structure. However, due



to the complexity of the design and the difficulty of implementation, very delicate work is required.

Figure 3. Multi-layer structures: (a) diagonal line; (b) vertical line; and (c) multi-layer robust inverter.

| Inverters | Cell Count | Area (nm <sup>2</sup> ) | Signal Strength (10 <sup>-1</sup> J) | Structure   |
|-----------|------------|-------------------------|--------------------------------------|-------------|
| Figure 2a | 2          | 1444                    | 5.62                                 | Coplanar    |
| Figure 2b | 7          | 4758                    | 7.75                                 | Coplanar    |
| Figure 3b | 2          | 324                     | 9.69                                 | Multi-layer |
| Figure 3c | 7          | 1404                    | 8.42                                 | Multi-layer |

Table 1. Performance comparison of coplanar inverter and multi-layer inverter.

### 2.3. Previous QCA Multiplexers and D-Latch

A Mux is a circuit that selects one of several input signals and delivers the selected input to an output line. This is used in many circuits such as D-latch, registers, and RAM cells, and a D-latch can be designed by creating a loop section using a Mux. Table 2 is the truth table of the 2-to-1 Mux, which has three input signals, S, A, and B, and one output, OUT. The input value S is a selection signal, and depending on whether it is 0 or 1, the input value A or B is outputted, respectively.

S В OUT Α 

 Table 2. Truth table of 2-to-1 Multiplexer.

Figure 4a is a logic diagram of a 2-to-1 Mux. A circuit is built using one NOT gate, two AND gates, and one OR gate. In Figure 4, S denotes selection lines, A (or I0) and B (or I1) denote input lines, OUT (or F) denotes output lines, and orange cells denote fixed cells with -1 or +1. Figure 4b through Figure 4h show previously proposed QCA 2-to-1 Mux circuits [12–18]. In Figure 4b, a weak inverter is used for NOT operation and three majority gates are used for AND and OR operations according to the format of the basic logic diagram in Figure 4a. Figure 4d,e was designed using a majority gate, and each was designed using a rotated cell and a multilayer structure, respectively. The remaining circuits were designed to minimize complexity by using cell interaction instead of implementing logical operations.



**Figure 4.** Previous 2-to-1 Muxes: (**a**) a logic diagram of 2-to-1 multiplexer; (**b**) B. Sen et al.'s [12]; (**c**) M. N. Asfestani et al.'s [13]; (**d**) D. Ajitha et al.'s [14]; (**e**) M. Mosleh's [15]; (**f**) A. H. Majeed et al.'s [16]; (**g**) A. H. Majeed's [17]; and (**h**) S. S. Ahmadpour's [18].

The principle of Figure 4c is that when the value of S is +1, the signal of fixed cell (-1) is weakened and the signal of fixed cell (+1) is strengthened, so that the signal of B, which is close to fixed cell (+1), is output. The first two circuits designed for cell interaction are a good way to reduce the number and area of cells, but due to the limitation of the planar structure there is a disadvantage in that the area increases when designing a large circuit using this Mux as an element. Figure 4e is a previously proposed multilayer Mux circuit. Although it can be made smaller than Figure 4b, it is operated on the first and third layers and used as a simple connection line that transmits only signals on the second layer so that it does not have spatial superiority compared to Figure 4f through Figure 4h.

Table 3 is the truth table of D-latch, which is a circuit that can store and maintain 1-bit information and is a basic element of a sequential circuit. The D-latch has an input signal D and a clock input CLK, and one output. D-Latch gives the result value of the Mux as an input value again. When CLK is 1, the result value is changed, and when CLK is 0, the previous value is output.

| CLK | D      | OUT                      |
|-----|--------|--------------------------|
| 0   | 0<br>1 | OUT(t - 1)<br>OUT(t - 1) |
| 1   | 0<br>1 | 0<br>1                   |

Table 3. Truth table of D-latch.

Figure 5a shows a logic diagram of D-latch. Figure 5b,e show that wiring is added to the Mux circuit based on the majority vote, while Figure 5c,f,g confirm that the wiring is connected to the Mux by cell interaction. Figure 5d uses a rotated majority gate, and Figure 5h shows the completed D-latch using a multilayered structure. As shown in Figure 5, D-latch is completed by adding wiring to the existing Mux circuit. Therefore, it can be seen that the performance of the D-latch circuit is determined by how efficiently and well the Mux circuit is designed.



**Figure 5.** Previous D-latches: (**a**) a logic diagram of D-latch; (**b**) M. M. Abutaleb's [20]; (**c**) M. G. Roshan et al.'s [21]; (**d**) T. N, Sasamal et al.'s [22]; (**e**) J. C. Jeon's [23]; (**f**) A. H. Majeed et al.'s [24]; (**g**) Z. Song et al.'s [25]; and (**h**) D. K. Seo et al.'s [26].

An SR is a circuit that temporarily stores binary information and transfers information left or right. The SR consists of a number of D-latches. The output of each latch is connected to the input of the next latch. According to the first clock input, one bit of binary information is input to the SR, and the previously stored information is moved along with the next clock input. Q0 to Q3 can check the output value of each D-latch of every clock. A logic diagram of the *n*-bit shift register is shown in Figure 6.





Figure 7 shows the previously proposed QCA 3- or 4-bit SR circuits. In papers [28,30], a 3-bit serial SR was proposed by arranging three D-latch blocks, but there was a problem that the clock was sequentially delivered to each block, resulting in a long delay. To solve this problem in papers [29,32,33], the clocks were synchronized. In paper [31], an effort was made to reduce the area of the SR by using a multilayer structure, but there was a problem in that the implementation complexity was surpassed by using five layers and the development cost increased. In addition, as each layer has a different structure, it lacks modularity and scalability. In papers [28–31], the circuits had a SISO structure with serial input and output, but the circuit in [32] developed a SIPO structure that allowed simultaneous output, and the circuit in [33] developed a PIPO structure that allowed simultaneous output as well as simultaneous input.

LFSR can be used for a random number generator (RNG), a cryptographic technology that plays an important role in security, and is a binary stream generator applied to various stream ciphers [34–36]. The LFSR is designed to have high periodicity and good statistical properties, and Figure 8 shows a 4-bit logic diagram of the LFSR structure. The initial values of the D-latch are shifted to the right every clock, and Q2 and Q3 are input to D0 by performing an XOR operation every clock. For example, if the initial value is 1010, the following values are calculated in the order of 1101, 1110, 1111, 0011, 0001, 1000, 0100, 0010, 1001, 1100, 0110, 1011, and 0101. With the exception of 0000, it has a maximum periodicity to obtain 15 combinations of all 4-bit random numbers.

Figure 9 shows typical LFSR structures. The circuit in paper [36] transfers the outputs of the second and fifth blocks in 5-bit SR to the input of the first block after XOR operation. The proposed circuit uses a robust inverter based on the majority vote to increase the signal strength, but the size of the XOR operation is large and there are many wasted areas due to long wiring, which reduces the overall circuit performance. Although the size of each block and XOR has been reduced in the circuit of paper [38], it has similar characteristics to the circuit in paper [36]. The circuit in paper [37] is a 3-bit LFSR structure with a majority vote, and was designed to synchronize the clocks and enable simultaneous output, and strives to minimize the wasted area. Each block of the LFSR structure in paper [39] was designed based on a rotated majority gate, and the required area was minimized by using an XOR gate using cell interaction. In addition, clock synchronization and simultaneous output are possible, and it was designed to enable dual-edge triggering.



**Figure 7.** Typical 3~4 bit SRs: (**a**) J. C. Das's [28]; (**b**) M. N. Divshali et al.'s [29]; (**c**) M. Abdullah-Al-Shafi et al.'s [30]; (**d**) T. Li et al.'s [31]; (**e**) M. G. Roshan et al.'s [32]; and (**f**) S. Fan et al.'s [33].



Figure 8. A logic diagram of 4-bit LFSR.



**Figure 9.** Typical LFSR structures: (a) A. Rezaei et al.'s [36]; (b) H. Mohammadi et al.'s [37]; (c) M. Kaviya et al.'s [38]; and (d) Z. Amirzadeh et al.'s [39].

#### 3. The Proposed Structures

#### 3.1. The Proposed 2-to-1 Multiplier and D-Latch

In this paper, we designed a new multilayer 2-to-1 Mux as shown in Figure 10 using cell interaction and a multilayer structure. This structure consists of three layers, and the selection input, S, is placed in the middle layer, and fixed cells of 1 and 0 are placed vertically above and below the selection cell. The input values, A and B, are placed opposite to each other on the third and first layer, respectively, and the output cell, OUT, is placed on the third layer to minimize the required area.



Figure 10. Proposed 2-to-1 Multiplexer: (a) full circuit; (b) layer 1; (c) layer 2; and (d) layer 3.

When the S value is -1 (binary logic 0), a signal of +1 is given above and below the S value due to the characteristic of the multilayer structure in which the signal is inverted due to the vertically connected fixed cells, and the output of the fixed cell +1 of the third layer becomes stronger, and the output of the fixed cell -1 on the first floor becomes weaker. Therefore, the output of the A value is stronger than the B value, and the A value is output. Conversely, if the S value is +1 (binary logic 1), the B value is output because the output of the B value is stronger than the A value. In other words, when A<sub>OUT</sub> and B<sub>OUT</sub> are determined, this value is set as an output and the function of the Mux is performed normally.

The proposed D-latch is shown in Figure 11. By inputting the output of the proposed 2-to-1 Mux back to the center of the circuit, it is designed to maintain the output value in the circuit. The proposed D-latch was designed to set CLK as the value of the selected cell and maintain the previous output value when the CLK is 0. As shown in Figure 11a, the value from the output cell, OUT, is input to the center cell of the circuit every clock along the arrow. Therefore, depending on the selection of CLK, the input value, D, or the previous output value, OUT(t - 1) is determined as the next output value.

#### 3.2. The Proposed 4-Bit SR and LFSR Structure

Figure 12 shows an SR designed by connecting D-latches. The proposed 4-bit SR has four D-latches, and the input values D and CLK are placed on the leftmost side, and the value of the D-latch can be shifted to the right according to the input value of CLK. Two input values are input to the second and third floors, respectively, so that they can proceed



in parallel, and the output is placed on the first and third floors in consideration of the characteristics of the multilayer structure, thereby maintaining the modularity of the circuit.



Figure 11. Proposed D-latch: (a) full circuit; (b) layer 1; (c) layer 2; and (d) layer 3.



Figure 12. Proposed 4-bit SR: (a) Top view; (b) layer 1; (c) layer 2; and (d) layer 3.

Figure 13 shows the proposed 4-bit LFSR structure. It was designed based on the logic diagram introduced in Figure 8 using the 4-bit SR proposed in Figure 12 and the XOR gate proposed in Ref. [10]. A level to edge converter was additionally designed in Figure 14 so that the D-latch can operate as a D flip-flop. This allows the circuit to decide which flip-flop can be either negative or positive edge triggered.



**Figure 13.** Proposed 4-bit LFSR structure: (a) Top view; (b) layer 1; (c) layer 2; and (d) layer 3.



(a)

Figure 14. Cont.





**Figure 14.** Proposed 4-bit level triggered LFSR structure: (**a**) Top view; (**b**) layer 1; (**c**) layer 2; and (**d**) layer 3.

Unpredictable random values created through the LFSR structure can be easily converted into ciphertext through plaintext and XOR operation. It can also be returned to plaintext through the same reverse operation. It is used as the simplest and most secure stream cipher.

# 4. Simulation Results and Analyses

In this section, we present the superiority of the proposed structures by analyzing the simulation results of these structures and comparing them with the latest excellent structures.

#### 4.1. Structural Analysis

The proposed structures were designed and simulated using QCADesigner version 2.0.3. For the simulation, bistable approximation and a coherence vector simulation engine were used, and the parameters used are shown in Table 4.

#### Table 4. Simulation parameters.

| Parameters             | <b>Bistable Approximation</b> | <b>Coherence Vector</b>   |  |  |
|------------------------|-------------------------------|---------------------------|--|--|
| Cell size              | 18 nm                         | 18 nm                     |  |  |
| Dot diameter           | 5 nm                          | 5 nm                      |  |  |
| Cell separation        | 2 nm                          | 2 nm                      |  |  |
| Layer separation       | 11.5 nm                       | 11.5 nm                   |  |  |
| Clock high             | $9.8	imes10^{-22}~{ m J}$     | $9.8	imes10^{-22}~{ m J}$ |  |  |
| Clock low              | $3.8	imes10^{-23}~{ m J}$     | $3.8	imes10^{-23}~{ m J}$ |  |  |
| Clock shift            | 0                             | 0                         |  |  |
| Clock amplitude factor | 2.0                           | 2.0                       |  |  |
| Relative permittivity  | 12.9                          | 12.9                      |  |  |
| Radius of effect       | 65 nm                         | 80 nm                     |  |  |

Figure 15 shows the simulation results of the 2-to-1 Mux. The proposed Mux outputs either an input value A to OUT through  $A_{out}$  or an input value B to OUT through  $B_{out}$ , depending on whether the selection input value, S, is 0 or 1. Figure 16 shows the simulation results of the proposed D-latch. As shown in the truth table in Table 3, when CLK = 1, if

the input value D = 1, the output value OUT = 1, and when CLK = 0, the output value maintains the previous output value OUT = 1, regardless of the input value.

|                 |    |    | - |            |    |             |    | ·   |                                   |
|-----------------|----|----|---|------------|----|-------------|----|-----|-----------------------------------|
| max: 1.00e+000  |    |    |   |            | 1  | 1           | 1  | 1   |                                   |
| S               | 0  | 0  | 0 | 0          |    |             |    |     |                                   |
| min: -1.00e+000 |    |    |   |            |    |             |    |     | Δ                                 |
| max: 1.00e+000  | F  |    | 1 | 1          |    |             | 1  | 1   | out                               |
| Α               |    |    |   |            |    |             |    |     |                                   |
| min: -1.00e+000 | 0  | 0  |   |            | 0  | 0           |    |     |                                   |
|                 |    |    |   |            | _  |             |    |     |                                   |
| max: 1.00e+000  |    | 1  |   | 1          |    | 1           |    | 1   |                                   |
| В               |    |    | ~ |            |    |             |    |     |                                   |
| min: -1.00e+000 | 0  |    | 0 | L          | 0  |             | 0  |     |                                   |
|                 |    | -  |   |            |    | -           | 77 | П   |                                   |
| max: 9.94e-001  | 1. |    | 1 | 1          |    | 1           | 1  |     |                                   |
| A out           | 50 |    | F |            | 50 |             | <  |     | <u>└──</u> ─── <u>└</u> + <u></u> |
| min: -9.69e-001 | 0  | 0  |   |            | 9  |             |    |     |                                   |
| max: 9.68e-001  |    |    |   |            |    | 4           |    | 1   |                                   |
| D and           |    |    |   | <b>7</b> l |    | <u>_+</u> [ |    | յել |                                   |
| Bout            | 1  |    | 0 |            |    |             | പ  |     | ולווזר זר זר                      |
| min: -9.94e-001 |    | 10 | M | ~          | Ц  |             | 4  |     |                                   |
| max: 9.53e-001  | F  |    | 1 | 1          |    | 1           |    | 1   |                                   |
| OUT             | IL |    |   |            |    |             |    |     | ┟┑┯┯┼┹┶┹╗┯┶┹┯┍┸┹╴                 |
| min: -9.52e-001 | 0  | 0  |   |            | 0  |             | 0  |     |                                   |

Simulation Results

Figure 15. Simulation result of the proposed 2-to-1 Mux.



Figure 16. Simulation result of the proposed D-latch.

Figure 17 shows the simulation results of the SR. A 4-bit SR looks like four D-latches joined together. An input signal D and a clock signal CLK exist, and there are four outputs. These four output signals proceed by passing the previous signal in turn. It can be seen that the values in the blue box are sequentially transferred to the next output value. Figure 18 shows the simulation results of the LFSR structure. When CLK changes from 0 to 1, the value changes, and after 15 clocks have passed, it can be seen that it returns to the initial

output value in red box. Therefore, it can be confirmed that the proposed LFSR structure has the maximum period that 4 bits can have.



Simulation Results

Figure 17. Simulation result of the proposed 4-bit SR.



Figure 18. Simulation result of the proposed 4-bit LFSR structure.

#### 4.2. Performance Comparison

Table 5 compares the performance of the QCA 2-to-1 Mux proposed in this paper with the most recent excellent structures. The criteria for comparison are cell count, circuit area, delay time, and the cost calculated as Area  $\times$  Latency. In addition, it was described whether the structure was single-layered or multilayered.

| Circuit   | Cell Count | Area<br>(nm <sup>2</sup> ) | Latency<br>(Clock Cycle) | Cost<br>(Area × Latency) | Structure   |
|-----------|------------|----------------------------|--------------------------|--------------------------|-------------|
| [12]      | 23         | 24,964                     | 0.50                     | 12,482                   | coplanar    |
| [13]      | 12         | 9604                       | 0.25                     | 2401                     | coplanar    |
| [14]      | 15         | 16,284                     | 0.50                     | 8142                     | coplanar    |
| [15]      | 21         | 9604                       | 0.75                     | 7203                     | multi-layer |
| [16]      | 9          | 7644                       | 0.25                     | 1911                     | coplanar    |
| [17]      | 9          | 5684                       | 0.25                     | 1421                     | coplanar    |
| [18]      | 10         | 4524                       | 0.50                     | 2262                     | coplanar    |
| Figure 10 | 13         | 5684                       | 0.25                     | 1421                     | multi-layer |

Table 5. Performance comparison of QCA 2-to-1 Mux.

Among the latest studies, the single-layered Mux proposed in paper [12] has the highest cost at 12,482, and the single-layered circuit using the cell interaction proposed in paper [17] has the lowest cost of 1421. The proposed multilayered Mux has the same cost as the existing lowest-cost circuit, and shows up to 8.8 times better performance compared to the existing circuit. Additionally, it has excellent circuit scalability due to its symmetrical structure.

As shown in Table 6, the cost of D-latch, which has the best performance, is 6903 in paper [24] among the latest studies. The cost of the proposed structures showed the best performance as 3822, and a performance improvement of about 45% compared to the best typical structure, and up to 4.9 times better performance compared to the existing circuit.

| Circuit   | Cell Count | Area<br>(nm <sup>2</sup> ) | Latency<br>(Clock Cycle) | $\begin{array}{l} \textbf{Cost} \\ \textbf{(Area} \times \textbf{Latency)} \end{array}$ | Structure  |
|-----------|------------|----------------------------|--------------------------|-----------------------------------------------------------------------------------------|------------|
| [20]      | 28         | 24,964                     | 0.50                     | 12,482                                                                                  | coplanar   |
| [21]      | 19         | 16,284                     | 0.75                     | 12,213                                                                                  | coplanar   |
| [22]      | 23         | 21,804                     | 0.75                     | 16,353                                                                                  | coplanar   |
| [23]      | 24         | 18,644                     | 1.00                     | 18,644                                                                                  | coplanar   |
| [24]      | 13         | 9204                       | 0.75                     | 6903                                                                                    | coplanar   |
| [25]      | 18         | 9204                       | 1.00                     | 9204                                                                                    | coplanar   |
| [26]      | 27         | 13,924                     | 1.00                     | 13,924                                                                                  | multilayer |
| Figure 11 | 17         | 7644                       | 0.50                     | 3822                                                                                    | multilayer |

Table 6. Performance comparison of QCA D-latch.

Tables 7 and 8 compare the performance of the proposed QCA SR and LFSR with the latest and best studies. The cost of the entire circuit can be calculated as Area  $\times$  Latency, but as the number of bits is different for each circuit, the cost for each bit is calculated again for more objective comparison. In this case, the values after the decimal point are rounded off. It also indicates whether the input and output are in series or parallel.

| Table 7. Performance comparison of QCA | SI | R |
|----------------------------------------|----|---|
|----------------------------------------|----|---|

| Circuit   | Cell Count | Area<br>(nm <sup>2</sup> ) | Latency<br>(Clock Cycle) | Cost<br>(Area × Latency) | Bits | Cost/bit | Туре | Structure  |
|-----------|------------|----------------------------|--------------------------|--------------------------|------|----------|------|------------|
| [28]      | 102        | 81,844                     | 3.00                     | 245,532                  | 3    | 81,844   | SISO | coplanar   |
| [29]      | 127        | 108,564                    | 3.00                     | 325,692                  | 3    | 108,564  | SISO | coplanar   |
| [30]      | 105        | 134,524                    | 2.75                     | 369,944                  | 3    | 123,315  | SISO | coplanar   |
| [31]      | 120        | 28,124                     | 3.00                     | 84,372                   | 3    | 28,124   | SISO | multilayer |
| [32]      | 92         | 68,724                     | 3.75                     | 257,715                  | 4    | 64,429   | SIPO | coplanar   |
| [33]      | 177        | 149,124                    | 2.00                     | 298,248                  | 3    | 99,416   | PIPO | coplanar   |
| Figure 12 | 80         | 33,124                     | 0.75                     | 24,843                   | 4    | 6210     | SIPO | multilayer |

| 17 | of | 19 |
|----|----|----|
|    |    |    |

| Circuit   | Cell Count | Area<br>(nm <sup>2</sup> ) | Latency<br>(Clock Cycle) | $\begin{array}{l} \textbf{Cost} \\ \textbf{(Area} \times \textbf{Latency)} \end{array}$ | Bits | Cost/bit | Туре  | Structure  |
|-----------|------------|----------------------------|--------------------------|-----------------------------------------------------------------------------------------|------|----------|-------|------------|
| [36]      | 440        | 958,324                    | 1.25                     | 1,197,905                                                                               | 5    | 239,581  | Latch | multilayer |
| [37]      | 191        | 230,044                    | 1.25                     | 287,555                                                                                 | 3    | 95,852   | Latch | coplanar   |
| [38]      | 472        | 918,924                    | 0.75                     | 689,193                                                                                 | 8    | 86,149   | Latch | coplanar   |
| [39]      | 226        | 275,044                    | 2.00                     | 550,088                                                                                 | 4    | 137,522  | F/F   | coplanar   |
| Figure 13 | 120        | 87,204                     | 0.75                     | 65,403                                                                                  | 4    | 16,351   | Latch | multilayer |
| Figure 14 | 136        | 121,004                    | 1.50                     | 181,506                                                                                 | 4    | 45,377   | F/F   | multilayer |

Table 8. Performance comparison of QCA LFSR structure.

Among the existing studies, it was confirmed that the multilayered SR of paper [31] had the best performance. It was confirmed that the proposed SR showed superior performance, primarily by 4.5 times, and rising up to 20 times, compared to the recent excellent studies. We have proposed two LFSRs using D-latch and D-F/F, respectively. Compared to the structure using the D-latch, the performance of our D-latch-based structure was 5.3 times to 14.7 times superior, and the D-F/F based structure showed a more than three times improvement in performance compared to the existing best structure.

# 4.3. Energy Dissipation Analysis

The QCADesigner-E [41] tool was used to estimate energy dissipation. QCADesigner-E is an extension of QCADesigner (version 2.0.3) and was developed by the University of Bremen [42]. This application for estimating the power dissipation of QCA circuits is based on the previous studies of Timer and Lent [43,44]. Another tool for estimating power dissipation is QCAPro [45], but it does not calculate the energy dissipation of multilayer structures. Therefore, in this study, QCADesigner-E, which makes it easy to calculate the energy loss of all circuits, was used.

In total energy dissipation and average energy dissipation per cycle, the D-latch-based LFSR structure was 2.2 to 2.4 times superior, and the D-F/F-based LFSR structure showed a performance improvement of more than 20%. As the bit sizes of the existing papers are different, all circuits are increased or decreased at the same rate based on 4 bits for objective comparison. The values in parentheses in Table 9 represent the energy dissipation of the original circuit.

| Energy<br>Dissipation                   | [36]           | [37]           | [38]           | [39] | Figure 13 | Figure 14 |
|-----------------------------------------|----------------|----------------|----------------|------|-----------|-----------|
| Total $(e^{-2} \text{ eV})$             | 7.61<br>(9.51) | 7.92<br>(5.94) | 7.20<br>(14.4) | 4.35 | 3.33      | 3.62      |
| Average per cycle $(e^{-3} \text{ eV})$ | 6.91<br>(8.64) | 7.20<br>(5.40) | 6.55<br>(13.1) | 3.95 | 3.03      | 3.29      |

 Table 9. Energy dissipation comparison of QCA 4-bit LFSR structure.

# 5. Conclusions

In this paper, a 2-to-1 Mux and D-latch with wiring were designed. In addition, we proposed a 4-bit SR with D-latch as each block, and a 4-bit LFSR structure by adding an XOR operation. Additionally, by adding an edge trigger, it was possible to extract the output value according to the change of the clock. The study was designed with a multilayer structure and cell interaction, and the performance was analyzed using QCADesigner, and energy dissipation was obtained using QCADesigner-E. The proposed LFSR structure based on D-latch improved performance by about 5.3 times and reduced energy dissipation by 2.2 times compared to the existing best structure. The proposed LFSR structure based on D-F/F showed that performance was improved more than three times and energy dissipation was reduced by more than 20%. This study proposed a QCA LFSR structure that can be used effectively for quantum random number generation (QRNG). QRNG

is essential for cryptographic purposes, and can be used directly in stream ciphers and generating quantum random values in cryptographic protocols. In addition to this, QRNG is required in various fields in quantum computing environments.

Author Contributions: Conceptualization, H.-I.K. and J.-C.J.; methodology, J.-C.J.; software, H.-I.K.; validation, J.-C.J.; formal analysis, H.-I.K.; investigation, H.-I.K.; resources, H.-I.K.; data curation, J.-C.J.; writing—original draft preparation, H.-I.K. and J.-C.J.; writing—review and editing, J.-C.J.; visualization, H.-I.K.; supervision, J.-C.J.; project administration, J.-C.J.; and funding acquisition, J.-C.J. All authors have read and agreed to the published version of the manuscript.

**Funding:** This work was supported by the Institute of Information and Communications Technology Planning and Evaluation (IITP) Grant by the Korean Government through MSIT (Research on AI-based Cryptanalysis and Security Evaluation) under Grant 2020-00126.

Institutional Review Board Statement: Not applicable.

Informed Consent Statement: Not applicable.

Data Availability Statement: The data presented in this study are available on request from the authors.

Conflicts of Interest: The authors declare no conflict of interest.

# References

- 1. Lent, C.S.; Tougaw, P.D.; Porod, W.; Bernstein, G.H. Quantum cellular automata. Nanotechnology 1993, 4, 49–57. [CrossRef]
- Lent, C.S.; Tougaw, P.D.; Porod, W. Quantum cellular automata: The physics of computing with arrays of quantum dot molecules. In Proceedings of the Workshop on Physics and Computation, PhysComp '94, Dallas, TX, USA, 17–20 November 1994.
- Safoev, N.; Jeon, J.C. Design of high-performance QCA incrementer/decrementer circuit based on adder/subtractor meth-odology. Microprocess. Microsyst. 2020, 72, 102927. [CrossRef]
- 4. Seyedi, S.; Navimpipour, N.J. Designing a three-level full-adder based on nano-scale quantum dot cellular automata. *Photonic Netw. Commun.* **2021**, *42*, 184–193. [CrossRef]
- 5. Erniyazov, S.; Jeon, J.C. Carry save adder and carry look ahead adder using inverter chain based coplanar QCA full adder for low energy dissipation. *Microelectron. Eng.* **2019**, *211*, 37–43. [CrossRef]
- Seyedi, S.; Navimpipour, N.J. Ultra-efficient adders and even parity generators in nano scale. *Comput. Electr. Eng.* 2021, *96*, 107548.
   Safoev, N.; Jeon, J.C. Design and Evaluation of Cell Interaction Based Vedic Multiplier Using Quantum-Dot Cellular Automata. *Electronics* 2020, *9*, 1036. [CrossRef]
- 8. Almatrood, A.F.; Singh, H. QCA circuit design of *n*-bit non-restoring binary array divider. J. Eng. 2018, 2018, 348–353. [CrossRef]
- 9. Kim, H.I.; Jeon, J.C. Non-Restoring Array Divider Using Optimized CAS Cells Based on Quantum-Dot Cellular Automata with Minimized Latency and Power Dissipation for Quantum Computing. *Nanomaterials* **2022**, *12*, 540. [CrossRef]
- Safoev, N.; Jeon, J.C. A novel controllable inverter and adder/subtractor in quantum-dot cellular automata using cell inter-action based XOR gate. *Microelectron. Eng.* 2020, 222, 111197. [CrossRef]
- 11. Jeon, J.C. Designing nanotechnology QCA–multiplexer using majority function-based NAND for quantum computing. *J. Supercomput.* **2021**, 77, 1562–1578. [CrossRef]
- 12. Sen, B.; Goswami, M.; Mazumdar, S.; Sikdar, B.K. Towards modular design of reliable quantum-dot cellular automata logic circuit using multiplexers. *Comput. Electr. Eng.* 2015, 45, 42–54. [CrossRef]
- 13. Asfestani, M.N.; Heikalabad, S.R. A unique structure for the multiplexer in quantum-dot cellular automata to create a revolution in design of nanostructures. *Phys. B Condens. Matter* **2017**, *512*, 91–99. [CrossRef]
- 14. Ajitha, D.; VijayaLakshmi, K.N.V.S.; BhagyaLakshmi, K.; Mehetaj, M. 2:1 MUX Implementation Using NMV-Gate: Non Majority Gate in QCA. In *Emerging Trends in Electrical, Communications, and Information Technologies*; Springer: Singapore, 2020; pp. 557–563.
- 15. Mosleh, M. A novel design of multiplexer based on nano-scale quantum-dot cellular automata. *Concurr. Comput. Pract. Exp.* **2019**, 31, e5070. [CrossRef]
- Majeed, A.H.; Alkaldy, E.; Zainal, M.S.; Navi, K.; Nor, D. Optimal design of RAM cell using novel 2:1 multiplexer in QCA technology. *Circuit World* 2019, 46, 147–158. [CrossRef]
- 17. Majeed, A.H. An ultra-low complexity of 2: 1 multiplexer block in QCA technology. *Indones. J. Electr. Eng. Comput. Sci.* 2021, 21, 1341–1346. [CrossRef]
- 18. Ahmadpour, S.S.; Mohammad, M.; Heikalabad, S.R. Efficient designs of quantum-dot cellular automata multiplexer and RAM with physical proof along with power analysis. *J. Supercomput.* **2022**, *78*, 1672–1695. [CrossRef]
- 19. Seyedi, S.; Navimpipour, N.J. An efficient structure for designing a nano-scale fault-tolerant 2:1 multiplexer based on quantum-dot cellular automata. *Optik* **2022**, 251, 168409. [CrossRef]
- 20. Abutaleb, M.M. Robust and efficient quantum-dot cellular automata synchronous counters. *Microelectron. J.* **2017**, *61*, 6–14. [CrossRef]

- 21. Roshan, M.G.; Gholami, M. Novel D Latches and D Flip-Flops with Set and Reset Ability in QCA Nanotechnology Using Minimum Cells and Area. *Int. J. Theor. Phys.* 2018, 57, 3223–3241. [CrossRef]
- Sasamal, T.N.; Singh, A.K.; Ghanekar, U. Design of QCA-Based D Flip Flop and Memory Cell Using Rotated Majority Gate, Smart Innovations in Communication and Computational Sciences. *Adv. Intell. Syst. Comput.* 2019, 670, 233–247.
- 23. Jeon, J.C. Area Efficient Code Converters Based on Quantum-Dot Cellular Automata. Int. J. Civ. Eng. Technol. 2019, 10, 690–701.
- 24. Majeed, A.H.; Alkaldy, E.; Zainal, M.S.; Nor, D. Novel Memory Structures in QCA Nano Technology. arXiv 2020, arXiv:2007.01954.
- 25. Song, Z.; Xie, G.; Cheng, X.; Wang, L.; Zhang, Y. An Ultra Low Cost Multilayer RAM in Quantum-Dot Cellular Automata. *IEEE Trans. Circuits Syst. II Express Briefs* **2020**, *67*, 3397–3401. [CrossRef]
- Seo, D.K.; Jeon, J.C. Loop-Based QCA RAM Cell Design Using Multilayer-Based D Latch. J. Korean Inst. Inf. Technol. 2020, 18, 25–31. [CrossRef]
- Jeon, J.C. Low Complexity QCA Universal Shift Register Design Using Multiplexer and D Flip-Flop Based on Electronic Correlations. J. Supercomput. 2019, 76, 6438–6452. [CrossRef]
- Das, J.C.; De, D. Operational efficiency of novel SISO shift register under thermal randomness in quantum-dot cellular automata design. *Microsyst. Technol.* 2017, 23, 4155–4168. [CrossRef]
- Divshali, M.N.; Rezai, A.; Karimi, A. Towards multilayer QCA SISO shift register based on efficient D-FF circuits. *Int. J. Theor. Phys.* 2018, 57, 3326–3339. [CrossRef]
- Abdullah-Al-Shafi, M.; Ziaur, R. Analysis and modeling of sequential circuits in QCA nano computing: RAM and SISO register study. Solid State Electron. Lett. 2019, 1, 73–83. [CrossRef]
- Li, T.; Kornovich, R. An Optimized Design of Serial-Input-Serial-Output (SISO) and Parallel-Input-Parallel-Output (PIPO) Shift Registers Based on Quantum Dot Cellular Automata Nanotechnology. Int. J. Theor. Phys. 2019, 58, 3684–3693. [CrossRef]
- 32. Roshan, M.G.; Gholami, M. 4-Bit serial shift register with reset ability and 4-bit LFSR in QCA technology using minimum number of cells and delay. *Comput. Electr. Eng.* 2019, *78*, 449–462. [CrossRef]
- Fan, S.; Khamesinia, M.S. An Efficient Design of Parallel and Serial Shift Registers Based on Quantum-Dot Cellular Automata. *Int. J. Theor. Phys.* 2021, 60, 2400–2411. [CrossRef]
- 34. Purkayastha, T.; De, D.; Das, K. A novel pseudo random number generator based cryptographic architecture using quantum-dot cellular automata. *Microprocess. Microsyst.* **2016**, 45, 32–44. [CrossRef]
- 35. Senthilnathan, S.; Kumaravel, S. Power-efficient implementation of pseudo-random number generator using quantum dot cellular automata-based D flip flop. *Comput. Electr. Eng.* 2020, *85*, 106658. [CrossRef]
- 36. Rezaei, A.; Saharkhiz, H. Design of low power random number generators for quantum-dot cellular automata. *Int. J. Nano Dimens.* **2016**, *7*, 308–320.
- Mohammadi, H.; Navi, K. Energy-Efficient Single-Layer QCA Logical Circuits Based on a Novel XOR Gate. J. Circuits Syst. Comput. 2018, 27, 1850216. [CrossRef]
- Kaviya, M.; Bavithra, S.; Soorya, M.; Sowndarya, S.; Senthilnathan, S. Design of Linear Feedback Shift Register in Quantum Dot Cellular Automata. Int. J. Inf. Comput. Sci. 2019, 6, 2019.
- Amirzadeh, Z.; Gholami, M. Analysis and Design of the Pseudo-Random Bit Generator in the Technology of Quantum-Dot Cellular Automata. Int. J. Theor. Phys. 2020, 59, 29–48. [CrossRef]
- Walus, K.; Dysart, T.J.; Jullien, G.A.; Budiman, R.A. QCADesigner: A rapid design and simulation tool for quantum-dot cel-lular automata. *IEEE Trans. Nanotechnol.* 2004, 3, 26–31. [CrossRef]
- 41. Qcadesigner-e. Available online: https://github.com/FSillT/QCADesigner-E (accessed on 24 March 2022).
- 42. Torres, F.S.; Wille, R.; Niemann, P.; Drechsler, R. An energy-aware model for the logic synthesis of quantum-dot cellular automata. *IEEE Trans. CAD Integr. Circuits Syst.* **2018**, *3*, 3031–3041. [CrossRef]
- 43. Timler, J.; Lent, C.S. Power gain and dissipation in quantum-dot cellular automata. J. Appl. Phys. 2002, 91, 823–831. [CrossRef]
- 44. Timler, J.; Lent, C.S. Maxwell's demon and quantum-dot cellular automata. J. Appl. Phys. 2003, 94, 1050–1060. [CrossRef]
- Srivastava, S.; Asthana, A.; Bhanja, S.; Sarkar, S. QCAPro-an error power estimation tool for QCA circuit design. In Proceedings
  of the IEEE International Symposium Circuits System 2011, Rio de Janeiro, Brazil, 15–18 May 2011.