



# Design and Simulation Analysis of a 3TnC MLC FeRAM Using a Nondestructive Readout and Offset-Canceled Sense Amplifier for High-Density Storage Applications

Bo Peng<sup>1</sup>, Donglin Zhang<sup>2</sup>, Zhongqiang Wang<sup>1,\*</sup> and Jianguo Yang<sup>3,4,\*</sup>

- Key Laboratory of UV Light-Emitting Materials and Technology of Ministry of Education, Northeast Normal University, Changchun 130024, China; pengb806@nenu.edu.cn
- <sup>2</sup> School of Microelectronics, University of Science and Technology of China, Hefei 230026, China; zhangdonglin20@mails.ucas.ac.cn
- <sup>3</sup> Key Laboratory of Microelectronic Devices Integrated Technology, Institute of Microelectronics, Chinese Academy of Sciences, Beijing 100029, China
- <sup>4</sup> Research Center for Intelligent Computing Hardware, Zhejiang Lab, Hangzhou 311121, China
- \* Correspondence: wangzq752@nenu.edu.cn (Z.W.); yangjianguo@ime.ac.cn (J.Y.); Tel.: +86-0431-85099767 (Z.W.); +86-10-82995585 (J.Y.)

Abstract:  $Hf_{0.5}Zr_{0.5}O_2$ -based multi-level cell (MLC) ferroelectric random-access memory (FeRAM) has great potential for high-density storage applications. However, it is usually limited by the issues of a small operation margin and a large input offset. The study of circuit design and optimization for MLC FeRAM is necessary to solve these problems. In this work, we propose and simulate a configuration for a  $Hf_{0.5}Zr_{0.5}O_2$ -based 3TnC MLC FeRAM macro circuit, which also presents a high area efficiency of  $12F^2$  for each bit. Eight polarization states can be distinguished in a single fabricated  $Hf_{0.5}Zr_{0.5}O_2$ -based memory device for potential MLC application, which is also simulated by a SPICE model for the subsequent circuit design. Therein, a nondestructive readout approach is adopted to expand the reading margin to 450 mV between adjacent storage levels, while a capacitorless offset-canceled sense amplifier (SA) is designed to reduce the offset voltage to 20 mV, which improves the readout reliability of multi-level states. Finally, a 4 Mb MLC FeRAM macro is simulated and verified using a GSMC 130 nm CMOS process. This study provides the foundation of circuit design for the practical fabrication of a  $Hf_{0.5}Zr_{0.5}O_2$ -based MLC FeRAM chip in the future, which also suggests its potential for high-density storage applications.

Keywords: FeRAM; nondestructive readout; offset-canceled sense amplifier

## 1. Introduction

Ferroelectric random-access memory (FeRAM) based on  $Hf_{0.5}Zr_{0.5}O_2$  film has attracted great attention because of its potential advantages of fast programming speeds [1,2], low operating power [3,4], and good CMOS compatibility [5,6]. Therefore,  $Hf_{0.5}Zr_{0.5}O_2$ based FeRAM is usually considered as one of the promising candidates for next-generation nonvolatile memory. In principle,  $Hf_{0.5}Zr_{0.5}O_2$ -based FeRAM benefits from its thin thickness and its compatibility to the advanced CMOS process node [7]. It also can meet the requirements of today's integrated circuits for high-density storage applications. However, the scaling of FeRAM capacitors is still limited compared to that of transistors, leading to low area efficiency. For instance, the ferroelectric capacitor (FeCAP) area was  $40 \times 10^3$  nm<sup>2</sup> for the 28 nm node in Stefan et al.'s work [8], and the FeRAM area was 0.49 um<sup>2</sup> for the 130 nm node in Zhao et al.'s work [9]. Considering the above reasons, multi-level cell (MLC) FeRAM for high-density storage applications has also been continuously explored in recent studies. For instance, K. Asari et al. used a three-pulse accessing scheme to achieve multi-level technology for FeRAM-embedded reconfigurable hardware [10]. Kai Ni et al.



Citation: Peng, B.; Zhang, D.; Wang, Z.; Yang, J. Design and Simulation Analysis of a 3TnC MLC FeRAM Using a Nondestructive Readout and Offset-Canceled Sense Amplifier for High-Density Storage Applications. *Micromachines* **2023**, *14*, 1572. https://doi.org/10.3390/ mi14081572

Academic Editor: Piero Malcovati

Received: 30 May 2023 Revised: 7 August 2023 Accepted: 8 August 2023 Published: 9 August 2023



**Copyright:** © 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https:// creativecommons.org/licenses/by/ 4.0/). demonstrated one type of MLC non-volatile memory by fabricating three ferroelectricinsulator layer-based structures [11]. However, some problems still need to be solved before its practical application, such as the small operation margin of MLC FeRAM and the large input offset of a readout circuit SA. These issues usually lead to the misreading of memory states, thus restricting the application of MLC FeRAM to high-density storage. The circuit design is usually considered as one critical step to make the connection between the study of a single device and the practical fabrication of microchips. It can help to solve some problems that cannot be overcome in device studies and can be used in trial-and-error approaches before chip fabrication to save the economic cost and time cost. Therefore, it is necessary to solve the issues of a small operation margin and a large input offset of MLC FeRAM using the circuit design and optimization.

In this work, we propose a configuration of  $Hf_{0.5}Zr_{0.5}O_2$ -based 3TnC MLC FeRAM with good area efficiency. The nondestructive readout approach is used, and a capacitorless offset-canceled SA is designed to solve the abovementioned issues, which leads to a wide operation margin and read reliability. The experimental electrical characteristics and a SPICE model of a  $Hf_{0.5}Zr_{0.5}O_2$ -based FeRAM device are introduced first in this paper, which presents eight polarization states for MLC. Subsequently, the circuit structure and the operation of a 3TnC MLC FeRAM macro are presented in the following sections. Then, the capacitorless offset-canceled SA is proposed to minimize the mismatch of the readout transistor and the readout circuit. Finally, the layout of the 4 Mb 3TnC MLC FeRAM is provided with high area efficiency.

#### 2. FeRAM Device Characteristics and SPICE Model

Figure 1a shows that the FeCAP cells are integrated between the metal layers of M5 and M6 in the GSMC 130 nm logic process. After the front-end-of-line (FEOL) process, the FeCAP device was fabricated by utilizing the back-end-of-line (BEOL) process [12,13], as shown in the bottom right inset of Figure 1a. Firstly, TiN film was deposited as the bottom electrode (BE) by using radio frequency (RF) reactive sputtering. Subsequently, the  $Hf_{0.5}Zr_{0.5}O_2$  film with a thickness of 10 nm was deposited on the BE via atomic layer deposition (ALD), in which the stoichiometric ratio of the Hf and Zr elements was 1:1. Finally, TiN was deposited as the top electrode (TE) via RF reactive sputtering, followed by a step of rapid thermal annealing. Through these above fabrication steps, we experimentally fabricated the  $Hf_{0.5}Zr_{0.5}O_2$ -based FeCAP devices, and the size of each single device was  $0.7 \ \mu m \times 0.7 \ \mu m$ . The upper right inset of Figure 1a shows the transmission electron microscopy (TEM) image of a single Hf<sub>0.5</sub>Zr<sub>0.5</sub>O<sub>2</sub>-based FeCAP device, which shows its metal-ferroelectric-metal structure and confirms the 10 nm thickness of the  $Hf_{0.5}Zr_{0.5}O_2$ film. Figure 1b shows the experimentally measured P–V hysteresis curves and the simulated curve using a SPICE model, in which different sweep voltages of  $\pm 1.5$  V,  $\pm 2.0$  V,  $\pm 2.5$  V, and  $\pm 3.0$  V were utilized to perform the multiple level states. The P–V hysteresis curves were measured using a ferroelectric tester (Precision Premier II, Radiant Technologies, Inc., Albuquerque, NM, USA). Taking the sweep voltage of  $\pm 1.5$  V as an example, the value of remnant polarization (Pr) was estimated to be 13  $\mu$ C/cm<sup>2</sup>. It also can be seen that the remanent polarization became larger when increasing the applied voltages, while these different remanent polarization states represent its potential application for multiple-level storage. Figure 1b summarizes the eight positive and negative polarization states measured by using different applied voltages, which can be defined as the states from "111" to "000". Thus, the eight polarization states can correspond to three bits in one single device of MLC FeRAM.



**Figure 1.** (a) Cross-sectional TEM image of the single  $Hf_{0.5}Zr_{0.5}O_2$ -based FeCAP device; the inset shows the BEOL process of the FeCAP device; (b) measured and simulated polarization–voltage curve with different voltages of  $\pm 1.5$  V,  $\pm 2$  V,  $\pm 2.5$  V, and  $\pm 3$  V; (c) SPICE model of the FeRAM.

In order to make the subsequent circuit design of MLC FeRAM, stimulation was necessary to ensure a good fit with the experimental P–V curve, thus ensuring the simulation result was compatible with the performance of real devices. To simulate the electrical characteristics of our MLC FeRAM, we utilized the physics-based circuit-compatible SPICE model based on the single-domain approximation, referring to the literature by Aziz et al. [14], as shown in Figure 1c. In fact, this model specifically focuses on the compatibility of FEFET-based circuits with efficient design and analysis. This SPICE model is described using the time-dependent Landau–Khalatnikov equation [15] as follows:

$$E - \frac{\rho dP}{dt} = \alpha P + \beta P^3 + \gamma P^5 \tag{1}$$

where  $\rho$  is the kinetic coefficient;  $\alpha$ ,  $\beta$ , and  $\gamma$  are the static parameters of the ferroelectric layer; *P* is the FeRAM remanent polarization; and *E* is the applied voltage on the FeRAM

device. Further, by setting  $Q_P$ ,  $T_{FE}$ , and  $A_{FE}$  as the polarization charge stored in the FeRAM, the thickness, and the area of the FeRAM device, respectively, the time-dependent Landau–Khalatnikov equation can be described as follows:

$$V_{FE} = V_{R_{LK}} + V_{C_{LK}} = \left(\rho \frac{T_{FE}}{A_{FE}} \frac{dQ_P}{dt}\right) + \left[T_{FE} \left(\frac{\alpha Q_P}{A_{FE}} + \frac{\beta Q_P^3}{A_{FE}^3} + \frac{\gamma Q_P^5}{A_{FE}^5}\right)\right]$$
(2)

FeRAM is modeled as a nonlinear capacitor ( $C_{LK}$ ) that is connected in series with a resistor ( $R_{LK} = \rho \times T_{FE}/A_{FE}$ ), in which the nonlinear capacitor is simplified to the polynomial voltage-controlled voltage source (PVCVS). As the current flows through  $R_{LK}$  and PVCVS, the current is captured through the current-controlled current source (CCCS). Then, the  $C_i$  of 1 F is charged by the current of CCCS, while the voltage across the CCCS is equal to the  $Q_P$  in FeRAM. Therefore, the dashed block diagram implements the formula  $(T_{FE} \times (\alpha Q_P / A_{FE} + \beta Q^3_P / A^3_{FE} + \gamma Q^5_P / A^5_{FE}))$ . Finally, the voltage drop of FeRAM is equal to the sum of the voltage drop of the nonlinear capacitor  $C_{LK}$  and resistor  $R_{IK}$ , which implements the Landau–Khalatnikov equation. The P–V curve can be simulated by calculating the remanent polarization  $P = Q_P / A_{FE}$  and monitoring the applied voltage  $V_{FE}$  on FeRAM. Table 1 summarizes the parameters used in this model for MLC FeRAM, in which  $C_{FE}$  is the parasitic parameter of FeRAM. By adjusting the values of PVCVS ( $\alpha$ ,  $\beta$ ,  $\gamma$ ) and the parasitic parameter, the *P*–*V* hysteresis curves of the MLC FeRAM device were simulated under different sweep voltages of  $\pm 1.5$  V,  $\pm 2.0$  V,  $\pm 2.5$  V, and  $\pm 3.0$  V, respectively. As shown in Figure 1b, the simulated P–V curves of this model can fit well with the experimentally measured data of the FeRAM device, which also ensures its feasibility in the subsequent circuit design. This model will be used for the design and simulation of a 3TnC MLC FeRAM macro circuit, as discussed in later sections.

Table 1. The spice model parameters of MLC FeRAM.

| Model Parameter        | α (m/F)             | $\beta$ (m <sup>5</sup> /F/C <sup>2</sup> ) | $\gamma$ (m <sup>9</sup> /F/C <sup>4</sup> ) | $R_{LK}$ ( $\Omega$ ) | <i>C<sub>FE</sub></i> (F) |
|------------------------|---------------------|---------------------------------------------|----------------------------------------------|-----------------------|---------------------------|
| $\pm 1.5$ V simulation | $-2.25	imes10^{13}$ | $3.06 	imes 10^{39}$                        | $2.3	imes10^{65}$                            | 1.0 M                 | 1.0 f                     |
| $\pm 2$ V simulation   | $-2.25	imes10^{13}$ | $2.06	imes10^{39}$                          | $8.2	imes10^{64}$                            | 0.9 M                 | 1.0 f                     |
| $\pm 2.5$ V simulation | $-2.24	imes10^{13}$ | $1.62 	imes 10^{39}$                        | $2.6	imes10^{64}$                            | 0.8 M                 | 0.9 f                     |
| $\pm 3$ V simulation   | $-2.23	imes10^{13}$ | $1.02 	imes 10^{39}$                        | $1.9	imes10^{64}$                            | 0.75 M                | 0.85 f                    |

## 3. Circuit Structure and Operation of 3TnC MLC FeRAM Macro

Figure 2 shows the circuit structure of our 4 Mb 3TnC MLC FeRAM macro, which comprises one 4 Mb bank and the peripheral circuit. The 4 Mb bank consists of eight 512 Kb split banks, while each split bank contains 256 word-lines (WL) or plate-lines (PL) and 2048 bit-lines (BL). Herein, one split bank includes four 128 Kb segments, where each segment contains 256 WLs or PLs and 512 BLs. Further, one segment includes sixteen 8 Kb blocks, where each block contains 256 WLs or PLs and 32 BLs and 32 3TnC arrays. The 3TnC means that there is one pass transistor  $Q_{PA}$ , one reset transistor  $Q_R$ , one pass transistor  $Q_{PA}$ , and one MLC FeCAP in a minimum memory unit. Therein, a reset transistor  $Q_R$  and a readout transistor  $Q_G$ , as a common read/write circuit, are shared by 256 memory units in one array. Meanwhile, the pass transistor  $Q_{PA}$  only has the switch function. Therefore, the 3TnC also means there are three types of transistors ( $Q_R$ ,  $Q_G$  and  $Q_{PA}$ ) and 256 FeCAPs in one memory array. In the peripheral circuit, one 1/32 column mux corresponds to one block, while one split bank corresponds to a  $16 \times 4$  column mux. Similarly, one split bank includes  $16 \times 4$  sense amplifiers. The row driver and decoder are used to address and decode. The local timing control circuit can drive the pulse sequence of the write operation and nondestructive readout. Finally, by selecting one of eight split banks and four segments, the output of 16 bits is obtained for the MLC FeRAM macro.



Figure 2. The 3TnC MLC array and peripheral circuit.

To expand the reading margin between adjacent storage levels in MLC FeRAM, we used a nondestructive readout scheme. In the traditional 1T1C array, the destructive readout scheme usually adopts the operation mode of power supply voltage  $V_{DD}$  to read out and write back. In comparison, the use of  $V_{RD}$  (less than the coercive field voltage) in our nondestructive readout scheme did not destroy the residual polarization between the adjacent levels of MLC FeRAM, thereby avoiding the misreading of the stored data between adjacent levels. This scheme is beneficial for improving the read reliability characteristics of MLC FeRAM [16].

An operation sequence for the nondestructive readout scheme is given according to Figure 3. Firstly, in the writing phase, a pass transistor  $Q_{PA}$  and a reset transistor  $Q_R$  turn on, which corresponds to the WL and the reset line (RL) turning on. Then, either the PL is applied with the write pulse  $V_{WR}$  for the data "111", or the source line is applied with the write pulse for the data "000". Secondly, during the reset phase, a pass transistor  $Q_{PA}$  turns off and a reset transistor  $Q_R$  turns on, which corresponds to the WL turning off and the RL turning on. This step leads to removing the residual charge on the floating gate  $Q_G$ . Finally, in the readout phase, a pass transistor  $Q_{PA}$  turns on and a reset transistor  $Q_R$  turns off, which means the WL is turned on and the RL is turned off. By applying the voltage  $V_{RD}$  (less than the coercive field voltage) to the PL, the FeRAM-stored data are read out to the BL through the readout transistor  $Q_G$ . In the last step, since the reading margin of FeRAM [17]. Therein, the sense margin can reach approximately 450 mV between two adjacent storage levels. This large sense margin can meet the requirement for distinguishing the eight different states from the "000" state to the "111" state for MLC FeRAM.

Figure 4 shows the overall pulse sequence diagram of the write–verify scheme. Due to the different residual polarization states of MLC FeRAM obtained by applying different voltages, the pulse sequence mode should be 2'b01 or 2'b11. However, to ensure the correctness of the written data for MLC FeRAM, the verify operation is added after the write operation, that is, the readout operation. If the read data are the same as the estimated data, which means the verification is correct, the pulse sequence continues to write the next adjacent storage level of the MLC FeRAM. If the verification is wrong, the pulse sequence mode enters 2b'00 or 2b'10 until the verification is correct. It should be emphasized that the polarization reversal of the ferroelectric domains is a relaxation phenomenon. Thus, the overall pulse sequence of different pulse widths is required to adjust the effectiveness of the written data for MLC FeRAM.



Figure 3. The principle of the nondestructive readout scheme and driving pulse sequence waveform.



Figure 4. The overall pulse sequence of the write-verify scheme.

Owing to the ideal electrical characteristics of the SPICE model of FeRAM, we adopt the 2'b01 mode to simulate the distribution condition of the readout voltage for each storage unit level of the MLC FeRAM. After 10 k Monte-Carlo simulations in the 16 Kb MLC array, each storage cell level can be effectively distinguished without the overlapping of the readout voltage distribution, as shown in Figure 5. At the same time, it can be seen that there is a nearly 450 mV reading margin between each storage unit level of MLC FeRAM.



**Figure 5.** The readout voltage distribution diagram for each storage cell level based on the 16 Kb MLC array simulation.

## 4. Capacitorless Offset-Canceled Sense Amplifier

Due to the fluctuation of the CMOS process, there is usually a mismatch phenomenon in the readout transistor  $Q_G$  of the 3TnC cell array and the readout circuit SA, resulting in a large input offset. To improve the reliability of the readout stored data between adjacent storage levels in MLC FeRAM, we propose a capacitorless offset-canceled SA to minimize the mismatch of SA and readout transistor. Meanwhile, compared with the single-capacitor offset-canceled SA [18], the capacitorless offset-canceled SA uses the parasitic capacitor of a transistor to replace the original single metal/insulator/metal (MIM) capacitor, thus saving the area of whole chip.

The minimization mismatch principle of capacitorless offset-canceled SA is explained below. Firstly, in the offset cancellation phase, the outputs of inverters are connected to their inputs in Figure 6a, which correspondingly close the switches of "pset\_n", "nset", and "S1" in Figure 6b. The parasitic capacitor of transistor Q collects the trip voltage of inverters, leading to the formation of two voltages of  $V_L$  and  $V_R$  at the two sides of transistor Q. Secondly, in the precharge phase, one side of the parasitic capacitor of transistor Q is connected to the ground, which correspondingly closes the switches of "S2R" and "S1B", while keeping the other switches open. Therefore, the other side of the parasitic capacitor of transistor Q obtains the voltage  $V_R - V_L$ , which is the difference between the two trip voltages of  $V_L$  and  $V_R$ . In the BL sampling phase, the switches of "S3R" and "S1B" are closed, while the other switches are open. The different reference voltage  $V_{ref}$  is added to the voltage  $V_R - V_L$  for different storage levels of MLC FeRAM, which compensates for the mismatch of the two side inverters, thus canceling out the offset of the SA. Finally, in the evaluation phase, the switches of "pset\_n", "nset", and "S1B" are closed, while the other switches are open. The SA can be quickly sensed thanks to the canceling out of this offset. Under the conditions of a TT process corner, 3.3 V, and 25 °C, Figure 6b shows the simulation result of the output waveforms of "BL<0>" and "BL<1>" in the SA. Herein, it is noted that the offset cancellation and precharge phases can be run concurrently with the reset operation of the 3TnC array, thus avoiding any timing penalty for the proposed method.



**Figure 6.** (a) The operation principle of the capacitorless offset-canceled SA; (b) the simulated waveform of the proposed SA.

Figure 7 shows the relationship between the input offset voltage and transistor size for both the proposed SA (capacitorless SA) and the conventional SA (conv. SA). Generally, the mismatch of transistors in the SA minimizes with the increment in its size, which means the input offset of all transistors of the SA also reduces accordingly. Importantly, after 10 k Monte-Carlo simulations, compared to the conv. SA, the standard deviation of the input offset can be reduced on average by nearly 45% in the proposed SA due to its minimization mismatch principle. Meanwhile, compared to the single MIM-capacitor SA with the same offset voltage and CMOS process, the area of capacitorless SA can be decreased by 35%.



Figure 7. The comparison of the offset voltage of capacitorless SA and conventional SA.

#### 5. The Layout of 4 Mb 3TnC MLC FeRAM and a Comparison with Other Memory Works

Figure 8 shows the layout of 4 Mb 3TnC MLC FeRAM, with an area of  $3052 \times 4306 \,\mu\text{m}^2$ , consisting of the 3TnC cell array, the capacitorless SA, and the other peripheral circuits. The inset shows the layout of the single 3T1C cell array. "AA" and "GATE" mean the active area and gate electrode of transistor. Table 2 illustrates the performance comparison of our work with other memory works. The proposed 3TnC MLC FeRAM macro has the advantages of the high area efficiency of  $12F^2$  for each bit, a large sense margin of 450 mV between each level of storage data, and a low offset of 20 mV, which are all beneficial for high-density storage applications. Both the read and write time of the cell are 100 ns, while the max power consumption is  $48.4 \,\mu\text{W}$  for a read and a write operation. Here, it should be noted that the influence of temperature on FeRAM is relatively small, as reported in reference [19]; the variation in the readout voltage of FeRAM is about 50 mV; and the readout margin of adjacent polarized states of our FeRAM with 3TnC architecture is 450 mV. Therefore, our 3TnC MLC FeRAM chip has good stability.



Figure 8. The layout of 4 Mb 3TnC MLC FeRAM; the inset shows the layout of a 3T1C cell array.

|                            | This Work | <b>Ref</b> [8] | <b>Ref</b> [9] | Ref [19] |
|----------------------------|-----------|----------------|----------------|----------|
| Cell structure             | 3TnC      | 2TnC           | 1T1C           | 1T1C     |
| Technology                 | 130 nm    | 130 nm         | 130 nm         | 130 nm   |
| Multi-level cell           | Yes       | Yes            | No             | No       |
| Area (F <sup>2</sup> /bit) | 12        | 51             | 36             | 36       |
| SA offset                  | 20 mV     | N/A            | 45 mV          | 18.1 mV  |
| Max sense margin           | 450 mV    | 300 mV         | 270 mV         | 200 mV   |
| Read time                  | 100 ns    | 15 µs          | 150 ns         | 5 ns     |
| Write time                 | 100 ns    | 15 µs          | 150 ns         | 7 ns     |
| Power consumption          | 48.4 μW   | 18 μW          | N/A            | N/A      |

Table 2. The performance compared to other memory works.

#### 6. Conclusions

In this work, a novel 3TnC MLC  $Hf_{0.5}Zr_{0.5}O_2$ -based FeRAM with a high area efficiency of  $12F^2$  for each bit is proposed for high-density storage application. Eight polarization states (three bits) can be obtained in one MLC FeRAM. The corresponding timing operation using a nondestructive readout is verified via simulation based on the GSMC 130 nm CMOS process. Meanwhile, the readout circuit SA has a low offset of 20 mV and a large sense margin of 450 mV to improve the reliability of the reading of the stored data between each level of the MLC FeRAM. These advantages of 3TnC MLC FeRAM using nondestructive readout and capacitorless SA ensure its potential for future high-density storage applications.

**Author Contributions:** Data curation, B.P. and D.Z.; visualization, B.P.; writing—original draft preparation, B.P.; software, D.Z.; writing—review and editing, B.P., Z.W. and J.Y.; methodology, B.P. and J.Y.; formal analysis, B.P., Z.W. and J.Y., investigation, B.P. and J.Y.; validation, B.P. and D.Z.; funding acquisition, J.Y.; supervision, Z.W. and J.Y. All authors have read and agreed to the published version of the manuscript.

**Funding:** This work was supported in part by the National Natural Science Foundation of China under Grants 92164204, 62222119, and 62025406, and in part by the Major Scientific Research Project of Zhejiang Lab (No. 2022PF0AC01), and the Strategic Priority Research Program of the Chinese Academy of Sciences under Grant No. XDB44000000.

**Data Availability Statement:** The data that support the findings of this study are available from the corresponding author upon request.

**Conflicts of Interest:** The authors declare no conflict of interest.

#### References

- Chang, S.-C.; Haratipour, N.; Shivaraman, S.; Brown-Heft, T.-L.; Peck, J.; Lin, C.-C.; Tung, I.-C.; Merrill, D.R.; Liu, H.; Lin, C.-Y. Anti-ferroelectric Hf<sub>x</sub>Zr<sub>1-x</sub>O<sub>2</sub> Capacitors for High-density 3-D Embedded-DRAM. In Proceedings of the 2020 IEEE International Electron Devices Meeting (IEDM), San Francisco, CA, USA, 12–18 December 2020.
- Zhao, Y.; Yu, J.; Zhang, D.; Hu, Q.; Liu, X.; Jiang, H.; Ding, Q.; Han, Z.; Cheng, J.; Zhang, W.; et al. A 0.02% Accuracy Loss Voltage-Mode Parallel Sensing Scheme for RRAM-Based XNOR-Net Application. *IEEE Trans. Circuits Syst. II Express Briefs* 2022, 69, 2697–2701. [CrossRef]
- Jiao, L.; Zhou, Z.; Zheng, Z.; Kang, Y.; Sun, C.; Kong, Q.; Wang, X.; Zhang, D.; Liu, G.; Liu, L.; et al. BEOL-compatible Ta/HZO/W Ferroelectric Tunnel Junction with Low Operating Voltage Targeting for Low Power Application. In Proceedings of the 2022 International Conference on IC Design and Technology (ICICDT), Hanoi, Vietnam, 21–23 September 2022.
- Kaur Kingra, S.; Parmar, V.; Verma, D.; Bricalli, A.; Piccolboni, G.; Molas, G.; Regev, A.; Suri, M. Fully Binarized, Parallel, RRAM-Based Computing Primitive for In-Memory Similarity Search. *IEEE Trans. Circuits Syst. II Express Briefs* 2023, 70, 46–50. [CrossRef]
- Liu, C.; Yang, J.; Jiang, P.; Wang, Q.; Zhang, D.; Gong, T.; Ding, Q.; Zhao, Y.; Luo, Q.; Xue, X.; et al. A Low Power 4T2C nvSRAM With Dynamic Current Compensation Operation Scheme. *IEEE Trans. Very Large Scale Integr. (VLSI) Syst.* 2020, 28, 2469–2473. [CrossRef]
- Chen, D.; Guo, Z.; Fang, J.; Zhao, C.; Jiang, J.; Zhou, K.; Tian, H.; Xiong, X.; Xue, X.; Zeng, X. A 1T2R1C ReRAM CIM Accelerator With Energy-Efficient Voltage Division and Capacitive Coupling for CNN Acceleration in AI Edge Applications. *IEEE Trans. Circuits Syst. II Express Briefs* 2023, 70, 276–280. [CrossRef]

- Salahuddin, S. Ultrathin Ferroelectricity and Its Application in Advanced Logic and Memory Devices. In Proceedings of the 2021 IEEE International Reliability Physics Symposium (IRPS), Monterey, CA, USA, 21–25 March 2021. [CrossRef]
- Slesazeck, S.; Ravsher, T.; Havel, V.; Breyer, E.T.; Mulaosmanovic, H.; Mikolajick, T. A 2TnC ferroelectric memory gain cell suitable for compute-in-memory and neuromorphic application. In Proceedings of the 2019 IEEE International Electron Devices Meeting (IEDM), San Francisco, CA, USA, 7–11 December 2019.
- Zhao, Y.; Wang, Y.; Zhang, D.; Han, Z.; Hu, Q.; Liu, X.; Ding, Q.; Cheng, J.; Zhang, W.; Cao, Y.; et al. A Hf<sub>0.5</sub>Zr<sub>0.5</sub>O<sub>2</sub> ferroelectric capacitor-based half-destructive read scheme for computing-in-memory. *Sci. China Inf. Sci.* 2022, *66*, 159402. [CrossRef]
- Asari, K.; Mitsuyama, Y.; Onoye, T.; Shirakawa, I.; Hirano, H.; Honda, T.; Otsuki, T.; Baba, T.; Meng, T. Multi-Mode and Multi-Level Technologies for FeRAM Embedded Reconfigurable Hardware. In Proceedings of the 1999 IEEE International Solid-State Circuits Conference, San Francisco, CA, USA, 17 February 1999.
- Ni, K.; Smith, J.; Ye, H.; Grisafe, B.; Rayner, G.B.; Kummel, A.; Datta, S. A Novel Ferroelectric Superlattice Based Multi-Level Cell Non-Volatile Memory. In Proceedings of the 2019 IEEE International Electron Devices Meeting (IEDM), San Francisco, CA, USA, 7–11 December 2019.
- Xu, Y.; Yang, Y.; Zhao, S.; Gong, T.; Jiang, P.; Lv, S.; Yu, H.; Yuan, P.; Dang, Z.; Ding, Y.; et al. Robust Breakdown Reliability and Improved Endurance in Hf<sub>0.5</sub>Zr<sub>0.5</sub>O<sub>2</sub> Ferroelectric Using Grain Boundary Interruption. *IEEE Trans. Electron Devices* 2022, 69, 430–433. [CrossRef]
- Dang, Z.; Lv, S.; Gao, Z.; Chen, M.; Xu, Y.; Jiang, P.; Ding, Y.; Yuan, P.; Wang, Y.; Chen, Y.; et al. Improved Endurance of Hf<sub>0.5</sub>Zr<sub>0.5</sub>O<sub>2</sub>-Based Ferroelectric Capacitor Through Optimizing the Ti–N Ratio in TiN Electrode. *IEEE Electron Device Lett.* 2022, 43, 561–564. [CrossRef]
- 14. Aziz, A.; Ghosh, S.; Datta, S.; Gupta, S.K. Physics-Based Circuit-Compatible SPICE Model for Ferroelectric Transistors. *IEEE Electron Device Lett.* 2016, *37*, 805–808. [CrossRef]
- 15. Song, T.K. Landau-Khalatnikov simulations for ferroelectric switching in ferroelectric random access memory application. *J. Korean Phys. Soc.* **2005**, *46*, 5–9.
- Kato, Y.; Yamada, T.; Shimada, Y. 0.18-μm Nondestructive Readout FeRAM Using Charge Compensation Technique. *IEEE Trans. Electron Devices* 2005, 52, 2616–2621. [CrossRef]
- Hur, J.; Luo, Y.-C.; Wang, Z.; Shim, W.; Khan, A.I.; Yu, S. A Technology Path for Scaling Embedded FeRAM to 28 nm with 2T1C Structure. In Proceedings of the 2021 IEEE International Memory Workshop (IMW), Dresden, Germany, 16–19 May 2021. [CrossRef]
- Dong, Q.; Wang, Z.; Lim, J.; Zhang, Y.; Shih, Y.-C.; Chih, Y.-D.; Chang, J.; Blaauw, D.; Sylvester, D. A 1Mb 28 nm STT-MRAM with 2.8 ns Read Access Time at 1.2V VDD Using Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination. In Proceedings of the 2018 IEEE International Solid-State Circuits Conference, San Francisco, CA, USA, 11–15 February 2018.
- Yang, J.; Luo, Q.; Xue, X.; Jiang, H.; Wu, Q.; Han, Z.; Cao, Y.; Han, Y.; Dou, C.; Lv, H.; et al. A 9Mb HZO-Based Embedded FeRAM with 10<sup>12</sup>-Cycle Endurance and 5/7 ns Read/Write using ECC-Assisted Data Refresh and Offset-Canceled Sense Amplifier. In Proceedings of the 2023 IEEE International Solid-State Circuits Conference, San Francisco, CA, USA, 19–23 February 2023.

**Disclaimer/Publisher's Note:** The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.