Slew-Rate Enhancement Techniques for Switched-Capacitors Fast-Settling Amplifiers: A Review

: This review is aimed at the integrated circuit design community and it explores slew-rate enhancement techniques for switched-capacitor amplifiers, with a primary focus on optimizing settling time within power constraints. Key challenges are addressed, including the selection between single-stage and two-stage amplifiers, along with the utilization of advanced circuit-level techniques for slew-rate enhancement. Presently, there exists a gap in comprehensive discussion, with reliance primarily on two Figures of Merit aimed at assessing power efficiency under specific capacitive loads. However, these metrics fail to adequately assess the performance of the existing slew-rate enhancer solutions at different values of capacitive loads. As a consequence, the designer lacks clear guidelines in practical situations. This review provides a state-of-the art mapping under a figure of merit dedicated to assess the whole settling delay, and also introduces a novel performance metric which highlights the role of the circuital architectures, regardless of external operating conditions. By offering a thorough examination, this review seeks to steer future research in switched-capacitor amplifier design, thereby facilitating informed decision-making and fostering innovation in the field


Introduction
The Switched-Capacitor (SC) technique has gained widespread attention since the late 1970s [1] to implement fundamental analog functions in integrated circuits.Its enduring popularity is rooted in the precise frequency responses and gain accuracy it offers when implementing analog filters.As of today, SC circuits are regularly employed into various domains beyond traditional analog filtering applications.For example, we find plenty of literature regarding SC design of sample-and-hold circuits, track-and-hold [2], analog-todigital (ADC), and digital-to-analog (DAC) converters, with special emphasis on Delta-Sigma modulators (∆ΣMs) [3,4] and sensor interfaces [5].
Important factors that catalyzed the adoption of SC techniques are their optimal match with low-cost CMOS technology: 1.
High-linearity capacitors, ranging from few tens of femto-farads to hundreds of picofarads, can be reliably realized in CMOS technologies, either as metal-insulator-metal (MIM) or metal-oxide-metal (MOM) structures.

2.
Versatile switches are realized with MOS transistors.

3.
The amplifiers involved in the SC circuits are loaded capacitively, hence simple Operational Transconductance Amplifier (OTA) structures are employed with respect to general-purpose Operational Amplifier (OpAmp) circuits.4.
MOS devices at the input of the amplifiers do not draw DC bias currents, and hence charge transfer is precisely controlled over a wide range of clock frequencies.

5.
In contrast to traditional time-continuous operation, the SC approach offers discretetime signal processing.The impact of non-linearity effects on the precision of the system is minimal, considering that system precision is evaluated at the end of discrete time phases, during which the amplifier output should be able to settle.The settling performance is often evaluated at the end of each phase, in terms of relative error with respect to an ideal response, determined by a capacitance ratio.Conversely, in continuous-time systems, non-linearity effects must be minimized throughout the entire transient duration.This distinction is crucial in making design choices for SC circuits, where non-linear circuit schemes are often adopted.
Despite the mentioned advantages, the design of SC circuits comes with various challenges: 1.
Maximize both the DC-gain and unity-gain bandwidth (UGB) of the OTA to meet specific precision specifications [6,7].2.
Reduce charge injections resulting from transitions in the control signals that command the switches [8]. 3.
Minimize noise introduced during signal processing in the charge domain [9].At the same time, maximize the maximum input signal to improve the signal-to-distortion and noise ratio of the system.4.
Settling speed is often traded with power consumption: numerous advanced circuital techniques have been proposed in the literature to obtain more beneficial balance.
Extensive studies on the first two challenges exist in numerous scientific works including the already cited references, and for brevity, they will not be addressed further.This review focuses on the latter two challenges.The objective of this review is to provide an initial tool for navigating the intricate landscape of optimizing the settling time of amplifiers for SC circuits, offering an update until 2024.Additionally, the review critically revisits techniques introduced in recent years concerning settling time enhancement and explores the associated trade-offs with power consumption.The target audience for this review encompasses junior IC designers seeking a succinct exploration of slew-rate enhancement techniques for SC circuits.It also extends to expert professionals who may benefit from a critical discussion on this topic.
Within the myriad of circuit designs incorporating SCs and an OTA, the parasiticinsensitive integrator stands out as one of the most commonly employed.Figure 1 illustrates a standard parasitic-insensitive fully-differential SC integrator, which will serve as a standard case study for analyzing the behavior and merit parameters of various OTA topologies explored in this review.The field of application of an SC integrator is vast: it can be employed in very-low frequency applications such as in temperature smart sensors, up to baseband signal processing in communication systems, operating at hundreds of megahertz.Indeed, meeting high-speed requirements aligns with the maximum transition frequency permitted by the technology at hand: higher bandwidth corresponds to more advanced CMOS technological nodes.However, technological nodes below 20 nm introduce significant phenomena like self-heating and aging, particularly noticeable during low-voltage operation.Although these effects are not addressed in this review, it is anticipated that they will become prominent in the coming years, prompting the development of new design methodologies and circuits.The general approach outlined here, concerning the capacitor network formed by C 1 , C 2 , and C 3 , facilitates the extension of the discussion to vastly different fields of applications.For instance, in typical ADC settings, C 3 is typically in the same order of magnitude as C 1 , C 2 capacitors , whereas in Liquid Crystal Display (LCD) driver applications, C 3 ≫ C 1 , C 2 .
Capacitors C 3p and C 3n represent the capacitive loads applied to the integrator.Phase ϕ reset is used to establish the initial conditions of the integrator state variable.The differential input of the integrator is represented by V ip − V in , while the differential output is represented by V op − V on .For simplicity, the output and the input common mode voltages are identical and equal to V cm .In this configuration, the output is valid at the end of ϕ 2 .

Settling Time and Power Optimization
Figure 2 illustrates the settling process at the differential output of the SC integrator of Figure 1.In this plot, the integration phase, ϕ 2 , starts at t = 0 and its duration is equal to t S .Two scenarios are depicted: (i) a small output voltage upset, denoted as V  [from (7)] for a small input step and for a large input.
In scenario (i), where the SC integrator experiences relatively minor variations across its nodes, the system is typically analyzed through its equivalent linearized circuit.Here, t lin , coinciding with t S , denotes the analytical approach used to determine V Conversely, in scenario (ii), the transient waveform exhibits two distinct regions: 1.
In the initial region, denoted by t sr , the output voltage (V o (t)) shows its maximum slope ( ), which in this phase is practically independent of the input signal.

2.
In the subsequent region, as V o (t) approaches its final value V o (+∞), the behavior resembles that of scenario (i).
The complex behavior observed during large output upsets stems from the maximum available current at the OTA output, denoted as I omax .Throughout t sr , the OTA supplies I omax to the output, driving the equivalent capacitive load, C LE , observed at the differential outputs of the OTA: Here, β FB denotes the feedback factor due to C 1 and C 2 .It is noteworthy that, for t > t sr , the slope of the output voltage is consistently smaller than the initial slope, indicating that the output current provided by the OTA is less than I omax .
The values of both V o (0 + ) and V o (+∞) in Figure 2 depend on several factors: (i) on the charge present within C 2 before the onset of ϕ 2 (resulting in an output voltage of V o (0 − )) and (ii) the charge sampled within C 1 , equal to C 1 V id , with V id = V ip − V in , at the end of ϕ 1 : In these equations, we considered the OTA gain high enough to allow for finite-gain effects to be neglected (Finite-gain effects are not considered in this analysis.The interested reader who wants to delve into this subject can refer to essential books for SC design such as [1,5], as well as more application-specific literature: [3,4] for ADCs, [6,7,10] for SC filters) and the OTA input capacitance to be negligible with respect to C 1 [10,11] (This latter condition will be removed in the analysis developed in Section 2.1).We can observe that the asymptotic output upset, ∆V o (+∞) = V o (+∞) − V o (0 + ), is proportional to V id through a coefficient that depends only on the capacitor network around the OTA: where: Ideally, the integrator output step would be dependent only on the C 1 /C 2 ratio, such as the following: The relationship between the actual and the ideal step is easily found elaborating (3) and ( 5): The coefficient α C is non-zero positive and less than one.It solely describes the ratio between the ideal integrator coefficient and the actual coefficient, which is slightly larger due to the presence of C 3 .
At this point, we focus on the error affecting the output voltage when a finite time interval t S is considered.To this purpose, the output relative error ϵ o (t) is defined as follows: which is schematically illustrated in Figure 2. We are interested in ϵ S , corresponding to ϵ o evaluated at t S : Next, we consider the case of a small output upset, as in case (1) of Figure 2. If the OTA is a dominant-pole system, a single time constant, τ, is needed to closely approximate the output waveform: The time constant τ is related to the gain-bandwidth product of the loop gain (For one-pole systems, characterized by an open-loop pole f p and DC gain where GBW is the gain-bandwidth product: GBW = A 0 f p .In Section 2.1, a design-oriented expression of τ is found for a simple single-stage OTA architecture.Very high-frequency designs (t S ≪ 1 ns) are often characterized by scarce adherence to dominant-pole models: in this case, specialized models need to be developed.This aspect, however, falls outside the scope of this review).The corresponding error can be calculated applying (8): Interestingly, this result indicates that, as long as the system does not enter slew-rate conditions, the relative error remains constant and is determined only by the ratio t S /τ.This outcome was expected since the circuit model we adopted is linear.An important consequence of this assumption is that the integrator does not introduce distortion as long as V id remains small enough.By elaborating Equations ( 1)-( 3), (9), and imposing dV o /dt < I omax /C LE , it can be found that this condition holds as long as the following inequality is verified: The inequality (11) provides a constraint on the maximum allowable value of V id , determined by the characteristics of the capacitive feedback network (k C , C LE ) and the OTA (I omax and τ), without entering slew-rate conditions.
In contrast, case (2) of Figure 2 is distinguished by an initial slew-rate period, t sr , and the respective relative error can be expressed as (The expression can be easily found from the relation in (9) by applying a time-shift of t where: o (+∞) In (12), two major terms need to be considered: rs term reflects the residual voltage interval to be covered by the OTA after it ends the slewing phase.In this term, I omax t (2) sr /C LE corresponds to the portion of the voltage upset related to the slewing phase, ∆V o,sr , also indicated in Figure 2. Intuitively, α (2) sr results in a decreasing function of V id , since the residual voltage tends to very small values as ∆V o (+∞) is increased.However, α (2) sr will never be zero, since a certain amount of linear settling is always due.Notably, t sr results are inversely proportional to I omax , and hence α (2) sr does not depend on the design choices related to I omax .

•
The most important term in (12) is represented by the exponential grow determined by the t sr /τ ratio, causing distortion to rapidly grow with V id , regardless of the partial compensation due to the α sr term.As seen from (11), the V id limit can be increased either by increasing I omax or τ.The latter choice is in contrast with maximum settling time requirements.On the other hand, increasing I omax only during t sr represents a viable solution.Slew-rate enhancing techniques are devoted to reduce t sr /τ in order to maintain acceptable distortion even in the presence of large V id values.The following section delves into analytical aspects of OTAs in order to identify a power-linearity trade-off in the SC integrator.

Simplified Settling Model
Optimizing the settling time of the SC integrator depicted in Figure 1 within a predetermined power budget is a common challenge for designers.In this pursuit, designers frequently encounter constraints related to the dimensions of capacitors C 1 , C 2 , and C 3 .These constraints stem from precision considerations in relation to the kT/C noise limit and the available supply voltage, V DD .This optimization challenge has been extensively studied in the literature, with contributions from various researchers [12][13][14][15][16][17][18].For clarity in the subsequent discussion, a brief overview of the elementary model proposed by these authors is presented here.To streamline the analysis, the circuit in Figure 1 is simplified to the configuration shown in Figure 3.In this simplified representation, the evolution of the integrator output, V o , is determined by the equivalent step at the input, V id .The idealized OTA model involves a few key parameters: G m , I omax , and V dmax .The transconductance G m characterizes the OTA behavior when operating for |V i | < V dmax .Here, V dmax defines the maximum value of the input differential voltage where the OTA output currents remain sensitive to V i .Outside this range, the maximum available current at the output is I omax .Importantly, G m is associated with the small-signal settling behavior of the OTA, while I omax is related to the OTA slew-rate behavior.In an ideal scenario: The parameters G m , V dmax , and I omax are contingent on the specific OTA topology and the corresponding design choices.The subsequent discussion will explore the relationship of these parameters with specific OTA topologies.The static supply current drawn by the OTA is denoted as I sup .Any discontinuities, arising when I omax > G m V dmax , are also incorporated into the I o (V i ) characteristics to account for the settling behavior of OTAs with highly non-linear circuits as the Parallel-type Slew Rate Enhancer (PSRE) circuits, as discussed in Section 3.6.
The analysis of the settling transient is complex due to the non-coincidence of the OTA differential voltage input, V i , with the input step V id .We assume a perfect virtual short-circuit between the OTA inputs at the steady state (V i = 0).However, an instant after the step is applied, V i jumps to the value c 1 V id , where c 1 is a voltage divider coefficient determined by the capacitor network formed by C 1 , C 2 , C 3 , C P [15]: The value of V i (0 + ) establishes the starting point of the settling transient, and the OTA finds itself operating either in the linear region if |V i (0 + )| < V dmax or in the slew-rate region if |V i (0 + )| ≥ V dmax .The coefficient c 1 is close to unity in the ideal conditions of an unloaded OTA (C 3 = 0) and zero input parasitic capacitance (C P = 0).In practice, 0 < c 1 < 1.
Figure 3. Schematic diagram of a fully-differential SC integrator during the settling phase.The capacitor C P is associated to the OTA input device parasitic capacitance.The OTA has the idealized I o (V i ) characteristic shown in the inset plot.I o represent the differential-mode current at the output of the OTA while V i represent the differential voltage at the input of the OTA.This idealized characteristic is fully described by the set of the following three parameters: G m , I omax , V dmax .I sup indicates the current drawn from the supply voltage.
For a single-stage OTA, the expression of the settling time, t set , given a settling precision specification, ϵ S , can be expressed as the sum of the slew-rate period, t sr , and a linear-settling period, t lin : The expressions of various terms in the equation (c 1 , c 2 , C IE ) are detailed in Table 1.Clearly, the C IE /G m term represents the time-constant τ found in (9).Straightforward algebraic manipulations lead to the following: This equation explicitly indicates the role of the feedback factor (With respect to the expression of β FB found in (1), C P has not been neglected.It is important to remark that β ⋆ FB and β FB assume the same conceptual role, and are here distinguished only to highlight the eventual influence of C P .In the following discussion, we will employ β FB , keeping in mind that β ⋆ FB should be used instead in all cases where C P cannot be neglected with respect to C 1 ).β ⋆ FB applies in determining the τ (for a single-stage OTA).
Table 1.Parameters of the simplified settling model for single-stage OTAs.

Parameter Expression Meaning
The model described by Equation ( 16) serves as a qualitative, high-level design tool and cannot substitute transistor-level simulations.Notably, the model exhibits significant inaccuracies in both large-signal and small-signal aspects: 1.
Abrupt transitions between the slew-rate and linear regions.

2.
Oversimplified modeling of G m and V dmax .

3.
Inability to capture the effects of non-dominant singularities associated with OTA internal nodes (The presence of non-dominant singularities is typically addressed by introducing the phase margin parameter.In an ideal one-pole system, the phase margin assumes a value of 90 degrees.In practical designs, a phase margin degradation of up to 20 degrees is often tolerable without significantly affecting settling time.This aspect is discussed in Section 2.2 when two-stage architectures are introduced).
Despite these evident shortcomings, Equation ( 16) remains useful for illustrating specific phenomena emerging during the settling process.This includes the dependence of t S on input amplitude and the influence of G m and I omax .
Equation ( 16) clearly indicates that the slew phenomenon occurs when |V id | > V dmax /c 1 .Consequently, SC integrators dealing with small input voltage steps do not derive significant benefits from the adoption of slew-rate enhancement techniques.In practical applications, this corresponds to scenarios where ∆Σ Modulators employ multi-bit quantisers or Finite Impulse Response (FIR) filters in the feedback paths [19].More commonly, straightforward and simpler implementations with single-bit quantisers are employed.This choice implies higher values of |V id | to be considered.As |V id | starts to increase, the contribution of t sr may dominate over t lin in the overall transient process.
This phenomenon is illustrated in Figure 4a, where Equation ( 16) has been numerically evaluated for three distinct design cases, A, B, and C. The numerical values in this example mimic a sub-unit integrator coefficient (C 1 /C 2 = 0.125), which is common in single-bit ∆ΣMs.In case A, I omax = G m V dmax , while in case B, I omax is four times the value of the previous case while maintaining G m .The comparison between A and B exemplifies the contrast between a standard OTA and an advanced OTA featuring slew-rate enhancement techniques.As will be clarified later, these techniques result in higher I omax availability without ideally increasing static power consumption.Hence, in this idealized experiment, both OTAs in cases A and B have the same I sup .In contrast, OTA in case C has the same standard topology as OTA A, with G m and I omax scaled by a factor of two.Consequently, OTA C doubles the power consumption of OTAs A and B. (a) As anticipated, OTA B outperforms OTA A in terms of settling time for |V id | > V dmax /c 1 , effectively expanding the range of input signals where the settling is acceptable (see Figure 4b).Moreover, OTA B surpasses OTA C, especially for |V id | > 0.9 V. Notably, this achievement is realized with a moderate slew-rate enhancement ratio, K SRE = I omax /(G m V dmax ), set at 4.
As already mentioned, the model has been developed for single-stage OTAs.The most simple example of such an OTA is shown in Figure 5a.For this configuration, V dmax coincides with the theoretical input differential voltage that completely imbalances the differential pair in strong inversion (M1-M2) biased at I 0 .It can be derived from the EKV model equations [20], as follows: Here, W/L is the aspect ratio of the input PMOS devices (M1, M2), µ p is their mobility, and C ox is the gate oxide capacitance per unit area.On the other hand, M1 and M2 are characterized also by g m,in , producing small-signal differential currents conveyed to the output nodes through current mirrors formed by M3-M5 and M4-M6.If those mirrors have a current amplification of k m , the resulting G m and I omax are easily found: The circuit in Figure 5b is a classical Miller-compensated two-stage OTA.Its analysis will be developed in the following section, together with considerations regarding the power efficiency of two-stage designs.

Considerations on Single-Stage and Two-Stage OTA Architectures
For OTAs designed to be used in place of operational amplifiers, the number of stages has a direct consequence on the maximum DC gain that can be attained, especially in the case of a resistive load or, equivalently, a resistive feedback network that has to be driven.In low voltage applications, where the voltage headroom is insufficient for cascode stages, multi-stage OTAs are a mandatory choice to maintain a sufficient DC gain even in the case of purely capacitive loads.Conversely, the effect of the number of stages on the slew-rate performance is not straightforwardly clear, and a comprehensive analysis is required to demystify some common misconceptions.In this section, we will limit the analysis to the comparison between single and two-stage OTAs, focusing on the trade-off between slew-rate and quiescent power consumption (Multi-stage architectures can be further considered based on the considerations that will be discussed for the two-stage OTAs.The interested reader may refer to dedicated works regarding the analysis of settling transients of multi-stage OTAs [17,[21][22][23][24]).
To this aim, it is convenient to express the maximum output current of a single-stage OTA, I omax , and its overall transconductance, G m , as a function of the quiescent supply current, I sup , by the following relationships: where k I strongly depends on the OTA topology, while voltage V TE plays the role of the inverse of the g m /I D parameter in single mosfets.For a single-stage OTA, the slew rate can be simply related to the maximum output current by Equation ( 21) suggests that the current vs. slew-rate trade-off can be well represented by the s r C LE /I sup efficiency factor.For a single-stage OTA, this factor coincides with k I .A two-stage amplifier can be modeled by the simple single-ended circuit of Figure 6, which is representative also of differential/fully-differential architectures when only the differential mode is considered.Components in red in Figure 6 indicate the capacitive feedback network, which is equivalent to that of Figure 3.The slew rate is the minimum between the two values s r1 and s r2 , related to the input and output stage, respectively, given by We will focus on cases where s r1 is smaller or equal to s r2 , so that the slew rate coincides with s r1 (dominance of s r1 ).From (22), the following condition on C C can be derived: The advantage of making s r1 dominating stands in its expression: notice that the load capacitance is not present.Properly choosing C C ≪ C LE , it should be possible to obtain relatively large slew rates vs. supply current efficiencies even in the presence of a large load capacitance.Equation (23) suggests that a C C value much smaller than C LE can be obtained, with no penalty in terms of I sup2 , using a class-AB second stage, resulting in a very large k I2 coefficient, while the first stage is a conventional class-A configuration.Very effective class-AB output stages are based on the popular Monticelli's solution [25], but alternatives compatible with very low supply voltages are present in the literature [26,27].In practice, stability issues prevent increasing s r1 without also increasing the static current absorption of the second stage, limiting the real advantage that can be obtained with small C C values.
Well-known considerations on the open-loop frequency response impose the following: where β FB is the feedback factor previously defined in (1) while k ϕ is a parameter that sets the phase margin.The formal definition of k ϕ is [28]: where f p2 is the entity of the second pole while GBW is the gain-bandwidth product of the OTA.A typical choice is k ϕ = 3 to obtain a phase margin of nearly 70 degrees, which guarantees that no peaking is present in the frequency response when the OTA is configured as a unity-gain buffer.At the same time, the choice of k ϕ = 3 also corresponds to a damping factor of 0.87, which avoids overshoots in response to step-like input stimuli [29].Although a single time-constant behavior (as in (9)) is not rigorous, it can be considered a valid first-order approximation for the remainder of the discussion of this review.Using (20) to express G m1 and deriving G m2 from (24), it is possible to obtain a relationship between the supply currents of the first and second stages: Finally, considering that the total current absorption of the two-stage amplifier is I sup1 + I sup2 and expressing I sup1 as a function of s r1 using (22), the total current absorption I tot can be calculated as a function of the slew-rate: Note that the first factor in (27), namely s r C LE /k I1 , is the power consumption of a single-stage OTA with output current efficiency k I1 that directly drives the load capacitance.The terms in round brackets represent the effect of having a two-stage architecture.With the arguments exposed above and taking into account condition (23), it is possible to make the ratio C C /C LE much smaller than one.As far as the remaining term in the round brackets, to obtain a real advantage from a two-stage architecture, we need to make Satisfying (28) means making the first stage widely suboptimal regarding its capability of converting the supply current into its effective transconductance.This can be detrimental when there are strict thermal noise specifications.It is worth observing that for single-stage amplifiers commonly used (used as stand-alone OTAs or to compose two-stage amplifiers), voltages V TE are proportional to the overdrive voltages of the input devices (for mosfets operating in strong-inversion).Considering the constraints that apply to overdrive voltages, it is clear that in many cases of interest the factor in round brackets in ( 27) is close to one or even greater than one.We can conclude this analysis by stating that, taking into account constraints that frequently occur in OTA design, a two-stage architecture does not offer important advantages with respect to single-stage OTAs in terms of slew-rate vs. current consumption trade-off.An important exception is represented by the case of OTAs that have to be used with a very low feedback factor (i.e., with β FB ≪ 1).This is the case, for example, of high-gain switched-capacitors amplifiers.Inspection of (27) reveals that in such cases it is possible to obtain a significant advantage (i.e., a smaller supply current) over a single-stage architecture.Unfortunately, in switched-capacitors integrators used in ∆ΣMs, it is much more likely to have β FB factors close to one.
Clearly, class-AB solutions that were mentioned as a requirement for the second stage to make s r1 dominant and then validate the subsequent analysis can be used also for single-stage OTAs to boost k I in (21) and, consequently, the efficiency.
The analysis conducted in Section 2.1 can be partially adapted to describe the settling behavior of the two-stage OTA of Figure 5b.The second stage, implemented through transistors M9, M10, M11, and M12, as well as the C C compensation capacitors, also introduces additional degrees of freedom in the design.In the case of a dominant-pole design, relations in ( 16) can be adjusted as follows: Notably, the most important design parameters V dmax1 , G m1 and I omax1 are referred to the first stage, which is now loaded by C C .The derivation of (29), however, neglects a number of important effects due to the non-dominant singularities.In [21], analytical aspects are fused with behavioral modeling in order to provide guidelines for rational design.
Section 3.1 is dedicated to review the most noticeable techniques used to implement differential-type Class-AB stages that are suitable to be employed both as single-stage OTAs or as the first stage in multistage OTA architectures.

Figures of Merit (
where C LE , the gain-bandwidth product GBW and the slew rate s r are confronted against P = V DD • I sup which represents the total static power consumption of the OTA.FOMS primarily describes the small-signal power efficiency, whereas FOML primarily characterizes the large-signal power efficiency.In simpler terms, FOMS and FOML aim to gauge the efficiency of achieving a specific gain-bandwidth product and slew rate, respectively, within defined power and load capacitance constraints.However, these FoMs have limitations as they separately address either small-signal or large-signal transients.In reality, both types of transients influence the settling time, as elaborated in Section 2. A straightforward classification of the state-of-the-art, based solely on FOMS and FOML, would result in the tabular output as shown in Table 2 (Reference pool: [15,26,).This approach, however, is overly simplistic and does not ensure that a solution optimal for a specific loading condition or maximum input step maintains the same power efficiency (either in terms of small signal or large signal) to meet varying requirements.
A more contextualized comparison is offered when different loading conditions are distinguished: in this regard the state-of-the-art designs have been plotted on the FOMS-C LE and FOML-C LE planes in Figure 7.Each point on the plots represents a specific design operating under particular conditions.In cases where the same design is reported with different operating conditions, a line has been drawn to group such instances.Additionally, specific graphic markers have been employed to indicate whether the design utilizes a single-stage, two-stage, or three-stage architecture.
The designs depicted in the plots incorporate advanced slew-rate enhancement techniques, which will be elaborated upon at the circuit level in Section 3.However, the apparent scattered distribution of the designs in Figure 7 does not facilitate the identification of the effectiveness of the various circuit techniques.66.67 [64] 130.19 [38] 66.67 [65] 109.39 [40] 45.71 [63] 106.35 [33] 20.72 [44] 80.23 [42] 17.71 [40] 22.97 [85] 15.38 [26] 16.61 [26] 9.75 [33] 14.25 [30] 6.07 [75] 13.52 [41] 5.72 [36] 11.79 [72] 5.21 The situation changes if a more comprehensive FoM is introduced, specifically designed to highlight the whole settling behavior: resulting in the panorama of Figure 8.To the best of the authors' knowledge, this FoM was introduced for the first time in [85] in the field of drivers for LCD applications.A trend line is also depicted in the plot, representing the state-of-the-art frontier.This configuration explicitly highlights the challenge of designing efficient SC circuits as C LE approaches the values of non-dominant parasitic capacitors.To aid in the interpretation of the plot, two regions have been delineated using a C LE value of 10 pF as a boundary.Interestingly, the most efficient designs in the region where C LE < 10 pF are primarily based on two-stage architectures, whereas single-stage and three-stage configurations dominate in the region where C LE > 10 pF.However, previous FoMs do not take into account the role played by the amplitude of the input voltage step, V id , in affecting settling performances.Depending on the value of V id , determined by the intended application, the output voltage transient may either be entirely linear or exhibit slew-rate effects.Therefore, considering the diversity of SRE applications targeted by the works considered in this review, we hereby introduce a new FoM, denoted as FOM * , defined as follows: where I sup indicates the integrator static supply current.Conversely with respect to previous FoMs, FOM * is a dimensionless quantity.Moreover, under simplified assumptions, it can be shown that FOM * is a function of only topology-related parameters.Specifically, by simply assuming the output transient to be dominated by the slew-rate effect (t sr ≫ t lin ), up until reaching the required settling accuracy, the proposed FoM can be rearranged into Evidently, dependencies with respect to the main degrees of freedom of the SC integrator design (such as I sup , C L , and V id ) vanish.Residual dependencies only involve the settling accuracy specification (ϵ S ) (This term is ≪ 1, however, hence its influence is negligible in practical cases), parameters related to the capacitive feedback network (β FB , c 1 ), and the OTA efficiency in terms of maximum output current (I omax /I sup ).Clearly, as far as real circuit operation scenarios are concerned (where the linear settling phase is also relevant), (33) results in an oversimplified expression of FOM * ; yet, the proposed FOM * can be expected to hold its link to the intrinsic slew-rate efficiency of the SRE-assisted OTA.The state-of-the-art designs are mapped into the plot of Figure 9.As in the previous case, a design front is evident, reinforcing the thesis of the difficulty of reaching high efficiency when small C LE are targeted.

Cells and Methods for OTA Enhancement
The main functional components of the simple mirror-based OTA are identified in Figure 10.They consist of the biasing, the transconductive core, and the current mirroring section.For the standard OTA, the biasing is fixed to I 0 (Class-A operation), the transcodunductive core is based on the input differential pair, and the current mirroring section is implemented by the use of standard current mirrors.
Advanced OTA architectures modify one or more of the aforementioned functional components in order to enhance both the G m and/or the I omax parameters discussed in the simplified model of Section 2.1: the Flipped-Voltage Follower circuit, discussed in Section 3.2, allows Class-AB biasing; the current-recycling and the mirror-nesting techniques, presented in Section 3.3, are devoted to enhance the steering capability of the transconductive core; non-linear current mirrors, described in Section 3.4, are employed to implement Class-AB operation at the output branches of the gain stage only.Furthermore, Compound Body-biased Mosfets, discussed in Section 3.5, are included in this discussion as an interesting technique when targeting low-voltage applications.Finally, Paralleltype Slew-Rate Enhancers, which operate as auxiliary circuits around the main OTA, are discussed in Section 3.6.For the explanation of the advanced subcircuits, the simple yet powerful EKV model is employed, which is able to capture both strong and weak inversion operation of the MOS in an amenable design-oriented fashion [20].In this model, V T , U t , n, and β stand for the threshold voltage, thermal potential, subthreshold slope, and current factor, respectively.

Flipped Voltage Follower (FVF) Cell
The Flipped-Voltage Follower (FVF) is a versatile cell formally described in [86], although it was already introduced in [87].Among the many uses that this cell can offer, we will focus on its use as (i) current buffer loop for class-AB current biasing and (ii) lowvoltage current mirror.These functionalities are illustrated in Figure 11a and Figure 11b, respectively.
Looking at Figure 11a, two distinct FVFs are present: FVF1 is composed by M0c-M01-M0a, while FVF2 is composed by M0d-M02-M0b.Both FVF1 and FVF2 will be referred to as PMOS-FVFs to distinguish them from the complementary implementation (NMOS-FVF) of Figure 11b.M0a and M0b set the bias currents I 0 of FVF1 and FVF2, respectively.Thanks to the loop around M01-M0c (M02-M0d), a constant V GS is set for M01 and M02, regardless of the current absorbed at the sources of M1 and M2.Assuming V p and V n to follow V i = V p − V n and V cm = 1 2 (V p + V n ), the drain currents of the PMOS input devices, M1 and M2, can be found as follows: In these expressions, β is related to M1 and M2, while β 0 is related to M01 and M02.While the quiescent operating current of M1 and M2 is set to be (β/β 0 )I 0 , its magnitude grows quadratically with |V id |, hence implementing a compact class-AB biasing of the transcoductive core of the OTA.Eventually, M0e and M0f can be added to the structure to provide a copy of I p and I n directly to OTA output branches.The circuit is able to operate with V DD as low as |V T | + 2|V DSsat |.However, V cm needs to be set in order to maintain M0c and M0d in the saturation region of operation.For the PMOS-FVFs, this limit is also expressed by The FVF structure illustrated in Figure 11b implements a low-voltage current mirror.It is able to absorb high-dynamic currents I p , I n , eventually sourced from the transconductive core of Figure 11a, and to mirror towards the output device M0e.
A possible limitation concerning the FVF is the fact that voltages V cm and V ′ c need to be extracted from the inputs V p and V n by dedicated auxiliary circuits in order to maintain the correct biasing of the OTA, thus increasing its complexity.In some implementations, the gates M01 and M02 are connected to V n and V p , respectively, simplifying the extraction of V cm and incrementing also the class-AB current boosting, at the cost of even more limited input range.Regarding offset and input-referred noise, the configuration depicted in Figure 11 is evidently less effective than a standard differential pair, primarily due to the increased number of mismatch and noise contributors.Designers are therefore advised to carefully consider this aspect.

Current Recycling and Mirror Nesting
The current recycling technique was originally proposed in [49] in order to enhance both the transconductance and the slew-rate of a standard folded-cascode OTA.The transcoductive core of the circuit is illustrated in Figure 12a.New design parameters, related to geometrical ratios between the involved transistors, are introduced in this structure: k a , k b , n a , m a .These parameters affect both the bias and the signal-dependent components of the currents in the branches.The input pair bias current provided by M0 is now considered to be equal to 2(k a + k b )I 0 .In the following, we consider (M1a, M2a) and (M1b, M2b) as composed by a k a -parallel and k b -parallel, respectively, of identical (unitary) transistors.Each unitary transistor is biased, ideally, by a current equal to I 0 .The small-signal transconductance of unitary transistors will be referred to as g m0 .As indicated in Figure 12a, the bias current component in the folded branches (e.g., through MC3 and MC4) is as follows: which clearly imposes the following constraint: k b < n a m a k a .On the other hand, G m and I omax are easily found by circuit inspection as follows: The portion of the static supply current drawn by the transcoductive core results is as follows: At this point, simple efficiency metrics can be introduced: These metrics show that the slew-rate efficiency, η SR , of the recycling folded cascode can be increased by increasing k b /k a .Since the bias constraint imposes n a /m a > k b /k a , the transconductance efficiency, η G m , tends to saturate for large values of k b /k a .Moreover, large n a /m a ratios tend to erode the OTA phase margin due to the zero-pole doublet associated to the current mirrors formed by M3a-M5a and M4a-M6a.In practice, the set of parameters The transconductive core of the circuit in Figure 12b follows a similar principle.Differently from the previous case, the low-impedance nodes are all constituted by input sections of NMOS current mirrors.This peculiarity allows for repeating the same structure iteratively, furtherly boosting the transconductance and slew-rate efficiencies with respect to the standard current-recycling technique.Such a circuit topology is known as nestedmirror OTA.Similar considerations as for the recycling folded cascode apply also in this case, with the only difference that here one more zero-pole doublet is introduced for each nesting iteration.The structure was introduced in [88] and later expanded to multiple (>2) nesting iterations in [36], becoming a de facto reference structure for very large output capacitive loads.

Non-Linear Current Mirrors
Non-linear current mirrors are used to enhance the current mirroring section of the OTA of Figure 10.These mirrors intentionally establish a non-linear connection between the input current, I 1 , and the output current of the mirror, I 2 .This deliberate non-linearity is introduced to realize a specific I 2 (I 1 ) characteristic, often designed to exhibit a super-linear behavior.This feature is used to deliver high-class AB currents to the output of the OTA, without resorting to high-current amplification ratios of the current mirror, which would also result in amplification of the static bias currents.
Various techniques can be employed to achieve a non-linear I 2 (I 1 ) characteristic.To present a comprehensive overview, let us examine the current mirror cells depicted in Figure 13.In Figure 13a, a controlled voltage generator, V C , is placed in series with the input section gate voltage, V G .Referring to a general case, regardless of the circuital implementation of V G , we assume V C (I 1 ) is such that dV C /dI 1 > 0 in all the operation intervals of interest of I 1 .In this analysis, the drain of M2 is set to a suitable voltage in order to maintain the saturation region, Since the source and bulk terminals of M1 and M2 are grounded, I 1 and I 2 are determined by their respective gate voltages, V G and V G + V C .By means of a simple mathematical analysis, we find: Hence, I 2 (I 1 ) shows a super-linear characteristic, by virtue of the only condition dV C /dI 1 > 0. The circuit in Figure 13b introduces a minor modification compared to Figure 13a.In this configuration, V G remains constant at the quiescent value established by I 1Q , regardless of variations in I 1 .This circuit is introduced to mirror the circuit implementation depicted in Figure 14a, which will be discussed later.
In the source-series V C configuration depicted in Figure 13c, we can apply considerations that are analogous to those already discussed for the gate-series configuration in Figure 13a.Meanwhile, in Figure 13d, a straightforward implementation utilizes a resistor.For this specific scenario, V C = RI 1 , leading to dV C /dI 1 = R.However, this solution is sub-optimal for low-current (low-power) circuits, where a large value of R is required to achieve a significant class-AB boosting effect.In such cases, R can be replaced by triode-operating MOSFETs [89].
Triode-region operation (b) Finally, Figure 13e illustrates a scheme that realizes body modulation at the input device.Clearly, this technique can be implemented only if the active devices can be put in an isolated well.Both I 1 and V C (I 1 ) are forced by the external circuits: V G increases if either I 1 or V C increases.As a consequence, I 2 increases as well, tracking the increments of both I 1 and V C .Simple algebraic manipulations lead to the following: 2 (strong inversion), These expressions also follow the relationships outlined in (39).An important consideration in body biasing is the potential turn-on of the body-well junction when V C increases excessively.This must be avoided, as it would lead to increased power consumption without providing any additional boost to the output current I 2 .
Two significant implementations of the aforementioned techniques are presented in Figure 14a and Figure 14b, respectively.Both circuits employ a differential configuration in order to provide a compact implementation of the control voltage generator V C .
The circuit in Figure 14a is known as local common-mode feedback circuit (LCMF) and it is related to the configuration discussed in Figure 13b.Here, V 0 , corresponding to the gate voltage of the input devices, M1 and M2, is kept constant, provided that the input currents I p and I n have a constant common mode.In the quiescent operation point I p = I n , hence V q = V m = V 0 .Additionally, the quiescent point of the output currents, I pp and I nn , is nominally identical, defined according to the model of a standard current mirror.Any differential component flows through the 2R series, creating an imbalance between V q and V m , expressed by the following equations: which triggers the class-AB boosting of the output currents, discussed earlier.The choice of the R resistors values is crucial and should align with the bias levels of the quiescent currents.If R is too low with respect to 1/g m1,2 , it results in weaker class-AB boosting.Conversely, if R is excessively high, the nodes at the drain terminals of M1 and M2 start to be affected by inertial effects, potentially compromising the OTA phase margin.The implementation in Figure 14b provides a compact realization of the principle explained in Figure 13c [90].Devices M1, M2, M3, and M4 operate in saturation and are nominally identical, while devices M1a, M2a, M3a, M4a, M1b, and M2b typically operate in the triode region.The drain voltages of the latter group of devices are labeled as V 1 , V 2 , V 3 , and V 4 , as illustrated in Figure 14b.Additionally, these devices are characterized by the following aspect ratios: A for M1a and M2a, B for M1b and M2b, and C = A + B for M3a and M4a.
Assuming ideal matching of the devices, operated with I p = I n , and considering the electrical and geometrical symmetry of the circuit, it can be shown that Any imbalance in the input currents will manifest as an imbalance between V q and V m , as well as between the drain voltages of the triode-operating devices.While V 3 depends only on V q and V 4 depends only on V m , V 1 and V 2 depend on both V q and V m .As a result, any imbalance between V q and V m breaks the electrical symmetry of the mirrors, leading to non-linear characteristics in the resulting output currents with respect to the input currents (This consideration can be easily translated to input-referred offset worsening as far as mismatch is considered.Again, the designer is warned to carefully check this aspect).
Detailed analysis can be retrieved from the original work [35].Here, our analysis is limited to the case of perfectly balanced differential driving: where I cm represents the fixed common-mode current, and I dm represents the variable differential-mode current.This driving scheme mimics the standard class-A differential pair circuit.At this point, we observe that, despite their non-linear behavior, the triode-operating devices can be treated as source-degeneration resistors.Focusing on the current mirror composed by M1 and M3, we can identify the following degeneration resistors: R 3 (V q ), corresponding to M3a, R 1a (V q ), corresponding to M1a, and, finally, R 1b (V m ), corresponding to M1b.Since R 3 and R 1 are connected to the same controlling voltage V q , the ratio R 1a /R 3 = C/A = 1 + B/A depends only on the aspect ratios of the transistors.
As V m decreases, concurrently with an increase in V q , it follows that R 1b > R 1a .This mechanism, involving M1b, results from a positive feedback path.However, this is counterbalanced by the parallel negative feedback path involving M1a.As M1a and M1b are related to the geometrical parameters A and B, the amount of positive feedback becomes a degree of freedom in the hands of the designer.
Eventually, as I dm increases, the V q − V m imbalance reaches a condition where R 1b ≫ R 1a : at this point, the source degeneration of M3 and M1 is determined only by R 3 and R 1a , respectively.Since R 1a > R 3 by geometrical construction, I pp will be greater than I p by an amount related to B/A, i.e., the amount of positive over negative feedback.An excess of positive feedback may induce latching problems as well as slow response of the Class-AB currents, degrading the settling period.The exact relationship between I pp and I p , as derived in [35], depends on the operation region of the devices.

Compound Body-Biased Mosfets
The compound body-biased MOSFETs (CBBM) technique, introduced in [78], aims to enhance the transistor intrinsic DC gain without compromising the frequency response.This technique is not directly linked to slew-rate enhancing per se.However, since it can be amalgamated with the previous circuital techniques in low-voltage scenarios, it deserves to be discussed in this review.
The CBBM technique employs a pair of stacked MOSFETs, MA and MB, as illustrated in Figure 15.The configuration resembles a pseudo-cascode structure, with the only difference being that the bulk terminal of MA is utilized to bias MB into the saturation region.In a standard pseudo-cascode compound, V B = 0, forcing MB to operate in the triode region.However, in the CBBM structure, V B > 0 to reduce the effective threshold voltage of MA.When V B reaches a proper value, V B,min , V X becomes high enough to allow MB to exit the triode region.It is important to note that setting V B > 0 implies that the technology allows for isolated wells.Consequently, a maximum value of V B , denoted as V B,max , must be observed to avoid the turn-on of the body-well junctions.Therefore, this technique is viable only if V B,min < V B,max .In the upcoming discussion, we will demonstrate how V B,min can be established through design considerations by manipulating the sizing of MA and MB.We will focus on two distinct design cases: (i) both MA and MB operating in strong inversion, and (ii) both MA and MB operating in weak inversion.While mixed cases are also analyzable, they will not be covered in this text.To facilitate our discussion, we will adhere to the following simple convention: β A and I SA will be assigned to MA, while β B and I SB will be assigned to MB. ISA and I SB indicate the weak-inversion specific currents of the two transistors.Now, we can express the following equations, assuming V S = 0 (i.e., the CBBM source node corresponds to our reference node): For MB to operate in the saturation region, we can elaborate Equation (43) to explicitly express V X : The inequalities of Equation ( 44) can be elaborated to find expressions for V B,min : (weak inversion) (45) In the weak inversion case, a further condition applies: The advantages of the CBBM structure are evident when analyzing the small-signal parameters of the circuit.Referring to Figure 15b, we aim to derive the equivalent smallsignal parameters g m,eq , r d,eq and g x from the small-signal parameters of MA and MB.Through simple analytical considerations, we find: g m,eq ≈ g mb , r d,eq ≈ ng ma r da r db , g x ≈ ng ma . ( The expressions have been simplified considering the usual inequalities g ma , g mb ≫ 1/r da , 1/r db .We also observe that the node X can be used to provide a low-impedance path (g x ) for further flexibility.This node has been used in [78] to implement a (pseudo) cascode frequency compensation of a two-stage OTA.Based on (46), it is clear that as MB enters the saturation region, it dominates the overall transconductance.At the same time, its output impedance is magnified as in a traditional cascode configuration, hence boosting the intrinsic gain of the compound.Although the CBBM structure is not intrinsically related to slew-rate enhancing nor to transconductance boosting, we believe that future work exploring amalgamation of this technique with other techniques exposed here may result in high-performance OTA subunits.

Parallel-Type Slew-Rate Enhancer (PSRE)
The parallel slew-rate enhancer (PSRE) approach is schematically depicted in Figure 16a.The PSRE establishes a parallel path for signal propagation across the OTA only during the slewing transient.During this part of the settling transient, the PSRE is turned-on and delivers high currents at the output nodes, effectively enhancing the system slew-rate.Conversely, the PSRE is completely turned-off during the last part of the transient, when the OTA enters its linear response region.Consequently, the precise adjustment of the residual output voltage is delegated to the main OTA.Importantly, the adoption of PSREs does not impact the DC gain, offset, or noise of the OTA, effectively isolating the slew-rate from other specifications.
In practice, to avoid lags in the activation of the PSRE, which directly sums up to the settling time, part of the PSRE is kept turned-on, i.e., it works in Class-A.This fact leads to t set optimization scenarios, concerning the supply current distribution between the main OTA and the PSRE, when a total static current budget is assigned.An important aspect to consider when a PSRE is used is that its output currents are not intrinsically balanced around a common mode.Common-mode components may arise from V DS effects at the output transistors of the PSRE, as well as from internal mismatch between current mirroring blocks.Therefore, output common mode regulation is left to the main OTA, which in general is already equipped with this functionality.However, as the PSRE peak output currents increase, their common mode also increases.Consequently, the OTA-PSRE combination may require some additional, but generally simple, design step to attain the correct settling behavior.The PSRE in Figure 16b is activated when the absolute value of the input differential voltage surpasses a certain activation threshold, e.g., |V p − then the PSRE operates in its deadzone, not providing any output current due to its output devices being turned-off.The first concept of this kind of circuit was introduced by Nagaraj in [91].When the PSRE operates within the deadzone, all the available current provided by the differential pair is absorbed by the shunt current sources I th .To this purpose, the condition I th > I tail /2 needs to be set by design.Hence, within the deadzone, the current mirrors Mm3p-Mm5p, Mm4p-Mm6p, Mm3n-Mm5n, and Mm4n-Mm6n are off and the output nodes are rendered in high impedance.When the input pair is fully unbalanced by a large input voltage, a non-zero current is conveyed into the selected mirrors if I tail > I th .Hence, the circuit is constrained by the following inequalities: I tail /2 < I th < I tail , which can be satisfied by the mismatch-robust choice of I th = 3 4 I tail .For this choice, V A ≈ 1.855 2nI tail /β if the PSRE input pair is operated in strong inversion, or V A ≈ 1.099nU t , if operated in weak inversion.Furthermore, it is convenient to set the V dmax of the PSRE slightly less than V A in order to ensure that, outside of the deadzone, the PSRE delivers the maximum current to the output, which will result in I omax,PSRE = k(I tail − I th ) = kI tail /4.
As the desired output voltage level is approached, marked by the condition |V p − V n | = V A , the SRE output devices deactivate, causing the SRE outputs to transition to high impedance.Hence, neglecting inertial effects, the system obtains the maximum benefit if V A is set to be equal (or even less) than the V dmax of the OTA.
Neglecting inertial effects (hence neglecting also the effects of the C B element in Figure 16b), a high overall slew-rate is obtained by simply increasing the mirror amplification factor k instead of increasing the PSRE static current consumption (I sup,PSRE = 2I tail ).In such simplified conditions, an optimization of t S for a given current budget, I sup = I sup,OTA + I sup,PSRE , is possible.In the first place, let us introduce η as follows: Secondly, we relate I sup,OTA to the G m and I omax,OTA of the OTA block, as well as I sup,PSRE to I omax,PSRE of the PSRE: From the previous discussion, we are already able to calculate while V TE,OTA and k OTA are contingent upon the specific OTA topology.In particular, V TE,OTA assumes the same role in defying the G m efficiency of the OTA as in (20), previously discussed in Section 2.2.We are interested in the case where a maximum |V id | causes the system to produce a combination of slewing and linear settling phases in the output transient.Moreover, we indicate as t S,max the settling time related to the maximum input step, |V id | max .In these conditions, Equation ( 16) can be rewritten as follows: where the maximum output current during the slewing period is given as follows: Considering Equations ( 47), ( 48), ( 50) and ( 51), we can express t s,max as follows: , where: The settling time t S,max can be minimized looking at the derivative with respect to η: A valid η opt is found for k PSRE > k OTA (1 + k OTA t 2 /t 1 ).In practice, this condition implies a minimum k(= 8k PSRE ) to be employed.
Up to this point, inertial effects of PSRE have been neglected.However, the settling time is affected also by the turn-on and the turn-off lags of the PSRE output mirrors.These lags are more evident as k is increased excessively.This effect can be seen by looking at the total gate capacitance of the PSRE output mirror, C G , which is dominated by the gatesource and the gate-drain parasitic capacitances of the output device (whose overall area is proportional to k).The charge and discharge process of such a capacitance can be modeled considering the non-linear differential equation described in [92] and characterized by a time constant τ that can be expressed as follows: where g m0 and C G0 are, respectively, the transconductance and the gate capacitance associated to the input transistor of the mirror.It is worth noting that τ clearly refers to a linearized circuit model, which can, however, serve as a first-order approximation of the actual solutions derived from the large-signal model of the current mirror [92].
Since the PSRE activation is linked to a large-signal step, sensed either at the inputs or at the clock edge, dynamic biasing schemes are also possible for this circuit [93].Following this idea, a capacitance boosting technique is proposed in [15] by adding the capacitor C B between the source terminals of the complementary input differential pairs of the original Nagaraj's PSRE, as illustrated in Figure 16b.The boosting effect provided by C B can be analyzed as follows: the steady-state value of the common source voltages (V csn0 , V csp0 ) are determined by the input common mode V ic and I tail .Hence, C B is charged at C B (V csn0 − V csp0 ).At the onset of an input voltage step (either positive or negative), V csn is pulled up by a voltage difference ∆V csn , while V csp is pulled down by ∆V csp .This causes C B to promptly absorb an amount of charge ∆Q B = C B (∆V csn + ∆V csp ).Depending on the sign of the input voltage step, one of the following low-impedance paths is enabled: (i) M2n and M1p (positive input steps), or (ii) M1n and M2p (negative input steps).The low-impedance paths facilitate the transit of ∆Q B towards one of the respective output mirror sections.This mechanism effectively helps prompt turn-on of the corresponding mirror by charging their respective C G .The larger the voltage step, the more efficient the charge injection mechanism, hence a larger slew-rate boosting is obtained.
It is worth noting that the turn-off mechanism is much less efficient with respect to the turn-on mechanism, even in the presence of C B : at the transition of the PSRE back into the deadzone region, V n and V p have similar values, and the initial state of charge of C B is mostly restored by the I tail sources.The fraction of I tail that reaches the output mirror sections is then combined with I th to provide a net current that discharges C G , turning off the output mirrors.Hence, while the C B significantly shortens the turn-on delay of the PSRE, it leaves almost unvaried the turn-off delay.In practical designs, the use of C B is beneficial for moderate values of k, contingent upon the predominance of the turn-off delay, which in change is still strongly dependent on I tail (hence, on the static power consumption).The value of η opt found in Equation ( 53) can serve as a starting point before further optimization of both k and η is conducted.
The PSRE depicted in Figure 16c was first introduced in [52,94] for high-speed SC pipeline ADC applications, operating beyond 100 MHz sampling rates.Such a circuit resembles a two-stage OTA setup.The initial stage, comprising M1n-M2n and R 1 , operates as a low-gain preamplifier, while the subsequent stage consists of an RCbias tie push-pull arrangement [58,95].The output stage, devoid of any DC coupling, features output push-pull pairs (M3p-M3n, M4p-M4n) that are AC coupled through capacitive-resistive networks formed by C B and R B .DC biasing is ensured thanks to R B resistors providing bias voltages V BP and V BN .In this specific setup, these voltages are adjusted to maintain the output transistors in cut-off during steady-state conditions, thereby implementing class-B operation.This is crucial to prevent DC-gain degradation of the main OTA.
When considering the linearized equivalent circuit for differential signals, we can express the transfer function as follows: , where: H(s) express the transfer function across the preamplifier, operating in class-A, loaded by the RC-bias tee.The small-signal circuit model of the preamplifier is also shown in Figure 16c.C p1 and C p2 represent the parasitic capacitive loading at internal nodes.The approximated expressions of H(s), ω 1 , and ω 2 are obtained considering C p1 , C p2 ≪ C B and R 1 ≪ R B .Insofar, ω 1 ≪ ω 2 , and at intermediate frequencies the voltage gain is flat and determined by A 1 .In the design case of [52], ω 2 is set to be ×60 larger than ω 1 .The step response at the input of the class-B amplifier can be found as follows: The step response behavior is then determined by two components, these being the e −ω 1 t term and the e −ω 2 t term as the slow and the fast components, respectively.The slow response is, however, attenuated by the ratio ω 2 /ω 1 ≫ 1; hence, by design, its amplitude can be set below the activation threshold of the output transistors by choosing proper values of V BP and V BN .Thereby, only the fast component is processed by the PSRE.
The additional voltage required to activate the output transistors forms the basis of the deadzone mechanism, which is subsequently attenuated by the gain of the preamplifier.However, the preamplifier gain cannot be excessively high, as the input-referred deadzone needs to exceed the combined offsets of the main OTA and the PSRE.Failure to meet this condition would result in the system being unable to activate for neither positive nor negative input steps (Clearly, the width of the deadzone, or even its presence, is strongly affected by device mismatch.The latter can be expected to be large, since minimization of internal delays of the PSRE imposes the use of minimum length devices.This is a critical aspect of the PSRE that has to be addressed in the design phase by means of statistical analysis and/or Monte Carlo simulations).

Conclusions
In this review, we delved into the intricate landscape of slew-rate enhancement techniques for switched-capacitor amplifiers.A simplified settling model was introduced to provide a rational framework for understanding these advanced circuital techniques.Through this model, we elucidated the roles of small-signal and large-signal characteristics of amplifiers, shedding light on their influence on settling behavior.
Discussions were centered around power efficiency, particularly comparing singlestage and two-stage OTAs to offer practical guidelines for optimizing settling behavior under power constraints.An exhaustive review of state-of-the-art designs was conducted, initially using standard Figures of Merit regarding power efficiency.However, we also identified their scalability limits concerning the capacitive loading applied to amplifiers.A clear vision of the effectiveness of advanced slew-rate enhancer techniques is evident when considering the settling-time-based metrics.Based on this result, we introduced a novel Figure of Merit, which also takes into account the entity of the equivalent input step-like signal.This novel Figure of Merit allowed us to identify a design frontier, clearly illustrating the trade-offs between power efficiency and capacitive loading.
Finally, a detailed discussion at the transistor level of advanced circuit techniques was undertaken, revealing the strengths and weaknesses of each solution.By providing a comprehensive overview of these techniques and their implications, this review aims to guide future research in the field of switched-capacitor amplifier design.

Figure 1 .
Figure 1.Schematic diagram of an inverting fully-differential SC integrator.NominallyC 1p = C 1n = C 1 , C 2p = C 2n = C 2 , C 3p = C 3n = C 3 .Capacitors C 3p and C 3n represent the capacitive loads applied to the integrator.Phase ϕ reset is used to establish the initial conditions of the integrator state variable.The differential input of the integrator is represented by V ip − V in , while the differential output is represented by V op − V on .For simplicity, the output and the input common mode voltages are identical and equal to V cm .In this configuration, the output is valid at the end of ϕ 2 .

Figure 2 .
Figure 2. Chronograph of the differential output voltage V o (t) [from (3)] the relative error ϵ o (t)[from(7)] for a small input step and for a large input.

Figure 4 .
Figure 4. Numerical results for the settling-time model of Equation (16) with the following values: C 1 = 4 pF, C 2 = 32 pF, C 3 = 1 pF, C P = 0.2 pF, ϵ S = 100 ppm, V dmax = 50 mV.Design case (A) features G m = 1.33 mS, I omax = 66.7 µA, Design case (B) features G m = 1.33 mS, I omax = 266.7 µA, Design case (C) features G m = 2.67 mS, I omax = 133.3µA.Subplot (a) shows the t S , t lin and t sr behavour as function of V id .Subplot (b) shows the maximum |V id | for t S ≤ t ⋆lin as a function of the slew-rate enhancing ratio I omax /(G m V dmax ), where t ⋆ lin is defined as t ⋆ lin = t lin (V id = V dmax ).

Figure 6 .
Figure 6.Simplified single-ended equivalent circuit of a two-stage amplifier with capacitive feedback.
FoMs) Two commonly used Figures of Merit (FoMs) for capacitively loaded OTAs are known in the literature: FOMS and FOML.They are defined as follows:

Figure 10 .
Figure 10.Schematic diagram of the functional parts of a mirror-based OTA.

Figure 11 .
Figure 11.Schematic diagrams of Flipped Voltage Follower employed as (a) current buffer loop for class-AB current biasing, (b) low-voltage current mirror.

Figure 13 .
Figure 13.Schematic diagrams of non-linear current mirror configurations: (a) gate-series V C (I 1 ); (b) same as previous but with constant V G ; (c) source-series V C (I 1 ); (d) trivial implementation of the previous by the means of a resistor; (e) body modulation at the input device.

Figure 14 .
Figure 14.Schematic diagrams of differential non-linear currents mirrors: (a) structure based on local common-mode feedback circuit (LCMF) and (b) structure based on the non-linear source degeneration.

Figure 16 .
Figure 16.Schematic diagrams of (a) OTA+PSRE configuration; (b) Implementation of a PSRE based on current mirrors, enhanced by the boost capacitor C B ; (c) Implementation of a class-B PSRE based on RC-bias ties.

Figure
Figure 16b,c shows two different implementations of the PSRE, respectively, based on current mirrors and on a class-B output stage.The PSRE in Figure16bis activated when the absolute value of the input differential voltage surpasses a certain activation threshold, e.g.,|V p − V n | ≥ V A , while if |V p − V n | < V A ,then the PSRE operates in its deadzone, not providing any output current due to its output devices being turned-off.The first concept of this kind of circuit was introduced by Nagaraj in[91].When the PSRE operates within the deadzone, all the available current provided by the differential pair is absorbed by the shunt current sources I th .To this purpose, the condition I th > I tail /2 needs to be set by design.Hence, within the deadzone, the current mirrors Mm3p-Mm5p, Mm4p-Mm6p, Mm3n-Mm5n, and Mm4n-Mm6n are off and the output nodes are rendered in high impedance.When the input pair is fully unbalanced by a large input voltage, a non-zero current is conveyed into the selected mirrors if I tail > I th .Hence, the circuit is constrained by the following inequalities: I tail /2 < I th < I tail , which can be satisfied by the mismatch-robust choice of I th = 3 4 I tail .For this choice, V A ≈ 1.855 2nI tail /β if the PSRE input pair is operated in strong inversion, or V A ≈ 1.099nU t , if operated in weak inversion.Furthermore, it is convenient to set the V dmax of the PSRE slightly less than V A in order to ensure that, outside of the deadzone, the PSRE delivers the maximum current to the output, which will result inI omax,PSRE = k(I tail − I th ) = kI tail /4.As the desired output voltage level is approached, marked by the condition |V p − V n | = V A , the SRE output devices deactivate, causing the SRE outputs to transition to high impedance.Hence, neglecting inertial effects, the system obtains the maximum benefit if V A is set to be equal (or even less) than the V dmax of the OTA.Neglecting inertial effects (hence neglecting also the effects of the C B element in Figure16b), a high overall slew-rate is obtained by simply increasing the mirror amplification factor k instead of increasing the PSRE static current consumption (I sup,PSRE = 2I tail ).In such simplified conditions, an optimization of t S for a given current budget, I sup = I sup,OTA + I sup,PSRE , is possible.In the first place, let us introduce η as follows:

Table 2 .
Top performers in terms of FOMS and FOML.