1. Introduction
The driving of large capacitive loads—such as those encountered in display drivers (LCD/OLED), sensor interfaces, biomedical signal buffers, and communication circuits—demands operational transconductance amplifiers (OTAs) with high slew rate (SR) and fast settling. In display applications, stringent pixel charging requirements and energy constraints have led to various architectural strategies [
1]. Early multistage class-B topologies [
2,
3] laid the groundwork with transient-aware biasing, achieving µA-level quiescent currents and sub-3 µs settling for multi-nF loads. In addition, the inclusion of a resistor in series to the amplifier output allowed a zero-based frequency compensation in such multistage structures, increasing stability at higher capacitive loads [
4]. Dual-path push–pull structures emerged as highly efficient class-B mechanisms coupled to a class-AB main signal path [
5,
6,
7]. A compact and efficient class-AB mechanism based on a floating PMOS/NMOS mesh was proposed in [
8]. Innovations like distributed Miller compensation have also been introduced to enhance stability at all signal levels [
9].
Moving towards general-purpose techniques tailored for large capacitive load driving, a range of amplifier topologies has been developed to address the conflicting requirements of stability (small-signal behavior) and large-signal responsiveness (slew rate), with approaches varying significantly depending on whether a single-stage or multistage design strategy is adopted.
Single-stage architectures are often preferred in ultra-low-power applications due to their compactness, simplicity, and reduced static power consumption. However, they typically struggle to simultaneously deliver high gain, large output swing, and sufficient capacitive load drivability. Various techniques have been proposed to overcome these limitations:
Several designs embed dynamic current-boosting elements to improve large-signal performance. For example, the amplifier in [
10] employs a current-detection-based slew-rate enhancement (SRE) circuit to boost transient current during load events, leading to a >43× improvement in slew rate when driving a 470 pF load. Similarly, ref. [
11] proposes current starving and feeding to achieve a 600–1100% SR enhancement while maintaining low quiescent current. The amplifier in [
12] follows a super class-AB approach in a recycling folded cascode (RFC) topology, where an adaptive bias circuit and local CMFB further improve GBW and SR by factors of 3 and 34, respectively, without additional power. The work in [
13] introduces a non-linear nested current mirror to push the class-AB operation further in the single-stage regime, demonstrating significant improvements in slew rate and GBW without additional power overhead.
Some single-stage approaches focused on small-signal frequency-domain behavior. In [
14], the nested current mirror (NCM) method achieves GBW scalability across a 0.15–15 nF range without requiring compensation capacitors, offering 72–84 dB gain with >60° phase margin, exploiting current mirroring hierarchies. A similar balance between SR and small-signal performance is achieved in [
15,
16], which utilizes gain-enhanced current mirrors with class-AB push–pull stages. Another example is [
17], where a bulk-driven OTA with local positive feedback exploits body terminals and a dedicated SR enhancement block to sustain performance with loads up to 10 nF while maintaining quiescent current under 1.4 μA.
Multistage topologies allow higher gain and output swing but complicate frequency compensation, especially under large capacitive loads. Recent innovations mitigate these challenges through systematic compensation strategies or control-centric design frameworks:
In [
18,
19], Yan et al. proposed three-stage and two-stage amplifiers employing parasitic pole cancellation and embedded capacitor multipliers (CM), respectively. The CM technique enhances SR and area efficiency by leveraging a small sub-pF capacitor. In [
20], a local-feedback-enhanced compensation scheme for three-stage amplifiers is proposed, showing a GBW of 1.34 MHz and phase margin of 52.7° when loaded with 500 pF in unity-gain configuration. The amplifier in [
21] employs an adaptively varied medium-impedance node, enabling ultra-high slew rate (up to 8.67 V/µs) over a wide 0.1–15 nF load range.
A number of works improve the GBW and phase margin through novel compensation strategies. For instance, ref. [
22] uses a signal- and transient-current boosting (STCB) mechanism, extending GBW and improving SR while keeping the phase margin above 80°. Likewise, ref. [
23] implements a three-stage cascode Miller-compensated amplifier with local impedance attenuation for complex-pole control, achieving both high DC gain and fast settling times. The amplifier in [
24] introduces dynamic biasing and a systematic optimization of settling time using contour plot analysis, offering robust stability for three-stage designs. Ref. [
25] is also noteworthy, as it utilizes inner feedforward compensation in a two-stage amplifier to achieve high current efficiency and a GBW of 2.01 MHz at just 108 µW of consumption, targeting pF-range loads, but still relevant for scalable designs. Furthermore, ref. [
26] demonstrates how local positive feedback in a folded cascode architecture can enhance SR, GBW, and DC gain simultaneously, outperforming even recycling FC-OTAs. Innovations in compensation techniques have also been introduced for high-speed designs [
27]. Recently, an interesting technique for unconditional stability for any value of capacitive load in a two-stage architecture has been demonstrated, employing an output branch replica connected to the OTA virtual ground terminal when configured as a standard inverting amplifier [
28].
While both single-stage and multistage designs have demonstrated competitive figures of merit (FOMs), the choice of strategy often reflects tradeoffs in complexity, power, and target load range. Parallel-type slew-rate enhancers (PSREs) appear as an alternative strategy to deal with stability vs. slew-rate trade-offs by decoupling large-signal dynamics from static power—a principle pioneered by Nagaraj in 1990 [
29] and later refined for modern technology nodes (e.g., Catania et al.’s 0.18-µm design proposed in [
30]). Naderi et al. employed PSRE structures in high-speed pipelined analog-to-digital converters [
31,
32], demonstrating superior performance even with very light capacitive loads (500 fF). The work of Koh et al. [
33] stems from the PSRE idea, also introducing refined techniques for near-zero deadzone control to obtain class-C operation. The work in [
34] is inspired by similar principles and implements a feedforward path using a dynamic circuit with a capacitor-based hysteresis detector. However, PSRE efficiency hinges on activation latency and seamless reintegration during settling, challenges exacerbated in advanced displays where sub-1% settling must coexist with <20 µW power budgets. Recent PSREs, like Gagliardi et al.’s LCD-oriented design [
35] (2023), demonstrate rail-to-rail capability under 1 nF loads but reveal lingering compromises between achievable slew rate and overall settling performance—a gap this work aims to bridge. Importantly, in [
30], the capacitive boosting (CB) technique is introduced in the PSRE block, showing great turn-on improvement on fully differential structures. Nevertheless, to the best of our knowledge, CB boosting has not yet been demonstrated in single-ended architectures such as those employed for LCD and large external load drivers.
This paper is organized as follows:
Section 2 provides the theoretical basis for the PSRE-assisted OTA topologies;
Section 3 discusses, for the first time, the single-ended parallel-type slew-rate enhancer with standard capacitive boosting and its limitations, highlighting the trade-offs between achievable slew rate and the reduction in turn-on delay;
Section 4 introduces a key topological improvement to overcome these limitations, namely the split capacitive boosting (SCB);
Section 5 showcases SCB-PSRE-assisted OTAs under typical LCD-driver design specifications in 0.18-µm CMOS, validated through extensive electrical simulations. A state-of-the-art comparison against designs previously implemented in the same technological node is also provided. Finally,
Section 6 concludes the paper.
2. Parallel Slew-Rate Enhancer Principles
The principle of a PSRE-assisted OTA is schematically represented in the block diagrams of
Figure 1a, considering a single-ended capacitive load. The output currents of both the OTA and the PSRE are shown in
Figure 1b, qualitatively plotted as a function of the differential input voltage (DC transconductance characteristics). While the OTA typically shows great sensitivity to the input signal within a small region around the zero differential input, the PSRE provides large output current outside of this region, where the OTA output current saturates to a constant value. These characteristics allow the PSRE to increase the overall SR exclusively during the initial phase of such a transient, when the absolute value of the input differential voltage exceeds a specific activation threshold. During the transient [
Figure 1c], the input differential voltage decreases, eventually deactivating the PSRE, which then transitions to a high-impedance state. This ensures that the final fine-tuning of the output voltage is handled by the main OTA. Hence, the presence of the PSRE does not affect the OTA DC gain, offset, noise, and stability performances at the steady state, effectively isolating the SR from other specifications and simplifying the design procedure.
Figure 1d shows the functional diagram of the original PSRE, introduced by Nagaraj in 1990 [
29]. The circuit is based on a differential transconductive stage, supplied by the tail current
, which provides
and
currents at its output nodes. A fixed threshold current,
, is substracted from both
and
before hard-limiting blocks (‘
’ and ‘
’) and the final
current amplification. As shown in the sketched current characteristics of
Figure 1d, the resulting current after the hard limiter,
, is reduced by a factor ×4 due to the restricted interval of choice of
[
], with
representing a typical robust choice. The choice of
also determines the value of the PSRE activation voltage,
.
Since the first PSRE, numerous improvements have been proposed. For example, Sakurai et al. [
36] introduced a folded-cascode OTA with an internal SR enhancement circuit, while Lee and Mok [
37] proposed single-point-detection circuits for single-stage amplifiers. More recently, Gambhir et al. [
6] demonstrated a high-speed three-stage class-B output buffer for LCD-driving applications, achieving significant improvements in settling time and power efficiency. Similarly, Wen et al. [
4] developed a rail-to-rail buffer using dynamic-current feedback, which achieved a high SR and low power consumption for OLED source drivers.
Figure 1e shows an efficient PSRE implementation which incorporates a dynamic-threshold-current (DTC) technique [
35]. Here, instead of using a fixed
, an auxiliary transconductor stage, biased at a small portion (
) of the total quiescent current of the PSRE, and also sensitive to the differential input voltage
, provides signal-dependent currents
and
, which are purposely amplified (
) and subtracted from
and
, respectively. The condition
is guaranteed by making
. Insofar, the maximum value of
,
, reached when the PSRE input pairs are fully unbalanced, results to be equal to
, which can be made reasonably close to
. For example, with
, 90% of
is effectively delivered to the final
current amplification stage, leading to larger SR for the same static supply current when compared to the solution of
Figure 1d.
For optimal power efficiency, the always-on PSRE current,
, is often designed to be much smaller than the OTA quiescent current. This leads to possible turn-on delays owing to the
current amplifier of
Figure 1d,e, hence reducing the effectiveness of the PSRE itself. The capacitive boosting (CB) technique, consisting of providing an additional impulse-like boost to
, addresses this issue by accelerating turn-on delays. In prior works, this technique has been demonstrated in fully-differential PSREs [
30]. However, their applicability to single-ended configurations—ubiquitous in off-chip load driving—remains mostly unexplored, leaving a critical gap in power-efficient, high-speed, single-ended designs.
This work identifies a key limitation of standard CB in single-ended PSREs: slew-rate degradation due to common-mode input swings (
Section 3). We then propose a novel CB-PSRE topology optimized for single-ended operation, avoiding detrimental dynamic currents, induced by common-mode swings in standard CB-PSREs, to minimize settling time without compromising static power consumption (
Section 4). Electrical simulations of a SCB-PSRE-assisted OTA designed in 0.18-µm CMOS demonstrate a ×5.53 settling-time reduction compared to conventional CB-PSREs when driving 1-nF loads, achieving 2.56-µs 1% settling time at 4-µA total static current consumption. The proposed design also exhibits robustness to PVT variations, making it suitable for display drivers, MEMS interfaces, and other large-capacitive-load applications.
The contributions of this work include: (i) the first analysis of CB-induced SR limitations in single-ended PSREs, linking performance degradation of the PSRE (in terms of achievable SR) to common-mode dynamics, (ii) an optimized SCB-PSRE architecture for single-ended systems, validated by comparative simulations, and (iii) a power-efficient solution (13.2 µW) for 1-nF loads, with ×5.53 faster settling time than prior CB approaches.
3. Single-Ended Parallel-Type Slew-Rate Enhancer with Standard Capacitive Boosting
3.1. Implementation of the PSRE with Dynamic Threshold Currents
Figure 2 shows the transistor-level schematic view of the PSRE with dynamic threshold currents (DTC) [
35], augmented by the addition of the capacitive booster
, as proposed in [
30]. In the following, we refer to this configuration as the
standard CB configuration to distinguish it from the novel approach introduced in
Section 4.
The PSRE block is ideally designed to deliver a DC output current
described by:
where
is the differential input voltage,
is the maximum output current available during slewing transients, and
is the PSRE activation threshold (defining its “deadzone” around zero differential input). For optimal transient response,
is typically set near the boundary of the OTA linear region [see
Figure 1b].
The PSRE in
Figure 2 consists of: (
i) main transconductors (Mtn1-Mi1-Mi2, Mtp1-Mi3-Mi4), (
ii) auxialiary tranconductors (Mtn2-Ma1-Ma2-Msh3a-Msh3-Msh4a-Msh4, Mtp2-Ma3-Ma4-Msh1a-Msh1-Msh2a-Msh2), (
iii) current mirrors (Mm1p-Mm2pm, Mm1n-Mm2n, Mm3p-Mm4p, Mm3n-Mm4n) for hard-limiting and amplification, (
iv) the capacitive booster
between the common source nodes of the main input pairs.
Neglecting, for the moment, the role played by
, it can be easily shown that the PSRE delivers maximum output current equal to:
when both NMOS and PMOS pairs are active. However, when the supply or ground rail is approached during rail-to-rail input swings, one pair eventually turns off, reducing
by 50% (see
Section 3.3).
As analytically derived in [
35],
depends on various factors: (
i) the fraction
of the total PSRE quiescent current used to supply the auxiliary transconductor and (
ii) the current-mirror gain
J, defined as
, with
, where symbols
,
,
W and
L indicate the carrier mobility, the gate capacitance per unit area, the channel width and the channel length of MOS transistors. In weak-inversion operating conditions, the activation threshold
can be expressed as in (
3) after analytical manipulations relying on descriptions of MOS behavior based on the Enz-Krummenacher-Vittoz (EKV) model [
38,
39]:
where
n is the subthreshold slope factor and
is the thermal voltage (i.e.,
, with
indicating the Boltzmann constant,
T the absolute temperature and
q the elementary charge). On the other hand, in strong-inversion conditions and assuming
:
where symbols
and
indicate the transconductance factors related to the differential pair devices fed by
and
, respectively. To attain (
4), the square-law approximation derived from the EKV model of the MOS transistor was used [
38]. The assumption
is enforced directly by design through appropriate geometrical ratios for the Mi1–Mi2 and Ma1–Ma2 transistor groups in
Figure 2. Specifically,
. It is important to emphasize that the aforementioned expressions apply to asymptotic cases corresponding to either pure weak or strong inversion operation. Real-world designs may also operate in the moderate inversion region, where
exhibits a dependence on the design parameters
and
J that lies between the two asymptotic cases. In practical scenarios where PVT (process, voltage, temperature) variations may occur, the actual dependence of
on
and
J can shift within the design space bounded by Equations (
3) and (
4). It is also worth noting that in the PSRE-assisted OTA,
is selected relative to the sensitivity range of the main OTA differential pair (
). Therefore, PVT variations tend to affect both parameters similarly, resulting in a first-order robustness of the PSRE activation mechanism.
As highlighted in [
40], the PSRE current mirrors exhibit a turn-on delay (
), degrading settling time. This phenomenon is qualitatively represented in
Figure 3, where the total settling time is determined by the sequence of three distinct processes: (
i) PSRE turn-on, (
ii) slewing phase, and (
iii) linear phase. From [
40],
is estimated to be:
where
is the gate capacitance of Mm1p-Mm2p and Mm1n-Mm2n, while
and
are the transconductance factor and the input current of either Mm1p or Mm1n (depending on the sign of the transient) in slewing conditions. The approximated expression of
—a relation useful in the design process—is derived through several simplifications (e.g., neglecting the dependence of
on charge and assuming a constant
during turn-on). Clearly, these assumptions imply that the parameters in (
5) should be interpreted as average values over the turn-on interval. Further details can be found in [
40]. Thanks to the DTC approach,
(still neglecting, for the moment, the CB effect). The
term is dominated by the
parasitic capacitances of the transistors labeled with Mm, hence its effect on
is proportional to
. Therefore, by increasing
K in the attempt of boosting
and the SR,
also increases. This phenomenon eventually determines the minimum achievable settling time of the considered PSRE-assisted OTA topology, when the CB technique is not used.
Next, we analyze the capacitive boosting achieved by placing
between the common-source nodes of the two complementary differential pairs in the main transconductors (
Figure 2).
Section 3.2 examines the effects during the turn-on phase, while
Section 3.3 addresses the slewing phase effects. To the best of the authors’ knowledge, the analyses reported in such two sections have not been proposed in any prior work. As for the final linear settling phase, CB has no role in it, since the PSRE output branches result in being turned off.
Section 3.4 describes the main OTA implementation used alongside the PSRE of
Figure 2 for the electrical simulations in
Section 3.5. The limitations identified through both analytical and simulation approaches will establish the foundation for the improved CB technique proposed in
Section 4.
3.2. Capacitively-Boosted Turn-On Process
We first examine the primary effect of the CB technique: reducing the turn-on delay of the PSRE. For clarity in subsequent calculations,
Figure 4a reproduces the transconductive core of the PSRE from
Figure 2, with annotations showing the operating states of transistors Mi1-Mi4 at the onset of a low-to-high input voltage (
) transition. We assume that this transition occurs at
, with
switching from
(close to ground) to
(close to
), where
denotes the supply voltage.
Prior to the transition, the n-type differential pair is turned off, with its common-source voltage
near ground, due to the low voltage
applied at its inputs. Meanwhile, the common-source voltage
of the p-type pair approximately equals
, where
represents the source-gate voltage of Mi3-Mi4 at the steady state, [i.e., when the currents of both Mi3 and Mi4 are equal to
, as
]. Therefore, the initial voltage across
, defined as
, is equal to:
Following the transition (assuming negligible rise time of ), Mi4 immediately turns off, while Mi3 adjusts to a new operating point characterized by , corresponding to drain current . This new operating point is dynamic as it evolves with . However, for greater simplicity, we approximate it as an effective constant value corresponding to the average value of during the transient. We apply the same simplification to the gate-source voltage of Mi1.
With
during the initial phase of the transient,
. Simultaneously, Mi1 turns on due to
applied at its gate terminal, reaching gate-source voltage equal to
, determined by
. This yields
. The updated
voltage becomes:
Therefore, the voltage change occurring across
at the rising edge of
, derived from (
6) and (
7), results to be equal to:
The abrupt voltage variation across
gives rise to a charge injection phenomenon from
towards the input capacitance
of the current mirror Mm1p-Mm2p, expediting the turn-on process of transistor Mm2p, delivering the PSRE output current. Two main contributions are present: (
i) charge directly injected from
into
through the n-type input transistor Mi1, acting like a closed switch, and (
ii) charge injected from
through the p-type input transistor Mi3, the n-type mirror Mm3n-Mm4n and, finally, into
. The voltage variation at the gate terminal of Mm1p-Mm2p can be expressed as:
where we have also taken into account charge partitioning effects into the gate-source capacitances of Mi1 and Mi3,
and
, and into the input capacitance
of current mirror Mm3n-Mm4n. The assumption of the PSRE input transistors operating in the triode region during the capacitively-boosted turn-on process is later verified in
Section 3.5 based on electrical simulation results.
With sufficient
, Mm2p enters strong inversion during the PSRE turn-on phase, producing a large instantaneous output current, equal to:
where
and
are the threshold voltage and subthreshold slope factor of Mm1p-Mm2p, respectively. By designing
and Mm1p-Mm2p such that:
the PSRE output current achieved during the turn-on process (
10) overcomes the limit value it could reach without
, expressed by (
2). However, this current level persists only during the charge injection phenomenon, lasting approximately:
In the latter relationship, we assume that all input devices (Mi1-Mi4) are characterized by the same transconductance factor
, with
and
indicating the subthreshold slope factors of Mi1 (NMOS) and Mi3 (PMOS), respectively.
For high-to-low transitions, analogous expressions apply:
where
is the steady-state gate-source voltage of Mi1-Mi2 when
,
and
are the input capacitances of current mirrors Mm1n-Mm2n and Mm3p-Mm4p, respectively,
and
are the gate-source capacitances of transistors Mi2 and Mi4, respectively, and
is the threshold voltage of Mm1n-Mm2n. We assume identical transconductance factors
for Mm1p and Mm1n, and equal
for Mm2p and Mm2n.
Notably, (
10) and (
15) may yield asymmetric currents (
) despite matched
and
, mainly due to typically different threshold voltages of n-type and p-type transistors. As demonstrated by simulations reported in
Section 3.5, this asymmetry gives rise to unequal output steps upon PSRE turn-on events, namely:
where
and
occur at the beginning of low-to-high and high-to-low transitions of
, respectively. Nevertheless, both
and
contribute to reducing the settling time by bringing
closer to its steady-state value.
We conclude by noting that larger
values improve turn-on performance by increasing voltage steps
and
(
9), (
14) and increasing
(
12). However, as explained in the next section, enlarging
in the standard CB-PSRE degrades slew-rate performance.
3.3. Capacitive-Boosting Effects on the Slew Rate
To analyze the impact of
on the slew rate (SR), we refer to
Figure 4b in the following analysis. During the slewing phase, with
and
evolving towards
, the output voltage can be expressed as:
This expression incorporates two key approximations: (
i) the current
delivered by the CB-PSRE remains constant during slewing, and (
ii) the main OTA contribution is negligible.
During the slewing phase, both Mi1 and Mi3 remain active, with their drain currents comprising two components: the static tail current and the dynamic current flowing through . This second contribution, absent in the original PSRE configuration without CB, originates from the fact that the voltage across evolves together with . Notably, as analytically shown in the following, has a detrimental impact on the SR, as it drains a portion of the static tail currents , decreasing the current effectively delivered to the output mirrors of the PSRE.
For analytical tractability, we assume constant “effective” gate-source voltages of Mi1 and Mi3 during slewing conditions, denoted as
and
, respectively.
Section 3.5 eventually validates this approximation through electrical simulations. Under these assumptions, the voltage across
results to be equal to:
This yields the following dynamic current through
:
The current delivered to the Mm1p-Mm2p mirror equals
, hence resulting in the following value for the CB-PSRE output current:
Comparison of (
20) with (
2) reveals a reduction factor of the PSRE output current equal to
with respect to the PSRE without
. As anticipated, this negatively impacts the slew rate
. The degradation factor worsens as either
K or
increases.
The derivation of (
20) implicitly assumes
, which holds true if:
a condition satisfied for all positive values of
and
K.
We emphasize that this SR degradation uniquely occurs in single-ended CB-PSREs, due to the ramping behavior of
during slewing conditions, caused by either
or
following the output voltage evolution. Conversely, this effect is absent in fully-differential PSRE-assisted OTAs, thanks to common-mode control during differential-mode transients [
30].
Finally, it is worth mentioning some additional effects not included in the model of (
20). While (
20) provides a reasonable approximation at the onset of the slewing phase, the actual available current diminishes towards the end of the transient due to: (
i) Mi3 gradually turning off as
approaches
(because of the tail transistor of the p-type pair, Mtp2, entering the triode region and eventually deactivating, reducing the output current contribution from the p-type input pair); (
ii) Mi2 turning on and competing with Mi1 for the tail current; (
iii) gradual re-activation of the DTC mechanism for imposing the threshold currents (via Ma2-Msh4a-Msh4 in
Figure 2). The last two mechanisms dominate at the slewing phase conclusion, whereas the PSRE leaves control to the main OTA (linear settling phase). On the other hand, the first mechanism may manifest earlier, during the last part of the ramping evolution of
, halving
(and the SR with it) when the output current contribution related to
vanishes [see
Figure 4b].
3.4. Input-Common-Mode Gm-Stabilized Single-Stage OTA
The case-study OTA used in this work employs a rail-to-rail single-stage active-mirror topology. Its complete schematic view is shown in
Figure 5. To maintain consistent transconductance (
) across the full rail-to-rail input common-mode range (ICMR), we implemented a modified
-control technique based on [
35], which builds upon the approach originally presented in [
41].
For mid-range input common-mode voltages (), both n-type (M1n-M2n) and p-type (M1p-M2p) differential pairs remain active (as their respective supply transistors Mtn and Mtp are biased in saturation) and jointly contribute to the total . In this condition, transistors M3n and M3p remain off, keeping Mb1p, Mb2p, Mb1n, and Mb2n in the cutoff region.
As
approaches either rail (ground or
), one differential pair deactivates when its supply transistor enters the triode region. Without compensation, this would typically halve the OTA
. The circuit in
Figure 2b solves this through complementary compensation.
When , the n-type pair (M1n-M2n) deactivates due to insufficient across Mtn. However, in this condition, M3n activates, injecting current into the Mb1p-Mb2p current mirror. This provides supplemental tail current to the p-type pair (M1p-M2p), beyond the nominal . Proper sizing of M3n, Mb1p, and Mb2p restores the total to its mid-range value.
Conversely, as approaches , M3p activates the Mb2n-Mb1n path to boost current in the n-type pair. This symmetrical compensation maintains optimal OTA performance across the entire ICMR, particularly minimizing the linear settling phase duration during output transients.
3.5. Simulation Results
Electrical simulations were performed on the PSRE-assisted OTA with the standard CB technique (
Figure 2 and
Figure 5) using the Spectre simulator with device models from a 0.18-µm CMOS process by UMC. The design, with device parameters detailed in
Table 1 and
Table 2, targets LCD-driving applications with 1-nF load capacitance. I/O transistors (3.3-V supply) and metal-insulator-metal (MIM) capacitors were employed, with the OTA consuming 2.4 µA in quiescent conditions and the PSRE 1.6 µA. The PSRE was configured with current gain
K = 600 in its output current mirrors and DTC parameters
= 1/4 and
, achieving 1% settling time below 10 µs upon input swings between 10% and 90% of
. All of the simulations discussed in this section were performed in the typical process corner at room temperature (
°C).
To determine transistor sizes in
Table 1 and
Table 2, we employed well-known intuitive guidelines that are typically used in most OTA designs. Notably, these considerations apply to both the main OTA and the PSRE block, since even the PSRE architecture closely resembles an OTA. Specifically, input-pair transistors have been sized so that to be biased in moderate/weak inversion, to maximize the
parameter, hence optimising transconductance and GBW for the given current budget. On the other hand, current-mirror transistors are biased in strong inversion, to reduce their noise and mismatch impact, with overdrive voltages below 200 mV to simultaneously achieve wide ICMR and output swing.
PSRE parameters
,
J and
K have been chosen based on parametric sweep simulations, taking into account their role in the following trade-offs: (
i) smaller
maximizes the fraction of PSRE quiescent current employed to generate large output current in slewing conditions, but excessively small
might cause excessive delays in the DTC mechanism, possibly degrading settling time; (
ii)
J is bounded to verify
, to correctly enforce the PSRE deadzone (as discussed in
Section 2), but excessive
J might also cause delayed DTC action; (
iii) larger
K increas the maximum PSRE output current, but exacerbates the tail current draining effect due to standard CB in the single-ended PSRE (studied in this work for the first time), on top of worsening the turn-on/turn-off delays of the PSRE, as discussed in [
40].
Figure 6 illustrates the competing effects of capacitive boosting during low-to-high transients [
Figure 6a] and high-to-low transients [
Figure 6b]. Small
values effectively eliminate the PSRE turn-on delay, replacing it with an initial voltage step. However, they simultaneously reduce the slew rate during the main slewing phase, as predicted by (
20). The net effect on settling time depends on which phenomenon dominates. Furthermore, focusing on the low-to-high transients of
Figure 6a, we observe that when the output voltage exceeds ∼2.6 V, the p-type pair deactivates, creating a secondary reduced slewing regime before the final linear settling phase, where the OTA operates alone. This effect has been discussed in the conclusion of
Section 3.3.
For the simulated design, settling time marginally improves with increased
at low-to-high transients, while it seriously worsens with
at high-to-low transients. These effects are highlighted in
Figure 7, showing SR and settling time as a function of
. As previously anticipated, this asymmetry, occurring despite balanced transconductance factors of PMOS and NMOS transistors, is due to PMOS-NMOS threshold voltage differences, causing unequal current boosting during opposite transitions, as predicted by (
10), (
15) and (
16). This polarity-dependent behavior reveals that the standard CB technique is not effective when used for single-ended structures.
Figure 8 and
Figure 9 provide detailed validation of the analytical models presented in
Section 3.2 and
Section 3.3.
Figure 8 shows voltages
,
and
across a whole low-to-high transient.
and
exhibit the predicted behavior [
Figure 8a]:
rapidly settles to
at the input voltage step, while
tracks the output voltage as
. The resulting evolution of the voltage across
, shown in
Figure 8b, matches our theoretical predictions, giving rise to the dynamic current
(
19), which lowers the SR (
20).
Figure 9 shows other relevant PSRE internal voltages during the low-to-high transient. Specifically, traces marked as
,
and
correspond to the gate-source voltage of Mi1, drain-source voltage of Mi1 and gate voltage of Mm1p-Mm2p, while the voltage across
and the input and output voltages are also plotted for reference. Interestingly, in the magnified view of
Figure 9b, it appears evident how, at the very beginning of the transient, during the PSRE turn-on process, the input transistor Mi1 finds itself to shortly work in the triode region (
), while the output transistor Mm2p markedly reaches strong inversion, thanks to the large
step. These findings validate the analytical model proposed in
Section 3.2 to describe the CB-assisted PSRE turn-on process.
5. Results and Discussion
In this section, performances of the proposed SCB-PSRE-assisted OTA are showcased. In this set of simulations, performed with the Spectre simulator using device models from a 0.18-µm CMOS process by the UMC foundry, the PSRE and OTA blocks were implemented using the same device sizes of
Table 1 and
Table 2, respectively, and were supplied with static currents equal to 1.6 µA and 2.4 µA, respectively. The
and
capacitors included in the SCB-PSRE scheme (
Figure 10) were set equal to 2.25 pF each. According to the parametric sweep simulations of
Figure 11, that is the highest capacitance value avoiding overshoot phenomena, hence corresponding to optimum settling behavior. Unless differently specified, simulations were performed at room temperature (
°C) with
V in the typical process corner. Performances of the SCB-PSRE are first compared to prior PSRE implementations; then, they are benchmarked against previously published works on fast-settling drivers.
Figure 13 shows output voltage transients obtained in response to both low-to-high and high-to-low input voltage transitions. The following designs have been tested: (
i) the OTA alone, (
ii) the PSRE-assisted OTA with the standard CB technique (
Figure 2), with
pF, and (
iii) the PSRE-assisted OTA with the novel SCB technique. All three circuits were designed with a total quiescent supply current of
µA and were arranged in unity-gain configuration. The load capacitor was set equal to
nF, while the input steps span from 10% to 90% of
. The first design, only including the OTA of
Figure 5, exhibits very inefficient driving capability, requiring around 400 µA to settle to the steady-state output voltage. Conversely, the addition of the PSRE block, under the same total
, remarkably improves settling speed, especially with the novel SCB technique, achieving both low-to-high and high-to-low 1% settling times (indicated as
and
in the following) below 3 µs.
In
Figure 14, the same three designs are tested under sinusoidal stimuli, again in unity-gain configuration with a load capacitance of 1 nF. Specifically,
Figure 14a shows the waveforms obtained when the input signal is generated with 2-kHz frequency and peak-to-peak amplitude equal to 2 V
pp. While the OTA alone is unable to correctly follow the input signal, due to clear SR limitations giving rise to a triangle-wave output, the PSRE-assisted OTA correctly tracks
, with only small discrepancies before sine-wave peaks due to PSRE turn-off events.
Figure 14b–d show total harmonic distortion (THD), peak-to-peak amplitude, and average consumption, respectively, as a function of frequency, while the input sine-wave amplitude is kept equal to 2 V
pp. While the OTA alone incurs THD and amplitude degradation just above 1 kHz, the PSRE-assisted designs achieve more limited THD increase as a function of frequency, while the amplitude is roughly kept constant. As expected, no significant differences can be noticed, in this simulation, between the CB-PSRE and the SCB-PSRE designs, since the SCB technique is primarily meant to optimize responses to fast input voltage steps, rather than gradual, continuous variations of
as encountered with a sinusoidal stimulus. It is also worth noting that, while all designs were supplied with the same total quiescent current, the average dynamic consumption of the PSRE block increases as a function of frequency, as made evident in
Figure 14d; however, this portion of consumption is entirely employed to drive the load capacitance during slewing transients. Hence, this effect highlights how the PSRE block allows efficient automatic adjustment of the average consumption depending on the load current requirement.
Table 3 compares the SCB-PSRE against alternative implementations. In all PSRE-assisted OTA solutions, the OTA and PSRE blocks have quiescent supply currents equal to 2.4 µA and 1.6 µA, respectively. Using
= 2.25 pF, the proposed SCB-PSRE-assisted OTA achieves ×5.53 lower settling time than the standard CB-PSRE, and a ×162 settling time improvement over the standalone OTA with the same quiescent supply current. In terms of SR, the proposed SCB-PSRE improves over the standard CB-PSRE by 2.44 times.
Static parameters are also reported in
Table 3 for all designs, including frequency-response parameters (i.e., DC gain, gain-bandwidth product and phase margin, indicated as
, GBW and PM, respectively), common-more rejection-ratio (CMRR) characteristics (i.e., the CMRR value in DC and its −3 dB bandwidth, indicated as CMRR
DC and BW
CMRR, respectively), integrated noise, and standard deviation of the input-referred offset (
). Offset has been evaluated through sets of 500 statistical Monte Carlo simulations, including both global and local (mismatch) process errors. All of these parameters are not affected by the presence of the PSRE block, proving correct turn-off of the PSRE output devices at the steady state, where only the main OTA operates.
Table 4 summarizes the performances of the SCB-PSRE-assisted OTA across mixed PVT corners. Good resilience to PVT variations is demonstrated, with all key parameters (SR, settling time, DC gain, GBW, and phase margin) showing minimal deviations from typical conditions.
Layout parasitic components (not taken into account in this work) are expected to have relevant consequences mainly concerning the choice of optimal
. Specifically, considering that the PSRE output mirrors Mm1p-Mm2p and Mm1n-Mm2n have output devices with large multiplicity factors (
), their input capacitances are likely to be higher due to routing capacitances. However, rather than disrupting the PSRE performance, this would simply require an adjustment of the
value to achieve optimal transient waveforms (specifically, referring to
Figure 11 selecting the fastest rising/falling behavior not presenting any overshoot).
Finally, in
Table 5, the performances of the proposed design are thoroughly compared against recent state-of-the-art solutions implemented with 0.18-µm CMOS technologies. All compared designs are single-ended amplifiers driving large capacitive loads. To allow fair comparisons, two key figures of merit are employed: the settling-time figure of merit (FoMT =
) and the normalized charge efficiency (FoMN =
) [
42,
43]. The proposed SCB-PSRE achieves competitive performance with FoMT = 29.6 pF·µs
−1·µW
−1 and FoMN = 0.258 while driving a 1 nF load with just 4 µA current consumption. These results represent significant improvement over our previous work [
35] and compare favorably with other implementations, particularly considering the large input swing (80% of
) used in our simulations.
While the class-B implementation in [
4] demonstrates superior figures of merit through its innovative resistive compensation technique, our approach maintains important advantages in design simplicity and compliance with standard class-A OTAs. The SCB technique proves particularly valuable for display driver applications and switched-capacitor circuits, where it enables fast settling (2.56 µs) with modest power consumption (13.2 µW) and symmetrical performance for both rising and falling edges.