Next Article in Journal
Negative Photoconductivity in Ultranarrow-Gap Semiconductors
Previous Article in Journal
Synthesis, Antimicrobial and Anti-Inflammatory Activity of a Novel Styrylquinolinium Iodide Bearing a Naphthalene Moiety
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

CVD Monolayer MoS2 Memtransistors for Chaotic Time-Series Prediction via Reservoir Computing

1
FG Nanotechnologie, Institut für Mikro- und Nanoelektronik, Institut für Mikro- und Nanotechnologien MacroNano®, Institut für Werkstofftechnik, TU Ilmenau, Postfach 100565, 98684 Ilmenau, Germany
2
FG Theoretische Physik 2, Institut für Physik, Institut für Mikro- und Nanotechnologien MacroNano®, TU Ilmenau, Postfach 100565, 98684 Ilmenau, Germany
*
Authors to whom correspondence should be addressed.
Crystals 2026, 16(2), 116; https://doi.org/10.3390/cryst16020116
Submission received: 16 December 2025 / Revised: 26 January 2026 / Accepted: 30 January 2026 / Published: 5 February 2026
(This article belongs to the Section Inorganic Crystalline Materials)

Abstract

Monolayer MoS 2 memtransistors offer gate-tunable hysteresis for neuromorphic reservoir computing, yet the role of operating window and fading-memory dynamics in CVD devices remains underexplored. We grow CVD monolayer MoS 2 , fabricate back-gated memtransistors, and use a single device as a time-multiplexed reservoir node for one-step Lorenz-63 prediction. Mobility, ON/OFF, hysteresis, and drift are quantified to identify stable, tunable bias regimes. We used a transistor with field-effect mobility on the order of 10   cm 2   V 1   s 1 , an ON/OFF ratio above 10 5 , and a moderate hysteresis window quantified by H 2.1   μ A·V at V DS   =   50 mV and H 17   μ A·V at V DS   =   500 mV over V GS [ 10 , 30 ] V. Performance is bias/memory-limited rather than FET-metric-limited. Sweeping gate-window and reservoir hyperparameters shows an optimum at intermediate hysteresis with moderate drift. Performance improves when the input clock matches the fading-memory time, achieving normalized root mean square error (NRMSE) = 0.09 for one-step Lorenz-63 x-prediction. Device-level statistics (discussed in the main text) show that, despite substantial scattering in electrical parameters, the resulting device-to-device NRMSE variation remains very small under fixed operating conditions. Classical FET metrics are not limiting here; NRMSE improvement instead requires engineering the hysteresis spectrum and gate stack. The demonstration of Lorenz-63 prediction using CVD-grown monolayer MoS 2 memtransistors highlights their potential as a wafer-scalable platform for compact chaotic time-series predictions.

1. Introduction

Two-dimensional (2D) transition metal dichalcogenides, and in particular monolayer MoS 2 , offer a direct band gap, atomically sharp interfaces, and excellent electrostatic control at the ultimate thickness limit, which makes them attractive candidates to extend device scaling beyond silicon [1,2,3,4]. Chemical vapor deposition (CVD) enables wafer-scale synthesis of monolayer MoS 2 and integration into circuits. Alternative synthesis routes include molecular beam epitaxy [5,6], atomic layer deposition [7,8], sulfurization of Mo layers [9], reactive sputtering [10], pulsed laser deposition [11,12], Langmuir–Blodgett coating [13], and printing [14]. For CVD, the resulting films often contain grain boundaries, point defects, and substrate-induced disorder that alter transport and introduce pronounced hysteresis in field-effect characteristics [4]. Recent work has shown that growth recipes, nucleation promoters, and substrate choice strongly influence crystalline quality and defect landscapes in CVD MoS 2 [15], yet the consequences for non-conventional computing schemes are still only partially quantified.
In conventional MOSFET logic, threshold-voltage drift and hysteresis caused by charge trapping at the MoS 2 /SiO 2 interface, oxide traps, and adsorbates are treated as reliability problems that degrade noise margins and bias stability [16]. For CVD MoS 2 , several studies attribute the dominant contribution to traps at grain boundaries and at the semiconductor–oxide interface, consistent with the intrinsic-oxide-trap picture developed for few-layer and thin-film devices [16,17]. The same history dependence that limits digital performance can, however, be harnessed in memristive operation: gate-tunable MoS 2 memtransistors exploit hysteresis to implement non-volatile states, analog weight updates, and rate-dependent responses that are useful for neuromorphic computing [18]. In previous work we showed that hysteresis in CVD MoS 2 memristors can be quantified via an energy-normalized hysteresis area and that energy-efficient operation windows emerge from an interplay of trap dynamics, bias range, and device geometry [18].
Neuromorphic computing aims to process information using distributed nonlinear dynamics and local memory rather than explicit instruction sequences. Reservoir computing (RC) is a particularly hardware-friendly framework in which a nonlinear dynamical system (the reservoir) is driven by an input signal and only a linear readout is trained. The reservoir may be implemented in software using recurrent neural networks, or physically using optical, electronic, or mechanical systems with suitable nonlinearity and fading memory. For chaotic time-series prediction, such as the Lorenz-63 system, RC can reconstruct attractors and achieve long prediction horizons when reservoir dynamics are matched to task timescales and remain stable but sufficiently rich [19,20]. Recent works show that even small reservoirs can reconstruct chaotic attractors reliably and that topology and spectral properties strongly influence long-term prediction performance [21,22].
Physical reservoirs based on memristive devices promise compact and low-power implementations [19,23]. Numerous material systems have been explored for chaotic prediction or classification, including oxide-based memristors [24,25,26,27,28,29], phase-change devices [30,31], organic synaptic transistors [32,33], and LiNbO 3 -based dynamic reservoirs [34,35] for chaotic prediction or classification tasks. At the same time, 2D-material-based memtransistors are emerging as attractive candidates due to their scalable growth, tunable interfaces, and compatibility with back-end-of-line thermal budgets [23]. Yet most demonstrations of MoS 2 -based neuromorphic devices either rely on exfoliated or transferred flakes [36], or focus on synaptic plasticity and classification tasks without a quantitative link between growth-induced defects, device hysteresis metrics, and system-level RC performance [37].
Device-focused studies on MoS 2 typically report field-effect mobility, subthreshold slope, ON/OFF ratio, and a qualitative measure of hysteresis, but they rarely specify temporal response, fading-memory time constants, or stable operating windows needed to embed these devices into concrete neuromorphic or RC architectures. Conversely, system-level neuromorphic and machine-learning works usually prescribe abstract requirements such as nonlinearity, state dimensionality, and memory depth, without translating them into constraints on current densities, gate-voltage ranges, or switching speeds that a fabrication process can realistically deliver. Chaotic systems are most often treated in dimensionless units, leaving open how to map state variables to physical voltages and currents for a given device technology. As a result, it is difficult to answer seemingly simple questions such as whether a given MoS 2 memtransistor, grown under a particular CVD recipe, is suitable as a reservoir node for a specified task and under which bias conditions.
In this work we address part of this gap by linking CVD growth, device-level hysteresis, and reservoir computing performance in a single, experimentally realized platform. We grow monolayer MoS 2 by atmospheric-pressure CVD on p + -Si/SiO 2 , fabricate back-gated memtransistors, and use a single device as a nonlinear elements in a time-multiplexed reservoir that performs one-step prediction of the Lorenz-63 x-component. We first characterize the crystalline quality of the CVD MoS 2 using atomic force microscopy (AFM), Raman spectroscopy, and photoluminescence (PL), confirming monolayer thickness. We then extract key device figures of merit from transfer and output characteristics. Finally, we implement a Lorenz-63 RC pipeline in which the memtransistor is driven by a masked, time-multiplexed input sequence, and a linear readout is trained by ridge regression to predict the next time step. By systematically varying the gate-voltage window, input scaling, and number of virtual nodes, we map NRMSE for Lorenz-63 prediction to experimentally accessible device parameters.
We find that intermediate hysteresis and moderate drift, corresponding to a specific range of gate biases and input amplitudes, maximize state richness and yield-normalized root mean square errors down to normalized root mean square error NRMSE 0.09 for one-step prediction of the Lorenz-63 x-component. Lower hysteresis reduces nonlinearity, while excessive hysteresis or strong drift degrades memory and leads to unstable trajectories. These results provide a concrete example of how CVD-grown monolayer MoS 2 memtransistors can be engineered and biased to serve as physical reservoir nodes for chaotic time-series prediction. These results provide a concrete example of how CVD-grown monolayer MoS 2 memtransistors can be engineered and biased to serve as physical reservoir nodes for chaotic time-series prediction: despite only moderate transistor figures of merit (field-effect mobility of order 10   cm 2   V 1   s 1 and ON/OFF > 10 5 ), trap-mediated fading memory on 1–4 s time scales enables Lorenz-63 one-step prediction down to NRMSE 0.09 under time-multiplexed operation (baseline NRMSE 0.46 without multiplexing). Because the platform relies on atmospheric-pressure CVD and a standard solid-state gate stack rather than transferred flakes or electrolyte gating, it is compatible with wafer-scale replication and array-style space multiplexing, which is essential for scalable and cost-effective neuromorphic hardware.

2. Materials and Methods

2.1. CVD Growth of Monolayer MoS 2

Monolayer MoS 2 was grown by chemical vapor deposition (CVD) in a horizontal two-zone quartz-tube reactor operated at atmospheric pressure with nitrogen ( N 2 ) as the carrier gas as demonstrated in Figure 1a. Degenerately doped p + -Si/SiO 2 substrates with a 90 nm thermally grown SiO 2 layer served as both the growth substrate and the global back gate in the final devices.
Prior to growth, the substrates underwent a standard solvent cleaning (acetone, isopropanol, isopropanol in ultrasonic bath, and deionized water), followed by drying with N 2 . The substrates were then treated in an O 2 plasma at 300 W for 10 min to remove residual organic contamination and activate the SiO 2 surface. Immediately after the plasma step, an aqueous KCl solution with a concentration of 0.01 M was dispensed onto the substrates and spread by spin coating at 4000 rpm. The coated substrates were subsequently dried on a hot plate at 80 °C for 1 min. This KCl layer acts as a seeding promoter for MoS 2 domain nucleation on SiO 2 .
Solid sulfur (S) and molybdenum trioxide ( MoO 3 ) powders were used as precursors. A total of 400 mg of S powder was loaded into a graphite crucible placed upstream in the low-temperature zone of the reactor, while 4 mg of MoO 3 powder and the KCl-treated p + -Si/SiO 2 (90 nm) substrates were placed side by side in the central hot zone of the furnace. The MoO 3 crucible and the substrates thus experienced the same nominal temperature profile. Sulfur solid source was heated indirectly by an external halogen lamp focused on the graphite crucible and maintained at approximately 140 °C during the growth, providing a stable S vapor flux into the reaction zone.
Before deposition the reactor was purged with N 2 at 1000 sccm for 1 h at room temperature to remove residual air and moisture. For the growth step, the N 2 flow was reduced to 500 sccm and kept constant throughout the temperature ramp, hold, and cool-down. The measured temperature profile during the CVD is shown in Figure 1b. Starting from room temperature, the furnace was ramped to 800 °C in 18 min. After reaching 800 °C, the temperature was held for 2 min to allow MoS 2 nucleation and lateral domain growth on the SiO 2 surface. At the end of the hold step, the furnace heating was switched off and the system was allowed to cool naturally to room temperature under continuous N 2 flow. Under these conditions, the growth yields predominantly monolayer MoS 2 on SiO 2 , as confirmed by the structural and optical characterization.

2.2. Memtransistor Fabrication

MoS 2 memtransistors were fabricated on the CVD-grown monolayer MoS 2 /p + -Si/SiO 2 (90 nm) substrates using optical lithography and reactive ion etching (RIE) followed by metal contact definition. All photolithography steps were performed on a maskless aligner MLA150 (Heidelberg Instruments Mikrotechnik GmbH, Heidelberg, Germany.).
In the first lithography step, the as-grown MoS 2 film was patterned into isolated channel regions. A positive-tone photoresist AZ 1518 was spin coated onto the substrates and exposed using the MLA150 to define the MoS 2 areas to be preserved. After development, the exposed MoS 2 was etched by RIE using the patterned AZ 1518 as an etch mask. Immediately after the MoS 2 etch, an O 2 descum step was carried out in the same chamber. To minimize damage to the remaining MoS 2 during resist stripping, the polymerized AZ 1518 was removed in a two-stage wet process: first, the samples were immersed in acetone for 10 min, followed by AZ 100 remover at 60 °C for 10 min, and then a second AZ 100 remover step for 5 min at room temperature. Finally, the substrates were rinsed in isopropanol and deionized water and dried with nitrogen.
Source and drain contacts were defined in a second photolithography step. A negative-tone photoresist AZ LNR-003 was spin coated and patterned to open contact pads and channel regions in the resistor. Gold contacts with a thickness of 100 nm were then deposited by electron-beam evaporation (CS400ES, VON ARDENNE GmbH, Dresden, Germany). No additional adhesion or capping layers were used in this process. After metallization, lift-off was carried out using the PS 3121 photoresist stripper (Intelligent fluids GmbH, Leipzig, Germany), which selectively removed the remaining AZ LNR-003 and the overlying metal, leaving well-defined Au source and drain electrodes contacting the MoS 2 channels. Channel geometries and additional process parameters are listed with the corresponding device statistics in and in the section “Geometrical Definition and Tolerances”.
As shown in Appendix D, the device architecture and fabricated layout are illustrated by a schematic cross-section and representative SEM micrographs of the patterned memtransistor structures.

Geometrical Definition and Tolerances

All memtransistors in this work were defined by the same mask layout (no intentional geometry variation) with nominal channel length L   =   2   μ m and width W   =   10   μ m . Because the channel is lithographically defined, the dominant geometrical uncertainties arise from mask-to-mask alignment accuracy, resist development bias, and contact edge definition. Based on the lithography tool specifications and optical inspection of representative structures across the chip, we estimate the effective dimensional tolerances to be within Δ L / L 5 % and Δ W / W 5 % for the present device set (i.e., sub-micrometer deviations around the nominal values). The source/drain electrodes were designed with a fixed overlap to the channel region to ensure robust contacting. The back-gate oxide thickness was fixed by the starting substrate with 90 nm SiO 2 thickness.

2.3. Structural and Electrical Characterization of MoS 2 Memtransistors

The as-grown MoS 2 films were first examined by atomic force microscopy (AFM) to confirm the surface morphology and layer thickness. AFM measurements were performed on a Dimension V system (Veeco Instruments Inc., Plainview, NY, USA) operated in tapping mode. Height and phase images were recorded over scan areas in the 25 μm range on multiple regions of each sample. The apparent step height between MoS 2 domains and the surrounding SiO 2 surface was used to verify monolayer thickness, while the absence of significant multilayer islands within the device channels was checked by combining height and phase contrast.
Raman and photoluminescence (PL) spectroscopy were used to further confirm the monolayer character and crystalline quality of the films. Spectra were acquired with a Alpha300 apyron Raman imaging microscope (WITEC GmbH, Kroppach, Germany) using a 532 nm excitation laser and an excitation power of 0.5 mW at the sample. Raman and PL maps were collected across representative device areas to assess spatial uniformity. The relative positions and intensities of the characteristic MoS 2 Raman modes, together with the strong direct-gap PL response, were used to identify monolayer regions.
Electrical measurements of the MoS 2 memtransistors were carried out using a shielded probe station integrated with a Keithley 4200-SCS Semiconductor Characterization System. All measurements reported in this work were performed in a nitrogen atmosphere at room temperature to minimize adsorption-related drifts and hysteresis. Transfer characteristics ( I D V GS ) and output characteristics ( I D V DS ) were recorded for multiple devices. Unless stated otherwise, the drain voltage was fixed at V DS   =   50 mV for transfer measurements, while the back-gate voltage was swept quasi-statically over the range relevant for reservoir computing operation. All electrical measurements were performed in a shielded probe station under a dry nitrogen atmosphere to minimize the influence of ambient humidity and adsorbates. Measurements were conducted under standard laboratory conditions (room temperature) with the device kept under nitrogen purge during transfer-curve, fading-memory, and time-series experiments.
The accuracy of the applied electrical biases is set by the source-measure units of the Keithley 4200-SCS; the resulting uncertainty in V GS and V DS is negligible compared to the voltage windows used throughout this work. The drain-current readout noise floor is far below the μ A-level currents relevant for the reported transfer curves and hysteresis metrics. Consequently, the dominant experimental uncertainties in extracted quantities arise from sweep discretization and numerical post-processing: the hysteresis area H is limited by the V GS step size and current noise propagation through numerical integration, while g m is limited by numerical differentiation and smoothing choices.
Figures of merit were extracted from the I D V GS curves at V DS   =   50 mV. The field-effect mobility was estimated in the linear regime using the channel geometry and the SiO 2 gate-oxide capacitance per unit area. The threshold voltage V TH was obtained by linear extrapolation of the I D V GS characteristics in the above-threshold region, and the ON/OFF current ratio was defined as the ratio of I D at the chosen ON- and OFF-state gate voltages within the accessible bias window.

2.4. Fading Memory: Definition and Experimental Extraction

Fading memory is the property that the influence of past inputs on the present device state decays with increasing delay. For a driven memtransistor used as a physical reservoir, this means that two input histories that differ only in the distant past produce reservoir states that converge in time. Here, we quantify fading memory directly from electrical measurements by estimating a characteristic decay time, denoted as the fading-memory time constant τ FM .
To quantify the intrinsic relaxation dynamics under the operating conditions used for reservoir computing, we estimated characteristic time constants from a drain-current decay trace recorded at fixed bias V GS   =   25   V and V DS   =   0.5   V . The measured I D ( t ) exhibits a gradual decrease over tens of seconds, consistent with bias-stress relaxation commonly associated with charge trapping/detrapping in the dielectric and at the MoS 2 /SiO 2 interface. Because the decay is not well described by a single exponential over the full acquisition window, we fit the data using a bi-exponential model from Equation (1):
I D ( t )   =   A 1 exp t t 1 + A 2 exp t t 2 + I 0 ,
where A 1 and A 2 are amplitudes, t 1 and t 2 are relaxation time constants, and I 0 is the long-time offset current. The resulting time constants capture a fast component ( t 1 , seconds) and a slow component ( t 2 , tens of seconds), which we use as an experimental proxy for the time scales that bound the usable fading memory in time-multiplexed operation. The full set of fitted parameters and uncertainties is provided in the Appendix B.

2.5. Reservoir Computing Protocol with MoS 2 Memtransistors

We implemented reservoir computing using single MoS 2 memtransistors driven by a scalar chaotic input derived from the Lorenz-63 system. The Lorenz dynamics [38] are defined by
d x d t   =   c 1 ( y x ) ,   d y d t   =   x ( c 2 z ) y ,   d z d t   =   x y c 3 z ,
with c 1   =   10 , c 2   =   28 , and c 3   =   8 / 3 , a shown in Figure 2a. The equations were integrated numerically using a python ODEint with a time step of 0.001 s to generate a long trajectory in the attractor. After discarding the initial transient, the x ( t ) component (denoted X in Figure 2b) was sampled at a uniform time-step to form a one-dimensional discrete-time sequence { X i } and used as the input signal with a varied time step. The integration and sampling parameters were chosen such that the dataset covered many visits to both lobes of the attractor. The full time series and scripts are provided in [39,40].
Because the raw Lorenz-X values do not match the gate-voltage range required for safe device operation, we applied an affine normalization to map X i into a chosen gate window. First, the samples were linearly rescaled to the unit interval,
X normalized   =   X i X min X max X min ,
where X min and X max are the minimum and maximum of the training portion of the sequence. The normalized values were then mapped to the gate range [ V GS , min , V GS , max ] as
V GS , i   =   V GS , min + V GS , max V GS , min   X normalized .
The specific voltage windows used in the non-time-multiplexed and time-multiplexed regimes (A–H) are summarized in Table A1. During measurements, the drain-to-source voltage V DS and the sampling time interval Δ t were kept constant within each regime, while the back gate was driven by piecewise-constant pulses corresponding to the sequence { V GS , i } .
Non-time-multiplexed (non-TM) operation was realized by directly sampling the memtransistor drain current I DS at the end of each gate pulse. For a given regime, the Lorenz-X sequence was applied as a train of V GS steps with dwell time Δ t and fixed V DS , and the stabilized current at the end of each dwell interval was recorded as the scalar reservoir state. We investigated four such configurations (A–D), which differ in V DS , Δ t , and the chosen gate window, corresponding to different trade-offs between subthreshold sensitivity, channel conduction, and hysteresis. The resulting current sequence was later aligned with the input sequence to form pairs of input states and targets for training and testing.
To increase the effective state dimensionality, we also implemented time-multiplexed (TM) operation with binary masks. In this case, the input sequence was first normalized to a symmetric range X c [ n ] [ 1 , 1 ] at a macro-step cadence indexed by n. Each macro-step was then expanded into N virtual nodes by driving the gate with a masked waveform
V GS [ n , k ]   =   V center + A   X c [ n ]   m k ,       k   =   1 , , N ,
where V center is the center of a narrow, device-safe gate window, A is the modulation amplitude, and m k { 1 , + 1 } is a Rademacher mask drawn once with a fixed pseudo-random seed. Within each macro-step of duration T H   =   N   Δ t , the gate was held at each V GS [ n , k ] for a dwell time Δ t , and the corresponding drain current I DS [ n , k ] was sampled at the end of the dwell. This produced N virtual nodes per macro-step. In the TM configurations (E–H), the device state for node k was defined as
z [ n , k ]   =   log 10 I DS [ n , k ] + I 0 ,
with I 0   =   10 11 A added to suppress numerical issues near the noise floor. In some regimes, a one-window lag was included by concatenating the present and previous window states, as detailed in Table A1.
For each macro-step n, the node states were collected into a window vector
s n   =   z [ n , 1 ] ,   z [ n , 2 ] ,   ,   z [ n , N ] R N ,
and the final feature vector for readout was formed as
x n   =   s n , non - lagged   TM   regime , s n , s n 1 , TM   regime   with   + 1   lag ,
after discarding a short washout of initial windows. The target for each macro-step was the Lorenz-X value at the next macro-step, y n   =   X [ n + 1 ] , so that the memtransistor reservoir performs one-step-ahead prediction at the macro cadence.
The linear readout was trained by ridge regression on standardized features. The sequence of windowed pairs { ( x n , y n ) } was split chronologically into training and test sets (e.g., 70/30 split), without shuffling, to respect the temporal structure of the data. On the training set, we computed the per-feature mean μ and standard deviation σ and standardized both training and test features accordingly. A bias term was included by augmenting the standardized feature matrix with an intercept column. The readout weights were obtained by minimizing a Tikhonov-regularized least-squares objective with penalty parameter chosen by grid search on the training set. Performance was evaluated on the held-out test set using the normalized root-mean-square error (NRMSE) and the coefficient of determination R 2 , defined as
RMSE   =   1 T t   =   1 T y t y ^ t 2 ,
NRMSE   =   RMSE σ y ,   σ y   =   1 T t   =   1 T y t y ¯ 2 ,
R 2   =   1 t   =   1 T y t y ^ t 2 t   =   1 T y t y ¯ 2 ,
where y t and y ^ t denote the true and predicted Lorenz-X values on the test set, y ¯ is their mean, and T is the number of test samples. The detailed hyperparameters and the full measurement logs for regimes A–H are provided in Table A1 and in the open dataset and code repository [39,40].
In many experimental forecasting scenarios, only a subset of the underlying dynamical state is available to the learning system (partial observation). In this setting, the instantaneous input at a single time step does not uniquely define the state of the task system, and accurate prediction requires access to recent history. Reservoir computing naturally addresses partial observation because the reservoir state depends on past inputs and can therefore act as an implicit delay embedding of the observed signal [41,42]. In the context of chaotic time-series prediction, this perspective clarifies why the relevant device requirement is not maximizing conventional transistor figures of merit, but matching the nonlinear fading-memory dynamics (memory depth and response timescales) to the sampling interval and the task dynamics.

3. Results

3.1. Crystal Growth and Monolayer Quality

The CVD process described in Section 2.1 yields discrete triangular MoS 2 domains with an average lateral size of about 50   μ m , which is large enough to accommodate the 10   μ m   ×   2   μ m transistor channels fully within a single flake. An optical micrograph of a representative chip after growth is shown in Figure 3a. Individual triangular flakes with lateral dimensions of tens of micrometres are visible on the SiO 2 surface, and the regions selected for device fabrication were chosen such that the entire channel resides inside a single monolayer domain.
The surface morphology and layer thickness were examined by AFM. Figure 3b presents a tapping-mode AFM height image of a typical MoS 2 flake, and Figure 3c shows a line profile across the MoS 2 /SiO 2 step. The apparent step height between the flake and the surrounding SiO 2 lies in the range of 0.7–0.9 nm, consistent with monolayer MoS 2 on SiO 2 when tip convolution and adsorbates are taken into account. Within the channel regions, we did not detect extended areas with multiples of this step height, indicating that the active devices are formed predominantly within monolayer flakes rather than multilayer aggregates or overlapped domains.
Raman spectroscopy confirms the monolayer character of the CVD MoS 2 and provides a first handle on strain, doping, and crystallinity (Figure 3d). The spectrum, measured with 532 nm excitation at 0.5 mW power, exhibits the characteristic in-plane E 2 g 1 and out-of-plane A 1 g modes. Single-Gaussian fits to the two main peaks yield positions of 385.0   cm 1 ( E 2 g 1 ) and 402.6   cm 1 ( A 1 g ), with full widths at half maximum (FWHM) of 3.7   cm 1 and 7.4   cm 1 , respectively. The peak separation Δ ω   =   ω A 1 ω E 2 GS 1 is 17.6   cm 1 . For exfoliated monolayers on SiO 2 , reported separations are typically in the 19– 20   cm 1 range, whereas thicker layers show larger Δ ω and broader A 1 g lines [1,43]. Our slightly reduced separation and modest A 1 g broadening therefore point to monolayer MoS 2 with a small tensile strain component and moderate n-type doping, but without strong degradation of crystallinity. The A 1 g intensity is only slightly higher than that of E 2 g 1 , with an amplitude ratio I A 1 / I E 2 GS 1 1.05 , which is comparable to values reported for high-quality monolayers and markedly different from heavily oxidized or disordered films where A 1 g broadens and dominates the spectrum [43].
Photoluminescence (PL) spectroscopy further corroborates the monolayer assignment and reveals the excitonic structure (Figure 3e). A three-Gaussian deconvolution of the PL spectrum in the 1.7–2.1 eV range identifies the A trion, A exciton, and B exciton contributions. The fitted peak energies are 1.80 eV for the A trion, 1.84 eV for the A exciton, and 1.96 eV for the B exciton. The corresponding FWHM values are approximately 0.12 eV (A trion), 0.09 eV (A exciton), and 0.12 eV (B exciton). The intensity ratios, normalized to the A exciton amplitude, are I T / I A 0.81 and I B / I A 0.29 . Canonical monolayer MoS 2 on SiO 2 exhibits a dominant A exciton at ∼1.85–1.90 eV, a weaker B exciton around 2.0 eV, and a trion shoulder whose strength scales with electron density [1,2]. Our spectra match this pattern: the strong direct-gap A exciton with sub-0.1 eV linewidth indicates relatively low inhomogeneous broadening and good crystalline quality, while the finite trion contribution is consistent with moderate n-type doping in the 10 12 10 13 cm 2 range, as commonly observed in CVD-grown monolayers [1].
The main Raman and PL parameters extracted from these fits are summarized in Table 1. Compared with reference data for exfoliated and CVD-grown monolayers, our values fall in the range associated with structurally intact, lightly doped monolayer MoS 2 , rather than strongly strained or defective material [1,2,43]. Together with the AFM and optical data, this supports the conclusion that the transistor channels are defined within high-quality monolayer flakes, providing a reproducible materials baseline for the memtransistor and reservoir computing experiments discussed in the following sections.
These structural and optical signatures demonstrate that the active regions of the devices are formed within isolated monolayer MoS 2 flakes of well-controlled thickness, strain, and excitonic response. This provides a reproducible materials baseline for the electrical characterization and reservoir computing experiments presented in the subsequent sections.

3.2. Memtransistor Characteristics and Hysteresis

We next quantify the DC characteristics of a representative CVD MoS 2 memtransistor and the strength of its hysteresis. Figure 4 shows the electrical response of a back-gated device with channel length L   =   2   μ m and width W   =   10   μ m on 90 nm SiO 2 . All measurements are performed in a nitrogen atmosphere at room temperature. The output characteristics I D ( V DS ) under different gate biases span drain currents from the sub-nanoampere range at V GS   =   0   V up to 1.1 × 10 4   A at V GS   =   40   V and V DS   =   20   V . At low drain bias the IV curves are nearly linear, with a small-signal on-state resistance on the order of 10 5   Ω at the highest gate bias, and then gradually evolve towards quasi-saturation as | V DS | increases, as expected for monolayer MoS 2 transistors on SiO 2 .
The transfer characteristics I D ( V GS ) at V DS   =   50 and 500   mV show n-type field-effect behavior with a clear turn-on around positive gate voltages. For the forward branch, the drain current at V DS   =   50   mV increases from ∼ 10 11   A at V GS   =   10   V to 1.9 × 10 6   A at V GS   =   30   V . At V DS   =   500   mV the corresponding current reaches 1.9 × 10 5   A . Depending on whether the forward or reverse branch is used for normalization, the ON/OFF ratio thus lies in the 10 5 10 6 range. Using the standard linear-regime expression,
μ FE   =   L W C ox V DS   d I D d V GS ,
with C ox 3.8 × 10 4   F   m 2 for 90 nm SiO 2 , and fitting the forward I D ( V GS ) branch between V GS   =   18 and 24   V at V DS   =   50   mV , we obtain a field-effect mobility μ FE 10   cm 2   V 1   s 1 . Linear extrapolation of the same fit to I D   =   0 yields a threshold voltage V TH 13   V . Both values are typical for back-gated CVD monolayer MoS 2 devices on 90 nm SiO 2 and indicate that the device remains transistor-like despite the presence of hysteresis.
Hysteresis appears as a systematic separation between the up- and down-sweep I D - V GS traces over the full 10 to 30   V gate window as shown in Figure 4b. We quantify this memory effect by the gate-sweep hysteresis loop area:
H   =   V G S , min V G S , max I up ( V GS ) I down ( V GS )   d V GS .
where I u p and I d o w n denote the currents for the forward and backward sweeps, respectively, and here V G , min   =   10   V and V G , max   =   30   V . Numerical integration of the measured traces on a 0.2 V grid gives H 2.1   μ A · V at V DS   =   50   mV and H 17   μ A · V at V DS   =   500   mV for the device used in the subsequent reservoir computing experiments. The area is dominated by the high-current region above 10   V , while the deep-off regime at negative gate voltages contributes negligibly to H.
An independent view of the memory window comes from output hysteresis at fixed gate bias (Figure 4d). Using
H   =   0 20   V I up ( V DS ) I down ( V DS )   d V DS .
we obtain loop areas of approximately 61   μ A · V , 0.90   mA · V , and 1.8   mA · V for V GS   =   20 , 30, and 40   V , respectively. The corresponding high-resistance to low-resistance state ratios at a small read bias are R HRS / R LRS 91 and 42 at V GS   =   0   V (for V DS   =   50 and 500   mV ), decreasing to 2.2 and 1.93 at V GS   =   10   V and approaching unity at V GS   =   30   V . Thus, the largest state contrast is available when reading near the subthreshold regime, while the hysteresis becomes almost invisible once the channel is strongly accumulated.
Recapitulating, the CVD MoS 2 memtransistor combines a technologically relevant mobility of order 10   cm 2   V 1   s 1 , an ON/OFF ratio above 10 5 , and a moderate but well-defined hysteresis window in both transfer and output characteristics. This balance between transistor performance and memory effect is essential for later use as a nonlinear, history-dependent element, but it also ensures that the device remains representative of CVD-grown MoS 2 field-effect transistors more broadly, keeping the focus of the manuscript on growth and device-level behavior. Device-to-device statistics for H, g m , max and μ FE , max extracted from the full transfer-curve dataset are summarized in Appendix C (Figure A2).

3.3. Lorenz-63 Prediction Without Time-Multiplexing

We first establish a non-time-multiplexed baseline where the Lorenz-63 signal directly drives a single MoS 2 memtransistor. The scalar Lorenz-X trajectory, preprocessed and mapped to the gate window as described above, is converted into a sequence of gate-to-source voltages V G S ( t ) , while the drain-to-source voltage V D S is held constant within each regime (Table A1). The drain current I D S ( t ) thus realizes a scalar, history-dependent nonlinear mapping of the chaotic input. After logarithmic transformation and standardization, we train a linear ridge-regression readout to perform one-step prediction of the Lorenz-X signal; performance is quantified by NRMSE and R 2 on a held-out test set.
Measurement results (purple dots) and applied gate-to-source voltage (yellow lines) inputs are presented in Figure 5. In the first pair of regimes (A and B, Figure 5a and Figure 5b, respectively) we probe a wide gate window that covers subthreshold and above-threshold operation. The normalized Lorenz-X is mapped to V G S [ 0   , 30 ] V, and the device is biased either at V D S   =   0.5 V with sampling interval Δ t   =   0.5 s (regime A) or at V D S   =   3 V with Δ t   =   2 s (regime B). At these settings the memtransistor output current spans several decades, from the subthreshold noise floor up to the microampere range. The best non-time-multiplexed predictions for these two regimes yield NRMSE 0.60 and R 2 0.64 (regime A) and NRMSE 0.61 with R 2 0.62 (regime B) for a 60/40 chronological train-test split. These values indicate that the device captures some structure of the Lorenz dynamics but leaves a large fraction of the variance unexplained.
To better exploit the steep part of the transfer characteristic, we confine the gate window to V G S [ 10 , 40 ] V in regimes C (Figure 5c) and D (Figure 5d) while keeping V D S   =   3 V. Regime C uses a relatively slow sampling interval Δ t   =   2 s, whereas regime D samples faster at Δ t   =   0.1 s. This focuses the input on the region of highest transconductance and reduces the time available for slow drift between samples. The best results in these regimes, obtained for a 70/30 split, improve to NRMSE 0.59 and R 2 0.65 (regime C) and NRMSE 0.46 with R 2 0.79 (regime D). Thus, concentrating V G S on the high-gain region and shortening Δ t both enhance the useful nonlinearity and effective memory encoded in I D S ( t ) , but the prediction quality remains well below the level typically targeted for high-fidelity Lorenz forecasting.
Overall, the non-time-multiplexed experiments confirm that a single CVD MoS 2 memtransistor, driven directly by a Lorenz-derived V G S ( t ) waveform, can provide a reproducible nonlinear transformation with moderate short-term memory. At the same time, the achievable NRMSE and R 2 in regimes A–D remain limited by the scalar nature of the node, the finite signal-to-noise ratio in the subthreshold range, and drift on the time scales of the measurement. We therefore treat these results as a device-level baseline and, in the next subsection, introduce time-multiplexed operation to increase the effective state dimensionality without modifying the device or growth process.

3.4. Time-Multiplexed Reservoir Operation

We next introduce time-multiplexed operation to increase the effective state dimensionality of the reservoir without changing the underlying device or growth process. Instead of using each Lorenz-X sample once, as in regimes A–D, we expand every macro-step into N short sub-steps of duration Δ t . Within each macro-step j, the Lorenz-derived scalar is first mapped into the gate window and then multiplied by a fixed Rademacher mask m k { 1 , + 1 } , k   =   1 , , N , to generate a masked waveform V G S ( t ) . The drain voltage V D S is held constant, and the resulting drain current I D S ( t ) is sampled at the end of each sub-step, yielding N virtual nodes per macro-step.
The left axis in Figure 6 shows typical masked V G S ( t ) traces (yellow lines), while the right axis shows the corresponding I D S ( t ) response for the four time-multiplexed regimes E–H (purple dots). The Lorenz-63 prediction performance for all regimes is summarized in Table A1. All time-multiplexed measurements are performed at V D S   =   3   V and in a narrow gate window V G S ( 31 ,   35 )   V , that is, close to the high-transconductance region of the transfer characteristic. This choice minimizes the influence of deep subthreshold noise and strongly accumulated on-state drift, and it keeps the instantaneous currents in a technologically reasonable range.
Regime E maximizes the number of virtual nodes under these conditions. We use N   =   112 nodes with Δ t   =   0.10   s , which gives a macro-step duration T H   =   N Δ t   =   11.2   s , and record M   =   36 macro-steps. The raw currents I D S are standardized node-wise (per-node z score) and used directly as the reservoir state. This configuration yields the largest state vector but also exposes the device to the longest effective memory window. Consistent with that, the test NRMSE is highly split-dependent: for a 60/40 split it remains high (NRMSE   =   0.85 , R 2   =   0.27 ), whereas for an 80/20 split it improves to NRMSE   =     0.19 and R 2   =   0.98 (Table A1, label E), indicating that long-term drift and low-frequency fluctuations limit the useful fading memory when T H approaches tens of seconds.
Regime F shortens the macro-step and compresses the current dynamic range. Here, we use N   =   16 and Δ t   =   0.10   s , so that T H   =   1.6   s , and increase the number of macro-steps to M   =   256 . Instead of raw current we feed log 10 ( I D S ) into the readout, which improves numerical conditioning and reduces the weight of rare high-current excursions. Across all chronological splits, the resulting test errors are low and relatively stable; the best result (60/40 split) reaches NRMSE   =   0.088 with R 2   =   0.99 (label F). This shows that, once T H is comparable to the intrinsic fading-memory time of the device and the state distribution is well conditioned, even a modest number of virtual nodes can support accurate short-horizon Lorenz-63 prediction.
Regime G increases the memory window at fixed dimensionality. We keep N   =   16 nodes but increase the sub-step duration to Δ t   =   0.25   s , so that T H   =   4.0   s , again with M   =   256 . As in regime F we use log 10 ( I D S ) but now augment the reservoir state with a one-step lag: the readout sees both the current and previous macro-step states. This explicit lag partially compensates for the shorter physical memory of the device. The best test metrics (60/40 split) reach NRMSE   =   0.16 and R 2   =   0.97 (label G). The time traces in Figure 6c show that this configuration still samples a broad range of I D S values while remaining less sensitive to slow drift than regime E.
Finally, regime H explores a larger state dimension at an intermediate memory window. We double the number of nodes to N   =   32 at Δ t   =   0.10   s , giving T H   =   3.2   s and M   =   128 , and retain both the log 10 ( I D S ) representation and the one-step lag. An explicit bias term (intercept) is included in the linear regression. The best result for regime H (80/20 split) achieves NRMSE   =   0.10 and R 2   =   0.99 (label H), comparable to regime F. Compared with N   =   16 , doubling N yields only a modest improvement, consistent with the expectation that, once the reservoir dimension exceeds the effective dimensionality of the task and the physical memory window, additional virtual nodes offer diminishing returns.
The numerical trends in Table A1 are reflected directly in the reconstructed Lorenz-63 trajectories. Figure 7 compares ground-truth and predicted Lorenz-X time traces for all regimes A–H, using the best-performing chronological split in each case. In the non-time-multiplexed configurations A–D (Figure 7a–d), the training segments can be fitted reasonably well, but the test trajectories show visibly damped oscillations and phase errors, consistent with the relatively high NRMSE of 0.46–0.61. In contrast, the time-multiplexed regimes F and H (Figure 7f,h) yield test traces that track both the amplitude and phase of the Lorenz dynamics over several oscillation periods before diverging, in line with their lower errors NRMSE 0.09 –0.10. These qualitative differences support the view that tuning the effective memory window T H , the number of virtual nodes N, and the current preprocessing is more important for RC performance than further incremental improvements in conventional FET metrics for the present devices.
Taken together, regimes E–H demonstrate that time multiplexing allows a single CVD MoS 2 memtransistor to reach NRMSE values near 0.09 and R 2 0.99 for short-horizon Lorenz-63 prediction, while keeping all device-level conditions (growth, contacts, and oxide stack) fixed. From a device perspective, the comparison highlights three trends. First, excessively long effective memory windows T H amplify drift and low-frequency noise and can degrade performance despite large N. Second, logarithmic current scaling is beneficial once the device spans several decades of I D S , because it balances on- and off-state contributions. Third, modest architectural tweaks such as lagged states can recover part of the effective memory without changing the physical device. We therefore view time-multiplexed RC here primarily as a characterization protocol for the nonlinear, history-dependent response of CVD-grown MoS 2 memtransistors, rather than as a fully optimized neuromorphic system.

4. Discussion

4.1. Benchmarking Against Existing Hardware Reservoirs

The best time-multiplexed regimes demonstrate that a single back-gated CVD MoS 2 memtransistor can reach one-step Lorenz-63 prediction errors on the order of 10 1 under realistic device-bias constraints. From Table A1, the optimal non-time-multiplexed configuration (regime D) saturates at NRMSE 0.46 and R 2 0.79 , whereas the best time-multiplexed settings (regimes F and H) reduce the error to NRMSE 0.09 –0.10 with R 2 0.99 . Thus, time multiplexing and simple pre-processing of I D S improve the prediction accuracy by roughly a factor of five relative to the direct, non-time-multiplexed baseline at the same drain bias, without modifying the device or growth stack. Table 2 consolidates the benchmarking of numerous experiments performed on the devices as well as simulations.
It is instructive to place these values alongside other hardware reservoirs that have tackled Lorenz-type tasks. State-of-the-art numerical reservoirs such as next-generation RC and symmetry-aware RC reach NRMSE values around 10 3 10 4 for Lorenz-63 short-term forecasting and variable inference in simulation, and optimized echo state networks can achieve comparable errors for multi-step autoregressive prediction [21]. Experimental photonic implementations of next-generation RC report NRMSE in the 10 2 range for Lorenz-63, similar to other optical reservoirs that trade device complexity for low error and long valid-prediction times [21]. Among memristive and ferroelectric platforms, dynamic CuInP 2 S 6 memories and α -In 2 Se 3 ferroelectric memtransistors have demonstrated Lorenz-type chaotic time-series prediction with reported errors between NRMSE 0.014 and 0.008 while operating at low voltages [49,50].
Closer to the present work, polymer electrolyte-gated MoS 2 transistors have recently been used as reservoir nodes for time-series processing, including Lorenz prediction [36]. In that platform, Li + intercalation drives a reversible 2H-1T phase transition that provides strong nonlinearity and long fading memory, and a time-multiplexed single-node reservoir achieved Lorenz NRMSE as low as ∼ 0.04 [36]. Our back-gated CVD MoS 2 memtransistor reservoir operates in a more conservative regime: V G S is confined to (31,35) V on 90 nm SiO 2 , V D S   =   3 V, and the device is read through a conventional solid-state gate stack without electrolytes, ionic motion, or phase transitions. Under these constraints the best errors, NRMSE 0.09 with R 2 0.99 , are about a factor of two above the polymer-electrolyte MoS 2 benchmark and an order of magnitude above the most optimized photonic or memristive reservoirs, but they are obtained on a standard CVD-grown, back-gated transistor structure.
From a device perspective this comparison highlights three points. First, the absolute RC performance of our platform is limited less by the crystalline quality of the monolayer and more by the choice of gate stack and biasing: dry SiO 2 confines the available nonlinearity and memory to trap-mediated charge storage, whereas ionic or ferroelectric media provide stronger history dependence at lower voltages. Second, within those constraints, the CVD MoS 2 memtransistor still attains NRMSE 0.09 as a single physical node with a purely linear readout and no network-level optimization or feedback. Third, because both the semiconductor and gate dielectric are wafer-process-compatible, the same device platform can in principle be scaled to multi-node arrays or 3D-integrated stacks without introducing new materials or processing steps.
We therefore view the Lorenz-63 benchmarks in this work not as an attempt to surpass the best reported RC error, but as a quantitative sanity check that links device-level metrics (mobility, hysteresis, and trap-mediated memory on sub- to multi-second scales) to an application-level figure of merit (NRMSE on a standard chaotic prediction task). In combination with the open time-series dataset and RC pipeline released with this work, this provides a reproducible baseline against which future MoS 2 growth, contact engineering, or gate-stack modifications can be judged in terms of their impact on both device characteristics and reservoir-computing performance.
We finally note that the Lorenz-63 benchmarks collected in Table 2 are not strictly equivalent in task difficulty to the configuration used here. Several of the best-performing numerical schemes operate in a “full observation” regime, where all three Lorenz coordinates ( x , y , z ) or at least two components are supplied to the model at each time step, and the readout learns the flow map X ( t ) X ( t + Δ t ) [20,21,44,45]. In our MoS 2 memtransistor implementation, the reservoir is driven by a single scalar input proportional to the x-component only, and the readout is trained to predict x ( t + Δ t ) from this one-dimensional observation and its fading memory. From a dynamical-systems perspective, this corresponds to forecasting from a partial observation rather than from the full state; the mapping x ( t ) x ( t + Δ t ) is generally many-to-one on the Lorenz attractor, so the reservoir must internally reconstruct the missing coordinates via its finite memory, in line with delay-embedding arguments for attractor reconstruction [21]. This makes the task more sensitive to noise and memory depth than full-state forecasts at the same sampling interval and reservoir size.
A second difference concerns the effective prediction horizon per step. Many Lorenz-63 RC studies integrate the equations with a small internal time step (for numerical accuracy) and then train on data sampled with Δ t 0.0001 0.02 [20,44]. In that regime, each one-step forecast covers only a short fraction of a Lyapunov time, and nearby trajectories separate relatively weak between successive samples, which typically leads to lower one-step NRMSE at fixed model capacity. In contrast, our best hardware configuration operates at a coarser sampling interval (here, Δ t   =   0.10 in Lorenz units), so that each prediction step spans a larger fraction of τ L . Because prediction errors in chaotic systems grow with elapsed time along the trajectory, a larger sampling interval intrinsically amplifies one-step errors and reduces the attainable NRMSE for a given reservoir size. The use of a single scalar input and a longer effective time step means that the NRMSE of the present MoS 2 memtransistor reservoir should be viewed as a conservative benchmark relative to numerical RC results obtained under full-state, finely sampled conditions.

4.2. Device Metrics and Design Window for MoS 2 Memtransistor Reservoirs

The experiments above show that, for the present CVD MoS 2 memtransistors, reservoir performance is governed more by how we bias and read the device than by further incremental improvements in classical FET metrics. The representative transistor combines a field-effect mobility of order 10   cm 2   V 1   s 1 , an ON/OFF ratio above 10 5 , and a moderate hysteresis window quantified by H 2.1   μ A · V at V D S   =   50   mV and H 17   μ A · V at V D S   =   500   mV over V G S [ 10 , 30 ]   V . Within this envelope, the non-time-multiplexed Lorenz-63 experiments in regimes A–D show that confining V G S to the high-transconductance region and shortening the sampling interval are more important for NRMSE than pushing mobility or ON/OFF higher. Form Figure 8 we can summarize that operating in V G S ( 10 , 40 )   V at V D S   =   3   V and Δ t   =   0.1   s (regime D) already reduces the error to NRMSE 0.46 , whereas wide gate windows that spend substantial time in deep subthreshold or strongly accumulated regimes perform worse despite similar static device figures of merit.
Time-multiplexed operation refines this picture. Regimes E–H keep the same physical device and gate stack but adjust the effective memory window T H   =   N Δ t , the number of virtual nodes N, and the preprocessing of I D S . Performance improves markedly when T H is made comparable to the intrinsic fading-memory time of the trap-mediated hysteresis, on the order of a few seconds, rather than being pushed to 11.2 s as in regime E. The best Lorenz-63 errors, NRMSE 0.09 –0.10 with R 2 0.99 in regimes F and H, occur for T H between 1.6 and 3.2 s, N   =   16 –32, and a narrow gate window V G S ( 31 ,   35 )   V at V D S   =   3   V , combined with a log 10 ( I D S ) representation and a simple one-step lag. In this bias range the device operates on the steep, nearly linear part of the transfer characteristic where small V G S variations are efficiently converted into current changes, while the hysteresis is strong enough to provide short-term memory but not so slow as to drift over the full duration of a macro-step. The stability of the best NRMSE across chronological splits in Table A1 supports the view that we are exploiting reproducible trap dynamics rather than uncontrolled long-term drift.
These observations suggest a practical design window for CVD MoS 2 memtransistor reservoirs. At the device level, it is sufficient to reach mobilities in the 10   cm 2   V 1   s 1 class and ON/OFF ratios above 10 4 10 5 ; further gains in these parameters are unlikely to translate into proportional reductions in NRMSE unless the hysteresis spectrum and gate stack are also engineered. More critical is to realize a controllable, intermediate hysteresis: large enough that R HRS / R LRS exceeds a few tens near the chosen read bias and that fading memory covers 1–4 s, but not so large that the device fails to wash out past inputs within the reservoir time horizon, defined by the macro-step duration of time-multiplexed reservoir. On the operation side, the results here point to narrow V G S windows around the high-transconductance region, moderate V D S (a few volts), and time-multiplexing parameters tuned so that T H matches the device’s intrinsic memory time. Within this window, the MoS 2 memtransistor behaves as a compact, wafer-process-compatible nonlinear node whose RC performance is acceptable to resolve trends when growth, contact engineering, or trap spectra are varied, while keeping the focus of the platform on scalable device physics rather than on RC optimization.

4.3. Stability, Reproducibility, and Scope of Structure–Performance Relations

All results reported here were obtained from a single CVD-grown chip processed in one fabrication run, thereby eliminating run-to-run variation and enabling a focused assessment of device-to-device reproducibility within a fixed process flow. In total, 65 transistor structures (monolayer CVD MoS 2 , L   =   2   μ m , W   =   10   μ m ) were patterned, contacted, and electrically characterized. Across this set, the hysteresis area H exhibits a broad distribution, consistent with device-to-device dispersion in trap populations and local electrostatics even on an optically uniform monolayer film, whereas the maximum transconductance and field-effect mobility show comparatively narrower spreads. To probe task-level reproducibility, we further recorded Lorenz-63 time-series measurements on n   =   17 devices selected from the higher-hysteresis subset under an identical input mapping and a fixed chronological 70/30 train/test split. Using log 10 ( | I D | ) as the reservoir state to mitigate slow baseline drift consistent with charge trapping/detrapping, the one-step prediction error shows low device-to-device scatter (range 0.58 0.61 ), motivating the selection of an average-performing device for the detailed demonstrations rather than an outlier. We note that the absolute NRMSE values in this multi-device dataset remain comparatively high because time multiplexing was not implemented for every device under the available measurement time; nevertheless, combining the same 17 devices as a space-multiplexed reservoir improves the one-step NRMSE to 0.57 , consistent with the expected benefit of increased state dimensionality.
Finally, we emphasize the scope of this work regarding structure–performance relationships. Devices were intentionally fabricated with a fixed channel geometry and monolayer thickness, and we therefore do not claim a systematic dependence of fading-memory time constants or prediction accuracy on thickness (1L vs multilayer) or geometry (L, W). These parameters are expected to influence response times through electrostatic scaling, trap occupancy, and RC time constants associated with charge trapping and dielectric relaxation, and they provide clear design parameters for the future studies.

4.4. Limitations and Outlook Toward CMOS-Compatible Neuromorphic Hardware

The present implementation has several clear limitations from a neuromorphic-hardware perspective. First, the back-gated geometry with 90 nm SiO 2 requires V G S up to 35–40 V and V D S   =   3 V, which is far above typical CMOS core voltages and is acceptable here only because we target a device-physics demonstration rather than an integrated circuit. Second, the relevant fading-memory times are on the order of 1–4 s and the sampling interval Δ t is at best 0.1 s, so the reservoir operates in a quasi-static regime set by trap dynamics and instrumentation rather than at radiofrequency or GHz bandwidths.
At the same time, the materials stack and fabrication flow are compatible with an eventual transition toward CMOS back-end integration. Moving from a global back gate to patterned top gates with thin high-k dielectrics or ferroelectric layers would reduce the required V G S by one to two orders of magnitude while preserving or even enhancing the useful hysteresis. The same CVD growth and contact scheme can, in principle, be adapted to BEOL-compatible thermal budgets, enabling local MoS 2 memtransistor arrays above conventional logic. On the architecture side, the single-node time-multiplexed reservoir demonstrated here can be extended to small multi-node ensembles, co-integrated with CMOS readout and training circuits, and exercised on a broader set of temporal tasks. In this sense, the current work should be viewed as a device-level benchmark that links CVD growth and hysteresis engineering to a standard RC task; future efforts will need to trade some of this simplicity for lower voltages, faster dynamics, and array-level integration to approach truly CMOS-compatible neuromorphic hardware.

5. Conclusions

We have demonstrated that back-gated memtransistors based on CVD-grown monolayer MoS 2 on 90 nm SiO 2 combine conventional transistor performance with sufficient hysteresis and fading memory to support chaotic time-series prediction via reservoir computing. The representative devices exhibit field-effect mobility on the order of 10   cm 2   V 1   s 1 , ON/OFF ratios above 10 5 , and a moderate hysteresis window quantified by gate-sweep loop areas H 2.1   μ A · V at V D S   =   50   mV and H 17   μ A · V at V D S   =   500   mV over V G S [ 10 , 30 ]   V . In a single-node reservoir configuration, direct (non-time-multiplexed) driving of the memtransistor by a Lorenz-63 waveform yields non-trivial one-step prediction with best NRMSE 0.46 , while time-multiplexed operation in a narrow high-transconductance window, combined with simple preprocessing and lagged states, improves the error to NRMSE 0.09 –0.10 with R 2 0.99 .
From a growth and device-engineering perspective, these results indicate that once a basic target is reached (mobility of order 10   cm 2   V 1   s 1 and ON/OFF above 10 4 10 5 ), further improvements in traditional FET figures of merit alone are unlikely to translate into proportional gains in reservoir performance. Instead, the shape and time scales of the hysteresis window, set by the trap spectrum in the MoS 2 /dielectric stack, become the key control parameters: the most useful operating points are those where V G S biases the device on the steep part of the transfer characteristic and where fading memory naturally spans the 1–4 s range probed by the time-multiplexed reservoir. In this sense, the Lorenz-63 task functions as a quantitative probe of trap-mediated dynamics rather than as an ultimate benchmark of prediction accuracy.
Device-to-device variability was quantified at both the DC and task level. Across the fabrication run, the hysteresis area spans nearly three orders of magnitude, indicating substantial dispersion in trap-mediated memory characteristics. For reservoir computing, measured devices, despite the broad hysteresis distribution, showed NRMSE scatters in the range from 0.58 to 0.61. The absolute NRMSE is comparatively high in this multi-device dataset because time multiplexing was not performed under the available measurement time.
With respect to CMOS compatibility, the present devices still operate at voltages well above typical logic levels because of the global back-gate geometry and thick SiO 2 dielectric. However, the material set and thermal budget are compatible with back-end-of-line integration, and the same CVD process can be combined with patterned top gates and thin high-k or ferroelectric dielectrics to reduce V G S by one to two orders of magnitude while preserving or enhancing the usable hysteresis. A practical path toward CMOS-compatible architectures is to reduce the operating voltages and increase reservoir dimensionality by device and array engineering rather than by complex mixed-signal circuitry. First, replacing the global back gate with a local top gate (thin high-k dielectric and short gate-to-channel spacing) directly increases C ox and enables voltage scaling, so that the same input mapping can be implemented with gate swings in the few-volt range. Second, geometrical scaling of the channel (shorter L and optimized W / L ) can reduce the required V DS and improve transconductance at lower bias. Third, array-style space multiplexing provides a scalable route to richer reservoir states under realistic measurement time. Multiple memtransistors fabricated on the same chip can be driven by a common V G S ( t ) waveform and read out as parallel nodes, converting device-to-device variability into state diversity and improving prediction accuracy without time multiplexing.
Together, voltage-scaled top-gated devices, modest geometric scaling, and space-multiplexed arrays define an experimentally accessible roadmap from the present back-gated proof of concept toward CMOS-constrained, wafer-scalable reservoir hardware. The open RC protocol and Lorenz-63 benchmark used here provide a reproducible framework against which such materials and device modifications can be evaluated in terms of both conventional transistor metrics and application-level reservoir computing performance.

Author Contributions

Conceptualization, V.K. and L.J.; methodology, V.K.; software, V.K. and L.J.; validation, V.K. and L.J.; formal analysis, V.K. and L.J.; investigation, V.K.; resources, J.P.; data curation, V.K.; writing—original draft preparation, V.K.; writing—review and editing, J.P.; visualization, V.K.; supervision, J.P.; project administration, J.P.; funding acquisition, J.P. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Carl Zeiss Foundation through the MemWerk® project (P2018-01-002). The work of V.K. was supported by the Thüringer Graduiertenförderung (AkZ 0051050-7150000163). L.J. is funded by the Carl Zeiss Foundation through the Nexus program.

Data Availability Statement

Data available in a publicly accessible repository: https://doi.org/10.5281/zenodo.16944410 and https://doi.org/10.5281/zenodo.16944485. Dataset is also available on request from the authors.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

Table A1. Resulting test metrics for the performed measurements. The best result for each set of measurements is highlighted in bold.
Table A1. Resulting test metrics for the performed measurements. The best result for each set of measurements is highlighted in bold.
LabelTMN Δ t (s) TH (s)M V GS Range (V) V DS (V)SplitNRMSE R 2
ANo0.5(0,30)0.560/400.59970.6404
70/300.60050.6393
80/200.91360.1653
BNo2.0(0,30)3.060/400.61300.6242
70/300.63700.5942
80/200.97810.0433
CNo2.0(10,40)3.060/400.59190.6497
70/300.58930.6527
80/200.89130.2056
DNo0.1(10,40)3.060/400.47340.7759
70/300.46110.7874
80/200.60940.6287
EYes1120.1011.236(31,35)3.060/400.85470.2694
70/300.46870.7804
80/200.18500.9817
FYes160.101.6256(31,35)3.060/400.08840.9922
70/300.09940.9901
80/200.19610.9616
GYes160.254.0256(31,35)3.060/400.16030.9743
70/300.16700.9721
80/200.29700.9118
HYes320.103.2128(31,35)3.060/400.13150.9827
70/300.12200.9851
80/200.10180.9896

Appendix B

To provide an experimental estimate of the characteristic time scales associated with slow state relaxation in the MoS 2 memtransistor, we analyzed the drain-current decay recorded at constant bias V GS   =   25   V and V DS   =   0.5   V . The measured data is shown in Figure A1. The measured signal exhibits a gradual decrease of I D ( t ) over tens of seconds, consistent with bias-stress-induced relaxation that is commonly attributed to charge trapping/detrapping in the gate dielectric, at the MoS 2 /SiO 2 interface, and/or via adsorbate-mediated processes. Because the decay is not well captured by a single time constant over the full time window, we fit the relaxation using a bi-exponential model
y ( x )   =   A 1 exp x t 1 + A 2 exp x t 2 + y 0 ,
where x is time, y is the measured drain current I D , A 1 and A 2 are amplitudes, t 1 and t 2 are characteristic time constants, and y 0 is the long-time offset current.
Figure A1. CVD MoS 2 memtransistor relaxation trace measured at V GS   =   25   V and V DS   =   0.5   V . The circles on the graph represent the measured data, while the red line shows the bi-exponential model fit.
Figure A1. CVD MoS 2 memtransistor relaxation trace measured at V GS   =   25   V and V DS   =   0.5   V . The circles on the graph represent the measured data, while the red line shows the bi-exponential model fit.
Crystals 16 00116 g0a1
The fitted parameters are listed in the Table A2.
Table A2. Bi-exponential fit parameters for the relaxation trace measured at V GS   =   25   V and V DS   =   0.5   V using y ( x )   =   A 1 exp ( x / t 1 ) + A 2 exp ( x / t 2 ) + y 0 . Reported uncertainties correspond to the fit standard errors.
Table A2. Bi-exponential fit parameters for the relaxation trace measured at V GS   =   25   V and V DS   =   0.5   V using y ( x )   =   A 1 exp ( x / t 1 ) + A 2 exp ( x / t 2 ) + y 0 . Reported uncertainties correspond to the fit standard errors.
ParameterValueUnit
y 0 3.29 × 10 7 ± 1.48 × 10 9 A
A 1 7.24 × 10 8 ± 1.51 × 10 9 A
t 1 2.45 ± 0.10 s
A 2 1.21 × 10 7 ± 9.38 × 10 10 A
t 2 47.75 ± 1.38 s
The presence of two well-separated time constants suggests at least two dominant relaxation channels: a faster component on the order of a few seconds ( t 1 ) and a slower component on the order of 50   s ( t 2 ). In the context of reservoir computing, these relaxation processes set the time scales over which the device retains a measurable influence of past excitation. We emphasize that this constant-bias decay provides an effective estimate of the intrinsic relaxation dynamics under the selected operating point.

Appendix C

To quantify device-to-device variability beyond the representative device shown in the main text, we summarize key DC metrics extracted from the transfer loops measured at V DS   =   0.5 V for CVD MoS 2 memtransistors with L   =   2   μ m and W   =   10   μ m. Figure A2 reports the hysteresis area H, maximum transconductance g m , max , and maximum field-effect mobility μ FE , max across the available device set. In addition, we include the one-step Lorenz-63 prediction error (NRMSE) for the subset of devices for which a full time-series measurement was recorded under identical input mapping and a fixed 70/30 chronological split, using log 10 ( | I D | ) as the reservoir state. This appendix supports the discussion in the main text that, despite substantial dispersion in DC hysteresis, the measured one-step prediction performance exhibits comparatively low device-to-device scatter within the investigated subset.
Figure A2. Device-to-device variability of electrical figures of merit and Lorenz-63 prediction accuracy. (a) Hysteresis area H extracted from the bidirectional transfer loop ( V GS : 10 30 10 V) at V DS   =   0.5 V for CVD MoS 2 memtransistors with L   =   2   μ m and W   =   10   μ m. (b) Maximum transconductance g m , max obtained from the forward sweep by numerical differentiation. (c) Field-effect mobility μ FE , max calculated from g m , max using the linear-regime relation. (d) One-step-ahead Lorenz-63 prediction error (NRMSE) for devices measured under identical input mapping and readout protocol (chronological 70/30 split), using log 10 ( | I D | ) as the reservoir state.
Figure A2. Device-to-device variability of electrical figures of merit and Lorenz-63 prediction accuracy. (a) Hysteresis area H extracted from the bidirectional transfer loop ( V GS : 10 30 10 V) at V DS   =   0.5 V for CVD MoS 2 memtransistors with L   =   2   μ m and W   =   10   μ m. (b) Maximum transconductance g m , max obtained from the forward sweep by numerical differentiation. (c) Field-effect mobility μ FE , max calculated from g m , max using the linear-regime relation. (d) One-step-ahead Lorenz-63 prediction error (NRMSE) for devices measured under identical input mapping and readout protocol (chronological 70/30 split), using log 10 ( | I D | ) as the reservoir state.
Crystals 16 00116 g0a2

Appendix D

As shown in Figure A3a,b, the device architecture and fabricated layout are illustrated by a schematic cross-section and representative SEM micrographs of the patterned memtransistor structures.
Figure A3. Device architecture and fabrication overview. (a) Schematic illustration of the back-gated monolayer MoS 2 memtransistor on Si/SiO 2 with source/drain contacts. (b) Representative SEM image of a fabricated device showing the contact geometry and the lithographically defined channel region.
Figure A3. Device architecture and fabrication overview. (a) Schematic illustration of the back-gated monolayer MoS 2 memtransistor on Si/SiO 2 with source/drain contacts. (b) Representative SEM image of a fabricated device showing the contact geometry and the lithographically defined channel region.
Crystals 16 00116 g0a3

References

  1. Mak, K.F.; Lee, C.; Hone, J.; Shan, J.; Heinz, T.F. Atomically Thin MoS2: A New Direct-Gap Semiconductor. Phys. Rev. Lett. 2010, 105, 136805. [Google Scholar] [CrossRef]
  2. Splendiani, A.; Sun, L.; Zhang, Y.; Li, T.; Kim, J.; Chim, C.Y.; Galli, G.; Wang, F. Emerging Photoluminescence in Monolayer MoS2. Nano Lett. 2010, 10, 1271–1275. [Google Scholar] [CrossRef]
  3. Radisavljevic, B.; Radenovic, A.; Brivio, J.; Giacometti, V.; Kis, A. Single-Layer MoS2 Transistors. Nat. Nanotechnol. 2011, 6, 147–150. [Google Scholar] [CrossRef]
  4. Wang, Q.H.; Kalantar-Zadeh, K.; Kis, A.; Coleman, J.N.; Strano, M.S. Electronics and Optoelectronics of Two-Dimensional Transition Metal Dichalcogenides. Nat. Nanotechnol. 2012, 7, 699–712. [Google Scholar] [CrossRef]
  5. Fu, D.; Zhao, X.; Zhang, Y.; Li, L.; Xu, H.; Jang, A.-R.; Yoon, S.I.; Song, P.; Poh, S.M.; Ren, T.; et al. Molecular Beam Epitaxy of Highly Crystalline Monolayer Molybdenum Disulfide on Hexagonal Boron Nitride. J. Am. Chem. Soc. 2017, 139, 9392–9400. [Google Scholar] [CrossRef]
  6. Zhan, L.; Wan, W.; Zhu, Z.; Shih, Z.-M.; Cai, W. MoS2 materials synthesized on SiO2/Si substrates via MBE. J. Phys. Conf. Ser. 2017, 864, 012037. [Google Scholar] [CrossRef]
  7. Kim, D.H.; Park, J.C.; Park, J.; Cho, D.-Y.; Kim, W.-H.; Shong, B.; Ahn, J.-H.; Park, T. Wafer-Scale Growth of a MoS2 Monolayer via One Cycle of Atomic Layer Deposition: An Adsorbate Control Method. Chem. Mater. 2021, 33, 4099–4105. [Google Scholar] [CrossRef]
  8. Gonzalez, M.A.; Pareek, D.; Büsing, L.; Parisi, J.; Schäfer, S.; Gütay, L. Rapid formation of large-area MoS2 monolayers by a parameter resilient atomic layer deposition approach. APL Mater. 2021, 9, 051122. [Google Scholar] [CrossRef]
  9. Robertson, J.; Liu, X.; Yue, C.; Escarra, M.; Wei, J. Wafer-scale synthesis of monolayer and few-layer MoS2 via thermal vapor sulfurization. 2D Mater. 2017, 4, 045007. [Google Scholar] [CrossRef]
  10. Tao, J.; Chai, J.; Lu, X.; Wong, L.-M.; Wong, T.I.; Pan, J.; Xiong, Q.; Chi, D.; Wang, N. Growth of wafer-scale MoS2 monolayer by magnetron sputtering. Nanoscale 2015, 7, 2497–2503. [Google Scholar] [CrossRef]
  11. Tumino, F.; Casari, C.S.; Passoni, M.; Russo, V.; Bassi, A.L. Pulsed laser deposition of single-layer MoS2 on Au(111): From nanosized crystals to large-area films. Nanoscale Adv. 2019, 1, 643–655. [Google Scholar] [CrossRef]
  12. Midhun, P.S.; Kumar, K.R.; Jayaraj, M.K. Large area synthesis of mono/few-layer MoS2 thin films on thermal oxide silicon substrate by pulsed laser deposition technique. Thin Solid Films 2023, 782, 140030. [Google Scholar] [CrossRef]
  13. Zhang, Y.; Xu, L.; Walker, W.R.; Tittle, C.M.; Blackhouse, J.; Pope, M.A. Wafer-scale growth of monolayer MoS2 film by using chemical vapor deposition. J. Mater. Chem. C 2017, 5, 11275–11287. [Google Scholar] [CrossRef]
  14. Franco, M.; Kiazadeh, A.; Martins, R.; Lanceros-Mendez, S.; Carlos, E. Printed Memristors: An Overview of Ink, Materials, Deposition Techniques, and Applications. Adv. Electron. Mater. 2024, 10, 2400212. [Google Scholar] [CrossRef]
  15. Kurtash, V.; Eliseyev, I.; Davydov, V.; Mathew, S.; Thiele, S.; Reiprich, J.; Jacobs, H.O.; Pezoldt, J. Monolayer MoS2 CVD-growth on SiC substrates assisted with KCl. Appl. Surf. Sci. 2025, 681, 161373. [Google Scholar] [CrossRef]
  16. Guo, Y.; Vashishta, D.; Singh, A.K.; Rasool, P.K.; Pu, Y.; Shi, Y.; Ostrikov, K.K.; Xu, H. Charge Trapping at the MoS2–SiO2 Interface and Its Effects on the Characteristics of MoS2 Field-Effect Transistors. Appl. Phys. Lett. 2015, 106, 103109. [Google Scholar] [CrossRef]
  17. Sangwan, V.K.; Jariwala, D.; Kim, I.S.; Chen, K.S.; Marks, T.J.; Lauhon, L.J.; Hersam, M.C. Gate-Tunable Memristive Phenomena Mediated by Grain Boundaries in Single-Layer MoS2. Nat. Nanotechnol. 2015, 10, 403–406. [Google Scholar] [CrossRef]
  18. Kurtash, V.; Jacobs, H.O.; Pezoldt, J. Energy-Efficient Operation Conditions of MoS2-Based Memristors. Phys. Status Solidi A 2023, 220, 2200893. [Google Scholar] [CrossRef]
  19. Tanaka, G.; Yamane, T.; Hélie, J.P.; Nakane, R.; Tanaka, N.; Takeda, H.; Takano, H.; Ikeda, Y.; Nakano, T.; Hirose, A. Recent Advances in Physical Reservoir Computing: A Review. Neural Netw. 2019, 115, 100–123. [Google Scholar] [CrossRef]
  20. Gauthier, D.J.; Bollt, E.; Griffith, A.; Barbosa, W.A.S. Next Generation Reservoir Computing. Nat. Commun. 2021, 12, 5564. [Google Scholar] [CrossRef]
  21. Jaurigue, L. Chaotic attractor reconstruction using small reservoirs—The influence of topology. Mach. Learn. Sci. Technol. 2024, 5, 035058. [Google Scholar] [CrossRef]
  22. Jaurigue, L.; Lüdge, K. Utilizing rate-independent hysteresis for analog computing. Neuromorphic Comput. Eng. 2025, 5, 044007. [Google Scholar] [CrossRef]
  23. Rehman, M.M.; Samad, Y.A.; Gul, J.Z.; Saqib, M.; Khan, M.; Shaukat, R.A.; Chang, R.; Shi, Y.; Kim, W.Y. 2D materials-memristive devices nexus: From status quo to Impending applications. Prog. Mater. Sci. 2025, 152, 101471. [Google Scholar] [CrossRef]
  24. Moon, J.; Ma, W.; Shin, J.H.; Cai, F.; Du, C.; Lee, S.H.; Lu, W.; Lu, W.D. Temporal data classification and forecasting using a memristor-based reservoir computing system. Nat. Electron. 2019, 2, 480–487. [Google Scholar] [CrossRef]
  25. Xie, J.; Liu, B.; Liu, X. Delay-embedded reservoir computing with single memristor for scale-efficient temporal signal processing. Chaos Solitons Fractals 2026, 205, 117842. [Google Scholar] [CrossRef]
  26. Wang, A.-H.; Fan, X.; Zhang, Z.; Kapur, O.; Huang, R.; Simanjuntak, F.; Chong, H.M.H.; Thomas, D.B. Temporal dynamics of TiOx memristor for reservoir computing applications. J. Phys. D Appl. Phys. 2026, 59, 015101. [Google Scholar] [CrossRef]
  27. Zhong, Y.; Tang, J.; Li, X.; Gao, B.; Qian, H.; Wu, H. Dynamic memristor-based reservoir computing for high-efficiency temporal signal processing. Nat. Commun. 2021, 12, 408. [Google Scholar] [CrossRef]
  28. Singh, A.; Choi, S.; Wang, G.; Daimari, M.; Lee, B.-G. Analysis and fully memristor-based reservoir computing for temporal data classification. Neural Netw. 2025, 182, 106925. [Google Scholar] [CrossRef]
  29. Shi, T.; Gao, L.; Tang, S.; Liu, J.; Li, Y.; Zhou, R.; Cui, S.; Zhang, H.; Wu, Z.; Zhang, X.; et al. Memristor-based feature learning for pattern classification. Nat. Commun. 2025, 16, 913. [Google Scholar] [CrossRef]
  30. Park, S.; Koo, M.; Kim, S. Advancing phase-change random access memory: Materials innovation neuromorphic applications, and scalability challenges. J. Phys. D Appl. Phys. 2026, 59, 013001. [Google Scholar] [CrossRef]
  31. Syed, G.S.; Le Gallo, M.; Sebastian, A. Phase-Change Memory for In-Memory Computing. Chem. Rev. 2025, 125, 5163–5194. [Google Scholar] [CrossRef]
  32. Cucchi, M.; Gruener, C.; Petrauskas, L.; Steiner, P.; Tseng, H.; Fischer, A.; Penkovsky, B.; Matthus, C.; Birkholz, P.; Kleemann, H.; et al. Reservoir computing with biocompatible organic electrochemical networks for brain-inspired biosignal classification. Sci. Adv. 2021, 7, eabh0693. [Google Scholar] [CrossRef]
  33. Shao, L.; Zhao, Y.; Liu, Y. Organic synaptic transistors: The evolutionary path from memory cells to the application of artificial neural networks. Adv. Funct. Mater. 2021, 31, 2101951. [Google Scholar] [CrossRef]
  34. Zhao, Y.; Duan, W.; Wang, C.; Xiao, S.; Li, Y.; Li, Y.; An, J.; Li, H. LiNbO3 dynamic memristors for reservoir computing in neuromorphic engineering. Front. Neurosci. 2023, 17, 1177118. [Google Scholar] [CrossRef]
  35. Lee, Y.; Lee, S. Artificial Intelligence and Machine Learning for Memristor Modeling and Neuromorphic Computing. Nanomaterials 2024, 14, 1884. [Google Scholar] [CrossRef]
  36. Wan, X.; Yuan, Q.; Sun, L.; Chen, K.; Khim, D.; Luo, Z. Reservoir Computing Enabled by Polymer Electrolyte-Gated MoS2 Transistors for Time-Series Processing. Polymers 2025, 17, 1178. [Google Scholar] [CrossRef]
  37. Farronato, M.; Seri, M.; Titova, L.V.; Sangiovanni, D.; Oriani, M.; Bonaccorso, F.; Pecchia, A.; Frabboni, S.; Brivio, S.; Molle, A. Charge-Trap MoS2 Memories as Reservoir Computers. Adv. Mater. 2023, 35, 2205381. [Google Scholar] [CrossRef]
  38. Lorenz, E.N. Deterministic Nonperiodic Flow. J. Atmos. Sci. 1963, 20, 130–141. [Google Scholar] [CrossRef]
  39. Kurtash, V. Reservoir Computing Using CVD-MoS2 Memtransistor. Zenodo: 2025; Data Set. Available online: https://zenodo.org/records/16944410 (accessed on 16 December 2025).
  40. Kurtash, V. Reservoir Computing Using CVD-MoS2 Memtransistor Python Script, Version 1.0; Zenodo: 2025; Software; Available online: https://zenodo.org/records/16944485 (accessed on 16 December 2025).
  41. Takens, F. Detecting strange attractors in turbulence. In Dynamical Systems and Turbulence, Warwick 1980; Rand, D.A., Young, L.-S., Eds.; Lecture Notes in Mathematics; Springer: Berlin/Heidelberg, Germany, 1981; Volume 898, pp. 366–381. [Google Scholar] [CrossRef]
  42. Storm, L.; Gustavsson, K.; Mehlig, B. Constraints on parameter choices for successful time-series prediction with echo-state networks. Mach. Learn. Sci. Technol. 2022, 3, 045021. [Google Scholar] [CrossRef]
  43. Lee, C.; Yan, H.; Brus, L.E.; Heinz, T.F.; Hone, J.; Ryu, S. Anomalous Lattice Vibrations of Single- and Few-Layer MoS2. ACS Nano 2010, 4, 2695–2700. [Google Scholar] [CrossRef]
  44. Barbosa, W.A.S.; Griffith, A.; Riechers, K.; Bollt, E.M.; Gauthier, D.J. Symmetry-Aware Reservoir Computing. Phys. Rev. E 2021, 104, 045307. [Google Scholar] [CrossRef]
  45. Viehweg, J.; Worthmann, K.; Mäder, P. Parameterizing echo state networks for multi-step time series prediction. Neurocomputing 2023, 522, 214–228. [Google Scholar] [CrossRef]
  46. Jaurigue, L.; Robertson, E.; Wolters, J.; Lüdge, K. Reservoir Computing with Delayed Input for Fast and Easy Optimisation. Entropy 2021, 23, 1560. [Google Scholar] [CrossRef]
  47. Cox, N.; Murray, J.; Hart, J.; Redding, B. Photonic Implementation of Next-Generation Reservoir Computing. Chaos 2024, 34, 073111. [Google Scholar] [CrossRef]
  48. Wang, H.; Liu, Y.; Zhang, L.; Li, W.; Chen, X. Optical Next-Generation Reservoir Computing. Light Sci. Appl. 2025, 14, 12. [Google Scholar] [CrossRef]
  49. Wu, Y.; Duong, N.T.; Chien, Y.C.; Liu, S.; Ang, K.W. A Dynamic Memory for Reservoir Computing Utilizing Ion Migration in CuInP2S6. Adv. Electron. Mater. 2024, 10, 2300481. [Google Scholar] [CrossRef]
  50. Duong, N.T.; Chien, Y.C.; Xiang, H.; Li, S.; Zheng, H.; Shi, Y.; Ang, K. Dynamic Ferroelectric Transistor-Based Reservoir Computing for Spatiotemporal Information Processing. Adv. Intell. Syst. 2023, 5, 2300009. [Google Scholar] [CrossRef]
Figure 1. (a) MoS 2 CVD chamber schematics. (b) Measured temperature profiles for S and MoO 3 precursors.
Figure 1. (a) MoS 2 CVD chamber schematics. (b) Measured temperature profiles for S and MoO 3 precursors.
Crystals 16 00116 g001
Figure 2. Lorenz-63 chaotic system used for reservoir computing. (a) Three-dimensional Lorenz-63 trajectory integrated at the classical parameters σ   =   10 , ρ   =   28 , β   =   8 / 3 . (b) Lorenz-63 chaotic system X value plotted versus time.
Figure 2. Lorenz-63 chaotic system used for reservoir computing. (a) Three-dimensional Lorenz-63 trajectory integrated at the classical parameters σ   =   10 , ρ   =   28 , β   =   8 / 3 . (b) Lorenz-63 chaotic system X value plotted versus time.
Crystals 16 00116 g002
Figure 3. Crystal growth and morphology of CVD-grown MoS 2 . (a) Optical micrograph of triangular MoS 2 domains on p + -Si/SiO 2 (90 nm), showing typical lateral sizes of ∼ 50   μ m and homogeneous coverage in the 10   μ m   ×   2   μ m channel area. (b) Tapping-mode AFM height image of a representative monolayer flake. (c) AFM height profile across the MoS 2 /SiO 2 edge, showing a step height of 0.88 nm consistent with monolayer MoS 2 . (d) Raman spectrum measured with 532 nm excitation and 0.5 mW power, together with single-Gaussian fits to the E 2 g 1 and A 1 g modes. (e) Room-temperature PL spectrum with a three-Gaussian deconvolution into A-trion, A-exciton, and B-exciton contributions.
Figure 3. Crystal growth and morphology of CVD-grown MoS 2 . (a) Optical micrograph of triangular MoS 2 domains on p + -Si/SiO 2 (90 nm), showing typical lateral sizes of ∼ 50   μ m and homogeneous coverage in the 10   μ m   ×   2   μ m channel area. (b) Tapping-mode AFM height image of a representative monolayer flake. (c) AFM height profile across the MoS 2 /SiO 2 edge, showing a step height of 0.88 nm consistent with monolayer MoS 2 . (d) Raman spectrum measured with 532 nm excitation and 0.5 mW power, together with single-Gaussian fits to the E 2 g 1 and A 1 g modes. (e) Room-temperature PL spectrum with a three-Gaussian deconvolution into A-trion, A-exciton, and B-exciton contributions.
Crystals 16 00116 g003
Figure 4. Electrical characteristics of a representative CVD-grown MoS 2 memtransistor. (a) Measured transfer characteristics for different drain-to-source voltage V D S , demonstrating hysteresis loop area H = 2.1 μA·V for V D S = 50 mV and H = 17 μA·V for V D S = 500 mV. (b) Dependence of the high-resistance state to the low-resistance state on the applied gate-to-source voltage V G S . (c) Measured output characteristics for different gate-to-source voltage V G S . (d) Measured output characteristics for different gate-to-source voltage V G S , demonstrating hysteresis loop area H = 61 μA·V for V G S = 20 V, H = 0.9 mA·V for V G S = 30 V, and H = 1.8 mA·V for V G S = 40 V.
Figure 4. Electrical characteristics of a representative CVD-grown MoS 2 memtransistor. (a) Measured transfer characteristics for different drain-to-source voltage V D S , demonstrating hysteresis loop area H = 2.1 μA·V for V D S = 50 mV and H = 17 μA·V for V D S = 500 mV. (b) Dependence of the high-resistance state to the low-resistance state on the applied gate-to-source voltage V G S . (c) Measured output characteristics for different gate-to-source voltage V G S . (d) Measured output characteristics for different gate-to-source voltage V G S , demonstrating hysteresis loop area H = 61 μA·V for V G S = 20 V, H = 0.9 mA·V for V G S = 30 V, and H = 1.8 mA·V for V G S = 40 V.
Crystals 16 00116 g004
Figure 5. Direct (no time-multiplexing mask) Lorenz driving of the MoS 2 memtransistor. Yellow lines indicate applied gate-to-source volatge, while purple dots indicate measured drain current. Panels (ad) show the applied V GS (left axis) obtained by affine mapping of Lorenz X ( t ) and the measured drain current I D (right axis). Sampling step is Δ t = 0.5 s for panels (a,b), 2 s for panel (c), and 0.1 s for panel (d). The device operates in a nonlinear, history-dependent regime due to charge trapping, providing the short-term memory exploited by the readout.
Figure 5. Direct (no time-multiplexing mask) Lorenz driving of the MoS 2 memtransistor. Yellow lines indicate applied gate-to-source volatge, while purple dots indicate measured drain current. Panels (ad) show the applied V GS (left axis) obtained by affine mapping of Lorenz X ( t ) and the measured drain current I D (right axis). Sampling step is Δ t = 0.5 s for panels (a,b), 2 s for panel (c), and 0.1 s for panel (d). The device operates in a nonlinear, history-dependent regime due to charge trapping, providing the short-term memory exploited by the readout.
Crystals 16 00116 g005
Figure 6. Time-multiplexed reservoir operation of a CVD monolayer MoS 2 memtransistor. Yellow lines indicate applied gate-to-source volatge, while purple dots indicate measured drain current. Panels (ad) show representative traces of the masked gate-to-source voltage V G S ( t ) (left axis) and the corresponding drain current I D S ( t ) (right axis) for the time-multiplexed regimes E–H listed in Table A1. In each case, the Lorenz-63 input is mapped to the gate window V G S ( 31 ,   35 )   V and multiplied by a fixed Rademacher mask to generate N sub-steps of duration Δ t . The dense traces illustrate the full measurement over many macro-steps, while the insets zoom into short time intervals around selected segments of the waveform, highlighting synaptic-plasticity-like behavior.
Figure 6. Time-multiplexed reservoir operation of a CVD monolayer MoS 2 memtransistor. Yellow lines indicate applied gate-to-source volatge, while purple dots indicate measured drain current. Panels (ad) show representative traces of the masked gate-to-source voltage V G S ( t ) (left axis) and the corresponding drain current I D S ( t ) (right axis) for the time-multiplexed regimes E–H listed in Table A1. In each case, the Lorenz-63 input is mapped to the gate window V G S ( 31 ,   35 )   V and multiplied by a fixed Rademacher mask to generate N sub-steps of duration Δ t . The dense traces illustrate the full measurement over many macro-steps, while the insets zoom into short time intervals around selected segments of the waveform, highlighting synaptic-plasticity-like behavior.
Crystals 16 00116 g006
Figure 7. Training and test trajectories for Lorenz-63 prediction across all bias and time-multiplexing regimes. Each panel shows the ground-truth Lorenz-63 X signal (black—expected value) and the memtransistor-based reservoir prediction (colored) for a representative training segment (left part of the trace) and the corresponding held-out test segment (right part), using a 70/30 split. Panels (ad) correspond to the non-time-multiplexed regimes A–D, while panels (eh) correspond to the time-multiplexed regimes E–H.
Figure 7. Training and test trajectories for Lorenz-63 prediction across all bias and time-multiplexing regimes. Each panel shows the ground-truth Lorenz-63 X signal (black—expected value) and the memtransistor-based reservoir prediction (colored) for a representative training segment (left part of the trace) and the corresponding held-out test segment (right part), using a 70/30 split. Panels (ad) correspond to the non-time-multiplexed regimes A–D, while panels (eh) correspond to the time-multiplexed regimes E–H.
Crystals 16 00116 g007
Figure 8. Summary of predictive performance: R 2 versus NRMSE. Each point is one configuration (A–H). Shaded regions indicate qualitative regimes: green is high-fidelity prediction, light-green is usable prediction, yellow is limited usable, and red is underfit or poorly separable.
Figure 8. Summary of predictive performance: R 2 versus NRMSE. Each point is one configuration (A–H). Shaded regions indicate qualitative regimes: green is high-fidelity prediction, light-green is usable prediction, yellow is limited usable, and red is underfit or poorly separable.
Crystals 16 00116 g008
Table 1. Summary of Raman and PL parameters for CVD-grown monolayer MoS 2 . Raman peak positions and widths are obtained from single-Gaussian fits; PL parameters are extracted from a three-Gaussian fit to the A-trion, A-exciton, and B-exciton contributions.
Table 1. Summary of Raman and PL parameters for CVD-grown monolayer MoS 2 . Raman peak positions and widths are obtained from single-Gaussian fits; PL parameters are extracted from a three-Gaussian fit to the A-trion, A-exciton, and B-exciton contributions.
MeasurementMode/PeakPositionFWHMRelative Intensity
Raman E 2 g 1 385.0 cm 1 3.7 cm 1 I / I E 2 GS 1   =   1
Raman A 1 g 402.6 cm 1 7.4 cm 1 I / I E 2 GS 1 1.05
Raman Δ ω   =   ω A 1 ω E 2 GS 1 17.5 cm 1
PLA trion1.80 eV0.12 eV I / I A 0.81
PLA exciton1.84 eV0.09 eV I / I A   =   1
PLB exciton1.96 eV0.12 eV I / I A 0.29
Table 2. Experimental hardware reservoirs that run the Lorenz system and reported (or directly provided) normalized RMSE (NRMSE) using standard deviation σ of the ground truth for normalization; n.r. stands for “not reported”.
Table 2. Experimental hardware reservoirs that run the Lorenz system and reported (or directly provided) normalized RMSE (NRMSE) using standard deviation σ of the ground truth for normalization; n.r. stands for “not reported”.
Platform/MaterialNRMSETypeLorenz Axes (Input → Target) Δ t (Time Units)Ref.
Next-Generation RC (Lorenz-63 short-term forecasting)0.002Simulation x , y , z x , y , z n.r.[20]
Symmetry-aware RC (variable inference)0.0005Simulation x , y z 0.0001[44]
Echo state network (multi-step autoregressive prediction)0.0005Simulation x , y , z x , y , z 0.01[45]
Reservoir RC with delayed input0.003Simulation x x 0.02[46]
Chaotic attractor reconstruction with RC0.01Simulation x , y , z x , y , z n.r.[21]
Fiber-optic photonic NGRC0.022Experiment x y , z n.r.[47]
Optical NGRC (laser and SPL)0.0169Experiment x , y , z x , y , z n.r.[48]
CuInP2S6 (2-terminal memristor)0.014Experiment x , y z n.r.[49]
MoS 2 memtransistor RC (polymer electrolyte-gated)0.04Experiment x , y , z x , y , z 0.01[36]
MoS 2 memtransistor RC (back-gated SiO 2 )0.089Experiment x x 0.05–0.2This work
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Kurtash, V.; Jaurigue, L.; Pezoldt, J. CVD Monolayer MoS2 Memtransistors for Chaotic Time-Series Prediction via Reservoir Computing. Crystals 2026, 16, 116. https://doi.org/10.3390/cryst16020116

AMA Style

Kurtash V, Jaurigue L, Pezoldt J. CVD Monolayer MoS2 Memtransistors for Chaotic Time-Series Prediction via Reservoir Computing. Crystals. 2026; 16(2):116. https://doi.org/10.3390/cryst16020116

Chicago/Turabian Style

Kurtash, Vladislav, Lina Jaurigue, and Jörg Pezoldt. 2026. "CVD Monolayer MoS2 Memtransistors for Chaotic Time-Series Prediction via Reservoir Computing" Crystals 16, no. 2: 116. https://doi.org/10.3390/cryst16020116

APA Style

Kurtash, V., Jaurigue, L., & Pezoldt, J. (2026). CVD Monolayer MoS2 Memtransistors for Chaotic Time-Series Prediction via Reservoir Computing. Crystals, 16(2), 116. https://doi.org/10.3390/cryst16020116

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop