CVD Monolayer MoS2 Memtransistors for Chaotic Time-Series Prediction via Reservoir Computing

Kurtash, Vladislav; Jaurigue, Lina; Pezoldt, Jörg

doi:10.3390/cryst16020116

Open AccessArticle

CVD Monolayer MoS₂ Memtransistors for Chaotic Time-Series Prediction via Reservoir Computing

by

Vladislav Kurtash

^1,*

,

Lina Jaurigue

² and

Jörg Pezoldt

^1,*

¹

FG Nanotechnologie, Institut für Mikro- und Nanoelektronik, Institut für Mikro- und Nanotechnologien MacroNano^®, Institut für Werkstofftechnik, TU Ilmenau, Postfach 100565, 98684 Ilmenau, Germany

²

FG Theoretische Physik 2, Institut für Physik, Institut für Mikro- und Nanotechnologien MacroNano^®, TU Ilmenau, Postfach 100565, 98684 Ilmenau, Germany

^*

Authors to whom correspondence should be addressed.

Crystals 2026, 16(2), 116; https://doi.org/10.3390/cryst16020116

Submission received: 16 December 2025 / Revised: 26 January 2026 / Accepted: 30 January 2026 / Published: 5 February 2026

(This article belongs to the Section Inorganic Crystalline Materials)

Download

Browse Figures

Versions Notes

Abstract

Monolayer

{MoS}_{2}

memtransistors offer gate-tunable hysteresis for neuromorphic reservoir computing, yet the role of operating window and fading-memory dynamics in CVD devices remains underexplored. We grow CVD monolayer

{MoS}_{2}

, fabricate back-gated memtransistors, and use a single device as a time-multiplexed reservoir node for one-step Lorenz-63 prediction. Mobility, ON/OFF, hysteresis, and drift are quantified to identify stable, tunable bias regimes. We used a transistor with field-effect mobility on the order of

10 {cm}^{2} V^{- 1} s^{- 1}

, an ON/OFF ratio above

10^{5}

, and a moderate hysteresis window quantified by

H \approx 2.1 μ

A·V at

V_{DS} = 50

mV and

H \approx 17 μ

A·V at

V_{DS} = 500

mV over

V_{GS} \in [- 10, 30]

V. Performance is bias/memory-limited rather than FET-metric-limited. Sweeping gate-window and reservoir hyperparameters shows an optimum at intermediate hysteresis with moderate drift. Performance improves when the input clock matches the fading-memory time, achieving normalized root mean square error (NRMSE) = 0.09 for one-step Lorenz-63 x-prediction. Device-level statistics (discussed in the main text) show that, despite substantial scattering in electrical parameters, the resulting device-to-device NRMSE variation remains very small under fixed operating conditions. Classical FET metrics are not limiting here; NRMSE improvement instead requires engineering the hysteresis spectrum and gate stack. The demonstration of Lorenz-63 prediction using CVD-grown monolayer

{MoS}_{2}

memtransistors highlights their potential as a wafer-scalable platform for compact chaotic time-series predictions.

Keywords:

MoS₂; memtransistor; memristor; reservoir computing; neuromorphic computing; hysteresis; CVD

1. Introduction

Two-dimensional (2D) transition metal dichalcogenides, and in particular monolayer

{MoS}_{2}

, offer a direct band gap, atomically sharp interfaces, and excellent electrostatic control at the ultimate thickness limit, which makes them attractive candidates to extend device scaling beyond silicon [1,2,3,4]. Chemical vapor deposition (CVD) enables wafer-scale synthesis of monolayer

{MoS}_{2}

and integration into circuits. Alternative synthesis routes include molecular beam epitaxy [5,6], atomic layer deposition [7,8], sulfurization of Mo layers [9], reactive sputtering [10], pulsed laser deposition [11,12], Langmuir–Blodgett coating [13], and printing [14]. For CVD, the resulting films often contain grain boundaries, point defects, and substrate-induced disorder that alter transport and introduce pronounced hysteresis in field-effect characteristics [4]. Recent work has shown that growth recipes, nucleation promoters, and substrate choice strongly influence crystalline quality and defect landscapes in CVD

{MoS}_{2}

[15], yet the consequences for non-conventional computing schemes are still only partially quantified.

In conventional MOSFET logic, threshold-voltage drift and hysteresis caused by charge trapping at the

{MoS}_{2} {/SiO}_{2}

interface, oxide traps, and adsorbates are treated as reliability problems that degrade noise margins and bias stability [16]. For CVD

{MoS}_{2}

, several studies attribute the dominant contribution to traps at grain boundaries and at the semiconductor–oxide interface, consistent with the intrinsic-oxide-trap picture developed for few-layer and thin-film devices [16,17]. The same history dependence that limits digital performance can, however, be harnessed in memristive operation: gate-tunable

{MoS}_{2}

memtransistors exploit hysteresis to implement non-volatile states, analog weight updates, and rate-dependent responses that are useful for neuromorphic computing [18]. In previous work we showed that hysteresis in CVD

{MoS}_{2}

memristors can be quantified via an energy-normalized hysteresis area and that energy-efficient operation windows emerge from an interplay of trap dynamics, bias range, and device geometry [18].

Neuromorphic computing aims to process information using distributed nonlinear dynamics and local memory rather than explicit instruction sequences. Reservoir computing (RC) is a particularly hardware-friendly framework in which a nonlinear dynamical system (the reservoir) is driven by an input signal and only a linear readout is trained. The reservoir may be implemented in software using recurrent neural networks, or physically using optical, electronic, or mechanical systems with suitable nonlinearity and fading memory. For chaotic time-series prediction, such as the Lorenz-63 system, RC can reconstruct attractors and achieve long prediction horizons when reservoir dynamics are matched to task timescales and remain stable but sufficiently rich [19,20]. Recent works show that even small reservoirs can reconstruct chaotic attractors reliably and that topology and spectral properties strongly influence long-term prediction performance [21,22].

Physical reservoirs based on memristive devices promise compact and low-power implementations [19,23]. Numerous material systems have been explored for chaotic prediction or classification, including oxide-based memristors [24,25,26,27,28,29], phase-change devices [30,31], organic synaptic transistors [32,33], and

{LiNbO}_{3}

-based dynamic reservoirs [34,35] for chaotic prediction or classification tasks. At the same time, 2D-material-based memtransistors are emerging as attractive candidates due to their scalable growth, tunable interfaces, and compatibility with back-end-of-line thermal budgets [23]. Yet most demonstrations of

{MoS}_{2}

-based neuromorphic devices either rely on exfoliated or transferred flakes [36], or focus on synaptic plasticity and classification tasks without a quantitative link between growth-induced defects, device hysteresis metrics, and system-level RC performance [37].

Device-focused studies on

{MoS}_{2}

typically report field-effect mobility, subthreshold slope, ON/OFF ratio, and a qualitative measure of hysteresis, but they rarely specify temporal response, fading-memory time constants, or stable operating windows needed to embed these devices into concrete neuromorphic or RC architectures. Conversely, system-level neuromorphic and machine-learning works usually prescribe abstract requirements such as nonlinearity, state dimensionality, and memory depth, without translating them into constraints on current densities, gate-voltage ranges, or switching speeds that a fabrication process can realistically deliver. Chaotic systems are most often treated in dimensionless units, leaving open how to map state variables to physical voltages and currents for a given device technology. As a result, it is difficult to answer seemingly simple questions such as whether a given

{MoS}_{2}

memtransistor, grown under a particular CVD recipe, is suitable as a reservoir node for a specified task and under which bias conditions.

In this work we address part of this gap by linking CVD growth, device-level hysteresis, and reservoir computing performance in a single, experimentally realized platform. We grow monolayer

{MoS}_{2}

by atmospheric-pressure CVD on

p^{+} {-Si/SiO}_{2}

, fabricate back-gated memtransistors, and use a single device as a nonlinear elements in a time-multiplexed reservoir that performs one-step prediction of the Lorenz-63 x-component. We first characterize the crystalline quality of the CVD

{MoS}_{2}

using atomic force microscopy (AFM), Raman spectroscopy, and photoluminescence (PL), confirming monolayer thickness. We then extract key device figures of merit from transfer and output characteristics. Finally, we implement a Lorenz-63 RC pipeline in which the memtransistor is driven by a masked, time-multiplexed input sequence, and a linear readout is trained by ridge regression to predict the next time step. By systematically varying the gate-voltage window, input scaling, and number of virtual nodes, we map NRMSE for Lorenz-63 prediction to experimentally accessible device parameters.

We find that intermediate hysteresis and moderate drift, corresponding to a specific range of gate biases and input amplitudes, maximize state richness and yield-normalized root mean square errors down to normalized root mean square error

NRMSE \approx 0.09

for one-step prediction of the Lorenz-63 x-component. Lower hysteresis reduces nonlinearity, while excessive hysteresis or strong drift degrades memory and leads to unstable trajectories. These results provide a concrete example of how CVD-grown monolayer

{MoS}_{2}

memtransistors can be engineered and biased to serve as physical reservoir nodes for chaotic time-series prediction. These results provide a concrete example of how CVD-grown monolayer

{MoS}_{2}

memtransistors can be engineered and biased to serve as physical reservoir nodes for chaotic time-series prediction: despite only moderate transistor figures of merit (field-effect mobility of order

10 {cm}^{2} V^{- 1} s^{- 1}

and ON/OFF

> 10^{5}

), trap-mediated fading memory on 1–4 s time scales enables Lorenz-63 one-step prediction down to NRMSE

\approx 0.09

under time-multiplexed operation (baseline NRMSE

\approx 0.46

without multiplexing). Because the platform relies on atmospheric-pressure CVD and a standard solid-state gate stack rather than transferred flakes or electrolyte gating, it is compatible with wafer-scale replication and array-style space multiplexing, which is essential for scalable and cost-effective neuromorphic hardware.

2. Materials and Methods

2.1. CVD Growth of Monolayer ${MoS}_{2}$

Monolayer

{MoS}_{2}

was grown by chemical vapor deposition (CVD) in a horizontal two-zone quartz-tube reactor operated at atmospheric pressure with nitrogen (

N_{2}

) as the carrier gas as demonstrated in Figure 1a. Degenerately doped

p^{+} {-Si/SiO}_{2}

substrates with a 90 nm thermally grown

{SiO}_{2}

layer served as both the growth substrate and the global back gate in the final devices.

Prior to growth, the substrates underwent a standard solvent cleaning (acetone, isopropanol, isopropanol in ultrasonic bath, and deionized water), followed by drying with

N_{2}

. The substrates were then treated in an

O_{2}

plasma at 300 W for 10 min to remove residual organic contamination and activate the

{SiO}_{2}

surface. Immediately after the plasma step, an aqueous KCl solution with a concentration of 0.01 M was dispensed onto the substrates and spread by spin coating at 4000 rpm. The coated substrates were subsequently dried on a hot plate at 80 °C for 1 min. This KCl layer acts as a seeding promoter for

{MoS}_{2}

domain nucleation on

{SiO}_{2}

.

Solid sulfur (S) and molybdenum trioxide (

{MoO}_{3}

) powders were used as precursors. A total of 400 mg of S powder was loaded into a graphite crucible placed upstream in the low-temperature zone of the reactor, while 4 mg of

{MoO}_{3}

powder and the KCl-treated

p^{+} {-Si/SiO}_{2}

(90 nm) substrates were placed side by side in the central hot zone of the furnace. The

{MoO}_{3}

crucible and the substrates thus experienced the same nominal temperature profile. Sulfur solid source was heated indirectly by an external halogen lamp focused on the graphite crucible and maintained at approximately 140 °C during the growth, providing a stable S vapor flux into the reaction zone.

Before deposition the reactor was purged with

N_{2}

at 1000 sccm for 1 h at room temperature to remove residual air and moisture. For the growth step, the

N_{2}

flow was reduced to 500 sccm and kept constant throughout the temperature ramp, hold, and cool-down. The measured temperature profile during the CVD is shown in Figure 1b. Starting from room temperature, the furnace was ramped to 800 °C in 18 min. After reaching 800 °C, the temperature was held for 2 min to allow

{MoS}_{2}

nucleation and lateral domain growth on the

{SiO}_{2}

surface. At the end of the hold step, the furnace heating was switched off and the system was allowed to cool naturally to room temperature under continuous

N_{2}

flow. Under these conditions, the growth yields predominantly monolayer

{MoS}_{2}

on

{SiO}_{2}

, as confirmed by the structural and optical characterization.

2.2. Memtransistor Fabrication

{MoS}_{2}

memtransistors were fabricated on the CVD-grown monolayer

{MoS}_{2} {/p}^{+} {-Si/SiO}_{2}

(90 nm) substrates using optical lithography and reactive ion etching (RIE) followed by metal contact definition. All photolithography steps were performed on a maskless aligner MLA150 (Heidelberg Instruments Mikrotechnik GmbH, Heidelberg, Germany.).

In the first lithography step, the as-grown

{MoS}_{2}

film was patterned into isolated channel regions. A positive-tone photoresist AZ 1518 was spin coated onto the substrates and exposed using the MLA150 to define the

{MoS}_{2}

areas to be preserved. After development, the exposed

{MoS}_{2}

was etched by RIE using the patterned AZ 1518 as an etch mask. Immediately after the

{MoS}_{2}

etch, an

O_{2}

descum step was carried out in the same chamber. To minimize damage to the remaining

{MoS}_{2}

during resist stripping, the polymerized AZ 1518 was removed in a two-stage wet process: first, the samples were immersed in acetone for 10 min, followed by AZ 100 remover at 60 °C for 10 min, and then a second AZ 100 remover step for 5 min at room temperature. Finally, the substrates were rinsed in isopropanol and deionized water and dried with nitrogen.

Source and drain contacts were defined in a second photolithography step. A negative-tone photoresist AZ LNR-003 was spin coated and patterned to open contact pads and channel regions in the resistor. Gold contacts with a thickness of 100 nm were then deposited by electron-beam evaporation (CS400ES, VON ARDENNE GmbH, Dresden, Germany). No additional adhesion or capping layers were used in this process. After metallization, lift-off was carried out using the PS 3121 photoresist stripper (Intelligent fluids GmbH, Leipzig, Germany), which selectively removed the remaining AZ LNR-003 and the overlying metal, leaving well-defined Au source and drain electrodes contacting the

{MoS}_{2}

channels. Channel geometries and additional process parameters are listed with the corresponding device statistics in and in the section “Geometrical Definition and Tolerances”.

As shown in Appendix D, the device architecture and fabricated layout are illustrated by a schematic cross-section and representative SEM micrographs of the patterned memtransistor structures.

Geometrical Definition and Tolerances

All memtransistors in this work were defined by the same mask layout (no intentional geometry variation) with nominal channel length

L = 2 μ m

and width

W = 10 μ m

. Because the channel is lithographically defined, the dominant geometrical uncertainties arise from mask-to-mask alignment accuracy, resist development bias, and contact edge definition. Based on the lithography tool specifications and optical inspection of representative structures across the chip, we estimate the effective dimensional tolerances to be within

Δ L / L ≲ 5 %

and

Δ W / W ≲ 5 %

for the present device set (i.e., sub-micrometer deviations around the nominal values). The source/drain electrodes were designed with a fixed overlap to the channel region to ensure robust contacting. The back-gate oxide thickness was fixed by the starting substrate with 90 nm

{SiO}_{2}

thickness.

2.3. Structural and Electrical Characterization of ${MoS}_{2}$ Memtransistors

The as-grown

{MoS}_{2}

films were first examined by atomic force microscopy (AFM) to confirm the surface morphology and layer thickness. AFM measurements were performed on a Dimension V system (Veeco Instruments Inc., Plainview, NY, USA) operated in tapping mode. Height and phase images were recorded over scan areas in the 25 μm range on multiple regions of each sample. The apparent step height between

{MoS}_{2}

domains and the surrounding

{SiO}_{2}

surface was used to verify monolayer thickness, while the absence of significant multilayer islands within the device channels was checked by combining height and phase contrast.

Raman and photoluminescence (PL) spectroscopy were used to further confirm the monolayer character and crystalline quality of the films. Spectra were acquired with a Alpha300 apyron Raman imaging microscope (WITEC GmbH, Kroppach, Germany) using a 532 nm excitation laser and an excitation power of 0.5 mW at the sample. Raman and PL maps were collected across representative device areas to assess spatial uniformity. The relative positions and intensities of the characteristic

{MoS}_{2}

Raman modes, together with the strong direct-gap PL response, were used to identify monolayer regions.

Electrical measurements of the

{MoS}_{2}

memtransistors were carried out using a shielded probe station integrated with a Keithley 4200-SCS Semiconductor Characterization System. All measurements reported in this work were performed in a nitrogen atmosphere at room temperature to minimize adsorption-related drifts and hysteresis. Transfer characteristics (

I_{D}

–

V_{GS}

) and output characteristics (

I_{D}

–

V_{DS}

) were recorded for multiple devices. Unless stated otherwise, the drain voltage was fixed at

V_{DS} = 50

mV for transfer measurements, while the back-gate voltage was swept quasi-statically over the range relevant for reservoir computing operation. All electrical measurements were performed in a shielded probe station under a dry nitrogen atmosphere to minimize the influence of ambient humidity and adsorbates. Measurements were conducted under standard laboratory conditions (room temperature) with the device kept under nitrogen purge during transfer-curve, fading-memory, and time-series experiments.

The accuracy of the applied electrical biases is set by the source-measure units of the Keithley 4200-SCS; the resulting uncertainty in

V_{GS}

and

V_{DS}

is negligible compared to the voltage windows used throughout this work. The drain-current readout noise floor is far below the

μ

A-level currents relevant for the reported transfer curves and hysteresis metrics. Consequently, the dominant experimental uncertainties in extracted quantities arise from sweep discretization and numerical post-processing: the hysteresis area H is limited by the

V_{GS}

step size and current noise propagation through numerical integration, while

g_{m}

is limited by numerical differentiation and smoothing choices.

Figures of merit were extracted from the

I_{D}

–

V_{GS}

curves at

V_{DS} = 50

mV. The field-effect mobility was estimated in the linear regime using the channel geometry and the

{SiO}_{2}

gate-oxide capacitance per unit area. The threshold voltage

V_{TH}

was obtained by linear extrapolation of the

I_{D}

–

V_{GS}

characteristics in the above-threshold region, and the ON/OFF current ratio was defined as the ratio of

I_{D}

at the chosen ON- and OFF-state gate voltages within the accessible bias window.

2.4. Fading Memory: Definition and Experimental Extraction

Fading memory is the property that the influence of past inputs on the present device state decays with increasing delay. For a driven memtransistor used as a physical reservoir, this means that two input histories that differ only in the distant past produce reservoir states that converge in time. Here, we quantify fading memory directly from electrical measurements by estimating a characteristic decay time, denoted as the fading-memory time constant

τ_{FM}

.

To quantify the intrinsic relaxation dynamics under the operating conditions used for reservoir computing, we estimated characteristic time constants from a drain-current decay trace recorded at fixed bias

V_{GS} = 25 V

and

V_{DS} = 0.5 V

. The measured

I_{D} (t)

exhibits a gradual decrease over tens of seconds, consistent with bias-stress relaxation commonly associated with charge trapping/detrapping in the dielectric and at the

{MoS}_{2} {/SiO}_{2}

interface. Because the decay is not well described by a single exponential over the full acquisition window, we fit the data using a bi-exponential model from Equation (1):

I_{D} (t) = A_{1} exp (- \frac{t}{t_{1}}) + A_{2} exp (- \frac{t}{t_{2}}) + I_{0},

(1)

where

A_{1}

and

A_{2}

are amplitudes,

t_{1}

and

t_{2}

are relaxation time constants, and

I_{0}

is the long-time offset current. The resulting time constants capture a fast component (

t_{1}

, seconds) and a slow component (

t_{2}

, tens of seconds), which we use as an experimental proxy for the time scales that bound the usable fading memory in time-multiplexed operation. The full set of fitted parameters and uncertainties is provided in the Appendix B.

2.5. Reservoir Computing Protocol with ${MoS}_{2}$ Memtransistors

We implemented reservoir computing using single

{MoS}_{2}

memtransistors driven by a scalar chaotic input derived from the Lorenz-63 system. The Lorenz dynamics [38] are defined by

\frac{d x}{d t} = c_{1} (y - x), \frac{d y}{d t} = x (c_{2} - z) - y, \frac{d z}{d t} = x y - c_{3} z,

(2)

with

c_{1} = 10

,

c_{2} = 28

, and

c_{3} = 8 / 3

, a shown in Figure 2a. The equations were integrated numerically using a python ODEint with a time step of 0.001 s to generate a long trajectory in the attractor. After discarding the initial transient, the

x (t)

component (denoted X in Figure 2b) was sampled at a uniform time-step to form a one-dimensional discrete-time sequence

{X_{i}}

and used as the input signal with a varied time step. The integration and sampling parameters were chosen such that the dataset covered many visits to both lobes of the attractor. The full time series and scripts are provided in [39,40].

Because the raw Lorenz-X values do not match the gate-voltage range required for safe device operation, we applied an affine normalization to map

X_{i}

into a chosen gate window. First, the samples were linearly rescaled to the unit interval,

X_{normalized} = \frac{X_{i} - X_{min}}{X_{max} - X_{min}},

(3)

where

X_{min}

and

X_{max}

are the minimum and maximum of the training portion of the sequence. The normalized values were then mapped to the gate range

[V_{GS, \min}, V_{GS, \max}]

as

V_{GS, i} = V_{GS, \min} + (V_{GS, \max} - V_{GS, \min}) X_{normalized} .

(4)

The specific voltage windows used in the non-time-multiplexed and time-multiplexed regimes (A–H) are summarized in Table A1. During measurements, the drain-to-source voltage

V_{DS}

and the sampling time interval

Δ t

were kept constant within each regime, while the back gate was driven by piecewise-constant pulses corresponding to the sequence

{V_{GS, i}}

.

Non-time-multiplexed (non-TM) operation was realized by directly sampling the memtransistor drain current

I_{DS}

at the end of each gate pulse. For a given regime, the Lorenz-X sequence was applied as a train of

V_{GS}

steps with dwell time

Δ t

and fixed

V_{DS}

, and the stabilized current at the end of each dwell interval was recorded as the scalar reservoir state. We investigated four such configurations (A–D), which differ in

V_{DS}

,

Δ t

, and the chosen gate window, corresponding to different trade-offs between subthreshold sensitivity, channel conduction, and hysteresis. The resulting current sequence was later aligned with the input sequence to form pairs of input states and targets for training and testing.

To increase the effective state dimensionality, we also implemented time-multiplexed (TM) operation with binary masks. In this case, the input sequence was first normalized to a symmetric range

X_{c} [n] \in [- 1, 1]

at a macro-step cadence indexed by n. Each macro-step was then expanded into N virtual nodes by driving the gate with a masked waveform

V_{GS} [n, k] = V_{center} + A X_{c} [n] m_{k}, k = 1, \dots, N,

(5)

where

V_{center}

is the center of a narrow, device-safe gate window, A is the modulation amplitude, and

m_{k} \in {- 1, + 1}

is a Rademacher mask drawn once with a fixed pseudo-random seed. Within each macro-step of duration

T_{H} = N Δ t

, the gate was held at each

V_{GS} [n, k]

for a dwell time

Δ t

, and the corresponding drain current

I_{DS} [n, k]

was sampled at the end of the dwell. This produced N virtual nodes per macro-step. In the TM configurations (E–H), the device state for node k was defined as

z [n, k] = {log}_{10} (I_{DS} [n, k] + I_{0}),

(6)

with

I_{0} = 10^{- 11}

A added to suppress numerical issues near the noise floor. In some regimes, a one-window lag was included by concatenating the present and previous window states, as detailed in Table A1.

For each macro-step n, the node states were collected into a window vector

s_{n} = [z [n, 1], z [n, 2], \dots, z [n, N]] \in R^{N},

(7)

and the final feature vector for readout was formed as

x_{n} = \{\begin{matrix} s_{n}, & non - lagged TM regime, \\ [s_{n}, s_{n - 1}], & TM regime with + 1 lag, \end{matrix}

(8)

after discarding a short washout of initial windows. The target for each macro-step was the Lorenz-X value at the next macro-step,

y_{n} = X [n + 1]

, so that the memtransistor reservoir performs one-step-ahead prediction at the macro cadence.

The linear readout was trained by ridge regression on standardized features. The sequence of windowed pairs

{(x_{n}, y_{n})}

was split chronologically into training and test sets (e.g., 70/30 split), without shuffling, to respect the temporal structure of the data. On the training set, we computed the per-feature mean

μ

and standard deviation

σ

and standardized both training and test features accordingly. A bias term was included by augmenting the standardized feature matrix with an intercept column. The readout weights were obtained by minimizing a Tikhonov-regularized least-squares objective with penalty parameter chosen by grid search on the training set. Performance was evaluated on the held-out test set using the normalized root-mean-square error (NRMSE) and the coefficient of determination

R^{2}

, defined as

\begin{matrix} RMSE & = \sqrt{\frac{1}{T} \sum_{t = 1}^{T} {(y_{t} - {\hat{y}}_{t})}^{2}}, \end{matrix}

(9)

\begin{matrix} NRMSE & = \frac{RMSE}{σ_{y}}, σ_{y} = \sqrt{\frac{1}{T} \sum_{t = 1}^{T} {(y_{t} - \bar{y})}^{2}}, \end{matrix}

(10)

\begin{matrix} R^{2} & = 1 - \frac{\sum_{t = 1}^{T} {(y_{t} - {\hat{y}}_{t})}^{2}}{\sum_{t = 1}^{T} {(y_{t} - \bar{y})}^{2}}, \end{matrix}

(11)

where

y_{t}

and

{\hat{y}}_{t}

denote the true and predicted Lorenz-X values on the test set,

\bar{y}

is their mean, and T is the number of test samples. The detailed hyperparameters and the full measurement logs for regimes A–H are provided in Table A1 and in the open dataset and code repository [39,40].

In many experimental forecasting scenarios, only a subset of the underlying dynamical state is available to the learning system (partial observation). In this setting, the instantaneous input at a single time step does not uniquely define the state of the task system, and accurate prediction requires access to recent history. Reservoir computing naturally addresses partial observation because the reservoir state depends on past inputs and can therefore act as an implicit delay embedding of the observed signal [41,42]. In the context of chaotic time-series prediction, this perspective clarifies why the relevant device requirement is not maximizing conventional transistor figures of merit, but matching the nonlinear fading-memory dynamics (memory depth and response timescales) to the sampling interval and the task dynamics.

3. Results

3.1. Crystal Growth and Monolayer Quality

The CVD process described in Section 2.1 yields discrete triangular

{MoS}_{2}

domains with an average lateral size of about

50 μ m

, which is large enough to accommodate the

10 μ m \times 2 μ m

transistor channels fully within a single flake. An optical micrograph of a representative chip after growth is shown in Figure 3a. Individual triangular flakes with lateral dimensions of tens of micrometres are visible on the

{SiO}_{2}

surface, and the regions selected for device fabrication were chosen such that the entire channel resides inside a single monolayer domain.

The surface morphology and layer thickness were examined by AFM. Figure 3b presents a tapping-mode AFM height image of a typical

{MoS}_{2}

flake, and Figure 3c shows a line profile across the

{MoS}_{2} {/SiO}_{2}

step. The apparent step height between the flake and the surrounding

{SiO}_{2}

lies in the range of 0.7–0.9 nm, consistent with monolayer

{MoS}_{2}

on

{SiO}_{2}

when tip convolution and adsorbates are taken into account. Within the channel regions, we did not detect extended areas with multiples of this step height, indicating that the active devices are formed predominantly within monolayer flakes rather than multilayer aggregates or overlapped domains.

Raman spectroscopy confirms the monolayer character of the CVD

{MoS}_{2}

and provides a first handle on strain, doping, and crystallinity (Figure 3d). The spectrum, measured with 532 nm excitation at 0.5 mW power, exhibits the characteristic in-plane

E_{2 g}^{1}

and out-of-plane

A_{1 g}

modes. Single-Gaussian fits to the two main peaks yield positions of

385.0 {cm}^{- 1}

(

E_{2 g}^{1}

) and

402.6 {cm}^{- 1}

(

A_{1 g}

), with full widths at half maximum (FWHM) of

3.7 {cm}^{- 1}

and

7.4 {cm}^{- 1}

, respectively. The peak separation

Δ ω = ω_{A_{1}} - ω_{E_{2 GS}^{1}}

is

17.6 {cm}^{- 1}

. For exfoliated monolayers on

{SiO}_{2}

, reported separations are typically in the 19–

20 {cm}^{- 1}

range, whereas thicker layers show larger

Δ ω

and broader

A_{1 g}

lines [1,43]. Our slightly reduced separation and modest

A_{1 g}

broadening therefore point to monolayer

{MoS}_{2}

with a small tensile strain component and moderate n-type doping, but without strong degradation of crystallinity. The

A_{1 g}

intensity is only slightly higher than that of

E_{2 g}^{1}

, with an amplitude ratio

I_{A_{1}} / I_{E_{2 GS}^{1}} \approx 1.05

, which is comparable to values reported for high-quality monolayers and markedly different from heavily oxidized or disordered films where

A_{1 g}

broadens and dominates the spectrum [43].

Photoluminescence (PL) spectroscopy further corroborates the monolayer assignment and reveals the excitonic structure (Figure 3e). A three-Gaussian deconvolution of the PL spectrum in the 1.7–2.1 eV range identifies the A trion, A exciton, and B exciton contributions. The fitted peak energies are 1.80 eV for the A trion, 1.84 eV for the A exciton, and 1.96 eV for the B exciton. The corresponding FWHM values are approximately 0.12 eV (A trion), 0.09 eV (A exciton), and 0.12 eV (B exciton). The intensity ratios, normalized to the A exciton amplitude, are

I_{T} / I_{A} \approx 0.81

and

I_{B} / I_{A} \approx 0.29

. Canonical monolayer

{MoS}_{2}

on

{SiO}_{2}

exhibits a dominant A exciton at ∼1.85–1.90 eV, a weaker B exciton around 2.0 eV, and a trion shoulder whose strength scales with electron density [1,2]. Our spectra match this pattern: the strong direct-gap A exciton with sub-0.1 eV linewidth indicates relatively low inhomogeneous broadening and good crystalline quality, while the finite trion contribution is consistent with moderate n-type doping in the

10^{12}

–

10^{13}

{cm}^{- 2}

range, as commonly observed in CVD-grown monolayers [1].

The main Raman and PL parameters extracted from these fits are summarized in Table 1. Compared with reference data for exfoliated and CVD-grown monolayers, our values fall in the range associated with structurally intact, lightly doped monolayer

{MoS}_{2}

, rather than strongly strained or defective material [1,2,43]. Together with the AFM and optical data, this supports the conclusion that the transistor channels are defined within high-quality monolayer flakes, providing a reproducible materials baseline for the memtransistor and reservoir computing experiments discussed in the following sections.

These structural and optical signatures demonstrate that the active regions of the devices are formed within isolated monolayer

{MoS}_{2}

flakes of well-controlled thickness, strain, and excitonic response. This provides a reproducible materials baseline for the electrical characterization and reservoir computing experiments presented in the subsequent sections.

3.2. Memtransistor Characteristics and Hysteresis

We next quantify the DC characteristics of a representative CVD

{MoS}_{2}

memtransistor and the strength of its hysteresis. Figure 4 shows the electrical response of a back-gated device with channel length

L = 2 μ m

and width

W = 10 μ m

on 90 nm

{SiO}_{2}

. All measurements are performed in a nitrogen atmosphere at room temperature. The output characteristics

I_{D} (V_{DS})

under different gate biases span drain currents from the sub-nanoampere range at

V_{GS} = 0 V

up to

\sim 1.1 \times 10^{- 4} A

at

V_{GS} = 40 V

and

V_{DS} = 20 V

. At low drain bias the I–V curves are nearly linear, with a small-signal on-state resistance on the order of

10^{5} Ω

at the highest gate bias, and then gradually evolve towards quasi-saturation as

| V_{DS} |

increases, as expected for monolayer

{MoS}_{2}

transistors on

{SiO}_{2}

.

The transfer characteristics

I_{D} (V_{GS})

at

V_{DS} = 50

and

500 mV

show n-type field-effect behavior with a clear turn-on around positive gate voltages. For the forward branch, the drain current at

V_{DS} = 50 mV

increases from ∼

10^{- 11} A

at

V_{GS} = - 10 V

to

1.9 \times 10^{- 6} A

at

V_{GS} = 30 V

. At

V_{DS} = 500 mV

the corresponding current reaches

1.9 \times 10^{- 5} A

. Depending on whether the forward or reverse branch is used for normalization, the ON/OFF ratio thus lies in the

10^{5}

–

10^{6}

range. Using the standard linear-regime expression,

μ_{FE} = \frac{L}{W C_{ox} V_{DS}} \frac{d I_{D}}{d V_{GS}},

(12)

with

C_{ox} \approx 3.8 \times 10^{- 4} F m^{- 2}

for 90 nm

{SiO}_{2}

, and fitting the forward

I_{D} (V_{GS})

branch between

V_{GS} = 18

and

24 V

at

V_{DS} = 50 mV

, we obtain a field-effect mobility

μ_{FE} \approx 10 {cm}^{2} V^{- 1} s^{- 1}

. Linear extrapolation of the same fit to

I_{D} = 0

yields a threshold voltage

V_{TH} \approx 13 V

. Both values are typical for back-gated CVD monolayer

{MoS}_{2}

devices on 90 nm

{SiO}_{2}

and indicate that the device remains transistor-like despite the presence of hysteresis.

Hysteresis appears as a systematic separation between the up- and down-sweep

I_{D}

-

V_{GS}

traces over the full

- 10

to

30 V

gate window as shown in Figure 4b. We quantify this memory effect by the gate-sweep hysteresis loop area:

H = \int_{V_{G S, min}}^{V_{G S, max}} |I_{up} (V_{GS}) - I_{down} (V_{GS})| d V_{GS} .

(13)

where

I_{u p}

and

I_{d o w n}

denote the currents for the forward and backward sweeps, respectively, and here

V_{G, min} = - 10 V

and

V_{G, max} = 30 V

. Numerical integration of the measured traces on a 0.2 V grid gives

H \approx 2.1 μ A \cdot V

at

V_{DS} = 50 mV

and

H \approx 17 μ A \cdot V

at

V_{DS} = 500 mV

for the device used in the subsequent reservoir computing experiments. The area is dominated by the high-current region above

\sim 10 V

, while the deep-off regime at negative gate voltages contributes negligibly to H.

An independent view of the memory window comes from output hysteresis at fixed gate bias (Figure 4d). Using

H = \int_{0}^{20 V} |I_{up} (V_{DS}) - I_{down} (V_{DS})| d V_{DS} .

(14)

we obtain loop areas of approximately

61 μ A \cdot V

,

0.90 mA \cdot V

, and

1.8 mA \cdot V

for

V_{GS} = 20

, 30, and

40 V

, respectively. The corresponding high-resistance to low-resistance state ratios at a small read bias are

R_{HRS} / R_{LRS} \approx 91

and 42 at

V_{GS} = 0 V

(for

V_{DS} = 50

and

500 mV

), decreasing to

\approx 2.2

and

1.93

at

V_{GS} = 10 V

and approaching unity at

V_{GS} = 30 V

. Thus, the largest state contrast is available when reading near the subthreshold regime, while the hysteresis becomes almost invisible once the channel is strongly accumulated.

Recapitulating, the CVD

{MoS}_{2}

memtransistor combines a technologically relevant mobility of order

10 {cm}^{2} V^{- 1} s^{- 1}

, an ON/OFF ratio above

10^{5}

, and a moderate but well-defined hysteresis window in both transfer and output characteristics. This balance between transistor performance and memory effect is essential for later use as a nonlinear, history-dependent element, but it also ensures that the device remains representative of CVD-grown

{MoS}_{2}

field-effect transistors more broadly, keeping the focus of the manuscript on growth and device-level behavior. Device-to-device statistics for H,

g_{m, max}

and

μ_{FE, max}

extracted from the full transfer-curve dataset are summarized in Appendix C (Figure A2).

3.3. Lorenz-63 Prediction Without Time-Multiplexing

We first establish a non-time-multiplexed baseline where the Lorenz-63 signal directly drives a single

{MoS}_{2}

memtransistor. The scalar Lorenz-X trajectory, preprocessed and mapped to the gate window as described above, is converted into a sequence of gate-to-source voltages

V_{G S} (t)

, while the drain-to-source voltage

V_{D S}

is held constant within each regime (Table A1). The drain current

I_{D S} (t)

thus realizes a scalar, history-dependent nonlinear mapping of the chaotic input. After logarithmic transformation and standardization, we train a linear ridge-regression readout to perform one-step prediction of the Lorenz-X signal; performance is quantified by NRMSE and

R^{2}

on a held-out test set.

Measurement results (purple dots) and applied gate-to-source voltage (yellow lines) inputs are presented in Figure 5. In the first pair of regimes (A and B, Figure 5a and Figure 5b, respectively) we probe a wide gate window that covers subthreshold and above-threshold operation. The normalized Lorenz-X is mapped to

V_{G S} \in [0, 30]

V, and the device is biased either at

V_{D S} = 0.5

V with sampling interval

Δ t = 0.5

s (regime A) or at

V_{D S} = 3

V with

Δ t = 2

s (regime B). At these settings the memtransistor output current spans several decades, from the subthreshold noise floor up to the microampere range. The best non-time-multiplexed predictions for these two regimes yield NRMSE

\approx 0.60

and

R^{2} \approx 0.64

(regime A) and NRMSE

\approx 0.61

with

R^{2} \approx 0.62

(regime B) for a 60/40 chronological train-test split. These values indicate that the device captures some structure of the Lorenz dynamics but leaves a large fraction of the variance unexplained.

To better exploit the steep part of the transfer characteristic, we confine the gate window to

V_{G S} \in [10, 40]

V in regimes C (Figure 5c) and D (Figure 5d) while keeping

V_{D S} = 3

V. Regime C uses a relatively slow sampling interval

Δ t = 2

s, whereas regime D samples faster at

Δ t = 0.1

s. This focuses the input on the region of highest transconductance and reduces the time available for slow drift between samples. The best results in these regimes, obtained for a 70/30 split, improve to NRMSE

\approx 0.59

and

R^{2} \approx 0.65

(regime C) and NRMSE

\approx 0.46

with

R^{2} \approx 0.79

(regime D). Thus, concentrating

V_{G S}

on the high-gain region and shortening

Δ t

both enhance the useful nonlinearity and effective memory encoded in

I_{D S} (t)

, but the prediction quality remains well below the level typically targeted for high-fidelity Lorenz forecasting.

Overall, the non-time-multiplexed experiments confirm that a single CVD

{MoS}_{2}

memtransistor, driven directly by a Lorenz-derived

V_{G S} (t)

waveform, can provide a reproducible nonlinear transformation with moderate short-term memory. At the same time, the achievable NRMSE and

R^{2}

in regimes A–D remain limited by the scalar nature of the node, the finite signal-to-noise ratio in the subthreshold range, and drift on the time scales of the measurement. We therefore treat these results as a device-level baseline and, in the next subsection, introduce time-multiplexed operation to increase the effective state dimensionality without modifying the device or growth process.

3.4. Time-Multiplexed Reservoir Operation

We next introduce time-multiplexed operation to increase the effective state dimensionality of the reservoir without changing the underlying device or growth process. Instead of using each Lorenz-X sample once, as in regimes A–D, we expand every macro-step into N short sub-steps of duration

Δ t

. Within each macro-step j, the Lorenz-derived scalar is first mapped into the gate window and then multiplied by a fixed Rademacher mask

m_{k} \in {- 1, + 1}

,

k = 1, \dots, N

, to generate a masked waveform

V_{G S} (t)

. The drain voltage

V_{D S}

is held constant, and the resulting drain current

I_{D S} (t)

is sampled at the end of each sub-step, yielding N virtual nodes per macro-step.

The left axis in Figure 6 shows typical masked

V_{G S} (t)

traces (yellow lines), while the right axis shows the corresponding

I_{D S} (t)

response for the four time-multiplexed regimes E–H (purple dots). The Lorenz-63 prediction performance for all regimes is summarized in Table A1. All time-multiplexed measurements are performed at

V_{D S} = 3 V

and in a narrow gate window

V_{G S} \in (31, 35) V

, that is, close to the high-transconductance region of the transfer characteristic. This choice minimizes the influence of deep subthreshold noise and strongly accumulated on-state drift, and it keeps the instantaneous currents in a technologically reasonable range.

Regime E maximizes the number of virtual nodes under these conditions. We use

N = 112

nodes with

Δ t = 0.10 s

, which gives a macro-step duration

T_{H} = N Δ t = 11.2 s

, and record

M = 36

macro-steps. The raw currents

I_{D S}

are standardized node-wise (per-node z score) and used directly as the reservoir state. This configuration yields the largest state vector but also exposes the device to the longest effective memory window. Consistent with that, the test NRMSE is highly split-dependent: for a 60/40 split it remains high (NRMSE

= 0.85

,

R^{2} = 0.27

), whereas for an 80/20 split it improves to NRMSE

= 0.19

and

R^{2} = 0.98

(Table A1, label E), indicating that long-term drift and low-frequency fluctuations limit the useful fading memory when

T_{H}

approaches tens of seconds.

Regime F shortens the macro-step and compresses the current dynamic range. Here, we use

N = 16

and

Δ t = 0.10 s

, so that

T_{H} = 1.6 s

, and increase the number of macro-steps to

M = 256

. Instead of raw current we feed

{log}_{10} (I_{D S})

into the readout, which improves numerical conditioning and reduces the weight of rare high-current excursions. Across all chronological splits, the resulting test errors are low and relatively stable; the best result (60/40 split) reaches NRMSE

= 0.088

with

R^{2} = 0.99

(label F). This shows that, once

T_{H}

is comparable to the intrinsic fading-memory time of the device and the state distribution is well conditioned, even a modest number of virtual nodes can support accurate short-horizon Lorenz-63 prediction.

Regime G increases the memory window at fixed dimensionality. We keep

N = 16

nodes but increase the sub-step duration to

Δ t = 0.25 s

, so that

T_{H} = 4.0 s

, again with

M = 256

. As in regime F we use

{log}_{10} (I_{D S})

but now augment the reservoir state with a one-step lag: the readout sees both the current and previous macro-step states. This explicit lag partially compensates for the shorter physical memory of the device. The best test metrics (60/40 split) reach NRMSE

= 0.16

and

R^{2} = 0.97

(label G). The time traces in Figure 6c show that this configuration still samples a broad range of

I_{D S}

values while remaining less sensitive to slow drift than regime E.

Finally, regime H explores a larger state dimension at an intermediate memory window. We double the number of nodes to

N = 32

at

Δ t = 0.10 s

, giving

T_{H} = 3.2 s

and

M = 128

, and retain both the

{log}_{10} (I_{D S})

representation and the one-step lag. An explicit bias term (intercept) is included in the linear regression. The best result for regime H (80/20 split) achieves NRMSE

= 0.10

and

R^{2} = 0.99

(label H), comparable to regime F. Compared with

N = 16

, doubling N yields only a modest improvement, consistent with the expectation that, once the reservoir dimension exceeds the effective dimensionality of the task and the physical memory window, additional virtual nodes offer diminishing returns.

The numerical trends in Table A1 are reflected directly in the reconstructed Lorenz-63 trajectories. Figure 7 compares ground-truth and predicted Lorenz-X time traces for all regimes A–H, using the best-performing chronological split in each case. In the non-time-multiplexed configurations A–D (Figure 7a–d), the training segments can be fitted reasonably well, but the test trajectories show visibly damped oscillations and phase errors, consistent with the relatively high NRMSE of 0.46–0.61. In contrast, the time-multiplexed regimes F and H (Figure 7f,h) yield test traces that track both the amplitude and phase of the Lorenz dynamics over several oscillation periods before diverging, in line with their lower errors

NRMSE \approx 0.09

–0.10. These qualitative differences support the view that tuning the effective memory window

T_{H}

, the number of virtual nodes N, and the current preprocessing is more important for RC performance than further incremental improvements in conventional FET metrics for the present devices.

Taken together, regimes E–H demonstrate that time multiplexing allows a single CVD

{MoS}_{2}

memtransistor to reach NRMSE values near

0.09

and

R^{2} \approx 0.99

for short-horizon Lorenz-63 prediction, while keeping all device-level conditions (growth, contacts, and oxide stack) fixed. From a device perspective, the comparison highlights three trends. First, excessively long effective memory windows

T_{H}

amplify drift and low-frequency noise and can degrade performance despite large N. Second, logarithmic current scaling is beneficial once the device spans several decades of

I_{D S}

, because it balances on- and off-state contributions. Third, modest architectural tweaks such as lagged states can recover part of the effective memory without changing the physical device. We therefore view time-multiplexed RC here primarily as a characterization protocol for the nonlinear, history-dependent response of CVD-grown

{MoS}_{2}

memtransistors, rather than as a fully optimized neuromorphic system.

4. Discussion

4.1. Benchmarking Against Existing Hardware Reservoirs

The best time-multiplexed regimes demonstrate that a single back-gated CVD

{MoS}_{2}

memtransistor can reach one-step Lorenz-63 prediction errors on the order of

10^{- 1}

under realistic device-bias constraints. From Table A1, the optimal non-time-multiplexed configuration (regime D) saturates at

NRMSE \approx 0.46

and

R^{2} \approx 0.79

, whereas the best time-multiplexed settings (regimes F and H) reduce the error to

NRMSE \approx 0.09

–0.10 with

R^{2} \approx 0.99

. Thus, time multiplexing and simple pre-processing of

I_{D S}

improve the prediction accuracy by roughly a factor of five relative to the direct, non-time-multiplexed baseline at the same drain bias, without modifying the device or growth stack. Table 2 consolidates the benchmarking of numerous experiments performed on the devices as well as simulations.

It is instructive to place these values alongside other hardware reservoirs that have tackled Lorenz-type tasks. State-of-the-art numerical reservoirs such as next-generation RC and symmetry-aware RC reach

NRMSE

values around

10^{- 3}

–

10^{- 4}

for Lorenz-63 short-term forecasting and variable inference in simulation, and optimized echo state networks can achieve comparable errors for multi-step autoregressive prediction [21]. Experimental photonic implementations of next-generation RC report

NRMSE

in the

10^{- 2}

range for Lorenz-63, similar to other optical reservoirs that trade device complexity for low error and long valid-prediction times [21]. Among memristive and ferroelectric platforms, dynamic

{CuInP}_{2} S_{6}

memories and

α

{-In}_{2} {Se}_{3}

ferroelectric memtransistors have demonstrated Lorenz-type chaotic time-series prediction with reported errors between

NRMSE \approx 0.014

and

0.008

while operating at low voltages [49,50].

Closer to the present work, polymer electrolyte-gated

{MoS}_{2}

transistors have recently been used as reservoir nodes for time-series processing, including Lorenz prediction [36]. In that platform,

{Li}^{+}

intercalation drives a reversible

{2H-1T}^{'}

phase transition that provides strong nonlinearity and long fading memory, and a time-multiplexed single-node reservoir achieved Lorenz

NRMSE

as low as ∼

0.04

[36]. Our back-gated CVD

{MoS}_{2}

memtransistor reservoir operates in a more conservative regime:

V_{G S}

is confined to (31,35) V on 90 nm

{SiO}_{2}

,

V_{D S} = 3

V, and the device is read through a conventional solid-state gate stack without electrolytes, ionic motion, or phase transitions. Under these constraints the best errors,

NRMSE \approx 0.09

with

R^{2} \approx 0.99

, are about a factor of two above the polymer-electrolyte

{MoS}_{2}

benchmark and an order of magnitude above the most optimized photonic or memristive reservoirs, but they are obtained on a standard CVD-grown, back-gated transistor structure.

From a device perspective this comparison highlights three points. First, the absolute RC performance of our platform is limited less by the crystalline quality of the monolayer and more by the choice of gate stack and biasing: dry

{SiO}_{2}

confines the available nonlinearity and memory to trap-mediated charge storage, whereas ionic or ferroelectric media provide stronger history dependence at lower voltages. Second, within those constraints, the CVD

{MoS}_{2}

memtransistor still attains

NRMSE \approx 0.09

as a single physical node with a purely linear readout and no network-level optimization or feedback. Third, because both the semiconductor and gate dielectric are wafer-process-compatible, the same device platform can in principle be scaled to multi-node arrays or 3D-integrated stacks without introducing new materials or processing steps.

We therefore view the Lorenz-63 benchmarks in this work not as an attempt to surpass the best reported RC error, but as a quantitative sanity check that links device-level metrics (mobility, hysteresis, and trap-mediated memory on sub- to multi-second scales) to an application-level figure of merit (NRMSE on a standard chaotic prediction task). In combination with the open time-series dataset and RC pipeline released with this work, this provides a reproducible baseline against which future

{MoS}_{2}

growth, contact engineering, or gate-stack modifications can be judged in terms of their impact on both device characteristics and reservoir-computing performance.

We finally note that the Lorenz-63 benchmarks collected in Table 2 are not strictly equivalent in task difficulty to the configuration used here. Several of the best-performing numerical schemes operate in a “full observation” regime, where all three Lorenz coordinates

(x, y, z)

or at least two components are supplied to the model at each time step, and the readout learns the flow map

X (t) \mapsto X (t + Δ t)

[20,21,44,45]. In our

{MoS}_{2}

memtransistor implementation, the reservoir is driven by a single scalar input proportional to the x-component only, and the readout is trained to predict

x (t + Δ t)

from this one-dimensional observation and its fading memory. From a dynamical-systems perspective, this corresponds to forecasting from a partial observation rather than from the full state; the mapping

x (t) \mapsto x (t + Δ t)

is generally many-to-one on the Lorenz attractor, so the reservoir must internally reconstruct the missing coordinates via its finite memory, in line with delay-embedding arguments for attractor reconstruction [21]. This makes the task more sensitive to noise and memory depth than full-state forecasts at the same sampling interval and reservoir size.

A second difference concerns the effective prediction horizon per step. Many Lorenz-63 RC studies integrate the equations with a small internal time step (for numerical accuracy) and then train on data sampled with

Δ t \approx 0.0001

–

0.02

[20,44]. In that regime, each one-step forecast covers only a short fraction of a Lyapunov time, and nearby trajectories separate relatively weak between successive samples, which typically leads to lower one-step NRMSE at fixed model capacity. In contrast, our best hardware configuration operates at a coarser sampling interval (here,

Δ t = 0.10

in Lorenz units), so that each prediction step spans a larger fraction of

τ_{L}

. Because prediction errors in chaotic systems grow with elapsed time along the trajectory, a larger sampling interval intrinsically amplifies one-step errors and reduces the attainable NRMSE for a given reservoir size. The use of a single scalar input and a longer effective time step means that the NRMSE of the present

{MoS}_{2}

memtransistor reservoir should be viewed as a conservative benchmark relative to numerical RC results obtained under full-state, finely sampled conditions.

4.2. Device Metrics and Design Window for ${MoS}_{2}$ Memtransistor Reservoirs

The experiments above show that, for the present CVD

{MoS}_{2}

memtransistors, reservoir performance is governed more by how we bias and read the device than by further incremental improvements in classical FET metrics. The representative transistor combines a field-effect mobility of order

10 {cm}^{2} V^{- 1} s^{- 1}

, an ON/OFF ratio above

10^{5}

, and a moderate hysteresis window quantified by

H \approx 2.1 μ A \cdot V

at

V_{D S} = 50 mV

and

H \approx 17 μ A \cdot V

at

V_{D S} = 500 mV

over

V_{G S} \in [- 10, 30] V

. Within this envelope, the non-time-multiplexed Lorenz-63 experiments in regimes A–D show that confining

V_{G S}

to the high-transconductance region and shortening the sampling interval are more important for NRMSE than pushing mobility or ON/OFF higher. Form Figure 8 we can summarize that operating in

V_{G S} \in (10, 40) V

at

V_{D S} = 3 V

and

Δ t = 0.1 s

(regime D) already reduces the error to

NRMSE \approx 0.46

, whereas wide gate windows that spend substantial time in deep subthreshold or strongly accumulated regimes perform worse despite similar static device figures of merit.

Time-multiplexed operation refines this picture. Regimes E–H keep the same physical device and gate stack but adjust the effective memory window

T_{H} = N Δ t

, the number of virtual nodes N, and the preprocessing of

I_{D S}

. Performance improves markedly when

T_{H}

is made comparable to the intrinsic fading-memory time of the trap-mediated hysteresis, on the order of a few seconds, rather than being pushed to 11.2 s as in regime E. The best Lorenz-63 errors,

NRMSE \approx 0.09

–0.10 with

R^{2} \approx 0.99

in regimes F and H, occur for

T_{H}

between 1.6 and 3.2 s,

N = 16

–32, and a narrow gate window

V_{G S} \in (31, 35) V

at

V_{D S} = 3 V

, combined with a

{log}_{10} (I_{D S})

representation and a simple one-step lag. In this bias range the device operates on the steep, nearly linear part of the transfer characteristic where small

V_{G S}

variations are efficiently converted into current changes, while the hysteresis is strong enough to provide short-term memory but not so slow as to drift over the full duration of a macro-step. The stability of the best NRMSE across chronological splits in Table A1 supports the view that we are exploiting reproducible trap dynamics rather than uncontrolled long-term drift.

These observations suggest a practical design window for CVD

{MoS}_{2}

memtransistor reservoirs. At the device level, it is sufficient to reach mobilities in the

10 {cm}^{2} V^{- 1} s^{- 1}

class and ON/OFF ratios above

10^{4}

–

10^{5}

; further gains in these parameters are unlikely to translate into proportional reductions in NRMSE unless the hysteresis spectrum and gate stack are also engineered. More critical is to realize a controllable, intermediate hysteresis: large enough that

R_{HRS} / R_{LRS}

exceeds a few tens near the chosen read bias and that fading memory covers 1–4 s, but not so large that the device fails to wash out past inputs within the reservoir time horizon, defined by the macro-step duration of time-multiplexed reservoir. On the operation side, the results here point to narrow

V_{G S}

windows around the high-transconductance region, moderate

V_{D S}

(a few volts), and time-multiplexing parameters tuned so that

T_{H}

matches the device’s intrinsic memory time. Within this window, the

{MoS}_{2}

memtransistor behaves as a compact, wafer-process-compatible nonlinear node whose RC performance is acceptable to resolve trends when growth, contact engineering, or trap spectra are varied, while keeping the focus of the platform on scalable device physics rather than on RC optimization.

4.3. Stability, Reproducibility, and Scope of Structure–Performance Relations

All results reported here were obtained from a single CVD-grown chip processed in one fabrication run, thereby eliminating run-to-run variation and enabling a focused assessment of device-to-device reproducibility within a fixed process flow. In total, 65 transistor structures (monolayer CVD

{MoS}_{2}

,

L = 2 μ m

,

W = 10 μ m

) were patterned, contacted, and electrically characterized. Across this set, the hysteresis area H exhibits a broad distribution, consistent with device-to-device dispersion in trap populations and local electrostatics even on an optically uniform monolayer film, whereas the maximum transconductance and field-effect mobility show comparatively narrower spreads. To probe task-level reproducibility, we further recorded Lorenz-63 time-series measurements on

n = 17

devices selected from the higher-hysteresis subset under an identical input mapping and a fixed chronological 70/30 train/test split. Using

{log}_{10} (| I_{D} |)

as the reservoir state to mitigate slow baseline drift consistent with charge trapping/detrapping, the one-step prediction error shows low device-to-device scatter (range

0.58

–

0.61

), motivating the selection of an average-performing device for the detailed demonstrations rather than an outlier. We note that the absolute NRMSE values in this multi-device dataset remain comparatively high because time multiplexing was not implemented for every device under the available measurement time; nevertheless, combining the same 17 devices as a space-multiplexed reservoir improves the one-step NRMSE to

0.57

, consistent with the expected benefit of increased state dimensionality.

Finally, we emphasize the scope of this work regarding structure–performance relationships. Devices were intentionally fabricated with a fixed channel geometry and monolayer thickness, and we therefore do not claim a systematic dependence of fading-memory time constants or prediction accuracy on thickness (1L vs multilayer) or geometry (L, W). These parameters are expected to influence response times through electrostatic scaling, trap occupancy, and RC time constants associated with charge trapping and dielectric relaxation, and they provide clear design parameters for the future studies.

4.4. Limitations and Outlook Toward CMOS-Compatible Neuromorphic Hardware

The present implementation has several clear limitations from a neuromorphic-hardware perspective. First, the back-gated geometry with 90 nm

{SiO}_{2}

requires

V_{G S}

up to 35–40 V and

V_{D S} = 3

V, which is far above typical CMOS core voltages and is acceptable here only because we target a device-physics demonstration rather than an integrated circuit. Second, the relevant fading-memory times are on the order of 1–4 s and the sampling interval

Δ t

is at best 0.1 s, so the reservoir operates in a quasi-static regime set by trap dynamics and instrumentation rather than at radiofrequency or GHz bandwidths.

At the same time, the materials stack and fabrication flow are compatible with an eventual transition toward CMOS back-end integration. Moving from a global back gate to patterned top gates with thin high-k dielectrics or ferroelectric layers would reduce the required

V_{G S}

by one to two orders of magnitude while preserving or even enhancing the useful hysteresis. The same CVD growth and contact scheme can, in principle, be adapted to BEOL-compatible thermal budgets, enabling local

{MoS}_{2}

memtransistor arrays above conventional logic. On the architecture side, the single-node time-multiplexed reservoir demonstrated here can be extended to small multi-node ensembles, co-integrated with CMOS readout and training circuits, and exercised on a broader set of temporal tasks. In this sense, the current work should be viewed as a device-level benchmark that links CVD growth and hysteresis engineering to a standard RC task; future efforts will need to trade some of this simplicity for lower voltages, faster dynamics, and array-level integration to approach truly CMOS-compatible neuromorphic hardware.

5. Conclusions

We have demonstrated that back-gated memtransistors based on CVD-grown monolayer

{MoS}_{2}

on 90 nm

{SiO}_{2}

combine conventional transistor performance with sufficient hysteresis and fading memory to support chaotic time-series prediction via reservoir computing. The representative devices exhibit field-effect mobility on the order of

10 {cm}^{2} V^{- 1} s^{- 1}

, ON/OFF ratios above

10^{5}

, and a moderate hysteresis window quantified by gate-sweep loop areas

H \approx 2.1 μ A \cdot V

at

V_{D S} = 50 mV

and

H \approx 17 μ A \cdot V

at

V_{D S} = 500 mV

over

V_{G S} \in [- 10, 30] V

. In a single-node reservoir configuration, direct (non-time-multiplexed) driving of the memtransistor by a Lorenz-63 waveform yields non-trivial one-step prediction with best

NRMSE \approx 0.46

, while time-multiplexed operation in a narrow high-transconductance window, combined with simple preprocessing and lagged states, improves the error to

NRMSE \approx 0.09

–0.10 with

R^{2} \approx 0.99

.

From a growth and device-engineering perspective, these results indicate that once a basic target is reached (mobility of order

10 {cm}^{2} V^{- 1} s^{- 1}

and ON/OFF above

10^{4}

–

10^{5}

), further improvements in traditional FET figures of merit alone are unlikely to translate into proportional gains in reservoir performance. Instead, the shape and time scales of the hysteresis window, set by the trap spectrum in the

{MoS}_{2}

/dielectric stack, become the key control parameters: the most useful operating points are those where

V_{G S}

biases the device on the steep part of the transfer characteristic and where fading memory naturally spans the 1–4 s range probed by the time-multiplexed reservoir. In this sense, the Lorenz-63 task functions as a quantitative probe of trap-mediated dynamics rather than as an ultimate benchmark of prediction accuracy.

Device-to-device variability was quantified at both the DC and task level. Across the fabrication run, the hysteresis area spans nearly three orders of magnitude, indicating substantial dispersion in trap-mediated memory characteristics. For reservoir computing, measured devices, despite the broad hysteresis distribution, showed NRMSE scatters in the range from 0.58 to 0.61. The absolute NRMSE is comparatively high in this multi-device dataset because time multiplexing was not performed under the available measurement time.

With respect to CMOS compatibility, the present devices still operate at voltages well above typical logic levels because of the global back-gate geometry and thick

{SiO}_{2}

dielectric. However, the material set and thermal budget are compatible with back-end-of-line integration, and the same CVD process can be combined with patterned top gates and thin high-k or ferroelectric dielectrics to reduce

V_{G S}

by one to two orders of magnitude while preserving or enhancing the usable hysteresis. A practical path toward CMOS-compatible architectures is to reduce the operating voltages and increase reservoir dimensionality by device and array engineering rather than by complex mixed-signal circuitry. First, replacing the global back gate with a local top gate (thin high-k dielectric and short gate-to-channel spacing) directly increases

C_{ox}

and enables voltage scaling, so that the same input mapping can be implemented with gate swings in the few-volt range. Second, geometrical scaling of the channel (shorter L and optimized

W / L

) can reduce the required

V_{DS}

and improve transconductance at lower bias. Third, array-style space multiplexing provides a scalable route to richer reservoir states under realistic measurement time. Multiple memtransistors fabricated on the same chip can be driven by a common

V_{G S} (t)

waveform and read out as parallel nodes, converting device-to-device variability into state diversity and improving prediction accuracy without time multiplexing.

Together, voltage-scaled top-gated devices, modest geometric scaling, and space-multiplexed arrays define an experimentally accessible roadmap from the present back-gated proof of concept toward CMOS-constrained, wafer-scalable reservoir hardware. The open RC protocol and Lorenz-63 benchmark used here provide a reproducible framework against which such materials and device modifications can be evaluated in terms of both conventional transistor metrics and application-level reservoir computing performance.

Author Contributions

Conceptualization, V.K. and L.J.; methodology, V.K.; software, V.K. and L.J.; validation, V.K. and L.J.; formal analysis, V.K. and L.J.; investigation, V.K.; resources, J.P.; data curation, V.K.; writing—original draft preparation, V.K.; writing—review and editing, J.P.; visualization, V.K.; supervision, J.P.; project administration, J.P.; funding acquisition, J.P. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Carl Zeiss Foundation through the MemWerk^® project (P2018-01-002). The work of V.K. was supported by the Thüringer Graduiertenförderung (AkZ 0051050-7150000163). L.J. is funded by the Carl Zeiss Foundation through the Nexus program.

Data Availability Statement

Data available in a publicly accessible repository: https://doi.org/10.5281/zenodo.16944410 and https://doi.org/10.5281/zenodo.16944485. Dataset is also available on request from the authors.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

Table A1. Resulting test metrics for the performed measurements. The best result for each set of measurements is highlighted in bold.

Label	TM	N	$Δ t$ (s)	$TH$ (s)	M	$V_{GS}$ Range (V)	$V_{DS}$ (V)	Split	NRMSE	$R^{2}$
A	No	–	0.5	–	–	(0,30)	0.5	60/40	0.5997	0.6404
								70/30	0.6005	0.6393
								80/20	0.9136	0.1653
B	No	–	2.0	–	–	(0,30)	3.0	60/40	0.6130	0.6242
								70/30	0.6370	0.5942
								80/20	0.9781	0.0433
C	No	–	2.0	–	–	(10,40)	3.0	60/40	0.5919	0.6497
								70/30	0.5893	0.6527
								80/20	0.8913	0.2056
D	No	–	0.1	–	–	(10,40)	3.0	60/40	0.4734	0.7759
								70/30	0.4611	0.7874
								80/20	0.6094	0.6287
E	Yes	112	0.10	11.2	36	(31,35)	3.0	60/40	0.8547	0.2694
								70/30	0.4687	0.7804
								80/20	0.1850	0.9817
F	Yes	16	0.10	1.6	256	(31,35)	3.0	60/40	0.0884	0.9922
								70/30	0.0994	0.9901
								80/20	0.1961	0.9616
G	Yes	16	0.25	4.0	256	(31,35)	3.0	60/40	0.1603	0.9743
								70/30	0.1670	0.9721
								80/20	0.2970	0.9118
H	Yes	32	0.10	3.2	128	(31,35)	3.0	60/40	0.1315	0.9827
								70/30	0.1220	0.9851
								80/20	0.1018	0.9896

Appendix B

To provide an experimental estimate of the characteristic time scales associated with slow state relaxation in the

{MoS}_{2}

memtransistor, we analyzed the drain-current decay recorded at constant bias

V_{GS} = 25 V

and

V_{DS} = 0.5 V

. The measured data is shown in Figure A1. The measured signal exhibits a gradual decrease of

I_{D} (t)

over tens of seconds, consistent with bias-stress-induced relaxation that is commonly attributed to charge trapping/detrapping in the gate dielectric, at the

{MoS}_{2} {/SiO}_{2}

interface, and/or via adsorbate-mediated processes. Because the decay is not well captured by a single time constant over the full time window, we fit the relaxation using a bi-exponential model

y (x) = A_{1} exp (- \frac{x}{t_{1}}) + A_{2} exp (- \frac{x}{t_{2}}) + y_{0},

(A1)

where x is time, y is the measured drain current

I_{D}

,

A_{1}

and

A_{2}

are amplitudes,

t_{1}

and

t_{2}

are characteristic time constants, and

y_{0}

is the long-time offset current.

Figure A1. CVD

{MoS}_{2}

memtransistor relaxation trace measured at

V_{GS} = 25 V

and

V_{DS} = 0.5 V

. The circles on the graph represent the measured data, while the red line shows the bi-exponential model fit.

Figure A1. CVD

{MoS}_{2}

memtransistor relaxation trace measured at

V_{GS} = 25 V

and

V_{DS} = 0.5 V

. The circles on the graph represent the measured data, while the red line shows the bi-exponential model fit.

The fitted parameters are listed in the Table A2.

Table A2. Bi-exponential fit parameters for the relaxation trace measured at

V_{GS} = 25 V

and

V_{DS} = 0.5 V

using

y (x) = A_{1} exp (- x / t_{1}) + A_{2} exp (- x / t_{2}) + y_{0}

. Reported uncertainties correspond to the fit standard errors.

Table A2. Bi-exponential fit parameters for the relaxation trace measured at

V_{GS} = 25 V

and

V_{DS} = 0.5 V

using

y (x) = A_{1} exp (- x / t_{1}) + A_{2} exp (- x / t_{2}) + y_{0}

. Reported uncertainties correspond to the fit standard errors.

Parameter	Value	Unit
$y_{0}$	$3.29 \times 10^{- 7} \pm 1.48 \times 10^{- 9}$	A
$A_{1}$	$7.24 \times 10^{- 8} \pm 1.51 \times 10^{- 9}$	A
$t_{1}$	$2.45 \pm 0.10$	s
$A_{2}$	$1.21 \times 10^{- 7} \pm 9.38 \times 10^{- 10}$	A
$t_{2}$	$47.75 \pm 1.38$	s

The presence of two well-separated time constants suggests at least two dominant relaxation channels: a faster component on the order of a few seconds (

t_{1}

) and a slower component on the order of

\sim 50 s

(

t_{2}

). In the context of reservoir computing, these relaxation processes set the time scales over which the device retains a measurable influence of past excitation. We emphasize that this constant-bias decay provides an effective estimate of the intrinsic relaxation dynamics under the selected operating point.

Appendix C

To quantify device-to-device variability beyond the representative device shown in the main text, we summarize key DC metrics extracted from the transfer loops measured at

V_{DS} = 0.5

V for CVD

{MoS}_{2}

memtransistors with

L = 2 μ

m and

W = 10 μ

m. Figure A2 reports the hysteresis area H, maximum transconductance

g_{m, max}

, and maximum field-effect mobility

μ_{FE, max}

across the available device set. In addition, we include the one-step Lorenz-63 prediction error (NRMSE) for the subset of devices for which a full time-series measurement was recorded under identical input mapping and a fixed 70/30 chronological split, using

{log}_{10} (| I_{D} |)

as the reservoir state. This appendix supports the discussion in the main text that, despite substantial dispersion in DC hysteresis, the measured one-step prediction performance exhibits comparatively low device-to-device scatter within the investigated subset.

Figure A2. Device-to-device variability of electrical figures of merit and Lorenz-63 prediction accuracy. (a) Hysteresis area H extracted from the bidirectional transfer loop (

V_{GS}

:

- 10 \to 30 \to - 10

V) at

V_{DS} = 0.5

V for CVD

{MoS}_{2}

memtransistors with

L = 2 μ

m and

W = 10 μ

m. (b) Maximum transconductance

g_{m, max}

obtained from the forward sweep by numerical differentiation. (c) Field-effect mobility

μ_{FE, max}

calculated from

g_{m, max}

using the linear-regime relation. (d) One-step-ahead Lorenz-63 prediction error (NRMSE) for devices measured under identical input mapping and readout protocol (chronological 70/30 split), using

{log}_{10} (| I_{D} |)

as the reservoir state.

Figure A2. Device-to-device variability of electrical figures of merit and Lorenz-63 prediction accuracy. (a) Hysteresis area H extracted from the bidirectional transfer loop (

V_{GS}

:

- 10 \to 30 \to - 10

V) at

V_{DS} = 0.5

V for CVD

{MoS}_{2}

memtransistors with

L = 2 μ

m and

W = 10 μ

m. (b) Maximum transconductance

g_{m, max}

obtained from the forward sweep by numerical differentiation. (c) Field-effect mobility

μ_{FE, max}

calculated from

g_{m, max}

using the linear-regime relation. (d) One-step-ahead Lorenz-63 prediction error (NRMSE) for devices measured under identical input mapping and readout protocol (chronological 70/30 split), using

{log}_{10} (| I_{D} |)

as the reservoir state.

Appendix D

As shown in Figure A3a,b, the device architecture and fabricated layout are illustrated by a schematic cross-section and representative SEM micrographs of the patterned memtransistor structures.

Figure A3. Device architecture and fabrication overview. (a) Schematic illustration of the back-gated monolayer

{MoS}_{2}

memtransistor on

{Si/SiO}_{2}

with source/drain contacts. (b) Representative SEM image of a fabricated device showing the contact geometry and the lithographically defined channel region.

Figure A3. Device architecture and fabrication overview. (a) Schematic illustration of the back-gated monolayer

{MoS}_{2}

memtransistor on

{Si/SiO}_{2}

with source/drain contacts. (b) Representative SEM image of a fabricated device showing the contact geometry and the lithographically defined channel region.

References

Mak, K.F.; Lee, C.; Hone, J.; Shan, J.; Heinz, T.F. Atomically Thin MoS₂: A New Direct-Gap Semiconductor. Phys. Rev. Lett. 2010, 105, 136805. [Google Scholar] [CrossRef]
Splendiani, A.; Sun, L.; Zhang, Y.; Li, T.; Kim, J.; Chim, C.Y.; Galli, G.; Wang, F. Emerging Photoluminescence in Monolayer MoS₂. Nano Lett. 2010, 10, 1271–1275. [Google Scholar] [CrossRef]
Radisavljevic, B.; Radenovic, A.; Brivio, J.; Giacometti, V.; Kis, A. Single-Layer MoS₂ Transistors. Nat. Nanotechnol. 2011, 6, 147–150. [Google Scholar] [CrossRef]
Wang, Q.H.; Kalantar-Zadeh, K.; Kis, A.; Coleman, J.N.; Strano, M.S. Electronics and Optoelectronics of Two-Dimensional Transition Metal Dichalcogenides. Nat. Nanotechnol. 2012, 7, 699–712. [Google Scholar] [CrossRef]
Fu, D.; Zhao, X.; Zhang, Y.; Li, L.; Xu, H.; Jang, A.-R.; Yoon, S.I.; Song, P.; Poh, S.M.; Ren, T.; et al. Molecular Beam Epitaxy of Highly Crystalline Monolayer Molybdenum Disulfide on Hexagonal Boron Nitride. J. Am. Chem. Soc. 2017, 139, 9392–9400. [Google Scholar] [CrossRef]
Zhan, L.; Wan, W.; Zhu, Z.; Shih, Z.-M.; Cai, W. MoS₂ materials synthesized on SiO₂/Si substrates via MBE. J. Phys. Conf. Ser. 2017, 864, 012037. [Google Scholar] [CrossRef]
Kim, D.H.; Park, J.C.; Park, J.; Cho, D.-Y.; Kim, W.-H.; Shong, B.; Ahn, J.-H.; Park, T. Wafer-Scale Growth of a MoS₂ Monolayer via One Cycle of Atomic Layer Deposition: An Adsorbate Control Method. Chem. Mater. 2021, 33, 4099–4105. [Google Scholar] [CrossRef]
Gonzalez, M.A.; Pareek, D.; Büsing, L.; Parisi, J.; Schäfer, S.; Gütay, L. Rapid formation of large-area MoS₂ monolayers by a parameter resilient atomic layer deposition approach. APL Mater. 2021, 9, 051122. [Google Scholar] [CrossRef]
Robertson, J.; Liu, X.; Yue, C.; Escarra, M.; Wei, J. Wafer-scale synthesis of monolayer and few-layer MoS₂ via thermal vapor sulfurization. 2D Mater. 2017, 4, 045007. [Google Scholar] [CrossRef]
Tao, J.; Chai, J.; Lu, X.; Wong, L.-M.; Wong, T.I.; Pan, J.; Xiong, Q.; Chi, D.; Wang, N. Growth of wafer-scale MoS₂ monolayer by magnetron sputtering. Nanoscale 2015, 7, 2497–2503. [Google Scholar] [CrossRef]
Tumino, F.; Casari, C.S.; Passoni, M.; Russo, V.; Bassi, A.L. Pulsed laser deposition of single-layer MoS₂ on Au(111): From nanosized crystals to large-area films. Nanoscale Adv. 2019, 1, 643–655. [Google Scholar] [CrossRef]
Midhun, P.S.; Kumar, K.R.; Jayaraj, M.K. Large area synthesis of mono/few-layer MoS₂ thin films on thermal oxide silicon substrate by pulsed laser deposition technique. Thin Solid Films 2023, 782, 140030. [Google Scholar] [CrossRef]
Zhang, Y.; Xu, L.; Walker, W.R.; Tittle, C.M.; Blackhouse, J.; Pope, M.A. Wafer-scale growth of monolayer MoS₂ film by using chemical vapor deposition. J. Mater. Chem. C 2017, 5, 11275–11287. [Google Scholar] [CrossRef]
Franco, M.; Kiazadeh, A.; Martins, R.; Lanceros-Mendez, S.; Carlos, E. Printed Memristors: An Overview of Ink, Materials, Deposition Techniques, and Applications. Adv. Electron. Mater. 2024, 10, 2400212. [Google Scholar] [CrossRef]
Kurtash, V.; Eliseyev, I.; Davydov, V.; Mathew, S.; Thiele, S.; Reiprich, J.; Jacobs, H.O.; Pezoldt, J. Monolayer MoS₂ CVD-growth on SiC substrates assisted with KCl. Appl. Surf. Sci. 2025, 681, 161373. [Google Scholar] [CrossRef]
Guo, Y.; Vashishta, D.; Singh, A.K.; Rasool, P.K.; Pu, Y.; Shi, Y.; Ostrikov, K.K.; Xu, H. Charge Trapping at the MoS₂–SiO₂ Interface and Its Effects on the Characteristics of MoS₂ Field-Effect Transistors. Appl. Phys. Lett. 2015, 106, 103109. [Google Scholar] [CrossRef]
Sangwan, V.K.; Jariwala, D.; Kim, I.S.; Chen, K.S.; Marks, T.J.; Lauhon, L.J.; Hersam, M.C. Gate-Tunable Memristive Phenomena Mediated by Grain Boundaries in Single-Layer MoS₂. Nat. Nanotechnol. 2015, 10, 403–406. [Google Scholar] [CrossRef]
Kurtash, V.; Jacobs, H.O.; Pezoldt, J. Energy-Efficient Operation Conditions of MoS₂-Based Memristors. Phys. Status Solidi A 2023, 220, 2200893. [Google Scholar] [CrossRef]
Tanaka, G.; Yamane, T.; Hélie, J.P.; Nakane, R.; Tanaka, N.; Takeda, H.; Takano, H.; Ikeda, Y.; Nakano, T.; Hirose, A. Recent Advances in Physical Reservoir Computing: A Review. Neural Netw. 2019, 115, 100–123. [Google Scholar] [CrossRef]
Gauthier, D.J.; Bollt, E.; Griffith, A.; Barbosa, W.A.S. Next Generation Reservoir Computing. Nat. Commun. 2021, 12, 5564. [Google Scholar] [CrossRef]
Jaurigue, L. Chaotic attractor reconstruction using small reservoirs—The influence of topology. Mach. Learn. Sci. Technol. 2024, 5, 035058. [Google Scholar] [CrossRef]
Jaurigue, L.; Lüdge, K. Utilizing rate-independent hysteresis for analog computing. Neuromorphic Comput. Eng. 2025, 5, 044007. [Google Scholar] [CrossRef]
Rehman, M.M.; Samad, Y.A.; Gul, J.Z.; Saqib, M.; Khan, M.; Shaukat, R.A.; Chang, R.; Shi, Y.; Kim, W.Y. 2D materials-memristive devices nexus: From status quo to Impending applications. Prog. Mater. Sci. 2025, 152, 101471. [Google Scholar] [CrossRef]
Moon, J.; Ma, W.; Shin, J.H.; Cai, F.; Du, C.; Lee, S.H.; Lu, W.; Lu, W.D. Temporal data classification and forecasting using a memristor-based reservoir computing system. Nat. Electron. 2019, 2, 480–487. [Google Scholar] [CrossRef]
Xie, J.; Liu, B.; Liu, X. Delay-embedded reservoir computing with single memristor for scale-efficient temporal signal processing. Chaos Solitons Fractals 2026, 205, 117842. [Google Scholar] [CrossRef]
Wang, A.-H.; Fan, X.; Zhang, Z.; Kapur, O.; Huang, R.; Simanjuntak, F.; Chong, H.M.H.; Thomas, D.B. Temporal dynamics of TiO_x memristor for reservoir computing applications. J. Phys. D Appl. Phys. 2026, 59, 015101. [Google Scholar] [CrossRef]
Zhong, Y.; Tang, J.; Li, X.; Gao, B.; Qian, H.; Wu, H. Dynamic memristor-based reservoir computing for high-efficiency temporal signal processing. Nat. Commun. 2021, 12, 408. [Google Scholar] [CrossRef]
Singh, A.; Choi, S.; Wang, G.; Daimari, M.; Lee, B.-G. Analysis and fully memristor-based reservoir computing for temporal data classification. Neural Netw. 2025, 182, 106925. [Google Scholar] [CrossRef]
Shi, T.; Gao, L.; Tang, S.; Liu, J.; Li, Y.; Zhou, R.; Cui, S.; Zhang, H.; Wu, Z.; Zhang, X.; et al. Memristor-based feature learning for pattern classification. Nat. Commun. 2025, 16, 913. [Google Scholar] [CrossRef]
Park, S.; Koo, M.; Kim, S. Advancing phase-change random access memory: Materials innovation neuromorphic applications, and scalability challenges. J. Phys. D Appl. Phys. 2026, 59, 013001. [Google Scholar] [CrossRef]
Syed, G.S.; Le Gallo, M.; Sebastian, A. Phase-Change Memory for In-Memory Computing. Chem. Rev. 2025, 125, 5163–5194. [Google Scholar] [CrossRef]
Cucchi, M.; Gruener, C.; Petrauskas, L.; Steiner, P.; Tseng, H.; Fischer, A.; Penkovsky, B.; Matthus, C.; Birkholz, P.; Kleemann, H.; et al. Reservoir computing with biocompatible organic electrochemical networks for brain-inspired biosignal classification. Sci. Adv. 2021, 7, eabh0693. [Google Scholar] [CrossRef]
Shao, L.; Zhao, Y.; Liu, Y. Organic synaptic transistors: The evolutionary path from memory cells to the application of artificial neural networks. Adv. Funct. Mater. 2021, 31, 2101951. [Google Scholar] [CrossRef]
Zhao, Y.; Duan, W.; Wang, C.; Xiao, S.; Li, Y.; Li, Y.; An, J.; Li, H. LiNbO₃ dynamic memristors for reservoir computing in neuromorphic engineering. Front. Neurosci. 2023, 17, 1177118. [Google Scholar] [CrossRef]
Lee, Y.; Lee, S. Artificial Intelligence and Machine Learning for Memristor Modeling and Neuromorphic Computing. Nanomaterials 2024, 14, 1884. [Google Scholar] [CrossRef]
Wan, X.; Yuan, Q.; Sun, L.; Chen, K.; Khim, D.; Luo, Z. Reservoir Computing Enabled by Polymer Electrolyte-Gated MoS₂ Transistors for Time-Series Processing. Polymers 2025, 17, 1178. [Google Scholar] [CrossRef]
Farronato, M.; Seri, M.; Titova, L.V.; Sangiovanni, D.; Oriani, M.; Bonaccorso, F.; Pecchia, A.; Frabboni, S.; Brivio, S.; Molle, A. Charge-Trap MoS₂ Memories as Reservoir Computers. Adv. Mater. 2023, 35, 2205381. [Google Scholar] [CrossRef]
Lorenz, E.N. Deterministic Nonperiodic Flow. J. Atmos. Sci. 1963, 20, 130–141. [Google Scholar] [CrossRef]
Kurtash, V. Reservoir Computing Using CVD-MoS₂ Memtransistor. Zenodo: 2025; Data Set. Available online: https://zenodo.org/records/16944410 (accessed on 16 December 2025).
Kurtash, V. Reservoir Computing Using CVD-MoS₂ Memtransistor Python Script, Version 1.0; Zenodo: 2025; Software; Available online: https://zenodo.org/records/16944485 (accessed on 16 December 2025).
Takens, F. Detecting strange attractors in turbulence. In Dynamical Systems and Turbulence, Warwick 1980; Rand, D.A., Young, L.-S., Eds.; Lecture Notes in Mathematics; Springer: Berlin/Heidelberg, Germany, 1981; Volume 898, pp. 366–381. [Google Scholar] [CrossRef]
Storm, L.; Gustavsson, K.; Mehlig, B. Constraints on parameter choices for successful time-series prediction with echo-state networks. Mach. Learn. Sci. Technol. 2022, 3, 045021. [Google Scholar] [CrossRef]
Lee, C.; Yan, H.; Brus, L.E.; Heinz, T.F.; Hone, J.; Ryu, S. Anomalous Lattice Vibrations of Single- and Few-Layer MoS₂. ACS Nano 2010, 4, 2695–2700. [Google Scholar] [CrossRef]
Barbosa, W.A.S.; Griffith, A.; Riechers, K.; Bollt, E.M.; Gauthier, D.J. Symmetry-Aware Reservoir Computing. Phys. Rev. E 2021, 104, 045307. [Google Scholar] [CrossRef]
Viehweg, J.; Worthmann, K.; Mäder, P. Parameterizing echo state networks for multi-step time series prediction. Neurocomputing 2023, 522, 214–228. [Google Scholar] [CrossRef]
Jaurigue, L.; Robertson, E.; Wolters, J.; Lüdge, K. Reservoir Computing with Delayed Input for Fast and Easy Optimisation. Entropy 2021, 23, 1560. [Google Scholar] [CrossRef]
Cox, N.; Murray, J.; Hart, J.; Redding, B. Photonic Implementation of Next-Generation Reservoir Computing. Chaos 2024, 34, 073111. [Google Scholar] [CrossRef]
Wang, H.; Liu, Y.; Zhang, L.; Li, W.; Chen, X. Optical Next-Generation Reservoir Computing. Light Sci. Appl. 2025, 14, 12. [Google Scholar] [CrossRef]
Wu, Y.; Duong, N.T.; Chien, Y.C.; Liu, S.; Ang, K.W. A Dynamic Memory for Reservoir Computing Utilizing Ion Migration in CuInP₂S₆. Adv. Electron. Mater. 2024, 10, 2300481. [Google Scholar] [CrossRef]
Duong, N.T.; Chien, Y.C.; Xiang, H.; Li, S.; Zheng, H.; Shi, Y.; Ang, K. Dynamic Ferroelectric Transistor-Based Reservoir Computing for Spatiotemporal Information Processing. Adv. Intell. Syst. 2023, 5, 2300009. [Google Scholar] [CrossRef]

Figure 1. (a)

{MoS}_{2}

CVD chamber schematics. (b) Measured temperature profiles for S and

{MoO}_{3}

precursors.

Figure 1. (a)

{MoS}_{2}

CVD chamber schematics. (b) Measured temperature profiles for S and

{MoO}_{3}

precursors.

Figure 2. Lorenz-63 chaotic system used for reservoir computing. (a) Three-dimensional Lorenz-63 trajectory integrated at the classical parameters

σ = 10

,

ρ = 28

,

β = 8 / 3

. (b) Lorenz-63 chaotic system X value plotted versus time.

Figure 2. Lorenz-63 chaotic system used for reservoir computing. (a) Three-dimensional Lorenz-63 trajectory integrated at the classical parameters

σ = 10

,

ρ = 28

,

β = 8 / 3

. (b) Lorenz-63 chaotic system X value plotted versus time.

Figure 3. Crystal growth and morphology of CVD-grown

{MoS}_{2}

. (a) Optical micrograph of triangular

{MoS}_{2}

domains on

p^{+} {-Si/SiO}_{2}

(90 nm), showing typical lateral sizes of ∼

50 μ m

and homogeneous coverage in the

10 μ m \times 2 μ m

channel area. (b) Tapping-mode AFM height image of a representative monolayer flake. (c) AFM height profile across the

{MoS}_{2} {/SiO}_{2}

edge, showing a step height of 0.88 nm consistent with monolayer

{MoS}_{2}

. (d) Raman spectrum measured with 532 nm excitation and 0.5 mW power, together with single-Gaussian fits to the

E_{2 g}^{1}

and

A_{1 g}

modes. (e) Room-temperature PL spectrum with a three-Gaussian deconvolution into A-trion, A-exciton, and B-exciton contributions.

Figure 3. Crystal growth and morphology of CVD-grown

{MoS}_{2}

. (a) Optical micrograph of triangular

{MoS}_{2}

domains on

p^{+} {-Si/SiO}_{2}

(90 nm), showing typical lateral sizes of ∼

50 μ m

and homogeneous coverage in the

10 μ m \times 2 μ m

channel area. (b) Tapping-mode AFM height image of a representative monolayer flake. (c) AFM height profile across the

{MoS}_{2} {/SiO}_{2}

edge, showing a step height of 0.88 nm consistent with monolayer

{MoS}_{2}

. (d) Raman spectrum measured with 532 nm excitation and 0.5 mW power, together with single-Gaussian fits to the

E_{2 g}^{1}

and

A_{1 g}

modes. (e) Room-temperature PL spectrum with a three-Gaussian deconvolution into A-trion, A-exciton, and B-exciton contributions.

Figure 4. Electrical characteristics of a representative CVD-grown

{MoS}_{2}

memtransistor. (a) Measured transfer characteristics for different drain-to-source voltage

V_{D S}

, demonstrating hysteresis loop area H = 2.1 μA·V for

V_{D S}

= 50 mV and H = 17 μA·V for

V_{D S}

= 500 mV. (b) Dependence of the high-resistance state to the low-resistance state on the applied gate-to-source voltage

V_{G S}

. (c) Measured output characteristics for different gate-to-source voltage

V_{G S}

. (d) Measured output characteristics for different gate-to-source voltage

V_{G S}

, demonstrating hysteresis loop area H = 61 μA·V for

V_{G S}

= 20 V, H = 0.9 mA·V for

V_{G S}

= 30 V, and H = 1.8 mA·V for

V_{G S}

= 40 V.

Figure 4. Electrical characteristics of a representative CVD-grown

{MoS}_{2}

memtransistor. (a) Measured transfer characteristics for different drain-to-source voltage

V_{D S}

, demonstrating hysteresis loop area H = 2.1 μA·V for

V_{D S}

= 50 mV and H = 17 μA·V for

V_{D S}

= 500 mV. (b) Dependence of the high-resistance state to the low-resistance state on the applied gate-to-source voltage

V_{G S}

. (c) Measured output characteristics for different gate-to-source voltage

V_{G S}

. (d) Measured output characteristics for different gate-to-source voltage

V_{G S}

, demonstrating hysteresis loop area H = 61 μA·V for

V_{G S}

= 20 V, H = 0.9 mA·V for

V_{G S}

= 30 V, and H = 1.8 mA·V for

V_{G S}

= 40 V.

Figure 5. Direct (no time-multiplexing mask) Lorenz driving of the

{MoS}_{2}

memtransistor. Yellow lines indicate applied gate-to-source volatge, while purple dots indicate measured drain current. Panels (a–d) show the applied

V_{GS}

(left axis) obtained by affine mapping of Lorenz

X (t)

and the measured drain current

I_{D}

(right axis). Sampling step is

Δ t

= 0.5 s for panels (a,b), 2 s for panel (c), and 0.1 s for panel (d). The device operates in a nonlinear, history-dependent regime due to charge trapping, providing the short-term memory exploited by the readout.

Figure 5. Direct (no time-multiplexing mask) Lorenz driving of the

{MoS}_{2}

memtransistor. Yellow lines indicate applied gate-to-source volatge, while purple dots indicate measured drain current. Panels (a–d) show the applied

V_{GS}

(left axis) obtained by affine mapping of Lorenz

X (t)

and the measured drain current

I_{D}

(right axis). Sampling step is

Δ t

= 0.5 s for panels (a,b), 2 s for panel (c), and 0.1 s for panel (d). The device operates in a nonlinear, history-dependent regime due to charge trapping, providing the short-term memory exploited by the readout.

Figure 6. Time-multiplexed reservoir operation of a CVD monolayer

{MoS}_{2}

memtransistor. Yellow lines indicate applied gate-to-source volatge, while purple dots indicate measured drain current. Panels (a–d) show representative traces of the masked gate-to-source voltage

V_{G S} (t)

(left axis) and the corresponding drain current

I_{D S} (t)

(right axis) for the time-multiplexed regimes E–H listed in Table A1. In each case, the Lorenz-63 input is mapped to the gate window

V_{G S} \in (31, 35) V

and multiplied by a fixed Rademacher mask to generate N sub-steps of duration

Δ t

. The dense traces illustrate the full measurement over many macro-steps, while the insets zoom into short time intervals around selected segments of the waveform, highlighting synaptic-plasticity-like behavior.

Figure 6. Time-multiplexed reservoir operation of a CVD monolayer

{MoS}_{2}

memtransistor. Yellow lines indicate applied gate-to-source volatge, while purple dots indicate measured drain current. Panels (a–d) show representative traces of the masked gate-to-source voltage

V_{G S} (t)

(left axis) and the corresponding drain current

I_{D S} (t)

(right axis) for the time-multiplexed regimes E–H listed in Table A1. In each case, the Lorenz-63 input is mapped to the gate window

V_{G S} \in (31, 35) V

and multiplied by a fixed Rademacher mask to generate N sub-steps of duration

Δ t

. The dense traces illustrate the full measurement over many macro-steps, while the insets zoom into short time intervals around selected segments of the waveform, highlighting synaptic-plasticity-like behavior.

Figure 7. Training and test trajectories for Lorenz-63 prediction across all bias and time-multiplexing regimes. Each panel shows the ground-truth Lorenz-63 X signal (black—expected value) and the memtransistor-based reservoir prediction (colored) for a representative training segment (left part of the trace) and the corresponding held-out test segment (right part), using a 70/30 split. Panels (a–d) correspond to the non-time-multiplexed regimes A–D, while panels (e–h) correspond to the time-multiplexed regimes E–H.

Figure 8. Summary of predictive performance:

R^{2}

versus NRMSE. Each point is one configuration (A–H). Shaded regions indicate qualitative regimes: green is high-fidelity prediction, light-green is usable prediction, yellow is limited usable, and red is underfit or poorly separable.

Figure 8. Summary of predictive performance:

R^{2}

versus NRMSE. Each point is one configuration (A–H). Shaded regions indicate qualitative regimes: green is high-fidelity prediction, light-green is usable prediction, yellow is limited usable, and red is underfit or poorly separable.

Table 1. Summary of Raman and PL parameters for CVD-grown monolayer

{MoS}_{2}

. Raman peak positions and widths are obtained from single-Gaussian fits; PL parameters are extracted from a three-Gaussian fit to the A-trion, A-exciton, and B-exciton contributions.

Table 1. Summary of Raman and PL parameters for CVD-grown monolayer

{MoS}_{2}

. Raman peak positions and widths are obtained from single-Gaussian fits; PL parameters are extracted from a three-Gaussian fit to the A-trion, A-exciton, and B-exciton contributions.

Measurement	Mode/Peak	Position	FWHM	Relative Intensity
Raman	$E_{2 g}^{1}$	385.0 ${cm}^{- 1}$	3.7 ${cm}^{- 1}$	$I / I_{E_{2 GS}^{1}} = 1$
Raman	$A_{1 g}$	402.6 ${cm}^{- 1}$	7.4 ${cm}^{- 1}$	$I / I_{E_{2 GS}^{1}} \approx 1.05$
Raman	$Δ ω = ω_{A_{1}} - ω_{E_{2 GS}^{1}}$	17.5 ${cm}^{- 1}$	–	–
PL	A trion	1.80 eV	0.12 eV	$I / I_{A} \approx 0.81$
PL	A exciton	1.84 eV	0.09 eV	$I / I_{A} = 1$
PL	B exciton	1.96 eV	0.12 eV	$I / I_{A} \approx 0.29$

Table 2. Experimental hardware reservoirs that run the Lorenz system and reported (or directly provided) normalized RMSE (NRMSE) using standard deviation

σ

of the ground truth for normalization; n.r. stands for “not reported”.

Table 2. Experimental hardware reservoirs that run the Lorenz system and reported (or directly provided) normalized RMSE (NRMSE) using standard deviation

σ

of the ground truth for normalization; n.r. stands for “not reported”.

Platform/Material	NRMSE	Type	Lorenz Axes (Input → Target)	$Δ t$ (Time Units)	Ref.
Next-Generation RC (Lorenz-63 short-term forecasting)	0.002	Simulation	$x, y, z \to x, y, z$	n.r.	[20]
Symmetry-aware RC (variable inference)	0.0005	Simulation	$x, y \to z$	0.0001	[44]
Echo state network (multi-step autoregressive prediction)	0.0005	Simulation	$x, y, z \to x, y, z$	0.01	[45]
Reservoir RC with delayed input	0.003	Simulation	$x \to x$	0.02	[46]
Chaotic attractor reconstruction with RC	0.01	Simulation	$x, y, z \to x, y, z$	n.r.	[21]
Fiber-optic photonic NGRC	0.022	Experiment	$x \to y, z$	n.r.	[47]
Optical NGRC (laser and SPL)	0.0169	Experiment	$x, y, z \to x, y, z$	n.r.	[48]
CuInP₂S₆ (2-terminal memristor)	0.014	Experiment	$x, y \to z$	n.r.	[49]
${MoS}_{2}$ memtransistor RC (polymer electrolyte-gated)	0.04	Experiment	$x, y, z \to x, y, z$	0.01	[36]
${MoS}_{2}$ memtransistor RC (back-gated ${SiO}_{2}$ )	0.089	Experiment	$x \to x$	0.05–0.2	This work

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Kurtash, V.; Jaurigue, L.; Pezoldt, J. CVD Monolayer MoS₂ Memtransistors for Chaotic Time-Series Prediction via Reservoir Computing. Crystals 2026, 16, 116. https://doi.org/10.3390/cryst16020116

AMA Style

Kurtash V, Jaurigue L, Pezoldt J. CVD Monolayer MoS₂ Memtransistors for Chaotic Time-Series Prediction via Reservoir Computing. Crystals. 2026; 16(2):116. https://doi.org/10.3390/cryst16020116

Chicago/Turabian Style

Kurtash, Vladislav, Lina Jaurigue, and Jörg Pezoldt. 2026. "CVD Monolayer MoS₂ Memtransistors for Chaotic Time-Series Prediction via Reservoir Computing" Crystals 16, no. 2: 116. https://doi.org/10.3390/cryst16020116

APA Style

Kurtash, V., Jaurigue, L., & Pezoldt, J. (2026). CVD Monolayer MoS₂ Memtransistors for Chaotic Time-Series Prediction via Reservoir Computing. Crystals, 16(2), 116. https://doi.org/10.3390/cryst16020116

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

CVD Monolayer MoS₂ Memtransistors for Chaotic Time-Series Prediction via Reservoir Computing

Abstract

1. Introduction

2. Materials and Methods

2.1. CVD Growth of Monolayer ${MoS}_{2}$

2.2. Memtransistor Fabrication

Geometrical Definition and Tolerances

2.3. Structural and Electrical Characterization of ${MoS}_{2}$ Memtransistors

2.4. Fading Memory: Definition and Experimental Extraction

2.5. Reservoir Computing Protocol with ${MoS}_{2}$ Memtransistors

3. Results

3.1. Crystal Growth and Monolayer Quality

3.2. Memtransistor Characteristics and Hysteresis

3.3. Lorenz-63 Prediction Without Time-Multiplexing

3.4. Time-Multiplexed Reservoir Operation

4. Discussion

4.1. Benchmarking Against Existing Hardware Reservoirs

4.2. Device Metrics and Design Window for ${MoS}_{2}$ Memtransistor Reservoirs

4.3. Stability, Reproducibility, and Scope of Structure–Performance Relations

4.4. Limitations and Outlook Toward CMOS-Compatible Neuromorphic Hardware

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A

Appendix B

Appendix C

Appendix D

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

CVD Monolayer MoS2 Memtransistors for Chaotic Time-Series Prediction via Reservoir Computing

Abstract

1. Introduction

2. Materials and Methods

2.1. CVD Growth of Monolayer MoS 2

2.2. Memtransistor Fabrication

Geometrical Definition and Tolerances

2.3. Structural and Electrical Characterization of MoS 2 Memtransistors

2.4. Fading Memory: Definition and Experimental Extraction

2.5. Reservoir Computing Protocol with MoS 2 Memtransistors

3. Results

3.1. Crystal Growth and Monolayer Quality

3.2. Memtransistor Characteristics and Hysteresis

3.3. Lorenz-63 Prediction Without Time-Multiplexing

3.4. Time-Multiplexed Reservoir Operation

4. Discussion

4.1. Benchmarking Against Existing Hardware Reservoirs

4.2. Device Metrics and Design Window for MoS 2 Memtransistor Reservoirs

4.3. Stability, Reproducibility, and Scope of Structure–Performance Relations

4.4. Limitations and Outlook Toward CMOS-Compatible Neuromorphic Hardware

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A

Appendix B

Appendix C

Appendix D

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

CVD Monolayer MoS₂ Memtransistors for Chaotic Time-Series Prediction via Reservoir Computing

2.1. CVD Growth of Monolayer ${MoS}_{2}$

2.3. Structural and Electrical Characterization of ${MoS}_{2}$ Memtransistors

2.5. Reservoir Computing Protocol with ${MoS}_{2}$ Memtransistors

4.2. Device Metrics and Design Window for ${MoS}_{2}$ Memtransistor Reservoirs