We simulated resonance shifts due to the incremental concentration of a CD33-like biomolecular layer deposited on the graphene and MoS2 surfaces to evaluate AML biomarker detection performance, and the materials showed monotonic red-shifts, which is in line with their sensitivity to the dielectric loading caused by the AML biomarkers. The MoS2 coatings demonstrated a larger Δf because they have higher intrinsic conductivity and stronger plasmonic confinement. A Bayesian neural network (BNN) was trained based on the simulated dataset to comprehend the biomarker concentration from the spectral response. The BNN exhibited high predictive performance, and the uncertainty intervals increased at low concentrations (<10 ng/mL), which is in agreement with weaker resonance perturbations.
Figure 2 exhibits a distinctly resonant red-shift in the Healthy (n = 1.35) vs. AML (n ≈ 1.381) situations for both graphene- and MoS
2-coated DSSRR biosensors. The shift agrees with dielectric loading theory: the increase in the analyte refractive index results in pushing the resonance to lower frequencies of transmission. MoS
2 has a larger Δf due to its stronger light–matter interaction and more extended field penetration, whereas graphene yields a narrower and sharper resonance; thus, confirming its higher Q-factor and spectral confinement. The negative S21 values are in line with the expectations as the plot shows transmission magnitude in dB, not frequency shift; hence, the resonance dip is negative, while the extracted Δf is positive.
Figure 3 shows the comparative absorption spectra of graphene- and MoS
2-coated sensors that were exposed to the same conditions. At resonance, MoS
2 demonstrated a more significant absorption, which indicates a more considerable plasmonic damping, and a more enhanced field–analyte interaction, which is the immediate cause of the higher sensitivity (GHz/RIU) that it possesses.
4.2. Comparison with Baseline Models
To demonstrate the value of Bayesian Deep Learning, we compared our BNN against three baseline models trained on the same dataset: (1) a deterministic neural network (DNN) with identical architecture but no dropout, (2) a Gaussian Process (GP) regression model with radial basis function kernel, and (3) an ensemble of 10 deterministic networks (Deep Ensemble). As shown in
Table 5, the BNN outperformed all baselines in predictive uncertainty calibration (measured via Negative Log Likelihood) while maintaining comparable RMSE. Only the BNN provided an explicit decomposition of epistemic and aleatoric uncertainty, which is essential for robust material selection.
In
Figure 9, the robustness of the Bayesian neural network (BNN) predictions is tested under the condition of real-world manufacturing inconsistencies, where the process tolerances are modeled as ±5% perturbations in critical geometric parameters. The results show the difference in the stability of the sensitivity and the quality factor between graphene and MoS
2 coatings. In the case of sensitivity for biosensors with a graphene coating, the value changed from 2500 to 2580 GHz/RIU, whereas for MoS
2 it changed in a wider range from 3420 to 3548 GHz/RIU, which implies that the sensitivity of MoS
2 is more influenced by geometrical fluctuations. In a similar manner, the quality factor for graphene was almost constant (46.8–48.5), while MoS
2 had a wider range of (30.2–32.1); hence, the figure reinforces graphene’s superior structural robustness. The loss of sensitivity is studied in
Figure 9b, which also shows the resilience: at 5% tolerance, graphene displayed a 10.1% degradation, while MoS
2 degraded by 15.3% under the same perturbations, with a 23% relative increase in performance volatility. These findings highlight a key trade-off: although the use of MoS
2 allows for greater initial sensitivity, it is also less tolerant to process-induced deviations, which may lead to the reproducibility of mass-manufactured sensors being impacted. A Monte Carlo simulation with 100 stochastic realizations was used to make more inferences about the statistical stability. The outcomes presented in
Figure 9c indicate that graphene predictions are more tightly clustered around the nominal sensitivity, while MoS
2 exhibits higher variance across runs. This suggests that epistemic uncertainty was amplified under MoS
2 configurations, indicating that the model may be dealing with complex or less-known areas of the parameter space.
Figure 10 illustrates the measured sensitivity values with those that were predicted by the Bayesian model for graphene as well as for MoS
2 biosensors. Additionally, residuals and predictive uncertainties (σ) are presented. The Bayesian model prediction demonstrates a high level of consistency with the ground truth, keeping residuals around ±25 GHz/RIU for most of the samples. Graphene prediction is characterized by lower residual spread and smaller uncertainty bounds (σ ≈ 42 GHz/RIU), which reflects the confidence of the model for the deterministic response of this material. By contrast, MoS
2 predictions are characterized by higher residual variance and broader uncertainty (σ up to 130 GHz/RIU), thereby signifying that the source of the uncertainty is the lack of knowledge of the material interactions as a result of more complicated material interactions. Through this basis of residuals, the BNN implements a test of being able to provide accurate and well-calibrated predictions, as the uncertainty estimates can be trusted in the sense that they correspond well with the prediction errors that have been observed.
Figure 11 shows the expected resonance frequency of refractive index (n = 1.35–1.39) for both graphene- and MoS
2-coated biosensors, along with the uncertainty calculated from Monte Carlo dropout. Both materials show a near-linear redshift in resonance with increasing refractive index, which aligns well with dielectric loading theory. Graphene is characterized by the tightest uncertainty bounds (±0.008–0.012 THz), while the uncertainty for MoS
2 is more than twice as large (±0.015–0.022 THz), pointing towards the fact that MoS
2 is more sensitive to the dielectric perturbations and the model has lower confidence for this material due to more complex electromagnetic interactions. The above are confirmed by the trends that the BNN presented. The uncertainty estimates also provide actionable insight for model trust in clinical deployment, while
Figure 12 illustrates the model-predicted sensitivity for low, medium, and high biomarker concentrations, along with the derived nonlinearity factors. At high concentrations, the graphene sensitivity increases slightly from 2500 to 2650 GHz/RIU, with a nonlinearity factor of 1.32. On the other hand, MoS
2 exhibits a more rapid increase, going from 3400 to 3680 GHz/RIU, and a steeper nonlinearity factor, which indicates a stronger optical response saturation. This material-dependent nonlinear behavior is essential for precision biosensing as it outlines the possibility of overestimation of biomarker levels if linear approximations are assumed. Thus, these findings imply that although MoS
2 can achieve higher raw sensitivity, but graphene has higher resonance frequency.
Figure 13 delves into the connection between model-predicted sensitivity and the uncertainty that comes along with it for different sensor setups. From the picture, a definite pattern can be seen: higher sensitivity predictions, in particular for MoS
2, are coupled with a higher predictive variance that is a sign of extrapolation into the less represented regions of the training set. On the contrary, graphene keeps lower uncertainty even at moderate sensitivity levels, which is a strong point of this material and of the BNN’s calibration for it. With this analysis, the users are able to measure the reliability of high-sensitivity predictions so as to ensure that the decisions about the sensors are made not only according to the absolute performance but also to the confidence intervals.
Figure 14 depicts the changes in the utility function for graphene and MoS
2 biosensors, depending on the weighting scheme applied to the parameters’ sensitivity, Q-factor, and predictive uncertainty. The change implies the essential compromise between the high spectral precision and the raw detection sensitivity. On the one hand, it is observed that MoS
2 prevails when the emphasis is laid on sensitivity, but on the other hand, it is lowered in uncertainty-aware or resolution-focused regimes. This utility-based optimization endows risk-informed material selection, where performance goals and uncertainty constraints were co-optimized, allowing for a versatile framework for biosensor design in various clinical and engineering situations. This study has several limitations: (1) the dataset, though augmented, is limited to 500 simulation samples; (2) the model assumes ideal periodicity and does not include all fabrication-level defects. Future work will expand the dataset with experimental measurements, incorporate more detailed defect models, and validate the framework with fabricated sensors.