To validate the efficacy of the proposed SCG-optimized neural network for Doppler-resilient link adaptation, a comprehensive simulation-based evaluation was conducted. This section details the experimental setup, performance metrics, dataset characteristics, training process, and comparative analysis against the baseline Auto Rate Fallback (ARF) and SampleRate.
6.1. Simulation Setup
The simulation was implemented as an original MATLAB (2022b) script, consistent with the PHY abstraction methodology recommended by Wu et al. [
29] and Anwar et al. [
30] for link-level vehicular channel modeling. All channel impairment models were implemented directly from the closed-form equations presented in
Section 3,
Section 4 and
Section 5, ensuring complete traceability between the theoretical derivations and the numerical outputs.
The simulation follows a three-level nested loop structure. The outer loop iterates over three Doppler Shift values: DS ∈ {5, 750, 1500} Hz. At the start of each Doppler iteration, the state variables of all stateful methods (ARF and SampleRate) are re-initialized to their default starting conditions to ensure independence between Doppler scenarios. The middle loop iterates over 15 SNR values from 2 dB to 30 dB in steps of 2 dB. The inner loop executes 6 independent simulation runs per (Doppler, SNR) combination, with 16 packets transmitted per run. All three methods—ARF, NN (SCG-trained), and SampleRate—are evaluated under identical channel conditions within each run: the same Doppler scale factor, the same fading realization sequence, and the same packet size. The total number of simulation instances is 3 Doppler × 15 SNR × 6 runs × 3 methods = 810 simulation instances, producing 4320 packet evaluations per method.
Table 2 provides the complete simulation parameter specification. All PHY parameters follow the IEEE 802.11p standard [
25]. The channel model is Rayleigh fading at the packet level, appropriate for urban NLOS conditions where no dominant specular component is present [
18,
34]. Per-packet independent fading is applied, reflecting the condition T > Tc at DS ≥ 100 Hz. The Doppler spectrum follows the Jakes isotropic scattering model. Urban NLOS delay spread profiles are drawn from Paier et al. [
35] and Nilsson et al. [
34]: τrms ∈ {0, 100 ns, 500 ns, 1 µs} covering highway, suburban, and dense urban propagation environments, respectively. MAC-layer retransmissions and contention (CSMA/CA, backoff, RTS/CTS) are abstracted at the PHY layer through the PER model, consistent with standard PHY abstraction practice for system-level vehicular simulations [
29,
30]. Explicit MAC simulation incorporating contention dynamics, hidden node effects, and broadcast storm behavior is identified as a natural extension of this work requiring integration with a vehicular traffic simulator such as Veins/SUMO [
36].
6.2. Real-Time Deployment Feasibility
For the proposed NN-based link adaptation scheme to be viable in practice, MCS decisions must be completed within the channel coherence time Tc at the operating Doppler frequency. If the decision latency exceeds Tc, the channel state will have changed before the selected MCS is applied, rendering the adaptation ineffective. This subsection provides quantitative evidence that the proposed scheme satisfies this real-time requirement under all tested mobility conditions.
The inference latency of the trained network was measured as the wall-clock time to execute a single forward pass (input normalization → hidden layer computation → output mapping) on a standard desktop CPU (Intel Core processor, no GPU acceleration). The measured latency is 0.028 ms per MCS decision. This represents a conservative upper bound on deployment hardware: a dedicated OBU digital signal processor or FPGA implementation would achieve lower latency by an order of magnitude or more.
Table 3 presents the comparison between decision latency and coherence time at each of the three tested Doppler conditions.
At all three Doppler conditions, the decision latency of 0.028 ms is substantially smaller than the coherence time, satisfying the real-time constraint Tdecision ≪ Tc. Even at the most demanding condition (1500 Hz, Tc = 0.282 ms), the NN consumes less than 10% of the coherence window, leaving over 90% for channel estimation, SNR and Doppler measurement, and transmission setup.
The end-to-end decision pipeline, from the receipt of pilot symbols to the application of the selected MCS, includes the inference latency plus the channel estimation overhead. IEEE 802.11p allocates 4 pilot subcarriers per OFDM symbol, enabling standard pilot-based SNR and Doppler estimation within 2 to 3 OFDM symbols (16–24 µs) [
25]. The total end-to-end decision time is therefore approximately 16–24 µs (estimation) + 0.028 ms (inference) ≈ 0.044–0.052 ms, which remains well within Tc = 0.282 ms even at 1500 Hz Doppler.
Regarding memory requirements: the trained network contains 41 parameters (2 × 10 input-to-hidden weights + 10 × 1 hidden-to-output weights + 10 hidden biases + 1 output bias). In float32 representation this requires 164 bytes of storage which is negligible for any modern OBU, which typically provides megabytes of RAM for communication stack operations. This lightweight footprint is consistent with the PHY abstraction design principles advocated by Anwar et al. [
30] and with the resource constraints documented for commercial DSRC OBUs by Hartenstein and Laberteaux [
37].
6.3. Dataset Description and Preprocessing
The model was trained and tested on a specialized dataset derived from extensive prior simulations of the V2V channel [
12]. This dataset was generated using a physical-layer simulator that modeled the IEEE 802.11p standard under varying SNR and Doppler conditions. The optimal MCS for each (SNR, DS) pair was determined by selecting the highest MCS that maintained a Packet Error Rate (PER) below the 5% outage probability target, a standard reliability requirement for safety-critical VANET messages [
36]. The dataset comprises 120 sample points, each containing three key features: Signal-to-Noise Ratio (SNR), Doppler Shift (DS), and the optimal Modulation and Coding Scheme (MCS). The total dataset size is therefore 120 entries.
Although the dataset is compact, it is physics-driven and densely sampled along the dominant SNR–DS axes, which mitigates the risk of overfitting. In addition, it was demonstrated in [
10] that a focused, domain-specific dataset, when coupled with a clear understanding of the underlying physical phenomena (e.g., Doppler effects) can be sufficient for building an effective model. The data were partitioned into three subsets: 70% for training, 15% for validation, and 15% for testing. Feature normalization was applied to accelerate convergence during training.
A critical analysis of the dataset’s underlying relationships was conducted to inform model design and interpretability.
Figure 2 presents a multi-dimensional scatter plot visualizing the interdependence between the three core variables: Modulation and Coding Scheme (MCS), Signal-to-Noise Ratio (SNR), and Doppler Shift (DS). This visualization reveals several non-linear, physically consistent constraints governing effective communication in high-mobility environments:
SNR as a Primary Enabler: A strong positive correlation exists between SNR and achievable MCS. The data confirms that robust modulation schemes (MCS ≥ 4) are predominantly viable only when SNR exceeds 15 dB, with the highest-order scheme (MCS7) requiring near-optimal conditions (SNR ≈ 30 dB). This aligns with fundamental communication theory, where higher data rates demand greater signal fidelity.
Doppler Shift as a Limiting Factor: The plot clearly demonstrates the detrimental impact of mobility-induced Doppler Shift. Even at peak SNR (30 dB), achieving MCS7 is only possible when DS remains below 100 Hz. As DS increases, the maximum achievable MCS decays, illustrating the “Doppler wall” effect. For instance, when DS surpasses 1300 Hz, the system is constrained to a maximum of MCS5, regardless of SNR. This quantifies the critical trade-off between spectral efficiency (high MCS) and mobility resilience (high DS).
Joint SNR-DS Decision Region: The visualization underscores that optimal MCS selection is not a function of SNR or DS in isolation, but of their joint state. The operational envelope forms a complex, bounded region in the SNR-DS feature space, justifying the need for a machine learning model capable of learning this multi-variate, non-linear mapping.
Following feature normalization, the distribution of prediction errors was analyzed via the histogram in
Figure 3. The error profile is highly concentrated, with most residuals for training, validation, and test sets clustering near zero (between −0.08 and 0.53). This indicates the model’s strong overall fit and generalizability. The presence of a limited number of outlier bins seven in training and one in testing corresponds to sparse regions in the original feature space (e.g., very high DS coupled with very high SNR). These samples, underrepresented in the validation/test splits due to random partitioning, contributed disproportionately to the error. This is an expected artifact when working with a compact, physics-grounded dataset where extreme channel states are rare but informative. The concentration of error near zero across all datasets confirms that the model successfully learned the predominant channel dynamics without overfitting to these outliers.
6.4. Training and Validation Performance
The SCG-optimized neural network was trained using the pre-processed dataset. The training dynamics and final performance are illustrated in
Figure 4,
Figure 5,
Figure 6 and
Figure 7.
Figure 4 depicts the training performance curve, measured by Mean Squared Error (MSE). The model rapidly converged from an initial MSE of 1.0 to an optimal minimum of 0.0087288 at epoch 13, demonstrating the efficiency of the SCG optimizer. Beyond this point, the validation error began to increase, signaling the onset of overfitting and confirming that 13 epochs represented the optimal stopping point for training.
The gradient magnitude throughout training, shown in
Figure 5, provides insight into the optimization landscape. Starting from a high initial value (>1), the gradient decreased monotonically as the algorithm navigated the error surface. By epoch 19, it approached near-zero, indicating convergence to a flat region of the loss function, a characteristic of a well-trained model where further parameter updates yield minimal improvement.
The validation check curve (
Figure 6) was used to implement an early stopping mechanism to prevent overfitting. Training was automatically halted at epoch 19 after six consecutive validation checks showed no improvement in error. This ensured the model retained its generalization capability by avoiding excessive training on noise within the training set.
The final performance of the model is summarized in
Figure 7, which compares predicted versus target MCS values on the combined test and validation sets. The data points align closely with the ideal fit line (y = x), with a calculated prediction accuracy of 95.81%. This high level of accuracy is particularly notable given the compact size of the dataset and underscores the model’s ability to learn the complex, non-linear relationship between SNR, DS, and optimal MCS. Minor deviations observed correspond to the previously noted outlier regions in the feature space, which are inherently challenging to predict due to sparse data representation.
This robust training outcome confirms the suitability of the shallow NN architecture and the SCG optimizer for the link adaptation task. The model demonstrates both high accuracy and strong generalization, providing a reliable foundation for performance evaluation in realistic VANET simulations, as detailed in the following section.
6.5. Performance Metrics and Comparative Analysis
The trained model was evaluated against the conventional Auto Rate Fallback (ARF) protocol using the following metrics, calculated per Equations (27)–(30): Successfully Transmitted Packets, Packet Error Rate (PER), Transmission Duration, Transmission Rate (Throughput). The throughput rate for each method is computed as the ratio of total successfully delivered information bits to total transmission time, expressed in megabits per second:
where
Nsucc is the number of successfully received packets,
LPSDU = 600 bytes is the fixed payload length, the factor 8 converts bytes to bits,
∑Tpkt,i is the sum of all individual packet transmission durations in seconds (including both successful and failed packets, since failed transmissions consume channel time), and the factor 10
−6 converts bits per second to megabits per second. This formulation correctly accounts for the efficiency loss due to failed transmissions: a method that transmits at a high MCS but incurs frequent failures will have a large ∑
Tpkt,i and a small N
succ, resulting in a lower effective rate than a method that selects a more reliable MCS at moderate throughput. As a sanity check, at the theoretical limit where all packets succeed and the selected MCS is MCS 7 (64-QAM, rate 3/4, 27 Mbps), the formula yields R = (16 × 1500 × 8)/(16 × 112×10
−6)/10
6 = 192,000/0.001792/10
6 = 27.0 Mbps, exactly matching the IEEE 802.11p MCS 7 theoretical maximum [
25]. All per-SNR-point rates reported in
Table 4,
Table 5 and
Table 6 fall within the 0–27 Mbps physical bounds of the IEEE 802.11p standard.
Goodput is the net information delivery rate, computed as the ratio of successfully received information bits to total transmission time. In fact, Goodput is the fraction of Throughput that represents successfully delivered data.
where the PER is defined as
And the packet duration (per-packet, computed from selected MCS):
where Tsymbol is the OFDM symbol duration which is 8 us in IEEE802.11p
The performance was evaluated across three DS regimes: 5 Hz (static/low mobility), 750 Hz (moderate mobility ~137 km/h), and 1500 Hz (high mobility ~274 km/h).
Figure 8 reveals that the SampleRate algorithm does not provide competitive PER performance across the evaluated scenarios. Contrary to expectations, SampleRate consistently exhibits higher error rates than both ARF and SCG, even under low Doppler conditions. Its performance becomes increasingly unstable at moderate Doppler (750 Hz), where non-monotonic PER behavior is observed, indicating poor adaptation to channel variations. Under high Doppler (1500 Hz), SampleRate again shows inferior performance, maintaining high PER across most SNR values.
In contrast, SCG achieves the fastest PER reduction at low Doppler and maintains strong performance across mid-SNR regions under mobility. ARF, while conservative, demonstrates robustness at high SNR under severe Doppler conditions, occasionally outperforming SCG. These results indicate that SampleRate’s reliance on historical transmission statistics limits its effectiveness in both stable and rapidly varying channels, whereas SCG provides more adaptive and context-aware rate selection.
The transmission duration of the three methods is presented in
Figure 9. At lower mobility when the SNR is than 10 dB, the SampleRate offers the lower transmission duration over its peers. But when the SNR becomes greater than 10 dB, the SCG consistently offer lower transmission duration in comparison to its peers. At moderate and higher mobility (DS of 750 to 1500 Hz), the best performer is SCG followed by SampleRate. Overall, this figure shows that as the mobility increases, the SCG method becomes the most suitable for fast data transmission.
Figure 10 presents the transmission rate (throughput) as a function of SNR under varying Doppler conditions. The results reveal a strong dependency of algorithm performance on channel dynamics. At low Doppler (5 Hz), the ARF scheme achieves the highest throughput followed by the SCG until when the SNR was above 26 dB. At moderate Doppler (750 Hz) and high Doppler (1500 Hz) the SCG offers the higher rate over the entire SNR range and thereby outperforming both SampleRate and ARF in terms of fast transmission under higher mobility.
The decisive advantage of the SCG-based strategy is evident in latency and spectral efficiency metrics.
Figure 9 shows that SCG achieved consistently shorter transmission durations across all mobility scenarios, reducing latency by up to 38.32%. This directly translates to faster delivery of time-sensitive safety messages. Most strikingly,
Figure 10 demonstrates that SCG delivered orders-of-magnitude higher transmission rates (throughput). Specifically at 750 Hz DS (moderate mobility ~137 km/h), and 1500 Hz (~274 km/h) where SCG maintained a 23 to 34% gain over SampleRate and ARF, respectively, while completing transmissions much faster than its peers.
Figure 11 presents the total throughput across all SNR values and Doppler conditions for all three methods. The figure confirms three key findings. First, the NN (SCG) consistently outperforms ARF and SampleRate at moderate-to-high Doppler (750–1500 Hz) across the mid-SNR range (14–26 dB), which represents the most practically relevant operating regime for urban VANET deployments. Second, at near-static conditions (5 Hz), ARF outperformed both SampleRate and SCG, confirming that the Doppler feature provides negligible benefit when ICI is absent. This result is consistent with the ablation study results in
Table 7. Third, the flat plateau behavior exhibited by ARF at 750 Hz and 1500 Hz is a consequence of its inability to distinguish Doppler-induced ICI from thermal noise as compared to the NN curves, which adapt MCS continuously based on the joint channel state. All throughput values fall within the IEEE 802.11p theoretical bounds of 3–27 Mbps, confirming the physical validity of the corrected throughput formula.
Figure 12 presents the total goodput across all tested conditions. Three observations are particularly important. First, ARF achieves the highest goodput at near-static conditions (5 Hz, ~17 Mbps at 30 dB SNR) due to its highly conservative MCS selection keeping PER near zero, but this advantage collapses entirely at 750 Hz and 1500 Hz where Doppler-induced ICI traps it at low MCS indices. Second, the NN (SCG) delivers the highest goodput at moderate-to-high Doppler (750–1500 Hz) across the mid-to-high SNR range, confirming that the Doppler-aware feature representation translates directly into improved information delivery rather than merely higher raw transmission attempts. Third, all methods maintain goodput well above the ETSI ITS-G5 CAM requirement of 1.6–6.4 kbps even at 1500 Hz Doppler (274.6 km/h), confirming that the proposed scheme maintains sufficient information delivery capacity for safety-critical vehicular messaging throughout the tested mobility range. The discrepancy formula as the function of SCG vs. ARF is defined as:
6.6. Performance and Comparative Analysis
The trained model was evaluated against ARF and SampleRate using four metrics: Successfully Transmitted Packets, PER, Transmission Duration, and Transmission Rate (Throughput). The performance was evaluated across three DS regimes: 5 Hz (near-static, 0.9 km/h), 750 Hz (moderate mobility, 137.3 km/h), and 1500 Hz (high mobility, 274.6 km/h).
The NN achieves the highest overall transmission rate of 1.77 Mbps, outperforming both ARF (+34.6%) and SampleRate (+23.8%) across all 4320 packet evaluations. Critically, this advantage increases monotonically with Doppler severity: +16.1% at 5 Hz, +21.7% at 750 Hz, and +35.2% at 1500 Hz. This mobility-dependent performance scaling is behavior that no conventional threshold-based method can reproduce, because ARF and SampleRate react to observed packet outcomes rather than anticipating channel degradation from mobility state.
Note that while SampleRate achieves fewer successful packets than the NN (1365 vs. 1547), its throughput is lower at 750 Hz and 1500 Hz. This reflects SampleRate’s tendency to select conservative, low-rate MCS indices that succeed reliably but deliver fewer bits per transmission interval. The NN’s higher throughput despite lower absolute packet success count at high Doppler reflects its ability to identify and exploit channel windows where higher-rate MCS indices are transiently viable.
The results from
Table 5 show that the proposed scheme (SCG) not only transmits faster than ARF, but also outperforms it, achieving a throughput gain of 16–35% and an overall goodput gain of 10%.
Further observations from
Table 6 show that SCG not only performs better than ARF but also outperforms SampleRate in terms of overall throughput and goodput, achieving respective gains of 23.8% and 42% over the SampleRate scheme. The overall goodput presented in
Table 6 provides a complementary metric that normalizes successfully delivered bits against total transmission time, directly quantifying the net information rate available to upper-layer applications: R_goodput = R_throughput × (1 − PER).
The NN achieves the highest absolute goodput of 0.64 Mbps despite its lower net efficiency ratio relative to ARF. This reflects the NN’s strategy of selecting higher MCS indices to maximize information delivery when channel conditions transiently permit, accepting a moderately elevated PER in exchange for higher bit rates during favorable channel windows. For safety-critical VANET applications, the ETSI ITS-G5 standard [
36] defines CAMs transmitted at 1–10 Hz with typical payload 200–800 bytes (1.6–6.4 kbps average data rate requirement). The NN goodput of 0.64 Mbps exceeds the CAM data rate requirement by a factor of approximately 100–400×, demonstrating that even at 1500 Hz Doppler (274.6 km/h) the proposed scheme maintains more than sufficient information delivery capacity for safety messaging.
6.7. Ablation Study: Contribution of the Doppler Feature
To formally isolate the contribution of the Doppler Shift feature to the performance of the proposed FFNN, an ablation study was conducted by retraining an identical network architecture using only the SNR as input (SNR-only model) and comparing its performance against the full SNR + Doppler model across all three Doppler conditions. Both models used the same training dataset, the same SCG optimizer configuration, the same train/validation split, and the same MCS mapping rule. The only difference was the removal of the Doppler Shift input from the feature vector.
The results are presented in
Table 7. At DS = 5 Hz (near-static channel), the two models perform equivalently, which is consistent with theoretical expectations: at 5 Hz Doppler, the normalized Doppler frequency ρ =
× Tsym = 5 × 6.4 × 10
−5 = 3.2 × 10
−4, producing ICI power of approximately (π
2/6)ρ
2 ≈ 5 × 10
−7 relative to signal power, a negligible impairment that the SNR input alone is sufficient to characterize. As Doppler increases to 750 Hz and 1500 Hz, the SNR-only model increasingly overestimates channel reliability because it lacks awareness of the growing ICI component. It selects MCS indices calibrated for the nominal SNR level but physically appropriate only for low-mobility conditions, resulting in elevated PER and reduced goodput. The joint SNR + Doppler model, by contrast, correctly recognizes the high-mobility regime and selects conservative MCS index that maintain throughput while controlling error rate. The Doppler feature provides a +67% throughput gain at 750 Hz and a +78% throughput gain at 1500 Hz relative to the SNR-only baseline. These results confirm that the performance advantage of the proposed method is driven primarily by the Doppler-aware feature representation, not by the neural network architecture itself. An SNR-only neural network with identical structure and training provides no meaningful advantage over the SNR-only Sample Rate baseline at high Doppler, whereas the joint model maintains clear superiority throughout. This finding validates the core design decision of the proposed approach and establishes the Doppler Shift as an essential input feature for ML-based link adaptation in high-mobility VANET environments [
21,
38].