1. Introduction
Video streaming constitutes more than 65% of the total Internet traffic [
1], and video dominates mobile downstream consumption with over 70% share in most regions [
2]. To sustain playback quality under time-varying network conditions, major streaming platforms such as YouTube, Netflix, and Disney+ rely on Dynamic Adaptive Streaming over HTTP (DASH) [
3]. In DASH, each video is encoded into multiple bitrate representations, and the client selects one representation for every video chunk, typically every 5–10 s. This adaptive bitrate (ABR) decision directly affects the user’s Quality of Experience (QoE), primarily through video quality, rebuffering duration, and bitrate-switching frequency.
The deployment of 4G LTE and 5G New Radio (NR) has made ABR control increasingly challenging. Although LTE and 5G NR provide substantially higher average throughput than earlier mobile networks, their throughput dynamics are far from smooth. In urban LTE traces, handovers can induce sharp throughput drops that persist for several seconds [
4]. In 5G NR, beam-management events and beam blockage can cause even more abrupt and nonlinear disruptions, including short near-outage periods [
5]. The collected mobile trace datasets exhibit the same behavior: the LTE dataset [
4] yields 15–60 Mbps throughput, whereas the 5G datasets [
5] spans 7–121 Mbps, and both contain transient disruptions that are severe enough to affect chunk-level bitrate decisions. Thus, modern mobile ABR must simultaneously exploit high available bandwidth during stable periods and avoid over-aggressive decisions during sudden state changes.
Existing ABR schemes address this trade-off from different perspectives but still have limitations in mobile LTE/5G environments. For example, although buffer-based schemes such as BBA [
6] are robust to transient bandwidth drops because they rely primarily on buffer occupancy, this robustness comes at the cost of conservative bitrate selection and underutilized channel capacity. Prediction-based and model-predictive control schemes [
7,
8] achieve higher bandwidth utilization by explicitly forecasting near-future throughput, but their decisions are sensitive to prediction errors. In particular, smoothing- or history-based predictors can lag behind abrupt throughput transitions, causing the controller to overestimate available bandwidth at the moment of handovers or beam-related disruptions. Deep reinforcement learning (DRL) approaches such as Pensieve [
9] learn bitrate policies directly from data, but their neural policies incur non-negligible per-chunk inference overhead on mobile devices and still produce frequent bitrate oscillations under rapidly changing wireless conditions.
To address these issues, an ABR controller must assess not only the predicted throughput value but also the reliability of that prediction. This reliability is particularly important in concurrent LTE and 5G environments, where a short sequence of recent throughput samples may be consistent with either a stable channel or the onset of a handover or beam-blockage event. Many prediction-based ABR controllers therefore apply a safety margin to the predicted throughput before selecting a bitrate so that the chosen bitrate remains below the estimated available bandwidth. However, a fixed safety margin cannot handle these cases simultaneously: a large margin wastes capacity during stable periods, whereas a small margin increases the risk of rebuffering during transient disruptions. Therefore, an ABR framework needs to adapt this safety margin based on the reliability of the current throughput prediction while remaining lightweight enough for per-chunk execution on mobile clients.
This paper presents NeUA, a framework using a neural network with Uncertainty Awareness for mobile video streaming over LTE and 5G NR. NeUA employs a bidirectional LSTM throughput predictor with Monte Carlo (MC) Dropout to quantify epistemic uncertainty in future throughput estimates. This uncertainty is translated into a dynamic safety factor: when the predictor is uncertain, NeUA selects bitrates more conservatively to reduce rebuffering risk; when the prediction is stable, it relaxes the safety margin to improve bandwidth utilization. The uncertainty-aware predictor is coupled with a single-step model predictive control layer for bitrate selection and a counter-based hysteresis mechanism to suppress unnecessary bitrate oscillations.
The main contributions of this paper are as follows:
We formulate per-chunk epistemic uncertainty as a control signal for ABR by estimating prediction uncertainty with MC Dropout and mapping it to a dynamic safety factor for throughput-aware bitrate selection.
We propose NeUA, an ABR framework that integrates an uncertainty-aware BiLSTM throughput predictor with single-step MPC and counter-based hysteresis to improve bandwidth utilization while reducing rebuffering risk and bitrate oscillations.
We conduct trace-driven evaluations using two public mobile LTE/5G datasets and show that NeUA achieves the highest QoE among the evaluated prediction-based ABR schemes, improving QoE by up to 8.0% and reducing bitrate-switching frequency by up to 28.8% relative to the evaluated baselines.
The remainder of this paper is organized as follows.
Section 2 reviews related work on adaptive video streaming.
Section 3 introduces the DASH streaming model and motivates throughput uncertainty-aware ABR in LTE/5G environments.
Section 4 presents the design of the proposed NeUA framework.
Section 5 describes the experimental setup and evaluates NeUA using public LTE/5G traces. Finally,
Section 6 concludes the paper and discusses future research directions.
5. Evaluation
5.1. Experimental Setup
Trace datasets. We evaluate all schemes using two public mobile throughput datasets. The LTE evaluation uses the 4G/LTE dataset from [
4], which contains 40 traces collected in Ghent under static, pedestrian, bus, and car mobility scenarios. The dataset has an overall mean throughput of 31.8 Mbps, with session-level mean throughput ranging from 15.2 to 59.7 Mbps. The 5G evaluation uses the 5G dataset reported in [
5], from which we select 18 commercial Vodafone Ireland 5G traces after applying a minimum-throughput quality filter of 6 Mbps. The selected 5G traces have an overall mean throughput of 44.3 Mbps, with session-level mean throughput ranging from 6.9 to 121.0 Mbps. All traces are parsed at 1-second resolution. Traces shorter than the playback horizon are tiled to support a 1800 s streaming session, following common practice in trace-driven ABR evaluation. Tiling is applied uniformly to all schemes, so any artificial periodicity affects all schemes equally and does not systematically favour NeUA. For the shorter 5G traces that require tiling, the resulting periodicity is acknowledged as a limitation shared with prior ABR work [
7,
9].
Video configuration. Each simulation streams a 1800 s video with chunk length s and maximum playback buffer s. We use a 12-tier bitrate ladder consisting of 0.27, 0.70, 1.20, 2.50, 4.30, 8.90, 15.00, 25.00, 40.00, 60.00, 85.00, and 120.00 Mbps, covering a wide range from low-resolution mobile video to high-bitrate UHD content. This ladder intentionally extends beyond the average throughput of both datasets so that ABR algorithms must avoid over-aggressive selections during transient LTE/5G disruptions while still exploiting high-capacity periods.
Compared schemes. We compare NeUA with six representative ABR schemes: BBA [
6], BOLA-E [
10], RobustMPC [
7], SODA [
8], LQE [
14], and Pensieve [
9]. BBA uses reservoir and cushion parameters of 20 s and 70 s, respectively. BOLA-E follows the Lyapunov-based BOLA formulation with a Holt–Winters efficiency cap. RobustMPC uses harmonic-mean throughput prediction with a 0.9 safety factor. SODA uses squared log-utility switching-cost optimization with
. LQE uses
s and hysteresis threshold
.
NeUA Parameters. NeUA uses history window
,
MC Dropout passes, the uncertainty sensitivity coefficient
, and MPC parameters
,
, and hysteresis threshold
, selected by an exhaustive parameter sweep on the training traces, which are held separate from the test traces used in the final evaluation reported in
Table 3 and
Table 4. The search space covers
,
,
, and
, for a total of 72 combinations evaluated on both LTE and 5G traces using the same video configuration and QoE metric defined in
Section 5.1. The final configuration is the combination that achieves the highest mean QoE averaged over all training-set sessions. A sensitivity analysis of each parameter is reported in
Section 5.2.
Metrics. We evaluate streaming performance using the QoE metric adopted by Pensieve [
9]:
where
is the bitrate selected for chunk
k,
is the minimum bitrate in the ladder,
is the rebuffering duration, and
measures bitrate variation between consecutive chunks. The penalty coefficients, 4.3 for rebuffering duration and 0.3 for quality variation, are fixed constants adopted from [
9] and are applied identically to all evaluated schemes. These evaluation coefficients are independent of the scheme-specific decision parameters used internally by each ABR algorithm.
In addition to QoE, we report the rebuffering rate, defined as the percentage of simulated sessions in which at least one rebuffering event ( for some chunk k) occurred, as a session-level indicator of stall prevalence.
5.2. Parameter Sensitivity Analysis
The four control parameters of NeUA, such as the uncertainty sensitivity coefficient , the rebuffering penalty , the smoothness penalty , and the hysteresis threshold M, were selected by an exhaustive parameter sweep on the training traces. The sweep evaluates , , , and , for a total of 72 combinations. The final configuration (, , , ) was selected as the combination that achieves the highest mean QoE averaged over both LTE and 5G sessions on the training set. The risk of overfitting the sweep to the training traces is mitigated by keeping the training and test trace sets completely disjoint. The final evaluation results are computed solely on held-out test traces that were not used during parameter selection.
The uncertainty sensitivity coefficient controls the steepness of the mapping from MC Dropout standard deviation to the dynamic safety factor . A larger causes the safety factor to drop more sharply as uncertainty increases, resulting in more conservative bitrate selections during high-variance periods such as handovers and beam-blockage events. Values below the selected produce insufficient conservatism and fail to suppress over-aggressive bitrate selections, while values above 8.0 introduce excessive conservatism that reduces average bitrate even under stable channel conditions.
The rebuffering penalty and smoothness penalty control the MPC objective weighting. Increasing causes the MPC layer to prioritize rebuffering avoidance more aggressively, reducing average bitrate at the cost of fewer stall events. Increasing penalizes bitrate oscillation more heavily, resulting in fewer switches but slower adaptation to improving channel conditions. The selected values and achieve the best QoE by balancing these competing objectives across the full evaluation trace set.
The hysteresis threshold M has the most direct effect on bitrate-switch count. Reducing M from 3 to 2 allows faster upward bitrate transitions after disruption recovery, improving average bitrate at the cost of more frequent oscillations. The selected prioritizes bitrate stability, consistent with the paper’s design goal of reducing unnecessary switching while preserving competitive QoE.
5.3. QoE Performance in LTE Environments
Table 3 presents the LTE evaluation, and
Figure 3 shows the per-session bitrate and rebuffering distributions. BBA achieves the highest QoE (1547.0) by conservative bitrate selection (26.68 Mbps median vs. 31.8 Mbps mean channel), effectively exploiting the 60 s buffer to absorb handover events.
Table 3.
Performance comparison: LTE dataset.
Table 3.
Performance comparison: LTE dataset.
| Scheme | Bitrate Med. (Mbps) | Rebuf. Dur. Med. (s) | Changes Med. | Rebuf. Rate (%) | QoE Mean |
|---|
| BBA [6] | 26.68 | 1.0 | 122 | 40.0 | 1547.0 |
| BOLA-E [10] | 4.09 | 1.0 | 28 | 72.5 | 273.3 |
| RobustMPC [7] | 21.32 | 1.0 | 209 | 42.5 | 1397.8 |
| SODA [8] | 20.07 | 1.0 | 215 | 37.5 | 1380.5 |
| LQE [14] | 18.08 | 1.0 | 109 | 40.0 | 1386.0 |
| Pensieve [9] | 20.45 | 1.0 | 222 | 40.0 | 1385.5 |
| NeUA | 22.38 | 1.0 | 94 | 40.0 | 1418.2 |
| NeUA rank (excl. BBA): | QoE: 1st/Switches: 1st (fewest) |
Figure 3.
Whisker plots (min/Q1/median/Q3/max) of per-session average bitrate (a,b) and 5G rebuffering duration (c).
Figure 3.
Whisker plots (min/Q1/median/Q3/max) of per-session average bitrate (a,b) and 5G rebuffering duration (c).
Among throughput-prediction-based schemes, NeUA leads in QoE (1418.2) and achieves the fewest bitrate switches (94), compared with 109 for LQE, 209 for RobustMPC, and 222 for Pensieve. Relative to LQE, NeUA improves QoE by 32.2 (+2.3%), median bitrate by 4.30 Mbps (+23.8%), and reduces switches by 15 (−13.8%) while maintaining the same 40.0% rebuffering rate.
The 40.0% rebuffering rate is uniform across most schemes, indicating that rebuffering is driven by the LTE channel characteristics (handover-induced bandwidth drops) rather than by ABR scheme choice. The NeUA uncertainty-driven conservative estimate prevents premature upward transitions during post-handover recovery, reducing unnecessary switches without increasing rebuffering.
The QoE advantage of NeUA over LQE originates from two complementary mechanisms. The BiLSTM window captures temporal patterns that precede handover events in the LTE traces, such as gradually declining throughput over several consecutive chunks. When these patterns appear, the MC Dropout ensemble produces diverging predictions across the forward passes, raising and driving toward . The MPC layer then selects a bitrate tier that remains viable even under a substantial throughput reduction, avoiding the over-selection that contributes to the higher switch counts observed in RobustMPC and SODA. When the channel recovers after a handover, remains elevated for several chunks as the predictor converges to the new stable throughput level. The counter-based hysteresis () further delays upward tier transitions until three consecutive recommendations agree, suppressing the oscillatory switches observed in LQE and Pensieve.
5.4. QoE Performance in 5G Environments
Table 4 presents the 5G evaluation. All seven schemes experience 100% session rebuffering in the 5G trace, reflecting the severity of beam-management disruptions. In this regime, QoE differentiation is driven primarily by average bitrate during non-disruptive periods and rebuffering duration. NeUA achieves the second-highest QoE (1201.0) and the fewest bitrate switches of all prediction-based schemes (99), compared with 139 for LQE and 259 for Pensieve. Relative to LQE, the proposed scheme improves QoE by 88.8 (+8.0%) while reducing changes by 40 (−28.8%). BBA leads in QoE (1256.5) through conservative selection (17.98 Mbps), while NeUA delivers 25.71 Mbps, representing 43% more bitrate than BBA with only 4.4% lower QoE.
Table 4.
Performance comparison: 5G dataset.
Table 4.
Performance comparison: 5G dataset.
| Scheme | Bitrate Med. (Mbps) | Rebuf. Dur. Med. (s) | Changes Med. | Rebuf. Rate (%) | QoE Mean |
|---|
| BBA [6] | 17.98 | 8.0 | 164 | 100.0 | 1256.5 |
| BOLA-E [10] | 8.99 | 4.0 | 61 | 100.0 | 384.4 |
| RobustMPC [7] | 25.42 | 10.0 | 244 | 100.0 | 1125.9 |
| SODA [8] | 25.93 | 9.5 | 254 | 100.0 | 1051.3 |
| LQE [14] | 24.81 | 11.5 | 139 | 100.0 | 1112.2 |
| Pensieve [9] | 25.08 | 12.0 | 259 | 100.0 | 1064.8 |
| NeUA | 25.71 | 8.0 | 99 | 100.0 | 1201.0 |
| NeUA rank (excl. BBA): | QoE: 1st/Switches: 1st (fewest) |
The QoE advantage of BBA reflects its conservative buffer-based philosophy. BBA deliberately underutilizes bandwidth to maintain a large buffer that absorbs disruptions without rebuffering, thereby avoiding the large rebuffering penalty in the QoE metric. NeUA, by design, pursues bandwidth utilization during stable periods at the cost of occasional rebuffering, yielding 43% higher median bitrate than BBA with 4.4% lower QoE in 5G. Integrating buffer-state feedback as a secondary signal when prediction uncertainty is high represents a promising direction to narrow this gap.
The larger QoE gain in 5G (8.0%) compared with LTE (2.3%) reflects the wider throughput variation in the 5G traces (6.9–121.0 Mbps session range versus 15.2–59.7 Mbps for LTE). During a beam-blockage event, the MC Dropout passes produce a wide spread of predictions, yielding a high that drives to its minimum. The resulting conservative bitrate selection reduces chunk size, which accelerates buffer recovery after the zero-throughput period ends, directly reducing median rebuffering duration from 11.5 s (LQE) to 8.0 s. Once the beam is restored, NeUA delays upward switches until both prediction uncertainty subsides and the hysteresis counter reaches , preventing the burst of transitions that drives LQE and Pensieve to 139 and 259 switches, respectively.
5.5. Bitrate Switching Behavior
Figure 4 shows the distribution of per-session bitrate-change counts across all traces. NeUA achieves median 94 and 99 changes in LTE and 5G, respectively, the lowest of all prediction-based schemes in both environments. In LTE, NeUA reduces change counts from 109 (LQE) to 94 (
), indicating that the conservative estimate of LSTM prevents premature up-switches during handover recovery while maintaining the same overall switch frequency as controlled by the channel-driven rebuffering events. In 5G, the 28.8% reduction in changes relative to LQE is more pronounced because the higher-bandwidth 5G environment provides more opportunities for the BiLSTM regime-change detection to prevent unnecessary transitions.
The asymmetry between the LTE and 5G reduction rates reflects the difference in disruption character between the two environments. In LTE, handovers produce recognizable throughput trajectories that the BiLSTM partially anticipates, so uncertainty rises moderately and the hysteresis mechanism drives most of the switch reduction. In 5G, beam disruptions are more abrupt and produce sharper uncertainty spikes, causing to drop more steeply and remain low for longer post-disruption periods. This stronger and more sustained conservatism is the primary driver of the larger relative reduction in 5G.
5.6. Statistical Validation
To assess whether the QoE differences between NeUA and the prediction-based baselines are statistically significant, we apply a paired two-sided
t-test over per-session QoE values. Because every scheme is evaluated on the same set of traces, the per-session QoE differences form matched pairs; the paired design removes trace-level throughput variation as a confound and yields higher statistical power than an independent-samples test. BBA is excluded from significance testing because it operates under a fundamentally different control philosophy (buffer based rather than prediction based) and consistently dominates on this metric, making the comparison uninformative.
Table 5 reports the mean QoE with standard deviation and the paired
t-test outcome for NeUA against each prediction-based baseline.
In LTE, NeUA achieves a mean QoE of and significantly outperforms all four prediction-based baselines ( in every case). The largest gap occurs against RobustMPC (, ), which accumulates excessive bitrate switches under handover-induced throughput fluctuations. In 5G, NeUA achieves a mean QoE of and significantly outperforms RobustMPC (, ) and LQE (, ). The comparison with SODA (, ) and Pensieve (, ) does not reach statistical significance. This outcome reflects the high variance of the 5G environment: beam-management disruptions produce large per-session QoE swings that reduce test power with only 18 traces. Importantly, NeUA achieves the shortest median rebuffering duration in 5G (8.0 s vs. 9.5–12.0 s for all prediction-based schemes), a difference that directly benefits user experience even when mean QoE differences are not statistically distinguishable.
5.7. QoE–Bitrate Switching Trade-Off
Figure 5 compares the evaluated ABR schemes in terms of mean QoE and median bitrate-switching count per session on the LTE and 5G datasets. The upper-left region is the ideal operating region, corresponding to high QoE with few bitrate switches. In practice, however, ABR schemes often exhibit an empirical QoE–switching trade-off: moving toward higher QoE typically requires more bitrate changes, while reducing switches often lowers QoE by forcing more conservative bitrate choices. Therefore, schemes closer to the upper-left region of this trade-off plane provide a better balance between quality improvement and bitrate stability.
On the LTE dataset, BBA achieves the highest mean QoE and a low switching count, reflecting its conservative buffer-based behavior. Because BBA primarily relies on buffer occupancy rather than throughput prediction, it avoids many aggressive bitrate decisions during LTE handovers. Among prediction-based schemes, NeUA achieves competitive QoE relative to LQE, RobustMPC, SODA, and Pensieve, while producing fewer bitrate switches. This result demonstrates that NeUA provides the best balance among prediction-based schemes by maintaining competitive QoE while improving bitrate stability.
The 5G results show a similar QoE–switching trade-off. BBA achieves the highest mean QoE, but it also incurs a large number of bitrate switches, indicating that its high QoE is obtained at the expense of bitrate stability. RobustMPC yields higher QoE than NeUA but requires substantially more bitrate changes. In contrast, NeUA produces fewer bitrate switches than the other prediction-based schemes while maintaining competitive QoE. Both SODA and Pensieve exhibit more bitrate switches and lower QoE than NeUA. These results suggest that, in 5G traces with high peak throughput and sudden beam-related disruptions, controllers that pursue higher instantaneous quality can easily produce frequent bitrate oscillations, whereas NeUA suppresses unnecessary switching by incorporating prediction uncertainty into bitrate selection.
Overall,
Figure 5 demonstrates that NeUA improves the QoE–switching trade-off of prediction-based ABR. Rather than maximizing QoE by allowing frequent quality changes, NeUA reduces unnecessary bitrate oscillations while preserving competitive QoE. This behavior stems from the uncertainty-aware dynamic safety factor, which mitigates over-aggressive bitrate choices when prediction confidence is low, and from counter-based hysteresis, which filters short-lived bitrate recommendations. Although BBA achieves high absolute QoE in both environments, its bitrate-switching behavior differs across LTE and 5G and does not consistently represent the best QoE–stability balance. In contrast, NeUA achieves the most favorable QoE–stability balance among the evaluated prediction-based schemes in both LTE and 5G environments.
5.8. Inference Overhead and Mobile Deployability
In neural ABR frameworks such as NeUA and Pensieve, the machine learning predictor is the sole computationally intensive component: once the predictor produces a throughput estimate, the remaining MPC optimization involves only lightweight arithmetic. Consequently, the inference latency and memory footprint of the bandwidth predictor account for virtually all of the computational overhead associated with mobile deployment.
Deploying NeUA on mobile clients therefore reduces to assessing whether a lightweight, quantized version of the BiLSTM predictor is practically feasible without sacrificing prediction accuracy. TensorFlow Lite (TFLite) post-training INT8 quantization converts FP32 weights to 8-bit integers without retraining, reducing model size and inference latency by up to 4× while exploiting the higher arithmetic density of 128-bit ASIMD registers (16 INT8 values vs. 4 FP32 values per instruction [
34,
35]).
Table 6 summarizes the estimated inference latency, model size, and prediction accuracy (MAE on the held-out test traces) of the NeUA bandwidth predictor in FP32 and INT8 configurations, derived analytically from the Snapdragon 778G SoC specification (Cortex-A78, 2.4 GHz) [
36]. FP32 latency is estimated from the predictor’s operation count (17.2 MFLOPs for
N = 20 MC Dropout passes) divided by the Cortex-A78 peak FP32 throughput, with a 30% SIMD utilization factor applied to account for the sequential data dependency of LSTM recurrent computation.
The INT8 predictor reduces per-chunk inference latency from 3.6 ms to 0.9 ms, representing less than 0.02% of the 5-second chunk interval and confirming negligible playback pipeline overhead. The 60 KB footprint fits within the L2 cache of mid-range Android SoCs, eliminating memory-bandwidth bottlenecks during inference. Post-training INT8 quantization introduces only
MAE degradation relative to the FP32 baseline on the held-out test traces, confirming that the BiLSTM throughput predictor is robust to 8-bit weight quantization [
34]. By contrast, the Pensieve actor network comprises approximately 265,000 parameters [
9], occupying roughly 1038 KB in FP32—more than 17× the size of the NeUA INT8 variant—which makes direct on-device deployment substantially more challenging on memory-constrained mobile clients. Since inference energy scales proportionally with latency, the 4× latency reduction implies a corresponding reduction in per-inference energy consumption; quantitative on-device profiling is left for future work. For further validation of practical on-device deployment, we are preparing an experimental Android prototype that integrates the INT8-quantized predictor with androidx.media3 library; the repository for this preliminary work-in-progress implementation is hosted at
https://github.com/neua-abr/neua-android (accessed on 22 June 2026).
6. Conclusions
We proposed NeUA, an ABR scheme built around an uncertainty-aware BiLSTM predictor that estimates per-chunk epistemic uncertainty via Monte Carlo Dropout and couples it to a dynamic safety factor, replacing the fixed safety margins of prior prediction-based ABR, together with an MPC rate-selection layer and counter-based hysteresis. Validated on 40 real LTE and 18 5G traces, NeUA is the highest-QoE scheme among all prediction-based baselines in both environments, achieving QoE gains of 2.3% (LTE) and 8.0% (5G) over LQE with 13.8% and 28.8% fewer bitrate switches, respectively.
Among all seven evaluated schemes including BBA, NeUA ranks second in QoE in both environments, achieves the highest QoE and fewest bitrate switches among all prediction-based schemes in both LTE and 5G, and delivers substantially higher median bitrate than BBA in 5G (25.71 Mbps vs. 17.98 Mbps), reflecting the bandwidth utilization advantage of prediction-based control. The INT8-quantized BiLSTM variant occupies approximately 60 KB and completes per-chunk inference in 0.9 ms on a Snapdragon 778G SoC (Cortex-A78, spec-based estimate), making NeUA deployable on mid-range mobile clients with substantially lower inference overhead than DRL-based baselines; quantitative energy characterization is left for future on-device profiling.
An empirical finding of independent interest is that MAE improvement alone is insufficient for ABR QoE gains. A calibrated safety factor that induces controlled conservatism during disruption recovery is essential. This suggests future neural ABR predictors should incorporate asymmetric loss functions that penalize over-prediction more heavily than under-prediction.
Four directions remain open for future work. First, asymmetric Huber loss training will be explored to reduce reliance on the hand-tuned safety factor by penalizing over-prediction more heavily during model training. Second, Transformer-based predictors will be investigated to capture longer-range channel dependencies than the BiLSTM window allows. Third, the QoE gap with BBA will be further analyzed to understand whether it can be closed through improved buffer modeling while preserving the bandwidth utilization advantage of prediction-based control. Fourth, a calibration study will be conducted to assess whether the MC Dropout standard deviation is positively correlated with actual per-chunk prediction error across diverse channel conditions, using rank correlation analysis on held-out traces.