Efficient Deep Learning-Based M-PSK Detection for OFDM V2V Systems Using MobileNetV3

Tonix-Gleason, Luis E.; Del-Puerto-Flores, José A.; Peña-Campos, Fernando; del Puerto-Flores, Dunstano; López-Pimentel, Juan-Carlos; Del-Valle-Soto, Carolina; Vela-Garcia, Luis René

doi:10.3390/a19030210

Open AccessArticle

Efficient Deep Learning-Based M-PSK Detection for OFDM V2V Systems Using MobileNetV3

by

Luis E. Tonix-Gleason

^1,†

,

José A. Del-Puerto-Flores

^1,*

,

Fernando Peña-Campos

^2,†

,

Dunstano del Puerto-Flores

³

,

Juan-Carlos López-Pimentel

¹

,

Carolina Del-Valle-Soto

¹

and

Luis René Vela-Garcia

²

¹

Facultad de Ingeniería, Universidad Panamericana, Álvaro del Portillo 49, Zapopan 45010, Mexico

²

Department of Electrical Engineering, Communications Section, Cinvestav, Guadalajara 45019, Mexico

³

Department of Mechanical-Electrical Engineering, CUCEI, University of Guadalajara, Guadalajara 44430, Mexico

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Algorithms 2026, 19(3), 210; https://doi.org/10.3390/a19030210

Submission received: 30 January 2026 / Revised: 7 March 2026 / Accepted: 9 March 2026 / Published: 11 March 2026

(This article belongs to the Special Issue Algorithmic Innovations: Bridging Theoretical Foundations and Practical Applications)

Download

Browse Figures

Versions Notes

Abstract

This paper investigates M-PSK symbol detection in Orthogonal Frequency Division Multiplexing (OFDM) systems for wideband Vehicle-to-Vehicle (V2V) communications using lightweight convolutional neural networks. In doubly dispersive channels, Inter-Carrier Interference (ICI) degrades subcarrier orthogonality, rendering conventional equalization ineffective. Current ICI mitigation techniques face a trade-off between Bit-Error Rate (BER) performance and computational complexity, limiting their applicability in dynamic vehicular scenarios. To address this issue, a low-complexity MobileNetV3-based receiver is proposed, incorporating a signal-model-driven preprocessing stage that compensates for Doppler-induced phase distortions responsible for ICI. Simulation results show that the proposed receiver improves BER performance compared to conventional equalizers and recent neural-based schemes in the low-SNR regime (below 15 dB) while maintaining computational complexity close to linear least-squares detection.

Keywords:

deep learning; DNN; ICI; neural network; OFDM; V2V

1. Introduction

In contemporary Intelligent Transportation Systems (ITS), the exchange of real-time information is fundamental. In this context, Vehicle-to-Infrastructure (V2I) and Vehicle-to-Vehicle (V2V) communication schemes must be efficiently integrated. However, these links typically operate over Doubly Selective Channels (DSCs), which arise from multipath propagation, environmental scatterers, and the high mobility of vehicles, thereby posing significant challenges to the design and performance of communication systems [1,2,3,4].

In V2V environments, vehicular motion induces significant Doppler spreads that lead to the loss of orthogonality among Orthogonal Frequency Division Multiplexing (OFDM) subcarriers, making Inter-Carrier Interference (ICI) particularly pronounced [5,6,7,8]. As a result, the induced ICI degrades the effectiveness of OFDM-based V2V systems and hinders fundamental receiver tasks, such as channel estimation, data detection, and error correction, which are required to mitigate their effects. These tasks typically entail high computational complexity, exceeding

O (N^{3})

, thereby complicating their real-time implementation and operation. These limitations have motivated considerable research interest in developing approaches that reduce computational complexity while improving performance compared to conventional equalization and data detection schemes [9,10,11].

Linear detectors, including Least Squares (LS) and Linear Minimum Mean Square Error (LMMSE), have been extensively investigated [12,13]. Although these schemes are attractive due to their simplicity and suitability for hardware implementation, their Bit-Error Rate (BER) performance is often limited compared to nonlinear methods. This performance degradation is mainly attributed to the fact that linear detectors perform symbol detection without exploiting the discrete structure of the signal constellation [4,14], which leads to suboptimal decisions, particularly in V2V scenarios characterized by low signal-to-noise ratios. Despite these limitations, their low computational complexity and reduced processing requirements continue to make them appealing candidates for practical systems, thereby highlighting the need for improved schemes that preserve these advantages while enhancing performance.

Nonlinear detectors exploit the finite symbol alphabet to improve performance. Common examples include Ordered Successive Interference Cancellation (OSIC) [15] and Decision Feedback Equalization (DFE) [16,17], and Maximum Likelihood (ML) detection [14], which typically achieve lower BER than LMMSE; however, both linear and nonlinear methods experience significant performance degradation in vehicular channels due to ICI. Although nonlinear detectors provide strong performance, this is often achieved at the expense of high computational complexity, typically exceeding

O (N^{3})

, which hinders real-time implementation. To alleviate this complexity, suboptimal ML variants restrict the search to a lower-dimensional latent space or perform candidate sequence pruning.

Neural Networks (NN) constitute a natural bridge between linear and nonlinear approaches by offering a favorable trade-off between performance and computational complexity [18,19,20]. On the one hand, nonlinear activations enable the modeling of complex decision boundaries, leading to significant performance gains; on the other hand, the underlying operations largely reduce to matrix-vector products, which are highly amenable to hardware acceleration and efficient real-time execution. Within this framework, NN are built from two fundamental blocks: fully connected layers and convolutional layers. Convolutional layers are especially attractive because they reuse the same weights and focus on local regions, allowing for faster inference with a lower computational cost. In the context of equalizer design, this efficiency translates into MobileNetV3-based architectures that achieve low latency, good scalability, and strong performance, outperforming linear NN. Moreover, MobileNetV3 incorporates lightweight activation functions such as hard-swish and hard-sigmoid, along with depthwise-separable convolutions, which drastically reduce the number of parameters and operations compared to standard convolutions [21], further reinforcing its suitability for real-time vehicular applications.

Neural networks have recently been integrated into OFDM receiver stages in an effort to increase detection accuracy while maintaining practical deployment viability. Most NN-based methods for rapidly time varying multipath channels focus primarily on channel estimation [22,23,24]. While others like ComNet explicitly combine estimation and equalization [25], none of these cutting-edge techniques takes advantage of the channel matrix to perform a dedicated preprocessing step prior to the network input.

Contribution and Article Structure

This work addresses the problem of low-complexity symbol detection in OFDM-based V2V communication systems operating over doubly dispersive channels, where inter-carrier interference severely degrades performance. The main contributions of this paper are summarized as follows:

MobileNetV3-based neural equalization for V2V links: A lightweight convolutional neural network based on MobileNetV3 is proposed for M-PSK detection in OFDM systems affected by ICI, adapting an architecture originally designed for mobile devices to the vehicular communications context.
Statistics-informed ICI-aware preprocessing: A channel-prediction-based preprocessing stage is introduced to mitigate ICI prior to neural equalization, relieving the network from directly suppressing interference and allowing it to focus on detection.
Signal-model-guided phase distortion compensation: The signal model proposed during the preprocessing stage compensates for Doppler-induced phase distortions, thereby facilitating MobileNetV3 to perform detection by efficiently exploiting phase–data correlations.
Favorable performance–complexity trade-off: The receiver proposed achieves a lower bit-error rate than conventional linear, nonlinear, and recent neural-based approaches, while maintaining computational complexity close to linear least-squares detection.

Overall, the proposed framework provides an efficient and scalable detection solution for ICI-impaired V2V OFDM systems, bridging the gap between the high performance of nonlinear methods and the low complexity of conventional linear receivers.

The remainder of this paper is organized as follows. Section 2 presents the system model of the proposed OFDM framework. Section 3 describes the proposed receiver architecture based on MobileNetV3 for OFDM signal detection. Section 4 analyzes the computational complexity and presents the performance results of the proposed receiver. Additionally, Section 5 provides a discussion of the experimental results. Finally, the conclusions of this work are drawn in Section 6.

2. Model System

The model includes the OFDM modulation of the 802.11p standard. In this context, the signal

y [n]

at the receiver side in complex baseband, following synchronization and cyclic prefix (CP) removal, is denoted by:

y [n] = \sum_{l = 0}^{L - 1} h [n, l] x [{〈 n - l 〉}_{N}] + w [n],

(1)

using

{〈 \cdot 〉}_{N}

to denote N-modulus indexing, the transmitted signal at the n-th sample is denoted as

x [n]

. The channel’s impulse response at time n for the l-th preceding sample is represented as

h [n, l]

, where L denotes the number of channel taps; whereas

w [n]

signifies the complex Additive White Gaussian Noise (AWGN) characterized by zero mean and variance

σ_{w}^{2} = N_{0} / 2

, where

N_{0}

denotes the noise power spectral density. The circular convolution, as delineated in (1), between the impulse response

h [n, l]

and

x [n]

, can be reformulated in a matrix-vector representation as follows:

y = H x + w,

(2)

where

H

denotes a

N \times N

matrix, indexed by

0, 1, \dots, N - 1

in each dimension, with its elements formulated from the Channel Impulse Response (CIR) coefficients as follows:

{[H]}_{n, n^{'}} = h [n, {〈 n - n^{'} 〉}_{N}],

(3)

for

n, n^{'} \in {0, 1, \dots, N - 1}

. The CIR is taken to be zero whenever

{〈 n - n^{'} 〉}_{N} > L - 1

. After CP removal, the received OFDM symbol in the frequency domain is obtained by applying the normalized Discrete Fourier Transform (DFT):

\begin{matrix} u & = G s + z, \end{matrix}

(4)

where

u

,

s

, and

z

are the DFTs of

y

,

x

, and

w

, respectively. The channel frequency matrix is

G = {FHF}^{H}

, with

F

denoting the normalized DFT matrix. The data vector

s = {[s [0], s [1], \dots, s [N_{D} - 1]]}^{T}

contains

N_{D}

M-PSK symbols, which can be written in terms of magnitude A and phases

θ [n]

as:

s = | α | {[e^{j θ [0]}, e^{j θ [1]}, \dots, e^{j θ [N_{D} - 1]}]}^{T},

(5)

where

e^{j θ [n]}

denotes the n-th transmitted complex value in the discrete frequency domain. In a typical OFDM receiver with ICI cancellation, the estimate of

G

is used to perform data detection via LMMSE, e.g., in the form (where

σ

is the noise-regularization term equivalent to

σ = 1 / SNR

for unit-power symbols):

\hat{s} = {(G G^{H} + σ I)}^{- 1} G^{H} u .

(6)

This structure is taken as a reference for the proposed NN-based detector to compute the MobileNetV3 input vector:

u_{d} = G^{H} u = G^{H} G s + G^{H} z,

(7)

where

{(\cdot)}^{H}

denotes the Hermitian transpose. Here,

u_{d}

denotes the matched-filter preprocessed received vector, obtained as

u_{d} = G^{H} u

and used as the NN input. The operation

G^{H} u

corresponds to a Matched-Filter (MF) preprocessing step with respect to the effective frequency-domain channel

G

, since it correlates the received vector with the conjugate channel response. The matched filter does not cancel interference and therefore does not suppress ICI completely.

In doubly selective channels,

G

is generally non-diagonal, so

G^{H} G

contains off-diagonal terms that represent residual ICI. Nevertheless, applying

G^{H}

partially compensates channel-induced phase distortions and tends to tighten the received constellation, as illustrated in Figure 1. Overall, this step reduces the learning burden on the neural network and enables a more streamlined architecture with fewer layers, while still leaving residual ICI that can limit performance at high SNR.

3. Deep Learning MobileNetV3

CNNs effectively capture spatial correlations; however, their computational cost increases significantly with the number of feature extraction channels. MobileNetV3 addresses this limitation through the use of depthwise separable convolutions, enabling a more computationally efficient architecture.

In this work, MobileNetV3 is adapted for M-PSK detection in OFDM systems operating over doubly dispersive channels. Since standard CNN implementations operate in the real domain, the complex channel matrix

G \in C^{48 \times 48}

is first transformed into its polar representation. The magnitude and phase components are then arranged as two input channels forming the tensor

χ \in R^{48 \times 48 \times 2}

. Specifically,

χ_{0} = | G |

(8)

χ_{1} = ∠ G

(9)

The input tensor is constructed by stacking these components along the channel dimension; i.e.,

χ = stack (χ_{0}, χ_{1}) .

(10)

The complete end-to-end processing chain—including preprocessing, network inference, and postprocessing—is illustrated in Figure 2 and described in detail in Algorithms 1 and 2. The network output is defined as

\hat{ρ} \in R^{1 \times 48}

and is projected onto the complex unit circle to generate the phasor

e^{j \hat{ρ}}

. This phasor acts as a learned phase correction term and is applied elementwise, via the Hadamard operator ⊙, to the Matched-Filter (MF) preprocessing output

u_{d}

defined in (7). The angular component of the resulting compensated signal provides the final phase estimate, which is expressed as:

\hat{θ} = arg (u_{d} ⊙ e^{j \hat{ρ}})

(11)

The training objective does not aim to directly regress

\hat{ρ}

, as this variable serves only as an intermediate latent representation produced by the network. Instead, the optimization is formulated over the final phase estimate defined in (11), which directly impacts the symbol decision process. In this way, the learning procedure is explicitly aligned with the system performance metric.

Specifically, the goal is to minimize the discrepancy between the estimated phase

\hat{θ}

and the true phase

θ = ∠ (s)

, where

θ

denotes the ground-truth phase of the transmitted symbol vector. To achieve this, it is necessary to define a loss function that properly quantifies this discrepancy in the angular domain.

A straightforward approach would be to employ a direct Mean-Squared Error (MSE) between angles,

e = MSE (\hat{θ}, θ),

(12)

however, this metric is not appropriate due to the inherent

2 π

periodicity of phase. When

\hat{θ}

and

θ

lie on opposite sides of the wrapping boundary (e.g., near

- π

and

+ π

), direct subtraction may yield an artificially large error, even though the phases are physically close. To properly account for the circular nature of phase, one could define the wrapped angular difference as

Δ θ = wrap (\hat{θ} - θ),

(13)

and compute the mean-squared wrapped phase error over a batch of size N as

e_{wrap} = \frac{1}{N} \sum_{n = 1}^{N} {(Δ θ_{n})}^{2} .

(14)

However, in the proposed approach, an equivalent reformulation in the complex domain is adopted. Since

e^{j θ}

represents a unit phasor on the complex circle, the phase discrepancy can be measured as the Euclidean distance between the corresponding unit phasors. Under this formulation, the loss function is defined as:

L_{phasor} = \frac{1}{N} \sum_{n = 1}^{N} {∥e^{j {\hat{θ}}_{n}} - e^{j θ_{n}}∥}_{2}^{2}

(15)

expanding the squared norm in the complex plane yields

{∥e^{j \hat{θ}} - e^{j θ}∥}_{2}^{2} = 2 (1 - cos (\hat{θ} - θ)),

(16)

so that the loss can be equivalently expressed as:

L_{phasor} = \frac{2}{N} \sum_{n = 1}^{N} (1 - cos ({\hat{θ}}_{n} - θ_{n})) .

(17)

This formulation is inherently

2 π

-periodic and directly penalizes the true circular distance between phases, avoiding ambiguities associated with angular discontinuities.

Given the phase aware loss defined above, we can now describe how the proposed receiver is trained. Algorithm 1 details the end-to-end procedure used to generate training frames, apply the Matched-Filter (MF) preprocessing, form the real-valued channel representation

χ

, and forward it through MobileNetV3 to obtain the phase-correction vector

\hat{ρ}

. The correction is mapped to a unit phasor

e^{j \hat{ρ}}

and applied elementwise to the MF output, yielding the phase estimate

\hat{θ}

in (11). The wrapped phase discrepancy between

\hat{θ}

and the ground truth

θ

is then evaluated with the proposed loss and used to update the network weights via backpropagation. Algorithm 2 follows the same pipeline at test time but omits the loss computation and weight updates.

Algorithm 1 Training of the proposed MobileNetV3 receiver
Input: Number of frames $N_{f}$ ; subcarriers $M = 48$ ; channel matrix $G \in C^{M \times M}$ ; constellation $A$ ; SNR.
Output: Trained weights $W$ .
1:	for $k = 1$ to $N_{f}$ do
2:	Draw symbols $s \in A^{M}$ .
3:	Generate noise $z \sim CN (0, σ^{2} I)$ according to SNR.
4:	Receive $u = G s + z$ .
5:	MF preprocessing: $u_{d} = G^{H} u$ .
6:	Build network input from $G$ :
	$Ø_{0} \leftarrow norm (\| G \|) \in R^{M \times M}$	▹ magnitude channel normalized
	$Ø_{1} \leftarrow (∠ G + π) / (2 π) \in R^{M \times M}$	▹ phase channel normalized
	$Ø \leftarrow stack (Ø_{0}, Ø_{1}) \in R^{2 \times M \times M}$ .
7:	Forward pass: $\hat{ρ} \leftarrow MobileNetV 3 (Ø; W) \in R^{M}$ .
8:	Apply phase correction: $\tilde{u} = u_{d} ⊙ e^{j \hat{ρ}}$ .
9:	Predict phases $\hat{θ} \leftarrow ∠ (\tilde{u})$ and target $θ \leftarrow ∠ (s)$ .
10:	Wrapped phase error: $d \leftarrow \hat{θ} - θ$ .
	$d_{w} \leftarrow atan2 (sin d, cos d)$ .
11:	Loss $L \leftarrow MSE (d_{w})$ .
12:	Update $W$ using backpropagation.
13:	end for

Algorithm 2 Inference mode of the proposed MobileNetV3 receiver
Input: Trained weights $W$ ; channel matrix $G \in C^{M \times M}$ ; received vector $u \in C^{M}$ .
Output: Detected symbols $\hat{s}$ .
1:	Build network input from $G$ :
	$Ø_{0} \leftarrow norm (\| G \|)$ , $Ø_{1} \leftarrow (∠ G + π) / (2 π)$
	$Ø \leftarrow stack (Ø_{0}, Ø_{1}) \in R^{2 \times M \times M}$
2:	MF preprocessing: $u_{d} = G^{H} u$ .
3:	Forward pass: $\hat{ρ} \leftarrow MobileNetV 3 (Ø; W)$
4:	Apply Phase correction: $\tilde{u} = u_{d} ⊙ e^{j \hat{ρ}}$ .
5:	Phase Correction $\tilde{u} = u_{d} ⊙ e^{j \hat{ρ}}$
6:	Estimate $\hat{θ} \leftarrow ∠ (\tilde{u})$

Rather than explicitly learning to cancel ICI entirely, the preprocessing stage partially mitigates ICI through an initial matched-filter step. By reducing the complexity of the interference, the network can focus on correcting residual distortions, primarily due to noise, by leveraging phase consistency among the received signals.

The effectiveness of this preprocessing approach is clearly demonstrated in Figure 1:

Before preprocessing (Blue scatters): Symbols are widely scattered due to ICI and Gaussian noise, limiting accurate identification.
After preprocessing (Orange scatters): Symbols are clearly clustered around the most effective QPSK constellation points, suggesting a substantial reduction in interference. This simplifies the learning process for MobileNetV3 correction.

For reproducibility, Table 1 details the dataset construction and stored tensors used in training: the transmitted 4-QAM (QPSK) symbols, the matched-filtered noisy received vectors, the per-frame channel matrices, and the binary LOS/NLOS indicator that controls how channel realizations are selected across frames, together with the fixed SNR and modulation order. In particular, LOS and NLOS channel snapshots are alternated (toggled) during dataset generation to expose the network to both propagation conditions; however, this indicator is used only for bookkeeping and analysis and is not provided as an input feature. Therefore, the proposed model is not explicitly aware of whether the current received frame corresponds to LOS or NLOS, and it must learn a unified mapping that generalizes across both channel regimes.

Table 2 complements this by listing all learning-related settings for the proposed MobileNetV3 receiver, including the backbone choice, tensor shapes, wrapped phase-MSE objective, optimizer and learning rate, batch/epoch schedule, validation protocol, numerical precision, and magnitude/phase normalization used to build the network input.

4. Results and Experiments

The MobileNetV3-based equalizer was assessed in simulation for computational complexity (

O

), BER, and Block-Error Rate (BLER). Performance was evaluated against OFDM systems utilizing both linear and nonlinear detectors. A WINNER II V2V channel model was employed to simulate a doubly selective environment [26]. All performance experiments were conducted under ideal channel state information conditions. The complete channel and system configurations are detailed in Table 3.

The proper dataset SNR for the network training was determined by running a series of simulations under several fixed and mixed SNR values. The results in Figure 3 show that training with a wider range of SNR values (or with mixed-SNR combinations) does not improve the final BER performance. In contrast, training only at 15 dB achieves a comparable BER while converging faster and requiring fewer computational resources.

The Big

O

Benchmark chart (Figure 4) presents the computing complexity of several equalization algorithms applied to OFDM signals. The x-axis represents N, indicating the number of symbols, while the y-axis measures ‘Operations’ in terms of real products, with values displayed on a logarithmic scale. The red dashed line at

N = 48

delineates the particular complexity associated with the 802.11p standard parameters [27].

Figure 4 shows frame size (x-axis,

N = 48

) and operation count (y-axis, log scale). Key points follow:

The OSIC approach demonstrates a significant escalation in operational complexity as N increases. In contrast to prior observations, it exhibits the lowest complexity among the evaluated strategies. Concurrently, the growth trajectories of NearML stabilize as N escalates. Significantly, NearML exhibits a increased complexity compared to LMMSE.
MobileNetV3-Small complexity: For an $N \times N$ input with 2 channels, each depthwise-separable block at layer ℓ with cumulative stride $s_{ℓ}$ costs $\frac{N^{2}}{s_{ℓ}^{2}} (k_{ℓ}^{2} C_{ℓ} + C_{ℓ} C_{ℓ + 1})$ MACs, where $k_{ℓ}$ is the kernel size and $C_{ℓ}$ , $C_{ℓ + 1}$ are the input/output channel counts. Since MobileNetV3 uses fixed channel widths and small kernels (e.g., $k = 3$ ), the complexity grows mainly with the spatial size $N^{2}$ . Early downsampling ( $s_{ℓ} > 1$ ) reduces the spatial size of feature maps (e.g., $48 \times 48 \to 24 \times 24$ for stride 2), so later layers operate on fewer pixels and the total operations drop significantly. Thus MobileNetV3 scales roughly with $N^{2}$ but is nearly constant for a fixed N (e.g., 48), unlike fully connected layers whose cost can grow quadratically with input size [21].
Occupying the most complex is LMMSE, bearing the highest complexity, scaling as $N^{3}$ due to the need to invert a matrix.

The graph shows a comparison of both BER (Figure 5) and BLER (Figure 6) performance between PhaseNet (Figure 2) and classical equalization methods—MobileNetV3, ComNet, LMMSE, OSIC, and NearML—across various SNR values in an OFDM system. In the context of LTE, BLER is defined as the ratio of the number of erroneous blocks to the total number of received blocks. This metric is calculated using a Cyclic Redundancy Check (CRC) evaluation, where each transport block has an attached CRC. At the receiver side, the transport block is considered successfully decoded if the CRC calculated by the receiver matches the CRC sent by the transmitter. In BLER metrics, MobileNetV3 behaves pretty similar to the classical methods and does not show a significant improvement. However, its capability to being parallelized makes it a suitable approach for real-time implementation.

Low SNR Range (5–15 dB): In this range, MobileNetV3 has lower BER than classical methods. The BER for MobileNetV3 decreases significantly, reaching values below $10^{- 2}$ by 10 dB, whereas methods such as OSIC, LMMSE, and NearML remain closer to $10^{- 1}$ .
Moderate SNR Range (15–20 dB): As the SNR increases, MobileNetV3 slows its decreasing rate compared to traditional methods around 17 dB. ComNet shows only limited improvement, with a slow reduction in BER and an almost flat trend. By 20 dB, the MobileNetV3 BER approaches $10^{- 3}$ , which is higher than the classical methods, it does not surpass the performance of the classical methods.
High SNR Range (20–25 dB): At higher SNR values, the performance of MobileNetV3 flattens and stabilizes around $10^{- 3}$ , while classical methods continue to decrease and achieve better results under noise. ComNet shows a similar stabilization but with a noticeable offset of about one order of magnitude, reaching BER values near $10^{- 2}$ . Traditional detectors such as LMMSE and OSIC achieve slightly lower error rates, although the gap becomes minimal in this regime. Neural networks demonstrate their limitations beyond the trained SNR range but remain competitive with classical approaches.

We also extended the evaluation scope to higher-order constellations. Figure 7 and Figure 8 report the BER and BLER performance, respectively, when the same receiver architecture is tested on QPSK, 8-PSK, and 16-QAM.

As the constellation order increases, constellation points become more densely packed for a fixed average symbol energy. This reduces the effective separation between decision regions, making symbol detection more sensitive to additive noise as well as to residual phase and amplitude errors. As a result, for the same SNR, higher-order constellations generally exhibit higher symbol error rates and, under Gray mapping, higher BER.

This trend is consistent with our results: compared to QPSK, the 8-PSK BER curve is shifted upward by approximately two orders of magnitude on average, and moving from 8-PSK to 16-QAM introduces roughly one additional order of magnitude. The BLER curves follow the same ordering, indicating that the network maintains the same relative behavior when evaluated on blocks. This is expected because BLER treats a block as incorrect if it contains even a single bit error.

5. Discussion

MobileNetV3 is an efficient neural network architecture designed to achieve fast inference by relying primarily on lightweight convolutional operations. Its depthwise separable convolution strategy is particularly well suited for embedded and real-time applications, as it significantly reduces computational cost while maintaining strong feature extraction capability. In this work, we explore a new application of MobileNetV3 in a regression setting, where the objective is to estimate a latent representation of the channel as a single vector. Conceptually, this can be interpreted as extracting a diagonal like summary of the channel matrix, while simultaneously incorporating the most informative features from the entire channel.

Beyond the architectural efficiency of MobileNetV3, the novelty of the proposed receiver lies in how channel knowledge is exploited and how the network output is integrated into the classical OFDM chain. Instead of directly estimating transmitted symbols or learning a full equalizer, the network processes the complex channel matrix

G \in C^{48 \times 48}

by forming the real-valued two-channel tensor

χ = stack (| G |, ∠ G) \in R^{48 \times 48 \times 2}

and regresses a phase-correction vector

\hat{ρ} \in R^{1 \times 48}

. The model is trained with an MSE objective that minimizes the discrepancy between the corrected phase

∠ (u_{d} ⊙ e^{j \hat{ρ}})

and the ideal phase

∠ s

. During inference,

\hat{ρ}

is mapped to a unit-magnitude phasor

e^{j \hat{ρ}}

and applied elementwise to the matched-filter output

u_{d}

(which already partially mitigates ICI), yielding the final phase estimate

\hat{θ} = arg (u_{d} ⊙ e^{j \hat{ρ}})

. This structured design targets residual phase distortions after model-based preprocessing, rather than replacing the detection or equalization pipeline.

Furthermore, a matched-filter preprocessing stage is explicitly introduced prior to the neural network to partially mitigate channel-induced distortions and tighten the received constellation, thereby reducing the learning burden of the network. In contrast to end-to-end architectures that implicitly learn estimation and equalization jointly, the proposed approach enforces a structured decomposition: the channel matrix is explicitly used to construct the preprocessing stage and the network input representation, while training is performed using a wrapped phase-MSE criterion to properly handle phase periodicity. This model-aware design differentiates the proposed receiver from existing NN-based OFDM detectors such as ComNet.

One limitation of the current design is that the network can only be trained over a limited SNR range, and it does not incorporate temporal context from previous OFDM frames. As a consequence, residual inter carrier interference cannot be fully eliminated when the interference originates from adjacent V2V frames. Nevertheless, the model effectively reduces BER levels over low SNR values and various channel impairments within the considered operating range.

In terms of BLER performance, the network behaves similarly to the classical methods baseline. This is a promising result as it indicates that the network is capable of learning and reproducing the behavior of a traditional signal processing method. It is also worth noting that neural networks are highly parallelizable, even on embedded platforms. With current state-of-the-art hardware accelerators such as NPUs, FPGAs, and lightweight AI inference engines, deploying MobileNet-based architectures in practical V2V systems is becoming increasingly feasible.

6. Conclusions

In this paper, a low-complexity detection scheme for M-PSK symbols in OFDM systems operating over doubly dispersive V2V channels is proposed. The MobileNetV3-based detector emerges as a suitable solution for mitigating the effects of severe ICI induced by Doppler spread in vehicular communication links. Unlike classical linear and nonlinear equalizers, which rely on computationally expensive matrix inversions or iterative interference cancellation, the proposed receiver introduces a lightweight preprocessing stage based on the signal model that compensates for Doppler-induced phase distortions. This preprocessing enables the neural network to focus on symbol detection by exploiting phase-data correlations rather than directly suppressing ICI. The proposed MobileNetV3-based receiver is particularly effective in the low-to-mid SNR regime, where ICI degrades the performance of conventional OFDM systems. Simulation results show that the proposed receiver improves BER performance compared to linear and nonlinear detectors, as well as ComNet, in the low-SNR regime (below 15 dB). At high SNR values, a mild performance saturation is observed, which is consistent with the absence of temporal memory in the network architecture. Nevertheless, this trade-off is justified by the reduced computational complexity of the proposed solution.

Overall, the proposed receiver distinguishes itself from existing equalization and neural-based schemes through its compact architecture, inherent parallelism, and reduced computational burden. These characteristics make the MobileNetV3-based approach a strong candidate for real-time V2V receivers and hardware-constrained platforms such as FPGA- or ASIC-based embedded systems.

Author Contributions

Conceptualization, L.E.T.-G., J.A.D.-P.-F. and F.P.-C.; methodology, L.E.T.-G., J.A.D.-P.-F. and F.P.-C.; software, L.E.T.-G., J.A.D.-P.-F. and F.P.-C.; validation, D.d.P.-F., J.-C.L.-P., C.D.-V.-S. and L.R.V.-G.; formal analysis, L.E.T.-G., J.A.D.-P.-F. and F.P.-C.; investigation, D.d.P.-F., J.-C.L.-P., C.D.-V.-S. and L.R.V.-G.; resources, D.d.P.-F., J.-C.L.-P., C.D.-V.-S. and L.R.V.-G.; data curation, D.d.P.-F., J.-C.L.-P., C.D.-V.-S. and L.R.V.-G.; writing—original draft preparation, L.E.T.-G., J.A.D.-P.-F. and F.P.-C.; writing—review and editing, D.d.P.-F., J.-C.L.-P., C.D.-V.-S. and L.R.V.-G.; visualization, D.d.P.-F., J.-C.L.-P., C.D.-V.-S. and L.R.V.-G.; supervision, D.d.P.-F., J.-C.L.-P., C.D.-V.-S. and L.R.V.-G.; project administration, D.d.P.-F., J.-C.L.-P., C.D.-V.-S. and L.R.V.-G. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The code used to run the simulations presented in this study is openly available through the following GitHub repository (accessed on 29 January 2026): https://github.com/Tonix22/PhdDegreeCode/tree/main/MatlabCode/ZeroFocingMobileNet.

Acknowledgments

The authors acknowledge at Universidad Panamericana.

Conflicts of Interest

The authors declare no conflicts of interest.

References

IEEE P1609.2.1/D3; IEEE Approved Draft Standard for Wireless Access in Vehicular Environments (WAVE)—Certificate Management Interfaces for End Entities. IEEE: New York, NY, USA, 2026; pp. 1–259.
Hartenstein, H.; Laberteaux, L. A tutorial survey on vehicular ad hoc networks. IEEE Commun. Mag. 2008, 46, 164–171. [Google Scholar] [CrossRef]
Gutiérrez, C.A.; Contreras-Ponce, O.; Ornelas-Lizcano, J.C.; Pätzold, M.; Castillo-Soria, F.R. Polarimetric Modeling of Mobile Fading Channels. IEEE Trans. Veh. Technol. 2025, 74, 21–37. [Google Scholar] [CrossRef]
Del Puerto-Flores, J.A.; Peña-Campos, F.; Parra-Michel, R.; Del-Valle-Soto, C. Carrier Diversity Incorporation to Low-Complexity Near-ML Detection for Multicarrier Systems over V2V Radio Channel. Sensors 2021, 21, 6067. [Google Scholar] [CrossRef]
Acosta-Marum, G.; Ingram, M.A. Six time- and frequency- selective empirical channel models for vehicular wireless LANs. IEEE Veh. Technol. Mag. 2007, 2, 4–11. [Google Scholar] [CrossRef]
Nguyen, T.H.; Nguyen, T.H.; Yoon, T.; Jung, W.S.; Yoo, D.; Ro, S. An ICI Suppression Analysis Testbed for Harbor Unmanned Ground Vehicle Deployment. IEEE Access 2019, 7, 107757–107768. [Google Scholar] [CrossRef]
Del Puerto-Flores, J.A.; Castillo-Soria, F.R.; Gutiérrez, C.A.; Peña-Campos, F. Efficient Index Modulation-Based MIMO OFDM Data Transmission and Detection for V2V Highly Dispersive Channels. Mathematics 2023, 11, 2773. [Google Scholar] [CrossRef]
Ko, K.; Lim, S. Generalized Maximum Delay Estimation for Enhanced Channel Estimation in IEEE 802.11p/OFDM Systems. Electronics 2025, 14, 2404. [Google Scholar] [CrossRef]
Toland, K.; Taiwo, P.; Cole-Rhodes, A. Towards Equalization of Mixed Multi-user OFDM Signals Over a Doubly-Dispersive Channel. In Proceedings of the 2023 57th Annual Conference on Information Sciences and Systems (CISS); IEEE: New York, NY, USA, 2023; pp. 1–5. [Google Scholar] [CrossRef]
Liu, X.; Anand, K.; Guan, Y.L.; Deng, L.; Fan, P.; Zhou, Z. BEM-PSP for Single-Carrier and SC-FDMA Communication Over a Doubly Selective Fading Channel. IEEE Trans. Wirel. Commun. 2020, 19, 3924–3937. [Google Scholar] [CrossRef]
Peña-Campos, F.; Parra-Michel, R.; Kontorovich, V. MIMO Multicarrier Transmission Over Doubly Selective Channels with Virtual Trajectories Receiver. IEEE Trans. Veh. Technol. 2019, 68, 9330–9338. [Google Scholar] [CrossRef]
Ramadan, K.; Aqeel, I.; Hassan, E.S. Robust OTFS Detection via MMSE-DFE Equalization for ISAC in Doubly Dispersive Channels. Mathematics 2025, 13, 3545. [Google Scholar] [CrossRef]
Li, M.; Wang, W. Phase Noise Effects on OFDM Chirp Communication Systems: Characteristics and Compensation. Information 2024, 15, 221. [Google Scholar] [CrossRef]
Kim, K.H. Low-Complexity Suboptimal ML Detection for OFDM-IM Systems. IEEE Wirel. Commun. Lett. 2023, 12, 416–420. [Google Scholar] [CrossRef]
Vlachos, E.; Lalos, A.S.; Berberidis, K. Low-Complexity OSIC Equalization for OFDM-Based Vehicular Communications. IEEE Trans. Veh. Technol. 2017, 66, 3765–3776. [Google Scholar] [CrossRef]
Zhang, X.; Xing, L.; Wu, H.; Ji, B.; Zhang, G. Low-Complexity Decision Feedback Equalization for Single-Carrier Massive MIMO Systems. IEEE Trans. Veh. Technol. 2024, 73, 17316–17330. [Google Scholar] [CrossRef]
Velázquez, R.; Pissaloux, E.; Del-Valle-Soto, C.; Arai, M.; Valdivia, L.J.; Del Puerto-Flores, J.; Gutiérrez, C.A. Performance Evaluation of Active and Passive Haptic Feedback in Shape Perception. In Proceedings of the 2019 IEEE 39th Central America and Panama Convention (CONCAPAN XXXIX); IEEE: New York, NY, USA, 2019; pp. 1–6. [Google Scholar] [CrossRef]
Mohammed, A.F.Y.; Sultan, S.M.; Patni, S. Collaborative Beamforming with DQN for Interference Mitigation in 5G and Beyond Networks. Telecom 2024, 5, 1192–1204. [Google Scholar] [CrossRef]
Aziz, M.A.; Rahman, M.H.; Tabassum, R.; Sejan, M.A.S.; Baek, M.S.; Song, H.K. A Hybrid Deep Learning Framework for OFDM with Index Modulation Under Uncertain Channel Conditions. Mathematics 2024, 12, 3583. [Google Scholar] [CrossRef]
Tonix-Gleason, L.E.; Del-Puerto-Flores, J.A.; Castillo-Soria, F.R.; Parra-Michel, R.; Campos, F.P. Neural Network Aided M-PSK Detection in 802.11P V2V OFDM Systems Under ICI Conditions. IEEE Wirel. Commun. Lett. 2025, 14, 3420–3424. [Google Scholar] [CrossRef]
Howard, A.; Sandler, M.; Chu, G.; Chen, L.C.; Chen, B.; Tan, M.; Wang, W.; Zhu, Y.; Pang, R.; Vasudevan, V.; et al. Searching for MobileNetV3. arXiv 2019, arXiv:1905.02244. [Google Scholar] [CrossRef]
Gümüş, M.; Duman, T.M. Channel Estimation and Symbol Demodulation for OFDM Systems Over Rapidly Varying Multipath Channels with Hybrid Deep Neural Networks. IEEE Trans. Wirel. Commun. 2023, 22, 9361–9373. [Google Scholar] [CrossRef]
Gizzini, A.K.; Chafii, M. Deep Learning Based Channel Estimation in High Mobility Communications Using Bi-RNN Networks. In Proceedings of the ICC 2023—IEEE International Conference on Communications; IEEE: New York, NY, USA, 2023; pp. 2607–2612. [Google Scholar] [CrossRef]
Almeida, I.; Guerreiro, J.; Dinis, R. On Deep Learning Hybrid Architectures for MIMO-OFDM Channel Estimation. Electronics 2025, 14, 4692. [Google Scholar] [CrossRef]
Gao, X.; Jin, S.; Wen, C.K.; Li, G.Y. ComNet: Combination of Deep Learning and Expert Knowledge in OFDM Receivers. IEEE Commun. Lett. 2018, 22, 2627–2630. [Google Scholar] [CrossRef]
MathWorks. WINNER II Channel Documentation. 2026. Available online: https://la.mathworks.com/help/comm/ug/winner-ii-channel.html (accessed on 26 February 2026).
Unapproved Draft Std P802.11p /D11.0; IEEE Draft Standard for Information Technology—Telecommunications and Information Exchange Between Systems Local and Metropolitan Area Networks Specific Requirements Part 11: Wireless Lan Medium Access Control (MAC) and Physical Layer (PHY) Specifications Amendment: Wireless Access in Vehicular Environments. IEEE: New York, NY, USA, 2010.

Figure 1. Received symbol constellation before and after preprocessing.

Figure 2. Block diagram of the proposed MobileNetV3 receiver.

Figure 3. BER performance over different training SNR values.

Figure 4. Complexity benchmark.

Figure 5. MobileNetV3 vs. classical methods in BER performance.

Figure 6. MobileNetV3 vs. classical methods in BLER performance.

Figure 7. BER performance across modulation orders (QPSK, 8-PSK, 16-QAM).

Figure 8. BLER performance across modulation orders (QPSK, 8-PSK, 16-QAM).

Table 1. Data Generation.

Name	Type/Shape	Description
`s_all`	$C^{48 \times 100000}$	Tx 4-QAM $s$ (unit average power)
`Mf_all`	$C^{48 \times 100000}$	Matched-filtered observations $G^{H} u$
`G_all`	$C^{48 \times 48 \times 100000}$	Channel matrices $G$
`isLOS`	${0, 1}^{100000 \times 1}$	$1 = LOS, 0 = NLOS$
`SNR`	scalar	15 dB
`modorder`	scalar	4 (Gray-mapped QAM)

Table 2. Hyperparameters for custom OFDM MobileNetV3.

Hyperparameter	Value
Backbone	MobileNetV3-Small (`torchvision`)
Input shape	$2 \times 48 \times 48$
Output dim	48
Loss	Wrapped phase MSE on $∠ (u_{d} ⊙ e^{j \hat{θ}})$ vs. $∠ (s)$
Optimizer	AdamW
Learning rate	$1 \times 10^{- 3}$
Batch size	64
Max epochs	10
Validation split	$0.1$
Num. workers	16
Precision	`“32-true”`
$χ_{0}$ magnitude norm	Per-sample min–max to $[0, 1]$
$χ_{1}$ phase norm	Map $[- π, π] \to [0, 1]$
$u_{d}$ preprocessing	Real/imag, no normalization
Target phase	$∠ (s) \in [- π, π]$
Checkpointing	Top-3 by `val_loss`; optional warm start/resume

Table 3. Channel and system parameters.

Parameter (Units)	Value
Path Delay (µs)	${0, 0.1, 0.2, 0.3, 0.4, 1}$
Path Power (dB)	${0, - 3, - 5, - 7, - 9, - 15}$
Bandwidth (MHz)	10
Taps number L	6
Doppler (kHz)	Jakes-0.5
Block length ( $N, N_{D}$ )	${64, 48}$
Constellation size	4-PSK
Spectral efficiency (b/Hz/s)	1.5

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Tonix-Gleason, L.E.; Del-Puerto-Flores, J.A.; Peña-Campos, F.; del Puerto-Flores, D.; López-Pimentel, J.-C.; Del-Valle-Soto, C.; Vela-Garcia, L.R. Efficient Deep Learning-Based M-PSK Detection for OFDM V2V Systems Using MobileNetV3. Algorithms 2026, 19, 210. https://doi.org/10.3390/a19030210

AMA Style

Tonix-Gleason LE, Del-Puerto-Flores JA, Peña-Campos F, del Puerto-Flores D, López-Pimentel J-C, Del-Valle-Soto C, Vela-Garcia LR. Efficient Deep Learning-Based M-PSK Detection for OFDM V2V Systems Using MobileNetV3. Algorithms. 2026; 19(3):210. https://doi.org/10.3390/a19030210

Chicago/Turabian Style

Tonix-Gleason, Luis E., José A. Del-Puerto-Flores, Fernando Peña-Campos, Dunstano del Puerto-Flores, Juan-Carlos López-Pimentel, Carolina Del-Valle-Soto, and Luis René Vela-Garcia. 2026. "Efficient Deep Learning-Based M-PSK Detection for OFDM V2V Systems Using MobileNetV3" Algorithms 19, no. 3: 210. https://doi.org/10.3390/a19030210

APA Style

Tonix-Gleason, L. E., Del-Puerto-Flores, J. A., Peña-Campos, F., del Puerto-Flores, D., López-Pimentel, J.-C., Del-Valle-Soto, C., & Vela-Garcia, L. R. (2026). Efficient Deep Learning-Based M-PSK Detection for OFDM V2V Systems Using MobileNetV3. Algorithms, 19(3), 210. https://doi.org/10.3390/a19030210

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Efficient Deep Learning-Based M-PSK Detection for OFDM V2V Systems Using MobileNetV3

Abstract

1. Introduction

Contribution and Article Structure

2. Model System

3. Deep Learning MobileNetV3

4. Results and Experiments

5. Discussion

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI