1. Introduction
OFDM has been standardized in 5G wireless communications, exhibiting greater resistance to frequency-selective fading compared to single-carrier systems [
1]. In contrast to single-carrier systems, which have a broad and leaky spectrum (such as raised cosine shapes with an excess factor), the OFDM spectrum is rectangular [
1]. This rectangular configuration confines nearly all the energy within the defined spectrum, resulting in a narrower, higher spectral efficiency. On the other hand, optical-orthogonal frequency division multiplexing (O-OFDM) can effectively eliminate intersymbol interference (ISI) caused by fiber chromatic dispersion (CD) using simple, single-tap equalization [
2,
3,
4,
5]. Consequently, O-OFDM reduces computational complexity and cost compared to single-carrier m-QAM. The latter requires multi-tap equalizers, such as feed-forward equalization or time-domain equalization (TDE-MIMO), implemented within a more complex DSP unit [
3]. In DP-QAM systems, polarization mode dispersion (PMD) is stochastic. Therefore, the finite impulse response (FIR) taps in TDE-MIMO must be continuously updated for each received symbol. In contrast, O-OFDM incorporates a straightforward cyclic prefix (CP) for each symbol to minimize ISI resulting from either CD or PMD. This enhancement demonstrates the potential of DP O-OFDM to serve as a replacement for traditional DP-QAM systems. Other benefits of OFDM include the parallelization with fast Fourier transform—a benefit for field programmable gate arrays (FPGAs)—and signal-to-noise ratio (SNR) monitoring in MHz or kHz granularity at any point in the spectrum [
6].
Optical fast-OFDM (F-OFDM) has emerged as a promising updated multicarrier solution to OFDM, in order to further reduce the subcarrier spacing to half the symbol rate per subcarrier [
7,
8]. This innovative approach offers significant advantages over conventional OFDM, including improved spectral efficiency, lower energy consumption, and reduced latency [
7,
8]. Additionally, coherent optical F-OFDM enhances the frequency offset compensation in coherent detection and boosts resilience to phase noise in self-coherent detection [
7]. In traditional OFDM, MHz-spaced parallel subcarriers form low-capacity data transmission signals. On the other hand, F-OFDM, which utilizes the compression properties of the inverse fast cosine transform (IFCT), reduces the frequency spacing between subcarriers, offering improved bandwidth efficiency compared to standard O-OFDM, as it employs half the subcarrier spacing [
7,
8]. However, the sub-MHz spacing can cause substantial distortion in phase-modulated signals, making it crucial to use single-dimensional modulation formats like amplitude shift-keying (ASK) [
9]. Previous demonstrations have shown IFCT-based optical F-OFDM for long-haul coherent transmission using optical double-sideband signals [
7], using a special configuration for 16QAM modulation [
10], and alternatively using multi-tap equalization for performance improvement, which sacrifices complexity [
11]. As a more straightforward implementation, direct-detected optical F-OFDM has also been applied in cost-sensitive local networks utilizing multimode fiber links, given the technique’s cost-effectiveness [
8,
9]. Nonetheless, like conventional O-OFDM, F-OFDM signals suffer from a high peak-to-average power ratio, leading to comparable transmission performance under similar signal capacities. In the work reported by Giacoumidis et al. [
12], the dynamic parameter requirements of sub-ranging quantizers—identified as digital-to-analog and analog-to-digital converters (DACs/ADCs)—involved in optical F-OFDM signals were also analyzed, serving as a proof-of-concept for future real-time implementations.
To further increase signal capacity, space division multiplexing (SDM) employing m-QAM has been proposed mainly in the realm of few-mode fibers (FMFs) and multicore fibers [
13,
14,
15]. By leveraging multiple spatial modes within a single fiber, SDM-QAM and SDM-OFDM [
13,
14,
15,
16] enhance data transmission capacity while maintaining high spectral efficiency. This approach is particularly beneficial for expanding bandwidth in optical communication systems, with direct relevance to high-capacity short-reach scenarios such as data center interconnects, as well as scalable metro and backbone networks envisioned for future 6G infrastructures [
17,
18]. Additionally, it promotes the efficient use of existing fiber infrastructure, reducing costs and power consumption while increasing overall network throughput. These are key requirements for both energy-efficient data centers and ultra-dense 6G communication frameworks. O-OFDM for SDM applications has been implemented for 314-Tbit/s (576 × 380.16 MHz 5G OFDM signals) wavelength-division-multiplexing (WDM)-based intermediate frequency (IF)-over-fiber transmission for analog mobile fronthaul [
14]. SDM superchannel transmission over FMF with a data rate of 100 Gb/s and beyond has been recently demonstrated through coherent O-OFDM and single-carrier superchannel transmission [
15]. For equalization purposes in SDM, Volterra-based nonlinear equalization has been proposed in [
19] to reduce inter-modal nonlinear penalties in two-mode fiber (TMF). It was shown that for ~260 Gb/s DP coherent O-OFDM at 1040 km, the TMF inter-modal nonlinear penalty can be reduced by up to ~4 dB [
19].
On the other hand, machine learning (ML) has become prevalent across various fields, and its use in optical communications is rapidly increasing due to the growing global internet traffic. Specifically, in the task of enhanced equalization, it has been utilized for coherent and direct-detection optical systems. In O-OFDM systems, unsupervised clustering techniques and advanced supervised ML techniques have been used for enhanced equalization, such as deep neural networks (DNNs) and support vector machines (SVMs) [
20,
21,
22,
23]. In [
23], direct-detection SDM-OFDM was considered using an SVM-based equalizer. A convolutional neural network (CNN) for nonlinear equalization has also been implemented in conventional 16QAM OFDM, as reported in [
24]. For F-OFDM, a low computational load artificial neural network (ANN) design was compared to the benchmark inverse Volterra series transfer function (IVSTF)-based compensator for long-haul single-mode fiber links at 9.69 Gb/s [
25]. Results showed that ANN enabled an 80 km extension in transmission reach, effectively addressing stochastic nonlinear impairments like parametric noise amplification [
25]. Additionally, ANN relaxed the dynamic parameter requirements of sub-ranging quantizers, reducing the optimum clipping ratio and quantization bits by 2 dB and 2 bits compared to both linear equalization and the IVSTF equalizer [
25].
Despite the growing body of work on ML-based equalization in optical systems, prior studies have primarily focused on single-mode fiber (SMF) transmission or conventional O-OFDM systems [
20,
21,
22,
23,
24,
25]. Similarly, while F-OFDM has been investigated in both direct-detected and coherent optical links [
7,
8,
9,
10,
11,
12,
25], its application has largely been limited to single-channel or single-mode scenarios, without considering the additional complexity introduced by SDM over FMFs. To the best of our knowledge, this work represents the first comprehensive investigation of coherent optical F-OFDM combined with ML-based equalization in an SDM-FMF transmission environment. The novelty of this work lies in three key aspects:
- (i)
The integration of coherent F-OFDM with SDM over FMFs, enabling the evaluation of multicarrier transmission under strong inter-modal crosstalk and modal dispersion;
- (ii)
The application and comparison of supervised ML equalizers, specifically DNN, CNN, and bidirectional recurrent neural networks (RNN) and long-short term memory (LSTM) architectures within a high-dimensional MIMO-SDM receiver framework;
- (iii)
A systematic performance and complexity analysis highlighting the trade-off between BER improvement and computational cost in SDM systems, which is not addressed in prior ANN/DNN-based optical equalization studies.
Unlike previous ML-based equalization approaches that target predominantly temporal impairments in SMF systems, the proposed approach addresses both temporal and spatial impairments inherent to SDM transmission, including inter-modal coupling and differential mode delay. This distinction is critical in demonstrating the effectiveness and limitations of ML equalizers in next-generation high-capacity optical networks.
2. Methods
A coherent optical F-OFDM system was generated, similar to that described in [
10] for 16QAM. Two bipolar 4ASK data sets were encoded using Gray code in Matlab (Matlab
® R2023b). A general transmission link diagram for the x-polarization is shown in
Figure 1a for both coherent optical OFDM and F-OFDM cases. The IFCT and FCT processes implemented 128 points, with 100 subcarriers designated for F-OFDM data transmission and 6 for phase estimation. The first two subcarriers were left unmodulated to facilitate the use of AC-coupled drive amplifiers and receivers. The generation and decoding processes were conducted using Matlab and integrated into the VPIphotonics system for the optical components and the fiber. Key steps in the F-OFDM signal creation included serial-to-parallel (S/P) conversion, symbol mapping, and an optimum 10% CP was inserted to mitigate ISI. For fair comparison, identical parameters were selected for the conventional coherent optical OFDM system.
The F-OFDM/OFDM system generated an optical polarization-multiplexed QAM signal using a pair of single-drive MZMs [
26] to independently modulate each polarization component of the carrier wave as depicted in
Figure 1a. The same constellation was used for both polarizations. An ideal polarization beam splitter was used to maintain orthogonality among X- and Y-polarizations, and then an ideal polarization beam combiner. The bit sequence generation (pseudo-random binary sequence, PRBS) for the encoding of the m-QAM F-OFDM/OFDM was set at 2
21, employing a virtually random sequence using Mersenne Twister in Matlab to avoid bit pattern recognition by the ML algorithms. For optical modulation, the in-phase/quadrature (IQ) single-drive architectures implemented for the X- and Y-polarization transmitters permitted advanced shaping of the modulator’s driving signals. The square-root-raised-cosine filtering at the transmitter side was selected, together with the matched filter at the receiver, to reduce the amount of detected noise, keeping ISI at a minimum. For the transmitter’s laser continuous wave source, a 300 kHz linewidth was set. In the inset of
Figure 1a, the 4QAM for OFDM and the combined 4ASK for the F-FODM constellation diagrams are illustrated (qualitative showcase).
For the coherent receiver as depicted in
Figure 1b, an optical coherent quadrature receiver model was used for signal detection. It incorporated a local oscillator (LO), optical hybrids, post-detection electrical filters, and ADCs. Enhanced accuracy was provided via clock recovery. Clock recovery ensured optimal sampling times in back-to-back configurations. The electrical received signal was digitized using four ADCs, outputting them as an
N × 4 float matrix, where
N was the product of the simulation time window and the sampling rate. Ideal optical hybrids were used without considering imbalances and losses. The ADC resolution for each channel was set at 8 bits. The LO laser linewidth was also set at 300 kHz with a frequency offset from the transmitter laser of 500 kHz. Balanced PIN photodiodes were used with the following parameters: a responsivity of 0.9 A/W, dark current of 10 nA, and thermal noise of 20 pA/Hz
0.5, including shot noise. Low-pass filters (LPFs) were implemented using a Bessel transfer function, featuring a bandwidth of 0.8 times the symbol rate. This bandwidth was selected based on a symbol rate of 12 Gbaud per channel, utilizing a 4th-order filter. At the digital receiver side, a MIMO processor was implemented to effectively mitigate channel crosstalk in SDM systems before OFDM/F-OFDM demodulation, as shown in
Figure 1b.
In
Figure 2, the total SDM F-OFDM QAM transceiver system is illustrated, where the principles of SDM over FMFs are shown, which support four LP modes, totaling twelve orthogonal modes (LP01, LP11a, LP11b, LP21a, LP21b, and LP02), each accommodating two polarizations. Furthermore, the Erbium-doped fiber amplifier (EDFA)-SDM model effectively accounts for mode-dependent gain and noise, enhancing the overall performance of the transceiver system. Ideal polarization coupling and splitting were used for the modes. For the FMF, the following parameters were utilized: The supported modes were (0, 1), (1, 1), (2, 1), and (0, 2). The attenuation was set to 0.2 dB/km. The relative mode delay was accounted for, and the CD was set at 20 ps/nm/km. The dispersion slope was set at 0.06 s/km
3. The differential group delay (DGD) value for each mode was 0, 1.3 × 10
−13, 1.5 × 10
−14, and 2 × 10
−13 s/m, respectively. The PMD was set at 0.01 s/km
0.5, while the intra-mode group delay deviation was 5 × 10
−15 s/m. The simulation study also includes mode coupling. The reference frequency set at the FMF was 1552 nm (193.1 THz). The mean section length in the FMF model was set at 500 m, with a section length deviation of 5 m. The inverse transmission matrix of the link was used, which is harnessed in the MIMO equalizer. The optical spectrum of the SDM channels at the reference frequency after the optical multi-mode amplifier and before FMF transmission is shown in
Figure 2b. Where “SB” denotes single-band simulation in the VPIphotonics simulator, which considers all channels collectively in the frequency domain, thereby addressing inter-channel crosstalk effects.
For the ML algorithms, both DNN and RNN models were employed in conjunction with the DSP, as illustrated in
Figure 1b. The DNN model was based on a fully connected architecture, as described in [
27]. This model utilizes the sliding window method to preprocess time-domain signals. The DNN structures are designed to operate with real numbers; consequently, arrays of complex numbers are converted into arrays of real numbers by stacking the real and imaginary parts into a single dimension. The initial hyperparameters selected for the model include: number of hidden of layers set to 4, number of neurons set to 8, using ReLU activation function, Adam-based stochastic gradient descent for the back-propagation adaptation, a learning rate of 0.001, loss function defined as minimum mean squared error (MMSE), a batch size of 32, the number of epochs set to 30, Kernel initializer based on He-Uniform, and a zeros bias initializer. Both DNN and RNN used 75% training data, 15% for testing, and 10% for validation. The RNN model was built upon a standard architecture as outlined in [
27]. Similar to the DNN model, it employs the sliding window technique for pre-processing time domain signals. Utilizing the
Keras framework, the RNN adopted a bidirectional RNN architecture, which processes input sequences in both forward and backward directions. This design enables the model to capture information from both past and future contexts. The architecture is structured as many-to-one, where multiple input values are used to predict a single output value. The inclusion of average nodes allows the output from different time steps to be combined, thereby enhancing the model’s capacity to understand and learn from the sequence data [
27]. Identical hyperparameters to the DNN model were adopted for fair comparison. To ensure reproducibility, the input to the model consists of complex 16QAM samples, which are decomposed into their in-phase and quadrature components and concatenated into real-valued input vectors. A sliding window of length 5–7 samples (empirically optimized) is used to capture short-term temporal dependencies in the signal. The dataset was also split into training, validation, and testing subsets using a 75%/10%/15% ratio across all models for fair comparison. Training was performed using the Adam optimizer with a learning rate of 0.001, batch size of 32, and 30 epochs, while the loss function was defined as the mean squared error (MSE). Identical hyperparameters to the DNN model were adopted for fair comparison. The CNN architecture was designed following standard deep learning practices, including the use of small convolutional kernels (size 3), 32 filters, and stacked convolutional layers with max-pooling operations for hierarchical feature extraction, as reported in [
28]. The LSTM model was implemented based on the standard architecture proposed by Hochreiter and Schmidhuber [
29], as well as its application in coherent optical systems reported in [
30]. The LSTM model incorporates input, forget, and output gates to regulate information flow within the memory cell. A configuration of 64 units with dropout regularization (0.2) was adopted following hyperparameter tuning to ensure stable training behavior [
30] and consistent comparison with the simpler RNN architecture.
4. Discussion
In this paper, the potential of coherent optical F-OFDM was explored as an advanced multicarrier solution, demonstrating its superiority over traditional coherent optical OFDM. The results highlighted the advantages of utilizing F-OFDM in combination with supervised ML equalization techniques, particularly in improving BERs in SDM-based FMF m-QAM systems. The findings indicate that while RNNs outperform DNNs in terms of BER under certain conditions, the complexity of RNNs presents a notable challenge. Specifically, the computational demands of RNNs, which demonstrated approximately 101.3 times more FLOPS compared to DNNs, make them less practical for real-time applications, particularly in cost-sensitive environments. On the other hand, as the number of neurons increases, the RNN shows improved performance; however, the diminishing returns in accuracy beyond a certain threshold suggest that simpler architectures like DNNs may be more efficient. It also may be prudent to consider adaptive CP length management [
31] based on the evolution curve against the FMF distance, highlighting that optimal configurations can enhance robustness while also accommodating the limits of system complexity. This adaptive management helps balance system performance and resource consumption. It also enables more efficient and scalable optical communication networks.
Furthermore, while the simulation framework incorporates realistic component parameters (e.g., DAC/ADC resolution, laser linewidth, fiber impairments), real-time deployment would introduce additional constraints such as processing latency, memory requirements, and hardware resource limitations in FPGA/application-specific integrated circuit (ASIC) platforms. The results indicate that although RNN- and LSTM-based equalization provides a superior BER performance, its significantly higher computational complexity (e.g., >100 × FLOPS compared to the DNN) may limit its immediate applicability in real-time systems, especially in power- and cost-constrained environments. In contrast, DNN-based equalizers offer a more favorable trade-off between performance and implementation complexity, making them more suitable for near-term hardware realization. Therefore, this work should be viewed as a proof-of-concept study that establishes performance bounds and design trade-offs, while future work will focus on experimental validation and hardware-oriented optimization, including reduced-complexity ML architectures and FPGA/ASIC implementation strategies.
To further justify the selection of ML architectures and evaluate the impact of model complexity on F-OFDM performance, an ablation-style comparison is presented in
Table 2.
Table 2 is related to the summary of the F-OFDM results shown in
Figure 8. In this context, the term “ablation-style” refers to a systematic comparison across different neural network architectures, where each model introduces a distinct structural modification (e.g., recurrence, convolution, or gating mechanisms) relative to a fully connected baseline network. This approach enables the isolation of the contribution of each architectural feature to the overall system performance. Specifically, the DNN serves as the baseline model, providing global feature learning through fully connected layers. The RNN extends this baseline by incorporating temporal dependencies via recurrent connections, while the LSTM model further enhances the recurrent structure by introducing gated memory mechanisms designed to capture long-term dependencies. In contrast, the CNN represents a fundamentally different approach, focusing on localized feature extraction through convolutional operations. As shown in
Table 2, the results demonstrate that increasing the architectural complexity does not necessarily lead to performance improvements in the considered F-OFDM equalization task. While the RNN achieves better performance than the DNN, it does so at a significantly higher computational cost. Similarly, the LSTM model, despite its more advanced memory structure, does not provide measurable gains over the simpler RNN, indicating that long-term dependencies are limited in this application. The CNN, on the other hand, exhibits inferior performance, suggesting that local feature extraction is less effective than global feature learning for this specific signal processing task. In general, the ablation-style comparison highlights a clear complexity–performance trade-off, confirming that lower-complexity models such as the DNN and RNN are sufficient to capture the underlying signal characteristics, while more sophisticated architectures introduce unnecessary computational overhead without the corresponding performance benefits.
5. Conclusions
This paper demonstrates the viability and advantages of optical F-OFDM as a next-generation multicarrier transmission solution. The innovative approach of reducing subcarrier spacing, combined with the application of ML techniques, specifically RNNs and DNNs, has shown promising results in enhancing spectral efficiency and improving the BER under various channel conditions, respectively. While the RNNs exhibited superior performance in managing ISI and inter-carrier interference and adapting to complex channel dynamics, their high computational complexity poses challenges for potential future real-time implementation. On the other hand, DNNs, with their simpler architecture, provide a compelling alternative that balances performance with efficient processing requirements.
To further assess the impact of model complexity, higher-complexity architectures, namely the CNN and the LSTM network, were also investigated. The results indicate that, despite their advanced structural capabilities for spatial and temporal feature extraction, these models do not provide additional performance gains for the considered F-OFDM scenario, while introducing significantly higher computational complexity.
This study also highlighted the importance of selecting appropriate system configurations, such as adaptive CP management and subcarrier number selection, to optimize performance while minimizing complexity. Overall, the findings reinforce the potential of F-OFDM integrated with ML strategies to meet the demands of modern SDM-based optical communication systems, while emphasizing that lower-complexity models can offer the most practical and efficient solutions. This paves the way for more efficient, scalable, and robust network implementations.