Hard-Decision Coded Modulation for High-Throughput Short-Reach Optical Interconnect

Coded modulation (CM), a combination of forward error correction (FEC) and high order modulation formats, has become a key part of modern optical communication systems. Designing CM schemes with strict complexity requirements for optical communications (e.g., data center interconnects) is still challenging mainly because of the expected low latency, low overhead, and the stringent high data rate requirements. In this paper, we propose a CM scheme with bit-wise hard-decision FEC and geometric shaping. In particular, we propose to combine the recently introduced soft-aided bit-marking decoding algorithm for staircase codes (SCCs) with geometrically-shaped constellations. The main goal of this CM scheme is to jointly boost the coding gain and provide shaping gain, while keeping the complexity low. When compared to existing CM systems based on M-ary quadrature-amplitude modulation (MQAM, M=64,128,256) and conventional decoding of SCCs, the proposed scheme shows improvements of up to 0.83 dB at a bit-error rate of 10−6 in the additive white Gaussian noise channel. For a nonlinear optical fiber system, simulation results show up to 24% reach increase. In addition, the proposed CM scheme enables rate adaptivity in single-wavelength systems, offering six different data rates between 450 Gbit/s and 666 Gbit/s.


Introduction
Modern optical fiber communication systems have been revolutionized by digital coherent technology via techniques such as pulse shaping, digital signal processing (DSP), and forward error correction (FEC) [1]. In order to support the Internet's exponential traffic growth and improve the transmission reach, the most natural way of achieving this goal is using high spectral efficiency (SE) modulation formats and powerful FEC techniques. The combination of modulation format and FEC is also known as coded modulation (CM) [2,3].
The purpose of CM is to increase the number of information bits per channel use (or information rate). High-order modulation formats can boost the information rates; however, there is often a gap between the In this paper, we propose a geometrically-shaped soft-aided SCC (GS-SA-SCC) coded modulation scheme. In particular, similar to the works in [17,31], we optimize the constellations and mapping strategy in terms of GMI. We then apply the SABM decoding algorithm for SCCs to yield additional coding gains. We show that SCCs with SABM decoding and GS provide gains up to 0.83 dB with respect to the baseline scheme with equidistant QAM and standard decoding for SCCs. Furthermore, we evaluate the proposed CM scheme in both single-span and multi-span wavelength-division multiplexing (WDM) optical fiber communication system, which covers a wide range of distances up to 950 km and line rates up to 666 Gbit/s. By evaluating the AIRs and post-FEC bit error rate (BER) results, we show that the proposed GS-SA-SCC CM scheme achieves up to 24% reach increases with respect to the system using equidistant QAM and standard decoding for SCCs for multiple line rates between 450 Gbit/s and 666 Gbit/s. The paper is structured as follows. In Section 2, we discuss general aspects of AIRs, GS, and SCCs with SABM decoding as well as the proposed GS-SA-SCC CM scheme. In Section 3, we present and analysis the advanced CM scheme with geometrically-shaped constellations and SABM for SCCs over additive white Gaussian noise (AWGN) channel. The simulation results of optical CM transmission system for both single-span system and multi-span systems are further presented in Section 4. Finally, we conclude this paper in Section 5.

Achievable Information Rates (AIRs)
Achievable Information Rates (AIRs) have emerged as practical tools to design optical fiber communication systems, i.e., to design modulation formats and predict the performance of FEC [49][50][51]. The channel MI represents an upper limit on the AIRs for a given channel when a given modulation format is used along with an information-theoretic optimum receiver. However, the implementation of such a decoder is in general avoided for practical reasons. A widespread choice for performance prediction of (binary) SD-FEC is the GMI, which is suitable for bit-wise (BW) decoder (i.e., a standard BICM receiver) [3,50].
Throughout this paper, random (row) vectors are denoted by X and their corresponding realizations are denoted by x. Matrices are denoted by X. The transmitted bits are assumed to be independent and uniformly distributed, which implies uniform symbols X (no PS is used). The receiver assumes a memoryless channel and also uses a bit-wise decoder (i.e., a standard BICM receiver). Under the assumptions above, the AIRs for SD-FEC and BW decoder is GMI where the bits mapped to the channel input X are represented by the random variables B = [B 1 , B 2 , . . . , B m ], m = log 2 M is the number of bits per constellation point, Y represents channel output, f Y|B i (Y|B i ) is the channel law and I(B i ; Y) is the MI between the bits and the symbols. As for HD-FEC schemes, we neglect the bit position asymmetry by assuming an ideal interleaver between BW demapper and HD-FEC decoder. Under the assumptions above, the AIRs are given by [52] whereB i represents the estimated bit after demapper and p corresponds to the average pre-FEC bit error rate (BER) across the m bit positions. Note that an average binary symmetric channel capacity might be a pessimistic AIR for generic HD-BW decoders. However, it is a more suitable AIR for HD-FEC decoders that disregard bit position asymmetry. Even though AIRs can be used to predict key metrics of communication systems, a gap between the AIR and the corresponding performance in practice is inevitable. In order to achieve a higher AIR, these penalties should be avoided. In the following, three SNR gaps, including the gap between HD-FEC and SD-FEC, shaping gap, and practical coding gap, are shown via an example. (1) and (2) for 64QAM are shown in Figure 1 (solid red and solid blue curves, respectively). These results show the theoretical gap (blue shaded area) of 1-2 dB. The SNR gap caused by using HD-FEC instead of SD-FEC is relatively small in the high-SNR regime (≈1 dB) with respect to the low-SNR regime (≈2 dB). Figure 1 also shows the gap (gray shaded area) between the Shannon capacity and ideal SD-FEC, which can be reduced by using a high-order and shaped modulation with an optimal distribution.  AIR analysis relies on an ideal assumption that there exists a FEC code with infinitely long codewords, which achieves an arbitrarily low BER at a code rate R = AIR. However, codes with a finite block length and a complexity-limited implementation will lead to a loss with respect to the AIR prediction. Here, we show the performance of LDPC codes using soft-decision decoding and SCC codes using hard-decision decoding (without any soft information) to achieve a post-FEC BER below 10 −6 . A simple concatenated outer code, i.e., BCH, can bring the post-SCC BER down to 10 −15 . The results are shown with markers in Figure 1. Binary LDPC codes are simulated with code rates R ∈ {3/5, 2/3, 3/4, 5/6, 8/9} from the DVB-S2 standard, with a block length of N = 64800 bits, and 50 decoding iterations. SCCs are simulated with higher code rates R ∈ {0.83, 0.85, 0.90, 0.92} and block size w ∈ {114, 126, 192, 252}, where w follows the notation in ( [47], Figure 1b). The component codes we chose for these SCCs are the most popular ones, i.e., binary Bose-Chaudhuri-Hocquenghem (BCH) codes but extended with 1 extra parity bit. The error-correcting capability of these BCH codes are 2. Decoding is done using iterative bounded-distance decoding (BDD) with a window length of 8 and 7 iterations per window. More details about this are given in Section 2.2. Figure 1 shows that both LDPC and SCC have approximately 1 dB gap to their corresponding AIRs. It is also reported in [53][54][55] (SC-LDPC) that hardware-implementable state-of-the-art SD-FEC codes also have approximately 0.5-1.5 dB penalty from the AIR because of implementation limitations such as power consumption and memory limitations. The gap between AIR and practical FEC is due to the subopimalty of the codes and can be decreased by increasing the code length or performing a more efficient decoding algorithm. Although AIRs give a good first indication of performance, post-FEC BER simulations are necessary to evaluate the performance of a given FEC.

Example 1. AIRs in Equations
From the example above, we can see that there exist opportunities to enhance the achieved data rate in practice by improving the CM scheme to reduce the gaps. Considering the strict latency and complexity requirements, simple but powerful CM systems (e.g., CM with HD-FEC) are desirable. Optimizing the modulation format and improving the FEC performance is therefore of great importance.

Improved Modulation
GS constellations are generated by optimizing a cost function under certain constraints and typically for a given channel. The cost functions could be symbol error probability [56], MI [17,22,24], GMI [17,28], etc. Throughout this paper, we consider BICM, which is one of the most popular CM schemes due to its advantages of flexibility, simplicity, and parallelizability. The transmitted symbols X symbols with two real dimensions drawn uniformly from a discrete constellation X with cardinality M = 2 m = |X |. The jth constellation point is denoted by s j = s j,1 , s j,2 ∈ R 2 with j = 1, 2, . . . , M. We use the M × 2 matrix S = [s T 1 , s T 2 , . . . , s T M ] T to denote the 2D constellation. The jth constellation point s j is labeled by the length-m binary bit sequence which contains all unique length-m binary sequences. The 2D constellation and its binary labeling are fully determined by the pair of matrices {S, B}.
Without the loss of generality, we consider only one of the polarizations for the shaping optimization; however, we emphasize that all results in this paper is performed for 2D constellations on each polarization. Therefore, we can optimize the two-dimensional constellation coordinates and the corresponding labeling by maximizing the cost function. For BW decoders, the AIR-based optimization problem is to find a constellation S * and labeling B * for a given channel law f Y|X and an energy constraint, i.e., where G(S, B, f Y|X ) is equal to either I SD for SD-FEC or I HD for HD-FEC; σ 2 x represents the transmitted power; and S * and B * indicate the optimal constellation and labeling, respectively. In this paper, average power constraint is considered in the optimization. Besides that, peak-to-average power ratio constraint is commonly used in optical fiber communication to maximize the signal generation performance for the systems with limited resolution of digital-to-analog converters and power budget of the links.
Note that the optimization problem is a single objective function (AIR maximization) with multiple parameters. Many optimization methods can be used to find the optimal solution. This includes for example pairwise optimization algorithm [16,27,57], genetic algorithms [58], and machine learning approaches [24,[59][60][61][62]. It is well known that a constellation with Gray-mapping has the minimum loss between MI and GMI. However, due to the unavailability of Gray-mapping for most GS constellations, only jointly-optimized coordinates and labeling is capable of reducing the impact of non-Gray mapping constellation, and thus reduce the gap between GMI and MI.
The main advantage of GS over PS is its low implementation complexity, as only the mapping and demapping blocks of the transponder need to be modified by giving a look-up table. However, the GS constellations may result in more levels in both in-phase and quadrature branches. Therefore, the digital-to-analog converter/analog-to-digital converter may be required to be implemented with higher resolutions. In order to reduce the practical requirements, different constraints in Equation (3), such as symmetry, constant number of phase level, or amplitude, can also be used.

Improved FEC
SCC is constructed with a sequence of binary matrices C i ∈ {0, 1} w×w , i = 0, 1, 2, . . ., which are ordered like a staircase, as shown in ( [47], Figure 1b). Following the notations in [47], let (n c , k c , t) denote the parameters of BCH code C, where n c is the codeword length, k c is the information length, and t is the error-correcting capability. The code rate of SCC with component code of C is R = 2k c /n c − 1, while the size of C i is w = n c /2. Standard decoding of SCCs is performed using a sliding window, wherein BDD is used to iteratively decode each component code. BDD is very simple, however, its error correcting capability is limited to t = d 0 −1 2 errors, where d 0 is the minimum Hamming distance. Sometimes, BDD will erroneously decode a received sequence with more than t errors, a situation known as a miscorrection. Miscorrections are highly undesirable and are known to degrade the performance of iterative BDD by introducing additional errors into the iterative decoding process.
In order to identify miscorrections and address those sequences with larger than t errors, SABM decoder exploits the channel soft information (see the two red highlighted blocks in Figure 2). The SABM decoder calculates the absolute value of LLR (|λ|) to know about the reliability of each bit (a higher value of |λ| indicates a more reliable bit). According to the values of |λ|, the SABM decoder marks some highly reliable bits (HRBs) and highly unreliable bits (HUBs). If |λ| is larger than a threshold δ, the corresponding bit will be marked as a HRB. The selection of δ is discussed in [47] (Section 6). On the other hand, the d 0 − t − 1 bits with smallest |λ| values in each row of SCC block C i are marked as HUBs.

CM Transmitter
Optical Fiber Channel  With the marked information, once the component BDD decoder has finished decoding, the SABM decoder will check its decoding status. Only when BDD succeeds and the output is not in conflict with HRBs and/or zero-syndrome component codewords, the output will be accepted by the SCC decoder. Otherwise, it will be marked as a miscorrection and the suggested bit flips will be rejected. If miscorrection happens, SABM flips the d 0 − t − w H (e) most unreliable bits from the HUBs, where e is the error pattern detected by BDD and w H denotes Hamming weight. For BDD decoding failures, the decoder will also flip the most unreliable bit from the HUBs. The intuition is that this approach in some cases will make the new sequence close enough to the transmitted codeword, i.e., with only t errors (which can be handled by the component BDD decoder).
The main feature of the SABM algorithm is its low complexity, as explained in what follows. The SABM algorithm only requires modifications to the decoding structure of the last block of each decoding window. Furthermore, in the SABM algorithm, each component code needs to be decoded at most twice. The relative increased complexity around the optimum threshold δ is only~4% at the post-FEC BER of 10 −4 with respect to standard decoding for SCC, in terms of the number of additional calls to the component BDD decoder. Also, the algorithm is based on marking bits only, which simply requires to store very small part of the soft bits (LLRs). Finally, marked bits do not need to be tracked and updated during the iterative process either. The complexity of the SABM algorithm has been discussed in details in [47].

GS-SA-SCC
In the paper, we propose a CM scheme via combining geometric shaping and soft-aided SCC (GS-SA-SCC). As shown in Figure 2, information bits are encoded by a staircase encoder and then mapped to symbol x k from an obtained GS constellation with labeling via (3) as matrices pair {S * , B * }, where k is the discrete time index. The receiver for the GS-SA-SCC uses an HD-based demapper to estimate the bits. The HD-estimated bitsĉ 1,k , . . . ,ĉ m,k are then decoded by a staircase decoder implemented with SABM decoding. The LLR values λ 1,k , . . . , λ m,k for the m bits per symbol are calculated as in [50] (Equation (25)), according to the channel information y k and also the transmitted modulation format. The marked information is denoted byq i,k , i = 1, . . . , m, which can be HRB, HUB, or unmarked.
Typically, GMI-optimized constellations translate into improved performance of the binary SD-FEC. However, it also has been shown that the GMI-optimized constellations could provide a considerable gain for a HD decoder [31,63]. In addition, the FEC we considered in this paper is a hybrid HD/SD FEC. Due to the use of a soft-aided decoding algorithm, it should in principle also benefit from the GMI-optimized constellations. The intuition behind the chosen performance metric is that for a hybrid HD/SD decoding, the more soft information is harvested in the FEC decoder, more potential gains can be achieved by using GMI-based optimization. For these reasons, we optimize the 2D constellation by maximizing I SD instead of I HD in (3). In the next sections, we will investigate how much improvement will be achieved by using the proposed GS-SA-SCC CM scheme.

Performance Over AWGN Channel
In this section, we show the performance over the most commonly used channel for optical communications, i.e., AWGN channel, which neglects the correlation caused by dispersion and nonlinear interference. Figure 2 shows a schematic diagram of the proposed CM communication system. The channel block in Figure 2 is modeled under a circularly symmetric Gaussian noise assumption, i.e., p Y|X is fully determined by the variance σ 2 z , which corresponds to the total noise power per one complex dimensions. By solving the optimization problem in Equation (3), we obtained three groups of geometrically-shaped constellations with SEs of 6 bit/2D-sym, 7 bit/2D-sym, and 8 bit/2D-sym as follows.
(a) 128-GS. Red and blue circles indicate the bits b i = 0 and b i = 1, respectively (i = 1, 2, . . . , 7).  For SCCs, two component codes, i.e., BCH codes (504, 485, 2) and (228, 209, 2), are considered. They correspond to the acceptable overhead range between 7% and 20% in optical fiber communications, which are the highest and lowest code rates investigated in Figure 1. These parameters are obtained by shortening the 1 bit-extended BCH code (512, 493, 2) by 8 and 284 bits, respectively. The extended bit is a parity bit and added at the end of standard BCH code (511, 493, 2). These two BCH codes result in SCC code rates R ≈ 0.92 and R ≈ 0.83, respectively, and SCC block size w = 252 and w = 114, respectively The decoding window size is 8, and the number of iterations is 7. The threshold δ for marking HRBs is 5. Note that the threshold here is equivalent to δ = 10 in [47] due to the different definition of the SNR. Here, the SNR is defined as the energy per complex symbol over the noise variance over one complex dimension. Figure 5 shows In Figure 6, we depict the AIRs in Equation (2) for HD-FEC schemes with MQAM and M-GS. As can be seen, the GS constellations lead to a SNR gain in terms of SNR compared to uniform QAM in the case of multiple SEs. For a normalized AIR of 0.9 (diamond markers), the shaping gains of GS are 0.3 dB, 0.6 dB, and 0.35 dB for M = {64, 128, 256} with respect to MQAM, respectively. Note that the AIRs of 64QAM and 256QAM in Equation (1) for SD-FEC schemes are also shown as baselines in Figure 6. The gaps between I SD and I HD indicate a potential performance region for hybrid HD/SD FEC schemes. Figure 6 also shows (with markers) the SNR required for different CM schemes to achieve a post-FEC BER of 10 −6 . We remark that such a SNR gain is due to a twofold advantage of GS-SA-SCC over conventional SCC with QAM: (i) the shaping gains, which comes from the optimized constellation matrices pair {S * , B * }, and (ii) the improved decoding capability provided by the SABM decoding. For GS and SCC with code rate of 0.83, we observe the shaping gains of GS are 0.3 dB, 0.6 dB, and 0.35 dB, which is in excellent agreement with the prediction of AIR curves. Thanks to the processing of SABM, it enables an additional gain on top of GS. The overall (shaping plus coding) gains for M = {64, 128, 256} with GS and SABM decoding are up to 0.55 dB, 0.83 dB and 0.63 dB with respect to MQAM with standard decoding, respectively. Note that GS-SA-SCC with 128-GS and code rate of 0.92 even achieved almost the same rate as I HD with 128QAM (see green square marker and blue dotted curve in Figure 6). Comparing to the results in Figure 1, GS-SA-SCC with code rate of 0.83 reduced 0.55 dB of the~2 dB gap to LDPC with very small added complexity. In general, the I HD is the AIR of decoding system operating solely based on HD decoding, while the soft-aided decoders operate beyond this. In other words, the overall gains shown here are larger than I HD prediction. An interesting research is to find the performance metric of the soft-aided decoders, which is left for future work.

Numerical Analysis for an Optical Transmission System
In this section, numerical results of optical fiber transmission are presented by considering the channel block in Figure 2 as a nonlinear optical channel. As shown in Figure 2, the simulated system consists of an optical link comprising either unrepeatered single standard single-mode fiber (SSMF) span N sp = 1 or multiple SSMF spans with erbium-doped fiber amplifier (EDFA). The investigated link lengths are the typical distances of data center interconnects (< 200 km), metro, and regional links (200-1000 km). To support the targeted net data rates of >600 Gbit/s and >400 Gbit/s per wavelength for these two distance ranges above, two sets of constellations are investigated. The first set uses 256QAM and 256-GS for single span transmission, and second set uses 64QAM, 128QAM, 64-GS, and 128-GS for multi-span transmission. The fiber propagation is simulated through the split-step Fourier method numerical solution of the nonlinear Manakov equation, where the step size was uniformly set to 0.1 km. The maximum-reach gains are defined as the reach increase observed at fixed net data rate when switching from 64QAM, 128QAM, and 256QAM constellations to 64-GS, 128-GS, and 256-GS. The QAM or GS symbols are independently modulated over the two orthogonal polarization channels via polarization multiplexing. The simulation parameters are shown in Table 1.

Single-Span Fiber System Setup
We first simulated a single-span transmission link over SSMF. At the transmitter, the information bits of each polarization are independently encoded and the modulator maps the encoded bits either to 256QAM or to 256-GS symbols. After the CM encoder, the signal was modulated using a root raised cosine (RRC) filter. Dual-polarization 11 × 45 GBd WDM channels are generated with 50 GHz spacing, yielding 720 Gbit/s raw data rate per wavelength. The corresponding line rate per wavelength is 720 · R, where R is the code rate of FEC code. For each polarization of each WDM channel, independent sequences of 2 18 symbols were transmitted. An ideal coherent receiver is used for detection and analog-to-digital conversion. Chromatic dispersion is digitally compensated by a fast Fourier transform based filter, then the signal is matched filtered and downsampled at the Nyquist rate.

AIR Results
Results after propagation over single-span SSMF are shown in Figure 7 as AIR vs. launch power per WDM channel (P ch ). At the distance of 110 km, the GS gain, which is the difference (in AIR) between 64QAM and 64-GS constellation, is 0.17 bits/sym at the optimal launch power (2.5 dBm). At the distance of 120 km, the GS gain increases to 0.21 bits/sym at the optimal launch power (3.5 dBm).

Post-FEC BER Results
In order to evaluate the reach increase for the proposed CM for single-span transmission, we implemented SCCs for the central WDM channel (number 6) at the optimum launch power, which is obtained from Figure 7. Figure

Multi-Span Fiber System Setup
In the considered multi-span transmission link scenario, each SSMF span of length 80 km was followed by an EDFA compensating exactly for the link loss (16 dB) and with a noise figure of 5 dB. The receiver DSP algorithms utilised are identical to the ones described in single-span transmission scenario. I HD of four different modulation formats are calculated to predict the shaping gain and finding the optimal launch power. In addition, pre-FEC BER and post-FEC BER are measured to verify the performance of the CM scheme.

AIR Results
In Figure 9, two sets of results are shown for AIRs versus launch power over two multi-span transmission link, respectively. The first set (see Figure 9a) show the AIR for SE of 14 bits/sym, i.e., 128QAM and 128-GS, over 4 × 80 km SSMF and 6 × 80 km SSMF. Around the optimum launch power regime where linear and nonlinear propagation effects are comparable, 128-GS modulation gives 0.24 bit/sym and 0.5 bit/sym AIR gains with respect to 64QAM for over 4 × 80 km SSMF and 6 × 80 km SSMF, respectively The second set (see Figure 9b) show the AIRs for SE of 12 bits/sym, i.e., 64QAM and 64-GS, over 10 × 80 km and SSMF 15 × 80 km SSMF. At the optimum launch power, 64-GS modulation gives 0.17 bit/sym and 0.19 bit/sym AIR gains with respect to 256QAM for over 10 × 80 km SSMF and 15 × 80 km SSMF, respectively Note that for higher launch power regime where nonlinearity is dominant, performance improvements are still remaining.

Post-FEC BER Results
In order to evaluate the reach increase for the proposed CM for multi-span transmission, we implemented SCCs for multiple transmission distances and compared the post-SCC BER of MQAM and M-GS modulation formats (M ∈ {64, 128}). Two code rates, R 1 s = 233/252 and R 2 s = 95/114, are used for SCCs. Therefore, the line rates of 450 Gbit/s, 500 Gbit/s, 525 Gbit/s, and 583 Gbit/s can be obtained. In addition, we concatenated LDPC codes with code rate R l = 0.9 (as inner code) and a SCC with code rate R 1 s (as outer code) as a SD-FEC CM scheme baseline. The LDPC codes are the ones from the DVB-S2 standard, with a block length of N = 64800 bits, and 50 decoding iterations. The decoded bits from the LDPC decoder are used as input to the staircase decoder. The overall concatenated code rate is R = R 1 s · R l ≈ R 2 s , which is the code rate as SCC with R 2 s = 95/114.
In Figure 10, post-FEC BER performance of central WDM channel at the optimum launch power is shown for MQAM and M-GS modulation formats (M ∈ {64, 128}) at different line rates. The transmission reach extension of 64-GS with HD-FEC (SCC) at 500 Gbit/s and 450 Gbit/s are measured to be 80 km (13%) and 140 km (17%), respectively. Considering the higher complexity of the CM system combining 64QAM and SD-FEC (LDPC+SCC) as a baseline at the date rate of 450 Gbit/s, we observe only a gain of 75 km compared to the proposed scheme with 64-GS and SABM decoding. Therefore, the combination of HD-FEC and GS constellation offers a good complexity-gain trade-off. For a transmission range between 300 km and 500 km, the reach extension of 128-GS with HD-FEC (SCC) at 583 Gbit/s and 525 Gbit/s, are measured to be 70 km (24%) and 80 km (20%), respectively. show BER between two amplifiers (obtained by noise loading). SCC rates are R 1 s = 0.925 and R 2 s = 0.833. LDPC rate is R l = 0.9.

Discussion
The performance improvements of the proposed GS-SA-SCC CM schemes are summarized in Table 2, when compared to the existing CM schemes with standard SCC decoding and QAM modulations. Due to the greater sensitivity to the fiber nonlinear effect, the performance degradation of the system with the higher order modulation format is more serious than that of system with the lower order modulation format. Therefore, the observed distance increase is not significant as the SNR gain in the AWGN channel for constellation with 256 points. However, we can see that the proposed CM scheme can cover the distance from 100 km to 1000 km and provide a net data rate of up to 666 Gbit/s, which are suitable for short-reach DCI and metro links. One of the advantages of the proposed CM schemes is the 50 GHz grid compatibility, which is very common in most of optical network. For future the transponder with a 75 GHz fixed grid, these CM scheme could be also extended to beyond 1 Tbps without significant modification.

Conclusions
In this paper, a coded modulation scheme based on geometric shaping and soft-aided staircase code (GS-SA-SCC) was proposed and studied. The GS-SA-SCC coded modulation scheme is based on simple modification of hard-decision staircase decoder with the help marking bits and optimized constellations with corresponding labeling. The analysis was performed both in terms of achievable information rates and post-FEC BER. Up to 0.83 dB gains compared to standard SCC decoding with conventional QAM were obtained with a low added complexity. In this paper, we showed that multiple line rates can be obtained by using three geometrically-shaped constellations and two code rates. By combining these constellations with two HD-FEC codes, six line rates between 450 Gbps and 666 Gbps can be obtained. Furthermore, reach increases of up to 24% for a wide range of distances (from 100 km to 950 km) for both single-span and multi-span systems were reported. We believe that the proposed GS-SA-SCC scheme is a promising candidate for next generation data center interconect compatible coherent 800 Gbps and 1 Tbps system with higher baud rate, while providing considerable reach increase.