Next Article in Journal
Design of All-Optical Ternary Inverter and Clocked SR Flip-Flop Based on Polarization Conversion and Rotation in Micro-Ring Resonator
Previous Article in Journal
Generation of Four-Beam Output in a Bonded Nd:YAG/Cr4+:YAG Laser via Fiber Splitter Pumping
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Correction

Correction: Zhou et al. Research Progress on Modulation Format Recognition Technology for Visible Light Communication. Photonics 2025, 12, 512

1
Guangxi Key Laboratory of Functional Information Materials and Intelligent Information Processing, Nanning Normal University, Nanning 530001, China
2
Guangxi Geographical Indication Crops Research Center of Big Data Mining and Experimental Engineering Technology, Nanning Normal University, Nanning 530001, China
3
Guangxi Key Laboratory of Earth Surface Processes and Intelligent Simulation, Nanning Normal University, Nanning 530001, China
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Photonics 2025, 12(8), 761; https://doi.org/10.3390/photonics12080761
Submission received: 2 July 2025 / Accepted: 3 July 2025 / Published: 29 July 2025
Text Corrections
The following text corrections have been made to the original publication [1]:
1. 
The expression of Equation (7) in Section 2.2 has been revised to enhance its generality. The original simplified formulation placed the summation symbol outside the Bessel function and exponential term, implying an uncommon assumption that the carrier phases of symbols are independently distributed. The updated equation, which refers to Equation (19) in [16], provides a more comprehensive model.
A correction has been made to Section 2.2, Equation (7):
Γ R H i = E s n k , i | n = 0 N 1 I 0 n = 0 N 1 2 r n * s n k , i σ 2 × exp n = 0 N 1 s n k , i 2 σ 2 ,
2. 
The original expression of Equation (8) represented the GLRT function for a single received sample rather than the overall function. It would be more appropriate to use the overall GLRT likelihood function here.
A correction has been made to Section 2.3, Equation (8):
Γ GLRT = n = 0 N 1 max u i Γ r n H i , u i ,
3. 
The expression of Equation (9) has been generalized to better accommodate the various estimation methods applicable in the HLRT method. We have removed m a x I g and used I g ^ to represent the estimated value of I g .
Corrections have been made to Section 2.4, Equation (9) and its explanatory text:
Γ HLRT = E S g Γ R H i , I g ^ , S g ,
where R represents the multi-dimensional matrix of received signals, and Γ R H i , I g ^ , S g represents the joint likelihood function given the modulation hypothesis H i , the estimated active subcarrier index I g ^ , and the symbols S g .
4. 
A sentence was misplaced during the translation process. A sentence describing the HLRT method appeared incorrectly in Section 2.3. It has been moved to its proper location in Section 2.4 with the subject corrected to HLRT.
A correction has been made to Section 2.3 Paragraph 1:
The last sentence of the paragraph, which read “Additionally, the GLRT method has a high dependence on the initial parameter estimates; inaccuracies in these estimates can adversely affect the final recognition results.”, has been deleted.
A correction has been made to Section 2.4 Paragraph 1:
However, the complexity of the HLRT method remains high, especially when multiple parameters need to be addressed simultaneously. Additionally, the HLRT method has a high dependence on the initial parameter estimates; inaccuracies in these estimates can adversely affect the final recognition results.
5. 
Section 3.1, Paragraph 3 has been revised for greater clarity and technical precision. The original text’s explanation of the distinguishability of QAM/PAM signals was ambiguous; it now clearly states that the cumulant difference arises from the complex nature of the original signal. The symbol for normalized cumulants has been corrected from C 40 ~ to C 4 i ~ . To better describe the experiment in [19], the word “establish” was changed to “testing”. The original term was inappropriate as the threshold was pre-determined and then used for comparison, not established during the simulation. The description of complexity has been refined by removing the mention of Monte Carlo simulations, as its role in the cited work is limited to performance testing and does not affect the algorithm’s operational complexity.
Corrections have been made to Section 3.1, Paragraph 3:
By leveraging the asymmetric clipping characteristic of the ACO-OFDM signal, there is an essential difference in the cumulants of QAM signals compared to PAM signals, which stems from the original signal’s real/imaginary parts. The normalized cumulants C 4 i ~ of different-order PAM signals exhibit separable statistical boundaries. By testing classification thresholds through Monte Carlo simulations, a correct recognition rate of 88.9% can be achieved with 2000 samples when the SNR exceeds 15 dB. However, performance drops sharply when the SNR falls below 5 dB due to noise dominance (as shown in Figure 4). This study confirms that HOS possess both noise robustness and modulation sensitivity. However, the high computational complexity of HOS limits its feasibility in dynamic real-time applications.
6. 
The sentence preceding Equation (14) has been refined to enhance clarity. The previous phrasing could be misinterpreted as implying a temporal action of “reconstructing... before noise interference.” The intended meaning was to convey a state of matching the ideal and noise-free signal constellation. To express this concept more precisely, the text has been streamlined to “matching the ideal constellation.”
A correction has been made to Section 3.2, Paragraph 2, in the sentence before Equation (14):
They calculated the Euclidean distance between each symbol point and the cluster centers of the templates and classified the symbols into the nearest cluster, thereby matching the ideal constellation distribution.
7. 
The wording concerning the threshold processing function and feature extraction in Section 3.3 has been revised for greater accuracy. The term “adaptive” has been revised to clarify that the threshold is designed for flexible operator selection, not automatic adaptation. And our original description of feature extraction for the method in [21] was inaccurate regarding the information of wavelet domain and statistical measures.
Corrections have been made to Section 3.3, Paragraph 1:
The received signal is analyzed for time-frequency localization, combined with a threshold processing function based on noise statistics … to achieve the nonlinear suppression of high-frequency noise components. Here, δ is the selected threshold based on noise statistical characteristics,
Corrections have been made to Section 3.3, Paragraph 2:
In terms of feature extraction, information from the time domain and frequency domain is integrated. The maximum amplitude spectral density is extracted as the pivotal feature, while the normalized peak spacing is amalgamated to augment the separability of modulation modes.
8. 
In Section 3.4, the terminology for performance metrics has been standardized by correcting “sensitivity” to “accuracy”. The description of SVM performance was not clear. It has been made more specific to provide a clearer comparison.
Corrections have been made to Section 3.4, Paragraph 2:
The k-nearest neighbors (KNN) method, which employs dynamic neighborhood search, performs best at medium distances (2.32 m), maintaining an accuracy of 79.99% at 3 m. The support vector machine (SVM), which maximizes the classification margin through the hyperplane y = w T x + b , achieves comparable performance to LM at medium distances (2.25–2.86 m),
9. 
The interpretation of the symbols in Equation (18) has been revised. Our original explanation was based on a slight misinterpretation of the variable definitions in the cited work. The text has been corrected to accurately describe the definitions in the original reference.
A correction has been made to Section 3.6, Paragraph 1:
where X i z represents the i-th frequency-domain signal sample after FFT and FDE processing. z is the index of the OFDM symbol, and i is the sequence number of the subcarrier.
10. 
In our introduction to [25] in Section 4.1, the notation of the signal length has been corrected from n to N . The description of the model has been refined. Although the convolution kernel used in the work can be abstractly considered as a 3 × 1 temporal convolution kernel, it is now specified as 3 × FI × FO to more accurately reflect the original expression, and the text now clarifies that the channel design is configurable.
Corrections have been made to Section 4.1, Paragraph 1, The text below Equation (19):
where the signal length N is 128... For network architecture, a seven-layer convolutional network was designed with 3 × FI × FO convolutional kernels and configurable channel dimensions.
11. 
A repeated sentence following Equation (22) was deleted, and the word “reconstructs” has been clarified to “transforms” to better describe the process.
A correction has been made to Section 4.1, Paragraph 5:
where D is the distance from the projection source to the image center… This process transforms the constellation image, significantly enhancing inter-class separability.
12. 
In Section 4.1, our description of the work by Gu et al. [29] has been refined for greater accuracy. The term “alternating” was replaced by “utilizing” to clarify the application of two pooling methods. The discussion of robustness was also refined to emphasize the network’s inherent improvement. As for the description of the dropout layer, while its placement in the final layer is a reasonable logical inference based on the model’s structure, this detail was not explicitly stated in the reference. Therefore, it has been omitted to adhere strictly to the reference.
Corrections have been made to Section 4.1, Paragraph 7:
The network architecture consisted of a six-layer convolutional block structure, which balanced feature abstraction and detail retention by utilizing max pooling and average pooling. In strong turbulence scenarios, robustness was enhanced by integrating average pooling and batch normalization techniques, with a dropout rate of 0.6.
13. 
Our description of the work by Wang et al. [31] has been revised for greater accuracy. The “mAP” performance metric is now clearly specified in both the text and Table 4. Furthermore, the parameter count of the improved model has been corrected to 24.17MB, as the previously cited value was for the base model. This required revising the model’s description and updating the parameter reduction calculation to 79.3%.
Corrections have been made to Section 4.1, Paragraph 9:
At 20 dB high SNR, it reached an mAP of 0.993, all while maintaining a model parameter size of 24.17 MB, which represents a 79.3% reduction compared to YOLOv3.
14. 
The wording following Equation (23) has been refined by clarifying the “signal constellation”. It has been made clear that the Hough transform was applied to the constellation diagram.
A correction has been made to Section 4.1, Paragraph 11:
This transformation projects the modulated signal constellation into the ρ , θ parameter space, enhancing the amplitude and phase features of modulation formats such as QPSK and QAM.
15. 
Our description of the training process and data augmentation of the work by Zhang et al. [34] was inaccurate. It has been revised to more accurately reflect the reference and improve clarity. And "noise injection" is no longer categorized as data augmentation, as it is applied in the communication simulation.
Corrections have been made to Section 4.3, Paragraph 1:
In the early stages of training, tasks from fewer batches are utilized to learn the common features associated with channel fading. Subsequently, more batches including more diverse task combinations are gradually introduced, creating a progressive knowledge path… Coupled with data compensation strategies such as rotation augmentation, the framework effectively captures essential features,
16. 
The use of the word “integrate” in our introduction of the work by Zhao et al. could cause ambiguity. We changed it to “introduce” for better clarity.
Corrections have been made to Section 4.3, Paragraph 2:
Zhao et al. [36] proposed an enhanced solution that introduces active learning (AL) and transfer learning (TL) to address the challenge of acquiring labeled data in VLC systems… The results indicated that in an extremely small sample scenario with only 60 labeled samples, the introduction of AL and TL achieved accuracy improvements of 6.82% and 14.6%, respectively.
17. 
Our description of the work by Li et al. [37] has been revised for greater precision. In the feature extraction method, the terminology has been corrected from “image” to “map”. Additionally, our originally described voltage condition was relatively conservative; it has been updated to 0.2–1.3 V to more accurately reflect the full performance capabilities demonstrated in [37]’s source figures.
A correction has been made to Section 4.3, Paragraph 3:
For the original complex signal y i , they extracted both the IQ constellation map in the Cartesian coordinate system and the joint feature map of amplitude and phase in the polar coordinate system.
A correction has been made to Section 4.3, Paragraph 4:
Experiments demonstrated that within the LED bias range of 0.2–1.3 V, the system achieved over 90% recognition accuracy for six modulation types.
18. 
There was a typing error in the symbols following Equation (31). They have been corrected from “ b T and b S ” to “ p T ^ and p S ^
A correction has been made to Section 4.3, Paragraph 5:
here, p T ^ and p S ^ represent the non-target class distributions after removing the target class probability p t T ,
19. 
In our introduction of the dataset of reference [34], our original description of the query set was inaccurate. Unlike the validation set mentioned earlier, the query set contains unknown samples of existing classes, not unknown classes.
A correction has been made to Section 4.4, last sentence of paragraph 8:
The query set contained samples of the same classes for aggregating and optimizing parameters to achieve knowledge transfer and generalization.
20. 
Our performance description of the work in [51] has been updated to include necessary conditions for the stated accuracy, making the description more precise.
A correction has been made to Section 4.5, Paragraph 2:
The ternary weight quantization technique in [51], which restricts neural network parameters to {−1, 0, +1} at inference, combined with the MobileNetV3 architecture, achieves an average recognition accuracy of 90.1% on an ASIC.
Revision to Table 4
The entry for Wang Y. et al. [31] in Table 4 has been revised to more clearly distinguish its unique performance metric. To improve clarity and align with standard representation, the entry now explicitly includes “(mAP)” and presents the value in decimal format (0.993). The corrected Table 4 appears below.
Table 4. Summary and comparison of deep learning-based modulation recognition methods.
Table 4. Summary and comparison of deep learning-based modulation recognition methods.
ReferenceYearInputModelModulation TypesTypical Accuracy (%)Typical Conditions
Liu W. et al.
[27]
2020Pseudo constellation diagramGoogLeNet V3BPSK, 4/8/16/32/64QAM98SNR = 15 dB
Mortada B. et al.
[28]
2022Fan-beam constellation diagramAlexNet2/4/8/16PSK, 8/16/32/64QAM100OSNR = 15 dB
Gu Y. et al.
[29]
20222 × 1024 IQ sequenceCNNOOK, BPSK, QPSK, 16QAM99.98SNR = 10~30 dB
Gao W. et al.
[30]
20222 × 2N matrixDrCNN4/8/16QAM,8PSK, OOK, 16APSK98.3SNR = 20 dB
Wang Y. et al.
[31]
2024Time-frequency diagramYOLOv5sBPSK, QPSK, 8PSK, 16QAM0.993(mAP)SNR = 20 dB
Arafa N. et al.
[32]
2024Hough transform constellation diagramPre-Trained AlexNetQPSK, 8/16PSK, 4/8/16/32/64QAM100SNR = 12 dB
Gao W. et al.
[33]
20244 × N sequenceBiGRU2/4/8/16/32/64QAM, 8/16/32/64APSK>96Linear working area
Zhang L. et al.
[34]
2020Constellation diagramPGML-CNN4/8/16/32/64/128/256 QAM, 2/4/8ASK, 2/4/8/16/32 PSK95.63SNR = 6 dB
Zhao Z. et al.
[36]
2022Contour stellar imageAlexNet-AL2/4/8/16/32/64QAM88.78SNR = 0~15 dB
Li F. et al.
[37]
2023IQ samples processed by coordinate transformation and folding algorithm.Reservoir ComputingOOK, 4QAM, 8QAM-DIA, 8QAM-CIR, 16APSK, 16QAM>90Linear working area
Yao L. et al.
[39]
2024Constellation diagramCIKD-CNNPAM4, QPSK, 8QAM-CIR, 8QAM-DIA, 16/32QAM, 16/32APSK100Ideal working area
Zheng X. et al.
[40]
2024Constellation diagramTCN-LSTM + MMAnet4/8/16/32/64QAM99.2SNR = 4 dB
Updated Funding
The name and number of the first project in the Funding section has been updated:
Innovation Project of Guangxi Graduate Education (No. YCSW2025507)
The authors apologize for any inconvenience these corrections may cause. The authors state that the scientific conclusions are unaffected. The corrections were approved by the Academic Editor. The original publication has also been updated.

References

  1. Zhou, S.; Du, W.; Li, C.; Liu, S.; Li, R. Research Progress on Modulation Format Recognition Technology for Visible Light Communication. Photonics 2025, 12, 512. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhou, S.; Du, W.; Li, C.; Liu, S.; Li, R. Correction: Zhou et al. Research Progress on Modulation Format Recognition Technology for Visible Light Communication. Photonics 2025, 12, 512. Photonics 2025, 12, 761. https://doi.org/10.3390/photonics12080761

AMA Style

Zhou S, Du W, Li C, Liu S, Li R. Correction: Zhou et al. Research Progress on Modulation Format Recognition Technology for Visible Light Communication. Photonics 2025, 12, 512. Photonics. 2025; 12(8):761. https://doi.org/10.3390/photonics12080761

Chicago/Turabian Style

Zhou, Shengbang, Weichang Du, Chuanqi Li, Shutian Liu, and Ruiqi Li. 2025. "Correction: Zhou et al. Research Progress on Modulation Format Recognition Technology for Visible Light Communication. Photonics 2025, 12, 512" Photonics 12, no. 8: 761. https://doi.org/10.3390/photonics12080761

APA Style

Zhou, S., Du, W., Li, C., Liu, S., & Li, R. (2025). Correction: Zhou et al. Research Progress on Modulation Format Recognition Technology for Visible Light Communication. Photonics 2025, 12, 512. Photonics, 12(8), 761. https://doi.org/10.3390/photonics12080761

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop