Unified Stochastic Differential Equation Modeling and Fuzzy-RL Control for Turbulent UWOC
Abstract
1. Introduction
- (1)
- From Physics to Stochastic Dynamics: Starting from the RTE, a physically motivated SDE representation is derived via diffusion approximation. An inverse Fokker–Planck equation approach is introduced to demonstrate that the proposed SDE acts as a unified stochastic generator for a broad class of stationary channel statistics, with the inverse Fokker–Planck formulation providing a mathematical route for reconstructing drift fields from target stationary PDFs.
- (2)
- Coupled Nonlinear Modeling: The coupling between AUV attitude dynamics and channel fading is explicitly modeled. Consequently, the beam alignment problem is reformulated as the stabilization of a nonlinear system subjected to multiplicative noise, extending classical wave propagation theories [34] to mobile engineering applications.
- (3)
- Hybrid Intelligent Control: A lightweight hybrid Fuzzy-RL controller is designed to provide robust adaptive beam-pointing control under the reduced plant model, with mean-square boundedness and asymptotic convergence to a bounded neighborhood established under Assumptions A1–A3 stated in Appendix A.1. By integrating the logical interpretability of fuzzy rules with the adaptive optimization of tabular Q-learning, this strategy effectively mitigates deep fading events with minimal computational overhead compared to deep RL approaches.
2. Materials and Methods
2.1. AUV Platform Model
2.2. Channel-Pointing Coupling Model
- : path-loss term based on the Beer–Lambert law.
- : extinction coefficient. This parameter is water-dependent and represents the sum of absorption and scattering. It is measured in , with values approximately ranging from 0.1 to 1 in the blue-green wavelength band.
- slant range, representing the Euclidean distance between the AUV and the receiver, which varies over time.
- : current gain.
- : pointing-recovery term. Here, is the scaling factor, is the current gimbal pointing vector, is the ideal alignment angle, and is the bm-divergence-related angular parameter. This Gaussian penalty function approaches when , and decays exponentially as the pointing deviation increases, thereby simulating power loss due to misalignment.
- : increment of the Wiener process.
- : diffusion coefficient. For weak turbulence where the Rytov variance satisfies , it can be approximated as
2.3. Parameter Mapping and Statistical Averaging
2.4. From Physics to Statistics: Development and Statistical Validation of the RTE–SDE Channel Model
Validity Conditions and Scope of the Diffusion-Based Reduction
- (1)
- : Inverse-Gamma distribution.
- (2)
- : Gamma distribution, dominated by additive noise.
- (3)
- : Generalized Gamma distribution family.
- (1)
- Multiplicative Noise Regime (): In this baseline scenario, turbulence acts as a multiplicative modulator. The diffusion intensity scales quadratically with signal amplitude (), leading to a heavy-tailed inverse-Gamma distribution. This regime corresponds to the classical saturated turbulence model where variance is dominated by diffusion.
- (2)
- Additive Noise Regime (): As approaches 1, the noise dependency becomes linear, approximating a Gamma distribution. This regime reflects scenarios where background additive noise or scattering becomes comparable to turbulent fluctuations.
- (3)
- Intermediate Regime (): By tuning continuously, the system traverses the generalized Gamma family. This flexibility allows the SDE to dynamically adapt to varying turbulence strengths—from the stability of deep water to the volatility of coastal zones.
2.5. Numerical Implementation and Baseline Verification
- (1)
- Numerical Stability: The Euler–Maruyama solver maintains stability without divergence even under high-variance conditions.
- (2)
- Physical Consistency: The SDE correctly reproduces the classical statistical behavior of saturated turbulence when configured in the multiplicative noise regime.
2.6. Numerical Fitting with Existing Models
3. Fuzzy-RL Hybrid Adaptive Controller
3.1. Formulation of the Adaptive Optimization Problem
- (1)
- State variables define the state as . Here, represents the angular error of the laser spot relative to the target center, and denotes its time derivative. The state undergoes fuzzy partitioning before being input to the Fuzzy-RL controller. This reflects fuzzy membership degrees across different error levels.
- (2)
- Action variable (action): Let . This represents the angular fine-tuning increment for the current step. It is adjusted by the reinforcement learning module based on fuzzy outputs to achieve a dynamic balance between rapid response and steady-state performance.
- (3)
- State transition: The system dynamics are jointly determined by the closed-loop response of the gimbal servo mechanism and channel disturbances: where denotes random disturbances caused by turbulence and observation noise.
3.2. Fuzzy-RL Controller Design
- (1)
- TS Rule Form: The fuzzy rule is expressed as follows. Rule : If is AND is , then , where . The final baseline output is the weighted sum of linear consequents.
- (2)
- Membership Functions: Gaussian membership functions are employed. Their centers are determined by offline clustering, and widths are set according to coverage and overlap criteria.
- (3)
- Fuzzy Parameter Strategy: By default, the membership structures are frozen, and only the consequent parameters are fine-tuned to ensure reproducibility. As an optional control experiment, the consequent parameters may undergo online projection updates with a small learning rate.
3.3. Reinforcement Learning Component Design
3.4. Compact Pseudocode
3.5. Computational Complexity Analysis
3.6. Theoretical Analysis and Innovations
4. Results
4.1. Numerical Simulation and Controller Benchmarking
- (1)
- Channel Dynamics: A baseline moderate turbulence is generated. A forced “bubble blockage” event is introduced between s and s, during which the channel gain drops to near zero (), simulating a complete signal interruption.
- (2)
- Sensor Noise: During the blockage, the vis feedback is assumed to be unreliable, outputting random measurement noise or drifting values.
- (3)
- Baselines: the pre-trained Fuzzy-RL controller was benchmarked against PID, anti-windup PID, ADRC, and MPC under the same plant and disturbance setup.
4.2. Controller Robustness and Parameter Sensitivity Analysis
4.3. Hardware-in-the-Loop (HIL) Experimental Setup
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Appendix A. Mathematical Analysis of Closed-Loop Boundedness and Convergence
Appendix A.1. Assumptions and Existence of SDE Solutions
Appendix A.2. Proof of Theorem A1: Convergence to Suboptimal Policy
Appendix A.3. Proof of Theorem A2: Closed-Loop Stochastic Stability
Appendix B. Derivation of SDE Drift Fields via Inverse Fokker–Planck Equation
Appendix B.1. The Correct Inverse Fokker–Planck Formulation
Appendix B.2. Reconstruction for Generalized Gamma (GG) Turbulence
Appendix C. Auxiliary Derivations for the Fokker–Planck and Moment Formulations
- (1)
- the Fokker–Planck and moment equations associated with the channel SDE introduced in Section 2.2;
- (2)
- the intermediate reduction from the diffusion-form Radiative Transfer Equation (RTE) to the scalar stochastic differential equation used in Section 2.4.
Appendix C.1. Fokker–Planck and Moment Equations Associated with the Channel SDE
Appendix C.2. Intermediate Reduction from the Diffusion-Form RTE to the Scalar SDE
- The diffusion-form RTE provides the physical transport background.
- The master equation and Kramers–Moyal expansion provide the probabilistic bridge from transport to stochastic dynamics.
- Stochastic averaging yields the low-dimensional parametric drift-diffusion form used for analysis and control design.
Appendix C.3. Relation to the Main Text
References
- Ishimaru, A. Wave Propagation and Scattering in Random Media; Academic Press: Cambridge, MA, USA, 1978. [Google Scholar]
- Garnier, J. Wave propagation in random media: Beyond Gaussian statistics. ESAIM Proc. Surv. 2023, 74, 63–89. [Google Scholar] [CrossRef]
- Wynn, R.B.; Huvenne, V.A.I.; Le Bas, T.P.; Murton, B.J.; Connelly, D.P.; Bett, B.J.; Ruhl, H.A.; Morris, K.J.; Peakall, J.; Parsons, D.R.; et al. Autonomous Underwater Vehicles (AUVs): Their past, present and future contributions to the advancement of marine geoscience. Mar. Geol. 2014, 352, 451–468. [Google Scholar] [CrossRef]
- Yan, T.; Xu, Z.; Yang, S.X.; Gadsden, S.A. Formation control of multiple autonomous underwater vehicles: A review. Intell. Robot. 2023, 3, 1–22. [Google Scholar] [CrossRef]
- Jamali, M.V.; Mirani, A.; Parsay, A.; Abolhassani, B.; Nabavi, P.; Chizari, A.; Khorramshahi, P.; Abdollahramezani, S.; Salehi, J.A. Statistical studies of fading in underwater wireless optical channels in the presence of air bubble, temperature, and salinity random variations. IEEE Trans. Commun. 2018, 66, 4706–4723. [Google Scholar] [CrossRef]
- Qiu, H.; Huang, Z.; Xu, J.; Zhai, W.; Gao, Y.; Ji, Y. Unified statistical thermocline channel model for underwater wireless optical communication. Opt. Lett. 2023, 48, 636–639. [Google Scholar] [CrossRef]
- Boluda-Ruiz, R.; García-Zambrana, A.; Castillo-Vázquez, B.; Hranilovic, S. Impact of angular pointing error on BER performance of underwater optical wireless links. Opt. Express 2020, 28, 34606–34622. [Google Scholar] [CrossRef] [PubMed]
- Ijeh, I.C.; Khalighi, M.A.; Elamassie, M.; Hranilovic, S.; Uysal, M. Outage probability analysis of a vertical underwater wireless optical link subject to oceanic turbulence and pointing errors. J. Opt. Commun. Netw. 2022, 14, 439–453. [Google Scholar] [CrossRef]
- Al-Habash, M.A.; Andrews, L.C.; Phillips, R.L. Mathematical model for the irradiance probability density function of a laser beam propagating through turbulent media. Opt. Eng. 2001, 40, 1554–1562. [Google Scholar] [CrossRef]
- Zedini, E.; Oubei, H.M.; Kammoun, A.; Hamdi, M.; Ooi, B.S.; Alouini, M.S. Unified statistical channel model for turbulence-induced fading in underwater wireless optical communication systems. IEEE Trans. Commun. 2019, 67, 2893–2907. [Google Scholar] [CrossRef]
- Arya, V. A comprehensive review on underwater optical wireless communication (UOWC) systems: Present and future prospective. J. Opt. Commun. 2025. [Google Scholar] [CrossRef]
- Lou, Y.; Cheng, J.; Nie, D.; Qiao, G. Performance of underwater wireless optical communications in presents of cascaded mixture exponential-generalized gamma turbulence. arXiv 2020, arXiv:2008.02868. [Google Scholar] [CrossRef]
- Alheadary, W.G.; Park, K.H.; Alouini, M.S. Performance analysis of multihop heterodyne free-space optical communication over general Malaga turbulence channels with pointing error. Optik 2017, 151, 34–47. [Google Scholar] [CrossRef]
- Le-Tran, M.; Kim, S. Performance analysis of multi-hop underwater wireless optical communication systems over exponential-generalized gamma turbulence channels. IEEE Trans. Veh. Technol. 2022, 71, 6214–6227. [Google Scholar] [CrossRef]
- Cochenour, B.; Mullen, L.; Muth, J. Temporal response of the underwater optical channel for high-bandwidth wireless laser communications. IEEE J. Ocean. Eng. 2013, 38, 730–742. [Google Scholar] [CrossRef]
- Vali, Z.; Michelson, D.; Ghassemlooy, Z.; Noori, H. A survey of turbulence in underwater optical wireless communications. Optik 2025, 320, 172126. [Google Scholar] [CrossRef]
- Meyers, B.; Mangelson, J.G. Testing and Evaluation of Underwater Vehicle Using Hardware-in-the-Loop Simulation with HoloOcean. arXiv 2025, arXiv:2511.07687. [Google Scholar]
- Orjales, F.; Rodríguez-Cortegoso, J.; Fernández-Pérez, E.; Romero, A.; Diaz-Casas, V. Towards a Digital Twin for Open-Frame Underwater Vehicles Using Evolutionary Algorithms. Appl. Sci. 2025, 15, 7085. [Google Scholar] [CrossRef]
- Almonacid, L.; Játiva, P.P.; Azurdia-Meza, C.A.; Dujovne, D.; Soto, I.; Firoozabadi, A.D.; Gaitan, M.G. On the path loss performance of underwater visible light communication schemes evaluated in several water environments. In 2023 South American Conference On Visible Light Communications (SACVLC); IEEE: Piscataway, NJ, USA, 2023; pp. 12–16. [Google Scholar]
- Martínez, G.; Játiva, P.P.; Gaitán, M.G.; Meza, C.A.; Boettcher, N.; Zabala-Blanco, D. On the Performance of an Air-Water Visible Light Communication System. In Advanced Research in Technologies, Information, Innovation and Sustainability; Springer Nature: Cham, Switzerland, 2024; pp. 380–394. [Google Scholar]
- Dimitropoulos, D.; Jalali, B. Stochastic differential equation approach for waves in a random medium. Phys. Rev. E Stat. Nonlinear Soft Matter Phys. 2009, 79, 036606. [Google Scholar] [CrossRef]
- Borcea, L.; Garnier, J. Derivation of a one-way radiative transfer equation in random media. Phys. Rev. E 2016, 93, 022115. [Google Scholar] [CrossRef]
- Enghiyad, N.; Sabbagh, A.G. Impulse response of underwater optical wireless channel in the presence of turbulence, absorption, and scattering employing Monte Carlo simulation. J. Opt. Soc. Am. A 2022, 39, 115–126. [Google Scholar] [CrossRef] [PubMed]
- Mobley, C.D. Light and Water: Radiative Transfer in Natural Waters; Academic Press: Cambridge, MA, USA, 1994. [Google Scholar]
- Gao, C.-Z.; Yin, J.-W.; Cai, Y.; Fan, Z.-F.; Wang, P.; Wang, J.-G. Stochastic radiative transfer in random media. III. Effective opacity. Phys. Rev. E 2025, 111, 044115. [Google Scholar] [CrossRef] [PubMed]
- Fu, Q.; Liou, K.N. On the correlated k-distribution method for radiative transfer in nonhomogeneous atmospheres. J. Atmos. Sci. 1992, 49, 2139–2156. [Google Scholar] [CrossRef]
- Ambrocio, J. A Self-Consistent Obstacle Scattering Theory for the Diffusion Approximation of the Radiative Transport Equation. Ph.D. Thesis, University of California, Merced, CA, USA, 2008. [Google Scholar]
- Qiu, H.; Huang, Z.; Xu, J.; Motani, M.; Ji, Y. Channel modelling, performance analysis, and probabilistic shaping for underwater wireless optical communications. IEEE J. Sel. Areas Commun. 2025, 43, 1568–1581. [Google Scholar] [CrossRef]
- Gkoura, L.K.; Roumelas, G.; Nistazakis, H.E.; Sandalidis, H.G.; Vavoulas, A.; Tsigopoulos, A.D.; Tombras, G.S. Underwater optical wireless communication systems: A concise review. In Turbulence Modelling Approaches—Current State, Development Prospects, Applications; IntechOpen: London, UK, 2017. [Google Scholar]
- Nami, O.F.; Widaryanto, A.; Rasuanta, M.P.; Pramudya, T.; Firdaus, M.Y.; Widati, P.L.; Anggraeni, S.P.; Dwiyanti, H.; Rahmadiansyah, M.; Purwoadi, M.A.; et al. Performance Comparison of PID, FOPID, and NN-PID Controller for AUV Steering Problem. J. Elektron. Dan Telekomun. 2024, 24, 72–79. [Google Scholar] [CrossRef]
- Qu, X.; Jiang, Y.; Zhang, R.; Long, F. A deep reinforcement learning-based path-following control scheme for an uncertain under-actuated autonomous marine vehicle. J. Mar. Sci. Eng. 2023, 11, 1762. [Google Scholar] [CrossRef]
- Pang, Z.; Wang, H.; Cheng, J.; Tang, S.; Park, J.H. Stability and Fuzzy Optimal Control for Nonlinear Itô Stochastic Markov Jump Systems via Hybrid Reinforcement Learning. IEEE Trans. Fuzzy Syst. 2024, 32, 6472–6485. [Google Scholar] [CrossRef]
- Liu, K.; Ban, X.; Xie, S. Fuzzy reinforcement learning based control of linear systems with input saturation. ISA Trans. 2025, 158, 405–414. [Google Scholar] [CrossRef]
- Abdullaev, F.K.; Hensen, J.H.; Bischoff, S.; Sørensen, M.P.; Smeltink, J.W. Propagation and interaction of optical solitons in random media. J. Opt. Soc. Am. B 1998, 15, 2424–2432. [Google Scholar] [CrossRef]










| Symbol | Physical Interpretation | Unit | Representative Range/Typical Form |
|---|---|---|---|
| Effective linear attenuation rate, encompassing absorption and out-scattering losses along the propagation path | 0.10–0.50 | ||
| Restoring/compensation coefficient, representing the combined effect of beam re-alignment recovery and equivalent source compensation | if is normalized; otherwise | 0.05–0.30 | |
| Diffusion intensity coefficient, characterizing the strength of turbulence-induced irradiance fluctuations | if is normalized | 0.15–0.65 | |
| State-dependent noise exponent, governing the nonlinear scaling of multiplicative diffusion with respect to the instantaneous channel state | (dimensionless) | 1.0–2.0 | |
| Reduced stochastic state variable in the averaged SDE; equivalent received optical intensity used for modeling and control | (dimensionless, normalized) | ||
| Normalized instantaneous channel gain, defined as the ratio of received optical irradiance/power to the transmit reference level | (dimensionless) | ||
| Additive receiver-side noise variance, aggregating thermal noise, shot noise, and residual background contributions | in physical IM/DD form; dimensionless in normalized proxy form | – (normalized) | |
| Increment of the Wiener process driving the continuous stochastic fluctuations | |||
| Jump term used to capture abrupt rare events such as bubble blockage or transient occlusion | same as | sparse/event-driven | |
| Optical extinction coefficient, equal to the sum of absorption and scattering coefficients | 0.08–1.0 | ||
| Slant propagation distance between transmitter and receiver | m | 1–5 | |
| Angular spread/beam-divergence-related parameter in the Gaussian pointing-loss term | or normalized angular variance | application-dependent |
| Parameter | Value/Specification | Remarks |
|---|---|---|
| Gimbal actuator | 2804 BLDC + AS5600 | SimpleFOC closed-loop control, feedback every 5 ms |
| Camera | 640 × 480 @ 30 fps | Measured frame delay ≈ 92 ms, FOV 60°/45° |
| Camera–target distance | 0.18 m | Used for ArUco and spot detection |
| Laser–target distance | 2.0 m | Tank-scale propagation path |
| Laser source | 520 nm, 80 mW | Constant-current driven |
| Water type | Clear water | Attenuation coefficient |
| ArUco physical spacing | 110 mm | Used for pixel-scale calibration |
| Control cycle | 5 ms (embedded)/~33 ms (camera) | Fusion of serial and visual feedback |
| Metric | Value |
|---|---|
| Average alignment error | |
| Standard deviation | |
| Median |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Si, B.; Hou, J.; Ning, D.; Gong, Y.; Yi, M.; Zhang, F. Unified Stochastic Differential Equation Modeling and Fuzzy-RL Control for Turbulent UWOC. J. Mar. Sci. Eng. 2026, 14, 792. https://doi.org/10.3390/jmse14090792
Si B, Hou J, Ning D, Gong Y, Yi M, Zhang F. Unified Stochastic Differential Equation Modeling and Fuzzy-RL Control for Turbulent UWOC. Journal of Marine Science and Engineering. 2026; 14(9):792. https://doi.org/10.3390/jmse14090792
Chicago/Turabian StyleSi, Bowen, Jiaoyi Hou, Dayong Ning, Yongjun Gong, Ming Yi, and Fengrui Zhang. 2026. "Unified Stochastic Differential Equation Modeling and Fuzzy-RL Control for Turbulent UWOC" Journal of Marine Science and Engineering 14, no. 9: 792. https://doi.org/10.3390/jmse14090792
APA StyleSi, B., Hou, J., Ning, D., Gong, Y., Yi, M., & Zhang, F. (2026). Unified Stochastic Differential Equation Modeling and Fuzzy-RL Control for Turbulent UWOC. Journal of Marine Science and Engineering, 14(9), 792. https://doi.org/10.3390/jmse14090792

