An Atmospheric Turbulence Compensation Algorithm Based on FSM-DM Cascaded AO Architecture for FSO Communications

: Free-space optics (FSO) communication enjoys desirable modulation rates at unexploited frequency bands, however, its application is hindered by atmospheric turbulence which causes phase shifting in laser links. Although a single deformable mirror (DM) adaptive optics (AO) system is a good solution, its performance remains unsatisfactory as the proportion of tilts aberrations becomes relatively high. This condition happens when the incident angle of the laser beam for the optical receiver dynamically shifts. To tackle this problem, we introduce a fast steering mirror (FSM), DM cascaded AO architecture, based upon which we also propose an atmospheric turbulence compensation algorithm. In this paper, we compare the compensation ability of FSM and DM towards tilts aberrations. Furthermore, we gain model matrices for FSM and DM from testbed and simulatively verify the effectiveness of our work. For a Kolmogorov theory-based atmospheric turbulence disturbed incident laser beam where the tilt components take up 80% of the total proportion of wavefront aberrations, our proposed architecture compensates the input wavefront to a residual wavefront root mean square (RMS) of 116 wavelength, compared to 16 wavelength for single DM architecture. The study intends to overcome atmospheric turbulence and has the potential to guide the development of future FSO communications.


Introduction
The explosive proliferation of wireless communications poses a shortage of frequency band resources [1]. FSO communications have been increasingly attractive due to the large capacity brought by unexploited spectrum as well as high security [2]. Nevertheless, despite their promising prospects, atmospheric turbulence has long been the principal encumbrance clogging the development of the subject. Spatially, inhomogeneities of temperature, humidity, momentum and other atmospheric components lead to a space-varying atmospheric refractive index along the transmission link causing phase shifting. Meanwhile, the existence of atmosphere turbulence significantly enhances the vertical and horizontal exchange of the above atmospheric components which is greater than the exchange intensity of molecular motion. Such influence is generalized as atmospheric turbulence. Displayed in the dynamic variation of the amplitude and phase of the received laser signal, the influence gradually becomes prominent when distance between communication terminals goes beyond 1 km [3]. According to [4], various models of the power spectrum were established to describe statistical features including the Kolmogorov power-law spectrum, the Tatarskii spectrum, the von Karman spectrum, and the modified atmospheric spectrum.
In this study, we focus on a ground-to-ground scenario where communications happen between base stations. The study conducts theoretical pre-research for the subsequent use of laser communication base stations instead of microwave relay base stations so as to achieve 40∼100 Gbps communication rate. As a basic method to improve signal-to-noise ratio (SNR) at the receiver so as to enhance service quality, enlarging SNR from the sender would be an effective method but it is not always the most agreeable solution. On the contrary, correcting phase shifting, compensating wavefront aberrations would benefit the sender in terms of energy saving. Therefore, AO aiming at extracting real-time wavefront distortion and dynamically compensating wavefront aberrations rises up as a prospective countermeasure which could reduce the likelihood of signal fading and mitigate intensity variations (scintillation) [5,6]. Existing studies [7][8][9] have shown that AO techniques are of significant practical value in satellite-to-ground optical communication. Our findings are not only applicable to ground-to-ground FSO communications; as long as it involves dynamically compensating the incident light with a continuously changing phase, our algorithm can be effective with wavefront correctors with suitable stroke. For example, in optical modulation, beam shaping, astronomy, and microscopic imaging [10][11][12].
In general, an AO system consists of a wavefront reconstructor, a wavefront sensor, and a wavefront corrector [13], where the wavefront sensor generates real-time feedback of wavefront aberrations and the wavefront reconstructor computes commands instructing an appropriate deformation of the corrector. The Shark-Hartmann wavefront sensor (SHWS) which gauges atmospherically induced wavefront aberrations across an aperture is widely used in diverse studies [14][15][16][17]. In addition, embedded development boards free from the operating system with better instantaneity and a flexible size would be better suited for a wavefront reconstructor than computers in engineering applications. As for the wavefront corrector, DMs are the most frequently employed optical hardware, among which, micromachined membrane DMs (MMDM) [18] taking advantage of low hysteresis and negligible power consumption are applicable in high-precision scenarios. By contrast, piezoelectric DMs (PDM) [19] benefit from large strokes and high response frequency, satisfying the demand of large-scale aberration compensation. Apart from the above two DMs, unimorph DMs [20] and liquid DMs [21] are indispensable devices used in many AO architectures as well. In addition to DMs, some studies [22] also introduce a tip-tilt mirror into the system to compensate for tilt aberration.
For double wavefront correctors' AO architecture, the distribution scheme of the workload determines the performance of an algorithm. Additionally, the modeling strategy affects the implementation of modeling and the computation of controlling voltages. Ref. [23,24] used a Lagrange-multiplier (LM)-based damped least-squares (DLS) control method based on a function of wavefront slopes for controlling voltages. It solved controlling voltages by multiplying the inverse of the influence matrix to the left of the sensed wavefont slopes. The method receives effective results, though, it cannot thoroughly remove the coupling between wavefront correctors. Therefore, the damping factor should be designed carefully. Otherwise, either a conservative iteration step is selected or the stability of the algorithm may suffer a loss. In [25,26], the authors decouple low-order aberrations from high-order aberrations by separating Zernike coefficients and distributing a reasonable workload to wavefront correctors. However, the modeling strategy for FSM considers angle value as a variable of Zernike coefficients. Similarly, the model for DM depicts the influence of controlling voltages towards Zernike coefficients. Firstly, the Zernike coefficients cannot be directly observed from SHWS which may introduce a fitting error into the model by using a wavefront restoration matrix fitting Zernike coefficients from wavefront slopes. Secondly, controlling voltages are the direct signal when using an embedded development board which means that another mapping relationship can be built when controlling FSM. To make the dependent and independent variables in a modeling strategy more intuitive, ref. [27,28] built a linear mapping relationship between controlling voltages and wavefront slopes and used a digital PI controller when solving controlling voltages. The latter arranges the Zernike polynomials into a matrix and makes a linear transformation to make the algorithm adapt to irregular pupil regions. Nevertheless, according to our previous work [29], the spots offsets (which are proportional to the wavefront slopes) are a linear function of the square of the controlling voltages.
Consequently, we separate the Zernike coefficients to distribute the workload considering the balance of complexity and performance and use a complete second-order modeling strategy to depict the mapping relationship between spots offsets and controlling voltages. Furthermore, due to the nonlinearity of the model structure, we use an iterative method to solve controlling voltages.
The main contribution of this paper is that we use Zernike mode decomposition to decompose the wavefront into 2nd, 3rd and 4th∼15th orders, and use FSM to compensate for low-order aberrations with DM compensating for high-order aberrations, which improves the compensation range and response rate to tilts aberrations of the AO architecture allowing the AO architecture to high-dynamically compensate aberrations. Then, the DM is modeled with the complete second-order model and the FSM is modeled with the coupled second-order model to ensure compensation accuracy. Finally, we use the iterative compensation algorithm to solve the controlling voltages.
The remainder of this paper is structed as follows: Section 2 introduces the proposed FSM-DM cascaded architecture. Section 3 introduces our proposed high-dynamic atmospheric turbulence compensation algorithm which is our main contribution. Section 4 illustrates experimental simulative results collected from our testbed which is our third contribution. Finally, conclusions are drawn in Section 5.

FSM-DM Cascaded AO Architecture
In this section, we demonstrate the design of our proposed AO architecture and compare the theoretical potential of DM and FSM compensating tilts aberrations.

Proposed Architecture Description
The suggested AO architecture is displayed in Figure 1 which could be embedded into optical communication receiving terminals as a wavefront compensation module. When the system is powered on, an ideal incident laser beam is needed for calibration modeling. After the accomplishment of it, the terminal performs closed-loop compensation. During the procedure of closed-loop compensation, images from SHWS are passed to the wavefront controller which drives FSM and MMDM simultaneously.

Theoretical Compensation Ability of FSM and DM Compensating Tilts Aberrations
For MMDM, the reflective surface is a static electricity controlled membrane shown in Figure 2a. Configured by the wavefront reconstructor, controlling voltages are passed to the electrodes which produce electrostatic force pulling and pushing the reflective film to form a desired shape. The advantages of MMDM lie in its cabinet size allowing it to work in a narrow environment. However, due to the absence of a physical connection between the flexible membrane and electrodes, the maximum deflection of a membrane varies from electrode to electrode. Meanwhile an inhomogeneous distribution of the tension on the mirror surface thwarts the stablization of deviation caused by marginal electrodes. Hence, the mirror could scarcely form an inclined plane to compensate tilts aberrations with the whole area. Generally the most suitable use scenario for MMDM is high-frequency compensation. As for the PDM, a brief schematic is displayed in Figure 2b. Unlike MMDM, the flexible surface is physically linked with piezoelectric ceramics. Consequently, even marginal electrodes share nearly the same rigorous mapping relationship between digital voltages configured to the electrode and stretching the measurement of the corresponding piezoelectric ceramics with the centre electrodes. Nevertheless, the physical characteristics of piezoelectric ceramics demand a hysteresis model for better precision depicting the influence of the reflective surface for the spots offsets. Owing to the balanced performance in terms of stroke and spatial resolution, it is widely used in single DM AO systems as the only wavefront corrector. For FSM, a schematic of it supported by piezoelectric ceramics is depicted in Figure 2c. It consists of a rigid non-deformable mirror. Correspondingly, the FSM cannot express any other aberrations except tilts aberrations with two controlling signals which determine the tipping and tilting angle of the mirror. Commonly, FSMs are used to either process low-order aberrations or fine tracing.

Base
Controlling Electrodes On our testbed, the diameter of MMDM is 15 mm and the number of electrodes is 37. As for the PDM, the diameter of it is 30 mm with same 37 electrodes. In addition, the FSM has two electrodes controlling a circular area with a diameter of 19 mm. According to the literature, the maximum deflection of the mirror center is 9.4 µm. Therefore, by assuming MMDM could form an ideal tilting surface within a 90-degree sector when the center electrode is configured to the maximum stroke, 1 4 of the mirror reaches the extreme compensation scope. Represented in Figure 3a, MMDM could compensate a tilts aberrations of 4.23 wavelength at the cost of sacrificing 3 4 correction area. When electrodes are configured by the same digital voltages, the deformation of the panel shares the same magnitude. This leads to the feasibility of using the whole area to frame an inclined plane to compensate tilts aberrations. The technique documents indicate that the maximum stroke of PDM is 8 µm. According to the simulation results shown in Figure 3b, the compensation scope of PDM is 1.8 wavelength. Similarly, the FSM can also use the whole surface to compensate tilts aberrations. Due to the non-deformable reflective surface, a literal depiction of its technical parameters focuses on inclination generation capability. According to the literature, the maximum deflection angle of FSM is 5 mrad, as shown in Figure 3c. This means a capability of 33.75 wavelength when compensating tilts aberrations.  As shown in Figure 3, the FSM has a desirable compensation ability for tipping and tilting as well as a swift response rate of 8 mrad within 1ms which is faster than higher spatial resolution wavefront corrcetors. For DMs, MMDM enjoys better local deformation ability. Besides, considering an absence of the hysteresis effect, the control of MMDM would be easier. Therefore, we cascade FSM and DM as the solution for our focused scenario.

High-Dynamic Atmospheric Turbulence Compensation Algorithm
In this section, we introduce our Zernike polynomials based method by which we interpret image information acquired from FSM and SHWS to wavefront phase screen so as to evaluate the quality of the incident and emergent laser beam. Secondly, a coupled secondorder modeling strategy for FSM and a complete second-order modeling scheme for DM are demonstrated. Both modeling plans map controlling voltages on electrodes to offsets of spots. Subsequently, in view of the calibrated models, we apply an iterative method to solve the models obtaining optimal controlling voltages for electrodes on wavefront correctors. Finally, the workload distribution is illustrated which is the core of the high-dynamic atmospheric turbulence compensation algorithm.

Zernike Polynomials Based Phase Reconstruction Method
Collected from SHWS, the information of wavefront would be displayed in the coordinates of spots arrayed in every single aperture of the micro lens array shown in Figure 4. By computing the offsets of the spots, the disturbance of the wavefront can be acquired. Therefore, we adopt the mapping relationship between spots offset and wavefront slopes suggested by [30]. Assume there are totally N spots on the SHWS, and let slope be the vector of partial derivatives of phase. Correspondingly, o f f set represents the offset vector of spots. Both slope and o f f set contain 2 × N elements. This process is expressed as where λ stands for wavelength and f indicates focus length of micro lens array on SHWS. Furthermore, following the modal algorithm presented in [31], the coefficients of assigned Zernike terms are fitted by a least-mean-square method, yielding where, A denotes the fitted coefficients vector and Z is the wavefront restoration matrix.

Modeling Strategy
In the context of our testbed, the wavefront reconstructor is a digital signal processor (DSP) which controls a digital-analog-converter (DAC) to produce an analog voltage of 0∼2.226 volts which is subsequently amplified to 0∼187 volts by a high voltage amplifier when controlling DM. When it comes to FSM, the 0∼2.226 volts output of DAC is amplified to 0∼10 volts. Due to the fact that the input of DAC is produced by DSP using a 24-bit data segment among which, there are 12 bits responsible for the voltage level of the output analog. Therefore, controlling voltages are expressed in a digital voltages range from 0 to 4095 within the capability of a 12-bit data segment. Literally, the relationship between the output analog voltages and input digital voltages of DAC follows Equation (3), where V analog is the level of output analog voltage, V re f erence is the reference voltage which is 1.25 volts in our context, and V digital represents the digital voltage.
When powered on, the initial digital voltages for the DM and FSM are 2048 and 2448, respectively. Furthermore, in order to choose a reasonable stroke of the wavefront correctors, the range of digital voltages are limited to −1600∼+1600 and −400∼+400 for the DM and FSM. Additionally, we ensure the precision of the digital voltages during the process of iteration by normalizing them to −1∼1. Letting v i and s i represent the digital voltage and the normalized digital voltage of the ith electrode on DM or FSM, the normalization process is shown in Equation (4), where v initial symbolises the digital voltages as our testbed is powered on 2048 for DM and 2448 for FSM. Meanwhile, v norm is the normalization radius of the electrodes on DM which is designed to 1600 for DM and 400 for FSM in our experiment.
Because of the attribute of DMs, the response of DMs is a function with respect to the square of controlling voltages. Nevertheless, ref. [29] points out that in practical testing, measured data are always accompanied with noise. Due to the tension of the continuous membrane introducing reflective jitter and system-level noise, the spots offsets are not exactly proportional to the square of controlling voltages. Hence, we adopt their modeling strategy adding first-order terms and constant terms to the model for more accurate fitting results. Subsequently, by assuming there are totally M electrodes on DM, the model for o f f set i which is the ith element in o f f set is shown as follows where, a i,j , b i,j and c i,j symbolise coefficients of second-order terms, first-order terms, and constant terms, respectively. Additonally, the 37 electrodes are modeled one by one sequentially. During the procedure of modeling, we configure the electrodes for 10 cycles of calibration voltages, and record the spots offsets captured by SHWS. One cycle of calibration voltages contains 34 elements. Therefore, we gain 340 spots offsets vectors expressed as o f f set 1 ∼ o f f set 340 . Correspondingly, each spots offsets vector corresponds to an calibration voltage vector marked as S f ull,1 ∼ S f ull,340 . Equation (6) depicts the process of single electrode modeling.
where, P DM is the model coefficients matrix for the currently modeled electrode. However, when modeling the jth electrode, the constant term of it describes the comprehensive effect of all M electrodes. To handle this, we consider the contribution of each electrode on spots offsets as equal. Therefore, the kth calibration voltage vector is supposed to be designed as Equation (7) Referring to the above complete second-order model, we build a model for FSM containing second-order terms, first-order terms, and constant terms. In addition, considering the fact that the reflective surface of FSM is continuous and flat with tough rigidity, such characteristics would cause FSM electrodes to have a fierce coupling effect and this becomes increasingly evident when the stroke of the wavefront corrector rises. Individually building models for a single electrode and executing a linear summation cannot completely capture the effect brought by the collaboration of the two electrodes thus producing modeling errors. Correspondingly, an application of coupling terms keeps the computational complexity bearable. Meanwhile, it would not pose unaffordable burdens on solving models. The following equation is our constructed model for FSM.
where, a i , b i , c i , d i , e i , f i indicate the model coefficients of the ith spots coordinate. s FSM,1 s FSM,2 are the called coupling terms with whom the reliability of the model bewriting the cooperation of the two electrodes is guaranteed.

Computing Controlling Voltages
Due to the nonlinearity of model as well as the overdetermination of the equation set, multiplying a pseudo-inverse of the model coefficients matrix on the left of the detected wavefront value vector could not determine an agreeable vector of controlling voltages. Therefore, the Gauss-Newton method which has the advantages of low computational complexity and excellent convergence performance is adopted.
Consider the following overdetermined equation set with 2 × N equations and M variables o f f set Target = f (s) + e where o f f set Target is the information vector collected from SHWS, and f (s) is a 2 × N model value vector, symbolises independent variables, while e stands for an error vector. Furthermore, the equation below is used in order to evaluate the global error of the overdetermined equation set under a certain set of solutions.
The Gauss-Newton method solves the following optimization problem.
By repeatedly computing the step length in Equation (13) and adding it to Equation (10) until it reaches the convergence condition of Equation (11) or iteration chances are exhausted, the final Equation (10) would be denormed and configured to the electrodes of wavefront correctors.
J in Equation (13) is expressed as follows, In practical applications, the Gauss-Newton method runs as Algorithm 1

Capturing Aberrations
In the previous subsection, we apply the Gauss-Newton method to compute controlling voltages for the electrodes of wavefront correctors. However, an input is supposed to be provided as the target of iteration which is the o f f set Target in Equation (9). In this subsection, we introduce our method of providing input for the Gauss-Newton method.
For single DM AO architecture, all workloads are put on the only wavefront corrector. Traditional iterative wavefont correction algorithms building on the relationship between controlling voltages and spots offsets linearly combine current spots offsets, wavefront residual of the previous turn and model value as o f f set Target . The algorithm captures accurate wavefront aberrations and according to [29], the wavefront residual remains under 1 8 wavelength after compensation. Nevertheless, in double wavefront correctors architecture, distribution of the workload on FSM and DM separately determines the precision of the compensated results.
In order to divide the labor of woofer and tweeter which are the FSM and DM in our context, the Lagrange-multiplier method [23,24] multiplied a multiplier λ to the left of the response matrix of tweeter as shown in Equation (15) o f f set = P woo f er × s woo f er + λP tweeter × s tweeter (15) where P woo f er and P tweeter are response matrices for the woofer and tweeter, while s woo f er and s tweeter are actuator vectors for the woofer and tweeter. The basic solving process are expressed in Equation (16) P T woo f er P woo f er λP T woo f er P tweeter λP T tweeter P woo f er λ 2 P T tweeter P tweeter However, by marking [P woo f er λP tweeter ] as P global , the irreversibility of P T global P global determines that a damped least-square method is needed for solving actuator vectors as shown in Equation (17). s woo f er s tweeter = P T woo f er P woo f er + β 1 λP T woo f er P tweeter λP T tweeter P woo f er λ 2 P T tweeter P tweeter + β 2 where β 1 and β 2 are damping parameter matrices. In Equation (17), λ, β 1 and β 2 are manually set. The empirically selected λ can suppress coupling between the woofer and tweeter. However, Equation (15) is neither a necessary nor sufficient condition for decoupling high and low-order aberrations. By contrast, in our FSM-DM cascaded AO architecture, the workload of FSM and DM consists of decoupled current spots offsets, decoupled wavefront residual of the previous turn and model value. Model value can be computed with coefficient matrices and current controllilng voltages. Decoupled current spots offsets and decoupled wavefront residual of the previous turn are acquired from current spots offsets and wavefront residual. By changing them to wavefront slopes using Equation (1) the coefficients of Zernike polynomials can be fitted with Equation (2). Subsequently, after separated Zernike coefficients performing the inverse process of Equations (1) and (2) in turn, the decoupled current spots offsets, and the decoupled wavefront residual of the previous turn can be gained. It should be noted that, the current spots offsets are calculated from the current wavefront on SHWS while the wavefront residuals are acquired from the compensated wavefront output of the previous turn on SHWS.
According to [32], consider the 2nd and 3rd order of Zernike polynomials in Cartesian coordinate system marked as Zernike 2 and Zernike 3 , as shown in Equation (18).
where x and y are coordinate vectors within the normalized area. The formulas above reveal that the 2nd and 3rd of the Zernike polynomials vary monotonically with x and y. They depict the horizontal and vertical tilt of the phase screen which demands barely any capability of local deformation. This fits the deformation capability of FSM shown in Figure 3c which forms a tilt shape with a spatial resolution of two. In contrast, the phase screens described by the fourth and above Zernike polynomials are not a plane, for example, the 4th∼6th order of Zernike polynomials in Equation (19). There are local fluctuations in the phase screen, which require a higher spatial resolution to be expressed. Therefore, the workload allocated to FSM is to compensate the 2nd and 3rd order Zernike aberrations while the rest of the wavefront aberrations belong to the DM.
Assuming the coefficients of Zernike polynomials are, where subscripts symbolise order. Decoupled offsets expressed in o f f set Decoupled,low and o f f set Decoupled,high (tilts aberrations and high-order aberrations, respectively) are gained according to the following procedure.
where, Z is the wavefront restoration matrix which corresponds to 2nd∼15th order Zernike polynomials, while I Low and I High are shown as follows Denoting As we are expected to drive the wavefront correctors eliminating the detected offsets, the target should be set opposite to the decoupled current spots or decoupled wavefront residual.
The above method effectively suppresses the wavefront residual and distributes the workload. Meanwhile, it saves the stroke of DM by passing the pressure of the stroke to FSM.

Implementation and Experimental Results
In this section, we introduce the implementation of the calibration modeling as well as the simulative performance of our algorithm.

Calibration Modeling
The modeling procedure is examined on our testbed displayed in Figure 5, from which we gain the coefficient matrices of DM and FSM describing the mapping relationship between spots offsets and controlling voltages. On our testbed, a 800 nm wavelength laser beam with a diameter of 15 mm is used as the ideal incident light. As for the AO system, the internal topology is shown in Figure 1. Within it, the wavefront sensor is a charge coupled device (CCD) camera with a micro lens array covered in the front of it which has a focus of 18.8 mm. Moreover, the effective area of the wavefornt sensor contains 1088 × 1088 pixels whose size is 5.5 µm, while the size of a single aperture of the micro lens array is 300 µm which means the wavefont sensor can capture spots in an around 19 × 19 rectangle area. However, in our experiment environment, the detected spots on the wavefront sensor cover a circular area with a diameter of 17 apertures. Additionally, the PDM in front of the light entrance is not drived on the current stage and it is set for a future experiment. On the left of the AO system, devices are depolyed for observing the emitting light. For DM, the calibration voltages in one period are shown in Figure 6. During the process of modeling the DM, we repeatedly configure the calibration voltages on every electrode for 10 periods. Each calibration voltage produces 20 data samples (two data per period). After receiving the 20 data samples, we use the least square method to solve pre-term coefficients for the current affecting electrode. The calibration voltages take 200 as the step and 2048 as center, traversing the voltage range of ±1600. According to Equation (3), a digital voltage ranging from 448 to 3648 can be linearly interpreted to an analog voltage ranging from 0.273 volts to 2.226 volts.  Different from the modeling structure of the DM, the structure of FSM contains coupling terms depicting the collaborative effects of electrodes on spots offsets. This leads to the requirement that when modeling, the two electrodes should be modeled together. Therefore, the horizontal electrode is sequentially configured with a period of calibration voltages. Whenever the horizontal electrode is configured with a calibration voltage, the vertical electrode is configured with a period of calibration voltages in turn. Furthermore, due to the large maximum stroke of FSM, the used range of it is limited to ±400 in digital voltages, as displayed in Figure 8.

Simulation Results
Firstly, we examine the Zernike mode decomposition method by driving FSM-DM cascaded AO architecture and single DM architecture approaching our generated phase screen. The result presented in Figure 10 illustrates that when approaching a phase screen containing tilts aberrations, FSM-DM cascaded AO architecture can form a surface shape that is closer to the generated phase screen than the single-DM AO architecture. The PV of Figure 10a is 15.76 wavelength. As for Figure 10b,c, the PV values are 10.99 wavelength and 6.75 wavelength, respectively. Furthermore, due to the response rate of the FSM, the FSM-DM cascaded AO architecture is bound to have a swifter response towards tilts aberrations. Secondly, to illustrate the merits of FSM-DM cascaded AO architecture against single DM AO architecture, we produced a single phase screen as the input wavefornt according to the Kolmogorov theory [32]. Subsequently, generated phase screen is interpreted to spots offsets which is the direct information we can gain from the wavefront of our sensor. Finally, we observe the phase screen before compensation, phase screens compensated by FSM-DM cascaded AO architecture, and single DM architecture. The result is shown in Figure 11. Figure 11a shows the simulative input wavefront of the system where a peak phase value of 1.13 wavelength can be generated while the valley phase value is −0.953 wavelength. After the compensation of FSM-DM cascaded AO architecture with the proposed algorithm, the convergent phase plane reaches a peak phase value of 0.196 wavelength and a valley phase value of −0.158 compared to single DM AO architecture with the traditional algorithm whose reconstructed phase plane claims a peak phase value of 0.360 wavelength and a valley phase value of −0.302 wavelength. In addition, although both the suggested and traditional architecture can compensate wavefont aberrations, the reconstructed phase planes demonstrate that the results produced by raised architecture with raised algorithms are more gentle which reflects the necessity of an FSM.
In addition, by analysing the iterative procedure of compensating a single wavefront input, Figure 12 reveals that for a wavefront input of 0.53 wavelength, both FSM-DM cascaded architecture and single DM AO architecture converges using the iterative compensation algorithm within one iteration. Nevertheless, as for the convergence level, single DM architecture converges to 0.12 wavelengths after compensation compared to the results of our proposed architecture which is approaching 0.09 wavelengths.   Another verification for the consecutive compensating capability of our architecture with the algorithm is conducted. Firstly, a series of discrete phase screens according to Kolmogorov theory [32] are produced. Additionally, in order to simulate a dynamic atmospheric turbulence disturbed phase screen, we perform interpolation using a parallel temporal filter following [33] and produce 1000 distorted wavefront inputs in total. During the process, Zernike polynomials are also used to mathematically represent phase screens and change them to spots offsets. The results are shown in Figure 13. The figure consists of three curves representing input wavefront aberrations, wavefront residuals compensated by single DM architecture, and wavefront residuals compensated by FSM-DM cascaded architecture. The blue curve represents the wavefront value when the laser beam has not been compensated by any AO system, which is also the quality of optical communication without the AO system in FSO communications. It fluctuates from around 1 5 to 4 5 wavelength which in our context is 1.6 µm to 6.4 µm. By contrast, considering the red curve in Figure 13, it represents the compensation performance of traditional AO architecture which contains only one DM. The curve fluctuates around 1 6 ∼ 1 10 wavelength with sharp glitches which occur because the model of DM cannot depict the overwhelmingly large tilts aberrations within the effective arguments range leading to a divergent result. This fact also demonstrates the advantage of FSM deployed in front of the DM with which the probability of divergence caused by parameter range limits are fully supressed. As shown in Figure 13, apart from the above curves, a green curve illustrates the stable tendency of wavefront residuals under 1 16 wavelength which is also displayed. Although, there are glitches, the reasons for their occurence lies in the coupling of tilts aberrations with high-order aberrations in the DM model. A calibration modeling strategy truthfully reflects the mapping relationship of spots offsets with voltages configured on the electrodes, however, tilts aberrations and high-order aberrations are coupled in the DM model. This demands the DM shoulder a part of the workload for tilts aberrations when conducting wavefront compensation. When generating random wavefront input using the DM model, the proportion of the tilt aberration of certain groups of inputs will be relatively high. As this part of the tilt aberrations are distributed to the workload of FSM, the glitch appears in the compensation of the remaining workload by the DM.

Conclusions
In conclusion, this paper clarifies the use of Zernike mode decomposition to decompose wavefronts into 2nd, 3rd and 4th∼15th orders which are compensated by FSM and DM, respectively. The decomposition ensures the compensation capability of FSM-DM cascade AO architecture. Meanwhile, modeling strategies of FSM and DM and the iterative compensation algorithm guarantee the precision of the compensation. In addition, the performance of the algorithm has been simulatively verified. For compensating a static input wavefront, the wavefornt residuals are better supressed around 1 16 wavelength than 1 6 wavelength for single DM AO architecture. Conversely, for consecutively compensating the Kolmogorov theory-based dynamic atmospheric turbulence disturbed incident laser beam, the algorithm reaches a convergence level under 1 16 wavelength better than 1 10 to 1 6 for the single AO architecture when glitches are not counted. This study intends to provide a guideline to design the AO architecture for future FSO communications.
Author Contributions: Conceptualization, investigation and methodology-J.Z. and S.Y.; experimentation, data acquisition and analysis-H.M. and X.Y.; writing-original draft preparation-H.M. and J.Z.; writing-review and editing-J.Z., G.L. and P.X. All authors have read and agreed to the published version of the manuscript.