1. Introduction
Driven by the growing strategic demands in fields such as marine resource exploration, underwater communication, and environmental monitoring, the detection and recognition of underwater acoustic signals have become a primary research focus. Benefiting from the advantages of miniaturization and batch fabrication enabled by MEMS technology, MEMS-based underwater acoustic sensors have attracted considerable attention [
1,
2,
3]. Most of the MEMS underwater acoustic sensors rely on electrical sensing mechanisms [
4,
5,
6,
7], mainly including piezoelectric, piezoresistive, and capacitive types [
8,
9,
10]. Piezoelectric underwater acoustic sensors based on thin-film materials have attracted significant attention due to their high sensitivity [
11]. However, electrical sensors are inherently active devices, which impose stringent requirements on electrical insulation and hermetic packaging in complex underwater environments. Moreover, they are susceptible to electromagnetic interference, thereby limiting their practical applications.
Fiber-optic MEMS sensors offer distinct advantages [
12,
13], including strong immunity to electromagnetic interference, corrosion resistance, and excellent electrical insulation, providing a promising alternative for acoustic detection in complex underwater environments. According to their sensing principles and measured physical quantities, these sensors can be classified into interferometric types [
14,
15,
16], diffractive types [
17,
18,
19], acoustic pressure types and acoustic vector types [
20]. Li et al. improved the dynamic range of a fiber-optic F-P underwater acoustic sensor by optimizing the corrugated structure parameters of a polyethylene terephthalate (PET) diaphragm, while maintaining a high sensitivity of −149.24 dB re 1 rad/μPa [
21]. Zhang et al. employed a metallic diaphragm with an Archimedean spiral configuration as the sensing element, achieving relatively flat signal-to-noise ratio and sensitivity within the frequency range of 1 kHz to 20 kHz, with an average sensitivity of −172.52 dB re 1 rad/μPa [
22]. Wang et al. developed an ultra-high-sensitivity hollow F-P ultrasonic sensor using 3D printing technology, exhibiting a sensitivity of 797 mV/kPa, a bandwidth of 1.2 MHz, and a central response frequency of 1.5 MHz [
23]. Despite these advances in improving the acoustic performance of fiber-optic sensors, most reported designs employ sealed air cavities in the sensing unit to enhance sensitivity. However, in underwater environments, such sealed diaphragm-based structures lead to severe pressure imbalance between the internal and external surfaces, resulting in excessive deformation of the diaphragm. This not only reduces the linear operating range of the sensor but also significantly limits its applicable depth in deep-sea environments. Therefore, mitigating the detrimental effects of high hydrostatic pressure on the sensing structure remains a critical challenge in underwater acoustic detection.
In this paper, a silicon MEMS-based fiber-optic F-P underwater acoustic sensor with a composite diaphragm structure consisting of a central boss and a micro-hole array is proposed. The micro-hole array enables automatic hydrostatic pressure equalization be-tween the inside and outside of the F-P cavity, effectively eliminating the initial cavity length drift and measurement range limitations induced by hydrostatic pressure. Meanwhile, the central boss structure significantly enhances the quality of the interference signal. The sensing unit is fabricated using MEMS processes combined with CNC machining, providing feasibility for batch production. Adhesive-free integration of the sensing probe is achieved by CO2 laser fusion splicing. A hybrid packaging structure composed of 316L stainless steel and polyether ether ketone (PEEK) is employed as the housing, ensuring long-term stability and structural reliability in liquid environments. Finally, an underwater acoustic testing platform is established to evaluate the hydrostatic pressure resistance and acoustic detection performance of the proposed sensor.
3. Sensor Fabrication and Packaging
The fabrication process of the silicon MEMS-based fiber-optic F-P underwater acoustic sensor is illustrated in
Figure 4, which mainly includes three steps: MEMS fabrication of the sensing diaphragm, silicon–glass anodic bonding and dicing, and adhesive-free sensor integration. The detailed MEMS fabrication process of the sensing diaphragm is shown in
Figure 4a. A 4-inch silicon wafer with a thickness of 300 μm is selected, and deep reactive ion etching (DRIE) is employed to form the central boss, micro-hole array, and groove structures. A Cr/Au thin film, with thicknesses of 30 nm for Cr and 300 nm for Au, is deposited on the inner surface of the groove structure using DC magnetron sputtering, where the Cr layer serves as an adhesion layer to enhance the bonding strength of the gold film. The anodic bonding and subsequent dicing of the sputtered silicon wafer and the borosilicate glass are illustrated in
Figure 4b. A double-side polished Pyrex glass wafer with through-holes fabricated by CNC machining is first bonded to the silicon wafer with the diaphragm structure via anodic bonding. Subsequently, an external boss structure is machined at the glass through-hole region, and individual sensing units are finally obtained by wafer dicing. The sensor integration process is shown in
Figure 4c. A single-mode fiber is first fusion-spliced to a capillary glass tube using a CO
2 laser splicer. The fiber with the glass tube is then inserted into the through-hole of the sensing unit. The fiber end is connected to an optical interrogator (TV125, Tongwei Sensing, Beijing, China) via a patch cord. The insertion depth of the fiber is manually adjusted, and when the interference spectrum exhibits the maximum fringe visibility, the Pyrex glass substrate is permanently bonded to the capillary tube using CO
2 laser fusion splicing, thereby completing the adhesive-free integration of the sensor.
The sensor housing and the armored fiber tail are fabricated from 316L stainless steel. The sensing unit is securely bonded to the metal housing using epoxy adhesive (353ND, Epoxy Technology Inc., Billerica, MA, USA) to enhance structural stability. The metal package is then assembled with a PEEK insulating cap via threaded connection and immersed in an ultrasonic cleaner, where vibration facilitates the filling of the F-P cavity with liquid. Deionized water is used to ensure a clean and impurity-free internal environment. When the optical cavity length of the interference spectrum increases to 1.33 times its initial value, the ultrasonic excitation is stopped. After the spectrum stabilizes, a dedicated water-proof acoustic-transparent membrane (thermoplastic polyurethane, TPU, ~0.012 mm thickness, >80% acoustic transmission) is attached to the front end of the plastic cap to isolate external impurities while reducing acoustic transmission loss, thereby completing the sensor packaging. A photograph of the packaged sensor is shown in
Figure 5a. Before packaging, the F-P cavity is filled with air, whereas after packaging it is filled with water. A comparison of the corresponding interference spectra is presented in
Figure 5b. The optical cavity length in water increases by a factor of 1.33, while the fringe visibility decreases, which is consistent with the theoretical simulation results shown in
Figure 2.
4. Experiments and Analysis
A schematic diagram of the constructed underwater acoustic testing platform is shown in
Figure 6. The system consists of a rectangular water tank, a cylindrical acoustic source transducer (CT1(2), China Shipbuilding Industry Corporation No. 715 Research Institute, Hangzhou, China), a standard hydrophone (RHSA-5, China Shipbuilding Industry Corporation No. 715 Research Institute, Hangzhou, China), a signal generator (DG4062, RIGOL Technologies, Suzhou, China), a power amplifier (ATA-6013C, Aigtek, Xi’an, China), a hydrophone power supply (China Shipbuilding Industry Corporation No. 715 Research Institute, Hangzhou, China), an oscilloscope (DSOX2004A, Keysight Technologies, Santa Rosa, CA, USA), and a three-wavelength dynamic demodulation system. The dimensions of the rectangular water tank are 1.5 m × 1.1 m × 0.8 m. The three-wavelength dynamic demodulation system serves as the core unit for real-time extraction and processing of underwater acoustic signals. It mainly consists of an amplified spontaneous emission (ASE) broadband light source (LSM-ASE-CL-F, Junfeng Technology, Tianjin, China), an optical fiber circulator, optical filters, photodetectors (PDs, GaAs PIN photodiodes, Mingguang Technology, Beijing, China), and an analog-to-digital converter (ADC, AD7667, Analog Devices, Norwood, MA, USA). During the experiment, the broadband continuous spectrum generated by the ASE source enters the sensor probe through the optical fiber circulator and undergoes interference within the F-P cavity. The interference spectrum is modulated by the external acoustic pressure. The modulated interference signal returns to the circulator and is then separated into three optical channels by optical filters. These signals are converted into electrical signals by a PD array, and the resulting voltage signals are acquired by the ADC module. The raw data are processed to extract the phase variation in the interference spectrum, which is dis-played in real time on the host computer. After further processing, the cavity length variation in the fiber-optic F-P underwater acoustic sensor is reconstructed, enabling high-sensitivity measurement of the external acoustic pressure. The demodulation system features high accuracy and fast response.
To evaluate the hydrostatic pressure resistance of the silicon MEMS-based fiber-optic F-P underwater acoustic sensor in a liquid environment, the sensor under test was vertically immersed in a rectangular water tank. Starting from the water surface (0 cm), the immersion depth was increased in steps of 10 cm, and the sensor was held at each depth for 10 s to ensure stability. The interference spectrum at each depth was acquired using the optical interrogator, and the initial cavity length shift at each measurement point was calculated using the host computer demodulation program. The relationship between water depth and hydrostatic pressure can be expressed as:
where
is the hydrostatic pressure applied to the sensor,
is the density of water, taken as 1000 kg/m
3,
is the gravitational acceleration, taken as 9.8 m/s
2, and
is the immersion depth of the sensor.
According to Equation (10), an immersion depth of 10 cm corresponds to a hydrostatic pressure of approximately 980 Pa. Based on this, the hydrostatic pressure corresponding to each test depth is calculated. The relationship between the initial cavity length shift and the hydrostatic pressure of the sensor under test is shown in
Figure 7.
The results in
Figure 7. indicate that when the diaphragm is fluidically connected on both sides and the F-P cavity is filled with water, the cavity length exhibits excellent stability within the tested pressure range, with the cavity length shift remaining nearly constant. In contrast, for the sealed air-filled cavity sensor used as a control group, the cavity length of the F-P cavity decreases significantly under hydrostatic pressure, and the cavity length shift increases linearly with increasing hydrostatic pressure. These results verify the effectiveness of the micro-hole array structure, which enables fluidic communication between the internal and external regions of the diaphragm, thereby establishing an automatic pressure-balancing mechanism between the inside and outside of the F-P cavity. This ensures reliable operation and measurement range stability of the sensor in deep-water environments.
To evaluate the dynamic performance of the silicon MEMS-based fiber-optic Fabry–Perot underwater acoustic sensor, a sinusoidal excitation signal is generated by a signal generator and amplified to drive the transducer, producing acoustic signals at specific frequencies. When the acoustic signal reaches the sensor, the diaphragm deformation modulates the interference signal, which is then transmitted to the three-wavelength dynamic demodulation system for real-time extraction of cavity length variations. A standard hydrophone simultaneously receives the acoustic signal. The peak-to-peak voltage measured by the oscilloscope connected to the hydrophone, combined with its calibrated sensitivity, is used to calculate the acoustic pressure amplitude as the reference value. During the experiment, the acoustic source transducer, the standard hydrophone, and the sensor are positioned at the same horizontal depth. To ensure far-field conditions and satisfy the plane-wave approximation, the distance between the source and the receivers is set to be greater than 30 cm, thereby ensuring that both the sensor and the hydrophone are subjected to an equivalent acoustic pressure field.
Three frequencies, 1 kHz, 5 kHz, and 10 kHz, are selected as test points. The output is adjusted to maintain an acoustic pressure of 1 kPa. The response of the sensor under sinusoidal excitation at different frequencies is shown in
Figure 8.
Figure 8a–c present the output waveforms of the sensor within 1 ms at each test frequency, while
Figure 8d–f show the corresponding frequency spectra. The experimental results demonstrate that the output waveforms closely match the excitation signals, exhibiting good sinusoidal characteristics.
The output frequency of the signal generator is set to 10 kHz, and the driving voltage is increased from 5 V to 50 V in steps of 5 V, corresponding to a reference acoustic pressure range of 136 Pa to 1209 Pa as measured by the hydrophone. To ensure data reliability, the sensor output waveform at each pressure level is recorded for no less than 3 s. The test results of the sensor at 10 kHz are shown in
Figure 9.
Figure 9a presents the sensor output waveforms under different acoustic pressures, exhibiting standard sinusoidal characteristics consistent with the excitation signal.
Figure 9b shows the corresponding output spectra, with the frequency maintained at 10 kHz, matching the excitation frequency.
Figure 9c illustrates the relationship between acoustic pressure and the sensor output. Lin-ear fitting yields a sensitivity of 75.78 nm/kPa, with a maximum nonlinearity of 0.48% F.S. Based on Equation (6), the sensitivity of the sensor at 10 kHz is calculated to be −181.79 dB re 1 rad/μPa.
Figure 9d shows the repeatability test results of the sensor. At 10 kHz, three complete loading–unloading cycles are performed with the same pressure step, and each pressure level is maintained for 3 s. The processed results indicate a repeatability error of 0.15% F.S. These experimental results demonstrate that the sensor exhibits excellent linear response characteristics, negligible hysteresis, and high repeatability at 10 kHz, indicating strong reliability for acoustic detection.
Within the frequency range of 1 kHz to 10 kHz, test points are selected according to the one-third octave standard. At each test point, the frequency is kept constant, and the driving voltage of the signal generator is increased from 5 V to 50 V in steps of 5 V to excite the acoustic source. By synchronously recording the demodulated output of the sensor and the acoustic pressure amplitude measured by the reference hydrophone, the mechanical sensitivity at each frequency is obtained. The phase sensitivity of the sensor is then calculated based on Equation (6), and a comparison is conducted with that of a sealed air-filled cavity sensor. The measured frequency response characteristics are shown in
Figure 10. The results indicate that although filling the F-P cavity with water leads to a slight reduction in sensitivity compared with the air-filled configuration, the proposed sensor exhibits an excellent flat frequency response within ±1.5 dB over the range of 1 kHz to 10 kHz.
In the frequency range from 1 kHz to 10 kHz, test points were selected according to the 1/3-octave standard. The driving voltage of the signal generator was maintained at 50 V, and the demodulated output of the sensor was recorded. By applying FFT and incorporating the resolution bandwidth (RBW), the power spectral density of the sensor was obtained, as shown in
Figure 11.
Figure 11a shows the power spectral density of the sensor at different frequencies from 1 kHz to 10 kHz. The average signal-to-noise ratio (SNR) within this range is 86.35 dB, indicating good signal processing capability and noise immunity in the tested frequency band.
Figure 11b shows the power spectral density of the sensor at 10 kHz. The results indicate that the THD at 10 kHz is −48.5 dB, which is well below −30 dB, confirming that the sensor operates within the linear regime under a 50 V excitation, and the maximum linear operating acoustic pressure is 181.65 dB re 1 μPa. At 10 kHz, based on the sensor’s SNR and phase sensitivity, the MDP is calculated to be 11 mPa/
, corresponding to 80.82 dB re 1 μPa/
. Since the THD is −48.5 dB, the dynamic range of the sensor at 10 kHz is 100.83 dB.
Table 2 shows a comparison of the key characteristics of underwater acoustic sensors with different sensitive diaphragm materials, where some parameter details are not mentioned in the referenced papers. Although the sensor designed in this study has a lower sensitivity, it demonstrates excellent performance in terms of signal-to-noise ratio and dynamic range. In practical applications, the sensor’s parameters can be adjusted by modifying diaphragm size or optimizing the demodulation system to meet specific requirements.