On the Effect of Channel Knowledge in Underwater Acoustic Communications: Estimation, Prediction and Protocol

Petroni, Andrea; Scarano, Gaetano; Cusani, Roberto; Biagi, Mauro

doi:10.3390/electronics12071552

Open AccessArticle

On the Effect of Channel Knowledge in Underwater Acoustic Communications: Estimation, Prediction and Protocol

¹

Fondazione Ugo Bordoni, 00184 Rome, Italy

²

Deptartment of Information, Electronics and Telecommunications (DIET) Engineering, Sapienza University of Rome, Via Eudossiana 18, 00184 Rome, Italy

^*

Author to whom correspondence should be addressed.

Electronics 2023, 12(7), 1552; https://doi.org/10.3390/electronics12071552

Submission received: 24 February 2023 / Revised: 20 March 2023 / Accepted: 24 March 2023 / Published: 25 March 2023

(This article belongs to the Special Issue Underwater Optical and Acoustic Communications: Research and Challenges)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Underwater acoustic communications are limited by the following channel impairments: time variability, narrow bandwidth, multipath, frequency selective fading and the Doppler effect. Orthogonal Frequency Division Modulation (OFDM) is recognized as an effective solution to such impairments, especially when optimally designed according to the propagation conditions. On the other hand, OFDM implementation requires accurate channel knowledge atboth transmitter and receiver sides. Long propagation delay may lead to outdated channel information. In this work, we present an adaptive OFDM scheme where channel state information is predicted through a Kalman-like filter so as to optimize communication parameters, including the cyclic prefix length. This mechanism aims to mitigate the variability of channel delay spread. This is cast in a protocol where channel estimation/prediction are jointly considered, so as to allow efficiency. The performance obtained through extensive simulations using real channels and interference show the effectiveness of the proposed scheme, both in terms of rate and reliability, at the expense of an increasing complexity. However, this solution is significantly preferable to the conventional mechanism, where channel estimation is performed only at the receiver, with channel coefficients sent back to the transmit node by means of frequent overhead signaling.

Keywords:

underwater acoustic communications; OFDM; interference; estimation; prediction; Kalman filtering

1. Introduction

In spite of their long and consolidated history, underwater acoustic communications (UWACs) have recently experienced a peak of interest from the scientific community, as they represent an effective technology for a wide range of applications [1]. Real-time control and communication with underwater remote instrumentation and autonomous vehicles [2], coastal surveillance [3] and shipping traffic management [4], environmental monitoring [5] and data collection [6] are some of the most widespread activities where the use of UWACs has been shown to be suitable.

Regarding communication aspects, UWAC links are affected by several impairments that significantly limit the transmission rate. In fact, besides the intrinsic (technological) bandwidth limitation, due to transmitter capabilities, underwater acoustic channels are also characterized by severe multipath delays and highly selective frequency responses, as well as non-negligible propagation delays due to the relatively slow speed of sound in water [7]. The UWAC propagation scenario exhibits a peculiar time-varying behavior [8,9], due to sea temperature and salinity gradients, sea stream, and wind speed, as well as relative positional changes due to motion of the terminals [10]. Multipath time delay and frequency-selective deep fading affect the UWAC channel in different ways, depending on the propagation environment, typically classified as shallow, medium or deep water [11]. The shallow water channel is characterized by several paths of comparable length, so the corresponding channel impulse response (CIR) presents a large number of close coefficients with similar amplitudes. On the other hand, deep water CIR exhibits longer delays and small coefficients because of the long distances traveled by the signal echoes with respect to the line-of-sight wave. The medium water environment presents delay spread and path gain features between the above two cases.

The mitigation of channel impairments is fundamental to provide a reliable communication link. Typical countermeasures consider the use of equalization [12,13] in order to restore the quality of the received signal. Alternatively, adaptive modulation and coding schemes are developed in order to match the transmission parameters with the channel behavior [14,15,16].

In this regard, Orthogonal Frequency Division Modulation (OFDM) offers desirable features, like robustness to multipath delay spread, sufficiently high data rate, large bandwidth efficiency, and adaptivity of the modulation format to channel conditions [17]. In OFDM, modulated symbols are transmitted over different sub-carriers that, ideally, do not interfere with each other when propagating over frequency-selective and time-invariant channels, so that simple symbol-by-symbol detection can be adopted. In real cases, the potential inter-carrier interference can be mitigated through equalization [18]. Furthermore, it is worth highlighting that, in radio-frequency (RF) OFDM systems, the intersymbol interference (ISI) arising from multipath propagation is typically mitigated through the use of a cyclic prefix (CP), the length of which is set according to the channel delay spread [19]. On the other hand, the sudden variability of channel behavior, as well as the long CIR, make the use of a fixed CP impracticable in UWACs, since ISI may not be properly mitigated [20]. CP length optimization has also been investigated, but mainly in reference to RF OFDM scenarios [21,22]. However, such solutions cannot be employed directly in UWACs, due to the significant differences between RF and underwater acoustic channels. Therefore, channel issues must be addressed in a different fashion. In this regard, OFDM adaptivity to the medium conditions is a relevant feature for the harsh propagation scenario offered by UWACs, allowing a significant increase in transmission rate. In [23], the authors propose an OFDM sub-carrier power optimization mechanism based on channel knowledge, achieved with a length-adaptive estimation technique. In [24], second-order statistics of the channel are exploited to derive the signal-to-interference-plus-noise ratio in each sub-carrier, so to realize adaptive coding and bit-power loading. An OFDM acoustic modem is presented in [25], with adaptive modulation being performed based on the channel delay and Doppler spread, measured by using chirp pilots. An information-dependent sub-carrier mapping is proposed for underwater video transmission in [26], so that important data are conveyed on the most reliable OFDM sub-channels, while less useful data are transmitted through the lower quality sub-carriers. The authors in [27] proposed a continuous phase modulation-based OFDM, outperforming standard implementations, and, thus, demonstrating itself to be more suited to UWACs. In [28], a novel fractional fast Fourier transform (FFT) OFDM system is presented, with amplitude shift keying (ASK) employed for sub-carriers’ modulation. Specifically, the use of ASK achieves a better bandwidth efficiency with respect to other conventionally considered schemes and, furthermore, the use of fractional FFT in place of the conventional one allows problems related to carrier frequency offset to be more efficiently mitigated.

All the solutions for OFDM optimization rely on channel state information (CSI), made available from estimation or prediction. Overall, channel estimation typically considers a bi-directional transmission of pilot signals between the communication nodes, in order to measure the corresponding CIR [29] and feed back the channel coefficients. For example, in [30] a novel adaptive denoising scheme is proposed to achieve a reliable channel estimation in OFDM-based UWACs affected by strong noise. The authors in [31] propose an optimized pilot-assisted channel estimation for OFDM, even though performance was measured under the unrealistic assumption of ideal synchronization and Doppler compensation. Unfortunately, due to the low speed of sound, estimation procedures are time consuming and they must be performed frequently in order to keep the CSI updated. So, the communication rate may be significantly reduced. Furthermore, the available channel coefficients may be outdated, since, during the estimation process, the channel may have already changed [32]. In order to overcome such problems, channel prediction can be exploited in place of estimation. Predictors are typically based on channel statistical models [33] and are developed in different approaches, such as recursive least squares (RLS) [34] and deep neural networks [35]. Channel prediction does not impact on data rate since it allows the reduction of overhead information to be transmitted. On the other hand, if the predicted channel deviates from the real one, inaccurate system optimization occurs and problems of reliability may occur. The work in [36] considers, instead, the joint use of channel estimation and prediction. However, it is proposed for a single-carrier binary phase shift keying (BPSK) scheme where transmission parameter adaptation is not addressed. The worthiness of estimation and/or prediction is discussed in [37] by comparing minimum mean square error channel estimation and auto-regressive (AR) channel prediction. Furthermore, the authors propose a mechanism for bandwidth adaptation, in order to provide channel-tailored performance and avoid interchannel interference (ICI). Finally, a possible approach for channel estimation may rely on deep-learning, as presented in [38]. The proposed solution is designed for Multiple-Input Multiple-Output (MIMO) OFDM systems, even though not specifically related to the underwater acoustic case.

Motivation and Goals of the Work

As previously stated, both OFDM and single carrier modulations require channel knowledge for transmitter processing (e.g., bit-loading, pre-equalization) or receiver processing (e.g., equalization), aimed at counterbalancing the propagation impairments. Most of the proposed solutions assume perfect channel knowledge at the receiver and/or transmitter side, so the achieved performance cannot match that expected in a real scenario. Actually, the accuracy of CSI unavoidably impacts on the communication performance in terms of trade off between reliability and data rate. However, to the best of our knowledge, only a few works in the literature address such an issue. Furthermore, despite the availability of several channel estimation and prediction techniques, we highlight the absence of a real protocol that drives the joint use of these approaches in an adaptive fashion, so as to maximize the performance in terms of bit error rate (BER), while minimizing the rate reduction caused by the overhead signaling. Finally, it is worth noting that CSI acquisition and transmission optimization are typically performed under the optimistic assumption of absence of interference. The presence of superposing external acoustic sources (mammals’ communication, vessels’ engines and so forth) may not be negligible when dealing with channel estimation and signal detection [39], and, hence, it should be conveniently taken into account.

Aimed at the above considerations, in this work we propose an adaptive OFDM scheme able to manage the cyclic prefix length adaptation by resorting to channel knowledge acquired thanks to both estimation and prediction techniques, with the goal of reducing ISI. The employment of estimation, followed by a channel predictive step, is peculiarly tailored to the underwater environment, since it allows reduction in overhead signaling due to training/pilot transmission. Such a saving is not negligible in an UWACs, characterized by very scarce resources. Summarizing, the main contributions of this work are the following:

an adaptive mechanism to tune the OFDM cyclic prefix, based on the channel estimation phase;
a channel tracking mechanism, based on channel estimation and a prediction stage, by resorting to Kalman filtering;
a two-side contemporary processing, that is, transmitter and receiver signal processing is operated independently, thus avoiding the need for a feedback link to continuously communicate measures operated at the receiver side;
a mechanism to decide when re-estimation is needed, based on channel behavior;
a frame structure/protocol supporting the adaptivity of the whole system and allowing a practical implementation of both the channel estimation and prediction.

The approach herein presented differs from the literature as it effectively considers practical situations related both to interference and channel. In fact, these two impairments are generally considered but their temporal features are assumed as static or quasi-static. The proposed study investigates how to handle interference variations and channel variability, in terms of delay spread and amplitude coefficients. The resulting link optimization can also be seen as an implementation guideline for UWACs.

About the manuscript’s organization, in Section 2 we introduce the system model, including the initial interference statistics acquisition, channel statistics estimation procedures, and the CP adaptation mechanism. Section 3 describes the core of the proposed communication protocol, including the two-side contemporary processing based on Kalman filtering prediction and data transmission phase. In Section 4 numerical results and performance comparisons are presented and discussed. Finally, Section 5 concludes the paper.

2. System Model and Connection Setup

Let us refer to an underwater link between two nodes, namely

u_{1}

and

u_{2}

, where data communication can be potentially bidirectional. Therefore,

u_{1}

and

u_{2}

can act as both transmitter and receiver. The transmission is considered to be OFDM-based and frame oriented, with frame length defined as

T_{f}

. By definition, each frame is then organized in slots of duration equal to

T_{s}

. Due to the time-varying nature of the underwater acoustic channel, we assume

T_{f}

to be shorter than the channel coherence time

T_{c o h}

, so that

T_{f} \leq T_{c o h}

.

The interaction between

u_{1}

and

u_{2}

is organized in two different phases, referred to as connection setup (CS) and established connection (EC), respectively, described in Figure 1. During CS, both the nodes are involved in interference and channel acquisition, including the evaluation of the statistical properties of both interference and channel, necessary for the initial communication setup and for managing the whole communication. The EC stage instead concerns data transmission and detection, with some (small) time intervals dedicated to refreshing the channel and interference statistics. The stages of CS and EC are detailed in the following:

Before starting the description of the whole signaling, we report in Table 1 all the time interval definitions we use in this work.

2.1. Interference Statistics Acquisition Stage

As previously stated, and without loss of generality, let us refer to

u_{1}

and

u_{2}

as the transmitting and receiving nodes, respectively, placed at the left and right side of the time diagram in Figure 1 (however, transmitting and receiving roles can be interchanged). The first stage of CS is the acquisition of interference (represented by the vertical blocks in Figure 1). This stage must be used by both the nodes interested in setting up the communication, and it deals with the acquisition of interference statistics that can require a long time, and even multiple frames. It is worth noting that, depending on the (possibly) different propagation conditions, the result of interference acquisition is different at the transmit and receive sides. During interference statistics acquisition, lasting

T_{a, CS}

seconds, no transmission is acted on by the nodes that remain in listening mode. Hence, the continuous-time received signal is:

r_{a, CS}^{(u_{i})} (t) = z^{(u_{i})} (t) = w^{(u_{i})} (t) + χ^{(u_{i})} (t), i = 1, 2

(1)

where

w^{(u_{i})} (t)

is the zero mean

N_{0}

-variance Additive White Gaussian noise at node

u_{i}

and

χ^{(u_{i})} (t)

is the possibly present interference due to acoustic sources (e.g., mammals and/or ship engines) plus the ambient noise described in [40]. Moreover, the term

χ^{(u_{i})} (t)

in Equation (1) can be detailed as follows:

χ^{(u_{i})} (t) = \sum_{n = 1}^{N_{I}} ψ_{n} (t) * h_{n, u_{i}} (t)

(2)

with

ψ_{n} (t)

being the n-th interfering source (in a number of

N_{I}

), * the convolution operator and

h_{n, u_{i}} (t)

the impulse response describing the channel from the n-th interference source to the node

u_{i}

. It is important to underline that both

χ_{n} (t)

and

h_{n, u_{i}} (t)

change, frame by frame, due to sea stream changes, the Doppler effect and multipath changes. This is the reason for performing long acquisition during the connection setup.

Furthermore, it is important to emphasize that, at regime, the time-varying nature of interference may be considered limited within multiple frames, thus meaning that statistical features do not change among some consecutive frames, let us say in a number of

Q_{a}

. At first sight, such an assumption may appear unreasonable. In fact, while for periodic and quasi-periodic interferences (e.g., engine of a ship) it is simple to prove that statistics do not change within a certain number of frames, a really different scenario is met when dealing, for instance, with mammal sounds. However, in this regard, the analysis reported in [41] proves that sound capture lasting in the order of hundreds of milliseconds allows a reliable collection of the statistical features of several interference sources, including also the sporadic ones (e.g., mammals).

So, during the first

T_{a, CS} = Q_{a} T_{f}

seconds, both the nodes can proceed with the estimation of interference autocorrelation by simply correlating the

T_{c}

-sampled received signal as follows:

c_{z}^{(C S)} [m] = \frac{1}{2 K_{a, CS} - 1} \sum_{p = 1}^{K_{a, CS}} r_{a, CS}^{*} [p] r_{a, CS} [p + m]

(3)

where the symbol

^{*}

means conjugation and

K_{a, CS} = Q_{a} T_{f} / T_{c}

is the length of autocorrelation. We remark that Equation (3) generically refers to the interference acquired by any of the considered nodes, but, of course, the measured values may not be the same due to different propagation conditions (that is,

h_{n, u_{1}} (t) \neq h_{n, u_{2}} (t)

, as detailed in Equation (2)). As the interference changes in time, statistics must be necessarily refreshed. Therefore, during EC, a time interval

T_{a, EC} = M_{a} T_{s}

is reserved for autocorrelation update by using Equation (3), with

M_{a}

being the number of frame slots reserved for such an operation. Given

K_{a, EC} = M_{a} T_{s} / T_{c}

as the number of samples considered for the update, we have:

c_{z}^{(E C)} [m] = \frac{1}{2 K_{a, CS} - 1} \sum_{p = 1}^{K_{a, CS}} r_{a}^{(μ) *} [p] r_{a}^{(μ)} [p + m]

(4)

being

r_{a}^{(μ)} [p]

defined in the vector format as:

r_{a}^{(μ)} = [r_{a}^{(μ - 1)} (K_{a} : K_{a, CS}) r_{a}]

(5)

where the superscript

(μ)

indicates the ordered number of updating procedure, the position

r_{a}^{(0)} = r_{a, CS}

is assumed, and

r_{a}^{(μ - 1)} (K_{a} : K_{a, CS})

refers to the elements of the vector

r_{a}^{(μ)}

ranging from the

K_{a}

-th till to the

K_{a, CS}

-th one.

Interference acquisition during CS may last several frames since sufficiently large statistics must be collected to reliably initialize the communication. On the other hand, interference acquisition at EC stage is performed to achieve a partial update of the statistics available from CS. So, its duration is only limited to some frame slots. A different way of defining Equation (5) is that the new acquisition of interference

r_{a}

(that is, the sampled version of the signal already described in Equation (1) during CS stage) gathers the most recent

K_{a}

samples that update the vector

r_{a}^{(μ)}

, while the oldest

K_{a}

ones are removed from the sequence

r_{a}^{(μ - 1)}

.

2.2. Channel Model, Estimation and Statistics Evaluation

After interference acquisition, both the nodes must proceed with channel estimation as the second phase of CS, depicted in Figure 1. So, node

u_{1}

starts sending some known pilot symbols to

u_{2}

, that replies with an identical transmission back to

u_{1}

. The pilot is modeled as:

x (t) = \sqrt{E_{s_{0}}} s (t)

(6)

where

E_{s_{0}}

is the associated energy and

s (t)

is the equivalent baseband signal given by:

s (t) = \frac{1}{\sqrt{2 π α}} e^{\frac{- {(t - \frac{T_{s}}{2})}^{2}}{2 γ}}, 0 \leq t \leq T_{s}

(7)

where

γ

takes care of signal shape (and, consequently, its bandwidth), while

α

is set so to have a unit energy signal

s (t)

, since the signal

x (t)

in Equation (6) has

E_{s_{0}}

energy. The pilot length is supposed to be equal to

T_{s}

, with

s (t)

being a single carrier signal, so as to allow the estimation of the whole channel. The choice of a Gaussian-like shape, described in Equation (7), is not mandatory and depends on the transmitting device capability to generate different shaping signals. Changing the signal shape (from Gaussian-like to Nyquist-like) would reflect on performance variations that are negligible if the bandwidth and transmit power levels do not change. If the term

γ

is sufficiently large, the signal is wide in bandwidth, since in the time domain it presents quick amplitude variations. At this point, such single-carrier pulse sent for estimation risks appearing a bit counter-intuitive since we base the communication on OFDM. However, by using such a pilot, we focus on a whole channel impulse response in order to optimize the cyclic prefix. Additionally, through the use of filter-bank (equating the number of OFDM sub-carriers) we are able to estimate the time response of each sub-channel. This scheme does not avoid the frequency domain description of the channel through the Discrete Fourier Transform (DFT).

With such bidirectional pilots transmission, each node can perform its own channel estimation. Especially regarding multipath, signal propagation along the two communication directions is different, that is, the channel is, in general, not reciprocal [42]. As a consequence, channel estimates available at the link sides may be different as well. However, since the largest part of channel energy is generally carried by the first path [43], reciprocity can be reasonably assumed. Moreover, what is important to underline is that, when reciprocity cannot be assumed, the channels are different. In other words, forward and backward channels can be different. However, it is highly probable that the differences do not impact on performance in a severe way. Hence, the conclusion is that reciprocity is worth it, especially when very particular propagation scenarios are not present [42], like, for example, obstacles that reflect a signal in a specific direction and block the signal in the opposite direction.

In order to describe the received signal during pilot transmission, let us now introduce the channel model. In detail, the underwater acoustic channel is typically affected by a frequency-selective fading phenomenon that scatters the energy of the transmitted signal over a (usually) not so small number of paths generated by reflections from ground and sea surfaces, or due to propagation effects that can be described as curved rays. By also taking into account its time-variability, the channel can be modeled according to the following expression [40]:

h (t; τ) ≜ \sum_{d = 1}^{ρ (t)} β_{d} (t) a_{d} δ (t - τ_{d} (t)) e^{j 2 π ν_{d} t}

(8)

where

a_{d}

is the complex coefficient accounting for losses over the d-th path [40],

ρ (t)

is the time-varying number of paths, each one characterized by a propagation delay

τ_{d} (t)

. Furthermore,

β_{d} (t)

considers the shadowing effect of sea stream, dives, and fish schooling, according to [44], while

ν_{d} = v cos (ϕ_{d}) f_{c} / v_{c}

takes care of the Doppler effect through the direction

ϕ_{d}

, with v being the node speed,

v_{c}

the speed of sound and

f_{c}

the reference frequency.

For some time intervals and propagation scenarios, the channel can be considered time-invariant. As outlined at the beginning of this section, we assumed

T_{f} \leq T_{c o h}

to reasonably meet such conditions. Hence, we have

ρ_{p} (t) = ρ

within a frame, and the received analog signal obtained from the transmission of a pilot can be represented as:

r_{e} (t) = \sum_{d = 1}^{ρ} β_{i} d (t) a_{d} x (t - τ_{d}) e^{j 2 π ν_{d} t} + z (t)

(9)

that represents an exhaustive model, since it includes several propagation effects, from multipath to shadowing, due to obstacles or fish schooling.

2.3. Channel Statistics Estimation

During CS, channel statistics estimation may take a long time to achieve reliable information. Indeed, transmitting several symbols in a short time interval does not allow for observation of remarkable channel changes. On the other hand, using long guard intervals (in the order of the frame time

T_{f}

) between two consecutive pilots allows the receiving nodes to acquire more significant channel statistics. In principle, there are several ways to estimate the channel coefficients starting from the received signal in Equation (9). Based on the a priori knowledge of the nodes about the signal shape

x (t)

, the channel autocorrelation can be retrieved from an observation time

T_{e, CS} = Q_{e} T_{f}

, with

Q_{e}

, being the number of frames spent for channel acquisition. In fact, similar to the interference statistics estimation in Equation (3), the channel autocorrelation can be evaluated as:

c_{h}^{(C S)} [m] = \frac{1}{E_{s 0} (2 K_{e, CS} - 1)} \sum_{p = 1}^{K_{e, CS}} r_{e, CS}^{*} [p] r_{e, CS} [p + m]

(10)

with

K_{e, CS} = Q_{e} T_{f} / T_{c}

being the number of samples used for estimation.

Analogously to the interference autocorrelation estimation, during EC, channel autocorrelation, updates can be performed exploiting a shorter time interval

T_{e, EC} = M_{e} T_{s}

(

M_{e}

, being the number of employed pilots/frame slots) and, according to:

c_{h}^{(E C)} [m] = \frac{1}{E_{s 0} (2 K_{e, CS} - 1)} \sum_{p = 1}^{K_{e, CS}} r_{e}^{(μ) *} [p] r_{e}^{(μ)} [p + m]

(11)

being

r_{e}^{(μ)} [p]

defined in vector format as:

r_{e}^{(μ)} = [r_{e}^{(μ - 1)} (K_{e} : K_{e, CS}) r_{e}],

(12)

where

K_{e} = M_{e} T_{s} / T_{c}

is the number of most recent autocorrelation samples. Similarly to the analysis reported in Equation (5), the superscript

(μ)

in Equation (12) indicates the number of updating procedure, the position

r_{e}^{(0)} = r_{e, CS}

is assumed, and

r_{e}^{(μ - 1)} (K_{e} : K_{e, CS})

refers to the elements of the vector

r_{e}^{(μ)}

ranging from the

K_{e}

-th till to the

K_{e, CS}

-th one. Finally,

r_{e}

defines the new estimation used for updating the channel autocorrelation, that is the sampled version of Equation (9).

2.4. Cyclic Prefix Length Adaptive Tuning

The cyclic prefix is a fundamental element in OFDM, allowing the system to be (hopefully) ISI- and ICI-free [45]. Due to possible relative motion between nodes, as well as sea stream changes, the channel results are unavoidably time-varying over consecutive frames. Therefore, the use of a fixed CP length during the whole communication may be ineffective to counterbalance multipath. On the other hand, the capability of tuning the CP length allows the transmission to be adapted to the propagation conditions.

In this regard, we have the number of sub-carriers

N_{S C}

employed for OFDM signaling defining the symbol length

T_{x} = 1 / Δ_{f}

, where

Δ f

is the sub-channel width obtained by partitioning the whole system bandwidth B in

N_{S C}

portions, so that

Δ_{f} = B / N_{S C}

. However, when CP is considered, the whole OFDM symbol time becomes

T_{s} = T_{x} + T_{C P}

, where

T_{C P}

is the time length of cyclic prefix that must be set according to the channel delay spread

τ_{d s}

. So, the OFDM symbol length corresponds to the slot time length

T_{s}

within a communication frame. Here, we recall that the delay spread is a measure of the length of the channel response, that is, the time difference between the earliest channel path and the last one. Hence, analytically speaking,

T_{C P}

value can be a multiple integer of delay spread. Of course, the use of CP unavoidably causes transmission rate reduction. In order to provide a theoretical example, if the system must be set up to grant a rate of R bits/s, the information bits must be allocated so that

\sum_{c = 1}^{N_{S C}} b_{c} = R T_{s}

, where

b_{c}

is the number of bits allocated on the c-th sub-channel. The presence of CP entails

T_{C P} > 0

, leading

T_{s}

to increase. From this fact, it appears evident that when the OFDM symbol length grows, due to delay spread (cyclic prefix), the use of richer modulation formats is required in the sub-channels to achieve the target data rate. As a consequence, higher power must be spent in order to guarantee robustness to ISI and ICI. So, communication reliability is paid for in terms of power efficiency.

The channel knowledge is the key to estimate the delay spread, that can be obtained by measuring the memory of the channel; that is, how long the effect of a signal emission is present at the receiver. Based on Equation (9), describing the pilot received during the estimation phase, it suffices to measure the cross-correlation between the emitted signal and the received one in order to test the signal time spread due to propagation. Analytically speaking, the cross-correlation is defined as:

y_{r_{e} s} [m] = \frac{1}{E_{s 0} (2 T_{s} / T_{c} - 1)} \sum_{p = 1}^{T_{s} / T_{c}} r_{e}^{(μ) *} [p] s [p + m]

(13)

where

y_{r_{e} s} [0]

corresponds to the signal energy on the first path. Then, by means of a threshold

ϑ

,

0 < ϑ < 1

, we can also define a reference energy level expected for the signal on the last path, expressed as a fraction,

y_{r_{e} s} [0]

. In other words, once

y_{r_{e} s} [0]

is selected as the cross-correlation value referred to the first path, we define, as delay spread, the time to achieve a percentage in terms of cross energy equal to

ϑ y_{r_{e} s} [0]

. Hence, formally we have:

{\tilde{τ}}_{d s} = T_{c} arg min_{m = 1, . . ., T_{s} / T_{c}} | y_{r_{e} s} [m] - ϑ y_{r_{e} s} [0] |

(14)

returns the estimate of delay spread.

3. Established Connection Phase

As stated before, the channel is assumed to be static within a frame. During the EC stage, we can have two different types of frames. The first contains only data, thus, the time spent for data transmission,

T_{d} = M_{d} T_{s}

, with

M_{d}

being the number of data symbols per frame, equates to the entire frame length; that is

T_{f} = T_{d}

(note that, since data are transmitted only during EC, we neglect the use of the corresponding subscript on

T_{d}

in order to simplify the notation). Such a frame can be easily recognized, highlighted in green in Figure 1. In this sense, the transmission of frames is continuous, while the channel prediction mechanism proceeds in the background. However, still from Figure 1, it is possible to appreciate that some frames are different, since they are organized in three different fields. The first one, of

T_{a, EC} = M_{a} T_{s}

-length, is dedicated to the interference acquisition. The second one, of

T_{e, EC} = M_{e} T_{s}

-length, is used for channel estimation purposes, while the last field, of

T_{d}

-length, is dedicated to sending information data. The whole time duration of a frame is

T_{f} = T_{a, EC} + T_{e, EC} + T_{d}

. Hence, for a fixed

T_{f}

, spending time on interference and channel statistics update unavoidably leads the data transmission time to be reduced within a frame.

3.1. Channel Re-Estimation and Prediction

The channel estimation can be performed in the discrete frequency domain. Referring to the model presented in Equation (9), by means of the DFT, the received pilot signal is:

R_{k} = H_{k} X_{k} + Z_{k} 0 \leq k \leq N_{S C} - 1

(15)

where

X_{k}

and

Z_{k}

are the DFT of the sampled version of

x (t)

and

z (t)

, respectively,

H_{k}

is the channel representation in the discrete frequency domain, and the number of samples to compute DFT is exactly the number of sub-channels

N_{S C}

. According to the orthogonal projection lemma criterion, detailed in [46], the Minimum Mean Square Error estimation of the channel can be obtained via the following relationship:

{\tilde{H}}_{k} = R_{k} \frac{X_{k}^{*} {| H_{k} |}^{2}}{E_{s 0} {| H_{k} |}^{2} + Z_{k}^{2}} 1 \leq k \leq N_{D F T}

(16)

where, we recall,

E_{s 0}

is the energy associated to the training pilot, while

Z_{k}^{2}

is the effect of disturbing signal

z (t)

. By observing Equation (16), we can argue that the

| H_{k} |^{2}

values are not available, since

H_{k}

are the quantities to be estimated. This problem is solved by posing

| H_{k} |^{2} = 1

, that leads to reliable estimates if

E_{s 0} > Z_{k}^{2}

.

Starting from Equation (16), the channel evolution can be described by a V-order AR model [47,48,49] as follows:

h_{k} (ℓ) = Ψ h_{k} (ℓ - 1) + z (ℓ)

(17)

where

h_{k} (ℓ)

is a vector describing the channel coefficients related to the k-th sub-carrier, and

Ψ

is the

[V \times V]

matrix related to the V-order AR model [47,48,49]. Based on such representation, we resort to Kalman filtering [50] to predict the channel evolution.

A possible drawback of this approach concerns the system complexity. In fact, tracking the channel evolution via Kalman filtering is not so costly for a single sub-channel, while it may become significant if a huge number of samples is considered. However, it is important to note that, at this stage, once the number of sub-carriers

N_{S C}

is chosen, the Kalman-based prediction operates exclusively on

N_{S C}

samples. So, the matrix dimensions used for handling the Kalman filtering is

[V N_{S C} \times V N_{S C}]

and, even though it may appear to lead to a huge processing cost, it is composed of all

[V \times V]

matrices on the diagonal, hence, presenting

V^{2} (N_{S C}^{2} - N_{S C})

zeros. Since no matrix inversions are present in the processing, the largest part of the computational cost is related to product and sums, to be performed with a processing time in the order of frame duration (hundreds of milliseconds).

Finally, we want to highlight here that channel prediction is operated both at the transmitter and receiver sides in place of channel estimation performed with overhead pilot signaling between nodes. Hence, prediction allows the reduction of latency, which represents a crucial issue in UWACs. Before proceeding, it is important to consider some key elements. First, it must be considered that the performance of the predictor (that is, the estimation accuracy) decreases in terms of prediction error variance when the coefficient to predict (in a time sense) is far from the last performed estimation. Hence, this fact suggests that, after predicting the channel for several frames, the receiver must proceed to re-estimate the channel by means of some pilot transmissions (as detailed in Figure 1). The motivation leading to requiring estimation after some prediction is twofold. First, after the CP tuning, it is expected that the delay spread may change. Second, the memory (in the sense of channel correlation) drops, thus meaning unreliability of the prediction. This is the reason for also introducing, in Figure 1, the pilot transmission during the EC stage, so as to provide the estimation to aid prediction. Therefore, a mechanism for re-estimating the channel and feeding the predictor with new estimated values is required.

In this regard, we need to regulate the switch from channel prediction to a new channel estimation. Let us consider

V_{k}

as the length of the AR model, related to the k-th sub-channel. Please note that it is not assured that all the

V_{k}

coincide. Moreover, delay spread may change during re-estimation, being smaller or larger with respect to the previously measured one. The delay spread being (considerably) changed indicates that the re-estimation was performed too late. On the other hand, if the changes measured are not sensible, it means that the re-estimation was performed early with respect to the changes. Hence, we introduce the variable

L_{ϵ}

to evaluate the measure of delay spread changes, initialized to a high value so that

L_{ϵ} > > V_{k}

. Then, in order to switch from prediction to estimation, we consider re-estimation as performed after a number of frames

L_{a d a p t}

given by:

L_{a d a p t} = min \{min_{k} {V_{k}}, L_{ϵ}\}

(18)

thus, implicitly meaning that estimation is performed if

L_{ϵ}

frames have passed, or if the prediction becomes unreliable for at least one sub-channel (

{min}_{k} {V_{k}}

). In other words, the event occurring first drives the need for a new estimation. It is important to note that, after a re-estimation is performed, if the channel delay spread has remained essentially the same, then

L_{ϵ}

is increased by a factor that we set as corresponding to 10% of its previous value. Otherwise, having delay spread as already changed suggests that we needed to re-estimate the channel earlier. Hence,

L_{ϵ}

is updated by decreasing it by 10%.

Finally, we would highlight that the proposed hybrid mechanism for channel estimation and prediction can be fruitfully exploited to deal with cross-talk mitigation, which represents a challenging issue for MIMO-OFDM systems. However, in such a more complex scenario, the sole channel equalization does not suffice to achieve good performance, since spatial ISI must also be mitigated. As a consequence, the number of channels to consider grows as well as the complexity of frequency and space equalization.

3.2. Information Data Stage and Detection

Having described the process related to interference and channel information acquisition, we now detail the data transmission stage, lasting

T_{d}

, as shown in Figure 1. Before symbol emission, bit-loading is performed. In this regard, we do not provide further details, since proposing a new mechanism was not our aim. However, it is fundamental to highlight that channel knowledge (and its reliability) is a key element to realizing bit-loading, especially when procedures based on waterfilling-like algorithms are considered [51]. Furthermore, channel knowledge is also necessary for signal detection. Specifically, we can express the OFDM data symbol as:

g (t) = \sqrt{E_{s_{0}}} \sum_{k = 0}^{N_{S C} - 1} G (k) e^{j 2 π k t Δ f}, 0 \leq t \leq T_{x}

(19)

where

T_{x}

is the above-mentioned signal length without the insertion of CP. Moreover, the

G (k)

term is related to the inverse-DFT of the symbol emitted on the k-th sub-channel. Note that symbols on different sub-channels may belong to different constellations (that is, the modulation order employed on sub-carriers may be different) if bit-loading is considered. The OFDM symbol is completed by CP insertion before transmission. If the CP length has been suitably adapted to completely avoid ISI, it follows that echoes related to the previously emitted symbol fall into the current symbol CP window, and, thus, do not affect the detection of carried data. Under such an assumption, the received analog signal related to an OFDM symbol after CP removal can be written as:

r (t) = \sum_{d = 1}^{P} β_{d} (t) a_{d} g (t - τ_{d}) e^{j 2 π ν_{d} t} + z (t), 0 \leq t \leq T_{x}

(20)

that collects the component

z (t)

related to background noise and other external interference, and the signal echoes coming from the propagation over P paths. Regarding this latter aspect, according to Equation (8), multipath was initially characterized by

ρ

paths. However, since delay spread may be longer than the OFDM symbol duration

T_{x}

, it is likely that the late echoes fall within the next symbol CP window, while early arrivals are, instead, superposed to the currently received signal, thus giving rise to auto-interference. So, in Equation (20), we refer to

P \leq ρ

as the number of secondary paths acting as interference on the current symbol.

Finally, concerning detection, the receiver computes the DFT of the signal passed through analog-to-digital conversion. Then, by processing each sub-carrier component, the symbol

\hat{G} (k)

(with k = 0, 1, …,

N_{S C} - 1

) is decided according to the Maximum Likelihood criterion (by remembering that, on different sub-channels, we can have different modulation formats) [18].

3.3. Remark—Protocol Summary and Efficiency

The scheme here presented considers that, despite the CS stage, lasting

T_{a, CS} + T_{e, CS} = Q_{a} T_{f} + Q_{e} T_{f}

seconds, being necessary, it is somewhat limiting from the point of view of transmission efficiency. Such a performance metric, referred as

η

, can be measured by considering the ratio for an assigned number of bits to be transmitted, between the time spent for an (ideal) OFDM communication with no connection setup, no estimation or prediction, and the presented case. Once set, the frame duration

T_{f}

, the number of slots composing a frame is equal to

M_{f} = T_{f} / T_{s}

. We recall from Section 2.4 that

T_{s}

corresponds also to the symbol duration, which, in turn, is dependent on the CP length

T_{C P}

. For the sake of simplicity, let us initially consider

T_{C P}

as fixed (that is, the case where CP tuning is not performed), so that

T_{s}

and

M_{f}

result can be considered fixed as well. Hence, the time requested to transmit

N_{b}

data bits in the ideal case, given R as the transmission rate, is:

T_{N_{b}}^{(i d e a l)} = \frac{N_{b}}{R}

(21)

since no overhead information is considered during the communication. On the other hand, the use of the proposed protocol makes the data transmission time given:

T_{N_{b}}^{(p r o p o s e d)} = (Q_{a} + Q_{e}) T_{f} + \frac{N_{b}}{R} + N_{o c c} (M_{a} + M_{e}) T_{f}

(22)

thus, taking into account the

(Q_{a} + Q_{e})

frames used for CS and those related to EC, while

N_{o c c}

refers to the number of times interference and channel estimation refreshing is performed, allowing a reliable prediction to take place (

M_{a}

and

M_{e}

are recalled as the number of frame slots dedicated to interference and channel statistics update during the EC stage, respectively). The proposed mechanism efficiency is calculated from the ratio between the terms in Equations (21) and (22) that, after some mathematical manipulation, is expressed as:

η^{(p r o p o s e d)} = \frac{1}{1 + \frac{R}{N_{b}} [(Q_{a} + Q_{e}) T_{f} + N_{o c c} (M_{a} + M_{e}) T_{f}]}

(23)

where it is possible to observe that

η^{(p r o p o s e d)}

increases with the amount of data to be transmitted

N_{b}

. This is due to the fact that the growth of

N_{b}

makes the CS stage duration ever shorter than the EC time.

Hence, the proposed method approaches the ideal case performance where no initial setup is considered. Furthermore, another aspect must be highlighted. By removing the initial assumption about fixed CP length, if

T_{s}

changes due to the tuning of

T_{C P}

, the whole time to transmit data also changes. Specifically, independently of

N_{b}

, the growth of

T_{C P}

leads R to decrease and, interestingly,

η^{(p r o p o s e d)}

to increase. This is because a larger time is needed to transmit the information, and this growth is more important with respect to the time spent in CS. Another interesting result is the following. From Equation (22), the CS time is invariant, since

Q_{a}

,

Q_{e}

and

T_{f}

are fixed. So, CP adaptation impacts only on data transmission time, and this is true for both the ideal and proposed cases. Moreover, Equations (21) and (22) show the same dependency on R. The only difference regarding data transmission is given by the quantity

N_{o c c} (M_{a} + M_{e})

, that is only related to the proposed mechanism. Therefore, we can conclude from Equation (23) that the time spent in CS and re-estimating cannot be considered as marginal when R is very high with respect to the data to be sent.

For the sake of comparison, we also discuss a reference case where channel estimation was systematically performed every frame, without resorting to prediction. Moreover, no CS and interference acquisition during EC were considered. Hence, at the beginning of each frame some pilots for channel estimation were sent and, due to the channel non-reciprocity, the transmitter had to wait for two-way propagation time

2 T_{p r o p}

before CSI was available. Note that

2 T_{p r o p}

is a function of the communication distance and may be very long. For example, in a 700 m link, the waiting time is about 1 s. In such a scenario, the data transmission time is calculated as:

T_{N_{b}}^{(e s t i m a t i o n - o n l y)} = \frac{N_{b}}{R} + \frac{N_{b}}{R T_{f}} (T_{s} + 2 T_{p r o p})

(24)

where the second term accounts for the time slot and propagation delay spent in transmitting the pilot symbols for channel estimation. So, based on Equations (21)–(24), we have that:

η^{(e s t i m a t i o n - o n l y)} = \frac{1}{1 + \frac{T_{s} + 2 T_{p r o p}}{T_{f}}}

(25)

represents the spectral efficiency for the case of estimation performed at each frame. Specifically, it can be appreciated from Equation (25) that, in this case, spectral efficiency did not scale with

N_{b}

, thus, meaning that it was independent of the volume of data to be transmitted. Moreover, as expected,

η^{(e s t i m a t i o n - o n l y)}

tended to zero when the propagation delay between the communicating nodes increased. So, we can conclude that the proposed mechanism, based on channel estimation and prediction results, is more convenient, in terms of efficiency, than a conventional approach, based only on the periodic channel estimation.

In order to summarize the whole mechanism, the flowchart reported in Figure 2 describes all the functional steps to be performed in the proposed communication framework. It is worth noting that, after a very large number of frames, referred as

Q_{m a x}

, where the communication is ongoing, it is reasonable to expect the channel to change significantly. In this case, it would be preferable to perform a complete refresh of interference and channel statistics by re-initializing the transmission with a new CS stage. Otherwise, as shown in Figure 2, a shorter acquisition and estimation during EC is sufficient to drive the Kalman filtering-based channel prediction.

4. Numerical Results

In this section, we present the analysis of performance related to the proposed channel prediction-based adaptive transmission scheme. Simulations were performed by merging typical parameters of UWAC systems concerning the transmitter in terms of power and bandwidth, and real data coming from measurement campaigns involving both channel and interference. Specifically, real multipath channel impulse responses, taken from the Watermark database [52], were considered to model the time-varying propagation. Channels were measured in a 740 m shallow water stretch of Oslofjorden, with sounding operated in a frequency range from 10 kHz to 18 kHz. The acquisition was performed by the authors of the measurement campaigns [52] by acquiring signals in raw data during a continuous time acquisition. Hence, consecutive time-variant channel impulse responses were measured and, due to the relative movement of transmitter and receiver, it was possible to infer that the average relative speed was 4 km/h. Furthermore, the interference generated by acoustic sources were taken from some recordings available in the literature [53] and directly added to the received signal. The results were strictly dependent on the communication bandwidth, chosen to be equal to 8 kHz. The other simulation parameters are summarized in Table 2.

One of the goals of the analysis was to prove the effectiveness of a hybrid channel prediction-estimation-based approach with respect to a conventional pure estimation mechanism. So, as further detailed, the performance comparison between two such cases is also provided.

In order to, firstly, evaluate the accuracy of channel estimation, we report, in Figure 3, the mean square error (MSE) regarding the channel tracking as a function of the length of the statistics acquisition phase during the CS.

Overall, it can be observed that, the longer the initial acquisition was (measured in terms of number of frames

Q_{e}

), the lower the MSE was. In detail, the use of only a frame (

Q_{e} = 1

) did not lead a sufficiently reliable channel estimation and tracking with Kalman filtering, while, on the other hand, spending 20 frames for statistics acquisition allowed the MSE to be reduced by one order of magnitude.

Furthermore, regarding the tracking operation, we show in Figure 4 the value of

L_{a d a p t}

in Equation (18) when different, consecutive channel realizations were considered. In particular, we discuss how the switching mechanism between tracking and estimation works. We recall that the need for re-estimation is driven by the minimum between the memory of the auto-regressive model and the measurement of delay spread changes. In our simulation, we set the auto-regressive model memory as equal to 8 (frames) and, from Figure 4, it was possible to appreciate how such a value resulted, in general, as being the most appropriate one. In fact, most times, channel re-estimation results after 7 or 8 frames after prediction were exploited instead.

However, for some channel realizations exhibiting significant changes in propagation characteristics (e.g., delay spread),

L_{a d a p t}

= 8 was too high and the accuracy of prediction might lower. Hence, a more frequent re-estimation was required. The particular case where

L_{a d a p t}

= 0 refers to the occurrence where estimation is required on two consecutive frames.

Regarding performance comparison, we considered the competitor of the proposed adaptive transmission scheme to be one employing OFDM where pilot symbols are transmitted each frame to perform estimation, without considering prediction. Moreover, we assumed that, due to the channel non-reciprocity, estimation operated only at the receiver side, with channel coefficients being sent back to the transmitter via a bipolar modulation. Specifically, the information about the channel coefficients in the frequency domain was quantized with 32 bits before being fed back to the transmitter.

During this signaling phase, potential errors might occur, impacting on bit-loading process as well.

As a performance metric, we introduced

σ

as the difference between the number of bits allocated via the ideal case (channel perfectly known at both the transmitter and the receiver) and the methods under investigation (that is, the proposed one and the frame-by-frame estimation with quantized feedback, respectively). The results are shown for two different channel realizations, referred as case (a) and (b) in Figure 5, respectively. The numerical evaluation is reported on a per OFDM sub-carrier basis, that is,

σ_{k} = b_{i d e a l} (k) - b_{p r o p o s e d} (k)

are represented with red circled markers, while

σ_{k} = b_{i d e a l} (k) - b_{e s t i m a t i o n - o n l y} (k)

are highlighted with blue markers (k = 1, 2, …,

N_{s c}

). From Figure 5, it is possible to appreciate that the mismatch between ideal and proposed bit-loading was always equal to zero in channel case (a), while only a few errors were made in case (b). Considering the scheme with frame-by-frame channel estimation, we can observe that there were several mis-allocated bits; in fact, the number of non-zero differences were 20 and 17 for cases (a) and (b), respectively.

Mis-allocation in terms of bits leads to a situation that can be problematic. In fact, if the transmitter and receiver share mismatched channel information or, even worse, different information about the modulation format, the performance, in terms of rise in error rate, rapidly lowers. Such a result can be inferred from Figure 6, where we report the BER values by considering the evolution in terms of channel realization. In this regard, we want to emphasize that, in the bit-loading procedure we set an SNR-margin [54] granting a BER value of

10^{- 6}

. By investigating Figure 6, for most of the channel realizations, target BER performances were granted by the proposed method. When this was not possible (due to time-varying propagation conditions not being perfectly compensated) the reliability lowered and BER increased to values around less than

10^{- 4}

, with the exception of very few realizations where the channel was really bad. On the other hand, OFDM transmission with frame-by-frame estimation and quantized feedback was not able to achieve those values. In fact, the average BER was a bit higher than

2 \times 10^{- 4}

, as is possible to appreciate by the distribution of blue markers in Figure 6.

Another interesting aspect to evaluate is the impact of the number of FFT points

N_{S C}

, that is, the number of considered OFDM sub-channels, on the communication performance. In this regard, by referring to different values of

N_{S C}

, we report in Table 3 the average BER evaluated among all the channel realizations, as well as the maximum BER and minimum one.

From the results, it can be emphasized that, using an increasing number of

N_{S C}

led to a reduction in the average BER. This was true for

N_{S C} \leq 128

. However, when

N_{S C} = 256

, the average BER increased. We can explain this non-monotonic behavior as follows. When the number of sub-channels increases, the frequency domain description of the channel is more accurate, and the same is true for the prediction, since variations of the frequency response are measured with a dense sampling. On the other hand, having few FFT points (low values of

N_{S C}

) leads to a poor description of the channel in the frequency domain. So, channel changes in time are badly tracked and likewise for the equalization. The reason why for

N_{S C} = 256

the average BER increased, is that the Doppler effect became more important, since the sub-channel bandwidth became narrower, and this induced mis-detection pf events. By looking at the third column of Table 3, we note that the maximum BER followed the same behavior as the average BER, and such a high value was justified by the fact that sometimes the channel can change radically and very quickly. Finally, the fourth column of the table reports the same values for the minimum BER, since that was the target considered as the constraint for bit-loading.

Furthermore, another important result must be highlighted. We recall that the proposed transmission framework considers transmitting and receiving nodes as operating separately on interference acquisition and channel estimation, without any mutual feedback. So, it may happen that the bit-loading performed at the transmit side does not match with that expected at the receiver. This fact may lead signal detection to completely fail, being based on a symbol constellation that may be different from that actually employed for transmission.

However, the results in Figure 6 demonstrate the high reliability of the proposed hybrid channel estimation-prediction method, outperforming, also, the case where frequent channel estimation operated.

Such an advantage can be finally appreciated in terms of communication efficiency as well. By referring to

η

, as defined in Equations (23)–(25), for the transmission schemes under investigation, we show in Figure 7 that, when the message length increased, the OFDM system, where frame-by-frame estimation was performed, was unable to become more efficient, since the amount of pilot symbols to be sent increased with the number of transmitted frames. On the other hand, the proposed system paid off, in terms of efficiency, when the file was very short, since the time spent acquiring the statistics (CS stage) was long with respect to the transmission of a few bytes (10–30). However, the time spent in CS lost importance when the amount of information increased, as the largest part of the time was spent sending data (with an additional small percentage involved in re-estimating the channel).

5. Conclusions

In this contribution, we proposed an adaptive OFDM scheme, in which interference and channel statistics are initially acquired. Subsequently, in place of proceeding with very frequent channel estimation, as considered in most of the literature, we propose a mechanism to predict the channel and also a protocol to rule the re-estimation of the channel when necessary. This mechanism also solves the problem related to potentially outdated channel information due to long propagation delays. Simulation results demonstrated that the proposed approach is able to provide reliable channel tracking, also reflected in high performance in terms of BER and rate. Moreover, the designed communication protocol avoids the exchange of overhead information between transmitting and receiving nodes, thus, allowing the achievement of highly efficient communication, especially when the amount of data to be transmitted is large.

Author Contributions

Conceptualization, M.B.; Methodology, R.C.; Software, M.B.; Investigation, G.S.; Writing—original draft, A.P.; Writing—review & editing, A.P.; Supervision, M.B. All authors have read and agreed to the published version of the manuscript.

Funding

This work was partially supported by the European Union under the Italian National Recovery and Resilience Plan (NRRP) of NextGenerationEU, partnership on “Telecommunications of the Future” (PE0000001-program “RESTART”).

Conflicts of Interest

The authors declare no conflict of interest.

References

Jouhari, M.; Ibrahimi, K.; Tembine, H.; Ben-Othman, J. Underwater Wireless Sensor Networks: A Survey on Enabling Technologies, Localization Protocols, and Internet of Underwater Things. IEEE Access 2019, 7, 96879–96899. [Google Scholar] [CrossRef]
Zhang, D.; N’Doye, I.; Ballal, T.; Al-Naffouri, T.Y.; Alouini, M.S.; Laleg-Kirati, T.M. Localization and Tracking Control Using Hybrid Acoustic–Optical Communication for Autonomous Underwater Vehicles. IEEE Internet Things J. 2020, 7, 10048–10060. [Google Scholar] [CrossRef]
Munafo, A.; Canepa, G.; LePage, K.D. Continuous Active Sonars for Littoral Undersea Surveillance. IEEE J. Ocean. Eng. 2019, 44, 1198–1212. [Google Scholar] [CrossRef]
Petroni, A.; Biagi, M.; Colonnese, S.; Cusani, R.; Scarano, G. Vessels traffic estimation through image processing applied to acquisitions by hydrophones. In Proceedings of the OCEANS 2015-Genova, Genova, Italy, 18–21 May 2015; pp. 1–4. [Google Scholar] [CrossRef]
Wang, K.; Gao, H.; Xu, X.; Jiang, J.; Yue, D. An Energy-Efficient Reliable Data Transmission Scheme for Complex Environmental Monitoring in Underwater Acoustic Sensor Networks. IEEE Sens. J. 2016, 16, 4051–4062. [Google Scholar] [CrossRef]
Zhang, Y.; Chen, Y.; Zhou, S.; Xu, X.; Shen, X.; Wang, H. Dynamic Node Cooperation in an Underwater Data Collection Network. IEEE Sens. J. 2016, 16, 4127–4136. [Google Scholar] [CrossRef]
Zhou, J.; Jiang, H.; Wu, P.; Chen, Q. Study of Propagation Channel Characteristics for Underwater Acoustic Communication Environments. IEEE Access 2019, 7, 79438–79445. [Google Scholar] [CrossRef]
Sun, W.; Wang, Z. Online Modeling and Prediction of the Large-Scale Temporal Variation in Underwater Acoustic Communication Channels. IEEE Access 2018, 6, 73984–74002. [Google Scholar] [CrossRef]
Campagnaro, F.; Toffolo, N.; Zorzi, M. Modeling Acoustic Channel Variability in Underwater Network Simulators from Real Field Experiment Data. Electronics 2022, 11, 2262. [Google Scholar] [CrossRef]
Czapiewska, A.; Luksza, A.; Studanski, R.; Zak, A. Analysis of Impulse Responses Measured in Motion in a Towing Tank. Electronics 2022, 11, 3819. [Google Scholar] [CrossRef]
Baggeroer, A. Acoustic telemetry An overview. IEEE J. Ocean. Eng. 1984, 9, 229–235. [Google Scholar] [CrossRef]
Yang, G.; Wang, L.; Qiao, P.; Liang, J.; Chen, T. Joint Multiple Turbo Equalization for Harsh Time-Varying Underwater Acoustic Channels. IEEE Access 2021, 9, 82364–82372. [Google Scholar] [CrossRef]
Scarano, G.; Petroni, A.; Biagi, M.; Cusani, R. Blind Fractionally Spaced Channel Equalization for Shallow Water PPM Digital Communications Links. Sensors 2019, 19, 4604. [Google Scholar] [CrossRef] [Green Version]
Su, W.; Lin, J.; Chen, K.; Xiao, L.; En, C. Reinforcement Learning-Based Adaptive Modulation and Coding for Efficient Underwater Communications. IEEE Access 2019, 7, 67539–67550. [Google Scholar] [CrossRef]
Huang, J.; Diamant, R. Adaptive Modulation for Long-Range Underwater Acoustic Communication. IEEE Trans. Wirel. Commun. 2020, 19, 6844–6857. [Google Scholar] [CrossRef]
Basavaraju, P.H.; Lokesh, G.H.; Mohan, G.; Jhanjhi, N.Z.; Flammini, F. Statistical Channel Model and Systematic Random Linear Network Coding Based QoS Oriented and Energy Efficient UWSN Routing Protocol. Electronics 2022, 11, 2590. [Google Scholar] [CrossRef]
Radosevic, A.; Ahmed, R.; Duman, T.M.; Proakis, J.G.; Stojanovic, M. Adaptive OFDM Modulation for Underwater Acoustic Communications: Design Considerations and Experimental Results. IEEE J. Ocean. Eng. 2014, 39, 357–370. [Google Scholar] [CrossRef]
Murad, M.; Tasadduq, I.A.; Otero, P. Pilot-Assisted OFDM for Underwater Acoustic Communication. J. Mar. Sci. Eng. 2021, 9, 1382. [Google Scholar] [CrossRef]
Tonello, A.M.; D’Alessandro, S.; Lampe, L. Cyclic Prefix Design and Allocation in Bit-Loaded OFDM over Power Line Communication Channels. IEEE Trans. Commun. 2010, 58, 3265–3276. [Google Scholar] [CrossRef]
Cobacho-Ruiz, P.; Cañete, F.J.; Martos-Naya, E.; Fernández-Plazaola, U. OFDM System Design for Measured Ultrasonic Underwater Channels. Sensors 2022, 22, 5703. [Google Scholar] [CrossRef]
Chang, Y.P.; Lemmens, P.; Tu, P.M.; Huang, C.C.; Chen, P.Y. Cyclic Prefix Optimization for OFDM Transmission over Fading Propagation with Bit-Rate and BER Constraints. In Proceedings of the 2011 Second International Conference on Innovations in Bio-inspired Computing and Applications, Shenzhen, China, 16–18 December 2011; pp. 29–32. [Google Scholar] [CrossRef]
Naderi, S.; Costa, D.B.d.; Arslan, H. Channel Randomness-Based Adaptive Cyclic Prefix Selection for Secure OFDM System. IEEE Wirel. Commun. Lett. 2022, 11, 1220–1224. [Google Scholar] [CrossRef]
Yasong, L.; Shengliang, H.; Chengxu, F.; Jijin, T. Power optimization algorithm for OFDM underwater acoustic communication using adaptive channel estimation. J. Syst. Eng. Electron. 2019, 30, 662–671. [Google Scholar] [CrossRef] [Green Version]
Zhang, R.; Ma, X.; Wang, D.; Yuan, F.; Cheng, E. Adaptive Coding and Bit-Power Loading Algorithms for Underwater Acoustic Transmissions. IEEE Trans. Wirel. Commun. 2021, 20, 5798–5811. [Google Scholar] [CrossRef]
Mangione, S.; Galioto, G.E.; Croce, D.; Tinnirello, I.; Petrioli, C. A Channel-Aware Adaptive Modem for Underwater Acoustic Communications. IEEE Access 2021, 9, 76340–76353. [Google Scholar] [CrossRef]
Hegazy, R.; Kadifa, J.; Milstein, L.; Cosman, P. Subcarrier Mapping for Underwater Video Transmission Over OFDM. IEEE J. Ocean. Eng. 2021, 46, 1408–1423. [Google Scholar] [CrossRef]
Tasadduq, I.A.; Murad, M.; Otero, P. CPM-OFDM Performance over Underwater Acoustic Channels. J. Mar. Sci. Eng. 2021, 9, 1104. [Google Scholar] [CrossRef]
Ashri, R.; Shaban, H.; El-Nasr, M. A Novel Fractional Fourier Transform-Based ASK-OFDM System for Underwater Acoustic Communications. Appl. Sci. 2017, 7, 1286. [Google Scholar] [CrossRef] [Green Version]
Khan, M.R.; Das, B.; Pati, B.B. Channel estimation strategies for underwater acoustic (UWA) communication: An overview. J. Frankl. Inst. 2020, 357, 7229–7265. [Google Scholar] [CrossRef]
Cho, Y.H.; Ko, H.L. Channel Estimation Based on Adaptive Denoising for Underwater Acoustic OFDM Systems. IEEE Access 2020, 8, 157197–157210. [Google Scholar] [CrossRef]
Liu, D.N.; Yerramalli, S.; Mitra, U. On Efficient Channel Estimation for Underwater Acoustic OFDM Systems. In Proceedings of the Fourth ACM International Workshop on UnderWater Networks, New York, NY, USA, 3 November 2009. [Google Scholar] [CrossRef]
Qiao, G.; Liu, L.; Ma, L. Analysis of Outdated Channel State Information in Underwater Acoustic Downlink OFDMA System. In Proceedings of the 2018 OCEANS—MTS/IEEE Kobe Techno-Oceans (OTO), Kobe, Japan, 28–31 May 2018; pp. 1–5. [Google Scholar] [CrossRef]
Liu, B.; Jia, N.; Huang, J.; Guo, S.; Xiao, D.; Ma, L. Autoregressive model of an underwater acoustic channel in the frequency domain. Appl. Acoust. 2022, 185, 108397. [Google Scholar] [CrossRef]
Radosevic, A.; Duman, T.M.; Proakis, J.G.; Stojanovic, M. Channel prediction for adaptive modulation in underwater acoustic communications. In Proceedings of the OCEANS 2011 IEEE-Spain, Santander, Spain, 6–9 June 2011; pp. 1–5. [Google Scholar] [CrossRef] [Green Version]
Liu, L.; Cai, L.; Ma, L.; Qiao, G. Channel State Information Prediction for Adaptive Underwater Acoustic Downlink OFDMA System: Deep Neural Networks Based Approach. IEEE Trans. Veh. Technol. 2021, 70, 9063–9076. [Google Scholar] [CrossRef]
Zhang, Y.; Venkatesan, R.; Dobre, O.A.; Li, C. Efficient Estimation and Prediction for Sparse Time-Varying Underwater Acoustic Channels. IEEE J. Ocean. Eng. 2020, 45, 1112–1125. [Google Scholar] [CrossRef]
Biagi, M.; Rinauro, S.; Cusani, R. Channel estimation or prediction for UWA? In Proceedings of the 2013 MTS/IEEE OCEANS-Bergen, Bergen, Norway, 10–14 June 2013; pp. 1–7. [Google Scholar] [CrossRef]
Khan, A.U.; Choi, W.; Sambo, Y.; Imran, M. Soft-Output Deep-LAS Detection for Coded MIMO System: A Learning-Aided LLR Approximation. IEEE Techrxiv 2022. [Google Scholar] [CrossRef]
Biagi, M.; Petroni, A.; Colonnese, S.; Cusani, R.; Scarano, G. On Rethinking Cognitive Access for Underwater Acoustic Communications. IEEE J. Ocean. Eng. 2016, 41, 1045–1060. [Google Scholar] [CrossRef]
Stojanovic, M.; Preisig, J. Underwater acoustic communication channels: Propagation models and statistical characterization. IEEE Commun. Mag. 2009, 47, 84–89. [Google Scholar] [CrossRef]
Fristrup, K.M.; Watkins, W.A. Marine Animal Sound Classification; Woods Hole Oceanographic Institution: Falmouth, MA, USA, 1993. [Google Scholar]
Petroni, A.; Pergoloni, S.; Ko, H.L.; Im, T.H.; Cho, Y.H.; Cusani, R.; Scarano, G.; Biagi, M. Channel reciprocity analysis for bi-directional shallow water acoustic communications. In Proceedings of the OCEANS 2017-Anchorage, Anchorage, AK, USA, 18–21 September 2017; pp. 1–5. [Google Scholar]
Chen, P.; Rong, Y.; Nordholm, S.; He, Z.; Duncan, A.J. Joint Channel Estimation and Impulsive Noise Mitigation in Underwater Acoustic OFDM Communication Systems. IEEE Trans. Wirel. Commun. 2017, 16, 6165–6178. [Google Scholar] [CrossRef]
Domingo, M.C. Overview of channel models for underwater wireless communication networks. Phys. Commun. 2008, 1, 163–182. [Google Scholar] [CrossRef]
Hara, S.; Prasad, R. Multicarrier Techniques for 4G Mobile Communications; Artech House, Inc.: Norwood, MA, USA, 2003. [Google Scholar]
Poor, H. An Introduction to Signal Detection and Estimation; Springer Texts in Electrical Engineering; Springer: New York, NY, USA, 1998. [Google Scholar]
Eggen, T.; Baggeroer, A.; Preisig, J. Communication over Doppler spread channels. Part I: Channel and receiver presentation. IEEE J. Ocean. Eng. 2000, 25, 62–71. [Google Scholar] [CrossRef]
Polprasert, C.; Ritcey, J.A.; Stojanovic, M. Capacity of OFDM Systems Over Fading Underwater Acoustic Channels. IEEE J. Ocean. Eng. 2011, 36, 514–524. [Google Scholar] [CrossRef] [Green Version]
Qarabaqi, P.; Stojanovic, M. Modeling the large scale transmission loss in underwater acoustic channels. In Proceedings of the 2011 49th Annual Allerton Conference on Communication, Control, and Computing (Allerton), Monticello, IL, USA, 28–30 September 2011; pp. 445–452. [Google Scholar] [CrossRef]
Grewal, M.S.; Andrews, A.P. Kalman Filtering: Theory and Practice; Prentice-Hall, Inc.: Hoboken, NJ, USA, 1993. [Google Scholar]
Baccarelli, E.; Biagi, M. Optimal integer bit-loading for multicarrier ADSL systems subject to spectral-compatibility limits. Signal Process. 2004, 84, 729–741. [Google Scholar] [CrossRef]
van Walree, P.A.; Socheleau, F.X.; Otnes, R.; Jenserud, T. The Watermark Benchmark for Underwater Acoustic Modulation Schemes. IEEE J. Ocean. Eng. 2017, 42, 1007–1018. [Google Scholar] [CrossRef] [Green Version]
Barisic, M.; Vukic, Z.; Miskovic, N.; Nagy, G. Developing the Croatian Underwater Robotics Research Potential. IFAC Proc. Vol. 2006, 43, 431–436. [Google Scholar] [CrossRef] [Green Version]
Garcia-Armada, A. SNR gap approximation for M-PSK-Based bit loading. IEEE Trans. Wirel. Commun. 2006, 5, 57–60. [Google Scholar] [CrossRef] [Green Version]

Figure 1. Graphical description of the three phases for interference and channel estimation and data transmission/detection during CS and ES.

Figure 2. Protocol summary for CS and EC scenarios.

Figure 3. MSE for different lengths of acquisition phase time.

Figure 4. Values of

L_{a d a p t}

in Equation (18) when different channel realizations are considered.

Figure 4. Values of

L_{a d a p t}

in Equation (18) when different channel realizations are considered.

Figure 5. Difference between ideal bit-loading and the proposed scheme compared with the difference between ideal bit-loading and quantized information feedback link in channel realizations (a,b).

Figure 6. BER for different channel realizations for the proposed scheme and the case with frame-by-frame estimation and quantized feedback.

Figure 7. Efficiency for different lengths of data to be sent for the proposed scheme and the case with frame-by-frame estimation with quantized feedback.

Table 1. List of variables related to time intervals.

Symbol	Definition
$T_{f}$	Transmission frame length
$T_{s}$	Slot time of a frame, equating the OFDM symbol length (including CP)
$T_{c}$	Sampling time
$T_{x}$	OFDM symbol length (without CP)
$T_{C P}$	OFDM CP length
$T_{a, CS}$	Interference acquisition time during CS
$T_{a, EC}$	Interference acquisition time during EC
$T_{e, CS}$	Channel acquisition time during CS
$T_{e, EC}$	Channel acquisition time during EC
$T_{d}$	Frame portion of time dedicated to data transmission during EC

Table 2. Simulation parameters.

Bandwidth (B)	8 kHz
OFDM sub-carriers ( $N_{s c}$ )	128
Transmit power ( $P_{t x}$ )	183 dB @ 8V
Noise variance ( $N_{0}$ )	1.2 × 10⁻²³ W/Hz
Frame duration ( $T_{f}$ )	20 ms
Slot duration ( $T_{s}$ )	125 μs
Interference acquisition frames ( $Q_{a}$ )	10
Channel acquisition frames ( $Q_{e}$ )	10

Table 3. BER performance as a function of

N_{S C}

.

Table 3. BER performance as a function of

N_{S C}

.

FFT Points	Average BER	Maximum BER	Minimum BER
$N_{S C}$ = 16	$9.7 \times 10^{- 6}$	$5.6 \times 10^{- 2}$	$10^{- 6}$
$N_{S C}$ = 32	$6.1 \times 10^{- 6}$	$2.3 \times 10^{- 2}$	$10^{- 6}$
$N_{S C}$ = 64	$4.7 \times 10^{- 6}$	$1.4 \times 10^{- 2}$	$10^{- 6}$
$N_{S C}$ = 128	$3.6 \times 10^{- 6}$	$8.7 \times 10^{- 3}$	$10^{- 6}$
$N_{S C}$ = 256	$4.3 \times 10^{- 6}$	$1.1 \times 10^{- 2}$	$10^{- 6}$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Petroni, A.; Scarano, G.; Cusani, R.; Biagi, M. On the Effect of Channel Knowledge in Underwater Acoustic Communications: Estimation, Prediction and Protocol. Electronics 2023, 12, 1552. https://doi.org/10.3390/electronics12071552

AMA Style

Petroni A, Scarano G, Cusani R, Biagi M. On the Effect of Channel Knowledge in Underwater Acoustic Communications: Estimation, Prediction and Protocol. Electronics. 2023; 12(7):1552. https://doi.org/10.3390/electronics12071552

Chicago/Turabian Style

Petroni, Andrea, Gaetano Scarano, Roberto Cusani, and Mauro Biagi. 2023. "On the Effect of Channel Knowledge in Underwater Acoustic Communications: Estimation, Prediction and Protocol" Electronics 12, no. 7: 1552. https://doi.org/10.3390/electronics12071552

APA Style

Petroni, A., Scarano, G., Cusani, R., & Biagi, M. (2023). On the Effect of Channel Knowledge in Underwater Acoustic Communications: Estimation, Prediction and Protocol. Electronics, 12(7), 1552. https://doi.org/10.3390/electronics12071552

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

On the Effect of Channel Knowledge in Underwater Acoustic Communications: Estimation, Prediction and Protocol

Abstract

1. Introduction

Motivation and Goals of the Work

2. System Model and Connection Setup

2.1. Interference Statistics Acquisition Stage

2.2. Channel Model, Estimation and Statistics Evaluation

2.3. Channel Statistics Estimation

2.4. Cyclic Prefix Length Adaptive Tuning

3. Established Connection Phase

3.1. Channel Re-Estimation and Prediction

3.2. Information Data Stage and Detection

3.3. Remark—Protocol Summary and Efficiency

4. Numerical Results

5. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI