Vocalization Source Level Distributions and Pulse Compression Gains of Diverse Baleen Whale Species in the Gulf of Maine

Abstract: The vocalization source level distributions and pulse compression gains are estimated for four distinct baleen whale species in the Gulf of Maine: fin, sei, minke and an unidentified baleen whale species. The vocalizations were received on a large-aperture densely-sampled coherent hydrophone array system useful for monitoring marine mammals over instantaneous wide areas via the passive ocean acoustic waveguide remote sensing technique. For each baleen whale species, between 125 and over 1400 measured vocalizations with significantly high Signal-to-Noise Ratios (SNR > 10 dB) after coherent beamforming and localized with high accuracies (<10% localization errors) over ranges spanning roughly 1 km–30 km are included in the analysis. The whale vocalization received pressure levels are corrected for broadband transmission losses modeled using a calibrated parabolic equation-based acoustic propagation model for a random range-dependent ocean waveguide. The whale vocalization source level distributions are characterized by the following means and standard deviations, in units of dB re 1 μPa at 1 m: 181.9 ± 5.2 for fin whale 20-Hz pulses, 173.5 ± 3.2 for sei whale downsweep chirps, 177.7 ± 5.4 for minke whale pulse trains and 169.6 ± 3.5 for the unidentified baleen whale species downsweep calls. The broadband vocalization equivalent pulse-compression gains are found to be 2.5 ± 1.1 for fin whale 20-Hz pulses, 24 ± 10 for the unidentified baleen whale species downsweep calls and 69 ± 23 for sei whale downsweep chirps. These pulse compression gains are found to be roughly proportional to the inter-pulse intervals of the vocalizations, which are 11 ± 5 s for fin whale 20-Hz pulses, 29 ± 18 for the unidentified baleen whale species downsweep calls and 52 ± 33 for sei whale downsweep chirps. The source level distributions and pulse compression gains are essential for determining signal-to-noise ratios and hence detection regions for baleen whale vocalizations received passively on underwater acoustic sensing systems, as well as for assessing communication ranges in baleen whales.


Introduction
The vocalization behaviors of diverse marine mammal species [1] have been simultaneously monitored over vast areas of the Gulf of Maine using the Passive Ocean Acoustic Waveguide Remote Sensing (POAWRS) technique [1][2][3] from 19 September-6 October 2006.The marine mammal vocalizations were received on a large-aperture densely-sampled coherent hydrophone array system that provides orders of magnitude higher array gain [4] than a single sensor, enabling whale vocalizations to be detected, localized and classified over an approximately 100,000 km 2 region instantaneously by POAWRS without aliasing in time and space (see the POAWRS detection region for whale vocalizations from diverse species in Figure 3a of [1]).
Here, we estimate the vocalization source levels of four distinct baleen whale species from simultaneous recordings of their vocalizations in the Gulf of Maine.The four baleen whale species analyzed here are fin whale (Balaenoptera physalus), sei whale (Balaenoptera borealis), minke whale (Balaenoptera acutorostrata) and an Unidentified Baleen Whale Species (UBWS) .Each baleen whale species was identified from its characteristic vocalization type: the fin whales were identified from their short duration 20-Hz center frequency calls [5][6][7][8]; the sei whales from their downsweep chirps occurring singly or as doublets with roughly a 4-s separation or sometimes as triplets [9][10][11]; and the minke whales were identified from their pulse trains [12][13][14] comprised of a series of click sequences (see Figure 1 here and also the Extended Data Figures 1-4 of [1]).The unidentified baleen whale species vocalized downsweep signals in the 30-60 Hz frequency range over a 2-3-second duration (see Figure 1J here and the Extended Data Figures 1B and 3A of [1]).These vocalizations have distinct bearing versus time trajectories and localizations that do not follow or coincide well with those of the other baleen whale species present in the area, namely fin, sei, humpback and minke (see the Extended Data Figure 4 of [1]).The unidentified baleen whale species downsweep signals most closely resemble the audible downsweep, burp and grunt calls of blue whales recorded in the Gulf of St. Lawrence [15], a neighboring region to the Gulf of Maine, and they were attributed to blue whales in [1].
For each baleen whale species, between 125 and over 1400 measured vocalizations with significantly high Signal-to-Noise Ratios (SNR > 10 dB) are included in the source level estimation.The source level of each baleen whale vocalization is estimated from the received vocalization pressure level by compensating for corresponding broadband transmission loss [1,[16][17][18] from whale location to the receiver array center location [19][20][21] in the temporally-and spatially-varying Gulf of Maine environment.The whale locations for each species were previously determined using the moving array triangulation [2,22,23], the bearings-migration minimum mean square error and the array invariant techniques [2,[22][23][24] from the measured bearing versus time trajectories of sequences of vocalizations from that species [1].The marine mammal species-dependent vocalization source level is an important parameter for estimating the marine mammal detection region for a given species in any passive underwater acoustic sensing system [1,2,25].It is also employed in distance sampling estimates of marine mammal call density and abundance estimation [26][27][28][29][30]. Vocalization source level is also essential for determining marine mammal communication ranges, which are key considerations in assessing the impact of anthropogenic sound on marine mammal behavior [31,32].
Previous estimates of vocalization source level for the baleen whale species considered here include fin whales off the Western Antarctic Peninsula and near Juan de Fuca Ridge of the northeast Pacific Ocean [33,34]; sei whales on the continental shelf off New Jersey [35]; minke whales near the Great Barrier Reef, Hawaii, and the Stellwagen Bank area of the Gulf of Maine [28,36,37]; and blue whales distributed in multiple ocean areas, including both the Pacific and Atlantic Ocean [15,33,[38][39][40].Previous vocalization source level estimates typically focused on a single species based on vocalization sample sizes ranging from a few tens to a few hundred.Transmission losses TL were often modeled previously using the azimuthally-symmetric formula TL = X log 10 R with the transmission loss coefficient X varying between the limits of spherical spreading (X = 20) and cylindrical spreading (X = 10) for source-receiver range separation R depending on the environment [15,33,36,37].In [35], a normal-mode based ocean acoustic propagation model was employed to correct for transmission losses in estimating sei whale vocalization source level in the shallow New Jersey shelf environment.
Here, we provide vocalization source level estimates for each of the baleen whale species considered using vocalization sample sizes that range from several hundred to a couple of thousand.The transmission losses calculated here are broadband and employ a calibrated [16,41] parabolic equation-based Range-dependent Acoustic propagation Model (RAM) [42] to compute the acoustic field moments in a fluctuating ocean waveguide with complex bathymetry.The model accounts for significant azimuth-and range-dependent variation in transmission losses for the Gulf of Maine environment where water depths can vary drastically from greater than 200 m in the basins to less than 30 m on the banks.While the azimuthally-symmetric ocean acoustic transmission loss formulation used in previous studies is valid for short ranges and for environments with negligible range dependence, here we find it necessary to employ a range-and depth-dependent acoustic propagation model [42] to handle the effects of significant bathymetric variations and depth-dependent water-column sound speed structure on the propagated marine mammal vocalization intensities received at long ranges.frequencies defined as −10 dB end points in power spectrum.The received vocalization pressure levels were estimated from the root-mean-square value of the maximum instantaneous beamformed bandpass filtered pressure-time series.A high gain of up to 18 dB can be achieved after beamforming the data measured on a 64-element sub-aperture of the 160-element hydrophone array, enabling vocalizations from sei whales, minke whales and unidentified baleen whales species to be detected above the ambient noise.In contrast, the sei whale, minke whale and unidentified baleen whale species vocalizations could not be consistently detected on a single hydrophone.For fin whales, since the acoustic wavelengths of the vocalizations are large, the array aperture is not long enough to provide gains larger than 5 dB.
The high intensity fin whale 20-Hz pulses are detectable even without coherent beamforming.
The pulse compression gains of the broadband vocalizations are estimated for three baleen whale species: fin whale, the unidentified baleen whale species and sei whale.The pulse compression gain is quantified as the ratio of the signal duration to the width of the main-lobe after vocalization frequency modulation to the baseband and matched filtering operations.Sonar, radar and ultrasonic systems [43,44] often employ pulse compression to enhance signal-to-noise ratios in signal detection and range-resolution in imaging applications.Marine mammal vocalization pulse compression gains are required for determining detection regions in underwater passive single sensor [45][46][47][48] or array sensor systems [1,2,22] that employ match-filter operations to enhance vocalization detection, as well as vocalization arrival time and bearing estimation for localization applications [1,3,22].

Gulf of Maine 2006 Experiment Acoustic Data Collection
The Gulf of Maine is an important North Atlantic marine mammal foraging ground and contains a number of significant spawning areas for various fish species [49][50][51], including the Atlantic herring (Clupea harengus) [52][53][54].The Atlantic herring comprises a keystone prey species, common in the diets of many marine mammals, piscivorous fish and seabirds of the region [52,55].The spawning activity of Atlantic herring on the northern flank of Georges Bank during the fall season each year has been observed [52,53,[56][57][58] and recorded by the U.S. National Marine Fisheries Services (NMFS) for over 30 years, coinciding with their annual survey of the Georges Bank herring stock with this period each year.
The Gulf of Maine 2006 Experiment [1,2,16,17,22,23,41,[59][60][61] was conducted from 19 September-6 October 2006, in conjunction with the US NMFS annual Atlantic herring acoustic survey of the Gulf of Maine and Georges Bank.The Atlantic herring areal population densities were monitored over instantaneous wide areas using active OAWRS imaging [1,2,16,17,60] and calibrated with coincident conventional ultrasonic fisheries echo sounding measurements [16,17,60] with fish species identification and physiological parameters extracted from trawl samples collected over the course of the experiment [56,62].The overall Georges Bank Atlantic herring stock estimate for autumn 2006 based on the OAWRS survey has been found to match well (with 80%-90% agreement) with independent NMFS stock estimates for 2006 and 2007 [61].
During the experiment, acoustic recordings were acquired using a 160 hydrophone-element horizontal receiver line-array towed behind a research vessel along designated tracks north of Georges Bank [2,16,17,41].To minimize the effect of tow ship noise on the recorded acoustic data, the coherent hydrophone array was towed approximately 375-405 m behind the research vessel so as to confine this noise to the forward end-fire direction of the array.The tow ship noise in directions away from the forward end-fire was negligible after coherent beamforming.The omnidirectional ambient noise spectral levels in the frequency band of the vocalizations for each baleen whale species considered here are provided in the Supplementary Information Section I of [1].The acoustic recordings of the coherent hydrophone array system contained marine mammal vocalizations from over eight distinct whale species [1,2] that include fin, humpback, sei, minke, orca, pilot, sperm, as well as other unidentified baleen and toothed whale species.Here, we focus our analysis on vocalizations of the fin whale, sei whale, minke whale and an unidentified baleen whale species recorded on the coherent hydrophone array.
Data from all 160 hydrophone elements nested into four sub-apertures are used, where each sub-aperture contains 64 hydrophones for spatially-and temporally-unaliased sensing up to 4 kHz (the sampling rate of POAWRS was 8 kHz).Detailed specifications of the coherent hydrophone array and data acquisition system used here are provided in [1,2,16,22,23,41,63], including array layout and aperture nesting.The low-frequency (LF) aperture, with inter-element spacing of 1.5 m, was used to analyze baleen whale vocalizations with fundamental frequency content below 500 Hz.The instantaneous receiver array center positions are determined from the shipboard Global Positioning System (GPS).The water-column temperature and salinity were measured using Expendable Bathythermographs (XBTs) and Conductivity-Temperature-Depth (CTD) sensors.Other details about the measurement geometry and oceanographic properties of the environment are provided in Section II of [16] and also in [1,2,17,22,23,41,59,64].

Baleen Whale Vocalization Detection and Classification
Acoustic pressure-time series measured by sensors across the receiver array were converted to two-dimensional (2D) beam-time series by conventional time-domain beamforming [4] and further converted to spectrograms by short-time Fourier transform (0.26-s length, 75% overlap, Hanning window).The baleen whale vocalizations were automatically extracted from the beamformed spectrograms using a threshold detector (>5.6 dB SNR) and checked by visual inspection [1].The azimuthal bearing of each extracted vocalization was subsequently determined by selecting the bearing in which the beamformed, bass-pass filtered pressure-time series contained maximum energy during the time duration of the vocalization and in the same frequency band.With our densely-sampled, large-aperture coherent POAWRS receiver array, a high gain of up to 10 log 10 n = 18 dB where n = 64 hydrophones for each sub-aperture can be achieved, enabling the detection of baleen whale vocalizations up to two orders of magnitude more distant in range in the shallow water environment than a single omnidirectional hydrophone, which has no array gain (Figure 1).The actual array gain, which may be smaller than the full 18-dB array gain, is dependent on noise coherence and vocalization wavelength relative to array aperture length.
From the beamformed spectrograms, the time-frequency characteristics of each baleen whale vocalization were extracted via pitch tracking [1,10,65,66] and applied for species classification.A pitch track describes the time-variation of the fundamental frequency in the vocalization signal.

It consists of a time series
, determined using a time-frequency peak detector from the beamformed spectrogram, which is created from short-time Fourier transforms of the audio data (sampling frequency = 8000 Hz, frame = 526 samples, overlap = 1/2, Hann window).A combination of extracted features from pitch-tracking, orthogonalized via Principle Component Analysis (PCA) [67], were used to optimize the vocalization species classification employing k-means [68] and Bayesian-based Gaussian mixture model clustering approaches [1].The number of clusters can be determined via the Bayesian Information Criterion (BIC).The eight features extracted from baleen whale vocalization pitch-tracking are provided in the Extended Data Table 2 of [1] for the species examined here.The bearing-time trajectories of each closely-associated series of vocalizations were also taken into account to ensure consistent classification.

Localization of Baleen Whale Vocalizations
The horizontal location of each detected baleen whale vocalization consists of a range and a bearing estimate.The estimated azimuthal bearings of sequences of baleen whale vocalizations form multiple bearing-time trajectories (Figure 2).These bearing-time trajectories are utilized to determine the ranges of the baleen whale vocalizations from the horizontal receiver array center employing the Moving Array Triangulation (MAT) [2,22,23] and the bearings-migration Minimum Mean Square Error (MMSE) methods [22].Position estimation error or the root mean squared (rms) distance between the actual and estimated location is a combination of range and bearing errors quantified for this array in [2,22,23].Range estimation error, expressed as the percentage of the range from the source location to the horizontal receiver array center, for the MAT and MMSE is roughly 2% at array broadside and gradually increases to 10% at 65 • from broadside and 25% near or at end-fire.Bearing estimation error of the time-domain beamformer ranges from 0.1 • -1.4 • at array broadside and gradually increases to between 0.7 • and 5.3 • at end-fire depending on the frequency of the vocalizations [1,16,69] for the given array aperture.These errors are determined at the same experimental site and time period as the marine mammal position estimates presented here, from thousands of controlled source signals transmitted by a source array, and are based on absolute GPS ground truth measurements of the source array's position [22,23], which are accurate to within 3 m-10 m.More than 80% of vocalizations are found to originate from between 0 • and 65 • of the array broadside direction, where both the MAT and MMSE offered reliable and consistent localization estimates.Vocalizations for which the MAT mean and the MMSE localization estimates differed by more than 10% of the estimated range were removed from all further analysis.

Figure 2.
The bearing-time trajectories of four distinct baleen whale species: fin whale, minke whale, sei whale and unidentified baleen whale species.One thousand four hundred and ten fin whale vocalizations, 431 minke whale pulse trains, 125 sei whale vocalizations and 417 unidentified baleen whale species vocalizations were selected to estimate the species-dependent baleen whale vocalization source levels.These vocalizations are a subset of the larger set of baleen and toothed whale vocalizations measured by the POAWRS receiver array for each species (see the Extended Data Figures 1-4 of [1]).
The bearings are measured with respect to true north.

Broadband Transmission Loss Modeling
The corresponding one-way broadband acoustic transmission loss from the estimated location of each whale vocalization to the center of the POAWRS receiver array was calculated following the approach described in Section I of the Supplementary Information of [1].A calibrated [16,41] parabolic equation-based Range-dependent Acoustic propagation Model (RAM) [42] was employed to calculate the broadband transmission loss via [16,18,41,70]: where G(r|r 0 , f ) is the waveguide Green function at frequency f for a whale located at r 0 and the receiver at r, Q( f ) is the normalized vocalization spectra and f U and f L are the upper and lower frequencies used for the bandpass filter.The model takes into account the environmental parameters, such as the range-dependent water depth and sound speed profiles, to stochastically compute the propagated acoustic intensities (Figure 3) via Monte Carlo simulations following the approach of [16,18,70].The mean magnitude-squared waveguide Green function is obtained by averaging over multiple whale depths from the sea surface to the sea floor and over multiple Monte Carlo simulations to account for the unknown whale depth and waveguide fluctuations.The broadband transmission loss standard deviations are calculated in the log-transformed domain using the broadband transmission loss at each potential whale depth from the sea surface to the seafloor.

Source Level Estimation
The baleen whale vocalization source level SL is estimated (Figures 4-7) using the passive sonar equation [2,19,71], where RL(r 0 ) is the received whale vocalization pressure level.The received whale vocalization pressure level was estimated as the root mean squared (rms) value of the maximum instantaneous time-domain signal bandpass-filtered between upper f U and lower f L frequencies and beamformed to the azimuthal bearing of the vocalization, over a time window [72] encompassing 90% of the total signal energy (Figure 1).The upper f U and lower f L frequencies are determined as the −10 dB end points relative to the signal peak in the power spectrum.Due to their high intensities, the fin whale 20-Hz pulses were also detectable in the unbeamformed bandpassed-filtered pressure-time series measured by each omnidirectional element of the hydrophone array (Figure 1C).The received pressure levels of fin whale vocalizations without beamforming (RL unb f ) are estimated as the maximum value out of the 160 received bandpass-filtered pressure levels on each element of the hydrophone array.The unbeamformed received pressure levels are compared to the received pressure levels with beamforming.For the pulse trains from minke whales, source level results are reported as SL click and SL max , which are the source levels of the individual clicks or units within pulse trains and the maxima of all of the clicks or units in a pulse train, respectively.The baleen whale species source level estimates here are based on rms quantities.

Pulse Compression Gain Estimation
Pulse compression [43,44] is widely employed in radar, sonar and ultrasonic systems to enhance signal-to-noise ratios in signal detection and range-resolution in imaging applications.A frequency or phased modulated long pulse can be compressed by matched filtering the received signal with a replica of the modulated pulse signal.The Pulse Compression Gain (PCG) [43] γ is a measure of the degree to which the pulse can be compressed and is defined as the ratio of the original uncompressed pulse width τ to the pulse width τ c after pulse compression.For a typical Linear Frequency Modulated (LFM) pulse with duration τ, it can be compressed to a duration τ c = 1/B after matched filtering, where B is the modulated pulse spectral bandwidth [43].The LFM pulse then has a pulse compression gain given by γ = τ τ c = τB, which is the time-bandwidth product [73,74].Each baleen whale vocalization can be considered as a nonlinear frequency modulated pulse, and the PCG is estimated by matched filtering the received signal with a normalized replica generated following the approach described in Appendix B of [22].Vocalization bandwidth B is calculated as the difference between the upper f U and lower f L frequencies used for the bandpass filter during source level estimation.The uncompressed pulse width τ is estimated as the length of the time window encompassing 90% of the total signal energy.After modulating or shifting the broadband vocalizations within [ f L , f U ] to baseband [0, f U − f L ], the matched filter output (Figure 8) is calculated using Equation (3) of [18]: where Ψ(r|r 0 , t) is the received vocalization pressure at time t at receiver location r from a whale at r 0 with vocalization complex spectral amplitude Φ(r|r 0 , f ) at frequency f .The normalized matched filter is given by h(t|t M ) = Kq(t M − t), and its Fourier transform is where t M is the delay time of the matched filter, q(t) is the received vocalization signal, Q( f ) is the Fourier transform of that signal and K = ( is the normalizing factor.The compressed baleen whale vocalization pulse width τ c,−6dB is estimated from the matched filter output as the time duration corresponding to the −6 dB down in 10 log 10 of the matched filter output (Equation ( 3)) on both sides of the peak.The compressed baleen whale vocalization equivalent pulse width τ c,eq is also reported.The mean PCGs, γ −6dB and γ eq , estimated from the vocalizations of each baleen whale species are compared with the time-bandwidth product τ • B, which is the pulse compression gain for LFM pulses.In the vocalization pulse compression gain estimation, the time-domain signal was always bandpass filtered and further beamformed to the azimuthal bearing of the vocalization in the same way as in the source level estimation.(E-H) unidentified baleen whale species and (I-L) fin whale.Sub-plots (A,E,I) show the beamformed pressure-time series for each species.The corresponding spectrograms (0.26-s length, 75% overlap) are shown in (B,F,J), respectively.Sub-plots (C,G,K) show the normalized spectrum for each vocalization over a time-window encompassing 90% of the total energy.After matched filtering each vocalization signal with a corresponding replica generated following the approach described in Appendix B of [22], the compressed pulse signals are plotted in (D,H,L), respectively.

Results
The baleen whale species-dependent vocalization source levels are estimated using 1410 fin whale vocalizations, 125 sei whale vocalizations, 431 minke whale pulse trains and 417 unidentified baleen whale species vocalizations.These vocalizations were selected based on several criteria that include (1) high Signal-to-Noise Ratios (SNR > 10 dB); (2) could be reliably localized with high accuracies (the MAT mean and the MMSE localization estimates differed by less than 10% of the estimated range); and (3) the bearing-time trajectories and spectra did not overlap with those of other significant sound sources.The bearing-time trajectories of all selected and classified vocalizations are shown in Figure 2.These vocalizations are a subset of the full set of baleen and toothed whale vocalizations simultaneously measured, detected and classified using the POAWRS approach for each species (see the Extended Data Figures 1-4 of [1] showing the distinct vocalization frequency range and bearing versus time trajectories for a wide variety of marine mammal species detected).

Fin Whales
The fin whales are identified from their characteristic 20-Hz center frequency high intensity calls [5][6][7][8] that have been associated with communication among fin whale individuals [75] and also found to be uttered by males as breeding displays in their mating grounds [5,76] (Figure 1A).Instantaneous azimuthal bearing estimates of the selected 1410 fin whale vocalizations are associated into 20 distinct bearing-time trajectories (Figure 2).The selected fin whale vocalizations are localized to areas on northern Georges Bank and west of Georges Basin, with ranges spanning between 1.9 and 25.8 km (Figure 4A) from the receiver array center.
The corresponding one-way broadband transmission losses are calculated and plotted in Figure 4B as a function of the distance of the MMSE estimated center location of a sequence of fin whale vocalizations (from each bearing-time trajectory) to the center of the POAWRS receiver array.The transmission loss variation during the time duration of a single bearing-time trajectory is ignored.The transmission loss is calculated as 10 log 10 of the mean spectrally-weighted magnitude-squared waveguide Green function, which is obtained by averaging over multiple whale depths and over five Monte Carlo simulations per whale depth.The transmission loss standard deviations, minimum and maximum values are calculated assuming the fin whales are located at each potential depth from the sea surface to near the seafloor.
The received fin whale vocalization pressure levels estimated from the beamformed bandpass-filtered time-domain signals are plotted in Figure 4D as a function of the distance of the estimated instantaneous MAT location of each whale vocalization to the receiver array center.The received pressure level standard deviations are calculated for each bearing-time trajectory.This standard deviation is a combination of fluctuations from varying whale depth and range, propagation scintillation in a shallow water waveguide, as well as the source level variation of the vocalizations.The received pressure level standard deviations are expected to be larger than the transmission loss standard deviations.The received vocalization pressure levels are next corrected for the corresponding one-way broadband transmission losses, leading to the fin whale vocalization source level distribution shown in Figure 4C.The average source level estimated from this distribution is 181.9 ± 5.2 dB re 1 µPa at 1 m over the 13-34-Hz vocalization frequency band of the fin whale.
The received fin whale vocalization pressure levels RL unb f estimated from the bandpass-filtered time-domain signals without beamforming are shown in Figure 4F.The corresponding fin whale source level SL unb f distribution derived from these measurements is characterized by a mean of 187.9 ± 5.6 dB re 1 µPa at 1 m (Figure 4E).The fin whale source level mean derived from the beamformed data is approximately 6 dB smaller than that derived from the unbeamformed data.This is because the time-domain signal after beamforming represents a spatially-averaged signal across the hydrophone elements of the array.In contrast, the source level derived from the unbeamformed data is based on the maximum vocalization pressure level received on the 160 hydrophone elements of the receiver array.Note that the received fin whale vocalization pressure levels shown in Figure 4D,F originate from locations that span a wide range of azimuths about the receiver array whose locations also vary (Figure 4A).As a result, the received fin whale vocalization pressure levels in Figure 4D,F undergo different transmission loss versus range trends (Figure 3B) leading to the non-monotonic decay with range seen in Figure 4D,F.

Sei Whales
The sei whales were identified from their downsweep calls [9][10][11], hypothesized to be long-range contact calls potentially enabling coordinated activities, such as feeding [1,10] or breeding [10].They usually occur singly or as doublets with approximately a 4-s separation (Figure 1D) and sometimes as triplets.Instantaneous bearing estimates of 125 sei whale downsweep calls are associated with four bearing-time trajectories (Figure 2).The sei whale vocalization spatial locations shown in Figure 5A vary between 3.4 and approximately 16 km from the receiver array center.The four vocalization sequences are charted to areas within Franklin Basin.The corresponding one-way broadband transmission losses are plotted in Figure 5B.
The vast majority of received sei whale vocalizations are not intense enough to be detected above the ambient noise floor at each hydrophone without beamforming (see Figure 1D-F).The received sei whale vocalization pressure levels estimated from bandpass-filtered beamformed time-domain signals are shown in Figure 5D.The average source level is estimated to be 173.5 ± 3.2 dB re 1 µPa at 1 m over the 28-92-Hz frequency band of the sei whale vocalizations after correcting for corresponding one-way broadband transmission losses (Figure 5C).

Minke Whales
The minke whales were identified from their pulse trains (Figure 1G) comprised of a series of click sequences [12][13][14].Four hundred thirty one pulse trains, consisting of 5390 clicks, are associated with 16 bearing-time trajectories (Figure 2).The vocalization pulse train sequences from all 16 bearing-time trajectories are spatially charted to a focused area on north-central Georges Bank (Figure 6A), with ranges varying between 19.3 and 28.4 km from the receiver array center.The corresponding one-way broadband transmission losses are plotted in Figure 6B.
The pressure levels of individual clicks in the minke whale pulse train, estimated from bandpass-filtered beamformed time-domain signals, are shown in Figure 6D.The received pressure levels of most individual minke whale clicks were not high enough to be detectable from the time-domain signal without coherent beamforming.The minke whale source level SL click distribution derived from individual minke whale clicks has a mean of 172.2 ± 5.3 dB re 1 µPa at 1 m over the 66-463-Hz frequency band of the vocalizations (Figure 6C).
The maximum source level of minke whale pulse trains SL max , derived from the maxima of the beamformed clicks in each pulse train, is also calculated (Figure 6F).This source level distribution SL max has a mean of 177.7 ± 5.4 dB re 1 µPa at 1 m (Figure 6E).Different calling patterns and pulse train types [14] were not separated during the calculations.Thus, the source level distributions shown here are an average over all minke whale vocalization pulse train types detected in the region.

An Unidentified Baleen Whale Species
Instantaneous bearing estimates of 417 downsweep calls of the unidentified baleen whale species are associated with eight bearing-time trajectories (Figure 2) with estimated locations shown in Figure 7A.The unidentified baleen whale species vocalization spatial locations partially overlap with those of the fin whale and have ranges varying between 0.9 and 16.7 km from the POAWRS receiver array.The corresponding one-way broadband transmission losses are plotted in Figure 7B.
The received unidentified baleen whale species downsweep calls are not intense enough to be consistently detected above the ambient noise floor at each hydrophone without beamforming (see Figure 1J-L).Therefore, the received unidentified baleen whale species vocalization pressure levels and standard deviations (Figure 7D) are all estimated from the beamformed bandpass-filtered time-domain signal.The average source level is estimated to be 169.6 ± 3.5 dB re 1 µPa at 1 m over the 25-70-Hz frequency band of the unidentified baleen whale species downsweep vocalizations after correcting for corresponding one-way broadband transmission losses.

Pulse Compression Gains of Vocalizations from Baleen Whale Species
The mean pulse compression gains γ −6dB and γ eq of the vocalizations from the three baleen species, sei whale downsweep chirps, unidentified baleen whale species downsweep signals and fin whale 20-Hz pulses, are calculated (see Table 1).For comparison, the pulse compression gains of the 50-Hz bandwidth Tukey windowed [77] 1-s duration LFM signals centered at various frequencies from 300-2000 Hz commonly used in OAWRS (Ocean Acoustic Waveguide Remote Sensing) imaging [16,17,41,60,69,78] are also tabulated.For the Tukey windowed LFM signal, the equivalent pulse compression gain is equal to the time-bandwidth product.The γ −6dB is smaller than the time-bandwidth product for the LFM signal and is a measure of the effective bandwidth due to bandwidth reduction from Tukey windowing.For the baleen species vocalizations, the γ −6dB are larger than the corresponding equivalent pulse compression gains by roughly a factor of two and closer to the time-bandwidth products of these vocalization signals.The duration of each click in the minke whale pulse train is approximately 20 ms, which is roughly equivalent to the compressed pulse width of the sei whale downsweep chirps, so pulse compression analysis is not done here for minke whale click vocalizations.Among the three baleen whale species investigated, the sei whale downsweep chirps have the largest pulse compression gain (Figure 8A-D), which is a factor of roughly 2.5-times larger than that of the unidentified baleen whale species downsweep calls and a factor of roughly 30-times larger than the fin 20-Hz pulses.This implies that the detection of sei whale chirp vocalizations can be significantly enhanced over ambient noise by employing pulse compression in passive marine mammal sensing systems.The pulse compression SNR enhancement is expected to be moderate for the unidentified baleen whale species downsweep vocalizations and insignificant for fin whale 20-Hz vocalizations.
The inter-pulse intervals of the fin whale, unidentified baleen whale species and sei whale vocalizations are roughly proportional to their pulse compression gains.The mean inter-pulse intervals of repetitive fin whale 20-Hz pulses, unidentified baleen whale species downsweep signals and sei whale downsweep chirp vocalizations are roughly 11, 29 and 52 s, respectively (Figure 9).

Discussion
The fin whale 20-Hz centered vocalization source level estimates obtained here in the Gulf of Maine compare well with previous estimates from other ocean areas, including the western Antarctic Peninsula [33], Northeast [34] and Central [79] Pacific Ocean.In general, the range of fin whale vocalization source level estimates from previous studies either overlap well with [5,80] or lie fully [33,34,79] within the range of fin whale vocalization source level estimates obtained here and shown in Figure 4E.
The mean value of the sei whale downsweep chirp vocalization source level distribution obtained here of 173.5 ± 3.2 dB re 1 µPa at 1 m is smaller than previous estimates of sei whale downsweep vocalizations measured in the nearby continental shelf off New Jersey (179 ± 4 dB re 1 µPa at 1 m [35]).
The previously-reported [37] peak-to-peak source levels for Type c2 (181.6 ± 6.6 dB re 1 µPa at 1 m) and Type sd3 (176.7 ± 4.2 dB re 1 µPa at 1 m) averaged over minke whale click vocalizations have equivalent rms values of 178.6 ± 6.6 dB re 1 µPa at 1 m and 173.7 ± 4.2 dB re 1 µPa at 1 m, respectively, for comparison to the study here.These rms values from the previous study [37] for minke whales in the Gulf of Maine lie well within the span of minke whale click vocalization rms source levels found here, which range from roughly 160-190 dB re 1 µPa at 1 m.
Our stochastic broadband transmission loss model calculations have been extensively calibrated and verified with thousands of one-way and two-way transmission loss measurements made during the same Gulf of Maine 2006 experiment at the same time and location [16,17,41].It is also verified by roughly one hundred two-way transmission loss measurements made from calibrated targets with known scattering properties [59] during the same experiment at the same time and location, indicating that our transmission loss measurements did not create a bias.Therefore, the observed difference between this study and previous ones is not likely caused by biased transmission loss measurements.
The POAWRS coherent hydrophone array's instantaneous marine mammal detection region extends over 100,000 km 2 , which is a factor of roughly 100-times larger than that of a single omnidirectional hydrophone.The vocalizing marine mammal population from each species instantaneously detected by POAWRS is expected to be larger than that of a single omnidirectional hydrophone.Based on historical visual surveys [81,82], the fall season areal population density of marine mammals in units of abundance per 1000 km 2 in their densest areas on or near northern Georges Bank (see Figures 1 and 2 of [1] for the locations of these dense areas during the experiment) is expected to range from roughly 10-22 for fin whales, 4-16 for minke whales and 3-24 for sei whales (refer to Supplementary Information Section IV of [1] for details).The vocalization source level estimates obtained here represent an average over multiple vocalizing marine mammal individuals for each species within the roughly 100,000-km 2 POAWRS detection region.
Baleen whale vocalizations are generally regarded as communication or contact signals for purposes such as coordinated feeding, migration and mating.Baleen whales are not known to produce sounds for echolocation or navigation, which is a capability in toothed whales.Some studies have suggested a potential for echolocation [25,83] or navigation [25,84] in some select baleen whale species, but is highly dependent on vocalization type [83], environmental and prey conditions [25], and they do not consider pulse compression gains since this ability is not known to be present in baleen whales.Here, the pulse compression gains are quantified for passive acoustic marine mammal sensing systems that use pulse compressions to enhance whale vocalization signal detection or bearing-time estimation for whale localization.

Conclusions
The vocalization source level distributions and pulse compression gains have been estimated for fin whale, sei whale, minke whale whale and an unidentified baleen whale species in the Gulf of Maine.The vocalization source level distributions are based on measurements made using a large-aperture densely-sampled coherent hydrophone array system that provides high SNR in signal detection, large sample sizes, as well as robust array-based methods for whale localization using vocalization bearing-time measurements over areas spanning 100,000 km 2 .An azimuth and range-dependent ocean acoustic propagation model calibrated for the Gulf of Maine environment was employed to correct the received vocalization pressure levels from various whale species with transmission losses.The whale species vocalization source level distributions are found to be characterized by the following rms means and standard deviations, in units of dB re 1 µPa at 1 m: 181.9 ± 5.2 for fin whale 20-Hz pulses, 173.5 ± 3.2 for sei whale downsweep chirps, 177.7 ± 5.4 for minke whale whale pulse trains and 169.6 ± 3.5 for an unidentified baleen whale species downsweep calls.The baleen whale species vocalization equivalent pulse compression gains have been estimated and found to be roughly 2.5 ± 1.1 for fin whale 20-Hz pulses, 24 ± 10 for the unidentified baleen whale species downsweep signals and 69 ± 23 for sei whale downsweep chirps.The pulse compression gains, source levels and inter-pulse intervals estimated here can be used as inputs for modeling the signal-to-noise ratios and hence detection regions of vocalizations from baleen whale species received passively on underwater acoustic sensing systems [1,2], as well as for assessing the communication ranges of baleen whales.

FrequencyFigure 1 .
Figure 1.Example vocalizations from (A-C) fin whale, (D-F) sei whale, (G-I) minke whale and (J-L) the unidentified baleen whale species.Sub-plots (A,D,G,J) show the beamformed spectrogram of the vocalizations from each species.Corresponding beamformed pressure-time series used for plotting the spectrograms are shown in (B,E,H,K), respectively.(C,F,I,L) show the pressure-time series from a single omnidirectional hydrophone.All signals were bandpass filtered between upper f U and lower f Lfrequencies defined as −10 dB end points in power spectrum.The received vocalization pressure levels were estimated from the root-mean-square value of the maximum instantaneous beamformed bandpass filtered pressure-time series.A high gain of up to 18 dB can be achieved after beamforming the data measured on a 64-element sub-aperture of the 160-element hydrophone array, enabling vocalizations from sei whales, minke whales and unidentified baleen whales species to be detected above the ambient noise.In contrast, the sei whale, minke whale and unidentified baleen whale species vocalizations could not be consistently detected on a single hydrophone.For fin whales, since the acoustic wavelengths of the vocalizations are large, the array aperture is not long enough to provide gains larger than 5 dB.The high intensity fin whale 20-Hz pulses are detectable even without coherent beamforming.

Figure 3 .
Figure 3. Example of broadband transmission losses calculated by a calibrated [16,41] parabolic equation-based range-dependent acoustic propagation model [42] along three propagation paths with the following directions: −13 • (roughly northwards crossing Georges Basin), 106 • (roughly eastwards across Georges Bank) and 194 • (roughly southwards across Georges Bank), as shown in (A).The transmission losses for two distinct broadband signals centered at 20 Hz and 120 Hz are plotted in (B,C), respectively, as a function of propagation range.The modeled broadband waveguide Green functions averaged over 15 Monte Carlo simulations for each signal along the three propagation directions are shown in (D-F,G-I), respectively, for the two broadband signals.The Green functions are used to calculate the transmission losses shown in (B,C) by averaging over multiple depths from the sea surface to near the sea floor to account for unknown whale depth.

Figure 4 .Figure 5 .
Figure 4. (A) The MMSE estimated center locations of sequences of fin whale vocalizations from 20 distinct bearing-time trajectories containing a total of 1410 vocalizations.The localization range and bearing errors are shown by the ellipse; (B) Corresponding one-way broadband transmission losses from the MMSE estimated fin whale vocalization sequence center locations to the POAWRS receiver array center.The transmission loss standard deviations (solid bar), minimum and maximum values (dotted bar) are calculated assuming the whales are located at each potential depth from the sea surface to near the seafloor; (C) Distribution of fin whale vocalization source level derived from bandpass-filtered beamformed signals has a mean of 181.9 ± 5.2 dB re 1 µPa at 1 m; (D) The received fin whale vocalization pressure levels estimated from beamformed data are plotted as a function of distance from estimated instantaneous MAT locations of each vocalization to the receiver array center.These data are used to derive the source level distribution shown in (C); (E) Distribution of fin whale vocalization source level derived from bandpass-filtered unbeamformed signals has a mean of 187.9 ± 5.6 dB re 1 µPa at 1 m; (F) The maximum value of the fin whale vocalization pressure levels received on 160 hydrophone elements of the POAWRS receiver array used to derive the source level distribution shown in (E).The blue bars in (D,F) represent one standard deviation in the pressure levels for each bearing-time trajectory.

Figure 6 .
Figure 6.(A) The MMSE estimated center locations of sequences of minke whale pulse train vocalizations from 16 distinct bearing-time trajectories consisting of a total of 431 pulse trains and 5390 clicks; (B) Corresponding one-way broadband transmission losses from the MMSE estimated minke whale vocalization sequence center locations to the POAWRS receiver array center.The transmission loss standard deviations (solid bar), minimum and maximum values (dotted bar) are calculated assuming the whales are located at each potential depth from the sea surface to near the seafloor; (C) The distribution of the individual minke whale click vocalization source level estimated from bandpass-filtered beamformed signals has a mean of 172.2 ± 5.3 dB re 1 µPa at 1 m; (D) The corresponding received click pressure levels are plotted as a function of distances from instantaneous MAT location estimates to the receiver array center.The blue bars represent one standard deviation in the pressure levels estimation; (E) The distribution of the maximum minke whale vocalization source level obtained from the maxima of the click source level in each pulse train has a mean of 177.7 ± 5.4 dB re 1 µPa at 1 m; (F) The maximum beamformed received pressure levels of each minke whale pulse train used to derive the distribution in (E).The blue bars in (D,F) represent one standard deviation in the pressure levels for each bearing-time trajectory.

Figure 7 .
Figure 7. (A) The MMSE estimated center locations of sequences of the unidentified baleen whale species downsweep vocalizations from 8 distinct bearing-time trajectories containing a total of 417 vocalizations; (B) Corresponding one-way broadband transmission losses from the MMSE estimated unidentified baleen whale species vocalization sequence center locations to the POAWRS receiver array center.The transmission loss standard deviations (solid bar), minimum and maximum values (dotted bar) are calculated assuming the whales are located at each potential depth from the sea surface to near the seafloor; (C) The distribution of the unidentified baleen whale species vocalization source level derived from bandpass-filtered beamformed signals has a mean of 169.6 ± 3.5 dB re 1 µPa at 1 m; (D) The received unidentified baleen whale species vocalization pressure levels estimated from beamformed data plotted as a function of the distance from estimated instantaneous MAT locations of each vocalization to the receiver array center.These data are used to derive the source level distribution shown in (C).The blue bars in (D) represent one standard deviation in the pressure levels for each bearing-time trajectory.

Figure 8 .
Figure 8. Example vocalizations and corresponding matched filter outputs for (A-D) sei whale,(E-H) unidentified baleen whale species and (I-L) fin whale.Sub-plots (A,E,I) show the beamformed pressure-time series for each species.The corresponding spectrograms (0.26-s length, 75% overlap) are shown in (B,F,J), respectively.Sub-plots (C,G,K) show the normalized spectrum for each vocalization over a time-window encompassing 90% of the total energy.After matched filtering each vocalization signal with a corresponding replica generated following the approach described in Appendix B of[22], the compressed pulse signals are plotted in (D,H,L), respectively.

Figure 9 .
Figure 9. Distribution of Inter-Pulse Intervals (IPIs) for (A) fin whale 20-Hz pulses; (B) unidentified baleen whale species downsweep calls; and (C) sei whale downsweep chirps.For sei whales, the IPIs of their vocalizations in a doublet or triplet are not included.All IPIs larger than two minutes are excluded from the analysis; (D) The pulse compression gains, γ −6dB and γ eq , are plotted as a function of the IPIs.The IPI distributions are characterized by the following means and standard deviations, in units of seconds: 11 ± 5 for fin whale, 29 ± 18 for unidentified baleen whale species and 52 ± 33 for sei whale.

Table 1 .
The potential pulse compression gains, γ −6dB and γ eq , of the baleen whale species vocalization signals with corresponding uncompressed pulse width τ, spectral bandwidth B and time-bandwidth product τ • B. These parameters are also tabulated for a Tukey-window LFM pulse for comparison.The unidentified baleen whale species is indicated as UBWS.