Seasonal Variation of Captive Meagre Acoustic Signalling: A Manual and Automatic Recognition Approach.

: Many species rely on acoustic communication to fulfil several functions such as advertisement and mediation of social interactions (e.g., agonistic, mating). Therefore, fish calls can be an important source of information, e.g., to recognize reproductive periods or to assess fish welfare, and should be considered a potential non-intrusive tool in aquaculture management. Assessing fish acoustic activity, however, often requires long sound recordings. To analyse these long recordings automatic methods are invaluable tools to detect and extract the relevant biological information. Here we present a study to characterize meagre ( Argyrosomus regius ) acoustic activity during social contexts in captivity using an automatic pattern-recognition methodology based on the Hidden Markov Model. Calls produced by meagre during the breading season showed a richer repertoire than previously reported. Besides the dense choruses composed by grunts already known for this species, meagre emitted successive series of isolated pulses, audible as ‘knocks’. Grunts with a variable number of pulses were also registered. The overall acoustic activity was concurrent with the number of spawning events. A diel call rhythms exhibit peak of calling activity from 15:00 to midnight. In addition, grunt acoustic parameters varied significantly along the reproduction season. These results open the possibility to use the meagre vocal activity to predict breeding and approaching spawning periods in aquaculture management. then searches for connected regions of data which rise above a pre-determined threshold; h a generalized tonal sound detector for extracting representative frequencies of delphinid whistles; i cepstral coefficient features with first and second derivatives, unpredictability measure feature and MUSIC algorithm feature.


Introduction
Many species of fish are soniferous, producing sound in different activities, such as feeding, territorial behaviour and during reproductive activities [1,2]. In some species, the vocalizations associated with the latter are often produced in choruses [3]. These sounds can provide information about the caller to the recipient as they are associated with differences between species, between individual callers of the same species and between call types emitted by the same individual under different behavioural contexts [4]. The species-specificity of fish calls and intra-specific variation can also be important for researchers as it may enable the recognition of the vocal species and, for example, whether the animal is breeding or not.
While traditional survey methods are expensive, time consuming and dependent on weather conditions and human effort, passive acoustic monitoring (PAM) coupled with automatic recognition methods is emerging as a cost-effective non-intrusive method to monitor vocal animals in the wild as well as habitat health and biodiversity at temporal and geographical scales [5][6][7][8][9][10]. Consequently, PAM can be an efficient method to detect vocal fish and identify the onset, duration, and periodicity of reproductive activities and possibly changes in fish abundance [11,12]. This information is crucial for the management of exploited species (i.e., seasonal fishery closures designed to protect spawning fish) and conservation of essential fish habitat. However, to develop automatic recognition methods the acoustic repertoire of the target species needs to be characterised [8,[13][14][15][16][17].
Sciaenids, known as croakers and drums, are soniferous fishes that produce breeding sounds (drums or grunts) through the contraction of a pair of sonic muscles typically present in males, and occasionally in both sexes, that cause the swimbladder to vibrate [18]. Sound production during the reproductive season has been reported in a number of sciaenid species, such as the weakfish, Cynoscion regalis [19], meagre, Argyrosomus regius [20], Japanese croaker, Argyrosomus japonicus [13] and red drum, Sciaenops ocellatus [21]. The profusion of advertisement calls produced by the representatives of this family during the reproductive season suggests that sound production might play an important role in their reproduction, and some studies have shown such association [22][23][24][25].
The meagre (Argyrosomus regius, Asso 1801) is one of the world's largest marine teleosts and has a high commercial value for recreational and small-scale commercial fisheries and aquaculture [26]. Advertisement calls produced during spawning aggregations have been characterised for A. regius by Lagardère and Mariani [20]. According to these authors, meagre calls were produced in dense choruses in which two sound types could be identified: Long and short grunts. However, there is still a lack of information on the daily and seasonal variation of sound production by adult meagre. In addition, it is still not known how sound features vary throughout the spawning season.
Some methods have been reported to study extensive bioacoustic recordings. The most common are supervised detection methods that can use, for example, energy thresholds or a matched filter to uncover the sounds. In recent decades, the interest of automatic speech recognition allowed the development of methods for recognition of sound patterns that have become increasingly faster, accurate and robust. Unsupervised methods using machine learning, such as Gaussian mixture models (GMMs) [27], artificial neural networks (ANN) [28,29] and hidden Markov models (HMMs) [30][31][32][33][34], have been reported to successfully recognize and classify human and other animals' vocalizations. In particular, HMMs can be used to statistically model both temporal and spectral variations of acoustic patterns through robust algorithms that extract the relevant information and ultimately classify the acoustic patterns. Table 1 refers to examples of automatic recognition systems for calls of different marine animals.
In this study, we characterised the calls produced by adult meagre and developed an automatic recognition system to analyse the round-the-clock (24 h/day) recordings made during 7 months in aquaculture facilities. Acoustic studies carried out in captive environments can provide valuable insights to the vocal behaviour of wild populations (e.g., [22,35]). The specific objectives were to (i) create a library of calls; (ii) create an automated recognition system to detect and classify meagre calls and apply this to the acoustic files recorded between January and July; (iii) compare results between the automatic and the manual approach; and (iv) investigate diel and seasonal patterns of calling activity, namely in relation to the breeding season.  [47] C,S GMM a, i [48] S h a [49] C HMM [9] S g [50] S ANN e [51] Sea lions I ANN d [52] ANN, artificial neural network; C, call type; GMM, Gaussian mixture model; HMM, hidden Markov model; I, individual; KNN, K-nearest neighbours; LPCC, linear prediction cepstral coefficients; MFCC, Mel-frequency cepstral coefficients; MRAF, multiresolution acoustic features; S, species; SCF, spectrogram correlator filter; Sparse, Sparse classification; SPL, sound pressure level; SVM, support vector machine; a vector composed of several sound coefficients/parameters; b each vocalization was characterized by its simultaneous modulations in duty cycle and peak frequency; c features were selected using a local discriminant basis; d average logarithmic spectrum on the backpropagation network input layer; e a wavelet coefficient matrix, plus a frequency features and time feature; f SPL feature-based signal detector using a correlation coefficient to measure the matching with the training selected data; g a contour-based classifier that applies a number of noise-cancellation techniques to a spectrogram and then searches for connected regions of data which rise above a pre-determined threshold; h a generalized tonal sound detector for extracting representative frequencies of delphinid whistles; i cepstral coefficient features with first and second derivatives, unpredictability measure feature and MUSIC algorithm feature.

Results
Captive meagre produced low-frequency pulsed calls with most energy below 1 kHz (see Materials and Methods ). The number of pulses in a sound varied from 1 to 160 (mean ± SD, 46.1 ± 33.0), sound duration varied from 38 to 2480 ms (799.6 ± 587.5), pulse period from 11.8 to 24 ms (17.4 ± 2.0) and peak frequency from 100 to 627 Hz (226.9 ± 147.5) (N = 222 calls). For this study calls were manually classified into 6 different categories based on the number of pulses to accommodate previous descriptions [20]: Long grunts (>30 pulses), intermediate grunts (7-29 pulses), short grunts (4-6 pulses), and 1 pulse, 2 pulses and 3 pulses. The shortest sound categories (1 pulse, 2 pulses and 3 pulses) were joined into one class for the automatic recognition system (see material and methods for more details). Many grunts recorded in the tanks presented irregular pulse periods and amplitude modulation in the first pulses (cf. Materials and Methods). Often grunts began with smaller pulse periods.

Validation of the Automatic Recognition System: Automatic Versus Manual Detection
The 20-2000 Hz bandwidth allowed the best recognition rate of meagre calls by the defined categories on the recognition system. A mean identification rate of 43.3% (and an accuracy of 2.2%) was obtained for the overall system using as test the recordings from 6 days (March 28th; April 18th and 28th, May 18th and 28th, and June 18th). But if joint categories were considered: 'long grunts + intermediate grunts', and 'short grunts + pulses', the overall mean rose to 78.8% (with an accuracy of 76.6%) due to the type of substitution errors observed. For example, most of the long grunts with less than 45 pulses were classified as intermediate grunts. In addition, some mistakes were observed on the limit of intermediate and short grunts (e.g., short grunts with 6 pulses were classified as intermediate grunts). The majority 1-3 pulses calls were misclassified as short grunts (Figure 1), as such, on the seasonal and diel analyses, the short grunt category will be referred to as the 'short grunt + pulses' category. Some intermediate grunts were also segmented into 2 short grunts (1 substitution error and 1 false positive). Most false negatives reported on pulses were observed in choruses with sequences of pulses due to misclassification of several pulses as one short grunt (e.g., 4 sequential pulses classified as a short grunt, origin one substitution error and 3 false negatives). Most calls with a signal-to-noise ratio under circa 4 dB were correctly ignored by the system, avoiding calls produced by fish housed in adjacent tanks. Table 2 represents the confusion matrix.

Seasonal Changes in Calling Rate
From January to May, the number of calls produced markedly increased (Figure 2A,B). The lowest number of detected calls occurred from January to March. In January, February, March, April and May, in the days/hours sampled, we manually counted a mean of 134, 115, 123, 317 and 2839 calls per day, respectively. In June, the number of calls detected manually decreased to a mean of 210 calls ( Figure 2B). Similarly, the automatic recognition system, using all available recordings, showed an analogous call rate trend throughout the spawning season. In July, the number of calls detected by the recognition system increased concurrently to the number of spawning events. This is consistent with the occurrence of peaks in acoustic activity during the breeding season ( Figure 2A). Note that, even ignoring most calls with a signal-to-noise ratio under circa 4 dB the recognition system still detected some calls produced by fish in adjacent tanks.
We assessed the variation of the different call categories throughout the breeding season using the automatic recognition system. The mean daily percentage of calls with 13 pulses, which were not discriminated by this system, increased from January (0.3%) to July (7.5%) ( Figure 2). Detailed inspection, using manual annotation, of the recordings obtained in the period April to June revealed that calls composed of 2 and 3 pulses co-occurred with one pulse sequences. The calls with 2 and 3 pulses were, however, in a much lower proportion ( Figure 1). The increase in the occurrence of these calls was more evident than depicted in Figure 2C, since many of the calls classified as 'short grunt + pulses' by the automatic recognition system were indeed calls with 1-3 pulses (see Figure 1). March, April and May were the months with the higher percentage of long grunts (4.7, 4.2 and 4.8% of all calls, respectively), and May showed the highest daily mean number of long grunts (562). The proportion of intermediate grunts increased from February (14.0%) to April (24.4%), and then decreased until July (10.8%). The highest percentage of short grunts + pulses occurred in February (83.4%), but the highest number of short grunts + pulses was observed in May (a mean of 9366 calls per day).
Notice that the mean water temperatures, computed per month, steadily increased from January to July (18 °C to 22 °C; Figure 2F).

Figure 2.
Seasonal variation of (A) the number of calls per day, (B) the mean number of calls per month, (C) the proportion of sound categories identified by the automatic recognition system, (D) the number of days with spawning events per month, (E) the eggs' total weight and (F) mean water temperature per month. A total of 7492 calls were counted and categorized manually from 3-hour recordings, between 18:00 and 21:00, in two days per month, from January to June 2018. The values obtained with the automatic recognition system represent monthly means of the calls counted on the full round-the-clock recordings. Notice that in (A) the horizontal axis is a continuous scale of days at each month. In addition, note that spawning (D and E) occurs in discrete events.

Seasonal Changes in Sound Features
Acoustic parameters of long grunts varied significantly among months. Sound duration (One-Way ANOVA; F4,75 = 5.22, p < 0.001) and number of pulses (One-Way ANOVA; F4,75 = 2.20, p = 0.08) decreased from March to June, and increased in July (Figure 3A,C; Table 3). Pulse period (One-Way ANCOVA; F4,75 = 8.24, p < 0.001) decreased from March to May, increased in June and decreased again in July. The covariate temperature only had a significant effect in the pulse period; the pulse period decreased with water temperature (F1,75 = 10.56, p = 0.002) ( Figure 3B; Table 3). On the sampled days, water temperatures increased rapidly from March to May (18 to 22 °C), decreased in June (from 22 to 20 °C) and increased again in July (21 °C). Peak frequency showed a similar trend to the pulse period (Kruskal-Wallis test, H = 36.64, p < 0.001) ( Figure 3D; Table 3).

Diel Changes in Calling Rate
Mean call rates varied with time of day, with call rates peaking near sunset period and dropping off before midnight ( Figure 4A). Interestingly, preliminary visualization of calling activity suggested at least two types of rhythms, depending on the overall daily number of calls. Thus, days were ranked by the number of calls counted per day and classified in percentiles. On the days with lower acoustic activity (25th percentile) and no chorusing behaviour, calls were more evenly produced throughout the 24 hour cycle ( Figure 4C). When the acoustic activity increased (75th percentile), the call rates peaking between 15:00 and midnight were clearly pronounced, reaching a mean calling rate of 1480 calls/hour at 21:00 ( Figure 4B). Furthermore, the longer sound categories seem to appear later in the night ( Figure 4B). Note that the distinction of sound categories depicted in Figure 3 was obtained by the automatic recognition system, and, as mentioned above, it recognised a high proportion of pulse calls as short grunts.

Discussion
Our study registered calls from captive meagre during 7 months and shows that A. regius has a greater variety of vocalizations than previously thought. According to Lagardère and Mariani [20], this species produces two sound types, short grunts (4-6 pulses) and long grunts (30-112 pulses). In this study, however, 6 putative categories (i.e., 1, 2 and 3 pulses, short, intermediate and long grunts) were observed, all of which appeared to be associated with the breeding season. The different sound categories mainly differed in the number of pulses (and hence duration) as is common for the majority of sciaenids (e.g., number of pulses and call duration [19,53,54]). The distinction of meagre calls in validated sound types still warrants a more detailed analysis of its repertoire [55] but a richer variability of the meagre's acoustic repertoire is evident. Noticeably, calls made with 1-3 pulses, observed in long sequences, are clearly heard as knocking sounds in contrast with calls with more pulses that are heard as drumming sounds. Long grunts produced in captivity were similar to the ones found in the Gironde estuary [20] but in our study these calls exhibited a higher number of pulses, with a maximum of 160 pulses, in contrast with the 112 pulses described for the Gironde estuary. Interestingly, many grunts started with a faster pulse rate and amplitude modulation. This might be caused by changes in the sonic muscles' contraction period and strength, or in the bilateral coordination of the sonic muscles that might alternate in the beginning and later become synchronized. This is likely dependent on the central pattern generator.
We used the manually-detected calls to create a library to train the HMMs in the automatic recognition system. We considered the same categories indicated above, except that the 1, 2 and 3 pulse calls were joined into a single category. Our automatic recognition system had difficulties with correctly discriminating some of the categories of meagre calls. For instance, some short grunts with 5 and 6 pulses, and the majority of long grunts with less than 45 pulses were classified as intermediate grunts. Moreover, many 1-3 pulse calls were classified as short grunts. The meagre system successfully detected all call types but the observed substitution errors probably occurred because we tried to classify a continuum in sound pulse numbers into an artificial discrete set of categories that may not constitute distinct call types. The lowest identification rate was 6.6% with the pulses that were in most part incorporated in the short grunt category. Notice, however, that almost all calls classified as pulses were pulses. In addition, the noise in the aquaculture facility caused a reduction in the signal-to-noise ratio of calls, leading to some misclassified calls. When the four call categories in Table 2 were grouped in only two categories, i.e., 'long + intermediate grunts' and 'short grunt + pulses', the automatic recognition system showed a higher and usable identification rate. The identification rates obtained for these joint categories are similar to the one obtained with the system developed for the boatwhistle calls of Lusitanian toadfish, which had an identification rate higher than 90%, but which failed to correctly recognize other call types (i.e., croaks and grunts) [15].
The application of automatic classification methods to fish calls is still scarce ( Table 1). The work of Malfante et al. [42], using support vector machine algorithms (SVM), showed high accuracy with continuous recordings (average accuracy reaches 93.4%). Notice that, in contrast to meagre calls that exhibit a constant frequency spectrum, those authors classified call types with very distinct temporal and spectral features, which should make a correct automatic classification easier. Moreover, the same authors analysed consecutive fixed segments of 0.5 s not taking into account adjacent time windows, thus considering a sound encompassing more than one segment as more than one hit. This is an important limitation of the recognition system, e.g., for call counting operations or assessing call boundaries. Instead, HMM-based systems statistically model both temporal and spectral sound variations along a recording that permits recognition of the start and end time of each call, thus allowing call-counting operations. Monczak et al. [38], working on an estuarine soundscape in the May River (South Carolina, USA), distinguished the calls from four different sciaenid species with a good identification rate (higher than 61%), but did not discriminate call types within each species. Note that their validation method used a qualitative manual approach. Contrary to the work of Monczak et al. [38], we were not faced with the challenge of calling overlaps. Calling overlaps, however, occur in choruses by wild meagre aggregations, and these impose a challenge for acoustic monitoring. Analysing choruses with a high rate of overlaps demands a recognition system trained with chorus data sets, possibly incorporating dedicated methods (e.g., Lin et al. [40]), and will certainly not allow individual call identification. Harakawa et al. [41] also presented an interesting hybrid automatic system to discriminate the sounds of some sciaenid species.
In this study, automatic and manual approaches consistently showed that acoustic activity increased concurrently with the number of spawning events. During the peak of the breeding season, choruses made up of grunts and pulse calls were observed. The former were already described for meagre [20], consisting of sequences grunts or drums made by different individuals, frequently overlapping. In the present study, the drumming was so loud that it could clearly be heard without the aid of hydrophones even on a different floor. Many sciaenids have been reported to produce drumming calls during the spawning season [2,[56][57][58][59], suggesting that it may play an important role in reproduction, including formation of spawning aggregates and courtship behaviour. It is possible that meagre calling behaviour could be triggered by different cues, such as rising temperature, time of day, the presence of gravid females or calling activity by other males. The increase in knocking activity from March onwards is most likely linked to spawning as it concurred with an increase in the number of spawning events. These results call for a more detailed study to ascertain which sounds are made just before, during and after spawning, as in the present study we only analysed the overall mean activity.
Variability found in acoustic parameters of long grunts analysed from March to July revealed a seasonal pattern, with temperature being a major contributor for the pulse period variation. Because most fishes are ectotherms, the capability and speed of metabolic and physiological processes are influenced by the surrounding water temperature. Sound characteristics are expected to change with temperature, since it influences muscle contraction properties [60][61][62]. Generally, rising temperatures increase muscle contraction velocity and shorten muscle twitches, allowing the accommodation of a faster rate of pulse/call emissions that may be caused by faster oscillations of the Central Pattern Generator neural network [63]. In this case, rising temperatures should cause a decrease in pulse duration and period, and an increase in fundamental/peak frequency [64][65][66][67]. Pulse period decreased with rising water temperatures from March to May (18 to 22 °C), significantly increased when, in June, temperatures declined (to 20 °C) and increased again with temperature (21 °C) in July. These findings indicate that changes in water temperature as low as 1 °C can considerably affect the sound characteristics of this species. Peak frequency followed a similar seasonal pattern to pulse period. Note that, because calls presented many frequency peaks with similar energy, it is possible for different peaks to have slightly greater energy than the remaining, thus adding considerable variability to the data. Results concerning the seasonality of long grunt peak frequency should thus be seen with caution. In general, the seasonal pattern observed in the acoustic parameters is likely also related with photoperiod, circulating hormones and other seasonal factors such as sonic muscle hypertrophy [2,67]. This variation in acoustic parameters of long grunts did not cause a noticeable impairment of the automatic system.
On the days with higher calling activity, captive meagre exhibited diel periodicity, with acoustic activity generally beginning at dusk. These observations are in accordance with what is known for several other sciaenid species [23,68]. Atlantic croaker (Micropogonias undulatus), sand seatrout (Cynoscion arenarius) and red drum (Sciaenops ocellatus), for example, show similar calling patterns with sound production increasing at laboratory simulated dusk [69,70]. Saucier and Baltz [24] showed that spotted seatrout sound production occurred from 17:00 to 01:00 and that 92% of the drumming occurred between 19:00 and 23:00. Maintaining daily patterns of calling may be an advantage to broadcast spawners like sciaenids by assuring that a large number of fish would be in spawning condition at the same time of day, thereby maximizing fertilization of the high number of eggs released into the water column. Dawn or daylight spawning occurs in several species that have visual courtship displays [71,72] but sciaenids primarily use sound for courtship displays, so spawning is probably not dependent on light [73]. Mass spawning at dusk in sciaenids is most likely an adaptation to limit the predation on eggs [22,69], since planktivores are inactive during this time [74][75][76]. Interestingly, we also observed long sequences of mostly single pulses audible as knocks, included in the 'short grunt + pulses' category. These series of pulses could last for more than one hour and preceded the choruses of long and intermediate grunts. Biological functions for these knocks are unknown.
On the days with lower calling activity, fish acoustic activity showed a less pronounced diel rhythm. These observations suggest the existence of periods where the calls are used for nonreproductive behaviours such as feeding or agonistic behaviour [77].
In conclusion, we show that automatic recognition methods based on hidden Markov models coupled with PAM are a valid option for monitoring and studying fish vocal activity. This tool presented good identification rates, being much more cost-and time-effective than manual detection and classification of sounds. In addition, despite the different sampling methods, the automatic and manual approaches presented similar results (seasonal patterns of acoustic activity), thus further supporting its value as a tool for monitoring fish species. Furthermore, this kind of automatic recognition systems can have other applications, from monitoring of biological activity at the natural habitat to characterization of disturbances due to human activities.
This study revealed that the meagre has a richer acoustic repertoire than previously thought. Not only do we describe a richer variability for grunts [20] but we also report knocking calls composed of a single pulse and sometime up to 3 pulses. These sounds were very abundant and preceded long grunt meagre choruses. In fact, we have previously found these sounds in nature (Tagus estuary), without knowing the source species, and have only been able to confirm the source by studying captive fish.
Both manual and automatic analysis showed a similar seasonal pattern of the overall calling rate associated with the occurrence of spawning events, suggesting that monitoring acoustic activity should be useful to quantify the spawning occurrence of a community. Furthermore, we illustrated how passive acoustic monitoring combined with automatic recognition is a reliable, powerful, and non-invasive tool that can be used to identify the presence of vocal fish aggregations and characterize the acoustic behaviour of adult meagre in captivity and ultimately in the field. For example, our findings on the daily and seasonal cycles of sound production in A. regius can be used to interpret the movements/activity of meagre in spawning grounds in nature. As in the present study we did not know the exact time of egg deposition, future experiments should focus on a more detailed analysis considering a finer temporal scale of the acoustic activity and call features in association to the spawning events. Moreover, behavioural observations of chorusing species such as scieanids are lacking. It would be interesting to carry out visual observations to investigate if the different sound types are related to different behavioural contexts or just a result of different degrees of sexual arousal. In addition, the effects of temperature on A. regius call characteristics should be studied in more detail. This increase in knowledge could provide an invaluable tool to enable the prediction of spawning events in aquaculture facilities and in the field.

Fish Maintenance
Sound recordings were performed from a group of sexually mature meagre hosted at the aquaculture facilities of Instituto Português do Mar e da Atmosfera -Estação Piloto de Piscicultura de Olhão (IPMA -EPPO), Portugal (37º02′ N, 7º49′ W). Fish were reared in an indoor concrete parallelepipedic 11m 3 tank (3 m 2 , 120 cm deep) under natural photoperiod, natural temperature, ranging between 14 and 23 °C (measured once a day), continuous water supply, controlled pH (8 ± 0.4), salinity (37 ± 0.2 psu) and oxygen levels close to saturation (80 ± 7.6%.). Meagre were fed several times a day with inert semi-moist feeds. Individuals housed in the two tanks studied (n = 18) were 9 and 6 years old, exhibited sex ratios of 6:2 and 4:6 (M:F) and an average total length (TL) of 87 cm (69-102 cm).
IPMA is a organization certified to perform experimental work with animals. The institution has DGAV-the Portuguese National Authority for Animal Health-authorization according to EU legislation for EPPO to breed, use and supply aquatic animals for scientific experimental work (DGVA reference 0421/000/000/2018).

Acoustic Recordings
The data set consisted of 156 days round-the-clock recordings of sounds from two tanks, obtained from January 16 to the 14th of July 2018, except from February 4 to 21 and April 24 to 27.
A hydrophone was positioned vertically at the centre of each tank at approximately 30 cm from the bottom and connected to a stand-alone 16 channel datalogger (LGR -5325, Measurement Computing Corp, Norton Ma USA; 12 kHz sampling rate 16 bit, ±1 V range). We used one custom-made hydrophone [78] and one High Tech 94 SSQ hydrophone (sensitivity of -165 dB re 1 V/µPa, flat frequency response up to 6 kHz ± 1 dB).
Overall, the dataset contained calls produced by the several fish (including calls from meagre in adjacent tanks) and an almost constant noise mostly produced by the flowing water and pumps.

Detection of Spawning Events
Eggs were collected from a skimming port in front of the tank that drained into an egg collection container equipped with a 250 micron mesh net. These nets were checked daily (early morning and late afternoon) for the presence of eggs.

Manual Sound Detection, Classification and Feature Measurement
Captive meagre were previously observed to exhibit diel periodicity of sound emissions, generally beginning at dusk, reaching a maximum at approximately 20:00/21:00, and ending at 24:00. For this reason, the manual approach was focused in this time frame.
Meagre calls were selected from sound files using Adobe Audition 3.0 (Adobe Systems Inc., CA, USA) and each call was subsequently analysed with Raven 1.5.1 for Windows (Bioacoustic Research Program, Cornell Laboratory of Ornithology, Ithaca, NY, USA).

Sound Detection and Classification
Vocalizations were visually (spectrogram and oscillogram) and aurally detected on each file. Vocalizations were then classified into 6 different categories based on pulse number: Long grunts (more than 30 pulses; Figure 5A), intermediate grunts (7-29 pulses; Figure 5B), short grunts (4-6pulses; Figure 5C), 1 pulse, 2 pulses and 3 pulses ( Figure 5D), and indeterminate sounds in cases in which the number of pulses could not be clearly determined since they presented an unclear pulsed structure in part of the sound. The latter could be caused by body movements of the vocalising fish. This classification attempted to include both the short and long grunt categories previously reported [20] and the calls that were registered in the present study.

Acoustic Features Measurements
Temporal features and peak frequency were measured in the selected long grunts. Only sounds with a good signal-to-noise ratio recorded from the selected tank, which was equipped with a custom-made hydrophone [78], were used in the analyses. The following temporal and spectral parameters of sounds were measured: Call duration (ms), as the time from the onset of the first pulse to the offset of the last pulse; number of pulses, obtained manually by counting the number of pulses in each call; pulse period (ms) (the time interval between the peaks of two consecutive pulses in a call), obtained by dividing the duration by the number of pulses minus one; and peak frequency (Hz), the frequency presenting the highest energy level in the call. Temporal parameters were measured from oscillograms while peak frequency was measured from power spectra: 12 kHz, Fast Fourier Transform (FFT) size 1024 points, Hamming window, time overlap 50%. Calls presented several frequency peaks with similar energy ( Figure 6). Consequently, the peak frequency corresponded to different frequency peaks in different calls due to small differences in relative energy imparting considerable variability to the data. See Figure 6 for the representation of some of the measured acoustic parameters.

Automatic Recognition
The proposed automatic recognition system was adapted from those described by Vieira and colleagues [15] using hidden Markov models (HMMs). For sound recognition, HMM is a statistical linear state model, where sequential states represent spectral configurations along time. Each HMM is characterized by the probabilities of transitioning from one state to the next, and the probabilities of a particular feature observation occurring while in each state. Briefly, multiple HMMs are trained using sounds of each defined category, the recordings being then classified according to the model with the highest likelihood. The overall flowchart of the method is shown in Figure 1 in [15].

Signal Processing
The waveform signal must be divided into a sequence of elementary segments according to a predefined window duration (see Figure 1, cf., [15]). This window should be longer than a cycle of the lower relevant frequency, but short enough to provide temporal resolution. After some preliminary tests, we chose a window of 32 ms with a 50% overlap to avoid losing information on the transition between two consecutive elementary segments [79]. We used the following acoustic features: Cepstrum, Mel-frequency cepstral (MFC), delta and acceleration coefficients. The selected frequency bandwidth ranged from 20 to 2000 Hz.

The HMM Time Alignment Structure
A human phoneme is usually modelled by three states [80]. However, because we do not have phonemes in fish calls, we assumed that the number of states should be equal to or higher than the number of different consecutive parts of the sound. Note that we used models with a linear topology in which all the states could transit to the same state, to the next or to the following one (except the initial and final states where self-transitions are meaningless as they only serve as signal boundary markers).
We created 4 models to classify the meagre call into different types based on pulse number: Long grunts (more than 30 pulses; Figure 4A), intermediate grunts (7-29 pulses; Figure 4B), short grunts (4-6pulses; Figure 4C) and pulses (1 pulse, 2 pulses and 3 pulses; Figure 4D). After some preliminary tests, we considered 14 states for long grunts and intermediate grunts, 5 states for short grunts, pulses and background noise (silence) models. We added extra models with 5 states for modelling nonbiological patterns with high energy and short duration (e.g., consecutive non-biological pulses with high energy).
For each sound type, a representative subset of samples was used to train the HMMs. The transition probabilities and the elementary segment probability densities of each state were estimated with the Baum-Welch algorithm [81].
In the recognition phase, each sound type was matched against the estimated HMM for each sound type. This was achieved by using a Viterbi algorithm [82] that produced a likelihood measure for each HMM.
For computations we used the HMM Toolkit (HTK, University of Cambridge, UK), a group of modules written in C to create automatic recognition systems for human speech [83].

System Training and Testing Process
An automatic HMM-based system was prepared to recognize and separate meagre calls into 4 different categories based on pulse number (long grunts, intermediate grunts, short grunts and pulses). The training set used to produce the recognition system included 129 meagre calls from a previously manually classified dataset of 222 calls (including indeterminate calls not used in the training) produced by fish housed in each tank. The system was tested with circa 18 hours of continuous recordings from 6 days (March 28th; April 18th and 28th, May 18th and 28th and June 18th) between 18:00 and 21:00.

Evaluation of the Recognition System
For each optimal alignment, the number of substitution errors (i.e., when one signal type is recognised as another signal type, S), deletion errors (i.e., when a sound type occurs but is not detected by the system-a false negative, D), insertion errors (i.e., when a signal is detected by the system but it did not occur-a false positive, I) and the total number of labels in the reference transcriptions (N) were determined (Young et al. 2000). The performance of the recognition systems was then evaluated by computing the percentage of correctly recognized sounds (identification rate) using: Identification rate (%) = N − D − S N × 100, or by computing the recognition accuracy using:

Seasonal Changes in Calling Rate
The daily number of each type of vocalization was obtained both with manual and automatic methods. Using the automatic recognition system, we analysed all the round-the-clock recordings available from January 16 to the 14th of July 2018. No recordings were available from February 4 to 21 and April 24 to 27.
Through manual detection, we restricted the sampling effort to two days in each month, the 18th and 28th, between 18:00 and 21:00. A total of 7492 calls were manually counted.

Seasonal Changes in Calls Features
Temporal features and peak frequency were measured in 10 to 20 long grunts, excluding the calls with a low signal-to-noise ratio, randomly selected on two consecutive days of every month (depending on the presence/absence of calls) from March to July (March 27 and 28, April 29, May 18 and 19, June 4 and July 5 and 6). Long grunts were not detected in the periods sampled in February. The four temporal and spectral acoustic features measured were described above.

Diel Changes in Calling Rate
The numbers of each type of vocalization present in the recordings were obtained from the automatic recognition system. Using this system we counted the number of calls per hour in all round-the-clock recordings available from January to July 2018.
Preliminary visualization of calling activity suggested at least two types of rhythms depending on the overall daily number of calls. Thus, days were ranked by the number of calls detected in each day and classified in percentiles. Here we show the diel activity for the days under the 25th and over the 75th percentiles.

Statistical Analysis
Statistical analysis was conducted using the software Statistica (version 13, TIBCO Software Inc., Palo Alto, CA, USA). A P-level of 0.05 was used for all analyses [84]. Residual plots including probability plots, residual vs. predicted plots and Levene tests were performed to assess the normality of the data and the homogeneity of variances.
For the study of seasonal changes in sound features we tested whether temperature had a significant effect in long grunts' acoustic variables with one-way analysis of covariance (ANCOVA). Water temperature was included as a continuous variable (covariate) because it can influence acoustic parameters [85]. When the effect of this covariate was not significant, one-way ANOVAs were conducted instead. The pulse period was log-transformed to meet the ANOVA assumptions. Homogeneity of variances assumptions were not met for peak frequency even after applying common data transformations [84]. In this case a non-parametric Kruskal-Wallis test was performed.