1. Introduction
Understanding the function of animal vocalisations requires careful assessment of both the behavioural and environmental contexts in which they are produced and the behavioural response they induce in the receiver. While recording and analysing the vocal behaviour of the sender provides first indications about the communicative function of a vocalisation, playback experiments have proven to be a powerful tool to experimentally test these hypotheses by simulating natural vocal interactions and examining how the receiver responds to specific sounds under controlled conditions [
1,
2,
3].
Vocalisations that serve a communicative function by conveying information relevant to the receiver are typically expected to elicit clear, measurable responses in listeners. Being produced with the potential to influence the behaviour of others, these signals are considered social features [
4]. Common examples include contact and mating calls that facilitate reproduction, agonistic calls used to defend resources, and alarm calls that warn conspecifics of potential threats [
5]. Because these call types are critical for survival and reproductive success, they often encode rich information about the sender, enabling receivers to discern individual identity ([
6,
7,
8,
9], reviewed in [
10]), kinship [
11,
12,
13], sex [
14,
15,
16], and to discriminate between species [
17,
18,
19,
20,
21].
In contrast, other acoustic outputs do not necessarily serve an immediate communicative purpose, even though they can be frequently produced. These include signals such as sighs, sneezes or snorts, which are often emitted independently of a specific audience and may primarily fulfil non-communicative physiological functions [
22,
23,
24,
25].
In recent years, snorts specifically have received growing attention due to emerging evidence across a broad range of taxa suggesting that, beyond their presumed physiological origins, they may also carry functional relevance in emotional and social contexts. Across species, snorts have been linked to a variety of arousal states, including positive affective states [
26,
27], frustration and anxiety [
28], aggression [
29], and alertness and irritation [
30,
31,
32], as well as to behavioural contexts such as alarm signalling [
33,
34,
35], reproduction [
36], and close-range affiliative interactions [
37]. Within the order of Perissodactyla, snorts occur in all three extant families. In horses (
Equus ferus caballus), for example, snorts are considered to be an acoustic marker of positive emotional states [
26,
27], while in plains zebras (
Equus quagga), long snorts are associated with contentment, whereas short ones are emitted in alarm contexts [
38]. South American tapirs (
Tapirus terrestris) utter snorts in agonistic contexts [
39], while in the Indian rhinoceros (
Rhinoceros unicornis), snorts serve as contact calls [
40]. Taken together, this cross-species evidence suggests that while snorts may be structurally similar across taxa, their function is likely shaped by emotional state, context, and species-specific social systems.
In southern white rhinoceroses (
Ceratotherium simum simum), a semi-social species with highly territorial males and females that form temporarily stable social groups, snorts are the most frequently produced call type [
41] but remain the least understood. Acoustically, snorts in white rhinoceroses, similar to those in horses [
27], are characterised as noisy nasal exhalations comprising two structural subtypes, a non-pulsed and pulsed variation [
42], that are produced by individuals of all ages and both sexes [
41,
43]. While snorts have been consistently described in various studies on the vocal repertoire [
42,
43,
44] and have primarily been documented in non-social contexts such as feeding and resting [
42,
43], their communicative value, irrespective of the subtype, remains unknown.
In contrast, the communicative function of the pant call and its social and behavioural contexts are well established. Characterised as a repetitive sequence of inhalations and exhalations, pants primarily function as contact calls during social approaches or separations [
43,
44,
45]. They have been shown to exhibit the highest degree of individual distinctiveness among white rhinoceros call types [
45,
46] and to convey various information about the sender, including sex [
47] and territorial status [
48]. Playback experiments have also demonstrated that pants enable species-specific discrimination, with individuals able to differentiate between the two white rhinoceros subspecies based on acoustic cues [
47]. Moreover, a male-specific variation in the pant, known as the “Hic,” occurs during interactions with receptive females, further underscoring the call’s social relevance and functional plasticity in reproductive contexts [
44,
47,
48]. While these findings highlight the diverse types of information encoded in pants, further playback experiments are required to assess whether additional acoustic correlates, such as indicators of age or testosterone levels, are also present in this call type.
The aim of this study was to gain deeper insight into the communicative function of vocalisations in southern white rhinoceroses. Specifically, we investigated the function of snort vocalisations, which, despite being the most prevalent call type, lack a clear functional interpretation. To this end, we conducted a series of playback experiments to assess behavioural responses to both conspecific and heterospecific snorts, as well as to compare reactions to snorts versus pants. Given the high production rate of snorts and the observed structural variation within this call type, our approach aimed to determine whether snorts serve a specific communicative function and whether they contribute to species recognition.
We hypothesised that if snorts carried communicative value, the study animals would show stronger behavioural responses to conspecific snorts than to heterospecific snorts or control stimuli. Furthermore, if pants function as socially relevant calls coordinating interactions within and between individuals, we expected the animals to respond more strongly to pants than to snorts, which have primarily been observed in non-social contexts. In addition, we explored whether behavioural responses to pants differed depending on the characteristics of the sender, particularly comparing pants produced by juvenile males versus adult males with varying testosterone levels.
2. Materials and Methods
2.1. Subjects and Study Sites
The playback study was conducted on a total of 15 adult southern white rhinoceroses housed in three zoological institutions: Planète Sauvage in France (
n = 4), Serengeti-Park Hodenhagen in Germany (
n = 7), and Givskud Zoo in Denmark (
n = 4) (
Table 1).
At Planète Sauvage, playback experiments were carried out with four white rhinoceroses. Individuals were housed in two groups (Jambo with Sana and Dinari with Goliath), which were alternated between two non-connected outdoor enclosures, allowing the playback experiments to be conducted in the smaller one of the two (approx. 200 m2).
At Serengeti-Park Hodenhagen, seven study animals were tested. The playback experiments were carried out in an outdoor enclosure (approx. 230 m2) adjacent to the stables and the larger outdoor area. As a result, the study animals typically maintained visual, acoustic, and/or olfactory contact with the other conspecifics. Adjusting to the typical rotation schedule of the animals between the different compartments, the two study males, Martin and Ekozu, and the females Makena and Moana, were tested separately, whereas Claudia, Kianga, and Uzuri were tested together with their respective calves.
At Givskud Zoo, playback experiments were conducted with four study females in an outdoor enclosure (approx. 450 m2). While Mazumba and Samia were tested separately, Sofie and Inger were tested together.
All playback experiments were conducted during the daily housing routine, and all study animals had continuous access to water and hay and/or pasture.
2.2. Recordings and Preparation of Playback Stimuli
Playback stimuli for white rhinoceros vocalisations were obtained from recordings of previous studies [
41,
42,
46]. Playback stimuli for horses were provided from the Ethology Department at the University of Rennes. All senders were unfamiliar to the study animals. All playback stimuli were prepared using
Praat software (version 6.0.52, [
49]). For the snort playbacks (SPBs), five different sets were created, each containing one stimulus per category (a white noise control, a pulsed and non-pulsed conspecific snort of a white rhinoceros, and a pulsed and non-pulsed heterospecific snort of a horse). Single snorts were cut with a duration of approximately 0.83 s (±0.15 s) and ramped by a 0.3 s fade-in ramp at the beginning to prevent abrupt signal truncation by the loudspeaker. The white noise stimulus was synthesised with a length of 0.6 s. Each stimulus consisted of one single presentation of the sound (
Figure 1).
For the pant playbacks (PPBs), two different sets were created, each containing one conspecific stimulus per category: a non-pulsed snort call as control, a juvenile pant, a high-testosterone pant (high T-pant), and a low-testosterone pant (low T-pant). The faecal testosterone metabolite (fTM) levels of the sender males had been determined in a previous study [
50]: The average fTM concentration in adult, reproductively successful males selected as high-T pant senders was 25.85 ng/g faeces. Non-reproductive adult males selected as low-T pant senders had an average concentration of 11.74 ng/g faeces, while juvenile males selected to represent the juvenile pant stimulus exhibited a mean fTM level of 10.67 ng/g faeces.
Pants were selected with a duration of approximately 1.78 s (±0.65 s). Since the pilot tests of the snort playbacks revealed a low response intensity, the pant playbacks were also designed as a control to estimate whether the low response intensity was due to the single presentation or the low sound level of the played stimuli. Thus, for the pant playbacks, a stimulus chain consisting of three repetitions of a stimulus call with a 3 s pause between each repetition was created (
Figure 2). Additionally, a 0.3 s ramp was added at the beginning of each stimulus to prevent abrupt signal truncation. In total, the stimulus chains had a duration of 11.29 s (±1.92 s).
Stimuli for the snort playback were sound pressure levelled to 75 dB (±5 dB) re 20 μPa, and for the pant playback to 85 dB (±5 dB) re 20 μPa in a sound chamber at a distance of 1 m using a Precision Impulse Sound Level Type 2233 from Brüel & Kjaer (Nærum, Denmark). Due to a delay in loudspeaker activation, each stimulus/stimulus chain was preceded by a 300 ms 22 kHz sinusoidal tone outside the animals’ assumed hearing range, followed by a 20 ms silent interval before onset to guarantee a smooth playback.
2.3. Playback Setup and Procedure
The playback stimuli were played back from a laptop connected to a loudspeaker (JBL Xtreme 2, by Harman, Stamford, CT, USA). The response behaviour of the study animals was both video- and audio-recorded for five minutes following playback onset. Video recordings were made with two to three digital camcorders (Sony DCR-SR36E, Sony Corporation, Tokyo, Japan) covering the entire area of the outdoor enclosures. In Planète Sauvage, two digital camcorders were installed at the front and back of the outdoor enclosure. In Serengeti-Park Hodenhagen, three camcorders were installed along two sides of the enclosure, with an additional handheld camcorder used to adjust the focus on the study animals as needed. In Givskud Zoo, one camcorder (Panasonic HDC-SD600, Panasonic Holdings Corporation, Kadoma, Japan) was installed at the front of the outdoor enclosure, and an additional handheld camcorder (Sony HDR-CX240E, Sony Corporation, Tokyo, Japan) was used to adjust the focus on the study animals.
For audio recordings, an omnidirectional Sennheiser MKH 8020 microphone (Sennheiser electronic GmbH & Co. KG, Wedemark-Wennebostel, Germany; flat frequency response of 10–20,000 Hz ± 5 dB) or a directional Sennheiser ME64 microphone (flat frequency response: 40–20,000 Hz ± 2.5 dB) was used. Both microphones were fitted with a windshield and connected to a digital recording device (Sound Devices 702T State Recorder, Sound Devices LLC, Reedsburg, WI, USA; frequency response: 10–40,000 Hz; settings: 44.1 kHz sampling rate, 16-bit, uncompressed .wav format). In Givskud Zoo, a directional Sennheiser MKH 816 P48 microphone (Sennheiser electronic GmbH and Co. KG, Wedemark-Wennebostel, Germany; flat frequency response of 40–20,000 Hz ±5 dB) that was connected to a Marantz PMD561 audio recorder (Marantz, Carlsbad, CA, USA; frequency response: 20–24,000 Hz; settings: 44.1 kHz sampling rate, 16-bit, uncompressed .wav format) was used.
Except for the four females in Givskud Zoo, who were only presented with the pant playback, all study animals were tested in both the snort and the pant playback experiments (
Table 1). The acoustic stimuli were played at a distance of one to three body lengths between the loudspeaker and the study animals at an angle of approximately 180 degrees. The loudspeaker was permanently installed in Planète Sauvage and Givskud Zoo, whereas in Serengeti-Park Hodenhagen, its position was adjusted based on the animals’ locations. The playback stimuli were presented in a randomised order with an inter-playback interval of at least ten minutes at all three of the study sites. The maximum number of playback stimuli presented to a study animal per day was three in Planète Sauvage and six in Serengeti-Park Hodenhagen and Givskud Zoo. Some playbacks were repeated to ensure optimal positioning of the animals relative to the loudspeaker and to maximise the general audibility of the stimuli if unexpected acoustic disturbance occurred.
2.4. Vocal and Behavioural Analysis
For behavioural coding, video recordings were synchronised with respective audio recordings and analysed using the
Observer XT software (version 12, Noldus Information Technology, Wageningen, The Netherlands, Noldus, L. [
51]). The analysis was conducted by two different observers (MH: Planète Sauvage, Serengeti-Park Hodenhagen 2021, Givskud Zoo; JJ: Serengeti-Park Hodenhagen 2022). The Cohen’s Kappa coefficient was determined among the observers by comparing eight randomly selected observations. The mean coefficient of 0.65 indicated a substantial agreement between the observers [
52].
For each study animal that was presented with both the snort and the pant playback, a total of nine observations were analysed (five for SPB and four for PPB). For the four individuals from Givskud Zoo, four observations from the pant playbacks were analysed. Overall, 115 observations were included in the statistical analysis.
The following behaviours were coded during a five-minute observation period following the onset of the playback stimulus: ear movement, head movement, body turns towards the loudspeaker, approaching the loudspeaker, locomotion, and vocalisations (focusing on snort and pant vocalisations).
Based on a previous playback study on white rhinoceroses by Cinková and Policht [
47], a 5-stage scoring scale was adopted to evaluate the intensity of the behavioural reaction of the study animals, with higher scores reflecting more intense behavioural responses (
Table 2; for a detailed description,
Supplementary Table S1). Focusing on the
immediate reactions, only behavioural responses within the first 15 s after playback of the stimuli were analysed for the assessment of the score. Each individual was assigned the highest score based on the most intense behaviour displayed during this initial period.
In addition, the duration of locomotion (in seconds), the latency of approaching the loudspeaker (in seconds), and the call rate (per five minutes) were calculated as indicators for increased vigilance and attention in the study animals.
2.5. Statistical Analysis
Statistical tests were calculated in ‘RStudio’, version 2024.12.1.563 [
53]. The significance level was set at
p ≤ 0.05.
For the analysis of the effects of playbacks on the
behavioural score and the
call rate (= number of calls uttered in the 5 min observation period), we fitted generalised linear mixed-effects models using the ‘glmmTMB’ function (‘glmmTMB’ package, version 1.1.11), initially assuming a Poisson distribution (‘poisson’) appropriate for count data, e.g., [
54]. We assessed model assumptions by simulating residuals with the ‘simulateResiduals’ function (‘DHARMa’ package, version 0.4.7). If over- or underdispersion were indicated, we refitted the model using a generalised Poisson distribution (‘genpois’). For both distribution families, the log link function was used. Further, we tested for zero inflation using the ‘testZeroInflation’ function (‘DHARMa’ package) and, when detected, accounted for it by specifying a zero-inflation formula (‘ziformula’) within the model. If a model failed to converge, we examined the variance components using the ‘VarCorr’ function and removed the random effect with an estimated variance of zero to simplify the model.
For the continuous response variables locomotion and latency to approach the loudspeaker, we used linear mixed-effects models fitted with the ‘lmer’ function (‘lme4’ package, version 1.1-37). We tested residuals for normality using the K-S test (‘DHARMa’ package). If the test revealed a significant deviation from normality, we applied a square root transformation to the response variable.
For the Snort playback experiments, we employed two complementary modelling approaches. In the first, we investigated the effects of stimulus type (control, horse, and rhinoceros) and group type (single or paired) as well as the sex of the receiver (male or female) on the behavioural variables. Animal ID and zoo were included as random effects to account for repeated measures and potential site-level variation. In the second approach, we investigated the effect of species (conspecific, heterospecific) and pulsation (pulsed, non-pulsed), excluding the control trials, which did not differ in pulsation. The response variable locomotion was square rooted to meet the assumption of normality.
For the pant playback experiments, we used the predictor variables stimulus type (snort, juvenile pant, low T-pant, and high T-pant), group type (single or paired), and sex of the receiver (female or male), again including animal ID and zoo as random effects.
All models were initially fitted with a full set of fixed effects and interaction terms. We applied backward elimination by sequentially removing the highest non-significant interaction term, using the ‘anova’ function to compare model performance at each step. This process continued until (a) only main effects remained, (b) all remaining interactions were significant, or (c) the reduced model significantly differed from the previous model.
Statistical significance of predictor variables was assessed using the ‘Anova’ function (‘car’ package, version 3.1-3). For models with a significant effect of stimulus type or an interaction term, we conducted post hoc comparisons using the ‘emmeans’ function (‘emmeans’ package, version 1.11.1, ‘fdr’ adjustment for multiple comparisons, ‘response’ type for back-transforming from log to response scale).
4. Discussion
In this study, we used controlled playback experiments to investigate the communicative function of snort and pant vocalisations in southern white rhinoceroses under captive conditions. By presenting conspecific and heterospecific snorts as well as male pants differing in the sender’s age and faecal testosterone levels, we aimed to assess how these cues are perceived by receivers. Our findings provide insights into the functional relevance of snorts and pants and the effects of social context and sex on behavioural responses. In the following, we discuss these findings in light of previous research, identify possible limitations, and propose directions for future studies.
4.1. Communicative Function of Snorts
The findings from the snort playback experiments indicate that snort vocalisations did not elicit differentiated behavioural responses depending on whether the calls originated from conspecific or heterospecific individuals. This outcome suggests that snort calls either lack species-specific acoustic signatures, making them acoustically ambiguous to the receiver, or that, irrespective of the sender species, snort calls convey minimal communicative information and therefore evoke similarly low levels of response behaviour under all conditions.
The first explanation of an inadequate species signature is supported by findings in previous studies describing snorts as being rather unspecific and exhibiting the lowest level of individual distinctiveness compared to other common call types in southern white rhinoceroses, such as pants, hisses, and grunts [
45,
46]. It is therefore likely that their acoustic structure also possesses few to no species-related properties, as no difference in any of the response behaviour parameters between the snorts and the white noise control was observed. At the same time, the lack of a difference compared to the white noise control can also be interpreted as an indicator for the snorts generally lacking social or contextual information. These observations are consistent with previous descriptions of snorts in southern white rhinoceroses, which attribute either a hygienic function to this call type [
42] or one that occurs in non-social contexts such as during resting or feeding [
43,
44,
46]. Corresponding to this are the descriptions of snorts in black rhinoceroses (
Diceros bicornis), which, similar to the snorts in white rhinoceroses, do not seem to have a behavioural context and do not elicit any reactions from the conspecifics [
55]. Meanwhile, in the Indian rhinoceros, the snort call has been described as an initial contact call that is most often directed at adult males [
40]. While this observation suggests a clear communicative function for the snort, it should be noted that the description of the snort call type in Indian rhinoceros research does not necessarily correspond to the acoustic properties of the snorts described in other studies on the vocal repertoire in the Rhinocerotidae family: while Laurie [
40] characterises the snort in the Indian rhinoceroses as a “series of quick bursts through the lips and nostrils”, the snort call type in the white rhinoceros has rather been associated with single air blows through the nostrils [
42,
43]. Thus, while the terms may be the same, they may describe different vocalisations depending on the study or species, and direct comparisons should therefore be made with caution.
Overall, the non-existing differences among the responses to different snort stimuli suggest that snorts do not have a distinct communicative function. This conclusion is supported by comparative data from the pant playback experiments, where snorts consistently elicited significantly lower response intensities than any of the pant stimuli. Notably, reactions to snorts rarely exceeded the body turn toward the loudspeaker on the behavioural response scale. In addition, only one-third of the study animals approached the loudspeaker at all following the snort stimulus, and only one of them actually did so immediately after the playback (within the first 15 s). These rather low-level responses suggest that, although the stimuli were perceived and may have briefly attracted attention due to their novelty, they failed to sustain interest or provoke further engagement, such as orienting or approaching the sound source.
Importantly, the consistency of low response levels to snorts across both playback designs affirms the validity of the experimental approach. The low reactivity cannot be attributed to the design itself—such as the use of single-call playback or the low sound level—but rather to the intrinsic nature of the snort as a stimulus. In addition, across a total of 55 snort playback sessions, observable behavioural responses (a score of 1 and higher on the behavioural response scale) were recorded in the vast majority of trials. In only seven sessions, no visible reaction to the acoustic stimulus could be observed. Given the high consistency of visible responses in the remaining sessions, it is reasonable to assume that the stimuli were clearly audible and perceived by the study animals. Thus, the few instances of no response are more likely attributable to the nature of the stimulus rather than to issues with stimulus delivery. This is further supported by the pant playback trials, where the only two instances of no observable behavioural response occurred following a snort stimulus, again reinforcing the notion that snorts elicit notably less engagement in the receiver compared to pants.
Nevertheless, it cannot be ruled out that snorts have an internal emotional function, signalling at least a relaxed state in southern white rhinoceros. Studies in horses showed that snort production is not random but seems to be related to emotional positive contexts and occur more often when welfare conditions increased [
26,
27]. Thus, further studies are needed to investigate the link between the internal emotional state and snort production rate.
4.2. Effect of Group Type
Group composition was found to significantly affect responses in both playback experiments. Individuals tested alone displayed higher response intensity, increased vigilance in the form of locomotion, and a greater frequency of vocalisations following playback. One possible explanation is that solitary animals were more aroused by the unexpected presence of vocalisations in the absence of visible conspecifics. In contrast, animals tested in pairs were likely habituated to conspecific vocalisations due to the constant presence of others, both before and during playback, resulting in a more attenuated reaction.
Similar effects of social environment have also been described in playback studies in other species. For instance, in rats (
Rattus norvegicus), a higher heart rate, locomotor activity, and vocalisation rate after a 50 kHz appetitive stimulus compared to a 22 kHz aversive stimulus were particularly pronounced in solitarily housed animals [
56]. Compared to the paired-housed group, singly housed rats also tended to spend more time close to the loudspeaker.
It is important to note that the present analysis did not differentiate between adult females housed with their calves and study animals paired with other adult conspecifics during the experiments. Maternal status can have a considerable influence on the behavioural responses, as shown in a playback study in black bears [
57]: mother bears accompanied by their cubs advanced more frequently towards the sound source and emitted more grunts following a playback of a predatory competitor compared to solitary individuals. In a similar way, females with calves in the presented study might also have shown behavioural patterns distinct from non-maternal individuals, potentially reflected in differences in vigilance or call rate. However, for our dataset, the limited number of subjects in each subgroup did not allow for sufficient statistical power to assess such an effect reliably. Future research should consider maternal status and social pairing context more explicitly, as these factors may affect how individuals perceive and respond to social acoustic cues.
More broadly, these findings emphasise the importance of considering the social environment when interpreting playback responses. To date, much of the research on audience effects has focused on how the social environment shapes the signalling behaviour of the sender, e.g., [
58,
59], whereas comparatively few studies have addressed how social context and social cues modulate the responses of receivers. Investigating both perspectives is essential for a more comprehensive understanding of animal communication.
4.3. Effect of Sex
Sex also emerged as a significant predictor of response behaviour, but only in the pant playback experiment. Males exhibited greater activity levels than females, which correlates with the functional relevance of pant calls as mating or territorial signals. Since pants are typically associated with courting males approaching oestrous females [
41,
44,
45,
47,
50], the playback may have triggered heightened movement behaviour in study males, potentially in an attempt to locate or monitor the unseen sender. Notably, increased locomotion occurred only in solitary males, which further supports the interpretation of an alarmed or investigative response toward a potential rival, unfamiliar caller, or intruder. These sex-specific differences in behavioural responses are consistent with recent findings from a playback study conducted in wild white rhinoceroses in Botswana, where males were reported to approach the loudspeaker more frequently than females following the playback of pant vocalisations, suggesting a heightened sensitivity or motivation among males toward these calls [
60].
A sex-specific difference was also observed in call rate. However, in contrast to the other behavioural response parameters, it was the females that produced the specific pant vocalisation more frequently than males. This appears to contradict earlier results from Jenikejew et al. [
41], who reported a higher pant call rate in males. One possible explanation for this discrepancy may lie in the age of the vocalising individuals. In this present study, the most vocal females were subadults or relatively young adults (between five and 14 years), potentially exhibiting greater social motivation or general vocal reactivity. Another relevant factor might be the testing condition: all four of these particularly vocal females were tested alone during the playback trials. It is therefore plausible that the increased pant production reflected attempts to re-establish contact with familiar conspecifics, given that the pant is commonly used as a contact call [
42,
43].
Findings from a recent playback study by Cinková and Shrader [
61] suggest that oestrus can influence female response to pant vocalisations. They demonstrated that female white rhinoceroses that were not in oestrus exhibited limited vigilance and looking behaviour in response to the courtship-related variation in the pant call, known as the hic. These findings may help explain the comparatively low levels of movement and approaching activity observed among females in the present study. Although oestrus status was not systematically assessed for our study females, it could be confidently ruled out for approximately half of the females based on reproductive history, such as the presence of dependent calves or advanced age. Moreover, none of the females displayed behavioural indicators typically associated with oestrus, such as urine spraying or elevated tail posture [
62].
At the same time, the observation that females showed higher call rates but less bodily movement and exploratory behaviour in response to pant playbacks, compared to males, may also reflect underlying social dynamics characteristic of the species. In the wild, females typically form temporally stable associations with other females and subadults [
63,
64] and thus may be more inclined to maintain vocal contact when separated from group members. In contrast, males are predominantly solitary and engage in extensive movement to establish and defend territories [
65,
66,
67], a behavioural pattern that could manifest as greater locomotor activity during playback experiments. Taken together, these observations support the interpretation that the lack of strong approach responses among females may be linked to reproductive state as well as sex-specific social strategies. It is plausible that females in oestrus would show greater responsiveness to pant playbacks, potentially narrowing or reversing the observed sex-based differences in response behaviour. Future studies that explicitly account for oestrus status and social context would provide valuable insights into the modulatory role of reproductive condition and social organisation in shaping responses to vocal signals and help clarify the functional relevance of pant vocalisations in different behavioural contexts.
4.4. Effect of Androgen Level
No significant differences were found between behavioural responses to high T- and low T-pant stimuli. As a previous study showed that faecal testosterone levels were significantly higher in territorial white rhinoceros males than in nonterritorial ones [
68], high T-pants—presumably representing more territorial or competitive males—were expected to elicit stronger behavioural responses in both female and male recipients. In line with that, a previous playback study on white rhinoceroses revealed that males were able to distinguish territorial from subordinate males based on their pant calls and reacted with higher attentiveness towards territorial senders [
48]. Taken together, the two previous studies suggest that territorial and non-territorial white rhinoceros males differ in both their testosterone levels and the acoustic properties of their pant calls. In contrast, the results of the present study suggest that features potentially related to circulating testosterone levels, such as male quality or social dominance and thus, territoriality, were either not necessarily encoded in the acoustic properties of the pant stimuli or have simply not been perceived by the study animals. These contrasting results might be explained by the different experimental conditions: While both Rachlow et al. [
68] and Cinková & Shrader [
48] conducted their studies on free-ranging white rhinoceroses in natural settings of national parks or reserves in Southern Africa, the present study has been carried out under captive conditions.
In captive environments, male white rhinoceroses do not need to establish or defend territories, and consequently, cues related to territoriality may not be as distinctly encoded in their vocalisations compared to wild conspecifics. Furthermore, neither females nor males in the study population were likely to anticipate territorial intrusions, potentially reducing their attentiveness or responsiveness to any territoriality-related information embedded in pant calls.
A further consideration is the basis for stimulus selection. The high T- and low T-pant stimuli used in this study were classified solely based on the faecal testosterone metabolite (fTM) concentrations of the senders. On the one hand, it may be that the variation in fTM concentrations in the males available from the previous study (
n = 16, [
50]) was not broad enough to cover the range between actually low and high values. On the other hand, it remains unknown whether these hormonal profiles would reliably correspond to truly territorial versus non-territorial males under natural conditions. It is therefore possible that the androgen level differences between the selected stimuli were not sufficiently distinct to elicit clear behavioural discrimination among the receivers.
Future studies could provide deeper insights by employing playback stimuli from a larger sample size of wild individuals whose territorial status and androgen profiles are independently verified. Comparing responses to such stimuli could help clarify whether captive rhinoceroses retain sensitivity to social or territorial cues in pant vocalisations and whether such information is more salient when derived from naturally territorial males.