Social Reward Behaviour in Two Groups of European Grey Wolves (Canis lupus lupus)—A Case Study

Simple Summary Prosocial behaviour is shown by a variety of mammals. Prosocial behaviours can also be part of social feedback mechanisms. For group-living mammals, group coordination and group cohesion are crucial factors for survival. Therefore, group-living mammals display a broad variety of social behaviours towards their group members. Canidae are known to be socially organised mammals. Wolves are one of the most cooperative and social canine species. In this case study, we investigated social feedback mechanisms in two European grey wolf groups. The social feedback was observed after novel object interaction, bold behaviour, and the individual behaviours of single individuals. The purpose of this paper is to focus on prosocial behaviour that might serve as social reward. We assume that there is social reward behaviour as a category that falls within social feedback in grey wolves. Abstract Prosociality occurs in many species and is likely to be a crucial factor for the survival of group-living animals. Social feedback is an important mechanism for the coordination of group decisions. Since group-living animals with specific personality axes, i.e., boldness, are known to provide certain benefits for their group, bold actions might receive more prosocial feedback than other actions. Our case study aims to determine whether bold behaviour, i.e., novel object interaction (Nobj), might be answered more frequently with prosocial behaviours. We investigated the differences in the frequency of occurrence in prosocial behaviours after three different individual actions in two groups of grey wolves. We aim to outline the development of a social reward behavioural category as part of social feedback mechanisms. We used Markov chain models for probability analyses, and a non-parametric ANOVA to test for differences between the influences of individual behaviours on the probability of a prosocial behaviour chain. We additionally tested for the potential influences of age, sex and personality on the frequency of Nobj. Our results suggest that bold interactions are more often responded to with prosocial behaviour. Bold behaviour might be more often socially rewarded because of its benefits for group-living animals. More research is needed to investigate whether bold behaviour is more frequently responded to prosocially, and to investigate the social reward phenomenon.


Introduction
Personality traits, which are consistent inter-individual differences in behaviour within populations, are a widespread phenomenon across various animal species [1,2]. Studies have shown that mammals [3], birds [4], fish [5,6], arthropods [7][8][9][10][11], amphibians [12] and cephalopods [13] exhibit consistent individual differences in behavioural traits, which are expressed in specific situations [14]. Extraversion [15,16] and boldness [16][17][18][19][20] are two of the most well-known phenomena in animal personality research. Boldness describes a consistent difference between individuals in their response to perceived risk [1]. Boldness is considered to be part of a major 'proactive-reactive' axis of personality variation, where boldness is one of a suite of behaviours which includes exploration, activity and aggression, which correlate positively with each other [12,21]. When presented with a startling stimulus or a novel object, individuals may differ consistently in their responses over repeated observations, a behavioural complex that might influence species dispersal as well as the approach to obtaining food in a task involving a novel object. Bolder individuals might have greater food intake when foraging [22], or take more risks to acquire food, investigate novel objects, or explore new environments. Previous research has shown that bolder individuals may be more likely to lead [23][24][25], whereas shy individuals may be more likely to group [26] and to respond to the decisions of their bolder conspecifics [27][28][29]. As bolder individuals are more prone to be neophilic and to take risks, they potentially are beneficial for the rest of their group, as they might serve as foragers, investigators, guardians or warners, since each individual profits from such division of labour.
Across taxa, individuals show the cognitive ability to use their observations of the social interactions of others to inform their own behaviours, including several primate species [30,31], ravens [32], hyenas [33] and fish [34,35]. While it has been shown that individuals observe each other and react to these observations, we do not currently understand how individuals integrate this information on the outcomes of their own interactions with observations of others' interactions, on their future actions. Social feedback, the tendency of a group to answer prosocially or agonistically to another group member's individual behaviour, is an important mechanism for the coordination of group decisions. We suggest that there might be a special type of social feedback: social reward. We define social reward as prosocial behaviours that are shown in direct response to a desirable and group-beneficial behaviour, i.e., bold behaviour, and aim to encourage the behaviour. This reward mechanism could be partly intended in order to increase the frequency of the occurrence of the displayed behaviour, or to encourage other group members to engage in the behaviour. Bold behaviours may be risky for individuals, but they may also prove to be beneficial to the group, and might therefore be supported by social feedback. In distinction to prosocial behaviour, which in addition to confirmation, has various other functions such as reproduction, rank verification, group cohesion, togetherness etc., we define social reward behaviour as a behavioural category that exclusively serves positive social feedback and is thus part of the social feedback mechanism. Social reward, as prosocial behaviours sent by the group towards an individual of the group to give feedback on a previous action, could be a special tool to encourage the actions of a group member that are in the interest of the group but that cannot be carried out by other group members, such as actions that require boldness. Social feedback or social reward might be useful within groups to engage in group-benefiting actions.
Emotional contagion, the transmission of emotional states from one animal to another, might also be involved in social feedback processes and group decision-making. Most studies on emotional contagion in non-human animals have focused on the transmission of negative emotional states, e.g., fear, but observations on farm animals suggest that animals may transmit both negative and positive emotions: respectively, pleasant and unpleasant emotions such as joy or fear [36,37]. Pigs, for example, responded differently to being reunited with group mates who had experienced either a negative or a positive treatment. They exhibited decreased activity and exploration when group mates received a negative treatment, and increased social contact and exploration when group mates received a positive treatment [37]. Since bolder animals, such as fast-bold explorers, show a reduced stress-induced glucocorticoid release compared to slow-shy [38], prosocial acts towards bold-behaving group members could also be influenced by mechanisms of emotional contagion.
Here, we take a look at the relationship between bold actions and social feedback mechanisms in two groups of European grey wolves (Canis lupus lupus), to address the question of how bold behaviours can influence the frequency of prosocial interactions towards the bold individual. We observed two groups of European grey wolves in two Animals 2023, 13, 872 3 of 17 different zoological facilities in Germany. The observations were part of a large-scale project on cooperative behaviour in canids, which also included Arctic wolves, Hudson Bay wolves and Timber wolves, as well as various other canids.

Study Animals
Our case study included two different wolf groups, with 5 individuals per group, at Zoo Wingst and Schwarze Berge Wildlife Park (see Table 1).

Novel Object
An apparatus of 1.80 m × 0.60 m × 1.20 m was installed in front of the enclosure (see Figure 1). The purpose of the apparatus was to study foraging cooperation in Canidae; therefore, it was equipped with food tubes, ropes for pulling and food flaps that were connected to the ropes (see Supplementary Materials Figures S1 and S2). The European grey wolf groups did not use the apparatus communally, and no cooperative food procurement was shown.
question of how bold behaviours can influence the frequency of prosocial interactions towards the bold individual. We observed two groups of European grey wolves in two different zoological facilities in Germany. The observations were part of a large-scale project on cooperative behaviour in canids, which also included Arctic wolves, Hudson Bay wolves and Timber wolves, as well as various other canids.

Study Animals
Our case study included two different wolf groups, with 5 individuals per group, at Zoo Wingst and Schwarze Berge Wildlife Park (see Table 1).

Novel Object
An apparatus of 1.80 m × 0.60 m × 1.20 m was installed in front of the enclosure (see Figure 1). The purpose of the apparatus was to study foraging cooperation in Canidae; therefore, it was equipped with food tubes, ropes for pulling and food flaps that were connected to the ropes (see Supplementary Materials Figures S1 and S2). The European grey wolf groups did not use the apparatus communally, and no cooperative food procurement was shown.

Observational Methodology
Behavioural observations were made in total for 192 h from June to August 2020, and from May to June 2021, via means of a rare event ("all-occurrence") sampling [39], which led to 242 behavioural observations and the record of 140 behaviours of interest in total. We used the ethogram of Goodmann et al. (2002) [40] to place the behaviours that we observed to a behavioural category (see Table 2). For a detailed version of the ethogram,

Observational Methodology
Behavioural observations were made in total for 192 h from June to August 2020, and from May to June 2021, via means of a rare event ("all-occurrence") sampling [39], which led to 242 behavioural observations and the record of 140 behaviours of interest in total. We used the ethogram of Goodmann et al. (2002) [40] to place the behaviours that we observed to a behavioural category (see Table 2). For a detailed version of the ethogram, see [40]. The behaviours of interest are categorised under submissive behaviour, care-giving behaviour, locomotion/exploratory behaviour and feeding (see Figure 2).

Definitions of Behaviours of Interest
We define behaviours of interest, referring to those behaviours as initial behaviours (see Table 3) that have led to a social reward behaviour chain (see Table 4). The social reward behaviour chain was recorded as such if a subsequent element directly (<15 s) followed the previous behaviour. Table 3. Definitions of initial behaviours.

Initial Behaviours Description of Observed Behaviours
Novel object interaction (Nobj) Approaching the novel object (apparatus) up to maximum 1 m distance, and sniffing on the rope or interacting with it by trying to pull the rope/pulling the rope Forage Walking around the enclosure, searching for leftovers of food or dead animals, eating, or hiding food Run Running around individually in the enclosure from one area to another Table 4. Definitions of the social reward behaviour states in a behavioural chain.

State Description of Observed Behaviours
Group formation (GrpFrmt) Individuals grouping together, observing the individual performing an "initial behaviour" Greeting (Greet) Tail wagging, body rubbing and face/body sniffing of group formation towards "active performer" Submissive behaviour (LickSn) Snout licking, lowered body posture, whimpering, lying on back/exposing belly towards the individual performing the "initial behaviour" Combined submissive behaviours (C.SubmBhv) Body rubbing, ears back, opening group formation and including the individual performing the "initial behaviour" to the group Leaving together (LeaveTgh) Leaving proximity of "initial behaviour performance location" as a united group Initial behaviours commonly led to a complete or partially performed behavioural chain, which included the behavioural states listed in Table 4. Behavioural states that were performed with the same frequency and that fell under the same behavioural category in our ethogram (see Figure 2), were classified as a combined state.

Markov Chain Modelling
Markov chains quantify the dependence of an event on preceding events [41,42] or an on initial state. There are several degrees of dependence. If sequencing events are independent, they are described by a zero-order Markov chain. In the case where an event depends only on the immediately preceding one, it fits a first-order Markov chain. If an event depends on the two most preceding events, it is a second-order Markov chain, and so on. We decided to assess the difference in transition from one event to another, and the probability to reach the succeeding state of the behavioural chain, depending on the initial state of the observed sequence. To simplify the analytical design, we concentrated only on a first-order Markov chain model. Transition probabilities (from initial to preceding to succeeding behaviours) were determined in all three chains using: where i is the initial behaviour, j is the succeeding behaviour (i and j range from 1 to 6, because there are six behavioural states in the chain), aij is the number of transitions observed from behaviour i to j, and pij is the transition probability from i to j in the Markov chain. Since the initial state influenced the succeeding state, we used an initial state vector of [0, 1]. We then calculated the steady-state vector (v) for our transition matrix. To determine the state of the system after one step (A1), the following applies: A1 = P − A0. P is a transfer matrix; it can be used to describe how a system changes over time from an initial-state vector A0. To determine the state of the system after one step (A1), the following applies: Animals 2023, 13, 872 7 of 17 A1 = P − A0. Thus, to calculate any step t + 1, it only needs the transfer matrix P and the vector in the previous step At + 1. To be exact: At + 1 = P − At. Using Python version 2022.2.4 (PyCharm Community Edition), we have calculated any number of steps by multiplying the vector of the current step by the transfer matrix P to reach the next step. At some point of P n , the vector no longer changes with the steps, and thus remains the same after each step, and so v = P − v is true.

Statistical Evaluation of Markov Chain Probabilities
The results of the Markov chains were used to determine whether there were differences in the influence of the initial state on the success of the behavioural chain. We used a non-parametric ANOVA, the Kruskal-Wallis test, to determine whether differences between initial states existed. The influence on the initial state of a successful behavioural chain (reaching the succeeding state) were calculated using the R version 4.2.2. (R Development Core Team, R Foundation for Statistical Computing, Vienna, Austria), with the "ggpubr" package for Windows. Normality was assessed using the Kolmogorov-Smirnov test. The relationship between the initial state and the success score was estimated using the Kruskal-Wallis test. To compare variables with significant inter-group variability, the Wilcoxon rank-sum was used for data with non-normal normal distribution, and the Dunn's test for pairwise comparisons to test for significant means. The returned p-values were adjusted using Bonferroni corrections for multiple comparisons. The results were presented as a median. The level of statistical significance was preset to p < 0.05.

Influence Factors on the Frequency of Novel Object Interaction
Factors such as age, sex and personality traits are known to influence behaviour. Therefore, we tested the influences of age, sex and the personality traits of the individuals on the frequency of novel object interaction.
To determine whether there was a statistically significant mean difference in the frequency of novel object interaction between female and male European grey wolves, we ran an independent samples t-test using package "ggpubr" in R. To estimate the relationship between the age of the individuals and the frequency of novel object interaction, a Wilcoxon rank sum test was performed using the package "coin". We estimated the data of 5 female and 5 male European grey wolves from 2 different groups. For age differences in novel object interaction, we compared the age data of the same groups, creating two different age groups (mean 2.5 | 9).
In order to determine the influence of personality factor on the frequency of novel object interaction, the DOGS questionnaire and the Monash Canine Personality Questionnaire-Revised (MCPQ-R) were given to the animal keepers of each group. We adapted the DOGS questionnaire [43] and the MCPQ-R for canids living in zoological institutions, and modified the questions accordingly (see Supplementary Materials). The DOGS personality questionnaire [43] includes the factors of trainability, sociability, extraversion and calmness. The MCPQ-R [44] contains the factors extraversion, self-assuredness/motivation, training focus, amicability and neuroticism. Since the personality factors of the questionnaires are similar, we grouped the factors extraversion/extraversion and sociability/amicability into personality dimensions. Since the trainability factor contains items of playfulness and intelligence, and is comparable to the Big Five dimension of Openness [45], we split this factor into two factors: Curiosity/Openness and Playfulness. Curiosity refers to an interest in new things, perceptiveness, learnability and cognitive flexibility. Playfulness refers to social play with conspecifics, playful interest in people and object-related play (see Table 5). Table 5. Questions and items related to personality factors curiosity and playfulness in [43,44].

Factor Questions and Items to Determine Factor Values
Curiosity/Openness "is inventive and resourceful when it comes to finding or reaching hidden food or toys" "does not have many interests apart from eating and sleeping, e.g., is unexplorative, uncurious, show limited interest in new objects, humans or animals" "has a good grasp of things and learns quickly" "attentive" "intelligent" "clever" Playfulness "is enthusiastic and encourages his peers to play" "is easy to get excited about new play ideas" "often does not understand what is being asked of him in play situations" We also calculated a David's Score (DS) [46] as an alternative method to access dominance rank. DS is a type of cardinal rank (dominance rank is an ordinal type of ranking) that is calculated from an individual's proportion of wins and losses in relation to the wins and losses of its opponents; ranging from −3 to 3 for triads, where −3 represents a maximum proportion of losses and 3 represents a maximum proportion of wins [47]. The DS was combined with the personality traits as a possible influencing factor on the frequency of novel object interaction. As per Ley et al. [44] and Turcsán et al. [43], the raw scores for each adjective within each personality factor subscale were summed and divided by the maximum score possible for the subscale. The result was converted to a percentage, thereby creating a percentage score for each of the five personality factors for every individual. To evaluate the influence of the personality factors and DS on the frequency of novel object interaction, a correlation matrix was created, and a multiple correlation analysis was performed. The multiple correlation analysis was carried out using the package "Hmsic". A linear regression analysis was then performed using the packages "dplyr", "broom" and ggpubr".

Observations
Despite the animals' neophobia and object-related fear, four individual group members showed repeated approaches and exploratory behaviours towards the novel object (max. 16, min. 1). Some group members-two in one group, three, partially, in anotherindependently approached the novel object directly and interacted with it by sniffing on it or pulling the rope located in the enclosure. There was only one animal at a time approaching the apparatus while the others kept distance, with a maximum of 14 interactions of a single individual in one group, and 16 interactions of another individual in the other group. After pulling, the animals immediately left the vicinity of the novel object and returned to their group. When returning to the group, the animals that interacted directly with the apparatus were welcomed by the group, while the animals that were in the close-up range as observers were not greeting, but they contributed to group formation. The "interactors'" muzzles were licked, they were greeted with a tail wag, the muzzle was licked by every other group member, and submissive behaviour was shown, after which the group retreated in a body. This behavioural chain was shown after each direct approach to the novel object, and was uniform in sequence in both wolf groups. This behavioural chain reinforced and increased the approach behaviour to the novel object, but did not reduce object-related fear sufficiently enough for observations of foraging cooperation. In one group, these behaviours seemed to result in another animal occasionally approaching the apparatus independently, which was then greeted in the same way, but were ultimately prevented from doing so by the "first interactor". Behavioural chain: Initial (A0) → group formation (A1) → greet. and sub. gest. (A2, A3, A4, A5) → group leaving together (A6).

Markov Probabilities of Initial Behaviours on the Succeeding Behaviours
When the behavioural chain started with the initial state Nobj (Novel object interaction), the probability to reach the succeeding state was 56% after seven steps, which was the same as after 24 steps (vector [0.56, 0.44]). Starting with individual behaviours, the calculated probability of reaching the succeeding state was 08% for Indbhv1 and 01% for Indbhv2 after seven steps (see Figure 3), and remaining steady after 8 and 24 steps, respectively. Success (A6) was defined as the completion of the behavioural chain, while failure (fail) was defined as the probability of the termination or non-completion of the behavioural chain; for example, if the chain stops at an any state before (A6). The relationship between the initial state and the success score was 0.63 for Nobj (Novel object interaction), 0.264 for Indbhv1 (run), and 0.266 for Indbhv2 (forage) (see Table 6).
ject, but did not reduce object-related fear sufficiently enough for observations of foraging cooperation. In one group, these behaviours seemed to result in another animal occasionally approaching the apparatus independently, which was then greeted in the same way, but were ultimately prevented from doing so by the "first interactor".

Markov Probabilities of Initial Behaviours on the Succeeding Behaviours
When the behavioural chain started with the initial state Nobj (Novel object interaction), the probability to reach the succeeding state was 56% after seven steps, which was the same as after 24 steps (vector [0.56, 0.44]). Starting with individual behaviours, the calculated probability of reaching the succeeding state was 08% for Indbhv1 and 01% for Indbhv2 after seven steps (see Figure 3), and remaining steady after 8 and 24 steps, respectively. Success (A6) was defined as the completion of the behavioural chain, while failure (fail) was defined as the probability of the termination or non-completion of the behavioural chain; for example, if the chain stops at an any state before (A6). The relationship between the initial state and the success score was 0.63 for Nobj (Novel object interaction), 0.264 for Indbhv1 (run), and 0.266 for Indbhv2 (forage) (see Table 6).
Since the data consist of a behavioural chain where states can merge into each other, and since states can be omitted, but succeeding states cannot merge into preceding states, it is represented as a probability chain and not a probability matrix.   Table 4. Initial behaviours are defined in Table 3.
Since the data consist of a behavioural chain where states can merge into each other, and since states can be omitted, but succeeding states cannot merge into preceding states, it is represented as a probability chain and not a probability matrix.
The difference between the initial states were only statistically significant for Nobj and Indhbv2; the statistical significance was p = 0.015 before and p = 0.04 after Bonferroni correction; for Indbhv1 and Indbhv2 = p = 1, for Nobj and Indbhv1 = p = 0.20 after Bonferroni corrections (see Figure 4).  Table 4. Initial behaviours are defined in Table 3.

Influence of Sex, Age and Personality Traits on the Frequency of Novel Object Interaction
The results of the relationship between sex and novel object interaction were (0.75 ± 0.957) for female wolves compared to the male group (6.60 ± 7.797). There was no statistically relevant difference between the frequency of novel object interaction in male and female wolves (t = 1.04, df = 8, p = 0.18) (see Figure 5). star symbol indicates the statistically significant values, the dot symbol indicates the outliers.

Influence of Sex, Age and Personality Traits on the Frequency of Novel Object Interaction
The results of the relationship between sex and novel object interaction were (0.75 ± 0.957) for female wolves compared to the male group (6.60 ± 7.797). There was no statistically relevant difference between the frequency of novel object interaction in male and female wolves (t = 1.04, df = 8, p = 0.18) (see Figure 5). For age differences in novel object interaction, we compared the age data of the same groups, creating two different age groups (mean 2.5 | 9). We found no effect of age on frequency of novel object interaction (W = 11, Z = 1.22, p = 0.33) (see Figure 6).  For age differences in novel object interaction, we compared the age data of the same groups, creating two different age groups (mean 2.5 | 9). We found no effect of age on frequency of novel object interaction (W = 11, Z = 1.22, p = 0.33) (see Figure 6). star symbol indicates the statistically significant values, the dot symbol indicates the outliers.

Influence of Sex, Age and Personality Traits on the Frequency of Novel Object Interaction
The results of the relationship between sex and novel object interaction were (0.75 ± 0.957) for female wolves compared to the male group (6.60 ± 7.797). There was no statistically relevant difference between the frequency of novel object interaction in male and female wolves (t = 1.04, df = 8, p = 0.18) (see Figure 5). For age differences in novel object interaction, we compared the age data of the same groups, creating two different age groups (mean 2.5 | 9). We found no effect of age on frequency of novel object interaction (W = 11, Z = 1.22, p = 0.33) (see Figure 6).  Personality factor means were relatively low with high standard derivation, suggesting that wolves may generally have lower scores in behavioural traits such as playfulness or motivation, or they may score lower in these categories (see Table 7). In particular, the scores for playfulness can be explained, as wolves do not exhibit paedomorphism. We performed a multiple t-test to test for the influence of sex and age on each personality dimension, no statistically relevant results were found. All personality traits showed a positive correlation with the frequency of novel object interaction, but this effect was only statistically significant for the factor "curiosity/openness" (CRS) (r = 0.73, p = 0.02). All other dimensions or factors showed a positive trend, which was marginal. The other two factors with a higher correlation to frequency of novel object interaction (NOBJ) were "playfulness" (PLAY) (r = 0.52, p = 0.15) and "motivation" (MOTV) (r = 0.54, p = 0.12). The lowest correlation was found in David's Score, the "Dominance Score" (DS) (r = 0.25, p = 0.51) (see Figure 7a).

Curiosity
42.77 ± 18.72 Sociability 40.75 ± 14.38 All personality traits showed a positive correlation with the frequency of novel objec interaction, but this effect was only statistically significant for the factor "curiosity/open ness" (CRS) (r = 0.73, p = 0.02). All other dimensions or factors showed a positive trend which was marginal. The other two factors with a higher correlation to frequency of nove object interaction (NOBJ) were "playfulness" (PLAY) (r = 0.52, p = 0.15) and "motivation" (MOTV) (r = 0.54, p = 0.12). The lowest correlation was found in David's Score, the "Dom inance Score" (DS) (r = 0.25, p = 0.51) (see Figure 7a).
Due to the statistically significant positive correlation between "curiosity" and the frequency of novel object interaction (Nobj), we also conducted a linear regression analysis (see Figure 8). The results show a moderate correlation between the personality factor "curiosity" and the frequency of novel object interaction (R 2 = 0.48, df = 8, p = 0.02).

Discussion
The application of transition matrix analyses to the study of behaviour provided more information than standard techniques would have; therefore, it proved to be useful as a tool for the probability analysis of behavioural chains. As animals are not infinite-state automata, Markov analyses might have limitations. We have to take into account that with a transition matrix, where every state can change into any other state of the matrix, different probabilities would probably have been obtained than with our behaviour chain, in which transitions only run in one direction. In addition, we excluded other possible initial behaviours that never led to completion: for example, success, of the behavioural chain, in order not to potentially drive up the significance level of the results. As we started collecting the data based on the observed chain of behaviour, a bias can be assumed. Behavioural states are generally difficult to sample adequately in the field without observer bias; therefore, we categorised the observed behaviours to established categories in the reliable [40] ethogram to avoid possible miscategorisation or strong assumptions about the functions of the observed behaviours. Since many of the behaviours we observed are part of the [40] ethogram, we consider categorisation errors and interpretation errors to be, for the most part, unlikely. Our results are mainly not statistically significant, which may be due to the sample size, as well as the excluded data. Nevertheless, our results show that the influence of bold behaviour, such as an interaction with a novel object, more often led to a behavioural chain of prosocial behaviours, whereas other individual behaviours that are also attention-grabbing, such as running around or foraging, did not regularly lead to the success of such a behavioural chain. Despite the low statistical power of the results, we would like to point out the relevance of the findings, as social reward is an unmentioned concept so far, but our results provide first indications that this behavioural category might exist as a subcategory of social feedback. Wolves are known to be one of the most cooperative canine species. Likely, the cooperative propensity is derived from the fact that each individual needs its other group members for survival. The group functions as a unit in which each individual collaborates in territory defence, hunting and the rearing of offspring [48]. One example for bold behaviour benefiting the group is cooperation during intergroup conflicts. Individuals actively dealing with conflicts for the benefit of the group, regardless of possible disadvantages for themselves, provide an example of a cooperative behaviour that is costly to participants, because it involves a considerable expenditure of energy and risk of injury, and that often results in benefits to both

Discussion
The application of transition matrix analyses to the study of behaviour provided more information than standard techniques would have; therefore, it proved to be useful as a tool for the probability analysis of behavioural chains. As animals are not infinite-state automata, Markov analyses might have limitations. We have to take into account that with a transition matrix, where every state can change into any other state of the matrix, different probabilities would probably have been obtained than with our behaviour chain, in which transitions only run in one direction. In addition, we excluded other possible initial behaviours that never led to completion: for example, success, of the behavioural chain, in order not to potentially drive up the significance level of the results. As we started collecting the data based on the observed chain of behaviour, a bias can be assumed. Behavioural states are generally difficult to sample adequately in the field without observer bias; therefore, we categorised the observed behaviours to established categories in the reliable [40] ethogram to avoid possible miscategorisation or strong assumptions about the functions of the observed behaviours. Since many of the behaviours we observed are part of the [40] ethogram, we consider categorisation errors and interpretation errors to be, for the most part, unlikely. Our results are mainly not statistically significant, which may be due to the sample size, as well as the excluded data. Nevertheless, our results show that the influence of bold behaviour, such as an interaction with a novel object, more often led to a behavioural chain of prosocial behaviours, whereas other individual behaviours that are also attention-grabbing, such as running around or foraging, did not regularly lead to the success of such a behavioural chain. Despite the low statistical power of the results, we would like to point out the relevance of the findings, as social reward is an unmentioned concept so far, but our results provide first indications that this behavioural category might exist as a subcategory of social feedback. Wolves are known to be one of the most cooperative canine species. Likely, the cooperative propensity is derived from the fact that each individual needs its other group members for survival. The group functions as a unit in which each individual collaborates in territory defence, hunting and the rearing of offspring [48]. One example for bold behaviour benefiting the group is cooperation during intergroup conflicts. Individuals actively dealing with conflicts for the benefit of the group, regardless of possible disadvantages for themselves, provide an example of a cooperative behaviour that is costly to participants, because it involves a considerable expenditure of energy and risk of injury, and that often results in benefits to both the cooperating and noncooperating group members, in terms of increased access to contested resources [49]. Individuals with more affiliative partners are more likely to act boldly in social contexts [50]. It could be assumed that not only the already existing number of social affiliations, but also the expected feedback in terms of social prestige might play a role in decision-making and action processes. Expected social prestige could promote risky, bold behaviour. In addition, pleasant emotional states might result from successful risk taking, e.g., caused by a dopaminergic reward response. The likeliness of the social reward-chain after novel object interaction could also be partly explained by social support, which plays a role when group members are exposed to stress [51]. Social support, which can express through seeking contact or rapprochement, as well as affiliative behaviour, can have a stress-reducing effect [52,53]. Therefore, calming mechanisms may also have played a role as an answer to the novel object interaction, given that such was potentially stressful for generally neophobic animals such as wolves. However, due to the lower stress response in risky or novel situations that bold individuals generally show [38], emotional contagion may also have played a role, in that group members who felt stressed by the sight of the new object, but then perceived the approach of a bold group member, felt less stressed and thus also showed a positive response to the approach. This could also have been a factor in the social reward response. Thus, a combination of different social behaviours and mechanisms may well play a role; both the reinforcement of bold behaviour and the reduction in stress on the individual and group levels.
Yet, it is known that boldness has several advantages on the group-level [49,54], but also some on the individual level, including that bold individuals often have a comparatively high status in the group, as aggressiveness, exploratory behaviour and boldness are positively correlated across individuals [16,54,55]. In our results, extraversion and dominance score were positively correlated. Since extraversion is associated with boldness, as the shyness-boldness continuum shares traits with the extraversion-introversion axis [56,57], it is not surprising that individuals that scored higher in extraversion also had the higher status within their group, such as the highest success in conflicts. This also supports the hypothesis that boldness is positively related to status. Although boldness is generally age-dependent-as has been shown in dogs (Canis lupus familiaris) [58]-as well as sex-dependent [58][59][60][61], our results did not show a significant influence of age or sex on bold behaviour, i.e., the frequency of novel object interaction. Possible explanations are that wolves might be less variable in their personality or behavioural traits, as with dingoes [62], but the results are likely also influenced by the small sample size, as well as the large age gaps between the wolves.
It is difficult to observe prosocial behaviours that follow bold individual behaviours in observations of wild animals, or even in observations of the social behaviours of captive animals, without a special experimental set-up. Therefore, it is likely that social reward is more frequently exhibited in wolves, as well as mammals in general, but it may be categorised as prosocial individual behaviours. It is likely that social reward does not only occur as a result of bold actions, but refers to diverse behaviours that can be beneficial to the group, such as cooperative behaviours. To better understand animal behaviour, it is important to be able to identify it as accurately and as precisely as possible. Therefore, we suggest that the phenomenon of social feedback should be investigated in more detail, and potential subcategories such as social reward should be explored. If we can identify reward mechanisms in group processes, this may prove helpful in assessing the animals' individual and group-oriented decisions, their social ecology, and possibly also the development of group dynamics. This can be useful, for example, in assessing which animal might take which future rank in captive animal groups, to detect conflicts between captive group-living animals at an early stage, or to contribute to the understanding of the processes and the adaptive mechanisms of wild animals. By identifying behaviour that is worth rewarding, it might even be possible to draw conclusions about evolutionary drivers.
As social reward and social-reward chains are not an established concept, we suggest that this potential phenomenon should be addressed, and that social reward is worth considering as part of prosociality. Due to our small sample size, more research is needed, with larger group sizes and in wider contexts, to investigate social reward. However, we hypothesise that social reward mechanisms exist among animals, or wolves, and that they can contribute to behavioural reinforcement, particularly in the area of group decisionmaking and the benefits of individual actions to groups or populations.
As our findings are part of a case study, the data are limited. Until more research is conducted, our results should be treated as tentative.

Conclusions
Our findings suggest that there may be specific behavioural chains as part of social feedback in grey wolves. The prosocial behaviours of those behavioural chains could have reward functions, which is potentially due to the benefits of an individual's bold actions for the group. Our results may be interpreted to indicate that bold behaviour in social mammals, such as wolves, is responded to with social reward and prosocial behaviour, which is potentially via reason of the benefits that individual bold actions can implicate for the group.
Due to the small number of groups and the potential bias in the observation, our results should be interpreted with caution. Further research is needed to investigate social feedback mechanisms in wolves and other social animals.
Supplementary Materials: The following supporting information can be downloaded at: https:// www.mdpi.com/article/10.3390/ani13050872/s1, Figure S1: Technical drawing of the apparatus and its function; Figure S2: Photo of the apparatus in use on another wolf group.  Institutional Review Board Statement: Ethical review and approval were waived for this study due to not involving manipulation, medical testing, laboratory animals or practices that could affect or harm animals. As we only conducted behavioural observations in zoological institutions without animal contact, there is no requirement for ethical review and approval.
Informed Consent Statement: Informed consent was obtained from all subjects involved in the study.

Data Availability Statement:
The data obtained during the case study are available upon request.