1. Introduction
Animal-assisted interventions (AAIs), or animal-assisted services (AAS) to use new uniform terminology at the international level [
1], is the umbrella term for structured human–animal interactions with a therapeutic, rehabilitative, educational, and recreational purpose, that involves the engagement of domestic animals to improve the health and well-being of people with various physical, neuromotor, mental, and psychological disorders [
2]. These kinds of activities can even involve healthy people to improve their social or relational skills, their knowledge or awareness around a topic, or to stimulate participation and motivation [
2]. AAIs can be divided, depending upon who is delivering the service, the intended outcomes, and how it is delivered, into three categories:
Animal-assisted therapy (AAT) [
2] or animal-assisted treatment AATx [
1]: A therapeutic intervention/treatment aimed at addressing and managing disorders affecting physical, neuropsychomotor, cognitive, emotional, and relational domains, intended for individuals with physical, psychological, sensory, or multiple conditions. These kinds of interventions are usually intended for an individual or a very small group and are tailored to the users. According to the Italian regulatory framework, AAT involves the designation of two professional figures: a medical specialist or psychologist–psychotherapist, responsible for defining the project objectives, and a qualified healthcare professional, responsible for managing the patient during the sessions.
Animal-assisted education (AAE) [
2]: An education-oriented intervention designed to promote, activate, and support individuals’ internal resources and growth potential, fostering their capacity for personal development, relational engagement, and social inclusion. This approach can be delivered individually or in group settings and aims to enhance overall well-being within the person’s living environment, especially in institutional contexts where adaptive skills are required. AAEs also facilitate behavioural re-education processes, contributing to improved quality of life and strengthening self-esteem in participants. AAE interventions are applicable in a wide range of contexts.
Animal-assisted activity (AAA) [
2] or animal-assisted support programmes (AASP) [
1]: Interventions with recreational, leisure, and socialization purposes, designed to enhance quality of life and foster appropriate human–animal interactions. These activities do not include competitive or sporting engagements involving animals. In AAA, the interaction with the animal serves as a source of knowledge, sensory stimulation, and emotional engagement. Activities may be conducted individually or in groups and aim to promote within the community the value of the human–animal relationship as a means of achieving mutual well-being.
In Italy, the regulatory framework for AAIs is relatively stringent, as national guidelines were developed in 2015 in collaboration with the Ministry of Health [
2]. Furthermore, a National Reference Center for AAIs was established to standardize operational protocols. Regarding animals involved in these interventions, the guidelines describe the required behavioural and health conditions, and state that each animal must be certified and approved by a veterinarian expert in AAIs (i.e., a veterinarian with specific training as defined in the guidelines themselves). The suitability of each animal for a specific project is assessed based on the species as well as on individual characteristics.
The guidelines emphasize strict health requirements; behavioural suitability is equally important. Animals must be free of behavioural disorders and demonstrate sociability, adaptability, the ability to spontaneously, or under guidance, interact with humans and other animals, and they must be docile. They may also undergo targeted training to develop skills for social engagement, cooperation with the handler, and motivation for activities [
3,
4]. Animals with recent histories of abandonment or mistreatment, including those from shelters or kennels, are generally excluded, unless they have successfully completed a rehabilitation and socialization programme supervised by a qualified behaviourist. Only adult animals are permitted, and females cannot participate during oestrus, lactation, or advanced pregnancy [
3]. All health and behavioural assessments, as well as monitoring records, are carefully documented in the clinical record of each animal.
The popularity of AAIs in various settings is increasing due to their impact as an innovative tool that is able to enhance the health and well-being of different kind of users (e.g. children, elderly people, prisoners, people with disabilities).
However, due to the heterogeneity of settings [
5] and to the logistics of AAIs per se, these interventions might provoke stress in the dogs involved [
4]. In fact, researchers and AAI practitioners generally agree that animal welfare issues related to AAIs require special attention and official regulations. As a result, the International Association of Human–Animal Interaction Organizations IAHAIO published new guidelines to reduce work-related stress during AAIs (IAHAIO White Paper 2014, updated 2018) [
6]. From a One Welfare perspective, these activities should not only improve the health of users but also safeguard the well-being of participating dogs and their handlers and, at best, be beneficial for the animals involved. Developing a multiparametric approach to monitor animal welfare and well-being during AAIs is, therefore, crucial to maintain the One Welfare concept and its prescriptions [
7].
One of the best ways to prevent stress in dogs during AAIs is to choose the right and most suitable individuals for this kind of intervention. In the scientific literature, a few protocols have been developed to assess the suitability of dogs for AAIs [
8].
Mongillo et al. (2015) [
9] validated a two-step procedure, consisting of a behavioural examination performed by a veterinary behaviourist, followed by a role-playing simulation of an AAI session, in which dogs were exposed to unexpected social and environmental stimuli. This test represented an important step toward scientific evaluation and standardization; however, given its 10 min duration and the rapid succession of stimuli, the test may not adequately reflect the complexity and real-world conditions of actual AAI environments.
The Italian National Institute of Health proposed a set of forms and guidelines [
3] for veterinarians certifying animals to be involved in AAIs. These forms are a valid method to focus on health status and broad behavioural suitability, ensuring that animals are sociable, docile, and free from behavioural disorders. However, the proposed checklists remain rather general and qualitative, offering limited detail on how to evaluate dogs’ responses to specific stimuli that may occur during interventions, and they rely heavily on the examiner’s interpretation. In fact, this scoring method may not show small differences among suitable dogs.
The Ethotest by Lucidi et al. [
10] was designed to assess suitability for AAA/AAT in a three-step protocol for shelter dogs. While innovative in opening opportunities for sheltered dogs with an unknown past, it is based on a rigid scoring system. In practice, some dogs excluded by the Ethotest were later assessed as suitable according to the Delta Society (now Pet Partners) tests, highlighting the potential risks of false negatives when applying strict criteria without a margin of interpretation.
Finally, the PADA (Personality Assessment of Domestic Animals, 2023) [
11] protocol proposes a highly structured protocol, including numerous practical tests (e.g., exposure to wheelchairs, crutches, screaming persons, and crowds) assessed by certified evaluators, with results stored in an international database [
11]. This approach ensures transparency and standardization but requires specialized teams, resources, and certification processes, which may limit its widespread application in everyday AAI settings.
Additionally, the C-BARQ questionnaire used by Sakurama et al. [
12] was proposed as a preliminary screening tool to identify promising candidates before undergoing an aptitude test. By quantifying traits such as aggression, excitability, fear, and attachment behaviours, it helps filter out unsuitable dogs early on. However, when considered alone, this approach relies entirely on caregiver reports, and thus, it does not directly measure how dogs respond to stimuli, but it does comprehend other stimuli commonly present as real-life challenges in AAI sessions.
Although existing protocols led to important advances, they still present some gaps. Current tools may not always reflect the variety of real-life AAI settings [
9], rely mainly on qualitative evaluations [
3], apply rigid score systems [
10], depend only on subjective caregiver perception [
12], or demand extensive personnel resources and certification procedures [
11]. In this study, we tried to fill the procedural gaps of current AAI testing protocols by developing a novel multiparametric test, the
Suitability Test (SuiTe) for dogs, intended to be used as a pre-screening aptitude test across all types of AAIs without distinction. By combining direct behavioural observation across a wide range of controlled but realistic stimuli, independent scoring systems, caregiver questionnaires, and physiological measures (salivary cortisol), the
SuiTe test aims at increasing reliability, objectivity, and feasibility. Given the existence of national guidelines under a ministerial mandate, the Italian context is relatively advanced and more stringent compared to other countries. Consequently, the present data are not only relevant at the national level but could also serve as a reference point for the development of comparable frameworks in other countries.
2. Materials and Methods
2.1. Subjects
Dog–caregiver dyads were recruited by word of mouth and included both dogs already involved in AAIs and dogs exclusively kept as companions.
Dogs of all breeds and mixed-breeds, of both sexes, and regardless of neutering status, were included. Dogs could not be recruited if they were less than 1 year of age or over 14 years, in case of a current diagnosed disease (acute or chronic), as well as during ongoing pregnancy, lactation, or oestrus. Caregivers were required to be legally adults, over 18 years old, and the person who gives the most care to the dog.
The initial sample was formed by 44 dog–human dyads; three were excluded due to health issues that emerged during or immediately after the test; and another three were excluded due to the absence of a completed questionnaire by the caregiver after three reminders. The final sample size was N = 38.
Dogs were purebred (5 Labradors, 4 Golden Retrievers, 3 Poodles, 3 Jack Russell Terriers, 2 Maremmano-Abruzzese Sheepdog, 1 Dachshund, 1 Border Collie, 1 Breton, 1 French Bulldog, 1 Parson Russell Terrier, 1 German Shepherd working line, 1 Australian Shepherd, 1 Italian Hound, and 1 Weimaraner) and mixed-breed (12), and 39.5% of dogs were male (N = 15; 11 neutered and 4 intact) and 60.5% female (N = 23; 11 neutered and 12 intact) of different ages (mean = 6.26 ± 3.24 years; range = 1–14 years). Six of them had been previously certified as therapy dogs and participated in AAI/AAS with their caregivers. Caregivers were 6 male and 32 female of different ages (mean = 37.24 ± 16.57 years; range = 20–68 years). Of these, 25 worked with animals or had expertise in the field, while 13 did not.
2.2. Location and Equipment
The tests were carried out from February to June 2025 at the Department of Veterinary Sciences—University of Pisa, Italy. The room where the test was performed measured 4 m × 5 m, with a ceramic floor where three “X” (Points 1, 2, and 3 in
Figure 1) marks were made using paper adhesive tape. The three points were selected during the design of the test to facilitate its execution: Points 1 and 3 indicate where the caregiver should stand, depending on the phase being performed, while Point 2 is used to position the stimulus at the appropriate distance, as detailed in the protocol below. In addition, the points were chosen based on the position of the cameras to ensure that the dog was always visible. The room has large windows on one side and two doors (A and B), as shown in
Figure 1. A desk, sofa, wardrobe, and sink were present. A box was placed on the desk, containing palatable food, toys (balls and ropes), paper sheets, and a brush; the box remained closed when not used by the operator. A paper trash bin, a metallic spoon, and two GoPro
® (Hero7 Black) cameras with tripods for videorecording the procedure were also required, as indicated below. Outside the room, behind door B, a wheelchair, a cleaning cart, a crutch, a white coat, a surgical face mask, sunglasses, and a baby doll were placed and kept ready for the test.
2.3. Experimental Protocol (SuiTe)
For this protocol, a minimum of four operators were required, as follows: (a) Operator 1 guided the protocol and remained in the room throughout the test; (b) Operator 2 assisted in forming the group and wore a white coat and surgical face mask; (c) the female stranger was a woman unfamiliar to the dog; and (d) the male stranger was a man unfamiliar to the dog.
2.3.1. Preparation Phase
Before the test, caregivers were asked to bring a preferred toy and a palatable food for their dog; if they did not bring any, the researchers provided beef treats and various toys (e.g., balls, ropes). The caregiver was briefed about the protocol and asked to sign the informed consent in a room separated from where the test occurred. They were also informed that the test would be presented step-by-step by Operator 1, who would remain in the room for the entire procedure. Operator 1 explained that the dog would be gradually exposed to the test stimuli and that the protocol could be interrupted (temporarily or permanently) by leaving the room if the dog shows signs of high stress, fear, or aggression or if requested by the caregiver.
The caregiver was asked to provide information regarding the dog’s food allergies or intolerances, current health diseases or pain, and ongoing medication.
Saliva collection (T0) was then performed by the caregiver, following the instructions provided below. After this, the caregiver and the dog were accompanied to the test room and entered through entrance door A, marking the start of the procedure.
2.3.2. Test Protocol
The
SuiTe protocol was developed based on the authors’ previous experience in the field of AAIs, as well as on other behavioural tests with similar purposes, both published [
9] and unpublished [
3,
11]. The test is divided into two main phases:
Environmental stimuli: Where the dog interacts with the environment, food, toys, and unusual objects to assess the dog’s motivation and behaviour in their presence;
Social stimuli: Where social attraction towards unfamiliar people of different genders, tolerance to handling, and reactions to “unusual” social stimuli are evaluated.
Each phase consisted of various sub-phases, described below in detail.
Phase 1: Environmental stimuli. Sub-phases:
- a.
Room exploration: The dyad enters through entrance door A. The dog on a leash can move around the room and the caregiver follows it. After 1 min, the leash is removed, and the dog is allowed to explore independently for another minute (total phase time: 2 min). The leash is then placed back on the dog; however, if it is too short, a choke chain, or too long, Operator 1 provides the caregiver with a leash of 1.5 m in length.
- b.
Food motivation: Operator 1 places a palatable food on a paper sheet on point 2. The caregiver stands on point 1 with the dog on the leash and prevents the dog from reaching the food for 10 s. After that, the dog is allowed to approach and eat the food, and the caregiver is asked to stop the dog, verbally at first; if unsuccessful in stopping the dog, they may use the leash, avoiding coercion. Finally, after removing the paper and the food, a piece of food is given to the dog from the caregiver’s open hand.
- c.
Toy motivation: The dog’s preferred toy (if present or chosen between a ball and a rope toy) is placed on point 2. The caregiver stands on point 1 with the dog on the leash, preventing the dog from reaching the toy for 10 s. The dog is then allowed to interact freely with the toy and play, either alone or involving the caregiver, who may invite the dog to play as they usually do. The caregiver will then ask the dog to release the toy, first verbally and, if unsuccessful, physically remove it from the dog’s mouth in a calm and non-coercive manner and without punishment. The toy is returned to Operator 1.
The caregiver and the dog exit the room through door A and wait for the Operator to call them to enter. During this time, three unusual objects (a wheelchair, a cleaning cart, and a crutch) are placed in the room, as shown in
Figure 2. The caregiver and the dog enter the room:
- d.
Unusual static objects: The caregiver and the dog, on a leash, can move around the room and explore the objects for around 10 s each.
- e.
Sounds: While the dog explores the objects and does not pay attention to Operator 1, a paper trash bin and then a metallic spoon are dropped to the ground. Operator 1 checks whether the dog shows signs of high stress, in which case the test is temporarily interrupted to take a break, leaving the room.
- f.
Unusual moving objects: While the caregiver–dog pair moves around the room, Operator 1 pushes first the cleaning cart and then the wheelchair. After a few seconds, the operator gradually approaches the dog and walks nearby.
The caregiver and the dog exit the room through door A and wait for the operator to call them to re-enter (3 min pause; the dog can drink water) after the objects are removed.
Phase 2: Response to social stimuli. Sub-phases:
- g.
Separation: The dog, without the leash, is free to move. The caregiver greets the dog as is usually done when leaving a familiar place (e.g., home), exits the room through door B, and leaves the leash inside. Operator 1 remains in the room with the dog, positioning himself on the wall opposite door B. Operator 1 does not initiate contact with the dog unless the dog seeks it. After 2 min, Operator 1 goes to the centre of the room, calls the dog by name, and interacts if the dog wants to and approaches (1 min).
- h.
Social attraction (caregiver): After 3 min outside, the caregiver enters the room through door B and stops at point 1, without interacting with or greeting the dog for 30 s. After that, the caregiver may greet the dog as usual.
- i.
Handling (caregiver): The dog is touched in the most sensitive areas (mouth, ears, paws, and tail), first in a gentle and then in a rougher manner; the first serves to check the potential aggressive response of the dog and the second is to simulate manipulation in AAI that might not follow common advice on how to gently handle dogs.
- j.
Brush (caregiver): The dog is brushed along the back for 10 s.
- k.
Play invitation (caregiver): Operator 1 gives the preferred toy to the caregiver to play with the dog, and then the caregiver asks the dog to release the toy verbally and, if necessary, physically.
- l.
Social attraction (female stranger): The dog on a leash and the caregiver stand on point 3 and wait for the entrance of the female stranger. In this phase, the dog on the leash may freely approach the stranger, and the caregiver is instructed not to prevent the dog from reaching her. The female stranger positions herself at point 1 and waits without interacting with or greeting the dog for 30 s. After that, she calls the dog by name while remaining standing, lowering her upper body, and greeting the dog by placing a hand on his head (replicating the typical and incorrect way unfamiliar people greet an unknown dog).
- m.
Handling (female stranger): The dog on a leash is touched in the most sensitive areas (mouth, ears, paws, and tail), first in a gentle and then in a rougher manner; the first serves to check the potential aggressive response of the dog and the second is to simulate manipulation in AAI that might not follow common advice on how to gently handle dogs.
- n.
Brush (female stranger): Operator 1 gives the brush to the stranger, and the dog is brushed along the back for about 10 s.
- o.
Play invitation (female stranger): Operator 1 gives the preferred toy to the female stranger to play with the dog, and then the dog is asked to release the toy verbally and, if necessary, physically. After that, the female stranger leaves the room.
- p.
Social attraction (male stranger): The dog is on a leash, and the caregiver stands on point 3 and waits for the entrance of the male stranger. In this phase, the dog on the leash may freely approach the stranger, and the caregiver is instructed not to prevent the dog from reaching him. The male stranger positions himself on point 1 and waits without interacting with or greeting the dog for 30 s. After that, he calls the dog by name while remaining standing, lowering his upper body, and greeting the dog by placing a hand on its head (replicating the typical and incorrect way unfamiliar people greet an unknown dog).
- q.
Handling (male stranger): The dog on a leash is touched in its most sensitive areas (mouth, ears, paws, and tail), first in a gentle and then in a rougher manner; the first serves to check the potential aggressive response of the dog and the second is to simulate manipulation in AAI that might not follow common advice on how to gently handle dogs.
- r.
Brush (male stranger): Operator 1 gives the brush to the stranger, and the dog is brushed along the back for about 10 s.
- s.
Play invitation (male stranger): Operator 1 gives the preferred toy to the male stranger to play with the dog, and then the dog is asked to release the toy verbally and, if necessary, physically.
- t.
Social attraction (group): The female stranger and Operator 2 enter the room and place themselves near the male stranger, forming a group. The group remains quiet for 10–15 s while standing, then talks to each other with a normal tone of voice for 10–15 s, and, finally, talks to each other in a loud voice for 10–15 s. In this phase, the dog on the leash and the caregiver stand on point 3 and may freely approach the group; the caregiver is instructed not to prevent the dog from reaching them.
- u.
Handling (group): The group calls the dog by name. In this phase, the dog on the leash may freely approach the group, and the caregiver is instructed not to prevent the dog from reaching them if the dog wants to. If the dog approaches, the group greets and pets the dog simultaneously. The group exits the room (door B).
- v.
White coat and surgical face mask: Operator 2 enters the room (door B) wearing a white coat and a surgical face mask, stands on point 1, and does not interact with the dog for 30 s. The operator then leaves the room (door B).
- w.
Crutches and sunglasses: The male stranger enters the room (door B) and walks with a limp, moving around and passing near the dog twice. He then leaves the room (door B).
- x.
Baby doll: The female stranger enters the room (door B) holding a baby doll in her arms. After a few seconds, she puts it down and pretends to make it walk by holding it by the arms, as if it were a child [
13]. She then leaves the room (door B).
- y.
Training and concentration: The caregiver asks the dog to perform some commands (e.g., sit, stay, paw, or others the dog knows). In the middle of the exercise, an operator from outside the room knocks on door B and briefly appears, says a phrase, and then closes the door. The caregiver tries to keep the dog focused on the exercise.
- z.
Sofa: The caregiver asks the dog to go up the sofa.
After the test, the dog and the caregiver remain in the room to immediately collect the second saliva sample (T1). The entire test protocol takes approximately 45 min.
2.3.3. Assessment of the Dog Behavioural Responses
Two independent evaluators observed the video-recorded tests using ad hoc scoring grids and a classification. The
SuiTe protocol and relative scoring grids were developed by adapting and integrating items from a validated test protocol [
9], as well as from protocols not yet validated [
3,
10,
11], and from the prior experience of the authors in the field of AAIs and behavioural testing.
The
SuiTe evaluation consisted of a mixed objective–subjective approach, based on X-Y scores derived from “valence” and “arousal” dimensions of the circumplex model of affect [
14] and on the descriptions provided in the four-quadrant Cartesian grid. The score attribution was performed using two primary dimensions, valence and arousal. These dimensions provide a framework for organizing and interpreting emotional responses [
15]:
Valence: Refers to the positive or negative character of an emotion [
16]. Positive valence refers to approach, retaining, tolerance, and acquisition; negative valence refers to withdrawal, escape, refusal, and aggression [
16]. In other words, it reflects whether an experience was perceived as unpleasant, e.g., fear or frustration, or pleasant, e.g., contentment or play [
17].
Arousal: Represents the activation level of an emotional response, independent of its valence. High levels of arousal can be manifested as excitement or agitation, while low arousal corresponds to calm or passive states [
18].
The evaluators’ scoring grid was structured as a Cartesian plane, with the x-axis representing Valence (−5 ≤ x ≤ +5; positive x > 0 and negative x < 0) and the y-axis representing Arousal (−5 ≤ y ≤+5; y > 0 for high levels and y < 0 for low levels) of the behaviours observed during the test. For sub-phase a-x, the evaluators assigned a coordinate (x; y) based on the dog’s behaviour, following a training period with these grids. To facilitate evaluation, a description of each quadrant was provided, illustrating the extreme behavioural expressions at the coordinates (−5,−5, e.g., the dog does not want to interact with people), (−5, +5, e.g., the dog tries to escape the room and actively avoids interaction), (+5, +5, e.g., intense attempts to seek physical contact with a stranger, such as jumping), and (+5, −5, e.g., the dog remains calm, does not actively seek physical contact, and shows interest in the environment). A “Notes” section was also provided to the evaluators to record observed behaviours, including aggressiveness when it occurred. Aggressiveness was scored as −5 for valence and +5 for arousal. The same valence score was also assigned when the dog refused to participate in or complete a sub-phase (e.g., remaining distant from unfamiliar people and avoiding interaction or physical contact). For sub-phases y–z, an independent score was assigned using another grid, which will be discussed in a future investigation.
After an overall evaluation, each dog was classified as suitable (S), unsuitable (U), or pending suitability (P), similar to a classification used in a previously validated test for the suitability of dogs in AAIs [
9]. Translating the characteristics of a “suitable dog” into scores was not simple; the evaluation needed to recognize as suitable both slightly excitable and active dogs, as well as calmer and more relaxed ones, recognizing that these two contrasting profiles are not mutually exclusive in terms of suitability. Furthermore, the capacity for recovery is one of the aspects contributing to dog suitability and influencing the score assigned to each phase. For instance, during the scoring of the “Sounds” phase, a dog that was initially startled but quickly recovered and approached the stimulus received a score indicating a more positive valence, compared to a dog that remained unsettled after the noise and did not return to a calm state.
In the
SuiTe, a “suitable dog” was defined as one displaying behaviours with a positive valence and an arousal level fluctuating between positive and negative values, gradually increasing as valence shifted toward more positive levels. This pattern formed a triangular area with the vertex at 0,0 and the other two vertices approximately at +5 for valence and +3/−3 for arousal, as illustrated in
Figure 3. Dogs classified as suitable (S) were expected to obtain the majority of their scores within this triangular green area (
Figure 3), with particular attention given to performance in the most relevant sub-phases (e.g., exposure to unusual objects such as wheelchairs, responses to sounds, or handling by unfamiliar people). The triangular area represents dogs ranging from proactive in social interaction and slightly excitable to calm and relaxed when approached, without being easily excitable. If a dog obtained all scores within the triangular area except in the food-motivation sub-phase, the dog could still be classified as suitable (S), provided that overall valence remained positive. Conversely, if a dog obtained most scores within the area but exhibited high arousal in certain sub-phases (e.g., intense attempts to seek physical contact with a stranger, such as jumping), the dog could be classified as P, with this aspect highlighted to the caregiver so that the dyad could work on it. Finally, if a dog obtained the majority of scores outside the triangular area and with negative valence, the dog was classified as unsuitable (U). The evaluators were responsible for assessing each case individually and considering the overall behaviour of the dog–caregiver dyad throughout the entire test.
2.4. Questionnaire
In addition, caregivers were asked to complete an online, purposefully developed questionnaire incorporating items from the validated Italian version of the C-BARQ [
19], as previously performed by [
12,
20], as well as newly introduced items designed to assess dogs’ behaviour in response to the stimuli administered during the behavioural test. A copy of the questionnaire is provided in the
Supplementary Materials.
2.5. Saliva Sample Collection and Cortisol Analysis
Saliva samples were collected from the dog using Salivette
® (Sarstedt, Rommelsdorft, Germany) swabs, which were chewed for approximately 90 s between the molars, both before the test (T0) and immediately after its end (T1). The saliva samples were refrigerated at −20 °C and stored at this temperature in the laboratory of the Department of Veterinary Sciences (University of Pisa, Italy) until analysis. Cortisol quantification was performed using the Salimetrics Cortisol Enzyme Immunoassay Kit
® (Salimetrics, Segrate, Italy), as reported in a previous study [
21].
2.6. Statistical Analysis
Descriptive statistics were performed, and data are presented as median (with interquartile range [IQR] and minimum–maximum values), unless otherwise stated. Data were analyzed using RStudio software (Version 2024.12.0+467).
SuiTe: The inter-rater reliability between two independent evaluators was calculated using the Intraclass Correlation Coefficient (ICC). A non-parametric Kruskal–Wallis test (p < 0.05), followed by a post hoc Dunn test with Holm correction, was performed to assess potential differences in valence and arousal median scores between the three groups (suitable, unsuitable, or pending suitability). To further investigate these aspects and groups’ differences, and to increase group sizes for more appropriate statistical analysis (N = 14 vs. 9 and 5), the suitable and pending suitability groups were merged, and a non-parametric Mann–Whitney U test (p < 0.05) was performed (suitable and pending suitability vs. unsuitable).
Salivary cortisol: Normality was assessed using the Shapiro–Wilk test, which revealed that salivary cortisol values were not normally distributed (T0: W = 0.68568, p = 2.249 × 10−7; T1: W = 0.80792, p = 2.928 × 10−5). Because the data were not normally distributed, a non-parametric Wilcoxon paired signed-rank test (p < 0.05) was performed to examine potential differences in cortisol levels at T0 and T1 between the three (suitable, unsuitable, and pending suitability) and also between the two groups (suitable and pending suitability vs. unsuitable). Delta cortisol levels (T1−T0) were calculated, and a non-parametric Kruskal–Wallis test (p < 0.05) was conducted to assess potential differences in delta cortisol levels between the three groups and a non-parametric Wilcoxon signed-rank sum test (p < 0.05) between the two groups.
Questionnaire scores: A non-parametric Kruskal–Wallis test (p < 0.05), followed by a post hoc Dunn test with Holm correction, was conducted to assess potential differences in mean scores between the three (suitable, unsuitable, and pending suitability) and also between the two groups (suitable and pending suitability vs. unsuitable), considering the different sections of the questionnaire (trainability and obedience, separation, aggression, fear and anxiety, excitability, attachment behaviours, and other behaviours).
In addition, the open-ended responses provided by the caregivers to the final question of the Italian questionnaire were considered in the analysis:
“In your opinion, would your dog be suitable for participating in Animal-Assisted Interventions (also known as pet therapy)? Please answer Yes/No and explain the reason for your response”. The open-ended responses provided by the caregivers, reported in the Results section, were simplified and classified by a researcher into four categories: “yes”, “yes after training/course”, “maybe”, and “no.” A qualitative analysis was performed to evaluate the level of concordance between the caregivers’ perception of suitability and the results of the SuiTe test, and Cohen’s kappa was calculated.
4. Discussion
This preliminary study aimed to assess the potential of two combined tools, a behavioural aptitude test (Suitability Test SuiTe) and a survey (an ad hoc revised questionnaire incorporating C-BARQ), to be used as pre-screening for dog suitability in AAIs/AAS. In addition, salivary cortisol levels were measured to include a physiological indicator of the dogs’ stress response to the stimuli presented during the test. Results showed that differences exist between dogs classified as suitable (S) or pending suitability (P) versus unsuitable (U), both in the valence scores obtained in SuiTe and in separation, attachment, and fear/anxiety behaviours as assessed by the questionnaire; and these differences were aligned. However, suitability as assessed by the SuiTe resulted in lower values compared to suitability assessed by caregivers through an open question.
Dogs judged suitable obtained scores with more positive values compared to unsuitable ones. In fact, when considering the mean valence scores across all sub-phases together, values were >2 for S and P dogs, whereas they were close to zero for U dogs. Therefore, the test could overall be considered not excessively stressful or challenging. On the other hand, if the SuiTe—designed as a pre-screening for suitability—were too demanding for the dogs in terms of emotional state, more dogs (especially U) would likely have exhibited negative valence during the test. Although it would have been useful to analyze the responses to each stimulus separately within the groups, in order to determine which of them had the greatest impact, this will be investigated in the future with a larger sample. For this pilot study, a different approach was chosen considering that, in a real AAI session, stimuli can occur altogether, at the same time, or in a limited time, and thus the overall valence is relevant. For instance, an unsuitable dog that is scared by a sudden noise may subsequently stop interacting with people, given the possible presence of a carryover effect. One of the strengths of suitable dogs should be the ability to manage even situations that can lead to mild discomfort, particularly with the support of their caregiver. Another strength of suitable dogs is that they can quickly recover in case something unpredictable and unpleasant occurs, as this sometimes may happen in AAI sessions.
The results of this study, in fact, suggest that suitable dogs have characteristics like interspecific sociability and tolerance, curiosity toward stimuli, and willingness to approach without fear, as well as focus and engagement with stimuli around them. As for arousal levels, suitable dogs varied, shifting towards more central ranges, but no statistically significant differences were found between S, P, and U dogs, either in their test behaviours or in the questionnaire scores (excitability scale). It may reflect the diversity among dogs considered suitable in the test (ranging from slightly positive and slightly negative arousal), from calm and relaxed individuals to more lively and energetic ones, matching the type of people they are expected to work with and the type of activities (individual or group). Indeed,
SuiTe aimed to detect dogs with the appropriate aptitude to participate in AAIs in a preliminary phase of selection, and including dogs that exhibit moderate variations in arousal levels seems appropriate. Companion dogs do not request to participate in AAIs, nor do they voluntarily consent to be involved. Assessing their emotional state during these activities—ensuring that they do not experience stress and, ideally, that they enjoy the interaction [
22]—is a fundamental aspect of both the selection and the ongoing re-evaluation process, representing an essential responsibility on our part [
23,
24].
Cortisol concentrations in saliva were within the physiological range [
25,
26], with mean values higher than those reported in a previous study conducted under similar conditions and in the same laboratory [
21]. The analysis did not reveal differences based on suitability; it can be hypothesized that, when negative valence occurred, the dog’s physiological response was not strong enough to elicit marked cortisol peaks, even in unsuitable dogs. The observed differences between the reported raw
p-values and the Holm-adjusted
p-values (
Table 4) may result from the multiple pairwise comparisons among the three groups and the application of the Holm correction, which increases
p-values, particularly those close to the significance threshold. In addition, the lack of a significant difference may suggest that the test is valid from an animal welfare perspective, as it does not elicit strong stress responses [
27]. At the same time, however, since it does not elicit strong stress, it may have limited effectiveness for assessment, as it does not provide additional information beyond the behavioural response of the dog, which is indeed what is strictly necessary for assessing suitability in AAIs. Data obtained from a bigger sample should confirm these results or give a more nuanced picture.
Dogs classified in the test as suitable and pending suitability showed lower mean scores of separation, attachment, and fear/anxiety sections, as similarly reported by Sakurama et al. [
12], except for the attachment. Separation and attachment sections were linked to
SuiTe sub-phase g, as the separation response, which typically is a stressful moment for most dogs and may be influenced by the quality of the attachment bond established within the dyad [
21]. Moreover, the tendency in the attachment section may suggest different attachment behaviours between S and U. The fear/anxiety section is expressed throughout the test (in response to both environmental and social stimuli), and whenever fear was observed, these corresponded to negative valence scores. Thus, the confirmation of the existing difference in caregivers’ perception of behaviours between S and U may indicate that dogs in S (and S + P) exhibit less fear/anxiety than those in the U group, as well as fewer problems with separation from the caregiver, as previously reported in the literature [
12]. The next step of this project will consist of better assessing the relevance of the bond between dogs and their caregivers [
28], which might give new insights for the interpretations of these findings.
Interestingly, when analyzing S, P, and U separately, the aggressiveness section of the questionnaire did not reveal any significant differences; however, when S and P were combined, a significant difference emerged (higher in U vs. S + P). Indeed, the presence of aggressiveness is one of the most important factors leading to the classification of a dog as unsuitable [
9,
12]. Ensuring proper selection, training, and management of dogs in AAIs is essential not only for safeguarding their welfare but also for preventing undesirable scenarios, including aggression and potential bites, during sessions [
29].
Taking into consideration the results of both the tests and the questionnaires, it appears that a moderate agreement exists. These findings are consistent with previous studies, in which the C-BARQ was considered a useful tool to help caregivers evaluate whether their dogs had a temperament making them suitable for becoming therapy dogs [
12]. However, the agreement decreased for the overall suitability. In fact, in this study, we also asked caregivers to say whether and why their dogs were, in their opinion, suitable for participating in animal-assisted interventions; it emerged that the overall concordance with
SuiTe was fair. This means that only half of the caregivers provided an accurate assessment, according to our
SuiTe evaluations. When caregivers considered the dogs unsuitable,
SuiTe provided the same evaluation in almost all cases. Conversely, only 41% of those who believed their dogs were suitable were correct based on
SuiTe evaluations. A possible explanation is that caregivers may have a subjective perception of their dogs and, wishing they are suitable, unconsciously overestimate their abilities, not knowing what AAIs consist of, and therefore what their dogs would be exposed to. This aspect may suggest that, in accordance with Mongillo et al. [
9], behavioural evaluation through testing represents the better assessment for identifying suitability for AAIs. Moreover, results indicated that caregivers may overestimate desirable behaviours for AAIs in their dogs or underestimate behaviours that render dogs unsuitable for AAIs (e.g., the dog does not want to be pet by a stranger, does not approach a group of unfamiliar people when called, shows fear or anxiety in the presence of unusual objects, or becomes stressed by unfamiliar people [
9,
12]), but more investigations are needed. These aspects can be particularly concerning when considering the everyday life of the dyad and the welfare of the dog. Caregivers may not always be able to detect stressful situations for their animals or recognize certain stress signals exhibited by the dog [
30]; this can, in turn, lead caregivers to force the dog to cope with difficulties, possibly without the caregiver’s support [
31,
32]. From this perspective,
SuiTe could also represent a valuable tool for dogs that will never participate in AAIs, as it may help identify potential difficulties in their everyday life.
SuiTe presents some innovation compared to previous tests. The overall duration of the test was extended to approximately 45 min, and a defined time to each sub-phase was established in order to improve standardization and repeatability. Differently from the test published by [
9], which includes a role-playing component that is probably closer to real-life situations because different stimuli are presented together, and in
SuiTe the environmental and social stimuli were separated into two distinct phases and subsequently into sub-phases. This approach allows for a more precise identification of the type of stimuli to which the dog’s response is related; in the future, an evaluation assigning a specific score to each sub-phase might provide valuable information, although the current study showed that dogs were assessed as unsuitable when the overall valence was low, suggesting that the negative score for one stimulus was usually associated with negative scores to other stimuli. Moreover, this tool, thanks to its graphical output, allows researchers and caregivers to clearly identify which
SuiTe stimuli are most challenging for the dog.
The ultimate aim of SuiTe is not to determine whether a dog is currently ready to participate in AAIs, but rather to assess his aptitude as a screening test before starting a training programme. In fact, if a caregiver wishes to begin an educational programme to become a handler in AAIs and to certify the dog, SuiTe can help identify the specific aspects that require further work.
Future analyses will attempt to address some limitations of this study. The sample size will be expanded by including new dyads, and a more detailed and deeper evaluation will be conducted on the comparison between each sub-phase of the SuiTe and the scores of specific correlated questions. Such analyses may provide a more accurate assessment of the efficiency of individual sub-phases in reflecting what is reported by caregivers. Moreover, some stimuli are missing in SuiTe, for ethical reasons, such as interactions with children and older adults, as well as other additional situations like an off-leash moment with strangers, food being offered by a stranger, and interactions with people in a wheelchair or lying in bed, which could be added to further refine the protocol. However, the SuiTe should maintain agility and feasibility; therefore, this represents a compromise, and integration with the survey is necessary to cover some unexplored aspects. In addition, a possible follow-up after one year or more, repeating the test, could provide important information on whether and how these responses change over time, particularly in young dogs. Finally, a better evaluation of the dog–caregiver bond might provide relevant inputs.