3.1. Speech Interaction in “Mother–Child” Dyads: Expert Analysis of the Audio Fragments of Speech Samples
It was shown that mothers of TD children and children with ASD and DS used different strategies of speech interaction with their child. Progress in the speech development of TD children correlated with MS features: if the mother spoke clearly, F (19.270) = 234.42, p < 0.0001, R2 = 0.943, the child spoke clearly (Beta = 0.150) and used phrases (Beta = 0.3297); if the mother encouraged the child, F (19.270) = 16.667, p < 0.0001, R2 = 0.540, and asked questions, F (19.270) = 208.41, p < 0.0001, R2 = 0.936, the child answered the mother’s questions (Beta = 0.584), asked questions (Beta = 0.160), and used phrases (Beta = 0.154); if the mother made pauses between phrases, F (19.270) = 40.477, p < 0.0001, R2 = 0.740, the child used phrases (Beta = 0.276) and asked questions (Beta = 0.100)—Multiple Regression analysis.
The repetition of the words spoken by the TD children by the mothers, F (19.270) = 35.568, p < 0.0001, R2 = 0.715; the highlighting of certain words, F (19.270) = 26.909, p < 0.0001, R2 = 0.654; and the stretching of sounds in words, F (19.270) = 5.625, p < 0.0001, R2 = 0.233, were due to the low levels of speech of the TD children—the speech was indistinct (Beta = 0.356), the replies were “yes-no” (Beta = 0.127), and the child did not answer the mother’s questions and did not ask questions.
The initiative manifested by the mother in interacting with the ASD child correlated with the following characteristics of her speech: emotional (0.56 Spearman correlation, p < 0.05), the mother spoke loudly (0.78), clearly (0.94), addressed the child (0.91), referred to the child by name (0.82), asked questions (0.71), encouraged the child (0.55), instructed the child (0.76), repeated the question or the same words (0.87), repeated the words for the child (0.62), specified what the child said (0.60), grammatically simplified (0.91), singled out individual words (0.70), stretched sounds in words (0.58), and paused between phrases (0.60). These features of MS, F (33.96) = 34.131, p < 0.0001, R2 = 0.921, correlated with the characteristics of the ASD child’s speech: the child’s speech was emotional (Beta = −0.157, p < 0.009), the child answered the mother’s questions (Beta = 0.390, p < 0.003), responded to the mother’s reply (Beta = 0.390, p < 0.003), used a replica of the word (Beta = 0.220, p < 0.03), and used a “yes-no” replica (Beta = 0.209). MS addressed to ASD children was characterized by the repetition of children’s words, the clarification of children’s utterances, the simplification of speech, the emphasis of words by the voice, the stretching of sounds in words, and the presence of long pauses between phrases. However, these features of MS do not always lead to progress in the development of the ASD child’s speech.
Direct correlations between the characteristics of MS and the features of DS child speech behavior were shown: if the mother spoke loudly, the child used a loud voice, F (1.58) = 10.101, p < 0.002 (R2 = 0.148; Beta = 0.385); the mother’s emotional speech correlated with emotional child speech, F (1.58) = 0.425, p < 0.02 (R2 = 0.086; Beta = 0.292); if the mother demonstrated joy, the child was happy, F (1.58) = 34.511, p < 0.0001 (R2 = 0.373; Beta = 0.610)—Regression analysis; if the mother was angry, the child was also angry, F (3.53) = 518, p < 0.0001 (R2 = 0.593; Beta = 0.660). However, a clear articulation of the MS words, F (3.56) = 45.607, p < 0.0001, R2 = 0.710, was correlated both with clear pronunciation of the words by the child (Beta = 0.653) and with fuzzy articulation of the child (Beta = 0.769).
Though the behavior and speech of the experimenter were standardized, if the adult highlighted certain words, F (1.48) = 6.654, p < 0.01 (R2 = 0.122; Beta = 0.349), or stretched sounds in the words, F (1.48) = 9.84, p < 0.01 (R2 = 0.17; Beta = 0.12), the number of DD children’s answers increased. The number of ID children’s answers increased if the adult used emotional speech, F (1.48) = 6.49, p < 0.01 (R2 = 0.119; Beta = 0.345).
3.2. Elements of Nonverbal Behavior of Mothers and Children in the Process of Interaction: Expert Analysis
The mothers of the TD children, children with ASD, and children with DS often looked at the child (94% of the answers of experts for the mothers of TD children, 99% for mothers of ASD and DS children) and rarely showed discontent (19%—DS, 7%—TD, and 3%—ASD). The mothers of all groups of children attracted the attention of the child with eyes (44%—TD, 43%—ASD, and 40%—DS) and used gestures (56%, 60%, and 56%). Mothers of children with ASD and with DS were more likely to touch children in the process of interaction (93%—ASD and 80%—DS) compared to mothers of TD children (30%) and to attract the child’s attention more through gestures (66%—ASD and 57%—DS) than mothers of TD children (41%). Mothers of DS children were more often dissatisfied with their children (19%) than mothers of ASD children (3%) and TD children (7%), smiled less (36%) than mothers of ASD children (50%) and TD children (51%), and looked around less often (14%) than mothers of ASD children (51%) and TD children (48%).
The nonverbal behavior of ASD children and children with DS differed from the behavior of TD children by a large number of behavioral elements: TD children smiled more often (56%) than ASD children (19%) and DS children (33%), others showed discontent (1%—TD children, 19%—ASD children, and 47%—children with DS), looked at the mother more often (71%), used “eye-to-eye” contact (56%), and looked at the object (85%). Children with ASD were more likely to look around (91%) than children with DS (76%) and TD children (62%) and to touch mothers more often (51%) than children with DS (44%) and TD children (16%).
Orphans with DD and ID smiled when they interacted with the experimenter (60%—DD children and 68%—ID children). Children with ID looked at the adult more often than children with DD (64%—DD children and 75%—ID children), but they rarely looked at the object compared to DD children.
3.3. Perception, Acoustic, and Phonetic Features of MS Addressed to the Child
3.3.1. Perception Data
MS directed to TD children and children with DS (38% and 37.2%) was aimed at attracting the attention of the child more than MS addressed to ASD children (
Figure 1).
Mothers of DS children commented more on the child’s utterances (20.9%) than mothers of ASD children (12.7%) and TD children (16%). MS addressed to ASD and DS children stimulated children to verbal response more (43.8% and 37.6% ASD and DS, respectively) than MS addressed to TD children (32%) with already well-formed speech. MS addressed to TD children and children with ASD contained equally few encouragements for the child (5% and 4.3%), whereas mothers of DS children encouraged their children more often (8.3%).
Determining the emotional state of the mothers, the listeners noted a calm state for mothers of TD, ASD, and DS children, but the number of listeners’ answers was minimal for mothers of DS children (32.2%—DS, 64%—TD, and 52.2%—ASD). According to the responses of listeners, the mothers of ASD children manifested a state of joy more frequently (19.7%) than mothers of DS children (17.9%) and TD children (15%). The listeners noted the states of anger (23.5%—DS, 10.5%—ASD, and 6%—TD), sadness (10.2%—DS, 7.8%—ASD, and 7%—TD), and aggression (10.1%—DS, 7.2%—ASD, and 2%—TD) for mothers of DS children more often than for mothers of ASD and TD children.
3.3.2. Acoustic Data
The mother’s utterances addressed to ASD children were longer (
p < 0.001—the Mann–Whitney criterion) than the utterances directed to TD children and children with DS, and contained shorter phrases (
p < 0.005). In MS addressed to children with DS, the duration of pauses between phrases in utterances was longer than the MS directed to TD children (
p < 0.005). The pauses between phrases (
p < 0.005) and between words (
p < 0.005) were longer in MS addressed to ASD than to TD children (
Figure 2A).
Pitch values were significantly higher (
p < 0.001—Mann–Whitney test) in MS addressed to ASD than MS addressed to TD children and children with DS in utterances, phrases, words, and stressed vowels (
Figure 2B).
The pitch values of the voices of ASD children’s mothers were higher if the child was older, F (1.74) = 4.531, p < 0.03 (R2 = 0,057; Beta = 0.240)—Regression analysis. Mothers’ voices addressed to ASD boys were higher than those directed to ASD girls, F (1.74) = 4.118, p < 0.04 (R2 = 0.054; Beta = 0.231). The lower the voice of the ASD child’s mother (the speech was less emotional) was, the higher the child’s CARS score, F (1.74) = 37.397, p < 0.0001 (R2 = 0.336; Beta = −0.579).
Discriminant analysis showed the difference in pitch values of MS as a function of the gender of the TD child, F (6.47) = 2.879, p < 0.01 Wilks’ Lambda—0.731 Wilks’ = 0.800. Mothers’ communications with TD boys were more emotional (the pitch values were higher) than with the girls, F (6.47) = 2.872, p < 0.01 (R2 = 0.268; Beta = −1.815).
For MS addressed to DS children, the correlation, F (1.21) = 5.790, p < 0.002 (R2 = 0.216; Beta = −0.465—Regression analysis), between the child’s age and the pitch values of the mother’s phrases was revealed. The mother’s voice was higher if the child was younger. The age and the gender of DS children were not correlated with the speech features of their mothers.
The maximal values of stressed vowel formant triangle areas (341,498 conv. units) and the minimal values of unstressed vowel formant triangle areas (135,067.8 conv. units) were revealed in MS addressed to children with DS. The values of stressed vowel formant triangle areas in MS addressed to children with ASD (180,670 conv. units) were higher than values in MS addressed to TD children (177,800.5 conv. units). The values of vowel articulation index (VAI) for stressed vowels in words from MS directed to children with DS (1.16) were higher than the values of VAI for stressed vowels for MS directed to children with ASD (0.95) and TD children (0.93).
The data on the values of the vowel formant triangle areas and the values of VAI indicated clearer articulation in MS addressed to children with speech disorders, i.e., ASD and DS, than to TD children. The articulation of the mothers of DS children was clearer than the articulation of the mothers of TD and ASD children.
3.3.3. Phonetic Data
MS addressed to ASD children contained fewer reduced vowel phonemes (up to six reduced phonemes in the utterance, at an average of 3.4) compared to MS directed to TD children (up to nine reduced phonemes, with a mean of 4.5). The results of the phonetic analysis indicated the use of a pronunciation standard for the Russian language by mothers of DS children. The number of reduced phonemes in the utterances of mothers of DS children was up to 28% of the total number of phonemes used in speech.
3.5. Phonetic Data of Child Speech
TD, ASD, DS children, and orphans used all the vowel phonemes of the Russian language. The consonant phonemes differed in TD, DS, and ASD children in the frequency of occurrence of the phoneme /r/. In ASD child speech, the phoneme /R/ occurred (a uvular trembling). For ASD child speech, there were replacements of phonemes: /s’/ to S/, /t’/ to /tS’/, and /j/ to /r’/, the non-formation of phonemes: /r/, /tS’/, /S/, and /Z/, and the use of vocalized /r’/ and aspirated /k/. In contrast to children with ASD and TD children, children with DS did not have the phonemes /b’/, /m’/, /f/, /z/, /z’/, /r’/, /S/, /S’/, and /r/. Thus, the lack of formation of the majority of consonant phonemes in DS children was shown, which leads to a lack of speech in children. The omissions of phoneme /r/ and the substitutions of phonemes /r/ to /R/ (a uvular trembling), /l/ to /v/, /Z/ to /z/, and /S/ to /s/ were specific characteristics of orphans’ speech.