The Phonetics of Speech Production and Medical Research

Fivela, Barbara Gili; Grimaldi, Mirko; Sigona, Francesco; d’Apolito, Sonia

doi:10.1285/i25327518v3i2p17

Open AccessArticle

The Phonetics of Speech Production and Medical Research

by

Barbara Gili Fivela

^*,

Mirko Grimaldi

,

Francesco Sigona

and

Sonia d’Apolito

Department of Humanities & CRIL-DReAM, University of Salento, Italy

^*

Author to whom correspondence should be addressed.

J. Interdiscip. Res. Appl. Med. 2019, 3(2), 17-26; https://doi.org/10.1285/i25327518v3i2p17

Published: 31 December 2019

Download

Browse Figures

Versions Notes

Abstract

The production of speech requires the interplay of a number of cognitive and motoric activities, which make it an interesting object of study from both a linguistic and a medical point of view. In this paper, we discuss, first, the features and domain of application of the most used technologies in linguistic research on speech production, focusing on those that have been applied to medicine. Second, we offer an insight into the main results that have been obtained so far in studying dysarthria in Italian Parkinson’s Disease, as an example of the interdisciplinary, experimental research at the border between linguistics and medicine.

Keywords:

3D articulography; ultrasound; phonetics; phonology; medicine; dysarthria; Parkinson’s Disease

1.1. The analysis of speech production

Investigations on speech production may rely on acoustic analyses, which offer information on the issue, though do not allow a direct and detailed observation of the production mechanism. Nowadays phoneticians easily perform acoustic analyses, thanks to the diffusion of recording facilities and dedicated software. However, acoustic data only represent the starting point of studies whose aim is to deeply investigate speech production. Therefore, a typical speech production study involves the recording of both acoustic and articulatory material, and the following analysis of both types of data. Crucially, data acquisition has to be synchronized (via hardware or post-processing) in order to shed light on the articulatory mechanism responsible for the production of linguistic sounds. The following part of this section offers and overview of the main software nowadays used for acoustic investigation, as well as the main technologies exploited in most of the current studies on speech production.

Acoustic analysis, as well as acoustic recordings, may be performed by means of PRAAT (Boersma and Weenink 2020). As for speech analysis, the software allows the user to segment the audio signal and to time-align multiple levels of text labels, such as those regarding consonant and vowel boundaries, as well as the presence intonational events, words, phrases and larger constituents—Figure 1. The system also allows to semi-automatically performing a wide range of acoustic measurements with reference to the abovementioned labels, i.e., points in time. Crucial information on speech may be already collected by means of this type of material and analyses.

However, investigations on speech production nowadays often relies on the use of appropriate, in some cases purpose-built, technology.

1.2. Electromagnetic articulography

An example of purpose-built technology is the ElectroMagnetic Articulograph (EMA). Various systems are described in the literature (Kaburagi et al. 2005; Stella et al. 2012; Stella et al. 2013), which are used to track the movements of a set of sensors glued on the main speech articulators, such as tongue and lips (see Figure 2, right panel), as well as on more stable parts, such as the front teeth or the forehead, to compensate for head movement. For instance, the Carstens Medizinelektronik GmbH’s system AG501 (http://www.articulograph.de) is used to identify the Cartesian coordinates x, y and z, as well as azimuth and elevation of directionally sensitive magnetic field sensors (8 to 24) at a sampling frequency of up to 1250 Hz in real time. The measuring sensors are single axis coils. Nine reference coils, along three arms (blue arms in Figure 2, left panel), arranged to form a three-dimensional frame of reference, emit magnetic fields at different well known frequencies between 7500 and 13,750 Hz. During a recording session, the alternating currents induced in the sensors by the magnetic fields of the reference coils are separated by their frequencies, digitized and sent in real-time to the control unit. Dedicated software stores the current values, making them available for the spatial arrangement determination process.

1.3. Ultrasound tongue imaging

Articulatory studies of tongue motion are becoming popular in phonetics, thanks to the adoption of ultrasound systems which have been used for clinical purposes and have been eventually adapted to suit the investigation of speech production. Such systems, which are both non-invasive and non-obtrusive, are able to provide the profile of the tongue during speech production, although the image of tongue apex and radix may be often occluded by the presence of the jaw and the hyoid bone respectively. Ultrasound images are obtained thanks to a high frequency (2-14 MHz) sound waves emitted from an array of piezoelectric transducers (crystals), multiplexed in time: only one crystal emits sound waves in a given time interval, while all the remaining crystals are used to convert the received echoes to voltage values. The ultrasound wave goes through tissues and is reflected when it reaches an interface between tissues or materials with different impedance properties. Ultrasound images are reconstruction of such interfaces, thanks to the processing of the voltage values of the received echoes. In the case of tongue imaging, the probe is placed under the chin and the wave goes upward, though the tongue (Figure 3, left panel). When the wave reaches the upper surface of the tongue, where the impedance changes (think of the mucous membrane and the air in the oral cave), it is reflected to the probe and the surface of the tongue may be reconstructed (Figure 3, right panel). The tongue profile during speech production may be available as a sequence of images that are timealigned with the audio recording.

Nowadays, increased sampling rates allow researchers to get a more convenient number of frames per second during speech, as producing the verbal chain requires quite a high-speed sequence of gestures. For instance, an Aplio XV machine, by Toshiba Medical System corp. (http://www.medical.toshiba.com), allowed us to collect the first ultrasound data related to speech in Italy (Grimaldi et al. 2008). At that time, during each recording session, the ultrasound pictures were exported as a continuous video stream (at 25 Hz), by means of a dedicated S-Video output; such stream was acquired together with the audio signal (synchronously), by means of an external a/v analog-to-digital acquisition card, and then recorder in real-time on a dedicated PC. Nowadays, more compact and fast systems may be used, that do not require the realization of a constellation of hardware and software means to ensure the acquisition of all the information needed to analyze speech. Systems such as the “Micro Ultrasound system for speech research”, proposed by the Articulate Instruments Ltd. (http://www.articulateinstruments.com/alan-wrench/), for instance, may include the hardware synchronization for audio capturing and the AAA software for data analysis.

Linguists, phoneticians, and laboratory phonologist in particular use the abovementioned technologies to test their hypothesis on the linguistic organization of speech production. The way speech gestures are realized and phased with respect to each other is investigated with respect to various languages, as well as with respect to the changes in their organization in the case a second, or foreign, language is produced. However, the testing of the very same linguistic hypotheses, or a better understanding of speech production in general, may be of interest of medical research too (for an overview, see Gili Fivela, Zmarich 2013).

2. Articulatory studies and speech pathology

For many decades, the description, explanation and rehabilitation of various speech articulatory disorders have been based on data derived from phonetic transcriptions, based on transcriber’s perception, and acoustic analysis of pathological speech. However, these methods reveal limitations concerning the description, the explanation and the rehabilitation of speech pathologies. As for the former, limits are due to the subjectivity of the auditory perception and of the subsequent evaluation (Shriberg and Lof 1991), and to the lack of a two-way correspondence between acoustic and articulatory data (Sondhi 1979). As for the possible explanation, limits are identified, firstly, in the “opacity” introduced by the distance between the cause of the pathology and the measured acoustic or perceptual events, which originate at the periphery of the speech production system; secondly, limits concern the inadequacy of phonetic-phonological theories based on the study of perceptual or acoustic targets, and therefore not suitable to explain motor events of an intrinsically dynamic nature (Weismer, Tjaden, Kent 1995). Finally, concerning rehabilitation, a limit consists in the inability to provide adequate articulatory feedback, which can be only partially provided by auditory and, even less, by acoustic information.

A useful integration to these traditional methods is then represented by the information on the dynamics and kinematics of speech, that is the description of the movement (e.g., duration, extension) and the description of the physical conditions responsible for a given movement (which, in addition to the already mentioned descriptors, include, e.g., mass coefficients, rigidity, damping; Bingham 1988). Kinematic and dynamic data offer reliable qualitative and quantitative information about the movements of the articulatory organs (Gracco 1992); from an explanatory point of view, they can provide a precious alternative to traditional explanations, when framed within a suitable theoretical framework, such as a non-linear dynamic theory (see Port, van Gelder 1995). For instance, according to Articulatory Phonology, the main unit of the motoric control for linguistic purposes is the so-called articulatory gesture (see the review in Goldstein and Fowler 2003). The articulatory gesture is dimensioned with respect to the spatial coordinates that represent the vocal tract and with individual quantities that are proportional according to a gestural score that indicates the organization of gestures and the intervals of activation of the tract variables which are relevant to the production (e.g., Lip Aperture or Lip Protrusion, which according to Gestural Phonology—Browman and Goldstein 1986—are associated with a series of articulators). The impact of such theory on the investigation of speech pathologies is acknowledged (van Lieshout, Goldstein 2008), and is tightly linked with experimental investigations performed with the technologies described in the first section of this paper.

In fact, the usefulness of technologies in studying speech articulation has been clear at least since the review by Thompson-Ward and Murdoch (1998), on the methods to verify the articulatory capacity in dysarthria, and the review by Barlow et al. (2009), on cinematic measures related to speech. However, as recalled in the latter, Sonoda (1974) already observed the usefulness of the real-time collection of kinematic data by means of orofacial magnetometry, because such data can be used in the rehabilitation of dysarthria, thanks to the visualization of the movement of the articulators. Not surprisingly, the first studies employing ultrasound in the investigation of pathological speech date back to the early 1980s. As Thompson-Ward and Murdoch (1998) recall, Shawker et al. (1984) already carried out a study on non-pathological and dysarthric speakers, and noted that ultrasounds allowed to observe significant differences in the articulation of vowels (/a/, /i/) and consonants (/k/), and could have therefore represented a promising technique. The usefulness of Electromagnetic articulography is also acknowledged and quite widespread (Wong et al. 2010), as for the investigation of various pathologies, such as dysarthria (Rong et al. 2012; Jaeger et al. 2000; McAuliffe et al. 2005; Wong et al. 2010a; Wong et al. 2011), apraxia of speech (Katz et al. 2003; Katz MacNeil 2010) and stuttering (van Lieshout et al. 1993, 2004; McClean et al. 2004; McClean, Runyan 2000; Max 2004). Further, several corpora of kinematic data have been collected for the investigation of apraxia, stuttering and dysarthria. For instance, van den Berg et al. (2006) for apraxia, van Lieshout et al. (1993), and Ward (1997) for stuttering, and the TORGO database, which includes video, audio and 3D electromagnetic articulation data recordings (AG500) concerning dysarthria (Rudzicz et al. 2008).

Interestingly, a number of studies investigate on the use of technologies, in particular the electromagnetic articulation, in training during therapies related to pathologies that cause articulatory problems in speech. For example, Bose et al. (2001) reported on a one case study on an adult subject suffering from Broca’s aphasia and apraxia. They demonstrated the usefulness of the PROMPT system (Hayden 1984), originally developed for oral language “teaching” and involving the use of auditory, visual and tactile stimuli.

Further, a dynamic field of investigation and application concerns the use of articulatory information for the realization of rehabilitation and training protocols based on biofeedback. The visualization of ultrasound images, for instance, has been successfully used to provide articulatory feedback in the therapy of articulatory problems (Bacsfalvi et al. 2007; Bernhardt et al. 2005). Systems developed from articulographic data are also of considerable interest. In this context, the BALDI systems are worthy of note (Massaro 2004; for Italian, cf. BALDINI, Cosi et al. 2002, LUCIA, Cosi et al. 2008) and ARTUR (Eriksson 2005; Engwall 2008). They are computer training systems used for teaching pronunciation (not only in the case of speech/hearing problems, but also for foreign language learning): thanks to the use of “talking heads”, i.e., heads and faces animated by computers, these systems show the user how to produce speech sounds, and help them to selfcorrect their speech gestures. In particular, ARTUR has been developed on the basis of acoustic, video and EMA data (or MOVETRACK; Branderud 1985) through the optimization of acoustic-articulatory inversion (Kjellström, Engwall 2009).

With respect to the use of biofeedback, the contribution of Katz and McNeil (2010) who studied the feedback effect provided in real time to verify its usefulness in apraxic patients is also particularly important. The study, carried out by means of EMA and sensors positioned on the subjects’ tongue, describes how it is possible to provide information on the use of biofeedback in real time, regarding the position of the tongue (see also Schulz et al. 2006). The system shows subjects how to reach a target indicated on the computer monitor, and is proved to be a useful aid in the improvement of articulatory problems due to apraxia.

As far as Italian is concerned, the usefulness of articulatory investigations has long been recognized and applied to the analysis of stuttering (Zmarich 1999a, 1999b, and following works), and, more recently, to the investigation of dysarthria in Parkinson’s Disease.

3. Dysarthric speech in Parkinson’s Disease: state of the art of investigations on Italian

Along with the development of Parkinson’s Disease, patients often suffer from hypokinetic dysarthria. They show a reduction in the amplitude and speed of movements (Ackermann, Ziegler 1991; Duffy 2005; Darley et al. 1975), which has an impact on speech production too. Besides these very general characteristics, a considerable intra-speaker variability concerning speech abnormalities is observed, which may also depend on various factors, such as the task subjects are asked to perform.

Nevertheless, some established characteristics have been identified. In early stages, there may be mild phonetic impairment, while at later stages articulation becomes less precise, and reading rate becomes slower, with an increased number and duration of pauses which may relate to the difficulties in initiating the articulator movement. Perceptually, hypokinetic dysarthria is characterized by monopitch, monoloudness, reduced stress, imprecise consonants, and inappropriate silences. From a kinematic point of view, a reduction is observed in the movement peak velocity and amplitude of lips and jaw. This is evident from reduced vowel formant transition extents, reduced vowel spaces and reduced consonant spectral distinctiveness (Tjaden 2008). Besides reduction, incoordination has also been observed, and different gesture coordination relations can imply changes in syllabic affiliation too.

Research in speech production in Italian dysarthric speech by Parkinsonian subjects has been performed by adopting the Laboratory Phonology approach (Pierrehumbert et al. 2000). Specifically, the impact of the disease on speech production has been investigated with reference to its effect on phonological features, rather than by itself. Specifically, most of the analyses performed so far aimed at investigating phonological contrasts involving vowels and consonants, with specific attention to the realization of the geminate vs. singleton differences, as produced by mild-to-severe Parkinson’s Disease patients (Gili Fivela et al. 2014; Iraci et al. 2016, 2017a, 2017b; Iraci 2017; Gili Fivela et al., submitted). Besides the obvious interest in the realization of single vowels and consonants, gemination was chosen in order to get information also on the realization of syllables, as the presence of a geminate rather than a singleton involves a change in syllabic affiliation and a change in the duration of the preceding vowel.

The mail goals in investigating Italian dysarthric speech have been 1) to verify if pathological speakers were able to produce the articulatory correlates of a given phonological contrast, and 2) to identify possible compensatory phenomena speakers may adopt in order to overcoming the motor deficit.

Following the Articulatory Phonology framework’s predictions, the hypothesis behind 1) and 2) has been that distinctiveness is not threatened since the vocal tract (which is seen as a dynamical system) would re-organise as a function of the linguistic contrasts to maintain. In order to reach the above mentioned goals and check the main hypothesis, a corpus of acoustic and articulatory—AG501—data has been collected, by asking mild-to-severe Parkinson’s Disease patients, who developed hypokinetic dysarthria, to read aloud highly controlled sentences. They include minimal pairs differing as for the medial consonant—singleton vs. geminate—and the vocalic composition.

Results of analyses performed so far (acoustics by means of PRAAT and articulatory by means of MAYDAY—Sigona et al. 2015) show that Parkinson’s dysarthric speakers show spatial alterations (amplitude of movements) that not necessarily involve a reduction of the range of motion. Specifically, they show even gestures of systematically increased amplitude in the case of horizontal, antero-posterior, displacement of the tongue (Gili Fivela et al. 2014; Iraci 2017). However, intra-speaker analyses showed that phonological distinctions are preserved as much as possible, through a re-organisation of the vocal tract movements, consisting in re-adjustments of surrounding/secondary articulatory gestures (Iraci 2017; Gili Fivela et al., submitted).

The analysis of the abovementioned corpus is still on-going, and it will be deepened thanks to a funded National project (PRIN 2017JNKCYZ), within which prosodic correlates will also be investigated.

References

Ackermann, H., and W. Ziegler. 1991. Articulatory deficits in Parkinsonian dysarthria: an acoustic analysis. Journal of Neurology, Neurosurgery, and Psychiatry 54: 1093–1098. [Google Scholar] [CrossRef] [PubMed]
Barlow, S., D. Finan, R. Andreatta, and C. Boliek. 2009. Kinematic measurements of speech and early orofacial movements. In Clinical management of sensorimotor speech disorders. Edited by M. McNeil. Medical Publishers Inc.: pp. 80–99. [Google Scholar]
Bingham, G. P. 1988. A note on dynamics and kinematics, Haskins Laboratories, Status Rep. Speech Research SR-93/94. pp. 247–251. [Google Scholar]
Bacsfalvi, P., B. Bernhardt, and B. Gick. 2007. Electropalatography and ultrasound in vowel remediation for adolescents with hearing impairment. Advances in Speech-Language Pathology 9: 36–45. [Google Scholar] [CrossRef]
Bernhardt, B., P. Bacsfalvi, B. Gick, B. Radanov, and R. Williams. 2005. Exploring the use of electropalatography and ultrasound in speech habilitation. Journal of Speech-Language Pathology and Audiology 29, 4: 169–182. [Google Scholar]
Bose, A., P. A. Square, R. Schlosser, and P. Van Lieshout. 2001. Effects of PROMPT therapy on speech motor function in a person with aphasia and apraxia of speech. Aphasiology 15, 8: 767–785. [Google Scholar] [CrossRef]
Boersma, P., and D. Weenink. 2020. University of Amsterdaam. Available online: http://www.fon.hum.uva.nl/.
Branderud, P. 1985. Movetrack—A movement tracking system, Proc. French–Swedish Symposium on Speech, Grenoble, 113–122. [Google Scholar]
Browman, C.P., and L. Goldstein. 1986. Towards an articulatory phonology. In Phonology Yearbook 3. Edited by C. Ewen and J. Anderson. Cambridge: CUP: pp. 219–252. [Google Scholar]
Cosi, P., M. Cohen, and D.W. Massaro. 2002. BALDINI: BALDI Speaks Italian! Eurospeech 2002, Denver, Colorado-U.S.A, September 16–20. [Google Scholar]
Cosi, P., and C. Drioli. 2008. LUCIA a new emotive/expressive Italian talking head. In Emotions in the Human Voice, Volume III, Chapter 9. Edited by K. Izdebski. Plural Publishing, Inc.: S.Diego CA, USA: pp. 153–176. [Google Scholar]
Darley, F. L., A. E. Aronson, and J. R. Brown. 1975. Motor speech disorders. W.B.Saunders and Co. [Google Scholar]
Duffy, J. R. 2005. Motor Speech Disorders: Substrates, Differential Diagnosis, and Management, 2°ed. Elsevier Mosby. [Google Scholar]
Engwall, O. 2008. Can audio-visual instructions help learners improve their articulation?—an ultrasound study of short term changes. Brisbane, Australia, Proceedings of Interspeech 2008: pp. 2631–2634. [Google Scholar]
Eriksson, E., O. Bälter, O. Engwall, and A. M. Öster. 2005. Design recommendations for a computer-based speech training system based on end-user interviews (ARTUR). Proceedings of the Tenth International Conference on Speech and Computers, SPECOM 2005, Patras, Greece, October 17–19; pp. 483–486. [Google Scholar]
Gili Fivela, B., and C. Zmarich. 2013. Le patologie del parlato e il ruolo dello studio strumentale dell’articolazione: una prima ricognizione. In “Multimodalità e Multilingualità: la Sfida più Avanzata della Comunicazione Orale, Atti del 9° convegno AISV, 21–23 gennaio 2013. Edited by V. Galatà. Università Ca’ Foscari—Venezia. Bulzoni: Roma: pp. 185–202. [Google Scholar]
Gili Fivela, B., M. Iraci, V. Sallustio, M. Grimaldi, C. Zmarich, and D. Patrocinio. 2014. Italian Vowel and Consonant (co)articulation in Parkinson’s Disease: extreme or reduced articulatory variability? In Proceedings of the 10th International Seminar on Speech Production (ISSP). Edited by Susanne Fuchs, Martine Grice, Anne Hermes, Leonardo Lancia and Doris Mücke. Cologne, Germany, May 5–8, pp. 146–149. [Google Scholar]
Gili Fivela, B., M.M. Iraci, M. Grimaldi, and C. Zmarich. 2015. Consonanti scempie e geminate nel morbo di Parkinson: la produzione di bilabiali. In “Il farsi e il disfarsi del linguaggio Acquisizione, mutamento e destrutturazione della struttura sonora del linguaggio/LANGUAGE AC-QUISITION AND LANGUAGE LOSS. Acquisition, change and disorders of the language sound structure”. Edited by M. Vayra, C. Avesani and F. Tamburini. Milano: Officinaventuno: pp. 289–312. [Google Scholar]
Gili Fivela, B., S. d’Apolito, and G. Di Prizio. submitted. Labialization and Prosodic Modulation in Italian Dysarthric Speech by Parkinsonian Speakers: A Preliminary Investigation. Speech Prosody 2020, Tokyo, Japan, May 24–28 2020. [Google Scholar]
Goldstein, L., and C. Fowler. 2003. Articulatory phonology: a phonology for public language use, in Phonetics and Phonology in Language Comprehension and Production: Differences and Similarities. Edited by A. Meyer and N. Schiller. New York: Mouton: pp. 159–207. [Google Scholar]
Gracco, V.L. 1992. Analysis of speech movements: practical considerations and clinical application, Haskins Laboratories, Status Report on Speech Research SR-109/110. pp. 45–58. [Google Scholar]
Grimaldi, M., B. Gili Fivela, F. Sigona, M. Tavella, P. Fitzpatrick, L. Craighero, L. Fadiga, G. Sandini, and G. Metta. 2008. New technologies for simoultaneous acquisition of speech articulatory data: ultrasound, 3D articulograph and electroglottograph, C.Delogu, M. Falconi (eds.), Proc. of LangTech, Feb.08, Rome, FUB, 81-85. [Google Scholar]
Hayden, D. A. 1984. The PROMPT system of therapy: Theoretical framework and applications for developmental apraxia of speech. Seminars in Speech and Language 2, n.2: 139–155. [Google Scholar] [CrossRef]
Iraci, M.M. 2017. Vowels, consonants and co-articulation in Parkinson’s Disease. Unpublished PhD Dissertation, 2017, University of Salento, Lecce, Italy. [Google Scholar]
Iraci, M., M. Grimaldi, and B. Gili Fivela. 2016. Phonology drives compensation: bridging linguistic and clinical evaluation for a classification of dysarthric speech. In La fonetica nell’apprendimento delle lingue, Phonetics and language learning. Edited by R. Savy and I. Alfano. Studi Aisv 2, Officinaventuno, Milano: pp. 359–379. [Google Scholar]
Iraci, M., V. Sallustio, M. Grimaldi, C. Zmarich, D. Patrocinio, and B. Gili Fivela. 2017a. Il parlato nel morbo di Parkinson: ampiezza dei gesti articolatori e distintività dei suoni linguistici. In Il linguaggio disturbato. Modelli, strumenti, dati empirici. Edited by P. Sorianello. Roma: Aracne Editrice: pp. 93–108. [Google Scholar]
Iraci, M.M., M. Grimaldi, and B. Gili Fivela. 2017b. Il contributo di Fonetica e Fonologia alla riabilitazione logopedica personalizzata di soggetti parkinsoniani disartrici. In Tra medici e linguisti. Lingua e patologia. Le frontiere interdisciplinari del linguaggio. Edited by F. Dovetto. Collana “Linguistica delle differenze”. Roma: Aracne: pp. 253–265. [Google Scholar]
Jaeger, M., I. Hertrich, U. Stattrop, P.-W. Schönle, and H. Ackermann. 2000. Speech disorders following severe traumatic brain injury: Kinematic analysis of syllable repetitions using electromagnetic articulography. Folia Phoniatrica et Logopaedica 52: 187–196. [Google Scholar] [CrossRef]
Kaburagi, T., K. Wakamiya, and M. Honda. 2005. Three-dimensional electromagnetic articulography: A measurement principle. J. of Acoustical Society of America 118: 428–444. [Google Scholar] [CrossRef]
Katz, W. F., J. S. Levitt, and G. C. Carter. 2003. Biofeedback treatment of buccofacial apraxia using EMA. Brain and Language 87: 175–176. [Google Scholar] [CrossRef]
Katz, W.F., and M. McNeil. 2010. Studies of Articulatory Feedback Treatment for Apraxia of Speech Based on Electromagnetic Articulography. Perspectives on Neurophysiology and Neurogenic Speech and Language Disorders 20: 73–79. [Google Scholar] [CrossRef]
Kjellström, H., and O. Engwall. 2009. Audiovisual-to-articulatory inversion. Speech Communication 51, 3: 195–209. [Google Scholar] [CrossRef]
Massaro, D. 2004. Symbiotic Value of an Embodied Agent in Language Learning, (BALDI). In Proceedings of 37th Annual Hawaii International Conference on System Sciences (CD/ROM). Computer Society Press: CD Rom: pp. 1–10. [Google Scholar]
Max, L. 2004. Stuttering and internal models for sensorimotor control: A theoretical perspective to generate testable hypotheses. In Speech motor control in normal and disordered speech. Edited by B. Maassen, R.D. Kent, H.F.M. Peters, P.H.H.M. van Lieshout and W. Hulstijn. Oxford (UK): OUP: pp. 357–387. [Google Scholar]
McAuliffe, M. J., E. C. Ward, and B. E. Murdoch. 2005. Articulatory function in hypokinetic dysarthria: an electropalatographic examination of two cases. Journal of Medical Speech-Language Pathology: vol. 13, 2, pp. 149–168. [Google Scholar]
McClean, M.D., S.M. Tasko, and C.M. Runyan. 2004. Orofacial movements associated with fluent speech in persons who stutter. J Speech Lang Hear Res. 47, 2: 294–303. [Google Scholar] [CrossRef]
McClean, M.D., and C.M. Runyan. 2000. Variations in the relative speeds of orofacial structures with stuttering severity. J Speech Lang Hear Res. 43, 6: 1524–31. [Google Scholar] [CrossRef]
Pierrehumbert, J., M. Beckman, and D.R. Ladd. 2000. Conceptual Foundations of Phonology as a Laboratory Science. In Phonological Knowledge: Conceptual and Empirical Issues. Edited by N. Burton-Roberts, P. Carr and G.J. Docherty. Oxford: OUP: pp. 273–303. [Google Scholar]
Port, R.F., and T. van Gelder, eds. 1995. Mind as motion. MIT Press: Cambridge, MA. [Google Scholar]
Rong, P. Y., T. M. Loucks, H. J. Kim, and M. Hasegawa-Johnson. 2012. Assessment of tongue-jaw coordination in spastic dysarthria using simultaneous EMA and EMG recordings. Clinical Linguistics and Phonetics 26, 9: 806–822. [Google Scholar] [CrossRef]
Rudzicz, F., P. van Lieshout, G. Hirst, G. Penn, F. Shein, and T. Wolff. 2008. Towards a comparative database of dysarthric articulation, in Proc.8th Int. Seminar Speech Production (ISSP’08), Strasbourg, France, Dec. pp. 285–288. [Google Scholar]
Schulz, G.M., J. Hahn, G. Jin, J. Kiraly, and B. e B. Carstens. 2006. Translation Of 3-D Articulatory Signals Acquired By Electromagnetic Articulography To A Visual Display Of Lingual Movements For Biofeedback: Preliminary Results, Presentation during. Motor speech conference. [Google Scholar]
Shawker, T.H., and B.C. Sonies. 1984. Tongue Movement During Speech: A Real-Time Ultrasound Evaluation. J. Clin. Ultrasound 12: 125–133. [Google Scholar] [CrossRef]
Shriberg, L.D., and G.D. Lof. 1991. Reliability studies in broad and narrow phonetic transcription. Clinical and Linguistic Phonetics 5: 225–279. [Google Scholar] [CrossRef]
Sigona, F., Stella, M. A. Grimaldi, and B. Gili Fivela. 2015. MAYDAY: a software for multimodal articulatory data analysis. In A. Romano, M.Rivoira, I. Mean-dri (eds.) “Aspetti prosodici e testuali del raccontare: dalla letteratura orale al parlato dei media”, Atti del 10° convegno AISV, 22-24 gennaio 2014, Università di Torino, Torino: Edizioni dell’Orso, 173-184. [Google Scholar]
Sigona, F., M. Stella, A. Stella, P. Bernardini, B. Gili Fivela, and M. Grimaldi. 2018. Assessing the Position Tracking Reliability of Carstens’ AG500 and AG501 Electromagnetic Articulogrphy during Constrained Movements and Speech Tasks. Speech communication 104: 73–88. [Google Scholar] [CrossRef]
Sondhi, M.M. 1979. Estimation of vocal-tract areas: The need for acoustic measurements. IEEE Trans. Acoust. Speech Signal Process. ASSP-27, 3: 268–273. [Google Scholar] [CrossRef]
Sonoda, Y. 1974. Observation of tongue movement employing magnetometer sensor. IEEE Trans. Magn. MAG-10, 954–957. [Google Scholar] [CrossRef]
Stella, M., P. Bernardini, F. Sigona, A. Stella, M. Grimaldi, and B. Gili Fivela. 2012. Numerical instabilities and threedimensional electromagnetic articulograph. Journal of the Acoustical Society of America 132, 6: 3941–3949. [Google Scholar] [CrossRef] [PubMed]
Stella, M., P. Bernardini, F. Sigona, A. Stella, M. Grimaldi, and B. Gili Fivela. 2013. Electromagnetic Articulography with AG500 and AG501. 14th Annual Conference of the International Speech Communication Association (ISCA), Interspeech, Lyon (France); pp. 1316–1320. [Google Scholar]
Thompson-Ward, E.C., and B.E. Murdoch. 1998. Instrumental assessment of the speech mechanism. In Dysarthria: A Physiological Approach to Assessment and Treatment. Edited by B. E. Murdoch. Stanley Thornes (Publishers) Ltd.: pp. 68–101. [Google Scholar]
Tjaden, K. 2008. Speech and swallowing in Parkinson’s disease. Topics in geriatric rehabilitation 24, 2: 115–126. [Google Scholar] [CrossRef]
van den Berg, R., L. Nijland, and B. Maassen. 2006. Kinematic measurements of diadochikinetic performances in children with DAS or PD, SMC conference, Nijmegen. [Google Scholar]
van Lieshout, P. H. H. M., and L. Goldstein. 2008. Articulatory phonology and speech impairment, in Ball et al. In The Handbook of Clinical Linguistics. Oxford: Blackwell. [Google Scholar]
van Lieshout, P. H. H. M., P. J. Alfonso, W. Hulstijn, and H. F. M. Peters. 1993. Electromagnetic articulography (EMA) in stuttering research. Forschungsberichte des Instituts für Phonetik und Sprachliche Kommunikation der Universität München 31: 215–224. [Google Scholar]
van Lieshout, P. H. H. M., W. Hulstijn, and H. F. M. Peters. 2004. Searching for the weak link in the speech production chain of people who stutter: A motor skill approach. In Speech Motor Control in Normal and Disordered Speech. Edited by B. Maassen, R. Kent, H. Peters, P. H. H. M. van Lieshout and W. Hulstijn. OUP: Oxford: pp. 313–355. [Google Scholar]
Weismer, G., K. Tjaden, and R.D. Kent. 1995. Can articulatory behavior in motor speech disorders be accounted for by theories of normal speech production? J. Phonetics 23: 149–164. [Google Scholar] [CrossRef]
Wong, M. N., B. E. Murdoch, and B.-M. Whelan. 2010. Kinematic analysis of lingual function in dysarthric speakers with Parkinson’s disease: an electromagnetic articulograph study. International Journal of Speech-Language Pathology 12, 5: 414–425. [Google Scholar] [CrossRef] [PubMed]
Wong, M. N., B. E. Murdoch, and B.-M. Whelan. 2010a. Tongue function in nondysarthric speakers with Parkinson’s disease: an electromagnetic articulography investigation. Journal of Parkinson’s Disease Medical Speech-Language Pathology 18, 3: 24–33. [Google Scholar]
Wong, M., B. E. Murdoch, and B.-M Whelan. 2011. Lingual Kinematics in Dysarthric and Nondysarthric Speakers with Parkinson’s Disease. SAGE-Hindawi Access to Research Parkinson’s Disease 2011: 352838. [Google Scholar]
Zmarich, C. 1999a. Dinamiche articolatorie nella produzione verbale fluente di normoparlanti e balbuzienti. In Atti delle XI Giornate di Studio del G.F.S. (A.I.A.). Edited by R. Del Monte and A. Bristot. Venezia: pp. 217–230. [Google Scholar]
Zmarich, C. 1999b. L’importanza dell’ana lisi cinematica: esemplificazioni relative alla balbuzie. In Atti del 6° Convegno Nazionale Informatica, Didattica & Disabilità. Edited by A. Tronconi. Andria (Bari): pp. 101–106. [Google Scholar]

Figure 1. PRAAT—waveform, spectrogram with overlapping fundamental frequency track, and two levels of time-aligned labels.

Figure 2. Speech recording with AG501 (left) and sensors glued on the subject’s tongue and lips (right).

Figure 3. Ultrasound probe position during speech recording (left) and example of ultrasound tongue image (right).

Share and Cite

MDPI and ACS Style

Fivela, B.G.; Grimaldi, M.; Sigona, F.; d’Apolito, S. The Phonetics of Speech Production and Medical Research. J. Interdiscip. Res. Appl. Med. 2019, 3, 17-26. https://doi.org/10.1285/i25327518v3i2p17

AMA Style

Fivela BG, Grimaldi M, Sigona F, d’Apolito S. The Phonetics of Speech Production and Medical Research. Journal of Interdisciplinary Research Applied to Medicine. 2019; 3(2):17-26. https://doi.org/10.1285/i25327518v3i2p17

Chicago/Turabian Style

Fivela, Barbara Gili, Mirko Grimaldi, Francesco Sigona, and Sonia d’Apolito. 2019. "The Phonetics of Speech Production and Medical Research" Journal of Interdisciplinary Research Applied to Medicine 3, no. 2: 17-26. https://doi.org/10.1285/i25327518v3i2p17

APA Style

Fivela, B. G., Grimaldi, M., Sigona, F., & d’Apolito, S. (2019). The Phonetics of Speech Production and Medical Research. Journal of Interdisciplinary Research Applied to Medicine, 3(2), 17-26. https://doi.org/10.1285/i25327518v3i2p17

Article Menu

The Phonetics of Speech Production and Medical Research

Abstract

1.1. The analysis of speech production

1.2. Electromagnetic articulography

1.3. Ultrasound tongue imaging

2. Articulatory studies and speech pathology

3. Dysarthric speech in Parkinson’s Disease: state of the art of investigations on Italian

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI