Next Article in Journal
Experimental Study on Mechanical Properties, Energy Dissipation Characteristics and Acoustic Emission Parameters of Compression Failure of Sandstone Specimens Containing En Echelon Flaws
Next Article in Special Issue
Inundation Risk Assessment of Underground Space Using Consequence-Probability Matrix
Previous Article in Journal
Potential of Workshop Measurement Positioning System to Measure Oscillation Frequencies of Rigid Structures
Previous Article in Special Issue
Investigation of an Explosion at a Styrene Plant with Alkylation Reactor Feed Furnace
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Web Service for Evaluating the Level of Speech in Korean

1
Department of Convergence Software, Hallym University, Chuncheon-si, Gangwon-do 24252, Korea
2
Bio-IT Center, Hallym University, Chuncheon-si, Gangwon-do 24252, Korea.
3
Division of Speech Pathology and Audiology, Hallym University, Chuncheon, Ganwon-do 24252, Korea
*
Author to whom correspondence should be addressed.
Appl. Sci. 2019, 9(3), 594; https://doi.org/10.3390/app9030594
Submission received: 18 January 2019 / Revised: 7 February 2019 / Accepted: 8 February 2019 / Published: 11 February 2019
(This article belongs to the Special Issue Selected Papers from IMETI 2018)

Abstract

:
Speaking is a way for humans to communicate with others using language. The ability to speak according to the speaker is very diverse. In general, language skills improve as intelligence improves. Therefore, it is known that the analysis of a speaker’s utterances is a good tool to evaluate the intellectual maturity of the speaker. Until recently, these evaluations have been done manually based on the experience of a handful of experts, but this approach is not only time consuming and costly, but also highly subjective. In this paper, we propose a Korean automatic speech analysis system based on Natural Language Processing (NLP) and web service to solve this problem. For this study, we constructed a web service based on Django to respond to the requests of various users. When a user delivered a transcription file of utterances to the server via the web, the server analyzed the speech ability of the speaker based on various indicators. The server compared the transcription file with the language ability indicators of persons of the same age as the speaker and displayed the result immediately to the user. In this study, we used KoNLPy, a Korean language-processing tool. The automatic speech analysis service analyzed not only the overall language ability of the speaker but also individual domains such as sentence completion ability and vocabulary ability. In addition, a faster and immediate service was made possible without sacrificing accuracy as compared to human analysis.

1. Introduction

Humans use language to communicate with others. There are documentary, behavioral, and linguistic aspects of communicating [1,2]. One of the linguistic aspects is how to communicate with others through dialogue. Speaking is the most natural way to engage in social interaction or social participation. In addition, utterances vary widely depending on the speaker’s ability. In speech pathology, we also use test tools to classify speech disorders by analyzing utterances [3]. The reason why the use of utterances as an evaluation measure is preferred is the method’s advantage of understanding how language is actually used in a natural situation [4]. Language development does not stop even after physical growth stops. As adults, language skills increase or decrease.“ Whole-life language development”, according to which language development continues not only in infancy and childhood but also into adolescence, adulthood, and old age, is emphasized in the field of speech pathology [5,6,7]. However, research regarding language development analysis is mostly carried out with respect to subjects in infancy and childhood. Recently research about the language development of adults and seniors has grown. However, there is no research on the entire age range.
Two methods are used for analyzing utterances: the first involves experts who analyze the results by hand, whereas the second uses an automatic analysis system equipped with software technology such as Natural Language Processing (NLP). The first method has been used by a few experts [8], who analyze the utterances based on their experience and assess verbal skills. This type of analysis is not only time consuming and costly, but it is also difficult to be objective. This is because the results analyzed by the experts involve much personal experience and varied opinions as well as established rules. To overcome these drawbacks, a second method has recently been adopted more frequently [9].
An automated analysis system equipped with software technology is a method of analyzing human language based on natural language processing. In English-speaking countries, active research studies are being carried out on automatic analysis systems, but there are few research studies about Korean language. At first, Korean Computerized Language Analysis (KCLA) was developed as a Korean automatic analysis system, followed by the development of Korean Language Analysis (KLA). However, these systems are little different from manual analysis, except that they work on a computer [10,11].
To overcome these drawbacks, KSTARS was recently developed [12]. KSTARS provides automatic analysis of the word frequency and type of morphemes and words. Compared to KLA, KSTARS differs in that it performs morphological analysis automatically, but its analysis is still limited to subjects in infancy between 2 and 5 years old. Considering that language development continues through all ages, the limitations of KSTARS are clear. In speech pathology, the language of a speaker is generally compared to the language of persons from the same age group to determine a language disorder, based on whether it is within the normal range [13]. However, since KSTARS provides only the number of words or the frequency of morphemes, its disadvantage is that it is difficult to easily discriminate between language disorders or language development degree.
In this paper, we propose a Korean automatic spoken analysis system that overcomes the problems mentioned above. Instead of manual morphological analysis of transcription data, we automatically analyzed morphemes using natural language processing techniques. Thus, we significantly reduced the time and cost spent by professionals. Since automated morphological analysis determines the type of morpheme in accordance with established rules, the subjective opinions and experience of the expert are excluded, enabling an objective evaluation. In this study, the accuracy of morphological analysis was relatively low due the fact that the targets were dialogue sentences. However, we showed that the problem caused by morphological analysis did not affect the overall performance of the experimental element.
The system also showed the assessment results of the speaker’s language ability for the relative evaluation, along with the evaluation results of the same age group. We applied the web to the system so that users could access the system without time and space restrictions. This allowed users to receive results in real time whenever the Internet was accessible. The server received the file that the user wanted to analyze through the web and performed a morphological analysis. Upon completion of the morphological analysis, a speech analysis was applied and the result was immediately provided to the user along with the average data. In addition, the proposed system can be used for language analysis for all ages, not just for specific age groups. The immediate results and age independence are the most significant differences between our system and the previously developed system [14].
Section 2 describes the whole system including web client, web server, and databases. Section 3 shows how to use the NLP system and utterance analysis system to evaluate language age. The experiments and their results are shown in Section 4 and the conclusions are presented in Section 5.

2. Materials and Methods

We developed a system to automate the assessment of Korean language ability of all ages. For this purpose, we built a database that contained the analysis results for the age group. The automated analysis system was implemented on the web for immediate use by users. Therefore, the analysis results could be obtained in real time without being spatially or temporally restricted by a web browser, wherever the Internet was available.
Figure 1 shows the overall configuration of the system proposed in this paper. This system is largely divided into a client part and a server part. In the client part, the user inputs the file through the web browser and views the analysis results. The server part carries out natural language processing and language analysis with the file received from the user. In addition, the average value of the language ability of persons of the same age as the speaker is retrieved from the database. Thus, the user can easily recognize whether the speaker’s ability is within the normal range of the corresponding age group.

2.1. Client

The client part is the interface that is manipulated when the user uses the system. The system used for this paper has a web-based interface. The web page viewed by the user is composed of a function for uploading a file, a selection of the age of the speaker, and a button for starting the analysis. To evaluate the speaker’s ability to speak, a file containing a spoken word is required. These files are referred to as transcription files. Transcription files stored in the user’s computer can be analyzed by selecting the files through the file selection button. After selecting the transcription file to analyze, the user selects the age group of the speaker. A screen shot of the web page is shown in Figure 2.
The distinguishing feature of this paper is that this system makes it possible to evaluate the speaker’s linguistic ability by showing the evaluation results of the same age group at the same time. To compare and analyze the speaker’s ability, the user should add information about the speaker’s age. The speaker’s age group is one of six categories, divided into 5–7 years old, 8–13 years old, 14–19 years old, 20–39 years old, 40–59 years old, and over 60 years old. After selecting the age group corresponding to the speaker and clicking on the analysis button, the age group’s analysis result and the speaker’s analysis result are obtained at the same time.
The results of the analysis are given in numerical form according to each evaluation item. The mean value of the same age group as the speaker is also retrieved from the database according to the evaluation item. Thus, the ability of the speaker can be easily compared with the ability of persons from the same age group. The evaluation items provided to the user are the same as those for evaluating language ability in speech pathology. The items, called dependent measurements, are as follows: Total number of utterances, Mean Length of Utterances in morpheme (MLUm), Mean Length of Utterances in words (MLUw), Total number of words (TNW), Number of different words (NDW), and Type-Token Ratio (TTR) [15].

2.2. Server

2.2.1. Web Server

In this paper, we constructed a web server using Django, which is a free open-source web application framework built on the Python language [16]. Django works the same way as is shown in Figure 3. When a request is received from HTTP, the event is processed by the corresponding method through the URL. The methods are defined in Views.py. The output of the method is displayed to the user again by HMTL and the processed data is read and written to the database. The basic screen design that the user faces is stored in the Template. The Template and its URL are linked so that the screen is displayed differently according to the URL.
Django is efficient and systematic in that it can develop both client and server simultaneously. Django provides mySQL3 as an embedded database. The internal repository is easy to use, has fast access to the database, and can provide fast results to users.

2.2.2. Database

The database stores the results of the language ability evaluation for six groups. In order to build this database, many individual transcription files were required. All transcription files were converted into evaluation values through natural language processing and language analysis, and the average evaluation values of each age group were stored in the database. We saved the file to analyze the interview with the interviewee. This is because dialogue is mainly used when assessing language ability in speech pathology. The most natural language is obtained when communicating through dialogue.
Table 1 shows the results of analyzing one person’s transcription file. In addition to this, the present study also provides analysis results by part of speech, but these results are not presented because of the page limitation. In order to construct the database, we analyzed 120 transcription files, created by 20 persons per each age group. Each indicator is described in detail in Section 3.2. This system can provide faster results by constructing a database of analysis results for all ages in the internal repository of the web.
Using the analysis result data stored in the database, the system provides an average value of each index for the same age as the interviewee. Therefore, the user can make a relative evaluation by instantly comparing the same age with that of the interviewee. In speech pathology, we compare the index of the speaker’s analysis with the average index of the same age when discriminating speech disorders [17,18]. Although there is no absolute criterion for language impairment, existing Korean automatic analysis systems provide only the results of the speaker’s analysis. However, this study has solved this problem through database construction.

3. Analysis Modules

The analysis module consists of two main parts, namely, a natural language processing module and a dialogue analysis module. The server reads the sentences one by one in the transcription file and performs the preprocessing and morphological analysis using the natural language processing module. The dialogue analysis module then uses the dependent measures to provide the evaluation results in the same way as language pathology does.

3.1. Natural Language Processing Module

The role of the natural language processing module is data preprocessing and morphological analysis. The sentences read from the transcription file contain various meaningless symbols and characters. These symbols and characters are removed because they are not used for the evaluation. For example, ‘., ,, ?, /, *, !’, indicating admiration or the end of a sentence, are removed.
When preprocessing is completed, morphological analysis is performed on the sentence. A morpheme is the smallest unit of a word that has a meaning, and if it breaks down, it loses its meaning [19]. The process of segmenting a word into a morpheme and finding a part of speech from an individual morpheme is called morphological analysis [20]. In addition, a system that automatically performs morphological analysis using a computer is called a morphological analyzer [21]. For this paper, KoNLPy was used for Korean speech analysis. KoNLPy is an open source software for Korean language processing with a Python programming language [22]. The morphological analysis package provided by KoNLPy includes Hannanum, Kkma, Komoran, Mecab, and Twitter Class. Among them, this study used Kkma Class that provides a more detailed analysis.
Table 2 shows an example of analyzing sentences using a morphological analyzer. When a sentence is analyzed, the sentence is divided into morpheme units. Each morpheme acquires a corresponding tag, which is used to evaluate language ability. Kkma used in this study provides nine parts of speech. Korean has nouns, verbs, adjectives, adverbs, determiners, exclamations, josas, and eomies.

3.2. Conversation Analysis Module

The conversation analysis module calculates the individual evaluation index based on the morphological analysis results. For this study, the following dependent measures were used to calculate the evaluation index as used in speech pathology. These indices, extracted from [23,24,25,26] with the help of language pathology experts, are as follows: Total number of utterances; Mean Length of Utterances in morpheme (MLUm), Mean Length of Utterances in words (MLUw), Total number of words (TNW), Number of different words (NDW) and Type-Token Ratio (TTR). Among them, TNW, NDW, and TTR correspond to the semantics, and MLUm and MLUw correspond to the syntax and morphology. The evaluation of each item was made using the following calculation method.
Total number of utterances = The number of utterances of the speaker
Mean   Length   of   Utterances   in   morpheme   ( MLUm )   =   T h e   t o t a l   n u m b e r   o f   m o r p h e m e s   i n   u t t e r a n c e s T o t a l   n u m b e r   o f   u t t e r a n c e s
Mean   Length   of   Utterances   in   words   ( MLUw )   =   T h e   t o t a l   n u m b e r   o f   w o r d s   i n   u t t e r a n c e s T o t a l   n u m b e r   o f   u t t e r a n c e s
Total number of words (TNW) = The total number of words in all the utterances
Number of different words (NDW) = The total number of different words in all the utterances
Type - Token   Ratio   ( TTR )   =   The   total   number   of   words   in   utterances The   total   number   of   different   words   in   utterances
In language pathology, the more we talk, the higher the language ability. In addition, the more vocabulary that constitutes one sentence, the higher the language ability. Equations (1)–(3) reflect this. The ability to use various vocabulary words also helps to judge language ability. Equations (4)–(6) reflect this.

4. Implementation and Experiments

4.1. Transcription Utterances

The user inputs the transcription file in the same format as Table 3 to evaluate the speech ability of the speaker. Transcription files consist of turn, utterance, and utterance contents. The turn is the number of questions asked by the interviewer, and the utterance is the number of sentences answered by the interviewee. Therefore, if an interviewee answers a question with two sentences such as utterances 5 and 6, the turn does not change but the utterance is increased by one. The utterance contents refer to the contents of the interviewee’s answer.

4.2. Execution Results

Figure 4 shows the results of the analysis. The six indicators obtained from the results are as described in Section 3.2. From this screen, it is easy to evaluate the individual value of the speaker and the average value of the same age group at the same time, so that the relative evaluation is easy to obtain. Figure 4a shows the evaluation results of a speaker with a slightly higher language ability than the speakers of the same age. For all indicators, including the total number of utterances, the speaker has a slightly higher language level than the average for persons of the same age. On the other hand, the speaker in Figure 4b shows a slightly lower language level than speakers from the same age group. However, this difference does not necessarily mean that the speaker needs language therapy.

4.3. Comparison with Manual Analysis

The main difference between manual analysis and automatic analysis is whether the morphological analysis is done by a person or a machine. One of the main reasons why clinical researchers are reluctant to perform automatic analysis is that morphological analysis is less accurate. The accuracy of the morphological analysis of spoken sentences is known to remain at 70%. However, in this study, the level of language is analyzed through the relative comparison of individual indicators. Therefore, the consistency of the relative position of indicator values is more important than the accuracy of the morphological analysis itself. Table 4 shows the correlation coefficients between human-derived and machine-derived results for individual indicators. The correlation is calculated from 0.87 to 1. In other words, morphological analysis by a computer does not affect the quality of the whole analysis.
In addition, since the morphological analysis is performed by one algorithm, there is no inconsistency in the analysis result according to the analyst. Therefore, a more objective and consistent analysis is possible. Finally, the time required for analysis differs to such an extent that it cannot be compared. Therefore, the automatic analysis system proposed in this study shows that the efficiency is very high, while the accuracy is never lower than the manual analysis.

5. Conclusions

In this paper, we propose a system for automatically analyzing the language level of Korean utterances. For this purpose, we built a database of results for all ages. Therefore, by using this system, it is possible to evaluate not only the linguistic ability of the speaker but also the average ability of persons of the same age as the speaker. The results from the automatic computer analysis are almost the same as those from the human analysis. Therefore, the problems of cost, time, and inconsistency caused by human analysis have been solved.
Future research studies are considering the automatic construction of transcription files. Much has been automated using a morphological analyzer, but it is still a manual build phase for transcription files. This also requires a lot of time and money. Therefore, we intend to build an automatic transcription system using the speech recognition module that is currently developed and publicly available.

Author Contributions

Conceptualization, J.H.Y. and Y.-S.K.; Data curation, Y.-K.L.; Formal analysis, H.-J.S.; Funding acquisition, Y.-S.K.; Methodology, J.-D.K.; Project administration, Y.-S.K.; Resources, C.-Y.P.; Software, J.-E.C.; Supervision, Y.-S.K.; Validation, J.-E.C.; Writing – original draft, J.-E.C.; Writing – review and editing, Y.-S.K.

Funding

This work was funded by the National Research Foundation of Korea Grant funded by the Korean Government (NRF-2017M3C4A7068188) and by the Hallym Leading Research Group Support Program of 2017 (No. HRF-LGR-2017-0002).

Acknowledgments

Byoung-Doo Oh gave his technical support about Django programming and Ji-Su Kim helped this research by implementing language evaluation indices.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Kim, J.-Y.; Yi, S.-H. Infants’ verbal, and nonverbal interaction with their teacher and peers in daycare center. J. Korean Psychol. Assoc. 1997, 10, 57–73. [Google Scholar]
  2. Crystal, D. Speaking of Writing and Writing of Speaking; Pearson: Harlow, UK, 2005. [Google Scholar]
  3. Kim, Y.-T. A Basic Study on Utterance Length of Korean Infants of 2-4 Years Olds. Commun. Sci. Disord. 1997, 2, 5–26. [Google Scholar]
  4. Lee, H.-J.; Kim, Y.-T. Measures of Utterance Length of Normal and Language-Delayed Children. Korean J. Communi. Disord. 1999, 4, 153–166. [Google Scholar]
  5. Kwon, M.-S.; Kim, H.-H.; Choi, S.-S.; Na, D.-L.; Lee, K.-H. A Study for Analyzing Spontaneous Speech of Korean Adults with CIU Scoring System. Korean J. Communi. Disord. 1998, 3, 35–49. [Google Scholar]
  6. Shin, J.-Y. Phoneme and Syllable Frequencies of Korean Based on the Analysis of Spontaneous Speech Data. Commun. Sci. Disord. 2008, 13, 193–215. [Google Scholar]
  7. Lee, M.-S.; Kim, H.-H. Cognitive-Pragmatic Language Assessment for Normal Aging: Study of Assessment Tools and Content Validity. J. Korea Contents Assoc. 2012, 12, 280–292. [Google Scholar] [CrossRef]
  8. Yang, S.-H.; Kim, Y.-S. A high-speed Korean morphological analysis method based on pre-analyzed partial words. J. KIISE Software Appl. 2000, 27, 290–301. [Google Scholar]
  9. Yoo, J.-H.; Lee, J.-H.; Lee, G.B. Post—Processing for Character Recognition Using Morphological Analysis and Linguistic Evaluation. J. KIISE 1995, 22, 880–891. [Google Scholar]
  10. Pae, S.-Y.; Kim, K.-S.; Sung, K.-H.; Sung, J.-A. Korean Computerized Language Analysis 1.0. Commun. Sci. Disord. 1998, 3, 123–138. [Google Scholar]
  11. Ha, S.-H.; Seol, A.-Y.; So, J.-M.; Pae, S.-Y. Speech and language development patterns of Korean two-year-old children from analysis of spontaneous utterances. Commun. Sci. Disord. 2016, 21, 47–59. [Google Scholar] [CrossRef]
  12. Yoon, M.-S.; Kim, S.-J.; Kim, J.-M.; Chang, M.-S.; Chac, J.-E. Reliable Sample Size for Mean Length of Utterance Analysis in Preschooler. Korean J. Communi. Disord. 2013, 18, 368–378. [Google Scholar] [CrossRef]
  13. Hwang, S.-S. A Study on Language Characteristics of Children according to ages in Culturally Diverse Family Environments. J. Spec. Educ.: Theor. Pract. 2009, 10, 265–281. [Google Scholar]
  14. Price, L.H.; Hendricks, S.; Cook, C. Incorporating Computer-Aided Language Sample Analysis into Clinical Practice. Lang. Speech Hear. Serv. Sch. 2010, 41, 206–222. [Google Scholar] [CrossRef]
  15. Kim, J.-S.; Kim, Y.-S. Language dependent value automatic analysis program for efficient language analysis. In Proceedings of the Korean International Statistical Society (KISS 2017), Seoul, Korea, 10–11 November 2017; pp. 1901–1903. [Google Scholar]
  16. Django. Available online: https://www.djangoproject.com/ (accessed on 20 September 2018).
  17. Kim, Y.-J.; Pae, S.-Y. Narrative Abilities of Korean Children with and without Specific Language Impairment. Korean J. Dev. Psychol. 2004, 17, 41–58. [Google Scholar]
  18. Ahn, J.-S.; Kim, Y.-T. The Effect of Syntactic Complexity on Sentence Repetition Performance and Intelligibility between Specific Language Impairment and Normal Children. Speech Sci. 2000, 7, 262–275. [Google Scholar]
  19. Yang, J.-H. The Concept of the Morpheme and the Description on the History of the Korean Language. Korean Culture 4th 2004, 12, 1–19. [Google Scholar]
  20. Lee, J.-S. Three-step probabilistic model for Korean morphological analysis. J. KIISE Software Appl. 2011, 38, 257–268. [Google Scholar]
  21. Kang, S.-S.; Zhang, B.-T. A General Morphological Analyzer and Spelling Checker for the Korean Language Using Syllable Characteristics. J. KIISE 1996, 23, 530–539. [Google Scholar]
  22. Park, E.-J.; Cho, S.-Z. KoNLPy: Korean natural language processing in Python. In Proceedings of the 26th Human & Cognitive Language Technology (HCLT 2014), Chuncheon, Korea, 11–14 October 2014; pp. 133–136. [Google Scholar]
  23. Kim, Y.-T. A Basic Study on Utterance Length of Korean 2–4 aged Children. Korean J. Communi. Disord. 1997, 2, 5–25. [Google Scholar]
  24. Owens, R.E. Language Disorders: A Functional Approach to Assessment and Intervention, 6th ed.; Pearson: Harlow, UK, 2013. [Google Scholar]
  25. Choi, J.; Lee, Y. Conversational Turn-Taking and Topic Manipulation Skills of Children with High-Functioning Autism Spectrum Disorders. Commun. Sci. Disord. 2013, 18, 12–23. [Google Scholar] [CrossRef]
  26. Choi, J.; Lee, Y. Contingency and Informativeness of Topic Maintenance in Children with High-Functioning Autism Spectrum Disorders. Commun. Sci. Disord. 2015, 20, 413–423. [Google Scholar] [CrossRef]
Figure 1. Configuration of the whole system.
Figure 1. Configuration of the whole system.
Applsci 09 00594 g001
Figure 2. Screen shot of the client part.
Figure 2. Screen shot of the client part.
Applsci 09 00594 g002
Figure 3. Django structure.
Figure 3. Django structure.
Applsci 09 00594 g003
Figure 4. Result screen of our system. (a) shows the result of an interviewee having a higher language ability than his/her age. However, (b) shows the opposite case.
Figure 4. Result screen of our system. (a) shows the result of an interviewee having a higher language ability than his/her age. However, (b) shows the opposite case.
Applsci 09 00594 g004
Table 1. Example of the analysis results from a single transcription file.
Table 1. Example of the analysis results from a single transcription file.
Evaluation ItemNumber
Total number of utterances57
Syntax/Morpheme
Mean Length of Utterances in morpheme (MLUm)
Mean Length of Utterances in words (MLUw)
14.05
5.6
Meaning
Total number of words (TNW)
Number of different words (NDW)
Type-Token Ratio (TTR)
801
290
2.76
Table 2. Results of analyzing sentences using a morphological analyzer.
Table 2. Results of analyzing sentences using a morphological analyzer.
InputOutput
‘아 뭐 이야기할걸 잘 모르겠네’(‘
I don’t know what to say.’)
[(‘뭐’, ‘NP’), (‘이야기’, ‘NNG’), (‘하’, ‘XSV’), (‘ㄹ’, ‘ETD’), (‘것’, ‘NNB’), (‘을’, ‘JKO’), (‘잘’, ‘MAG’), (‘모르’, ‘VV’), (‘겠’, ‘EPT’), (‘네’, ‘EFN’)]
‘아 우리 할아버지랑 나랑 우리 딸네 집에 있어’
(‘My grandfather and I are at my daughter’s house.’)
[(‘우리’, ‘NP’), (‘할아버지’, ‘NNG’), (‘랑’, ‘JC’), (‘나’, ‘NP’), (‘랑’, ‘JC’), (‘우리’, ‘NP’), (‘딸’, ‘NNG’), (‘네’, ‘XSN’), (‘집’, ‘NNG’), (‘에’, ‘JKM’), (‘있’, ‘VV’), (‘어’, ‘ECD’)]
‘아 우리 할아버지랑 아주 잘지내’
(‘I am very well with my grandfather.’)
[(‘우리’, ‘NP’), (‘할아버지’, ‘NNG’), (‘랑’, ‘JC’), (‘아주’, ‘MAG’), (‘잘’, ‘MAG’), (‘지내’, ‘VV’)
Table 3. Example of transcription file.
Table 3. Example of transcription file.
OOO (75 years old)
TurnUtteranceContents
11Ah. I do not know what to say.
22Oh, my grandfather and I are at my daughter’s house.
33Oh, I am very well with my grandfather.
44Oh, my grandchild works well in sk and got married to a woman who worked for Samsung and was doing very well.
5Oh, I am happy.
56Oh, I want to see my grandchildren and grandchildren’s daughter-in-law.
67Oh yeah, I want to see them every day.
78Oh, that is right.
89Ah, we usually go out to church.
910Oh, and I buy meat with my family and eat meat.
Table 4. Comparison between machine and human analysis.
Table 4. Comparison between machine and human analysis.
DivisionTotal Number of UtterancesMLUmMLUwTNWDNWTTR
All ages10.96659510.925290.954850.86812
5 to 7 years old10.99579910.995640.99340.94724
8 to 13 years old10.99859210.998630.993750.98357
14 to 19 years old10.99880910.994070.996020.83191
Adult (20s to 39s)10.99612510.99630.984760.89463
Adult (40s to 59s)10.99897510.998660.99340.98355
Adult (over 60s)10.9998110.999810.999490.99195

Share and Cite

MDPI and ACS Style

Song, H.-J.; Choi, J.-E.; Lee, Y.-K.; Yoon, J.H.; Kim, J.-D.; Park, C.-Y.; Kim, Y.-S. A Web Service for Evaluating the Level of Speech in Korean. Appl. Sci. 2019, 9, 594. https://doi.org/10.3390/app9030594

AMA Style

Song H-J, Choi J-E, Lee Y-K, Yoon JH, Kim J-D, Park C-Y, Kim Y-S. A Web Service for Evaluating the Level of Speech in Korean. Applied Sciences. 2019; 9(3):594. https://doi.org/10.3390/app9030594

Chicago/Turabian Style

Song, Hye-Jeong, Ji-Eun Choi, Yoon-Kyoung Lee, Ji Hye Yoon, Jong-Dae Kim, Chan-Young Park, and Yu-Seop Kim. 2019. "A Web Service for Evaluating the Level of Speech in Korean" Applied Sciences 9, no. 3: 594. https://doi.org/10.3390/app9030594

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop