Next Article in Journal
Financial Time Series Modelling Using Fractal Interpolation Functions
Previous Article in Journal
Numerical Study of Velocity and Mixture Fraction Fields in a Turbulent Non-Reacting Propane Jet Flow Issuing into Parallel Co-Flowing Air in Isothermal Condition through OpenFOAM
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Readability across Time and Languages: The Case of Matthew’s Gospel Translations

by
Emilio Matricciani
Dipartimento di Elettronica, Informazione e Bioingegneria (DEIB), Politecnico di Milano, 20133 Milan, Italy
AppliedMath 2023, 3(2), 497-509; https://doi.org/10.3390/appliedmath3020026
Submission received: 30 April 2023 / Revised: 2 June 2023 / Accepted: 13 June 2023 / Published: 14 June 2023

Abstract

:
We have studied how the readability of a text can change in translation by considering Matthew’s Gospel, written in Greek, translated into Latin and 35 modern languages. We have found that the deep-language parameters C P (characters per word), P F (words per sentence), I P (words per interpunctions), M F (interpunctions per sentence) and a universal readability index G U   of each translation are so diverse from language to language, and even within a given language for which there are many versions of Matthew—such as in English and Spanish—that the resulting texts mathematically seem to be diverse. The several tens of versions of Matthew’s Gospel studied appear to address very diverse audiences. If a reader could understand all of them well, he/she would have the impression of reading texts written by diverse authors, although all of them tell the same story.

1. A Common Ground for Measuring Readability across Time and Languages

Readability formulae [1,2,3,4,5,6,7,8,9] are applicable to any alphabetical language. Based on the length of words and sentences, they allow for the comparison of diverse texts automatically and objectively to assess the difficulty that readers may find in reading them. From the point of view of the writer, a readability formula allows one to better match readers and texts. Many readability formulae have been proposed for English [5], but only a few have been proposed for a very small number of other languages [9].
In response to the need to provide a readability formula applicable to any alphabetical language and the interest to compare the readability of texts written in diverse languages, including translations, in Reference [10], we have proposed a universal readability index which includes, for the first time, an estimate of the short-term memory (STM) processing capacity of readers. This capacity seems to be linked to the response of readers’ STM to the partial stimuli contained in a sentence, singled out by interpunctions, a process described by the so-called word interval I P , i.e., the number of words between two following interpunctions [11,12]. In fact, all other readability formulae neglect the empirical connection found between readers’ STM processing capacity (described by Miller’s 7 ± 2 law [13]) and the word interval I P , which appears, at least empirically, justified and natural [10,11,12,14,15,16].
The possibility of comparing the readability of texts written in any alphabetical language, including ancient languages such as Greek and Latin, can be very useful in cognitive psychology, theory of communication, information theory, phonics and linguistics. Before Reference [10], however, scholars had never considered I p , but the scatterplots of I p versus any readability index show that texts sharing the same readability index can have very diverse values of I p [11]. Now, it is unlikely that I P has no impact on reading difficulty; therefore, by introducing I P in the mathematical definition of a readability index, the readability index is better “fine-tuned” to readers and reveals their STM processing capacity.
In Reference [10], the global readability index G that was developed from Italian [12] was transformed into the universal readability index G U after showing that the popular Flesch’s Reading Ease Index [1] and the Automated Readability Index [2], largely used in assessing the readability of English texts, are connected to G .
As the title of this article claims, our purpose is to assess whether the translation of an ancient text written in Greek—namely, in our exercise, the Gospel according to Matthew—into Latin and 35 modern languages may reveal similarities or differences in the reading ability required to readers for understanding it across time and translation. The mathematical parameters we use for this study are the deep-language parameters recalled in Section 2 and the universal readability index recalled in Section 3. We have studied Matthew’s Gospel because of its great heritage value and its multiple translations [14].
After this introductory section, Section 2 deals with Matthew’s Gospel translations in Latin and in 35 modern languages, together with their representation in a vector plane; Section 3 deals with a universal readability index in translated texts; Section 4 deals with many diverse translations of Matthew within the same language (English and Spanish); finally, Section 5 summarizes some final remarks and proposes further developments.

2. Matthew’s Gospel Translations and the Vector Plane of Deep-Language Parameters

The defined deep-language parameters C P (number of characters per word), P F (number of words per sentence), I P (word interval, number of words per interpunctions) and M F (number of interpunctions per sentence) [11] are listed in Table 1 alongside their average values in the original Greek Matthew and its translation into Latin and 35 modern languages [14].
Notice that verses 1.1 to 1.17 (genealogy) have been deleted every time for not biasing the statistics of P F , I P and M F [14,16,17], as with all titles, notes and other characters not belonging to the pure text. This work was done manually with WinWord.
Similar to in References [14,15,16,17], the 28 chapters of Matthew were statistically weighted with the fraction of total words. For example, the original Greek Matthew contains 18 , 121 total words; therefore, each sample does not weigh 1 / 128 = 0.0078 but the number of its words divided by the total words: Chapter 5, for example, is made of 824 words; therefore, its weight is 824 / 18 , 121 = 0.0455 . This choice is mandatory to avoid a short chapter (or, in general, a short text) affecting the statistical results as though it were a long one.
The averages of the four deep-language parameters can be used to represent each text graphically in the first Cartesian quadrant [11], thereby allowing a first assessment of whether texts are mathematically similar or “close” according to these parameters.
Let us consider the following six vectors of the indicated components, made of the average values of Table 1 ,   R 1 = ( C P , P F ), R 2 = ( M F , P F ), R 3 = ( I P , P F ), R 4 = ( C P , M F ), R 5 = ( I P , M F ), R 6 = ( I P , C P ), and their resulting ending point:
R = k = 1 6 R k
Figure 1 shows the ending point of vector (1) of the original Greek text and its translations. We notice a very large spread, as discussed in Reference [14], because most translations are not verbatim, although they all maintain the conventional division in chapters and verses of the Greek text. The spread is so large to be almost identical to that shown by seven centuries of Italian literature [11]. In other words, the texts of Table 1, mathematically, could describe an entire culture’s literature, not a single text.
For two texts to be mathematically similar, the vicinity in this plane is only a necessary condition. Because based on averages only, the “vicinity” of two texts should be further studied and assessed with Communication Theory involving the “sentences channel”, the “interpunctions channel” and the “likeness index”, as shown in [12,14,15,16].
After showing this large spread, let us recall and apply the concept of a universal readability index proposed in Reference [10] to Matthew’s translations.

3. Universal Readability Index in Translations

As shown in Reference [10], the universal readability index is given by the following formula:
G U = G 6 I P 6
with G :
G = 89 10 k C P + 300 / P F
k = C P , I T A / C P
The symbol < > indicates average values, as with those reported in Table 1; C P , I T A refers to Italian. Notice that since, in the following, we calculate Equation (4) according to average values, hence k C P = C p , I T A = 4 .281 for all languages.
The rationale for this latter choice is that C P is a parameter typical of a language which, if not scaled before calculating Equation (2), would bias any readability index without really quantifying the change in reading difficulty experimented by readers used to reading, in their language, shorter or longer words than those found in Italian. This scaling, therefore, avoids changing G and G U   for the only reason that a language has, on average, words shorter or longer than their Italian counterparts. In any case, as shown in Reference [11], C p affects readability much less than P F .
Figure 2 shows the scatterplot of I P versus P F concerning the translations listed in Table 1. Since G is mostly affected by P F , the rationale for introducing I P in Equation (2) becomes clear [10,11]. For example, if P F 15 , I P can range from 4 to 8.5, practically by Miller’s range of 7 ± 2 .
In conclusion, a readability formula that does not consider I p assumes, unrealistically, that a text with a given readability index is readable, with the same effort, both by readers who display powerful STM processing capacity (large I p ) and by readers who do not (small I p ).
Equation (2) sets G U = G for I P = 6 —a center value for texts according to the equivalence between I P and Miller’s 7 ± 2 law, see [18,19,20,21,22,23,24,25,26,27]— G U < G for I P > 6 and G U > G for I P < 6 . In other words, if a text has a small word interval I P (therefore matched to readers gifted with average-to-weak STM processing capacity), then it should be read more easily than a text with the same G but larger I P (matched to readers gifted with an average-to-powerful STM processing capacity). These changes quantify and introduce important aspects of readers’ STM; therefore, G U should better “measure” relative reading difficulties, which are very likely linked to readers’ schooling.
In fact, G U can be related to the number of years of school attended in Italy’s school system [10,11]—assumed as a reference school system for any language if G U is adopted as a universal readability index—as shown in Figure 3. From it, we notice that the larger   G U , the more readable the text for a given number of school years. The continuous lines divide the quadrant into areas of the same reading performance, such as “almost unintelligible”, “very difficult”, etc. For example, the area labelled “easy” indicates all combinations of values of G U and school years required to declare a text “easy” to read. In all cases, as readers’ schooling increases, they can read more difficult texts as far as readability is concerned.
The rightmost column of Table 1 reports G U calculated with the average values of P F and I P listed there. Table 2 reports the overall average values and ranges of P F , I P and G U calculated from Table 1. We can notice a large range for each parameter. In other words, if there were a reader who could understand all languages equally well, he/she would experiment with quite diverse reading difficulties, not to mention he/she would read texts which would appear diverse, with diverse lengths of sentences and diverse distribution of interpunctions, although they all convey the same story.
For example, the Cebuano version (least readable) requires at least 10 years of schooling to be considered “easy” to read (Figure 3), while the Polish version (most readable) requires less than 5.
Figure 4 shows the scatterplot of G U versus P F and Figure 5 shows the scatterplot of G U versus I P . Figure 4 clearly shows, in agreement with Equation (2), that G U decreases as P F increases. Figure 5 enhances the linear decrease of G U as I P increases. In all cases, notice that translations show values that are quite diverse compared to that of the original Greek text.
The large spread found in translating Matthew’s Gospel into diverse languages is also found in translating it into the same language by diverse teams of translators—as is the case for the multiple versions in English and Spanish—discussed in the next section.

4. Many Diverse Translations within the Same Language

It may be curious to notice that within the same language, there are many diverse translations of the Bible. In most cases, the reason for this proliferation is not that the existing translation has become obsolete because of particularly convoluted sentences and/or old words no longer clearly understood. The reason, as it becomes clear in this section, is that each translation seems to be addressed to diverse contemporary readers. The most striking cases are English and Spanish, for which there are tens of translations, some of which we have studied.
Table 3 lists 40 English translations and their parameters, including the classical King James (1611), Table 4 lists 15 Spanish translations, including the classical Reina Valera Antigua. Both sets are freely available, along with other translations, on the website https://www.biblegateway.com/versions/ (accessed on 15 April 2023) The deep-language parameters and the universal readability indices calculated for each translation are extremely variable, especially in English.
Figure 6 shows the scatterplot of I P versus P F concerning the English (black circles, Table 3) and Spanish (green circles, Table 4) translations. Again, we notice I P can vary in the full Miller’s range 7 ± 2 and the necessity and usefulness of introducing I P in readability formulae.
Figure 7 shows the ending point of vector (1) of the original Greek text compared with the English and Spanish translations. The spread is similar to that found in Figure 1 concerning diverse languages.
Table 5 and Table 6 report the main statistics for these two sets. As noticed, the English translations are more varied than the Spanish ones, a find confirmed in Figure 8 and Figure 9 showing the scatterplot of G U versus P F and G U versus I P .
From Table 5 and Figure 3, in English, the least readable version (New American Standard Bible) requires more than 13 years of schooling for “easy” reading; the most readable version (New International Reader’s Version) only requires 6 years. The historical King James’ version (1611) requires 8 years.
Similarly, in Spanish (Table 6 and Figure 3), the least readable version (Palabra de Dios para todos) requires 9 years of schooling for “easy” reading, while the most readable version (Spanish Blue Red and Gold Letter Edition) requires about 7. The historical Reina Valera Antigua (1602) requires 7 years of schooling.
In conclusion, multiple translations within the same languages are likely addressed to readers with very different STM processing capacities and schooling.

5. Conclusions

We have studied how the readability of a text can change in translation. To arrive at a sound conclusion, we have considered the translation of an ancient text written in Greek—the Gospel according to Matthew (very important for its heritage value)—into Latin and 35 modern languages. We have found that the deep-language parameters C P , P F , I P , M F and a universal readability index G U   of each translation are so diverse from language to language, and even within a given language for which there are many versions of Matthew—such as in English and in Spanish—that the resulting texts mathematically seem to be diverse texts. All the analyses performed confirm this extremely marked scattering, which does not depend on the year of translation.
What is really striking is the large range of the readability index, which does not depend on the year of translation. This index “measures” how difficult a text is to read and how many years of schooling it requires for “easy” reading. The several tens of versions of Matthew’s Gospel studied here appear to address very diverse audiences. If a reader could understand all the 37 languages considered well, he/she, on the one hand, would be surely guided by the traditional division in chapters and verses of Matthew common to all versions, but on the other hand, would also have the impression of reading texts written by diverse authors, although all of them tell the same story.
Future work should study how readability changes in translations of modern novels, which tend to be more verbatim than that of Matthew and, in general, of the New Testament.

Funding

This research received no external funding.

Data Availability Statement

Data are available from the author upon request.

Acknowledgments

The author warmly thanks all those scholars who, with continuous great care and dedication, keep the texts of the Bible in many languages online for the benefit of everyone, specifically Bible Gateway, Perseus Digital Library and the Vatican.

Conflicts of Interest

The author declares no conflict of interest.

References

  1. Flesch, R. A New Readability Yardstick. J. Appl. Psychol. 1948, 32, 222–233. [Google Scholar] [CrossRef] [PubMed]
  2. Kincaid, J.P.; Fishburne, R.P.; Rogers, R.L.; Chissom, B.S. Derivation of New Readability Formulas (Automated Readability Index, Fog Count and Flesch Reading Ease Formula) for Navy Enlisted Personnel; Research Branch Report 8-75; Chief of Naval Technical Training; Naval Air Station: Memphis, TN, USA, 1975. [Google Scholar]
  3. DuBay, W.H. The Principles of Readability; Impact Information: Costa Mesa, CA, USA, 2004. [Google Scholar]
  4. Bailin, A.; Graftstein, A. The linguistic assumptions underlying readability formulae: A critique. Lang. Commun. 2001, 21, 285–301. [Google Scholar] [CrossRef]
  5. DuBay, W.H. The Classic Readability Studies; Impact Information: Costa Mesa, CA, USA, 2006. [Google Scholar]
  6. Zamanian, M.; Heydari, P. Readability of Texts: State of the Art. Theory Pract. Lang. Stud. 2012, 2, 43–53. [Google Scholar] [CrossRef]
  7. Benjamin, R.G. Reconstructing Readability: Recent Developments and Recommendations in the Analysis of Text Difficulty. Educ. Psycol. Rev. 2012, 24, 63–88. [Google Scholar] [CrossRef]
  8. Collins-Thompson, K. Computational Assessment of Text Readability: A Survey of Past, in Present and Future Research, Recent Advances in Automatic Readability Assessment and Text Simplification. ITL Int. J. Appl. Linguist. 2014, 165, 97–135. [Google Scholar] [CrossRef]
  9. Kandel, L.; Moles, A. Application de l’indice de Flesch à la langue française. Cah. Etudes De Radio-Télév. 1958, 19, 253–274. [Google Scholar]
  10. Matricciani, E. Readability Indices Do Not Say It All on a Text Readability. Analytics 2023, 2, 296–314. [Google Scholar] [CrossRef]
  11. Matricciani, E. Deep Language Statistics of Italian throughout Seven Centuries of Literature and Empirical Connections with Miller’s 7 ∓ 2 Law and Short-Term Memory. Open J. Stat. 2019, 9, 373–406. [Google Scholar] [CrossRef] [Green Version]
  12. Matricciani, E. A Statistical Theory of Language Translation Based on Communication Theory. Open J. Stat. 2020, 10, 936–997. [Google Scholar] [CrossRef]
  13. Miller, G.A. The Magical Number Seven, Plus or Minus Two. Some Limits on Our Capacity for Processing Information. Psychol. Rev. 1955, 63, 343–352. [Google Scholar]
  14. Matricciani, E. Linguistic Mathematical Relationships Saved or Lost in Translating Texts: Extension of the Statistical Theory of Translation and Its Application to the New Testament. Information 2022, 13, 20. [Google Scholar] [CrossRef]
  15. Matricciani, E. Multiple Communication Channels in Literary Texts. Open J. Stat. 2022, 12, 486–520. [Google Scholar] [CrossRef]
  16. Matricciani, E. Capacity of Linguistic Communication Channels in Literary Texts: Application to Charles Dickens’ Novels. Information 2023, 14, 68. [Google Scholar] [CrossRef]
  17. Matricciani, E.; Caro, L.D. A Deep-Language Mathematical Analysis of Gospels, Acts and Revelation. Religions 2019, 10, 257. [Google Scholar] [CrossRef] [Green Version]
  18. Baddeley, A.D.; Thomson, N.; Buchanan, M. Word Length and the Structure of Short-Term Memory. J. Verbal Learn. Verbal Behav. 1975, 14, 575–589. [Google Scholar] [CrossRef]
  19. Cowan, N. The magical number 4 in short-term memory: A reconsideration of mental storage capacity. Behav. Brain Sci. 2000, 24, 87–114. [Google Scholar] [CrossRef] [Green Version]
  20. Pothos, E.M.; Joula, P. Linguistic structure and short-term memory. Behav. Brain Sci. 2000, 24, 138–139. [Google Scholar] [CrossRef]
  21. Jones, G.; Macken, B. Questioning short-term memory and its measurements: Why digit span measures long-term associative learning. Cognition 2015, 144, 1–13. [Google Scholar] [CrossRef] [Green Version]
  22. Saaty, T.L.; Ozdemir, M.S. Why the Magic Number Seven Plus or Minus Two. Math. Comput. Model. 2003, 38, 233–244. [Google Scholar] [CrossRef]
  23. Mathy, F.; Feldman, J. What’s magic about magic numbers? Chunking and data compression in short-term memory. Cognition 2012, 122, 346–362. [Google Scholar] [CrossRef]
  24. Chen, Z.; Cowan, N. Chunk Limits and Length Limits in Immediate Recall: A. Reconciliation, J. Exp. Psychol. Mem. Cogn. 2005, 31, 1235–1249. [Google Scholar] [CrossRef] [Green Version]
  25. Chekaf, M.; Cowan, N.; Mathy, F. Chunk formation in immediate memory and how it relates to data compression. Cognition 2016, 155, 96–107. [Google Scholar] [CrossRef] [Green Version]
  26. Barrouillest, P.; Camos, V. As Time Goes By: Temporal Constraints in Working Memory. Curr. Dir. Psychol. Sci. 2012, 21, 413–419. [Google Scholar] [CrossRef]
  27. Conway, A.R.A.; Cowan, N.; Michael FBunting, M.F.; Therriaulta, D.J.; Minkoff, S.R.B. A latent variable analysis of working memory capacity, short-term memory capacity, processing speed, and general fluid intelligence. Intelligence 2002, 30, 163–183. [Google Scholar] [CrossRef]
Figure 1. Scatter plot of vector (1) coordinates concerning the translations in Table 1. The original Greek Matthew is indicated with the red square; Latin: blue square; Spanish: green square; English: black square.
Figure 1. Scatter plot of vector (1) coordinates concerning the translations in Table 1. The original Greek Matthew is indicated with the red square; Latin: blue square; Spanish: green square; English: black square.
Appliedmath 03 00026 g001
Figure 2. Scatterplot of I P versus P F concerning the translations in Table 1. The original Greek Matthew is indicated with the red square; Latin: blue square; Spanish: green square; English: black square. Notice that for a fixed P F , I P extends in the full Miller’s range 7 ± 2 .
Figure 2. Scatterplot of I P versus P F concerning the translations in Table 1. The original Greek Matthew is indicated with the red square; Latin: blue square; Spanish: green square; English: black square. Notice that for a fixed P F , I P extends in the full Miller’s range 7 ± 2 .
Appliedmath 03 00026 g002
Figure 3. Universal readability index G U , as a function of the number of school years attended in Italy (see References [10,11] and the further citations there reported), as reference for all languages. The continuous lines divide the quadrant into areas of the same performance of texts. Elementary school lasts 5 years; Junior High School lasts 3 years; High School lasts 5 years. Kids stay at school until they are 19 years old.
Figure 3. Universal readability index G U , as a function of the number of school years attended in Italy (see References [10,11] and the further citations there reported), as reference for all languages. The continuous lines divide the quadrant into areas of the same performance of texts. Elementary school lasts 5 years; Junior High School lasts 3 years; High School lasts 5 years. Kids stay at school until they are 19 years old.
Appliedmath 03 00026 g003
Figure 4. Scatter plot of G U versus P F concerning the translations in Table 1. The original Greek Matthew is indicated with the red square; Latin: blue square; Spanish: green square; English: black square.
Figure 4. Scatter plot of G U versus P F concerning the translations in Table 1. The original Greek Matthew is indicated with the red square; Latin: blue square; Spanish: green square; English: black square.
Appliedmath 03 00026 g004
Figure 5. Scatter plot of G U versus I P concerning the translations in Table 1. The original Greek Matthew is indicated with the red square; Latin: blue square; Spanish: green square; English: black square.
Figure 5. Scatter plot of G U versus I P concerning the translations in Table 1. The original Greek Matthew is indicated with the red square; Latin: blue square; Spanish: green square; English: black square.
Appliedmath 03 00026 g005
Figure 6. Scatter plot of I P versus P F concerning the English (black circles, Table 3) and Spanish (green circles, Table 5) translations. The cyan diamond indicates King James (1611); the blue diamond indicates Reina Valera Antigua (1602).
Figure 6. Scatter plot of I P versus P F concerning the English (black circles, Table 3) and Spanish (green circles, Table 5) translations. The cyan diamond indicates King James (1611); the blue diamond indicates Reina Valera Antigua (1602).
Appliedmath 03 00026 g006
Figure 7. Scatter plot of vector (1) coordinates concerning the English (black circles, Table 3) and Spanish translations (green circles, Table 5). The cyan diamond indicates King James (1611); the blue diamond indicates Reina Valera Antigua (1602). The red square indicates the original Greek Matthew.
Figure 7. Scatter plot of vector (1) coordinates concerning the English (black circles, Table 3) and Spanish translations (green circles, Table 5). The cyan diamond indicates King James (1611); the blue diamond indicates Reina Valera Antigua (1602). The red square indicates the original Greek Matthew.
Appliedmath 03 00026 g007
Figure 8. Scatter plot of G U versus P F concerning the English (black circles, Table 2) and Spanish (green circles, Table 3) translations. The cyan diamond indicates King James (1611); the blue diamond indicates Reina Valera Antigua (1602).
Figure 8. Scatter plot of G U versus P F concerning the English (black circles, Table 2) and Spanish (green circles, Table 3) translations. The cyan diamond indicates King James (1611); the blue diamond indicates Reina Valera Antigua (1602).
Appliedmath 03 00026 g008
Figure 9. Scatter plot of G U versus I P concerning the English (black circles, Table 3) and Spanish (green circles, Table 5) translations. The cyan diamond indicates King James (1611); the blue diamond indicates Reina Valera Antigua (1602).
Figure 9. Scatter plot of G U versus I P concerning the English (black circles, Table 3) and Spanish (green circles, Table 5) translations. The cyan diamond indicates King James (1611); the blue diamond indicates Reina Valera Antigua (1602).
Appliedmath 03 00026 g009
Table 1. Average values of C P (characters per word), P F (words per sentence), I P (words per interpunction), M F (interpunctions, or number of I P , per sentence), G U (universal readability index, recalled and calculated in Section 3) in Matthew in the indicated languages and listed according to their belonging to a language family. The source of these texts is reported in Reference [14].
Table 1. Average values of C P (characters per word), P F (words per sentence), I P (words per interpunction), M F (interpunctions, or number of I P , per sentence), G U (universal readability index, recalled and calculated in Section 3) in Matthew in the indicated languages and listed according to their belonging to a language family. The source of these texts is reported in Reference [14].
LanguageLanguage Family C p P F I P M F G U
1. GreekHellenic4.89620.2707.1822.82553.90
2. LatinItalic5.13217.4505.0513.46569.08
3. EsperantoConstructed4.46819.5864.7984.08368.72
4. FrenchRomance4.17017.9177.4682.40454.13
5. Italian4.28117.5345.9822.93163.41
6. Portuguese4.44514.3025.2922.70971.41
7. Romanian4.28316.4956.4932.54261.42
8. Spanish4.24617.6056.0912.90362.68
9. DanishGermanic4.12314.2665.8382.44968.19
10. English4.03117.7746.9472.57857.39
11. Finnish5.84415.5094.7843.24872.83
12. German4.64616.4035.8052.83065.65
13. Icelandic4.38114.3635.3312.70671.09
14. Norwegian4.07315.3777.4742.06256.86
15. Swedish4.20215.0457.5132.01257.05
16. BulgarianBalto-Slavic4.37514.2845.5222.59070.06
17. Czech4.51912.3264.7552.60578.00
18. Croatian4.37913.6745.3072.58572.29
19. Polish5.08811.4864.5312.53681.12
20. Russian4.66518.1854.1074.42674.05
21. Serbian4.19613.9435.6142.48470.02
22. Slovak4.63512.3304.9122.51277.05
23. Ukrainian4.59013.8914.6962.96075.61
24. EstonianUralic4.84715.4115.1992.96970.46
25. Hungarian5.28715.3474.0133.83277.66
26. AlbanianAlbanian4.03221.0136.4343.28757.86
27. ArmenianArmenian4.48715.3165.6252.72668.03
28. WelshCeltic4.07121.8635.6033.91162.29
29. BasqueIsolate4.67115.7024.6723.36573.26
30. HebrewSemitic4.12511.2615.3592.10376.68
31. CebuanoAustronesian4.69515.4648.3511.86251.48
32. Tagalog4.77916.5847.6642.16854.30
33. ChichewaNiger-Congo6.08412.4646.0672.07169.86
34. Luganda6.23612.5435.5992.25472.51
35. SomaliAfro-Asiatic5.30817.1376.0032.87263.68
36. HaitianFrench Creole3.34314.3516.1812.32666.01
37. NahuatlUto-Aztecan6.69913.7256.1972.22866.87
Table 2. Average values and range (minimum–maximum) of P F (words per sentence), I P (words per interpunction) and G U (universal readability index) found in the texts listed in Table 1.
Table 2. Average values and range (minimum–maximum) of P F (words per sentence), I P (words per interpunction) and G U (universal readability index) found in the texts listed in Table 1.
P F I P G U
Minimum 11.26
Hebrew
4.01
Hungarian
(Short STM capacity)
51.48
Cebuano
(Least readable)
Average 15.63 5.80 67.18
Maximum 21.86
Welsh
8.35
Cebuano
(Long STM capacity)
81.12
Polish
(Most readable)
Table 3. Average values of C P (characters per word), P F (words per sentence), I P (words per interpunction), M F (interpunctions, or number of I P , per sentence) and G U (universal readability index) in Matthew in the indicated English translations, updated to the indicated year. Source: https://www.biblegateway.com/versions/ (last access, 15 April 2023).
Table 3. Average values of C P (characters per word), P F (words per sentence), I P (words per interpunction), M F (interpunctions, or number of I P , per sentence) and G U (universal readability index) in Matthew in the indicated English translations, updated to the indicated year. Source: https://www.biblegateway.com/versions/ (last access, 15 April 2023).
Translation C p P F I P M F G U
New American Bible (Vatican, 2002)4.0317.786.962.5857.39
King James (1611)4.2723.515.913.9859.51
21st Century King James (1994)4.2319.666.632.9657.64
American Standard Version (1881)4.2922.516.003.7559.51
Amplified Bible (1987)4.0525.736.284.1056.18
Amplified Bible, Classic Edition (1987)4.0624.806.194.0157.17
Blue Red & Gold Bible (2012)4.2522.505.853.8460.41
Christian Standard Bible (2017)4.3216.226.742.4060.24
Common English Bible (2011)4.3213.787.271.9060.35
Complete Jewish Bible (1998)4.2916.016.902.3259.54
Contemporary English Version (1995)4.2813.087.571.7359.72
Darby Translation (1890)4.2022.976.443.5756.62
Disciples’ Literal New Testament (2011)4.2916.706.652.5160.26
Douay-Rheims 1899 American Edition4.2119.755.993.3061.45
Easy-to-Read Version (2006)4.1611.647.451.5663.09
Evangelical Heritage Version (2019)4.2215.276.942.2060.22
English Standard Version (2016)4.1518.416.912.6657.00
English Standard Version Anglicised (1971)4.1718.937.112.6655.36
1599 Geneva Bible4.3121.705.663.8362.04
God’s Word Translation (2020)4.2912.466.731.8565.86
Good News Translation (1992)4.1516.107.542.1355.58
Holman Christian Standard Bible (2009)4.3215.536.682.3261.41
International Standard Version (2014)4.3116.137.022.3058.66
J.B. Phillips New Testament (1972)4.1917.388.152.1350.56
Jubilee Bible 20004.1922.607.103.1852.84
Legacy Standard Bible (2021)4.1721.436.923.1054.67
Lexham English Bible (2012)4.1919.457.612.5551.93
Living Bible (1971)4.2915.056.802.2161.32
The Message (2018)4.4212.746.242.0468.30
Modern English Version (2014)4.1716.686.732.4859.79
Names of God Bible (2011)4.3012.477.101.7663.64
New American Standard Bible 19954.1921.059.982.1136.54
New Catholic Bible (2019)4.2317.047.142.3956.96
New Century Version (2005)4.1514.537.511.9457.81
New International Reader’s Version (2014)4.138.996.801.3274.75
New Life Version (2003)3.9611.898.011.4959.37
New Living Translation (2015)4.1914.767.292.0358.80
Tree of Life Version (2015)4.2216.036.582.4361.41
Worldwide English New Testament (1998)3.9310.257.141.4468.63
Wycliffe Bible (2001)4.0623.255.883.9559.82
Young’s Literal Translation (1898)4.1933.755.885.7455.80
Table 4. Average values of C P (characters per word), P F (words per sentence), I P (words per interpunction), M F (interpunctions, or number of I P , per sentence), G U (universal readability index) in Matthew in the indicated English translations, updated to the indicated year. Source: https://www.biblegateway.com/versions/ (last access, 15 April 2023).
Table 4. Average values of C P (characters per word), P F (words per sentence), I P (words per interpunction), M F (interpunctions, or number of I P , per sentence), G U (universal readability index) in Matthew in the indicated English translations, updated to the indicated year. Source: https://www.biblegateway.com/versions/ (last access, 15 April 2023).
Translation C p P F I P M F G U
La Biblia (Vatican, traducción argentina, 1990)4.2517.6156.092.9062.68
La Biblia de las Américas (1997)4.4719.375.803.3462.85
Biblia del Jubileo (2020)4.4219.805.193.8266.22
Dios Habla Hoy (1996)4.3918.236.502.8059.64
Nueva Biblia de las Américas (2005)4.4519.095.993.1961.94
Nueva Traducción Viviente (2010)4.4515.107.072.1459.62
Nueva Versión Internacional (Castilian, 2015)4.4115.336.652.3061.84
Palabra de Dios para todos (2015)4.3815.377.562.0356.37
La Palabra (España, 2010)4.4817.636.682.6459.13
La Palabra (Hispanoamérica, 2010)4.4617.696.702.6458.97
Reina Valera 19954.4818.875.743.2963.63
Traducción en lenguaje actual (2000)4.4115.586.852.2760.33
Spanish Blue Red and Gold Letter Edition (2012)4.4419.945.123.8966.51
Reina Valera Antigua (1602)4.4419.935.133.8866.46
Reina Valera Actualizada (2015)4.4616.096.252.5863.35
Table 5. Average values and range (minimum–maximum) of P F (words per sentence), I P (words per interpunction) and G U (universal readability index) found in the English translations listed in Table 3.
Table 5. Average values and range (minimum–maximum) of P F (words per sentence), I P (words per interpunction) and G U (universal readability index) found in the English translations listed in Table 3.
P F I P G U
Minimum 8.99
New International Reader’s Version
5.66
1599 Geneva Bible
(Short STM capacity)
36.54
New American Standard Bible
(Least readable)
Average 17.82 6.89 58.98
Maximum 33.75
Young’s Literal Translation
9.98
New American Standard Bible
(Long STM capacity)
74.75
New International Reader’s Version
(Most readable)
Table 6. Average values and range (minimum–maximum) of P F (words per sentence), I P (words per interpunction) and G U (universal readability index) found in the Spanish translations listed in Table 5.
Table 6. Average values and range (minimum–maximum) of P F (words per sentence), I P (words per interpunction) and G U (universal readability index) found in the Spanish translations listed in Table 5.
P F I P G U
Minimum 15.10
Nueva Traducción Viviente
5.12
Spanish Blue Red and Gold Letter Edition
(Short STM capacity)
56.37
Palabra de Dios para todos
(Least readable)
Average 17.71 6.22 61.97
Maximum 19.94
Spanish Blue Red and Gold Letter Edition
7.56
Palabra de Dios para todos
(Long STM capacity)
66.51
Spanish Blue Red and Gold Letter Edition
(Most readable)
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Matricciani, E. Readability across Time and Languages: The Case of Matthew’s Gospel Translations. AppliedMath 2023, 3, 497-509. https://doi.org/10.3390/appliedmath3020026

AMA Style

Matricciani E. Readability across Time and Languages: The Case of Matthew’s Gospel Translations. AppliedMath. 2023; 3(2):497-509. https://doi.org/10.3390/appliedmath3020026

Chicago/Turabian Style

Matricciani, Emilio. 2023. "Readability across Time and Languages: The Case of Matthew’s Gospel Translations" AppliedMath 3, no. 2: 497-509. https://doi.org/10.3390/appliedmath3020026

APA Style

Matricciani, E. (2023). Readability across Time and Languages: The Case of Matthew’s Gospel Translations. AppliedMath, 3(2), 497-509. https://doi.org/10.3390/appliedmath3020026

Article Metrics

Back to TopTop