Next Article in Journal
Option Pricing with Given Risk Constraints and Its Application to Life Insurance Contracts
Previous Article in Journal
The Role of Parenting Styles in Narcissism Development: A Systematic Review and Meta-Analysis
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Translation Can Distort the Linguistic Parameters of Source Texts Written in Inflected Language: Multidimensional Mathematical Analysis of “The Betrothed”, a Translation in English of “I Promessi Sposi” by A. Manzoni

by
Emilio Matricciani
Dipartimento di Elettronica, Informazione e Bioingegneria (DEIB), Politecnico di Milano, 20133 Milan, Italy
AppliedMath 2025, 5(1), 24; https://doi.org/10.3390/appliedmath5010024
Submission received: 10 January 2025 / Revised: 8 February 2025 / Accepted: 21 February 2025 / Published: 4 March 2025

Abstract

:
We compare, mathematically, the text of a famous Italian novel, I promessi sposi, written by Alessandro Manzoni (source text), to its most recent English translation, The Betrothed by Michael F. Moore (target text). The mathematical theory applied does not measure the efficacy and beauty of texts; only their mathematical underlying structure and similarity. The translation theory adopted by the translator is the “domestication” of the source text because English is not as economical in its use of subject pronouns as Italian. A domestication index measures the degree of domestication. The modification of the original mathematical structure produces several consequences on the short–term memory buffers required for the reader and on the theoretical number of patterns used to construct sentences. The geometrical representation of texts and the related probability of error indicate that the two texts are practically uncorrelated. A fine–tuning analysis shows that linguistic channels are very noisy, with very poor signal–to–noise ratios, except the channels related to characters and words. Readability indices are also diverse. In conclusion, a blind comparison of the linguistic parameters of the two texts would unlikely indicate they refer to the same novel.

1. Introduction

Translation is the replacement of a text in one language (source text) by an equivalent text in another language (target text). Most studies on translation report results not based on mathematical analysis of texts, as we do with our mathematical/statistical theory on alphabetical languages developed in a series of papers [1,2,3,4,5,6,7,8]. When a mathematical approach is used [9,10,11,12,13], scholars consider neither Shannon’s communication theory [14] nor the fundamental connection that linguistic variables have with reader’s reading skill and short–term memory, considered instead by our theory. Very often, they refer only to one linguistic variable. The theory we have developed can be applied and unifies the study of any alphabetical language.
Of all types of translation, literary translation is maybe the most demanding and difficult because the language of literature is different from ordinary or technical language and involves many challenges on the syntactic, lexical, semantic, and stylistic levels [15,16,17,18,19,20,21,22].
Often, in translation theory studies, especially of literary texts, scholars mention the concept of “equivalence”—or “sameness”—between the source text (to be translated) and the target text (translated text) [23,24,25]. This concept, however, is loosely and never mathematically defined. On the contrary, our theory can mathematically “measure” how much texts differ from each other, with a multidimensional analysis [26], and we also apply this in the present paper.
Today, it is agreed that the translator of literary texts is an active role–player and the intermediary between the source text and the target text [17,27].
In relation to the translator’s role, however, two main theories of translation are today discussed by scholars: “domestication” and “foreignization” [27]. Domestication is a translation method in which the translator tries to match the source text to the reader, mainly to reader’s reading skills. Foreignization is a method in which the translator tries to match the reader to the source text, essentially regardless of reader’s reading skills. In other words, domestication neglects the foreign reality, while foreignization retains the foreignness and cultural otherness of a foreign text.
In translation to English, the dominant practice is mainly domestication [27] because it is much more likely for the translation to make the target text more fluent. This is achieved, however, by erasing the linguistic and cultural differences in the foreign text; therefore, text fluency becomes the general criterion to judge a translation.
In this paper, as a study case, we compare, by means of a multidimensional mathematical analysis, the text of a famous Italian novel, I promessi sposi (1840), written by Alessandro Manzoni (1785–1873), to the text of the most recent English translation, The Betrothed by Michael F. Moore [28], a masterful translation, according, e.g., to the novelist Jhumba Lahiri (see Preface in [28]).
To avoid any misunderstanding, the mathematical theory, and its multidimensional application to texts, is not concerned with the efficacy and beauty of a text—in this case, the target text—merely with its mathematical underlying structure, compared, in this case, to the Italian original text. The discussion and conclusion we reach are meant to be valid only in this context. In other words, we try to measure the “equivalence” of the two texts mathematically, just like we have done with other, often unrelated, pairs of texts in any language, even between different languages [4,5,6,7,8].
Alessandro Manzoni was an Italian poet and novelist. He is famous for the novel The Betrothed (in Italian, I promessi sposi), ranked among the masterpieces of world literature. He started to write the first version in 1821 after reading Ivanhoe by Walter Scott (1771–1832) in a French translation, and revised it in its final version in the 1830s.
I promessi sposi is a symbol of the Italian Risorgimento—the Italian XIXth century political renaissance movement which unified Italy as a country—both because of its patriotic message and because the novel was a fundamental milestone in developing the modern Italian language which Manzoni stabilized and whose unity he ensured throughout Italy. The novel is still today a required reading in Italian High Schools.
Of the two translation theories, domestication and foreignization, Moore clearly has chosen domestication. In fact, in his “Notes on the Translation”, he writes [28],
“Within the framework of fidelity to I promessi sposi, I felt adjustments were needed due to fundamental stylistic and grammatical differences between English and Italian conventions. For example, Manzoni often adopts the periodic structure for his sentences, stringing together clause after clause with a series of semicolons. Italian enables and even encourages this style, since as an inflected language, Italian embeds the person and number of the subject—the I, you, or we—in the verb. Subject pronouns are used minimally, so a series of clauses with the same subject can result in a sentence the length of a page. The period, or “full stop”, is regarded with barely concealed horror, as if it truncates an otherwise beautiful sentence. English is not as economical in its use of subject pronouns, and actually requires them, so rather than mimic the periodic style at full length, I broke his sentences up into smaller parcels, sometimes (but not always) using periods where he used semicolons, feeling that they do not impose the same sense of finality in English as they do in Italian.”
In the following sections of the paper, we will show that the replacement of semicolons with periods very much impacted the underlying mathematical structure of the translation. However, this is not the only change; other linguistic parameters have been largely modified. The aim of the paper is to show what was mathematically modified, to what extent, and the impact it has on readers. The conclusion is that a blind comparison of the linguistic parameters, including the linguistic channels, of the two texts would unlikely indicate they refer to the same novel.
After this introductory section, Section 2 shows an example of the translation (the incipit of the novel), Section 3 reports some total statistics, Section 4 reports an exploratory data analysis of linguistic variables, Section 5 defines deep language variables, Section 6 defines a geometrical representation of texts, discusses the probability of confusing a text with another and defines a domestication index, Section 7 deals with the short–term memory of readers/writers, Section 8 deals with the fine–tuning analysis performed through linguistic channels, Section 9 deals with a universal readability index, and Section 10 summarizes the main results and draws a conclusion.

2. Example of Translation: Incipit

The beginning of I promessi sposi—the first sentence of Chapter 1, “Quel ramo del lago di Como…”—is the second most celebrated incipit in the Italian Literature, the first being Dante’s Inferno incipit—“Nel mezzo del cammin di nostra vita…”. Generations of Italian High School students have memorized both and still do.
It is illuminating to read how the beginning paragraph was translated (the Italian and English paragraphs, for reader’s benefit, are presented in Appendix A). As mentioned in the introduction, here—and in the whole paper—we are not concerned with the meaning and beauty of words, nor with grammar and so on; only with the statistics of the linguistic parameters and channels, suitably defined. For this purpose, let us study how this paragraph was translated by calculating some statistics. These arid numbers, however, highlight some important changes made by the translator, which deeply affected the entire English version of the novel. Table 1 reports them.
The beginning paragraph was split into five. In the Italian paragraph, Manzoni describes Lake Como and its eastern branch, from north to south and down to the main town, Lecco. He describes coves, inlets, hills, mountains, ravines, steep inclines and flat terraces, cultivated fields, vineyards, towns, villages, roads, and footpaths. Only when he introduces the first character, Don Abbondio walking along one of the footpaths, does Manzoni start a new paragraph, after 682 words and nine long sentences with nine semicolons within them (Table 1).
He seems to act like a modern movie director who sets the stage for the movie’s first events and ends this introductory paragraph—hence starting a new paragraph—when in the close–up on a particular footpath, the first character appears, Don Abbondio.
Since a paragraph develops a single point, we understand that, in Manzoni’s mind, the single point was the detailed geographical setting of the beginning of his story. When the description is, in his plan, complete, then he starts a new paragraph.
This is not reflected by the translator. By splitting Manzoni’s setting into five paragraphs, he abandons the original sequence/climax and breaks the flow and unity that Manzoni carefully planned in the entire novel [29,30,31,32,33]. By splitting the single paragraph into five, the translator automatically distinguishes and emphasizes five points.
All interpunctions were drastically changed. The most striking change is the almost complete replacement of semicolons and colons with periods. In fact, the sum 9 + 9 + 5 = 23 gives the number of periods in the translated version. The purpose of semicolons—to separate independent clauses while connecting them as related ideas, just like Manzoni amply does—seems no longer acceptable by modern writers/readers of English texts [34,35,36]. Of course, this agrees with the translation theory flaunted by the translator (domestication), explicitly stated in his introduction to the novel, as noted in Section 1. Consequently, the linguistic structure of the Italian original text is largely modified; therefore, we should expect significant changes in the deep language parameters.

3. Total Statistics

In this section, we report the overall total statistics of the linguistic parameters. To calculate them, we used the digital text (WinWord file) and counted, for each chapter, the number of characters, words, sentences, and interpunctions (punctuation marks). Before doing so, we deleted the titles, footnotes, and other extraneous material present in the digital texts. The count is very simple, though time–consuming. WinWord directly provides the number of characters and words. The number of sentences was calculated by using WinWord to replace every period with a period (full stop); of course, this action does not change the text, but it gives the number of these substitutions and therefore the number of periods. The same procedure was repeated for question marks and exclamation marks. The sum of the three totals gives the total number of sentences. The same procedure gives the total number of commas, colons, and semicolons. The sum of these latter values with the total number of sentences gives the total number of interpunctions.
Table 2 and Table 3 report these total statistics. In English, paragraphs, sentences, and words are increased. The characters are fewer than in Italian, but not because the translator decided to do this; because English words are, on average, shorter than Italian words.
The text indicated with Italian–E refers to the Italian text in which periods replaced semicolons to simulate a “reverse” domestication translation. In this case, the number of sentences is about the same as in the English text, 14,676 compared to 14,647; therefore, the translator replaced semicolons with periods in practically the entire novel, and this replacement makes the number of words per sentence very similar to that of the Italian–E text, 16.0 compared to 15.3. English–I refers to the translation in which semicolons replaced periods. In this latter case, the number of sentences is smaller than in Italian.
In the next section, we report an exploratory data analysis of the two texts with the aim of visualizing important relations between linguistic variables and further investigate the changes made by the translator.

4. Exploratory Data Analysis: Relationships Between Italian and English Linguistic Variables

This section compares the linguistic variables per chapter of the English and Italian texts, comprising 39 samples due to the introduction (the one written by Manzoni) and 38 chapters of the novel. These scatterplots are further studied in Section 8 according to the theory of linguistic channels.
Figure 1a shows the difference between the number of paragraphs per chapter in the two texts, n p a r , E n g n p a r , I t a , and the percentage change, 100 × ( n p a r , E n g n p a r , I t a ) / n p a r , I t a . We note that the translator split the Italian paragraphs, increasing them up to 60%. Figure 1b shows the scatterplot comparing the number of paragraphs; it clearly shows that the data are all above the 45° line, y = x . Table 4 reports the correlation coefficient and slope of the regression line for this scatterplot and also for the following ones.
Figure 2a shows the scatterplot comparing characters and Figure 2b shows that comparing words. Both variables are very much correlated. In English, the number of words is always larger than in Italian; therefore, the translator needs more words to convey the original meaning. Characters and words are the most correlated variables.
Figure 3a shows the scatterplot comparing periods. The replacement of most semicolons with periods is clearly evidenced by being all samples above the 45° line, y = x .
Figure 3b shows the scatterplot comparing sentences. The effect of replacing semicolons with periods is clearly evident in the blue circles and regression line. Since the Italian text contains many semicolons, sentences are quite long; therefore, the regression line is far from the y = x line. On the contrary, when the semicolons in Italian are counted as periods, then the agreement is very good, as the red circles and regression line show.
Figure 4a shows the scatterplot comparing question marks, and Figure 4b shows that comparing exclamation marks. It is surprising that neither matches the Italian text at all (i.e., the 45° line), particularly the number of questions marks, which in English is larger than in Italian. Appendix C presents a short example in which question and exclamation marks are modified.
Figure 5a shows the scatterplot comparing semicolons and Figure 5b shows that comparing colons. The almost total cancellation of both interpunctions in English is clearly evident.
Figure 6a shows the scatterplot comparing commas. We note that in English, there are many fewer commas than in Italian, and this reduction, together with that in colons, semicolons, and question and exclamation marks, makes the number of interpunctions quite a bit smaller than in Italian (Figure 6b).
In conclusion, the translator completely modified the underlying context of interpunctions by distributing them quite differently from the original Italian text, therefore changing reader’s pauses and the reader’s short–term memory load, as discussed in Section 7 below. A similar modification also occurs in the distribution of paragraphs.
Only characters (Figure 2a) follow the 45° line strictly, but this good agreement is not the translator’s choice; it is very likely due to two reasons: (a) both Italian and English are alphabetical languages with many similar words coming directly from Latin in Italian and, indirectly, through French in English, and (b) alphabetical languages have a very similar mean number of characters per word (see Table 1 of Ref. [37]).
In other terms, a blind comparison of the statistics concerning fundamental linguistic parameters would unlikely indicate that these two texts refer to the same novel.
In the next section, we present the deep language variables which allow for further studies.

5. Deep Language Variables

Let us detail four deep language variables [1,2] which are not consciously managed by writers and are therefore useful for assessing the similarity of texts beyond writers’ awareness. To avoid any possible misunderstanding, these variables refer to the “surface” structure of texts, not to the “deep” structure mentioned in cognitive theory.
Let n C , n W , n S   , and n I be, respectively, the number of characters, words, sentences, and interpunctions (punctuation marks) calculated in disjointed blocks of texts, such as chapters (in this paper) or any other subdivisions; then, we can define four deep language variables (Appendix B lists the mathematical symbols used in the present paper):
The number of characters per word, C P :
C P = n C n W
The number of words per sentence, P F :
P F = n W n S
The number of interpunctions per word, referred to as the word interval, I P :
I P = n I n W
The number of word intervals, n I P , per sentence, M F :
M F = n I P n S
Table 5 reports the mean and standard deviation of these variables. These values were calculated by weighing each text with its number of words to avoid shorter texts weighing statistically as much as long ones. In other words, the statistical weight of a chapter is determined by the ratio between the number of its words and the total number of words of the novel. Moreover, note that the average values of these variables calculated from the totals, as shown in Table 2, do not coincide with the arithmetic or statistical means (those reported in Table 5), as proven in Appendix D.
From these statistical mean values, we note the following:
(a)
< C p > is very similar in both languages, for the reason detailed in Section 4.
(b)
< P F > in Italian is, as expected, quite a bit larger than in English. It becomes smaller and very similar to English only if semicolons are replaced by periods (Italian–E).
(c)
< I P > is significantly smaller in Italian than in English, due to the large number of interpunctions present in Italian. This parameter does not depend on the type of interpunctions; therefore, it is the same also in Italian–E.
The mean values of Table 5—or their minimum values directly calculated from the totals, as discussed in Appendix D—are useful for a first assessment of the “similarity” of texts by defining linear combinations of deep language parameters and then modelling them as vectors, a representation briefly discussed in the next section.

6. Geometrical Representation of Alphabetical Texts

The mean values of Table 5 can be used to model texts as vectors, a representation discussed in detail in Refs. [1,2,3] and here briefly discussed. This geometrical representation also allows the calculation of the probability that a text/author can be confused with another one, an extension in two dimensions of the problem discussed in Ref. [6]. The conditional probability can be considered an index indicating who influenced who, as shown in Ref. [26]. In our case, it provides another indication of how much the translation differs from the original text.

6.1. Vector Representation of Texts

Let us consider the following six vectors of the indicated components of deep language variables: R 1 = ( < C P > , < P F > ), R 2 = ( < M F > , < P F > ), R 3 = ( < I P > , < P F > ), R 4 = ( < C P > , < M F > ), R 5 = ( < I P > , < M F > ), R 6 = ( < I P > , < C P > ) and their resulting vector sum:
R = k = 1 6 R k = x i + y j
The choice of which parameter represents the component in abscissa and ordinate axes is not important because, once the choice is made, the numerical results depend on it, but not the relative comparisons and general conclusions.
In the first quadrant of the Cartesian coordinate plane, two texts are likely mathematically connected—they show close ending points of vector (5)—if their relative Pythagorean distance is small.
To set the two texts here studied—especially the translated text—in the framework of the English Literature, we considered the Cartesian plane defined and discussed in Ref. [26], which reports the list of the English novels whose positions are reported in this figure.
By considering the vector components x and y of Equation (5), we obtain the scatterplot shown in Figure 7 where X and Y are normalized coordinates calculated by setting Lord of the Rings at the origin ( X = 0 ,   Y = 0 ) and Silmarillion at ( X = 1 ,   Y = 1 ) , according to the linear transformations:
X = x x L o r d x S i l m a x L o r d
X = y y L o r d y S i l m a y L o r d
In Figure 7, we report the position of the Italian text (green diamond), its translation (blue diamond), and the position of Ivanohe—a novel that inspired Manzoni in writing his novel—with a green × . Note that since < C P > in the two languages is very similar (see Table 5), the position depends mainly on the other mean values.
We note that the translation (blue diamond) is well within the range of other English novels; in particular, it almost coincides with The Jungle Book. The Italian text is significantly displaced from most English novels.
Figure 8 shows a zoomed–in view. We note that if periods are replaced by semicolons in the translation (English–I, cyan diamond), then this text is located very close to Ivanohe, which may indicate an indirect influence of Walter Scott on Manzoni’s writing. On the other hand, if semicolons are replaced by periods, Ivanohe is closer to English (red × ). The red diamond refers to Italian–E, which falls well within the range of English texts reported in Figure 7.
These remarks refer to the display of vectors whose ending point depends only on mean values. The standard deviation of the four deep language variables, reported in Table 5, do introduce data scattering; therefore, in the next sub–section, we study and discuss this issue by calculating the probability (called “error” probability) that a text may be mathematically confused with another one, as in Ref. [26].

6.2. Error Probability

Besides the vector R of Equation (5)—due to mean values—we can consider another vector ρ , due to the standard deviation of the four deep language variables, that adds to R [26]. In this case, the final random vector describing a text is given by
T = R + ρ
Now, to obtain some insight into this new description, we consider the area of a circle centred at the ending point of R .
We fix the magnitude (radius)   ρ as follows. First, we add the variances of the deep language variables that determine the components x and y of R —let them be σ x 2 , σ y 2 —then, we calculate the average value σ ρ 2 = 0.5 × ( σ x 2 + σ y 2 ) , and finally, we set
ρ = σ ρ
Now, since in calculating the coordinates x and y of R , a deep language variable can be summed twice or more, we add its standard deviation (referred to as sigma) twice or more times before squaring, as shown in [26].
Now, we can estimate the (conditional) probability that a text is confused with another by calculating ratios of areas. This procedure is correct if we assume that the bivariate density of the normalized coordinates ρ X ,     ρ Y , centred at R , is uniform. By assuming this hypothesis, we can calculate probabilities as ratios of areas [38,39].
As discussed in [26], the hypothesis of substantial uniformity around R should be justified within 1–sigma bounds by noting that the coordinates X ,   Y are likely distributed according to a log–normal bivariate density because the logarithm of the four deep language variables, which are combined in Equation (9) linearly, can be modelled as Gaussian [1]. For the central–limit theorem, we should expect approximately a Gaussian model on the linear values, but with a significantly larger standard deviation than that of the single variables. Therefore, in the area close to R , the bivariate density function should not be very peaked, hence the uniform density modelling.
Now, we can calculate the following probabilities. Let A be the common area of two 1−sigma circles (i.e., the area proportional to the joint probability of two texts), let A 1 be the area of the 1−sigma circle of text 1, and let A 2 be the area of the 1−sigma circle of text 2. Now, since probabilities are proportional to areas, we obtain the following relationships:
A A 1 = P ( A 1 , A 2 ) P ( A 1 ) = P ( A 2 / A 1 ) P ( A 1 ) P ( A 1 ) = P ( A 2 / A 1 )
A A 2 = P ( A 1 , A 2 ) P ( A 2 ) = P ( A 1 / A 2 ) P ( A 2 ) P ( A 2 ) = P ( A 1 / A 2 )
Therefore, A / A 1 gives the conditional probability P ( A 2 / A 1 ) that part of text 2 can be confused (or “contained”) in text 1; A / A 2 gives the conditional probability P ( A 1 / A 2 ) that part of text 1 can be confused with text 2. P ( A 2 / A 1 ) = 1 means that A = A 1 ; therefore, text 1 can be fully confused with text 2, and P ( A 1 / A 2 ) = 1 means that A = A 2 ; therefore, text 2 can be fully confused with text 1.
Now, we can present a synthetic parameter which highlights how much two texts can be erroneously confused with each other. The parameter is the average conditional probability
p e = P ( A 2 / A 1 ) P ( A 1 ) + P ( A 1 / A 2 ) P ( A 2 )
Now, since in comparing two texts we can assume P ( A 1 ) = P ( A 2 ) = 0.5 , we obtain
p e = 0.5 × [ P ( A 2 / A 1 ) + P ( A 1 / A 2 ) ]
If p e = 0 , there is no intersection between the two 1−sigma circles, so the two texts cannot be confused with each other; therefore, there is no mathematical connection involving the deep language parameters. If p e = 1 , the two texts can be totally confused, and the two 1−sigma circles coincide.
In our exercise, assuming the Italian text is text 1 and its translation is text 2, we obtain P ( A 2 / A 1 ) = 0.225 and P ( A 1 / A 2 ) = 0.339 ; therefore, p e = 0.282 . If we assume Italian–E as text 1, we get P ( A 2 / A 1 ) = 0.377 and P ( A 1 / A 2 ) = 0.243 ; therefore, p e = 0.310 . Since these values are very low, there is little chance that the two texts, in any case, can be confused with each other. In conclusion, as already remarked, a blind analysis of the two texts would unlikely reveal their real relationship.
Now, we can define a “domestication index” D , which takes care of both translation and retro–translation, given by
D = 1 p e
with the following meaning: if D = 1 , then p e = 0 , so domestication is total; if D = 0 , then p e = 1 , so foreignization is total. In our study case, D = 1 0.282 = 0.718 or D = 1 0.310 = 0.690 therefore, domestication is about 70%.
In the next section, we study and discuss how the Italian text and its translation engage readers’ short–term memory.

7. Short–Term Memory of Writers/Readers

The human short–term memory seems to be modelled by two practically independent variables, which apparently engage two short–term memory (STM) buffers in series [7,8]. The first buffer is modelled according to the number of words between two consecutive interpunctions—i.e., the variable I P , the word interval—which follows Miller’s 7 ± 2 law [40] (based on the extensive literature on this topic; see the studies and remarks reported in [41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63]). The second buffer is modelled according to the number of word intervals, the variable M F , contained in a sentence, referred to as the extended short–term memory (E–STM).
Regarding the first buffer, Figure 9 shows I P versus P F . Note that the replacement of semicolons with periods is clearly evident (red circles). The trends shown are typical of alphabetical languages [1,2,3,4,5,6,7,8,26] and confirm two fundamental aspects:
(a)
I P ranges approximately within Miller’s bounds.
(b)
As the number of words in a sentence, P F , increases, I P can increase but not linearly, because the first buffer cannot hold, approximately, a number of words larger than that empirically predicted by Miller’s Law; therefore, saturation must occur. Scatterplots like that shown in Figure 9 provide an insight into the short–term memory capacity engaged when reading/writing a text, because a writer is also a reader of their own text.
As for the second buffer, Figure 10 shows the scatterplot comparing M F and I P . The effect of replacing semicolons with periods is clearly evident (red circles). Note that I P is insensitive to this replacement because n I does not change.
More interestingly, in the translation, M F 2.5 = < M F > (see Table 5) for a large range, while in Italian, it varies more randomly. In other words, the effect of replacing semicolons with periods (English) has the effect of forcing the capacity of the second buffer to a constant value. Finally, Figure 10 shows that, for a large range, I P and M F are uncorrelated, therefore independent, according to the log–normal probability modelling discussed in Ref. [26]. In other words, the reading of these specific texts also seems to require an STM made of two independent buffers in series.
In [8], we studied the numerical patterns (due to combinations of I P and M F ) which provide the number of sentences that can theoretically be recorded in the two STM buffers, regardless of their meaning. This number was compared with that of the sentences actually found in novels of the Italian and English Literatures and found that most authors write for readers with small STM buffers and, consequently, are forced to reuse numerical patterns to convey multiple meanings.
Figure 11 shows the scatterplot comparing the theoretical number of sentences allowed in Italian and in English. In Italian, this number is about two orders of magnitude greater than in English, a result mainly due to the larger M F (see Figure 10). Only when semicolons are replaced by periods in the Italian text (red circles) do the two variables agree more. In any case, in Italian, the number of possible patterns is always larger than in English.
The ratio between the number of sentences in a novel and the number of sentences theoretically allowed by the two buffers concerning I P and M F , referrred to as multiplicity factor α , can be used to compare two texts [8]. In the Italian and English Literatures, we found that α > 1 is more likely than α < 1 , and often α 1 . In the latter case, writers reuse the same pattern of sentences many times.
In our present case, we study a text and its translation. Figure 12 shows α for the two texts. Differences are clearly evident and they depend, of course, on the findings reported in Figure 11, with very low values in Italian. Italian and English are clearly very different. In other words, Manzoni has many potential patterns to use, according mainly to M F , but he does not. These patterns are much reduced in English; only Italian–E (red circles) agrees more with English.
Another useful parameter is the mismatch index [8], which measures to what extent a writer uses the number of sentences theoretically available, defined by
I M = α 1 α + 1
I M = 0 if α = 1 ; in this case, experimental and theoretical patterns are perfectly matched. They are over–matched if I M > 0 ( α > 1 ) and under–matched if I M < 0 ( α < 1 ) .
Figure 13 shows the scatterplot of I M compared within the two texts. Again, there is a large difference between Italian and English (blue circles). Only Italian–E agrees with English (red circles).
In conclusion, in the translation, the reader’s STM buffers are so diverse that they make the relationship between the two texts hardly recognizable as one being the translation of the other. The two texts show a significant correlation only if semicolons are replaced by periods in the Italian text, which indicates, as already noted, that almost all semicolons were replaced with periods.
The next section details the theory of important linguistic channels.

8. Linguistic Channels

In this section, we note, first, important linguistic channels present in alphabetical texts, which represent a “fine–tuning” analysis of linguistic variables; second, the general theory of linear channels [3,6] concerning the joint processing of two experimental scatterplots; third, the theory applied to the particular case of processing a single scatterplot; and finally, the application of the theory to the two texts.
Note that since the theory deals with linear regressions, it can be applied to and be useful in any field in which linear relationships fit experimental data.

8.1. Four Linguistic Channels

In alphabetical texts, we can always define at least four linguistic linear channels [3,6], namely,
(a).
Sentence channel (S–channel);
(b).
Interpunctions channel (I–channel);
(c).
Word interval channel (WI–channel);
(d).
Characters channel (C–channel).
In S–channels, the number of sentences of two texts is compared for the same number of words. From the two scatterplots relating n S to n W in two texts, we can relate the n S , j of text Y j to n S , k of text Y k by setting n W , j = n W , k . These channels describe how many sentences the author of text j writes, compared to the writer of text k (reference text), by using the same number of words. Only the theory of linguistic channels allows this comparison. These channels are more linked to P F than to the other variables. They are very likely to reflect the style of the writer.
In I–channels, the number of word intervals, M F , of two texts is compared for the same number of sentences. These channels describe how many short texts between two contiguous punctuation marks (of length I P words) two authors use. Since M F is connected to the E–STM, I–channels are more related to the second buffer capacity than to the style of the writer.
In WI–channels, the number of words contained in a word interval, (i.e., I P ) is compared for the same number of interpunctions; therefore, WI–channels are more related to the first buffer of the STM than to the style of the writer.
In C–channels, the number of characters of two texts is compared for the same number of words. These channels are more related to the language used than to other variables.

8.2. General Theory of Linear Channels

In a scatterplot involving—in our case—linguistic variables, an independent (reference) variable x and a dependent variable y can be related using a regression line (slope m ) passing through the origin:
y = m x
However, Equation (16) does not detail the full relationship between x and y because it links only mean conditional values. For example, for two different texts, Y k and Y j , we can write Equation (16) for the same variables, x and y .
More general linear relationships can be written by also considering the scattering of the data—measured in Y k and Y j by the correlation coefficients r k and r j , not considered in Equation (12)—around the regression lines (slopes m k and m j ) in two scatterplots:
y k = m k x + n k
y j = m j x + n j
While Equation (16) connects the dependent variable y to the independent variable x only on average, Equation (17) introduces additive “noise”, n k and n j , with zero mean value. The noise, therefore, is due to the correlation coefficient r 1 .
This theory allows us to compare two dependent variables, y j and y k , by eliminating x . We can calculate the slope and correlation coefficient of y j versus y k without knowing their scatterplot. For example, we can compare the number of sentences in two texts for an equal number of words by considering not only the two regression lines that link sentences to words—which would provide only the slope of the linear relationship between y j and y k —but also the scattering of the data. In this particular case, we refer to this linguistic channel as the “sentences channel” and to the channel processing as “fine–tuning” because it deepens the analysis of the data and provides more insight into the relationship between two texts.
The mathematical theory is here briefly detailed (more details in Refs. [2,3]). Continuing with the example of sentences, we can study the “sentences channel” that relates the sentences of the input text Y k to those of the output text Y j by eliminating x (i.e., the variable words). From Equation (17), we obtain the linear noisy relationship between the sentences in text Y k (reference, independent, channel input text) and the sentences in text Y j (dependent, channel output text):
y j = m j m k y k m j m k n k + n j
In Equation (18), the slope of the regression line linking y j to k , m j k , is given by
m j k = m j m k
The “regression noise–to–signal ratio”, R m , due to m j k 1 , of the channel is given by
R m = ( m j k 1 ) 2
The unknown correlation coefficient r j k between y j and y k is given by [39]
r j k = c o s a r c o s ( r j ) a r c o s ( r k )
The “correlation noise–to–signal ratio”, R r , due to r j k < 1 , is given by
R r = 1 r j k 2 r j k 2 m j k 2
Because the two noise sources are disjointed, the total noise–to–signal ratio R of the channel connecting text Y k to text Y j is given by
R = ( m j k 1 ) 2 + 1 r j k 2 r j k 2 m j k 2
From Equation (23), we obtain the signal–to–noise ratio γ (natural units) and Γ (dB):
γ = 1 R
Γ = 10 × l o g 10 γ
Note that no channel can yield r j k = 1 and m j k = 1 (i.e., γ = ), a case referred to as the ideal channel, unless a text is compared with itself (self–comparison, self–channel). In practice, we always find that r j k < 1 and m j k 1 .
In summary, the slope m j k measures the multiplicative “bias” of the dependent variable compared to the independent variable; the correlation coefficient r j k measures how “precise” the linear best fit is. The slope m j k is the source of the “regression noise” of the channel, and the correlation coefficient r j k is the source of the “correlation noise”.
Finally, note that since the probability of jointly finding r = 1 and m = 1 is practically zero, all channels are always noisy.

8.3. The Channel with a Single Scatterplot: One–to–One Correspondence

To clarify what we mean by a single scatterplot and one–to–one correspondence, let us consider the translation of a text in which we draw a scatterplot for each linguistic variable. For example, we can display the number of words per chapter in the translated text versus that of the original text, as in Figure 2b, in which m = 1.047 and r = 0.9967 (Table 4). Now, if there were a perfect ideal translation, the translated text would have the same number of words per chapter as the original text—therefore, m = 1 and r = 1 —because the scatterplot would collapse to the deterministic 45 ° linear relationship between x and y , resulting in the following relationship:
y k = x
Let us show how the relationships in Section 8.2 change in this particular and ideal case. If m k = 1 and n k = 0 , then r k = 1 ; therefore, this is a special case of the general theory recalled in the previous sub–section, and we obtain the following expressions:
m j k = m j
R m = ( m j 1 ) 2
r j k = c o s a r c o s ( r j ) 0 ) = r j
R r = 1 r j 2 r j 2 m j 2
R = ( m j 1 ) 2 + 1 r j 2 r j 2 m j 2
γ = 1 R
In other words, by considering Equation (26), we study how a deterministic relationship, which describes a noiseless channel, is transformed into the experimental relationship of a noisy channel, m j 1 and r j k < 1 . In conclusion, we can study a single scatterplot with the tools of the general theory. Therefore, the signal–to–noise ratio γ is still a single index that synthetically describes the relationship between x and y .

8.4. Performance of Linguistic Channels in Italian and in English

In this section, we study the linguistic channels in Italian and in English. Table 6 reports the correlation and slope of the regression lines need to study the S–channel, I–channel, WI–channel, and C–channel. Table 7 reports the signal–to–noise ratio in the channels.
It is very clear that the S–channel, I–channel, and WI–channel between Italian (input) and English (output) are very noisy, with very low signal–to–noise ratios. Only the S–Channel between Italian–E (input) and English (output) has a significant large Γ 27 dB ( γ 500 ), even larger than the expected large Γ 24 dB of the C–channel.
To study the single scatterplots (i.e., the scatterplot comparing the same variables), we need the correlation and slope of the regression lines reported in Table 4. Table 8 reports the signal–to–noise ratio in the channels.
It is very clear that only the channels related to characters and words—both characters and words are very much correlated; see Table 6—show large signal–to–noise ratios, while the other channels (with the exception of the sentences in Italian–E) are very noisy.
The conclusion regarding the linguistic channels is the same as above: a blind comparison would unlikely indicate the two texts refer to the same novel.
In the next section, we consider a universal readability index which measures the relative reading difficulty of texts.

9. Universal Readability Index

The universal readability index G U —proposed in Ref. [37], see also Ref. [64]—measures the relative reading difficulty of texts. It is understood that readers are educated enough to understand the texts. It is given by
G U = 89 10 k C P + 300 / P F 6 I P 6
k = < C P , I T A > / < C P , L a n >
In the present paper, < C p , I T A > = 4.62 refers to Italian and < C p , L a n > = 4.36 refers to English (see Table 5). By using Equations (33) and (34), the mean value < k C P > is forced to be equal to that found in Italian, namely, 4.62 . The rationale for this choice is that C P is a parameter typical of a language/text which, if not scaled, would bias G U without really quantifying the reading difficulty of readers, who in their own language are, on average, used to reading shorter or longer words than in Italian. This scaling, therefore, avoids changing G U only because a language, on average, has shorter words than Italian (like English). In any case, C p affects Equation (33) much less than P F or I P .
Figure 14a shows the scatterplot of G U comparing Italian and English (blue circles) and Italian–E and English (red circles). We note two interesting facts:
(a)
For G U < 55 , the readability of English is lower than that of Italian; for G U 55 , the two values agree more, as they align more with the 45° line (in Italian, in this range, the lower values of I P in Equation (32) balance the larger values of P F ).
(b)
If semicolons are replaced by periods, the Italian–E text is much more readable than the English text, because P F is noticeably reduced while I P does not change.
These values could be decoded, for example, into the minimum number of school years, Y , necessary to assess that a text/author is “easy” to read, according to the modern Italian school system, assumed to be a common reference (Figure 14b) (see Refs. [1,64]).
This assumption does not mean, of course, that English readers should attend school for the same number of years as Italian readers; it is only a way to perform relative comparisons, becasue a compariosn is otherwise difficult to assess from the mere values of G U . In other words, we should consider Y to be an “equivalent” number of school years. We obtain, on average, Y 9 years for English, Y 8 years for Italian, and Y 7 years for Italian–E.

10. Summary and Conclusions

We have mathematically compared the text of a famous Italian novel, I promessi sposi, written by Alessandro Manzoni, to the text of a well–received masterful English translation, The Betrothed by Michael F. Moore. To avoid any misunderstanding, the mathematical theory and its multidimensional application to this translation does not measure or judge its efficacy and beauty; only its mathematical underlying structure. In other words, the discussion and conclusion we reach are meant to be valid only in this context. We measured the “equivalence” of the source text and target text mathematically, just like we did with other unrelated couples of texts in any language, even between different languages.
After the punctual discussion on the findings reported in each section, we can conclude that the translator largely modified the mathematical structure of the original Italian text. The translation theory he adopted was the “domestication” of the source text. We have found that domestication was largely, even “wildly”, used in translating the New Testament Greek Books into modern languages, as we have shown in [4,6,64].
The almost total replacement of semicolons with periods was justified by stating that English is not as economical in its use of subject pronouns as Italian is. This was not the only change. Question and exclamation marks, as well as commas, were also largely modified, with no clear justification of the alleged poor “economy” of English. These modifications produce several consequences.
The short–term memory buffers required for the reader are significantly modified. The theoretical number of sentences allowed in Italian, i.e., the patterns of sentences available to Manzoni, is about two orders of magnitude greater than in English; therefore, the translator reused the same numerical pattern of words many times. Only when semicolons are replaced by periods in the Italian text, to simulate a reverse “domestication” translation, do the two variables agree more.
The geometrical representation of the texts, the related probability of error and the domestication index all indicate that the two texts are practically “disconnected”. The fine–tuning analysis shows that the linguistic channels are very noisy, with very poor signal–to–noise ratios, except the channels related to characters and words.
In conclusion, a blind comparison of the parameters obtained through a multidimensional analysis for the two texts would unlikely indicate they refer to the same novel.

Funding

This research received no external funding.

Data Availability Statement

Data are contained within the article.

Acknowledgments

The author wishes to thank Lucia Matricciani for providing the digital versions of the texts here studied.

Conflicts of Interest

The author declares no conflicts of interest.

Appendix A. Incipit Paragraph

Alessandro Manzoni, I promessi sposi, Capitolo 1.
“Quel ramo del lago di Como, che volge a mezzogiorno, tra due catene non interrotte di monti, tutto a seni e a golfi, a seconda dello sporgere e del rientrare di quelli, vien, quasi a un tratto, a ristringersi, e a prender corso e figura di fiume, tra un promontorio a destra, e un’ampia costiera dall’altra parte; e il ponte, che ivi congiunge le due rive, par che renda ancor più sensibile all’occhio questa trasformazione, e segni il punto in cui il lago cessa, e l’Adda rincomincia, per ripigliar poi nome di lago dove le rive, allontanandosi di nuovo, lascian l’acqua distendersi e rallentarsi in nuovi golfi e in nuovi seni. La costiera, formata dal deposito di tre grossi torrenti, scende appoggiata a due monti contigui, l’uno detto di san Martino, l’altro, con voce lombarda, il Resegone, dai molti suoi cocuzzoli in fila, che in vero lo fanno somigliare a una sega: talché non è chi, al primo vederlo, purché sia di fronte, come per esempio di su le mura di Milano che guardano a settentrione, non lo discerna tosto, a un tal contrassegno, in quella lunga e vasta giogaia, dagli altri monti di nome più oscuro e di forma più comune. Per un buon pezzo, la costa sale con un pendìo lento e continuo; poi si rompe in poggi e in valloncelli, in erte e in ispianate, secondo l’ossatura de’ due monti, e il lavoro dell’acque. Il lembo estremo, tagliato dalle foci de’ torrenti, è quasi tutto ghiaia e ciottoloni; il resto, campi e vigne, sparse di terre, di ville, di casali; in qualche parte boschi, che si prolungano su per la montagna. Lecco, la principale di quelle terre, e che dà nome al territorio, giace poco discosto dal ponte, alla riva del lago, anzi viene in parte a trovarsi nel lago stesso, quando questo ingrossa: un gran borgo al giorno d’oggi, e che s’incammina a diventar città. Ai tempi in cui accaddero i fatti che prendiamo a raccontare, quel borgo, già considerabile, era anche un castello, e aveva perciò l’onore d’alloggiare un comandante, e il vantaggio di possedere una stabile guarnigione di soldati spagnoli, che insegnavan la modestia alle fanciulle e alle donne del paese, accarezzavan di tempo in tempo le spalle a qualche marito, a qualche padre; e, sul finir dell’estate, non mancavan mai di spandersi nelle vigne, per diradar l’uve, e alleggerire a’ contadini le fatiche della vendemmia. Dall’una all’altra di quelle terre, dall’alture alla riva, da un poggio all’altro, correvano, e corrono tuttavia, strade e stradette, più o men ripide, o piane; ogni tanto affondate, sepolte tra due muri, donde, alzando lo sguardo, non iscoprite che un pezzo di cielo e qualche vetta di monte; ogni tanto elevate su terrapieni aperti: e da qui la vista spazia per prospetti più o meno estesi, ma ricchi sempre e sempre qualcosa nuovi, secondo che i diversi punti piglian più o meno della vasta scena circostante, e secondo che questa o quella parte campeggia o si scorcia, spunta o sparisce a vicenda. Dove un pezzo, dove un altro, dove una lunga distesa di quel vasto e variato specchio dell’acqua; di qua lago, chiuso all’estremità o piuttosto smarrito in un gruppo, in un andirivieni di montagne, e di mano in mano più allargato tra altri monti che si spiegano, a uno a uno, allo sguardo, e che l’acqua riflette capovolti, co’ paesetti posti sulle rive; di là braccio di fiume, poi lago, poi fiume ancora, che va a perdersi in lucido serpeggiamento pur tra’ monti che l’accompagnano, degradando via via, e perdendosi quasi anch’essi nell’orizzonte. Il luogo stesso da dove contemplate que’ vari spettacoli, vi fa spettacolo da ogni parte: il monte di cui passeggiate le falde, vi svolge, al di sopra, d’intorno, le sue cime e le balze, distinte, rilevate, mutabili quasi a ogni passo, aprendosi e contornandosi in gioghi ciò che v’era sembrato prima un sol giogo, e comparendo in vetta ciò che poco innanzi vi si rappresentava sulla costa: e l’ameno, il domestico di quelle falde tempera gradevolmente il selvaggio, e orna vie più il magnifico dell’altre vedute.
Per una di queste stradicciole, tornava bel bello dalla passeggiata verso casa, sulla sera del giorno 7 novembre dell’anno 1628, don Abbondio…”
Alessandro Manzoni, The Betrothed, Chapter 1. Translated by Michael F. Moore [28].
“The branch of Lake Como that turns south between two unbroken mountain chains, bordered by coves and inlets that echo the furrowed slopes, suddenly narrows to take the flow and shape of a river, between a promontory on the right and a wide shoreline on the opposite side. The bridge that joins the two sides at this point seems to make this transformation even more visible to the eye and mark the spot where the lake ends and the Adda begins again, to reclaim the name lake where the shores, newly distant, allow the water to spread and slowly pool into fresh inlets and coves. Formed from the sediment of three large streams, the shoreline lies at the foot of two neighboring mountains, the first called San Martino, the second, in Lombard dialect, the “Resegone”—the big saw—after the row of many small peaks that really do make it look like one. So clear is the resemblance that no one—provided they are directly facing it, from the northern walls of Milan, for example—could fail to immediately distinguish this summit from other mountains in that long and vast range with more obscure names and more common shapes. For a good stretch the shore rises upward into a smooth rolling slope. Then it breaks off into small hills and ravines, steep inclines and flat terraces, molded by the contours of the two mountains and the erosion of the waters. The water’s edge, cut by the outlets of the streams, is almost all pebbles and stones. The rest is fields and vineyards, dotted with towns, villages, and hamlets. Here and there a woods climbs up the side of the mountain.
Lecco, the capital that lends its name to the province, stands at a short remove from the bridge, and is on and indeed partly inside the lake when the water rises. Nowadays it is a large town well on its way to becoming a city. At the time of the events I am about to relate, this already good–sized village was also fortified, which conferred upon it the honor of a commander in residence, and the benefit of a permanent garrison of Spanish soldiers, who taught modesty to the girls and women of the town, gave an occasional tap on the back to a husband or father, and, at summer’s end, never failed to spread out into the vineyards to thin the grapes and relieve the peasants of the trouble of harvesting them.
Roads and footpaths used to run—and still do—from town to town, from summit to shore, from hill to hill. Some are more or less steep; some are level. In places they dip, sinking between two walls, and all you can see when you look up is a patch of sky and a mountaintop. In others they climb to open embankments, where the view encompasses a broader panorama, always rich, always new, depending on the vantage point and how much of the vast expanse can be seen, and on whether the landscape protrudes or recedes, stands out from or disappears into the horizon.
One piece, then another, then a long stretch of that vast and varied mirror of water. Over here, the lake—terminating at the far end or vanishing into a cluster, a procession of mountains—slowly growing wider between yet more mountains that unfold before the eyes, one by one, whose image, alongside that of the towns by the lake, is reflected upside down in the water. Over there, a bend in the river, then more lake, then river again disappearing into a shiny ribbon curling between the mountains on either side, and slowly descending to vanish on the horizon.
The place from which you contemplate these varied sights offers its own display on every side. The mountain along whose slopes you walk unfolds its peaks and crags above and around you, distinct, prominent, changing with every step, opening and then circling into ridges where a lone summit had at first appeared. The shapes that were reflected in the lake only minutes before now appear close to the summit. The tame, pleasant foothills temper the wild landscape and enhance the magnificence of the other vistas.
Along one of these footpaths, on the evening of the seventh day of November in 1628, Don Abbondio…”

Appendix B. Example of Texts

Example in which the number of question marks do not coincide; see the text in italics.
Alessandro Manzoni, I promessi sposi, Capitolo 21.
“Oh Signore! pretendere! Cosa posso pretendere io meschina, se non che lei mi usi misericordia? Dio perdona tante cose, per un’opera di misericordia! Mi lasci andare; per carità mi lasci andare! Non torna conto a uno che un giorno deve morire di far patir tanto una povera creatura. Oh! lei che può comandare, dica che mi lascino andare! M’hanno portata qui per forza. Mi mandi con questa donna a *** dov’è mia madre. Oh Vergine santissima! mia madre! mia madre, per carità, mia madre! Forse non è lontana di qui... ho veduto i miei monti! Perché lei mi fa patire? Mi faccia condurre in una chiesa. Pregherò per lei, tutta la mia vita. Cosa le costa dire una parola? Oh ecco! vedo che si move a compassione: dica una parola, la dica. Dio perdona tante cose, per un’opera di misericordia!”
Alessandro Manzoni, The Betrothed, Chapter 21.
“Oh, sir! What do I expect? What can a poor girl like me expect? Only your mercy. God will forgive many things for an act of mercy! Let me go, for the love of God, let me go! You, who will also have to face your maker one day: What do you have to gain from causing a poor creature so much suffering? You have the power to give orders: So tell them to let me go! They brought me here by force. Send me with this woman to ***, where my mother lives. Oh Blessed Virgin! My mother! My mother, for goodness’ sake, my mother! Perhaps it is not far from here. I saw my mountains! Why are you making me suffer? Tell them to take me to a church. I will pray for you for the rest of my life. What would it cost you to say one word? Ah, yes! I can see that you are moved to pity: Say the word, say it. God will forgive many things for an act of mercy!”

Appendix C. List of Mathematical Symbols and Meaning

SymbolDefinitions
CpCharacters per word
DDomestication index
GUUniversal readability index
IMMismatch index
IpWord interval
MFWord intervals per sentence
PFWords per sentence
RNoise–to–signal ratio
RmRegression noise–to–signal ratio
RrCorrelation noise–to–signal ratio
STotal number of sentences
WTotal number of words
nCNumber of characters
nWNumber of words
nSNumber of sentences
nINumber of interpunctions
n I P Number of word intervals
𝛾Signal–to–noise ratio
ΓSignal–to–noise ratio (dB)
mjkSlope of regression line of text j versus text k
rjkCorrelation coefficient between text j and text k

Appendix D. Inequalities

Let M be the number of samples (i.e., number of disjointed blocks of text, such as chapters or books); then, for example, the statistical mean value, < P F > , is given by
< P F > = k = 1 M P F , k × n W , k / W
where W = k = 1 M n W , k is the total number of words. Note that < P F > 1 M k = 1 M P F , k k = 1 M n W , k / k = 1 M n S , k = W / S , where S is the total number of sentences.
In general, the average values calculated from sample totals are always smaller than their statistical means; therefore, they give lower bounds, as I prove in the following.
Let us consider, for example, parameter P F . Because of Chebyshev’s inequality ([65], inequality 3.2.7), we can write Equation (A1) as
< P F > = k = 1 M n W k n S k n W k W 1 M k = 1 M n W k n S k k = 1 M n W k W = 1 M k = 1 M n W k n S k
Equation (A2) states that the mean calculated with samples weighted 1 / M (arithmetic mean) is smaller than (or equal to) the mean calculated with samples weighted n W k / W .
Now, again for Chebyshev’s inequality, we obtain the following:
k = 1 M n W k n S k 1 M k = 1 M n W k k = 1 M 1 n S k = W M k = 1 M 1 n S k
Further, for Cauchy–Schawarz’s inequality (or by the fact that the harmonic mean is less than, or equal to, the arithmetic mean), we obtain the following:
k = 1 M 1 n S k M 2 k = 1 M n S k
Finally, by inserting these inequalities into (6), we obtain the following:
< P F > W M 2 M 2 k = 1 M n S k = W S
Equation (A5) establishes that the statistical mean calculated with samples weighted n W k / W is greater than (or equal to) the average calculated with sample totals. The values given by these three methods of calculation coincide only if all texts are perfectly identical, i.e., with the same number of characters, words, sentences, and interpunctions, a case which is improbable.

References

  1. Matricciani, E. Deep Language Statistics of Italian throughout Seven Centuries of Literature and Empirical Connections with Miller’s 7 ∓ 2 Law and Short−Term Memory. Open J. Stat. 2019, 9, 373–406. [Google Scholar] [CrossRef]
  2. Matricciani, E. A Statistical Theory of Language Translation Based on Communication Theory. Open J. Stat. 2020, 10, 936–997. [Google Scholar] [CrossRef]
  3. Matricciani, E. Multiple Communication Channels in Literary Texts. Open J. Stat. 2022, 12, 486–520. [Google Scholar] [CrossRef]
  4. Matricciani, E. Linguistic Mathematical Relationships Saved or Lost in Translating Texts: Extension of the Statistical Theory of Translation and Its Application to the New Testament. Information 2022, 13, 20. [Google Scholar] [CrossRef]
  5. Matricciani, E. Capacity of Linguistic Communication Channels in Literary Texts: Application to Charles Dickens’ Novels. Information 2023, 14, 68. [Google Scholar] [CrossRef]
  6. Matricciani, E. Linguistic Communication Channels Reveal Connections between Texts: The New Testament and Greek Literature. Information 2023, 14, 405. [Google Scholar] [CrossRef]
  7. Matricciani, E. Is Short−Term Memory Made of Two Processing Units? Clues from Italian and English Literatures down Several Centuries. Information 2024, 15, 6. [Google Scholar] [CrossRef]
  8. Matricciani, E. A Mathematical Structure Underlying Sentences and Its Connection with Short–Term Memory. Appl. Math 2024, 4, 120–142. [Google Scholar] [CrossRef]
  9. Catford, J.C. A Linguistic Theory of Translation—An Essay in Applied Linguistics; Oxford Univeristy Press: Oxford, UK, 1965. [Google Scholar]
  10. Bell, T.R. Translation and Translating: Theory and Practice; Longman: New York, NY, USA, 1991. [Google Scholar]
  11. Proshina, Z. Theory of Translation; Far Eastern University Press: Manila, Philippines, 2008. [Google Scholar]
  12. Warren, R. (Ed.) The Art of Translation: Voices from the Field; North-eastern University Press: Boston, MA, USA, 1989. [Google Scholar]
  13. Wilss, W. Knowledge and Skills in Translator Behaviour; John Benjamins: Amsterdam, The Netherlands; Philadelphia, PA, USA, 1996. [Google Scholar]
  14. Shannon, C.E. A Mathematical Theory of Communication. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef]
  15. Hyde, G.M. Literary Translation. Hung. Stud. Engl. 1991, 22, 39–47. [Google Scholar]
  16. Yousef, T. Literary Translation: Old and New Challenges. Int. J. Arab.—Engl. Stud. 2012, 13, 49–64. [Google Scholar] [CrossRef]
  17. Nũñez, K.J. Literary translation as an act of mediation between author and reader. Estud. Traducción 2012, 2, 21–31. [Google Scholar]
  18. Bernaerts, L.; De Bleeker, L.; De Wilde, J. Narration and translation. Lang. Lit. 2014, 23, 203–212. [Google Scholar] [CrossRef]
  19. Ghazala, H.S. Literary Translation from a Stylistic Perspective. Stud. Engl. Lang. Teach. 2015, 3, 124–145. [Google Scholar] [CrossRef]
  20. Suo, X. A New Perspective on Literary Translation. Strategies Based on Skopos Theory. Theory Pract. Lang. Stud. 2015, 5, 176–183. [Google Scholar] [CrossRef]
  21. Munday, J. (Ed.) The Routledge Companion to Translation Studies; Routledge: New York, NY, USA, 2009. [Google Scholar]
  22. Munday, J. Introducing Translation Studies. Theories and Applications; Routledge: New York, NY, USA, 2016. [Google Scholar]
  23. Panou, D. Equivalence in Translation Theories: A Critical Evaluation. Theory Pract. Lang. Stud. 2013, 3, 1–6. [Google Scholar]
  24. Saule, B.; Aisulu, N. Problems of translation theory and practice: Original and translated text equivalence. Procedia—Soc. Behav. Sci. 2014, 136, 119–123. [Google Scholar] [CrossRef]
  25. Krein-Kühle, M. Translation and Equivalence. In Translation: A Multidisciplinary Approach; House, J., Ed.; Palgrave Advances in Language and Linguistics: London, UK, 2014. [Google Scholar]
  26. Matricciani, E. Multi–Dimensional Data Analysis of Deep Language in J.R.R. Tolkien and C.S. Lewis Reveals Tight Mathematical Connections. AppliedMath 2024, 4, 927–949. [Google Scholar] [CrossRef]
  27. Venuti, L. The Translator’s Invisibility: A History of Translation; Routledge: New York, NY, USA, 1995. [Google Scholar]
  28. Manzoni, A. The Betrothed; Moore, M.F., Translator; The Modern Library: New York, NY, USA, 2022. [Google Scholar]
  29. Cella, R. Storia dell’italiano; Società Editrice Il Mulino: Bologna, Italy, 2015. [Google Scholar]
  30. Frare, P. Una struttura in movimento: Sulla forma artistica dei «I promessi sposi». Italianist 1996, 16, 62–75. [Google Scholar] [CrossRef]
  31. Raboni, G. La scrittura purgata: Sulla cronologia della seconda minuta dei Promessi sposi. Filol. Ital. 2008, 5, 1000–1018. [Google Scholar]
  32. Frare, P. La scrittura Dell’inquietudine: Saggio su Alessandro Manzoni; L.S. Olschki: Firenze, Italy, 2006. [Google Scholar]
  33. Frare, P.; Salvioli, M. Prodigi di misericordia e forza del linguaggio. Sui capitoli XXI e XXIII dei «Promessi sposi». Munera 2016, 3, 109–119. [Google Scholar]
  34. Bruthiaux, P. The Rise and Fall of the Semicolon: English Punctuation Theory and English Teaching Practice. Appl. Linguist. 1995, 16, 1–14. [Google Scholar] [CrossRef]
  35. Duncan, M. Whatever Happened to the Paragraph? Coll. Engl. 2007, 69, 470–495. [Google Scholar] [CrossRef]
  36. Watson, C. Points of Contention: Rethinking the Past, Present, and Future of Punctuation. Crit. Inq. 2012, 38, 649–672. [Google Scholar] [CrossRef]
  37. Matricciani, E. Readability Indices Do Not Say It All on a Text Readability. Analytics 2023, 2, 296–314. [Google Scholar] [CrossRef]
  38. Papoulis, A. Probability & Statistics; Prentice Hall: Hoboken, NJ, USA, 1990. [Google Scholar]
  39. Lindgren, B.W. Statistical Theory, 2nd ed.; MacMillan Company: New York, NY, USA, 1968. [Google Scholar]
  40. Miller, G.A. The Magical Number Seven, Plus or Minus Two. Some Limits on Our Capacity for Processing Information. Psychol. Rev. 1955, 101, 343–352. [Google Scholar] [CrossRef]
  41. Crowder, R.G. Short–term memory: Where do we stand? Mem. Cogn. 1993, 21, 142–145. [Google Scholar] [CrossRef]
  42. Lisman, J.E.; Idiart, M.A.P. Storage of 7 ± 2 Short–Term Memories in Oscillatory Subcycles. Science 1995, 267, 1512–1515. [Google Scholar] [CrossRef]
  43. Cowan, N. The magical number 4 in short−term memory: A reconsideration of mental storage capacity. Behav. Brain Sci. 2000, 24, 87–114. [Google Scholar] [CrossRef]
  44. Bachelder, B.L. The Magical Number 7 ± 2: Span Theory on Capacity Limitations. Behav. Brain Sci. 2001, 24, 116–117. [Google Scholar] [CrossRef]
  45. Saaty, T.L.; Ozdemir, M.S. Why the Magic Number Seven Plus or Minus Two. Math. Comput. Model. 2003, 38, 233–244. [Google Scholar] [CrossRef]
  46. Burgess, N.; Hitch, G.J. A revised model of short–term memory and long–term learning of verbal sequences. J. Mem. Lang. 2006, 55, 627–652. [Google Scholar] [CrossRef]
  47. Richardson, J.T.E. Measures of Short–Term Memory: A Historical Review. Cortex 2007, 43, 635–650. [Google Scholar] [CrossRef] [PubMed]
  48. Mathy, F.; Feldman, J. What’s magic about magic numbers? Chunking and data compression in short−term memory. Cognition 2012, 122, 346–362. [Google Scholar] [CrossRef] [PubMed]
  49. Melton, A.W. Implications of Short–Term Memory for a General Theory of Memory. J. Verbal Learn. Verbal Behav. 1963, 2, 1–21. [Google Scholar] [CrossRef]
  50. Atkinson, R.C.; Shiffrin, R.M. The Control of Short–Term Memory. Sci. Am. 1971, 225, 82–91. [Google Scholar] [CrossRef]
  51. Murdock, B.B. Short–Term Memory. Psychol. Learn. Motiv. 1972, 5, 67–127. [Google Scholar]
  52. Baddeley, A.D.; Thomson, N.; Buchanan, M. Word Length and the Structure of Short−Term Memory. J. Verbal Learn. Verbal Behav. 1975, 14, 575–589. [Google Scholar] [CrossRef]
  53. Case, R.; Midian Kurland, D.; Goldberg, J. Operational efficiency and the growth of short–term memory span. J. Exp. Child Psychol. 1982, 33, 386–404. [Google Scholar] [CrossRef]
  54. Grondin, S. A temporal account of the limited processing capacity. Behav. Brain Sci. 2000, 24, 122–123. [Google Scholar] [CrossRef]
  55. Pothos, E.M.; Joula, P. Linguistic structure and short−term memory. Behav. Brain Sci. 2000, 124, 138–139. [Google Scholar]
  56. Conway, A.R.A.; Cowan, N.; Michael, F.; Bunting, M.F.; Therriaulta, D.J.; Minkoff, S.R.B. A latent variable analysis of working memory capacity, short−term memory capacity, processing speed, and general fluid intelligence. Intelligence 2002, 30, 163–183. [Google Scholar] [CrossRef]
  57. Jonides, J.; Lewis, R.L.; Nee, D.E.; Lustig, C.A.; Berman, M.G.; Moore, K.S. The Mind and Brain of Short–Term Memory. Annu. Rev. Psychol. 2008, 69, 193–224. [Google Scholar] [CrossRef] [PubMed]
  58. Potter, M.C. Conceptual short–term memory in perception and thought. Front. Psychol. 2012, 3, 113. [Google Scholar] [CrossRef]
  59. Jones, G.; Macken, B. Questioning short−term memory and its measurements: Why digit span measures long−term associative learning. Cognition 2015, 144, 1–13. [Google Scholar] [CrossRef]
  60. Chekaf, M.; Cowan, N.; Mathy, F. Chunk formation in immediate memory and how it relates to data compression. Cognition 2016, 155, 96–107. [Google Scholar] [CrossRef]
  61. Norris, D. Short–Term Memory and Long–Term Memory Are Still Different. Psychol. Bull. 2017, 143, 992–1009. [Google Scholar] [CrossRef]
  62. Houdt, G.V.; Mosquera, C.; Napoles, G. A review on the long short–term memory model. Artif. Intell. Rev. 2020, 53, 5929–5955. [Google Scholar] [CrossRef]
  63. Islam, M.; Sarkar, A.; Hossain, M.; Ahmed, M.; Ferdous, A. Prediction of Attention and Short–Term Memory Loss by EEG Workload Estimation. J. Biosci. Med. 2023, 11, 304–318. [Google Scholar] [CrossRef]
  64. Matricciani, E. Readability across Time and Languages: The Case of Matthew’s Gospel Translations. AppliedMath 2023, 3, 497–509. [Google Scholar] [CrossRef]
  65. Abramovitz, M.; Stegun, I.A. Handbook of Mathematical Formulas, 9th ed.; Dover publications: New York, NY, USA, 1972. [Google Scholar]
Figure 1. (a) Difference and percentage change in paragraphs per chapter in the two texts; (b) scatterplot comparing the two variables. The sample size is 39.
Figure 1. (a) Difference and percentage change in paragraphs per chapter in the two texts; (b) scatterplot comparing the two variables. The sample size is 39.
Appliedmath 05 00024 g001
Figure 2. (a) Scatterplot comparing characters per chapter; (b) scatterplot comparing words per chapter.
Figure 2. (a) Scatterplot comparing characters per chapter; (b) scatterplot comparing words per chapter.
Appliedmath 05 00024 g002
Figure 3. (a) Scatterplot comparing periods per chapter; (b) scatterplot comparing sentences per chapter. Blue indicates the comparison with the original Italian text; red indicates the comparison with the Italian text in which semicolons are replaced by periods.
Figure 3. (a) Scatterplot comparing periods per chapter; (b) scatterplot comparing sentences per chapter. Blue indicates the comparison with the original Italian text; red indicates the comparison with the Italian text in which semicolons are replaced by periods.
Appliedmath 05 00024 g003
Figure 4. (a) Scatterplot comparing question marks per chapter; (b) scatterplot comparing exclamation marks per chapter.
Figure 4. (a) Scatterplot comparing question marks per chapter; (b) scatterplot comparing exclamation marks per chapter.
Appliedmath 05 00024 g004
Figure 5. (a) Scatterplot comparing semicolons per chapter; (b) scatterplot comparing exclamation marks per chapter.
Figure 5. (a) Scatterplot comparing semicolons per chapter; (b) scatterplot comparing exclamation marks per chapter.
Appliedmath 05 00024 g005
Figure 6. (a) Scatterplot comparing commas per chapter; (b) scatterplot comparing total interpunctions.
Figure 6. (a) Scatterplot comparing commas per chapter; (b) scatterplot comparing total interpunctions.
Appliedmath 05 00024 g006
Figure 7. Normalized coordinates X and Y of the ending point of vector (5) such that The Lord of the Rings, blue square, is at (0,0) and The Silmarillion, blue triangle pointing left, is (1,1). The Chronicles of Narnia: red square; The Space Trilogy: red circle; The Hobbit: blue triangle pointing right; The Screwtape Letters: red triangle pointing upward; At the Back of the North Wind: cyan triangle pointing left; Lilith: A Romance: cyan triangle pointing downward; Phantastes: A Fairie Romance for Men and Women: cyan triangle pointing right; The Little Princess: cyan triangle pointing upward; Oliver Twist: blue circle; David Copperfield: green circle; A Tale of Two Cities: cyan circle; Bleak House: magenta circle; Our Mutual Friend: black circle; Pride and Prejudice: magenta triangle pointing right; Vanity Fair: magenta triangle pointing left; Moby Dick: magenta triangle pointing downward; The Mill On The Floss: magenta triangle pointing upward; Alice’s Adventures in Wonderland: yellow triangle pointing right; The Jungle Book: yellow triangle pointing downward; The War of the Worlds: yellow triangle pointing right; The Wonderful Wizard of Oz: green triangle pointing left; The Hound of The Baskervilles: green triangle pointing right; Peter Pan: green triangle pointing upward; Martin Eden: green square; Adventures of Huckleberry Finn: black triangle pointing right. See [26] for the full details on the novels. Texts considered in the present paper: I Promessi Sposi: green diamond; The Betrothed: blue diamond; Ivanohe: green “×”.
Figure 7. Normalized coordinates X and Y of the ending point of vector (5) such that The Lord of the Rings, blue square, is at (0,0) and The Silmarillion, blue triangle pointing left, is (1,1). The Chronicles of Narnia: red square; The Space Trilogy: red circle; The Hobbit: blue triangle pointing right; The Screwtape Letters: red triangle pointing upward; At the Back of the North Wind: cyan triangle pointing left; Lilith: A Romance: cyan triangle pointing downward; Phantastes: A Fairie Romance for Men and Women: cyan triangle pointing right; The Little Princess: cyan triangle pointing upward; Oliver Twist: blue circle; David Copperfield: green circle; A Tale of Two Cities: cyan circle; Bleak House: magenta circle; Our Mutual Friend: black circle; Pride and Prejudice: magenta triangle pointing right; Vanity Fair: magenta triangle pointing left; Moby Dick: magenta triangle pointing downward; The Mill On The Floss: magenta triangle pointing upward; Alice’s Adventures in Wonderland: yellow triangle pointing right; The Jungle Book: yellow triangle pointing downward; The War of the Worlds: yellow triangle pointing right; The Wonderful Wizard of Oz: green triangle pointing left; The Hound of The Baskervilles: green triangle pointing right; Peter Pan: green triangle pointing upward; Martin Eden: green square; Adventures of Huckleberry Finn: black triangle pointing right. See [26] for the full details on the novels. Texts considered in the present paper: I Promessi Sposi: green diamond; The Betrothed: blue diamond; Ivanohe: green “×”.
Appliedmath 05 00024 g007
Figure 8. Normalized coordinates X and Y of the ending point of vector (5) such that Lord of the Rings is at (0,0) and Silmarillion is at (1,1), zoomed in. Italian text: green diamond; its translation: blue diamond; Italian text with semicolons transformed in periods (Italian–E): red diamond; English text with periods transformed in semicolons (English–I): cyan diamond. The position of Ivanohe is reported with a green “×” (original text) and with a red “×” if semicolons are replaced by periods. The green, red and blue circles represent 1–sigma contour lines discussed in the main text.
Figure 8. Normalized coordinates X and Y of the ending point of vector (5) such that Lord of the Rings is at (0,0) and Silmarillion is at (1,1), zoomed in. Italian text: green diamond; its translation: blue diamond; Italian text with semicolons transformed in periods (Italian–E): red diamond; English text with periods transformed in semicolons (English–I): cyan diamond. The position of Ivanohe is reported with a green “×” (original text) and with a red “×” if semicolons are replaced by periods. The green, red and blue circles represent 1–sigma contour lines discussed in the main text.
Appliedmath 05 00024 g008
Figure 9. I P versus P F . Italian: green circles; English: blue circles; Italian–E: red circles. Miller’s range is between I P = 5 and I P = 9 (this latter limit is not explicitly drawn because it coincides with the largest ordinate value).
Figure 9. I P versus P F . Italian: green circles; English: blue circles; Italian–E: red circles. Miller’s range is between I P = 5 and I P = 9 (this latter limit is not explicitly drawn because it coincides with the largest ordinate value).
Appliedmath 05 00024 g009
Figure 10. M F versus I P . Italian: green circles; English: blue circles; Italian–E: red circles.
Figure 10. M F versus I P . Italian: green circles; English: blue circles; Italian–E: red circles.
Appliedmath 05 00024 g010
Figure 11. Scatterplot comparing the theoretical number of sentences per chapter allowed in Italian and that in English: blue circles. Scatterplot comparing Italian–E and English: red circles.
Figure 11. Scatterplot comparing the theoretical number of sentences per chapter allowed in Italian and that in English: blue circles. Scatterplot comparing Italian–E and English: red circles.
Appliedmath 05 00024 g011
Figure 12. Scatterplot comparing the multiplicity factor α in Italian and in English: blue circles. Scatterplot comparing Italian–E and in English: red circles.
Figure 12. Scatterplot comparing the multiplicity factor α in Italian and in English: blue circles. Scatterplot comparing Italian–E and in English: red circles.
Appliedmath 05 00024 g012
Figure 13. Scatterplot comparing the mismatch factor I M in Italian and in English: blue circles. Scatterplot comparing Italian–E and in English: red circles.
Figure 13. Scatterplot comparing the mismatch factor I M in Italian and in English: blue circles. Scatterplot comparing Italian–E and in English: red circles.
Appliedmath 05 00024 g013
Figure 14. (a) Scatterplot comparing G U in the two texts: blue indicates the comparison with Italian; red indicates the comparison with the Italian–E. (b) Chart to estimate the number of school years for a given G U , according to text reading difficulty.
Figure 14. (a) Scatterplot comparing G U in the two texts: blue indicates the comparison with Italian; red indicates the comparison with the Italian–E. (b) Chart to estimate the number of school years for a given G U , according to text reading difficulty.
Appliedmath 05 00024 g014
Table 1. Statistics of the translation of the beginning paragraph of I Promessi Sposi (The Betrothed).
Table 1. Statistics of the translation of the beginning paragraph of I Promessi Sposi (The Betrothed).
ParagraphsWordsPeriods
(Sentences)
CommasSemicolonsColons
Italian1682910395
English57012358 + 810
Table 2. Number of paragraphs, words per paragraph, characters contained in the words, words, and sentences. Italian refers to the original, Manzoni’s novel; English refers to Moore’s translation; Italian–E refers to the case in which all semicolons of the Italian novel are replaced with periods; English–I refers to the translation in which all periods are replaced by semicolons (see main text).
Table 2. Number of paragraphs, words per paragraph, characters contained in the words, words, and sentences. Italian refers to the original, Manzoni’s novel; English refers to Moore’s translation; Italian–E refers to the case in which all semicolons of the Italian novel are replaced with periods; English–I refers to the translation in which all periods are replaced by semicolons (see main text).
ParagraphsWords
per Paragraph
CharactersCharacters
per Word
WordsWords
per Sentence
Sentences
Italian273282.11,036,5604.62224,23421.110,627
Italian–E273282.11,036,5604.62224,23415.314,647
English302977.51,022,2394.36234,64616.014,676
English–I302977.51,022,2394.36234,64630.67672
Table 3. Number of periods, question marks, exclamation marks, commas, semicolons, colons, and total interpunctions. Italian refers to the original, Manzoni’s novel; English refers to Moore’s translation.
Table 3. Number of periods, question marks, exclamation marks, commas, semicolons, colons, and total interpunctions. Italian refers to the original, Manzoni’s novel; English refers to Moore’s translation.
PeriodsQuestion MarksExclamation MarksCommasSemicolonsColonsInterpunctions
Italian77951335149726,3164020263343,596
English11,6921558142620,00326354035,482
Table 4. Correlation coefficient and slope of the scatterplots comparing Italian and English variables (samples refer to chapters) for which a linear relationship fits the data.
Table 4. Correlation coefficient and slope of the scatterplots comparing Italian and English variables (samples refer to chapters) for which a linear relationship fits the data.
Linguistic VariableCorrelation CoefficientSlope
Characters0.99730.9861
Paragraphs0.98091.0897
Words0.99671.0470
Sentences0.97711.3704
Sentences (Italian–E)0.98681.0047
Question Marks0.98201.1581
Exclamation Marks0.92370.9215
Commas0.94770.7572
Interpunctions0.98260.8143
Table 5. Mean values and standard deviation of the indicated deep language variables. Italian refers to the original, Manzoni’s novel; English refers to Moore’s translation; Italian–E refers to the case in which all semicolons of the Italian novel are replaced by periods; English–I refers to the translation in which the periods are replaced by semicolons.
Table 5. Mean values and standard deviation of the indicated deep language variables. Italian refers to the original, Manzoni’s novel; English refers to Moore’s translation; Italian–E refers to the case in which all semicolons of the Italian novel are replaced by periods; English–I refers to the translation in which the periods are replaced by semicolons.
< C p > < P F > < I P > < M F >
Italian4.62
0.12
22.7
6.24
5.18
0.41
4.34
1.03
Italian–E4.62
0.12
16.01
3.57
5.18
0.41
3.07
0.50
English4.36
0.15
16.82
4.56
6.66
0.56
2.50
0.43
English–I4.36
0.15
30.30
––
6.66
0.56
4.53
––
Table 6. Correlation and slope of the regression line y = m x . Four digits are reported because some channels differ only in the third digit.
Table 6. Correlation and slope of the regression line y = m x . Four digits are reported because some channels differ only in the third digit.
TextS–Channel
Sentences vs. Words
I–Channel
Words vs. Interpunctions
WI–Channel
Word Intervals vs. Sentences
C–Channel
Characters vs. Words
Correlation CoefficientSlopeCorrelation CoefficientSlopeCorrelation CoefficientSlopeCorrelation CoefficientSlope
Italian0.59780.04700.93575.12200.78633.93740.99264.6263
Italian–E0.69800.06490.93575.12200.85732.90700.99264.6263
English0.71060.06230.93226.57850.89322.35870.98844.3560
Table 7. Signal–to–noise ratio Γ (dB) in the indicated channels.
Table 7. Signal–to–noise ratio Γ (dB) in the indicated channels.
S–ChannelI–ChannelWI–ChannelC–Channel
ItaIta–MEngItaIta–MEngItaIta–MEngItaIta–MEng
Italian10.6911.3513.098.112.5023.08
Italian–E7.4826.8113.0911.1312.0423.08
English8.3627.2210.9110.917.5614.0623.7123.71
Table 8. Correlation and slope of the regression line y = m x , English versus Italian and Italian–E. Signal–to–noise ratio Γ (dB) and γ (linear) of the indicated linguistic channels.
Table 8. Correlation and slope of the regression line y = m x , English versus Italian and Italian–E. Signal–to–noise ratio Γ (dB) and γ (linear) of the indicated linguistic channels.
Variable Γ (dB) γ
Characters22.62182.97
Paragraphs12.6218.27
Words20.23105.49
Sentences6.454.42
Sentences (Italian–E)15.6536.75
Question Marks11.2713.40
Exclamation Marks8.176.57
Commas9.078.07
Interpunctions12.3517.19
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Matricciani, E. Translation Can Distort the Linguistic Parameters of Source Texts Written in Inflected Language: Multidimensional Mathematical Analysis of “The Betrothed”, a Translation in English of “I Promessi Sposi” by A. Manzoni. AppliedMath 2025, 5, 24. https://doi.org/10.3390/appliedmath5010024

AMA Style

Matricciani E. Translation Can Distort the Linguistic Parameters of Source Texts Written in Inflected Language: Multidimensional Mathematical Analysis of “The Betrothed”, a Translation in English of “I Promessi Sposi” by A. Manzoni. AppliedMath. 2025; 5(1):24. https://doi.org/10.3390/appliedmath5010024

Chicago/Turabian Style

Matricciani, Emilio. 2025. "Translation Can Distort the Linguistic Parameters of Source Texts Written in Inflected Language: Multidimensional Mathematical Analysis of “The Betrothed”, a Translation in English of “I Promessi Sposi” by A. Manzoni" AppliedMath 5, no. 1: 24. https://doi.org/10.3390/appliedmath5010024

APA Style

Matricciani, E. (2025). Translation Can Distort the Linguistic Parameters of Source Texts Written in Inflected Language: Multidimensional Mathematical Analysis of “The Betrothed”, a Translation in English of “I Promessi Sposi” by A. Manzoni. AppliedMath, 5(1), 24. https://doi.org/10.3390/appliedmath5010024

Article Metrics

Back to TopTop