Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

Search Results (1)

Search Parameters:
Keywords = individual author’s style of the writer

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
12 pages, 294 KB  
Article
The Amount of Data Required to Recognize a Writer’s Style Is Consistent Across Different Languages of the World
by Boris Ryabko, Nadezhda Savina, Yeshewas Getachew Lulu and Yunfei Han
Entropy 2025, 27(10), 1039; https://doi.org/10.3390/e27101039 - 4 Oct 2025
Viewed by 273
Abstract
In this paper, we apply an information-theoretic method proposed by Ryabko and Savina (therefore called the RS-method), based on the use of data compression, to recognize the individual author’s style of a writer across four languages from different language groups and families. In [...] Read more.
In this paper, we apply an information-theoretic method proposed by Ryabko and Savina (therefore called the RS-method), based on the use of data compression, to recognize the individual author’s style of a writer across four languages from different language groups and families. In this paper, the presented method was used to study fiction texts in Russian (East Slavic group of languages of the Indo-European language family), Amharic (South Ethiosemitic group of the Semitic language family), Chinese (Sinitic group of the Sino-Tibetan language family) and English (West Germanic language group of the Indo-European language family). It was found that the amount of data necessary for recognizing an author’s style is almost the same for all four languages, i.e., the amount of data is invariant across different language groups. The results obtained are of interest to computer science, literary studies, linguistics and, in particular, computational linguistics. Full article
(This article belongs to the Section Information Theory, Probability and Statistics)
Back to TopTop