Relative Quality and Popularity Evaluation of Multilingual Wikipedia Articles
Department of Information Systems, Poznań University of Economics and Business, 61-875 Poznań, Poland
*
Author to whom correspondence should be addressed.
†
Current address: al. Niepodległości 10, 61-875 Poznań, Poland
‡
These authors contributed equally to this work.
Academic Editors: Mouzhi Ge and Vlastislav Dohnal
Informatics 2017, 4(4), 43; https://doi.org/10.3390/informatics4040043
Received: 21 September 2017 / Revised: 26 November 2017 / Accepted: 2 December 2017 / Published: 8 December 2017
(This article belongs to the Special Issue Quality Management in Big Data)
Despite the fact that Wikipedia is often criticized for its poor quality, it continues to be one of the most popular knowledge bases in the world. Articles in this free encyclopedia on various topics can be created and edited in about 300 different language versions independently. Our research has showed that in language sensitive topics, the quality of information can be relatively better in the relevant language versions. However, in most cases, it is difficult for the Wikipedia readers to determine the language affiliation of the described subject. Additionally, each language edition of Wikipedia can have own rules in the manual assessing of the content’s quality. There are also differences in grading schemes between language versions: some use a 6–8 grade system to assess articles, and some are limited to 2–3. This makes automatic quality comparison of articles between various languages a challenging task, particularly if we take into account a large number of unassessed articles; some of the Wikipedia language editions have over 99% of articles without a quality grade. The paper presents the results of a relative quality and popularity assessment of over 28 million articles in 44 selected language versions. Comparative analysis of the quality and the popularity of articles in popular topics was also conducted. Additionally, the correlation between quality and popularity of Wikipedia articles of selected topics in various languages was investigated. The proposed method allows us to find articles with information of better quality that can be used to automatically enrich other language editions of Wikipedia.
View Full-Text
Keywords:
Wikipedia; information quality; WikiRank; DBpedia
▼
Show Figures
This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited
MDPI and ACS Style
Lewoniewski, W.; Węcel, K.; Abramowicz, W. Relative Quality and Popularity Evaluation of Multilingual Wikipedia Articles. Informatics 2017, 4, 43. https://doi.org/10.3390/informatics4040043
AMA Style
Lewoniewski W, Węcel K, Abramowicz W. Relative Quality and Popularity Evaluation of Multilingual Wikipedia Articles. Informatics. 2017; 4(4):43. https://doi.org/10.3390/informatics4040043
Chicago/Turabian StyleLewoniewski, Włodzimierz; Węcel, Krzysztof; Abramowicz, Witold. 2017. "Relative Quality and Popularity Evaluation of Multilingual Wikipedia Articles" Informatics 4, no. 4: 43. https://doi.org/10.3390/informatics4040043
Find Other Styles
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.
Search more from Scilit