Next Article in Journal
Web-Based Scientific Exploration and Analysis of 3D Scanned Cuneiform Datasets for Collaborative Research
Next Article in Special Issue
A Data Quality Strategy to Enable FAIR, Programmatic Access across Large, Diverse Data Collections for High Performance Data Analysis
Previous Article in Journal
Requirements and Pitfalls in AAL Projects. Guide to Self-Criticism for Developers from Experience
Previous Article in Special Issue
Big Data in the Era of Health Information Exchanges: Challenges and Opportunities for Public Health
Article Menu

Export Article

Open AccessArticle
Informatics 2017, 4(4), 43; https://doi.org/10.3390/informatics4040043

Relative Quality and Popularity Evaluation of Multilingual Wikipedia Articles

Department of Information Systems, Poznań University of Economics and Business, 61-875 Poznań, Poland
Current address: al. Niepodległości 10, 61-875 Poznań, Poland
These authors contributed equally to this work.
*
Author to whom correspondence should be addressed.
Academic Editors: Mouzhi Ge and Vlastislav Dohnal
Received: 21 September 2017 / Revised: 26 November 2017 / Accepted: 2 December 2017 / Published: 8 December 2017
(This article belongs to the Special Issue Quality Management in Big Data)
Full-Text   |   PDF [1857 KB, uploaded 9 December 2017]   |  

Abstract

Despite the fact that Wikipedia is often criticized for its poor quality, it continues to be one of the most popular knowledge bases in the world. Articles in this free encyclopedia on various topics can be created and edited in about 300 different language versions independently. Our research has showed that in language sensitive topics, the quality of information can be relatively better in the relevant language versions. However, in most cases, it is difficult for the Wikipedia readers to determine the language affiliation of the described subject. Additionally, each language edition of Wikipedia can have own rules in the manual assessing of the content’s quality. There are also differences in grading schemes between language versions: some use a 6–8 grade system to assess articles, and some are limited to 2–3. This makes automatic quality comparison of articles between various languages a challenging task, particularly if we take into account a large number of unassessed articles; some of the Wikipedia language editions have over 99% of articles without a quality grade. The paper presents the results of a relative quality and popularity assessment of over 28 million articles in 44 selected language versions. Comparative analysis of the quality and the popularity of articles in popular topics was also conducted. Additionally, the correlation between quality and popularity of Wikipedia articles of selected topics in various languages was investigated. The proposed method allows us to find articles with information of better quality that can be used to automatically enrich other language editions of Wikipedia. View Full-Text
Keywords: Wikipedia; information quality; WikiRank; DBpedia Wikipedia; information quality; WikiRank; DBpedia
Figures

Figure 1

This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. (CC BY 4.0).
SciFeed

Share & Cite This Article

MDPI and ACS Style

Lewoniewski, W.; Węcel, K.; Abramowicz, W. Relative Quality and Popularity Evaluation of Multilingual Wikipedia Articles. Informatics 2017, 4, 43.

Show more citation formats Show less citations formats

Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Related Articles

Article Metrics

Article Access Statistics

1

Comments

[Return to top]
Informatics EISSN 2227-9709 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert
Back to Top