Open AccessThis article is
- freely available
Readability and the Web
Institute of Computer Science, Johannes Gutenberg University Mainz, Mainz 55128, Germany
Institute for Web Science and Technologies, Universität Koblenz-Landau, Koblenz 56070, Germany
* Author to whom correspondence should be addressed.
Received: 20 December 2011; in revised form: 7 February 2012 / Accepted: 5 March 2012 / Published: 12 March 2012
Abstract: Readability indices measure how easy or difficult it is to read and comprehend a text. In this paper we look at the relation between readability indices and web documents from two different perspectives. On the one hand we analyse how to reliably measure the readability of web documents by applying content extraction techniques and incorporating a bias correction. On the other hand we investigate how web based corpus statistics can be used to measure readability in a novel and language independent way.
Keywords: web document readability; content extraction; corpus statistics
Article StatisticsClick here to load and display the download statistics.
Notes: Multiple requests from the same IP address are counted as one view.
Cite This Article
MDPI and ACS Style
Martin, L.; Gottron, T. Readability and the Web. Future Internet 2012, 4, 238-252.
Martin L, Gottron T. Readability and the Web. Future Internet. 2012; 4(1):238-252.
Martin, Ludger; Gottron, Thomas. 2012. "Readability and the Web." Future Internet 4, no. 1: 238-252.