Next Article in Journal / Special Issue
Secure Military Social Networking and Rapid Sensemaking in Domain Specific Concept Systems: Research Issues and Future Solutions
Previous Article in Journal
Using Web 2.0 Technologies for Collaborative Learning in Distance Education—Case Studies from an Australian University
Future Internet 2012, 4(1), 238-252; doi:10.3390/fi4010238

Readability and the Web

1,*  and 2
1 Institute of Computer Science, Johannes Gutenberg University Mainz, Mainz 55128, Germany 2 Institute for Web Science and Technologies, Universität Koblenz-Landau, Koblenz 56070, Germany
* Author to whom correspondence should be addressed.
Received: 20 December 2011 / Revised: 7 February 2012 / Accepted: 5 March 2012 / Published: 12 March 2012
(This article belongs to the Special Issue Selected Papers from ITA 11)
View Full-Text   |   Download PDF [268 KB, uploaded 12 March 2012]   |  


Readability indices measure how easy or difficult it is to read and comprehend a text. In this paper we look at the relation between readability indices and web documents from two different perspectives. On the one hand we analyse how to reliably measure the readability of web documents by applying content extraction techniques and incorporating a bias correction. On the other hand we investigate how web based corpus statistics can be used to measure readability in a novel and language independent way.
Keywords: web document readability; content extraction; corpus statistics web document readability; content extraction; corpus statistics
This is an open access article distributed under the Creative Commons Attribution License (CC BY 3.0).

Share & Cite This Article

Further Mendeley | CiteULike
Export to BibTeX |
EndNote |
MDPI and ACS Style

Martin, L.; Gottron, T. Readability and the Web. Future Internet 2012, 4, 238-252.

View more citation formats

Article Metrics

For more information on the journal, click here


[Return to top]
Future Internet EISSN 1999-5903 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert