Future Internet 2012, 4(1), 238-252; doi:10.3390/fi4010238
Article

Readability and the Web

1,* email and 2email
Received: 20 December 2011; in revised form: 7 February 2012 / Accepted: 5 March 2012 / Published: 12 March 2012
(This article belongs to the Special Issue Selected Papers from ITA 11)
This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Abstract: Readability indices measure how easy or difficult it is to read and comprehend a text. In this paper we look at the relation between readability indices and web documents from two different perspectives. On the one hand we analyse how to reliably measure the readability of web documents by applying content extraction techniques and incorporating a bias correction. On the other hand we investigate how web based corpus statistics can be used to measure readability in a novel and language independent way.
Keywords: web document readability; content extraction; corpus statistics
PDF Full-text Download PDF Full-Text [268 KB, uploaded 12 March 2012 12:22 CET]

Export to BibTeX |
EndNote


MDPI and ACS Style

Martin, L.; Gottron, T. Readability and the Web. Future Internet 2012, 4, 238-252.

AMA Style

Martin L, Gottron T. Readability and the Web. Future Internet. 2012; 4(1):238-252.

Chicago/Turabian Style

Martin, Ludger; Gottron, Thomas. 2012. "Readability and the Web." Future Internet 4, no. 1: 238-252.

Future Internet EISSN 1999-5903 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert