Sensors 2010, 10(5), 5263-5279; doi:10.3390/s100505263
Article

Basic Test Framework for the Evaluation of Text Line Segmentation and Text Parameter Extraction

1,* email, 2email and 3email
Received: 9 April 2010; in revised form: 13 May 2010 / Accepted: 14 May 2010 / Published: 25 May 2010
(This article belongs to the Section Physical Sensors)
This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Abstract: Text line segmentation is an essential stage in off-line optical character recognition (OCR) systems. It is a key because inaccurately segmented text lines will lead to OCR failure. Text line segmentation of handwritten documents is a complex and diverse problem, complicated by the nature of handwriting. Hence, text line segmentation is a leading challenge in handwritten document image processing. Due to inconsistencies in measurement and evaluation of text segmentation algorithm quality, some basic set of measurement methods is required. Currently, there is no commonly accepted one and all algorithm evaluation is custom oriented. In this paper, a basic test framework for the evaluation of text feature extraction algorithms is proposed. This test framework consists of a few experiments primarily linked to text line segmentation, skew rate and reference text line evaluation. Although they are mutually independent, the results obtained are strongly cross linked. In the end, its suitability for different types of letters and languages as well as its adaptability are its main advantages. Thus, the paper presents an efficient evaluation method for text analysis algorithms.
Keywords: OCR; document engineering; text line segmentation; text features; testing
PDF Full-text Download PDF Full-Text [360 KB, uploaded 21 June 2014 03:02 CEST]

Export to BibTeX |
EndNote


MDPI and ACS Style

Brodić, D.; Milivojević, D.R.; Milivojević, Z. Basic Test Framework for the Evaluation of Text Line Segmentation and Text Parameter Extraction. Sensors 2010, 10, 5263-5279.

AMA Style

Brodić D, Milivojević DR, Milivojević Z. Basic Test Framework for the Evaluation of Text Line Segmentation and Text Parameter Extraction. Sensors. 2010; 10(5):5263-5279.

Chicago/Turabian Style

Brodić, Darko; Milivojević, Dragan R.; Milivojević, Zoran. 2010. "Basic Test Framework for the Evaluation of Text Line Segmentation and Text Parameter Extraction." Sensors 10, no. 5: 5263-5279.

Sensors EISSN 1424-8220 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert