Next Article in Journal
Exploiting Multiple Detections for Person Re-Identification
Next Article in Special Issue
Open Datasets and Tools for Arabic Text Detection and Recognition in News Video Frames
Previous Article in Journal
Studies of Ancient Russian Cultural Objects Using the Neutron Tomography Method
Previous Article in Special Issue
Transcription of Spanish Historical Handwritten Documents with Deep Neural Networks
Open AccessArticle

A New Binarization Algorithm for Historical Documents

1
Departamento de Eletrônica e Sistemas, Centro de Tecnologia, Universidade Federal de Pernambuco, Recife-PE 50670-901, Brazil
2
Centro de Informática, Universidade Federal de Pernambuco, Recife-PE 50740-560, Brazil
3
Departamento de Estatística e Informática, Universidade Federal Rural de Pernambuco, Recife-PE 52171-900, Brazil
4
Programa de Pós-Graduação em Engenharia Elétrica, Universidade Federal de Pernambuco, Recife-PE 50670-901, Brazil
*
Author to whom correspondence should be addressed.
J. Imaging 2018, 4(2), 27; https://doi.org/10.3390/jimaging4020027
Received: 31 October 2017 / Revised: 16 January 2018 / Accepted: 16 January 2018 / Published: 23 January 2018
(This article belongs to the Special Issue Document Image Processing)
Monochromatic documents claim for much less computer bandwidth for network transmission and storage space than their color or even grayscale equivalent. The binarization of historical documents is far more complex than recent ones as paper aging, color, texture, translucidity, stains, back-to-front interference, kind and color of ink used in handwriting, printing process, digitalization process, etc. are some of the factors that affect binarization. This article presents a new binarization algorithm for historical documents. The new global filter proposed is performed in four steps: filtering the image using a bilateral filter, splitting image into the RGB components, decision-making for each RGB channel based on an adaptive binarization method inspired by Otsu’s method with a choice of the threshold level, and classification of the binarized images to decide which of the RGB components best preserved the document information in the foreground. The quantitative and qualitative assessment made with 23 binarization algorithms in three sets of “real world” documents showed very good results. View Full-Text
Keywords: documents; binarization; back-to-front interference; bleeding documents; binarization; back-to-front interference; bleeding
Show Figures

Figure 1

MDPI and ACS Style

Almeida, M.; Lins, R.D.; Bernardino, R.; Jesus, D.; Lima, B. A New Binarization Algorithm for Historical Documents. J. Imaging 2018, 4, 27.

Show more citation formats Show less citations formats
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Access Map by Country/Region

1
Search more from Scilit
 
Search
Back to TopTop