CPL - Chalmers Publication Library
| Utbildning | Forskning | Styrkeområden | Om Chalmers | In English In English Ej inloggad.

A method for document image binarization based on histogram matching and repeated contrast enhancement

Mattias Wahde (Institutionen för tillämpad mekanik, Fordonsteknik och autonoma system)
ICAART 2014 - Proceedings of the 6th International Conference on Agents and Artificial Intelligence Angers, Loire Valley; France; 6 March 2014 through 8 March 2014 Vol. 1 (2014), p. 34-41.
[Konferensbidrag, refereegranskat]

In this paper, a new method for binarization of document images is introduced. During training, the method stores histograms from training images (divided into small tiles), along with the optimal binarization threshold. Training image tiles are presented in pairs, one noisy version and one clean binarized version, where the latter is used for finding the optimal binarization threshold. During use, the method considers the tiles of an image one by one. It matches the stored histograms to the histogram for the tile that is to be binarized. If a sufficiently close match is found, the tile is binarized using the corresponding threshold associated with the stored histogram. If no match is found, the contrast of the tile is slightly enhanced, and a new attempt is made. This sequence is repeated until either a match is found, or a (rare) timeout is reached. The method has been applied to a set of test images, and has been shown to outperform several comparable methods.

Nyckelord: Document image binarization; Image processing

Denna post skapades 2014-07-02.
CPL Pubid: 200035


Institutioner (Chalmers)

Institutionen för tillämpad mekanik, Fordonsteknik och autonoma system


Teknisk mekanik

Chalmers infrastruktur