2002
Conference article  Restricted

An integrated system for the analysis and the recognition of characters in ancient documents

Vezzosi S, Bedini L, Tonazzini A

Characters recognition 

This paper describes an integrated system for processing and analyzing highly degraded ancient printed documents. For each page, the system reduces noise by wavelet-based filtering, extracts and segments the text lines into characters by a fast adaptive thresholding, and performs OCR by a feed-forward back-propagation multilayer neural network. The probability recognition is used as a discriminant parameter for determining the automatic activation of a feed-back process, leading back to a block for refining segmentation. This block acts only on the small portions of the text where the recognition was not trustable, and makes use of blind deconvolution and MRF-based segmentation techniques. The experimental results highlight the good performance of the whole system in the analysis of even strongly degraded texts.



Back to previous page
BibTeX entry
@inproceedings{oai:it.cnr:prodotti:91532,
	title = {An integrated system for the analysis and the recognition of characters in ancient documents},
	author = {Vezzosi S and Bedini L and Tonazzini A},
	year = {2002}
}