Vezzosi S, Bedini L, Tonazzini A
Characters recognition
This paper describes an integrated system for processing and analyzing highly degraded ancient printed documents. For each page, the system reduces noise by wavelet-based filtering, extracts and segments the text lines into characters by a fast adaptive thresholding, and performs OCR by a feed-forward back-propagation multilayer neural network. The probability recognition is used as a discriminant parameter for determining the automatic activation of a feed-back process, leading back to a block for refining segmentation. This block acts only on the small portions of the text where the recognition was not trustable, and makes use of blind deconvolution and MRF-based segmentation techniques. The experimental results highlight the good performance of the whole system in the analysis of even strongly degraded texts.
@inproceedings{oai:it.cnr:prodotti:91532, title = {An integrated system for the analysis and the recognition of characters in ancient documents}, author = {Vezzosi S and Bedini L and Tonazzini A}, year = {2002} }