2007
Journal article  Closed Access

Fast correction of bleed-through distortion in grayscale documents by a blind source separation technique

Tonazzini A., Salerno E., Bedini L.

Computer Science Applications  Computer Vision and Pattern Recognition  Blind source separation  Software  I.4.4 Restoration 

Ancient documents are usually degraded by the presence of strong background artifacts. These are often caused by the so-called bleed-through effect, a pattern that interferes with the main text due to seeping of ink from the reverse side. A similar effect, called show-through and due to the nonperfect opacity of the paper, may appear in scans of even modern, well-preserved documents. These degradations must be removed to improve human or automatic readability. For this purpose, when a color scan of the document is available, we have shown that a simplified linear pattern overlapping model allows us to use very fast blind source separation techniques. This approach, however, cannot be applied to grayscale scans. This is a serious limitation, since many collections in our libraries and archives are now only available as grayscale scans or microfilms. We propose here a new model for bleed-through in grayscale document images, based on the availability of the recto and verso pages, and show that blind source separation can be successfully applied in this case too. Some experiments with real-ancient documents are presented and described.

Source: International journal on document analysis and recognition (Internet) 10 (2007): 17–25. doi:10.1007/s10032-006-0015-z

Publisher: Springer., Heidelberg, Germania


Metrics



Back to previous page
BibTeX entry
@article{oai:it.cnr:prodotti:68351,
	title = {Fast correction of bleed-through distortion in grayscale documents by a blind source separation technique},
	author = {Tonazzini A. and Salerno E. and Bedini L.},
	publisher = {Springer., Heidelberg, Germania},
	doi = {10.1007/s10032-006-0015-z},
	journal = {International journal on document analysis and recognition (Internet)},
	volume = {10},
	pages = {17–25},
	year = {2007}
}