Debole F., Hanif M., Salerno E., Savino P., Tonazzini A.
Recto-verso registration Ancient manuscript restoration Bleed-through removal Blind source separation Sparse representation inpainting
Digitization of the documental heritage conserved in libraries and archives is a common practice, in order to ensure the preservation and fruition of this extended part of the human cultural and historical patrimony. For the most precious, fragile and difficult to read and decipher manuscripts, specialized though portable digitization equipment, such as high resolution multispectral/hyperspectral cameras, is nowadays available. Digitization made it possible the increasingly extensive use of digital image processing techniques, to perform a number of virtual restoration tasks, which constitute a first, often necessary step prior subsequent automatic analysis of the writing contents, with the ultimate goal to perform automatic transcription and/or natural language processing tasks. Here we report our experience in this field, referring, as a case study, to the problem of removing one of the most frequent and impairing degradation affecting many ancient manuscripts, i.e., the bleed-through distortion. In this case, virtual restoration gives also the immediate benefit to facilitate the work of philologists and paleographers interested in examining and transcribing the manuscript in a traditional way.
Source: CiST 2018 - IEEE 5th International Congress on Information Science and Technology, pp. 188–193, Marrakech, Marocco, 21-27 October 2018
Publisher: The Institute of Electrical and Electronics Engineers (IEEE), Piscataway, USA
@inproceedings{oai:it.cnr:prodotti:397565, title = {A first step towards NLP from digitized manuscripts: virtual restoration}, author = {Debole F. and Hanif M. and Salerno E. and Savino P. and Tonazzini A.}, publisher = {The Institute of Electrical and Electronics Engineers (IEEE), Piscataway, USA}, doi = {10.1109/cist.2018.8596494}, booktitle = {CiST 2018 - IEEE 5th International Congress on Information Science and Technology, pp. 188–193, Marrakech, Marocco, 21-27 October 2018}, year = {2018} }
Debole, Franca
0000-0002-0369-6045
Hanif, Muhammad
0000-0002-9236-5263
Salerno, Emanuele
0000-0002-3433-3634
Savino, Pasquale
0000-0002-8841-5440
Tonazzini, Anna
0000-0001-6970-4725
Networked Multimedia Information System (2002-2020)
Signals and Images (2002-ongoing)
Servizio Infrastruttura Informatica ISTI e Supporto ai Servizi (2018-ongoing)