Kuruoglu E. E., Vern Tan T.
Image similarity retrieval Index generation Multimedia information systems Document capture (scanning document analysis)
We propose a technique for efficient document retrieval from digital libraries containing document images which are compressed with token based compression. The technique we propose uses the layout information supplied by the relative positions of the character tokens on the page of a 'query' paper document to retrieve the original document in the image database. The query image is captured from a paper document by a multimedia system composed of a PC and a video scanning tool. This technique avoids OCRing the query document and the documents in the database; moreover avoidsdecompressing the documents in the database compressed with token based compression, therefore achieving important time and computational gains. The technique provides one with the capability of retrieving the original document stored in a digital library using part of a previously produced paper copy
Source: ISTI Technical reports, pp.1–9, 2001
@techreport{oai:it.cnr:prodotti:160467, title = {Document image retrieval without OCRing using a video scanning system}, author = {Kuruoglu E. E. and Vern Tan T.}, institution = {ISTI Technical reports, pp.1–9, 2001}, year = {2001} }