2001
Report  Unknown

Document image retrieval without OCRing using a video scanning system

Kuruoglu E. E., Vern Tan T.

Image similarity retrieval  Index generation  Multimedia information systems  Document capture (scanning  document analysis) 

We propose a technique for efficient document retrieval from digital libraries containing document images which are compressed with token based compression. The technique we propose uses the layout information supplied by the relative positions of the character tokens on the page of a 'query' paper document to retrieve the original document in the image database. The query image is captured from a paper document by a multimedia system composed of a PC and a video scanning tool. This technique avoids OCRing the query document and the documents in the database; moreover avoidsdecompressing the documents in the database compressed with token based compression, therefore achieving important time and computational gains. The technique provides one with the capability of retrieving the original document stored in a digital library using part of a previously produced paper copy

Source: ISTI Technical reports, pp.1–9, 2001



Back to previous page
BibTeX entry
@techreport{oai:it.cnr:prodotti:160467,
	title = {Document image retrieval without OCRing using a video scanning system},
	author = {Kuruoglu E. E. and Vern Tan T.},
	institution = {ISTI Technical reports, pp.1–9, 2001},
	year = {2001}
}