Document - PP-Index: using permutation prefixes for efficient and scalable approximate similarity search

2009

Conference article Unknown

PP-Index: using permutation prefixes for efficient and scalable approximate similarity search

Esuli A.

Information Search and Retrieval Approximate Similarity Search Access Methods

We present the Permutation Prefix Index (PP-Index), an index data structure that allows to perform efficient approximate similarity search. The PP-Index belongs to the family of the permutation-based indexes, which are based on representing any indexed object with "its view of the surrounding world", i.e., a list of the elements of a set of reference objects sorted by their distance order with respect to the indexed object. In its basic formulation, the PP-Index is strongly biased toward efficiency, treating effectiveness as a secondary aspect. We show how the effectiveness can easily reach optimal levels just by adopting two "boosting" strategies: multiple index search and multiple query search. Such strategies have nice parallelization properties that allow to distribute the search process in order to keep high efficiency levels. We study both the efficiency and the effectiveness properties of the PP-Index. We report experiments on collections of sizes up to one hundred million images, represented in a very high-dimensional similarity space based on the combination of ve MPEG-7 visual descriptors.

Source: 7th Workshop on Large-Scale Distributed Systems for Information Retrieval, pp. 17–24, Boston, USA, 23 luglio 2009

Publisher: M. Jeusfeld c/o Redaktion Sun SITE, Informatik V, RWTH Aachen., Aachen, Germania

Back to previous page

Cite as

BibTeX entry

@inproceedings{oai:it.cnr:prodotti:91930,
	title = {PP-Index: using permutation prefixes for efficient and scalable approximate similarity search},
	author = {Esuli A.},
	publisher = {M. Jeusfeld c/o Redaktion Sun SITE, Informatik V, RWTH Aachen., Aachen, Germania},
	booktitle = {7th Workshop on Large-Scale Distributed Systems for Information Retrieval, pp. 17–24, Boston, USA, 23 luglio 2009},
	year = {2009}
}

CNR authors and affiliations

CNR authors

Esuli, Andrea
0000-0002-5725-4322

Laboratories

Networked Multimedia Information System (2002-2020)

Download

CNR ExploRA

Bibliographic record

PP-Index: using permutation prefixes for efficient and scalable approximate similarity search

Share

Cite as

CNR authors and affiliations

Download