Document - Use of permutation prefixes for efficient and scalable approximate similarity search

2010

Other Restricted

Use of permutation prefixes for efficient and scalable approximate similarity search

Esuli A

Content Analysis and Indexing Information Search and Retrieval Approximate similarity search Metric spaces Scalability

We present the Permutation Prefix Index (PP-Index), an index data structure that allows to perform efficient approximate similarity search. The PP-Index belongs to the family of the permutation-based indexes, which are based on representing any indexed object with ``its view of the surrounding world'', i.e., a list of the elements of a set of reference objects sorted by their distance order with respect to the indexed object. In its basic formulation, the PP-Index is strongly biased toward efficiency. We show how the effectiveness can easily reach optimal levels just by adopting two ``boosting'' strategies: multiple index search and multiple query search, which both have nice parallelization properties. We study both the efficiency and the effectiveness properties of the PP-Index, experimenting with collections of sizes up to one hundred million objects, represented in a very high-dimensional similarity space.

Back to previous page

Cite as

BibTeX entry

@misc{oai:it.cnr:prodotti:161210,
	title = {Use of permutation prefixes for efficient and scalable approximate similarity search},
	author = {Esuli A},
	year = {2010}
}

CNR authors and affiliations

CNR authors

Esuli, Andrea
0000-0002-5725-4322

Laboratories

Networked Multimedia Information System (2002-2020)

Download

CNR IRIS

Bibliographic record
Deposited version

Use of permutation prefixes for efficient and scalable approximate similarity search

Share

Cite as

CNR authors and affiliations

Download