2005
Journal article  Restricted

Similarity Grid for searching in metric spaces

Batko M, Gennaro C, Zezula P

H.3.3 Information Search and Retrieval  H.3.4 Systems and Software 

Similarity search in metric spaces represents an important paradigm for content-based retrieval of many applications. Existing centralized search structures can speed-up retrieval, but they do not scale up to large volume of data because the response time is linearly increasing with the size of the searched file. The proposed GHT* index is a scalable and distributed structure. By exploiting parallelism in a dynamic network of computers, the GHT* achieves practically constant search time for similarity range queries in data-sets of arbitrary size. The structure also scales well with respect to the growing volume of retrieved data. Moreover, a small amount of replicated routing information on each server increases logarithmically. At the same time, the potential for interquery parallelism is increasing with the growing data-sets because the relative number of servers utilized by individual queries is decreasing. All these properties are verified by experiments on a prototype system using real-life data-sets.


Metrics



Back to previous page
BibTeX entry
@article{oai:it.cnr:prodotti:44130,
	title = {Similarity Grid for searching in metric spaces},
	author = {Batko M and Gennaro C and Zezula P},
	doi = {10.1007/11549819_3},
	year = {2005}
}