2024
Conference article  Open Access

Early exit strategies for approximate k-NN search in dense retrieval

Busolin F., Lucchese C., Nardini F. M., Orlando S., Perego R., Trani S.

Neural IR  efficiency/effectiveness trade-offs  Information Retrieval (cs.IR)  neural IR  FOS: Computer and information sciences  Dense Retrieval  Computer Science - Information Retrieval  dense retrieval  Efficiency/Effectiveness Trade-offs 

Learned dense representations are a popular family of techniques for encoding queries and documents using high-dimensional embeddings, which enable retrieval by performing approximate k nearest-neighbors search (A-kNN). A popular technique for making A-kNN search efficient is based on a two-level index, where the embeddings of documents are clustered offline and, at query processing, a fixed number N of clusters closest to the query is visited exhaustively to compute the result set. In this paper, we build upon state-of-the-art for early exit A-kNN and propose an unsupervised method based on the notion of patience, which can reach competitive effectiveness with large efficiency gains. Moreover, we discuss a cascade approach where we first identify queries that find their nearest neighbor within the closest τ << N clusters, and then we decide how many more to visit based on our patience approach or other state-of-the-art strategies. Reproducible experiments employing state-of-the-art dense retrieval models and publicly available resources show that our techniques improve the A-kNN efficiency with up to 5× speedups while achieving negligible effectiveness losses. All the code used is available at https://github.com/francescobusolin/faiss_pEE

Publisher: ACM


Metrics



Back to previous page
BibTeX entry
@inproceedings{oai:iris.cnr.it:20.500.14243/516293,
	title = {Early exit strategies for approximate k-NN search in dense retrieval},
	author = {Busolin F. and Lucchese C. and Nardini F.  M. and Orlando S. and Perego R. and Trani S.},
	publisher = {ACM},
	doi = {10.1145/3627673.3679903 and 10.48550/arxiv.2408.04981},
	year = {2024}
}

EFRA
Extreme Food Risk Analytics


OpenAIRE