Frieder O., Mele I., Muntean C., Nardini F. M., Perego R., Tonellotto N.
Similarity search Caching Dense retrieval Conversational search
A method and system are described for improving the speed and efficiency of obtaining conversational search results. A user may speak a phrase to perform a conversational search or a series of phrases to perform a series of searches. These spoken phrases may be enriched by context and then converted into a query embedding. A similarity between the query embedding and document embeddings is used to determine the search results including a query cutoff number of documents and a cache cutoff number of documents. A second search phrase may use the cache of documents along with comparisons of the returned documents and the first query embedding to determine the quality of the cache for responding to the second search query. If the results are high-quality then the search may proceed much more rapidly by applying the second query only to the cached documents rather than to the server.
@misc{oai:iris.cnr.it:20.500.14243/504603, title = {Caching historical embeddings in conversational search}, author = {Frieder O. and Mele I. and Muntean C. and Nardini F. M. and Perego R. and Tonellotto N.}, year = {2024} }