Ferragina P, Venturini R
Indexing data structure H.3 INFORMATION STORAGE AND RETRIEVAL Data compression Mathematics (miscellaneous) Compressed index Pattern matching Indexing String dictionary
In this paper we address few variants of the well-known prefix-search problem in strings, and provide solutions for the cache-oblivious model which improve the best known results. In particular we close (asymptotically) the classic problem which asks for searching the set of strings sharing the longest common prefix with a queried pattern. Indeed, we provide an I/O-optimal solution matching the space lower bound for tries up to a constant multiplicative factor $(1+epsilon)$, for $epsilon>0$. Our solutions hinge upon few technicalities and, more interestingly, on a novel compressed storage scheme which adds to the elegant Locality Preserving Front-Coding (Bender {em et al.} 2006) the ability to decompress prefixes of the stored strings I/O-optimally still preserving its space bounds.
@inproceedings{oai:it.cnr:prodotti:277747, title = {Compressed cache-oblivious string B-tree}, author = {Ferragina P and Venturini R}, doi = {10.1007/978-3-642-40450-4_40 and 10.1145/2903141}, year = {2013} }
Bibliographic record
Deposited version
Deposited version
10.1007/978-3-642-40450-4_40
10.1145/2903141