Grossi R., Ottaviano G.
Monotone perfect hash functions String dictionaries Succinct data structures FOS: Computer and information sciences Computer Science - Data Structures and Algorithms Theoretical Computer Science Tries Data Structures and Algorithms (cs.DS)
Tries are popular data structures for storing a set of strings, where common prefixes are represented by common root-to-node paths. More than 50 years of usage have produced many variants and implementations to overcome some of their limitations.We explore new succinct representations of path-decomposed tries and experimentally evaluate the corresponding reduction in space usage and memory latency, comparing with the state of the art.We study the following applications: compressed string dictionary and monotone minimal perfect hash for strings. In compressed string dictionary, we obtain data structures that outperform other state-of-the-art compressed dictionaries in space efficiency while obtaining predictable query times that are competitive with data structures preferred by the practitioners. On real-world datasets, our compressed tries obtain the smallest space (except for one case) and have the fastest lookup times, whereas access times are within 20% slower than the best-known solutions. In monotone minimal perfect hash for strings, our compressed tries perform several times faster than other trie-based monotone perfect hash functions while occupying nearly the same space. On real-world datasets, our tries are approximately 2 to 5 times faster than previous solutions, with a space occupancy less than 10% larger. Categories and Subject Descriptors: H.3.3 [Information Storage and Retrieval]: Information Search and Retrieval; E.1 [Data Structures]: Trees General Terms: Algorithms, Experimentation, Performance.
Source: ACM journal of experimental algorithmics 19 (2014). doi:10.1145/2656332
Publisher: ACM, New York , Stati Uniti d'America
@article{oai:it.cnr:prodotti:303200, title = {Fast compressed tries through path decompositions}, author = {Grossi R. and Ottaviano G.}, publisher = {ACM, New York , Stati Uniti d'America}, doi = {10.1145/2656332 and 10.48550/arxiv.1111.5220}, journal = {ACM journal of experimental algorithmics}, volume = {19}, year = {2014} }