2024
Conference article  Open Access

From text to locations: repurposing language models for spatial trajectory similarity assessment

De Melo Wilken C. D., Cruz L. A., Lettich F., Coelho Da Silva T. L., Magalhães R. P.

Spatial Trajectory Similarity  Language Models  Trajectory Embeddings  Natural Language Processing 

The proliferation of electronic devices with geopositioning capabilities has significantly increased trajectory data generation, thus opening up novel opportunities in mobility analysis. Our work considers the problem of assessing spatial similarity between trajectories, and focus on deep learning-based approaches that discretize trajectories using a uniform grid to generate their embeddings. In this context, t2vec is the reference approach. Large Language Models (LLMs) show promise in capturing patterns in mobility data. In this paper, we investigate whether an LLM can be repurposed to generate high-quality trajectory embeddings for the considered task. Using two real-world trajectory datasets, we consider repurposing three language models: Word2Vec, Doc2Vec, and BERT. Our results show that BERT, trained on dense trajectory datasets, can generate high-quality embeddings, thus highlighting the potential of LLMs.


Metrics



Back to previous page
BibTeX entry
@inproceedings{oai:iris.cnr.it:20.500.14243/510984,
	title = {From text to locations: repurposing language models for spatial trajectory similarity assessment},
	author = {De Melo Wilken C.  D. and Cruz L.  A. and Lettich F. and Coelho Da Silva T.  L. and Magalhães R.  P.},
	doi = {10.5753/sbbd.2024.240212},
	year = {2024}
}

Spoke 1 ”Human-centered AI” of the M4C2 - Investimento 1.3, Partenariato Esteso PE00000013 - ”FAIR - Future Artificial Intelligence Research”
Spoke 1 ”Human-centered AI” of the M4C2 - Investimento 1.3, Partenariato Esteso PE00000013 - ”FAIR - Future Artificial Intelligence Research”