2023
Journal article  Open Access

Trajectory test-train overlap in next-location prediction datasets

Luca M., Pappalardo L., Lepri B., Barlacchi G.

Machine learning  Human mobility 

Next-location prediction, consisting of forecasting a user's location given their historical trajectories, has important implications in several fields, such as urban planning, geo-marketing, and disease spreading. Several predictors have been proposed in the last few years to address it, including last-generation ones based on deep learning. This paper tests the generalization capability of these predictors on public mobility datasets, stratifying the datasets by whether the trajectories in the test set also appear fully or partially in the training set. We consistently discover a severe problem of trajectory overlapping in all analyzed datasets, highlighting that predictors memorize trajectories while having limited generalization capacities. We thus propose a methodology to rerank the outputs of the next-location predictors based on spatial mobility patterns. With these techniques, we significantly improve the predictors' generalization capability, with a relative improvement in the accuracy up to 96.15% on the trajectories that cannot be memorized (i.e., low overlap with the training set).

Source: Machine learning (2023). doi:10.1007/s10994-023-06386-x

Publisher: Kluwer Academic Publishers,, Boston/U.S.A. , Stati Uniti d'America


Metrics



Back to previous page
BibTeX entry
@article{oai:it.cnr:prodotti:486587,
	title = {Trajectory test-train overlap in next-location prediction datasets},
	author = {Luca M. and Pappalardo L. and Lepri B. and Barlacchi G.},
	publisher = {Kluwer Academic Publishers,, Boston/U.S.A. , Stati Uniti d'America},
	doi = {10.1007/s10994-023-06386-x},
	journal = {Machine learning},
	year = {2023}
}

SoBigData-PlusPlus
SoBigData++: European Integrated Infrastructure for Social Mining and Big Data Analytics


OpenAIRE