2022
Journal article  Open Access

Scikit-mobility: a python library for the analysis, generation, and risk assessment of mobility data

Pappalardo L., Simini F., Barlacchi G., Pellungrini R.

Data science  Data mining  Human mobility  Network science  Statistics  Privacy  Mobility analysis  Spatio-temporal analysis  Big data  Mathematical modeling  Statistics and Probability  Software  Probability and Uncertainty  Python  Migration models 

The last decade has witnessed the emergence of massive mobility datasets, such as tracks generated by GPS devices, call detail records, and geo-tagged posts from social media platforms. These datasets have fostered a vast scientific production on various applications of mobility analysis, ranging from computational epidemiology to urban planning and transportation engineering. A strand of literature addresses data cleaning issues related to raw spatiotemporal trajectories, while the second line of research focuses on discovering the statistical "laws" that govern human movements. A significant effort has also been put on designing algorithms to generate synthetic trajectories able to reproduce, realistically, the laws of human mobility. Last but not least, a line of research addresses the crucial problem of privacy, proposing techniques to perform the re-identification of individuals in a database. A view on state-of-the-art cannot avoid noticing that there is no statistical software that can support scientists and practitioners with all the aspects mentioned above of mobility data analysis. In this paper, we propose scikit-mobility, a Python library that has the ambition of providing an environment to reproduce existing research, analyze mobility data, and simulate human mobility habits. scikit-mobility is efficient and easy to use as it extends pandas, a popular Python library for data analysis. Moreover, scikit-mobility provides the user with many functionalities, from visualizing trajectories to generating synthetic data, from analyzing statistical patterns to assessing the privacy risk related to the analysis of mobility datasets.

Source: Journal of statistical software (2022). doi:10.18637/jss.v103.i04

Publisher: UCLA Statistics],, [Los Angeles, Calif. , Stati Uniti d'America


Metrics



Back to previous page
BibTeX entry
@article{oai:it.cnr:prodotti:471899,
	title = {Scikit-mobility: a python library for the analysis, generation, and risk assessment of mobility data},
	author = {Pappalardo L. and Simini F. and Barlacchi G. and Pellungrini R.},
	publisher = {UCLA Statistics],, [Los Angeles, Calif. , Stati Uniti d'America},
	doi = {10.18637/jss.v103.i04},
	journal = {Journal of statistical software},
	year = {2022}
}

Track and Know
Big Data for Mobility Tracking Knowledge Extraction in Urban Areas

SoBigData-PlusPlus
SoBigData++: European Integrated Infrastructure for Social Mining and Big Data Analytics


OpenAIRE