2022
Conference article  Open Access

A spatio-temporal attentive network for video-based crowd counting

Avvenuti M., Bongiovanni M., Ciampi L., Falchi F., Gennaro C., Messina N.

Crowd counting  Deep learning  Visual counting  Smart cities 

Automatic people counting from images has recently drawn attention for urban monitoring in modern Smart Cities due to the ubiquity of surveillance camera networks. Current computer vision techniques rely on deep learning-based algorithms that estimate pedestrian densities in still, individual images. Only a bunch of works take advantage of temporal consistency in video sequences. In this work, we propose a spatio-temporal attentive neural network to estimate the number of pedestrians from surveillance videos. By taking advantage of the temporal correlation between consecutive frames, we lowered state-of-the-art count error by 5% and localization error by 7.5% on the widely-used FDST benchmark.

Source: ISCC 2022 - 27th IEEE Symposium on Computers and Communications, Rhodes Island, Greece, 30/06/2022-03/07/2022


Metrics



Back to previous page
BibTeX entry
@inproceedings{oai:it.cnr:prodotti:470912,
	title = {A spatio-temporal attentive network for video-based crowd counting},
	author = {Avvenuti M. and Bongiovanni M. and Ciampi L. and Falchi F. and Gennaro C. and Messina N.},
	doi = {10.1109/iscc55528.2022.9913019},
	booktitle = {ISCC 2022 - 27th IEEE Symposium on Computers and Communications, Rhodes Island, Greece, 30/06/2022-03/07/2022},
	year = {2022}
}

AI4Media
A European Excellence Centre for Media, Society and Democracy


OpenAIRE