Ciampi L., Messina N., Pierucci M., Amato G., Avvenuti M., Falchi F.
Computer Vision and Pattern Recognition (cs.CV) FOS: Computer and information sciences Benchmark Computer Vision Prompt-Based Counting Class-Agnostic Computer Science - Computer Vision and Pattern Recognition
Recently, object counting has shifted towards classagnostic counting (CAC), which counts instances of arbitrary object classes never seen during model training. With advancements in robust vision-and-language foundation models, there is a growing interest in prompt-based CAC, where object categories are specified using natural language. However, we identify significant limitations in current benchmarks for evaluating this task, which hinder both accurate assessment and the development of more effective solutions. Specifically, we argue that the current evaluation protocols do not measure the ability of the model to understand which object has to be counted. This is due to two main factors: (i) the shortcomings of CAC datasets, which primarily consist of images containing objects from a single class, and (ii) the limitations of current counting performance evaluators, which are based on traditional class-specific counting and focus solely on counting errors. To fill this gap, we introduce the Prompt-Aware Counting (PrACo) benchmark. It comprises two targeted tests coupled with evaluation metrics specifically designed to quantitatively measure the robustness and trustworthiness of existing prompt-based CAC models. We evaluate state-of-the-art methods and demonstrate that, although some achieve impressive results on standard class-specific counting metrics, they exhibit a significant deficiency in understanding the input prompt, indicating the need for more careful training procedures or revised designs. The code for reproducing our results is available at https://github.com/ciampluca/PrACo.
Publisher: Institute of Electrical and Electronics Engineers Inc.
@inproceedings{oai:iris.cnr.it:20.500.14243/552089,
title = {Mind the prompt: a novel benchmark for prompt-based class-agnostic counting},
author = {Ciampi L. and Messina N. and Pierucci M. and Amato G. and Avvenuti M. and Falchi F.},
publisher = {Institute of Electrical and Electronics Engineers Inc.},
doi = {10.1109/wacv61041.2025.00774 and 10.48550/arxiv.2409.15953},
year = {2025}
}