2020
Conference article  Closed Access

Re-implementing and Extending Relation Network for R-CBIR

Messina N., Amato G., Falchi F.

Image retrieval  Visual features  Relation Network  Deep Learning 

Relational reasoning is an emerging theme in Machine Learning in general and in Computer Vision in particular. Deep Mind has recently proposed a module called Relation Network (RN) that has shown impressive results on visual question answering tasks. Unfortunately, the implementation of the proposed approach was not public. To reproduce their experiments and extend their approach in the context of Information Retrieval, we had to re-implement everything, testing many parameters and conducting many experiments. Our implementation is now public on GitHub and it is already used by a large community of researchers. Furthermore, we recently presented a variant of the relation network module that we called Aggregated Visual Features RN (AVF-RN). This network can produce and aggregate at inference time compact visual relationship-aware features for the Relational-CBIR (R-CBIR) task. R-CBIR consists in retrieving images with given relationships among objects. In this paper, we discuss the details of our Relation Network implementation and more experimental results than the original paper. Relational reasoning is a very promising topic for better understanding and retrieving inter-object relationships, especially in digital libraries.

Source: 16th Italian Research Conference on Digital Libraries, IRCDL 2020, pp. 82–92, Bari, Italy, 30-31/01/2020


Metrics



Back to previous page
BibTeX entry
@inproceedings{oai:it.cnr:prodotti:424547,
	title = {Re-implementing and Extending Relation Network for R-CBIR},
	author = {Messina N. and Amato G. and Falchi F.},
	doi = {10.1007/978-3-030-39905-4_9},
	booktitle = {16th Italian Research Conference on Digital Libraries, IRCDL 2020, pp. 82–92, Bari, Italy, 30-31/01/2020},
	year = {2020}
}