TITLE : Fine-grained visual textual alignment for cross-modal retrieval using transformer encoders AUTHOR(S) : Messina N, Amato G, Esuli A, Falchi F, Gennaro C, Marchandmaillet S TYPE : Journal article YEAR : 2021 CODE : 457546 *** DO NOT EDIT THIS FILE ***