Puccetti G., Esuli A.
Text classification Machine Learning Deep Learning
This report describes our contribution to the EVALITA 2023 shared task MULTI-Fake-DetectIVE which involves the classification of news including textual and visual components. To experiment on this task we focus on textual data augmentation, extending the Italian text and the Images available in the training set using machine translation models and image captioning ones. To train using different set of input features, we use different transformer encoders for each variant of text (Italian, English) and modality (Image). For Task 1, among the models we test, we find that using the Italian text together with its translation improves the model performance while the captions don't provide any improvement. We test the same architecture also on Task 2 although in this case we achieve less satisfactory results
Source: EVALITA 2023 - Eighth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian, Parma, Italy, 7-9/09/2023
@inproceedings{oai:it.cnr:prodotti:486589, title = {AIMH at MULTI-Fake-DetectIVE: system report}, author = {Puccetti G. and Esuli A.}, booktitle = {EVALITA 2023 - Eighth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian, Parma, Italy, 7-9/09/2023}, year = {2023} }