Esuli A., Moreo Fernandez A. D., Sebastiani F.
Sentiment Classification Cross-Lingual Quantification Prevalence Estimation
We discuss Cross-Lingual Text Quantification (CLTQ), the task of performing text quantification (i.e., estimating the relative frequency pc(D) of all classes c?C in a set D of unlabelled documents) when training documents are available for a source language S but not for the target language T for which quantification needs to be performed. CLTQ has never been discussed before in the literature; we establish baseline results for the binary case by combining state-of-the-art quantification methods with methods capable of generating cross-lingual vectorial representations of the source and target documents involved. We present experimental results obtained on publicly available datasets for cross-lingual sentiment classification; the results show that the presented methods can perform CLTQ with a surprising level of accuracy.
Source: Research report, 2019
@techreport{oai:it.cnr:prodotti:415585, title = {Cross-Lingual Sentiment Quantification}, author = {Esuli A. and Moreo Fernandez A. D. and Sebastiani F.}, institution = {Research report, 2019}, year = {2019} }