2020
Journal article  Open Access

Cross-Lingual Sentiment Quantification

Esuli A., Moreo A., Sebastiani F.

Computer Science - Machine Learning  Machine Learning (stat.ML)  Statistics - Machine Learning  quantification  sentiment analysis  Artificial Intelligence  Computer Networks and Communications  cross-lingual  Information Retrieval (cs.IR)  quanet  FOS: Computer and information sciences  Computer Science - Information Retrieval  Machine Learning (cs.LG) 

Sentiment Quantification is the task of estimating the relative frequency of sentiment-related classes-such as Positive and Negative-in a set of unlabeled documents. It is an important topic in sentiment analysis, as the study of sentiment-related quantities and trends across a population is often of higher interest than the analysis of individual instances. In this article, we propose a method for cross-lingual sentiment quantification, the task of performing sentiment quantification when training documents are available for a source language S, but not for the target language T, for which sentiment quantification needs to be performed. Cross-lingual sentiment quantification (and cross-lingual text quantification in general) has never been discussed before in the literature; we establish baseline results for the binary case by combining state-of-the-art quantification methods with methods capable of generating cross-lingual vectorial representations of the source and target documents involved. Experiments on publicly available datasets for crosslingual sentiment classification show that the presented method performs cross-lingual sentiment quantification with high accuracy.

Source: IEEE intelligent systems 35 (2020): 106–113. doi:10.1109/MIS.2020.2979203

Publisher: IEEE Computer Society,, Los Alamitos, CA , Stati Uniti d'America


Antonio Bella, Ce`sar Ferri, Jose´ Herna´ndez-Orallo, and Mar´ıa Jose´ Ram´ırez-Quintana. 2010. Quantification via Probability Estimators. In Proceedings of the 11th IEEE International Conference on Data Mining (ICDM 2010). Sydney, AU, 737-742. https://doi.org/10.1109/icdm.2010.75
John Blitzer, Mark Dredze, and Fernando Pereira. 2007. Biographies, Bollywood, Boom-boxes and Blenders: Domain Adaptation for Sentiment Classification. In Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL 2007). Prague, CZ, 440-447.
Andrea Esuli, Alejandro Moreo, and Fabrizio Sebastiani. 2018. A Recurrent Neural Network for Sentiment Quantification. In Proceedings of the 27th ACM International Conference on Information and Knowledge Management (CIKM 2018). Torino, IT, 1775-1778. https://doi.org/10.1145/3269206.3269287
George Forman. 2008. Quantifying counts and costs via classification. Data Mining and Knowledge Discovery 17, 2 (2008), 164-206. https://doi.org/10.1007/s10618-008-0097-y
Pablo Gonza´lez, Alberto Castan˜o, Nitesh V. Chawla, and Juan Jose´ del Coz. 2017. A Review on Quantification Learning. Comput. Surveys 50, 5 (2017), 74:1-74:40. https://doi.org/10.1145/3117807
Alejandro Moreo, Andrea Esuli, and Fabrizio Sebastiani. 2016. Distributional Correspondence Indexing for CrossLingual and Cross-Domain Sentiment Classification. Journal of Artificial Intelligence Research 55 (2016), 131-163. https://doi.org/10.1613/jair.4762
Alejandro Moreo, Andrea Esuli, and Fabrizio Sebastiani. 2018. Revisiting Distributional Correspondence Indexing: A Python Reimplementation and New Experiments. arXiv:1810.09311 [cs.CL].
Peter Prettenhofer and Benno Stein. 2011. Cross-lingual adaptation using structural correspondence learning. ACM Transactions on Intelligent Systems and Technology 3, 1 (2011), Article 13. https://doi.org/10.1145/2036264.2036277
Fabrizio Sebastiani. 2018. arXiv:1809.01991.
Evaluation Measures for Quantification: An Axiomatic Approach. (2018).
Hui Zou and Trevor Hastie. 2005. Regularization and variable selection via the elastic net. Journal of the Royal Statistical Society, Series B 67, 2 (2005), 301-320. https://doi.org/10.1111/j.1467-9868.2005.00503.x

Metrics



Back to previous page
BibTeX entry
@article{oai:it.cnr:prodotti:438786,
	title = {Cross-Lingual Sentiment Quantification},
	author = {Esuli A. and Moreo A. and Sebastiani F.},
	publisher = {IEEE Computer Society,, Los Alamitos, CA , Stati Uniti d'America},
	doi = {10.1109/mis.2020.2979203 and 10.48550/arxiv.1904.07965},
	journal = {IEEE intelligent systems},
	volume = {35},
	pages = {106–113},
	year = {2020}
}

SoBigData-PlusPlus
SoBigData++: European Integrated Infrastructure for Social Mining and Big Data Analytics


OpenAIRE