Document - Distributional correspondence indexing for cross-language text categorization

2015

Conference article Restricted

Distributional correspondence indexing for cross-language text categorization

Esuli A, Fernandez Am

Cross-Language Text Categorization Distributional Semantics Sentiment Analysis

Cross-Language Text Categorization (CLTC) aims at producing a classifier for a target language when the only available training examples belong to a different source language. Existing CLTC methods are usually affected by high computational costs, require external linguistic resources, or demand a considerable human annotation effort. This paper presents a simple, yet effective, CLTC method based on projecting features from both source and target languages into a common vector space, by using a computationally lightweight distributional correspondence profile with respect to a small set of pivot terms. Experiments on a popular sentiment classification dataset show that our method performs favorably to state-of-the-art methods, requiring a significantly reduced computational cost and minimal human intervention.

Back to previous page

Cite as

BibTeX entry

@inproceedings{oai:it.cnr:prodotti:329758,
	title = {Distributional correspondence indexing for cross-language text categorization},
	author = {Esuli A and Fernandez Am},
	year = {2015}
}

CNR authors and affiliations

CNR authors

Esuli, Andrea
0000-0002-5725-4322
Moreo Fernandez, Alejandro David
0000-0002-0377-1025

Laboratories

Networked Multimedia Information System (2002-2020)

Download

CNR IRIS

Bibliographic record
Bibliographic record

Also available from

link.springer.com

Distributional correspondence indexing for cross-language text categorization

Share

Cite as

CNR authors and affiliations

Download