2009
Conference article  Open Access

Active learning strategies for multi-label text classification

Esuli A., Sebastiani F.

Learning  Active learning  Information Storage and Retrieval  Classifier design and evaluation  Text classification 

Active learning refers to the task of devising a ranking function that, given a classifier trained from relatively few training examples, ranks a set of additional unlabeled examples in terms of how much further information they would carry, once manually labeled, for retraining a (hopefully) better classifier. Research on active learning in text classification has so far concentrated on single-label classification; active learning for multi-label classification, instead, has either been tackled in a simulated (and, we contend, non-realistic) way, or neglected tout court. In this paper we aim to fill this gap by examining a number of realistic strategies for tackling active learning for multi-label classification. Each such strategy consists of a rule for combining the outputs returned by the individual binary classifiers as a result of classifying a given unlabeled document. We present the results of extensive experiments in which we test these strategies on two standard text classification datasets.

Source: ECIR'09 - 31st European Conference on Information Retrieval, pp. 102–113, Toulouse, France, 7-9/04/2009


Metrics



Back to previous page
BibTeX entry
@inproceedings{oai:it.cnr:prodotti:44251,
	title = {Active learning strategies for multi-label text classification},
	author = {Esuli A. and Sebastiani F.},
	doi = {10.1007/978-3-642-00958-7_12},
	booktitle = {ECIR'09 - 31st European Conference on Information Retrieval, pp. 102–113, Toulouse, France, 7-9/04/2009},
	year = {2009}
}