Document - Explaining short text classification with diverse synthetic exemplars and counter-exemplars

2022

Journal article Open Access

Explaining short text classification with diverse synthetic exemplars and counter-exemplars

Lampridis O., State L., Guidotti R., Ruggieri S.

Explainable AI Short text classifcation Synthetic exemplars Counterfactuals Model-agnostic explanation

We present xspells, a model-agnostic local approach for explaining the decisions of black box models in classification of short texts. The explanations provided consist of a set of exemplar sentences and a set of counter-exemplar sentences. The former are examples classified by the black box with the same label as the text to explain. The latter are examples classified with a different label (a form of counter-factuals). Both are close in meaning to the text to explain, and both are meaningful sentences - albeit they are synthetically generated. xspells generates neighbors of the text to explain in a latent space using Variational Autoencoders for encoding text and decoding latent instances. A decision tree is learned from randomly generated neighbors, and used to drive the selection of the exemplars and counter-exemplars. Moreover, diversity of counter-exemplars is modeled as an optimization problem, solved by a greedy algorithm with theoretical guarantee. We report experiments on three datasets showing that xspells outperforms the well-known lime method in terms of quality of explanations, fidelity, diversity, and usefulness, and that is comparable to it in terms of stability.

Source: Machine learning (2022). doi:10.1007/s10994-022-06150-7

Publisher: Kluwer Academic Publishers,, Boston/U.S.A. , Stati Uniti d'America

Metrics

Back to previous page

Cite as

BibTeX entry

@article{oai:it.cnr:prodotti:468789,
	title = {Explaining short text classification with diverse synthetic exemplars and counter-exemplars},
	author = {Lampridis O. and State L. and Guidotti R. and Ruggieri S.},
	publisher = {Kluwer Academic Publishers,, Boston/U.S.A. , Stati Uniti d'America},
	doi = {10.1007/s10994-022-06150-7},
	journal = {Machine learning},
	year = {2022}
}

CNR authors and affiliations

CNR authors

Guidotti, Riccardo
0000-0002-2827-7613
Ruggieri, Salvatore
0000-0002-1917-6087

Laboratories

Knowledge Discovery and Data Mining (2002-ongoing)

Download

CNR ExploRA

Bibliographic record

ISTI Repository

Published version

DOI

10.1007/s10994-022-06150-7

Also available from

link.springer.com

Projects (via OpenAIRE)

NoBIAS
Artificial Intelligence without Bias
SoBigData-PlusPlus
SoBigData++: European Integrated Infrastructure for Social Mining and Big Data Analytics