2022
Journal article  Open Access

ICS: total freedom in manual text classification supported by unobtrusive machine learning

Esuli A.

Electrical and Electronic Engineering  Automatic text classification  Active learning  Machine learning  General Computer Science  General Materials Science  Online learning  General Engineering 

We present the Interactive Classification System (ICS), a web-based application that supports the activity of manual text classification. The application uses machine learning to continuously fit automatic classification models that are in turn used to actively support its users with classification suggestions. The key requirement we have established for the development of ICS is to give its users total freedom of action: they can at any time modify any classification schema and any label assignment, possibly reusing any relevant information from previous activities. We investigate how this requirement challenges the typical scenarios faced in machine learning research, which instead give no active role to humans or place them into very constrained roles, e.g., on-demand labeling in active learning processes, and always assume some degree of batch processing of data. We satisfy the "total freedom" requirement by designing an unobtrusive machine learning model, i.e., the machine learning component of ICS as an unobtrusive observer of the users, that never interrupts them, continuously adapts and updates its models in response to their actions, and it is always available to perform automatic classifications. Our efficient implementation of the unobtrusive machine learning model combines various machine learning methods and technologies, such as hash-based feature mapping, random indexing, online learning, active learning, and asynchronous processing.

Source: IEEE access 10 (2022): 64741–64760. doi:10.1109/ACCESS.2022.3184009

Publisher: Institute of Electrical and Electronics Engineers, Piscataway, NJ, Stati Uniti d'America


Metrics



Back to previous page
BibTeX entry
@article{oai:it.cnr:prodotti:469105,
	title = {ICS: total freedom in manual text classification supported by unobtrusive machine learning},
	author = {Esuli A.},
	publisher = {Institute of Electrical and Electronics Engineers, Piscataway, NJ, Stati Uniti d'America},
	doi = {10.1109/access.2022.3184009},
	journal = {IEEE access},
	volume = {10},
	pages = {64741–64760},
	year = {2022}
}

AI4Media
A European Excellence Centre for Media, Society and Democracy

ARIADNEplus
Advanced Research Infrastructure for Archaeological Data Networking in Europe - plus

SoBigData-PlusPlus
SoBigData++: European Integrated Infrastructure for Social Mining and Big Data Analytics


OpenAIRE