Esuli A.
Electrical and Electronic Engineering Automatic text classification Active learning Machine learning General Computer Science General Materials Science Online learning General Engineering
We present the Interactive Classification System (ICS), a web-based application that supports the activity of manual text classification. The application uses machine learning to continuously fit automatic classification models that are in turn used to actively support its users with classification suggestions. The key requirement we have established for the development of ICS is to give its users total freedom of action: they can at any time modify any classification schema and any label assignment, possibly reusing any relevant information from previous activities. We investigate how this requirement challenges the typical scenarios faced in machine learning research, which instead give no active role to humans or place them into very constrained roles, e.g., on-demand labeling in active learning processes, and always assume some degree of batch processing of data. We satisfy the "total freedom" requirement by designing an unobtrusive machine learning model, i.e., the machine learning component of ICS as an unobtrusive observer of the users, that never interrupts them, continuously adapts and updates its models in response to their actions, and it is always available to perform automatic classifications. Our efficient implementation of the unobtrusive machine learning model combines various machine learning methods and technologies, such as hash-based feature mapping, random indexing, online learning, active learning, and asynchronous processing.
Source: IEEE access 10 (2022): 64741–64760. doi:10.1109/ACCESS.2022.3184009
Publisher: Institute of Electrical and Electronics Engineers, Piscataway, NJ, Stati Uniti d'America
@article{oai:it.cnr:prodotti:469105, title = {ICS: total freedom in manual text classification supported by unobtrusive machine learning}, author = {Esuli A.}, publisher = {Institute of Electrical and Electronics Engineers, Piscataway, NJ, Stati Uniti d'America}, doi = {10.1109/access.2022.3184009}, journal = {IEEE access}, volume = {10}, pages = {64741–64760}, year = {2022} }
AI4Media
A European Excellence Centre for Media, Society and Democracy
ARIADNEplus
Advanced Research Infrastructure for Archaeological Data Networking in Europe - plus
SoBigData-PlusPlus
SoBigData++: European Integrated Infrastructure for Social Mining and Big Data Analytics