2016
Software  Metadata Only Access

Java Text Categorization System

Esuli A, Fagni T, Moreo Fernandez A

Text categorization  Machine learning  Natural language processing  Learning  Natural Language Processing 

JaTeCS is an open source Java library focused on Automatic Text Categorization (ATC). It covers all the steps of an experimental activity, from reading the corpus to the evaluation of the experimental results. JaTeCS focuses on text as the central input, and its code is optimized for this type of data. As with many other machine learning (ML) frameworks, it provides data readers for many formats and well-known corpora, NLP tools, feature selection and weighting methods, the implementation of many ML algorithms as well as wrappers for well-known external software (e.g., libSVM, SVM_light). JaTeCS also provides the implementation of methods related to ATC that are rarely, if never, provided by other ML framework (e.g., active learning, quantification, transfer learning).



Back to previous page
BibTeX entry
@misc{oai:it.cnr:prodotti:354436,
	title = {Java Text Categorization System},
	author = {Esuli A and Fagni T and Moreo Fernandez A},
	year = {2016}
}
CNR IRIS

Bibliographic record

Also available from

github.comMetadata Only Access