2019
Conference article  Open Access

An Open Science System for Text Mining

Coro G., Panichi G., Pagano P.

Text mining  e-Infrastructures  Named Entity Recognition  Natural Language Processing  Cloud Computing 

Text mining (TM) techniques can extract high-quality information from big data through complex system architectures. However, these techniques are usually difficult to discover, install, and combine. Further, modern approaches to Science (e.g. Open Science) introduce new requirements to guarantee reproducibility, repeatability, and re-usability of methods and results as well as their longevity and sustainability. In this paper, we present a distributed system (NLPHub) that publishes and combines several state-of-the art text mining services for named entities, events, and keywords recognition. NLPHub makes the integrated methods compliant with Open Science requirements and manages heterogeneous access policies to the methods. In the paper, we assess the benefits and the performance of NLPHub on the I-CAB corpus.

Source: CLiC-it 2019 Italian Conference on Computational Linguistic, pp. 1–7, Bari, Italy, 13-15/11/2019



Back to previous page
BibTeX entry
@inproceedings{oai:it.cnr:prodotti:407862,
	title = {An Open Science System for Text Mining},
	author = {Coro G. and Panichi G. and Pagano P.},
	booktitle = {CLiC-it 2019 Italian Conference on Computational Linguistic, pp. 1–7, Bari, Italy, 13-15/11/2019},
	year = {2019}
}
CNR ExploRA

Bibliographic record

ISTI Repository

Published version Open Access

Also available from

disi.unitn.itOpen Access

PARTHENOS
Pooling Activities, Resources and Tools for Heritage E-research Networking, Optimization and Synergies


OpenAIRE