2017
Conference article  Restricted

Detecting domain-specific ambiguities: An NLP approach based on wikipedia crawling and word embeddings

Ferrari A., Donati B., Gnesi S.

pragmatic ambiguity  requirements engineering  ambiguity  NLP  natural language processing  natural language 

In the software process, unresolved natural language (NL) ambiguities in the early requirements phases may cause problems in later stages of development. Although methods exist to detect domain-independent ambiguities, ambiguities are also influenced by the domain-specific background of the stakeholders involved in the requirements process. In this paper, we aim to estimate the degree of ambiguity of typical computer science words (e.g., system, database, interface) when used in different application domains. To this end, we apply a natural language processing (NLP) approach based on Wikipedia crawling and word embeddings, a novel technique to represent the meaning of words through compact numerical vectors. Our preliminary experiments, performed on five different domains, show promising results. The approach allows an estimate of the variation of meaning of the computer science words when used in different domains. Further validation of the method will indicate the words that need to be carefully defined in advance by the requirements analyst to avoid misunderstandings when editing documents and dealing with experts in the considered domains.

Source: Requirements Engineering Conference Workshops (REW), 2017 IEEE 25th International, pp. 393–399, 04/


Metrics



Back to previous page
BibTeX entry
@inproceedings{oai:it.cnr:prodotti:382471,
	title = {Detecting domain-specific ambiguities: An NLP approach based on wikipedia crawling and word embeddings},
	author = {Ferrari A. and Donati B. and Gnesi S.},
	doi = {10.1109/rew.2017.20},
	booktitle = {Requirements Engineering Conference Workshops (REW), 2017 IEEE 25th International, pp. 393–399, 04/},
	year = {2017}
}