2017
Conference article  Open Access

The impact of negative samples on learning to rank

Lucchese C., Nardini F. M., Perego R., Trani S.

learning systems  Document selection  Learning to rank  Negative samples  Reducing noise  Relevance ranking  Relevant query  Sampling strategies  State of the art 

Learning-to-Rank (LtR) techniques leverage machine learning algorithms and large amounts of training data to induce high-quality ranking functions. Given a set of documents and a user query, these functions are able to predict a score for each of the documents that is in turn exploited to induce a relevance ranking. The effectiveness of these learned functions has been proved to be significantly affected by the data used to learn them. Several analysis and document selection strategies have been proposed in the past to deal with this aspect. In this paper we review the state-of-the-art proposals and we report the results of a preliminary investigation of a new sampling strategy aimed at reducing the number of not relevant query-document pairs, so to significantly decrease the training time of the learning algorithm and to increase the final effectiveness of the model by reducing noise and redundancy in the training set.

Source: 1st International Workshop on LEARning Next GEneration Rankers, LEARNER 2017, Amsterdam, Netherlands, 1 October, 2017



Back to previous page
BibTeX entry
@inproceedings{oai:it.cnr:prodotti:424422,
	title = {The impact of negative samples on learning to rank},
	author = {Lucchese C. and Nardini F. M. and Perego R. and Trani S.},
	booktitle = {1st International Workshop on LEARning Next GEneration Rankers, LEARNER 2017, Amsterdam, Netherlands, 1 October, 2017},
	year = {2017}
}
CNR ExploRA

Bibliographic record

ISTI Repository

Published version Open Access

Also available from

ceur-ws.orgOpen Access