Document - The Istella22 dataset: bridging traditional and neural learning to rank evaluation

2022

Conference article Open Access

The Istella22 dataset: bridging traditional and neural learning to rank evaluation

Dato D., Macavaney S., Nardini F. M., Perego R., Tonellotto N.

Learning to rank Neural models

Neural approaches that use pre-trained language models are effective at various ranking tasks, such as question answering and ad-hoc document ranking. However, their effectiveness compared to feature-based Learning-to-Rank (LtR) methods has not yet been well-established. A major reason for this is because present LtR benchmarks that contain query-document feature vectors do not contain the raw query and document text needed for neural models. On the other hand, the benchmarks often used for evaluating neural models, e.g., MS MARCO, TREC Robust, etc., provide text but do not provide query-document feature vectors. In this paper, we present Istella22, a new dataset that enables such comparisons by providing both query/document text and strong query-document feature vectors used by an industrial search engine. The dataset consists of a comprehensive corpus of 8.4M web documents, a collection of query-document pairs including 220 hand-crafted features, relevance judgments on a 5-graded scale, and a set of 2,198 textual queries used for testing purposes. Istella22 enables a fair evaluation of traditional learning-to-rank and transfer ranking techniques on the same data. LtR models exploit the feature-based representations of training samples while pre-trained transformer-based neural rankers can be evaluated on the corresponding textual content of queries and documents. Through preliminary experiments on Istella22, we find that neural re-ranking approaches lag behind LtR models in terms of effectiveness. However, LtR models identify the scores from neural models as strong signals.

Source: SIGIR '22 - 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 3099–3107, Madrid, Spain, 11-15/07/2022

Publisher: ACM Press, New York, USA

Metrics

Back to previous page

Cite as

BibTeX entry

@inproceedings{oai:it.cnr:prodotti:469573,
	title = {The Istella22 dataset: bridging traditional and neural learning to rank evaluation},
	author = {Dato D. and Macavaney S. and Nardini F. M. and Perego R. and Tonellotto N.},
	publisher = {ACM Press, New York, USA},
	doi = {10.1145/3477495.3531740},
	booktitle = {SIGIR '22 - 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 3099–3107, Madrid, Spain, 11-15/07/2022},
	year = {2022}
}