139 result(s)
Page Size: 10, 20, 50
Export: bibtex, xml, json, csv
Order by:

CNR Author operator: and / or
more
Typology operator: and / or
Language operator: and / or
Date operator: and / or
more
Rights operator: and / or
2012 Contribution to conference Open Access OPEN
Studying search shortcuts in a query log to improve retrieval over query sessions
Albakour M., Nardini F. M., Adeyanju I., Kruschwitz U.
In this paper, we study a state-of-the-art model to derive query suggestions from the search logs of a Web search engine in the context of its application for the session retrieval problem. Using the session track 2011 as an evaluation platform and the MSN 2006 logs, we analyze the characteristics of this model and how it can be optimized to better represent user information needs throughout the session.Source: International Workshop on Information Retrieval Over Query Sessions (co-located with the 34th European Conference on Information Retrieval (ECIR 2012)), Barcellona, Spagna, 1-5 April 2012
Project(s): S-CUBE via OpenAIRE

See at: ISTI Repository Open Access | CNR ExploRA


2012 Journal article Open Access OPEN
RICH: Research and Innovation for Cultural Heritage
Barone V., Licari D., Nardini F. M.
This paper describes RICH: a new architecture conceived and developed at the Scuola Normale Superiore, for collecting, promoting and sharing cultural heritage data. Starting with the observation that cultural heritage is a cross-cutting field of research where needs are often poorly integrated with each other, a new architecture is required, aimed at solving this integration issue. RICH provides a first step in this direction by addressing several needs in this field through state-of-the-art technologies such as 3D vision and virtualization. The paper outlines the principal building blocks of the architecture by planning and explaining each step in terms of functionality. We also report some preliminary experiences that are being carried out using this architecture.Source: Conservation Science in Cultural Heritage (Online) 12 (2012): 109–133. doi:10.6092/issn.1973-9494/3385
DOI: 10.6092/issn.1973-9494/3385
Metrics:


See at: Conservation Science in Cultural Heritage Open Access | conservation-science.unibo.it Restricted | CNR ExploRA


2012 Conference article Restricted
RGU-ISTI-Essex at TREC 2011 session track
Adeyanju I., Nardini F. M., Albakour M., Song D., Kruschwitz U.
Mining query recommendation from query logs has attracted a lot of attention in recent years. We propose to use query recommenda- tions extracted from the logs of a web search engine to solve the session track tasks. The runs are obtained by using the Search Shortcuts recom- mender system. The Search Shortcuts technique uses an inverted index and the concept of " successful sessions" present in a web search engine's query log to produce effective recommendations for both frequent and rare/unseen queries. We adapt the above technique as a query expan- sion tool and use it to expand the given queries for Session Track at TREC 2011. The expansion is generated by using a method which aims to consider all past queries in the session. The expansion terms obtained are then used to build a global, uniformly weighted, representation of the user session (RL2). Furthermore, the expansion terms are then combined with a ranked list of results in order to boost terms appearing more fre- quently in the final results lists (RL3). Finally, we also integrate dwell times and the weighting method obtained taking both result lists and clicks into account for assigning weights to the terms to expand the final query of the session. In addition to that, we submitted a baseline run. It is based on the observation that using the term " wikipedia" to expand the query resulted in a better retrieval performance for the tasks at last year's session track at TREC 2010.Source: International Text REtrieval Conference, pp. 1–12, Gaithersburg, Md. USA, 15-18 November 2011

See at: trec.nist.gov Restricted | CNR ExploRA


2016 Contribution to conference Open Access OPEN
Proceedings of the 7th Italian Information Retrieval Workshop
Di Nunzio G., Nardini F. M., Orlando S.
This volume contains the papers presented at IIR 2016 (the 7th Italian Infor- mation Retrieval Workshop) held on 30-31 May, 2016, in Venice, Italy. The purpose of the Italian Information Retrieval Workshop (IIR) is to provide a forum for stimulating and disseminating research in Information Retrieval, where Italian researchers (especially young ones) and researchers affiliated with Italian institutions can network and discuss their research results in an informal way. The general chair of IIR 2016 was Salvatore Orlando (Universit`a Ca' Fos- cari Venezia), and the co-chairs of the program committee were Giorgio Maria Di Nunzio (Universit`a di Padova) and Franco Maria Nardini (ISTI-CNR, Pisa). For this edition, the program committee accepted 26 papers for oral presen- tation and the final program also included four invited talks by Stefano Ceri (Politecnico di Milano), Alessandro Moschitti (Qatar Computing Research In- stitute, HBKU), Giorgio Satta (Universit`a di Padova), and Rossano Venturini (Universit`a di Pisa). The program co-chairs wish to thank all the members of the Program Com- mittee (see below) as well as the local organizers Francesco Lettich (Universit`a Ca' Foscari Venezia) and Claudio Silvestri (Universit`a Ca' Foscari Venezia), the webmaster Salvatore Trani (ISTI-CNR, Pisa), and the publicity chair, Cristina Muntean, (ISTI-CNR, Pisa).

See at: ceur-ws.org Open Access | CNR ExploRA


2019 Conference article Open Access OPEN
Learning to Rank in Theory and Practice: From Gradient Boosting to Neural Networks and Unbiased Learning
Lucchese C., Nardini F. M., Pasumarthi R. K., Bruch S., Bendersky M., Wang X., Oosterhuis H., Jagerman R., De Rijke M.
This tutorial aims to weave together diverse strands of modern Learning to Rank (LtR) research, and present them in a unified full-day tutorial. First, we will introduce the fundamentals of LtR, and an overview of its various sub-fields. Then, we will discuss some recent advances in gradient boosting methods such as LambdaMART by focusing on their efficiency/effectiveness trade-offs and optimizations. Subsequently, we will then present TF-Ranking, a new open source TensorFlow package for neural LtR models, and how it can be used for modeling sparse textual features. Finally, we will conclude the tutorial by covering unbiased LtR -- a new research field aiming at learning from biased implicit user feedback. The tutorial will consist of three two-hour sessions, each focusing on one of the topics described above. It will provide a mix of theoretical and hands-on sessions, and should benefit both academics interested in learning more about the current state-of-the-art in LtR, as well as practitioners who want to use LtR techniques in their applications.Source: 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 1419–1420, Parigi, Francia, 21/07/2019, 25/07/2019
DOI: 10.1145/3331184.3334824
Metrics:


See at: ISTI Repository Open Access | NARCIS Restricted | dl.acm.org Restricted | doi.org Restricted | NARCIS Restricted | CNR ExploRA


2011 Doctoral thesis Open Access OPEN
Query Log Mining to Enhance User Experience in Search Engines
Nardini F. M.
The Web is the biggest repository of documents humans have ever built. Even more, it is increasingly growing in size every day. Users rely on Web search engines (WSEs) for finding information on the Web. By submitting a textual query expressing their information need, WSE users obtain a list of documents that are highly relevant to the query. Moreover, WSEs tend to store such huge amount of users activities in query logs. Query log mining is the set of techniques aiming at extracting valuable knowledge from query logs. This knowledge represents one of the most used ways of enhancing the users search experience. According to this vision, in this thesis we firstly prove that the knowledge extracted from query logs suffer aging effects and we thus propose a solution to this phenomenon. Secondly, we propose new algorithms for query recommendation that overcome the aging problem. Moreover, we study new query recommendation techniques for efficiently producing recommendations for rare queries. Finally, we study the problem of diversifying Web search engine results. We define a methodology based on the knowledge derived from query logs for detecting when and how query results need to be diversified and we develop an efficient algorithm for diversifying search results.

See at: ISTI Repository Open Access | CNR ExploRA


2020 Conference article Open Access OPEN
Predicting and explaining privacy risk exposure in mobility data
Naretto F., Pellungrini R., Monreale A., Nardini F. M., Musolesi M.
Mobility data is a proxy of different social dynamics and its analysis enables a wide range of user services. Unfortunately, mobility data are very sensitive because the sharing of people's whereabouts may arise serious privacy concerns. Existing frameworks for privacy risk assessment provide tools to identify and measure privacy risks, but they often (i) have high computational complexity; and (ii) are not able to provide users with a justification of the reported risks. In this paper, we propose expert, a new framework for the prediction and explanation of privacy risk on mobility data. We empirically evaluate privacy risk on real data, simulating a privacy attack with a state-of-the-art privacy risk assessment framework. We then extract individual mobility profiles from the data for predicting their risk. We compare the performance of several machine learning algorithms in order to identify the best approach for our task. Finally, we show how it is possible to explain privacy risk prediction on real data, using two algorithms: Shap, a feature importance-based method and Lore, a rule-based method. Overall, expert is able to provide a user with the privacy risk and an explanation of the risk itself. The experiments show excellent performance for the prediction task.Source: DS 2020 - International Conference on Discovery Science, pp. 403–418, Thessaloniki, Greece, October 19-21, 2020
DOI: 10.1007/978-3-030-61527-7_27
Project(s): XAI via OpenAIRE, SoBigData-PlusPlus via OpenAIRE
Metrics:


See at: discovery.ucl.ac.uk Open Access | ISTI Repository Open Access | Lecture Notes in Computer Science Restricted | Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna Restricted | link.springer.com Restricted | CNR ExploRA


2022 Conference article Open Access OPEN
ReNeuIR: Reaching Efficiency in Neural Information Retrieval
Bruch S., Lucchese C., Nardini F. M.
Perhaps the applied nature of information retrieval research goes some way to explain the community's rich history of evaluating machine learning models holistically, understanding that efficacy matters but so does the computational cost incurred to achieve it. This is evidenced, for example, by more than a decade of research on efficient training and inference of large decision forest models in learning-to-rank. As the community adopts even more complex, neural network-based models in a wide range of applications, questions on efficiency have once again become relevant. We propose this workshop as a forum for a critical discussion of efficiency in the era of neural information retrieval, to encourage debate on the current state and future directions of research in this space, and to promote more sustainable research by identifying best practices in the development and evaluation of neural models for information retrieval.Source: SIGIR 2022 - The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 3462–3465, Madrid, Spain, 11-15/07/2022
DOI: 10.1145/3477495.3531704
Metrics:


See at: ISTI Repository Open Access | dl.acm.org Restricted | doi.org Restricted | CNR ExploRA


2023 Journal article Open Access OPEN
Efficient and effective tree-based and neural learning to rank
Bruch S., Lucchese C., Nardini F. M.
As information retrieval researchers, we not only develop algorithmic solutions to hard problems, but we also insist on a proper, multifaceted evaluation of ideas. The literature on the fundamental topic of retrieval and ranking, for instance, has a rich history of studying the effectiveness of indexes, retrieval algorithms, and complex machine learning rankers, while at the same time quantifying their computational costs, from creation and training to application and inference. This is evidenced, for example, by more than a decade of research on efficient training and inference of large decision forest models in Learning to Rank (LtR). As we move towards even more complex, deep learning models in a wide range of applications, questions on efficiency have once again resurfaced with renewed urgency. Indeed, efficiency is no longer limited to time and space; instead it has found new, challenging dimensions that stretch to resource-, sample- and energy-efficiency with ramifications for researchers, users, and the environment. This monograph takes a step towards promoting the study of efficiency in the era of neural information retrieval by offering a comprehensive survey of the literature on efficiency and effectiveness in ranking, and to a limited extent, retrieval. This monograph was inspired by the parallels that exist between the challenges in neural network-based ranking solutions and their predecessors, decision forest-based LtR models, as well as the connections between the solutions the literature to date has to offer. We believe that by understanding the fundamentals underpinning these algorithmic and data structure solutions for containing the contentious relationship between efficiency and effectiveness, one can better identify future directions and more efficiently determine the merits of ideas. We also present what we believe to be important research directions in the forefront of efficiency and effectiveness in retrieval and ranking.Source: Foundations and trends in information retrieval 17 (2023): 1–123. doi:10.1561/1500000071
DOI: 10.1561/1500000071
DOI: 10.48550/arxiv.2305.08680
Metrics:


See at: ISTI Repository Open Access | Foundations and Trends® in Information Retrieval Restricted | doi.org Restricted | CNR ExploRA


2023 Conference article Open Access OPEN
ReNeuIR at SIGIR 2023: The Second Workshop on Reaching Efficiency in Neural Information Retrieval
Bruch S., Mackenzie J., Maistro M., Nardini F. M.
Multifaceted, empirical evaluation of algorithmic ideas is one of the central pillars of Information Retrieval (IR) research. The IR community has a rich history of studying the effectiveness of indexes, retrieval algorithms, and complex machine learning rankers and, at the same time, quantifying their computational costs, from creation and training to application and inference. As the community moves towards even more complex deep learning models, questions on efficiency have once again become relevant with renewed urgency. Indeed, efficiency is no longer limited to time and space; instead it has found new, challenging dimensions that stretch to resource-, sample- and energy-efficiency with ramifications for researchers, users, and the environment alike. Examining algorithms and models through the lens of holistic efficiency requires the establishment of standards and principles, from defining relevant concepts, to designing metrics, to creating guidelines for making sense of the significance of new findings. The second iteration of the ReNeuIR workshop aims to bring the community together to debate these questions, with the express purpose of moving towards a common benchmarking framework for efficiency.Source: SIGIR '23: 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 3456–3459, Taipei, Taiwan, China, 23-27/07/2023
DOI: 10.1145/3539618.3591922
Metrics:


See at: ISTI Repository Open Access | doi.org Restricted | CNR ExploRA


2023 Journal article Open Access OPEN
An approximate algorithm for maximum inner product search over streaming sparse vectors
Bruch S., Nardini F. M., Ingber A., Liberty E.
Maximum Inner Product Search or top-k retrieval on sparse vectors is well-understood in information retrieval, with a number of mature algorithms that solve it exactly. However, all existing algorithms are tailored to text and frequency-based similarity measures. To achieve optimal memory footprint and query latency, they rely on the near stationarity of documents and on laws governing natural languages. We consider, instead, a setup in which collections are streaming--necessitating dynamic indexing--and where indexing and retrieval must work with arbitrarily distributed real-valued vectors. As we show, existing algorithms are no longer competitive in this setup, even against na"ive solutions. We investigate this gap and present a novel approximate solution, called Sinnamon, that can efficiently retrieve the top-k results for sparse real valued vectors drawn from arbitrary distributions. Notably, Sinnamon offers levers to trade-off memory consumption, latency, and accuracy, making the algorithm suitable for constrained applications and systems. We give theoretical results on the error introduced by the approximate nature of the algorithm, and present an empirical evaluation of its performance on two hardware platforms and synthetic and real-valued datasets. We conclude by laying out concrete directions for future research on this general top-k retrieval problem over sparse vectors.Source: ACM transactions on information systems (2023). doi:10.1145/3609797
DOI: 10.1145/3609797
Metrics:


See at: dl.acm.org Open Access | ISTI Repository Open Access | ACM Transactions on Information Systems Restricted | CNR ExploRA


2023 Contribution to conference Open Access OPEN
String processing and information retrieval - SPIRE 2023 - Proceedings of the 30th International Symposium, Pisa, Italy, September 26-28, 2023
Nardini F. M., Pisanti N., Venturini R.
The 30th International Symposium on String Processing and Information Retrieval (SPIRE) was held on September 26-28, 2023, in Pisa (Italy), followed by the 18th Workshop on Compression, Text, and Algorithms (WCTA) held on September 29, 2023. SPIRE started in 1993 as the South American Workshop on String Processing. It was held in Latin America until 2000. Then, SPIRE moved to Europe, and from then on, it has been held in Australia, Japan, the UK, Spain, Italy, Finland, Portugal, Israel, Brazil, Chile, Colombia, Mexico, Argentina, Bolivia, Peru, the USA, and France. SPIRE continues the long and well-established tradition of encouraging high-quality research at the broad nexus of string processing, information retrieval, and computational biology. This volume contains the accepted papers presented at SPIRE 2023. SPIRE 2023 received a total of 47 submissions. Each submission received at least three reviews. After an intensive discussion phase, the Scientific Program Committee accepted 31 papers. We thank all the authors for their valuable contributions and presentations at the conference and the Program Committee members for their valuable work during the review and discussion phases. We thank Springer for publishing the proceedings of SPIRE 2023 in the LNCS series and ACM SIGIR for sponsoring the conference. The scientific program of SPIRE 2023 includes invited talks by three eminent researchers in the field: Sebastian Bruch (Pinecone, USA), Inge Li Gørtz (Technical University of Denmark, Denmark), and Jakub Radoszewski (University of Warsaw, Poland). SPIRE 2023 had a Best Paper Award, sponsored by Springer. The award was announced during the conference. Finally, we thank the Local Organizing Committee members for making the conference successful.Source: New York: Springer, 2023
DOI: 10.1007/978-3-031-43980-3
Metrics:


See at: ISTI Repository Open Access | CNR ExploRA


2023 Contribution to conference Open Access OPEN
Proceedings of the 13th Italian Information Retrieval Workshop (IIR 2023), Pisa, Italy, June 8-9, 2023
Nardini F. M., Tonellotto N., Faggioli G., Ferrara A.
There were 33 papers submitted this workshop. Out of these, 24 were accepted for this volume, 1 as regular papers and 23 as short papers.Source: Aachen: CEUR-WS.org, 2023

See at: ceur-ws.org Open Access | ISTI Repository Open Access | CNR ExploRA


2022 Journal article Open Access OPEN
Report on the 1st Workshop on Reaching Efficiency in Neural Information Retrieval (ReNeuIR 2022) at SIGIR 2022
Bruch S., Lucchese C., Nardini F. M.
As Information Retrieval (IR) researchers, we not only develop algorithmic solutions to hard problems, but we also insist on a proper, multifaceted evaluation of ideas. The IR literature on the fundamental topic of retrieval and ranking, for instance, has a rich history of studying the effectiveness of indexes, retrieval algorithms, and complex machine learning rankers and, at the same time, quantifying their computational costs, from creation and training to application and inference. This is evidenced, for example, by more than a decade of research on efficient training and inference of large decision forest models in Learning to Rank (LtR). As we move towards even more complex, deep learning models in a wide range of applications, questions on efficiency have once again become relevant with renewed urgency. Indeed, efficiency is no longer limited to time- and space-efficiency; instead it has found new, challenging dimensions that stretch to resource-, sample- and energy-efficiency with ramifications for researchers, users, and the environment. As a step towards bringing together experts from industry and academia and creating a forum for a critical discussion and the promotion of efficiency in the era of Neural Information Retrieval (NIR), we held the ReNeuIR workshop on July 15th, 2022 as a hybrid event---in person in Madrid, Spain along with online attendees---in conjunction with ACM SIGIR 2022. Recognizing the importance of this topic, approximately 80 participants answered our call and attended the workshop over three sessions. The event included a total of two keynotes and eight paper presentations, and concluded with a lively discussion where participants helped identify gaps in existing research and brainstormed future research directions. We had consensus in recognizing that efficiency is not simply latency, that a holistic, concrete definition of efficiency is needed to guide researchers and reviewers alike, and that more research is necessary in developing efficiency-centered evaluation metrics and standard benchmark datasets, platforms, and tools.Source: SIGIR forum 56 (2022): 1–14. doi:10.1145/3582900.3582916
DOI: 10.1145/3582900.3582916
Metrics:


See at: ISTI Repository Open Access | dl.acm.org Restricted | ACM SIGIR Forum Restricted | Archivio istituzionale della ricerca - Università degli Studi di Venezia Ca' Foscari Restricted | CNR ExploRA


2015 Conference article Restricted
MUSETS: Diversity-aware web query suggestions for shortening user sessions
Sydow M., Muntean C. I., Nardini F. M., Matwin S., Silvestri F.
We propose MUSETS (multi-session total shortening) - a novel formulation of the query suggestion task, specified as an optimization problem. Given an ambiguous user query, the goal is to propose the user a set of query suggestions that optimizes a diversity-aware objective function. The function models the expected number of query reformulations that a user would save until reaching a satisfactory query formulation. The function is diversity-aware, as it naturally enforces high coverage of different alternative continuations of the user session. For modeling the topics covered by the queries, we also use an extended query representation based on entities extracted from Wikipedia. We apply a machine learning approach to learn the model on a set of user sessions to be subsequently used for queries that are under-represented in historical query logs and present an evaluation of the approach.Source: Foundations of Intelligent Systems. 22nd International Symposium, pp. 237–247, Lyon, France, 21-23/10/2015
DOI: 10.1007/978-3-319-25252-0_26
Metrics:


See at: doi.org Restricted | link.springer.com Restricted | CNR ExploRA


2015 Conference article Open Access OPEN
Gamification in information retrieval: State of the art, challenges and opportunities
Muntean C. I., Nardini F. M.
Gamification aims at applying game design principles and elements, such as points, badges, feedbacks or leader boards in non- gaming environments. An interesting goal of gamification is to combine and exploit the fun factor for targeting other aspects like achieving more accurate work, more cost effective solutions and better retention rates. The application of gamification techniques to IR tasks poses interesting research challenges. In this paper, we propose an analysis of the state of the art in this field and we summarize interesting challenges and oppor- tunities for the near future.Source: 6th Italian Information Retrieval Workshop, Cagliari, Italy, 25-26/05/2015

See at: ceur-ws.org Open Access | CNR ExploRA


2016 Contribution to conference Open Access OPEN
Understanding human mobility during events in foursquare
Muntean C. I., Nardini F. M., Noulas A.
Social events can generate high influxes of people transitioning various locations in a city. They can be considered to have a considerable impact on the local economy, whether they are sport events, concerts or festivals. These events are capable of generating sudden changes in the activity landscape of a city, with the neighborhoods that host events becoming unusually busy and active compared to times of regular citizen activity. While event and anomaly detection more generally has been a topic of study in recent years, as also has been event recommendation for mobile users, progress has been slower towards building systems that are able to capture the sudden shift appropriately in this setting. In this work we exploit data from the location-based service Foursquare to study mobility during events in Chicago, and later expand our study to other cities as well. Our aim is to identify what differences emerge in terms of user mobility during events versus regular periods of human activity.Source: 7th Italian Information Retrieval Workshop, Venezia, Italy, 30-31 May 2016

See at: ceur-ws.org Open Access | CNR ExploRA


2016 Conference article Closed Access
Learning to rank user queries to detect search tasks
Lucchese C., Nardini F. M., Orlando S., Tolomei G.
We present a framework for discovering sets of web queries having similar latent needs, called search tasks, from user queries stored in a search engine log. The framework is made of two main modules: Query Similarity Learning (QSL) and Graph-based Query Clustering (GQC). The former is devoted to learning a query similarity function from a ground truth of manually-labeled search tasks. The latter represents each user search log as a graph whose nodes are queries, and uses the learned similarity function to weight edges between query pairs. Finally, search tasks are detected by clustering those queries in the graph which are connected by the strongest links, in fact by detecting the strongest connected components of the graph. To discriminate between "strong" and "weak" links also the GQC module entails a learning phase whose goal is to estimate the best threshold for pruning the edges of the graph. We discuss how the QSL module can be effectively implemented using Learning to Rank (L2R) techniques. Experiments on a real-world search engine log show that query similarity functions learned using L2R lead to better performing GQC implementations when compared to similarity functions induced by other state-of the-art machine learning solutions, such as logistic regression and decision trees.Source: ACM International Conference on the Theory of Information Retrieval, pp. 157–166, Newark, DE, USA, 12-16 September 2016
DOI: 10.1145/2970398.2970407
Project(s): SoBigData via OpenAIRE
Metrics:


See at: doi.org Restricted | ieeexplore.ieee.org Restricted | CNR ExploRA


2017 Conference article Restricted
Efficiency/Effectiveness trade-offs in learning to rank
Lucchese C., Nardini F. M.
In the last years, Learning to Rank (LtR) had a significant influence on several tasks in the Information Retrieval field, with large research efforts coming both from the academia and the industry. Indeed, efficiency requirements must be fulfilled in order to make an effective research product deployable within an industrial environment. The evaluation of a model can be too expensive due to its size, the features used and several other factors. This tutorial discusses the recent solutions that allow to build an effective ranking model that satisfies temporal budget constrains at evaluation time.Source: ICTIR 2017 - 3rd ACM International Conference on the Theory of Information Retrieval, pp. 329–330, Amsterdam, The Netherlands, 1-4 October, 2017
DOI: 10.1145/3121050.3121109
Metrics:


See at: doi.acm.org Restricted | doi.org Restricted | CNR ExploRA


2019 Conference article Open Access OPEN
An Optimal Algorithm to Find Champions of Tournament Graphs
Beretta L., Nardini F. M., Trani R., Venturini R.
A tournament graph T=(V,E) is an oriented complete graph, which can be used to model a round-robin tournament between n players. In this short paper, we address the problem of finding a champion of the tournament, also known as Copeland winner, which is a player that wins the highest number of matches. Our goal is to solve the problem by minimizing the number of arc lookups, i.e., the number of matches played. We prove that finding a champion requires ?(ln) comparisons, where l is the number of matches lost by the champion, and we present a deterministic algorithm matching this lower bound without knowing l . Solving this problem has important implications on several Information Retrieval applications including Web search, conversational AI, machine translation, question answering, recommender systems, etc.Source: SPIRE 2019: International Symposium on String Processing and Information Retrieval, pp. 267–273, Segovia, Spagna, 07/10/2019, 09/10/2019
DOI: 10.1007/978-3-030-32686-9_19
Project(s): BigDataGrapes via OpenAIRE
Metrics:


See at: ISTI Repository Open Access | Lecture Notes in Computer Science Restricted | link.springer.com Restricted | CNR ExploRA