Query Driven Algorithm Selection in Early Stage Retrieval

Type: Preprint

Publication Date: 2018-02-02

Citations: 43

DOI: https://doi.org/10.1145/3159652.3159676

Download PDF

Abstract

Scalable web search systems typically employ multi-stage retrieval architectures, where an initial stage generates a set of candidate documents that are then pruned and re-ranked. Since subsequent stages typically exploit a multitude of features of varying costs using machine-learned models, reducing the number of documents that are considered at each stage improves latency. In this work, we propose and validate a unified framework that can be used to predict a wide range of performance-sensitive parameters which minimize effectiveness loss, while simultaneously minimizing query latency, across all stages of a multi-stage search architecture. Furthermore, our framework can be easily applied in large-scale IR systems, can be trained without explicitly requiring relevance judgments, and can target a variety of different efficiency-effectiveness trade-offs, making it well suited to a wide range of search scenarios. Our results show that we can reliably predict a number of different parameters on a per-query basis, while simultaneously detecting and minimizing the likelihood of tail-latency queries that exceed a pre-specified performance budget. As a proof of concept, we use the prediction framework to help alleviate the problem of tail-latency queries in early stage retrieval. On the standard ClueWeb09B collection and 31k queries, we show that our new hybrid system can reliably achieve a maximum query time of 200 ms with a 99.99% response time guarantee without a significant loss in overall effectiveness. The solutions presented are practical, and can easily be used in large-scale distributed search engine deployments with a small amount of additional overhead.

Locations

  • arXiv (Cornell University) - View - PDF
  • DataCite API - View

Similar Works

Action Title Year Authors
+ Dynamic Trade-Off Prediction in Multi-Stage Retrieval Systems 2016 J. Shane Culpepper
Charles L. A. Clarke
Jimmy Lin
+ Assessing Efficiency-Effectiveness Tradeoffs in Multi-Stage Retrieval Systems Without Using Relevance Judgments 2015 Charles L. A. Clarke
J. Shane Culpepper
Alistair Moffat
+ Assessing Efficiency-Effectiveness Tradeoffs in Multi-Stage Retrieval Systems Without Using Relevance Judgments 2015 Charles L. A. Clarke
J. Shane Culpepper
Alistair Moffat
+ PDF Chat Forward and backward feature selection for query performance prediction 2020 Sébastien Dejean
Radu Tudor Ionescu
Josiane Mothe
Md Zia Ullah
+ Forward and Backward Feature Selection for Query Performance Prediction 2019 Sébastien Dejean
Radu Tudor Ionescu
Josiane Mothe
Md Zia Ullah
+ PDF Chat Boosting Search Performance Using Query Variations 2019 Rodger Benham
Joel Mackenzie
Alistair Moffat
J. Shane Culpepper
+ Query-level Early Exit for Additive Learning-to-Rank Ensembles 2020 Claudio Lucchese
Franco Maria Nardini
Salvatore Orlando
Raffaele Perego
Salvatore Trani
+ Query-level Early Exit for Additive Learning-to-Rank Ensembles 2020 Claudio Lucchese
Franco Maria Nardini
Salvatore Orlando
Raffaele Perego
Salvatore Trani
+ Learning Early Exit Strategies for Additive Ranking Ensembles 2021 Francesco Busolin
Claudio Lucchese
Franco Maria Nardini
Salvatore Orlando
Raffaele Perego
Salvatore Trani
+ PDF Chat Learning Early Exit Strategies for Additive Ranking Ensembles 2021 Francesco Busolin
Claudio Lucchese
Franco Maria Nardini
Salvatore Orlando
Raffaele Perego
Salvatore Trani
+ PDF Chat Query-level Early Exit for Additive Learning-to-Rank Ensembles 2020 Claudio Lucchese
Franco Maria Nardini
Salvatore Orlando
Raffaele Perego
Salvatore Trani
+ PDF Chat Cascade Ranking for Operational E-commerce Search 2017 Shichen Liu
Fei Xiao
Wenwu Ou
Luo Si
+ Refining Recency Search Results with User Click Feedback 2011 Taesup Moon
Wei Chu
Lihong Li
Zhaohui Zheng
Yi Chang
+ Effectiveness and Efficiency Trade-off in Selective Query Processing 2023 Josiane Mothe
Md Zia Ullah
+ Selective Query Processing: a Risk-Sensitive Selection of System Configurations 2023 Josiane Mothe
Md Zia Ullah
+ Selective Query Processing: A Risk-Sensitive Selection of Search Configurations 2023 Josiane Mothe
Md Zia Ullah
+ PDF Chat A Comprehensive Survey on Retrieval Methods in Recommender Systems 2024 Junjie Huang
Jizheng Chen
Jianghao Lin
Jiarui Qin
Ziming Feng
Weinan Zhang
Yong Yu
+ PDF Chat Assessing efficiency–effectiveness tradeoffs in multi-stage retrieval systems without using relevance judgments 2016 Charles L. A. Clarke
J. Shane Culpepper
Alistair Moffat
+ Unsupervised Search Algorithm Configuration using Query Performance Prediction 2024 Haggai Roitman
+ PDF Chat Anytime Ranking on Document-Ordered Indexes 2021 Joel Mackenzie
Matthias Petri
Alistair Moffat