Query Processing in Highly-Loaded Search Engines

Broccolo, Daniele; Macdonald, Craig; Orlando, Salvatore; Ounis, Iadh; Perego, Raffaele; Silvestri, Fabrizio; Tonellotto, Nicola

doi:10.1007/978-3-319-02432-5_9

Daniele Broccolo^19,20,
Craig Macdonald²¹,
Salvatore Orlando^19,20,
Iadh Ounis²¹,
Raffaele Perego²⁰,
Fabrizio Silvestri²⁰ &
…
Nicola Tonellotto²⁰

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8214))

Included in the following conference series:

International Symposium on String Processing and Information Retrieval

1178 Accesses
1 Citations

Abstract

While Web search engines are built to cope with a large number of queries, query traffic can exceed the maximum query rate supported by the underlying computing infrastructure. We study how response times and results vary when, in presence of high loads, some queries are either interrupted after a fixed time threshold elapses or dropped completely. Moreover, we introduce a novel dropping strategy, based on machine learned performance predictors to select the queries to drop in order to sustain the largest possible query rate with a relative degradation in effectiveness.

The original version of this chapter was revised: The copyright line was incorrect. This has been corrected. The Erratum to this chapter is available at DOI: 10.1007/978-3-319-02432-5_33

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Anh, V.N., de Kretser, O., Moffat, A.: Vector-space ranking with effective early termination. In: Proceedings of SIGIR, pp. 35–42 (2001)
Google Scholar
Barroso, L.A., Dean, J., Holzle, U.: Web search for a planet: The Google cluster architecture. IEEE Micro 23(2), 22–28 (2003)
Article Google Scholar
Broder, A.Z., Carmel, D., Herscovici, M., Soffer, A., Zien, J.: Efficient query evaluation using a two-level retrieval process. In: Proceedings of CIKM, pp. 426–434 (2003)
Google Scholar
Moffat, A., Zobel, J.: Self-indexing inverted files for fast text retrieval. ACM Trans. Inf. Syst. 14(4), 349–379 (1996)
Article Google Scholar
Tonellotto, N., Macdonald, C., Ounis, I.: Efficient and Effective Retrieval using Selective Pruning. In: Proceedings of WSDM (2013)
Google Scholar
Macdonald, C., Tonellotto, N., Ounis, I.: Learning to Predict Response Times for Online Query Scheduling. In: Proceedings of SIGIR, pp. 621–630 (2012)
Google Scholar
Carterette, B., Pavlu, V., Fang, H., Kanoulas, E.: Million Query Track 2009 Overview. In: Proceedings of TREC (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

Università Ca’Foscari of Venice, Italy
Daniele Broccolo & Salvatore Orlando
ISTI-CNR of Pisa, Italy
Daniele Broccolo, Salvatore Orlando, Raffaele Perego, Fabrizio Silvestri & Nicola Tonellotto
University of Glasgow, UK
Craig Macdonald & Iadh Ounis

Authors

Daniele Broccolo
View author publications
You can also search for this author in PubMed Google Scholar
Craig Macdonald
View author publications
You can also search for this author in PubMed Google Scholar
Salvatore Orlando
View author publications
You can also search for this author in PubMed Google Scholar
Iadh Ounis
View author publications
You can also search for this author in PubMed Google Scholar
Raffaele Perego
View author publications
You can also search for this author in PubMed Google Scholar
Fabrizio Silvestri
View author publications
You can also search for this author in PubMed Google Scholar
Nicola Tonellotto
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Industrial Engineering and Management Technion, Technion Institute of Technology, Bloomfield Hall 308, 32000, Haifa, Israel
Oren Kurland
Bar-Ilan University, Israel
Moshe Lewenstein
Department of Computer Science, Bar-Ilan University, 52900, Ramat-Gan, Israel
Ely Porat

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Broccolo, D. et al. (2013). Query Processing in Highly-Loaded Search Engines. In: Kurland, O., Lewenstein, M., Porat, E. (eds) String Processing and Information Retrieval. SPIRE 2013. Lecture Notes in Computer Science, vol 8214. Springer, Cham. https://doi.org/10.1007/978-3-319-02432-5_9

Download citation

DOI: https://doi.org/10.1007/978-3-319-02432-5_9
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-02431-8
Online ISBN: 978-3-319-02432-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics