Skip to main content

A New Static Web Caching Mechanism Based on Mutual Dependency Between Result Cache and Posting List Cache

  • Conference paper
  • First Online:
Web Information Systems Engineering – WISE 2017 (WISE 2017)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10570))

Included in the following conference series:

  • 1436 Accesses

Abstract

Caching is an important optimization technique in search engine architectures. There exist various types of caches, such as result cache, posting list cache, intersection cache, snippet cache, and document cache. However, these caching techniques are studied separately. Although several multiple level caches that integrate different types of caches have been proposed, the relationships among different caches are ignored. In this paper, we study the mutual dependency between the result cache and the posting list cache via empirical experiments and observe duplicate hits in the two types of caches. In order to better utilize the cache space and increase the hit ratio, three algorithms are proposed to implement a static cache mechanism based on the mutual dependency between the result cache and the posting list cache. A series of experiments were conducted on a real data set and the results have demonstrated the improvement of the hit ratio of our proposals.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Notes

  1. 1.

    https://dumps.wikimedia.org/enwiki/20161201.

  2. 2.

    https://lucene.apache.org.

References

  1. Altingovde, I.S., Ozcan, R., Ulusoy, Ö.: A cost-aware strategy for query result caching in web search engines. In: Boughanem, M., Berrut, C., Mothe, J., Soule-Dupuy, C. (eds.) ECIR 2009. LNCS, vol. 5478, pp. 628–636. Springer, Heidelberg (2009). doi:10.1007/978-3-642-00958-7_59

    Chapter  Google Scholar 

  2. Baeza-Yates, R.A., Gionis, A., Junqueira, F., Murdock, V., Plachouras, V., Silvestri, F.: The impact of caching on search engines. In: SIGIR, pp. 183–190 (2007)

    Google Scholar 

  3. Baeza-Yates, R.A., Gionis, A., Junqueira, F., Murdock, V., Plachouras, V., Silvestri, F.: Design trade-offs for search engine caching. TWEB 2(4), 20:1–20:28 (2008)

    Article  Google Scholar 

  4. Baeza-Yates, R., Jonassen, S.: Modeling static caching in web search engines. In: Baeza-Yates, R., Vries, A.P., Zaragoza, H., Cambazoglu, B.B., Murdock, V., Lempel, R., Silvestri, F. (eds.) ECIR 2012. LNCS, vol. 7224, pp. 436–446. Springer, Heidelberg (2012). doi:10.1007/978-3-642-28997-2_37

    Chapter  Google Scholar 

  5. Ceccarelli, D., Lucchese, C., Orlando, S., Perego, R., Silvestri, F.: Caching query-biased snippets for efficient retrieval. In: EDBT, pp. 93–104 (2011)

    Google Scholar 

  6. Fagni, T., Perego, R., Silvestri, F., Orlando, S.: Boosting the performance of web search engines: caching and prefetching query results by exploiting historical usage data. ACM Trans. Inf. Syst. 24(1), 51–78 (2006)

    Article  Google Scholar 

  7. Gan, Q., Suel, T.: Improved techniques for result caching in web search engines. In: WWW, pp. 431–440 (2009)

    Google Scholar 

  8. Long, X., Suel, T.: Three-level caching for efficient query processing in large web search engines. World Wide Web 9(4), 369–395 (2006)

    Article  Google Scholar 

  9. Marín, M., Costa, V.G., Gómez-Pantoja, C.: New caching techniques for web search engines. In: HPDC, pp. 215–226 (2010)

    Google Scholar 

  10. Markatos, E.P.: On caching search engine query results. Comput. Commun. 24(2), 137–143 (2001)

    Article  Google Scholar 

  11. Ozcan, R., Altingövde, I.S., Cambazoglu, B.B., Junqueira, F.P., Ulusoy, Ö.: A five-level static cache architecture for web search engines. Inf. Process. Manage. 48(5), 828–840 (2012)

    Article  Google Scholar 

  12. Ozcan, R., Altingövde, I.S., Cambazoglu, B.B., Ulusoy, Ö.: Second chance: a hybrid approach for dynamic result caching and prefetching in search engines. TWEB 8(1), 3:1–3:22 (2013)

    Article  Google Scholar 

  13. Ozcan, R., Altingövde, I.S., Ulusoy, Ö.: Static query result caching revisited. In: WWW, pp. 1169–1170 (2008)

    Google Scholar 

  14. Ozcan, R., Altingövde, I.S., Ulusoy, Ö.: Cost-aware strategies for query result caching in web search engines. TWEB 5(2), 9:1–9:25 (2011)

    Article  Google Scholar 

  15. Podlipnig, S., Böszörményi, L.: A survey of web cache replacement strategies. ACM Comput. Surv. 35(4), 374–398 (2003)

    Article  Google Scholar 

  16. Saraiva, P.C., de Moura, E.S., Fonseca, R.C., Meira, W.Jr., Ribeiro-Neto, B.A., Ziviani, N.: Rank-preserving two-level caching for scalable search engines. In: SIGIR, pp. 51–58 (2001)

    Google Scholar 

  17. Tolosa, G., Becchetti, L., Feuerstein, E., Marchetti-Spaccamela, A.: Performance improvements for search systems using an integrated cache of Lists+Intersections. In: Moura, E., Crochemore, M. (eds.) SPIRE 2014. LNCS, vol. 8799, pp. 227–235. Springer, Cham (2014). doi:10.1007/978-3-319-11918-2_22

    Chapter  Google Scholar 

  18. Zhang, J., Long, X., Suel, T.: Performance of compressed inverted list caching in search engines. In: WWW, pp. 387–396 (2008)

    Google Scholar 

  19. Zhou, W., Li, R., Dong, X., Xu, Z., Xiao, W.: An intersection cache based on frequent itemset mining in large scale search engines. In: IEEE Workshop on Hot Topics in Web Systems and Technologies, pp. 19–24 (2015)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Dingming Wu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Trinh, T., Wu, D., Huang, J.Z. (2017). A New Static Web Caching Mechanism Based on Mutual Dependency Between Result Cache and Posting List Cache. In: Bouguettaya, A., et al. Web Information Systems Engineering – WISE 2017. WISE 2017. Lecture Notes in Computer Science(), vol 10570. Springer, Cham. https://doi.org/10.1007/978-3-319-68786-5_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-68786-5_12

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-68785-8

  • Online ISBN: 978-3-319-68786-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics