Skip to main content

Query-Based Versus Resource-Based Cache Strategies in Tag-Based Browsing Systems

  • Conference paper
  • First Online:
Maturity and Innovation in Digital Libraries (ICADL 2018)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11279))

Included in the following conference series:

Abstract

Tag-based browsing is a popular interaction model for navigating digital libraries. According to this model, users select descriptive tags to filter resources in the collections. Typical implementations of the model are based on inverted indexes. However, these implementations can require a considerable amount of set operations to update the browsing state. To palliate this inconvenience, it is possible to adopt suitable cache strategies. In this paper we describe and compare two of these strategies: (i) a query-based strategy, according to which previously computed browsing states are indexed by sets of selected tags; and (ii) a resource-based strategy, according to which browsing states are indexed by sets of filtered resources. Our comparison focused on runtime performance, and was carried out empirically, using a real-world web-based collection in the field of digital humanities. The results obtained show that the resource-based strategy clearly outperforms the query-based one.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 59.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 74.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://oda-fec.org/ucm-chasqui.

  2. 2.

    All the measures reported were taken on a machine with an Intel® Core™ i5-4660S 2.9 GHz processor, RAM 16 GB and Windows 10 OS. Browsing software was programmed in Java. The browsing cache was maintained in memory using Java’s HashMap s. Sets were managed using roaring bitmaps [4].

References

  1. Bloehdorn, S., Görlitz, O., Schenk, S., Völkel, M.: TagFS - tag semantics for hierarchical file systems. In: Proceedings of the 6th International Conference on Knowledge Management (I-KNOW 2006) (2006)

    Google Scholar 

  2. Buendía, F., Gayoso-Cabada, J., Sierra, J.-L.: Using digital medical collections to support radiology training in e-learning platforms. In: Pammer-Schindler, V., Pérez-Sanagustín, M., Drachsler, H., Elferink, R., Scheffel, M. (eds.) EC-TEL 2018. LNCS, vol. 11082, pp. 566–569. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-98572-5_46

    Chapter  Google Scholar 

  3. Carpineto, C., Romano, G.: Concept Data Analysis: Theory and Applications. Wiley, Hoboken (2004)

    Book  Google Scholar 

  4. Chambi, S., Lemire, D., Kaser, O., Godin, R.: Better bitmap performance with Roaring bitmaps. Softw.-Pract. Exp. 46(5), 709–719 (2016)

    Article  Google Scholar 

  5. Culpepper, J.-S.; Moffat, A.: Efficient set intersection for inverted indexing. ACM Trans. Inf. Syst. 29(1), Article no. 1 (2010)

    Article  Google Scholar 

  6. Dimitrov, D., Helic, D., Strohmaier, M.: Tag-based navigation and visualization. In: Brusilovsky, P., He, D. (eds.) Social Information Access. LNCS, vol. 10100, pp. 181–212. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-90092-6_6

    Chapter  Google Scholar 

  7. Eck, O., Schaefer, D.: A semantic file system for integrated product data management. Adv. Eng. Inform. 25(2), 177–184 (2011)

    Article  Google Scholar 

  8. Gayoso-Cabada, J., Gómez-Albarrán, M., Sierra, J.-L.: Tag-based browsing of digital collections with inverted indexes and browsing cache. In: Proceedings of the 6th Edition of the Technological Ecosystems for Enhancing Multiculturality Conference (TEEM 2018) (2018)

    Google Scholar 

  9. Gayoso-Cabada, J., Rodríguez-Cerezo, D., Sierra, J.-L.: Multilevel browsing of folksonomy-based digital collections. In: Cellary, W., Mokbel, Mohamed F., Wang, J., Wang, H., Zhou, R., Zhang, Y. (eds.) WISE 2016. LNCS, vol. 10042, pp. 43–51. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-48743-4_4

    Chapter  Google Scholar 

  10. Gayoso-Cabada, J., Rodríguez-Cerezo, D., Sierra, J.-L.: Browsing digital collections with reconfigurable faceted thesauri. In: 25th International Conference on Information Systems Development (ISD), Katowize, Poland (2016)

    Google Scholar 

  11. Gayoso-Cabada, J., Rodríguez-Cerezo, D., Sierra, J.-L.: Browsing digital collections with reconfigurable faceted thesauri. In: Gołuchowski, J., Pańkowska, M., Linger, H., Barry, C., Lang, M., Schneider, C. (eds.) Complexity in Information Systems Development. LNISO, vol. 22, pp. 69–86. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-52593-8_5

    Chapter  Google Scholar 

  12. Gifford, D.K., Jouvelot, P., Sheldon, M.A., O’Toole, J.W.: Semantic file systems. SIGOPS Oper. Syst. Rev. 25(5), 16–25 (1991)

    Article  Google Scholar 

  13. Greene, G.-J., Dunaiski, M., Fischer, B.: Browsing publication data using tag clouds over concept lattices constructed by key-phrase extraction. In: Proceedings of Russian and South African Workshop on Knowledge Discovery Techniques Based on Formal Concept Analysis (RuZA 2015) (2015)

    Google Scholar 

  14. Helic, D., Trattner, C., Strohmaier, M., Andrews, K.: On the navigability of social tagging systems. In: 2010 IEEE Second International Conference on Social Computing (SocialCom 2010), pp. 161–168 (2010)

    Google Scholar 

  15. Hernandez, M.-E., Falconer, S.-M., Storey, M.-A., Carini, S., Sim, I.: Synchronized tag clouds for exploring semi-structured clinical trial data. In: Proceedings of the 2008 Conference of the Center for Advanced Studies on Collaborative Research: Meeting of Minds (CASCON 2008) (2008)

    Google Scholar 

  16. Kammerer, Y., Nairn, R., Pirolli, P., Chi, E.H.: Signpost from the masses: learning effects in an exploratory social tag search browser. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI 2009) (2009)

    Google Scholar 

  17. Kleinberg, J.: Navigation in a small world. Nature 406(6798), 845 (2000)

    Article  Google Scholar 

  18. Koutrika, G., Zadeh, Z.-M., Garcia-Molina, H.: CourseCloud: summarizing and refining keyword searches over structured data. In: Proceedings of the 12th International Conference on Extending Database Technology (EDBT), pp. 1132–1135 (2009)

    Google Scholar 

  19. Leone, S., Geel, M., Müller, C., Norrie, M.C.: Exploiting tag clouds for database browsing and querying. In: Soffer, P., Proper, E. (eds.) CAiSE Forum 2010. LNBIP, vol. 72, pp. 15–28. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-17722-4_2

    Chapter  Google Scholar 

  20. Lin, Y.-L., Brusilovsky, P., He, D.: Finding cultural heritage images through a Dual-Perspective Navigation Framework. Inf. Proc. Manag. 52(5), 820–839 (2016)

    Article  Google Scholar 

  21. Redden, C.S.: Social bookmarking in academic libraries: trends and applications. J. Acad. Librariansh. 36(3), 219–227 (2010)

    Article  Google Scholar 

  22. Salton, G., McGill, M.J.: Introduction to Modern Information Retrieval. McGraw-Hill, New York (1986)

    MATH  Google Scholar 

  23. Schwartz, B., Tkachenko, V., Zaitsev, P.: High Performance MySQL, 3rd edn. O’Reilly Media, Sebastopol (2012)

    Google Scholar 

  24. Seltzer, M., Murphy, N.: Hierarchical file systems are dead. In: Proceedings of the 12th Conference on Hot Topics in Operating Systems (HotOS 2009) (2009)

    Google Scholar 

  25. Sierra, J.-L., Fernández-Valmayor, A., Guinea, M., Hernanz, H.: From research resources to learning objects: process model and virtualization experiences. Educ. Technol. Soc. 9(3), 56–68 (2006)

    Google Scholar 

  26. Sim, H., Kim, Y., Vazhkudai, S.S., Vallée, G.R., Lim, S.-H., Butt, A.R.: Tagit: an integrated indexing and search service for file systems. In: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC 2017) (2017)

    Google Scholar 

  27. Trattner, C., Lin, Y., Parra, D., Yue, Z., Real, W., Brusilovsky, P.: Evaluating tag-based information access in image collections. In: Proceedings of the 23rd ACM Conference on Hypertext and Social Media (HT 2012), pp. 113–122 (2012)

    Google Scholar 

  28. Watson, R., Dekeyser, S., Albadri, N.: Exploring the design space of metadata-focused file management systems. In: Proceedings of the Australasian Computer Science Week Multiconference (ACSW 2017) (2017)

    Google Scholar 

  29. Way, T., Eklund, P.: Social tagging for digital libraries using formal concept analysis. In: Proceedings of the 17th International Conference on Concept Lattices and Their Applications (CLA 2010) (2010)

    Google Scholar 

  30. Zobel, J., Moffat, A.: Inverted files for text search engines. ACM Comput. Surv. 33(2), Article 6 (2006)

    Article  Google Scholar 

Download references

Acknowledgements

This research is supported by the research projects TIN2014-52010-R and TIN2017-88092-R. Also, we would like to thank Mercedes Guinea and Alfredo Fernández-Valmayor (El Caño Foundation, Panamá), for their work on Chasqui.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to José-Luis Sierra .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Gayoso-Cabada, J., Gómez-Albarrán, M., Sierra, JL. (2018). Query-Based Versus Resource-Based Cache Strategies in Tag-Based Browsing Systems. In: Dobreva, M., Hinze, A., Žumer, M. (eds) Maturity and Innovation in Digital Libraries. ICADL 2018. Lecture Notes in Computer Science(), vol 11279. Springer, Cham. https://doi.org/10.1007/978-3-030-04257-8_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-04257-8_4

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-04256-1

  • Online ISBN: 978-3-030-04257-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics