Skip to main content

Hybrid Index Structures for Temporal-Textual Web Search

  • Conference paper
Web Technologies and Applications (APWeb 2011)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6612))

Included in the following conference series:

Abstract

Most Web pages contain temporal information. However, most of previous studies only consider the update time of Web pages rather than fully exploit different temporal features in Web. In this paper, we propose a novel approach to fusing different temporal features in Web pages to build an efficient index structure for temporal-textual Web search. Specially, we focus on update time and content time, and propose to use a hybrid index structure to organize textual keywords, update time, and content time. In particular, we study three mechanisms to implement a hybrid index structure for temporal-textual Web search: (1) first inverted file then MAP21-tree and B+-tree, (2) first inverted file then MAP21-tree, (3) expanded inverted file. We conduct experiments on a real dataset to evaluate the performance of those hybrid index structures. The experimental results show that the first inverted file then MAP21-tree index structure has the best query performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Alonso, O., Gertz, M., Yates, R.B.: On the value of temporal information in information retrieval. In: Proc. of SIGIR 2007, pp. 35–41 (2007)

    Google Scholar 

  2. Deniz, E., Chris, F., Terence, J.P.: Chronica: a temporal Web search engine. In: Proc. Of ICWE 2006, pp. 119–120 (2006)

    Google Scholar 

  3. Herscovici, M., Lempel, R., Yogev, S.: Efficient Indexing of Versioned Document Sequences. In: Amati, G., Carpineto, C., Romano, G. (eds.) ECiR 2007. LNCS, vol. 4425, pp. 76–87. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  4. Grandi, F.: Introducing an Annotated Bibliography on Temporal and Evolution Aspects in the World Wide Web. SIGMOD Record 33(2), 84–86 (2004)

    Article  Google Scholar 

  5. Anick, P.G., Flynn, R.A.: Versioning a Full-Text Information Retrieval System. In: Proc. of SIGIR (1992)

    Google Scholar 

  6. Nørvåg, K., Nybø, A.O.N.: DyST: Dynamic and Scalable Temporal Text Indexing. In: Proc. of TIME (2006)

    Google Scholar 

  7. Stack, M.: Full Text Search of Web Archive Collections. In: Proc. of IWAW (2006)

    Google Scholar 

  8. Berberich, K., Bedathur, S.J., Neumann, T., Weikum, G.: FluxCapacitor: Efficient Time-Travel Text Search. In: Proc. Of VLDB, pp. 1414–1417 (2007)

    Google Scholar 

  9. Berberich, K., Bedathur, S.J., Neumann, T., Weikum, G.: A Time Machine for Text Search. In: Proc. Of SIGIR, pp. 519–526 (2007)

    Google Scholar 

  10. Salzberg, B., Tsotras, V.J.: Comparison of Access Methods for Time-Evolving Data. ACM Comput. Surv. 31(2), 158–221 (1999)

    Article  Google Scholar 

  11. Zobel, J., Moffat, A.: Inverted Files for Text Search Engines. ACM Comput. Surv. 38(2), 6 (2006)

    Article  Google Scholar 

  12. Nascimento, M., Dunham, M.: Indexing Valid Time Databases via B+-Trees. IEEE Transactions on Knowledge and Engineering 11(6), 929–947 (1999)

    Article  Google Scholar 

  13. Beckmann, N., Kriegel, H.P., Schneider, R., Seeger, B.: The R-tree: An efficient and robust access method for points and rectangles. In: Proc. Of SIGMOD, pp. 322–331 (1990)

    Google Scholar 

  14. Nørvåg, K.: Space-Efficient Support for Temporal Text Indexing in a Document Archive Context. In: Koch, T., Sølvberg, I.T. (eds.) ECDL 2003. LNCS, vol. 2769, pp. 511–522. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  15. Nørvåg, K.: Supporting temporal text-containment queries in temporal document databases. Data Knowl. Eng. 49(1), 105–125 (2004)

    Article  Google Scholar 

  16. Nørvåg, K., Nybø, A.O.: Improving Space-Efficiency in Temporal Text-Indexing. In: Zhou, L.-z., Ooi, B.-C., Meng, X. (eds.) DASFAA 2005. LNCS, vol. 3453, pp. 791–802. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  17. Nørvåg, K., Nybø, A.O.: Albert Overskeid Nybø. DyST: Dynamic and Scalable Temporal Text Indexing. In: Proc. of TIME, pp. 204–211 (2006)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Jin, P., Chen, H., Lin, S., Zhao, X., Yue, L. (2011). Hybrid Index Structures for Temporal-Textual Web Search. In: Du, X., Fan, W., Wang, J., Peng, Z., Sharaf, M.A. (eds) Web Technologies and Applications. APWeb 2011. Lecture Notes in Computer Science, vol 6612. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20291-9_28

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-20291-9_28

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-20290-2

  • Online ISBN: 978-3-642-20291-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics