Skip to main content

HFilter: Hybrid Finite Automaton Based Stream Filtering for Deep and Recursive XML Data

  • Conference paper
Database and Expert Systems Applications (DEXA 2008)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5181))

Included in the following conference series:

Abstract

XML filtering applications are gaining increasing popularity recently. Automata are generally adopted to construct query indexes for evaluating large numbers of XPath queries over XML streams. Usually only shallow data are observed in existing approaches. How to process deep and recursive XML data with low memory limitation efficiently is still a challenging issue. In this paper, we propose HFilter, a Hybrid Finite Automaton (HFA) based stream filtering approach, to solve this problem. We introduce the basic two-tier HFA (lazy DFA tier and NFA tier) first, which realizes data prefix sharing and memory overflow control to improve the filtering throughput. Then an optimized three-tier HFA with an extra pre-expanded DFA tier is put forward, which significantly reduces the restarting cost of HFA after memory overflow. Experiments show that our approaches work more efficiently than existing ones.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Altinel, M., Franklin, M.: Efficient filtering of XML documents for selective dissemination of information. In: VLDB (2000)

    Google Scholar 

  2. Diao, Y., Fischer, P., Franklin, M., To, R.: YFilter: Efficient and Scalable Filtering of XML Documents. In: ICDE (2002)

    Google Scholar 

  3. Chan, C., Felber, P., Garofalakis, M.N., Rastogi, R.: Efficient filtering of XML documents with XPath expressions. In: ICDE (2002)

    Google Scholar 

  4. Diao, Y., Altinel, M., Franklin, M.J., Zhang, H., Fischer, P.: Path sharing and predicate evaluation for high-performance XML filtering. ACM Trans. on Database Systems (TODS) 28(4) (2003)

    Google Scholar 

  5. Green, T., Gupta, A., Miklau, G., Onizuka, M., Suciu, D.: Processing XML streams with deterministic automata and stream index. ACM Trans. on Database Systems (TODS) 29(4) (2004)

    Google Scholar 

  6. Gupta, A., Suciu, D.: Stream processing of XPath queries with predicates. In: SIGMOD (2003)

    Google Scholar 

  7. Onizuka, M.: Light-weight XPath processing of XML stream with deterministic automata. In: CIKM 2003 (2003)

    Google Scholar 

  8. Chen, D., Wong, R.: Optimizing The lazy DFA approach for XML stream processing. In: The Fifteenth Australasian Database Conference (ADC) (2004)

    Google Scholar 

  9. Florescu, D., Hillery, C., Kossmann, D., Lucas, P.: The BEA/XQRL streaming XQuery processor. In: VLDB 2003 (2003)

    Google Scholar 

  10. Ives, Z., Halevy, A., Weld, D.: An XML query engine for network-bound data. VLDB Journal 11(4) (2002)

    Google Scholar 

  11. Diaz, A.L., Lovell, D.: XML Generator, http://www.alphaworks.ibm.com/tech/xmlgenerator

    Google Scholar 

  12. He, B., Luo, Q., Choi, B.: Cache-conscious automata for XML filtering. In: ICDE 2005 (2005)

    Google Scholar 

  13. Zhang, X., Yang, L., Lee, M., Hsu, W.: Scaling SDI systems via query clustering and aggregation. In: Lee, Y., Li, J., Whang, K.-Y., Lee, D. (eds.) DASFAA 2004. LNCS, vol. 2973, pp. 208–219. Springer, Heidelberg (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Sourav S. Bhowmick Josef Küng Roland Wagner

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Sun, W., Qin, Y., Yu, P., Zhang, Z., He, Z. (2008). HFilter: Hybrid Finite Automaton Based Stream Filtering for Deep and Recursive XML Data. In: Bhowmick, S.S., Küng, J., Wagner, R. (eds) Database and Expert Systems Applications. DEXA 2008. Lecture Notes in Computer Science, vol 5181. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85654-2_48

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-85654-2_48

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-85653-5

  • Online ISBN: 978-3-540-85654-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics