Skip to main content

Web Pages Reordering and Clustering Based on Web Patterns

  • Conference paper
SOFSEM 2008: Theory and Practice of Computer Science (SOFSEM 2008)

Abstract

In this paper was proposed a method for the description of web pages using web patterns. We will explain what we mean by the term ”web pattern”. We will present a taxonomy web patterns and a description of some their types. In the description of web patterns we will focus on properties which are useful for automatic detection on web pages. As a result of the detection we get a description of a web page using found web patterns. The description can be used for reordering and clustering of a web page set.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Alexander, Ch.: A Pattern Language: Towns, Buildings, Construction. Oxford University Press, New York (1977)

    Google Scholar 

  2. Chakrabarti, S.: Mining the Web: Discovering Knowledge from Hypertext Data. Morgan Kaufmann Publishers, San Francisco (2003)

    Google Scholar 

  3. Chang, Ch.H., Kayed, M., Girgis, M.R., Shaalan, K.F.: A Survey of Web Information Extraction Systems. IEEE Transactions on Knowledge and Data Engineering 18(10), 1411–1428 (2006)

    Article  Google Scholar 

  4. Dearden, Finlay, J.: Pattern Languages in HCI: A critical review. Human-Computer Interaction 21(1), 49–102 (2006)

    Article  Google Scholar 

  5. Dong, J., Zhao, Y.: xperiments on Design Pattern Discovery. In: PROMISE 2007. Third International Workshop on Predictor Models in Software Engineering, p. 12 (2007)

    Google Scholar 

  6. Van Duyne, D.K., Landay, J.A., Hong, J.I.: The Design of Sites: Patterns, Principles, and Processes for Crafting a Customer-Centered Web Experience. Pearson Education (2002)

    Google Scholar 

  7. Ivory, M.Y., Megraw, R.: Evolution of Web Site Design Patterns. ACM Transactions on Information Systems 23(4), 463–497 (2005)

    Article  Google Scholar 

  8. Kiyavitskaya, N., Zeni, N., Cordy, J.R., Mich, L., Mylopoulos, J.: Text Mining Through Semi Automatic Semantic Annotation. In: PAKM 2006, pp. 143–154 (2006)

    Google Scholar 

  9. Kohonen, T.: Self-Organizing Maps, 3rd edn. Springer, Heidelberg (2006)

    Google Scholar 

  10. Kudělka, M., Snášel, V., Lehečka, O., El-Qawasmeh, E.: Semantic Analysis of Web Pages Using Web Patterns. In: WI 2006. International Conference on Web Intelligence, Hong Kong, pp. 329–333 (2006)

    Google Scholar 

  11. Kočibova, J., Klos, K., Lehečka, O., Kudělka, M., Snášel, V.: Web Page Analysis: Experiments Based On Discussion and Purchase Web Patterns. In: WI 2006. International Conference on Web Intelligence, Silicon Valley, CA, USA, pp. 221–225 (2007)

    Google Scholar 

  12. Nie, Z., Wen, J-R., Ma, W-Y.: Object-level Vertical Search. In: CIDR 2007, Asilomar, CA, USA, pp. 235–246 (2007)

    Google Scholar 

  13. Nie, Z., Ma, Y., Shi, S., Wen, J-R., Ma, W-Y.: Web Object Retrieval. In: WWW 2007, pp. 81–90 (2007)

    Google Scholar 

  14. Pivk, A.: Automatic Ontology Generation from Web Tabular Structures, PhD thesis, University of Maribor (2005)

    Google Scholar 

  15. Reis, D.C., Golgher, P.B., Silva, A.S., Laender, A.F.: Automatic web news extraction using tree edit distance. In: WWW 2004, pp. 502–511. ACM Press, New York (2004)

    Chapter  Google Scholar 

  16. Salton, G., Wong, A., Yang, C.S.: A vector space model for automatic indexing. Communications of the ACM 18(11), 613–620 (1975)

    Article  MATH  Google Scholar 

  17. Snášel, V., Řezanková, H., Húsek, D., Kudělka, M., Lehečka, O.: Semantic Analysis of Web Pages Using Cluster Analysis and Nonnegative Matrix Factorization. In: AWIC 2007, Fontainebleau, France, pp. 328–336. Springer, Heidelberg (2007)

    Google Scholar 

  18. Snášel, V.: GUI Patterns and Web Semantics. In: CISIM 2007, pp. 14–19. IEEE, Elk, Poland (2007)

    Google Scholar 

  19. Tidwell, J.: Designing Interfaces: Patterns for Effective Interaction Design. O’Reilly Media, Inc. (2006)

    Google Scholar 

  20. Tsantalis, N., Chatzigeorgiou, A., Stephanides, G., Halkidis, S.T.: Design Pattern Detection Using Similarity Scoring. IEEE Transactions on Software Engineering 32(11), 896–909 (2006)

    Article  Google Scholar 

  21. Yu, S., Cai, D., Wen, J-R., Ma, W-Y.: Improving Pseudo-Relevance Feedback in Web Information retrieval Using Web Page Segmentation. In: World Wide Web conference (WWW 2003), Hungary, pp. 203–211 (2003)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Viliam Geffert Juhani Karhumäki Alberto Bertoni Bart Preneel Pavol Návrat Mária Bieliková

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Kudělka, M., Snášel, V., Lehečka, O., El-Qawasmeh, E., Pokorný, J. (2008). Web Pages Reordering and Clustering Based on Web Patterns. In: Geffert, V., Karhumäki, J., Bertoni, A., Preneel, B., Návrat, P., Bieliková, M. (eds) SOFSEM 2008: Theory and Practice of Computer Science. SOFSEM 2008. Lecture Notes in Computer Science, vol 4910. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-77566-9_63

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-77566-9_63

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-77565-2

  • Online ISBN: 978-3-540-77566-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics