Improving Performance in Constructing specific Web Directory using Focused Crawler: An Experiment on Botany Domain

Khalilian, Madjid; Boroujeni, Farsad Zamani; Mustapha, Norwati

doi:10.1007/978-90-481-3660-5_79

Madjid Khalilian²,
Farsad Zamani Boroujeni² &
Norwati Mustapha²

2360 Accesses
2 Citations

Abstract

Nowadays the growth of the web causes some difficulties to search and browse useful information especially in specific domains. However, some portion of the web remains largely underdeveloped, as shown in lack of high quality contents. An example is the botany specific web directory, in which lack of well-structured web directories have limited user’s ability to browse required information. In this research we propose an improved framework for constructing a specific web directory. In this framework we use an anchor directory as a foundation for primary web directory. This web directory is completed by information which is gathered with automatic component and filtered by experts. We conduct an experiment for evaluating effectiveness, efficiency and satisfaction.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

[1] H. Topi, W. Locase, Mix and Match: combining terms and opratores for successful web searches, information processing and management 41(2005) 801-817
Google Scholar
H.P.ALSSO and F.SMILL, Thinking on the Web, John WILLEY New Jersey 2006.
Google Scholar
Ee -Peng Lim and Aixin Sun: Web Mining- The Ontology Approach, The International Advanced Digital Library Conference in Nagoya Noyori Conference Hall Nagoya University, Japan August 25-26, 2005
Google Scholar
EW De Luca, A Nürnberger: Improving Ontology-Based Sense Folder Classification of Document Collections with Clustering Methods Proc. of the 2nd Int. Workshop on Adaptive Multimedia. 2004.
Google Scholar
Taghva, K. Borsack, J. Coombs, J. Condit, A. Lumos, S. Nartker, T: Ontology-based classification of email, ITCC 2003. International Conference on Information Technology: Coding and Computing 2003.
Google Scholar
M. Khalilian, K. Sheikh, H. abolhassani (2008), classification of web pages by automatically generated categories, Innovations and Advanced Techniques in Systems, Computing Sciences and Software Engineering, springer,ISBN: 978-1-4020-8734-9
Google Scholar
M. Jamali et .al . A Frame Work using Combination of link structure and Content similarity.
Google Scholar
N.LUO, W-ZUO. F.YUON, A New Method for Focused Crawler Cross Tunnel, RSKT2006. pp 632-637
Google Scholar
Chage Su. J.yang,An efficient adaptive focused crawler based on ontology learning . 5th ICHIS IEEE 2005
Google Scholar
[10] Wingyan Chung, G. Lai, A. Bonillas, W. Xi, H. Chen, organizing domain-specific information on the web: An experiment on the Spanish business web directory, int. j. human computer studies 66 (2008) 51-66
Article Google Scholar
M. Khalilian, K. Sheikh, H. abolhassani (2008), Controlling Threshold Limitation in Focused crawler with Decay Concept, 13^th National CSI Conference Kish Island Iran
Google Scholar
F. Menczer and G. Pant and P. Srinivasan. Topic-driven crawlers: Machine learning issues, ACMTOIT, Submitted 2002.
Google Scholar
X. Wan, J. Yang, J. Xiao, Towards a unified approach to document similarity search using manifold ranking of blocks, Information processing and Management 44 (2008) 1032-1048
Google Scholar
M. Diligenti, F. Coetzee, S. Lawrence, C. Giles and M. Gori, Focused Crawling Using Context Graphs, In Proceedings of the 26^th International Conference on VLDB Egypt (2000)
Google Scholar
Cai, D., Yu, S., Wen,J., & Ma., W, -Y. (2003) ;VIPS ; A vision based page segmentation algorithm. Microsoft Technical Report, MSRTR- 2003-79.
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Computer Science and Information Technology(FSKTM), Universiti Putra Malaysia (UPM), Please provide city, Please provide country
Madjid Khalilian, Farsad Zamani Boroujeni & Norwati Mustapha

Authors

Madjid Khalilian
View author publications
You can also search for this author in PubMed Google Scholar
Farsad Zamani Boroujeni
View author publications
You can also search for this author in PubMed Google Scholar
Norwati Mustapha
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Engineering, University of Bridgeport, University Avenue 221, Bridgeport, 06604, U.S.A.
Khaled Elleithy

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Khalilian, M., Boroujeni, F.Z., Mustapha, N. (2010). Improving Performance in Constructing specific Web Directory using Focused Crawler: An Experiment on Botany Domain. In: Elleithy, K. (eds) Advanced Techniques in Computing Sciences and Software Engineering. Springer, Dordrecht. https://doi.org/10.1007/978-90-481-3660-5_79

Download citation

DOI: https://doi.org/10.1007/978-90-481-3660-5_79
Published: 15 December 2009
Publisher Name: Springer, Dordrecht
Print ISBN: 978-90-481-3659-9
Online ISBN: 978-90-481-3660-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics