Abstract
Due to the tremendous increase rate and the high change frequency of Web documents, maintaining an up-to-date index for searching purposes (search engines) is becoming a challenge. The traditional crawling methods are no longer able to catch up with the constantly updating and growing Web. Realizing the problem, in this paper we suggest an alternative distributed crawling method with the use of mobile agents. Our goal is a scalable crawling scheme that minimizes network utilization, keeps up with document changes, employs time realization, and is easily upgradeable.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Altavista Search Engine, Basic submit, Available at http://addurl.altavista.com/addurl/new
Ahuja, S., Carriero, N., Gelernter, D.: Linda and Friends. IEEE Computer 19(8), 26–34 (1986)
Brin, S., Page, L.: The Anatomy of a Large-Scale Hypertextual Web Search Engine. In: WWW7, Brisbaib (April 1998)
Brown, C.M., Danzig, B.B., Hardy, D., Manber, U., Schwartz, M.F.: The harvest information discovery and access system. In: WWW2, Chicago (October 1994)
Bowman, C.M., Danzig, P.B., Hardy, D.R., Manber, U., Schwartz, M.F.: Harvest: A Scalable, Customizable Discovery and Access System. Technical Report CU-CS-732-94, Department of Computer Science, University of Colorado (August 1995)
Chakrabarti, S., van den Berg, M., Dom, B.: Focused Crawling: A New Approach to Topic- Specific Web Resource Discovery. WWW8 / Computer Networks 31(11–16), 1623–1640 (1999)
Chakrabarti, S., Punera, K., Subramanyam, M.: Accelerated Focused Crawling through Online Relevance Feedback. In: WWW2002, Hawaii (May 2002)
Chess, D., Harrison, C., Kershenbaum, A.: Mobile Agents: Are They A Good Idea? IBM research
Cho, J., Garcia-Molina, H.: Parallel Crawlers. In: WWW2002, Hawaii (May 2002)
Diligenti, M., Coetzee, F., Lawrence, S., Giles, C.L., Gori, M.: Focused Crawling Using Context Graphs. VLDB 2000, 527–534 (2000)
Fiedler, J., Hammer, J.: Using the Web Efficiently: Mobile Crawling. In: Proc. of the Seventeenth Annual International Conference of the Association of Management (AoM/IAoM) on Computer Science, August 1999, pp. 324–329. Maximilian Press Publishers, San Diego (1999)
Fiedler, J., Hammer, J.: Using Mobile Crawlers to Search the Web Efficiently. International Journal of Computer and Information Science 1(1), 36–58 (2000)
Google Search Appliance, Available at http://www.google.com/services/
Grub Distributed Internet Crawler, Available at www.grub.org
Heydon, A., Najork, M.: Mercator: A Scalable, Extensible Web Crawler. Compaq Systems Research Center. In: WWW9, Amsterdam (May 2000)
Hypertext Transfer Protocol – HTTP/1.0, specification, Available at http://www.w3.org/
Kahle, B.: Achieving the Internet. Scientific American (1996)
Karjoth, G., Asokan, N., Gülcü, C.: Protecting the Computation Results of Free Roaming Agents. In: Rothermel, K., Hohl, F. (eds.) MA 1998. LNCS, vol. 1477, p. 195. Springer, Heidelberg (1998)
Lawrence, S., Lee Giles, C.: Accessibility of information on the web. Nature 400(6740), 107–109 (1999)
Lyman, P., Varian, H., Dunn, J., Strygin, A., Swearingen, K.: How much information?, Available at http://www.sims.berkeley.edu/how-much-info
Dikaiakos, M., Samaras, G.: Quantitative Performance Analysis of Mobile Agent Systems. A Hierarchical Approach. Technical Report TR-00-2, Department of Computer Science, University of Cyprus (June 2000)
Sander, T., Tschudin, C.F.: Towards Mobile Cryptography. In: Proc. of the IEEE Symposium on Research in Security and Privacy, USA (1998)
SETI: Search for Extraterrestrial Intelligence, Available at http://setiathome.ssl.berkeley.edu/
Varadharajan, V.: Security enhanced mobile agents: ACM Conference on Computer and Communications Security, 200–209 (2000)
Voyager Web site, by ObjectSpace, Available at http://www.recursionsw.com/products/voyager/voyager.asp
Yoshioka, N., Tahara, Y., Oshuga, A., Honiden, S.: Security for Mobile Agents. In: Ciancarini, P., Wooldridge, M.J. (eds.) AOSE 2000. LNCS, vol. 1957, pp. 223–234. Springer, Heidelberg (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Papapetrou, O., Papastavrou, S., Samaras, G. (2003). UCYMICRA: Distributed Indexing of the Web Using Migrating Crawlers. In: Kalinichenko, L., Manthey, R., Thalheim, B., Wloka, U. (eds) Advances in Databases and Information Systems. ADBIS 2003. Lecture Notes in Computer Science, vol 2798. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39403-7_12
Download citation
DOI: https://doi.org/10.1007/978-3-540-39403-7_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20047-5
Online ISBN: 978-3-540-39403-7
eBook Packages: Springer Book Archive