Skip to main content

Parallel Information Retrieval on an SCI-Based PC-NOW

  • Conference paper
  • First Online:
Parallel and Distributed Processing (IPDPS 2000)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1800))

Included in the following conference series:

Abstract

This paper presents an efficient parallel information retrieval (IR) system which provides fast information service for the Internet users on low-cost high-performance PC-NOW environment. The IR system is implemented on a PC cluster based on the Scalable Coherent Interface (SCI), a powerful interconnecting mechanism for both shared memory models and message passing models. In the IR system, the inverted-index file (IIF) is partitioned into pieces using a greedy declustering algorithm and distributed to the cluster nodes to be stored on each node’s hard disk. For each incoming user’s query with multiple terms, terms are sent to the corresponding nodes which contain the relevant pieces of the IIF to be evaluated in parallel. According to the experiments, the IR system outperforms an MPI-based IR system using Fast Ethernet as an interconnect. Speed- up of up to 4.0 was obtained with an 8-node cluster in processing each query on a 500,000-document IIF.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. IEEE, “MYRINET: A GIGABIT PER SECOND LOCAL AREA NETWORK”, IEEE-Micro, Vol.15, No.1, February 1995, pp.29–36.

    Article  Google Scholar 

  2. “Active Messages: a Mechanism for Integrated Communication and Computation”, Thorsten von Eicken and David Culler, et. al., 1992.

    Google Scholar 

  3. “Fast Messages (FM): Efficient, Portable Communication for Workstation Clusters and Massively-Parallel Processors”, IEEE Concurrency, vol. 5, No. 2, April-June 1997, pp. 60–73. (Pakin, Karamcheti & Chien)

    Google Scholar 

  4. “U-Net: A User-Level Network Interface for Parallel and Distributed Computing”, Anindya Basu, Vineet Buch, Werner Vogels, Thorsten von Eicken, Proceedings of the 15th ACM Symposium on Operating Systems Principles (SOSP), Copper Mountain, Colorado, December 3–6, 1995.

    Google Scholar 

  5. http://www.myri.com/GM/doc/gm_toc.html

  6. “NUMA-Q: An SCI based Enterprise Server”, http://www.sequent.com/products/highend_srv/sci_wp1.html

  7. “SCI Interconnect Chipset and Adapter: Building Large Scale Enterprise Servers with Pentium Pro SHV Nodes”, http://www.dg.com/about/html/sci_interconnect_chipset_and_a.html

  8. S.H. Park, H.C. Kwon, “An Improved Relevance Feedback for Korean Information Retrieval System”, Proc. of the 16th IASTED International Conf. Applied Informatics, IASTED/ACTA Press, pp.65–68, Garmisch-Partenkirchen, Germany, February 23–25, 1998

    Google Scholar 

  9. Salton, G. and Buckley, C., “Improving retrieval performance by relevance feedback”, American Society for Information Science, 41,4, pp. 288–297, 1990.

    Article  Google Scholar 

  10. http://www.dolphinics.no/customer/software/linux/index.html

  11. “A High-Performance, Portable Implementation of the MPI Message Passing Interface Standard”, http://www-unix.mcs.anl.gov/mpi/mpich/docs.html

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2000 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Chung, SH., Kwon, HC., Ryu, K.R., Jang, HK., Kim, JH., Choi, CA. (2000). Parallel Information Retrieval on an SCI-Based PC-NOW. In: Rolim, J. (eds) Parallel and Distributed Processing. IPDPS 2000. Lecture Notes in Computer Science, vol 1800. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45591-4_9

Download citation

  • DOI: https://doi.org/10.1007/3-540-45591-4_9

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-67442-9

  • Online ISBN: 978-3-540-45591-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics