KNOWLEDGE GRID: High Performance Knowledge Discovery Services on the Grid

Cannataro, Mario; Talia, Domenico; Trunfio, Paolo

doi:10.1007/3-540-45644-9_5

Mario Cannataro²,
Domenico Talia³ &
Paolo Trunfio²

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2242))

Included in the following conference series:

International Workshop on Grid Computing

332 Accesses
19 Citations

Abstract

Knowledge discovery tools and techniques are used in an increasing number of scientific and commercial areas for the analysis of large data sets. When large data repositories are coupled with geographic distribution of data, users and systems, it is necessary to combine different technologies for implementing high-performance distributed knowledge discovery systems. On the other hand, computational grid is emerging as a very promising infrastructure for high-performance distributed computing. In this paper we introduce a software architecture for parallel and distributed knowledge discovery (PDKD) systems that is built on top of computational grid services that provide dependable, consistent, and pervasive access to high-end computational resources. The proposed architecture uses the grid services and defines a set of additional layers to implement the services of distributed knowledge discovery process on grid-connected sequential or parallel computers.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Chattratichat J., Darlington J., Guo Y., Hedvall S., Koler M. and Syed J., An architecture for distributed enterprise data mining. HPCN Europe 1999, Lecture Notes in Computer Science, 1593, 1999, pp. 573–582.
Google Scholar
Chervenak A., Foster I., Kesselman C, Salisbury C. and Tuecke S., The Data Grid: towards an architecture for the distributed management and analysis of large scientific data sets. Journal of Network and Computer Appls, 2001.
Google Scholar
Fayyad U.M. and Uthurusamy R. (eds.), Data mining and knowledge discovery in databases. Communications of the ACM 39, 1997.
Google Scholar
Foster I. and Kesselman C, Globus: a metacomputing infrastructure toolkit. International Journal of Supercomputing Applications11, 1997, pp. 115–128.
Google Scholar
Freitas A.A. and Lavington S.H., Mining Very Large Databases with Parallel Processing, Kluwer Academic Publishers, 1998.
Google Scholar
Grimshaw A.S., Ferrari A., Knabe F., and Humphrey M., Wide-area computing: resource sharing on a large scale. Computer32, 1999, pp. 29–37.
Article Google Scholar
Grossman R., Bailey S., Kasif S., Mon D., Ramu A. and Malhi B., The preliminary design of papyrus: a system for high performance, distributed data mining over clusters, meta-clusters and super-clusters. International KDD’98 Conference, 1998, pp. 37–43.
Google Scholar
Kargupta H., Park B., Hershberger, D. and Johnson, E., Collective data mining: a new perspective toward distributed data mining. In H. Kargupta and P. Chan (eds.) Advances in Distributed and Parallel Knowledge Discovery, AAAI Press 1999.
Google Scholar
Kimm H. and Ryu T.-W., A framework for distributed knowledge discovery system over heterogeneous networks using CORBA. KDD2000 Workshop on Distributed and Parallel Knowledge Discovery, 2000.
Google Scholar
D. Foti, D. Lipari, C. Pizzuti, D. Talia, “Scalable Parallel Clustering for Data Mining on Multicomputers”, Proc. of the 3rd Int. Workshop on High Performance Data Mining HPDM00-1PDPS, LNCS, Springer-Verlag, Cancun, Mexico, May 2000.
Book Google Scholar
Moore R., Baru C, Marciano R., Rajasekar A. and Wan M., Data-intensive computing. In I. Foster and C. Kesselman (eds.) The Grid: Blueprint for a Future Computing Inf., Morgan Kaufmann Publishers, 1999, pp. 105–129.
Google Scholar
Rana O.F., Walker D.W., Li M., Lynden S. and Ward M., PaDDMAS: parallel and distributed data mining application suite. Proc. International Parallel and Distributed Processing Symposium (IPDPS/SPDP), IEEE Computer Society Press, 2000, pp. 387–392.
Google Scholar
Stolfo S.J., Prodromidis A.L., Tselepis S., Lee W., Fan D.W., Chan P.K., JAM: Java agents for meta-learning over distributed databases. International KDD’97 Conference, 1997, pp. 74–81.
Google Scholar

Download references

Author information

Authors and Affiliations

ISI-CNR, Via P. Bucci, cubo 41-C, 87036, Rende (CS), Italy
Mario Cannataro & Paolo Trunfio
DEIS, Universita della Calabria, Via P. Bucci, cubo 41-C, 87036, Rende (CS), Italy
Domenico Talia

Authors

Mario Cannataro
View author publications
You can also search for this author in PubMed Google Scholar
Domenico Talia
View author publications
You can also search for this author in PubMed Google Scholar
Paolo Trunfio
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

High Performance Computing, The Aerospace Corp., 2350 East El Segundo Blvd., 90245, El Segundo, CA, USA
Craig A. Lee

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Cannataro, M., Talia, D., Trunfio, P. (2001). KNOWLEDGE GRID: High Performance Knowledge Discovery Services on the Grid. In: Lee, C.A. (eds) Grid Computing — GRID 2001. GRID 2001. Lecture Notes in Computer Science, vol 2242. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45644-9_5

Download citation

DOI: https://doi.org/10.1007/3-540-45644-9_5
Published: 14 March 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42949-4
Online ISBN: 978-3-540-45644-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics