Skip to main content

Distributed Data Mining by Means of SQL Enhancement

  • Conference paper
On the Move to Meaningful Internet Systems: OTM 2008 Workshops (OTM 2008)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5333))

Abstract

An analysis of a huge amount of information is feasible only if information systems are used. First, information needs to be accumulated and stored in a persistent structure enabling effective data access and management. The main aspects of nowadays data processing are: storing data in (mostly relational) databases, improving data processing efficiency by parallel analysis [1], distributed processing (necessary for institution consisting of autonomous, geographically distributed departments), query languages (SQL) remain a fundamental way to access data in databases, data analysis often includes data mining (building data models describing data characteristics or predicting some features) [2].

Regarding the above mentioned circumstances authors propose an enhancement of SQL for data mining of a distributed data structure. Basic assumption is a complete, horizontal data fragmentation and an explicit model format. Building global data model consists of two stages. In the first one, local models are built in a parallel manner. Second one consists of combining these models into a global data picture. Detailed description of combining methods regarding global classification models authors presented in [3].

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Ullman, J.D., Widom, J.: A First Course in Database Systems. Prentice-Hall, Inc., Englewood Cliffs (1997)

    Google Scholar 

  2. Hand, D., Mannila, H., Smyth, P.: Principles of Data Mining. The MIT Press, Cambridge (2001)

    Google Scholar 

  3. Gorawski, M., Pluciennik, E.: Analytical Models Combining Methodology with Classification Model Example. In: 1st IEEE International Conference on Information Technology, Gdansk, Poland (2008)

    Google Scholar 

  4. International Organization for Standardization (ISO). Information Technology, Database Language, SQL Multimedia and Application Packages, Part 6: Data Mining Draft Standard No. ISO/IEC 13249-6 (2003)

    Google Scholar 

  5. Han, J., Fu, Y., Wang, W., Koperski, K., Zaiane, O.: DMQL: A Data Mining Query Language for Relational Database. In: Proc. Of SIGMOD Workshop DMKD, Montreal, Canada (1996)

    Google Scholar 

  6. Imieliński, T., Virmani, A.: MSQL: A Query Language for Database Mining. Data Mining and Knowledge Discovery (1999)

    Google Scholar 

  7. Meo, R., Psaila, G., Ceri, S.: An Extention to SQL for Mining Association Rules. Data Mining and Knowledge Discovery (1998)

    Google Scholar 

  8. Morzy, T., Zakrzewicz, M.: SQL-like language for database mining. In: Proc. of the First East-European, Symposium on Advances in Databases and Information Systems - ADBIS, St. Petersburg (1997)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Gorawski, M., Pluciennik, E. (2008). Distributed Data Mining by Means of SQL Enhancement. In: Meersman, R., Tari, Z., Herrero, P. (eds) On the Move to Meaningful Internet Systems: OTM 2008 Workshops. OTM 2008. Lecture Notes in Computer Science, vol 5333. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-88875-8_16

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-88875-8_16

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-88874-1

  • Online ISBN: 978-3-540-88875-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics