Skip to main content

Dynamic Source Selection in Large Scale Mediation Systems

  • Conference paper
Data Management in Grid and Peer-to-Peer Systems (Globe 2008)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5187))

Included in the following conference series:

Abstract

This paper proposes ORS, an original strategy to reduce the number of data sources to access during query evaluation in large scale mediation systems. ORS proceeds first selecting sources using extensional (data) information to discard useless sources and then validates the intentional (schema) information that each one is able to provide. The first step is based on location queries on some ”well chosen” data sources, that previously had made a consolidation integration effort. This paper proposes ORS to improve querying semantic virtual objects whose instances are distributed across numerous data sources. Cost analysis and implementation in a grid context are also presented.

This research is supported by Ecos-Colciencias C06M02/C07M02.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Foster, I., Kesselman, C., Tuecke, S.: The anatomy of the grid: Enabling scalable virtual organizations. Int. J. High Perform. Comput. Appl. 15, 200–222 (2001)

    Article  Google Scholar 

  2. Roth, M., Schwarz, P.: A wrapper architecture for legacy data sources. In: Proc. vldb conference (1997)

    Google Scholar 

  3. Tomasic, A., Raschid, L., Valduriez, P.: Scaling access to heterogeneous data sources with DISCO. Knowledge and Data Engineering 10, 808–823 (1998)

    Article  Google Scholar 

  4. Kossmann, D.: The state of the art in distributed query processing. ACM Comput. Surv. 32, 422–469 (2000)

    Article  Google Scholar 

  5. Melnik, S., et al.: A mediation infrastructure for digital library services. In: DL 2000: Fifth ACM conference on Digital libraries, pp. 123–132. ACM Press, New York (2000)

    Chapter  Google Scholar 

  6. Bruno, G., Collet, C., Vargas-Solar, G.: Configuring intelligent mediators using ontologies. In: EDBT Workshops, pp. 554–572 (2006)

    Google Scholar 

  7. Bleiholder, J., et al.: Query planning in the presence of overlapping sources. In: Grust, T., Höpfner, H., Illarramendi, A., Jablonski, S., Mesiti, M., Müller, S., Patranjan, P.-L., Sattler, K.-U., Spiliopoulou, M., Wijsen, J. (eds.) EDBT 2006. LNCS, vol. 4254, pp. 811–828. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  8. Yu, B., et al.: Effective keyword-based selection of relational databases. In: SIGMOD 2007, pp. 139–150. ACM, New York (2007)

    Chapter  Google Scholar 

  9. Wiederhold, G.: Mediators in the architecture of future information systems. Computer 25, 38–49 (1992)

    Article  Google Scholar 

  10. Garcia-Molina, H., et al.: The tsimmis approach to mediation: Data models and languages. Journal of Intelligent Information Systems 8, 117–132 (1997)

    Article  Google Scholar 

  11. Stevens, R., et al.: Tambis: Transparent access to multiple bioinformatics information sources. Bioinformatics 16, 184–186 (2000)

    Article  Google Scholar 

  12. Doan, A., et al.: Reconciling schemas of disparate data sources: a machine-learning approach. In: SIGMOD 2001, pp. 509–520. ACM Press, New York (2001)

    Chapter  Google Scholar 

  13. Gounaris, A., et al.: A service-oriented system to support data integration on data grids. In: CCGRID 2007, pp. 627–635. IEEE Computer Society, Los Alamitos (2007)

    Google Scholar 

  14. Wang, Q., Chen, J., Gao, X., Zhou, W., Yan, B.: A new architecture of data access middleware under grid environment. apscc 0, 384–391 (2006)

    Google Scholar 

  15. Liu, J., Wu, Y., Zheng, W.: Grid enabled data integration framework for bioinformatics research. In: GCC Workshops, pp. 401–406 (2006)

    Google Scholar 

  16. Wöhrer, A., Brezany, P., Tjoa, A.M.: Novel mediator architectures for grid information systems. Future Gener. Comput. Syst. 21, 107–114 (2005)

    Article  Google Scholar 

  17. Wenlong, H., et al.: Data model and virtual database engine for grid environment. In: GCC 2007, pp. 823–829. IEEE Computer Society, Los Alamitos (2007)

    Google Scholar 

  18. Apers, P.M.G., Hevner, A.R., Yao, S.B.: Optimization algorithms for distributed queries, 262–273 (1986)

    Google Scholar 

  19. Haraty, R.A., Fany, R.C.: Query acceleration in distributed database systems. Revista Comlombiana de Computación 2, 19–34 (2001)

    Google Scholar 

  20. Levy, A.Y., et al.: Querying heterogeneous information sources using source descriptions. In: VLDB, Bombay, India, VLDB Endowment, pp. 251–262 (1996)

    Google Scholar 

  21. Duschka, O.M., Genesereth, M.R.: Query planning in infomaster. In: Selected Areas in Cryptography, pp. 109–111 (1997)

    Google Scholar 

  22. Nie, T., et al.: Sla-based data integration on database grids. In: COMPSAC (2), pp. 613–618 (2007)

    Google Scholar 

  23. Liu, C., Chen, H.: A hash partition strategy for distributed query processing. In: Extending Database Technology, pp. 373–387 (1996)

    Google Scholar 

  24. Gounaris, A., et al.: A novel approach to resource scheduling for parallel query processing on computational grids. Distrib. Parallel Databases 19, 87–106 (2006)

    Article  Google Scholar 

  25. Eric Prud’hommeaux, A.S.: Sparql query language for rdf (2007), http://www.w3.org/tr/rdf-sparql-query/

  26. Verdier, C.: Health information systems: from local to pervasive medical data. Santé et Systémique 9, 87–108 (2006)

    Google Scholar 

  27. Berlin, F.U.: The d2rq plattform (2007), http://sites.wiwiss.fu-berlin.de/suhl/bizer/d2rq/

  28. d’Orazio, L., et al.: Distributed semantic caching in grid middleware. In: Wagner, R., Revell, N., Pernul, G. (eds.) DEXA 2007. LNCS, vol. 4653, pp. 162–171. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Abdelkader Hameurlain

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Pomares, A., Roncancio, C., Abásolo, J., Villamil, P. (2008). Dynamic Source Selection in Large Scale Mediation Systems. In: Hameurlain, A. (eds) Data Management in Grid and Peer-to-Peer Systems. Globe 2008. Lecture Notes in Computer Science, vol 5187. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85176-9_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-85176-9_6

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-85175-2

  • Online ISBN: 978-3-540-85176-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics