Skip to main content

Instances Navigation for Querying Integrated Data from Web-Sites

  • Conference paper
Web Information Systems and Technologies

Abstract

Research on data integration has provided a set of rich and well understood schema mediation languages and systems that provide a meta-data representation of the modeled real world, while, in general, they do not deal with data instances.

Such meta-data are necessary for querying classes result of an integration process: the end user typically does not know the contents of such classes, he simply defines his queries on the basis of the names of classes and attributes.

In this paper we introduce an approach enriching the description of selected attributes specifying as meta-data a list of the“relevant values”for such attributes. Furthermore relevant values may be hierarchically collected in a taxonomy. In this way, the user may exploit new meta-data in the interactive process of creating/refining a query. The same meta-data are also exploited by the system in the query rewriting/unfolding process in order to filter the results showed to the user.

We conducted an evaluation of the strategy in an e-business context within the EU-IST SEWASIE project. The evaluation proved the practicability of the approach for large value instances.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Beneventano et al., 2003. Beneventano, D., Bergamaschi, S., Guerra, R, Vincini, M.: Synthesizing an integrated ontology. IEEE Internet Computing Magazine, 42–51 (2003)

    Google Scholar 

  • Beneventano and Lenzerini, 2005. Beneventano, D., Lenzerini, M.: Final release of the system prototype for query management. Sewasie, Deliverable D.3.5, Final Version (2005), available http://www.dbgroup.unimo.it/pubs.html

  • Bergamaschi et al., 2001. Bergamaschi, S., Castano, S., Beneventano, D., Vincini, M.: Semantic integration of heterogeneous information sources. Data & Knowledge Engineering, Special Issue on Intelligent Information Integration 36(1), 215–249 (2001)

    MATH  Google Scholar 

  • Broder et al., 2005. Broder, A.Z., Maarek, Y.S., Bharat, K., Dumais, S.T., Papa, S., Pedersen, J., Raghavan, P.: Current trends in the integration of searching and browsing. In: WWW (Special interest tracks and posters), pp. 793 (2005)

    Google Scholar 

  • Buneman et al., 1997. Buneman, P., Davidson, S., Fernandez, M., Suciu, D.: Adding structure to unstructured data. In: Afrati, F.N., Kolaitis, P.G. (eds.) ICDT 1997. LNCS, vol. 1186, pp. 336–350. Springer, Heidelberg (1996)

    Google Scholar 

  • Chaudhuri et al., 2005. Chaudhuri, S., Ramakrishnan, R., Weikum, G.: Integrating db and ir technologies: What is the sound of one hand clapping. In: Proceedings of the Second Biennial Conference on Innovative Data Systems Research, Asilomar, CA, USA, pp. 1–12 (2005)

    Google Scholar 

  • Dong and Halevy, 2005. Dong, X., Halevy, A.Y.: Malleable schémas: A preliminary report. In: Proceedings of he Eight International Workshop on the Web & Databases (WebDB 2005), Baltimore, Maryland, USA, pp. 139–144 (2005)

    Google Scholar 

  • Galindo-Legaria, 1994. Galindo-Legaria, C.A.: Outerjoins as disjunctions. In: Snodgrass, R.T., Winslett, M. (eds.) SIGMOD Conference, pp. 348–358. ACM Press, New York (1994)

    Google Scholar 

  • Gibson et al., 2000. Gibson, D., Kleinberg, J., Raghavan, P.: Clustering categorical data: an approach based on dynamical systems. VLDB Journal 8(3–4), 222–236 (2000)

    Google Scholar 

  • Gottlob et al., 2004. Gottlob, G., Koch, C., Baumgartner, R., Herzog, M., Flesca, S.: The lixto data extraction project–back and forth between theory and practice. In: Proceedings of the Twenty-third ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, Paris, France, pp. 1–12 (2004)

    Google Scholar 

  • Halevy, 2003. Halevy, A.: Data integration: a status report. In: Proceedings of the German Database Conference, BTW-03, Leipzig (2003)

    Google Scholar 

  • Halevy, 2004. Halevy, A.Y: Structures, semantics and statistics. In: Proceedings of the 30th International Conference on VLDB, Toronto, Canada, pp. 4–6 (2004)

    Google Scholar 

  • N. Noy, 2005. Noy, N., Uschold, M.C.W.: Representing classes as property values on the semantic web. Semantic Web Best Practices and Deployment Working Group, part of the W3C Semantic Web Activity (2005), http://www.w3.org/TR/ swbp-classes-as-values

  • Nestorov et al., 1997. Nestorov, S., Abiteboul, S., Motwani, R.: Inferring structure in semistructured data. SIGMOD Record 26(4), 39–43 (1997)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Beneventano, D., Bergamaschi, S., Bruschi, S., Guerra, F., Orsini, M., Vincini, M. (2007). Instances Navigation for Querying Integrated Data from Web-Sites. In: Filipe, J., Cordeiro, J., Pedrosa, V. (eds) Web Information Systems and Technologies. Lecture Notes in Business Information Processing, vol 1. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74063-6_10

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-74063-6_10

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-74062-9

  • Online ISBN: 978-3-540-74063-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics