Abstract
A Data Warehouse (DW) is a database that collects and stores data from multiple remote and heterogeneous information sources. When a query is posed, it is evaluated locally, without accessing the original information sources. In this paper we deal with the issue of designing a DW, in the context of the relational model, by selecting a set of views to materialize in it. Views allow to compute both the schema and the instance of the DW from the schemas and the instances of the source relations.
We briefly present a theoretical framework for the DW design problem, which concerns the selection of a set of views that (a) fits in the space allocated to the DW, (b) answers all the queries of interest, and (c) minimizes the total query evaluation and view maintenance cost. We then formalize it as a state space search problem by taking into account multiquery optimization over the maintenance queries (i.e. queries that compute changes to the materialized views) and the use of auxiliary views for reducing the view maintenance cost. Finally, incremental algorithms and heuristics for pruning the search space are presented.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Ceri, S., Widom, J.: Deriving production rules for incremental view maintenance. In: Proc. of the 20th Intl. Conf. on Very Large Data Bases, pp. 577–589 (1991)
Chaudhuri, S., Dayal, U.: An Overview of Data Warehousing and OLAP Technology. SIGMOD Record 26(1), 65–74 (1997)
Finkelstein, S.: Common Expression Analysis in Database Applications. In: Proc. of the ACM SIGMOD Intl. Conf. on Management of Data, pp. 235–245 (1982)
Gupta, H.: Selection of Views to Materialize in a Data Warehouse. In: Intl. Conf. on Database Theory, pp. 98–112 (1997)
Gupta, H., Harinarayan, V., Rajaraman, A., Ullman, J.D.: Index Selection for OLAP. In: Proc. of the 13th Intl. Conf. on Data Engineering, pp. 208–219 (1997)
Harinarayan, V., Rajaraman, A., Ullman, J.D.: Implementing Data Cubes Efficiently. In: Proc. of the ACM SIGMOD Intl. Conf. on Management of Data (1996)
Immon, W., Kelley, C.: Rdb/VMS: Developing the Data warehouse. QED Publishing Group, Boston (1993)
Labio, W., Quass, D., Adelberg, B.: Physical Database Design for Data Warehousing. In: Proc. of the 13th Intl. Conf. on Data Engineering (1997)
Levy, A., Mendelson, A.O., Sagiv, Y., Srivastava, D.: Answering Queries using Views. In: Proc. of the ACM Symp. on Principles of Database Systems, pp. 95–104 (1995)
Quass, D., Gupta, A., Mumick, I.S., Widom, J.: Making Views Self Maintainable for Data Warehousing. In: PDIS (1996)
Ross, K.A., Srivastava, D., Sudarshan, S.: Materialized View Maintenance and Integrity Constraint Checking: Trading Space for Time. In: Proc. of the ACM SIGMOD Intl. Conf. on Management of Data, pp. 447–458 (1996)
Roussopoulos, N.: View Indexing in Relational Databases. ACM Transactions on Database Systems 7(2), 258–290 (1982)
Sellis, T.K.: Multiple Query Optimization. ACM Transactions on Database Systems 13(1), 23–52 (1988)
Shim, K., Sellis, T.K., Nau, D.: Improvements on a heuristic algorithm for multiple-query optimization. Data & Knowledge Engineering 12, 197–222 (1994)
Theodoratos, D., Sellis, T.: Data Warehouse Configuration. In: Proc. of the 23nd Intl. Conf. on Very Large Data Bases, pp. 126–135 (1997)
Theodoratos, D., Sellis, T.: Designing Data Warehouses. Technical Report, Knowledge and data Base Systems Laboratory, Electrical and Computer Engineering Dept., National Technical University of Athens, pp. 1–29 (1997)
Widom, J. (ed.): Data Engineering, Special Issue on Materialized Views and Data Warehousing, vol. 18(2). IEEE, Los Alamitos (1995)
Yang, J., Karlapalem, K., Li, Q.: Algorithms for Materilaized View Design in Data Warehousing Environment. In: Proc. of the 23nd Intl. Conf. on Very Large Data Bases, pp. 136–145 (1997)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1998 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Theodoratos, D., Sellis, T. (1998). Data Warehouse Schema and Instance Design. In: Ling, TW., Ram, S., Li Lee, M. (eds) Conceptual Modeling – ER ’98. ER 1998. Lecture Notes in Computer Science, vol 1507. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-49524-6_29
Download citation
DOI: https://doi.org/10.1007/978-3-540-49524-6_29
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-65189-5
Online ISBN: 978-3-540-49524-6
eBook Packages: Springer Book Archive