Abstract
Statistical databases have traditionally been stored as flat files approximating relations. We propose that by storing statistical data in an object oriented type database, enhanced with knowledge of statistical theory, a more natural and powerful interface to statistical data can be created.
A formalism is proposed for dealing with and combining data that have random components by makingstatistics first class citizens in the database world. Entities in the databases are classified according to whether they areobservations, orstatistics.Estimates are a special type ofstatistics which aremoored toobservation entities.Statistics entities are classified by their statistical properties. A hierarchical structure of random features is provided, with distributions at its leaves. This structure is a DAG (Directed Acyclic Graph), which may be extended or redefined for different applications and contains information used to compare and manipulatestatistics.
Similar content being viewed by others
References
SAS Institute Inc.,SAS Users Guide (Raleigh NC, 1984).
R.A. Becker, J.M. Chambers and A.R. Wilks,The New S Language (Wadsworth and Brooks/Cole, 1988).
H. Boral, D.J. De Witt and D. Bates, A framework for research in database management for statistical analysis, Computer Science Technical Report no. 465, Madison, University of Wisconsin CS Dept. (Feb. 1982).
A. Borgida, Features of languages for the development of information systems at the conceptual level, Rutgers University Technical Report, LCSR-TR-52 (1983).
M.L. Brodie and J. Mylopoulos, AI and databases: semantic vs computational theories of information, in:New Directions for Database Systems (Ablex Publ. Co., 1985).
N.C. Rowe, Rule based statistical calculations on a database abstract, in:Proc. 1st LBL Workshop on Statistical Database Management, LBL, ed. H.T.K. Wong (March 1982).
D. Lubinsky and D. Pregibon, Data analysis as search, J. Econometrics 38 (1988) 247–268.
W.A. Gale, Student Phase 1 — A report on work in progress, in:Artificial Intelligence and Statistics, ed. W.A. Gale (Addison-Wesley, 1987).
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Lubinsky, D.J. Integrating statistical theory with statistical databases. Ann Math Artif Intell 2, 245–259 (1990). https://doi.org/10.1007/BF01531010
Issue Date:
DOI: https://doi.org/10.1007/BF01531010