Abstract
Assuming an adaptation of Suppes’s analysis of causality, we show that multiple regression methods are fundamentally incorrect procedures for identifying causes. This is because when regressors are correlated the existence of an unmeasured common cause of regressor X i and outcome variable Y may bias estimates of the influence of other regressors X k;, variables having no influence on Y whatsoever may thereby be given significant regression coefficients. The bias may be quite large. Simulation studies show that standard regression model specification procedures make the same error. The strategy of regressing on a larger set of variables and checking stability may compound rather than remedy the problem. A similar difficulty in the estimation of the influence of other regressors arises if some X i is an effect rather than a cause of Y. The problem appears endemic in uses of multiple regression on uncontrolled variables, and unless somehow corrected appears to invalidate many scientific uses of regression methods. We describe an implementation in the TETRAD II program of a model specification algorithm that avoids these and certain other errors in large samples. We illustrate the TETRAD II algorithm by applying it to a number of real and simulated data sets.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Cooper, G. and Herskovits, E.: 1992, ‘A Bayesian Method for the Induction of Probabilistic Networks from Data’, Machine Learning (to appear).
Fox, J.: 1984, Linear Statistical Models and Related Methods, Wiley, New York.
Glymour, C., Spirtes P., and Scheines, R.: 1991b, ‘From Probability to Causality’, Philosophical Studies, 64(1), 1–36.
Linthurst, R. A.: 1979, ‘Aeration, Nitrogen, pH and Salinity as Factors Affecting Spartina Alterniflora Growth and Dieback’, Ph.D. thesis, North Carolina State University.
Mosteller, F. and Tukey, J.: 1977, Data Analysis and Regression, A Second Course in Regression, Addison-Wesley, Massachusetts.
Pratt, J. and Schlaifer, R.: 1988, ‘On the Interpretation and Observation of Laws’, Journal of Econometrics, 39, 23–52.
Rawlings, J.: 1988, Applied Regression Analysis, Wadsworth, Belmont, CA.
Spirtes, P.: 1992, ‘Building Causal Graphs from Statistical Data in the Presence of Latent Variables’, forthcoming in: B. Skyrms (Ed.), Proceedings of the IX International Congress on Logic, Methodology, and the Philosophy of Science, Uppsala, Sweden, 1991.
Spirtes, P., Glymour, C., Scheines, R., and Sorensen, S.: 1990, ‘TETRAD Studies of Data for Naval Air Traffic Controller Trainees’, Report to the Navy Personnel Research Development Center, San Diego, CA.
Spirtes, P., Glymour, C., and Scheines, R.: 1990, ‘Causality from Probability’, in: J. Tiles et al. (Eds.), Evolving Knowledge in Natural Science and Artificial Intelligence, Pitman, London, pp. 181–199.
Spirtes, P., Glymour, C., and Scheines, R.: 1991, ‘An Algorithm for Fast Recovery of Sparse Causal Graphs’, Social Science Computer Review, 9, 62–72.
Spirtes, P., Glymour, C., and Scheines, R.: 1993, Causation, Prediction and Search, Springer, New York.
Suppes, P.: 1970, A Probabilistic Theory of Causality, North-Holland, Amsterdam.
Timberlake, M. and Williams, K.: 1984, ‘Dependence, Political Exclusion, and Government Repression: Some Cross-National Evidence’, American Sociological Review, 49, 141–146.
Verma, T. and Pearl, J.: 1990b, ‘Equivalence and Synthesis of Causal Models’, Proc. Sixth Conference on Uncertainty in AI, Association for Uncertainty in AI, Inc., Mountain View, CA, pp. 220–227.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1994 Springer Science+Business Media Dordrecht
About this chapter
Cite this chapter
Glymour, C., Spirtes, P., Scheines, R. (1994). In Place of Regression. In: Humphreys, P. (eds) Patrick Suppes: Scientific Philosopher. Synthese Library, vol 234. Springer, Dordrecht. https://doi.org/10.1007/978-94-011-0774-7_13
Download citation
DOI: https://doi.org/10.1007/978-94-011-0774-7_13
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-010-4331-1
Online ISBN: 978-94-011-0774-7
eBook Packages: Springer Book Archive