Automatic smoothing parameter selection: A survey

Marron, J. S.

doi:10.1007/BF01972448

Automatic smoothing parameter selection: A survey

Published: September 1988

Volume 13, pages 187–208, (1988)
Cite this article

Empirical Economics Aims and scope Submit manuscript

J. S. Marron¹

352 Accesses
110 Citations
Explore all metrics

Abstract

This is a survey of recent developments in smoothing parameter selection for curve estimation. The first goal of this paper is to provide an introduction to the methods available, with discussion at both a practical and also a nontechnical level, including comparison of methods. The second goal is to provide access to the literature, especially on smoothing parameter selection, but also on curve estimation in general. The two main settings considered here are nonparametric regression and probability density estimation, although the points made apply to other settings as well. These points also apply to many different estimators, although the focus is on kernel estimators, because they are the most easily understood and motivated, and have been at the heart of the development in the field.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Bean SJ Tsokos CP (1980) Developments in nonparametric density estimation. International Statistical Review 48:267–287
Google Scholar
Bhattacharya PK, Mack KP (1987) Weak convergence ofk-NN density and regression estimators with varyingk and applications. Annals of Statistics 15:976–994
Google Scholar
Bierens HJ (1987) Kernel estimation of regression function. In: Bewley TF (ed) Advances in econometrics. Canbridge University Press, New York; pp 99–144
Google Scholar
Bowman A (1984) An alternative method of cross-validation for the smoothing of density estimates. Biometrika 65:521–528
Google Scholar
Burman P (1985) A data dependent approach to density estimation. Zeitschrift für Wahrscheinlichkeitstheorie und verwandte Gebiete 69:609–628
Google Scholar
Burman P (1988) Estimation of the optimal transformations usingv-fold cross-validation and repeated learning testing methods. Unpublished manuscript
Chow YS, Geman S, Wu LD (1983) Consistent cross-validated density estimation. Annals of Statistics 11:25–38
Google Scholar
Clark RM (1975) A calibration curve for radio carbon dates. Antiquity 49:251–266
Google Scholar
Collomb G (1981) Estimation non parametrique de la regression: revue. International Statistical Review 49:75–93
Google Scholar
Collomb G (1985) Nonparametric regressio: an up-to-date bibliography. Statistics 16:309–324
Google Scholar
Craven P, Wahba G (1979) Smoothing noisy data with spline functions. Numerische Mathematik 31:377–403
Google Scholar
Devroye L, Györfi L (1984) Nonparametric density estimation: TheL ₁ view. Wiley, New York
Google Scholar
Devroye L (1987) A course in density estimation. Birkhauser, Boston
Google Scholar
Dodge Y (1986) Some difficulties involving nonparametric estimation of a density function. Journal of Official Statistics 2:193–202
Google Scholar
Duin RPW (1976) On the choice of smoothing parameters of Parzen estimators of probability density functions. IEEE Transactions on Computers C-25:1175–1179
Google Scholar
Eagleson GK, Buckley MJ (1987) Estimating the variance in nonparametric regression. Unpublished manuscript
Eubank R (1988) Spline smoothing and nonparametric regression. Wiley, New York
Google Scholar
Ferguson TS (1967) Mathematical statistics, a decision theoretic approach. Academic Press, New York
Google Scholar
Fryer MJ (1977) A review of some non-parametric methods of density estimation. Journal of the Instiute of Mathematics and its Applications 20:335–354
Google Scholar
Gasser T, Sroka L, Jennen C (1986) Residuals variance and residual pattern in nonlinear regression. Biometrika 73:625–633
Google Scholar
Habbema JDF, Hermans J, van den Broek K (1984) A stepwise discrimination analysis program using density estimation. Compstat 1974: Proceedings in Computational Statistics. Physica Verlag, Vienna, pp 101–110
Google Scholar
Härdle W (1988) Applied nonparametric regression
Härdle W, Marron JS (1985a) Optimal bandwidth selection in nonparametric regression function estimation. Annals of Statistics 12:1465–1481
Google Scholar
Härdle W, Marron JS (1985b) Asymptotic nonequivalence of some bandwidth selectors in nonparametric regression. Biometrika 72:481–484
Google Scholar
Härdle W, Hall P, Marron JS (1988) How far are automatically chosen regression smoothers from their optimum? Journal of the American Statistical Association 83:86–101, with discussion
Google Scholar
Härdle W, Marron JS, Wand MP (1988) Bandwidth choice for density derivatives. Unpublished manuscript
Hall P (1982) Cross-validation in density estimation. Biometrika 69:383–390
Google Scholar
Hall P (1983) Large sample optimality of least square cross-validation in density estimation, Annals of Statistics 11:1156–1174
Google Scholar
Hall P (1985) Asymptotic theory of minimum integrated square error for multivariate density estimation. Proceedings of the Sixth International Symposium on Multivariate Analysis at Pittsburgh, 25–29
Hall P (1987a) On the estimation of probability densities using compactly supported kernels. Journal of Multivariate Analysis 23:131–158
Google Scholar
Hall P (1987b) On Kullback-Leibler loss and density estimation. Annals of Statistics 15:1491–1519
Google Scholar
Hall P, Marron JS (1987a) Extent to which least-squares cross-validation minimises integrated square error in nonparametric density estimation. Probability Theory and Related Fields 74:567–581
Google Scholar
Hall P, Marron JS (1987b) On the amount of noise inherent in bandwidth selection for a kernel density estimator. Annals of Statistics 15:163–181
Google Scholar
Hall P, Marron JS (1987c) Estimation of integrated squared density derivatives. Statistics and Probability Letters 6:109–115
Google Scholar
Hall P, Wand M (1988) On the minimization of absolute distance in kernel density estimation. To appear in Statistics and Probability Letters
Hall P, Wand M (1989) MinimizingL1 distance in nonparametric density estimation. To appear in Journal of Multivariate Analysis
Kappenman RF (1987) A nonparametric data based univariate function estimate. Computational Statistics and Data Analysis 5:1–7
Google Scholar
Kendall MS (1976) Time Series. Griffin, London
Google Scholar
Krieger AM, Pickands J (1981) Weak convergence and efficient density estimation at a point. Annals of Statistics 9:1066–1078
Google Scholar
Li KC, Hwang J (1984) The data smoothing aspects of Stein estimates. Annals of Statistics 12: 887–897
Google Scholar
Li KC (1985) From Stein's unbiased risk estimates to the method of generalized cross-validation. Annals of Statistics 13:1352–1377
Google Scholar
Li KC (1987) Asymptotic optimality forC _p, C_L, cross-validation and generalized cross-validation: discrete index set. Annals of Statistics 15:958–975
Google Scholar
Mallows CL (1973) Some comments onC _p, Technometrics 15:661–675
Google Scholar
Marron JS (1985) An asymptotically efficient solution to the bandwidth problem of kernel density estimation. Annals of Statistics 13:1011–1023
Google Scholar
Marron JS (1986) Will the art of smoothing ever become a science? Marron JS (ed) Function estimates. American Mathematical Society Series: Contemporary Mathematics 9:169–178
Marron JS (1987a) A comparison of cross-validation techniques in density estimation. Annals of Statistics 15:152–162
Google Scholar
Marron JS (1987b) What does optimal bandwidth selection mean for nonparametric regression estimation. Dodge Y (ed) Statistical data analysis based on theL ¹ norm and related methods. North Holland, Amsterdam
Google Scholar
Marron JS (1987c) Partitioned cross-validation. North Carolina Institute of Statistics, Mimeo Series #1721
Marron JS (1988) Improvement of a data based bandwidth selector. Unpublished manuscript
Marron JS, Padgett WJ (1987) Asymptotically optimal bandwidth selection for kernel density estimators from randomly right-censored samples. Annals of Statistics 15:1520–1535
Google Scholar
Mielniczuk J, Vieu P (1988) Asymptotic suboptimality of one method of cross-validatory bandwidth choice in density estimation. Unpublished manuscript
Müller HG (1985) Empirical bandwidth choice for nonparametric kernel regression by means of pilot estimators. Statistics and Decisions, Supplement Issue No. 2:193–206
Müller HG, Stadtmüller U (1987) Variable bandwidth kernel estimators of regression curves. Annals of Statistics 15:182–201
Google Scholar
Müller HG, Stadtmüller U, Schmitt T (1987) Bandwidth choice and confidence intervals for derivatives of noisy data. Biometrika 74:743–749
Google Scholar
Nadaraya EA (1964) On estimating regression. Theory of Probability and its Application 9:141–142
Google Scholar
Nolan D, Pollard D (1987) U-processes: rates of convergence. Annals of Statistics 15:780–799
Google Scholar
Parzen E (1962) On estimation of a probability density function and mode. Annals of Mathematical Statistics 33:1065–1076
Google Scholar
Prakasa Rao BLS (1983) Nonparametric functional estimation. Academic Press, New York
Google Scholar
Rice J (1984) Bandwidth choice for nonparametric regression. Annals of Statistics 12:1215–1230
Google Scholar
Rice J (1986) Bandwidth choice for differentiation. Journal of Multivariate Analysis 19:251–264
Google Scholar
Rosenblatt M (1956) Remarks on some non-parametric estimates of a density function. Annals of Mathematical Statistics 27:832–837
Google Scholar
Rosenblatt M (1971) Curve estimates. Annals of Mathematical Statistics 42:1815–1842
Google Scholar
Rudemo M (1982a) Empirical choice of histograms and kernel density estimators. Scandanavian Journal of Statistics 9:65–78
Google Scholar
Rudemo M (1982b) Consistent choice of linear smoothing methods. Report 82-1, Department of Mathematics, Royal Danish Agricultural and Veterinary University, Copenhagen
Google Scholar
Schuster EA, Gregory CG (1981) On the nonconsistency of maximum likelihood nonparametric density estimators. Eddy WF (ed) Computer Science and Statistics: Proceedings of the 13th Symposium on the Interface. Springer, New York, pp 295–298
Google Scholar
Scott DW (1985) Handouts for ASA short course in density estimation. Rice University Technical Report 776-331-86-2
Scott DW (1986) Choosing smoothing parameters for density estimators. In: Allen DM (ed) Computer Science and Statistics: The Interface, pp 225–229
Scott DW (1988) Discussion of Hardie W, Hall P, Marron JS, How far are automatically chosen regression smoothers from their optimum? To appear Journal of the American Statistical Association
Scott DW, Factor LE (1981) Monte Carlo study of three data-based nonparametric probability density estimators. Journal of the American Statistical Association 76:9–15
Google Scholar
Scott DW, Tapia RA, Thompson JR (1977) Kernel density estimation revisited. Nonlinear Analysis, Theory, Methods and Applications 1:339–372
Google Scholar
Scott DW, Terrell GR (1987) Biased and unbiased cross-validation in density estimation. Journal of the American Statistical Association 82:1131–1146
Google Scholar
Sheather SJ (1983) A data-based algorithm for choosing the window width when estimating the density at a point. Computational Statistics and Data Analysis 1:229–238
Google Scholar
Sheather SJ (1986) An improved data-based algorithm for choosing the window width when estimating the density at a point Computational Statistics and Data Analysis 4:61–65
Google Scholar
Silverman BW (1985) Some aspects of the spline smoothing approach to nonparametric regression curve fitting (with discussion). Journal of the Royal Statistical Society, Series B 46:1–52
Google Scholar
Silverman BW (1986) Density estimation for statistics and data analysis. Chapman and Hall, New York
Google Scholar
Stone CJ (1984) An asymptotically optimal window selection rule for kernel density estimates. Annals of Statistics 12:1285–1297
Google Scholar
Stone CJ (1985) An asymptotically optimal histogram selection rule. Proceedings of the Berkeley Symposium in Honor of Jerzy Neyman and Jack Keifer
Stone M (1974) Cross-validatory choice and assessment of statistical predictions. Journal of the Royal Statistical Society, Series B 36:111–147
Google Scholar
Tapia RA, Thompson JR (1978) Nonparametric probability density estimation. The Johns Hopkins University Press, Baltimore
Google Scholar
Tarter ME, Kronmal RA (1976) An introduction to the implementation and theory of nonparametric density estimation. The American Statistician 30:105–112
Google Scholar
Terrell GR, Scott DW (1985) Oversmoothed density estimates. Journal of the American Statistical Association 80:209–214
Google Scholar
Tsybakov AB (1987) On the choice of the bandwidth in kernel nonparametric regression. Theory of Probability and Its Applications 32:142–148
Google Scholar
Ullah A (1987) Nonparametric estimation of econometric functional. Research Report 18, University of Western Ontario, to appear in Canadian Journal of Economics
Wahba G, Wold S (1975) A completely automatic fench curve: fitting spline functions by cross-validation. Communications in Statistics 4:1–17
Google Scholar
Watson GS (1964) Smooth regression analysis. Sankhya, series A 26:359–372
Google Scholar
Wegman EJ (1972) Nonparametric probability density estimation: I. a summary of the available methods. Technometrics 14:533–546
Google Scholar
Wertz W (1978) Statistical density estimation: a survey. Angewandte Statistique und Okonometrie 13, Vandenhoeck und Ruprecht
Wertz W, Schneider B (1979) Statistical density estimation: a bibliography. International Statistical Review 49:75–93
Google Scholar
Woodroofe M (1970) On choosing a delta sequence. Annals of Mathematical Statistics 41:1665–1671
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Statistics, University of North Carolina, USA
J. S. Marron

Authors

J. S. Marron
View author publications
You can also search for this author in PubMed Google Scholar

Additional information

Research partially supported by NSF Grant DMS-8701201.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Marron, J.S. Automatic smoothing parameter selection: A survey. Empirical Economics 13, 187–208 (1988). https://doi.org/10.1007/BF01972448

Download citation

Issue Date: September 1988
DOI: https://doi.org/10.1007/BF01972448

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Automatic smoothing parameter selection: A survey

Abstract

Access this article

Similar content being viewed by others

Estimator Selection: a New Method with Applications to Kernel Density Estimation

Kader—An R Package for Nonparametric Kernel Adjusted Density Estimation and Regression

Semi-supervised Smoothing for Large Data Problems

References

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Automatic smoothing parameter selection: A survey

Abstract

Access this article

Similar content being viewed by others

Estimator Selection: a New Method with Applications to Kernel Density Estimation

Kader—An R Package for Nonparametric Kernel Adjusted Density Estimation and Regression

Semi-supervised Smoothing for Large Data Problems

References

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation