Skip to main content
Log in

A pretest for choosing between logrank and wilcoxon tests in the two-sample problem

  • Published:
METRON Aims and scope Submit manuscript

Summary

Two commonly used tests for comparison of survival curves are the generalized Wilcoxon procedure of Gehan (1965) and Breslow (1970) and the logrank test proposed by Mantel (1966) and Cox (1972). In applications, the logrank test is often used after checking for validity of the proportional hazards (PH) assumption, with Wilcoxon being the fallback method when the PH assumption fails.

However, the relative performance of the two procedures depend not just on the PH assumption but also on the pattern of differences between the two survival curves. We show that the crucial factor is whether the differences tend to occur early or late in time. We propose diagnostics to measure early-or-late differences between two survival curves. We propose a pretest that will help the user choose the more efficient test under various patterns of treatment differences.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Andersen, P. K., Borgan, O., Gill, R. D. and Keiding, N. (1993) Statistical Models Based on Counting Processes, Springer-Verlag, New York.

    Book  MATH  Google Scholar 

  • Bagdonavicius, V. B., Levuliene, R. J., Nikulin, M. S. and Zdorova-Cheminade, O. (2004) Tests for equality of survival distributions against non-location alternatives, Lifetime Data Analysis, 10(4), 445–460.

    Article  MathSciNet  MATH  Google Scholar 

  • Bagdonavicius, V. B. and Nikulin, M. (2006) On goodness-of-fit tests for homogeneity and proportional hazards, Applied Stochastic Models in Business and Industry, 22, 607–619.

    Article  MathSciNet  MATH  Google Scholar 

  • Bagdonavicius, V. B., Levuliene, R. J. and Nikulin, M. (2009) Testing absence of hazard rates crossing, Comptes Rendus de l’Academie des Sciences de Paris, 346(7—8), 445–450.

    Google Scholar 

  • Beltangady, M. S. and Frankowski, R. F. (1989) Effect of unequal censoring on the size and power of the logrank and Wilcoxon types of tests for survival data, Statistics in Medicine, 8(8), 937–945.

    Article  Google Scholar 

  • Breslow, N. (1970) A generalized Kruskal-Wallis test for comparing K samples subject to unequal patterns of censorship, Biometrika, 57(3), 579–594.

    Article  MATH  Google Scholar 

  • Cox, D. R. (1972) Regression Models and Life-tables (with discussion), Journal of the Royal Statistical Society, 34(2), 187–220.

    MATH  Google Scholar 

  • Gehan, E. A. (1965) A Generalized Wilcoxon Test for Comparing Arbitrarily Singly-Censored Samples, Biometrika, 52(1/2), 203–223.

    Article  MathSciNet  MATH  Google Scholar 

  • Hess, K. (1995) Graphical Methods for Assessing Violations of the Proportional Hazards Assumption in Cox Regression, Statistics in Medicine, 14, 1707–1723.

    Article  Google Scholar 

  • Hogg, R. V., Fisher, D. M. and Randles, R. H. (1975) A Two-Sample Adaptive Distribution-Free Test, Journal of the American Statistical Association, 70(351), 656–661.

    Article  MATH  Google Scholar 

  • Kalbfleisch, J. D. and Prentice, R. L. (1980) The Statistical Analysis of Failure Time Data, John Wiley and Sons Inc., New York.

    MATH  Google Scholar 

  • Klein, J. P. and Moeschberger, M. L. (1997) Survival Analysis, Springer, New York.

    MATH  Google Scholar 

  • Lee, E. T., Desu, M. M. and Gehan, E. A. (1975) A Monte Carlo Study of the Power of Some Two-Sample Tests, Biometrika, 62(2), 425–432.

    Article  MATH  Google Scholar 

  • Lehmann, E. L. (1953) The power of rank tests, Ann. Math. Statist., 24, 23–43.

    Article  MATH  Google Scholar 

  • Leton, E. and Zuluaga P. (2001) Equivalence between score and weighted tests for survival curves, Communications in Statistics: Theory and Methods, 30, 591–608.

    Article  MathSciNet  MATH  Google Scholar 

  • Leton, E. and Zuluaga P. (2005) Relationships among tests for censored data, Biometrical Journal, 47, 377–387.

    Article  MathSciNet  MATH  Google Scholar 

  • Mantel, N. (1967) Evaluation of survival data and two new rank order statistics arising in its consideration, Cancer Chemotherapy Rep., 50, 163–170.

    Google Scholar 

  • Peto, R. (1972) Rank tests of maximum power against Lehmann type alternatives, Biometrika, 59, 472–474.

    Article  MathSciNet  MATH  Google Scholar 

  • Peto, R. and Peto, J. (1972) Asymptotically efficient rank invariant test procedures, Journal of the Royal Statistical Society, Series A (General) 135(2), 185–207.

    Article  Google Scholar 

  • Prentice, R. L. and Marek, P. (1979) A qualitative discrepancy between censored data rank tests, Biometrics, 35, 861–867.

    Article  Google Scholar 

  • Shen, W. and Le, C. T. (2000) Linear rank tests for censored survival data, Communication in Statistics-Simulation and Computation, 29(1), 21–36.

    Article  MATH  Google Scholar 

  • Stablein, D. M. and Koutrouvelis, I. A. (1985) A two sample test sensitive to crossing hazards in uncensored and singly censored survival data, Biometrics, 41, 643–652.

    Article  MathSciNet  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Martinez, R.L.M.C., Naranjo, J.D. A pretest for choosing between logrank and wilcoxon tests in the two-sample problem. METRON 68, 111–125 (2010). https://doi.org/10.1007/BF03263529

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF03263529

Keywords

Navigation