An Updated Algorithm for Moderate Censoring in Time-to-Event Data Using Rank-based Regression

Main Article Content

Milind A. Phadnis


Aim: To propose an updated algorithm with an extra step added to the Newton-type algorithm used in robust rank based non-parametric regression for minimizing the dispersion function associated with Wilcoxon scores in order to account for the effect of covariates.

Methodology: The proposed accelerated failure time approach is aimed at incorporating right random censoring in survival data sets for low to moderate levels of censoring. The existing Newton algorithm is modified to account for the effect of one or more covariates. This is done by first applying Mantel scores to residuals obtained from a regression model, and second by minimizing the dispersion function of these scored residuals. Diagnostic check of the model fit is performed by observing the distribution of the residuals and suitable Bent scores are considered in the case of skewed residuals. To demonstrate the efficacy of this method, a simulation study is conducted to compare the power of this method under three different scenarios: non-proportional hazard, proportional and constant hazard, and proportional but non-constant hazard.

Results: In most situations, this method yielded reasonable estimates of power for detecting an association of the covariate with the response as compared to popular parametric and semi-parametric approaches. The estimates of the regression coefficient obtained from this method were evaluated and were found to have low bias, low mean square error, and adequate coverage. In a real-life example pertaining to pancreatic cancer study, the proposed method performed admirably well and provided a more realistic interpretation about the effect of covariates (age and Karnofsky score) compared to a standard parametric (lognormal) model.

Conclusion: In situations where there is no clear best parametric fit for time-to-event data with moderate level of censoring, the proposed method provides a robust alternative to obtain regression coefficients (both adjusted and unadjusted) with a performance comparable to that of a proportional hazards model.

Bent scores, mantel scoring, Newton algorithm, rank regression, Wilcoxon scores.

Article Details

How to Cite
Phadnis, M. (2019). An Updated Algorithm for Moderate Censoring in Time-to-Event Data Using Rank-based Regression. Asian Journal of Probability and Statistics, 5(1), 1-18.
Original Research Article


Conover WJ, Iman RL. Rank transformations as a bridge between parametric and nonparametric statistics. Am Stat. 1981;35:124-134.

Howard G, Koch GG. The GLM logrank test: general linear modeling of logrank scores as a method of analysis for survival data. Commun Stat Simul Comput. 1990;19:903-917.

Hougaard P. Analysis of multivariate survival data, New York, NY: Springer; 2001.

Buckley J, James I. Linear regression with censored data. Biometrika. 1979;66:429-436.

Prentice RL. Linear rank tests with censored data. Biometrika. 1978;65:167-179.

Tsiatis AA. Estimating regression parameters using linear rank tests for censored data. Ann Stat. 1990;18:354-372.

Ritov Y. Estimation in a linear regression model with censored data. Ann Stat. 1990;18:303-328.

Lai TL, Ying Z. Large sample theory of a modified Buckley-James estimator for regression analysis with censored data. Ann Stat. 1991;19:1370-1402.

Lai TL, Ying Z. Rank regression methods for left-truncated and right-censored data. Ann Stat. 1991;19:531-556.

Ying Z. A large sample study of rank estimation for censored regression data. Ann Stat. 1993;21:76-99.

Jin Z, Lin DY, Wei LJ, Ying Z. Rank-based inference for the accelerated failure time model. Biometrika. 2003;90(2):341-353.

Wei LJ, Ying Z, Lin DY. Linear regression analysis of censored survival data based on rank tests. Biometrika. 1990;77:845-851.

Lin DY, Greyer CJ. Computational methods for semiparametric linear regression with censored data. J Comput Graph Stat. 1992;1:77-90.

Gehan EA. A generalized Wilcoxon test for comparing arbitrarily single censored samples. Biometrika. 1965;52:203-223.

Hettmansperger TP, McKean JW. Kendalls library of statistics 5: Robust nonparametric statistical methods, New York, NY: Arnold; 1998.

McKean JW, Sievers GL. Rank scores suitable for analyses of linear models under asymmetric error distributions. Technometrics. 1989;31:207-218.

McKean JW, Terpstra JT, Kloke JD. Computational rank-based statistics. Comput Stat. 2009;1:132-140.

Terpstra JT, McKean JW. Rank-based analysis of linear models in R. J Stat Softw. 2005;14:1-26.

Kapenga JA, McKean JW, Vidmar TJ. Short course on robust statistical procedures for the analysis of linear and nonlinear models. RGLM: Users Manual, American Statistics Association, New Orleans; 1988.

Mantel N. Calculation of scores for a Wilcoxon generalization applicable to data subject to arbitrary right censorship. Am Stat. 1981;35:244-247.

Policello GE, Hettsmanperger TP. Adaptive robust procedures for the one-sample location problem. J Am Stat Assoc. 1976;71:624-633.

McKean JW, Vidmar TJ, Sievers GL. A robust two-stage multiple comparison procedure with application to a random drug screen. Biometrics. 1989;45:1281-1297.

Burton A, Altman DG, Royston P, Holder RL. The design of simulation studies in medical statistics. Stat Med. 2006;25:4279-4292.

Kloke JD, McKean JW. Rfit: Rank based estimation for linear models. R J. 2012;4:57-64.

Royston P. The lognormal distribution as a model for survival time in cancer, with an emphasis on prognostic factors. Stat Neerl. 2001;55:89-104.

Wilcox CM, Phadnis MA, Seay T, Varadarajulu S. MO1428 Prospective Evaluation of the Karnofsky Score As a Guide to Placing a Plastic Versus Metal Stent for Pancreaticobiliary Malignancy (4th ed., vol. 73, pp. AB342). Elsevier; 2011.