Modelling interval over-dispersed censored survival data
DOI:
https://doi.org/10.32782/2786-7684/2024-2-27Keywords:
over-dispersion, interval censoring, survival data, NB2 regressionAbstract
Introduction. Unknown exact timing of events and quite possible heterogeneity of counts distributions across time intervals together with profile insufficiencies pose challenges to data modelling. it calls either for mixture of latent variables distributions or generalised distributions that accommodate heterogeneity. Our goal was to examine capacity of Negative Binomial Type 2 (NB2) regression to deal with presented challenges. Methods and data. We suggested application of NB2 regression with support of power analysis implemented in R package «ltable». We examined efficacy by simulated example to demonstrate the capacity to unveil the data generation mechanism. Results confirmed that counts strongly over-dispersed which is usual situation. Covariance matrix of model parameters was not stable enough and sensitive due to the paucity of profiles that is also typical. Fit to the data was good as chi-square test is less than 1 per degree of freedom with actual value of 0.09. That was supported by residuals report. Regression effects were of expected directions and magnitudes and revealed data generation mechanism. Power analysis validated the output and substantiated true data generation mechanism of interval censored survival data. We suggest that tentative example supports the effectiveness of modelling. In its turn power analysis helps to validate the output and to reveal and substantiate true data generation mechanism of interval censored survival data. Conclusions. Modelling interval over-dispersed censored survival data is still a challenge due to heterogeneity and profile shortage that underpowers hypotheses tests. We suggest NB2 based regression to process such data with support of power analysis. Simulated data analysis supports the effectiveness of modelling.
References
Klein JP, Moeschberger M. Survival Analysis. New York: Springer Verlag, 1997.
CRAN R package ltable. https://cran.r-project.org/package=ltable
Congdon P. Bayesian models for categorical data. 2005. John Wiley & Sons Ltd, England. 415 p.
Agresti A. Categorical Data Analysis. 3rd ed. (Wiley series in prob. and stat.; 792). 2013.
Consul PC., Famoye F. Generalized poisson regression model. Communications in Statistics – Theory and Methods. 1992;21(1);89-109 https://doi.org/10.1080/03610929208830766
Alan Huan. Mean-parametrized Conway–Maxwell–Poisson regression models for dispersed counts. Statistical Modelling. 2017;17(6):359–380. https://doi.org/10.1177/1471082X176977
Ocheredko OM. MCMC Bootstrap Based Approach to Power and Sample Size Evaluation. /New Developments in Data Science and Analytics. Proceedings of the 2019 Meeting of International Society for Data Science and Analytics. Zhiyong Zhang Ke-Hai Yuan Yong Wen Jiashan Tang. ISDSA Press · Granger, IN. p.67-87
Ocheredko Oleksandr. Library «ltable». The Comprehensive R Archive Network. https://cran.r-project.org/package=ltable