Petoussis, Kos and Gill, Richard and Zeelenberg, Kees (1997): Statistical analysis of heaped duration data.

PDF
MPRA_paper_89263.pdf Download (248kB)  Preview 
Abstract
This paper shows how heaping of duration data, e.g. caused by rounding due to memory effects, can be analyzed. If the data are heaped Cox's partial likelihood approach, which is often used in survival analysis, is no longer appropriate. We show how this problem can be overcome by considering the problem as a missing data problem. A variant of Cox's Proportional Hazard Model is constructed that takes heaping into account, and is estimated by maximum likelihood using the EM algorithm, with many nuisance parameters, simultaneously for all parameters. Ingredients of our method are application of the EM algorithm, Cox regression and nonparametric maximum likelihood calculation with `predicted' data in each M step. An example from practice, where jackknife is used to estimate the variances, illustrates the power of the new methodology.
Item Type:  MPRA Paper 

Original Title:  Statistical analysis of heaped duration data 
English Title:  Statistical analysis of heaped duration data 
Language:  English 
Keywords:  heaping; duration data; survival analysis; Proportional Hazard Model; profile likelihood; EM algorithm 
Subjects:  C  Mathematical and Quantitative Methods > C2  Single Equation Models ; Single Variables > C23  Panel Data Models ; Spatiotemporal Models C  Mathematical and Quantitative Methods > C4  Econometric and Statistical Methods: Special Topics > C41  Duration Analysis ; Optimal Timing Strategies J  Labor and Demographic Economics > J6  Mobility, Unemployment, Vacancies, and Immigrant Workers > J64  Unemployment: Models, Duration, Incidence, and Job Search 
Item ID:  89263 
Depositing User:  Kees Zeelenberg 
Date Deposited:  03 Oct 2018 16:22 
Last Modified:  03 Oct 2018 16:24 
References:  CBS, 1991. Socioeconomic panel survey, contents, concept and organization, 1991, SDU/Publishers, the Hague. Cox, D.R. and D.V. Hinkley, 1978. Problems and solutions in theoretical statistics (Chapman & Hall). Cox, D.R., 1975. Partial likelihood. Biometrika 62 p. 269. Dempster, A.P., N.M. Laird and D.B. Rubin, 1976. Maximum Likelihood from Incomplete Data via the EM algorithm. Journal of Royal Statistical Society B 19, pp.138. Gorter D. and E. Hoogteijling, 1990a. Determinanten van werkloosheidsduur, 19851987. Internal report (Department of Statistical Methods, Statistics Netherlands, Voorburg). Gorter D. and E. Hoogteijling, 1990b. Duur van het zoeken naar werk. Report (Department of Statistical Methods, Statistics Netherlands, Voorburg). Heijtjan, D.F. and D.B. Rubin, 1991. Ignorability and coarse data. Annals of Statistics 19, pp.224453. Heijtjan, D.F. and D.B. Rubin, 1990. Inference from coarse data via multiple imputation with application to age heaping. Journal of the American Statistics Association 85, pp.30414. Johansen, S., 1983. An extension of Cox's regression model. International Statistical Review, 51, pp. 165174. Kalbfleisch, J.G., and R.L. Prentice, 1980. The statistical analysis of failure time data, John Wiley, New York. Louis, T.A., 1982. Finding the observed information matrix when using the EM algorithm. J. R. Statist. Soc. B 44, No. 2, pp.226233. Meilijson, I., 1989. A fast improvement to the EM algorithm on its own terms. Journal of the Royal Statistical Society B 51, pp.12738. Torelli, N. and U. Trivellato, 1993. Modelling inaccuracies in jobsearch duration data. Journal of Econometrics 59, pp.185211. Wolter, K.M., 1985. Introduction to variance estimation. SpringerVerlag, New York Inc. Zwemmer, W., 1995. Parallel computing in statistics. Report (Department of Statistical Methods, Statistics Netherlands, Voorburg). 
URI:  https://mpra.ub.unimuenchen.de/id/eprint/89263 