Juarez, Miguel A. and Steel, Mark F. J. (2006): Modelbased Clustering of nonGaussian Panel Data.

PDF
MPRA_paper_880.pdf Download (1MB)  Preview 
Abstract
In this paper we propose a modelbased method to cluster units within a panel. The underlying model is autoregressive and nonGaussian, allowing for both skewness and fat tails, and the units are clustered according to their dynamic behaviour and equilibrium level. Inference is addressed from a Bayesian perspective and model comparison is conducted using the formal tool of Bayes factors. Particular attention is paid to prior elicitation and posterior propriety. We suggest priors that require little subjective input from the user and possess hierarchical structures that enhance the robustness of the inference. Two examples illustrate the methodology: one analyses economic growth of OECD countries and the second one investigates employment growth of Spanish manufacturing firms
Item Type:  MPRA Paper 

Institution:  University of Warwick 
Original Title:  Modelbased Clustering of nonGaussian Panel Data 
Language:  English 
Keywords:  autoregressive modelling; employment growth; GDP growth convergence; hierarchical prior; model comparison; posterior propriety; skewness 
Subjects:  C  Mathematical and Quantitative Methods > C2  Single Equation Models ; Single Variables > C23  Panel Data Models ; Spatiotemporal Models C  Mathematical and Quantitative Methods > C1  Econometric and Statistical Methods and Methodology: General > C11  Bayesian Analysis: General 
Item ID:  880 
Depositing User:  Miguel A. Juarez 
Date Deposited:  21. Nov 2006 
Last Modified:  13. Mar 2015 03:49 
References:  AlonsoBorrego, C. and Arellano, M. (1999). Symmetrically normalised intrumental variable estimation using panel data, Journal of Business & Economic Statistics, 17, 36–49. Arellano, M. (2003). Panel Data Econometrics, Oxford: University Press. Baltagi, B. (2001). Econometric Analysis of Panel Data, Chichester: Wiley, second ed. Banfield, J. D. and Raftery, A. E. (1993). Modelbased Gaussian and nonGaussian clustering, Biometrics, 49, 803–821. Bauwens, L. and Rombouts, J.V.K. (2006). Bayesian clustering of many GARCH models, Econometric Reviews, forthcoming. Bensmail, H., Celeux, G., Raftery, A. E. and Robert, C. P. (1997). Inference in modelbased cluster analysis, Statistics and Computing, 7, 1–10. Berger, J.O. and Bernardo, J.M. (1992). Ordered group reference priors with application to the multinomial problem, Biometrika, 79, 25–37. Canova, F. (2004). Testing for convergence clubs in income per capita: A predictive density approach, International Economic Review, 45, 49–77. Casella, G., Mengersen, K. L., Robert, C. P. and Titterington, D. M. (2002). Perfect samplers for mixtures of distributions, J. Roy. Statist. Soc. B, 64, 777–790. Casella, G., Robert, C. P. and Wells, M. T. (2004). Mixture models, latent variables and partitioned important sampling, Statistical Methodology, 1, 1–18. Celeux, G., Hurn, M. and Robert, C. P. (2000). Computational and inferential diculties with mixture posterior distributions, J. Amer. Statist. Assoc., 95, 957–970. Chen, M. H., Shao, Q. M. and Igrahim, J. G. (2000). Monte Carlo Methods in Bayesian Computation, New York: Springer. Chib, S. (1995). Marginal likelihood from the Gibbs output, J. Amer. Statist. Assoc., 90, 1313–1321. DiCiccio, J., Kass, R. E., Raftery, A. E. and Wasserman, L. (1997). Computing Bayes factors by combining simulations and asymptotic approximations, J. Amer. Statist. Assoc., 92, 903–915. Diebolt, J. and Robert, C. P. (1994). Estimation of finite mixture distributions through Bayesian sampling, J. Roy. Statist. Soc. B, 56, 363–375. Diggle, P. J., Heagerty, P., Liand, K. Y. and Zeger, S. L. (2002). Analysis of longitudinal data, Oxford: University Press, second ed. Durlauf, S. N. and Johnson, P. A. (1995). Multiple regimes and crosscountry growth behaviour, Journal of Applied Econometrics, 10, 365–384. Durlauf, S. N. and Quah, D. T. (1999). The new empirics of economic growth, Handbook of Macroeconomics, vol. 1 (J. B. Taylor and M. Woodford, eds.), Amsterdam: Elsevier, pp. 235–308. Fernández, C. and Steel, M. F. J. (1998). On Bayesian modeling of fat tails and skewness, J. Amer. Statist. Assoc., 93, 359–371. Fernández, C. and Steel, M. F. J. (2000). Bayesian regression analysis with scale mixtures of normals, Econometric Theory, 16, 80–101. Fraley, C. and Raftery, A. E. (2002). Modelbased clustering, discriminant analysis, and density estimation, J. Amer. Statist. Assoc., 97, 611–631. FrühwirthSchnatter, S. (2004). Estimating marginal likelihoods for mixture and Markov switching models using bridge sampling techniques, Econometrics Journal, 7, 143–167. FrühwirthSchnatter, S. and Kaufmann, S. (2004). Modelbased clustering of multiple time series, mimeo, Johannes Kepler Universität Linz. Gelman, A. (2006). Prior distributions for variance parameters in hierarchical models, Bayesian Analysis, 1, 1–19. Green, P. J. (1995). Reversible jump Markov chain Monte Carlo computation and Bayesian model determination, Biometrika, 82, 711–732. Heston, A., Summers, R. and Aten, B. (2002). Penn world table version 6.1, http://pwt.econ.upenn.edu/php_site/pwt_index.php. Center for International Comparisons at the University of Pennsylvania (CICUP). Hirano, K. (2002). Semiparametric Bayesian inference in autoregressive panel data models, Econometrica, 70, 781–799. Hoogstrate, A. J., Palm, F. C. and Pfann, G. A. (2000). Pooling in dynamic paneldata models: An application to forecasting GDP growth rates., Journal of Business & Economic Statistics, 18, 274– 283. Hsiao, C. (2003). Analysis of Panel Data, 2nd ed., Cambridge: Cambridge Univ. Press. Ishwaran, H., James, L. F. and Sun, J. (2001). Bayesian model selection in finite mixtures by marginaldensity decompositions, J. Amer. Statist. Assoc., 96, 1316–1322. Jasra, A., Holmes, C. C. and Stephens, D. A. (2005). Markov chain Monte Carlo methods and the label switching problem in Bayesian mixture modelling, Statistical Science, 20, 50–67. Juárez, M. A. and Steel, M. F. J. (2006). NonGaussian dynamic Bayesian modelling for panel data, Working Paper 0605, CRiSM, University of Warwick. Liu, M. C. and Tiao, G. C. (1980). Random coecient firstorder autoregressive models, Journal of Econometrics, 13, 305–325. Marin, J. M., Mengersen, K. and Robert, C. P. (2005). Bayesian modelling and inference on mixtures of distributions, Handbook of Statistics, vol. 25 (D. Dey and C. R. Rao, eds.), Amsterdam: NorthHolland, pp. 459–207. McLachlan, G. J. and Peel, D. (2000). Finite Mixture Models, New York: Wiley. Meng, X. and Wong, W. H. (1996). Simulating ratios of normalising constants via a simple identity: A theoretical exploration, Statistica Sinica, 6, 831–860. Nandram, B. and Petruccelli, J. D. (1997). A Bayesian analysis of autoregressive time series panel data, Journal of Business and Economic Statistics, 15, 328–334. Nerlove, M. (2002). Essays in Panel Data Econometrics, Cambridge: University Press. Newton, M. A. and Raftery, A. E. (1994). Approximate Bayesian inference with the weighted likelihood bootstrap, J. Roy. Statist. Soc. B, 56, 3–48. Phillips, D. B. and Smith, A. F. M. (1996). Bayesian model comparison via jump diusions, Markov chain Monte Carlo in practice (W. R. Gilks, S. Richardson and S. J. Spiegelhalter, eds.), Boca Raton: Chapman & Hall, pp. 215–240. Quah, D. T. (1997). Empirics for growth distribution: stratification, polarization and convergence clubs, Journal of Economic Growth, 2, 27–59. Raftery, A. E. (1996). Hypothesis testing and model selection, Markov chain Monte Carlo in practice (W. R. Gilks, S. Richardson and S. J. Spiegelhalter, eds.), Boca Raton: Chapman & Hall, pp. 163–188. Richardson, S. and Green, P. J. (1997). On Bayesian analysis of mixtures with an unknown number of components, J. Roy. Statist. Soc. B, 59, 731–792. (with discussion). Steele, R. S., Raftery, A. E. and Emond, M. J. (2003). Computing normalizing constants for finite mixture models via incremental mixture important sampling, Tech. Report 436, Department of Statistics, University of Washington. Stephens, M. (2000a). Bayesian analysis of mixtures with an unknown number of components – An alternative to reversible jump methods, Annals of Statistics, 28, 40–74. Stephens, M. (2000b). Dealing with label switching in mixture models, J. Roy. Statist. Soc. B, 62, 795–809. Temple, J. (1999). The new growth evidence, Journal of Economic Literature, 37, 112–156. Titterington, D. M., Smith, A. F. M. and Makov, U. E. (1985). Statistical analysis of finite mixture distributions, Chichester: Wiley. Verdinelli, I., and Wasserman, L. 1995. Computing Bayes Factors using a generalization of the SavageDickey density ratio, Journal of the American Statistical Association, 90, 614618. Weiss, R. E. (2005). Modeling longitudinal data, New York: Springer. 
URI:  http://mpra.ub.unimuenchen.de/id/eprint/880 