Bai, Jushan and Ando, Tomohiro (2013): Panel data models with grouped factor structure under unknown group membership.
Preview |
PDF
MPRA_paper_52782.pdf Download (398kB) | Preview |
Abstract
This paper studies panel data models with unobserved group factor structures. The group membership of each unit and the number of groups are left unspecified. The number of explanatory variables can be large. We estimate the model by minimizing the sum of least squared errors with a shrinkage penalty. The regressions coefficients can be homogeneous or group specific. The consistency and asymptotic normality of the estimator are established. We also introduce new $C_p$-type criteria for selecting the number of groups, the numbers of group-specific common factors and relevant regressors. Monte Carlo results show that the proposed method works well. We apply the method to the study of US mutual fund returns under homogeneous regression coefficients, and the China mainland stock market under group-specific regression coefficients.
Item Type: | MPRA Paper |
---|---|
Original Title: | Panel data models with grouped factor structure under unknown group membership |
Language: | English |
Keywords: | Clustering, penalized method, lasso, SCAD, serial and cross-sectional error correlations, factor structure |
Subjects: | C - Mathematical and Quantitative Methods > C2 - Single Equation Models ; Single Variables > C23 - Panel Data Models ; Spatio-temporal Models C - Mathematical and Quantitative Methods > C5 - Econometric Modeling > C52 - Model Evaluation, Validation, and Selection |
Item ID: | 52782 |
Depositing User: | Jushan Bai |
Date Deposited: | 09 Jan 2014 05:46 |
Last Modified: | 28 Sep 2019 13:07 |
References: | Amengual, D., Watson, M. W. (2007). Consistent estimation of the number of dynamic factors in a large $N$ and $T$ panel. {\it Journal of Business and Economic Statistics 25}, 91--96. Ando, T. and Bai, J. (2013). Multifactor asset pricing with a large number of observable risk factors and unobservable common and group-specific factors. Working paper. Available at http://papers.ssrn.com/sol3/papers.cfm?abstract\_id=2289201 Ando, T. and Tsay, R. (2013). On model selection for large panel data with interactive effects. Working paper. Booth School of Business, University of Chicago. Arellano, M. (2003). Panel Data Econometrics. Oxford University Press. Arellano, M., and Hahn, J. (2005). Understanding Bias in Nonlinear Panel Models: Some Recent Developments. Invited Lecture, Econometric Society World Congress, London. Bai, J. (2009). Panel data models with interactive fixed effects. {\it Econometrica} 77, 1229--1279. Bai, J. and Ng, S. (2002). Determining the number of factors in approximate factor models. {\it Econometrica}, 70, 191--221. Baltagi, B.H. (2008). Econometric Analysis of Panel Data Wiley, John \& Sons. Bester, A. and Hansen, C. (2012). Grouped effects estimators in fixed effects models. {\it Journal of Econometrics}, Forthcoming. Bonhomme, S. and Manresa, E. (2012). Grouped patterns of heterogeneity in panel data. CEMFI, Working paper, 2012-1208. Chamberlain, G. and Rothschild, M. (1983). Arbitrage, factor structure and mean-variance analysis in large asset markets. {\it Econometrica} 51, 1305--1324. Chen, N. F., Roll, R., and Ross, S. (1986). Economic forces and the stock market. {\it Journal of Business}, 59, 383--403. Connor, G. and Korajzcyk, R. (1986). Performance measurement with the arbitrage pricing theory: a new framework for analysis. {\it Journal of Financial Economics}, 15, 373--394. Diebold, F., Li, C. and Yue, V. (2008). Global yield curve dynamics and interactions: a dynamic Nelson-Siegel approach. {\it Journal of Econometrics}, 146, 315--363. Fama, E. F. (1981). Stock prices, real activity, inflation and money. {\it American Economic Review}, 71, 545--65. Fama, E. F. and French, K. R. (1989). Business conditions and expected returns on stocks and bonds. {\it Journal of Financial Economics}, 5, 23--49. Fama, E. F. and French, K. R. (1993). Common risk factors in the returns on stocks and bonds. {\it Journal of Financial Economics}, 33, 3--56. Fan, J. and Li, R. (2001). Variable selection via nonconcave penalized likelihood and its oracle properties. {\it J. of the American Statistical Association} 96, 1348--1361. Forgy, E. W. (1965). Cluster analysis of multivariate data: Efficiency vs. Interpretability of classifications. {\it Biometrics}, 21, 768--769. Forni, M., Hallin, M., Lippi, M. and Reichlin, L. (2000). The generalized dynamic factor model: identification and estimation. {\it Review of Economics and Statistics}, 82, 540--554. Forni, M. and Lippi, M. (2001). The generalized factor model: representation theory. {\it Econometric Theory}, 17, 1113--1141. Geweke, J. (1977). The dynamic factor analysis of economic time series. In: Aigner, D. J., Goldberger, A. S. (eds), Latent Variables in Socio-Economic Models. Amsterdam: North-Holland, pp. 365--383. Hahn, J. and Kuersteiner, G. M. (2002). Asymptotically unbiased inference for a dynamic panel model with fixed effects when both $N$ and $T$ are large. {\it Econometrica}, 70, 1639--1657. Hahn, J. and Newey, W. K. (2004). Jackknife and analytical bias reduction for nonlinear panel models. {\it Econometrica}, 72, 1295--1319. Hallin, M., Liska R., (2007). The generalized dynamic factor model: determining the number of factors. {\it Journal of the American Statistical Association}, 102, 603--617. Hsiao, C., and A. K. Tahmiscioglu (1997). A panel analysis of liquidity constraints and firm investment. {\it Journal of the American Statistical Association}, 92, 455--465. Hsiao, C. (2003). Analysis of Panel Data, 2nd edition. Cambridge University Press Kapetanios, G., Pesaran, M.H. and Yamagata, T. (2011). Panels with non-stationary multifactor error structures. {\it Journal of Econometrics}, 160, 326--348. Kose, A., Otrok, C., Whiteman, C. (2008). Understanding the evolution of world business cycles. {\it International Economic Review}, 75, 110--130. Lin, C. and Ng. S. (2012). Estimation of Panel Data Models with Parameter Heterogeneity When Group Membership is Unknown. {\it Journal of Econometric Methods}, 1, 42--55. Mallows, C. L. (1973). Some comments on $Cp$. {\it Technometrics} 15, 661--675. Moench, E. and Ng, S. (2011). A Factor Analysis of Housing Market Dynamics in the U.S. and the Regions. {\it Econometrics Journal}, 14, 1--24. Moench, E., Ng, S., Potter, S. (2012). Dynamic hierarchical factor models. {\it Review of Economics and Statistics}, forthcoming. Available at Staff Reports 412, Federal Reserve Bank of New York. Moon, H. R. and Weidner, M. (2009). Likelihood expansion for panel regression models with factors. Working Paper. Department of Economics, University of Southern California. Pesaran, M. H. (2006). Estimation and inference in large heterogeneous panels with a multifactor error structure. {\it Econometrica}, 74, 967--1012. Pesaran, M. H. and Tosetti, E. (2011). Large panels with common factors and spatial correlation {\it Journal of Econometrics}, 161, 182--202. Sargent, T. J., Sims, C. A. (1977). Business cycle modeling without pretending to have too much a priori economic theory. In: Sims C. et al. (eds), New Methods in Business Cycle Research. Federal Reserve Bank of Minneapolis, Minneapolis. Stock, J. H. and Watson, M. W. (2002). Forecasting using principal components from a large number of predictors. {\it Journal of the American Statistical Association}, 97, 1167--1179. Sun, Y. X. (2005). Estimation and inference in panel structure models. Working paper, Department of Economics, University of California, San Diego. Tibshirani, R. (1996). Regression shrinkage and selection via the lasso. {\it Journal of the Royal Statistical Society}, B58, 267--288. Wang, P. (2010). Large dimensional factor models with a multi-level factor structure. Working paper, Department of Economics, HKUST. Wooldridge, J.M. (2010) Econometric analysis of cross section and panel data, second edition. MIT Press. Zhang, C. H. (2010). Nearly unbiased variable selection under minimax concave penalty. {\it Annals of Statistics}, 38, 894--942. |
URI: | https://mpra.ub.uni-muenchen.de/id/eprint/52782 |