Antonicelli, Margareth and Drago, Carlo and Costantiello, Alberto and Leogrande, Angelo (2025): Analyzing Income Inequalities across Italian regions: Instrumental Variable Panel Data, K-Means Clustering and Machine Learning Algorithms.
![]() |
PDF
MPRA_paper_124910.pdf Download (2MB) |
Abstract
This study examines income inequality across Italian regions by integrating instrumental variable panel data models, k-means clustering, and machine learning algorithms. Using econometric techniques, we address endogeneity and identify causal relationships influencing regional disparities. K-means clustering, optimized with the elbow method, classifies Italian regions based on income inequality patterns, while machine-learning models, including random forest, support vector machines, and decision tree regression, predict inequality trends and key determinants. Informal employment, temporary employment, and overeducation also play a major role in influencing inequality. Clustering results confirm a permanent North-South economic divide and the most disadvantaged regions are Campania, Calabria, and Sicily. Among the machine learning models, the highest income disparities prediction accuracy comes with the use of Random Forest Regression. The findings emphasize the necessity of education-focused and digitally based policies and reforms of the labor market in an effort to enhance economic convergence. The study portrays the use of a combination of econometric and machine learning methods in the analysis of regional disparities and proposes a solid framework of policy-making with the intention of curbing economic disparities in Italy.
Item Type: | MPRA Paper |
---|---|
Original Title: | Analyzing Income Inequalities across Italian regions: Instrumental Variable Panel Data, K-Means Clustering and Machine Learning Algorithms |
English Title: | Analyzing Income Inequalities across Italian regions: Instrumental Variable Panel Data, K-Means Clustering and Machine Learning Algorithms |
Language: | English |
Keywords: | Income Inequality, Regional Disparities, Machine Learning, Labor Market, Digital Divide. |
Subjects: | C - Mathematical and Quantitative Methods > C2 - Single Equation Models ; Single Variables > C23 - Panel Data Models ; Spatio-temporal Models C - Mathematical and Quantitative Methods > C3 - Multiple or Simultaneous Equation Models ; Multiple Variables > C38 - Classification Methods ; Cluster Analysis ; Principal Components ; Factor Models C - Mathematical and Quantitative Methods > C4 - Econometric and Statistical Methods: Special Topics > C45 - Neural Networks and Related Topics O - Economic Development, Innovation, Technological Change, and Growth > O1 - Economic Development > O15 - Human Resources ; Human Development ; Income Distribution ; Migration R - Urban, Rural, Regional, Real Estate, and Transportation Economics > R1 - General Regional Economics > R11 - Regional Economic Activity: Growth, Development, Environmental Issues, and Changes R - Urban, Rural, Regional, Real Estate, and Transportation Economics > R5 - Regional Government Analysis > R58 - Regional Development Planning and Policy |
Item ID: | 124910 |
Depositing User: | Dr Angelo Leogrande |
Date Deposited: | 01 Jun 2025 11:26 |
Last Modified: | 01 Jun 2025 11:26 |
References: | Acheampong, A. O., Dzator, J., & Shahbaz, M. (2021). Empowering the powerless: does access to energy improve income inequality?. Energy Economics, 99, 105288. Acheampong, A. O., Shahbaz, M., Dzator, J., & Jiao, Z. (2022). Effects of income inequality and governance on energy poverty alleviation: Implications for sustainable development policy. Utilities Policy, 78, 101403. Aginta, H., Gunawan, A. B., & Mendez, C. (2023). Regional income disparities and convergence clubs in Indonesia: new district-level evidence. Journal of the Asia Pacific Economy, 28(1), 101-132. Akita, T., & Alisjahbana, A. S. (2023). The Initial Impacts of the COVID-19 Pandemic on Regional Economies in Indonesia: Structural Changes and Regional Income Inequality. Sustainability, 15(18), 13709. Aresu, F., Marrocu, E., & Paci, R. (2023). Public capital and institutions' quality in the Italian regions. Journal of Regional Science, 63(5), 1284-1308. Asongu, S. A., & Odhiambo, N. M. (2021). Inequality, finance and renewable energy consumption in Sub-Saharan Africa. Renewable Energy, 165, 678-688. Asso, P. F. (2023). New perspectives on old inequalities: Italy's north–south divide. In Inequalities, territorial politics, Nationalism (pp. 22-40). Routledge. Baek, I., Noh, S., & Ahn, J. (2024). Does income inequality move together across the world?. Applied Economics Letters, 31(13), 1195-1200. Basile, R., Ciccarelli, C., & Groote, P. (2022). The legacy of literacy: evidence from Italian regions. Regional Studies, 56(5), 794-807. Berton, F., Pacelli, L., Quaranta, R., & Trentini, F. (2023). Patterns of labor market reforms: a regional approach to the Italian ‘Jobs Act’. Sinappsi, 13, 50-67. Bollani, L., Di Zio, S., & Fabbris, L. (2023). Chapter Remote working in Italy: Just a pandemic accident or a lesson for the future?. In ASA 2022 Data-Driven Decision Making. Firenze University Press, Genova University Press. Bonacini, L., Gallo, G., & Scicchitano, S. (2020). All that glitters is not gold. Influence of working from home on income inequality at the time of Covid-19. Bruns-Smith, D., Feller, A., & Nakamura, E. (2023, June). Using Supervised Learning to Estimate Inequality in the Size and Persistence of Income Shocks. In Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency (pp. 1747-1756). Byrum, G. (2023). Opening the Broadband Access Paradox. JCMS: Journal of Cinema and Media Studies, 62(4), 193-199. Caragliu, A., & Del Bo, C. F. (2022). Smart cities and urban inequality. Regional Studies, 56(7), 1097-1112. Carta, F., & De Philippis, M. (2021). The impact of the COVID-19 shock on labour income inequality: Evidence from Italy. Bank of Italy Occasional Paper, (606). Castelló-Climent, A., & Doménech, R. (2021). Human capital and income inequality revisited. Education Economics, 29(2), 194-212. Checchi, D., Jappelli, T., Marino, I., & Scognamiglio, A. (2024). Inequality trends in a slow‐growing economy: Italy, 1990–2020. Fiscal Studies, 45(3), 377-392. Chen, T., Gozgor, G., & Koo, C. K. (2021). Pandemics and income inequality: what do the data tell for the globalization era?. Frontiers in Public Health, 9, 674729. Chi, Z., Lun, H., Ma, J., & Zhou, Y. (2024). Income inequality and healthcare utilization of the older adults-based on a study in three provinces and six cities in China. Frontiers in Public Health, 12, 1435162. Condino, F. (2023). Share density‐based clustering of income data. Statistical Analysis and Data Mining: The ASA Data Science Journal, 16(4), 336-347. Culotta, F. (2021). Life expectancy heterogeneity and pension fairness: An Italian north-south divide. Risks, 9(3), 57. Daniele, V. (2021). Territorial disparities in labour productivity, wages and prices in Italy: What does the data show?. European Urban and Regional Studies, 28(4), 431-449. Davidescu, A. A., Lobonţ, O. R., & Nae, T. M. (2024). The Fabric of Transition: Unraveling the Weave of Labor Dynamics, Economic Structures, and Innovation on Income Disparities in Central and Eastern Europe Nations. Economies, 12(3), 68. Dawat, E. R. R. (2023). Predictive Modeling of PowerSchool Usage: Comparative Analysis of Linear Regression and Data Mining Techniques Using Student Attributes. International Journal of Research and Innovation in Social Science, 7(11), 75-85. Dong, K., Dou, Y., & Jiang, Q. (2022). Income inequality, energy poverty, and energy efficiency: who cause who and how?. Technological Forecasting and Social Change, 179, 121622. Fauser, S., & Gebel, M. (2023). Labor market dualism and the heterogeneous wage gap for temporary employment: a multilevel study across 30 countries. Socio-Economic Review, 21(4), 2069-2091. Feldman, M., Guy, F., & Iammarino, S. (2021). Regional income disparities, monopoly and finance. Cambridge Journal of Regions, Economy and Society, 14(1), 25-49. Ferto, I., & Bojnec, S. (2023). Subsidies and the income inequality in the Hungarian wine sector. Wine Economics and Policy, 12(2), 3-14. Futagami, R. (2022). Regional Income Inequality and Allocation of Public Investment: The Japanese Experience, 1958-1986. Adjustments of economics and enterprises in a changing world, 56, 101. Gao, J., Liu, Y., Chen, J., & Cai, Y. (2022). Demystifying the geography of income inequality in rural China: A transitional framework. Journal of Rural Studies, 93, 398-407. Gaubert, C., Kline, P., Vergara, D., & Yagan, D. (2021, May). Trends in US spatial inequality: concentrating affluence and a democratization of poverty. In AEA Papers and proceedings (Vol. 111, pp. 520-525). 2014 Broadway, Suite 305, Nashville, TN 37203: American Economic Association. Ghazouani, T., & Beldi, L. (2022). The impact of income inequality on carbon emissions in Asian countries: Non-parametric panel data analysis. Environmental Modeling & Assessment, 27(3), 441-459. Giannoni, P., Palumbo, M., Pandolfini, V., & Torrigiani, C. (2024). Territorial Disparities in the Governance of Policies Promoting the School-to-Work Transition: An Analysis of the Italian Case. Education Sciences, 14(3), 260. Ginsburgh, V., Magerman, G., & Natali, I. (2021). COVID-19 and the role of inequality in French regional departments. The European Journal of Health Economics, 22, 311-327. Gooljar, S., Manohar, K., & Hosein, P. (2023). Performance evaluation and comparison of a new regression algorithm. arXiv preprint arXiv:2306.09105. Guzzardi, D., Palagi, E., Roventini, A., & Santoro, A. (2024). Reconstructing income inequality in italy: New evidence and tax system implications from distributional national accounts. Journal of the European Economic Association, 22(5), 2180-2224. Hoffmann, E. B., Malacrino, D., & Pistaferri, L. (2021). Labor market reforms and earnings dynamics: the Italian case. International Monetary Fund. Hollman, A. K., Obermier, T. R., & Burger, P. R. (2021). Rural measures: A quantitative study of the rural digital divide. Journal of Information Policy, 11, 176-201. Huang, X. (2024). Predictive models: regression, decision trees, and clustering. Applied and Computational Engineering, 79, 124-133. Hyee, R., Immervoll, H., Lee, J., & Fernandez, R. (2020). How reliable are social safety nets? Value and accessibility in situations of acute economic need. Hyun-Chool, L. E. E., & Repkine, A. (2023). Ideological Polarization and Income Inequality in the Korean Regions. Korea Journal, 63(1), 118-149. Kashwan, K. R., & Velu, C. M. (2013). Customer segmentation using clustering and data mining techniques. International Journal of Computer Theory and Engineering, 5(6), 856. Khan, S., & Yahong, W. (2022). Income inequality, ecological footprint, and carbon dioxide emissions in Asian developing economies: what effects what and how?. Environmental Science and Pollution Research, 29(17), 24660-24671. Khan, S., Yahong, W., & Zeeshan, A. (2022). Impact of poverty and income inequality on the ecological footprint in Asian developing economies: Assessment of Sustainable Development Goals. Energy Reports, 8, 670-679. Kim, Y., Sommet, N., Na, J., & Spini, D. (2022). Social class—not income inequality—predicts social and institutional trust. Social Psychological and Personality Science, 13(1), 186-198. Kocurová, T., & Hampel, D. (2020). Inequality in the Income of the Population as a Determinant of the Country's Economic Growth. Acta Universitatis Agriculturae et Silviculturae Mendelianae Brunensis. Komornicki, T., & Goliszek, S. (2023). New transport infrastructure and regional development of Central and Eastern Europe. Sustainability, 15(6), 5263. Korotaj, T., Kurnoga, N., & Šimurina, N. (2023). Multivariate analysis of post-transition OECD countries in the context of inequality measures. Croatian operational research review, 14(1), 53-64. Leogrande, A., Costantiello, A., & Leogrande, D. (2023). The Socio-Economic Determinants of the Number of Physicians in Italian Regions. Li, G., Zhang, R., Feng, S., & Wang, Y. (2022). Digital finance and sustainable development: Evidence from environmental inequality in China. Business Strategy and the Environment, 31(7), 3574-3594. Lipps, J., & Schraff, D. (2021). Regional inequality and institutional trust in Europe. European Journal of Political Research, 60(4), 892-913. Ma, H. Can Innovation Input Help Reduce the Income Inequality in China?—An Analysis Based on Panel Data of 21 Provincial Regions. Maia, M., Azevedo, A. R., & Ara, A. (2021). Predictive comparison between random machines and random forests. Journal of Data Science, 19(4), 593-614. Marino, A., Pariso, P., & Picariello, M. (2023). Exploring the Economic Recovery of Italy’s Regions Post-COVID-19: A focus on Energy, Services, ICT Opportunities, and the Digital Divide. International Journal of Energy Economics and Policy, 13(5), 271-280. Mazzeo Rinaldi, F., & Leone, L. (2023). Conditional cash transfers in OECD countries: a realist synthesis. Frontiers in Sociology, 8, 1202430. Mdingi, K., & Ho, S. Y. (2021). Literature review on income inequality and economic growth. MethodsX, 8, 101402. Mercado, R. V., Park, C. Y., & Zhuang, J. (2023). Trends and drivers of income inequality in the Philippines, Thailand, and Viet Nam: A decomposition analysis (No. 692). ADB Economics Working Paper Series. Munandar, T. A. (2023). K-Means Cluster Algorithm for Grouping Inequality in Regional Development. Mussida, C., & Parisi, M. L. (2020). Features of personal income inequality before and during the crisis: An analysis of Italian regions. Regional studies. Nassif Pires, L., Carvalho, L. B. D., & Lederman Rawet, E. (2021). Multi-dimensional inequality and COVID-19 in Brazil. Investigación económica, 80(315), 33-58. Palagi, E., Coronese, M., Lamperti, F., & Roventini, A. (2022). Climate change and the nonlinear impact of precipitation anomalies on income inequality. Proceedings of the National Academy of Sciences, 119(43), e2203595119. Panzera, D., & Postiglione, P. (2022). The impact of regional inequality on economic growth: a spatial econometric approach. Regional Studies, 56(5), 687-702. Rosik, P., & Wójcik, J. (2022). Transport infrastructure and regional development: A survey of literature on wider economic and spatial impacts. Sustainability, 15(1), 548. Rossi, R., Di Lorenzo, G., Jannini, T. B., Ossola, P., Belvederi Murri, M., Siracusano, A., & Rossi, A. (2024). The role of income inequality as an ecological determinant of mental health: A nation-wide multilevel analysis on an Italian sample. International Journal of Social Psychiatry, 70(5), 999-1003. Saeed, S., & Siraj, T. (2024). Global Renewable Energy Infrastructure:: Pathways to Carbon Neutrality and Sustainability. Solar Energy and Sustainable Development Journal, 13(2), 183-203. Santamato, V., Tricase, C., Faccilongo, N., Iacoviello, M., Pange, J., & Marengo, A. (2024). Machine learning for evaluating hospital mobility: an Italian case study. Applied Sciences, 14(14), 6016. Sbardella, A., Andrea, Z., Luciano, P., & Scaramozzino, P. (2021). Behind the Italian regional divide: An economic fitness and complexity perspective. SINAPPSI, 11(2), 50-73. Shudanko, P. (2024). How to Maintain Business Sustainability and Performance in Dynamic Global Market. Journal of Current Research in Business and Economics, 3(1), 1174-1220. Sieck, C. J., Sheon, A., Ancker, J. S., Castek, J., Callahan, B., & Siefer, A. (2021). Digital inclusion as a social determinant of health. NPJ digital medicine, 4(1), 52. Sotomayor, O. J. (2021). Can the minimum wage reduce poverty and inequality in the developing world? Evidence from Brazil. World Development, 138, 105182. Stella, G. P., Filotto, U., & Cervellati, E. M. (2020). Could Financial Literacy Become a Key Variable to Examine Social and Economic Inequalities? A Study on Italian Regions. International Journal of Trade, Economics and Finance, 11(1). Subbotin, S. (2020). Radial-basis function neural network synthesis on the basis of decision tree. Optical Memory and Neural Networks, 29(1), 7-18. Sugiharti, L., Purwono, R., Esquivias, M. A., & Rohmawati, H. (2023). The nexus between crime rates, poverty, and income inequality: A case study of Indonesia. Economies, 11(2), 62. Sun, Y. Y., Li, M., Lenzen, M., Malik, A., & Pomponi, F. (2022). Tourism, job vulnerability and income inequality during the COVID-19 pandemic: A global perspective. Annals of Tourism Research Empirical Insights, 3(1), 100046. Sung, J., Qiu, Q., & Marton, J. (2021). Income Inequality and Health: New Methodology and an Application. Economics Bulletin, 41(4), 2676-2689. Suratman, E., & Mayudi, G. (2022). IMPACT OF INFLATION AND EXCHANGE RATE ON ASEAN INCOME INEQUALITY. International Journal of Business & Society, 23(1). Tan, Y., & Uprasen, U. (2021). Carbon neutrality potential of the ASEAN-5 countries: Implications from asymmetric effects of income inequality on renewable energy consumption. Journal of Environmental Management, 299, 113635. Thohari, A. N. A., & Ramadhani, R. D. (2022). Performance Comparison Supervised Machine Learning Models to Predict Customer Transaction Through Social Media Ads. J. Comput. Networks, Archit. High Perform. Comput, 4(2), 116. Tubadji, A., Gheasi, M., Crociata, A., & Odoardi, I. (2022). Cultural capital and income inequality across Italian regions. Regional Studies, 56(3), 459-475. Ullah, A., Kui, Z., Ullah, S., Pinglu, C., & Khan, S. (2021). Sustainable utilization of financial and institutional resources in reducing income inequality and poverty. Sustainability, 13(3), 1038. Van Ham, M., Tammaru, T., Ubarevičienė, R., & Janssen, H. (2021). Urban socio-economic segregation and income inequality: A global perspective (p. 523). Springer Nature. Wan, G., Wang, C., Wang, J., & Zhang, X. (2022). The income inequality-CO2 emissions nexus: Transmission mechanisms. Ecological Economics, 195, 107360. Wang, P., Li, Z., Wang, Y., & Wang, F. (2024). Unveiling the Dynamics of Educational Equity: Exploring the Third Type of Digital Divide for Primary and Secondary Schools in China. Sustainability, 16(11), 4868. Wen, C., Xiao, Y., & Hu, B. (2024). Digital financial inclusion, industrial structure and urban–Rural income disparity: Evidence from Zhejiang Province, China. Plos one, 19(6), e0303666. Xu, C., Han, M., Dossou, T. A. M., & Bekun, F. V. (2021). Trade openness, FDI, and income inequality: Evidence from sub‐Saharan Africa. African Development Review, 33(1), 193-203. Xu, G., Feng, L., Wang, W., & Liang, Q. (2024). Digital Financial Literacy and Rural Income Inequality. SAGE Open, 14(3), 21582440241275642. Xu, Q., & Zhong, M. (2023). The impact of income inequity on energy consumption: The moderating role of digitalization. Journal of Environmental Management, 325, 116464. Yang, F., Zhang, S., & Sun, C. (2020). Energy infrastructure investment and regional inequality: Evidence from China's power grid. Science of the Total Environment, 749, 142384. Yin, Z. H., & Choi, C. H. (2023). Does digitalization contribute to lesser income inequality? Evidence from G20 countries. Information Technology for Development, 29(1), 61-82. Zhang, C., Liu, B., & Yang, Y. (2024). Digital economy and urban innovation level: A quasi-natural experiment from the strategy of “Digital China”. Humanities and Social Sciences Communications, 11(1), 1-12. Zhang, J. (2021). A survey on income inequality in China. Journal of Economic Literature, 59(4), 1191-1239. |
URI: | https://mpra.ub.uni-muenchen.de/id/eprint/124910 |