Gorjian, Mahshid (2025): Integrating Machine Learning and Hedonic Regression for Housing Price Prediction: A Systematic International Review of Model Performance and Interpretability.
![]() |
PDF
MPRA_paper_125676.pdf Download (188kB) |
Abstract
It is becoming increasingly important to predict property prices to mitigate investment risk, establish policies, and preserve market stability. To determine the practical utility and anticipated efficacy of the sophisticated statistical and machine learning models that have emerged, a comparative analysis is required. The purpose of this systematic study is to assess the predictive effectiveness and interpretability of hedonic regression and complex machine learning models in the estimation of housing prices in a wide range of foreign scenarios. In May 2024, a thorough search was conducted in Scopus, Google Scholar, and Web of Science. The search terms included "hedonic pricing models," "machine learning," and "housing price prediction," in addition to others. The inclusion criteria required the utilization of empirical research published after 2000, a comparison of at least two predictive models, and reliable transaction data. Research that utilized non-empirical methodologies or web- scraped prices was excluded. Twenty-three investigations met the eligibility criteria. The evaluation was conducted in accordance with the reporting criteria of PRISMA 2020. Random Forest was the most frequently employed and consistently high-performing model, being selected in 14 of 23 studies and regarded as exceptional in five. Despite their lack of precision, hedonic regression models provided critical explanatory insights into critical variables, such as proximity to urban centers, property characteristics, and location. The integration of hedonic and machine learning models improved the interpretability and accuracy of the predicted results. Many of the studies included in this review were longitudinal, covered a diverse range of international contexts (specifically, Asia, Europe, America, and Australia), and demonstrated a rise in research output beyond 2020. Even though hedonic models retain a significant amount of explanatory power, the precision of home price predictions is improved by machine learning, particularly Random Forest and neural networks. The optimal results for researchers, real estate professionals, and policymakers who aim to improve market transparency and enlighten effective policy decisions are achieved through the seamless integration of these techniques.
Item Type: | MPRA Paper |
---|---|
Original Title: | Integrating Machine Learning and Hedonic Regression for Housing Price Prediction: A Systematic International Review of Model Performance and Interpretability |
Language: | English |
Keywords: | housing price prediction; machine learning; hedonic price model; Random Forest; real estate valuation; artificial neural networks; systematic review; property market analysis |
Subjects: | C - Mathematical and Quantitative Methods > C0 - General > C00 - General C - Mathematical and Quantitative Methods > C0 - General > C01 - Econometrics C - Mathematical and Quantitative Methods > C1 - Econometric and Statistical Methods and Methodology: General > C10 - General |
Item ID: | 125676 |
Depositing User: | Mahshid Gorjian |
Date Deposited: | 27 Aug 2025 08:29 |
Last Modified: | 27 Aug 2025 08:29 |
References: | Bartholomew, K., & Ewing, R. (2011). Hedonic price effects of pedestrian- and transit- oriented development. Journal of Planning Literature, 26(1), 18– 34. https://doi.org/10.1177/0885412210386540 Begum, A., Samad, M., & Chowdhury, S. (2022). Comparative analysis of predictive models for housing price estimation. International Journal of Housing Markets and Analysis, 15(3), 423–441. https://doi.org/10.1108/IJHMA-09-2021-0117 Chen, Z., Ye, L., Zhang, X., & Wu, J. (2022). Machine learning-based housing price prediction: A survey. Journal of Real Estate Research, 44(2), 241– 267. https://doi.org/10.1080/08965803.2022.2029673 Chou, J. S., Hsu, S. C., & Ho, C. C. (2022). Integrating ensemble learning and hedonic regression for real estate appraisal. Expert Systems with Applications, 188, 116026. https://doi.org/10.1016/j.eswa.2021.116026 Diewert, W. E., Heravi, S. M., & Silver, M. (2011). Hedonic imputation indexes versus time dummy hedonic indexes. In W. E. Diewert, J. Greenlees, & C. Hulten (Eds.), Price index concepts and measurement (pp. 323–352). University of Chicago Press. Glaeser, E. L., Gyourko, J., & Saks, R. E. (2016). Urban growth and housing supply. Journal of Economic Geography, 6(1), 71–89. https://doi.org/10.1093/jeg/lbj004 Goodman, A. C. (1998). Andrew Court and the invention of hedonic price analysis. Journal of Urban Economics, 44(2), 291–298. https://doi.org/10.1006/juec.1997.2050 Gorjian, M. (2024). A deep learning-based methodology to re-construct optimized re- structured mesh from architectural presentations (Doctoral dissertation, Texas A&M University). Texas A&M University. https://oaktrust.library.tamu.edu/items/0efc414a-f1a9-4ec3-bd19- f99d2a6e3392 Gorjian, M. (2025). Green gentrification and community health in urban landscape: A scoping review of urban greening’s social impacts (Version 1) [Preprint]. Research Square. https://doi.org/10.21203/rs.3.rs-7225794/v1 Gorjian, M. (2025). Green schoolyard investments and urban equity: A systematic review of economic and social impacts using spatial-statistical methods [Preprint]. Research Square. https://doi.org/10.21203/rs.3.rs-7213563/v1 Gorjian, M. (2025). Green schoolyard investments influence local-level economic and equity outcomes through spatial-statistical modeling and geospatial analysis in urban contexts. arXiv. https://doi.org/10.48550/arXiv.2507.14232 Gorjian, M. (2025). Schoolyard greening, child health, and neighborhood change: A comparative study of urban U.S. cities (arXiv:2507.08899) [Preprint]. arXiv. https://doi.org/10.48550/arXiv.2507.08899 Gorjian, M. (2025). The impact of greening schoolyards on surrounding residential property values: A systematic review (Version 1) [Preprint]. Research Square. https://doi.org/10.21203/rs.3.rs-7235811/v1 Gorjian, M. (2025, July 10). Greening schoolyards and the spatial distribution of property values in Denver, Colorado [Preprint]. arXiv. https://doi.org/10.48550/arXiv.2507.08894 Gorjian, M. (2025, July 11). The impact of greening schoolyards on residential property values [Working paper]. SSRN. https://doi.org/10.2139/ssrn.5348810 Gorjian, M. (2025, July 15). Analyzing the relationship between urban greening and gentrification: Empirical findings from Denver, Colorado. SSRN. https://doi.org/10.2139/ssrn.5353201 Gorjian, M. (2025, July 26). Greening schoolyards and urban property values: A systematic review of geospatial and statistical evidence [Preprint]. arXiv. https://doi.org/10.48550/arXiv.2507.19934 Gorjian, M. (2025, July 29). Urban schoolyard greening: A systematic review of child health and neighborhood change [Preprint]. Research Square. https://doi.org/10.21203/rs.3.rs-7232642/v1 Gorjian, M., & Quek, F. (2024). Enhancing consistency in sensible mixed reality systems: A calibration approach integrating haptic and tracking systems [Preprint]. EasyChair. https://easychair.org/publications/preprint/KVSZ Gorjian, M., Caffey, S. M., & Luhan, G. A. (2024). Exploring architectural design 3D reconstruction approaches through deep learning methods: A comprehensive survey. Athens Journal of Sciences, 11(2), 1– 29. https://www.athensjournals.gr/sciences/2024-6026-AJS-Gorjian-02.pdf Gorjian, M., Caffey, S. M., & Luhan, G. A. (2025). Analysis of design algorithms and fabrication of a graph-based double-curvature structure with planar hexagonal panels. arXiv. https://doi.org/10.48550/arXiv.2507.16171 Gorjian, M., Caffey, S. M., & Luhan, G. A. (2025). Exploring architectural design 3D reconstruction approaches through deep learning methods: A comprehensive survey. Athens Journal of Sciences, 12, 1–29. https://doi.org/10.30958/ajs.X-Y-Z Gorjian, M., Luhan, G. A., & Caffey, S. M. (2025). Analysis of design algorithms and fabrication of a graph-based double-curvature structure with planar hexagonal panels. arXiv. https://doi.org/10.48550/arXiv.2507.16171 Hong, S. H., Lee, D., & Kim, T. (2020). Comparison of machine learning models for housing price prediction. Sustainability, 12(24), 10348. https://doi.org/10.3390/su122410348 Howard, G., & Liebersohn, J. (2023). Regional effects on real estate pricing: A review. Regional Studies, 57(2), 283– 298. https://doi.org/10.1080/00343404.2022.2073381 Hoxha, E. (2024). Predicting housing prices with decision trees: Evidence from emerging markets. Journal of Property Research, 41(1), 47– 68. https://doi.org/10.1080/09599916.2023.2179073 Hwang, M., & Quigley, J. M. (2006). Economic fundamentals in local housing markets: Evidence from US metropolitan regions. Regional Science and Urban Economics, 36(2), 183–206. https://doi.org/10.1016/j.regsciurbeco.2005.03.003 Jayantha, W. M., & Oladinrin, T. O. (2019). Artificial intelligence and real estate valuation: A systematic review. Journal of Property Investment & Finance, 37(3), 223– 240. https://doi.org/10.1108/JPIF-09-2018-0068 Lundberg, S. M., & Lee, S. I. (2017). A unified approach to interpreting model predictions. Advances in Neural Information Processing Systems, 30, 4765– 4774. https://doi.org/10.48550/arXiv.1705.07874 Mark, J., & Kim, M. (2007). The impact of demographic changes on housing prices: An empirical analysis. Journal of Housing Economics, 16(2), 125– 144. https://doi.org/10.1016/j.jhe.2007.06.002 Page, M. J., McKenzie, J. E., Bossuyt, P. M., Boutron, I., Hoffmann, T. C., Mulrow, C. D., ... & Moher, D. (2021). The PRISMA 2020 statement: An updated guideline for reporting systematic reviews. BMJ, 372, n71. https://doi.org/10.1136/bmj.n71 Raina, A. S., Mone, V., Gorjian, M., Quek, F., Sueda, S., & Krishnamurthy, V. R. (2024). Blended physical-digital kinesthetic feedback for mixed reality-based conceptual design-in-context. In Proceedings of the 50th Graphics Interface Conference (Article 6, pp. 1–16). ACM. https://doi.org/10.1145/3670947.3670967 Rico-Juan, J. R., & Taltavull, P. (2021). Hedonic and machine learning models for real estate valuation: A critical review. Urban Science, 5(2), 32. https://doi.org/10.3390/urbansci5020032 Rigatti, S. J. (2017). Random forest. Journal of Insurance Medicine, 49(4), 391– 395. https://doi.org/10.17849/insm-49-04-391-395.1 Rosen, S. (1974). Hedonic prices and implicit markets: Product differentiation in pure competition. Journal of Political Economy, 82(1), 34– 55. https://doi.org/10.1086/260169 Rouhiaine, N. (2018). Machine learning approaches to housing price prediction. Computers, Environment and Urban Systems, 68, 36– 43. https://doi.org/10.1016/j.compenvurbsys.2017.10.003 Schläpfer, F., Waltert, F., Segura, L., Kienast, F., & Bürgi, M. (2015). Impact of land-use and landscape pattern on real estate prices in the Swiss Alps. Ecological Economics, 112, 372–382. https://doi.org/10.1016/j.ecolecon.2015.02.017 Selim, H. (2009). Determinants of house prices in Turkey: Hedonic regression versus artificial neural network. Expert Systems with Applications, 36(2), 2843– 2852. https://doi.org/10.1016/j.eswa.2008.01.044 Shmueli, G. (2010). To explain or to predict? Statistical Science, 25(3), 289– 310. https://doi.org/10.1214/10-STS330 Soltani, A., Chua, M., & Perera, R. (2022). A comparative study of machine learning models for house price prediction in Australia. Property Management, 40(2), 209– 225. https://doi.org/10.1108/PM-08-2020-0056 Wang, T. (2023). Housing price prediction using image data and machine learning. Applied Artificial Intelligence, 37(1), 62–81. https://doi.org/10.1080/08839514.2022.2083980 Wang, Y., & Li, H. (2019). A review of mass appraisal models for real estate. Land Use Policy, 81, 263–273. https://doi.org/10.1016/j.landusepol.2018.10.035 Xiao, Q. (2017). Hedonic price modeling and housing market segmentation. Habitat International, 64, 110–118. https://doi.org/10.1016/j.habitatint.2017.05.012 |
URI: | https://mpra.ub.uni-muenchen.de/id/eprint/125676 |