Munich Personal RePEc Archive

Predicting the price of second-hand vehicles using data mining techniques

Jafari Kang, Masood and Zohoori, Sepideh and Abbasi, Elahe and Li, Yueqing and Hamidi, Maryam (2019): Predicting the price of second-hand vehicles using data mining techniques. Published in: IISE Annual Conference Proceedings (2 November 2020)


Download (373kB) | Preview


The electronic commerce, known as “E-commerce”, has been boosted rapidly in recent years, and makes it possible to record all information such as price, location, customer’s review, search history, discount options, competitor’s price, and so on. Accessing to such rich source of data, companies can analyze their users’ behavior to improve the customer satisfaction as well as the revenue. This study aims to estimate the price of used light vehicles in a commercial website, Divar, which is a popular website in Iran for trading second-handed goods. At first, highlighted features were extracted from the description column using the three methods of Bag of Words (BOW), Latent Dirichlet Allocation (LDA), and Hierarchical Dirichlet Process (HDP). Second, a multiple linear regression model was fit to predict the product price based on its attributes and the highlighted features. The accuracy index of Actuals-Predictions Correlation, the min-max index, and MAPE methods were used to validate the proposed methods. Results showed that the BOW model is the best model with an Adjusted R-square of 0.7841.

MPRA is a RePEc service hosted by
the Munich University Library in Germany.