Munich Personal RePEc Archive

Evaluation of technology clubs by clustering: A cautionary note

Andres, Antonio Rodriguez and Otero, Abraham and Amavilah, Voxi Heinrich (2021): Evaluation of technology clubs by clustering: A cautionary note. Forthcoming in: Applied Economics (2021)

[img]
Preview
PDF
MPRA_paper_109138.pdf

Download (727kB) | Preview

Abstract

Applications of machine learning techniques to economic problems are increasing. These are powerful techniques with great potential to extract insights from economic data. However, care must be taken to apply them correctly, or the wrong conclusions may be drawn. In the technology clubs literature, after applying a clustering algorithm, some authors train a supervised machine learning technique, such as a decision tree or a neural network, to predict the label of the clusters. Then, they use some performance metric (typically, accuracy) of that prediction as a measure of the quality of the clustering configuration they have found. This is an error with potential negative implications for policy, because obtaining a high accuracy in such a prediction does not mean that the clustering configuration found is correct. This paper explains in detail why this modus operandi is not sound from theoretical point of view and uses computer simulations to demonstrate it. We caution policy and indicate the direction for future investigations.

Logo of the University Library LMU Munich
MPRA is a RePEc service hosted by
the University Library LMU Munich in Germany.