Manheim, David (2018): Building Less Flawed Metrics: Dodging Goodhart and Campbell's Laws. Forthcoming in:
PDF
MPRA_paper_90649.pdf Download (190kB) |
|
Preview |
PDF
MPRA_paper_98288.pdf Download (242kB) | Preview |
Abstract
Metrics are useful for measuring systems and motivating behaviors. Unfortunately, naive application of metrics to a system can distort the system in ways that undermine the original goal. The problem was noted independently by first Campbell, then Goodhart, and in some forms it is not only common, but unavoidable due to the nature of metrics. There are two distinct but interrelated problems that must be overcome in building better metrics; first, specifying metrics more closely related to the true goals, and second, preventing the recipients from gaming the difference between the reward system and the true goal. This paper describes several approaches to designing metrics, beginning with design considerations and processes, then discussing specific strategies including secrecy, randomization, diversification, and post-hoc specification. The discussion will then address important desiderata and the trade-offs involved in each approach, and examples of how they differ, and how the issues can be addressed. Finally, the paper outlines a process for metric design for practitioners who need to design metrics, and as a basis for further elaboration in specific domains.
Item Type: | MPRA Paper |
---|---|
Original Title: | Building Less Flawed Metrics: Dodging Goodhart and Campbell's Laws |
Language: | English |
Keywords: | Metrics, Measurement, Complex Systems, Control Theory, Perverse Incentives, Cobra Effect, Goodhart's Law, Campbell's Law |
Subjects: | D - Microeconomics > D8 - Information, Knowledge, and Uncertainty D - Microeconomics > D8 - Information, Knowledge, and Uncertainty > D80 - General I - Health, Education, and Welfare > I2 - Education and Research Institutions > I26 - Returns to Education I - Health, Education, and Welfare > I2 - Education and Research Institutions > I28 - Government Policy J - Labor and Demographic Economics > J4 - Particular Labor Markets > J48 - Public Policy Z - Other Special Topics > Z1 - Cultural Economics ; Economic Sociology ; Economic Anthropology > Z18 - Public Policy |
Item ID: | 98288 |
Depositing User: | David Manheim |
Date Deposited: | 25 Jan 2020 02:21 |
Last Modified: | 25 Jan 2020 02:21 |
References: | APA (American Psychiatric Association). (2013). Diagnostic and statistical manual of mental disorders. BMC Med, 17, 133–137. Atkins, A., Wanick, V., & Wills, G. (2017). Metrics Feedback Cycle: measuring and improving user engagement in gamified eLearning systems. International Journal of Serious Games, 4(4), 3–19. Berry, L. M., & Houston, J. P. (1993). Psychology at work: An introduction to industrial and organizational psychology. Brown & Benchmark/Wm. C. Brown Publ. Blanchard, B. S., & Fabrycky, W. J. (1990). Systems engineering and analysis (4th ed.). Prentice Hall Englewood Cliffs, NJ. Borsboom, D., Mellenbergh, G. J., & van Heerden, J. (2004). The concept of validity. Psychological review, 111(4), 1061. Bradbury, A. (2014, sep). ‘Slimmed down’ assessment or increased accountability? Teachers, elections and UK government assessment policy. Oxford Review of Education, 40(5), 610–627. Retrieved from https://doi.org/10.1080/03054985.2014.963038 doi: 10.1080/03054985.2014.963038 Cames, M., Harthan, R. O., Fu¨ssler, J., Lazarus, M., Lee, C., Erickson, P., & SpaldingFecher, R. (2016). How additional is the clean development mechanism. Analysis of application of current tools and proposed alternatives. Oeko-Institut EV CLlMA. B, 3. Campbell, D. T. (1979). Assessing the impact of planned social change. Evaluation and program planning, 2(1), 67–90. Caplan, B. (2018). The case against education. Why the education system is a waste of time and money. Princeton University Press. Caudill, H. L., & Porter, C. D. (2014, dec). An Historical Perspective of Reward Systems: Lessons Learned from the Scientific Management Era. International Journal of Human Resource Studies; Vol 4, No 4 (2014)DO - 10.5296/ijhrs.v4i4.6605. Retrieved from http://www.macrothink.org/journal/index.php/ijhrs/article/view/6605 Choi, J., Hecht, G. W., & Tayler, W. B. (2012). Lost in translation: The effects of incentive compensation on strategy surrogation. The Accounting Review, 87(4), 1135–1163. Clifton, P. M., & Keogh, J. B. (2017). A systematic review of the effect of dietary saturated and polyunsaturated fat on heart disease. Nutrition, Metabolism and Cardiovascular Diseases, 27(12), 1060–1080. Dai, H., Dietvorst, B. J., Tuckfield, B., Milkman, K. L., & Schweitzer, M. E. (2017, aug). Quitting When the Going Gets Tough: A Downside of High Performance Expectations. Academy of Management Journal, 61(5), 1667–1691. Retrieved from https://doi.org/10.5465/amj.2014.1045 doi: 10.5465/amj.2014.1045 Deresiewicz, W. (2015). Excellent sheep: The miseducation of the American elite and the way to a meaningful life. Free Press. Duff, F. J., Mengoni, S. E., Bailey, A. M., & Snowling, M. J. (2015). Validity and sensitivity of the phonics screening check: implications for practice. Journal of Research in Reading, 38(2), 109–123. Faeh, D., Paccaud, F., Cornuz, J., & Chiolero, A. (2008, apr). Consequences of smoking for body weight, body fat distribution, and insulin resistance. The American Journal of Clinical Nutrition, 87(4), 801–809. Retrieved from https://dx.doi .org/10.1093/ajcn/87.4.801 doi: 10.1093/ajcn/87.4.801 Flacker, J. M., & Kiely, D. K. (2003). Mortality-related factors and 1-year survival in nursing home residents. Journal of the American Geriatrics Society, 51(2), 213–221. Fraade-Blanar, L., Blumenthal, M. S., Anderson, J. M., & Kalra, N. (2018). Measuring Automated Vehicle Safety. Frances, A. (2017). Trump isn’t crazy. Psychology Today. Retrieved from https://www. psychologytoday.com/blog/saving-normal/201701/trump-isnt-crazy. Gelman, A. (2010). Causality and Statistical Learning. American Journal of Sociology, 117(3), 955–966. Retrieved from http://arxiv.org/abs/1003.2619 doi: 10 .1086/662659 Goodhart, C. A. E. (1975). Problems of monetary management: the UK experience. In Papers in monetary economics. Reserve Bank of Australia. Herzberg, F. (1968). One more time: How do you motivate employees. Harvard Business Review Boston, MA. Hess, F. (2018, sep). Straight Up Conversation: Scholar Jay Greene on the Importance of field Trips. Education Week. Holmstrom, B., & Milgrom, P. (1991). Multitask principal-agent analyses: Incentive contracts, asset ownership, and job design. JL Econ. & Org., 7, 24. Hubbard, D. W. (2007). How to Measure Anything: finding the Value of Intangibles in Business (Second ed.). doi: 10.1002/9781118983836 Kalra, N., Hallegatte, S., Lempert, R., Brown, C., Fozzard, A., Gill, S., & Shah, A. (2014). Agreeing on Robust Decisions New Processes for Decision Making Under Deep Uncertainty. World Bank Policy Research Working Paper, No. 6906(June). doi: doi:10.1596/1813-9450-6906 Kenny, G. (2014). five questions to identify key stakeholders. HBR Harvard Business Review. Klein, G. (2007). Performing a project premortem. Harvard Business Review, 85(9), 18–19. Klein, G., Sonkin, P. D., & Johnson, P. (2019). Rendering a Powerful Tool Flaccid: The Misuse of Premortems on Wall Street. Lempert, R. J., Groves, D. G., Popper, S. W., & Bankes, S. C. (2006). A General, Analytic Method for Generating Robust Strategies and Narrative Scenarios. Management Science, 52(4), 514–528. Retrieved from http://pubsonline.informs.org/ doi/abs/10.1287/mnsc.1050.0472 doi: 10.1287/mnsc.1050.0472 Liebowitz, S., & Kelly, M. L. (2018, nov). Everything You Know About State Education Rankings Is Wrong: Minds and dollars are a terrible thing to waste. Reason. Retrieved from https://reason.com/archives/2018/10/07/everything-you-know-about-stat Liska, D. J., Cook, C. M., Wang, D. D., Gaine, P. C., & Baer, D. J. (2016). Trans fatty acids and cholesterol levels: An evidence map of the available science. Food and Chemical Toxicology, 98, 269–281. Manheim, D. (2016). Overpowered Metrics Eat Underspecified Goals (Vol. 2016). Retrieved from https://www.ribbonfarm.com/2016/09/29/soft-bias -of-underspecified-goals/ Manheim, D. (2018). Value of Information for Policy Analysis (Doctoral dissertation, Pardee RAND). Manheim, D., & Garrabrant, S. (2018). Categorizing Variants of Goodhart’s Law. , 1–10. Mika, E., & Lee, B. (2017). Who Goes Trump? Tyranny as a Triumph of Narcissism. St. Martin’s Press. Mitchell, D. J., Edward Russo, J., & Pennington, N. (1989). Back to the future: Temporal perspective in the explanation of events. Journal of Behavioral Decision Making, 2(1), 25–38. Retrieved from https://doi.org/10.1002/bdm.3960020103 doi: 10.1002/bdm.3960020103 Muller, J. Z. (2018). The tyranny of metrics. Princeton University Press. O’Keefe, C., Cihon, P., Garfinkel, B., Flynn, C., Leung, J., & Dafoe, A. (2019). The Windfall Clause: Distributing the Benefits of AI for the Common Good. arXiv preprint arXiv:1912.11595. Poulis, K., & Poulis, E. (2016). Problematizing fit and survival: transforming the law of requisite variety through complexity misalignment. Academy of Management Review, 41(3), 503–527. Rasul, I., & Rogger, D. (2017). Management of bureaucrats and public service delivery: Evidence from the nigerian civil service. The Economic Journal, 128(608), 413– 446. Rasul, I., Rogger, D., & Williams, M. (2017). Management and bureaucratic effectiveness: A scientific replication. Rasul, I., Rogger, D., & Williams, M. J. (2018). Autonomy, incentives, and the effectiveness of bureaucrats. VoxDev. Retrieved from https://voxdev.org/topic/public-economics/autonomy-incentives-and-effectiveness-bureaucrats Rodamar, J. (2017). There ought to be a law! Campbell v. Goodhart. Rogers, P. J., Petrosino, A., Huebner, T. A., & Hacsi, T. A. (2000). Program theory evaluation: Practice, promise, and problems. New directions for evaluation, 2000(87), 5–13. 23 Rosenhead, J., & Mingers, J. (2001). Rational analysis for a problematic world revisited (No. 2nd). John Wiley and Sons. Ruch, W. A. (1994). Measuring and managing individual productivity. Organizational linkages: Understanding the productivity paradox, 105–130. Saltelli, A. (2020). Ethics of quantification or quantification of ethics? Futures, 116, 102509. Retrieved from http://www.sciencedirect.com/science/article/ pii/S0016328719303714 doi: https://doi.org/10.1016/j.futures.2019.102509 Schoeller, D. A. (1990). How accurate is self-reported dietary energy intake? Nutrition reviews, 48(10), 373–379. Shorrock, S. (2019, may). Shorrock’s Law of Limits. Blog Post. Retrieved from https:// humanisticsystems.com/2019/10/24/shorrocks-law-of-limits/ Simon, H. A. (1947). Administrative behavior; a study of decision-making processes in administrative organization. Macmillan. Simon, H. A. (1956). Rational choice and the structure of the environment. Psychological review, 63(2), 129. Soares, N. (2015). Half-assing it with everything you’ve got. Retrieved 2019-07-22, from http://mindingourway.com/half-assing-it-with-everything-youve-got/ Strathern, M. (1997). ’Improving ratings’: audit in the British University system. European Review. doi: 10.1002/(SICI)1234-981X(199707)5:33.0.CO;2-4 Sturla, K., Shah, B., & McManus, J. (2018). The Great DIB-ate: Measurement for Development Impact Bonds. Stanford Social Innovation Review. Szajewska, H., & Szajewski, T. (2016). Saturated fat controversy: importance of systematic reviews and meta-analyses. Critical reviews in food science and nutrition, 56(12), 1947–1951. Taplin, D. H., & Clark, H. (2012). Theory of change basics: A primer on theory of change. van Gelder, T., Vodicka, R., & Armstrong, N. (2016, sep). Augmenting Expert Elicitation with Structured Visual Deliberation. Asia & the Pacific Policy Studies, 3(3), 378–388. Retrieved from https://doi.org/10.1002/app5.145 doi: 10.1002/app5.145 Wigert, B., & Harter, J. (2017). Re-engineering performance management. Gallup. com. Viewed: March, 6, 2019. |
URI: | https://mpra.ub.uni-muenchen.de/id/eprint/98288 |
Available Versions of this Item
-
Building Less Flawed Metrics. (deposited 21 Dec 2018 14:40)
- Building Less Flawed Metrics: Dodging Goodhart and Campbell's Laws. (deposited 25 Jan 2020 02:21) [Currently Displayed]