Logo
Munich Personal RePEc Archive

Applying CHAID for logistic regression diagnostics and classification accuracy improvement

Antipov, Evgeny and Pokryshevskaya, Elena (2009): Applying CHAID for logistic regression diagnostics and classification accuracy improvement.

[thumbnail of MPRA_paper_21499.pdf]
Preview
PDF
MPRA_paper_21499.pdf

Download (544kB) | Preview

Abstract

In this study a CHAID-based approach to detecting classification accuracy heterogeneity across segments of observations is proposed. This helps to solve some important problems, facing a model-builder: 1. How to automatically detect segments in which the model significantly underperforms? 2. How to incorporate the knowledge about classification accuracy heterogeneity across segments to partition observations in order to achieve better predictive accuracy? The approach was applied to churn data from the UCI Repository of Machine Learning Databases. By splitting the dataset into 4 parts, which are based on the decision tree, and building a separate logistic regression scoring model for each segment we increased the accuracy by more than 7 percentage points on the test sample. Significant increase in recall and precision was also observed. It was shown that different segments may have absolutely different churn predictors. Therefore such a partitioning gives a better insight into factors influencing customer behavior.

Atom RSS 1.0 RSS 2.0

Contact us: mpra@ub.uni-muenchen.de

This repository has been built using EPrints software.

MPRA is a RePEc service hosted by Logo of the University Library LMU Munich.