TY - JOUR
T1 - Improving direct mail targeting through customer response modeling
AU - Coussement, K.
AU - Harrigan, Paul
AU - Benoit, D.F.
PY - 2015
Y1 - 2015
N2 - Direct marketing is an important tool in the promotion mix of companies, amongst which direct mailing is crucial. One approach to improve direct mail targeting is response modeling, i.e. a predictive modeling approach that assigns future response probabilities to customers based on their history with the company. The contributions to the response modeling literature are three-fold. First, we introduce well-known statistical and data-mining classification techniques (logistic regression, linear and quadratic discriminant analysis, naïve Bayes, neural networks, decision trees, including CHAID, CART and C4.5, and the k-NN algorithm) to the direct marketing community. Second, we run a predictive benchmarking study using the above classifiers on four real-life direct marketing datasets. The 10-fold cross-validated area under the receiver operating characteristics curve is used as evaluation metric. Third, we give managerial insights that facilitate the classifier choice based on the trade-off between interpretability and predictive performance of the classifier. The findings of the benchmark study show that data-mining algorithms (CHAID, CART and neural networks) perform well on this test bed, followed by simplistic statistical classifiers like logistic regression and linear discriminant analysis. It is shown that quadratic discriminant analysis, naïve Bayes, C4.5 and the k-NN algorithm yield poor performance.
AB - Direct marketing is an important tool in the promotion mix of companies, amongst which direct mailing is crucial. One approach to improve direct mail targeting is response modeling, i.e. a predictive modeling approach that assigns future response probabilities to customers based on their history with the company. The contributions to the response modeling literature are three-fold. First, we introduce well-known statistical and data-mining classification techniques (logistic regression, linear and quadratic discriminant analysis, naïve Bayes, neural networks, decision trees, including CHAID, CART and C4.5, and the k-NN algorithm) to the direct marketing community. Second, we run a predictive benchmarking study using the above classifiers on four real-life direct marketing datasets. The 10-fold cross-validated area under the receiver operating characteristics curve is used as evaluation metric. Third, we give managerial insights that facilitate the classifier choice based on the trade-off between interpretability and predictive performance of the classifier. The findings of the benchmark study show that data-mining algorithms (CHAID, CART and neural networks) perform well on this test bed, followed by simplistic statistical classifiers like logistic regression and linear discriminant analysis. It is shown that quadratic discriminant analysis, naïve Bayes, C4.5 and the k-NN algorithm yield poor performance.
U2 - 10.1016/j.eswa.2015.06.054
DO - 10.1016/j.eswa.2015.06.054
M3 - Article
SN - 0957-4174
VL - 42
SP - 8403
EP - 8412
JO - Expert Systems with Applications
JF - Expert Systems with Applications
IS - 22
ER -