Abstract
Churn prediction is vital to companies as to identify potential churners and prevent losses in advance. Although it has been addressed as a classification task and a variety of models have been employed in practice, fund management services have presented several special challenges. One is that financial data is extremely imbalanced since only a tiny proportion of customers leave every year. Another is a unique cost-sensitive learning problem, i.e., costs of wrong predictions for churners should be related to their account balances, while costs of wrong predictions for non-churners should be the same. To address these issues, this paper proposes a new churn prediction model based on ensemble learning. In our model, multiple classifiers are built using sampled datasets to tackle the imbalanced data issue while exploiting data fully. Moreover, a novel sampling strategy is proposed to deal with the unique cost-sensitive issue. This model has been deployed in one of the leading fund management institutions in Australia, and its effectiveness has been fully validated in real applications. Copyright © 2018 Springer International Publishing AG, part of Springer Nature.
Original language | English |
---|---|
Title of host publication | Database systems for advanced applications: 23rd International Conference, DASFAA 2018, proceedings, part II |
Editors | Jian PEI, Yannis MANOLOPOULOS, Shazia SADIQ, Jianxin LI |
Publisher | Springer |
Pages | 776-788 |
ISBN (Print) | 9783319914572 |
DOIs | |
Publication status | Published - 2018 |