Multi-view GCN for loan default risk prediction

Zihao LI, Yakun CHEN, Xianzhi WANG, Lina YAO, Guandong XU

Research output: Contribution to journalArticlespeer-review

Abstract

As a significant application of machine learning in financial scenarios, loan default risk prediction aims to evaluate the client’s default probability. However, most existing deep learning solutions treat each application as an independent individual, neglecting the explicit connections among different application records. Besides, these attempts suffer from the problem of missing data and imbalanced distribution (i.e., the default records are small samples against all the applications). We believe similar records could provide some auxiliary signals, which are of critical importance to alleviate the data missing issue and facilitate data argumentation. To this end, we propose multi-view loan application graphs, dubbed MLAGs. By evaluating the similarity between the records, a loan application graph can be constructed. Furthermore, we arrange different similarity thresholds to organize various graph structures for multi-graph constructions; thus, a variety of representations can be generated via information propagation and aggregation for small sample argumentation. Consequently, the imbalanced data distribution and missing values issues can be alleviated effectively. We conduct experiments on three public datasets from real-world home credit and P2P lending platforms, which show that MGCN outperforms both conventional and deep learning models. Ablation studies also illustrated the validity of each module design. Copyright © 2024 The Author(s).

Original languageEnglish
Pages (from-to)12149-12162
JournalNeural Computing and Applications
Volume36
Early online dateApr 2024
DOIs
Publication statusPublished - Jul 2024

Citation

Li, Z., Chen, Y., Wang, X., Yao, L., & Xu, G. (2024). Multi-view GCN for loan default risk prediction. Neural Computing and Applications, 36, 12149-12162. https://doi.org/10.1007/s00521-024-09695-x

Keywords

  • Loan default prediction
  • Graph neural network
  • Multi-view graphs
  • Heterogeneous and unbalanced data

Fingerprint

Dive into the research topics of 'Multi-view GCN for loan default risk prediction'. Together they form a unique fingerprint.