Abstract (EN):
Non-traditional data like the applicant¿s bank statement is a significant source for decision-making when granting loans. We find that we can use methods from network science on the applicant¿s bank statements to convert inherent cash flow characteristics to predictors for default prediction in a credit scoring or credit risk assessment model. First, the credit cash flow is extracted from a bank statement and later converted into a visibility graph or network. Afterwards, we use this visibility network to find features that predict the borrowers¿ repayment behaviour. We see that feature selection methods select all the five extracted features. Finally, SMOTE is used to balance the training data. The model using the features from the network and the standard features together is shown having superior performance compared to the model that uses only the standard features, indicating the network features¿ predictive power. © 2022, The Author(s), under exclusive license to Springer Nature Switzerland AG.
Language:
English
Type (Professor's evaluation):
Scientific