A Boosting Framework for Financial Distress Prediction Based on Imbalanced Data
Authors:
Zhao Dan
Conference:
ICIC 2024 Posters, Tianjin, China, August 5-8, 2024
Pages:
-
Keywords:
Boosting framework , Financial distress prediction, Imbalanced data ,Clustering validation measure
Abstract
This study introduces a boosting framework for financial distress prediction, specifically designed for imbalanced data, and incorporates robust business logic to enhance interpretability. The framework employs a clustering algorithm to group data samples based on corporate governance features, and then determines the optimal number of clusters using a unique validation measure. An oversampling method is applied post-clustering, followed by a base prediction algorithm to predict financial distress for each cluster. The empirical analysis is conducted using imbalanced sample data from Chinese listed companies, with feature data at time $t-m$ where $m=1,2,3$ and the Special Treatment ST status at time $t$ used to train the model. The aim is to predict the occurrence of financial distress $m$ years into the future. The results demonstrate that the proposed boosting framework outperforms the base model in terms of prediction accuracy on imbalanced data.
BibTeX Citation:
@inproceedings{ICIC2024,
author = {Zhao Dan},
title = {A Boosting Framework for Financial Distress Prediction Based on Imbalanced Data},
booktitle = {Proceedings of the 20th International Conference on Intelligent Computing (ICIC 2024)},
month = {August},
date = {5-8},
year = {2024},
address = {Tianjin, China},
pages = {-},
}