A Boosting Framework for Financial Distress Prediction Based on Imbalanced Data

Authors: Zhao Dan

Conference: ICIC 2024 Posters, Tianjin, China, August 5-8, 2024

Pages: -

Keywords: Boosting framework , Financial distress prediction, Imbalanced data ,Clustering validation measure

Abstract

This study introduces a boosting framework for financial distress prediction, specifically designed for imbalanced data, and incorporates robust business logic to enhance interpretability. The framework employs a clustering algorithm to group data samples based on corporate governance features, and then determines the optimal number of clusters using a unique validation measure. An oversampling method is applied post-clustering, followed by a base prediction algorithm to predict financial distress for each cluster. The empirical analysis is conducted using imbalanced sample data from Chinese listed companies, with feature data at time $t-m$ where $m=1,2,3$ and the Special Treatment ST status at time $t$ used to train the model. The aim is to predict the occurrence of financial distress $m$ years into the future. The results demonstrate that the proposed boosting framework outperforms the base model in terms of prediction accuracy on imbalanced data.

BibTeX Citation:

@inproceedings{ICIC2024,
    author = {Zhao Dan},
    title = {A Boosting Framework for Financial Distress Prediction Based on Imbalanced Data},
    booktitle = {Proceedings of the 20th International Conference on Intelligent Computing (ICIC 2024)},
    month = {August},
    date = {5-8},
    year = {2024},
    address = {Tianjin, China},
    pages = {-},
}