Performance Optimization of Imbalanced Intrusion Detection Data Classification Based on Voting Approach
Authors:
Guannan Wen and Zhenzhou An
Conference:
ICIC 2025 Posters, Ningbo, China, July 26-29, 2025
Pages:
1026-1043
Keywords:
Intrusion detection, Machine learning, Ensemble learning
Abstract
Intrusion detection systems IDSs often suffer from the category imbalance problem, i.e., malicious traffic is much less than normal traffic, which results in inefficient system detection. This paper proposed a data generation technique incorporating Pearson's correlation coefficient to solve this problem. At the same time, feature selection based on chi-square distribution and integrated learning based on voting method are also used for model optimization. In the data generation stage, the Pearson correlation coefficient-based sample similarity calculation was introduced to improve the Borderline-SMOTE method to generate high-quality data and reduce the negative impact of low-quality samples on the classification model. Different experiments were conducted for multiple machine learning methods in the model optimization phase and selected the most effective combination. Experiments on three public datasets, UNSW-NB15, NSL-KDD, and CIC-IDS-2017, proved the effectiveness of the method, especially in the detection of a few categories, which achieved significant improvement, especially in the UNSW-NB15 dataset, the F1 scores of the few categories of Analysis and Backdoor had 42$ $ and 39$ $ improvement, respectively. In addition, a new evaluation metric, Mean Category Accuracy MCA , was proposed, which provides a more balanced assessment of the detection performance of all attack types.
BibTeX Citation:
@inproceedings{ICIC2025,
author = {Guannan Wen and Zhenzhou An},
title = {Performance Optimization of Imbalanced Intrusion Detection Data Classification Based on Voting Approach},
booktitle = {Proceedings of the 21st International Conference on Intelligent Computing (ICIC 2025)},
month = {July},
date = {26-29},
year = {2025},
address = {Ningbo, China},
pages = {1026-1043},
note = {Poster Volume â… }
doi = {
10.65286/icic.v21i1.23529}
}