Key frame extraction based on sparse coding with deep frame features
Authors:
Yujie Li, Ning Liu, Depeng Chen, Shuxue Ding, and Benying Tan
Conference:
ICIC 2025 Posters, Ningbo, China, July 26-29, 2025
Pages:
485-498
Keywords:
Key frame extraction, Sparse coding, Deep learning, Feature extraction, YOLO-MLP.
Abstract
Key frame extraction based on sparse coding can present the entire video with a small number of key frames, reducing the redundancy of the video. However, existing sparse coding-based methods use raw video frame features, which leads to high computational complexity and significant time consumption. In this paper, we propose a novel key frame extraction method based on sparse coding and deep frame features KSC-DFF to address these challenges. First, we obtain deep frame features using a deep neural network, which can reduce the dimensionality of the input video data and generate deep frame features such as the main object features of the frame. To automatically extract deep frame features, a YOLO-based deep neural network called YOLO-MLP was designed for video feature extraction. Then, we used sparse coding to extract key frames based on deep frame features, which can reduce information redundancy and computation time while maintaining high accuracy. Experimental results on SumMe demonstrate that the proposed KSC- DFF outperforms the existing methods with an increase of 49.4 and a time reduction of nearly 98 compared to the conventional sparse coding-based method SMRS.
BibTeX Citation:
@inproceedings{ICIC2025,
author = {Yujie Li, Ning Liu, Depeng Chen, Shuxue Ding, and Benying Tan},
title = {Key frame extraction based on sparse coding with deep frame features},
booktitle = {Proceedings of the 21st International Conference on Intelligent Computing (ICIC 2025)},
month = {July},
date = {26-29},
year = {2025},
address = {Ningbo, China},
pages = {485-498},
}