Your browser doesn't support javascript.
loading
TFRS: A task-level feature rectification and separation method for few-shot video action recognition.
Qin, Yanfei; Liu, Baolin.
Affiliation
  • Qin Y; School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, PR China.
  • Liu B; School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, PR China. Electronic address: liubaolin@ustb.edu.cn.
Neural Netw ; 176: 106326, 2024 Aug.
Article in En | MEDLINE | ID: mdl-38688066
ABSTRACT
Few-shot video action recognition (FS-VAR) is a challenging task that requires models to have significant expressive power in order to identify previously unseen classes using only a few labeled examples. However, due to the limited number of support samples, the model's performance is highly sensitive to the distribution of the sampled data. The representativeness of the support data is insufficient to cover the entire class, and the support features may contain shared information that confuses the classifier, leading to biased classification. In response to this difficulty, we present a task-level feature rectification and separation (TFRS) method that effectively resolves the sample bias issue. Our main idea is to leverage prior information from base classes to rectify the support samples while removing the commonality of task-level features. This enhances the distinguishability and separability of features in space. Furthermore, TFRS offers a straightforward yet versatile solution that can be seamlessly integrated into various established FS-VAR frameworks. Our design yields significant performance enhancements across various existing works by implementing TFRS, resulting in competitive outcomes on datasets such as UCF101, Kinetics, SSv2, and HMDB51.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Video Recording / Pattern Recognition, Automated Limits: Humans Language: En Journal: Neural Netw Journal subject: NEUROLOGIA Year: 2024 Document type: Article Country of publication: Estados Unidos

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Video Recording / Pattern Recognition, Automated Limits: Humans Language: En Journal: Neural Netw Journal subject: NEUROLOGIA Year: 2024 Document type: Article Country of publication: Estados Unidos