Spectral Decomposition and Transformation for Cross-domain Few-shot Learning.

Liu, Yicong; Zou, Yixiong; Li, Ruixuan; Li, Yuhua

Liu, Yicong; Zou, Yixiong; Li, Ruixuan; Li, Yuhua.

Affiliation

Liu Y; School of Computer Science and Technology, Huazhong University of Science and Technology, 1037 Luoyu Road, Wuhan, 430070, Hubei, China.
Zou Y; School of Computer Science and Technology, Huazhong University of Science and Technology, 1037 Luoyu Road, Wuhan, 430070, Hubei, China. Electronic address: yixiongz@hust.edu.cn.
Li R; School of Computer Science and Technology, Huazhong University of Science and Technology, 1037 Luoyu Road, Wuhan, 430070, Hubei, China.
Li Y; School of Computer Science and Technology, Huazhong University of Science and Technology, 1037 Luoyu Road, Wuhan, 430070, Hubei, China.

Neural Netw ; 179: 106536, 2024 Jul 14.

Article de En | MEDLINE | ID: mdl-39089156

ABSTRACT

ABSTRACT

Cross-domain few-shot Learning (CDFSL) is proposed to first pre-train deep models on a source domain dataset where sufficient data is available, and then generalize models to target domains to learn from only limited data. However, the gap between the source and target domains greatly hampers the generalization and target-domain few-shot finetuning. To address this problem, we analyze the domain gap from the aspect of frequency-domain analysis. We find the domain gap could be reflected by the compositions of source-domain spectra, and the lack of compositions in the source datasets limits the generalization. Therefore, we aim to expand the coverage of spectra composition in the source datasets to help the source domain cover a larger range of possible target-domain information, to mitigate the domain gap. To achieve this goal, we propose the Spectral Decomposition and Transformation (SDT) method, which first randomly decomposes the spectrogram of the source datasets into orthogonal bases, and then randomly samples different coordinates in the space formed by these bases. We integrate the above process into a data augmentation module, and further design a two-stream network to handle augmented images and original images respectively. Experimental results show that our method achieves state-of-the-art performance in the CDFSL benchmark dataset.

Mots clés

Cross-domain few-shot learning; Frequency-domain analysis

Texte intégral

Ajouter à My VHL

Imprimer

XML

PubMed Links

Recherche sur Google

Texte intégral: 1 Collection: 01-internacional Base de données: MEDLINE Langue: En Journal: Neural Netw Sujet du journal: NEUROLOGIA Année: 2024 Type de document: Article Pays d'affiliation: Chine

Texte intégral

Ajouter à My VHL

Imprimer

XML

PubMed Links

Recherche sur Google