首页|GCSTG: Generating Class-Confusion-Aware Samples With a Tree-Structure Graph for Few-Shot Object Detection

GCSTG: Generating Class-Confusion-Aware Samples With a Tree-Structure Graph for Few-Shot Object Detection

扫码查看
Few-Shot Object Detection (FSOD) aims to detect the objects of novel classes using only a few manually annotated samples. With the few novel class samples, learning the inter-class relationships among foreground and constructing the corresponding class hierarchy in FSOD is a challenging task. The poor construction of the class hierarchy will result in the inter-class confusion problem, which has been identified as a primary cause of inferior performance in novel classes by recent FSOD methods. In this work, we further find that the intra-super-class confusion, where samples are misclassified as classes within their associated super-classes, is the main challenge in solving the confusion problem. To solve this issue, this work generates class-confusion-aware samples with a pre-defined tree-structure graph, for helping models to construct a precise class hierarchy. In precise, for generating class-confusion-aware samples, we add the noise into available samples and update the noise to maximize confidence scores on associated confusion categories of samples. Then, a confusion-aware curriculum learning strategy is proposed to make generated samples gradually participate in the training, which benefits the model convergence while learning the generated samples. Experimental results show that our method can be used as a plug-in in recent FSOD methods and consistently improve the model performance.

Object detectionData augmentationSemanticsNoiseTVAccuracyHorsesDetectorsData modelsComputational modeling

Longrong Yang、Hanbin Zhao、Hongliang Li、Liang Qiao、Ziwei Yang、Xi Li

展开 >

College of Computer Science and Technology, Zhejiang University, Hangzhou, China

School of Information and Communication Engineering, University of Electronic Science and Technology of China, Chengdu, China

Hikvision Research Institute, Hangzhou, China

2025

IEEE transactions on image processing

IEEE transactions on image processing

ISSN:
年,卷(期):2025.34(1)
  • 70