Abstract
Object detection on very high resolution (VHR) remote sensing images is a crucial task that has seen remarkable progress in developing deep learning techniques. However, deep learning-based methods rely heavily on the quality and quantity of labeled data. Although few-shot object detection (FSOD) can mitigate this dependency, existing methods still face challenges, including domain shifts between base and novel classes, misclassification due to class similarities, and limited ability to acquire effective information from a few samples. VHR remote sensing images exacerbate these issues due to their greater intra-class diversity and weaker inter-class separability. To address these issues, we propose a new FSOD network, the Adaptive Meta-Knowledge Transfer Network (AMTN). AMTN adaptively transfers meta-knowledge from the source domain to the target domain by effectively obtaining valid information and keeping a stronger discriminative ability towards objects of similar classes in low-shot scenarios. Specifically, considering that VHR remote sensing images’ higher frequency domain resolution provides richer frequency domain information, we employ a Spatial-Frequency Joint Enhancement (SFJE) model to achieve dual enhancement of query image features by fusing information across the spatial and frequency domains. Moreover, we propose the Adaptive Reweighting (AR) loss to enhance the sensitivity of novel class detection in limited sample scenarios. Finally, we propose Sim-Meta loss for discriminability enhancement of similar classes. Our AMTN has demonstrated effectiveness and stability through multiple rounds of experiments on the largescale DIOR dataset while also achieving rapid transfer of meta-knowledge. Copyright © 2024 The Author(s).
Original language | English |
---|---|
Article number | 103675 |
Journal | International Journal of Applied Earth Observation and Geoinformation |
Volume | 127 |
Early online date | Feb 2024 |
DOIs | |
Publication status | Published - Mar 2024 |
Citation
Chen, X., Jiang, W., Qi, H., Liu, M., Ma, H., Yu, P. L. H., Wen, Y., Ham, Z., Zhang, S., & Cao, G. (2024). Adaptive meta-knowledge transfer network for few-shot object detection in very high resolution remote sensing images. International Journal of Applied Earth Observation and Geoinformation, 127, Article 103675. https://doi.org/10.1016/j.jag.2024.103675Keywords
- Few-shot learning
- Object detection
- Convolutional neural network
- Deep learning
- Remote sensing