关键词:
Object detection
摘要:
Camouflaged object detection (COD) aims to identify target objects in complex scenes with extremely high similarity to their surroundings, and has significant applications in military, medical, and other fields. This paper proposes a hierarchical feature-enhanced aggregation network (HFANet) for COD, aiming to address the situations that the target object is highly similar to the background. First, we adopt the pyramid vision Transformer model as the backbone for feature extraction. On top of it, the object-region amplification module and deep interaction guidance module are stacked to enhance the perception of camouflaged objects in complex scenes. Second, an enhanced receptive field module is designed to improve edge perception of camouflaged objects. At last, a multi-scale interactive fusion module is designed by cross-scale connection through adjacent layers, effectively improving the accuracy of COD. The proposed method is evaluated on three challenging datasets: CAMO, CHAMELEON, and COD10K. Evaluation results demonstrate superior performance compared to the state-of-the-art methods. © Shanghai Jiao Tong University 2024.