posted on 2022-01-07, 16:38authored byD Liang, Q Geng, Z Wei, D Vorontsov, E Kim, M Wei, Huiyu Zhou
Object detection has made tremendous strides in computer vision. Small object detection with appearance degradation is a prominent challenge, especially for aerial observations. To collect sufficient positive/negative samples for heuristic training, most object detectors preset region anchors in order to calculate Intersection-over-Union (IoU) against the ground-truthed data. In this case, small objects are frequently abandoned or mislabeled. In this paper, we present an effective Dynamic Enhancement Anchor (DEA) network to construct a novel training sample generator. Different from the other state-of-the-art techniques, the proposed network leverages a sample discriminator to realize interactive sample screening between an anchor-based unit and an anchor-free unit to generate eligible samples. Besides, multi-task joint training with a conservative anchor-based inference scheme enhances the performance of the proposed model while reducing computational complexity. The proposed scheme supports both oriented and horizontal object detection tasks. Extensive experiments on two challenging aerial benchmarks (i.e., DOTA and HRSC2016) indicate that our method achieves state-of-the-art performance in accuracy with moderate inference speed and computational overhead for training. On DOTA, DEA-Net surpasses the other state-of-the-art by 0.40% mean-Average-Precision (mAP) for oriented object detection with a weaker backbone network (ResNet-101vsResNet-152) and 3.08% mean-Average-Precision (mAP)for horizontal object detection with the same backbone. OnHRSC2016, it surpasses the previous best model by 1.1% using only 3 horizontal anchors.
History
Author affiliation
School of Computing and Mathematical Sciences, University of Leicester
Version
AM (Accepted Manuscript)
Published in
IEEE Transactions on Geoscience and Remote Sensing
Volume
60
Publisher
Institute of Electrical and Electronics Engineers (IEEE)