Multi-patch Feature Pyramid Network for Weakly Supervised Object Detection in Optical Remote Sensing Images

Shamsolmoali, P; Chanussot, J; Zareapoor, M; Zhou, Huiyu; Yang, J

Multi-patch Feature Pyramid Network for Weakly Supervised Object Detection in Optical Remote Sensing Images

journal contribution

posted on 2021-09-03, 10:23 authored by P Shamsolmoali, J Chanussot, M Zareapoor, Huiyu Zhou, J Yang

Object detection is a challenging task in remote sensing because objects only occupy a few pixels in the images, and the models are required to simultaneously learn object locations and detection. Even though the established approaches well perform for the objects of regular sizes, they achieve weak performance when analyzing small ones or getting stuck in the local minima (e.g. false object parts). Two possible issues stand in their way. First, the existing methods struggle to perform stably on the detection of small objects because of the complicated background. Second, most of the standard methods used hand-crafted features, and do not work well on the detection of objects parts of which are missing. We here address the above issues and propose a new architecture with a multiple patch feature pyramid network (MPFP-Net). Different from the current models that during training only pursue the most discriminative patches, in MPFP-Net the patches are divided into class-affiliated subsets, in which the patches are related and based on the primary loss function, a sequence of smooth loss functions are determined for the subsets to improve the model for collecting small object parts. To enhance the feature representation for patch selection, we introduce an effective method to regularize the residual values and make the fusion transition layers strictly norm-preserving. The network contains bottom-up and crosswise connections to fuse the features of different scales to achieve better accuracy, compared to several state-of-the-art object detection models. Also, the developed architecture is more efficient than the baselines.

Funding

NSFC (No: 61876107, U1803261) and Com-mittee of Science and Technology, Shanghai (No. 19510711200)

History

Author affiliation

School of Informatics

Version

AM (Accepted Manuscript)

Published in

IEEE Transactions on Geoscience and Remote Sensing

Volume

60

Publisher

Institute of Electrical and Electronics Engineers

issn

0196-2892

eissn

1558-0644

Acceptance date

2021-08-18

Copyright date

2021

Available date

2021-09-03

Publisher DOI

https://doi.org/10.1109/TGRS.2021.3106442

Language

en

Publisher version

https://ieeexplore.ieee.org/document/9524844

Usage metrics

Keywords

Multiple patch learning multi-scale objects detection feature fusion remote sensing images

Multi-patch Feature Pyramid Network for Weakly Supervised Object Detection in Optical Remote Sensing Images

Funding

NSFC (No: 61876107, U1803261) and Com-mittee of Science and Technology, Shanghai (No. 19510711200)

History

Author affiliation

Version

Published in

Volume

Publisher

issn

eissn

Acceptance date

Copyright date

Available date

Publisher DOI

Language

Publisher version

Usage metrics

Categories

Keywords

Licence

Exports