Skip to content

Latest commit

 

History

History

oriented_rcnn_beyond

Oriented R-CNN and Beyond

Xingxing Xie, Gong Cheng, Jiabao Wang, Ke li, Xiwen Yao, Junwei Han,

Introduction

illustration

Currently, two-stage oriented detectors are superior to single-stage competitors in accuracy, but the step of generating oriented proposals is still time-consuming, thus hindering the inference speed. This paper proposes an Oriented Region Proposal Network (Oriented RPN) to produce high-quality oriented proposals in a nearly cost-free manner. To this end, we present a novel representation manner of oriented objects, named midpoint offset representation, which avoids the complicated design of oriented proposal generation network. Built on Oriented RPN, we develop a simple yet effective oriented object detection framework, called Oriented R-CNN, which could accurately and efficiently detect oriented objects. Moreover, we extend Oriented R-CNN to the task of instance segmentation and realize a new proposal-based instance segmentation method, termed Oriented Mask R-CNN. Without bells and whistles, Oriented R-CNN achieves state-of-the-art accuracy on all 7 commonly-used oriented object detection datasets for the tasks of aerial object detection and scene text detection. More importantly, our method has the fastest speed among all detectors. For instance segmentation, Oriented Mask R-CNN also achieves the top results on the large-scale aerial instance segmentation dataset, named iSAID. We hope our methods could serve as solid baselines for oriented object detection and instance segmentation.

Datasets

First, Users need to pre-process isaid dataset, Please follow the tutorial of iSAID_Devkit to split images and get the coco format json file.

Results and models

Backbone Lr schd ms rr mask mAP Baidu Yun Google Drive
R50-FPN 1x - - 38.91 uni7 Model
R101-FPN 1x - - 39.22 nxj3 Model
R152-FPN 1x - - 40.15 6qw8 Model

Citation

@ARTICLE{orcnn_beyond,
  author={Xie, Xingxing and Cheng, Gong and Wang, Jiabao and Li, Ke and Han, Junwei},
  journal={International Journal of Computer Vision}, 
  title={Oriented R-CNN and Beyond}, 
  year={2024},
  pages={1-23},
  doi={https://doi.org/10.1007/s11263-024-01989-w}
}