Some questions about paper #2

DuinoDu · 2017-07-12T06:47:35Z

Hey, much thanks for your great work. About the paper, I have some questions if you don't mind.

For each scale feature maps, there is a seperated classifier and regressor to get class-specific score and bounding box regression. So for four scales, there are four classifiers and regressors. This might bring repeated computation. I wonder if these operations on different scales can merge in some way.
I find that objectness prior is much like rpn(region proposal network). The only difference is that objectness prior only produces a score without bbreg, which is included in rpn. I wonder if I am wrong. Please give me some tips about the differences.
For the last classifier and regressor, one uses two convs while the other uses two inceptions. I wonder the reason why you choose them.
Thanks again. If disturbed, please forgive.

mattdingmeng · 2017-07-12T21:38:58Z

@DuinoDu for the second comment, I agree with you. The objectness prior is very close to the rpn in faster rcnn. I think the main contribution of this paper is the reverse connection, which combine different scaled feature map to detect objects in different size.

chengshuai · 2017-07-13T04:00:19Z

@DuinoDu @mattdingmeng

The reverse connection is similar with the idea of the paper(the Feature Pyramid Networks for Object Detection and Deconvoluiton SSD).

taokong · 2017-07-18T04:05:15Z

@DuinoDu
For the first question, we find that not sharing features could get better detection results. Maybe you can have try about sharing weights with four scales.
The objectness prior is modified from RPN. The original RPN will do bbox regression to get better localization, however, the anchor's location will be changed after bbox reg. So Faster R-CNN must use ROI-Pooling to extract features on these changed anchors. Thus the detection module will bring repeated computations.
@mattdingmeng @chengshuai
Yes, the idea of reverse connection is similar with DSSD, FPN and TDM. In fact, the four works are developed amost at the same period. RON and FPN are both accepted by cvpr2017.

DuinoDu · 2017-07-18T04:36:09Z

Thanks!

kl456123 · 2017-07-23T15:10:20Z

I want to know why it is faster than Faster R-CNN.
Who can help me ,thanks a lot

luuuyi · 2017-08-08T06:05:55Z

@kl456123 the author mentioned it that use ROI-Pooling can bring extra computation. Meanwhile, I think discarding Fully Connection Layer also can accelerate the speed of train and inference.

guiyang882 · 2017-09-22T08:50:35Z

I want to study about the small target detection in large scale scene.
But I find that, the CNN feature Map is very important, If the CNN base model can't find the target, the regression has no meaning.
Could you give me some tips about how to advance the CNN feature model ?

twmht · 2018-02-07T15:26:04Z

@taokong

by the way, I have sent an email to you to ask some questions about hypernet (https://arxiv.org/abs/1604.00600). Please take a look if you have time:)

taokong closed this as completed Jul 29, 2017

taokong reopened this Aug 10, 2017

peyer mentioned this issue Jan 15, 2018

Train RON on KITTI Blocked in pythonlayer::forward #26

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some questions about paper #2

Some questions about paper #2

DuinoDu commented Jul 12, 2017

mattdingmeng commented Jul 12, 2017

chengshuai commented Jul 13, 2017

taokong commented Jul 18, 2017

DuinoDu commented Jul 18, 2017

kl456123 commented Jul 23, 2017

luuuyi commented Aug 8, 2017

guiyang882 commented Sep 22, 2017

twmht commented Feb 7, 2018

Some questions about paper #2

Some questions about paper #2

Comments

DuinoDu commented Jul 12, 2017

mattdingmeng commented Jul 12, 2017

chengshuai commented Jul 13, 2017

taokong commented Jul 18, 2017

DuinoDu commented Jul 18, 2017

kl456123 commented Jul 23, 2017

luuuyi commented Aug 8, 2017

guiyang882 commented Sep 22, 2017

twmht commented Feb 7, 2018