Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Some questions about paper #2

Open
DuinoDu opened this issue Jul 12, 2017 · 8 comments
Open

Some questions about paper #2

DuinoDu opened this issue Jul 12, 2017 · 8 comments

Comments

@DuinoDu
Copy link

DuinoDu commented Jul 12, 2017

Hey, much thanks for your great work. About the paper, I have some questions if you don't mind.

  1. For each scale feature maps, there is a seperated classifier and regressor to get class-specific score and bounding box regression. So for four scales, there are four classifiers and regressors. This might bring repeated computation. I wonder if these operations on different scales can merge in some way.
  2. I find that objectness prior is much like rpn(region proposal network). The only difference is that objectness prior only produces a score without bbreg, which is included in rpn. I wonder if I am wrong. Please give me some tips about the differences.
  3. For the last classifier and regressor, one uses two convs while the other uses two inceptions. I wonder the reason why you choose them.
    Thanks again. If disturbed, please forgive.
@mattdingmeng
Copy link

@DuinoDu for the second comment, I agree with you. The objectness prior is very close to the rpn in faster rcnn. I think the main contribution of this paper is the reverse connection, which combine different scaled feature map to detect objects in different size.

@chengshuai
Copy link

@DuinoDu @mattdingmeng

The reverse connection is similar with the idea of the paper(the Feature Pyramid Networks for Object Detection and Deconvoluiton SSD).

@taokong
Copy link
Owner

taokong commented Jul 18, 2017

@DuinoDu
For the first question, we find that not sharing features could get better detection results. Maybe you can have try about sharing weights with four scales.
The objectness prior is modified from RPN. The original RPN will do bbox regression to get better localization, however, the anchor's location will be changed after bbox reg. So Faster R-CNN must use ROI-Pooling to extract features on these changed anchors. Thus the detection module will bring repeated computations.
@mattdingmeng @chengshuai
Yes, the idea of reverse connection is similar with DSSD, FPN and TDM. In fact, the four works are developed amost at the same period. RON and FPN are both accepted by cvpr2017.

@DuinoDu
Copy link
Author

DuinoDu commented Jul 18, 2017

Thanks!

@kl456123
Copy link

I want to know why it is faster than Faster R-CNN.
Who can help me ,thanks a lot

@taokong taokong closed this as completed Jul 29, 2017
@luuuyi
Copy link

luuuyi commented Aug 8, 2017

@kl456123 the author mentioned it that use ROI-Pooling can bring extra computation. Meanwhile, I think discarding Fully Connection Layer also can accelerate the speed of train and inference.

@taokong taokong reopened this Aug 10, 2017
@guiyang882
Copy link

I want to study about the small target detection in large scale scene.
But I find that, the CNN feature Map is very important, If the CNN base model can't find the target, the regression has no meaning.
Could you give me some tips about how to advance the CNN feature model ?

@twmht
Copy link

twmht commented Feb 7, 2018

@taokong

by the way, I have sent an email to you to ask some questions about hypernet (https://arxiv.org/abs/1604.00600). Please take a look if you have time:)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

8 participants