Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About the SPP layer #2

Open
WanliOuyang opened this issue Mar 21, 2018 · 20 comments
Open

About the SPP layer #2

WanliOuyang opened this issue Mar 21, 2018 · 20 comments

Comments

@WanliOuyang
Copy link

TypeError: Expected binary or unicode string , got None
in line: spp = tf.reshape(max_pool, [num_sample, -1])
How to resolve it ?

@peace195
Copy link
Owner

peace195 commented Mar 21, 2018

Which tensorflow version do you use?

You should use tensorflow version 1.4.0 or higher.

@WanliOuyang
Copy link
Author

ok, My tensorflow version 1.1.0. maybe is lower.

@WanliOuyang
Copy link
Author

How should the code be modified if it runs on a relatively low version of tensorflow?
Thanks

@peace195
Copy link
Owner

You can search the structure of tf.reshape function in tensorflow 1.1.0 then modify my code using this structure.

Anyway, you should upgrade your tensorflow to newest version and prepare the dataset carefully.

@WanliOuyang
Copy link
Author

Is the model able to ascend to the pool of 3D data? for example, videos

@peace195
Copy link
Owner

Only 2D data.

Best regards,

Binh Do

@WanliOuyang
Copy link
Author

@peace195 When I use a fixed batch size value(such as batch_size=4), the code can run, but when training, the precision of the training keeps fluctuating on a very small number(such as the mean accuracy is 0.16). Why is this?

@peace195
Copy link
Owner

peace195 commented Apr 9, 2018

@WanliOuyang I'm so sorry because I was busy these days. Did you solve that problem? I'm very appreciated that you might help me to fix it. I think you can try with large batch size and small learning rate.

@peace195 peace195 reopened this Apr 9, 2018
@CXSunshine
Copy link

I also want to use this to deal with 3D data.I am confused with the input of the SPP(such as the size of the imag)

@hbb21st
Copy link

hbb21st commented Jun 5, 2018

Hi, many thanks for your code, I have a small question about the way deal with your x = tf.placeholder('float', shape = x_train.get_shape()), as TRAINING_ITERATIONS going on, the leaking mem happens and memory usage quickly increase, how you overcome it?

@tkone2018
Copy link

Hi,thanks for your code,do you use the multi_scale image inputs in your experiment?

@peace195
Copy link
Owner

peace195 commented Jul 20, 2018 via email

@tkone2018
Copy link

Ok,thank you.I want to use like this network to realize the image retrieval ,but I don't know how to build my own network ,can you help me ? Thank you very much.

@peace195
Copy link
Owner

peace195 commented Jul 20, 2018 via email

@tkone2018
Copy link

Ok, thank you

@tkone2018
Copy link

Hello, h_strd =math.floor(float(previous_conv_size[0]) / out_pool_size[i])
w_strd = math.floor(float(previous_conv_size[1]) / out_pool_size[i])
the stride size should be math.floor not math.ceil acoording to the Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition paper.Maybe you think math.ceil is also right.

@ProjectDimlight
Copy link

Hello, I suggest that the shape of the tf.nn.max_pool in the for-loop should be:
ksize = [1, h_size, w_size, 1]
instead of :
ksize = [1, h_size, h_size, 1]
As the input is always in the shape of a square, this might not be a big issue, but somehow confusing?

@zjz5250
Copy link

zjz5250 commented Jan 5, 2020

hello,can the input data hava different shape?

@peace195
Copy link
Owner

peace195 commented Jan 6, 2020

@zjz5250 zj Yes.

@zjz5250
Copy link

zjz5250 commented Jan 6, 2020 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

7 participants