Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Lunet (train from scatch) #44

Open
kailigo opened this issue Jun 21, 2018 · 4 comments
Open

Lunet (train from scatch) #44

kailigo opened this issue Jun 21, 2018 · 4 comments

Comments

@kailigo
Copy link

kailigo commented Jun 21, 2018

Nice work. I wonder to know if it is possible to release the implementation of the proposed LuNet? Thanks.

@lucasb-eyer
Copy link
Member

Hi, it was initially not planned (too much code to port) but I had to do it anyways for my thesis, so I can confirm that I have a working LuNet reproducing the paper's result in this codebase.

I will make a PR adding it to this repo sometime next week, I have a few other things to do first. Ping me here again if there's no news mid next week.

@kailigo
Copy link
Author

kailigo commented Jun 21, 2018

Thanks for the reply. I will send you a reminder next week.

Did you try to train from scratch using resnet50? I tried, but failed. Without good initialization, the model tends to stuck in bad local minima, resulting in reasonably low accuracy. I did try to tune the learning rates, but still failed to get reasonable results.

I guessed you must have tried to train from scratch using resnet50. How about your experiments? Do you have some suggestions on improving it. Thanks.

@lucasb-eyer
Copy link
Member

Yeah I agree with you re: initialization. And to make matters worse, I discovered problems with TF's ortho init, which we use for the embedding layer. I'll soon make another PR which replaces it by Xavier init, which is also what we originally used in the paper. You could try that already.

But no, as far as I remember, we haven't tried getting best possible results with resnet50 from scratch, and I'm not sure if it's a worthwhile time investment. But if you do try, I'd be happy to hear about your results.

@percarlsen
Copy link

Hi, I can't seem to find the LuNet implementation. Do you still plan on uploading the code?
Thanks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants