Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Classic attention baseline model #17

Open
zbetmen1 opened this issue Dec 29, 2018 · 4 comments
Open

Classic attention baseline model #17

zbetmen1 opened this issue Dec 29, 2018 · 4 comments

Comments

@zbetmen1
Copy link

Hi Jianshu, must say I'm impressed with your work and both WAP, DenseNet and TAP papers and advances.
Have you done any experiments with CROHME using classic attention (Bahdanau style) and VGG style encoder (similar to the one used in WAP paper) and without 8 dimensional features?
I've tried to use this as baseline model, however this model is not able to generalize well and results in huge and quick overfitting. My intention is to inspect/study effects of each of the steps that are taken towards state of the art architecture.
I'm just checking with you since I speculate it might have been one of the steps you took when you began your work.

@JianshuZhang
Copy link
Owner

JianshuZhang commented Dec 29, 2018 via email

@zbetmen1
Copy link
Author

zbetmen1 commented Dec 29, 2018 via email

@zbetmen1
Copy link
Author

zbetmen1 commented Jan 1, 2019

Hi Jianshu, its me again. Just one quick question to check if I've understood correctl. In your TAP work classic attention almost without modification (I've seen in some of your comments that second GRU used is just minor improvement) trains correctly and produces reasonable results (~42% expression rate)?
I get that traces carry much more information than offline images, however this sounds very interesting to me because RNNs are usually much harder to train than CNNs (this is related to encoders used in TAP/WAP). Do you have some insight that I'm not aware of regarding this? :)

@JianshuZhang
Copy link
Owner

JianshuZhang commented Jan 1, 2019 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants