Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

questions about performance of sequential mnist experiment #6

Open
ziyuwwang opened this issue Aug 4, 2020 · 2 comments
Open

questions about performance of sequential mnist experiment #6

ziyuwwang opened this issue Aug 4, 2020 · 2 comments

Comments

@ziyuwwang
Copy link

Hello, I ran the code with setting "num_units=6, k=4" and I just cannot reach the accuracy reported in readme.md. Could you provide exact hyper-parameters of the sequential mnist experiment?
Another issue is that I noticed there is no embedding of input in sequential mnist experiment and input size is set to 1. Would it be ok with this situation? For example, a pixel with value 0 is indistinguishable from null input.
Looking forward to your reply!

@dido1998
Copy link
Owner

dido1998 commented Aug 5, 2020

Hi, could you share the results that you are getting currently? I agree that the pixels will not be distinguishable from the null input and in the official implementation the null vector is appended after using a linear transformation on the input first before appending the null vector. You can also try that, but I was able to get these results without using a transformation.

@ziyuwwang
Copy link
Author

It's strange that I get a result about "0.77, 0.55, 0.31" for test resolutions 1616, 1919, 24*24 with exact the code you released. After I add an embedding layer to transform the input into vectors of 600 dimensions and change the learning rate to 0.0001, the result turns to be around "0.88, 0.70, 0.44" which is closed to the result you report in readme.md.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants