Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Any ideas about structure optimization? #72

Open
Crestina2001 opened this issue Jul 8, 2024 · 0 comments
Open

Any ideas about structure optimization? #72

Crestina2001 opened this issue Jul 8, 2024 · 0 comments

Comments

@Crestina2001
Copy link

The model size is too large to fit into my 4090. Are there any ideas to shrink the model size while affecting the performance as little as possible?

I don't know which parameters affect the model size the most, and how to adjust it.

I am working on HKO-7, which is 5005001.

And in addition, could anyone explain the intuitions behind the cuboid attention? I wonder if there is a way to avoid using the 3-d structure, which is computationally expensive. I believe that there exists an elegant and computationally cheap way to deal with the spatial-temporal data.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant