-
-
Notifications
You must be signed in to change notification settings - Fork 424
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ONNX export failed #212
Comments
Then if i change:
to
I get another error:
|
It would seem that during normal inference EDIT: |
Changing:
at line 432
seems to resolve everything but i dunno, this seems like a hack. |
@pfeatherstone ah yea, think i may have a solution threw in some fixes (but by no means for all configurations) let me know if that works |
@pfeatherstone you set |
I now get the following error during export:
|
@lucidrains sorry to bother again. But it would be really cool to get this working with ONNX. At some point I might submit a PR which adds CI/CD. Some unit tests would go a long way |
@pfeatherstone can you try it without shift tokens? |
@pfeatherstone yea, i know some others have already gotten onnx to work in production, so it definitely works for some configurations, just not all. the repository at this point prioritizes simplicity; it is not worth bending over backwards to make onnx work for all settings. |
@lucidrains No it didn't work either |
ah alright, i'll have to circle back to this some other time |
OK cool. to be honest, once i've nailed down the configurations i want, i might write from scratch keeping exactly what i need, then it will probably be easier to debug the onnx export. |
@pfeatherstone yes exactly, that is how i intended it to be |
I've tried so many configurations and it turns out i only really need:
|
Here is a repro:
The export fails with message:
"b, n, device, num_mems, has_memory_tokens, emb_frac_gradient = *x.shape, x.device, self.num_memory_tokens, self.num_memory_tokens > 0, self.emb_frac_gradient
ValueError: too many values to unpack (expected 6)"
The text was updated successfully, but these errors were encountered: