[Bug Report] Multi-GPU device ordinal issue #837
Labels
bug
Something isn't working
complexity-high
Very complicated changes for people to address who are quite familiar with the code
multi-gpu
If you are submitting a bug report, please fill in the following details and use the tag [bug].
Describe the bug
If you run n_devices=3 for llama2-7b from a HookedTransformer, you will receive an issue that the device index specified is out of the ordinal range.
From what I can tell this could be related to how layers are being assigned GPU, and then fetched.
Code example
System Info
Describe the characteristic of your environment:
Additional context
Checklist
The text was updated successfully, but these errors were encountered: