40 GB model #6

DHOFM · 2023-03-23T16:27:09Z

Hi,
thanks for your nice repo. You mention 2 3090:
`

The following hardware is needed to run different models in MiniLLM:

`

So when I try the 60B Version with 2 RTX 3090 I get an OOM - how can I use both GPUs?

Kind regards,

Dirk

kuleshov · 2023-03-24T04:23:22Z

Ah, sorry, I haven't implemented that feature yet :) happy to merge a PR if someone does that first

kuleshov · 2023-03-24T04:24:57Z

for what it's worth, i was running the 65B model on one a6000

Provide feedback