Won't Use GPU+CPU in 1.78. #1225

sekushi18 · 2024-11-19T03:43:15Z

Describe the Issue
Using the latest update has made issues with all models i run mostly in anything above 11B. Upon which in Vulkan, Clblast, Cublas and all legacy's. the ai with character card injected. crashes with over flow vram rather then to use cpu and gpu together.
essentially instead of ai model being held by gpu and cpu it just only does gpu and crashes.

Additional Information:
using UBUNTU 24.04 cinnamon fully updated with latest Silly Tavern as well. for reference 1.73 works well with 11B and 13B using ai fall back rather then now where the 13B model won't fit into 10G's RTX 3080 LHR, ryzen 7 5700G 128 gig's ddr4. to help gauge specs.

(I've never made issues in git hub often to know if I'm doing it right. Sorry.)

LostRuins · 2024-11-23T10:46:54Z

How many layers is it currently offloading? Try offloading 1 or 2 fewer layers.

sekushi18 · 2024-11-23T17:19:43Z

I've tried the max it thinks being 49 layers. to as low as 33 with same results. no matter the layer offload it says it failed and goes to cpu only backend.

LostRuins · 2024-11-24T01:43:49Z

Alright, could you try running it in a command prompt terminal, then copying the console output (including the crash message) here?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Won't Use GPU+CPU in 1.78. #1225

Won't Use GPU+CPU in 1.78. #1225

sekushi18 commented Nov 19, 2024

LostRuins commented Nov 23, 2024

sekushi18 commented Nov 23, 2024

LostRuins commented Nov 24, 2024

Won't Use GPU+CPU in 1.78. #1225

Won't Use GPU+CPU in 1.78. #1225

Comments

sekushi18 commented Nov 19, 2024

LostRuins commented Nov 23, 2024

sekushi18 commented Nov 23, 2024

LostRuins commented Nov 24, 2024