Installation requirements #89

arthurv · 2024-09-13T13:41:35Z

Hi,

I tried to install ktransformers on a clean install of Linux Mint 22 (based on Ubuntu 24.04), and there are a few things that I had to add:

pip instal numpy
pip install cpufeature
pip install flash_attn
conda install -c conda-forge libstdcxx-ng

Please update the pip dependencies.

Are there any plans to increase the number of quants supported for Deepseek-Coder-V2-Instruct-0724?

The text was updated successfully, but these errors were encountered:

Azure-Tang · 2024-09-14T02:35:38Z

Okay, we will update these packages in our next release.

Regarding quantizations, we now support multiple quantization methods(Qx_k and IQ4_XS) . What else would you like?

arthurv · 2024-09-15T16:23:47Z

I have a system with 192GB DRAM and 48GB VRAM (2x 3090). Would it be able to handle 128k context with the specs? Would it be able to handle Q5_K_M or Q6_K_M?

Also I can only set max_new_tokens in local_chat, not the ktransformers server, and I can't set the total context size anywhere.

devprimed · 2024-09-18T15:22:26Z

Having this issue as well, not being able to set --max_new_tokens in the container breaks downstream projects that require longer output lengths.

Azure-Tang · 2024-09-23T03:22:16Z

I have a system with 192GB DRAM and 48GB VRAM (2x 3090). Would it be able to handle 128k context with the specs? Would it be able to handle Q5_K_M or Q6_K_M?

Also I can only set max_new_tokens in local_chat, not the ktransformers server, and I can't set the total context size anywhere.

Yes, we supported Q5_K_M and Q6_K_M.

About setting max_new_tokens in container, sorry for the inconvenience that this is not supported now. If you’re building from source, you can modify the max_new_tokens parameter in ktransformers/server/backend/args.py. We will include this update in the next Docker release.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Installation requirements #89

Installation requirements #89

arthurv commented Sep 13, 2024

Azure-Tang commented Sep 14, 2024

arthurv commented Sep 15, 2024

devprimed commented Sep 18, 2024

Azure-Tang commented Sep 23, 2024

Installation requirements #89

Installation requirements #89

Comments

arthurv commented Sep 13, 2024

Azure-Tang commented Sep 14, 2024

arthurv commented Sep 15, 2024

devprimed commented Sep 18, 2024

Azure-Tang commented Sep 23, 2024