int8 for pytorch #70

boehm-e · 2024-09-19T11:40:32Z

Due diligence

I have done my due diligence in trying to find the answer myself.

Topic

The PyTorch implementation

Question

Hi there,
First I would like to thank you for this amazing project :)

Question : I would like to know if a int8 version of the models are planned for PyTorch.

I see it is supported for mlx and Rust.
I tried to run python -m moshi.server -h but i don't see any parameters to load it in lower precision. With the default settings i get an OOM on my rtx 4060 8GB.

Thank you for the great work

The text was updated successfully, but these errors were encountered:

adefossez · 2024-09-19T15:06:54Z

I would strongly recommend against downloading and running code from random people commenting on issues. For safety reason I have deleted the comment.

At the moment only the Rust backend supports int8 quant. Might change in the future. In any case it still probably wouldn't fit on a 8GB GPU, as there are also the weights of the codec Mimi, and the depth transformer + the KV cache.

boehm-e · 2024-09-19T15:43:42Z

@adefossez , thank you for your answer.
Yes, I tried the Rust backend and also got an OOM.
I assume a lot of people have an 8GB GPU.
To your knowledge, would it be possible to run the model in int4 quant?

meicale · 2024-09-22T03:37:09Z

I would strongly recommend against downloading and running code from random people commenting on issues. For safety reason I have deleted the comment.

At the moment only the Rust backend supports int8 quant. Might change in the future. In any case it still probably wouldn't fit on a 8GB GPU, as there are also the weights of the codec Mimi, and the depth transformer + the KV cache.

What is the minimum GBU memory requirement! Would you mind providing the q-8 pytorch version and when? Thank you for your wonderful work!

boehm-e added the question Further information is requested label Sep 19, 2024

kyutai-labs deleted a comment Sep 19, 2024

adefossez added the may_implement Tag for stuff we might implement in the near future. label Sep 19, 2024

adefossez self-assigned this Sep 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

int8 for pytorch #70

int8 for pytorch #70

boehm-e commented Sep 19, 2024

adefossez commented Sep 19, 2024

boehm-e commented Sep 19, 2024

meicale commented Sep 22, 2024 •

edited

Loading

int8 for pytorch #70

int8 for pytorch #70

Comments

boehm-e commented Sep 19, 2024

Due diligence

Topic

Question

adefossez commented Sep 19, 2024

boehm-e commented Sep 19, 2024

meicale commented Sep 22, 2024 • edited Loading

meicale commented Sep 22, 2024 •

edited

Loading