Any plan about W4A8 #873

Arcmoon-Hu · 2024-10-29T09:05:47Z

Any plan about W4A8？

robertgshaw2-neuralmagic · 2024-11-20T21:21:20Z

Did you run into an issue using this in llm-compressor?

We are working on extending the machete kernels in vllm to support W4A8 and will hook up the compressed-tensors models once this is complete.

RameshArvind · 2024-12-03T01:22:42Z

Are you suggesting that W4A8 already works in llm-compressor? Like if we pass the scheme=W4A8 here in this example this will work? An example recipe would be great if so!

Couldn't find an example

Arcmoon-Hu added the enhancement New feature or request label Oct 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Any plan about W4A8 #873

Any plan about W4A8 #873

Arcmoon-Hu commented Oct 29, 2024

robertgshaw2-neuralmagic commented Nov 20, 2024

RameshArvind commented Dec 3, 2024

Any plan about W4A8 #873

Any plan about W4A8 #873

Comments

Arcmoon-Hu commented Oct 29, 2024

robertgshaw2-neuralmagic commented Nov 20, 2024

RameshArvind commented Dec 3, 2024