PoC: Block weight quantize tool for LLM [skip ci] #13758

hseok-oh · 2024-08-26T11:17:19Z

Block quantization for LLM: FullyConnected, Gather
Decide quantize type by circle-quantizer parameter: --block_quantize_weights (Q4_0, Q8_0)
Skip quantization by circle-quantizer parameter: --skipsize_block_quantize (default: 0)

Caution: It's for PoC of circle format and test model generation. Not for compiler implementation.
#13742 #13743

- Blockwise quantization for LLM: FullyConnected, Gather - Decide quantize type by circle-quantizer parameter: `--quantize_weights_chunk` (Q4_0, Q8_0) - Skip quantization by circle-quantizer parameter: `--skip_chunkquant_size` (default: 0) ONE-DCO-1.0-Signed-off-by: Hyeongseok Oh <[email protected]>

hseok-oh added the PR/NO TEST Tell CI to not run test label Aug 26, 2024

hseok-oh force-pushed the draft/weight_quant_llm branch from 621f3f6 to 29b28a5 Compare August 26, 2024 11:19

hseok-oh changed the title ~~PoC: Blckwise weight quantize tool for LLM [skip ci]~~ PoC: Chunk weight quantize tool for LLM [skip ci] Aug 26, 2024

hseok-oh force-pushed the draft/weight_quant_llm branch 3 times, most recently from b23a54b to 47eede8 Compare August 27, 2024 05:33

hseok-oh changed the title ~~PoC: Chunk weight quantize tool for LLM [skip ci]~~ PoC: Block weight quantize tool for LLM [skip ci] Aug 27, 2024

hseok-oh force-pushed the draft/weight_quant_llm branch from 93887e0 to fb99730 Compare August 27, 2024 08:16

hseok-oh mentioned this pull request Aug 29, 2024

[tools] Implement temporary block quantization tool #13830

Closed

hseok-oh force-pushed the draft/weight_quant_llm branch from f26e7cc to 84cef0e Compare September 6, 2024 02:03

hseok-oh force-pushed the draft/weight_quant_llm branch 2 times, most recently from efac650 to 750278f Compare October 11, 2024 08:11

hseok-oh force-pushed the draft/weight_quant_llm branch from 750278f to 2971542 Compare October 11, 2024 08:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PoC: Block weight quantize tool for LLM [skip ci] #13758

PoC: Block weight quantize tool for LLM [skip ci] #13758

hseok-oh commented Aug 26, 2024 •

edited

Loading

PoC: Block weight quantize tool for LLM [skip ci] #13758

Are you sure you want to change the base?

PoC: Block weight quantize tool for LLM [skip ci] #13758

Conversation

hseok-oh commented Aug 26, 2024 • edited Loading

hseok-oh commented Aug 26, 2024 •

edited

Loading