Skip to content

Supports W8A8 quantization for more models#2850

Merged
lvhan028 merged 2 commits intoInternLM:mainfrom AllentDan:w8a8-llmDec 4, 2024