Add convenience Functions to save/load quantized Model to/from Disk #411

marcnnn · 2024-12-07T19:24:22Z

Since loading Quantized models from HF is not possible, jet.

I was searching for an easy way to safe models after quantization, as easy as loading them from HF.

And then a function to load the file again.

jonatanklosko · 2024-12-09T08:52:53Z

This probably belongs more to Axon than Bumblebee, since we need a way to store %Axon.ModelState{}. For the model itself, maybe there should be a way to quantize the model only, so it can be replicated without altering the params (so we can still build the model, instead of storing it). cc @seanmor5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add convenience Functions to save/load quantized Model to/from Disk #411

Add convenience Functions to save/load quantized Model to/from Disk #411

marcnnn commented Dec 7, 2024

jonatanklosko commented Dec 9, 2024

Add convenience Functions to save/load quantized Model to/from Disk #411

Add convenience Functions to save/load quantized Model to/from Disk #411

Comments

marcnnn commented Dec 7, 2024

jonatanklosko commented Dec 9, 2024