Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
optimum.ipynb		optimum.ipynb
quantization.yml		quantization.yml

README.md

Huggingface Optimum

Scaling and productionizing Transformers with millions of parameters is difficult!

Addressing this, Huggingface 🤗 released a new tool called Optimum (https://huggingface.co/blog/hardware-partners-program) which aims to speed up the inference time of Transformers 🏎️!

This notebook demonstrates some experiments on quantizing HF pre-trained models for sentiment analysis 🎭 and summarization 🤏.

It also compares the performance of Optimum - LPOT quantization, ONNX/ONNX Runtime quantization, and the baseline model.

It's recommended to run this notebook using Google Cloud AI Platform, using an N2-standard-4 machine. But for ease of use, you can follow this link for a Colab version 👇:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

2021_10_12_huggingface_optimum

2021_10_12_huggingface_optimum

README.md

Huggingface Optimum

Files

2021_10_12_huggingface_optimum

Directory actions

More options

Directory actions

More options

Latest commit

History

2021_10_12_huggingface_optimum

Folders and files

parent directory

README.md

Huggingface Optimum