Skip to content

Files

This branch is 19 commits ahead of, 42 commits behind zama-ai/concrete-ml:release/1.0.x.

cifar_brevitas_training

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
Dec 9, 2022
Apr 19, 2023
Dec 9, 2022
Apr 7, 2023
Apr 12, 2023
Apr 19, 2023
Apr 19, 2023
Apr 19, 2023
Dec 20, 2022
Apr 7, 2023
Apr 19, 2023

Ternary VGG on CIFAR10 with Fully Homomorphic encryption

Description

In this directory we provide Python code for training, from scratch, a VGG-like neural network using Brevitas on CIFAR-10. We also give a script to run the neural network in the Fully Homomorphic Encryption (FHE) settings.

Original files can be found in the Brevitas repository. The model in the models/ folder has a few modifications from the original to make it compatible with Concrete ML:

  • MaxPool layers have been replaced by AvgPool layers. This is mainly because max pooling is a costly operation in FHE which we want to avoid for less FHE costly operations such as average pooling.
  • Quantization is applied after each AvgPool as this is needed for Concrete ML to capture the quantization parameter. A QuantIdentity Brevitas layer achieves this.

Installation

To use this code, you need to have Python 3.8 and install the following dependencies:

pip install -r requirements.txt

Training and inference

The files in this section are almost as identical to the original. Here we train a VGG-like neural network using an example available on Brevitas Github repository.

To train the neural network:

python3 bnn_pynq_train.py --data ./data --experiments ./experiments

To evaluate the trained model:

python3 bnn_pynq_train.py --evaluate --resume ./experiments/CNV_2W2A_2W2A_20221114_131345/checkpoints/best.tar

Simulation in Concrete ML

In Concrete ML, you can test your model before running it in FHE such that you don't have to pay the cost of FHE runtime during development.

You can launch this evaluation as follows:

python3 evaluate_torch_cml.py

It evaluates the model with Torch and Concrete ML in simulation mode (a representation of FHE circuit running in the clear) to compare the results.

Optionally, you can change the default rounding bits (default to 6) applied on the model as follows:

python3 evaluate_torch_cml.py --rounding_threshold_bits 8

You can as well test different rounding_threshold_bits to check the final accuracy as follows:

python3 evaluate_torch_cml.py --rounding_threshold_bits 1 2 3 4 5 6 7 8

Fully Homomorphic Encryption (FHE)

Once the model has been proposed to have a correct performance, compilation to the FHE settings can be done.

python3 evaluation_one_example_fhe.py

Here, a picture from the CIFAR10 data-set is randomly chosen and preprocessed. The data is then quantized, encrypted and then given to the FHE circuit that evaluates the encrypted image. The result, encrypted as well, is then decrypted and compared vs. the expected output coming from the clear inference.

Warning: this execution can be quite costly.

While it is the ambition of Concrete ML to execute such large CNNs in reasonable time on various hardware accelerators, currently on a CPU the execution times are very high, more than 10 hours for a large many-core machine. This is a work in progress and will be improved significantly in future releases

Accuracy and performance

Runtime Rounding Accuracy
VGG Torch None 88.7
VGG FHE (simulation*) None 88.7
VGG FHE (simulation*) 8 bits 88.3
VGG FHE (simulation*) 7 bits 88.3
VGG FHE (simulation*) 6 bits 87.5
VGG FHE (simulation*) 5 bits 84.9
VGG FHE NA** NA**

Rounding with 6 bits all accumulators offer a huge boost in FHE (TBD) while the loss compared to the original model is only 1.3 .

* Simulation is used to evaluate the accuracy in the clear for faster debugging. ** Expected to match the VGG FHE simulation. It is a work in progress to assess the actual FHE accuracy on a subset of images.