Code for the NeurIPS 2024 submission:

"DAGER: Extracting Text from Gradients with Language Model Priors"

Prerequisites

Install Anaconda.
Create the conda environment:

conda env create -f environment.yml -n dager

Enable the created environment:

conda activate dager

Create necessary folders

mkdir -p models models_cache

Download the required files from Megadrive and store the fine-tuned GPT-2 model and LoRA models in the \models folder

Setting up HuggingFace (optional)

We use HuggingFace for obtaining all datasets and models. However, the LLaMa-2 (7B) and ECHR datasets rely on a specific setup.

For installing the ECHR dataset follow the next steps:

wget https://huggingface.co/datasets/glnmario/ECHR/resolve/main/ECHR_Dataset.csv

mkdir ./models_cache/datasets--glnmario--ECHR

mv ECHR_Dataset.csv ./models_cache/datasets--glnmario--ECHR/

For using the LLaMa-2 (7B) model, you need to create a HuggingFace profile and request access to the model from Meta. Then, create an API token that allows read access to relevant repositories. Export this token into an environment variable as shown below:

export HF_TOKEN=<huggingface_api_token>

NOTE: if you have any issues connecting to HuggingFace we recommend using the huggingface-cli as so:

huggingface-cli download <model> --cache-dir ./models_cache

Running TAG/LAMP

We provide a modified verson of TAG and LAMP inside the \lamp folder that we used to run the baselines for GPT-2. For experiments on BERT, we recommend using the original code. Further instructions can be found inside the README.md inside the respective repositories.

Rank tolerance adjustment

As general guidance, for each experiment we define a rank tolerance parameter RANK_TOL to deal with numerical instabilities. This parameter has to be tweaked depeding on the batch size, with any value within an order of magnitude of the "true value" resulting in equivalent behaviour. Throughout our experiments, as rule of thumb we followed the below guidelines:

RANK_TOL = $10^{-7}$ for batch sizes 1/2
RANK_TOL = $10^{-8}$ for batch sizes 4-16
RANK_TOL = $10^{-9}$ for batch sizes 32-128

We set a default rank tolerance of None for automatic inference (as specified in the NumPy documentation).

DAGER Experiments (Tables 1, 2, 3, 5, 11)

Parameters

DATASET - the dataset to use. Must be one of cola, sst2, rotten_tomatoes.
BATCH_SIZE - how many sentences we have in a batch.

Commands

To run GPT-2:

./scripts/gpt2.sh DATASET BATCH_SIZE <--rank_tol RANK_TOL>

To run GPT-2 Large:

./scripts/gpt2-large.sh DATASET BATCH_SIZE <--rank_tol RANK_TOL>

To run GPT-2 Finetuned:

./scripts/gpt2-ft.sh DATASET BATCH_SIZE <--rank_tol RANK_TOL>

To run GPT-2 on the next-token prediction task:

./scripts/gpt2.sh DATASET BATCH_SIZE <--rank_tol RANK_TOL> --task next_token_pred

To run GPT-2 using the Frobenius norm loss:

./scripts/gpt2.sh DATASET BATCH_SIZE <--rank_tol RANK_TOL> --loss mse

To run GPT-2 using a ReLU activation:

./scripts/gpt2.sh DATASET BATCH_SIZE <--rank_tol RANK_TOL> --hidden_act relu

To run LLaMa-2:

./scripts/llama.sh DATASET BATCH_SIZE <--rank_tol RANK_TOL>

To run BERT (with all heuristics enabled):

./scripts/bert.sh DATASET BATCH_SIZE <--rank_tol RANK_TOL>

To run BERT (with heuristics disabled):

./scripts/bert.sh DATASET BATCH_SIZE <--rank_tol RANK_TOL> --l1_filter all --l2_filter overlap

To run DAGER on GPT-2 under LoRA finetuning (by default run at rank $r=256$):

./scripts/lora.sh DATASET BATCH_SIZE <--rank_tol RANK_TOL>

NOTE: for the experiment on LLaMa-2 on Rotten Tomatoes for batch size 128, it is recommended to use the following command for speed up and numerical stability:

./llama.sh rotten_tomatoes 128 --rank_tol 1e-11 --l1_span_thresh 1e-4 --l2_span_thresh 5e-10

Furthermore, for the experiment on long sequences using the glnmario/ECHR dataset, it is recommended to also set a low rank tolerance as well as a lower span threshold (we used RANK_TOL=$10^{-9}$, l1_span_thresh=$10^{-4}$)

DAGER under FedAvg (Table 4)

By default, the algorithm is run on the Rotten Tomatoes dataset with a batch size of 16.

Parameters

AVG_EPOCHS - How many training iterations of the FedAvg algorithm are run.
B_MINI - The minibatch size.
LEARNING_RATE - The learning rate for the single SGD step.

Commands

To run DAGER on GPT-2 with the FedAvg algorithm:

./scripts/fed_avg.sh AVG_EPOCHS B_MINI LEARNING_RATE

Ablation study on best-effort reconstruction (Table 4)

We completed this study through logging our results in Neptune. We can retrieve the effect of the rank thresholding by setting the scores of all runs with logs\batch_tokens > <model_embedding_dim> to 0, and recomputing the aggregate statistics.

Ablation study on filter thresholding (Figure 2)

To run a test on the effect of the span check filter across various thresholds, ranging from $10^{-7}$ to 1, run:

python ./token_filtering.py

Ablation study on rank threshold (Figure 3)

This study relates to finding the optimal rank threshold and showcasing what happens when it is too high or low. This study requires the ECHR dataset - see how to set it up in the Setting up HuggingFace section.

Parameters

RANK_CUTOFF - the amount we lower the maximum rank by with respect to the embedding dimension, after which we apply the cutoff. For example, for RANK_CUTOFF=20, and an embedding dimension of 768, the maximum rank becomes 748.

Command

./scripts/dec_feasibility.sh RANK_CUTOFF

Ablation study on encoder heuristics (Figure 4)

In this study we how DAGER on encoders performs with and without heuristics across different sequence lengths on batch size 4.

Parameters

END_INPUT - at what sequence length the search should finish

Commands

To run the study on BERT with all heuristics:

./scripts/enc_feasibility.sh 4 END_INPUT

To run the study on BERT with no heuristics:

./scripts/enc_feasibility_no_heuristics.sh 4 END_INPUT

Fine-tuning GPT-2 with and without defended gradients

In order to fine-tune GPT-2, you need scikit-learn as an additional prerequisite. Simply run either:

pip install scikit-learn

or

conda install -c conda-forge scikit-learn

and follow the instructions below.

Parameters

DATASET - the dataset to use. Must be one of cola, sst2, rotten_tomatoes.
SIGMA - the amount of Gaussian noise with which to train e.g 0.001. To train without defense set to 0.0.
NUM_EPOCHS - for how many epochs to train e.g 2.

Commands

To train your own network:

python3 train.py --dataset DATASET --batch_size 32 --noise SIGMA --num_epochs NUM_EPOCHS --save_every 100 --model_path MODEL

The models are stored under finetune/<TRAINING_METHOD>_<EPOCH>, where TRAINING_METHOD can be either lora or full, and the EPOCH is the corresponding checkpoint

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
lamp		lamp
scripts		scripts
train_utils		train_utils
utils		utils
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
analyze_neptune.py		analyze_neptune.py
args_factory.py		args_factory.py
attack.py		attack.py
attack_len_increment.py		attack_len_increment.py
attack_new.py		attack_new.py
constants.py		constants.py
environment.yml		environment.yml
token_filtering.py		token_filtering.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Code for the NeurIPS 2024 submission:

"DAGER: Extracting Text from Gradients with Language Model Priors"

Prerequisites

Setting up HuggingFace (optional)

Running TAG/LAMP

Rank tolerance adjustment

DAGER Experiments (Tables 1, 2, 3, 5, 11)

Parameters

Commands

DAGER under FedAvg (Table 4)

Parameters

Commands

Ablation study on best-effort reconstruction (Table 4)

Ablation study on filter thresholding (Figure 2)

Ablation study on rank threshold (Figure 3)

Parameters

Command

Ablation study on encoder heuristics (Figure 4)

Parameters

Commands

Fine-tuning GPT-2 with and without defended gradients

Parameters

Commands

About

Releases

Packages

Languages

License

LBR-13/dager-gradient-inversion

Folders and files

Latest commit

History

Repository files navigation

Code for the NeurIPS 2024 submission: "DAGER: Extracting Text from Gradients with Language Model Priors"

Prerequisites

Setting up HuggingFace (optional)

Running TAG/LAMP

Rank tolerance adjustment

DAGER Experiments (Tables 1, 2, 3, 5, 11)

Parameters

Commands

DAGER under FedAvg (Table 4)

Parameters

Commands

Ablation study on best-effort reconstruction (Table 4)

Ablation study on filter thresholding (Figure 2)

Ablation study on rank threshold (Figure 3)

Parameters

Command

Ablation study on encoder heuristics (Figure 4)

Parameters

Commands

Fine-tuning GPT-2 with and without defended gradients

Parameters

Commands

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Code for the NeurIPS 2024 submission:

"DAGER: Extracting Text from Gradients with Language Model Priors"

Packages