ZeroShot In-Context Retrieval-Augmented Language Models

Setup

Install python 3.8 first.

To install the required libraries in our repo, run:

pip install -r requirements.txt

To have a Pytorch version specific to your CUDA, install your version before running the above command.

To install Java using conda, run:

conda install openjdk=11

TODO

Checkout the todo section in the doc.

Retrieval

Currently only BM25 retrieval is supported, however this was shown to be the best in In Context RALM.

BM25

Java 11 is required to run bm25 retrieval with pyserini.

python prepare_retrieval_data.py \
  --retrieval_type sparse \
  --tokenizer_name $MODEL_NAME \
  --max_length 1024 \
  --dataset_path wikitext \
  --dataset_name wikitext-103-v1 \
  --dataset_split [validation, test] \
  --index_name wikipedia-dpr \
  --forbidden_titles_path ralm/retrievers/wikitext103_forbidden_titles.txt \
  --stride 4 \
  --output_file $RETRIEVAL_FILE \
  --num_tokens_for_query 32 \
  --num_docs 16

Reranking

All reranking requires a retrieval file. First run retrieval and then run reranking.

List of Reranking Methods

ZS-Col-LLMs: gpt2, gpt2-medium, gpt2-large, gpt2-xl, bert-base-uncased,
ColBERT: ,
Bert: ,
Contriever: ,

Zero Shot Col-LLMs

Zero Shot Col-LLMs utilizes any pretrained LM as a reranker and calculates the query document similarity using the maxsim operator. Zero Shot Col-LLMs can use either the inter-layer embeddings or the attention projection embeddings to rerank documents.

To run reranking with zero shot Col-LLMs.

python rerank_retrieval_data.py \
  --reranking_type zs-llms \
  --model_name gpt2 \
  --batch_size 1 \
  --output_file $OUTPUT_DIR \
  --retrieved_file $RETRIEVAL_FILE \
  --max_length 256 \
  --num_docs_to_rank 16

Rather than using embeddings dimension we can also use attention matrix to project embeddings into subspaces which can then we used to rerank the documents. To do this we can pass the --attention flag.

python rerank_retrieval_data.py \
  --reranking_type zs-llms \
  --attention \
  --model_name gpt2 \
  --batch_size 1 \
  --output_file $OUTPUT_DIR \
  --retrieved_file $RETRIEVAL_FILE \
  --max_length 256 \
  --num_docs_to_rank 16

The --similarity flag can be set to either max or avg, to run MaxSim and AvgSim respectfully.

ColBERT

We can also use an OOD trained ColBERT style reranker trained on MS MARCO. While Zero Shot ColBERT could use any model, they method specifically relies on LMs pretrained on the ColBERT objective. These towers are BERT Based.

To run reranking with the ColBert objective:

BERT

Bert reranking use's the more common coarse grain reranking of a pretrained Bert Tower.

To run reranking with the BERT objective:

Contriever

Contriever is another coarse grain reranking method introduced in this paper.

To run reranking with a Contriever Model:

Evaluation

List of Language Models

GPT-2: gpt2, gpt2-medium, gpt2-large, gpt2-xl
GPT-Neo: EleutherAI/gpt-neo-1.3B, EleutherAI/gpt-neo-2.7B, EleutherAI/gpt-j-6B
OPT: facebook/opt-125m, facebook/opt-350m, facebook/opt-1.3b, facebook/opt-2.7b, facebook/opt-6.7b, facebook/opt-13b, facebook/opt-30b, facebook/opt-66b

Evaluate models w/o retrieval

To run evaluation on models without retrieval, please use the following command (you can increase stride to 32 for faster evaluation):

python eval_lm.py \
  --model_name $MODEL_NAME \
  --dataset_path wikitext \
  --dataset_name wikitext-103-v1 \
  --dataset_split [validation, test] \
  --output_dir $OUTPUT_DIR \
  --stride 4 \
  --max_length 1024 \
  [--model_parallelism]

Evaluate models with retrieval:

To run models with retrieval, use the $RETRIEVAL_FILE output from the prepare_retrieval_data.py script:

python eval_lm.py \
  --model_name $MODEL_NAME \
  --dataset_path wikitext \
  --dataset_name wikitext-103-v1 \
  --dataset_split [validation, test] \
  --output_dir $OUTPUT_DIR \
  --stride 4 \
  --max_length 1024 \
  [--model_parallelism] \
  --retrieved_file $RETRIEVAL_FILE

Note: Our main retrieval flow assumes you want to use the top-scored passage from your retrieval file (--ranking_strategy first).

Reranking

To run model with reranking, use the $RERANKING_FILE output from rerank_retrieval_data.py script.

Then run:

python eval_lm.py \
  --model_name $MODEL_NAME \
  --dataset_path wikitext \
  --dataset_name wikitext-103-v1 \
  --dataset_split [validation, test] \
  --output_dir $OUTPUT_DIR \
  --stride 4 \
  --max_length 1024 \
  [--model_parallelism] \
  --retrieved_file $RERANK_FILE \
  --ranking_strategy first-rerank \
  --layer -1 \ # For strategies that require a layer
  --num_docs_to_rank 16 \
  --ranking_logprob_past_tokens 16

Note: The reranked file doesn't store documents in sorted order, thus --ranking_strategy first-rerank will dynamically find the top scoring document.

Display Results

We can use the script summarize_results.py in order to summarize runs and create graphics.

Acknowledgements

This code base was forked from In Context RALM.

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
benchmark		benchmark
prefix_trainer		prefix_trainer
ralm		ralm
scripts		scripts
training-configs		training-configs
visualization-scripts		visualization-scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
eval_lm.py		eval_lm.py
eval_qa.py		eval_qa.py
prepare_retrieval_data.py		prepare_retrieval_data.py
requirements.txt		requirements.txt
rerank_retrieval_data.py		rerank_retrieval_data.py
summarize_results.py		summarize_results.py
train_prefix.py		train_prefix.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ZeroShot In-Context Retrieval-Augmented Language Models

Table of Contents

Setup

TODO

Retrieval

BM25

Reranking

List of Reranking Methods

Zero Shot Col-LLMs

ColBERT

BERT

Contriever

Evaluation

List of Language Models

Evaluate models w/o retrieval

Evaluate models with retrieval:

Reranking

Display Results

Acknowledgements

About

Releases

Packages

Contributors 3

Languages

License

pbansal5/zero-shot-colbert-retrieval

Folders and files

Latest commit

History

Repository files navigation

ZeroShot In-Context Retrieval-Augmented Language Models

Table of Contents

Setup

TODO

Retrieval

BM25

Reranking

List of Reranking Methods

Zero Shot Col-LLMs

ColBERT

BERT

Contriever

Evaluation

List of Language Models

Evaluate models w/o retrieval

Evaluate models with retrieval:

Reranking

Display Results

Acknowledgements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages