LLM Listwise Reranker for CodeRAG

Retrieval-Augmented Generation (RAG) system over a code repository for a question-answering task

Developed as part of an application for a JetBrains internship.

⚡Quickstart

Prerequisites

Python 3.9+
Conda (for environment management)

🛠️Setup

Clone the repository and install dependencies:

git clone https://github.com/AStroCvijo/llm_listwise_reranker_for_coderag.git
cd llm_listwise_reranker_for_coderag
conda create -n rag python=3.9
conda activate rag
pip install -r requirements.txt --index-url https://download.pytorch.org/whl/cu124 --extra-index-url https://pypi.org/simple --extra-index-url https://pypi.ngc.nvidia.com

🖥️Run with User Interface

python main.py -ui

📊Evaluation

To evaluate the model on the provided test set:

chmod +x eval.sh
./eval.sh

📖Arguments Guide

Argument	Description	Default	Options
`-k`, `--top_k`	Number of results retrieved per query	`30`	Any positive integer
`-cs`, `--chunk_size`	Text chunk size for indexing	`1200`	Any positive integer
`-co`, `--chunk_overlap`	Overlapping tokens between chunks	`200`	Any non-negative integer
`-ls`, `--llm_summary`	Enable document summarization	`True`	`True`, `False`
`-em`, `--embedding_model`	Embedding model selection	`text-embedding-3-large`	`text-embedding-3-large`, `text-embedding-3-small`, `text-embedding-ada-002`
`-m`, `--llm`	LLM model to use	`gpt-4o-mini`	`gpt-3.5-turbo`, `gpt-4o-mini`
`-ui`, `--user_interface`	Enable UI mode	`False`	`True`, `False`
`-v`, `--verbose`	Enable debugging output	`False`	`True`, `False`
`-e`, `--eval`	Enable evaluation mode	`False`	`True`, `False`
`-ru`, `--repo_url`	Repository URL for indexing	`https://github.com/viarotel-org/escrcpy`	Any valid repo URL

📄Experiments

For detailed experiments and evaluations of this RAG system, please refer to the following document:

Experiments and Evaluations

This document provides comprehensive insights into the performance, benchmarks, and various configurations tested during the development of the LLM Listwise Reranker for CodeRAG. It includes comparisons of different embedding models, chunk sizes, and other parameters to help you understand the system's capabilities and optimize it for your specific use case.

🧩Seamless Integration with Multiple LLM and Embedding Providers (Local/API/Custom)

Currently, utils/handlers.py contains the handlers for LLM and embedding models, but only OpenAI models are implemented. To use a different LLM or embedding model, you can modify the logic in utils/handlers.py accordingly.

Note: The LLMs used should be compatible with LangChain and, preferably, support structured output. Additionally, they should have a context length of at least 16,385 tokens for optimal performance.

📌TODO

- Implement proper Contextual embedding
- Hugging Face Model Handling

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Listwise Reranker for CodeRAG

⚡Quickstart

Prerequisites

🛠️Setup

🖥️Run with User Interface

📊Evaluation

📖Arguments Guide

📄Experiments

🧩Seamless Integration with Multiple LLM and Embedding Providers (Local/API/Custom)

📌TODO

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 64 Commits
docs		docs
eval		eval
handlers		handlers
rag		rag
ui		ui
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
eval.sh		eval.sh
main.py		main.py
requirements.txt		requirements.txt

License

AStroCvijo/llm_listwise_reranker_for_coderag

Folders and files

Latest commit

History

Repository files navigation

LLM Listwise Reranker for CodeRAG

⚡Quickstart

Prerequisites

🛠️Setup

🖥️Run with User Interface

📊Evaluation

📖Arguments Guide

📄Experiments

🧩Seamless Integration with Multiple LLM and Embedding Providers (Local/API/Custom)

📌TODO

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages