StructRAG

StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization

0. Environment

python 3.8.19
vllm 0.6.3.post1
pip install -r requirement.txt

1. Data Preparation

please follow Loong/README.md

2. StructRAG Inference

# 1. launch llm api server
model_path = "/mnt/data/lizhuoqun/hf_models/Qwen2-72B-Instruct"
CUDA_VISIBLE_DEVICES=0,1,2,3 && OUTLINES_CACHE_DIR=tmp && nohup python -m vllm.entrypoints.openai.api_server --model ${model_path} --served-model-name Qwen --tensor-parallel-size 4 --port 1225 --disable-custom-all-reduce > vllm.log
# 2. run StructRAG
python main.py --url {url_of_api_server} # output will be in ./eval_results/qwen/loong
# 3. transform model output to Loong results format
python do_merge_each_batch.py # results will be in ./Loong/output/qwen

3. Results Evaluation

cd Loong/src && bash run.sh

4. Router Training (optional)

Qwen2-72B-Instruct has already achieved good routing performance under the few-shot examples setting. If wish to further improve routing accuracy, we can train the 7B model using the DPO algorithm:

bash train_router/train.sh

After training, deploy the output model as an API using vllm, and obtain url_of_router. When running StructRAG, use the following command:

python main.py --url {url_of_api_server} --router_url {url_of_router}

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
Loong		Loong
prompts		prompts
train_router		train_router
utils		utils
.gitignore		.gitignore
README.md		README.md
do_merge_each_batch.py		do_merge_each_batch.py
main.py		main.py
requirements.txt		requirements.txt
router.py		router.py
structurizer.py		structurizer.py
utilizer.py		utilizer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

StructRAG

0. Environment

1. Data Preparation

2. StructRAG Inference

3. Results Evaluation

4. Router Training (optional)

About

Releases

Packages

Languages

icip-cas/StructRAG

Folders and files

Latest commit

History

Repository files navigation

StructRAG

0. Environment

1. Data Preparation

2. StructRAG Inference

3. Results Evaluation

4. Router Training (optional)

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages