The code in this repository supplements our recent paper that is published at IJCLR 2024 and available on arXiv: https://arxiv.org/abs/2409.00861
We integrated stark_main in this directory from https://github.com/snap-stanford/stark/tree/main for benchmarking.
We added bridge_to_llm_consultant.py to stark_main -> models and added it in stark_main -> models -> init.py in this directory. 4StepFocus can be used directly without the stark_main as in main.py in the root directory.
Add a file .env
with your OpenAI api key OPENAI_API_KEY=sk-XXXXXXXXXXXXXXXXXXXXXXXXXX
in the main directory.
See https://github.com/snap-stanford/stark/tree/main for instructions how to download benchmarking data and candidate embeddings.
Install requirements from requirements.txt
python -m stark_main.eval
For example:
python -m stark_main.eval --dataset mag --model LLMConsultant --split test --output_dir output --llm_model gpt-4o-2024-05-13 --emb_dir emb --llm_topk 20 --max_retry 2 --save_pred --test_ratio 0.1
python -m main
Configurations can to be set in main function in main.py in this root directory.