llm-mcts

This is a project to explore Monte-Carlo Tree Search (MCTS) for Code-Gen tasks. We first test our method on the Human-Eval dataset, and extend to the Verilog-Eval dataset. For a detailed explanation of the experiments please see the accompanying blog at /web/index.html also hosted at https://localhost:3000/web/index.html.

Env Setup

This project uses conda to manage its python environment and packages. To install all relevant libraries, run the following:

conda env create -f environment.yml
conda activate llm-mcts

Human-Eval

We use a modified human-eval dataset/enviroment from https://github.com/arunpatro/human-eval. This fork contains updated code for python-3.10 and also extends the error feedback to include the traceback.

git clone https://github.com/arunpatro/human-eval
cd human-eval && pip install -e .

Checkout the nbs/humaneval.ipynb for a demo.

Run experiments

python src/baselines.py
python src/mcts.py

Verilog-Eval

We use a modified verilog-eval dataset/enviroment from https://github.com/arunpatro/verilog-eval. This fork contains updated code for python-3.10 and also extends the error feedback to include the traceback, vcdcat for further waveform analysis.

git clone https://github.com/arunpatro/verilog-eval
cd verilog-eval && pip install -e .
git clone https://github.com/cirosantilli/vcdvcd
cd vcdvcd && pip install -e .

Setup Icarus Verilog

Executing tests from the verilog-eval dataset requires a local installation of iverilog. You'll need to follow the relevant installation steps to get it setup. Once this is done, run the following to verify everything is working correctly:

iverilog -V
vvp -V

Run experiments

PYTHONPATH="./verilog" python src/baselines.py verilog
PYTHONPATH="./verilog" python src/mcts.py verilog

Name	Name	Last commit message	Last commit date
Latest commit rmshin fix: img lazy loading + branch factor input handling Feb 26, 2024 de53d36 · Feb 26, 2024 History 64 Commits
data	data	add notebooks for complete functional mcts with humaneval integration	Jan 5, 2024
human-eval	human-eval	add verilog code	Jan 29, 2024
nbs	nbs	add nb	Jan 29, 2024
src	src	web: capture tree data for web visualisation + implement basic html d…	Jan 29, 2024
svgviz	svgviz	experiment: generate new verilog mcts with 4096 ctx + top-5 smpl	Jan 29, 2024
vcdvcd	vcdvcd	add verilog code	Jan 29, 2024
verilog-eval	verilog-eval	add verilog code	Jan 29, 2024
verilog	verilog	fix: implement timeout for verilog mcts node evaluation	Jan 29, 2024
web	web	fix: img lazy loading + branch factor input handling	Feb 26, 2024
.gitignore	.gitignore	debug: include generated mcts visualisations	Jan 29, 2024
README.md	README.md	add changes	Feb 22, 2024
environment.yml	environment.yml	add notebook for experimenting with phi-2 in llama.cpp	Jan 9, 2024
few_shot_baselines_256_top_3.jsonl	few_shot_baselines_256_top_3.jsonl	generate new baselines with top-3 sampling, 256 token limit & 20 samples	Jan 8, 2024
few_shot_baselines_256_top_3.jsonl_results.jsonl	few_shot_baselines_256_top_3.jsonl_results.jsonl	add verilog code	Jan 29, 2024
few_shot_mcts.jsonl	few_shot_mcts.jsonl	handle skipped human eval problems and generate their mcts solutions	Jan 8, 2024
few_shot_mcts.jsonl_results.jsonl	few_shot_mcts.jsonl_results.jsonl	add verilog code	Jan 29, 2024
verilog_few_shot_baselines_1024_top_50.jsonl	verilog_few_shot_baselines_1024_top_50.jsonl	experiment: generate new verilog baselines with 4096 ctx + top-50 smpl	Jan 29, 2024
verilog_few_shot_baselines_1024_top_50.jsonl_results.jsonl	verilog_few_shot_baselines_1024_top_50.jsonl_results.jsonl	add verilog code	Jan 29, 2024
verilog_few_shot_baselines_256_top_3.jsonl	verilog_few_shot_baselines_256_top_3.jsonl	experiment: generate baselines for verilog code gen	Jan 29, 2024
verilog_few_shot_mcts.jsonl	verilog_few_shot_mcts.jsonl	experiment: generate new verilog mcts with 4096 ctx + top-5 smpl	Jan 29, 2024
verilog_few_shot_mcts.jsonl_results.jsonl	verilog_few_shot_mcts.jsonl_results.jsonl	add verilog code	Jan 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

llm-mcts

Env Setup

Human-Eval

Run experiments

Verilog-Eval

Setup Icarus Verilog

Run experiments

About

Releases

Packages

Contributors 2

Languages

rmshin/llm-mcts

Folders and files

Latest commit

History

Repository files navigation

llm-mcts

Env Setup

Human-Eval

Run experiments

Verilog-Eval

Setup Icarus Verilog

Run experiments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages