LLMM (Large Language Model for Math)

HF LLaMA: https://github.com/huggingface/transformers/tree/main/src/transformers/models/llama
Annotated BERT: https://github.com/w32zhong/annotated-bert

Usage

python inference.py ~/llama-models/7B-hgf-new/ --debug=False

Creating model ...
Loading model shard: pytorch_model-00002-of-00002.bin
Loading model shard: pytorch_model-00001-of-00002.bin
Prompt: My name is Mariama, my favorite
2016 film is La La Land and my favorite food is chocolate chip cookies. I love being active and
am always looking for new things to do around Chicago. I am currently a junior majoring in
Communication with a focus in Strategic Communication and a minor in Spanish. After graduation,
I plan to move to a city with a good public transportation system, get a job and enjoy life. I
am so excited to be a part of the Communication Interns this summer and look forward to learning
about the industry and developing skills that will help me in the future.

Wandb

wandb login

Setup

conda create --name llmm -c conda-forge python=3.8
conda activate llmm

if true; then
  pip3 install --pre torch torchvision torchaudio --index-url https://download.pytorch.org/whl/nightly/cu118
  python -c 'import torch; print(torch.cuda.is_available())'
  python -c 'import torch; print(torch.version.cuda)'
  python -c 'import sys; print(sys.version)'
  python -c 'import torch; print(torch.backends.cudnn.enabled)'
  python -c 'import torch; print(torch.__version__)'
  python -c 'import torch; d = torch.device("cuda"); print(torch.cuda.get_device_properties(d))'
  python -c 'import torch; print(torch.cuda.get_arch_list())'
  
  conda install cuda -c nvidia/label/cuda-11.8.0 # must match torch version!
  pip3 install packaging
  unset CUDA_HOME
  pip3 install flash-attn==2.3.0
else
  pip install vllm
fi;

pip3 install transformers==4.33.1
pip3 install deepspeed==0.10.3
pip3 install peft==0.4.0

pip3 install -r requirements.txt

git submodule init
git submodule update

cd ..
git clone [email protected]:w32zhong/Progressive-Hint.git
git clone [email protected]:hendrycks/math.git

Slurm

See instructions: https://watgpu.cs.uwaterloo.ca/slurm.html, or https://docs.alliancecan.ca/wiki/Using_GPUs_with_Slurm

To see the time limit for a job:

squeue
scontrol show job -dd 483 | grep TimeLimit
scontrol show job -dd 479 | grep TRES=
salloc --gres=gpu:5 --cpus-per-task=8 --mem=250G --time=20:00:00
sacct --starttime=2023-10-18 # list pass/revoked jobs

Name		Name	Last commit message	Last commit date
Latest commit History 816 Commits
BMTrain @ 569e687		BMTrain @ 569e687
ModelCenter @ d084767		ModelCenter @ d084767
checkpoints		checkpoints
data		data
llms		llms
output		output
scripts		scripts
tools		tools
trl @ 4f81e77		trl @ 4f81e77
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
compare.html		compare.html
conda.list		conda.list
data.sh		data.sh
ds_config.py		ds_config.py
ds_config_zero3.json		ds_config_zero3.json
ds_test.py		ds_test.py
ds_test.sh		ds_test.sh
experiments.sh		experiments.sh
flash_attn_monkey_patch.py		flash_attn_monkey_patch.py
generate.py		generate.py
inference.py		inference.py
inference.sh		inference.sh
load_hg_ckpt.py		load_hg_ckpt.py
pip.list		pip.list
prompt_example1.txt		prompt_example1.txt
prompt_example2.txt		prompt_example2.txt
prompt_example3.txt		prompt_example3.txt
prompt_example4.txt		prompt_example4.txt
requirements.txt		requirements.txt
rerope_patch.py		rerope_patch.py
rl.ini		rl.ini
rl.py		rl.py
rl.sh		rl.sh
rl_data.py		rl_data.py
rl_mcts.py		rl_mcts.py
rl_openai.py		rl_openai.py
rl_tools.py		rl_tools.py
setup.sh		setup.sh
test-setup.py		test-setup.py
train.py		train.py
train.sh		train.sh
utils2.py		utils2.py
utils2.sh		utils2.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLMM (Large Language Model for Math)

Usage

Wandb

Setup

Slurm

About

Releases

Packages

Languages

approach0/llmm

Folders and files

Latest commit

History

Repository files navigation

LLMM (Large Language Model for Math)

Usage

Wandb

Setup

Slurm

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages