Case-Based or Rule-Based: How Do Transformers Do the Math?

We explore whether LLMs perform case-based or rule-based reasoning in this work.

⭐ Official code for Case-Based or Rule-Based: How Do Transformers Do the Math?.

Requirements

Tested combination of python packages that can successfully complete the program is listed in requirements.txt. You can run the following script to install them.

pip install -r requirements.txt

Replication of Leave-Square-Out

To replicate our main experiments of Leaving-Square-Out, you need to download the GPT-2 or GPT-2 Medium models and put them in .\pretrained_models. Then, you can run the script train.py to fine-tune the pre-trained models.

Datasets

We provide the datasets for our main experiments in .\datasets. In each dataset, we provide a figure showing the train-test split data_split.png.

Llama

We adopt the FastChat Framework to finetune Llama-7B in ./llama.

Citation

If you want to use the code for your research, please cite our paper:

@misc{hu2024casebased,
      title={Case-Based or Rule-Based: How Do Transformers Do the Math?},
      author={Yi Hu and Xiaojuan Tang and Haotong Yang and Muhan Zhang},
      year={2024},
      eprint={2402.17709},
      archivePrefix={arXiv},
      primaryClass={cs.AI}
}

Name	Name	Last commit message	Last commit date
Latest commit AheadOFpotato Update README.md Mar 2, 2024 1f96b05 · Mar 2, 2024 History 21 Commits
datasets	datasets	update req and data	Feb 19, 2024
icl	icl	Add in-context learning codes	Feb 27, 2024
llama	llama	add llama	Feb 19, 2024
LICENSE	LICENSE	Initial commit	Jan 9, 2024
README.md	README.md	Update README.md	Mar 2, 2024
dataset.py	dataset.py	framework	Jan 9, 2024
plot_1hole.ipynb	plot_1hole.ipynb	update pic and llama	Feb 19, 2024
plot_1hole_rec.ipynb	plot_1hole_rec.ipynb	update pic and llama	Feb 19, 2024
plot_3hole.ipynb	plot_3hole.ipynb	update pic and llama	Feb 19, 2024
plot_ablation.ipynb	plot_ablation.ipynb	update pic and llama	Feb 19, 2024
plot_scratch.ipynb	plot_scratch.ipynb	update pic and llama	Feb 19, 2024
requirements.txt	requirements.txt	requirements	Feb 19, 2024
train.py	train.py	Update train.py	Feb 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Case-Based or Rule-Based: How Do Transformers Do the Math?

Requirements

Replication of Leave-Square-Out

Datasets

Llama

Citation

About

Releases

Packages

Contributors 3

Languages

License

GraphPKU/Case_or_Rule

Folders and files

Latest commit

History

Repository files navigation

Case-Based or Rule-Based: How Do Transformers Do the Math?

Requirements

Replication of Leave-Square-Out

Datasets

Llama

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages