MATCHA

Misaligning Reasoning with Answers - A Framework for Assessing LLM CoT Robustness[Arxiv]
Enyi Jiang, Changming Xu, Nischay Singh, Gagandeep Singh

Code

Installation

We recommend first creating a conda environment using the provided requirements.txt:

conda create --name MATCHA

pip install -r requirements.txt

Testing

Run embedding-level perturbation script

python inference_emb_attack.py --dataset="gsm8k" --model="llama3-8B" --method="few_shot_cot" --qes_limit=0 --prompt_path="./basic_cot_prompts/math_word_problems" --random_seed=42 --multipath=1 --basic_cot True

Run token-level perturbation script

python inference_tok_random_dual.py --dataset="gsm8k" --model="llama3-8B" --method="few_shot_cot" --qes_limit=0 --prompt_path="./basic_cot_prompts/math_word_problems" --random_seed=42 --multipath=1 --basic_cot True

Run black-box tranfer script

python close_source_transfer.py --dataset="gsm8k" --model="gpt-3.5-turbo" --model2='deepseek' --method="few_shot_cot" --qes_limit=0 --prompt_path="basic_cot_prompts/math_word_problems" --random_seed=42 --multipath=1 --temperature=0.7 --basic_cot True  --api_time_interval=2

Important arguments

--dataset: The name of a dataset. choices = [gsm8k, strategyqa, singleeq].
--model: open-source model. choices = ["llama3-8B","mistral", "zephyr", "qwen", "deepseek"].
--method: few-shot-cot.
--qes_limit: number of test questions.
--prompt_path: path of prompt file.

Credits

Parts of the code in this repo is based on

https://github.com/shizhediao/active-prompt

Citation

Cite the paper/repo:

@article{jiang2025misaligning,
  title={Misaligning Reasoning with Answers--A Framework for Assessing LLM CoT Robustness},
  author={Jiang, Enyi and Xu, Changming and Singh, Nischay and Singh, Gagandeep},
  journal={arXiv preprint arXiv:2505.17406},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
autorater		autorater
basic_cot_prompts		basic_cot_prompts
LICENSE		LICENSE
README.md		README.md
close_source_transfer.py		close_source_transfer.py
data_utils.py		data_utils.py
inference_emb_attack.py		inference_emb_attack.py
inference_tok_random.py		inference_tok_random.py
inference_tok_random_dual.py		inference_tok_random_dual.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MATCHA

Code

Installation

Testing

Run embedding-level perturbation script

Run token-level perturbation script

Run black-box tranfer script

Important arguments

Credits

Citation

About

Uh oh!

Releases

Packages

Languages

License

uiuc-focal-lab/MATCHA

Folders and files

Latest commit

History

Repository files navigation

MATCHA

Code

Installation

Testing

Run embedding-level perturbation script

Run token-level perturbation script

Run black-box tranfer script

Important arguments

Credits

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages