baichuan2 mmlu结果复现的问题 #194

zhanghan1992 · 2023-11-02T09:59:48Z

评估使用的代码：https://github.com/baichuan-inc/Baichuan-7B/blob/main/evaluation/evaluate_mmlu.py

用bf16精度测试 llama2-13-hf 和 baichuan2-13b-base
llama2-13-hf: 0.550
baichuan2-13b-base: 0.564

改了一行代码，用fp32测试：
#model = AutoModelForCausalLM.from_pretrained(args.model, torch_dtype=torch.bfloat16, device_map="auto",trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(args.model, device_map="auto",trust_remote_code=True)
llama2-13-hf: 0.554
baichuan2-13b-base: 0.590

请教下，为啥baichuan2在bf16和fp32精度下结果差这么多？

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

baichuan2 mmlu结果复现的问题 #194

baichuan2 mmlu结果复现的问题 #194

zhanghan1992 commented Nov 2, 2023 •

edited

Loading

baichuan2 mmlu结果复现的问题 #194

baichuan2 mmlu结果复现的问题 #194

Comments

zhanghan1992 commented Nov 2, 2023 • edited Loading

zhanghan1992 commented Nov 2, 2023 •

edited

Loading