针对金融领域的咨询问答微调模型

项目简介

本项目是通过使用金融问答数据对Llama-3-8B-Instruct模型进行4bit量化微调，以提升模型在金融领域的咨询问答功能。

Llama3 finetune tips

According to usage tips for Llama3 Training the model in float16 is not recommended and is known to produce nan; as such, the model should be trained in bfloat16.

项目运行

Setup

申请Llama-3-8B-Instruct模型的使用权，将huggingface token填入loftq_4bit.yaml与benchmark.sh文件中。

微调模型

python train.py loftq_4bit.yaml

模型评估

sh benchmark.sh

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.vscode		.vscode
.gitignore		.gitignore
README.md		README.md
benchmark.py		benchmark.py
benchmark.sh		benchmark.sh
hparams.py		hparams.py
loftq_4bit.yaml		loftq_4bit.yaml
test_fiqa.py		test_fiqa.py
tools.py		tools.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

针对金融领域的咨询问答微调模型

项目简介

Llama3 finetune tips

项目运行

Setup

微调模型

模型评估

About

Releases

Packages

Languages

kelesit/FinQA

Folders and files

Latest commit

History

Repository files navigation

针对金融领域的咨询问答微调模型

项目简介

Llama3 finetune tips

项目运行

Setup

微调模型

模型评估

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages