Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

train_llava 训练好以后出现空格 #185

Open
wrsnice opened this issue Oct 10, 2024 · 2 comments
Open

train_llava 训练好以后出现空格 #185

wrsnice opened this issue Oct 10, 2024 · 2 comments
Labels
good first issue Good for newcomers llava

Comments

@wrsnice
Copy link

wrsnice commented Oct 10, 2024

image

不管英文还是中文数据训练,都出现空格,并且推理信息很少,跟你发布的模型差异很大。

只使用了LLava的数据,没使用另外两个。训练了1个epoch,loss在2.6左右。

@yuanzhoulvpi2017
Copy link
Owner

这是你电脑字体问题吧

@yuanzhoulvpi2017
Copy link
Owner

别人也有反映,因此我看了一下。
出现这个错误,主要是因为preprocessor_config.json文件下: "processor_class": "LlavaProcessor",❌,这种是错误的。

这个值正确的为: "processor_class": "LlavaProcessor",

对比一下

错误的效果
截屏2024-12-15 17 54 18

正确的效果
截屏2024-12-15 17 52 48

建议调整一下这个参数设置,对应好之后,再训练一下。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
good first issue Good for newcomers llava
Projects
None yet
Development

No branches or pull requests

2 participants