Feature/support qwenvl glm4-v phi3-v(conflict resolving) #4377

marko1616 · 2024-06-19T12:55:07Z

What does this PR do?

Fixes #4375

Before submitting

Did you read the contributor guideline?
Did you write any new necessary tests?

marko1616 · 2024-06-20T13:22:07Z

终于还差一个image的padding处理就能做好训练支持了。

marko1616 · 2024-06-20T13:27:09Z

@hiyouga 改的比较多捏，有空帮忙看看这个实现思路行不行。谢谢。

src/llamafactory/chat/hf_engine.py

src/llamafactory/data/loader.py

src/llamafactory/data/processors/processor_utils.py

src/llamafactory/model/adapter.py

marko1616 · 2024-06-21T08:27:13Z

成功跑了训练。

BUAADreamer · 2024-07-01T10:10:40Z

暂时先不要引入更多模型，把现有的三个模型完善好，控制diff🤗 @marko1616

marko1616 · 2024-07-01T11:07:38Z

暂时先不要引入更多模型，把现有的三个模型完善好，控制diff🤗 @marko1616

oaky，现在确实在测试别的训练模式。

marko1616 · 2024-07-11T09:26:53Z

你说的这个版本应该是可以运行的。glm4v没有使用任何新特性好像。如果遇到问题欢迎随时报告。 Get Outlook for Android<https://aka.ms/AAb9ysg>

…

________________________________ From: Coding Steven ***@***.***> Sent: Thursday, July 11, 2024 11:37:17 AM To: hiyouga/LLaMA-Factory ***@***.***> Cc: marko1616 ***@***.***>; Mention ***@***.***> Subject: Re: [hiyouga/LLaMA-Factory] Feature/support qwenvl glm4-v phi3-v(tested) (PR #4377) glm4v官方的demo要求的transformer版本低于您的transformer版本要求,导致无法兼容，我现在是4.40.2，请问下您的版本？ ― Reply to this email directly, view it on GitHub<#4377 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AKZ2M5MWLAEZCHBEZBIAKUDZLX4W3AVCNFSM6AAAAABJSALHYGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDEMRRHE2TGMZQGA>. You are receiving this because you were mentioned.Message ID: ***@***.***>

src/llamafactory/chat/vllm_engine.py

chocoded · 2024-07-30T09:54:16Z

在sft训练时，eval步骤出现问题：
File ".../TuningFactory/src/llmtuner/train/sft/workflow.py", line 98, in run_sft
metrics = trainer.evaluate(metric_key_prefix="eval", **gen_kwargs)

...

File ".../glm-4v-9b/modeling_chatglm.py", line 1035, in forward
full_attention_mask = self.get_masks(inputs_embeds, past_key_values, padding_mask=attention_mask)

File ".../glm-4v-9b/modeling_chatglm.py", line 835, in get_masks
full_attention_mask = full_attention_mask * padding_mask.unsqueeze(1)
RuntimeError: The size of tensor a (1623) must match the size of tensor b (24) at non-singleton dimension 2

input_ids:
[151331, 151333, 151336, 198, 151339, 151329, 151340, 98598, 98992, 100555, 101052, 101939, 11314, 151337, 198, 111000, 127102, 98993, 114571, 98362, 104343, 1773, 151329]

可能的原因：

...
data_collator = DataCollatorForSeq2Seq(
        tokenizer=tokenizer,
        pad_to_multiple_of=8 if tokenizer.padding_side == "right" else None,  # for shift short attention
        label_pad_token_id=IGNORE_INDEX if data_args.ignore_pad_token_for_loss else tokenizer.pad_token_id,
)
...

在 sft/workflow.py 文件 59 行处设置了 pad_to_multiple_of=8，如果 attention_mask 长度不为 8 的倍数，则末尾需要用 0 填充，在 modeling_chatglm.py 第 1035 行处需要计算 full_attention_mask，eval 时不会在之前步骤重新计算 attention_mask，导致 inputs_embeds 和 attention_mask 长度不匹配，从而出现错误。

可能的修复：
在 glm4v 模型中，设置 pad_to_multiple_of=1，即不做填充，这也是我看到的 ms-swift 的做法。

marko1616 · 2024-07-31T13:04:29Z

v-9b/modeling_chatglm.py", line 1035, in forward full_attention_mask = self.get_masks(inputs_embeds, past_key_values, padding_mask=attention_mask)

File ".../glm-4v-9b/modeling_chatglm.py", line 835, in get_masks full_attention_mask = full_attention_mask * padding_mask.unsqueeze(1) RuntimeError: The size of tensor a (1623) must match the size of tensor b (24) at non-singleton dimension 2

input_ids: [151331, 151333, 151336, 198, 151339, 151329, 151340, 98598, 98992, 100555, 101052, 101939, 11314, 151337, 198, 111000,

真的非常感谢你指出了这个问题与详细的分析，我会更加仔细的查看对应的功能实现，并尽快给出可用的commit。

chocoded · 2024-08-01T07:54:26Z

补充：
glm4v 模型在无监督的情况下 train 的步骤会出现问题，原因是 _encode_unsupervised_example 中 encode 过程没有保证 input_ids 和 labels 长度对齐，在 modeling_chatglm.py 中 1216 行处计算 loss 时会报错。

marko1616 · 2024-08-01T08:24:01Z

补充： glm4v 模型在无监督的情况下 train 的步骤会出现问题，原因是 _encode_unsupervised_example 中 encode 过程没有保证 input_ids 和 labels 长度对齐，在 modeling_chatglm.py 中 1216 行处计算 loss 时会报错。

方便加个聊天方式交流一下吗？（非常感谢你完成了这些测试，我今天就会进行修正）

chocoded · 2024-08-01T08:35:13Z

方便加个聊天方式交流一下吗？（非常感谢你完成了这些测试，我今天就会进行修正）

行，我的qq是227154737

marko1616 · 2024-08-01T08:48:46Z

方便加个聊天方式交流一下吗？（非常感谢你完成了这些测试，我今天就会进行修正）

行，我的qq是227154737

OK加了

marko1616 and others added 4 commits June 19, 2024 14:11

Basic support for webui.

fbf19f8

Basic support for GLM4V

95b8a1d

Merge branch 'hiyouga:main' into feature/Support-Qwenvl

61a0880

Pass ruff check.

8044804

hiyouga added the pending This problem is yet to be addressed label Jun 19, 2024

Half of sft support and bug fix.

c58be83

marko1616 commented Jun 20, 2024

View reviewed changes

src/llamafactory/chat/hf_engine.py Show resolved Hide resolved

marko1616 commented Jun 20, 2024

View reviewed changes

src/llamafactory/data/loader.py Outdated Show resolved Hide resolved

marko1616 commented Jun 20, 2024

View reviewed changes

src/llamafactory/data/processors/processor_utils.py Show resolved Hide resolved

marko1616 commented Jun 20, 2024

View reviewed changes

src/llamafactory/model/adapter.py Outdated Show resolved Hide resolved

GLM4v lora sft support

4b01584

Little fix

c233520

marko1616 changed the title Feature/support qwenvl glm4-v *WORKING DO NOT MERGE* Feature/support qwenvl glm4-v (tested) Jun 23, 2024

hiyouga and others added 2 commits June 25, 2024 02:58

Merge branch 'main' into feature/Support-Qwenvl

078c85d

Fix requirements.txt

67542a0

BUAADreamer self-requested a review June 28, 2024 17:05

BUAADreamer and others added 6 commits June 29, 2024 01:45

fix conflict

e6aa967

QwenVL sft & webui train buxfix.

f698b43

phi3v infer support & rename.

3fa3a0b

Add rm,pt,ppo,kto,dpo support for glm4v(Not tested).

06823f4

Merge branch 'hiyouga:main' into feature/Support-Qwenvl

40e817c

little fix

4e4f959

marko1616 changed the title ~~Feature/support qwenvl glm4-v (tested)~~ Feature/support qwenvl glm4-v phi3-v(tested) Jun 30, 2024

marko1616 and others added 2 commits July 1, 2024 01:15

Pass ruff

4f564a1

Merge branch 'main' into feature/Support-Qwenvl

5065e87

marko1616 added 4 commits July 3, 2024 21:16

Phi3v lora sft fix.

4146242

fix get_template.

70ac8ea

Update for unsupervised dataset.

ea60231

Phi3v dataset processor fix.

b932bc0

zjysteven reviewed Jul 17, 2024

View reviewed changes

src/llamafactory/chat/vllm_engine.py Show resolved Hide resolved

Merge branch 'main' into feature/Support-Qwenvl

36932dd

marko1616 changed the title ~~Feature/support qwenvl glm4-v phi3-v(tested)~~ Feature/support qwenvl glm4-v phi3-v(conflict resolving) Jul 18, 2024

marko1616 added 3 commits July 19, 2024 03:53

Conflict fix

3c2ecba

RLHF support.

3f9ccb3

glm4v pairwise dataset support

9c6587e

marko1616 requested a review from zjysteven July 19, 2024 20:40

BUAADreamer removed the request for review from zjysteven July 21, 2024 02:26

Merge branch 'main' into feature/Support-Qwenvl

cfe0652

marko1616 and others added 2 commits August 20, 2024 22:01

Merge branch 'main' into feature/Support-Qwenvl

19a4cf7

Name fix.

e9d902b

marko1616 temporarily deployed to tests August 22, 2024 04:38 — with GitHub Actions Inactive

ruff pass.

65b64be

marko1616 marked this pull request as draft September 3, 2024 09:53

hiyouga force-pushed the main branch from 5569125 to b4c7dd3 Compare October 29, 2024 07:32

marko1616 closed this Dec 31, 2024

marko1616 deleted the feature/Support-Qwenvl branch December 31, 2024 12:25

hiyouga added wontfix This will not be worked on and removed pending This problem is yet to be addressed labels Dec 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/support qwenvl glm4-v phi3-v(conflict resolving) #4377

Feature/support qwenvl glm4-v phi3-v(conflict resolving) #4377

marko1616 commented Jun 19, 2024 •

edited

Loading

marko1616 commented Jun 20, 2024

marko1616 commented Jun 20, 2024

marko1616 commented Jun 21, 2024

BUAADreamer commented Jul 1, 2024 •

edited

Loading

marko1616 commented Jul 1, 2024

marko1616 commented Jul 11, 2024 via email

chocoded commented Jul 30, 2024

marko1616 commented Jul 31, 2024

chocoded commented Aug 1, 2024

marko1616 commented Aug 1, 2024 •

edited

Loading

chocoded commented Aug 1, 2024

marko1616 commented Aug 1, 2024

Feature/support qwenvl glm4-v phi3-v(conflict resolving) #4377

Feature/support qwenvl glm4-v phi3-v(conflict resolving) #4377

Conversation

marko1616 commented Jun 19, 2024 • edited Loading

What does this PR do?

Before submitting

marko1616 commented Jun 20, 2024

marko1616 commented Jun 20, 2024

marko1616 commented Jun 21, 2024

BUAADreamer commented Jul 1, 2024 • edited Loading

marko1616 commented Jul 1, 2024

marko1616 commented Jul 11, 2024 via email

chocoded commented Jul 30, 2024

marko1616 commented Jul 31, 2024

chocoded commented Aug 1, 2024

marko1616 commented Aug 1, 2024 • edited Loading

chocoded commented Aug 1, 2024

marko1616 commented Aug 1, 2024

marko1616 commented Jun 19, 2024 •

edited

Loading

BUAADreamer commented Jul 1, 2024 •

edited

Loading

marko1616 commented Aug 1, 2024 •

edited

Loading