-
Notifications
You must be signed in to change notification settings - Fork 5k
Issues: hiyouga/LLaMA-Factory
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
使用API模型能力变差
bug
Something isn't working
pending
This problem is yet to be addressed
#7010
opened Feb 20, 2025 by
zzysos
1 task done
mac本用llama-factory如何用MPS ?
enhancement
New feature or request
pending
This problem is yet to be addressed
#7001
opened Feb 19, 2025 by
catman002
1 task done
A800 7*80g 全参微调qwen-2.5-32b OOM?
bug
Something isn't working
pending
This problem is yet to be addressed
#6999
opened Feb 19, 2025 by
coinfist-lucian
1 task done
minicpm_2_6o全量微调验证集eval_loss不计算不打印,也不绘制eval_loss图
bug
Something isn't working
pending
This problem is yet to be addressed
#6988
opened Feb 18, 2025 by
Maflyflyy
1 task done
[help]如何添加规则到数据集的每个条目中,并且不影响返回值
bug
Something isn't working
pending
This problem is yet to be addressed
#6984
opened Feb 18, 2025 by
frankyuan
1 task done
怎么返回输出token的prob呢?
enhancement
New feature or request
pending
This problem is yet to be addressed
#6980
opened Feb 18, 2025 by
PapaMadeleine2022
1 task done
converting model Error: unknown data type: I32
bug
Something isn't working
pending
This problem is yet to be addressed
#6971
opened Feb 17, 2025 by
nemoisfash
1 task done
rm模型代码更改问题
bug
Something isn't working
pending
This problem is yet to be addressed
#6967
opened Feb 17, 2025 by
zll0032
1 task done
用deepspeed zero-3-offload去微调DeepSeek-R1-Distill-Qwen-32B,系统卡住,长时间无反应
bug
Something isn't working
pending
This problem is yet to be addressed
#6964
opened Feb 17, 2025 by
erichuazhou
1 task done
[HELP] Runable solution of RTX 5090 GPU + Linux Driver version + Pytorch version + Deepspeed version for LLM finetuning
bug
Something isn't working
pending
This problem is yet to be addressed
#6958
opened Feb 17, 2025 by
0781532
1 task done
单机多卡 resume_from_checkpoint 时报错 assert len(self.ckpt_list) > 0
bug
Something isn't working
pending
This problem is yet to be addressed
#6955
opened Feb 16, 2025 by
Cassieyy
1 task done
Error in the process of fine-tuning qwen2.5vl-7b evaluate&predict data = [self.dataset[idx] for idx in possibly_batched_index] KeyError: 0
bug
Something isn't working
pending
This problem is yet to be addressed
#6947
opened Feb 14, 2025 by
illusionnnnnnn
1 task done
使用unsloth报错
bug
Something isn't working
pending
This problem is yet to be addressed
#6945
opened Feb 14, 2025 by
Harris-Xie
1 task done
Training Qwen/Qwen2.5-Coder-32B-Instruct model OOM
bug
Something isn't working
pending
This problem is yet to be addressed
#6942
opened Feb 14, 2025 by
mertunsall
1 task done
rm lora训练完成后,执行export合并后结果模型并不是一个reward model。
bug
Something isn't working
pending
This problem is yet to be addressed
#6934
opened Feb 14, 2025 by
yangyang6666
1 task done
安装完成后执行 llamafactory-cli -v 报 fatal error: stdlib.h: No such file or directory #include_next <stdlib.h>,CentOS7
bug
Something isn't working
pending
This problem is yet to be addressed
#6932
opened Feb 14, 2025 by
lunza
1 task done
0.9.2版本训练deepseek3问题
bug
Something isn't working
pending
This problem is yet to be addressed
#6923
opened Feb 13, 2025 by
TexasRangers86
1 task done
failed to docker build
bug
Something isn't working
pending
This problem is yet to be addressed
#6922
opened Feb 13, 2025 by
Danee-wawawa
1 task done
希望支持 Tencent-Hunyuan-7B
enhancement
New feature or request
pending
This problem is yet to be addressed
#6919
opened Feb 12, 2025 by
xenv
1 task done
Qwen2.5-VL-7B-Instruct推理bug:RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cuda:1!
bug
Something isn't working
pending
This problem is yet to be addressed
#6910
opened Feb 12, 2025 by
Felixvillas
1 task done
deepseek微调后进行推理输出混乱
bug
Something isn't working
pending
This problem is yet to be addressed
#6908
opened Feb 12, 2025 by
HelloWorld506
1 task done
the cutoff of multimodal input sequence
enhancement
New feature or request
pending
This problem is yet to be addressed
#6891
opened Feb 11, 2025 by
JJJYmmm
1 task done
sft断点续训时0卡上有显存不释放
bug
Something isn't working
pending
This problem is yet to be addressed
#6880
opened Feb 10, 2025 by
wufenglailai
1 task done
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.