Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Artifacts in Long Video Reconstruction with VAE (e.g., 225 Frames, Sample Rate = 2) #9

Open
WilliamLLee opened this issue Feb 12, 2025 · 2 comments

Comments

@WilliamLLee
Copy link

When using VAE to reconstruct long videos (e.g., 225 frames) with a sample_rate of 2 (or more), artifacts tend to appear towards the end of the video. Has anyone encountered this issue, and if so, what might be causing it? How can these artifacts be mitigated?

Image
@WilliamLLee
Copy link
Author

This issue seems to arise from the tiling problem. For the same video clip, when tiling is enabled, artifacts appear; when tiling is disabled, the reconstruction works as expected. I am using the WFVAE inference code version included in Open-Sora-Plan. You can find it here: WFVAE Inference Code.

Could you please let me know if there is any difference between the WFVAE inference code in this repository and the one in Open-Sora-Plan? If there is no difference, I suspect there might be a bug in the tiling implementation within the WFVAE inference code. I am currently debugging it, and I wanted to check if you have encountered and solved this issue before?

@qqingzheng
Copy link
Collaborator

qqingzheng commented Feb 13, 2025

Hi, it is caused by the setting of tiling size. You should choose a suitable tiling size for a specific frame number.

Open-Sora-Plan: https://github.com/PKU-YuanGroup/Open-Sora-Plan/blob/main/opensora/models/causalvideovae/model/vae/modeling_wfvae.py#L348

WF-VAE: https://github.com/PKU-YuanGroup/WF-VAE/blob/main/causalvideovae/model/vae/modeling_wfvae.py#L331

Fo high sample rate scenarios, I suggest using lpips weight = 0.1 and enabling dynamic sample (which adopted by CogVideoX) fine-tuning WF-VAE to achieve better results.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants