You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When using VAE to reconstruct long videos (e.g., 225 frames) with a sample_rate of 2 (or more), artifacts tend to appear towards the end of the video. Has anyone encountered this issue, and if so, what might be causing it? How can these artifacts be mitigated?
The text was updated successfully, but these errors were encountered:
This issue seems to arise from the tiling problem. For the same video clip, when tiling is enabled, artifacts appear; when tiling is disabled, the reconstruction works as expected. I am using the WFVAE inference code version included in Open-Sora-Plan. You can find it here: WFVAE Inference Code.
Could you please let me know if there is any difference between the WFVAE inference code in this repository and the one in Open-Sora-Plan? If there is no difference, I suspect there might be a bug in the tiling implementation within the WFVAE inference code. I am currently debugging it, and I wanted to check if you have encountered and solved this issue before?
Fo high sample rate scenarios, I suggest using lpips weight = 0.1 and enabling dynamic sample (which adopted by CogVideoX) fine-tuning WF-VAE to achieve better results.
When using VAE to reconstruct long videos (e.g., 225 frames) with a sample_rate of 2 (or more), artifacts tend to appear towards the end of the video. Has anyone encountered this issue, and if so, what might be causing it? How can these artifacts be mitigated?
The text was updated successfully, but these errors were encountered: