Noisy results with "order == 1" (trying to replicate DDIM resutls) #36

WikiChao · 2023-04-24T04:27:43Z

Hi authors,

Thank you for the nice paper and clear code and documentation!!

I am trying DPM-Solver in my project for sampling acceleration. Previously, I can obtain reasonable results with DDIM (step=10, 100, ...), but the results I obtained with dpm-solver are pretty bad. Could you give some suggestions on the implementation?

Here are the details of my model:
(1) Training: DDPM ( L1 Loss, predict noise), T=1000, UNet with additional condition inputs, trained on audio data.
(2) Beta schedules: Sigmoid schedule (according to(https://arxiv.org/abs/2212.11972))

Code snippet that uses DPM-solver in my project:

    self.betas = sigmoid_beta_schedule(timesteps=1000)
    self.noise_schedule = NoiseScheduleVP(schedule='discrete', betas=self.betas)
    self.model_fn = model_wrapper(
        self.net,
        self.noise_schedule,
        model_type="noise",  # or "x_start" or "v" or "score"
        model_kwargs={},
    )
    self.dpm_solver = DPM_Solver(self.model_fn, self.noise_schedule, algorithm_type="dpmsolver")

After the definition:

    x_T = torch.randn(input.shape, device = "cuda")
    pred = self.dpm_solver.sample(
        x_T,
        condition,
        steps=20,
        order=1,
        skip_type="time_uniform",
        method="singlestep",
    )
   pred = unnormalize_to_zero_to_one(pred)

Thanks a lot!

The text was updated successfully, but these errors were encountered:

LuChengTHU · 2023-04-24T10:38:15Z

Hi @WikiChao , does your code contain this line?: https://github.com/LuChengTHU/dpm-solver/blob/main/dpm_solver_pytorch.py#L105

If so, could you please print the first 5 and last 5 items of log_alphas?

WikiChao · 2023-04-25T19:02:12Z

Thanks for the prompt reply! The trick did help, it seems I am using the previous version and missing such a line of code.

The results make sense now, but they are still worse than DDIM. I have tried different settings, e.g., "multistep" or "single step", "order = 2 or 3", "step = 10 to 100", but cannot beat DDIM. Are there any tricks in choosing hyperparameters, for example, clipping log-SNR by different values?

Thanks a lot!

Chao

LuChengTHU · 2023-04-26T15:56:53Z

Hi @WikiChao ,

"but they are still worse than DDIM": In fact, order=1 is exactly the DDIM. You can try to reproduce the results of DDIM by manually setting the timestep in

dpm-solver/dpm_solver_pytorch.py

Line 453 in 5c6ee9f

def get_time_steps(self, skip_type, t_T, t_0, N, device):

as the same as your DDIM code to check which part is missing. I guess you need to tune timestep carefully.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Noisy results with "order == 1" (trying to replicate DDIM resutls) #36

Noisy results with "order == 1" (trying to replicate DDIM resutls) #36

WikiChao commented Apr 24, 2023

LuChengTHU commented Apr 24, 2023

WikiChao commented Apr 25, 2023

LuChengTHU commented Apr 26, 2023

Noisy results with "order == 1" (trying to replicate DDIM resutls) #36

Noisy results with "order == 1" (trying to replicate DDIM resutls) #36

Comments

WikiChao commented Apr 24, 2023

LuChengTHU commented Apr 24, 2023

WikiChao commented Apr 25, 2023

LuChengTHU commented Apr 26, 2023