Unstable critic training #1

atagle123 · 2025-01-12T13:11:07Z

Hi, great work! I'm trying to reproduce results on Antmaze and MuJoCo, but the critic training becomes unstable (loss explodes) in certain domains. Could you provide the batch size hyperparameter for each experiment to help avoid training instabilities,? Do you use cossine annealing in the critic training? I really appreciate it. Thank you!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unstable critic training #1

Unstable critic training #1

atagle123 commented Jan 12, 2025

Unstable critic training #1

Unstable critic training #1

Comments

atagle123 commented Jan 12, 2025