Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

More details of the "baseline acoustic codec" #2

Open
hbwu-ntu opened this issue Sep 2, 2024 · 1 comment
Open

More details of the "baseline acoustic codec" #2

hbwu-ntu opened this issue Sep 2, 2024 · 1 comment

Comments

@hbwu-ntu
Copy link

hbwu-ntu commented Sep 2, 2024

Hi, thank you for the amazing work. May I ask two questions about Table 1?

  1. Could you please provide more detailed descriptions about the baseline acoustic codec in your paper? Does it
  • exclude the $S$ in the encoder side but still try to predict the $\hat{S}$
  • remove the entire blue block in Figure 1
  1. About the Encodec and DAC, do you use their released ckpts, or do you train counterparts with the same dataset (LibriSpeech)?
@zhenye234
Copy link
Owner

Thank you for your interest and your questions!

1, Yes, the baseline acoustic codec excludes the entire blue block in Figure 1.
2, We used their released checkpoints for both Encodec and DAC.
Looking forward to further discussions with you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants