More details of the "baseline acoustic codec" #2

hbwu-ntu · 2024-09-02T12:20:17Z

Hi, thank you for the amazing work. May I ask two questions about Table 1?

Could you please provide more detailed descriptions about the baseline acoustic codec in your paper? Does it

exclude the $S$ in the encoder side but still try to predict the $\hat{S}$
remove the entire blue block in Figure 1

About the Encodec and DAC, do you use their released ckpts, or do you train counterparts with the same dataset (LibriSpeech)?

The text was updated successfully, but these errors were encountered:

zhenye234 · 2024-09-02T12:49:14Z

Thank you for your interest and your questions!

1, Yes, the baseline acoustic codec excludes the entire blue block in Figure 1.
2, We used their released checkpoints for both Encodec and DAC.
Looking forward to further discussions with you!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More details of the "baseline acoustic codec" #2

More details of the "baseline acoustic codec" #2

hbwu-ntu commented Sep 2, 2024

zhenye234 commented Sep 2, 2024

More details of the "baseline acoustic codec" #2

More details of the "baseline acoustic codec" #2

Comments

hbwu-ntu commented Sep 2, 2024

zhenye234 commented Sep 2, 2024