Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

多通道输出的标签问题 #1

Open
wanghao0225 opened this issue Jul 29, 2022 · 1 comment
Open

多通道输出的标签问题 #1

wanghao0225 opened this issue Jul 29, 2022 · 1 comment

Comments

@wanghao0225
Copy link

您好,看了您的论文《EMBEDDING AND BEAMFORMING: ALL-NEURAL CAUSAL BEAMFORMER FOR MULTICHANNEL SPEECH ENHANCEMENT》和对应的代码,有一个疑问:多通道的target是什么?看论文和代码都没有具体说,您是以某一个通道作为整体的target还是多个通道有多个target,然后分别进行处理的?

@Andong-Li-speech
Copy link
Owner

您好,看了您的论文《EMBEDDING AND BEAMFORMING: ALL-NEURAL CAUSAL BEAMFORMER FOR MULTICHANNEL SPEECH ENHANCEMENT》和对应的代码,有一个疑问:多通道的target是什么?看论文和代码都没有具体说,您是以某一个通道作为整体的target还是多个通道有多个target,然后分别进行处理的?

您好,因为我们最后做的是一个filter-and-sum的操作,因此输入到输出是一个MISO的过程,用到的标签是参考通道的目标语音。如果您要利用多个通道目标得到多个通道输出,有两种方式,一个是MIMO,另一个是利用圆阵的旋转不变性依次推理M次(M代表通道数)。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants