Change the repository type filter
All
Repositories list
28 repositories
- A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NIPS 2024]
- The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024] and "LS-EEND: long-form streaming end-to-end neural diarization with online attractor extraction"
- The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation
- This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".
SAR-SSL
PublicA python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Multi-Channel Conformer” [TASLP 2024]- A library built for easier audio self-supervised training, downstream tasks evaluation
RVAE-EM
PublicOfficial PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutive transfer function" [ICASSP2024]FullSubNet
PublicPyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."McNet
PublicThe official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023RCT
PublicNarrowband_DeepFiltering
PublicRTF_InterFrameSpecSub
PublicRS_noisePSD
PublicDP_RTF_SSL
Publicbss_ctf_lasso
Publicdereverb_ctf_nonneg
PublicBSS_CTF_EM
PublicLSTM-noisePSD
Publicctf_mint
PublicOnlineSSL_DPRTF_EG
PublicSMIF_online_dereverb
Public