Audio-WestlakeU

VINP Public
Official PyTorch implementation of 'VINP: Variational Bayesian Inference with Neural Speech Prior for Joint ASR-Effective Speech Dereverberation and Blind RIR Identification'

Audio-WestlakeU/VINP’s past year of commit activity

Python 5 MIT 2 0 0 Updated Feb 20, 2025
FS-EEND Public
The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024] and "LS-EEND: long-form streaming end-to-end neural diarization with online attractor extraction"

Audio-WestlakeU/FS-EEND’s past year of commit activity

Python 115 MIT 5 6 0 Updated Feb 18, 2025
RVAE-EM Public
Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutive transfer function" [ICASSP2024]

Audio-WestlakeU/RVAE-EM’s past year of commit activity

Python 42 MIT 4 0 0 Updated Jan 25, 2025
NBSS Public
The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation

Audio-WestlakeU/NBSS’s past year of commit activity

Python 255 MIT 31 22 0 Updated Jan 1, 2025
UMA-ASR Public
This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).

Audio-WestlakeU/UMA-ASR’s past year of commit activity

Shell 22 5 1 0 Updated Dec 17, 2024
RealMAN Public
A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurIPS 2024]

Audio-WestlakeU/RealMAN’s past year of commit activity

Python 112 12 4 0 Updated Dec 11, 2024
FN-SSL Public
The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization [INTERSPEECH2023 & TASLP2024]

Audio-WestlakeU/FN-SSL’s past year of commit activity

Python 102 11 2 0 Updated Dec 9, 2024
ATST-SED Public
This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".

Audio-WestlakeU/ATST-SED’s past year of commit activity

Jupyter Notebook 120 MIT 13 1 0 Updated Oct 15, 2024
SAR-SSL Public
A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Multi-Channel Conformer” [TASLP 2024]

Audio-WestlakeU/SAR-SSL’s past year of commit activity

Python 34 MIT 1 2 0 Updated Oct 11, 2024
ATST-RCT Public
ATST-RCT model for DCASE 2022 task4.

Audio-WestlakeU/ATST-RCT’s past year of commit activity

Python 2 0 0 0 Updated Sep 19, 2024

View all repositories

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Audio-WestlakeU

Pinned Loading

Repositories

People

Top languages

Most used topics