Skip to content
Change the repository type filter

All

    Repositories list

    • Diffusion-based Speech Enhancement: Demonstration of Performance and Generalization
      Jupyter Notebook
      MIT License
      2500Updated Dec 21, 2024Dec 21, 2024
    • sgmse

      Public
      Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation
      Python
      MIT License
      76541100Updated Dec 21, 2024Dec 21, 2024
    • Python
      MIT License
      0100Updated Dec 19, 2024Dec 19, 2024
    • gen-se

      Public
      Investigating Training Objectives for Generative Speech Enhancement
      HTML
      0300Updated Dec 11, 2024Dec 11, 2024
    • 2sderev

      Public
      Two-stage Dereverberation Algorithm using DNN-supported multi-channel linear filtering and single-channel non-linear post-filtering
      Python
      2300Updated Oct 21, 2024Oct 21, 2024
    • buddy

      Public
      BUDDy: Single-Channel Blind Unsupervised Dereverberation with Diffusion Models
      Python
      43710Updated Oct 18, 2024Oct 18, 2024
    • HTML
      0200Updated Sep 24, 2024Sep 24, 2024
    • Generation scripts for EARS-WHAM and EARS-Reverb
      Python
      32700Updated Sep 16, 2024Sep 16, 2024
    • storm

      Public
      StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation
      Python
      MIT License
      2619520Updated Sep 13, 2024Sep 13, 2024
    • HTML
      0000Updated Jun 25, 2024Jun 25, 2024
    • HTML
      0000Updated Apr 24, 2024Apr 24, 2024
    • driftrec

      Public
      DriftRec: Adapting diffusion models to blind image restoration tasks
      Python
      0420Updated Feb 23, 2024Feb 23, 2024
    • Python
      145021Updated Feb 9, 2024Feb 9, 2024
    • derevdps

      Public
      Python
      11310Updated Jan 12, 2024Jan 12, 2024
    • sgmse_crp

      Public
      Python
      GNU Affero General Public License v3.0
      22110Updated Jan 9, 2024Jan 9, 2024
    • TODO
      Python
      MIT License
      83711Updated Nov 1, 2023Nov 1, 2023
    • livepty

      Public
      Live Iterative Ptychography with projection-based algorithms
      Python
      GNU General Public License v3.0
      0300Updated Sep 21, 2023Sep 21, 2023
    • diffphase

      Public
      DiffPhase: Generative Diffusion-based STFT Phase Retrieval
      Python
      MIT License
      01100Updated Sep 21, 2023Sep 21, 2023
    • Python
      MIT License
      2500Updated Jun 6, 2023Jun 6, 2023
    • Python
      MIT License
      01510Updated Mar 30, 2023Mar 30, 2023
    • Continous Phoneme Recognition based on Audio-Visual Modality Fusion
      Python
      MIT License
      1200Updated Jul 23, 2022Jul 23, 2022
    • Repository for the paper "Disentanglement Learning for Variational Autoencoders Applied to Audio-Visual Speech Enhancement".
      Python
      2610Updated Aug 30, 2021Aug 30, 2021
    • Re-implementation of the paper "An End-to-End Multimodal Voice Activity Detection Using WaveNet Encoder and Residual Networks"
      Python
      1600Updated Aug 3, 2021Aug 3, 2021
    • MATLAB
      MIT License
      0400Updated May 29, 2021May 29, 2021
    • This is the repository of the paper
      Jupyter Notebook
      0800Updated May 7, 2021May 7, 2021
    • stcn-nmf

      Public
      VAE and STCN with NMF for single-channel speech enhancement
      Python
      MIT License
      41400Updated Mar 24, 2021Mar 24, 2021
    • Compressed Long Short-Term Memory (CLSTM) Keras Layer
      Python
      Other
      0000Updated Nov 22, 2020Nov 22, 2020
    • Dual-Path RNN for Single-Channel Speech Separation (in Keras-Tensorflow2)
      Python
      MIT License
      63420Updated Jun 2, 2020Jun 2, 2020
    • mp-gtf

      Public
      Multi-Phase Gammatone Filterbank (MP-GTF) construction for Python
      Python
      Other
      74630Updated Apr 30, 2020Apr 30, 2020