YuanGongND

Follow

Yuan Gong YuanGongND

Follow

Research Scientist, MIT CSAIL

389 followers · 2 following

MIT
Cambridge, MA
23:29 (UTC -05:00)
yuangongnd.github.io

Achievements

Achievements

Pinned Loading

ltu ltu Public

Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".

Python 396 38
whisper-at whisper-at Public

Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"

Python 343 28
gopt gopt Public

Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".

Python 154 28
cav-mae cav-mae Public

Code and Pretrained Models for ICLR 2023 Paper "Contrastive Audio-Visual Masked Autoencoder".

Python 242 22
ssast ssast Public

Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".

Python 366 60
ast ast Public

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Jupyter Notebook 1.2k 220