Open-source datasets and deep learning models for separating sounds.
-
Audio from YFCC100M videos for mixture-invariant training (MixIT).
-
Audio-visual YFCC100M with annotations for on-screen sound separation with AudioScope.
-
Audio-visual YFCC100M with annotations for on-screen sound separation with AudioScopeV2.
-
Synthetic AMI for speech separation in meeting room scenarios.