Skip to content

Latest commit

 

History

History
33 lines (22 loc) · 1.03 KB

tutorial for WebUI-1.6-Version.md

File metadata and controls

33 lines (22 loc) · 1.03 KB

Tutorial for WebUI 1.6 Version

We have added one new feature

  • We have added an audio-grounding feature that tracks the sound-making object within the video's soundtrack.

audio-grounding

1. Clone the audio-spectrum transformer (AST) model to ./ast_master and download the pretrained model

git clone https://github.com/YuanGongND/ast.git ast_master
wget -O ./ast_master/pretrained_models/audio_mdl.pth https://www.dropbox.com/s/cv4knew8mvbrnvq/audioset_0.4593.pth?dl=1

2. Switch to Audio-Tab by clicking the audio-grounding Tab

3. Upload video or use example directly

4. Alter the number of labels and the confidence interval of the resulting labels according to your personal preference

step 1: detect the label of the sound making object

step 2: ground the sound making object

5. Get mask of selected object by clicking Detect button

6. Track in video