-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hello, may I ask if ATST Frame can publicly disclose the models and scripts used for inference? #12
Comments
Hi, |
Thanks for your reply~! |
Hi, I would let you know once the uploading finished ; ) |
@Angelalilyer You could try this checkpoint file, hope it helps! |
Thank you so much!! |
Is there a complete inference code available? I tried to modify "audiossl/audiossl/methods/atstframe/downstream/train_strong. py" but kept reporting errors. |
Did you solve the problem? |
I tried to write inference code myself, but I couldn't output predicted labels,may I ask if there is a relatively complete inference code for ATST Frame (Audioset strong label)? Thanks for your help again! |
I write a quick solution in a new pull request #13 , can you test it ? |
thanks!! |
The strong AudioSet includes some extra Mids excluded by the original AudioSet ontology, you could refer to the official page and download the mid_to_display_name.tsv file. According the the file, |
Thank you very much for your help! The problem has been solved~ |
I only found the training code, and my test dataset is unlabeled. I would like to try using ATST Frame to detect sound events in a new dataset. Perhaps you have inference code and models? Looking forward to your reply!
The text was updated successfully, but these errors were encountered: