Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to detect the noise, breaks and multi-person speak in a audio? #673

Open
4 tasks done
TomSuen opened this issue Dec 27, 2024 · 2 comments
Open
4 tasks done

How to detect the noise, breaks and multi-person speak in a audio? #673

TomSuen opened this issue Dec 27, 2024 · 2 comments
Labels
question Further information is requested

Comments

@TomSuen
Copy link

TomSuen commented Dec 27, 2024

Checks

  • This template is only for question, not feature requests or bug reports.
  • I have thoroughly reviewed the project documentation and read the related paper(s).
  • I have searched for existing issues, including closed ones, no similar questions.
  • I confirm that I am using English to submit this report in order to facilitate communication.

Question details

Hello, I am not in the audio field. I would like to ask, for a reference audio, I have removed BGM and reverberation to a certain extent, but the effect of inputting it into the sound cloning is still not good. Is there any better way to detect whether there is noise, distortion, and multiple people speaking in the reference audio?

@TomSuen TomSuen added the question Further information is requested label Dec 27, 2024
@sam4muzix
Copy link

sam4muzix commented Dec 27, 2024

Removing bgm and reverb from an audio will also remove many frequency ranges where the module finds difficult to analyse. So its better use some other dataset which will have only voice. Still whisper can transcribe. But in audio case, its recommended to use raw voice only dataset.

@mlndlesslydev
Copy link

This project helped me a lot, but it's a bit of a pain to install and it's only working in Linux.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

3 participants