Experiment with adding speaker diarization #45

audreyfeldroy · 2023-10-04T08:23:31Z

Speaker diarization is where you annotate a transcript by noting which words were spoken by which speakers.

There are tools in Python that do this. It would be great to try them out and see if any would work for our project:

https://github.com/pyannote/pyannote-audio
https://github.com/espnet/espnet
Anything else anyone can find!

It's possible we may also have to implement our own speaker diarization, either here or in a separate repo that we use as a dependency here. I attended a talk last night about how News UK did this with their own dynamic clustering of their vectorized embeddings. They used the large whisper model to transcribe their audio files, and then they implemented speaker diarization using their own algorithm. I vaguely recall they used https://github.com/NVIDIA/NeMo for the auto-clustering.

Contributions welcome from anyone who wants to play with this!

heymanpreet · 2023-10-07T11:04:28Z

@audreyfeldroy Happy to experiment with this ticket if anyone not working on it.
Thanks.

audreyfeldroy · 2023-10-12T19:57:41Z

This one is open for anyone looking for an issue to work on 🙂

Subramaniam-dot · 2023-10-14T20:36:43Z

Can I take up this issue?

audreyfeldroy added help wanted Extra attention is needed hacktoberfest-accepted Issue or PR is approved for anyone who wants it to count toward Hacktoberfest good first issue Good for newcomers high priority Opportunity to contribute something valuable that's urgently needed labels Oct 4, 2023

audreyfeldroy assigned heymanpreet and unassigned heymanpreet Oct 7, 2023

audreyfeldroy assigned Subramaniam-dot Oct 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Experiment with adding speaker diarization #45

Experiment with adding speaker diarization #45

audreyfeldroy commented Oct 4, 2023 •

edited

Loading

heymanpreet commented Oct 7, 2023

audreyfeldroy commented Oct 12, 2023

Subramaniam-dot commented Oct 14, 2023

Experiment with adding speaker diarization #45

Experiment with adding speaker diarization #45

Comments

audreyfeldroy commented Oct 4, 2023 • edited Loading

heymanpreet commented Oct 7, 2023

audreyfeldroy commented Oct 12, 2023

Subramaniam-dot commented Oct 14, 2023

audreyfeldroy commented Oct 4, 2023 •

edited

Loading