Add searchable text captions with speech to text. #14468

bagobones · 2024-10-21T03:44:30Z

bagobones
Oct 21, 2024

Using faster whisper or something similar, process complete video segments and ether add a caption track (probably easier to play back later) or caption file that can then be added to frigate search.

It might be possible to leverage existing projects to add this capability.

https://github.com/McCloudS/subgen

bagobones · 2024-11-05T06:35:36Z

bagobones
Nov 5, 2024
Author

Could be combined with the existing audio detection label of speech to efficiently determine which clips should have speech to text applied.

0 replies

sotiris-bos · 2024-12-17T17:57:26Z

sotiris-bos
Dec 17, 2024

This could also be helpful: https://github.com/Carleslc/AudioToText

I would really like this feature!

0 replies

bagobones · 2024-12-19T01:59:03Z

bagobones
Dec 19, 2024
Author

Looks like this one is small, fast and available as ONNX models. I wonder if real time word triggers could be a thing as part of audio detection?

https://www.reddit.com/r/LocalLLaMA/comments/1hh5y87/moonshine_web_realtime_inbrowser_speech/

0 replies

genevera · 2025-02-03T11:17:15Z

genevera
Feb 3, 2025

The yt-wsp.sh script over at whisper.cpp could be pretty helpful for anyone who doesn't want to wait for an implementation in Frigate. Right now it downloads a youtube video and works on the mp4 there but with minimal modifications it can certainly do the same for things recorded with frigate as it produces an srt file then bakes it into the orignal video.

1 reply

torsteinelv Feb 3, 2025

Is this a desired feature in the future?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add searchable text captions with speech to text. #14468

{{title}}

Replies: 4 comments 1 reply

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Add searchable text captions with speech to text. #14468

bagobones Oct 21, 2024

Replies: 4 comments · 1 reply

bagobones Nov 5, 2024 Author

sotiris-bos Dec 17, 2024

bagobones Dec 19, 2024 Author

genevera Feb 3, 2025

torsteinelv Feb 3, 2025

bagobones
Oct 21, 2024

Replies: 4 comments 1 reply

bagobones
Nov 5, 2024
Author

sotiris-bos
Dec 17, 2024

bagobones
Dec 19, 2024
Author

genevera
Feb 3, 2025