Replies: 1 comment
-
Could be combined with the existing audio detection label of speech to efficiently determine which clips should have speech to text applied. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Using faster whisper or something similar, process complete video segments and ether add a caption track (probably easier to play back later) or caption file that can then be added to frigate search.
It might be possible to leverage existing projects to add this capability.
https://github.com/McCloudS/subgen
Beta Was this translation helpful? Give feedback.
All reactions