go-whisper golang bindings #1

djthorpe · 2022-12-01T19:21:25Z

Create bindings for https://github.com/ggerganov/whisper.cpp

Simple golang bindings with tests
Some examples (main, sample) based off of these
Integrate with ffmpeg for audio conversion
Some sort of real-time translation
gRPC and/or websocket API
Docker image of a speech-to-tech service

djthorpe · 2022-12-18T10:46:34Z

chrisbward · 2022-12-20T10:40:04Z

Great work!

Keen on realtime translation and a way of calling out/streaming the output to another app - gRPC seems the best option for this

djthorpe · 2022-12-20T11:45:19Z

Yeah thanks.

I'm doing the audio downsampling to 16KHz at the moment in a different repository (go-media)

The realtime transcription and translation should be pretty straightforward, but pretty experimental, even for whisper.cpp

I will take a while to get to the gPRC microservice :-(

djthorpe · 2023-01-06T21:34:40Z

Added a "stream" command for the start of real-time streaming, but:

Thread safety: Needs some work to ensure the same model can be used in the process method across threads/goroutines
Ring buffer: Implement a ring buffer for continious audio samples
Overlaps: Need some word overlaps to ensure we don't lose words between sample windows
Silence: Don't process audio when silence is fed in. Ideally chunk windows when there is a largish (>1s) silence

There's also some issues with the segmenting in the main package (repeated segments come out!) needs fixing.

djthorpe · 2024-07-30T07:21:51Z

djthorpe · 2024-07-31T08:08:48Z

Also:

Fix client so that it works with text/event-stream for both downloading models and transcription
Logging is generally not working. Fix it so only messages apart from errors are supressed unless debug mode. Also output logging of requests
Add some metrics in there somewhere

djthorpe · 2024-07-31T09:12:20Z

Also:

Fix resampling of raw audio in the go-media code, so we can again ingest WAV files without "input changed" errors

djthorpe · 2024-08-08T08:24:16Z

Simplified Dockerfile and now uses the base images from here as a base:

https://github.com/mutablelogic/docker-llamacpp

This is still now working; Now I need to have the ffmpeg shared libraries included in the runtime image. Considering whether to just copy over the libraries from the build image, or to install ffmpeg libraries from source.

paradoxe35 · 2024-12-20T19:14:32Z

Hey @djthorpe,

Thank you for your excellent work on ggerganov/whisper.cpp#269. However, it seems the binding you developed isn't compatible with the latest version of whisper.cpp.

Do you have plans to update it soon, or is there an updated version available somewhere?

djthorpe · 2024-12-20T19:46:52Z

Hi @paradoxe35 how are you?

Actually this repository contains updated bindings and I would like to merge them into whisper.cpp at some point...You can find them under
github.com/mutablelogic/go-whisper/sys/whisper but I didn't test them recently. Let me know if that' useful to you?

paradoxe35 · 2024-12-20T23:09:00Z

Thank you, @djthorpe, for your feedback. Unfortunately, I couldn't run github.com/mutablelogic/go-whisper/sys/whisper. To simplify things, I've decided to use the previous version of whisper.cpp that is compatible with the existing binding.

djthorpe self-assigned this Dec 1, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

go-whisper golang bindings #1

go-whisper golang bindings #1

djthorpe commented Dec 1, 2022 •

edited

Loading

djthorpe commented Dec 18, 2022

chrisbward commented Dec 20, 2022

djthorpe commented Dec 20, 2022

djthorpe commented Jan 6, 2023

djthorpe commented Jul 30, 2024 •

edited

Loading

djthorpe commented Jul 31, 2024 •

edited

Loading

djthorpe commented Jul 31, 2024 •

edited

Loading

djthorpe commented Aug 8, 2024

paradoxe35 commented Dec 20, 2024

djthorpe commented Dec 20, 2024

paradoxe35 commented Dec 20, 2024

go-whisper golang bindings #1

go-whisper golang bindings #1

Comments

djthorpe commented Dec 1, 2022 • edited Loading

djthorpe commented Dec 18, 2022

chrisbward commented Dec 20, 2022

djthorpe commented Dec 20, 2022

djthorpe commented Jan 6, 2023

djthorpe commented Jul 30, 2024 • edited Loading

djthorpe commented Jul 31, 2024 • edited Loading

djthorpe commented Jul 31, 2024 • edited Loading

djthorpe commented Aug 8, 2024

paradoxe35 commented Dec 20, 2024

djthorpe commented Dec 20, 2024

paradoxe35 commented Dec 20, 2024

djthorpe commented Dec 1, 2022 •

edited

Loading

djthorpe commented Jul 30, 2024 •

edited

Loading

djthorpe commented Jul 31, 2024 •

edited

Loading

djthorpe commented Jul 31, 2024 •

edited

Loading