Replace AVS with custom ASR service #20

sskorol · 2020-09-13T13:27:12Z

In the dev manual you mentioned:

It's also a good example showing how to utilize the librespeaker. Users can implement their own server application / daemon to invoke librespeaker.

Is there any reference on how to use respeakerd w/o AVS? I just want to apply DSP algorithms (AGC, NS, AEC, etc.) to the input audio stream captured from Respeaker Core V2, and redirect the filtered audio as a byte array via web sockets to my ASR server. Is there any similar example? Or maybe you can provide a short description of what should be changed in the existing code to support such a scenario?

P.S. I saw python client in a separate repo. But it doesn't use any DSP.

Would be greatly appreciated any help.

The text was updated successfully, but these errors were encountered:

sskorol · 2020-11-08T17:36:28Z

@fanjm95, @jerryyip maybe you have some thoughts folks?

spidey99 · 2020-11-26T04:59:25Z

Bump!

I'm trying to create an always listening device, so circumventing the wake-word mentality, and want to pass the audio down stream for processing. I'm having a heck of a time peeling back the layers. I'm looking for an example similar to above.

sskorol · 2020-11-29T15:21:10Z

@spidey99 seems like this repo is dead and not maintained anymore. Moreover, main contributors don't answer even to emails. I didn't find any help on official forum as well. Unfortunately, they flushed such a perspective idea down the toilet.

I spent a lot of time poking around these repos and their dependencies. Finally, I decided to avoid wasting time on this particular project anymore. Actually, I believe the entire idea of re-using Respeaker Core hardware with AVS is a dead-end, as it makes no sense to buy a $99 board to get another Alexa (assuming Echo Dot is much cheaper, especially on Black Friday).

For me, Seeed Studio had to concentrate on a software part that allows developers all over the world to easily connect their own SST/TTS services. It would make more sense for people who are willing to make an offline ASR solution based on languages that aren't supported by Amazon or Google. That's why I decided to focus my effort on extending librespeaker samples.

Now I have a working prototype, which can stream audio chunks to custom WebSocket ASR server. Technically, there are 2 transports implemented in this repo: WS and MQTT. So we can send audio data the way we want.

However, I'm not a C++ developer. My primary language is Java/TS. So there are still lots of things I want to improve. Unfortunately, can't do it right now due to a lack of C++ expertise. So if you have any ideas or suggestions, PRs are always welcome. I hope there will be more people who want to resurrect and improve this idea. As it's really hard to do it alone.

songtaoshi · 2021-06-28T07:01:57Z

Maybe I am late and not quite understanding the context, but I think you can just use pyaudio get the stream and push it into your ASR service.

sskorol · 2021-06-28T09:23:03Z

@songtaoshi if you just get the stream from pyaudio, there won't be any DSP algorithms applied at all. It makes no sense to send a raw audio stream to ASR w/o preprocessing. This board's value is only in DSP (NS, BF, AEC, etc.) that could be achieved only programmatically via librespeaker. I don't believe anyone wants to use a $99 hardware just as a usb mic array. There are much cheaper alternatives for this.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Replace AVS with custom ASR service #20

Replace AVS with custom ASR service #20

sskorol commented Sep 13, 2020 •

edited

Loading

sskorol commented Nov 8, 2020

spidey99 commented Nov 26, 2020

sskorol commented Nov 29, 2020

songtaoshi commented Jun 28, 2021

sskorol commented Jun 28, 2021 •

edited

Loading

Replace AVS with custom ASR service #20

Replace AVS with custom ASR service #20

Comments

sskorol commented Sep 13, 2020 • edited Loading

sskorol commented Nov 8, 2020

spidey99 commented Nov 26, 2020

sskorol commented Nov 29, 2020

songtaoshi commented Jun 28, 2021

sskorol commented Jun 28, 2021 • edited Loading

sskorol commented Sep 13, 2020 •

edited

Loading

sskorol commented Jun 28, 2021 •

edited

Loading